Hi ,
I would like to mask or anonymize the person's name in comment text field. for eg : Lee is good person should be mask to abc is good person etc.
is there any way to do this in Alteryx?
Thanks
Hsac
Ah, I see. Since you do not have a list of expected names, you will need an NLP tool. I suggest that you look into Named Entry Recognition that is part of the Alteryx Intelligence Suite.
https://www.youtube.com/watch?v=se-wnjfP2UY
https://www.alteryx.com/products/intelligence-suite
https://help.alteryx.com/20221/designer/alteryx-intelligence-suite
https://help.alteryx.com/20221/designer/named-entity-recognition
Let us know if this is what you're looking for.
Thanks,Deb
Here's the yxmd workflow.
@harinder1301,
You can extract the list of the name from the data source
Use https://www.mockaroo.com/ to create a list of name
Then use join by position to get the new names :
Attached the workflow,
Hope this helps!
Regards,
Hi @harinder1301!
To build on @messi007's suggestion, you might also look into the functionality of the Find and Replace tool.
Thanks, Is there any way to find the names of the persons from nearly 5000 comments?
Thanks for the Quick reply.
I have attached some sample records for your reference.
Regards
Similar to @ddiesel workflow :
Hope this helps
Thanks for the reply. Actually my question is " How can i identify the names from 5000 comment rows?"
Is there any tool in Alteryx to identify the names from the comment?
Appreciate your reply.
@harinder1301 Do you have the list of names? What do you want the final output to look like?
Hi,
No, We don't have person list. This can be first name/known as/or surname etc. This the feedback provided. Here is the desired final output:
Thanks for your help!
Doesn't look pretty but took a stab at it here. Assuming that all the names would start with a capital letter, I used RegEx to extract possible names. Only issue is if there is a word other than a name capitalized (but I did exclude singles like I and A). You would then just need to update this list to remove non-names. Only other thing is that all anonymizing is done with xyz so there would be no distinct replacement per name.
Thanks Robin, Please can you share the yxmd file.
Many Thanks Robin and Deb,
Both solutions worked for us but we have chosen NLP as it worked better with our data.