I want to run a similarity check on the records which have same RecordID but different names(strings).
A recordID can have 2 or more names and I want to get a similarity score among all the names with same recordID.
Hey @Puranjaysaprax,
I don't know what your dataset looks like so its hard to answer this question. From what you've said this is the set up I think you'll want:
Essentially you'll need an exact match on Record ID then a Name match on your names. The bottom box will then let you output the score. The final output says which rows matched.
HTH,
Ira
Hi @Irawatt, This can be considered as a sample dataset. For Each name with the same record ID, I want to find a String similarity score between them.
Record ID | Name for Record ID |
1001 | ABC - XYZ |
1001 | ABC - XYK |
1001 | CBC - XYZ |
1002 | XYZABC |
1002 | XYZAC |
1003 | KLMKLM |
1003 | KLMKLW |
1003 | KLM12M |
1003 | KLMKKM |
Great thanks @Puranjaysaprax,
So from the workflow I see row 1&2, 6&7 and 6&9 match. Of course you can also reduce the required threshold if you want more matches. Any questions on how this works make sure to ask :)
HTH,
Ira