Hi Team,
I have table in which we have a list of names and there are some names which are the same but they typed differently. For Example, we have punctuation missing or added, MR. MRS or Dr added or missing and we have last name in place of first name.
I want match score and also one common name in result box.
Kindly help.
Name | Result |
Saif /Name | Saif /Name |
Name/Saif | Saif /Name |
Name/Saif MR | Saif /Name |
MR Name/SaiF | Saif /Name |
Kara/Tim | |
Kara,Tim | |
Dim/Tore Show | |
DimTore/ Show | |
Dim/Show tore | |
ERS/COL | |
RERS/COL | |
ERS/COL, | |
NDLER/JERRY L | |
NDLER/L JERRY | |
NDLER/JERRY | |
NDLER/L, JERRY | |
NDLER/JERRY, | |
NDLER/L JERRY ANDREW | |
HUG/XUEN | |
XUEN/HUG | |
XUEN,HUG | |
XUEN,HUG MS | |
XUEN,HUG DR |
Hi @mmustkee ,
As a starting point, please look at the output from Fuzzy Match tool.
Depending on your prefernce, you may want to calibrate the setting such as Match Function, Match Threshold etc.
Hi Yoshiro,
Thanks for your solution,
I want if match is more than 85% I want Name 2 column with one common name for all different names. For example:
Saif /Name | Saif /Name |
Name/Saif | Saif /Name |
Name/Saif MR | Saif /Name |
MR Name/SaiF | Saif /Name |
The above booker is the only one booker but his name was used differently. so I want Name 2 column to be updated as one common name IN front of all differently typed name.
and we have extra names as well than the original list
Hi @mmustkee ,
In order to group the names, you would need to know which names are correct and which are not. (I could not figure out a solution without this flag.)
Assuming you can add that flag to the data, you can use "Merge Mode" of Fuzzy Match tool, which compares only records from a different source.
As the output from Fuzzy Match tool is mixed up between sources, I had to add some data cleansing process.
Workflow
Input
You need to specify "Y" to "correct names" and "N" to "fuzzy names".
Output
I hope this may be of some help.
Hi Yoshiro,
That's look ok but I do not want my original name to be changed at all no matter how many times it appears in column "A" by different type and fuzzy match column should appear in front of the original name with the match score and one common name.
Kindly help on urgent basis.
Not working
User | Count |
---|---|
17 | |
15 | |
15 | |
8 | |
5 |