Community Spring Cleaning week is here! Join your fellow Maveryx in digging through your old posts and marking comments on them as solved. Learn more here!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Fuzzy Match on name cleaning

mmustkee
9 - Comet

Hi Team,

 

I have table in which we have a list of names and there are some names which are the same but they typed differently. For Example, we have punctuation missing or added, MR. MRS or Dr added or missing and we have last name in place of first name.

I want match score and also one common name in result box.

Kindly help.

NameResult
Saif /NameSaif /Name
Name/SaifSaif /Name
Name/Saif MRSaif /Name
MR Name/SaiFSaif /Name
Kara/Tim 
Kara,Tim 
Dim/Tore Show 
DimTore/ Show 
Dim/Show tore 
ERS/COL 
RERS/COL 
ERS/COL, 
NDLER/JERRY L 
NDLER/L JERRY 
NDLER/JERRY 
NDLER/L, JERRY 
NDLER/JERRY, 
NDLER/L JERRY ANDREW 
HUG/XUEN 
XUEN/HUG 
XUEN,HUG 
XUEN,HUG MS 
XUEN,HUG DR 

 

7 REPLIES 7
Yoshiro_Fujimori
15 - Aurora

Hi @mmustkee ,

 

As a starting point, please look at the output from Fuzzy Match tool.

Depending on your prefernce, you may want to calibrate the setting such as Match Function, Match Threshold etc.

Yoshiro_Fujimori_0-1682998869046.png

Yoshiro_Fujimori_2-1682998944380.png

Yoshiro_Fujimori_1-1682998909606.png

Yoshiro_Fujimori_3-1682998990572.png

 

mmustkee
9 - Comet

 Hi Yoshiro,

 

Thanks for your solution,

 

I want if match is more than 85% I want Name 2 column with one common name for all different names. For example:

Saif /NameSaif /Name
Name/SaifSaif /Name
Name/Saif MRSaif /Name
MR Name/SaiFSaif /Name

 

The above booker is the only one booker  but his name was used differently. so I want Name 2 column to be updated as one common name IN front of all differently typed name.

mmustkee
9 - Comet

and we have extra names as well than the original list

Yoshiro_Fujimori
15 - Aurora

Hi @mmustkee ,

 

In order to group the names, you would need to know which names are correct and which are not. (I could not figure out a solution without this flag.)

Assuming you can add that flag to the data, you can use "Merge Mode" of Fuzzy Match tool, which compares only records from a different source.

 

As the output from Fuzzy Match tool is mixed up between sources, I had to add some data cleansing process.

 

Workflow

Yoshiro_Fujimori_0-1683003155493.png

 

Input

You need to specify "Y" to "correct names" and "N" to "fuzzy names".

Yoshiro_Fujimori_1-1683003234651.png

 

Output

Yoshiro_Fujimori_2-1683003266459.png

 

I hope this may be of some help.

mmustkee
9 - Comet

Hi Yoshiro,

 

That's look ok but I do not want my original name to be changed at all no matter how many times it appears in column "A" by different type and fuzzy match column should appear in front of the original name with the match score and one common name.

Kindly help on urgent basis. 

Yoshiro_Fujimori
15 - Aurora

Hi @mmustkee ,

 

This is what I can do for today.

I hope this helps. Good luck.

 

Workflow

Yoshiro_Fujimori_0-1683010754843.png

Output

Yoshiro_Fujimori_0-1683011080659.png

 

mmustkee
9 - Comet

Not working 

Labels