We are celebrating the 10-year anniversary of the Alteryx Community! Learn more and join in on the fun here.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

fuzzy match Problem statement

Rohan139
6 - Meteoroid

Hi peeps, 

 

Facing some issue while using fuzzy match and Further Make group tool.

I am Applying fuzzy match on below given table :

Record id NAME
1P R BISWAS 
2PULAKH RAJ BISWAS 
3RANJAN CHOPRA 
4RAN CHOPRA 

 

I am Doing fuzzy match on Name where I am using ( Best of Jaro & Levenshtein) Algorithm and Double Metaphon for generate keys. I have kept the Match threshold to 85% and Weight to 100.

 

So Output is like below:

Record id 1 Record id 2Match Score 
12Null
3492

 

 

It's not showing match score for record id 1 And 2  , But When I changed the threshold to 50% it does show me the match Score 50.  Problem is in that case it Also shows the match with Record Id 3 and 4 as well which is not right as Person on Record ID 3&4 are same and Person on Record ID 1&2 are same which is very much clear from the table itself.

 

I have tried using different Algorithm as well , but didn't get desired results.

 

Please Suggest some way out so that i can generate below table after using make group tool:

 

Parent key Child key 
11
12
33
34
2 REPLIES 2
Adrian_T
Alteryx Alumni (Retired)

Hey @Rohan139,

 

I was able to obtain your desired output while ensuring your desired Match Options are toggled - Double Metaphone and Best of Jao & Leven Distance, with match threshold set to 50%.

Adrian_T_2-1665123690027.png

 

I have attached my workflow for your reference.

 

As a quick tip, you can make your fuzzy matching experience more intuitive by using [NAME] as the Record ID Field. Example below:

Adrian_T_0-1665123634524.png

Adrian_T_1-1665123660219.png

 

Hope this helps! 😊

Rohan139
6 - Meteoroid

Hi  , @Adrian_T 

 

It's not working with the case like this , you can see in the  below  table its is generating wrong Parent Names for last 4 records.

MOHAMMAD IQBALMOHAMMAD IQBAL
MOHAMMAD IQBALMOHAMMAD IQBAL WANI
P R BISWASP R BISWAS
PULAK RANJAN BISWASPULAK RANJAN BISWAS
PULAK RANJAN BISWASR CHOPRA
PULAK RANJAN BISWASRANJAN CHOPRA

 

Output should be like this :

 

MOHAMMAD IQBALMOHAMMAD IQBAL
MOHAMMAD IQBALMOHAMMAD IQBAL WANI
P R BISWASP R BISWAS
P R BISWASPULAK RANJAN BISWAS
R CHOPRAR CHOPRA
R CHOPRARANJAN CHOPRA

 

I have applied same 50% match threshold and same algorithm as before.

 

Could you help me out with this?

 

Thanks in Advance.

Labels
Top Solution Authors