Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

fuzzy match Problem statement

Rohan139
6 - Meteoroid

Hi peeps, 

 

Facing some issue while using fuzzy match and Further Make group tool.

I am Applying fuzzy match on below given table :

Record id NAME
1P R BISWAS 
2PULAKH RAJ BISWAS 
3RANJAN CHOPRA 
4RAN CHOPRA 

 

I am Doing fuzzy match on Name where I am using ( Best of Jaro & Levenshtein) Algorithm and Double Metaphon for generate keys. I have kept the Match threshold to 85% and Weight to 100.

 

So Output is like below:

Record id 1 Record id 2Match Score 
12Null
3492

 

 

It's not showing match score for record id 1 And 2  , But When I changed the threshold to 50% it does show me the match Score 50.  Problem is in that case it Also shows the match with Record Id 3 and 4 as well which is not right as Person on Record ID 3&4 are same and Person on Record ID 1&2 are same which is very much clear from the table itself.

 

I have tried using different Algorithm as well , but didn't get desired results.

 

Please Suggest some way out so that i can generate below table after using make group tool:

 

Parent key Child key 
11
12
33
34
2 REPLIES 2
Adrian_T
Alteryx Alumni (Retired)

Hey @Rohan139,

 

I was able to obtain your desired output while ensuring your desired Match Options are toggled - Double Metaphone and Best of Jao & Leven Distance, with match threshold set to 50%.

Adrian_T_2-1665123690027.png

 

I have attached my workflow for your reference.

 

As a quick tip, you can make your fuzzy matching experience more intuitive by using [NAME] as the Record ID Field. Example below:

Adrian_T_0-1665123634524.png

Adrian_T_1-1665123660219.png

 

Hope this helps! 😊

Rohan139
6 - Meteoroid

Hi  , @Adrian_T 

 

It's not working with the case like this , you can see in the  below  table its is generating wrong Parent Names for last 4 records.

MOHAMMAD IQBALMOHAMMAD IQBAL
MOHAMMAD IQBALMOHAMMAD IQBAL WANI
P R BISWASP R BISWAS
PULAK RANJAN BISWASPULAK RANJAN BISWAS
PULAK RANJAN BISWASR CHOPRA
PULAK RANJAN BISWASRANJAN CHOPRA

 

Output should be like this :

 

MOHAMMAD IQBALMOHAMMAD IQBAL
MOHAMMAD IQBALMOHAMMAD IQBAL WANI
P R BISWASP R BISWAS
P R BISWASPULAK RANJAN BISWAS
R CHOPRAR CHOPRA
R CHOPRARANJAN CHOPRA

 

I have applied same 50% match threshold and same algorithm as before.

 

Could you help me out with this?

 

Thanks in Advance.

Labels
Top Solution Authors