Get Inspire insights from former attendees in our AMA discussion thread on Inspire Buzz. ACEs and other community members are on call all week to answer!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Workflow Assistance - Fuzzy match outputs

HarryRoles
7 - Meteor

Hi All,

 

Hoping someone could assist.

 

I need to create a workflow that:

1. Uses fuzzy match logic to identify records in a column that are similar to each other in that column e.g. if there was Harry Roles, Harrry Rolles and Harryy Rooles in the column, I want these to be 'matched' as they are similar

2. Should there be a match, flag this in some way (yes output in a column, highlight it yellow etc)

 

I hope this helps.

 

Many thanks in advance.

6 REPLIES 6
echuong1
Alteryx Alumni (Retired)

I'll generally use the fuzzy match in combination with make groups and the find and replace.

 

The fuzzy match will help identify values that are similar to others in the dataset. You can play around with the settings to work with your data. The make groups will find anything that's similar, and create a "normalized" group. The keys are the variations in the dataset, and the group is the normalized version. From there, you can use a find and replace to append the group (normalized) value to the end.

 

echuong1_0-1614363922097.png

 

Emil_Kos
17 - Castor
17 - Castor

Hi,


There is a really good article about how to use fuzzy matching for two data sets:

 

https://community.alteryx.com/t5/Alteryx-Designer-Knowledge-Base/Prepare-two-inputs-for-Fuzzy-Match-...

 

What @echuong1 is much more elegant but I wanted to explore @EricWe approach and I have adjusted his workflow for this exercise:

 

Emil_Kos_0-1614365518514.png

 

 

The output:

 

Emil_Kos_1-1614365528304.png

Hope this helps!

HarryRoles
7 - Meteor

Hey!

 

I think this is almost what I need... a few tweaks and I think I may be there.

 

With the example workflow you provided, how would I output the match score into a column because I would then like to be able to filter out the 100% matches as I am only looking for variations. Would you know how?

 

Thank you!

pedrodrfaria
13 - Pulsar

Hi @HarryRoles 

 

In the Fuzzy Matching tool configuration window, you can select to also output the match score.

 

pedrodrfaria_0-1614861223152.png

 

This way it will populate the match score.

 

pedrodrfaria_1-1614861284439.png

 

 

Pedro.

HarryRoles
7 - Meteor

Hey,

 

I do have that option ticked but i dont see a column for the match score output.

 

I am thinking it is maybe to do with the worklow that echuong suggested which used make group and find replace tool?

 

Cheers

pedrodrfaria
13 - Pulsar

@HarryRoles 

 

Using @echuong1  WF, you can just add the filter right after the fuzzy matching.

pedrodrfaria_1-1614861678080.png

 

 

pedrodrfaria_0-1614861663505.png

 

Pedro.

 

Labels