Bring your best ideas to the AI Use Case Contest! Enter to win 40 hours of expert engineering support and bring your vision to life using the powerful combination of Alteryx + AI. Learn more now, or go straight to the submission form.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Fuzzy Match: Words Levenshtein Distance Approach

mshinn9999
7 - Meteor

Hello all. I have a conceptual question on the Fuzzy Match tool (PURGE).  I am trying to de-dupe/consolidate University names.  After cleansing the data (de-duping, getting rid of spaces, etc.), I ran the Funny match logic and leveraged the "Generate Keys for Each Word" approach, along with the Words - Levenshtein Distance Match function method. 

 

Assuming that this is not too generic of a question, how do I get below sample data set as ONE single record vs 3?  Below represents the types of data patterns that I see.  Is there another setting that I should consider? (data is dummied up).

 

Mike

 

Texas A M Univ

Texas A M University

Texas A M University College Station

 

1 REPLY 1
alexnajm
18 - Pollux
18 - Pollux

This is normal behavior because of the key generation - use a Unique tool afterwards to get a unique list of the fuzzy matches!

Labels
Top Solution Authors