SOLVED
Comparing Options / Fuzzy Match Option
Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Chad_Cox
7 - Meteor
‎06-14-2024
01:08 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
I have two files that I have already merged based on Commodity Code and removed the rows that have an exact match for the descriptions. Now, I'm left with 6,000+ records and want to see if Fuzzy Match can give me a score on how close the descriptions match. Is this an option? Also open to other suggestions. Sample data file is attached and in the attached image I want to know who close the two columns in red match each other.
Solved! Go to Solution.
Labels:
- Labels:
- Fuzzy Match
1 REPLY 1
jdminton
13 - Pulsar
‎06-14-2024
01:40 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Yes, you can do this. You will need to:
- Start with 2 separate files (or separate the columns into 2 separate with two select tools)
- Add recordIDs to each and make them unique (you can add text for the DOAS and UGA in front or make the RecordID on one table start with a number like 1000001.
- Add filename or some kind of identifier for each set of data
- Union the tables so the descriptions and record IDs are in the same columns.
- Feed them into a Fuzzy Match tool and select "Merge"
- I customized the Match Style on the description column. You need to start Alteryx as Admin to do this. You can also try some of the standard, but I like to play with things.
- Group the results so if one record generates keys that match the 2 set of records more than once, you eliminate the lower value.
- Join back to the first set on recordID
- Join back to the second set on recordID2.
- Sort by record ID and record ID2 ascending
 
