Fuzzy Matching - Two data sources by address to bring back match record?
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hello -
I have a customer list and hospital roster list and I'd like to do a fuzzy match based on address (excluding name because naming conventions get wonky). Is it possible to do a fuzzy match based off address and then bring back the "Definitive ID" associated with each customer. Not likely a match for all accounts... I tried pulling something together but I'm not achieving what I'm looking to do. Any help would be appreciated!
Expected results would be the Customer list with the Definitive ID match.
Solved! Go to Solution.
- Labels:
- Datasets
- Fuzzy Match
- Help
- Join
- Workflow
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi @KFarmer
How does this look? I've created a source column and unioned the two datasets together before fuzzy matching on 90% (see screenshot below with the setup), then join the matches to the Hospital Roster to obtain the Definitive ID, then join back to the Customer List.
It's a bit rough but hopefully gives you something to work with. Ideally you start with a really high % such as 96-98% and run through multiple fuzzy matches reducing the % as you go but i set it at 90% as that brought through your example above correctly.
Note there are duplicate matches appearing, which increase with each join, so look out for that.
Does that help?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Ah, that makes sense! Thanks so much!
