Fuzzy String Matching between columns from different datasets
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi All,
I try to match between two string columns from different datasets. The two columns do not contain exact values to match, only a certain part is matching like below. I want to match "origin_city" with "standardized_city_name", and if matching, replace values of "origin_name" to value of "standardized_city_name"
dataset 1 : target data
dataset 2: source data
I already tried "Find and Replace" tool but it was not working. Can someone help to address a way to tackle the problem? thank you.
Solved! Go to Solution.
- Labels:
- Developer
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Check this Video https://community.alteryx.com/t5/Videos/Fuzzy-Matching-for-Beginners/td-p/330575
This may help you.
Thanks,
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi @mpham3000 ,
Here an example showing how to build your fuzzy match method to work.
Best,
Fernando Vizcaino
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi fmvizcaino, thank you for your solution, but the workflow that you attached is not complete as the joint node is missing "origin_city", could you check and re
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi @mpham3000 ,
Solution reattached. I'm not sure why you are getting this error, I downloaded the workflow and it is working as it should.
Best,
Fernando Vizcaino
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Yes it is working now, thank you very much. 🙂
