Hi team, I am trying to Fuzzymatch data from data sources as according to the table below (sample of the data that I am using and that I think that should be matched) -
Title | lower_title | Vendor Name | lower_vendor_name |
Shell- 01.01.18- ID 2534 | Shell- 01.01.18- id 2534 | ||
Shell Phillips | Shell phillips | ||
Shell Purchase Order | Shell purchase order | ||
Shell PHILLIPS CHEMICAL CO | Shell phillips chemical co |
Fuzzy Match is config as seen on the pictures attached.
Any tips on how to "join" the fields 'lower_vendor_name' with 'lower_title'?
Many thanks
@tacd ,
To use merge mode of Fuzzy Match tool, the input data need to have "Source ID Field" and "Record ID Field".
Attached a sample workflow.
I hope this helps. Good luck.
Workflow
Input Data to Fuzzy Match tool
text | source |
Shell- 01.01.18- id 2534 | title |
Shell phillips | title |
Shell purchase order | title |
Shell phillips chemical co | vendor name |
The other addition to the process by @Yoshiro_Fujimori above is creating a recordID Key by concatenating a record ID and the text. This makes it easier to use Make Groups afterwards and connect back to the original data. Concatenating the Source and text can sometimes run into issues where you have deliberately made the text "less-alike".
This is the video by Chris Love, that I always used to provide to clients learning about Fuzzy Matching. It's a little old now and so don't mind the old icons etc
https://www.youtube.com/watch?v=35aNGzJ5kYY
The Alteryx Sample also uses some simple methods (Right Click > Open Example)
And one other resource I used to rely on to show how detailed large Fuzzy Match processes can get to is found under "Help > Sample Workflows > Use scripting and automation tools > Build a macro > Merge to master file with fuzzy match"