Hi, i have a company master of more than thirty thousands Company records, the company master data contains company names and other company details. However there is a possibilities that many company names are very similar and on investigation found same.
I have tried using Fuzzy Match tool as well but because the master data contains more than 30 thousands company name records using fuzzy match increasing the output records into millions, therefore i am afraid if i can use the Fuzzy Match tool. Therefore using Alteryx I want to identify the potential duplicate names of the company which are having very similar names and could be potentially the same, please help me how can i achieve this.
Solved! Go to Solution.
It is highly recommended that you first remove any perfect matches from your dataset to reduce the scope of the Fuzzy Match. Next find any other fields that may be used to connect records (e.g. State) and set them to match exactly in the Fuzzy Match Tool.
Hope this helps and Happy Solving!