This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
02-02-2017 05:34 PM - edited 07-23-2021 12:02 PM
Similar to the Excel Fuzzy Lookup, the Fuzzy Match Tool(see it in action here) makes it easy for a user to perform inexact matches in their data. By specifying similarity thresholds, utilizing varying matching algorithms, and specifying other configuration options, you can customize the tool to best fit your data set. Due to the high degree of customization in the tool, we recommend ramping up to speed with our introductoryand intermediate live training videosif more complex applications of the tool are anticipated. We also have a list of frequently asked questionsand Fuzzy Matching Tips and Tricksthat can supplement your use of the tool as well!
The Fuzzy Match Tool provides some pretty amazing flexibility for string joins with inexact values – usually in the case of names, addresses, phone numbers, or zip codes because many of the pre-configured match styles are designed around the formats of those types of string structures. However, taking advantage of the custom match style and carefully configuring the tool specific to human entered keyword strings in your data can also allow you to use the loose string matching feature of the tool to match those values to cleaner dictionary keyword strings. If done properly, it can help you take otherwise unusable strings and, matching by each individual word, recombine your human entered data to a standardized format that can be used in more advanced analyses:
In the attached v10.6 workflow, Fuzzy Keyword Match.yxmd, we provide an example of the technique, which can be replicated for most strings using the steps below:
Following these steps should make it possible to make even the most error-filled strings more useable in sentiment or keyword analyses, refine text to be more readable for reporting, and provide you with ample evidence of being a miracle worker. Good luck!
Matt,
Just to let you know I love two things.
1) Your explanation of fuzzy matching practices.
2) Your use of Kanye West.
Treyson
I like your pic man!
HI Matt
I love this:-)
//anitta
Excellent!
Good