This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
The latest release includes several enhancements designed to improve your Community experience!
Learn More
on
02-02-2017
05:34 PM
- edited on
03-08-2019
01:05 PM
by
SydneyF
Similar to the Excel Fuzzy Lookup, the Fuzzy Match Tool (see it in action here) makes it easy for a user to perform inexact matches in their data. By specifying similarity thresholds, utilizing varying matching algorithms, and specifying other configuration options, you can customize the tool to best fit your data set. Due to the high degree of customization in the tool, we recommend ramping up to speed with our introductory and intermediate live training videos if more complex applications of the tool are anticipated. We also have a list of frequently asked questions and Fuzzy Matching Tips and Tricks that can supplement your use of the tool as well!
The Fuzzy Match Tool provides some pretty amazing flexibility for string joins with inexact values – usually in the case of names, addresses, phone numbers, or zip codes because many of the pre-configured match styles are designed around the formats of those types of string structures. However, taking advantage of the custom match style and carefully configuring the tool specific to human entered keyword strings in your data can also allow you to use the loose string matching feature of the tool to match those values to cleaner dictionary keyword strings. If done properly, it can help you take otherwise unusable strings and, matching by each individual word, recombine your human entered data to a standardized format that can be used in more advanced analyses:
In the attached v10.6 workflow, Fuzzy Keyword Match.yxmd, we provide an example of the technique, which can be replicated for most strings using the steps below:
Following these steps should make it possible to make even the most error-filled strings more useable in sentiment or keyword analyses, refine text to be more readable for reporting, and provide you with ample evidence of being a miracle worker. Good luck!
Matt,
Just to let you know I love two things.
1) Your explanation of fuzzy matching practices.
2) Your use of Kanye West.
Treyson
I like your pic man!
HI Matt
I love this:-)
//anitta
Excellent!
Good