This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
I am trying to demonstrate to some colleagues that Alteryx is the best tool for Fuzzy Matching (as opposed to Kalido MDM, Oracle Text, or others). They are very concerned about Thesaurus matching (the examples they use are matching "William" with "Bill" for names or "Drive" with "Dr" for addresses). I decided to test these specific examples and to my dismay, they FAILED!
So I took a step back and looked at the Fuzzy Match tool documentation:
I decided to test fuzzy matching with Andrew, Andy, and Drew using the Name with Nicknames match style as suggested....
....and Alteryx returned NO MATCHES!!!!
I've attached a workflow showing my failed attempt to match Andrew, Drew, and Andy or Bill with William. I also have there my failed attempt to match "333 Reed Drive" with "333 Reed Dr" with the Address No Suite algorithm.
Please help me figure out how I can get Alteryx to see these as matches! (I have to be missing something if the example in the process documentation doesn't even work!) Also, does anyone know where Alteryx stores the "common Nicknames table" mentioned in the documentation?
P.S. I have already tried lowering the match strength; still no success!
Update: @patrick_digan worked with me offline and discovered that if you edit the Names with Nicknames option to NOT generate keys, all of a sudden Andrew matches with Andy and Drew!! (Note, at least 1 field has to generate keys so we included a dummy field where we gave every record the same last name and chose match style = Exact for that field). Other nicknames worked as well! My conclusion is that the presence of the keys on the Names w/ Nicknames option prevents these Nicknames from matching despite the alleged use of the common nicknames table! Is this an appropriate conclusion? (I hope not because that is the same thing as saying the Nicknames table is not even applied!)
Update: Thanks to @cbridges for notifying me about the webinar! I attended and it added much needed clarity!
In the webinar, @CailinS shared her personal preference to turn OFF key generation when using the Names w/ Nicknames option because the key generation DOES overpower the nicknames table lookup! Perhaps the Alteryx developers kept key generation as part of the Names w/ Nickname option so that the Fuzzy Match tool is less likely to be broken if that's the only field the matching is being performed on (at least one match field must have keys generated). To turn off key generation, click "Edit" next to where you choose the Match Style and change the Generate Keys drop down to "[None]".
If you would like to get a glimpse of the process the Fuzzy Match tool performs behind the scenes, definitely check out the recording of @CailinS's webinar(s) using @cbridges link.