Alteryx designer Discussions

Find answers, ask questions, and share expertise about Alteryx Designer.

Measuring distance and/or proximity between two strings

Highlighted
6 - Meteoroid
Hello fellow artisans,

I have two columns in csv format. Both are strings. I would like to measure the similarity using the fuzzylogic node in Alteryx. I only need two compare the strings from the same row. Is there a way to do so using using what we have in Alteryx?  If it can be dome, may I ask to write step by step instructions on how to do it?

Below is what the file would look like:

Row number....String1, String2
Row 1....Mike,Michael
Row 2...Joe, Jose A
etc....

For Row 1, I would like to compare Mike and Michael "only" and have Alteryx show percent similarity (or distance measure...) between the two strings.

Apreciate your help, Paolo
Highlighted
Alteryx Alumni (Retired)
Hi Paolo,

The key here is to create group keys for each record that you would like to match.  In order to compare Mike to Michael, they would first need to be grouped together.  Once grouped, you can set your Fuzzy Match tool to Jaro Distance for your name field, then do an Exact match on the group (that way it only compares the groups, or like names together).  

I've emailed the solution to you, but if anyone else is interested please let me know at cmartin@alteryx.com.

Thanks!

Chad
Highlighted
Alteryx Alumni (Retired)

UPDATE: Workflow attached. 

Labels