Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.
Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Fuzzy Match Reporting

Todd_Sturner
5 - Atom

I’m puzzled on how to report some fuzzy match data.  I know what I would like to do, but I don’t know what the term is to search for.  I've attached the Excel file of my fuzzy match results (without Name/Address data as I show below)...

 

What I would like to do is this:

 

Display the UniqueID Record on one row in Excel, then display the UniqueID2 records, and their match scores, on subsequent rows in Excel, then rinse/repeat for the next UniqueID record.  Essentially:

 

UniqueID1           Name                   Address

UniqueID2           Name                   Address                MatchScore        MatchScore_NAME               MatchScore_Address

UniqueID2           Name                   Address                MatchScore        MatchScore_NAME               MatchScore_Address

UniqueID1           Name                   Address

UniqueID2           Name                   Address                MatchScore        MatchScore_NAME               MatchScore_Address

UniqueID1           Name                   Address

UniqueID2           Name                   Address                MatchScore        MatchScore_NAME               MatchScore_Address

UniqueID2           Name                   Address                MatchScore        MatchScore_NAME               MatchScore_Address

UniqueID2           Name                   Address                MatchScore        MatchScore_NAME               MatchScore_Address

 

And so on…

 

I know I can get the Name/Address data from using joins; however, I want to format the output in the way I'm showing above.

 

Does anyone have any ideas?

2 REPLIES 2
danilang
19 - Altair
19 - Altair

Hi @Todd_Sturner 

 

I think this is what you're looking for.

 

danilang_2-1634383525548.png

 

It uses a couple of Multi-Row Formula tools to number the UniqueID1 with a gap of 100 between each and then number each UniqueID2 sequentially within each group.  Use a select tool to rename the UniqueID2 and ID2 fields to UniqueID1 and ID1 and a Summarize tool to get the unique list of UniqueID1s.  Union the 2 datasets and sort on ID1.  

 

the results look like this with a record for the UniqueID1 value, followed by one record record for each of the matching UniqueID2 rows

danilang_1-1634383438849.png

 

Dan

 

Todd_Sturner
5 - Atom

Hi, Dan - that worked with a bit of tweaking (as I copied the items into my Alteryx solution).  I found that the ID2 field wasn't changing to match/count with the ID1 field (example: I was getting 101, 102, 103, 104... instead of 101, 102, 201, 202, 203, 301, 401, 402...).  I was able to fix this by adjusting the expression in the second multi-row formula:

 

if ([Row-1:ID1] = 0) OR ([Row-1:ID1] != ID1) then
ID1+1
else
[Row-1:ID2]+1
endif

 

After that, it worked!  Thank you so much for the help!!

Labels
Top Solution Authors