Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

How I can Improve a Fuzzy match Score

ByranCarter12
7 - Meteor

Hello, 

 

I am trying to compare adresses each other with using Fuzzy Match Tool. 

 

you can see the result below :

 

AdressAdress_matchingIdMatchScoreMatchScore_Adress
605 3rd Avenue605 Third Avenue200
2203 Rowan Street2203 Rowen St38585
1550 W Isaac DR1550 West Isaac DR49191
340 S Patterson DR340 South Patterson DR58484
1555 W 3rd ST1555 Third Street600
2310 S Winslow CT2310 Sth Winslow CT79696
1110 S Covenanter DR1110 Sth Covenanter DR88989
2430 S Brown AVE2430 Sth Brown Avenue99191
1918 W 3rd ST1918  Third Street1000

 

So, When the adress is written like 'Third' instead 3rd, the result return the value 0, even if the two adress are similars.

 

I've used this paramaters to do this fuzzy match 

 

Vanderleck16_0-1625576772074.png

 

So wich options can improve the result when there is a number written in word (1th ==>first , 3rd ==> Third ....). 

 

I've also an another question about the Fuzzy Match Tool, how i can avoid to having duplicates in the result, i give you this example :

 

IdId2MatchScoreMatchScore_Adress
723723100100
7239538282
7239538282
953953100100
6916919191
6919617272
6919617676

 

The Tool calculate the score with others Id and I don't know why. 

 

Thank you for your help.

 

 

 

3 REPLIES 3
ArtApa
Alteryx
Alteryx

Hi @ByranCarter12 - Fuzzy matching is an art. It is a very iterative process. Here are my comments:

 

  1. You may Need to prepare your data set first. In your particular scenario I converted "3rd" to "Third".
  2. Match everything that can be matched before Fuzzy matching
  3. Start with the default Match Style (Address)
  4. The longer the Max Key Length the lower the match score. For "Address" the default length is 8. You may want to reduce it to 6 or 4 to improve the score.
  5. The Unique tool will remove duplicates.

Here is how your solution may look like:

ArtApa_0-1625613863847.png

 

You may want to watch the following videos to learn more about Fuzzy Matching in Alteryx:

 

Enjoy!

ByranCarter12
7 - Meteor

@ArtApa 

 

Thank you, I appreciate your help.

 

I also had this idea to replace some abbreviations to improve the score. But when it comes to words like (third, first, fifth...), it might be long and boring to list all the possibilities to replace. There is no way in the options to ignore ordinal values.

 

In addition, the result obtained is 0 while there are similarities between the two addresses.

 

ArtApa
Alteryx
Alteryx

Hi @ByranCarter12 - Well, if you are to solve a business problem then nothing is boring. Building a lookup table is a valid approach. See example here: https://community.alteryx.com/t5/Weekly-Challenge/Challenge-235-Spell-It-Out-For-Me-Part-2/td-p/6409...

 

The way the Address Match Style works it ignores: Ave, St, Rd, Blvd etc. If you want to change the style to Name, then the Score will jump up for this particular record, but you may have a lower case somewhere else:

 

ArtApa_0-1625646549369.png

 

That's why I wrote, it's an art. This is why it's called Fuzzy.

 

Labels