In case you missed the announcement: Alteryx One is here, and so is the Spring Release! Learn more about these new and exciting releases here!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Fuzzy matching a name within a paragraph

mzsweetumz
8 - Asteroid

Hello Everyone,

 

I need some assistance with a fuzzy matching I am trying to do.

 

I've done some standard fuzzy matches on name and addresses but am struggling to do a match between two files based on a 'name' value separated by a vertical bar.

 

1st file: Names

fuzzy_1.JPG

 

2nd file: Information containing the 'Name' but is separated by vertical bars.

fuzzy_2.JPG

 

I am trying to do a fuzzy match to produce a match score based off the 'name' in both files. In the second file, I bolded 'Cisco' and 'Tripex'.

 

One of the files has 5 million rows...

 

Any help is appreciated!

5 REPLIES 5
atcodedog05
22 - Nova
22 - Nova

Hi @mzsweetumz 

 

I feel fuzzy match might not be the best tool this (then again its only me 😅 maybe others have better suggestions)

 

I would approach this with a find and replace where I would lookup and check whether Manufacturer info contains Manufacturer Name. Here is a learning resource on find and replace tool.

https://community.alteryx.com/t5/Interactive-Lessons/VLookUps-with-Designer/ta-p/80201

 

Give it a try if you are facing any issues let us know.

 

Hope this helps : )

ArtApa
Alteryx
Alteryx

Hi @mzsweetumz - I think that you don't need Fuzzy Matching for this specific use case. Find Replace will do a better job for you.

 

ArtApa_0-1648514177982.png

 

mzsweetumz
8 - Asteroid

Hi @atcodedog05 and @ArtApa ,

 

Thank you! The find and replace works wonders and is a really cool tool that I did not know about.

 

However, I am still stumped - as I didn't give enough background info on my two files. I am trying to match two files to find which 'responses' has the business 'name' in it.

 

The file with the business 'names' has 5 million rows... and the file with the business 'responses' only has 150 rows.

 

I am trying to match the business name and business responses associated with a person (their name), but that's not something I can join on.

 

TiffanyLy27_0-1648561867828.png

 

 

TiffanyLy27_1-1648561912245.png

 

This output looks right, however, I don't think it is as it's not linked to the same person.

 

any help is appreciated!

 

ArtApa
Alteryx
Alteryx

Hi @mzsweetumz - Can you please show how a desired output should look like?

mzsweetumz
8 - Asteroid

Hi @ArtApa , @atcodedog05 

 

TiffanyLy27_0-1649025529137.png

 

Something like this.

 

I have done other fuzzy matches for different match types and will eventually union it all into one big spreadsheet.

 

Labels
Top Solution Authors