Let’s talk Alteryx Copilot. Join the live AMA event to connect with the Alteryx team, ask questions, and hear how others are exploring what Copilot can do. Have Copilot questions? Ask here!
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Create Multiple Filters Outputs using the First Letter of a Name

Mzacr
6 - Meteoroid

Hello, 

 

I´m trying to find a way to compare multiple entities from a data set to ensure that they match. For example, if I have: 

Alteryx Company 
Alteryx Company LLC
Alteryx Company Limited Corporation

 

I want to be able to perform a fuzzy match to find out discrepancies and so forth which I know how to do it. The problem is that it takes too long, hours & hours long, as the data set is over 50 thousand records. I think I found a turnaround and it is to separate each record by sorting them into their first name letter. Which I did and it showed me the following results using the summary tool. So, I thought about filtering the data set into 38 different outputs without manually dragging 38 filter tools into the data set. I´ve asked copilot and it tells me that a batch macro can help me but I´ve not yet become an advance user. Can someone please help me? Or advise me a better way to tackle in the problem? 

 

Thank you!

 

 

First Name LetterRecord Count
H2335
S5481
233
G2432
:1
 14
Z225
412
P3036
Â1
M3772
U992
J1080
F2373
Q185
I1524
Y219
E1952
63
137
K1039
L2177
W2093
V1006
O1339
74
92
85
B4301
R2095
A4514
57
T2480
C6272
N2365
X70
D1750
326
4 REPLIES 4
binuacs
21 - Polaris

@Mzacr updated workflow attached, i used your above input for the first input text tool

 

KGT
13 - Pulsar

You might want to look at some other techniques as well. Fuzzy Match is a rather iterative process as you want the algorithms to get as many matches as possible without a false positive.

 

Take a look at the sample under "Help > Sample Workflows > Scripting and Automation > Build a Macro > Merge to a master file with Fuzzy Match". It gives a good run-down of how you match what you can and then match more. 

 

So, in your list of 57k, you want to match what you can and reduce that list size. Once you figure out matches, start adding them to a master list so that you don't need to match them again.

Mzacr
6 - Meteoroid

You´re amazing! Thank you very much for your quick turnaround!

Mzacr
6 - Meteoroid

Understood! Yes, the idea itself is that I have only one data set, from a source. So, I want to ensure that there are no fishy business in the records. So one test was to ensure that similar records don´t have duplicate values and such. I´ll give a try, I believe that sample, follows a cascade fashion, that would be very helpful indeed, the only thing I had a hard time cracking where the embedded macros hahaha. 

Labels
Top Solution Authors