Hello,
Can someone please advise me. I´ve been working on analyzing a data set which contains over 50 thousand records with multiple columns. One goal is to compare names, for example;
Alteryx Company |
Alteryx Company LLC |
Alteryx Company Limited Corporation |
I would like to find out if different names have different ids or duplicate ones. The issue is that when I perform the fuzzy match, it takes way too long. Hours even.
So, one turnaround I thought of is to divide the data set into categories, which contain the first letter of their name. Like sorting them. As names that start with "C" are too different to even bother comparing them with names that start with "W".
I used a formula and summarized tool to find out that the data set can be grouped by into 38 different categories. (Some names start with numbers and such)
So, my problem is how do I filter out the data set into the 38 different outputs, without manually dragging 38 different filter tools. Is that possible? Or someone has a better advise? I´ve asked copilot and it told me that batch macro could be used, but I´m not well versed yet into how to do it.
Thank you in advance for all your help!!
Best regards,
Esteban
Solved! Go to Solution.
@Mzacr Here is the batch macro which will check all the names starts with a specific letter and write the output into separate sheets based on the letter