This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
Hi everyone! I am trying to do a fuzzy match between two columns but the problem is that it takes too long to be performed since I need to put all the rows in the same column, then perform fuzzy match and then I only need the fuzzy match of those rows who have the same ID.
I have 1,000 rows, each one with an identifier. If I group the rows in one column and then perform fuzzy match I have 1,000,000 combinations that fuzzy match is giving me, but I am only interested in the ones that have the same ID, so it is a completely waste of time.
I attach an image of a sample of my dataset, once fuzzy match has been performed.
Thanks for your reply. I do have an ID column, but the problem is that I would like to tell Fuzzy Match which rows I want to match because otherwise it will match all the possible combinations and then I would have to filter the ones I am interested in. I attach the input, in case you can help me to do something more efficient than what I have done.