Hi, I have around 8 million data and one of the field need to be matched with another dataset that contain 21 records as the field has a misspelling. However, it took much longer that I thought. I ran it 10 hours ago and it only settle up until 818000. Do any of you know how to quicken the fuzzy match?
Thank you.
Hi @faiqz,
Have you tried using the AMP Engine? You can turn this feature on in the "Runtime" section of the workflow's Configuration, and it could help improve your run speed.
Hi @faiqz !
I second @jbichachi003's suggestion on the AMP engine if you aren't using that already.
I also wanted to add that I recently used some of the tips in this article to optimize one of my Fuzzy Match workflows:
This section has some great tips for optimizing processing time:
7. Optimizing fuzzy matching processing time:
Also, the suggestion to "use a join to remove any exact matches from the fuzzy match process" was especially helpful to my use case.
Take a look and let us know if any of these suggestions work for you.
Thanks,
Deb