Hello,
I have performed a reconciliation on 2 datasets, and have come across 1 issue with some false breaks.
Where 1 side contains 1/2 latin/special characters while other side has full english. (Using UTF-8, have tried every other as well, UTF-8 came out as best)
For Example:
George Fernandes George Ferñandes coming as a break.
Any was to apply a matching % here to figure out high match rows. Tried fuzzy match, but it's too complex and not giving right.
Around 400 rows of data, and around 25 columns, out of which around 8 pair of columns containings strings like addresses are getting compared, and multiple instances of diverse latin/weird/special characters leading to mismatches.