I have a list of manually entered countries, and am fuzzy matching it against my 'masterlist' of countries taken from wikipedia.
I've defined source ID and was hoping that output will look as follows:
- my wikipedia values will appear on the left (as 'country'), and
- manual entries will appear on the right (as 'country2').
However, left column seems to also have manually entered values, specifically "ITALY" and "UNITED KINGDOM" in capitals. Why is this the case?
I was then expecting to use join tool to compare my original list of manually entered values against the output of the fuzzy match to see what did not match. But I do not know which values in the fuzzy match output are the original ones.
Hopefully this makes sense. I've attached my sample workflow.
Saulius