Hello,
I understand that there are two modes in fuzzy matching tool: purge mode (All records compared) and merge (only records from a different source are compared).
If I have multiple sources (thousands) and would like to only match WITHIN the same source, is there any way to do that?
Sample input:
| source | name |
| 1 | Xicheng Chang |
| 1 | Xicheng chang |
| 2 | Nicholas Smith |
| 3 | Nicholas Smith |
| 4 | James Page |
| 4 | James Bateman |
| 4 | Stephanie A Picano |
| 4 | Stephanie Ann Picano |
| 5 | Xicheng Chang |
Sample output:
| source | name |
| 1 | Xicheng Chang |
| 2 | Nicholas Smith |
| 3 | Nicholas Smith |
| 4 | James Page |
| 4 | James Bateman |
| 4 | Stephanie Ann Picano |
| 5 | Xicheng Chang |
Sincerely,
Kazumi