Hello, I'm a new to Alteryx and still learning the tool, can you help me with following task?
I have a list of the data divided into categories and subcategories (each row have its unique ID), which I want compare to the given list of categories and subcategories. The task is to check if a pair of category and subcategory match the list (that is easy to do with Join tool), but if there is no exact match available, then, Based on the exact match of the categories, suggest the closest match for subcategory if there is any similar value in the reference table.
Additionally, it is nice to have the evaluation how close the best match is to the given value (i.e. 90%)
The sample list is here:
| ID | Category | Subcategory |
| 1 | Classic | Symphony |
| 2 | Jazz | Acid Jazz |
| 3 | Rock | Death and Roll |
| 4 | Rock | Hard Rok |
| 5 | Jazz | Jazz |
The table with reference data:
| Category | Subcategory |
| Classic | Chamber |
| Classic | Opera |
| Classic | Symphonic |
| Jazz | Acid |
| Jazz | Classic |
| Jazz | Fusion |
| Jazz | Modern |
| Rock | Death 'n' Roll |
| Rock | Hard Rock |
Desired output:
| ID | Category | Subcategory | Closest match (Subcategory) |
| 1 | Classic | Symphony | Symphonic |
| 2 | Jazz | Acid Jazz | Acid |
| 3 | Rock | Death and Roll | Death 'n' Roll |
| 4 | Rock | Hard Rok | Hard Rock |
| 5 | Jazz | Jazz | |
I thought about fuzzy match tool, but don't know how to properly configure it. Or maybe is is not the right tool for the task?