Hello - I am looking to perform a categorical clustering of qualitative data and have never done this before. I have a data set with 500K+ rows of bill of materials data where every Finished Good is mapped to each of its Subcomponents like in the example below.
| Finished Good | Component |
| 5S4Y | 56-9A |
| 5S4Y | 559-0Y |
| 5S4Y | 14-56-AB |
| 56-SY4-9 | 56-9A |
| 56-SY4-9 | 559-0Y |
What I am looking to do is to identify "similar groupings of finished goods" based on the Components they are tied to.
Any advice for what type of clustering algorithm in Alteryx I should use?
Thanks