I have data looks like this:
1 | s-lights | casuals | bungee-shoes | casuals | hook-and-loop-shoes | casuals | light-up-shoes | sneakers | bungee-shoes | sneakers | hook-and-loop-shoes |
2 | usa-casuals | casuals | comfort-shoes | dress-casuals | loafers | dress-casuals | slip-on-shoes | ||||
3 | s-lights | casuals | lace-up-shoes | casuals | light-up-shoes | sneakers | lace-up-shoes |
For each row, I would like to compare the data across columns. If there are duplicates in later columns, I would like to keep the first value and then remove other duplicates in the later columns. The highlighted values mean there are duplicates in each row but different columns. The yellow highlighted values are the ones to keep, and the red highlighted values are the ones to remove.
The expected result should look like:
1 | s-lights | casuals | bungee-shoes | hook-and-loop-shoes | light-up-shoes | sneakers | |||||
2 | usa-casuals | casuals | comfort-shoes | dress-casuals | loafers | slip-on-shoes | |||||
3 | s-lights | casuals | lace-up-shoes | light-up-shoes | sneakers |
I really appreciate if someone can show me how to do this type of cleanup in Alteryx!
Solved! Go to Solution.
Are you trying to retain the column order and the blank fields? I think a transpose, unique, and a crosstab tool should do the trick.
Hi @acastelazo preferably retain the column order, and I can lose the blank fields. I think it would work for me even if the column order is different.
Thank you for your guidance! This solution works :)
User | Count |
---|---|
17 | |
15 | |
15 | |
8 | |
5 |