Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

How to remove duplicate content in a specific data box? pls see details

ryanzhang487
8 - Asteroid

hello community, I would want to remove the duplicate contents in some of my data box, the content are separated by comma (pls see excel workbook attached, eg: A,A,A needs to change to A,  or  A,B,B,C,C needs to change to A,B,C)

How do I do that in alteryx please?

Thank you

 

5 REPLIES 5
SPetrie
13 - Pulsar

I would transpose it using the product# as the key column. Then use a text to column to split it to rows. A unique tool to remove the duplicate values and then cross tab it back into place.,

 

1.PNG2.PNG

ryanzhang487
8 - Asteroid

thank you for your reply, I have another question, my data set has characteristic like " 10,000lb, 10,000 lb  "

there is thousands delimiters in a number and a unit at the end, if I were to use your solution, it would auto use all comma to split the data which is not very correct.

how do we deal with this case? would you please have a solution for this

thank you very much

SPetrie
13 - Pulsar

You may be able to do a replace formula and replace a comma followed by a space (",  ") with | and then use | as the delimiter. 

Without seeing how the data actually looks, it will be more difficult to work around that limitation.

Can you share an example of how the data actually looks? That may make it easier to help with a solution.

ryanzhang487
8 - Asteroid

it looks something like this, excel attached, thank you

SPetrie
13 - Pulsar

Thanks! I think the replace method is going to be the best bet with that.

I updated the example flow with that in place.

1.PNG2.PNG

Labels