Hello Guys,
I have number of rows of data (representative purpose) as shown,
Col_1 | Col_2 | Value |
A | B | 0.45 |
B | A | 0.45 |
A | C | 0.87 |
B | C | 0.34 |
C | B | 0.34 |
And I want the output to consider A & B 0.45 to be same as B&A 0.45 and eliminate one of the row(preferably second. i.e. B & A. So the expected output would be like this.
Col_1 | Col_2 | Value |
A | B | 0.45 |
A | C | 0.87 |
B | C | 0.34 |
Please help, how to do this in Alteryx.
Solved! Go to Solution.
Hi,
I have mocked up the workflow for you:
The output:
Please mark as a solution if it was helpful!
Good Luck!
Thank you @Qiu , but my actual data has 685 columns and more than 480k rows, so changing their name is not an option for me as you have done with record ID then changed it back to col_1 and col_2.
Can you suggest me any other way?
Also the rows which are duplicating may not necessarily be at position +1 row or -1 row. Since I have 480k rows of data, first row may have A+B and last row B+A, for such case I don't think your solution can be used. Please help with your thoughts on this.
@suraj_ford
Instead of Select Tool, we can use Dynamic Rename Tool to name the columns
Hi @suraj_ford,
I needed to prepare another workflow for you.
I believe it should work for your data set:
Thank you @Emil_Kos . That's a great way to do it.
Hi @suraj_ford,
If this post was helpful please mark it as an answer.
One topic can have more than 1 post marked as an answer.
Thank you for a kind word and good luck!