Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Conditional Duplicates

rrafferty2
8 - Asteroid

I need to remove (or flag) duplicates that meet certain criteria.

For example, I split my data set into 2 groups, group 0 and group 1

How do I remove (or flag) the duplicates in group 1 which are dups of group 0

 

Thanks

Robb

4 REPLIES 4
AbhilashR
15 - Aurora
15 - Aurora

Hi @rrafferty2, does the attached solution get you what you are looking for? Give it a try on your larger dataset and let us know if it addresses your ask. I have included some comments in the workflow itself to give an overview of the operations.

AbhilashR_3-1588209354074.png

 

Source Data:

AbhilashR_0-1588209199416.png

De-Duped Dataset:

AbhilashR_1-1588209217136.png

Let us know if this doesn't work.

rrafferty2
8 - Asteroid

Here is my test with your workflow.

Yes, it works!

Thanks You

Robb

rrafferty2
8 - Asteroid

What if the data is not sorted for Multi-row formula?

 

what if duplicates in group 1 are randomly dispersed above and below the first instance in group0?

 

Thanks

Robb

AbhilashR
15 - Aurora
15 - Aurora

Given the multi-row formula does a row-by-row operation, Alteryx will not be able to help you identify duplicates if you don't bring the numerical values together (hence the sort tool).

 

If they are dispersed, the records that are actually duplicate will satisfy the Not Duplicate logic in the multi-row formula tool, causing incorrect results.

 

That was a very wordy explanation, so let me know if it didn't make sense. 

Labels