Alteryx Designer Desktop Discussions

jped · ‎02-05-2018

I am looking to flag duplicates but only if they are duplicates from different data sources. In the example below, there are duplicate values in Dataset 1 (50) and duplicate values in dataset 2 (45) that I don't want. The duplicates I care about would be the 40 that appears in dataset 1 and dataset 2. How would I set up 2 datasets to pull only the duplicated values only if they appear in the other dataset.

Dataset 1	Dataset 2
50	45
20	40
40	35
30	45
50	25

Thanks!

NicoleJohnson · ‎02-05-2018

If you join the two datasets together on the field where you are looking for duplicates, the (J) Join branch in the middle should give you a list of values that are duplicated between the two datasets.

Keep in mind that if you had 40 duplicated in Dataset 1, and then it shows up one time in Dataset 2, you would end up with two records in the Join branch - 1 for the first record in Dataset 1 that matched the 40 in Dataset 2... and another for the second record in Dataset 1 that also matched the 40 in Dataset 2. Just something to be aware of while you are finding matches. :)

Hope that helps!

NJ

jped · ‎02-05-2018

That is what i was looking for- thank you!

Alteryx Designer Desktop Discussions

Duplicates from two different datasets

Re: Row creation

Re: How to select columns dynamically using number...

Re: Batch macro to read 1000+ .xlsx files with var...

Re: Issue when using Block Until Done and Power BI...

Example workflow for setting up a custom list to u...