We are celebrating the 10-year anniversary of the Alteryx Community! Learn more and join in on the fun here.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Duplicate rows after using a Join tool

CristinaLiriano
5 - Atom

Hi everyone,

I have a workflow I use to validate if data was entered correctly into our system. In my workflow I am joining the file with the data to be entered to the file that contains an extract of the data entered into our system. In my join output I am seeing duplicates of certain items. I do not want to use a Unique tool because some items are intentionally duplicate. Can someone please assist in figuring out how to eliminate the unwanted duplicates?

 

This is my current output:

CristinaLiriano_1-1646857761182.png

 

 

This would be my desired output

CristinaLiriano_0-1646857690746.png

 

2 REPLIES 2
Luke_C
17 - Castor
17 - Castor

Hi @CristinaLiriano 

 

Rows 2 and 3 of the data are identical. You are getting the extra records due to a 'cross join', essentially the 2 duplicate Keys map to each other, creating 5 in the output (1 record for the unique key, 4 records for the 2 duplicate keys). The solution is to find a secondary field to join on, but there doesn't appear to be another field you can use to join on that would resolve this since the records are total duplicates. The question becomes do you expect these duplicate records in your data?

SergioCamposs
7 - Meteor

Hi @CristinaLiriano 

 

The question is how do you know what is intentionally duplicated. If you have this answer we can do some logic to segregate, however if it is simply something that another system does, without any support it is difficult.

 

Labels
Top Solution Authors