Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Join tool duplicate confusion

Alteryxuserhere
8 - Asteroid

Hi there

Can someone explain to me in simple term how I can use the join tool to check what does not have a match? The right output seems to generate more duplicates that what was in the Right input file. 

How do I just get it to show the exact rows that dont match? The input file has duplicates which I would like to not remove and keep in the output file.

When I try to link the R output to another join, it is generating alot of rows? How do I avoid it? 

 

Thanks!

 

 

13 REPLIES 13
OTrieger
12 - Quasar

@Alteryxuserhere  L and R cannot generate duplicates

OTrieger
12 - Quasar

Duplicates can be only in J

L - all the items that comes from the Left and does not have matching item
R - all the items that comes from the Right and does not have matching item
J - that is the only output where you might have duplications

Alteryxuserhere
8 - Asteroid

Thanks @OTrieger  For the secondary Join tool i used after the R tool, it shows that a larger number of rows are processed. Wouldn't these be duplicates then?

OTrieger
12 - Quasar

R cannot contain duplicates, as mentioned before, R contains all the items that comes from Right and there is no match for them from the L side.

In your case as you are merging the same source of the data. You might get lots of duplicates in J as you are joining the same source after the first Join. I do not think that you need the 2nd Join, as what you are doing you are taking all the items from the Right side that did not have a match with the Left side and then join them again with the same source of data. in the 2nd Join you will never going to have items in L as you are using the same data source. Why do you have the 2nd Join?

OTrieger
12 - Quasar

There is a good explanation here with visual explanation
https://help.alteryx.com/current/en/designer/tools/join/join-tool.html#idp336712 

Alteryxuserhere
8 - Asteroid

For the first join im using email address as a criteira for the join and i want to find out which rows do not match based on email and then the second join is based on another criteira which tries to match the R output

OTrieger
12 - Quasar

j.PNG

OTrieger
12 - Quasar

As you can see whatever you are doing or the right side, you do it with the same data

Alteryxuserhere
8 - Asteroid

Yes But im using different fields to try match what the first Join couldnt. Is this wrong?

Labels