Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Join tool duplicate confusion

Alteryxuserhere
8 - Asteroid

Hi there

Can someone explain to me in simple term how I can use the join tool to check what does not have a match? The right output seems to generate more duplicates that what was in the Right input file. 

How do I just get it to show the exact rows that dont match? The input file has duplicates which I would like to not remove and keep in the output file.

When I try to link the R output to another join, it is generating alot of rows? How do I avoid it? 

 

Thanks!

 

 

13 REPLIES 13
OTrieger
13 - Pulsar

@Alteryxuserhere  L and R cannot generate duplicates

OTrieger
13 - Pulsar

Duplicates can be only in J

L - all the items that comes from the Left and does not have matching item
R - all the items that comes from the Right and does not have matching item
J - that is the only output where you might have duplications

Alteryxuserhere
8 - Asteroid

Thanks @OTrieger  For the secondary Join tool i used after the R tool, it shows that a larger number of rows are processed. Wouldn't these be duplicates then?

OTrieger
13 - Pulsar

R cannot contain duplicates, as mentioned before, R contains all the items that comes from Right and there is no match for them from the L side.

In your case as you are merging the same source of the data. You might get lots of duplicates in J as you are joining the same source after the first Join. I do not think that you need the 2nd Join, as what you are doing you are taking all the items from the Right side that did not have a match with the Left side and then join them again with the same source of data. in the 2nd Join you will never going to have items in L as you are using the same data source. Why do you have the 2nd Join?

OTrieger
13 - Pulsar

There is a good explanation here with visual explanation
https://help.alteryx.com/current/en/designer/tools/join/join-tool.html#idp336712 

Alteryxuserhere
8 - Asteroid

For the first join im using email address as a criteira for the join and i want to find out which rows do not match based on email and then the second join is based on another criteira which tries to match the R output

OTrieger
13 - Pulsar

j.PNG

OTrieger
13 - Pulsar

As you can see whatever you are doing or the right side, you do it with the same data

Alteryxuserhere
8 - Asteroid

Yes But im using different fields to try match what the first Join couldnt. Is this wrong?

Labels
Top Solution Authors