We are celebrating the 10-year anniversary of the Alteryx Community! Learn more and join in on the fun here.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Issue in Join's

sriniprad08
11 - Bolide

Hi Team,

Hope you are well. I am trying to add a new column to the existing dataset. 

The first dataset has 463683 records and 

The second dataset has 398754 records. I have a column in second dataset which i am trying to add it to the first dataset.

 

But my final output i am getting the total records as 7834348. Ideally it should be 463683. Not sure why its appending.

 

Please let me know what i am missing,

 

Thanks,

Sri

 

sriniprad08_0-1677576257434.png

 

5 REPLIES 5
ShankerV
17 - Castor

Hi @sriniprad08 

 

This is because, there are values which will not be unique in the other columns in your dataset.

This leads to increase in count.

 

Many thanks

Shanker V

ShankerV
17 - Castor

Hi @sriniprad08 

 

Please find a sample use case to explain your scenario.

 

Dataset 1:

 

ShankerV_0-1677576643269.png

 

Dataset 2:

 

ShankerV_1-1677576676516.png

 

Join:

ShankerV_0-1677577580740.png

 

Output:

 

ShankerV_1-1677577611729.png

 

Many thanks

Shanker V

 

sriniprad08
11 - Bolide

Hi @ShankerV ,

 

This is great. thank you so much. Can you please let me know how to do in my situation? Means what to exclude/include?

 

Cheers,

Sri

ShankerV
17 - Castor

Hi @sriniprad08 

 

Can you please a sample dataset of 1 and 2.

And the expected output.

 

So that I can propose the best solution.

 

ShankerV
17 - Castor

Hi @sriniprad08 

 

In my case, as I have decided to have the first value of unique records on the 3 columns.

 

I have inserted the Unique tool to filter one records for all 3 columns and then joining.

 

ShankerV_0-1677579187856.png

 

ShankerV_1-1677579204804.png

 

Post running the output is:

ShankerV_2-1677579230686.png

 

Note: Logic needs to be decided based on the dataset and the output expected from the dataset.

 

Many thanks

Shanker V

Labels
Top Solution Authors