Hi Team,
Hope you are well. I am trying to add a new column to the existing dataset.
The first dataset has 463683 records and
The second dataset has 398754 records. I have a column in second dataset which i am trying to add it to the first dataset.
But my final output i am getting the total records as 7834348. Ideally it should be 463683. Not sure why its appending.
Please let me know what i am missing,
Thanks,
Sri
Solved! Go to Solution.
Hi @sriniprad08
This is because, there are values which will not be unique in the other columns in your dataset.
This leads to increase in count.
Many thanks
Shanker V
Hi @sriniprad08
Please find a sample use case to explain your scenario.
Dataset 1:
Dataset 2:
Join:
Output:
Many thanks
Shanker V
Hi @ShankerV ,
This is great. thank you so much. Can you please let me know how to do in my situation? Means what to exclude/include?
Cheers,
Sri
Hi @sriniprad08
Can you please a sample dataset of 1 and 2.
And the expected output.
So that I can propose the best solution.
Hi @sriniprad08
In my case, as I have decided to have the first value of unique records on the 3 columns.
I have inserted the Unique tool to filter one records for all 3 columns and then joining.
Post running the output is:
Note: Logic needs to be decided based on the dataset and the output expected from the dataset.
Many thanks
Shanker V
User | Count |
---|---|
63 | |
28 | |
23 | |
22 | |
22 |