Hi,
I've uploaded 2 files in the tool. There are some duplicates between both files. I would like to delete those.
However, the duplicates that are included within 1 file I would like to keep them.
For exemple, here below I would like to keep line 1 2 and 4 and remove line 2.
FILE 1 - XXX
FILE 2 - XXX
FILE 2 - XXX
FILE 1 - YYY
Here below I would like to keep line 1 2 and 5 and remove line 3 and 4.
FILE 1 - XXX
FILE 1 - XXX
FILE 2 - XXX
FILE 2 - XXX
FILE 1 - YYY
Solved! Go to Solution.
I think I understand your requirements now.
I essentially found a count of each value per file. I then added a running total/count per value in the overall dataset. After adding the counts from File 1 to the overall dataset, I was able to use a filter to include any value where File = File 1 or where the running total was higher than that of the file 1 count.