In case you missed the announcement: The Alteryx One Fall Release is here! Learn more about the new features and capabilities here
ACT NOW: The Alteryx team will be retiring support for Community account recovery and Community email-change requests after December 31, 2025. Set up your security questions now so you can recover your account anytime, just log out and back in to get started. Learn more here
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
解決済み

Duplicates

GdeH
メテオール

Hi,

I've uploaded 2 files in the tool. There are some duplicates between both files. I would like to delete those.

However, the duplicates that are included within 1 file I would like to keep them. 

For exemple, here below I would like to keep line 1 2 and 4 and remove line 2.
FILE 1 - XXX
FILE 2 - XXX
FILE 2 - XXX
FILE 1 - YYY

Here below I would like to keep line 1 2 and 5 and remove line 3 and 4.

FILE 1 - XXX

FILE 1 - XXX
FILE 2 - XXX
FILE 2 - XXX
FILE 1 - YYY

10件の返信10
echuong1
Alteryx Alumni (Retired)

I think I understand your requirements now.

 

I essentially found a count of each value per file. I then added a running total/count per value in the overall dataset. After adding the counts from File 1 to the overall dataset, I was able to use a filter to include any value where File = File 1 or where the running total was higher than that of the file 1 count.

 

echuong1_0-1613511826492.png

 

ラベル
トップのソリューション投稿者