In case you missed the announcement: The Alteryx One Fall Release is here! Learn more about the new features and capabilities here
ACT NOW: The Alteryx team will be retiring support for Community account recovery and Community email-change requests after December 31, 2025. Set up your security questions now so you can recover your account anytime, just log out and back in to get started. Learn more here
Start Free Trial

General Discussions

Discuss any topics that are not product-specific here.
SOLVED

Removing Duplicates

itahir04
8 - Asteroid

Hi Team,

I have a training data set. A user can have a training status as "completed" and same training as "In progress". I would like to remove "training in progress" record, if that training has already been completed for same user.

I have 50 thousand record in real data set. Duplicates should be removed within each users data set, then look for same condition in another user's data. Please assist me here.

My sample Data set is attached.

 Thanks!

2 REPLIES 2
AmeliaG
Alteryx
Alteryx

Hi @itahir04,

 

Thanks for your question! To achieve your desired result:

 

1. Sort data on "training status" such that training complete will come before training in progress (i.e. in descending order)

2. Use the 'Unique' tool to  de-dupe records. Use UserID and Training Title as the 'Unique' fields

 

See workflow below:

 

dedupep.png

 

Let me know if you have any additional questions. 

 

Thanks!

 

Amelia

itahir04
8 - Asteroid

Thanks a lot @

 

Labels
Top Solution Authors