We are celebrating the 10-year anniversary of the Alteryx Community! Learn more and join in on the fun here.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Delete duplicate based on time submitted help

BMartinCOE
6 - Meteoroid

Hello!

 

I am working with a dataset that contains duplicates it looks something like this:

 

Building IDField 1Field 2

1

1234 Maple St7:00pm 6-6-2022
2123 Yellow St6:00pm 4-5-2022

1

123 Blue St5:33pm 2-5-2022

2

123 Yellow St3:00pm 6-6-2022

3

123 Green Ave3:00pm 4-5-2022

1

1234 Maple St7:10pm 6-6-2022

 

I am trying to remove duplicate based on multiple columns. Some users submit data with the same ID and the same address but it has been submitted at a later time. I am looking to remove the early time as I assume the more recent submission is their intended submission (ex: would want to remove the older 1234 yellow st). This becomes more complicated because some users submit data with the same building ID for different addresses (ex: building ID 1 has two different address but 3 different submissions). In this case it should end up with two submissions, removing the older duplicated address.

 

Does anyone have any suggestions on how I can clean this up?

 

Thanks

 

*Edit: it is okay for their to be duplicate building ID, it is not okay for their to be duplicate field 1

1 REPLY 1
geraldo
13 - Pulsar

@BMartinCOE 

 

 

AN worflow example

 

deletedup.JPG

Labels
Top Solution Authors