I need help solving a problem in Alteryx to remove duplicate files . Out of 1,000,000 there are about 70 duplicates so there are about 70 extra rows . Most duplicates are coming in from the data set based off a different date . Could somebody please give advice on how to handle this solution?
For example one duplicate
BLAH NUM|BLAH NUM 2|BLAH NUM 3|CODE|DATE
02045321|34567890123|234567|ABCDEF|5/7/23
02045321|34567890123|234567|ABCDEF|7/5/23
I only want to get back this row :
02045321|34567890123|234567|ABCDEF|7/5/23
any advice?
Solved! Go to Solution.
If what you shared is a table with 5 columns, then you can Sort by Date Ascending use the Summarize tool, grouping by all columns except the [DATE] column, then get the Last (Action) of the [DATE] Column:
@staceygreene I guess you do not want to split the fields first then take the rows based on max date in that case you can do something like below