Hello everyone,
I need some help cleaning a dataset.
My input file contains customer IDs, records ids, stages and timestamps.
all of these recordIDs are set as Main although they should not be.
Each Customer may have two or more record ids, but only one of them should be set as Main.
the Main record is the one that has the most advanced Stage.
and in case both records have the same stage, then the one to be main is the most recent one.
Here is a sample of the dataset to clean up.
Your help would be greatly appreciated.
Many thanks
Esme