Hi Community,
I am stuck with a data cleanse exercise where I need to replace data in few columns once I have identified that they are duplicate rows.
Please find the attached workflow for reference & below is what I am after.
For example: Sales Document 150502162202 & 6602165304 are duplicates in the dataset & have got multiple rows for same material and customer code. (Record ID 24-31 for example), I want the # or 0's to be replaced with where there is a value. Further only keep the clean data and remove the bad duplicates. Please refer to the screenshots attached and let me know if something is still not clear.
Note: I have around 1 million rows in my original data set.
Thanks a ton in advance.