Hello!
I have a list of customer numbers coming from two files that I have joined together into a single dataset (labeling them along the way), however there are specific duplicates I am trying to remove. If there is a duplicate customer from each file (sold to/sold with) I want to filter out the duplicative sold with customer. Any duplicates stemming from their original file I want to keep included.
Sample Data:
Customer Number | Source |
01769068001 | Sold To |
01769368001 | Sold To |
01769368001 | Sold To |
01774320001 | Sold To |
01774320001 | Sold With |
01776858001 | Sold To |
01776858001 | Sold With |
01790297001 | Sold To |
01790297001 | Sold To |
01818687001 | Sold To |
01818687001 | Sold To |
01857760001 | Sold With |
01857760001 | Sold With |
01859692001 | Sold With |
01859692001 | Sold With |
01860858001 | Sold To |
01860858001 | Sold With |
01860959001 | Sold To |
01860959001 | Sold With |