How would I go about excluding the duplicates in my data set? In this case the duplicate lines are the 3 claim IDS (699,699 and 699R). My initial data set is :
Customer ID | Claim ID | Fill Date | Day Supply |
123 | 241 | 1/8/2018 | 90 |
123 | 957 | 4/16/2018 | 90 |
123 | 699 | 7/15/2018 | 0 |
123 | 699 | 7/15/2018 | 90 |
123 | 699R | 7/15/2018 | -90 |
123 | 428 | 7/16/2018 | 90 |
123 | 910 | 10/1/2018 | 90 |
My output goal is:
Customer ID | Claim ID | Fill Date | Day Supply |
123 | 241 | 1/8/2018 | 90 |
123 | 957 | 4/16/2018 | 90 |
123 | 428 | 7/16/2018 | 90 |
123 | 910 | 10/1/2018 | 90 |
Thanks!
Solved! Go to Solution.
Hi Carolyne,
You would want to use the Unique tool to remove the duplicates, then join the data back together to remove the initial value as well. The join tool allows you to choose what kind of join you prefer, in this case it would be the right join excluding the inner join. I have attached a sample workflow for you with the data provided so you can see what I did.
Thanks!