Identify and Isolate duplicate values in a column or columns
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Is there a good way to identify and isolate duplicate values in a column? I need to find every duplicate value in one column among many, and work with that data (not caring about the single occurances), so the Unique tool doesn't quite work for my needs. i've been running my data through the Summary tool and counting the columns of focus, Filter tool to isolate count values > 1, and then Join that result back with the original data. This process just feels clunky to me, does anyone know a single-tool formula/process to accomplish the same result? Bonus points if the process can be adapted for multiple columns. Thank you!
Solved! Go to Solution.
- Labels:
- Preparation
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi,
Can you provide some sample data to look at?
Thanks.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi @Rob48
There is no single tool solution to isolate duplicate values. This is a solution using probably the smallest number of tools.
Use the Unique tool to find the duplicates and then a Summarize(or another Unique) to get a unique list. Join this back to the input data on the value column to get all the rows that have duplicates.
The solution for multiple columns is more complex since you need to compare values with other values, but only in the same column
 
Similar process but you transpose the value columns and then summarize the duplicate results to join back on row id.
Dan
 
