Remove Duplicates on one column without removing whole record
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi,
How do I remove repeating cells from one column (measure) without removing the whole row? I would like to replace with null any repeating cell (sometimes there are triple or even four records). I used a multi-row tool, but that would only remove the duplicate, not if there are more than two of the same records for this specific attribute.
Thank you!
- Labels:
- Datasets
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi @magsbiel
I believe the Multi-Row Formula Tool is still what you want to use. You can define what fields (attributes) you want it to be evaluated on by selecting it in the "Group by" section.
Can you share a dummy file or illustration of what you're working with and what you'd like it to look like after?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@magsbiel
I hope I get your intention correctly.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Do this in 2 steps:
Step 1: Identify the duplicate:
- Sort the data by the relevant column
- Use the Multi-row tool to check for duplicates and create a column called "isDuplicate" which is Y/N
Step 2: Eliminate the duplicate
- Simple formula tool to update FieldName field that says "if [isDuplicate] = 'Y' then null() else [fieldname] endif"
data:image/s3,"s3://crabby-images/4af8e/4af8e2bf13f92919131b4ee238c11b923051a566" alt=""