I have a bunch of records that belong to groups, with a field assigning the group. I.e. field name is "Group Name" and then each record has something like "Group 1" or "Group 2" and so on. Record 1, lets ay "John Doe" may be in the whole batch several times in multiple groups and I need it deduplicated so ultimately he is only in there once listed with one group name.
The tricky part, I want to first figure out how many records each group contains. I want john doe to stay in the smallest group, so that his multiples are pulled out of the larger groups. This should help ensure the groups that start small are not weeded down to nothing after the dupes are pulled out.
Any ideas?