Alteryx Designer Desktop Discussions

kcroninkrein · ‎09-18-2023

Hi all,

I have a Relationship export that has two records representing each unique relationship (example: one record for Parent-Child and another for Child-Parent). I need to de-dupe this dataset so that each relationship only has 1 record. The RelationshipType and InverseRelationships are set up such that a given RelationshipType can have only 1 possible InverseRelationship, and vice versa. I've included an example of the dataset below:

LegacyContactId	LegacyRelatedContactId	RelationshipType	InverseRelationship
123	456	Child	Parent
456	123	Parent	Child
789	1011	Business	Owner
1011	789	Owner	Business
1213	1415	Sibling	Sibling
1415	1213	Sibling	Sibling

Note that for simplicity sake, I have put the relationship pair of records next to each other, but in the real dataset they may be far apart.

I thought that I could resolve this using a self-join with the Join tool, connecting Left: LegacyContact = Right: LegacyRelatedContactId and Left: LegacyRelatedContactId = Right: LegacyContactId, but that gives me the same number of records when I would have expected the count to decrease by half.

How would y'all suggest that I approach this? Thank you!

AncientPandaman · ‎09-18-2023

add 2 new columns, sort them by formula

formula

1: if [LegacyContactId]<[LegacyRelatedContactId] then [LegacyContactId] else [LegacyRelatedContactId] endif

2: if [LegacyContactId]<[LegacyRelatedContactId] then [LegacyRelatedContactId] else [LegacyContactId] endif

then add unique tool and keep unique by the new 2 column.

kcroninkrein · ‎09-20-2023

Thank you, Pang_Hee_Choy! That seems to have worked. I added in a third column that added in a formula for

IF [RelationshipType] < [InverseRelationship] THEN [RelationshipType] ELSE [InverseRelationship] ENDIF

This pulls in whichever relationship is alphabetically earlier than the other (which works because each Relationship has only one InverseRelationship option). That way it isn't removing instances where the two accounts might have a second valid relationship (example: Parent-Child and Team Leader-Team Member). That facet wasn't present in the example dataset, but is in the real data so wanted to mention it for others who might encounter a similar issue.

Alteryx Designer Desktop Discussions

De-Duping Relationship and Inverse Relationship Records

Re: Is there any way the computer vision tools can...

Re: Batch Macro

Re: How to get cell reference address from excel

Re: Replacing Forecast columns with Actual Data

Re: Row creation