Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Duplicate Records

amcgill2
7 - Meteor

Hello Alteryx community! I am hoping you can help me with the below questions as I am a super new user to Alteryx! I need to use a custom workflow to check the following:

  1. Are there duplicate records in the RTT_SUMMARY All reduced.csv table? If so, how many?
  2. After removing duplicate records, does OPA_ACCOUNT_NUM determine STREET_ADDRESS?
  3. Does OPA_ACCOUNT_NUM determine ZIP_CODE?
  4. What should be the primary key for the RTT_SUMMARY All reduced.csv table? (Hint: Data may not currently support that)
3 REPLIES 3
mceleavey
17 - Castor
17 - Castor

Hi @amcgill2 ,

 

I'm not sure what you mean by "determines" does OPA_ACCOUNT_NUM = Street_Address? Does it relate in some way?

In the data you have provided, OPA_ACCOUNT_NUM does not equal Street-Address or Zip_Code.

 

mceleavey_0-1621794269789.png

I have attached a workflow that checks for duplicates, but given you haven't told us what the key would be to determine duplicates, I've just checked the situation where it is an exact duplicate on all columns. 

I removed these duplicate rows using the unique tool.

The primary key is unknown but looks like it should be Document_ID, however with the duplicates there's no way to confirm.

 

M

 



Bulien

amcgill2
7 - Meteor

Hi @mceleavey 

 

You have raised a good question. I will need to investigate further. 

Thank you!

mceleavey
17 - Castor
17 - Castor

no problem.gif



Bulien

Labels
Top Solution Authors