Fort Myers-Naples, FL

Alteryx Tip 2016 from VeraData

VeraData101
7 - Meteor

VeraData tips for working with duplicates.

 

In this post, we’d like to share some useful tips to help organizations identify the best process to remove duplicates within Alteryx.

 

The most common way to remove duplicate records from your files is to use the Unique Tool, Tile Tool or Summarize Tool, when you need to get quantity of unique records.

 

When using these tools, however, you’ll be removing identical records and leaving only one example from each repeated record. The biggest disadvantage of this method lies in the criteria’s sorting functionality. For example, you’ll need to not only remove the duplicate records, but also leave one record that is identical. This record will need to contain the values in the columns, otherwise you’d need to leave a record which contains more information than the others.

 

In this case, we would like to propose this combination of tools.

 

The sample “Output #1” below (on the left) illustrates how to remove ALL duplicated records. You’ll get only unique records on the first output, meaning that you’ll have records which don’t have any duplicates at all.

 

On “Output #2” (on the right), you’ll get a pool of all duplicate records.

From this pool, you can choose only the unique records that align with your criteria.


You may also find this combination quite useful if you group it into the macros.  It’ll save you quite a bit of time, but it’s definitely recommended to practice this combination in a regular workflow beforehand.

 

 

 

Capture.JPG

0 REPLIES 0