Join the Inspire AMA with Joshua Burkhow, March 31-April 4. Ask, share, and connect with the Alteryx community!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Compare two columns from two different data sets

danniboy3000
5 - Atom

Hi all,

I am brand new, taking the courses, and learning fast. Great tool!

 

I have two data sets with millions of rows that should contain the same set of data. I am trying to find the differences.

For both sets I concatenated three columns to create an unique ID column.

I want to compare the two unique ID columns to see the % match / mismatch.

 

How do I do that? Is there a way to see the % match between the two? The below data table is an example of what I am talking about.

 

Customer NameCustomer IDNumber of purchasesConcat 1
John110John110
Bob211Bob211
Sally312Sally312

 

Customer NameCustomer IDNumber of purchasesConcat 2
John110John110
Bob211Bob211
Jenny413Jenny413
3 REPLIES 3
gawa
16 - Nebula
16 - Nebula

hi @danniboy3000 

You want to know how many columns are match for each row? For example, 2 out of 3 columns match=> 66.7%, 1 out of 3 columns match=>33.3%, correct?

If so, you can transpose table and compare them by JOIN and identify difference by formula tool...so on. Please see attached workflow.

image.png

By the way, I suppose you create unique column by concat first three column, however, I recommend to put a separator like semi-colon ;  vertical bar| etc..

For example, if there is another record like:

Customer Name: John, Customer ID: 11, Number of purchases: 0

Then, concat value would be John110, which is identical to first record, and not a unique one! By using separator, they look like;

John;1;10

John;11;0

These values become unique.

danniboy3000
5 - Atom

Thank you for the response. I truly appreciate it. I will look it over.

ChrisTX
16 - Nebula
16 - Nebula

@danniboy3000 I've seen many options to compare data sets.  If you only need to compare 2 columns, a Join or Transpose may be easiest.  If you ever need to compare many columns, see links below.

 


Community > Support > Knowledge > Designer > How To: Compare Data from Two Data sets
https://community.alteryx.com/t5/Alteryx-Designer-Knowledge-Base/How-To-Compare-Data-from-Two-Data-s...

 

I want to develop something that would highlight the differences in the names.
https://community.alteryx.com/t5/Alteryx-Designer-Discussions/String-Comparison/m-p/388279#M71158

 

CReW Delta macro is in our laboratory and looking for feedback. This macro allows a user to input two files that are expected to be equal. The "Delta" output will include any differences.
https://gallery.alteryx.com/#!app/CReW-Delta--Alpha-/5b41141b826fd3116445fee8

 

Comparing Data from Two Sets of Data, Calling out Differences
https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Comparing-Data-from-Two-Sets-of-Data-C...

 

use join tool to find newly inserted or deleted rows. See blog...
https://community.alteryx.com/t5/Engine-Works-Blog/Compare-2-Data-Sets/ba-p/88853

 

Compare 2 Data Sets
https://community.alteryx.com/t5/Engine-Works-Blog/Compare-2-Data-Sets/ba-p/88853

 

The purpose of the macro is to pull in two different sheets and help identify the differences between the two
https://community.alteryx.com/t5/Alteryx-Designer-Discussions/How-To-Update-Fields-Selected-With-Con...

 

Community > Getting Started > Alteryx Use Cases > Data Check Between Two Sources
2018 Excellence Awards Entry
https://community.alteryx.com/t5/Alteryx-Use-Cases/Data-Check-Between-Two-Sources/ta-p/270082

 

I'd like to create a workflow that compares all data for a given row in spreadsheet 1 with spreadsheet 2
https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Best-way-to-identify-unique-rows-of-da...

 

Comparing data between two different servers
https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Comparing-data-between-two-different-s...

 

Compare two files without a unique key - not able to Join
https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Comparing-two-files-that-do-not-have-a...

 

Compare field values Left vs Right
https://community.alteryx.com/t5/Inspire-2019-Buzz/Nested-Macros-Session-Field-Comparison-macro/m-p/...

 

Iterative Data Comparison
I am looking to compare data between 2 files - Pre and Post.
https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Iterative-Data-Comparison/td-p/841398/...

 

Combining 2 Data Sets and Highlighting Differences in Output
https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Combining-2-Data-Sets-and-Highlighting...

 

 

Chris

Labels
Top Solution Authors