Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Reconcile multiple datasets

barkat
6 - Meteoroid

Hello, 

 

I have multiple datasets coming from different sources, and I need to check that all the 39 fields match for each ID in all the data sources, and fields that have a different value should be flagged. Initially, ii thought I would have to join on the unique ID and 'amount' column. however in my real data sets, there are 39 fields, do I need to do this 39 times? 

 

If anyone could help me with a more efficient way to do this. That would be amazing. I have attached two datasets that are similar to my real datasets. 

 

 

Thank you. 

 

DATA SET 1 : 

 

 

IDAMOUNTRatingTypeflag
11000A+CouponNo
21000AA++CouponYes
33000BBCouponyes
46000CCCouponyes
56000BBCouponyes
67000A-CouponNo
79000n/ACouponNo
89000AAACouponNo
99000BuCouponNo
109000BuCreditNo

 

DATA SET 2:

IDAMOUNTRatingTypeflag
11000A+CouponNo
21000BCouponYes
33000BBCredityes
46000CCCouponyes
56000BBCouponyes
67000A-CreditNo
79000n/ACouponNo
89000AAACouponNo
99000BuCouponNo
109000BuCreditNo

 

5 REPLIES 5
ChrisTX
16 - Nebula
16 - Nebula

There are many posts on this subject.  Here are a few:

 

How To: Compare Data from Two Data sets

Community > Designer > Browse Knowledge > How To: Compare Data from Two Data sets

https://community.alteryx.com/t5/Alteryx-Knowledge-Base/How-To-Compare-Data-from-Two-Data-sets/ta-p/...

 

CReW Delta macro: This macro allows a user to input two files that are expected to be equal. The user can select which fields are to be compared

https://gallery.alteryx.com/#!app/CReW-Delta--Alpha-/5b41141b826fd3116445fee8

 

Comparing Data from Two Sets of Data, Calling out Differences

https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Comparing-Data-from-Two-Sets-of-Data-C...

 

Compare 2 Data Sets

https://community.alteryx.com/t5/Engine-Works-Blog/Compare-2-Data-Sets/ba-p/88853

 

Data Check Between Two Sources

Community > Getting Started > Alteryx Use Cases > Data Check Between Two Sources

https://community.alteryx.com/t5/Alteryx-Use-Cases/Data-Check-Between-Two-Sources/ta-p/270082

 

Best way to identify unique rows of data

https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Best-way-to-identify-unique-rows-of-da...

 

Comparing data between two different servers

https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Comparing-data-between-two-different-s...

 

 

Chris

 

barkat
6 - Meteoroid

Thank you. Yes I did see those, but these are for two data sets only. I have multiple data sets over 20 that need reconciling. Which is where i am stuck. 

ChrisTX
16 - Nebula
16 - Nebula

Do your 20 data sets have any type of ordering?  How would you perform the task manually?  I'm guessing you would start with dataset #1 then compare it to dataset #2, then what would be your next step, manually?

 

If, for example, you were comparing row #5, column C across 20 datasets.  If the value in C5 was:

dataset 1: C5 = 123

dataset 2: C5 = 444

dataset 3: C5 = 123

dataset 4: C5 = 444

 

Manually, would these be denoted as 4 separate values, or only 2 separate values?  How would you order the datasets before you started your comparison?

 

After you identify the steps for a manual task, you should have the logic for your workflow.

 

Chris

 

CarliE
Alteryx Alumni (Retired)

@barkat ,

 

Would this work? Getting a table that tells you what ID and field is different?

CarliE_0-1633444754027.png

 

Attached is the workflow.

 

If this solution helped, please mark as a solution for other users benefit.

 

Thanks,

Carli
DawnDuong
13 - Pulsar
13 - Pulsar

hi @barkat 

Assuming that the only required output is to detect exceptions (i.e. where there are mismatches), I think the most efficient way is to use a single Join Tool.

if you can make sure that the columns in both files are identical in sequence then you can also use the option "Join by Record Position". However in this case, I have just selected all the available fields.

The L and R outputs will point you to the exceptions where there are not identical matches between 2 books.

ARComm_for_Barkat.PNG

Labels
Top Solution Authors