Get Inspire insights from former attendees in our AMA discussion thread on Inspire Buzz. ACEs and other community members are on call all week to answer!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Compare two data sets and flag for discrepancies

mamzel_juju
6 - Meteoroid

I am trying to compare two data sets which have a lot of values in common. I have browsed the community but nothings is working for me. I want to compare for each employee, if they have the same Project ID and corresponding Hours on both data set. I would just need a list of where they are different. I have tried several workflows but somehow it is giving me duplicate rows within the Project ID column and I cannot further. Thanks for your help.

Data set 1

Employee NameEmp IDProject IDProject TypeHours
A11287026-21C40
A22393417-01C40
A33303955-04C40
A44281392-03C32
A44264592-92P8
A55365196-13P40
A66287026-21X40
A77364636-05X40
A88287026-38L40
A99364636-04F16
A99394642-08C24
A1010394642-08C40
A1111281392-03C40

 

Data set 2

Employee NameEmp IDProject IDProject TypeHours
A11287026-21C32
A11CAP/DFZ-00Z8
A22393417-01C40
A33303955-04C40
A44281392-03C32
A44264592-94P8
A55365196-13P40
A66287026-20X40
A77364636-05X40
A88287026-38L40
A99364636-04F16
A99394642-08C24
A1010394642-08C40
A1111281392-03C40
3 REPLIES 3
BenMoss
ACE Emeritus
ACE Emeritus

You need one tool to perform this job, a join tool, example attached...

 

You are interested in the data from the 'Left' and 'Right' anchors; you could then build some flags about what exactly the issue is, whether it is just completely missing from the other file, or just the amount of hours logged is different.

 

2019-03-13_08-36-06.png

 

Ben

mamzel_juju
6 - Meteoroid

Superstar, it was so simple I feel terrible to have asked!

mamzel_juju
6 - Meteoroid

I am still getting duplicated rows in my data though so in my table it is incorrectly telling me that this employee has all those entries when actually he has 20 hours on 365336-34 and 20 hours on 365336-38 in Data set 1, it is just split differently in Data set 2. how do I get rid for these extra lines

 

A295,787PrincipalC365336-34365336-34 2018Difference in Logged Hours
A295,787PrincipalC365336-38365336-38 2018Difference in Logged Hours
A295,787PrincipalC365336-34365336-34 2018Difference in Logged Hours
A295,787PrincipalC365336-38365336-38 2018Difference in Logged Hours
A295,787PrincipalC365336-34365336-34 208Difference in Logged Hours
A295,787PrincipalC365336-38365336-38 208Difference in Logged Hours
A295,787PrincipalC365336-34365336-34 208Difference in Logged Hours
A295,787PrincipalC365336-38365336-38 208Difference in Logged Hours
A295,787PrincipalC365336-34365336-34 204Difference in Logged Hours
A295,787PrincipalC365336-38365336-38 204Difference in Logged Hours
A295,787PrincipalC365336-34365336-34 204Difference in Logged Hours
A295,787PrincipalC365336-38365336-38 204Difference in Logged Hours
A295,787PrincipalC365336-34365336-28Difference in Project ID2018Difference in Logged Hours
A295,787PrincipalC365336-38365336-28Difference in Project ID2018Difference in Logged Hours
A295,787PrincipalC365336-38365336-34Difference in Project ID2018Difference in Logged Hours
A295,787PrincipalC365336-34365336-38Difference in Project ID2018Difference in Logged Hours
A295,787PrincipalC365336-34365336-28Difference in Project ID2018Difference in Logged Hours
A295,787PrincipalC365336-38365336-28Difference in Project ID2018Difference in Logged Hours
A295,787PrincipalC365336-38365336-34Difference in Project ID2018Difference in Logged Hours
A295,787PrincipalC365336-34365336-38Difference in Project ID2018Difference in Logged Hours
A295,787PrincipalC365336-34365336-28Difference in Project ID208Difference in Logged Hours
A295,787PrincipalC365336-38365336-28Difference in Project ID208Difference in Logged Hours
A295,787PrincipalC365336-38365336-34Difference in Project ID208Difference in Logged Hours
A295,787PrincipalC365336-34365336-38Difference in Project ID208Difference in Logged Hours
A295,787PrincipalC365336-34365336-28Difference in Project ID208Difference in Logged Hours
A295,787PrincipalC365336-38365336-28Difference in Project ID208Difference in Logged Hours
A295,787PrincipalC365336-38365336-34Difference in Project ID208Difference in Logged Hours
A295,787PrincipalC365336-34365336-38Difference in Project ID208Difference in Logged Hours
A295,787PrincipalC365336-34365336-28Difference in Project ID204Difference in Logged Hours
A295,787PrincipalC365336-38365336-28Difference in Project ID204Difference in Logged Hours
A295,787PrincipalC365336-38365336-34Difference in Project ID204Difference in Logged Hours
A295,787PrincipalC365336-34365336-38Difference in Project ID204Difference in Logged Hours
A295,787PrincipalC365336-34365336-28Difference in Project ID204Difference in Logged Hours
A295,787PrincipalC365336-38365336-28Difference in Project ID204Difference in Logged Hours
A295,787PrincipalC365336-38365336-34Difference in Project ID204Difference in Logged Hours
A295,787PrincipalC365336-34365336-38Difference in Project ID204Difference in Logged Hours
Labels