I am trying to compare a huge dataset consisting of (Old vs New records) across 973,000 rows and 85 columns in my dataset.
I already tried using the transpose tool (which is not helpful as its creating over 27 million rows) and I dont want to create 85 columns separate columns for each comparison using the formula tool.
So, is there a quicker way to calculate the Match or Mismatch result in a new column?
Sample data is shown in screenshot and the desired output is shown below -
Column A | Column B | Variance/ DELTA/ Comparison | Column C | Column D | Variance/ DELTA/ Comparison |
10 | 20 | Mismatch | Apples | Apples | Match |
5 | 5 | Match | Banana | Strawberry | Mismatch |
37 | 35 | Mismatch | Kale | Celery | Mismatch |
100 | 100 | Match | Mango | Mango | Match |
You've already discovered it - the Transpose tool. I do it regularly for delta checks with over 100 million rows. Alteryx should be able to handle that just fine (assuming you have the recommended RAM and CPUs). The only other way I can think of would be a batch macro where it transposes one column at a time and combines the results.