Hi all
I’m currently engaged in the task of optimising some older but very complex Alteryx workflow, and I’m trying to create a standard methodology for ensuring that my newer, optimised work matches exactly the output of the original workflow.
I’m unsure if there is a standard methodology recommended by Alteryx experts for such a task.
I’d previously simply outputted a CSV file of all the data from my new optimised workflows and simply joined each column to the data in the older workflows and checking how many matches they are.
If all matched, then I assume that the workflows essentially provide the same output.
However, my intuition is that this approach isn’t robust enough and I thought I’d ask here.
In other platforms (e.g. R) I have often tested the similarities between data frames using anti-joins.
Many thanks in advance for your assistance.
D