AB test results:
Paramaterized datasets X and Y are identical except for the default parameter values
A -> Paramaterized dataset X is used as input for flow 1 and runtime parameters (p1='2000', p2='01', p3='01') are passed in
B -> Paramaterized dataset Y is used as input for flow 1 and runtime parameters (p1='2000', p2='01', p3='01') are passed in
From what I understand the results for both flow runs A and B should match but they don't (rows and/or col differences etc.)
Hi @Remi Shah?,
We need more information to understand the issue. Please provide your descriptive inputs for the below queries:
Best,
Apeksha Prasad
1 - The column names are the same but they are ordered differently in case A (correct output) an B. In B one column ('col_x') is in column position 6, but in A 'col_x' is at last column.
2 - The total row count matches in both output files
3 - There is mismatch in the schema between A and B (see point 1). 'col_x' moves from position 6 in B output file to position -1 in A output file.
4 - Here's an example of the output differences between A and B:
Its quite complex to show the differences but here you can see that up until col_e both outputs match, but after that point the schema changes and some columns are shifted across (col_h in A is shifted to col_f in B) and col_x in B only have NULL values