Hey Alteryx,
I've hit a little wall so I'm reaching out for help.
I'm parsing and prepping some data for a client, who is testing some duplicate billing.
I have a list of around 22k transactions with Project Invoices, Billed amounts, Cost Codes, Dates, Billing Name, Labor Type, and Hours billed. (few other fields if needed).
The only logical way I can think of doing it with tools, would be to sort the data by Master Name, Document Date, and Billed Amount;
Then run and if else as such:
IF
([billed amount] == [row+1:billed amount]
AND [document date] == [row+1:document date]
AND [cost code description] == [row+1:cost code description]
AND [master name] == [row+1:master name]
AND [Labour type] == [row+1:Labour Type]
AND [Qty] = [row+1:Qty])
THEN "YES"
ELSE "NO"
ENDIF
This will match all the billed amounts, BUT WAIT!!! There are bill backs also. IE They bill 5 hours then -5 hours after. So I added a case of:
OR
([billed amount] == ([row+1:billed amount] * -1)
AND [document date] == [row+1:document date]
AND [cost code description] == [row+1:cost code description]
AND [master name] == [row+1:master name]
AND [Labour type] == [row+1:Labour Type]
AND [Qty] = ([row+1:Qty] * -1))
Right now I feel a little far down the rabbit hole and wanted to take a step back to see if anyone would do this any differently.
SAMPLE TAKEN DOWN