Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.
Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Duplicate invoice / payment or vendor number

Hussein982
8 - Asteroid

Hi All: 

 

I have an invoice and payment file (see example below for your reference). My end goal is to find if there was a duplicate invoice #, duplicate vendor # or duplicate payment were made throughput the year.

 

My issue in that  the file list all invoices on a header level. What does this mean is that same invoice number appears multiple time due to the fact that the invoice  was breakdown by line by line item (different products purchased within the same invoice number) (see below table for your reference): 

 

Thank you in advance.

 

Vendor Name Vendor #Invoice number INVOICE DATEINVOICE_AMOUNTPayment amount Line type 
A200073501031162020-11-30 100100L1
A200073501031162020-11-30 100100L2
A200073501031162020-11-30 100100L3
       
B15202020-5-25 200 200L1
B15202020-5-25 200 200L2
B15202020-5-25 200 200L3
3 REPLIES 3
Kenda
16 - Nebula
16 - Nebula

Hi @Hussein982 

 

If you use a summarize tool first to group your data together, you can combine line items that all belong to the same invoice. From there, you can use a unique tool to check for duplicates. 

 

Kenda_0-1617126861576.png

 

Hussein982
8 - Asteroid

@Kenda Thank you Kenda for your quick response. I did follow it and it worked but as you can see below the invoice number 001/2020 appears multiple times but for different suppliers (so false positive).

 

How can I get rid of these so to pick real duplicate invoices if applicable/any? 

 

 

 

 

Hussein982_0-1617129628605.png

 

Kenda
16 - Nebula
16 - Nebula

Hey @Hussein982 

 

In the summarize tool, when you "Group By" a field, Alteryx will combine the records with identical values in that field into a single record. Therefore, by grouping by multiple fields, each unique combination of those fields will result in a separate row. 

 

In your case, if you don't want the level of detail where it's creating multiple rows that you don't want to see, do not include that field in your summarize tool.

 

For example, if this is my input data:

GroupSub GroupField3
Aa1
Aa1
Ab1
Bb1
Bb1

 

Here is how the data will look if I group by both the Group and Sub Group fields:

GroupSub GroupSum_Field3
Aa2
Ab1
Bb2

 

Here is how the data will look if I only group by the Group field:

GroupSum_Field3
A3
B2

 

And here is how the data will look if I only group by the Sub Group field:

Sub GroupSum_Field3
a2
b3

 

 

Hopefully this provides you with enough information to apply to your problem here.

Labels
Top Solution Authors