Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Why is performing a JOIN increasing my total DB value?!

pupmup
7 - Meteor

(I'm going to find it hard to upload sample data - it's proprietary and large - but will try if no-one can make sense of the below.)

 

+ I have data - 779k records. Total value: £13m. 

+ I summarise - 55k records. Total value: £13m. 

+ I run a JOIN. Input:

 

LEFT: My 55k records

RIGHT: A reference sheet, in which I'm pulling in only BARCODE and PRODUCT TITLE, joining on BARCODE and pulling in the PRODUCT TITLE on a match.

 

+ The join output:

 

RIGHT: Irrelevant

LEFT: 290 records, SUMMARISE value £8k

JOIN: 68,618 records, SUMMARISE value £15.5m(!!)

 

I simply do not understand how this happened. All the JOIN should be doing is pulling in a text value if it matches barcodes. How is it increasing the number of records and the value in the db?

 

The Join is reading off an Excel - my current theory is that perhaps that reference excel lists the some barcodes twice but with a different text entry for each listing. Would the JOIN then duplicate the record it's matching to in the non-reference document, so it can put a match one with each of the found text entries? (I have checked and this is not the case)

4 REPLIES 4
Aguisande
15 - Aurora
15 - Aurora

Hi @pupmup

What's happening is that you have more than one matching records per concept in one (or both) datasources.

Remember that Alteryx returns Joined records through the J output, and Unjoined ones through the L & R outputs.

 

Maybe if you summarize(Group by) or Unique the BARCODE for the R input, and then try the join, maybe you'll get close to what you need.

Regards

ChillEDog
6 - Meteoroid

simple example but hope it helps

 

suppose you have one row on the left value 100 and it has  barcode "abc"

on the right you have 10 rows, 3 of which have barcode "abc"

the join on barcode "abc" will produce 3 rows, each having value 100

 

is there additional field(s) that can be added to the join to make it unique ?

Aguisande
15 - Aurora
15 - Aurora

Hi @ChillEDog,
For what I understand of your example, the structure of your data is something like this:

Capture1.PNG

What I would do is to Summarize the INVENTORY table, grouping by [barcode] and SUM([units]). That'll give you a unique value for [barcode] with the total of values summed.
Capture2.PNG

It's result is:

Capture3.PNG

Then you can join against your other table.
If I misinterpreted anything of your use case, please let me know, and we'll see it.

pupmup
7 - Meteor

Thank you. I was already summarising the Inventory by barcode at an earlier stage in the process - the duplication only went away when I added a Unique process to the retail price table, based off barcode.

Labels