Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Compare two columns on 4 datasets for trends/ changes

negurai
6 - Meteoroid

Hi everyone, I'm new to Alteryx and also been a while since looking at any workflows or done any training.

 

  • I have 4 sheets/ data sources (the format is exactly the same on all 4) and I've two columns I want to look at for trends/ changes, so as an example:

 

Column A/ Column B

Score (1 to 10)/ Unique Identifier 

 

  • The unique identifiers are on all 4 sheets, but the scores may be changing, or staying the same (so need to find out what's changing)

 

  • I have added all 4 datasets on my canvas, what's the best design for me at this stage please?

 

  • I have been looking at Join Multiple but I think my problem is configuration, as I'm struggling to set it up correctly so it gives me what I need.


Many thanks,


Ina

 

8 REPLIES 8
negurai
6 - Meteoroid

Workflow Start

BenoitC
Alteryx
Alteryx

Hello,

 

If you want to aggregate all your sheets into one dataset, you can simply replace the name of the sheet by a wildcard (*), it will read all files and aggregate them into one dataset.

 

BenoitC_0-1649162341922.png

 

This will work if you have the same column names on all files.

 

 

To clarify, JOIN are used to merge datasets horizontally, and UNION are used to merge datasets vertically (piling them up together).

 

Does this help?

Benoit Conley

Sales Engineer
Alteryx, Inc.

negurai
6 - Meteoroid

Hi Benoit,

 

Thank you for the quick reply. My apologies, think I'm getting myself confused when it comes to Join and Union.

I've now used Union and I've all data in one vertical set but unsure where to go from here?

 

Aware this is a painful post to read through, but it's really been a while.

 

Thanks,

Ina

Luke_C
17 - Castor
17 - Castor

Hi @negurai 

 

Since the tabs have the dates in them, maybe include the file name as a field so you can parse out the date. Then cross tab grouping by the unique IDs and making a column for each date populated with the scores. 

 

Another option might be to sort the data by date/unique ID and then use a multirow tool to take the difference between each row.

negurai
6 - Meteoroid

Thanks everyone for your help, and sorry I am a bit slow here. I've attached a data sample which is pretty much a replica of what I'm working with, along with a screenshot of my canvas. Can I check I'm on the right track here? I am trying to assess whether scores are changing over time or not based on 'ID' (unique identifier) and perhaps generate a graph.

Luke_C
17 - Castor
17 - Castor

Hi @negurai 

 

Here's an example of the two items I mentioned:

 

Luke_C_0-1649176140558.png

Luke_C_1-1649176163024.png

 

negurai
6 - Meteoroid

Wow thank you Luke, virtual coffee incoming!

 

I don't have the latest version of Alteryx under my key, and it won't let me open the workflow, is there any chance you could attach one last screenshot please so I can zoom in and try and replicate your solution :)


Many thanks,


Ina

Luke_C
17 - Castor
17 - Castor
Labels