i all,
I'm quite new to Alteryx and would like to know if someone can help guide me or provide examples to clean this dataset. This data is too hard to me to clean as I'm just a beginner.
I know that I need to combine all of the data from 2014 to 2021 into one. Should I pivot the data and do Vlook up? What should I do? Thank you for your time!
https://tinyurl.com/eeocdataset
Downloads
There are also some labels under page 6-12
You posted this before - https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/If-you-re-good-at-Alteryx-coul...
I gave you the workflow that combines them all together. In addition, I've also mentioned several points here: https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/How-do-you-handle-the-nulls-An...
What are you trying to do exactly? How granular must your dimensions be?
Hi @caltang
This is the one that I'm expecting to get this table: (please see the picture)
I can't open and run the file that you shared.
Could you tell me how I do this? Thank you! I'm hoping to get the expected results like the table below
The file I gave you is an Alteryx Packaged file. You have to import the package. You may get a warning saying that the version is different, but you can still proceed.
Now the Directory Tool I used will have an error warning because we cannot package Directory tool contents.
That is why I gave you all the YXDB files. You need to create a folder to save all your YXDB files, then configure the Directory Tool to that folder path tou saved all the YXDBs in. Then it’ll work.
This is far more efficient and effective than your multiple input files and using Union to combine them together. Plus processing will be faster.
Please try that first before going on to your main request.
@caltang Could you please guide me on how to insert 8 different CSV file in the same folder? Do we need to manually drag the input and Union them? I'm a bit lost
My output file is only 2 kb ??
@rively90 If the files have different columns, then yeah, you might have to bring them in via separate Input Data Tools so you can do the necessary fixes before you can Union them. There are quite a number of ways to do this, really.
If all of the files have the same columns/template, then you can put them all in one folder, and use just one Input Data Tool to bring all of them into your Alteryx workflow.
@rively90 kindly follow the steps below...
Download all your YXDBs from 2014 to 2021 into a folder which you can remember. Save it somewere simple like your Desktop. You should have something like this:
So you see, I decided to save it in a simple folder in my desktop - which includes the original .xlsx files along with my newly created .yxdb files.
In the attached workflow, I used a Directory Tool to call the data in because the YXDBs all have different schemas. Because of this, I am unable to use the wildcard to call them in successfully. This is why Step 1 is so important, because I need to know where I saved the files.
So, you can seeI changed the Directory path to where I am storing my data in my folder as decided in Step 1. I also made note to the Directory tool to only call YXDB files.
The error you see comes from the Batch Macro I sent you. You need to connect it to a Directory Tool to call the Full Path as below:
Please make sure to select the Macro, then choose FullPath after the Directory tool.
The steps thereafter should combine all your files into 1 - nice and easy.
Please follow the steps above first, once you finish - report back here. Then we will decide what to do with your data thereafter + how you want to proceed to clean the data.
Hope this helps.
Best,
Calvin
@caltang how do I get yxdb from 2014 to 2021? I don't have it. I only have csv???
I gave you the full set last Thursday… https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/If-you-re-good-at-Alteryx-coul...
Please read through the comments, I’ve attached them there for your convenience.