Community Spring Cleaning week is here! Join your fellow Maveryx in digging through your old posts and marking comments on them as solved. Learn more here!

Welcome & Introductions

Get to know the people behind the avatar! Introduce yourself, welcome peers, build connections & extend your network.

Take a moment to introduce yourself to the Community. We'd love to get to know you!

Learn More

Data Cleansing

Honghao
5 - Atom

Hi Everyone,

 

I am trying to use an excel file to do some practice with Alteryx.  My goal is to make a data visulazition of the Syrian refugee's family size and person count.  I think only the yellow area is useful.  Since the orginal data is not a standard column and row table,  as a beginner Alteryx user, I am having a hard time to extract the data I wanted.

 

I attached the sample dataset here, can someone show me how will you clean the data and extract the information in the yellow highlight? Thanks a lot! 🙂

 

Honghao_0-1602867488205.png

 

3 REPLIES 3
treepruner
9 - Comet

The headers are really the issue. Use the Sample tool to skip the rows with the header. You could then use a select tool to manually label the fields and a filter to skip the total row.

 

A more automated solution is to connect another Sample tool to the spreadsheet to select the headers and then manipulate them into nice field names and recombine them with the numeric data.  Look for information on the Dynamic select/rename tools. 

 

The beginner Weekly Challenges are a really good place to get started and best of all, the old ones already have solutions! You'll learn a lot looking at the official solution, but people also post their individual solutions.

Honghao
5 - Atom

Thank you so much for your help. The sample tool is a good idea, I didn't think of it. 🙂

 

Thanks for the recommendation of the "Weekly challenges", I will definitely check that out.

 

All the best!

Hi ,

 

I have create one sample workflow. I hope it will work for you.

Labels