Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Tool Mastery

Explore a diverse compilation of articles that take an in-depth look at Designer tools.
Become a Tool Master

Learn how you can share your expertise with the Community

LEARN MORE

Tool Mastery | Summarize

MattD
Alteryx Alumni (Retired)
Created

1.png

This article is part of the Tool Mastery Series, a compilation of Knowledge Base contributions to introduce diverse working examples for Designer Tools. Here we’ll delve into uses of the Summarize Tool on our way to mastering the Alteryx Designer:

 

Did you know the average football game lasts 3 hours and 12 minutes and only amounts to roughly 11 minutes of play? Now, I love trying to eat Doritos through my TV screen as much as the next guy, but for me the highlights are definitely a better watch. The Summarize Tool would probably agree - the most effective communication of your data is the most concise summary of it. Whether it’s concatenating strings for storage, merging reports to have better readability, getting your spatial objects to interact, or even calculating averages and other formulas on groupings of data, the Summarize Tool can reframe your data to be more informative. This article provides a few examples on how.

 

  • The Summarize Tool has utility for nearly every data type – numerics, strings, spatial objects, behavior profiles, and even reporting elements. Just to name a few of the useful actions (from the full list of summary processes available here) numerics can be summed, averaged, converted into percentiles, or used to find standard deviation. They can also be used in a number of financial calculations. Strings are easily concatenated with the delimiter of your choice, or
    identified as the longest or shortest strings in your groupings. If you have a knack for spatial, you can combine your spatial objects or use them to create intersections, bounding rectangles, convex hulls, and centroids as well. You can even use the tool to count the records you have in each respective group, or identify their minimum (min) or maximum (max) values. While not required, these processes, or actions, can be aggregated across groupings specified in your data by using the “GroupBy” action on one or more fields. This applies the action to sets of records in your dataset that share a unique value in your chosen group by field. For your reference, we’ve attached a v19.4 workflow (Summarize Tool.yxmd) demonstrating uses for each of the applicable data types the Summarize Tool can transform.
  • If you like the granularity of information your summarized data provides, but want it to supplement your original data set, you can join the two together using the GroupBy field as your join field (be sure to deselect redundant fields in the join!). The attached v19.4 Summarize Count Field Append.yxzp provides an example of this(see the addition of the “CustomerSegment Count” field below):

2.png

 

3.png

 

  • The Summarize Tool can also be used to reshape your data – much like another Transform Tool, Cross-Tab, where rows that are first concatenated can be easily converted to columns (with the Text to Columns Tool). Summarize Reshape.yxzp, attached, shows this reshape in v19.4:

 

4.png

 

5.png


Lastly, if you love using the Summarize Tool but find yourself spending time renaming fields after it adds prefixes for each action (you have high standards, we know, it’s why you use Alteryx), try emulating the approach in the attached v19.4 workflow, Trim Summarize or Join Prefixes.yxzp, to remove them.


By now, you should have expert-level proficiency with the Summarize Tool! If you can think of a use case we left out, feel free to use the comments section below! Consider yourself a Tool Master already? Let us know at community@alteryx.com if you’d like your creative tool uses to be featured in the Tool Mastery Series.

Stay tuned with our latest posts every Tool Tuesday by following Alteryx on Twitter! If you want to master all the Designer tools, consider subscribing for email notifications.

Additional Information
Click on the corresponding language link below to access this article in another language -
Portuguese
Spanish
French
German
Japanese

Attachments
Comments
brad_j_crep
8 - Asteroid

How does it work when grouping by more than one variable?  Does it group by the first then the second or does it take both joined together on the data? 

 

Thank you,

petgrap
8 - Asteroid

After opening, I am not able to see data in the input source. Please advise how to see the data.

DanM
Alteryx Community Team
Alteryx Community Team

 @petgrap which package are you seeing this issue? Did you run the workflow after installing the package? what version of Alteryx are you on?

 

Thanks

DanM

haider
5 - Atom
I had a similar problem as that faced by @petgrap. I cannot see any source data. I am using 11.7 Is it possible to share the data file?
vaughangary
8 - Asteroid

Thanks MattD!  Regarding the comment:  "Lastly, if you love using the Summarize Tool but find yourself spending time renaming fields after it adds prefixes for each action (you have high standards, we know, it’s why you use Alteryx), try emulating the approach in the attached v10.6 workflow, Trim Summarize or Join Prefixes.yxzp, to remove them."

 

Would be nifty - "particularly good, skillful, or effective", if we could just check a box to rename fields (or not).  There is always a work around, but this seems like a pretty obvious feature addition.

 

Regards,

Gary

mbooker
6 - Meteoroid

Hi, the data files for the input data task are not in the downloads. Where can I find them please?

DanM
Alteryx Community Team
Alteryx Community Team

You should be able to find the sample data for these workflows here C:\Program Files\Alteryx\Samples. The data is part of the sample data that you receive with the software.

pawel_zych
7 - Meteor

Hello,

I receive an error while downloading the package: CustomerFile4.yxdb.

Could you please correct the package or guide me how I should use it.

Thanks

Pawel ych

danespoors
8 - Asteroid

Hi Brad_j_crep,

 

The group by tool will follow the order of grouping just like it does with SQL. It's important to know what is the most important field that you're grouping by and have that at the top.

 

e.g. if you're doing a count of people in a country with the same surnames, you'd need to group by country first then group by the surnames before doing a count.

 

Hope that helps,

 

Dane.

 

subhajits11
8 - Asteroid

Hi Guys,

Facing issue with data source data, kindly use this links

Summarize Reshape.yxzp-->  Summarize Reshape  

 

Captureer.PNG

cs11
8 - Asteroid

Use cases: these are minor sub-cases for the ones you've listed above, but as a beginning Alteryx user I find I do use these all the time, so will list here. Use the summarize tool as a starting point for data exploration, ie, duplicate an R fivenum() or summary() function. Second, use it as part of the audit process, generally to recreate an R identical() function to compare two arrays, first creating Boolean expressions to compare cell by cell then using summarize as a checksum. The Alteryx Boolean function is defined as 0 or non-zero for False/True, which doesn't automatically lend itself to summing the True cases, but if the column is converted to Byte it can then be summed - at least so far as I've been able to ascertain thus far the True always converts to a 1.

UgoAgo
Alteryx Alumni (Retired)

@subhajits11  I found the correct location at C:\Program Files\Alteryx\Samples\data\OneToolData\

sangeethadetne
7 - Meteor

hello all,

 

I am trying to retrieve columns along with the other columns of the table. I am not able to do it using summarize. Pls suggest me the way?

 

sangeethadetne_0-1592367526418.png

 

sangeethadetne
7 - Meteor

 

Hello  

 

 

 

 

 

groupbysummarize.PNG

 

 
rajatsadan
8 - Asteroid

 

Hi everyone,

 

I am facing issues in locating the input data files for the Summarize.yxzp exercise. I was able to retrieve the first input data location from another location, however for the 'Create Profile' location, could anyone tell me in which folder can I find the dependency? Thanks!

 

rajatsadan_0-1601089225484.png

 

rajatsadan_1-1601089379631.png

 

 

srilakshmi123
7 - Meteor

After opening, I am not able to see data in the input source. Please advise how to see the data.