Community Spring Cleaning week is here! Join your fellow Maveryx in digging through your old posts and marking comments on them as solved. Learn more here!

Data Science

Machine learning & data science for beginners and experts alike.
SusanCS
Alteryx Alumni (Retired)

The leaves may be changing colors, but the enthusiasm for data conversation on our Data Science Portal is evergreen!



SusanCS_0-1633363402403.gif

Image via GIPHY



Mandatory seasonal reference: check. 🍁 Now let’s jump right into September’s top data science conversations.



Time Series Tips

A few good questions came up in September about using time series tools in Designer. First, what if you have multiple time series models that you want to use jointly to generate forecasts? @Fierel raised that question, and @vsoni suggested checking out the TS Forecast Factory Tool

 

@Aleks_Data also asked how to retrieve the residuals generated when a model is fitted (typically, residuals are the differences between observed values and fitted values). @NeilR and @apathetichell conspired to come up with the right solution for the job, ultimately finding resolution in a blast-from-the-past post by @cwkoops from 2019.



SusanCS_1-1633363390111.gif

 If you can’t tour the chocolate factory, the forecast factory will have to do. Image via GIPHY



Tool Palette Trick: Save Your Faves

If you have favorite macros like TS Forecast Factory, why not make them permanent features of your Designer tool palette? @tomtveidt asked how to keep those tools handy in Designer, and @Garabujo7 hopped in with the magic to make that happen. Grab your favorite data science tools and customize your tool categories for convenience with this awesome Designer trick.



SusanCS_2-1633363393468.gif

 Custom homes for your tools. Image via GIPHY



Dataframes in Columns in Dataframes

@Hamder83 ran into an interesting challenge when using the Python package tabula to extract data from a PDF within Designer. @clmc9601 diagnosed the issue, detecting that pandas was combining dataframes of different sizes within single columns. @dbmurray dropped by to mention that tabula is also available as an R package, for those who prefer that flavor of code.



SusanCS_3-1633363399588.gif

 Dataframes getting squished. Image via GIPHY



Outlier Observation

Finally, @ArnabSengupta posted questions about observing and dealing with outliers in your dataset, and received two great macro suggestions: @MarqueeCrew offered an outlier detection macro, and @mst3k mentioned the hidden z-score macro in Designer as well. In the Alteryx Intelligence Suite, the Data Health Tool could also be useful; check out this Data Science Blog article about it and other methods of contending with outliers.

 

It’s always great to see these helpful, thoughtful conversations. We also had a super fun Data Science Mixer podcast chat in September with Dr. Heather Lynch, whose research on wildlife in Antarctica could inform your data work in business in surprising ways. 

 

Stay tuned for more compelling articles, podcast episodes and discussions by keeping up with the Data Science Portal. See you there!



Blog teaser photo by Steven Wright on Unsplash.

Susan Currie Sivek
Senior Data Science Journalist

Susan Currie Sivek, Ph.D., is the data science journalist for the Alteryx Community. She explores data science concepts with a global audience through blog posts and the Data Science Mixer podcast. Her background in academia and social science informs her approach to investigating data and communicating complex ideas — with a dash of creativity from her training in journalism. Susan also loves getting outdoors with her dog and relaxing with some good science fiction. Twitter: @susansivek

Susan Currie Sivek, Ph.D., is the data science journalist for the Alteryx Community. She explores data science concepts with a global audience through blog posts and the Data Science Mixer podcast. Her background in academia and social science informs her approach to investigating data and communicating complex ideas — with a dash of creativity from her training in journalism. Susan also loves getting outdoors with her dog and relaxing with some good science fiction. Twitter: @susansivek

Comments
Aleks_Data
6 - Meteoroid

Loving the post and the format! Cheers Susan 🤗

SusanCS
Alteryx Alumni (Retired)

Glad to hear it, @Aleks_Data! Thank you for your participation! 🌟So fun to read through all these great discussions.

dbmurray
8 - Asteroid

Thanks for the shout out @SusanCS !