Data Science

Machine learning & data science for beginners and experts alike.
DrDan
Alteryx Alumni (Retired)

The 7.1 release of Alteryx contains 17 new macros that enable data artisans to easily undertake predictive analytics projects. The tools can be placed into several different groups based on the business analytics functions they perform. The groups are predictive modeling, model assessment, grouping, data investigation, and data sampling for predictive analytics. This post is the second in a 5-part series will focus on the model assessment macros.

Model Assessment

The model assessment macros in Alteryx 7.1 allow a data artisan to both fine-tune a model under development, and determine which of several possible candidate models is most effective at predicting new data. The model assessment macros are:

Alteryx Predictive ToolNested Test

Alteryx Predictive ToolStepwise

Alteryx Predictive ToolLift Chart

The macros that help fine-tune the development of a predictive analytics model are focused on linear and logistic regression models, since decision tree and forest models do this implicitly. The Nested Test and Stepwise macros help the data artisan fine-tune a regression model with a large number of variables and reduce it down to the set of variables that are most important based on statistical criteria. The Nested Test macro does this in a selective fashion, in which a data artisan compares two models (one of which contains a subset of the predictor fields contained in the other) and determines whether the set of fields that are different between the two models can safely be removed. The Stepwise tool takes an initial regression model with a large number of predictor fields, and finds the best subset of those predictors using automated methods.

 

The Lift Chart tool allows a data artisan to compare the performance of different models, likely developed using different algorithms, on the metric that matters most, the ability to predict new data (i.e., data that was not used to create the models). In addition, lift chart tools allow a data artisan to quickly develop a high-level economic analysis of a proposed action.

 

In my next post, I will focus on the grouping macros which assist the data artisan in grouping both "units" (customers, outlets, transactions, etc.) and fields to form customer segments, grouping outlets together based on trade area characteristics in order to implement merchandising or pricing programs, or grouping transactions together to look for outliers that might signal fraudulent activity.

Dan Putler
Chief Scientist

Dr. Dan Putler is the Chief Scientist at Alteryx, where he is responsible for developing and implementing the product road map for predictive analytics. He has over 30 years of experience in developing predictive analytics models for companies and organizations that cover a large number of industry verticals, ranging from the performing arts to B2B financial services. He is co-author of the book, “Customer and Business Analytics: Applied Data Mining for Business Decision Making Using R”, which is published by Chapman and Hall/CRC Press. Prior to joining Alteryx, Dan was a professor of marketing and marketing research at the University of British Columbia's Sauder School of Business and Purdue University’s Krannert School of Management.

Dr. Dan Putler is the Chief Scientist at Alteryx, where he is responsible for developing and implementing the product road map for predictive analytics. He has over 30 years of experience in developing predictive analytics models for companies and organizations that cover a large number of industry verticals, ranging from the performing arts to B2B financial services. He is co-author of the book, “Customer and Business Analytics: Applied Data Mining for Business Decision Making Using R”, which is published by Chapman and Hall/CRC Press. Prior to joining Alteryx, Dan was a professor of marketing and marketing research at the University of British Columbia's Sauder School of Business and Purdue University’s Krannert School of Management.