Don't forget to register for our panel discussion with Dean Stoecker, Amy Holland, and Mark Frisch occurring next Wednesday, June 1!

2022-05-26 Updates: Email: If you're not seeing emails be delivered from the Community, please check your spam and mark the Community emails as not junk. Thank you for your patience.

Alteryx Designer Ideas

Share your Designer product ideas - we're listening!
Submitting an Idea?

Be sure to review our Idea Submission Guidelines for more information!

Submission Guidelines

Xgboost Regression as a predictive macro

XGboost regression is now the benchmark for every Kaggle competition and seems to consistently outperform random forest, spline regression, and all of the more basic models. For those of us using predictive modeling on a regular basis in our actual work, this tool would allow for a quick improvement in our model accuracy. And I think, from a marketing standpoint, having a core group of users competing in Kaggle using Alteryx would be a great way to show off Alteryx's power.

 

It is readily available as an R package: https://cran.r-project.org/web/packages/xgboost/index.html

29 Comments
PedrodeOl
9 - Comet

@TimothyL 

 

Thank your for sharing with me!

JohnJPS
15 - Aurora

From my angle (submitter of the idea), this could be marked as completed/implemented: the gallery solution given by @TimothyL is great.

Thanks!

 

marco_zara
8 - Asteroid

XGBoost, Catboost and LightBGM are seriously needed.

Atabarezz
13 - Pulsar

great work here!

Community_Admin
Alteryx
Alteryx
Status changed to: Inactive
 
Community_Admin
Alteryx
Alteryx

The status of this idea has been changed to 'Inactive'. This status indicates that:

 

1. The idea has not had activity in the form of likes or comments in over a year.

2. The idea has not reached ten likes.

3. The idea is still in the 'New Idea' status. 

 

However, this doesn't mean your idea won't be implemented! The Community can still like and comment on this idea. With enough renewed interest, this idea can be brought back into the 'New Idea' status. 

 

Thank you for contributing to the Alteryx Community and the Alteryx Product Idea Boards!

KylieF
Alteryx Community Team
Alteryx Community Team
Status changed to: New Idea

This idea is being returned to the New Idea status, as it has reached the required number of likes to be brought to our product teams attention. Please note it may take time for our product team to provide a response. However, we will do our best to insure this idea is updated accordingly.

 

Thank you for posting and contributing to the product idea boards!

 

bkramer66_dup_418
6 - Meteoroid

I am recommending/requesting that Alteryx add an XG Boost tool written in R to the Predictive toolset. I have just finished productionizing a classification model using the Boosted tool and while it performs well, I derived better predictions using XGBoost (in another software on the same data). I am aware that the Intelligence Suite now has XGB in it...but that is at an additional cost and, quite frankly, after having tested it, more difficult to productionize.

 

A couple of points:

1. Every other toolset that I have used has an XGBoost algorithm as part of the standard package (SPSS Modeler, Statistica, RapidMiner);

2. XGoost is arguably the leading algorithm out there for many/most classification problems; it has been in the winning solution in a disproportionate number of Kaggle competitions (40%-50%?);

3. It is a bit gut-wrenching to tell my colleagues, "No, there is not a supported XGB tool in Alteryx" when that is seen as a litmus test for a DS platform or tool.

4. Yes, we can use R or Python and that is precisely what I will do in iteration 2 of the model...but having the tool already exist would save significant time, especially when building and testing models.

 

Thank you for the consideration!

 

 

Joe_Lipski
13 - Pulsar
13 - Pulsar

Hi @bkramer66_dup_418 

 

I agree it would be nice to have an XGBoost as part of the native R predictive tools, given it's growing popularity. However, I'm pleased to let you know that there are two ways to build XGBoost models in Alteryx out of the box:

 

1) Using Alteryx's new Intelligence Suite, where XGBoost is one of the four model options:

joe_lipski_0-1596612329268.png

 

2) Using an opensource macro published to the Alteryx Gallery by @TimothyL - I haven't used it, so can't verify it's success, but I suggest you give it a go: https://community.alteryx.com/t5/Data-Science/Expand-Your-Predictive-Palette-XGBoost-in-Alteryx/ba-p...

 

More predictive tools that are not natively installed can be found here: https://gallery.alteryx.com/#!districts/56327e37aa690a17f0760bdc

 

Thanks,

Joe

bkramer66_dup_418
6 - Meteoroid

Hi Joe,

 

Thank you for the comment and suggestions. As I mentioned in my original post, I tried the Intelligence Suite and actually used XGB and of course, it outperformed the other three algos. The challenge is that 1) I ran into a bug that we couldn't resolve, 2) it comes at an additional $2K+ price tag, and 3) because of the way it works (guided data cleansing and feature selection), I was leery of trying to productionize it. And of course to do so, we would have had to bite the bullet on procuring the suite. Don't get me wrong - I am anxious for the IS to develop and will probably advocate for its procurement in the next iteration or two.

 

Secondly, I have run the opens source macros from Timothy (who is awesome) in both R and Python. I could never get the Python one to run, largely because of our environment. The R package ran well but the predictions were significantly different from what the IS produced as well as from what the Boosted model produced. And I mean significantly, as in almost opposite probabilities. I reached out to Timothy directly and we had some communication around it but it remains unresolved on why the predictions are so disparate from what the IS and SPSS Modeler produced. I thought it may be reading the label backwards, or the probabilities are reversed or something...but who knows. I am grateful for what is out there but I think Alteryx needs to invest some development in making an R-based one more robust.

 

Thanks for the other tools not natively installed - I do have numerous ones (Model Comparison, TS Factory) installed and that I have used.

 

Thanks again for the comment and for your contributions to the community - I have read many of your posts and they have been so helpful!

 

Brett