Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Ideas

Share your Designer Desktop product ideas - we're listening!
Submitting an Idea?

Be sure to review our Idea Submission Guidelines for more information!

Submission Guidelines

Data scientists, which libraries would you like as Alteryx tools?

It feels that lately Alteryx has been focusing on integration rather than adding more machine learning tools, which sadly are still not on par with many competing products...

Personally I miss having XGboost and multi-core random forest libraries like Ranger (along with a more robust implementation of C5.0).

 

What about you guys? Which R/Python libraries are you missing in Alteryx?

7 Comments
steven_tally
5 - Atom

XGboost should definitely be included.  The R Keras interface with TensorFlow back end and everything that goes with that would also be really nice.

TimothyL
Alteryx Alumni (Retired)

@marco_zara 

Here is the latest XGBoost macro in both R & Python version: https://gallery.alteryx.com/#!app/XGBoost/5ca42dfe826fd31188cd0f08

 

Give a try and let me know if any issue. Will publish a blog to explain the feature soon

marco_zara
8 - Asteroid
Hi, looks like I need some R libraries, Python I'm not sure: Error: XGBoost_R (430): Tool #1: Error in library(readr) : there is no package called 'readr' Error: XGBoost_R (430): Tool #1: The R.exe exit code (1) indicated an error. Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- ModuleNotFoundError Traceback (most recent call last) in 2 # First XGBoost model for Pima Indians dataset 3 from numpy import loadtxt ----> 4 from xgboost import XGBClassifier 5 from sklearn.model_selection import train_test_split 6 from sklearn.metrics import accuracy_score ModuleNotFoundError: No module named 'xgboost' Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- ModuleNotFoundError Traceback (most recent call last) in ----> 1 import xgboost 2 from scipy.stats import uniform, randint 3 from sklearn.model_selection import cross_val_score, GridSearchCV, KFold, RandomizedSearchCV, train_test_split ModuleNotFoundError: No module named 'xgboost' Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in ----> 1 xgboost.__version__ 2 3 #if not, reinstall xgboost package using this line: Alteryx.installPackages("xgboost==0.71") NameError: name 'xgboost' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in 2 seed = 7 3 test_size = 0.10 ----> 4 X_train, X_test, y_train, y_test = train_test_split(X, Y, test_size=test_size, random_state=seed) 5 # fit model on training data 6 model = XGBClassifier() NameError: name 'train_test_split' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in ----> 1 model = XGBClassifier() 2 eval_set = [(X_test, y_test)] 3 model.fit(X_train, y_train, early_stopping_rounds=10, eval_metric="auc", 4 eval_set=eval_set, verbose=True) 5 # make predictions for test data NameError: name 'XGBClassifier' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- ModuleNotFoundError Traceback (most recent call last) in ----> 1 from xgboost import plot_tree 2 from matplotlib import pyplot 3 from xgboost import plot_importance 4 # plot feature importance 5 plot_importance(model, ylabel = "feat_names") ModuleNotFoundError: No module named 'xgboost' Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in ----> 1 sorted_idx = np.argsort(model.feature_importances_)[::-1] NameError: name 'np' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in ----> 1 for index in sorted_idx: 2 print([X_train.columns[index], model.feature_importances_[index]]) NameError: name 'sorted_idx' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in 1 #Select the best model with best acc above ----> 2 model.fit(X_train, y_train) 3 score = model.predict(Z) NameError: name 'model' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in ----> 1 df2 = pd.DataFrame(data=score) NameError: name 'pd' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in ----> 1 Alteryx.write(df2, 1) NameError: name 'df2' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in 1 from sklearn.metrics import confusion_matrix ----> 2 cm = confusion_matrix(predictions, y_test) NameError: name 'predictions' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in ----> 1 df3 = pd.DataFrame(data=cm, columns=['pred_pos','pred_neg'], index=['true_pos','true_neg']) NameError: name 'pd' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in ----> 1 Alteryx.write(df3, 2) NameError: name 'df3' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in 1 from sklearn.metrics import accuracy_score ----> 2 acc = pd.DataFrame([basic_acc,earlystop_acc]) 3 4 Alteryx.write(acc, 4) NameError: name 'pd' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in ----> 1 df5 = pd.DataFrame(data = model.feature_importances_, columns=['feature importance']) 2 df5['variable'] = feat_names 3 df5 = df5[['variable','feature importance']] NameError: name 'pd' is not defined Error: XGBoost_Python (433): Tool #4: --------------------------------------------------------------------------- NameError Traceback (most recent call last) in ----> 1 Alteryx.write(df5, 3) NameError: name 'df5' is not defined End: Designer x64: Finished running XGBoost.yxmd in 52.1 seconds with 19 errors and 11 field conversion errors and 1 warning
marco_zara
8 - Asteroid
Sorry about last post, looks like the error log format is not forum friendly...
TimothyL
Alteryx Alumni (Retired)

No worries. And yes, you need to install the libraries first. Investigate your error message, for example: No module named 'xgboost' 

 

Then install the corresponding library accordingly.

 

Community_Admin
Alteryx
Alteryx
Status changed to: Inactive
 
Community_Admin
Alteryx
Alteryx

The status of this idea has been changed to 'Inactive'. This status indicates that:

 

1. The idea has not had activity in the form of likes or comments in over a year.

2. The idea has not reached ten likes.

3. The idea is still in the 'New Idea' status. 

 

However, this doesn't mean your idea won't be implemented! The Community can still like and comment on this idea. With enough renewed interest, this idea can be brought back into the 'New Idea' status. 

 

Thank you for contributing to the Alteryx Community and the Alteryx Product Idea Boards!