Showing results for 
Search instead for 
Did you mean: 

Alteryx designer Discussions

Find answers, ask questions, and share expertise about Alteryx Designer.

Product Price Range Prediction

Alteryx Certified Partner



I have a product catalogue which contains different types of products, it includes the features, materials used, current price, etc. Basically around 20 features (most of them are categorical, such as material -> aluminum, wood, steel, etc.) that are available to be a predictor variables.


What I want to do:

To create a model that can predict the price range based on the catalogue data. Suppose I have a new product, I can enter the nearest features available and the output will be a price range with certain confidence level.


I tried to use linear regression with dummy coding to compansate the categorical data problem. However, the result is not good enough as it gives a very large range which is meaningless. I also checked some research paper that I can use NN/SVM to first reduce the number of predictor variables...but I am not sure how to implement.


Do you have any idea or what are the alternative approach?




ACE Emeritus
ACE Emeritus

HI @kennethli, just some random thoughts...

It's hard to say much without data to play with, but 20 predictors doesn't seem like that many: I'm playing with a Kaggle dataset right now that has over 100 categorical predictors, plus another 20 some numeric predictors.  A couple random thoughts which perhaps you've already looked into...

  • If price ranges are pretty high, you could do log(price) first, and predict that instead
  • Try some other model types... the nice thing about Alteryx is it's mostly just drag and drop to try different models.  (Boosted Model tends to do well in almost any setting, though parameter tweaking will be necessary; Google parameter tuning for R's "gbm" since that's what is used behind the scenes; the config panel in Alteryx's Boosted Model tool should match up pretty well with docs you find online.)



Alteryx Certified Partner

Thanks @JohnJPS,


The key is how can I evaluate the model suitability?

I can test different models but I am not able to select one that most suitable for my case.

For example, how does those classification models work to handle the categorical data? What are the difference between classification models and regression in handling categorical data?

Does it work better than linear regression? If yes, how to evaluate the result's difference?


Probably these questions are also questions of experience......but the most important question I want to ask is how to select the most suitable model to start with.





I'm trying to create a workflow based on my historical sales transaction, I wan't to predict/forecast a suggested price of a certain product?

Hope to get some sample workflow to study.

Thank you