Hi everyone,
I get errors when running these models and do not understand the reason for the errors. Have done some searching, but the adjustments I have done thus far does not help that much. The data I use is 158MB, about 1.9mil records (1.3mil for evaluation for the models). I am training the data to determine if an order converted or not. The error messages and fields I use are as follows:
Account_categories, quote_day_of_date, geo, region, max_quoted_amount. These variable sums up the types of fields I have in the data, 3 interger types and 7 V_WString / string types. I am using a total of 10 fields and am training 2 models for each algorithm to see how the model performs on less variables and all variables.
I also share the R code I used to determine the variable importance and an image of the flow. (however I think the error is due to the error with the model):
I would appreciate any expertise advice on the errors I currently have.
Thank you!
One more shot: have you tried increasing the Memory Limit setting in your System Settings? (Options > Advanced Options > System Settings > Engine > General > Memory Limit)
Hi @Roche - just saw your message, is the problem resolved? If not, would you mind to share your workflow with a sample of data?
Hi, I have changed the system settings to be 100000, but then there was no output from the flows and so I changed it back and instead changed the settings within the workflows with incremental steps to see if and when it will be successful. Had no success with the forest model in any setting. My manager has also sent a message to hear from Alteryx.
Other models run and I have the same variables etc for them.
Hi @CathyS_Slalom , thank you for getting back to me. No, it is not resolved yet. I will not be able to share data since it is confidential. Do not think I can attempt to create a similar scenario either. The data does run at 10% with no errors. The problem is when I try to run all 2mil rows or 50% of the data. Is there anything specific that you would want to check in the workflow e.g. settings in the forest model icon or variables? Currently it does look like a capacity problem.
The Boosted model is also giving me an error. This I see can also be a problem due to running out of memory.
Hi @Roche, I was thinking to see your workflow and find a place to write random forest model in Python or R directly from Alteryx, instead of using the Forrest Tool under predictive tab. Here is the link for an example random forest model in python
https://towardsdatascience.com/random-forest-in-python-24d0893d51c0
Hi @CathyS_Slalom, thank you very much! Appreciate your help. Currently I need to see if I can sort out the errors and as a last resort I will be able to do manual coding.
User | Count |
---|---|
60 | |
24 | |
24 | |
21 | |
21 |