Hello!
I am a brand new Alteryx user and I am working on a predictive modeling workflow where I am using a Random Forest model to predict future values of my target variable on a daily basis out to 2033. My predictors have no null values and are fully filled in through 2033, but my target variable is only filled in through July 31, 2025. My flow is currently filtering so where my target variable is null it is getting fed into a Score tool and where my target variable is filled in, I am running that through the forest model so it can get trained. I have my browse tool attached to this forest model on the R and on the O, it is going to the M input of my Score tool. The browse tool attached to my forest model is showing an MSE of 11 billion and an R-squared of 24%. This seems very off, and I am wondering if I have set up my flow correctly, but the outcomes do not look too bad.
My questions are is using the score tool in this way appropriate for predicting missing target values?
Is there a better way to determine how the model is actually performing?
Is there a better way to validate or improve this model within Alteryx?
Any insights or suggestions would be greatly appreciated!
Thanks
Calvin