Hi all
I'm working with a continuous data set to try and forecast prices for metals (in this example, nickel). I'm looking at trying to compare the relative accuracy of various different models, including a Random Forest model. However when I run my models trained over a period of 3 years between 2015 and 2017 to the validation period (2018 and 2019), as well as a model trained on data prior to 2015, I get the diagnostic plot as attached. In short, the 2015_17 model won't forecast above ~14,500, while the model trained on data pre 2015 won't forecast under ~14,000. This is odd given the target variable has ranged between ~9,000 in 2015 and 18,000 in 2019 and 2014, while looking at the variable importance plots the models have access to all of the predictor variables that are deemed important, which track the target variable quite closely.
Any assistance is much appreciated.