Dear all,
Based on the same dataset I am training a random forest and a decision tree.
As far as I am concerned, the split order demonstrates how important that variable is for information gain, first split variable being the most important one.
A similar report is given by the random forest output via its variable importance plot. The order of variable importance does not overlap with that of decision tree (through manual inspection) and one variable has a big deviation in the ranking (in decision tree it is the first one to be split, in variable importance plot of random forest it is one of the least important).
What would you do in this case to come up with a solid result?
I appreciate your thoughts.
Best,
Atamert