Hi,
It is not clear to me if the field on the score tool called "The target field has an oversampled value", is refering to the training (evaluation) or testing (validation) data.
Could you please advise?
Many thanks
Oversampling is done to adjust the ratio of categories represented in your data and can be accomplished with the Oversample Field tool. The classic example is male/female ratio. If you have collected a population sample to train a model and your sample contains 65% males, you may want to oversample the females in your population sample so that your sample closer represents the actual wider 50/50 population.
When this is done, the Score tool needs to know it is dealing with an oversampled value so that it can help correct for the selection bias.
so when the target value is not around 50%, I need put value in this field?
For example, if my target value is binary 1/0. 1 is 20% and 0 is 80%. So what should I put in "The value of the target field that was oversampled" and what should I put into the percentage?
Thank you.