Hello Alteryx Community,
I am new to Alteryx, so I was going through the Oversample Field tool using single tool sample workflow and the comments say use it before predictive for effective modeling but what I observed in the workflow, starts with a dataset having 226 records and outputs 150 records, that is in effect reducing total data available, though it balances the data, is it good to reduce the available data for effective training? why is it called oversampling tool in fact it is reducing the samples? Little bit confused here, can some Alteryx Gurus clarify it?
In my understanding what it outputs is undersampled balanced data, is my understanding correct?