Community Spring Cleaning week is here! Join your fellow Maveryx in digging through your old posts and marking comments on them as solved. Learn more here!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Samling Tool - How to input the desired number of records N?

pierrelouisbescond
8 - Asteroid

Dear Community,

 

Probably a simple question for advanced users but not for a young Padawan like me ^^

 

I have a file with 5000 lines, out of which more than 85% have a “NOK” output and 15% are “OK” (quite unbalanced). To properly train my model, I would like to feed it with a balanced sampling (50% NOK & 50% OK).

 

I start by isolating the “OK” lines through a filter and I use a random sampling on the “NOK” data.

 

The thing is that I need to manually define what should be the size of “NOK” samples I am looking for… according to the number of “OK”.

 

So I would like to use the number of “OK” samples as an input to request the same “NOK” number :

 

screenshot.2019-02-14 (2).png

 

I have found replies with some inputs but more for batches than for a simple workflow.

Thanks,

Pierre-Louis

 

3 REPLIES 3
paul_houghton
12 - Quasar

Hi Pierre-Louis,

 

You probably want to have a look at the oversampling tool. Sounds like the exact result you are after. 

https://help.alteryx.com/2018.4/Oversample_Field.htm

 

pierrelouisbescond
8 - Asteroid

Thanks a lot @paul_houghton! The option was not ticked on my Alteryx update... so I was not aware of this option :-)

paul_houghton
12 - Quasar

No problem there are a lot of tools in alteryx so knowing which one works best in a situation can be a challenge. Glad that helped.

 

Labels