Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Random sample based on these criterias

vishtrack
7 - Meteor

Hi Folks, need your expert advise on the below scenario.

 

I am trying to get random samples from a very large data set(over 60k records), the conditions are as below

1. The data set has user names and case IDs which they have worked on. Some users have worked on 30k records and few in less numbers (5, 10 etc)
2. I need to pick samples based on the percentage to the total number of cases they have worked on. Eg. users with large cases needs more sampling and users with less cases fewer samples to be picked. 
3. Each user should contain atleast 1 sample 

4 Total number of samples should be equal to 50

 

Breaking my head over this from few days, any help is appreciated.
Thank you in advance!

1 REPLY 1
rzdodson
12 - Quasar

@vishtrack: I would creating a batch macro that contains a Random Sample % tool in it. The percentage you need to sample will now become one of your questions in the macro that you can now toggle based on your needs. And, by having a Random Sample % tool in there, it will ensure that there are folks who always have at least one record present.

 

Labels