Bring your best ideas to the AI Use Case Contest! Enter to win 40 hours of expert engineering support and bring your vision to life using the powerful combination of Alteryx + AI. Learn more now, or go straight to the submission form.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Random sample based on these criterias

vishtrack
7 - Meteor

Hi Folks, need your expert advise on the below scenario.

 

I am trying to get random samples from a very large data set(over 60k records), the conditions are as below

1. The data set has user names and case IDs which they have worked on. Some users have worked on 30k records and few in less numbers (5, 10 etc)
2. I need to pick samples based on the percentage to the total number of cases they have worked on. Eg. users with large cases needs more sampling and users with less cases fewer samples to be picked. 
3. Each user should contain atleast 1 sample 

4 Total number of samples should be equal to 50

 

Breaking my head over this from few days, any help is appreciated.
Thank you in advance!

1 REPLY 1
rzdodson
12 - Quasar

@vishtrack: I would creating a batch macro that contains a Random Sample % tool in it. The percentage you need to sample will now become one of your questions in the macro that you can now toggle based on your needs. And, by having a Random Sample % tool in there, it will ensure that there are folks who always have at least one record present.

 

Labels
Top Solution Authors