I've run into a bit of a problem that I'll try to explain with a hypothetical scenario.
I have 100 records that I would like to place into 20 groups of 5 based on 3 factors. The factors are gender (M/F), program (5 programs) and activity level (3 levels). I’d like each of the 20 groups to be as similar as possible across those factors. I guess I’m not really grouping the factors but distributing them as evenly as possible across the different groups.
If all of those factors had equal numbers (50 males/50 females, 20 from each program and ~33 at each activity level) each group would have one person from each of the 5 programs, 2 or 3 males and 1 or 2 from each activity level. In reality the data won’t be evenly distributed though.
Does anyone have any suggestions as to what tools I could use to assign people to groups to get as even a spread as possible across the three factors?
Thanks in advance.
best,
DHB.