Generate sample records from the big dataset based on 2 columns
Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
hash_89
8 - Asteroid
‎12-17-2024
11:05 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
I need to generate 3500 samples and need some representation of different combinations.
Here is how the data looks like
Name | Year | Subject |
ABC | 2021 | Math |
DEF | 2022 | English |
GHI | 2022 | Math |
GIH | 2023 | English |
JIK | 2024 | Math |
LHM | 2024 | English |
KIK | 2024 | Science |
I need to generate samples for combination of Year & Subject. It doesn't matter how many in each combination, but the overall sample should have records for all possible combinations in year & subject.
can anyone help?
Labels:
- Labels:
- Expression
4 REPLIES 4
apathetichell
19 - Altair
‎12-17-2024
11:09 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Sample tool. select both columns in the group by. choose an amount. You will have that many items for each grouping.
‎12-17-2024
11:38 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
I did select the 2 columns, but it gives me more than 3500 for each year.
binuacs
21 - Polaris
‎12-17-2024
11:49 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@hash_89 Another approach
apathetichell
19 - Altair
‎12-17-2024
12:16 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@hash_89 N is the number of entries you want per combo. you are putting 3500 samples PER GROUPING...
