I have a dataset with several hundred columns over millions of rows. I'm trying to understand how I can group the data over one column, then get the percentage of filled data for each column in that group, for n columns. For my purposes filled can be either not empty or not null.
Below shows a sample input and output with just 3 columns:
ToGroup A B C
1 10 13
1 9 7 5
1 11 4 9
2 6
2 8 5 2
2 6 1
Group A B C
1 66.7% 100% 100%
2 66.7% 33.3% 100%
@jframe How are you calculating the percentage?
Hi, [number filled rows in column] / [number of rows in group]
Hey @jframe,
Here's one way of doing this:
You don't need the data cleaning tool at the beginning I just added that to clean the data after copying yours.
Any questions or issues please ask :)
HTH!
Ira