I have a dataset with several hundred columns over millions of rows. I'm trying to understand how I can group the data over one column, then get the percentage of filled data for each column in that group, for n columns. For my purposes filled can be either not empty or not null.
Below shows a sample input and output with just 3 columns:
ToGroup  A   B   C
1            10  13
1        9   7   5
1        11  4   9
2                6
2        8   5   2
2        6       1
Group    A     B     C
1        66.7% 100%  100%
2        66.7% 33.3% 100%
@jframe How are you calculating the percentage?
Hi, [number filled rows in column] / [number of rows in group]
Hey @jframe,
Here's one way of doing this:
You don't need the data cleaning tool at the beginning I just added that to clean the data after copying yours.
Any questions or issues please ask :)
HTH!
Ira
