This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
I have a dataset with 60000+ rows and 150+ columns. This consists of the various test results done on patients in a hospital.
Given below is a small sample of the data. Rows (P1-15) are the individual patients, Gender M-Male, F-Female. Columns T1 to T5 denote the test results done on them. The null values mean the particular test was not done on the patient.
Now, i would like to find the best combination of patients-tests where there are no/minimum number of nulls with maximum number of patients and tests covered. This could be an optimization problem. Any idea how this can be done in Alteryx?
Your best bet is to pivot the data (this will produce a large number of records due to your column count), and then use the summarize tool finding the Min/Max (whichever you prefer), grouping by patient.
Once complete, then join back to your original dataset to find the trial value. Note, in this instance if there are ties then you'll have to account for that as you will get multiple rows.