Hi All,
I've got an upcoming project that I'm sure Alteryx would be ideal for, but I'm at a bit of a roadblock.
My organisation receives a quarterly data set from another entity whom we grant funding to, and we then use this data for evaluation and research which then drives improvements, monitors performance etc.
What I wish to do has 2 stages to it:
1. Evaluate each column within the data set to output a volume and percentage figure based on whether each field in the column has an entry or not (basically CountA vs CountBlank). The idea is there should be *something* entered in each row of each column.
2. Evaluate each row within selected columns to assess the quality of data within. For example, an Age field should only contain a numeric entry, a Gender field should contain text meeting a limited criteria etc.
This is the same principle as 1. but goes deeper into assessing the quality of the data we are being supplied with and whether or not it meets our requirements.
Ideally a simple output in this scenario of "valid" or "not valid" would be useful in giving an overall rating.
I've experimented with the Field Summary tool and that does parts of the job but may not go as deep as I'd like.
Anyone have any ideas on how to approach this?
Many thanks
D