I'm pulling my Alteryx boots back on after a while now, so bear with me!😀
I'm checking data quality for a data set, an example of which is below..
Client_id | Name | Address | Phone |
1 | Adam | 123 Sesame St | 9123 4567 |
2 | Becky | [Null] | [Null] |
3 | [Null] | 3 Rosedale Cct | 9456 7890 |
My desired output is this
Client_id | Name | Address | Phone | DQ_Issue |
1 | Adam | 123 Sesame St | 9123 4567 | No issue |
2 | Becky | [Null] | [Null] | Missing address |
2 | Becky | [Null] | [Null] | Missing Phone |
3 | [Null] | 3 Rosedale Cct | 9456 7890 | Missing name |
...but using the Generate Rows seems to work iteratively between 2 existing values (usually numeric or date types). AM I barking up the wrong tree here?
The other thing I thought of would be pass each and every record through a formula tool that tests each individual DQ_Issue, then union and deduplicate, but for a million records, it might get expensive.
@gurth
I think we can try something like this.