Hi, I am doing some basic profiling and not sure how to calculate Average Value, Median Value and Std deviation on a string data type file. The file I have is mostly string value hence struggling to do calculation. Thanks
Average, Median, Std are applicable to quantitative data, but not to categorical data.
So if you want to deal with them, you would need to convert the strings to numeric values in some way.
Good luck.
Thank you for your reply. Please see the file here. Thanks for the help. Basically I am very new to the Alteryx and need to calculate these on this data set.
Null Detection |
Blank Detection |
Zero Detection |
Uniqueness (Excludes blanks, nulls and zeros) |
Net Uniqueness (Excludes blanks, nulls and zeros) |
Average character count |
Mean Character Count (Mean and Ave is the same) |
Average Value |
Median Value |
Std Deviation Character count |
Std Deviation |
Special Character |
Unique Defined Sets |
Frequency |
Thanks for the data.
You may want to check the tools in Data Investigation category.
Here are good articles on these tools.
Pre-Predictive: Using the Data Investigation Tools - Part 1 of 4
Pre-Predictive: Using the Data Investigation Tools - Part 2 of 4
Pre-Predictive: Using the Data Investigation Tools - Part 3 of 4
Pre-Predictive: Using the Data Investigation Tools - Part 4 of 4
These tools are purposed to serve the data investigation before analysis.
If you do not find what you need, you may DIY it with other tools.