Hello,
I have two string columns that I would like to correlate with each other. In python, I would just convert to a categorical data type and then do the correlation through pandas.
How would I do this in Alteryx?
Solved! Go to Solution.
Given that you know python, why not use the python tool?
The alternative would also be (depending on the number of categories that you have) to encode them by using a text input and creating a table with each category and a numerical id. You could then use the pearson correlation tool if you wanted.
If you have a large number of categories, then there are methods but it's a bit more involved using a summarize, record ID, etc.
Here you go:
- go from categorical data to numerical using Record ID tool
- use Association Analysis to create correlation matrix
Thanks,
Rafal
#Excuse me, do you speak Alteryx?
Ahhh... I should have stated the company I'm doing this for has not upgrade and is currently on Alteryx 2048.2 which doesn't have the python tool.
2048.2?? Wow I'd imagine that version would incorporate everything!!