How would one correlate two string columns?
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hello,
I have two string columns that I would like to correlate with each other. In python, I would just convert to a categorical data type and then do the correlation through pandas.
How would I do this in Alteryx?
Solved! Go to Solution.
- Labels:
- Data Investigation
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Given that you know python, why not use the python tool?
The alternative would also be (depending on the number of categories that you have) to encode them by using a text input and creating a table with each category and a numerical id. You could then use the pearson correlation tool if you wanted.
If you have a large number of categories, then there are methods but it's a bit more involved using a summarize, record ID, etc.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Here you go:
- go from categorical data to numerical using Record ID tool
- use Association Analysis to create correlation matrix
Thanks,
Rafal
#Excuse me, do you speak Alteryx?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Ahhh... I should have stated the company I'm doing this for has not upgrade and is currently on Alteryx 2048.2 which doesn't have the python tool.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
2048.2?? Wow I'd imagine that version would incorporate everything!!
