We’ve extended Inspire Early Bird Pricing until March 1. Register now and enjoy 20% off conference passes and 10% off training passes. P.S. Don’t forget to bring friends! When you sign up for five or more tickets, you get an extra 20% discount on conference passes. Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

How would one correlate two string columns?

CandidDataScientist
7 - Meteor

Hello,

 

I have two string columns that I would like to correlate with each other.  In python, I would just convert to a categorical data type and then do the correlation through pandas.  

 

How would I do this in Alteryx? 

4 REPLIES 4
neilgallen
12 - Quasar

Given that you know python, why not use the python tool?

 

The alternative would also be (depending on the number of categories that you have) to encode them by using a text input and creating a table with each category and a numerical id. You could then use the pearson correlation tool if you wanted.

 

If you have a large number of categories, then there are methods but it's a bit more involved using a summarize, record ID, etc.

rafalolbert
ACE Emeritus
ACE Emeritus

Hi @CandidDataScientist,

 

Here you go:

 

- go from categorical data to numerical using Record ID tool

- use Association Analysis to create correlation matrix

 

img1.JPG

 

Thanks,

Rafal

 

#Excuse me, do you speak Alteryx?

CandidDataScientist
7 - Meteor

Ahhh... I should have stated the company I'm doing this for has not upgrade and is currently on Alteryx 2048.2 which doesn't have the python tool.

DanSmith
5 - Atom

2048.2?? Wow I'd imagine that version would incorporate everything!!

Labels