community
cancel
Showing results for 
Search instead for 
Did you mean: 

Alteryx designer Discussions

Find answers, ask questions, and share expertise about Alteryx Designer.
Upgrade Alteryx Designer in 10 Steps

Debating whether or not to upgrade to the latest version of Alteryx Designer?

LEARN MORE
SOLVED

Analogous tool to Association analysis for categorical variables

Highlighted
Aurora
Aurora

Hi there,

 

the association analysis tool is tremendously useful for undersanding if there is a strong correlation relationship between continuous variables and your target continuous target variables.

 

Is there a similar tool (aside from actually performing the regression analysis itself) that operates on categorical variables?   For example - looking to see if there is any correlation between age expectancy and:

- zip code that the person lives in

- their surname

- highest level of education

 

what would be the right predictive tool to do this, and if one doesn't exist - would it make sense to submit this as an idea?

Thank you

Sean

 

 

ACE Emeritus
ACE Emeritus

Hi @SeanAdams,

Not an expert, but you might want to have a look at the Importance Weights tool available in the Predictive District on the Gallery. "The Importance Weight tool provides methods for selecting a set of variables to use in a predictive model based on how strongly related each possible predictor is to the target variable." I've never used it but did download it and can see comments indicating that it considers both categorical and continuous variables.

Hope that helps!

John

Alteryx Certified Partner

Hi Sean,

The appropriate statistical test to determine the relationship between nominal data is Chi-Square. You can find an output for this text and p-value in the Contingency table/Distribution analysis tools. I also saw that the Crew Macro pack includes a Chi square macro but I have not been able to test it myself.  However, it seems you are asking two questions one is to determine the relationship (above) and the second to predict in which case I would use the logistic regression.

Cheers,

Sasha

Alteryx Certified Partner
Alteryx Certified Partner

@alexandra_hanna,

 

I wrote the macro.  I'd appreciate a nod or feedback as to whether or not it passes muster for you.

 

Attached is the table that I used for lookups.

Alteryx ACE & Top Community Contributor

Chaos reigns within. Repent, reflect and reboot. Order shall return.
Labels