community
cancel
Showing results for 
Search instead for 
Did you mean: 

Data Science Blog

Machine learning & data science for beginners and experts alike.
New Data Science Blog

Check out the latest post: All Models Are Wrong

READ MORE
 You are using an unsupported browser for translation. Please switch to another browser.

Community Content Engineer
Community Content Engineer

ALL MODELS ARE WRONG.png

“Essentially, all models are wrong, but some models are useful.” Unpacking the famous George Box quote.

Read more...

Community Content Engineer
Community Content Engineer

 

SPECIFICATION GAMING1.png

Machines are not constrained by human experience or expectations, only by what we give them as inputs. This can be exciting and beautiful, or dangerous.

Read more...

Community Content Engineer
Community Content Engineer

Vector.png

 raster.png

 

 

In the world of spatial analysis, there are two major varieties of data: vector and raster. The divide between these two data types and the people that use them has raged on for decades.

 

Read more...

Community Content Engineer
Community Content Engineer

lda.png

Understanding the topic of a piece of writing is typically an easy task for people. However, there are times where we need to train our computers to find topics in a collection of documents. There might be too many documents for you, a single human, to read through, or you may be interested in discovering underlying themes in a large set of texts. Enter LDA, a popular model for Topic Modeling.

Read more...

Community Content Engineer
Community Content Engineer

embed model.png

After training a Phrases model with Community texts, I wanted to be able to incorporate the model into Alteryx workflows that I was using to process text, and hopefully even be able to share the model with other Alteryx users. After thinking through this, I realized it was a perfect application for the Python SDK.

Read more...

Alteryx
Alteryx

MULTICOLLINEARITY.png

Dr. Dan imparts some intuition behind the problems associated with predictor collinearity (also known as multicollinearity), and provides some rules of thumb about when, and when not, to be concerned.

Read more...

Community Content Engineer
Community Content Engineer

WORD2VEC.png

Word embeddings are vector representations of words, where more similar words will have similar locations in vector space. First developed by a team of researchers at Google led by Thomas Mikolov, and discussed in the paper Efficient Estimation of Word Representations in Vector Space, word2vec is a popular group of models that produce word embeddings by training shallow neural networks. In this blog post, we apply a word2vec model to the Alteryx Community texts to develop Alteryx-specific word embeddings.

Read more...

Community Content Engineer
Community Content Engineer

GOLDACRE BANNER.png

Reproducibility, the open sharing of data, and expanding on the research of others are all at the heart of the scientific process, and we live in an exciting time where it is more possible than ever. This year's Inspire Europe Closing Keynote speaker Dr. Ben Goldacre has recently published a paper examining compliance with the European Commission's guideline that all Clinical Trials registered in the EU Clinical Trials Register must report results to the European Medicines Agency within 12 months of the trial's completion. The bulk of the paper's analysis was performed in the statistical software Stata. With tools like Alteryx or Python, we have easy and open-source ways to process data and derive new knowledge. In this blog, we reproduce some of Goldacre et al.'s analysis in Alteryx and Python and provide both formats for you to further explore the data on your own. 

Read more...

Alteryx
Alteryx

vacancy.jpg

Let's see if we can cut down our energy consumption - FROM 97.7 QUADRILLION BTU!

Read more...

Community Content Engineer
Community Content Engineer

BRAINS.PNG

Neural Networks are an approach to artificial intelligence that was first proposed in 1944.  Modeled loosely on the human brain, Neural Networks consist of a multitude of simple processing nodes (called neurons) that are highly interconnected and send data through these network connections to estimate a target variable. In this article, I will discuss the structure and training of simple neural networks (specifically Multilayer Perceptrons, aka "vanilla neural networks"), as well as demonstrate an example neural network created by the Alteryx Neural Network Tool.

Read more...

Community Content Engineer
Community Content Engineer

cheat sheet.PNG 

Who doesn’t love a good cheat sheet? Nobody, that’s who. Cheat sheets are awesome. They are a great reference for functions you need handy, but don’t have memorized by heart (yet). They can also be a fantastic way for learning and reinforcing components of a programming language. Some people like to keep them saved as a bookmark on their web browser. With all of that in mind, we are proud to present to you an Alteryx – R Cheat Sheet, which features Alteryx specific functions for use in the R Tool. With this cheat sheet, you should be better equipped to take on any R Tool challenges you encounter.

Read more...

Alteryx
Alteryx

LINEAR REGRESSION.png

Building my first linear regression model turned me into an instant celebrity. My roommate, who has acted as a sounding board for my predictive-analytics-learning progress, now believes I can use Linear Regression to predict the winner of the next horse race. While it would be fun to try, a more applicable use case is predicting how much a customer will spend (which, in the case of horse racing could translate to how much someone might spend on a bet). For my use case, I want to predict how much a Lyft driver can expect to receive on their next fare.

Read more...

Alteryx
Alteryx

titanic-c.png

Part 3 of 3: The final portion of predicting which Titanic passengers would survive.

Read more...

Alteryx
Alteryx

titanic-b.png

Part 2 of 3: The imputation portion of predicting which Titanic passengers would survive.

Read more...

Alteryx
Alteryx

titanic-a.png

Part 1 of 3: The feature engineering portion of predicting which Titanic passengers would survive.

Read more...

Atom

BUILD banner.png

Royden Onishi explains how he & Ryan Andrew created their Image Vectorizer tool with the Python SDK and reflects on their experience in the Alteryx BUILD hackathon.

Read more...

Alteryx
Alteryx

football.png

TLDR: the curse is real.

Read more...

Alteryx
Alteryx

Where we dive deep into the simulation techniques used, and see how we did.

Read more...

Alteryx
Alteryx

Dr. Dan shows how he created the model and pauses for a mid-point sanity check.

Read more...

Alteryx
Alteryx

football.png

Just in time before the first game begins, Alteryx predicts which Teams Will Advance out of the Group Round.

Read more...

Alteryx
Alteryx

levelling-up-banner.jpg

Ever wondered how to build a new analytic tool from scratch using the Alteryx Python SDK, but didn’t know where to start? This blog post takes you through the absolute basics to get you up and running - You’ll be creating brand new tools, connectors and advanced analytics in no time with this step-by-step beginners guide!

Read more...

Sr. Community Content Manager
Sr. Community Content Manager

icon.pngHow to create a Gender Classification tool with the Python SDK based on example code in chapter 1 - Language and Computation - of Applied Text Analysis with Python.

Read more...

Community Content Engineer
Community Content Engineer

banner1.pngVoronoi Tesselation and Delaunay Trianglulation both perform spatial calculations on a set of irregular points. Voronoi Cells (sometimes referred to as Thiessen Polygons in the GIS world) make up a Voronoi Tesselation, which is the partitioning of a plane into polygons based on a set of points, so that for each point there is a corresponding polygon where the area in the polygon is closer to the corresponding point than any other point. Delaunay Triangulation is when a set of irregular points are divided into triangles, so that no point in the set is inside the circumcircle of any triangle created from the points.

 

Both of these processes have a bunch of really neat spatial analysis applications. In this article, we will talk about their implementation in Alteryx.

Read more...

Alteryx
Alteryx

decisionTree.png

New to Alteryx and ready to help improve the documentation, I'm clambering up another predictive suite tool: the decision tree.

Read more...

Alteryx
Alteryx
Alteryx
Alteryx

Opioid Prescription Rates for CCGs in England Banner.png

Dr. Dan examines opioid abuse in England and compares it to behavior across the pond.

Read more...

Alteryx
Alteryx

pandas2.png

How does Alteryx Designer perform against Python's Pandas library? In this post I conduct 5 experiments with impressive results (post 2 of 2).

Read more...

Alteryx
Alteryx

pandas1.png

How does Alteryx Designer perform against Python's Pandas library? Here's how I set up the experiment (post 1 of 2).

Read more...

Alteryx
Alteryx

Ye_Olde_Map.jpg

No plan works as expected and not every connection is obvious. Our team discovered this through the use of Alteryx and Neo4j.

Read more...