# Data Science Blog

Machine learning & data science for beginners and experts alike.
## Alteryx Tackles the 2018 FIFA World Cup: Investigating the World Cup Curse

Alteryx

TLDR: the curse is real.

## Alteryx Tackles the 2018 FIFA World Cup: Simulating the Group Results

Alteryx

Where we dive deep into the simulation techniques used, and see how we did.

## Alteryx Tackles the 2018 FIFA World Cup: Creating the Win/Lose/Draw Probability Model

Alteryx

Dr. Dan shows how he created the model and pauses for a mid-point sanity check.

## Alteryx Tackles the 2018 FIFA World Cup: Group Round Predictions

Alteryx

Just in time before the first game begins, Alteryx predicts which Teams Will Advance out of the Group Round.

## Levelling Up: A Beginner’s Guide to the Python SDK in Alteryx

Alteryx

Ever wondered how to build a new analytic tool from scratch using the Alteryx Python SDK, but didn’t know where to start? This blog post takes you through the absolute basics to get you up and running - You’ll be creating brand new tools, connectors and advanced analytics in no time with this step-by-step beginners guide!

## Text Analysis in Alteryx with the Python SDK: Gender Classification

Sr. Community Content Manager

How to create a Gender Classification tool with the Python SDK based on example code in chapter 1 - Language and Computation - of Applied Text Analysis with Python.

## Voronoi (Thiessen) Polygons and Delaunay Triangles in Alteryx

Alteryx

Voronoi Tesselation and Delaunay Trianglulation both perform spatial calculations on a set of irregular points. Voronoi Cells (sometimes referred to as Thiessen Polygons in the GIS world) make up a Voronoi Tesselation, which is the partitioning of a plane into polygons based on a set of points, so that for each point there is a corresponding polygon where the area in the polygon is closer to the corresponding point than any other point. Delaunay Triangulation is when a set of irregular points are divided into triangles, so that no point in the set is inside the circumcircle of any triangle created from the points.

Both of these processes have a bunch of really neat spatial analysis applications. In this article, we will talk about their implementation in Alteryx.

## An Alteryx Newbie works through the predictive suite: Decision Tree

Alteryx

New to Alteryx and ready to help improve the documentation, I'm clambering up another predictive suite tool: the decision tree.

Alteryx

## The Drivers of Opioid Prescription Rates in England

Alteryx

Dr. Dan examines opioid abuse in England and compares it to behavior across the pond.

## Benchmarking Alteryx Designer against Pandas: Results

Alteryx

How does Alteryx Designer perform against Python's Pandas library? In this post I conduct 5 experiments with impressive results (post 2 of 2).

## Benchmarking Alteryx Designer against Pandas: Preparing for the Experiment

Alteryx

How does Alteryx Designer perform against Python's Pandas library? Here's how I set up the experiment (post 1 of 2).

## Alteryx + Predictive Analytics + Caffeine = Opiate Prescriber App

Alteryx Alumni (Retired)

## Exploring Hidden Relationships: Alteryx Team 1's Journey at the HHS Opioid Code-a-Thon

Alteryx

No plan works as expected and not every connection is obvious. Our team discovered this through the use of Alteryx and Neo4j.

## Optimizing Fantasy Football (Soccer) Using Alteryx

Alteryx

Want to use your analytics brain to get a competitive edge at fantasy football?  This article contains the secret to beating all your colleagues in your fantasy premier league.

## Optimally Locating Opioid Treatment Facilities

Alteryx

Learn how team Helping Hands built an app to recommend opioid treatment facility locations during the HHS Opioid Code-a-Thon.

## Harness the Power of Your Data Lake with Alteryx Spark Direct

Alteryx

Find out how the new Alteryx Direct Connection for Apache Spark functionality will open the floodgates on your Data Lake, unleashing its full potential.

## An Alteryx Newbie works through the predictive suite: Boosted Model

Alteryx

New to Alteryx and ready to help improve the documentation, I'm jumping in with both feet to tackle some of the wildest: the predictive suite.

## What’s Your Favorite Color?

Alteryx

What's you favorite color in Alteryx?

## Data Wrangling 101: Using Python to Fetch, Manipulate & Visualize NBA Data

Dark Matter

This is meant to be used as a general tutorial for beginners with some experience in Python or R.

## Beginner's Guide to Customer Segmentation

Alteryx

Step by step tutorial on using K-Means clustering to analyze your customer base.

## R for Excel Users

Matter

Why learning new things is hard, plus four fundamental differences between Excel and R

## Data-Science Design Patterns: Cost-Sensitive Learning

Alteryx Alumni (Retired)

Many if not most supervised-classification problems involve some degree of class imbalance, where at least one class occurs more frequently than the others.  The imbalanced-classification problem illustrates the value of approaching data-science problems as empirical (as well as formal) optimization problems, using techniques termed cost-sensitive learning. This post will show you how to do cost-sensitive binary classification.

## Why use SVM?

Alteryx

What is support vector machine and why should you use it?

## Scikit-Learn Cheat Sheet: Python Machine Learning

Matter

This cheat sheet is a handy reference for using the Scikit-Learn Python package

## GoogleVis, R and Alteryx! Oh my!

Magnetar

Alteryx has a lot of built in functionality, but the ability to leverage custom R code opens up even more possibilities. After reading an answer on the Alteryx Community many months back, I was inspired to try and integrate Google Charts into an Alteryx workflow by using the R tool.

## Alteryx Data Science Design Patterns: Combining Models

Alteryx Alumni (Retired)

Most real-world data-science design patterns combine several models to solve a single business problem.  This post surveys the most common and effective techniques for combining models.  Once you make it through this post (and its predecessors), you'll be ready to take on the design patterns we'll begin learning in 2017.

## Alteryx Data Science Design Patterns: Cross Validation

Alteryx Alumni (Retired)

Cross validation (CV) is a difficult topic.  There are many ways to do CV, and articles on the subject can be very technical.  This blog post is a gentle introduction to CV.  Read it and you'll find it much easier to understand later posts describing data-science design patterns that use CV.

## Alteryx Data Science Design Patterns: Predictive Model Form, Part Five

Alteryx Alumni (Retired)

Understanding fitting algorithms is the final hurdle between you and some juicy data science design patterns.  Make the leap!

## Pandas Cheat Sheet for Data Science in Python

Matter

This cheat sheet is a quick reference for Pandas beginners

