Lookup the term "date" in community and you'll get 2,000 results. Hidden in this article are Easter eggs for the finding. Besides learning about dates, see if you can uncover the clues that I've cloaked.
Alteryx Server provides a fully scalable architecture that allows an organization to scale Alteryx to automate data analytics, tackle bigger projects, process larger datasets and put self-service data analytics into the hands of more decision makers. From scaling Worker nodes to Gallery nodes to the MongoDB persistence layer, Alteryx Server allows organizations to efficiently manage their automated and self-service data analytics needs.
.NET Framework is a software framework developed by Microsoft that supports the building and running of apps and XML web services. The framework version can have an impact on the installation and operation of Alteryx.
A common concern in predictive modeling is whether a model has been overfit. In statistics, overfitting refers to the phenomena when an analytical model corresponds too closely (or exactly) to a specific data set, and therefore may fail when applied to additional data or future observations. One common method that can be used to mitigate overfitting is regularization . Regularization places controls on how large the coefficients of the predictor variables grow. In Alteryx, the option of implementing regularized regression is available for the Linear Regression and Logistic Regression Tools.
The subtitle to this article should be a short novel on configuring the Decision Tree Tool in Alteryx . The initial configuration of the tool is very simple, but it you chose to customize the configuration of the tool at all, it can get complicated quickly. In this article, I am focusing on the configuration of the Tool. However, because it is a Tool Mastery, I am covering everything within the configuration of the tool
The Find Replace Tool is one of those tools that goes relatively unused and uncelebrated until you stumble into a data blending technique that would be extremely difficult without it – at which point, it becomes your favorite tool in the Designer. You can find it in the Join Category and it’ll make easy string substitutions in your data that would otherwise require herculean effort to work around. Today, we celebrate Find Replace as a hero.
Database aliases can help users save time and allow easier sharing of data. Let's take a look at how.
The first step to creating a database alias in Designer is to establish the database connection. This can be done with the Input Data tool by selecting any of the database options in the dropdown.
Databases we can connect to with Alteryx, an overview.
Once you’ve established the connection through the connection admin, a database connection string will appear, like the one shown below.
Next, let’s go to the Manage Database Connections window (or Manage In-DB Connections window for In-DB aliases). This will be under Options > Advanced Options > Manage Data Connections. Here we can add new aliases, and edit or delete, any existing ones.
Manage Database Connections overview:
There are two types of aliases:
User: Any user can add a User Alias that only they will be able to access and change.
System: System Aliases can only be added/edited by a local administrator.
Sync All allows you to sync Gallery Data Connections
Add Connections allows you to create a new alias (see steps below)
Additional options can be found when you hover over an existing data connection:
Allows you to edit the password of the Alias.
Allows you to delete an alias
When adding a new alias, Alteryx will assist you in making the connection to both MS Sql Server, and Oracle. For all other database connections, select the 'Other' option.
There will be three things to include for 'Other' database connection types: the connection type (user vs system), an alias name and the connection string. The reason we made the connection first through the Input data tool is so that we can simply copy/paste that string into our alias manager.
When the alias has been created you can access this connection through the Input Data tool dropdown under Saved Data Connections. You’ll notice that the string now says aka:AliasName.
Q: What happens when you have multiple workflows that all reference a connection string and your password changes?
A: If using workflows that reference an alias, the user will only have to update the password in the alias manager. If using a regular connection string, every Input Data tool will have to be updated in every workflow that uses that connection.
Q: Can I share, or export, a workflow that has database connection strings?
A: Yes! If you setup an alias on both user’s machines that are an exact match you will be able to share workflows that use those database connections.
Note: The screenshots and steps taken to create the database connection and alias were built in Designer version 11.8.
Don’t know the area of your polygon? Need the length of your line? Do you want your spatial object’s X and Y coordinates? Don’t Panic! The Spatial Info tool can translate all that information and more!
If you are building a predictive model, inevitably you will want to analyze the effect that your independent variables have on your dependent variable. This article is meant to shed some light on the Alteryx-specific options for this type of analysis!
The Dynamic Replace Tool is an under-utilized tool in the Developer Toolset that is very powerful. It allows for dynamic formulas or conditions to be used in your workflow. It was first introduced in Alteryx 6.1 . It’s one of the few tools that is currently multi-threaded which makes is fast.
The humble histogram is something many people are first exposed to in grade school. Histograms are a type of bar graph that display the distribution of continuous numerical data. Histograms are sometimes confused with bar charts, which are plots of categorical variables.
You want to impress your managers, so you decide to try some predictions on your data – forecasting, scoring potential marketing campaigns, finding new customers… That's great! Welcome to the addictive world of predictive analytics. We have the perfect platform for you to start exploring your data.
I know you want to dive right in and start testing models. It's tempting to just pull some data and start trying out tools, but the first and fundamentally most important part of all statistical analysis is the data investigation.
Your models won't mean much unless you understand your data. Here's where the Data Investigation Tools come in! You can get a statistical breakdown of each of your variables, both string and numeric, check for outliers (categorical and continuous), test correlations to slim down your predictors, and visualize the frequency and dispersion within each of your variables.
Part 1 of this article will give you an overview of the Field Summary Tool (never leave home without it!) Part 2 will touch on the Contingency and Frequency Tables, and Distribution Analysis; Part 3 will be the Association Analysis Tool, and the Pearson and Spearman Correlations; and Part 4 will be all the cool plotting tools.
Always, every day, literally every time you acquire a new data set, you will start with the Field Summary Tool. I cannot emphasize this enough, and I promise it will save you headaches.
There are three outputs to this tool: a data table containing your fields and their descriptive statistics, a static report, and the interactive visualization dashboard that provides a visual profile of your variables. From this output, you can select subsets to view, sort each of the panels, view and zoom in on specific values, and it even includes a visual indicator of data quality.
You'll get a nifty report with plots and descriptive statistics for each of your variables. Likely the most important part of this report is '% Missing' – ideally, you want 0.0% missing. If you are missing values, don't fret. You can remove these records or impute those values (another reason knowing your data is so important).
Also check 'Unique Values' – if you have a single unique value in one of your variables, that won't add anything useful to your model, so consider deselecting that variable.
The Remarks field is also very useful – it will suggest field-type changes for fields with a small number of unique values, perhaps that should be a string field. Or, if some values of your field have a small number of value counts, you may consider combining some value levels together.
The better YOU know your data, the more efficient and accurate your models will be. Only you know your data, your use case, and how your results are going to be applied. But we're here to help you get as familiar as you can with whatever data you have.
Stay tuned for subsequent articles – these tools will be your new best friends. Happy Alteryx-ing!
Ever needed to learn how to use an Iterative macro? This article walks you through the process with a basic mathematical problem highlighting how the data moves through the macro and loops back to continue processing.
The Field Summary Tool analyzes data and creates a summary report containing descriptive statistics of data in selected columns. It’s a great tool to use when you want to make sure your data is structured correctly before using any further analysis, most notably with the suite of models that can be generated with the Predictive Tools.