Alteryx Designer Desktop Knowledge Base

Definitive answers from Designer Desktop experts.

0 Replies 3475 Views 3 likes

6 Replies 59586 Views 18 likes

3 Replies 34956 Views 27 likes

7 Replies 112215 Views 22 likes

0 Replies 619 Views 0 likes

0 Replies 466 Views 0 likes

2 Replies 3937 Views 7 likes

10 Replies 55394 Views 42 likes

1 Replies 12406 Views 4 likes

Often, we get support requests asking if we can do a quick session on best practices for a particular tool or set of tools. The Spatial Match Tool is certainly no stranger to this request. Even though Alteryx is extremely fast and efficient with spatial processing, there are instances where a slight change in settings could speed up your total module runtime. For example, let us assume that we have 10,000 records that we are trying to perform a spatial match against. Upon looking at our data, we notice that the points were derived from the centroid of another polygon, but the previous polygon field was still in the data stream. Removing the unnecessary spatial field is one of the first steps in optimizing your data stream. Why? If you don't need the data, why are you pushing it all the way through your module? That extra data consumes memory, so removing it from your data stream reduces that consumption, increasing the available resources on your computer. Ultimately, this practice can be used in any aspect of your data, but it is especially relevant with spatial processing. For more tips and tricks regarding your spatial match process, download the included zip package. It contains information that can help you optimize your modules that utilize the Spatial Match tool. Special thanks to Paul Treece for helping out with the module! Until next time! - Chad Follow me on Twitter! @AlteryxChad

1 Replies 5530 Views 4 likes

It's not uncommon to have a situation where you need to conditionally join one dataset to another. Perhaps the most common is when you want to join one file to another where a date from the first file is between, greater than or less than a date(s) on a second file. The default tools found in the Join section of the tool palette don't provide a simple way of doing this (it can be done, but you need to string several tools together to make it work. There is a better way! Read on...). There is great macro available in the public Alteryx gallery called Advanced Join (find it here, but spoiler alert... you can download the attached workflow which includes this macro so you don't have to go to the gallery to get it). The Advanced Join gives you greater latitude than the Join tool. Most notably, you can select records from file A that are unique to A AND intersect with file B. Now you may be thinking, “I can do that by unioning the records from an inner join with records from a left join,” and you would be correct. But it takes two tools to do what one Advance Join does. More importantly, the Advanced Join allows you to put a conditional statement on your join which is something you can't do with the Join tool. And it’s this feature - the ability to use conditional statements in a join - which we will focus on for our purpose here. Let's get into some examples. I have a file, 'Fruit List’, which contains data about various fruits. This file contains a Column Id, a Fruit Name, a Start DateTime and an End DateTime: I have a second file, 'Greek Alphabet’, which contains a Column Id, a Greek letter and a Datetime. I want to join the two files on ColumnId where the Datetime from Greek Alphabet (file B) is BETWEEN Start Datetime and End Datetime from Fruit List (file A). Here's the workflow and a screenshot of how to configure the Advanced Join: And here are what my results look like: Only one record from Greek Alphabet matched one from Fruit List on ColumnId where Greek Alphabet's Datetime was between Fruit List's Start Datetime and End Datetime. In the next example, I have the same Fruit List file and want to join it another file, Greek Alphabet that contains just one datetime filed: The first thing to note is both files have a field called 'DateTime.' We'll want to give these unique names to avoid ambiguity when we write our conditional state in the Advance Join configuration. I want to join both files on ColumnId but only when DateTime from Fruit List is LESS THAN DateTime from Greek Alphabet: And the results...: Let's look at one last example. This time, I'm going to use the Fruit List and Greek Alphabet files used in the first example (Fruit List has a Start DateTime and an End DateTime). I'm interested in matching records where DateTime from Greek Alphabet is BETWEEN Start Datetime and End DateTime from Fruit List. I'm not matching on ColumnId this time. For the Advanced Join configuration, I'm going to cross join my files. (CAUTION: the resulting join could contain as many rows as the product of the number of rows of the incoming datasets - a.k.a. Cartesian join - depending on how restrictive your conditional is. This means if you're joining 2 datasets that contain a million records each, the resulting dataset could contain as many as one trillion records! ). If I had wanted to match on ColumnId, I would have had to do that separately using a Join tool. The cross join option only allows you to apply a conditional statement: Results from our 3rd example: Notice how 10 records from Greek Alphabet were joined to just one record from Fruit List. The Advanced Join tool can save you time and a lot of headaches when you want to join files using a conditional statement. It has some limitations - you can only join two datasets and include one conditional statement per tool, cross join limitation mentioned above - but Advanced Join provides greater capability and flexibility than the standard Join tool.

8 Replies 12684 Views 11 likes

1 Replies 6283 Views 15 likes

2 Replies 6411 Views 4 likes

5 Replies 23710 Views 5 likes

0 Replies 61247 Views 12 likes

Scenario: You monitor the mileage of multiple trucks as they deliver shipments over the course of a week and record additional information regarding each truck in a file (Truck Metrics). Each truck’s cumulative mileage per day is recorded in a separate file (Truck Mileage). Your goal is to update the mileage-related fields in Truck Metrics with the values recorded in Truck Mileage. Note: today is Tuesday so only fields for Monday and Tuesday will be updated in in the Truck Metrics file. Methods: Manually Select Fields to Update Whether the data is uniquely identified by a Truck Number (or Record ID) or identically ordered in both files, data from Truck Metrics (the table to be updated) and Truck Mileage (the data used for updating) can be Joined together. Then, using the Select functionality within the Join tool, I can manually choose the fields from Truck Mileage that I want to use to replace the fields that need to be updated (Figure 1). Figure 1: Fields from Truck Mileage (Yellow) replace fields from Truck Metrics (Blue). Note that fields in yellow are selected while fields in blue have been deselected. Fields that need to be included from Truck Metrics (Red) remain selected. Fantastic! A simple, straightforward way to update fields! But, as any analyst knows, working with data is rarely simple or straightforward. What if you’re dealing with 20 fields that need to be updated, not just 2? In that case, manually selecting fields to update is not only tedious but also error-prone. For these types of situations, I recommend a process that allows for a more dynamic approach. 'Verticalize' the Data to Dynamically Update Fields Transposing, or ‘verticalizing’ data, allows for a more dynamic workflow when you have unknowns in your processing. In a scenario such as this one, you may have an unknown or changing number of fields that will need be updated in Truck Metrics. Using this approach, we’ll first Transpose both data sets to configure the Field Name and its associated value in a single row (Figure 2). Figure 2: The data is transposed from Truck Mileage and Truck Metrics. The highlighted fields in Truck Metics indicates which fields will be updated, as the same field exists in Truck Mileage. Then, we’ll Join our datasets based on two fields: Truck Number AND Name. This ensures that fields in Truck Mileage will match to the correct fields in Truck Metrics, assuming the fields names in both tables are named in the same way. The only selecting we’ll have to do is to make sure all fields from the Right data source (in this case, Truck Metrics) are deselected (Figure 3). This allows that, in the situation of matched Truck Numbers and field names, the updated values will be used. Figure 3: The Joined fields indicate the fields that exist in both Truck Mileage and Truck Metrics. Fields in yellow (Truck Mileage) are selected to reflect updated values in downstream tools. Fields in blue (Truck Metrics) are deselected. Note that any unmatched fields have fallen out of the Right side of the Join: To add them back into the data stream, simply Union the Center and Right Joins together, setting the tool to “Auto Configure by Name”. Then, to rearrange the data into its original format, use the Cross Tab tool (Figure 4). And, voila! Your data is updated! Figure 4: The updated fields are highlighted in yellow. All other fields originally included in Truck Metrics are included the in the final results as well.

0 Replies 5625 Views 4 likes

2 Replies 9522 Views 6 likes

3 Replies 2954 Views 7 likes

3 Replies 3032 Views 2 likes

One common reason can be due to mixed cases within the data of the join fields between the two datasets. For example, and as shown below, "Orange" from the first dataset is not going to join to "orange" in the second dataset. And this does not just apply to the leading character. Another common reason can be due to the existence of trailing or leading spaces. As also shown in the example below, "Plum " is not going to join with "Plum". The little red triangle in the upper right-hand corner of "Plum" indicates that there are leading and/or trailing spaces.

3 Replies 7971 Views 28 likes

0 Replies 27276 Views 0 likes

Alteryx Designer Desktop Knowledge Base

How to Combine Join Output Anchors with a Union Tool

Left, Right, and Full Outer Joins are easy!

How to Join on a Date Range

Alteryx for Excel Users: How to do a VLOOKUP in Alteryx

DE30000 / GDC-1220: Snowflake Receive "ERROR: SQLPrepare: SQL compilation error:¶duplicate alias 'Tool1_7c8a' when using Inner Join In-DB Tool

DE29774: Join Multiple tool produces different results with AMP

CS Macro Dev: Preserving Field Order without Using the Select Tool

Why Your Join Is Getting More Records than Expected

How to omit a record if any of the fields are null?

Alteryx Designer Best Practices: Spatial Matching

Advanced Join: value from one file is between, > or < a value from another file

CS Macro Dev: Detour Tool Alternative - Tool Container and Union Approach

When to use Join vs Append

N-Dimensional Join Errors

Cartesian Join/Cartesian Product

Updating Fields in One Input from Another

How To: Create a Distance Matrix or Mileage Chart

ACE Tip for Getting Faster Results from Alteryx

How to Use the Join Tool with a Union Tool and Change Field Names and their Order

Fewer Than Expected Records Output from Join

Error: No valid fields were selected