We are launching an updated process to submit Cloud Quest solutions!
Starting today and moving forward we will provide a JSON workflow start file for each quest. The workflow contains starting data preloaded into a Text Input tool (or tools).
Upload this JSON file into your Alteryx Analytics Cloud Library, then you can open the file and start solving the quest in Designer Cloud!
Similarly, once you are finished, you can export your JSON solution file to attach to your reply on Community.
Review the following instructions to learn how to import and export Alteryx Designer Cloud workflows.
Import the Cloud Quest Start File
Export Your Solution
Now, with that out of the way...
Let’s dive into this week's quest!
You are a data analyst at a sports statistics firm. Your department is focused on analyzing historical data from major international sporting events, and your current project is the FIFA Women’s World Cup™.
You have been provided with a dataset named Womens World Cup dataset. This file contains detailed Women's World Cup match records, including teams, match outcomes, and dates from 2016 through 2019.
Your goal is to use Alteryx Designer Cloud to process this dataset and determine which team won the most matches in years 2016-2018. The outcome should consist of two columns: one for the team and the other for the number of wins.
Hint: Use the IF/THEN expression in the Formula tool to identify the winners. Also, ensure to exclude any matches where there was a tie.
If you find yourself struggling with any of the tasks, feel free to explore these interactive lessons in the Maveryx Academy for guidance:
My solution.
My result | netherlands, 15 |
Expected result | england, 16 |
@Yoshiro_Fujimori Your result is correct. After filtering out all dates from 2019, the Netherlands wins with a count of 15.
Your question about the Sample tool has a nuanced answer. The behavior of tools differs between Cloud Native Mode and Standard Mode. When this quest was created, Standard Mode did not yet exist. The sorting functionality you’re referring to is only available in the Cloud Native version of the Sample tool. In Standard Mode, tools behave as they do in Designer Desktop, so they will function in the way you’re accustomed to.
The reason for this difference lies in how Cloud Native Mode is designed. It is built to connect to massive cloud databases, and the Cloud Native version of tools operates similarly to In-DB tools in Designer Desktop. Specifically, it takes a 10MB random sample of the input data to display in the results window. The entire dataset is processed only when you select Run Job, using pushdown processing.
As a result, tools like the Multi-Row Formula or Generate Rows cannot be used in Cloud Native Mode when the results depend on exact row order. In your small example, you wouldn’t notice a difference in the number of rows between Cloud Native Mode and Standard Mode because the dataset is under 10MB. For small datasets like this, Standard Mode is generally recommended.