This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
It would be useful to be able to select a single container (containing a data input) or multiple containers using Shift, and run those and only those.
When building a new element to a larger workflow, I often enter a new Input in a new container, the ability to run just that container without having to turn off all my other containers would be really useful in speeding up the start of joining things together.
With the 2019.3 release the summarize tool now includes prefixes for grouped fields. While a nice addition, in application it makes using this data downstream (like joining to other tables) more involved because of needing to remove this prefix.
It would be nice to have this as an option (a checkbox to add/remove prefixes maybe) or just revert back to pre-2019.3 behavior...thanks!
When writing a good amount of code, it is easy to get lost in a sea of parentheses. Just when you think you're all done, you get an error that can force you to scour through your code to find the missing, extra, or misplaced parenthesis.
A common feature today is to highlight a parenthesis when its partner is clicked on. This instantly lets you know if you have the wrong number of them and where.
I didn't think this was that important early on in Alteryx, at least for me. Formulas were meant to be short and easily readable at a glance. Now as I dig deeper, there's R, Python, SQL and other text-heavy inputs.
I don't need a full-fledged text editor in Alteryx, but I would love some quality of life features like parentheses matching.
It will be great to make visibility of workflow execution results to other users in same subscription.
As of now, only schedules are visible to all users in a subscription, but not the workflow execution results executed by a user to other users in same subscription.
This will avoid duplicate execution of same workflow by multiple user in a team as it will provide option to cross check the execution results by other users, if executed already, before execution of same workflow.
Current State: In current scenario, once we add a new tool to the workflow and want to see its output, we are required to run the complete workflow.
Challenge: This step takes a lot of time if the input data files are huge in size with million of rows and multiple tools are operating on them.
Suggestion/Idea: We could have a "freeze" functionality inbuilt or in tool palette to freeze a portion of workflow or an entire tool container which is not dependent on the output of the newly added tool. This would save time in running workflow.
Example: For example, in attached workflow snapshot, freezing the Tool container 1 while adding tools in Tool container 2 and then running those tools will take less processing time.
As well as using keyboard shortcuts, many of us are using a mouse / keyboard with program specific assignable shortcut buttons. It is a serious boost to productivity. The ability to instantly enable / disable would be a great tool large complex workflows. In general, it would be great to expand the keyboard shortcuts to offer more Alteryx specific advanced functions.
When the Python Tool operates, it seems to always ingest all the data before processing any of it (i.e. no batch processing). Python can handle this type of functionality with generators, can we update the tool so that it may do some preprocessing (like imports and data prep) and allow a defined generator function to be called repeatedly from a separate input handle and provide batch data frames on output for more parallel-like processing of data?
The Python Tool could be updated as such:
Multi-Input - Same functionality as now, and also allow this data to be used for preprocessing and setting up the Python functions and a single batch function.
Data Input - Ingests data in batches (as most other tools operate) where each batch passes in a dataframe (in this case, a subset of processed entries) into an existing Python function (with a name that is in globals()), and returns another dataframe with that desired output. This can give the option of adding/removing rows as necessary to a subset of the data.
Data Output - Partial set of data after data processing to allow tools further in the chain to process in parallel.
"On Complete" Multi-Outputs - Same functionality as now, to pass process-complete data to the next tool once all data ingested has been processed. Perhaps give the option to pass the complete set from Data Output.
A simple use-case, if a user wanted to use only the Python Tool:
Let's say a user wants to get all URLs from every post in a thread (containing millions of posts) that are in blacklisted domains.
Data prep that sends the list of blacklisted domains into the Python Tool's Multi-Input handle, and that data is transformed and stored in a set within the Python tool once.
A series of posts (strings) are sent in batches (let's say ~10000) to the Data Input of the Python Tool. The tool calls a defined Python function that extracts all the URLs, and filters those in the blacklist.
That data is then transformed into a DataFrame which is then sent to the Data Output of the Python Tool, and only contains results corresponding to the small batch of posts that were ingested. Alteryx can also use this to track progress during execution.
Once all posts have been processed, one of the Python Tool's Multi-Outputs can return a total count of URLs found that were NOT in the blacklist (sure this can be a part of the Data Output, but just for the sake of this example). Could also be used to trigger "on-complete events."
I know I used the term "generators" above, and the design could probably be simplified to instead call an Alteryx Python function that yields from a function to await input from the next batch to use actual Python generators. However, I feel my initial approach could be thought of as a simpler process since generators are more of an intermediate functionality.
I hope this makes sense and is elaborate enough to pursue. Thanks for the consideration!
Everyone on our team would benefit from having access to each other's workflows -- including workflow results and scheduling -- but publishing them to gallery where any user in the company can run it/ download it would present a security issue. Functionality that is not met with Collections. I found a similar idea, but not anything that encompasses the whole thing.
It would be of immense value to have a Team Gallery - something intermediate to the company gallery and the private collection. This would enable our team to have access to modify, publish, schedule workflows as a team.
My specific use case relates to writing to AWS but am sure there are many other use cases for federated user session token support.
Specifically, using the S3 Upload tool or Athena Bulk Write (via SIMBA and Athena ODBC), the configuration works when using a IAM user, access key, and secret access key but when using a federated user via Okta there is no option to enter the session token and authentication fails.
Alteryx desktop should support federated users' session tokens.
When I use the Comment Tool its difficult to select the tools inside it, but when I use the Container Tool the Container Text doesn't support Font Sizes, and doesn't support multiple lines of text so I end up moving the Comment into the Container, but still have problems selecting a group of tools.
So a combined Comment and Container Tools would be wonderful!
Bonus: If the Comment Tool could support Multiple Font Sizes.
I just downloaded Alteryx Designer 2019.2 yesterday and got busy straight away but couldn't help notice that while I like the general look and feel of the tool and general design language, I'm concerned that configuring the tools I work with will require so much scrolling.
Could we add the ability to set the zoom level of the configuration pane like we do in the workflow window or have some form of control on how the config pane sizing of contents.
I have attached the config panes using the crosstab tool as an example with 2018.4 on the left and the new 2019.2 on the right. I took care to snapshot both versions the same dimension for a more apples to apples comparison.
Often as I am scraping web sites, some clever developer has put an invisible character (ASCII or Unicode) in the data which causes terrible trouble.
I've identified 89 instances of zero-width or non-zero-width glyphs that are not visible and/or Alteryx does not classify as whitespace. There are probably more, but Unicode is big y'all.
Unfortunately, the Trim() string function only removes 4 of these characters (Tab, Newline, Carriage Feed, and Space). REGEX_REPLACE with the \s option (which is what the Cleanse macro uses) is a little better but still only removes 20. And it removes all instances, not just leading and trailing.
Now that 2019.2 is officially released I'll raise this here as I know it was raised as part of the beta testing. With the new interactive browse tool when filtering results the record numbering restarts.
For example in this window from a weekly challenge, I originally have this:
Then when I filter on the Allocated column for records where the Allocated amount is 0, I get this:
And as you can see the Record on the left hand side is numbered 1 - 15, so when trying to locate one of these lines to check the formula is working as expected it makes it difficult to isolate, where as if I knew that filtered record 10 was actually record 394 in the data I can then scroll to that point.
I know a solution to this would be to add a record ID field to the data, but this is not always needed.
Love the new updates to the Browse tool in 2019.2! However, if you choose the option Open results in new window, which I do often so I can see my whole dataset, the search/filter/sort functionality goes away. Would be great if that new functionality also worked in the new window. Thanks!
Sometimes formulas get pretty long. There are cases of deeply nested conditionals, concatenation of long strings, cases where multiple casts and parses are used, etc. where formulas get pretty large and unwieldy. The current system of wrapping lines and managing the size of the properties pane can be a hassle, especially if you are trying to use any sort of whitespace formatting to make the formulas more readable.
My solution is this is pretty simple, add a pop-out window for formulas. It could be a context menu option from right-clicking the formula box itself, a button on the bar at the top of each formula, or any number of other things.
A really good example of this is MS Access. You can right-click any text box that takes an expression and open it in the expression editor pop-up window. The current system is more like excel where you're stuck with whatever box size you're given.
Using other data viz tools like Tableau, we often plot yearly timeseries of data onto the same line chart so we can quickly compare year-on-year differences. All data viz tools seem to have complexities but the logical approach is the same. What you do is map all the years data to a relative year, i.e. this year, and then give each year it's own title. See the example below snipped from a Tableau dashboard:
In this example 7 years of data have been plotted on the same chart. Note the x-axis, In Tableau we are able to format the X-Axis labels to only show month and day (Mon-D). This removes the common relative year, i.e 2019.
As expected, Alteryx is awesome at preparing data to do this kind of thing. Using the interactive charting tool you can build really nice charts. However there is currently no way to format the X-Axis label, you must show the relative year too, as shown in the picture below (snipped from the browse tool, outputted from the interactive chart tool):
It was really easy to prepare the 5 year min, max and average lines, which is almost impossible to do in Tableau!
My idea in a nutshell is, please change the interactive chart tool so that the labels on the axis can be formatted to the user's choice, i.e. in this case formatted from datetime to "%B-%d".
Please note, the workflow i'm building in this case, is creating 3 line charts of related data, each by year. The end product is a daily email sent to users.
One thing I have noticed is that for some of the end-users of the apps I have in the gallery is that running the app from server is enough of a barrier that they don't use it at all. I have had to send links repeatedly to gallery, to apps on server for them to run them.
What I would love is a way to create a custom desktop icon (bear with me - I don't have the lexicon.) that an end-user would open and it would launch the app in the server directly (I'm assuming this would be opening a browser of choice, opening the app/workflow to the screen where it gives you the option to 'Run/Download/Schedule' rather than accessing it through a shared collection in the Gallery through a browser.
Possible extensions of this are the ability to create an app for a mobile device where they can access an app/workflow on server directly to run it from a phone/ipad.
I like the new cache option in 2018.3, but I would like it to function a little bit different. Let's say you cache at a certain point and then continue to build after that. If I reach another checkpoint and want to cache, it currently re-runs the entire workflow (ie it ignores my cache upstream and just goes back to the beginning of the workflow); instead, I would rather have it utilize the upstream cache. Personally, caching is usually an iterative effort during development where I keep caching along the way. The current functionality of the cache is not conducive to this. Thanks!