It would be useful to be able to select a single container (containing a data input) or multiple containers using Shift, and run those and only those.
When building a new element to a larger workflow, I often enter a new Input in a new container, the ability to run just that container without having to turn off all my other containers would be really useful in speeding up the start of joining things together.
Hope that makes sense.
The option to open Hyper files in 2019.4 is great! For some of our use cases it would be even better, if we would be able to directly open Hyper files that have been published to Tableau Server.
It should be possible to achieve this by combining the Tableau REST API method Download Data Source, which returns a Tableau Packaged Data Source (.tdsx), which then would need to be converted to a Zip file to be able to navigate to the contained Hyper file.
When I use the Comment Tool its difficult to select the tools inside it, but when I use the Container Tool the Container Text doesn't support Font Sizes, and doesn't support multiple lines of text so I end up moving the Comment into the Container, but still have problems selecting a group of tools.
So a combined Comment and Container Tools would be wonderful!
Bonus: If the Comment Tool could support Multiple Font Sizes.
I suggest that it would be beneficial to add in a column filter that can automatically remove columns based on a condition, such as removing columns where all values are NULL or if they contain something in the values.
Should have a True and False output, like the normal Filter tool, so you can check what is being removed.
e.g. Would help with when you get poorly formatted excel sheets that add in hundreds of redundant columns, or if your workflow has generated NULL columns that should be removed, without having to Transpose, Filter, Cross tab etc to clear them out.
I would love to be able to have an interface tool that allows a user to search through drop down values (when there are more than 100 or so) similar to autocomplete. It would be helpful as a multiselect or single select drop down. I have inserted a very poorly mocked up picture below. It would essentially be a modified version of the drop down as all the values would be in the tool, but the user could type to find what they are looking for.
Alteryx does not currently have to email tool that is configurable to use SMTP Authentication for Microsoft Office 365 or any server requiring authentication. Our office printer can authenticate over SMTP and with TLS enabled why not my Alteryx mail tool - 'mic drop!'.
Further explained, Alteryx is a tool that needs to live within abide by the policies and security standards in the organization not vice versa. Therefore, it shouldn't be a big surprise, or a big ask for that matter, that a mail client should have the ability to authenticate prior to sending email of SMTP. I'm very surprised this tool is so arcane. Please implement quickly. Thank you
There's often a need to do a cascade of filters which would normally be handled in a programming language by a Case or a Switch statement.
- if it's a cat then go left, otherwise go right
- if it's a dog then go left otherwise carry on right
if it's a fish then go left otherwise carry on right
otherwise do xxxx
This could be handled more elegantly by a conditional split tool that allowed you to specify multiple conditions like a case statement, and which then generated multiple output nodes; with the last one for any leftovers.
I've seen this question before and have run into it myself. I'd like to see a new tool that would allow a developer (of a workflow) to choose a path of logic based upon criteria known only during the execution of a module.
If LEFT INPUT Count of records < 10,000 THEN Path1 (e.g. use a calgary join)
ELSE Path 2 (e.g. use a standard join)
A very simple idea, the same principle as the existing filter but there is only one output (true) and there is an on / off button. This could gives users the opportunity to put in a "what-if" gateway into their workflow.
For example, your stakeholder has asked you to use 'all the data'. Whilst building the workflow, you notice that excluding 2016 data gives you a more accurate / representative end result. However, it is not necessarily your decision to exclude the 2016 data. You put in the Valve Filter tool:
Valve Filter On = Excludes the 2016 data.
Valve Filter Off = Allows all data to go through.
You are able to turn the filter on and off, run the workflow and see / demo different end results (instead of using a traditional filter and creating a separate workflow or a duplicated arm of a workflow).
Until your workflow is complete, you do not always know the full impact of the parameters / decisions you make along the way. The Valve Filter could give users more flexibility.
I constantly find my using pre and post SQL Commands in the Output tool to run SQL when I don't actually have any data to output.
One example is when I load data into S3 and want to load it into Redshift. I have SQL code to run but no data to Output - I end up running a dummy row into a temp table.
So can we have an SQL tool that simply acts the same as a Pre-SQL command without the associated data output. Once the command is run we should be able to continue the workflow, so the tool should have an option input and output, like the Run Command tool.
We don't have a seperate ANOVA tool in Alteryx, do you think of any reason?
It's not raw data or row blended data but insights gathered that's important:
Linear Regression Tool has a report for Type II ANOVA based on the model table we provide.
But both type II and other types are not available as standalone statistics tools...
Here is the list of different types of Anova that may be useful;
ANOVA models Definitions
|t-tests||Comparison of means between two groups; if independent groups, then independent samples t-test. If not independent, then paired samples t-test. If comparing one group against a fixed value, then a one-sample t-test.|
|One-way ANOVA||Comparison of means of three or more independent groups.|
|One-way repeated measures ANOVA||Comparison of means of three or more within-subject variables.|
|Factorial ANOVA||Comparison of cell means for two or more between-subject IVs.|
|Comparison of cells means for one or more between-subjects IV and one or more within-subjects IV.|
|ANCOVA||Any ANOVA model with a covariate.|
|MANOVA||Any ANOVA model with multiple DVs. Provides omnibus F and separate Fs.|
Looking forward for the addition of ANOVA tools to the data investigation tool box...
Unsupervised learning method to detect topics in a text document.
Helpful for users interested in text mining.
While In-db tools are very helpful and cut down the time needed to write complex SQL , there are some steps that are faster by directly writing SQL like window functions- OVER (PARTITION BY .....). In Alteryx, we need to create multiple joins and summaries to perform a window function. It would be immensely helpful if there was a SQL editor tool for in-db workflows where we can edit the SQL code at any point in the workflow, or even better, if they can add an "edit" function to every in-db tool where we can customize the SQL code generated and then send to the next tool.
This will cut down the time immensely and streamline the workflow to make Alteryx a true contender for the ETL solution space.
The introduction fo a rank tool would be hugely beneficial. Whilst there are currently means to rank using a combination of other tools formula/running total/multirow etc... a specific "Rank Tool" would be provide a seemless and smoother way to rank your data either for further analysis or purely to output this field.
This tool should include a sort by and group by functionaility as well as options for ranking (such as dense ranking or unique ranking) and in addition multi levels of ranking (ie. Rank by "Field A" Then By "Field B" etc...).
DELETE from Source_Data Where ID in
SELECT ID from My_Temp_Table where FLAG = 'Y'
Essentially, I want to update a DB table with either an update or with the deletion of rows. I can't delete all of the data. My work around will be to create/insert into a table the keys that i want to delete and try to use a input/output tool with SQL that performs the delete. Any other suggestions are welcome, but a tool is best.
Assuming some source control or versioning is in place, a formal compare tool would be a nice addition. This would be useful for determining what is different between two versions of a workflow, and that knowledge is very useful when modifying a production process: when formally moving a new (modified) process into production, part of the checks and balances would be to run a formal comparison against the workflow being replaces, and ensure that all differences are accounted for.
This sort of audit is notoriously difficult when the differences are buried deep in the configuration settings of various tools within Alteryx. I do see that the .yxmd files are XML based, so perhaps we could create our own compare tool based thereon, but it would be better (more trustworthy) to have one formally provided by Alteryx. Thanks!
Adam ( @AdamR ), Mark ( @MarqueeCrew) and many others have done a great job in putting together super helpful add-in macros in the CREW pack - and James ( @jdunkerley79 ) has really done an incredible job of filling in some gaps in a very useful way in the formula tools.
Would be possible to include a subset of these in the core product as part of the next release?
I'm thinking of (but others will chime in here to vote for their favourite):
- Unique only tool (CReW)
- Field Sort (CReW)
- Wildcard XLSX input (CReW) - this would eliminate a whole category of user queries on the discussion boards
- Runner (CReW - although this may have issues with licensing since many people don't have command line permission - Alteryx does really need the ability to do chained dependancy flows in a more smooth way.
- Date Utils (JDunkerly) - all of James's Date utils - again, these would immediately solve many of the support questions asked on the discussion forum
I think that these would really add richness & functionality to the core product, and at the same time get ahead of many of the more common queries raised by users. I guess the only question is whether the authors would have any objection?
Disabled Containers throw errors if it contains any interface tools. It should not throw any error as the user is intentionally disabling the container.
XGboost regression is now the benchmark for every Kaggle competition and seems to consistently outperform random forest, spline regression, and all of the more basic models. For those of us using predictive modeling on a regular basis in our actual work, this tool would allow for a quick improvement in our model accuracy. And I think, from a marketing standpoint, having a core group of users competing in Kaggle using Alteryx would be a great way to show off Alteryx's power.
It is readily available as an R package: https://cran.r-project.org/web/packages/xgboost/index.html
It seems that version 10.6 (still in beta) will have easy to use linear programming tool... We'll be able to allocate assets optimally, optimize our marketing decisions by inputting the predictions we had with predictive tools etc.
But when it comes to Non-linear models what happens? The idea is to add Alteryx designer an evolutionary optimization capability as well...
I've used a similar tool in excel which was very useful called Evolver; http://www.palisade.com/evolver/ It will be awesome to see that in the coming versions...
To note that one optimisation method does not rule them all and evolutionary algorithms are the slowest probably,
But I believe it will enable us to optimize hyperparameters of our models and greatly get better results...