This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
Similar to the Select tool's Unknown Field Checkbox, I figured it would be useful for the Data Cleansing tool to have this functionality as well in order to avoid a scenario where after a cross-tab you have a new numeric field, one of which has a Null value, so you can't total up multiple fields because the Null value will prevent the addition from happening. If the Unknown Field box were checked off in the Data Cleansing tool then this problem would be avoided.
This feature isn't a must - but would definitely be a nice to have.
Similar to the excel having a tab with key figures like average, count and sum
It would be a really good idea to do something similar within Alteryx just to have a quick glance on key figures/functions (example attached - apologise for the bad paint job but definitely would look good with Alteryx colour scheme)
There is a need when visualizing in-Database workflows to be able to visualize sorted data. This sorting could be done 1 of 2 ways: In a browse tool, or as a stand-alone Sort tool. Either would address the need. Without such a tool being present, the only way to sort the data is to "Data Stream Out" and then visualize the data in Alteryx. However, this process violates the premise of the usefulness of the in-DB toolkit, which is to keep your data in-DB and process using the DB engine. Streaming out big data in order to add a sort is not efficient.
Granted, the in-DB processing doesn't care whether data is sorted or not. However, when attempting to find extreme values after an aggregation, or when trying to identify something as simple as whether null values are present in a field, then a sort becomes extremely useful, and a necessary tool for human consumption of data (regardless of the database's processing needs).
Would be nice to have the option of disabling the append of the "action" to the variable in the summarize tool. Sometimes it's useful to leave the variable name as is when making tweeks to your module.
It would be great if there was an option to compute 'median' on numerical data column in 'cross-tab' tool. We trust 'median' a lot more than 'average' in many different computations. I would stretch my suggestion far enough to propose adding quantile computations as well...
In the histogram tool, I would like the ability to specify the bins, not just the number of bins, but the values of the bins. That would be especially helpful when comparing different data sets when I want to see an apples to apples comparison across two different histograms.
For those with large web and streaming-media server logs, the ability to geocode IP addresses be an excellent feature, similar to Alteryx's ability go geocode street addresses. Several IP geocoding services exist, with different levels of accuracy and cost. Ideally, the user should be able to choose their own service if they have one, in addition to a default service built-in to Alteryx.
Seeing how we use browse to tools to help build out modules, but they slow down the modules because they write out temp files, it would be awesome to have a "record count" option similar to the input tool. This would allow us to see the data as it's flowing through the module without slowing it down. Adding a sample tool before every browse would be fairly cumbersome.
I will sometimes open "New Window(s)" from a single or multiple Browse(s), and arrange them on a second monitor to dig through a module I am unfamiliar with, or when building out complicated models. It would be helpful if I could label these tempary new windows so that I don't spend to much time trying to remember which one is which.
In v10, I am using the summarize tool a lot and getting tired of selecting one or more fields and doing a sum function and having to revisit each summary tool when you add a numeric field upstream... I was hoping there would be a more dynamic method, e.g. select all numeric fields and then doing a SUM on _currentfield_.
Then I remembered the Field Info tool. (on a side note, I'd bet this tool is overlooked a lot). This tool is great because for each numeric field you get Min, Max, Median, Std Dev, Percent Missing, Unique Values, Mean, etc.
The one thing that's missing is SUM. Can you add it?
Also, can you give the user option to turn off layouts and reports so it runs faster? I only care out the data side.
or is there another way to do sum on dynamically selected numeric fields? (include Sum on Unknown field)
It appears that the Marketo Input tool only is focused aroudn getting Lead oriented Objects, however since Marketo is a Marketing Automation tool, it has many more objects such as email performance, landing page performance, web activity, program performance, revenue explorer, etc.
I'd like to vote up this idea of having marketo input connector having access to extrac data from other objects outside leads.
Here is the latest Marketo Analytics resources that are available. Anybody else encounter this bottleneck when trying to source the data out of Alteryx?
Within one of my modules I use the “Find Replace” tool quite often to find keywords within a customer name field. By chance, I have found that irrespective of what order my keywords are sorted in, Alteryx will look at each word in the customer name and try to match them in order.
For example, the customer name of EAST RIDING YORKSHIRE COUNCIL, when matched to my list of keywords matches the word RIDING before it matches the word COUNCIL, even though COUNCIL appears higher up my list than RIDING does.
I would like to be able to prioritise my keywords so that Alteryx starts at the top and works its way down my list of keywords, rather than just taking the first word in a customer name that matches any of my keywords. Is this possible?
Hello! Almost all statistical softwares allow for the analyst to use either a pairwise or a listwise option when applying clustering techinques. This option affects only how the inner distance matrix is built, and after that whichever algorithm you choose is peformed. However in Alteryx [K-Centroids] by default does listwise, classifying only those records where the selected variables have no nulls.
Please consider adding this option!
PS: the difference is pairwise will build the distance between 2 variables depending on those records that have no nulls on both variables, while listwise will run the distance matrix after it has checked for complete non null records in all variables of interest (not one at a time distance calculation).
While trying to debug data parsing, I end up with browse tools all over the place and it makes the pallet a mess and time consuming to organize.
It would be much nicer to just have a little corner of all icons to have a sub icon or something I can click on to see output. Multi output could be a little more tricky ... so better yet, maybe have small browse icons on the connectors you can click on for results of that feed?
It is disorienting when I am creating string fields explicitly coded to a specific length, then viewing my results window and having the values not line up across rows. If a font like courier were added as an option, this could be avoided.
Right now - if a tool generates an error - there is nothing productive that you can do with the error rows, these are just sent to the error log and depending on your settings the entire canvas will fail.
Could we change this in the Designer to work more like SSIS - where almost every tool has an error output, so that you can send the good rows one way, and the error rows the other way, and then continue processing? The error rows can be sent to an error table or workflow or data-quality service; and the good rows can be sent onwards. Because you have access to the error rows, you can also do run stats of "successful rows vs. unsuccessful"
This would make a big difference in the velocity of developing a canvas or prepping data.