This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
I would like to see more files types supported to be able to be dragged from a folder onto a workflow. More precisely a .txt and a .dat file. This will greatly help my team and I do be able to analyze new and unknown data files that we receive on a daily basis.
It would often be very useful to have the ability to search for a field in a browse too.
At the moment i don't think there's an easy way to manually trace data through a workflow
For example you have created a workflow with various Joins, filters, etc. and notice that the final output is missing data for "ABC limited". The only way to find at what step ABC limited dropped out of the workflow is to add 10 filter tools branching out from before and after each step in the workflow's logic then re-run the workflow (which might take 5-10 minutes) to see if where "ABC limited" has gone. You fix the problem "ABC ltd" didn't join to "ABC Limited", but now you want to also check for XYZ limited so you have to manually edit all 10 filter tools. It seems you have fixed the problem, but now your workflow is a mess of 10 filter tools.
Alternatively you could copy and paste the data from every browse tool into an excel workbook and use their search function instead, but that's obviously a cumbersome and unhelpful process, particularly as the excel sheet will have to be remade with every run of the workflow.
You could also use sort tools throughout before a browse tool, but that is still slow and doesn't help with cases where "ABC Ltd" is matching to "The ABC Co ltd"
Perhaps it would be much easier to just have a small search box in every browse tool?
Or is there a feature that I'm not aware of that makes this process of quality checking your workflow easier already?
Similar to the Select tool's Unknown Field Checkbox, I figured it would be useful for the Data Cleansing tool to have this functionality as well in order to avoid a scenario where after a cross-tab you have a new numeric field, one of which has a Null value, so you can't total up multiple fields because the Null value will prevent the addition from happening. If the Unknown Field box were checked off in the Data Cleansing tool then this problem would be avoided.
I have seen the Browse tool offering a basic level of profiling results in the profile table and also a basic data profile tool under Investigation category. But both of them lack the pattern profiling option. I would like to see a pattern profiling option inside Alteryx too, which can show the pattern distribution of column data something like below (This is from SQL Data Profile viewer).
This can be very helpful in checking the data quality, by picking up data anomalies and checking inconsistencies.
There is a need when visualizing in-Database workflows to be able to visualize sorted data. This sorting could be done 1 of 2 ways: In a browse tool, or as a stand-alone Sort tool. Either would address the need. Without such a tool being present, the only way to sort the data is to "Data Stream Out" and then visualize the data in Alteryx. However, this process violates the premise of the usefulness of the in-DB toolkit, which is to keep your data in-DB and process using the DB engine. Streaming out big data in order to add a sort is not efficient.
Granted, the in-DB processing doesn't care whether data is sorted or not. However, when attempting to find extreme values after an aggregation, or when trying to identify something as simple as whether null values are present in a field, then a sort becomes extremely useful, and a necessary tool for human consumption of data (regardless of the database's processing needs).
It is disorienting when I am creating string fields explicitly coded to a specific length, then viewing my results window and having the values not line up across rows. If a font like courier were added as an option, this could be avoided.
Right now - if a tool generates an error - there is nothing productive that you can do with the error rows, these are just sent to the error log and depending on your settings the entire canvas will fail.
Could we change this in the Designer to work more like SSIS - where almost every tool has an error output, so that you can send the good rows one way, and the error rows the other way, and then continue processing? The error rows can be sent to an error table or workflow or data-quality service; and the good rows can be sent onwards. Because you have access to the error rows, you can also do run stats of "successful rows vs. unsuccessful"
This would make a big difference in the velocity of developing a canvas or prepping data.
This feature isn't a must - but would definitely be a nice to have.
Similar to the excel having a tab with key figures like average, count and sum
It would be a really good idea to do something similar within Alteryx just to have a quick glance on key figures/functions (example attached - apologise for the bad paint job but definitely would look good with Alteryx colour scheme)
Would be nice to have the option of disabling the append of the "action" to the variable in the summarize tool. Sometimes it's useful to leave the variable name as is when making tweeks to your module.
It would be great if there was an option to compute 'median' on numerical data column in 'cross-tab' tool. We trust 'median' a lot more than 'average' in many different computations. I would stretch my suggestion far enough to propose adding quantile computations as well...
In the histogram tool, I would like the ability to specify the bins, not just the number of bins, but the values of the bins. That would be especially helpful when comparing different data sets when I want to see an apples to apples comparison across two different histograms.
For those with large web and streaming-media server logs, the ability to geocode IP addresses be an excellent feature, similar to Alteryx's ability go geocode street addresses. Several IP geocoding services exist, with different levels of accuracy and cost. Ideally, the user should be able to choose their own service if they have one, in addition to a default service built-in to Alteryx.
Seeing how we use browse to tools to help build out modules, but they slow down the modules because they write out temp files, it would be awesome to have a "record count" option similar to the input tool. This would allow us to see the data as it's flowing through the module without slowing it down. Adding a sample tool before every browse would be fairly cumbersome.
I will sometimes open "New Window(s)" from a single or multiple Browse(s), and arrange them on a second monitor to dig through a module I am unfamiliar with, or when building out complicated models. It would be helpful if I could label these tempary new windows so that I don't spend to much time trying to remember which one is which.
In v10, I am using the summarize tool a lot and getting tired of selecting one or more fields and doing a sum function and having to revisit each summary tool when you add a numeric field upstream... I was hoping there would be a more dynamic method, e.g. select all numeric fields and then doing a SUM on _currentfield_.
Then I remembered the Field Info tool. (on a side note, I'd bet this tool is overlooked a lot). This tool is great because for each numeric field you get Min, Max, Median, Std Dev, Percent Missing, Unique Values, Mean, etc.
The one thing that's missing is SUM. Can you add it?
Also, can you give the user option to turn off layouts and reports so it runs faster? I only care out the data side.
or is there another way to do sum on dynamically selected numeric fields? (include Sum on Unknown field)
It appears that the Marketo Input tool only is focused aroudn getting Lead oriented Objects, however since Marketo is a Marketing Automation tool, it has many more objects such as email performance, landing page performance, web activity, program performance, revenue explorer, etc.
I'd like to vote up this idea of having marketo input connector having access to extrac data from other objects outside leads.
Here is the latest Marketo Analytics resources that are available. Anybody else encounter this bottleneck when trying to source the data out of Alteryx?