Free Trial

Alteryx Designer Desktop Ideas

Share your Designer Desktop product ideas - we're listening!
Submitting an Idea?

Be sure to review our Idea Submission Guidelines for more information!

Submission Guidelines

Featured Ideas

I have a problem where bulk upload is failing because the last column of the table that the data is being imported into, is using the DEFAULT data type option. I am not passing through any value to this column as I want the DEFAULT value specified to always be applied.

 

The COPY command fails in this scenario if you don't specify an explicit field list

 

More details of the problem can be seen in this post, along with a workaround:

http://community.alteryx.com/t5/Data-Sources/import-into-redshift-DEFAULT-data-value/m-p/16655#U1665...

 

A tick box option should at least be added to the bulk upload tool to enable explicit field list specification based on column names coming into the bulk upload tool

 

When using the Write Data In-DB component from the In-Database tools, after the insert or update is complete the default behaviour is that the component does a select * from the table.  So if inserting into dbo.customer, once the transaction is complete, the component does a select * from customer.  This behaviour is really useful in some circumstances, e.g. creating reference data then returning all data so you can join back into the workflow, but can also really destroy performance.  Imagine if dbo.customer contains billions of rows.  Of course there are ways of restricting the data that is returned to the workflow, (e.g. use the sample tool), but the query is still executed in the DB.  Therefore my suggestion is to make output configurable.  Would be useful to have a four option output, either:

(1) zero row output (i.e. metadata only),

(2) successful transaction flag & error message if applicable,

(3) count of rows effected, for some DBs you can get this by default (MS SQL Server), or

(4) select * from table, as is now.

 

Thanks, nick

In-DB_idbx2_325x127

 

 

 

If you are connecting to an Oracle database you can get faster results with adding a short hint to your SQL query. For a query like this

 

 

SELECT customers.cust_first_name, customers.cust_last_name, 
MAX(QUANTITY_SOLD), AVG(QUANTITY_SOLD)
FROM sales, customers
WHERE sales.cust_id=customers.cust_id
GROUP BY customers.cust_first_name, customers.cust_last_name;

 

 

add /*+ PARALLEL(4) */  (or depending on your CPU size 😎 and that SQL becomes this;

 

 

SELECT /*+ PARALLEL(4) */ customers.cust_first_name, customers.cust_last_name, 
  MAX(QUANTITY_SOLD), AVG(QUANTITY_SOLD)
FROM sales, customers
WHERE sales.cust_id=customers.cust_id
GROUP BY customers.cust_first_name, customers.cust_last_name;

 

 

There is a similar capability for SQL Server too...

 

SELECT *
FROM Sales.SalesOrderDetail
OPTION (MAXDOP)

 

 

I suggest adding this feature in in-db tools so that no one needs to alter the SQL query itself...

 

You can check out the inner workings for oracle here; https://docs.oracle.com/cd/E11882_01/server.112/e25523/parallel002.htm

Hello All,

 

We are new to Alteryx and we could see that the Supported Data Sources from IBM are of below :

  • IBM DB2
  • IBM Netezza/Pure Data Systems
  • IBM SPSS

How about adding IBM Sterling to this?

We want Alteryx to support connection with IBM Sterling OMS which will help the Business requirements

Can anyone post some suggestions on this? How we can connect to Sterling?

 

Thanks,

Praveen C

 

It is nice that there is a sample node option for In-DB, however it isn't a random sample.  It isn't always feasible for me to stream out and use the random sample % option.  In fact on numerous occasions when I use the Data Stream Out option in DB I often times have workflows crash because it can't handle the number of records I am trying to stream out.

for the past 8 months I have been using alteryx and mostly working with the connect in db components , there are many issues which I am facing and i this this can be improved 

1. there is no such  flexibility of creating a table with the keys defined which is the most important pillar in database, also the options provided are limited , i.e. to create a new table , delete and append, drop table and recreate, 

now, there are many times where in we  need to update the tables based on the keys, which i find missing. Also how the option is defined is create a new table, next time if the job is run it states that the table is already created, for which we need to manually change the option in the next run.

 

2.which switching between in db and alteryx , if the records are more the alteryx lags completely and the job keeps running for hours , how can we achieve the flexibility  of alteryx designer if there is such bottleneck.

 

3.the flexibility that is provided with alteryx designer should also be given to in db components.

4.the parameters defined in the workflow can not be accessed in the in db  formula tools but can be used in the designer formula tools. this reduces the flexibility.

 

please look into the same

Curl currently doesn't have Secure protocols supported. Please find below screen-shot. We are currently using Alteryx 11.7.6

 

curl.png 

 

Can Alteryx take this as feature request and add the secure libraries to existing cURL tool so that it can support the secure SFTP protocol.

 

Was thinking with my peers at work that it might be good to have join module expanded both for desktop and in-database joins.

 

As for desktop join: left and right join shows only these records that are exclusive to that side of operation. Would it be possible to have also addition of data that is in common?

As for in-db join: db join acts like classic join (left with matching, right with matching data). Would it be possible to get as well only-left, only-right join module?

 

 

Greenplum is a new in-db capability; which our company has started using frequently. GP is a distributed database, across many nodes, much like how Redshift and other new database technology works.

 

In order to properly use these databases the Alteryx tools MUST have a way to distribute our tables across the nodes how we desire. Currently the tools for In-DB will distribute by the database default (almost always taking the first column in the table to distribute) which has terrible affects on performance. The work around has been to run SQL outside of the Alteryx tools to run ALTER TABLE DISTRIBUTE BY statements for all tables; also this cannot work on TEMP tables as those tables are created and deleted at the end of the workflow.

 

The idea I want is an additional option for Distributed DB's to choose the distribution column OR choose the "RANDOMLY" option for distribution of tables [DISTRIBUTED BY statement]. This would make my DBAs very happy and make it possible for me to run even more challenging requests to my system.

We recently upgraded our SQL server to 2016 to enable us to use R Server for predictive analytics.  We were excited about the more powerful algorithms and the fact that parallel processing will make things faster on bigger data sets.  

 

We often use stepwise logistic regression, especially in cases where we need to show which attributes are most significant.  The one drawback about the upgrade was that stepwise is not available when running logistic regression in-database.  I know there are ways to get around this e.g. PC etc. but it would be nice to have the ability to do stepwise in-database.

 

I hope there are others like me that will vote this up.  I think it will help a lot of data scientists out there and is probably one of the easier suggestions :-).

Try to create an oracle 32 bit connection.  Type in something wrong in the descriptor or user id or password.  You get a silent failure with no underlying notice about where you went wrong.  

 

You can get meaningful error messages from other database types, I got meaningful errors for ODBC connections to mysql and sql server while poking at this.  

 

Granted, it's only useful for jerks like me who don't type things in correctly and still have 32 bit Oracle instances to get to but somewhere down the chain the Oracle client is telling you that the user id was bad or the tns name couldn't be found or whatever, but you never get that metadata pushed back up to the user.  

I hope this request will be easy to implement.

 

One of the handy features I really like about Alteryx's standard Join tool is the way you can select and rename fields right in the join tool.  It would be great if this feature could be added to the IN-DB join tool.  Whenever I perform a join in DB, I ALWAYS have to add a select tool after it since you always want to de-select one of the redundant fields that the join was based on i.e. (ID and R_ID). 

 

Behind the scenes, I'm sure the select feature would still have to be handled as if it were a separate select tool, but it would just be convenient if the user interface could combine those two features.

 

Clustering your data on a sample and then appending clusters is a common theme

especially if you are in customer relations and marketing related divisions...

 Picture2.png

 

When it comes to appending clusters that you have calculated form a 20K sample and then you're going to "score" a few million clients you still need to download the data and use the append cluster...

 

Why don't we have an In-db append cluster instead,

which will quicken the "distance based" scoring that append cluster does

on SQLServer, Oracle or Teradata...

Picture4.png

 

 

 

 Best

 

 

 

Hi All,

 

It would be great if Alteryx 10.5 supports connectivity to SAS server.

 

Regards,
Gaurav

The object name is too long in NFD/NFC when connected to Teradata Error occurs when you use SQL comments above the SELECT statement within the Connect In-DB customer query box. Once the comments are removed the problem is resolved. It would be great if the Connect In-DB tool could recognise comments as what they are. 

Issue:

Currently there is no "In - Database" node to perform action queries

 

Work around

Use of the pre / post SQL in "Output data" node to perform action queries

 

Impact

Alteryx ease of use of "In - Database" nodes to work on large scale databases

Currently use of Knmie and SSIS is preferred instead of Alteryx

 

Action Requested

Please add an action query node to "In - Database" group.

 

 

 

I'm using Alteryx with an Hadoop Cluster, so I'm using lots of In-DB tools to build my workflows using Simba Hive ODBC Driver.

 

My Hadoop administrator set some king of default properties in order to share the power of the cluster to many people.

 

But some intensive workflow request need to override some properties. For instance, I must adjust the size of the TEZ container in setting specific values for hive.tez.container.size and hive.tez.java.opts.

 

A work-arround is to set those properties in the server side properties panel in ODBC Administrator but if I have many different configuration, I will have a lot of ODBC datasource which is not the ideal.

 

If I could directly set those properties in the Connect In-DB tool, it would be nice.

Currently we resort to using a manual create table script in redshift in order to define a distribution key and a sort key in redshift.

 

See below:

http://docs.aws.amazon.com/redshift/latest/dg/tutorial-tuning-tables-distribution.html

 

It would be great to have functionality similar to the bulk loader for redshift whereby one can define distribution keys and sort keys as these actually improve the performance greatly with larger datasets

We don't have Server.  Sometimes it's easy to share a workflow the old fashioned way - just email a copy of it or drop it in a shared folder somewhere.  When doing that, if the target user doesn't have a given alias on their machine, they'll have issues getting the workflow to run.

 

So, it would be helpful if saving a workflow could save the aliases along with the actual connection information.  Likewise, it would then be nice if someone opening the workflow could add the aliases found therein to their own list of aliases.

 

Granted, there may be difficulties - this is great for connections using integrated authentication, but not so much for userid/password connections. Perhaps (if implemented) it could be limited along these lines.

 

Hello all,

 

It will be great if there is an option to specify sql statement or delete based on condition in write In-DB tool. We have to delete all record even though when we are trying to delete and append only a subset of records. If it allows for "WHERE" statement atleast, it will be very much useful. I have a long post going on about this requirement in http://community.alteryx.com/t5/Data-Preparation-Blending/Is-there-a-way-to-do-a-delete-statement-in... .

 

Regards,

Jeeva.

Top Liked Authors