Greenplum is a new in-db capability; which our company has started using frequently. GP is a distributed database, across many nodes, much like how Redshift and other new database technology works.
In order to properly use these databases the Alteryx tools MUST have a way to distribute our tables across the nodes how we desire. Currently the tools for In-DB will distribute by the database default (almost always taking the first column in the table to distribute) which has terrible affects on performance. The work around has been to run SQL outside of the Alteryx tools to run ALTER TABLE DISTRIBUTE BY statements for all tables; also this cannot work on TEMP tables as those tables are created and deleted at the end of the workflow.
The idea I want is an additional option for Distributed DB's to choose the distribution column OR choose the "RANDOMLY" option for distribution of tables [DISTRIBUTED BY statement]. This would make my DBAs very happy and make it possible for me to run even more challenging requests to my system.