Hi all experts, Current i face problem when i try to use output file or IN-DB, it is very slow. 30MB size csv import to hive database it tooks 2 and half hours to finish. Why is it happen? what factor decide the speed of output data from alteryx to hive database? HDFS? Internal network issues? ODBC config? More details: Im using lastest Simba ODBC to connect and get the data from UAT mysql database server, input data by input tools from mysql and local to browser data, it is fast and takes no more than 10 secound. In-db tools i tried also, read is fast and wirte is super slow. (no different with output tools). With this super slow speed of write data,totally unacceptable speed on real life. At the beginning i think is hdfs or hive issues so I tried to move file to HDFS and copyFromLocal write to hive db used only around 5 secound to finish mapreduce. Please help and i can provide more information if you need, thank you! More inforamtion to let you guys: | Testing UAT env | Hardware | Firewall | | Window server 2019 (Alteryx location) | 8 cores and 32 GB ram | Close | | HDFS - name node | 8 cores and 32 GB ram | Close | | HDFS - data node *2 | 4 cores and 16 GB ram per node | Close | | | | |
FYI,HDFS cluter and hive is using Apache Ambari open source management platform This picture is how Alteryx get data on testing environment. 
Alteryx Output data setting: 
Window Server ODBC config: 

|