This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
on 09-29-201705:41 PM - edited on 03-08-201912:59 PM by SydneyF
Solution to access data stored in PARQUET format via the Alteryx Designer
Data in parquet format can be stored in hive tables and accessed from the Alteryx Designer via the hive ODBC driver.
Create a table in hive with "STORED AS PARQUET" for hive 0.13 and later.
Check the create table syntax in thisarticle. For data already stored in the PARQUET format in HDFS, use "LOAD DATA" to load the data in the HDFS file to a table in hive.
Alteryx can read and write data from these tables via the hive ODBC driver.
To write results of an Alteryx workflow back to a new hive table in the PARQUET format, use "hive.default.fileformat=PARQUET” in the Server Side Properties ODBC driver configuration under "Advanced Options".
Here is a solution that uses the Post Create SQL Statement to alter the table to parquet format "ALTER TABLE table_name SET FILEFORMAT PARQUET;"