In case you missed the announcement: The Alteryx One Fall Release is here! Learn more about the new features and capabilities here
ACT NOW: The Alteryx team will be retiring support for Community account recovery and Community email-change requests after December 31, 2025. Set up your security questions now so you can recover your account anytime, just log out and back in to get started. Learn more here
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Scheduling a flow from on premise hive to s3

NM2
5 - Atom

Hi,

I am totally new to Alteryx.

I am planning to set up a flow which will:

1. Run a hive(on premise) query and upload the output to amazon S3.

2. Then get a file from Amazon S3 and populate an on premise hive table.

 

Can someone point me to right direction/documentation to implement this.

 

Thank you!

1 REPLY 1
MichalM
Alteryx Alumni (Retired)

@NM2 

 

You can use a standard Input Data tool or Alteryx's InDB functionality read data from your Hive instance (see here for more details). Once you get the data into the context of Alteryx, you can use the Amazon S3 upload tool to push the data into an S3 bucket (see here for more details).

 

Here's how it'd look like

 

hive-s3.png

Labels
Top Solution Authors