Be sure to review our Idea Submission Guidelines for more information!
Submission GuidelinesAs you may know, the interrogation of Hive to get the Metadata is actually very slow on Alteryx
A first step of improvement (at least in the Visual Query Builder) has been proposed here
But the real issue for Hive is that the way Alteryx queries the Metadata : it passes "Show table" queries for all the databases. On our cluster, it means more than 400 queries that last each avout 0.5 seconds. The user has to to wait about 4 minutes.
A solution : using an API in java to ask the Hive metastore if it exists (it may be an other tab in the In database configuration). Our cluster admin has an example of a Thrift API in java that we can give you.
Result : 2 seconds for a 38700 tables in more than 500 databases !!
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.