Installing the Impala Driver
You can use the Apache Hive JDBC driver included with Data Collector to perform Impala queries. However, some distributions recommend using a native Impala JDBC driver.
To use the included Apache Hive JDBC driver included with Data Collector, you do not need to perform any additional steps.
- Download the native Impala JDBC driver for the Hive distribution that you use.
- Install the driver as an external library for the stage library used by the Hive
Query executor.
For example, say the executor is configured to use a CDP stage library. If you use Package Manager to install the driver, then in the Install External Libraries dialog box, you select the CDP stage library, then browse to select the Impala driver libraries to install.
For information about installing additional drivers, see Install External LibrariesInstall External Libraries in the Data Collector documentationExternal Resources in the Control Hub documentation.
If you have trouble determining the URL format to use when configuring the executor, check out our Ask StreamSets post.