Hive Streaming (deprecated)
Supported pipeline types:
|
Before you use the destination, verify that your Hadoop implementation supports Hive Streaming.
When configuring Hive Streaming, you specify the Hive metastore and a bucketed table stored in the ORC file format. You define the location of the Hive and Hadoop configuration files and optionally specify additional required properties. By default, the destination creates new partitions as needed.
Hive Streaming writes data to the table based on the matching field names. You can defining custom field mappings that override the default field mappings.
Before you use the Hive Streaming destination with the MapR library in a pipeline, you must perform additional steps to enable Data Collector to process MapR data. For more information, see MapR Prerequisites in the Data Collector documentation.
Hive Properties and Configuration Files
- Configuration files
- The following configuration files are required for the Hive Streaming
destination:
- core-site.xml
- hdfs-site.xml
- hive-site.xml
- Individual properties
- You can configure individual Hive properties in the destination. To add a
Hive property, specify the exact property name and the value. The
destination does not validate the property names or values.Note: Individual properties override properties defined in the configuration files.