MapR Prerequisites

MapR is now HPE Ezmeral Data Fabric. At times, this documentation uses "MapR" to refer to both MapR and HPE Ezmeral Data Fabric. For information about supported versions, see Supported Systems and Versions in the Data Collector documentation.

Due to licensing restrictions, StreamSets cannot distribute MapR libraries with Data Collector. As a result, you must perform additional steps to enable the Data Collector machine to connect to MapR. Data Collector does not display MapR stages in stage library lists nor the MapR Streams statistics aggregator in the pipeline properties until you perform these prerequisites. Install Data Collector on a node in the MapR cluster or on a client machine.

The MapR prerequisites include installing all necessary client libraries on the Data Collector machine. If you use a core installation of Data Collector, you must install the MapR stage libraries. If you use a common installation of Data Collector, you install MapR stage libraries when Data Collector does not include the version that you want to use. Then, you run the command to set up MapR.

If the MapR cluster is enabled with built-in security, you also must configure Data Collector to connect to a secure MapR cluster and ensure that a valid ticket exists for the Data Collector user.