Enterprise Stage Libraries

Enterprise stage libraries provide stages that connect to advanced external systems. Releases of Enterprise stage libraries occur separately from Data Collector releases. As a result, you must install Enterprise stage libraries on all Data Collector installations.
Note: Data Collector accessed through a cloud service provider marketplace automatically includes all Enterprise stage libraries except for the Protector and the SQL Server 2019 Big Data Cluster stage libraries.

Be sure to install a valid version of the stage libraries for the Data Collector version that you are using.

You can install Enterprise stage libraries using Package Manager for a tarball Data Collector installation or as custom stage libraries for a tarball, RPM, or Cloudera Manager Data Collector installation.

For installation instructions, the list of supported versions, and other prerequisite tasks, see the documentation for the individual stages, listed below.

The release notes for Enterprise stage libraries are available on the StreamSets Documentation page.

StreamSets provides the following Enterprise stage libraries:
Stage Library Stage Library Name Description
Azure Synapse streamsets-datacollector-azure-synapse-lib For Azure Synapse.

Includes the Azure Synapse SQL destination.

For version information, see Supported Versions for the Azure Synapse stage library.

Databricks streamsets-datacollector-databricks-lib For Databricks.

Includes the Databricks Delta Lake destination and the Databricks Query executor.

For version information, see Supported Versions for the Databricks stage library.

Google streamsets-datacollector-google-lib For Google.

Includes the Google BigQuery (Enterprise) destination.

For version information, see Supported Versions for the Google stage library.

GPSS streamsets-datacollector-greenplum-lib For Greenplum.

Includes the GPSS Producer destination.

For version information, see Supported Versions for the GPSS stage library.

MemSQL streamsets-datacollector-memsql-lib For MemSQL.

Includes the MemSQL Fast Loader destination.

For version information, see Supported Versions for the MemSQL stage library.

Oracle streamsets-datacollector-oracle-lib For bulk loading from Oracle tables.

Includes the Oracle Bulkload origin.

For version information, see Supported Versions for the Oracle stage library.

Protector streamsets-datacollector-protector-lib For protecting sensitive data.

Includes a set of Protector stages. For a full list, see the Protector release notes.

Snowflake streamsets-datacollector-snowflake-lib For Snowflake.

Includes the Snowflake destination, the Snowflake File Uploader destination, and the Snowflake executor.

For version information, see Supported Versions for the Snowflake stage library.

Microsoft SQL Server 2019 Big Data Cluster streamsets-datacollector-sql-server-bdc-lib For SQL Server 2019 Big Data Cluster.

Includes the SQL Server 2019 BDC Multitable Consumer origin and the SQL Server 2019 BDC Bulk Loader destination.

For version information, see Supported Versions for the SQL Server 2019 Big Data Cluster stage library.

Teradata streamsets-datacollector-teradata-lib For Teradata.

Includes the Teradata Consumer origin.

For version information, see Supported Versions for the Teradata stage library.