Legacy Kubernetes Integration Overview
Applies to: IBM StreamSets as a Service
You can automatically provision Data Collector containers on a container orchestration framework in your environment, such as Kubernetes.
- Data Collector Docker image
- Customize the public Data Collector Docker image for your configuration requirements. For example, you might need to modify the Data Collector configuration files, install external libraries, or store custom stage libraries. Use Docker to customize the public Data Collector Docker image and then store the private image in your private repository.
- Provisioning Agent
- A Provisioning Agent is a containerized application that runs in a Kubernetes container orchestration framework. The agent communicates with Control Hub to automatically provision Data Collector containers in the Kubernetes cluster in which it runs. Provisioning includes deploying, registering, starting, scaling, and stopping the Data Collector containers. You can configure the Provisioning Agent to provision Data Collector containers enabled for Kerberos authentication.
- Legacy Deployments
- A legacy deployment is a logical grouping of Data Collector containers deployed by a Provisioning Agent to Kubernetes. All Data Collector containers in a deployment are identical and highly available.
Provisioning is especially useful when you require a large number of execution Data Collectors to run jobs. When you provision Data Collectors, you benefit from all of the features that both Docker and Kubernetes offer - including easily scaling Data Collector containers and updating Data Collector containers to a new image with a different Data Collector version or with different configurations.
You can also automatically provision an authoring Data Collector dedicated to pipeline design as long as the authoring Data Collector is provisioned from a unique deployment that doesn't include any execution Data Collectors.