What is the StreamSets Platform?
The StreamSets platform is a cloud-native platform for building, running, and monitoring data pipelines.
A pipeline describes the flow of data from origin to destination systems and defines how to process the data along the way. Pipelines can access multiple types of external systems, including cloud data lakes, cloud data warehouses, and storage systems installed on-premises such as relational databases.
As a pipeline runs, you can view real-time statistics and error information about the data as it flows from origin to destination systems.
The StreamSets platform uses the following components to manage your pipelines:
- Control plane
- The StreamSets control plane consists of StreamSets Control Hub, a public cloud service hosted by StreamSets that you access using a web browser. Use Control Hub to build, manage, and monitor your pipelines.
- Data plane
- The StreamSets data plane provides the following engines to process data:
The following image provides a general overview of the StreamSets platform components when using deployed engines: