Transformer Registration with Control Hub Overview
Transformer is an execution engine that works directly with StreamSets Control Hub. You install Transformer on a machine that is configured to submit Spark jobs to a cluster, such as a Hadoop edge or data node or a cloud virtual machine. You then register Transformer to work with Control Hub.After installing Transformer, you can register Transformer to work with Control Hub. You can use Transformer with Control Hub on-premises version 3.11.0 or later.You can use Transformer with Control Hub cloud or with Control Hub on-premises version 3.11.0 or later.
When you register Transformer, you assign labels to the Transformer. The labelslabels determine which Control Hub jobs are run on that Transformer.
You can install and register multiple instances of Transformer with Control Hub. For example, you might install multiple instances of Transformer to work with different Hadoop YARN clusters. Or you might use one Transformer installation as a test environment and another installation as a production environment.
You can use each registered Transformer for both authoring and execution in Control Hub. You design pipelines in the Control Hub Pipeline Designer after selecting an available authoring Transformer to use. When you run pipelines from Control Hub jobs, you assign labels to the jobs and to the Transformers to determine the execution Transformer that runs the pipeline.
Before you register Transformer, ensure that you have enabled HTTPS for Transformer.