Install StreamSets for Databricks on Amazon Web Services
You can install StreamSets for Databricks on Amazon Web Services (AWS). StreamSets for Databricks includes both StreamSets Data Collector and Transformer.
Data Collector and Transformer are installed as RPM packages on an Amazon Linux 2 machine hosted on EC2. Data Collector and Transformer are available as services on the instance after the deployment is complete.
For more details about StreamSets for Databricks on AWS, see the AWS Marketplace listing.
-
In the AWS Marketplace, search for
StreamSets, and then subscribe to the
StreamSets for Databricks
offering. - Accept the terms and conditions, and then click Continue to Configuration.
- Select the appropriate AWS fulfillment options, and then click Continue to Launch.
-
To launch StreamSets for Databricks from the AWS marketplace website, choose Launch
from Website and then complete the following steps:
-
To launch StreamSets for Databricks from the AWS EC2 console, choose Launch through
EC2 and then complete the following steps:
-
When launching the instance, note the instance ID on the Launch
Status page.
The password to Data Collector and Transformer matches the instance ID.
AWS might require a few minutes to launch an instance.
-
To access Data Collector, enter the following URL in the address bar of your browser:
http://<Public DNS of EC2 instance>:18630
For example if your DNS is
ec2-12-345-678-999.compute-1.amazonaws.com
, enter:http://ec2-12-345-678-999.compute-1.amazonaws.com:18630
-
To access Transformer, enter the following URL in the address bar of your
browser:
http://<Public DNS of EC2 instance>:19630
For example if your DNS is
ec2-12-345-678-999.compute-1.amazonaws.com
, enter:http://ec2-12-345-678-999.compute-1.amazonaws.com:19630
-
To log in to either Data Collector or Transformer, enter admin as the user name and the
EC2 instance ID as the password.
Tip: If you are new to Data Collector, consider starting with the Databricks Delta Lake solutions. If you are new to Transformer, here are the basics.