Install Data Collector on Oracle Cloud

You can install the full Data Collector on Oracle Cloud.

Data Collector is installed as an RPM package on a Linux virtual machine hosted on Oracle Cloud. Data Collector is available as a service on the instance after the deployment is complete.

  1. Sign in to Oracle Cloud.
  2. Before you install StreamSets Data Collector, you must create a network security group for your virtual cloud network to define the range of IP addresses that can access the Data Collector instance.
    1. In the Oracle Cloud navigation panel, click Networking > Virtual Cloud Networks.
    2. Select the virtual cloud network where you want to install Data Collector.
    3. Under Resources, click Network Security Groups.
    4. Click Create Network Security Group.
    5. Enter a name for the network security group, such as sdc, and then click Security Rules.
    6. Add the following ingress rules for destination ports 18630 and 22.
      Important: The default range of 0.0.0.0/0 gives all IP addresses access to Data Collector. Be sure to modify the default value to restrict access to known IP addresses only.
      Ingress Rule Description
      Allow access to TCP port 18630 Range of IP addresses that can access the Data Collector web-based UI on port 18630.
      Allow access to TCP port 22 Range of IP addresses that can use SSH to access the Data Collector virtual machine on port 22 to run the Data Collector command line interface.

      For example, the following configuration allows a range of known IP addresses for each ingress rule:

    7. Click Create.
  3. In the Oracle Cloud navigation panel, click Compute.
  4. Select the compartment to use, and then click Create Instance.
  5. Enter a name for the instance, and then click Change Image Source.
  6. Click Partner Images.
  7. Select StreamSets Data Collector, review and accept the terms of use, and then click Select Image.
  8. Select the appropriate availability domain, and then select Virtual Machine for the instance type.

    You can optionally change the instance shape, but the default settings are sufficient in most cases. If you change the defaults, select values that meet the minimum Data Collector requirements.

    For example, the following configuration creates a compute instance named streamsets-data-collector in the AD 1 availability domain as a virtual machine with the default instance shape settings:

  9. In the Configure networking section, select the appropriate compartment, virtual cloud network, and subnet compartment.
  10. For Subnet, select an available public subnet.
  11. Select Use network security groups to control traffic, and then select the network security group that you created earlier.
  12. Select Assign a public IP address.

    For example, the following configuration creates a compute instance in a public subnet that uses the sdc security group and is assigned a public IP address:

  13. You can optionally change the boot volume size, but the default is sufficient in most cases.

    If you change the default, select a size that meets the minimum Data Collector requirements.

  14. Use your preferred method to add the SSH keys for this compute instance.
  15. Click Create.

    It can take several minutes for Oracle Cloud to deploy Data Collector and start it as a service. When available, the Instance Details page displays the public IP address and the OCID for Data Collector, as follows:

  16. Copy the public IP address and the OCID from the Instance Details page.
  17. To access the Data Collector UI, enter the public IP address in the address bar of your browser, for example:
    http://<public IP address>:18630
  18. To log in to Data Collector, enter admin as the default user account and enter the OCID value as the password.

    For information on administering Data Collector, such as viewing logs and restarting Data Collector, see Administration.

    Tip: If you are new to Data Collector, consider starting with the Tutorial.