Installation Requirements for Self-Managed Deployments

Install Data Collector on a machine that meets the following minimum requirements. To run pipelines in cluster execution mode, each node in the cluster must meet the minimum requirements.

When working with self-managed deployments, you take full control of procuring the resources needed to run a Data Collector engine. You must set up the machine and complete the installation prerequisites required by the engine.

Before launching a Data Collector engine for a self-managed deployment, set up a machine with the following minimum requirements. Then, complete the additional Docker image prerequisites or tarball prerequisites based on the installation type you want to use.

Component Minimum Requirement
Operating system Use one of the following operating systems and versions:
  • Mac OS X
  • Amazon Linux 2
  • CentOS 6.x or 7.x
  • Oracle Linux 6.x - 8.x
  • Red Hat Enterprise Linux 6.x - 8.x
  • Ubuntu 14.04 LTS - 20.04 LTS
Cores 2
RAM 1 GB
Disk space 6 GB
Note: StreamSets does not recommend using NFS or NAS to store Data Collector files.
File descriptors 32768
Java Oracle Java 8 or OpenJDK 8
Browser Use the latest version of one of the following browsers:
  • Chrome
  • Firefox
  • Safari