You can preview data to help build or fine-tune a pipeline. When you preview data, the Data Collector passes data through the pipeline and allows you to review how the data passes through each stage.
You can use data preview to help you configure the pipeline. After you configure the origin and link the stages that you want to review, you start the data preview.
In Preview mode, you cannot add or delete stages, but you can edit stage properties and run the preview again to see how your changes affect preview data. Similarly, you can edit preview data to test and tune pipeline configuration.
You can preview data for one stage at a time or for a group of stages. You can view the data in list or table view. You can also refresh the preview data.
You can preview complete and incomplete pipelines. The Data Preview icon becomes active when data preview is available.
As a tool for development, data preview does not write data to destinations by default.
If you like, you can configure the preview to write data to destinations. We advise against writing preview data to production destinations.
You can use Data Collector to view how data passes through the pipeline.
The following image shows Data Collector in Preview mode:
Area/Icon | Name | Description |
---|---|---|
1 | Pipeline canvas | Displays the pipeline. |
2 | Preview panel |
Displays the data that enters and exits the selected stage or group of stages. It can also display stage properties and preview configuration. |
![]() |
StreamSets Control Hub icon | Provides information about StreamSets Control Hub (SCH) and lets you register this Data Collector with Control Hub. |
![]() |
Home icon | Displays a home page with a list of pipelines and their statuses, allowing you to perform pipeline maintenance and navigate to individual pipelines. |
![]() |
Package Manager icon | Displays the Package Manager which allows you to install additional stage libraries for a core Data Collector installation. |
![]() |
Notifications icon | Displays notifications. |
![]() |
Administration icon | Provides access to Data Collector configuration properties, directories, and log. Also allows you to shut down Data Collector. |
![]() |
User icon | Displays the active user and the roles assigned to the user. Also allows you to log out of Data Collector. |
![]() |
Help icon | Provides context-sensitive help based on the information in the panel. Allows you to
configure display settings and to specify whether to use a local or hosted version of the
help. Provides access to the REST API and the Data Collector version. |
![]() |
Link to a pipeline list | Link to a pipeline list on the Home page. Use to view a list of available pipelines, perform pipeline maintenance like starting or sharing a pipeline, and navigate to individual pipelines. |
![]() |
Records icon | Displays data preview records. |
![]() |
Stage Configuration icon | Displays stage properties. |
![]() |
Preview Configuration icon | Displays data preview properties. |
![]() |
Single button | Displays input and output data for a single stage. |
![]() |
Multiple button | Displays input and output data for a group of stages. |
![]() |
List View icon | Displays preview data in a list. |
![]() |
Table View icon | Displays preview data in a table. |
![]() |
Previous Stage icon | Moves the preview to the previous stage. |
![]() |
Next Stage icon | Moves the preview to the next stage. |
![]() |
Refresh Preview icon | Provides a fresh set of data from the origin. |
![]() |
Run Preview with Changes icon | Runs the data preview using changed data. Use to see how edited data is processed by the pipeline. |
![]() |
Revert Changes icon | Reverts all changes to preview data and returns the preview to the origin. |
![]() |
Close Preview icon | Closes the data preview. |
For information about maintaining pipelines on the Home page, see Data Collector UI - Pipelines on the Home Page.
For information about configuring pipelines, see Data Collector UI - Edit Mode.
For information about pipeline monitoring options, see Data Collector UI - Monitor Mode.
In Preview mode, Data Collector displays different colors for different types of data. Data Collector uses other codes and formatting to highlight changed fields.
Preview Code | Description |
---|---|
Black values | Date data |
Blue values | Numeric data |
Green values | String data |
Red values | Boolean data |
Asterisk | Records that include edited field values |
Red italic field labels | Fields that contain edited data |
Light red background | Fields removed by a stage |
Italic values | Edited data |
Green stage | First stage in a multiple-stage preview |
Red stage | Last stage in a multiple-stage preview |
You can preview data for a single stage. In the Preview panel, you can review the values for each record to determine if the stage transforms data as expected.
You can preview data for a group of linked stages within a pipeline.
When you preview multiple stages, you select the first stage and the last stage in the group. The Preview panel then displays the input data and the output data for the group. The input data is the data that enters the first stage. The output data is the data that passes from the last stage.
In the Preview panel, you can review the values for each record to determine if the group of stages transforms data as expected.
You can edit preview data to view how a stage or group of stages processes the changed data. Edit preview data to test for data conditions that might not appear in the preview data set.
For example, when the stage filters integer data based on an expression, you might change the input data to test positive and negative integer values, as well as zero.
When you edit preview data, you can pass the changed data through the pipeline, or you can revert your changes to return to the original data.
In data preview, you can edit stage properties to see how the changes affect preview data. For example, you might edit the expression in an Expression Evaluator to see how the expression alters data.
When you edit properties, you can test the change with the existing preview data or you can refresh the preview data.