Field Renamer
Use the Field Renamer to rename fields in a record. You can specify individual fields to rename or use regular expressions to rename sets of fields.
You can configure the behavior when a source field does not exist, when a target field with a matching name already exists, and when a source field matches multiple source field expressions.
Fields that are not renamed or overwritten pass to the next stage.
Renaming Sets of Fields
You can use regular expressions, or regex, along with the IBM StreamSets expression language to rename sets of fields. You can use regex to define the set of source fields to rename and to define the target field names. You can also use the IBM StreamSets expression language to define target field names.
- Remove a prefix or suffix
-
Say you want to remove an OPS prefix from a set of fields. You can do this by using the following expression to define the source fields to change:
/'OPS(.*)'
Then use the following expression to remove the OPS prefix:/$1
- Remove special characters
- To remove special characters from field names, you can use the following
expression for the source field
names:
/'([A-Z a-z]*)[^a-z A-Z 0-9]([A-Z a-z 0-9]*)'
- Change capitalization
- To change field names to all uppercase, use the following expression for the
source field names:
/(.*)
Then use the following expression for the target field names:/${str:toUpper("$1")}
tag|attr
, enter the field name as follows:
/'tag|attr'
Configuring a Field Renamer Processor
-
In the Properties panel, on the General tab, configure the
following properties:
General Property Description Name Stage name. Description Optional description. Required Fields Fields that must include data for the record to be passed into the stage. Tip: You might include fields that the stage uses.Records that do not include all required fields are processed based on the error handling configured for the pipeline.
Preconditions Conditions that must evaluate to TRUE to allow a record to enter the stage for processing. Click Add to create additional preconditions. Records that do not meet all preconditions are processed based on the error handling configured for the stage.
On Record Error Error record handling for the stage: - Discard - Discards the record.
- Send to Error - Sends the record to the pipeline for error handling.
- Stop Pipeline - Stops the pipeline.
-
On the Rename tab, configure the following
properties:
Field Renamer Property Description Fields to Rename To rename fields, enter or select the fields to rename in Source Field Expression, and enter the new name for the field in Target Field Expression. Click Add to rename additional fields. Note: The Field Renamer processor cannot rename fields that contain a backslash character ( \ ).To rename sets of fields, use expressions:- You can use regex in both properties.
- You can use the IBM StreamSets expression language in the Target Field Expression property.
To rename an array or map, you can specify a single array index or map element, or you can use the asterisk wildcard to represent all array indices and map elements.
You cannot use regex to select a subset of array indices and map elements. For example, if the Division array contains 20 indices, you cannot use the following regex to rename the field path for the first 10 indices:/Division[0-9]
Note: If you rename fields within a list or list-map field, the processor lists the renamed fields at the end of the list or list-map field. You can use a Field Order processor to reorder fields in a list-map field.Source Field Does Not Exist Behavior when a source field does not exist in a record: - Continue - Continues processing the record, ignoring missing source fields.
- Send to Error - Handles the record based on the error handling configured for the stage.
Target Field Already Exists Behavior when the record includes a field name that matches a proposed target field: - Replace - Replaces the existing field with the renamed field.
- Append Numbers - Appends numbers to any duplicates in the renamed fields.
- Continue - Continues processing the record, making no change for existing fields.
- Send to Error - Handles the record based on the error handling configured for the stage.
Multiple Source Field Matches Behavior when a source field matches more than one source field expression: - Continue - Continues processing the record, skipping fields with multiple matches.
- Send to Error - Handles the record based on the error handling configured for the stage.