Expressions in Pipeline and Stage Properties
- Snowflake SQL for data manipulation
- Snowflake SQL is the relational query language used with Snowflake. Because processing for Transformer for Snowflake pipelines occurs in Snowflake, you must use Snowflake SQL for all expressions that manipulate pipeline data, such as for a condition or query properties.
- IBM StreamSets expression language for one-time evaluation
- The IBM StreamSets expression language is based on the JSP 2.0 expression language. If you already use IBM StreamSets, you are probably familiar with the IBM StreamSets expression language.
- Java regular expressions for pattern matching
- A Java regular expression, known simply as regular expression or regex, is a set of characters that represents a pattern used to search for matching text.
Referencing Columns in Snowflake SQL Expressions
To reference a column that contains a single level of data in a Snowflake SQL expression, you simply specify the column name. Column names are not case-sensitive.
For example, to deduplicate data based on an ID
column, you configure a
Deduplicate processor to deduplicate based on columns. Then, you can specify
ID
, Id
, iD
, or
id
as the column to use.
When a column contains hierarchical data, you reference a specific element within that column differently, based on the following Snowflake data types:
- Object
- To reference an element within an Object column, use dot notation
(
.
) to specify the path to the element, as follows:<top level>.<next level>.<next level>.<element to use>
For example,
customer.transactions.2019
. - Array
-
To reference an element within an Array column, use bracket notation (
[#]
) to indicate the position in a list. Use 0 to indicate the first element in the list, 1 to indicate the second, and so on.For example, to reference the second element in an
appt_date
Array column, enterappt_date[1]
.