Clustering Condition Dialog
You can use the Clustering Condition dialog to add or edit one or more clustering conditions to the Records will be clustered if any condition is satisfied field in the Clustering node. The dialog contains the following elements:
Available fields - Displays the fields that are available to cluster on. You can select multiple fields.
Cluster if - Lists the individual expressions that make up a clustering condition. The items that are displayed in this list are dependent on your data sources and any preceding steps in your data job. Within a condition, records will match if any of the specified conditions match each other. For example:
- If Email1 is specified as the only entity within a condition, then two records will match if their Email1 fields match. This is represented as Match (Email1)
- If Email1 and Email2 are combined into the sole single entity within a condition, then two records will match if the concatenations of Email1 and Email2 match. This is represented as Match (Email1+Email2)
- If Email1 and Email2 are specified as two individual entities within a condition, then two records will match if any of the fields match any other. This scenario includes cross matches: for example, Email1 in one record matches Email2 in another, and so on for all combinations. This type of match is represented as Cross match (Email1, Email2)
The buttons to the right of the Cluster if field enable you to perform the following functions:
- Combine multiple selected functions in the format {entity1}+{entity2}. In practice, the "+" represents string concatenation. After the combine operation, only the combined entity remains in the list. The individual entities that were combined are removed.
- Move selected entities up and down in the list.
- Delete selected entities.
Preview - Provides a preview of the string representation of the clustering condition that is being edited.