DataFlux Data Management Studio 2.7: User Guide

Standardization Definitions

A standardization definition formats the input into a desired "standard" format.

Input: a string or pre-parsed input.

Example:

"10 Main Street, Boston, Mass."

Output: the standardized string

Example:

"10 Main St, Boston, MA"

Hierarchy Node/Group Container Group Count
1 Standardization Definition Head Node   1
2 Tokenization Node   1
3 Standardization Token Group   1 or more (*)
3.1 Normalization Group Standardization Token Group 1
3.1.1 Trimming Node Normalization Group  
3.1.2 Uppercasing Node Normalization Group 1
3.1.3 Pre-Scheme Regex Libraries Group Normalization Group 1
3.1.3.1 Regex Library Node Pre-Scheme Regex Libraries Group 0 or more
3.2 Transformation Schemes Group   1
3.2.1 Transformation Scheme Node Transformation Schemes Group 0 or more
3.3 Post-Scheme Regex Libraries Group Standardization Token Group 1
3.3.1 Regex Library Node Post-Scheme Regex Libraries Group 0 or more
3.4 Casing Node   1
4 Concatenation Node   1
See Note 1
5 Postprocessing Regex Libraries Group   1
See Note 1
5.1 Postprocessing Regex Library Node Postprocessing Regex Libraries Group 0 or 1
See Note 1
6 Casing Node   1
See Note 1

Note 1: This node is included only when tokenization is configured, as in the standardization definition named Business Title in the English locale.

Documentation Feedback: yourturn@sas.com
Note: Always include the Doc ID when providing documentation feedback.

Doc ID: DMCust_DefsStd_12150.html