You are here: Customizing Quality Knowledge Bases>Customize - Chop Table>Character Level Options

DataFlux Data Management Studio 2.5: User Guide

Customize - Chop Table Editor - Character Level Options

Character Classifications

Classification Description
LETTER A letter or non-separating symbol
NUMBER A numeric digit (0-9)
FULL SEPARATOR A delimiting character, for example, spaces, dashes, and commas
LEAD SEPARATOR A delimiter attached to the beginning of a word, for example, left parenthesis
TRAIL SEPARATOR A delimiter attached at the end of a word, for example, a period

Operations

Operation Description
USE Use the character as-is in the word list and output tokens
TRIM Omit from word list; trim leading/trailing characters in output tokens
SUPPRESS Omit from the word list and output tokens

Implicit Separators

Implicit separators are an optional feature of string chopping. For each parse definition, you can enable or disable implicit separation. If you enable implicit separators, a separator mark is placed wherever the classification of a character differs from the classification of the previous character in a string. To use implicit separation, on the Chop Table Editor screen, choose Options > Implicit Separation.

For example, using implicit separation, the phrase:

APT124

will be separated into two phrases:

APT 124

assuming that the letter "T" is assigned one classification such as "Letter" and the number "1" is assigned a different classification such as "Number." If both characters have the same classification, they will remain together. With implicit separation disabled, this phrase could be broken apart only by assigning separator functionality to the "T" or to the "1," but that change would affect every instance of those characters.


Related Topics

Documentation Feedback: yourturn@sas.com
Note: Always include the Doc ID when providing documentation feedback.

Doc ID: dfU_Cstm_Chop_18002.html