You are here: Customizing Quality Knowledge Bases>Overview of Customize Features>Nodes>Extraction Definition Head Node

DataFlux Data Management Studio 2.5: User Guide

Extraction Definition Head Node

The first node of an Extraction Definition shows the name of the definition. In this node you can choose to extract substrings into multiple tokens to resolve ambiguities in the input string.

Used in:

Properties

Surface

Check the box to allow the definition to be available to external applications.

Allow extraction of words into multiple tokens

Check this box to extract overlapping segments of the input string into multiple tokens when they are attributed to multiple patterns. This functionality helps you resolve ambiguities in the input string. For example, let’s suppose we pass the input string “gold ring” to an extraction definition that searches for text to insert into the following tokens:

The input word “gold” could be associated with the tokens Colors and Materials. By default, the extraction definition finds the best single token for this word. It this case it could be beneficial to apply the word to two tokens.

Selecting this check box can lengthen processing time.

Output

None

Documentation Feedback: yourturn@sas.com
Note: Always include the Doc ID when providing documentation feedback.

Doc ID: dfU_Cstm_12360.html