DataFlux Data Management Studio 2.7: User Guide

Noise Word Vocabulary Node

The Noise Word Vocabulary Node removes words that are considered "noise" from the input. These are usually words that form a legitimate part of the input but are not important for matching purposes. For example, if matching addresses, the word denoting the type of street (road, lane, drive, trail, and so on) is usually considered unimportant.

Used in:

Properties

Vocabulary

Select a vocabulary to use. By default, only files from the locale and its ancestors appear in the drop-down. If you do not see the desired library, click Tools > Data Management Studio Options. From the column on the left, select QKB Definition Editor. Then, in the pane on the right under Library files, click Show files for all locales. QKB files from all locales will be included.

Click Open vocabulary to edit the selected file.

Sensitivity

Select the sensitivity range for which this node will have an effect.

Output

Message

If any word in the Vocabulary matches the input, the message is "Changes applied". Otherwise, the message will be, "No changes applied".

Result

The string after noise words were removed, if any.

"All noise" message

Will be set as true if every word in the input string was found to be a noise word. If this occurs, no filtering occurs and the string is left intact.

Filtered words

A list of the noise words that were removed.

Documentation Feedback: yourturn@sas.com
Note: Always include the Doc ID when providing documentation feedback.

Doc ID: DMCust_12345.html