DataFlux Data Management Studio 2.6: User Guide

Noise Word Vocabulary Node

The Noise Word Vocabulary Node removes words that are considered "noise" from the input. These are usually words that form a legitimate part of the input but are not important for matching purposes. For example, if matching addresses, the word denoting the type of street (road, lane, drive, trail, and so on) is usually considered unimportant.

Used in:

Properties

Vocabulary

Select a vocabulary to use. By default, only files from the locale and its ancestors appear in the drop-down. If you do not see the desired library, click Tools > Options. Click Display and select Show files for all locales under the Library file selection drop-down lists to view QKB files from all locales.

Click Edit to edit the selected file or create a new file, the appropriate editor opens.

Sensitivity

Select the sensitivity range for which this node will have an effect.

Output

Message

If any word in the Vocabulary matches the input, "Changes were applied". Otherwise, the message will be, "No changes were applied".

Result

The string after noise words were removed, if any.

"All noise" message

Will be set as true if every word in the input string was found to be a noise word. If this occurs, no filtering occurs and the string is left intact.

Filtered words

A list of the noise words that were removed.

Documentation Feedback: yourturn@sas.com
Note: Always include the Doc ID when providing documentation feedback.

Doc ID: DMCust_12345.html