DataFlux Data Management Studio 2.7: User Guide
The Noise Word Vocabulary Node removes words that are considered "noise" from the input. These are usually words that form a legitimate part of the input but are not important for matching purposes. For example, if matching addresses, the word denoting the type of street (road, lane, drive, trail, and so on) is usually considered unimportant.
Used in:
Select a vocabulary to use. By default, only files from the locale and its ancestors appear in the drop-down. If you do not see the desired library, click Tools > Data Management Studio Options. From the column on the left, select QKB Definition Editor. Then, in the pane on the right under Library files, click Show files for all locales. QKB files from all locales will be included.
Click Open vocabulary to edit the selected file.
Select the sensitivity range for which this node will have an effect.
If any word in the Vocabulary matches the input, the message is "Changes applied". Otherwise, the message will be, "No changes applied".
The string after noise words were removed, if any.
Will be set as true if every word in the input string was found to be a noise word. If this occurs, no filtering occurs and the string is left intact.
A list of the noise words that were removed.
Documentation Feedback: yourturn@sas.com
|
Doc ID: DMCust_12345.html |