DataFlux Data Management Studio 2.5: User Guide
The Noise Word Vocabulary Node removes words that are considered "noise" from the input. These are usually words that form a legitimate part of the input but are not important for matching purposes. For example, if matching addresses, the word denoting the type of street (road, lane, drive, trail, and so on) is usually considered unimportant.
Used in:
Select a vocabulary to use. By default, only files from the locale and its ancestors appear in the drop-down. If you do not see the desired library, click Tools > Options. Click Display and select Show files for all locales under the Library file selection drop-down lists to view QKB files from all locales.
Click Edit to edit the selected file or create a new file, the appropriate editor opens.
Select the sensitivity range for which this node will have an effect.
If any word in the Vocabulary matches the input, "Changes were applied". Otherwise, the message will be, "No changes were applied".
The string after noise words were removed, if any.
Will be set as true if every word in the input string was found to be a noise word. If this occurs, no filtering occurs and the string is left intact.
A list of the noise words that were removed.
Documentation Feedback: yourturn@sas.com
|
Doc ID: dfU_Cstm_12345.html |