The
Text
Rule Builder node generates an ordered set of rules from
small subsets of terms that together are useful in describing and
predicting a target variable. Each rule in the set is associated with
a specific target category. Each target category consists of a conjunction
that indicates the presence or absence of one or a small subset of
terms (for example, “term1” AND “term2”
AND (NOT “term3”)). A particular document matches this
rule if and only if it contains at least one occurrence of term1 and
of term2 but no occurrences of term3.
This set of derived
rules creates a model that is both descriptive and predictive. When
categorizing a new document, the model will proceed through the ordered
set and choose the target that is associated with the first rule that
matches that document. The rules are provided in the syntax that can
be used within SAS Content Categorization Studio, and can be deployed
there.
For more information
about the
Text Rule Builder node, see the
SAS Text Miner Help.
The rest of this chapter
presents an example of how you can use the
Text Rule Builder node.