The
Text
Parsing node enables you to parse a document collection
in order to quantify information about the terms that are contained
therein. You can use the
Text Parsing node
with volumes of textual data such as e-mail messages, news articles,
Web pages, research papers, and surveys. For more information about
the
Text Parsing node, see the SAS Text Miner
Help.
Perform the following
steps to add a
Text Parsing node to the analysis:
-
Select the
Text
Mining tab on the node toolbar, and drag a
Text
Parsing node into the diagram workspace.
-
Connect the
Data
Partition node to the
Text Parsing node.
-
Select the
Text
Parsing node.
The properties for the
Text
Parsing node appear in the Properties Panel.
-
Set the
Different
Parts of Speech property value to
No
.
For the VAERS data,
this setting offers a more compact set of terms.
-
Click the
for the
Synonyms property.
-
The
Select
a SAS Table dialog box appears.
-
Select
No
data set to be specified.
-
Click
OK to
exit the
Select a SAS Table dialog box.
-
Click
OK to
exit the
Synonyms dialog box.
-
Click the
for the
Ignore Parts of Speech property.
The
Ignore
Parts of Speech dialog box appears.
-
Select the following
items, which represent parts of speech:
Note: Hold down the CTRL key to
select more than one.
Any terms with the parts
of speech that you select in the
Ignore Parts of Speech dialog
box are ignored during parsing. The selections indicated here ensure
that the analysis ignores low-content words such as prepositions and
determiners.
-