Entity identification
uses SAS linguistic technologies to classify sequences of words into
predefined classes. These classes are assigned as roles for the corresponding
sequences. For example, "Person," "Location,"
"Company," and "Measurement" are identified as
classes for "George W. Bush," "Boston," "SAS
Institute," "2.5 inches," respectively. The following
table lists the possible entities for English.
|
|
|
Postal address or number
and street name
|
|
|
|
Currency or currency
expression
|
|
|
|
City, county, state,
political or geographical place or region
|
|
Measurement or measurement
expression
|
|
Phrases that contain
multiple words
|
|
Government, legal, or
service agency
|
|
Percentage or percentage
expression
|
|
|
|
|
|
Proper noun with an
ambiguous classification
|
|
|
|
|
|
Measure of time expressions
|
|
Person’s title
or position
|
|
Motor vehicle, including
color, year, make, and model
|