Field Content

Identification Analysis Definition

The Field Content identification analysis definition supports identification of global and locale specific data. The table below describes those elements that are specific to this locale. For a description of elements that are common across all locales, see the Field Content Identification Analysis Definition section in the Identification Analysis Definitions documentation.

Field Content
Description

The Field Content identification analysis definition identifies the type of data that is represented by a string.

Possible Outputs

CITY
CITY-STATE/PROVINCE-POSTAL CODE
COUNTRY
CURRENCY
DATE
DATE/TIME
DELIVERY ADDRESS
E-MAIL
EMPTY
FAMILY NAME
FULL ADDRESS
GEOGRAPHICAL POINT
GIVEN NAME
IBAN
INDIVIDUAL
MONTH
NETWORK ADDRESS
ORGANIZATION
PAYMENT CARD NUMBER
PHONE
POSTAL CODE
SOCIAL SECURITY NUMBER
STATE/PROVINCE
UNKNOWN
URL

Default Identity UNKNOWN
Examples Input Output
January MONTH
Mr. Smith INDIVIDUAL
Mr John Smith INDIVIDUAL
John S McCain III INDIVIDUAL
N Z Smith INDIVIDUAL
Mr. and Mrs. Nathan Guttermoth INDIVIDUAL
SAS Institute ORGANIZATION
Johnson FAMILY NAME
Patricia GIVEN NAME
IEEE ORGANIZATION
Sunrise Club, Inc. ORGANIZATION
Computerland Of Wichita ORGANIZATION
100 SAS Campus Dr, Cary, NC 27513 FULL ADDRESS
123 Main Street, Apt 4B DELIVERY ADDRESS
Cary, NC 27513 CITY-STATE/PROVINCE-POSTAL CODE
Los Angeles CITY
Illinois STATE/PROVINCE
27513 POSTAL CODE
27513-0476 POSTAL CODE
919-531-0000 PHONE
OFFICE -919-883-4242 PHONE
123-45-6789 SOCIAL SECURITY NUMBER
Remarks

Some data has the potential for ambiguity. When conflicts occur, this definition uses the following priorities in this order to produce more accurate results when aggregating over a column of data:

  1. CITY (population > 100,000)
  2. CURRENCY
  3. MONTH
  4. ORGANIZATION
  5. STATE/PROVINCE
  6. COUNTRY
  7. FAMILY NAME
  8. GIVEN NAME
  9. CITY (population < 100,000)