Field Content

Identification Analysis Definition

The Field Content identification analysis definition supports identification of global and locale specific data. The table below describes those elements that are specific to this locale. For a description of elements that are common across all locales, see the Field Content Identification Analysis Definition section in the Identification Analysis Definitions documentation.

Field Content
Description

The Field Content identification analysis definition identifies the type of data that is represented by a string.

Possible Outputs

CITY
CITY-STATE/PROVINCE-POSTAL CODE
COUNTRY
COUNTY
CURRENCY
DATE
DATE/TIME
DELIVERY ADDRESS
DRIVERS LICENSE
E-MAIL
EMPTY
FAMILY NAME
FULL ADDRESS
GEOGRAPHICAL POINT
GIVEN NAME
IBAN
INDIVIDUAL
MONTH
NATIONAL INSURANCE NUMBER
NETWORK ADDRESS
ORGANIZATION
PAYMENT CARD NUMBER
PHONE
POSTAL CODE
UNKNOWN
URL
VEHICLE REGISTRATION

Default Identity UNKNOWN
Examples Input Output
January MONTH
Mr. Smith INDIVIDUAL
Mr John Smith INDIVIDUAL
John S McCain III INDIVIDUAL
N Z Smith INDIVIDUAL
Mr. and Mrs. Nathan Guttermoth INDIVIDUAL
Johnson FAMILY NAME
Patricia GIVEN NAME
IEEE ORGANIZATION
SAS Institute ORGANIZATION
Sunrise Club, Inc. ORGANIZATION
Computerland Of Wichita ORGANIZATION
1st Floor Rennie House Stamford Street London SE1 9LL FULL ADDRESS
5 Chilton Cottages, Queen's Road Princes Risborough Buckinghamshire FULL ADDRESS
420 Park Ridge Rd DELIVERY ADDRESS
1st Floor Rennie House Stamford Street DELIVERY ADDRESS
London SE1 9LL CITY–STATE/PROVINCE–POSTAL CODE
Princes Risborough Buckinghamshire CITY–STATE/PROVINCE–POSTAL CODE
Manchester CITY
Cumbria COUNTY
SL7 2EB POSTAL CODE
(0)20 87817200 PHONE
+44 (0) 1753 272 020 Ext 12 PHONE
AA999999A NATIONAL INSURANCE NUMBER
AA99AAA VEHICLE REGISTRATION
AAAAA909099AA9AA00 DRIVERS LICENSE
Remarks The English, United Kingdom implementation of the Field Content identification analysis definition does not identify data as STATE/PROVINCE. Instead, it identifies counties of the United Kingdom as COUNTY.

Some data has the potential for ambiguity. When conflicts occur, this definition uses the following priorities in this order to produce more accurate results when aggregating over a column of data:

  1. CITY (official cities)
  2. COUNTY
  3. MONTH
  4. COUNTRY
  5. FAMILY NAME
  6. GIVEN NAME
  7. CITY (towns)
  8. DELIVERY ADDRESS
  9. INDIVIDUAL