Field Content

Identification Analysis Definition

Field Content
Description

The Field Content identification analysis definition identifies the type of data that is represented by a string.

Possible Outputs

CITY
CITY-STATE/PROVINCE-POSTAL CODE
COUNTRY
CURRENCY
DATE
DATE/TIME
DELIVERY ADDRESS
E-MAIL
EMPTY
FAMILY NAME
FULL ADDRESS
GEOGRAPHICAL POINT
GIVEN NAME
IBAN
IDENTITY CARD NUMBER
INDIVIDUAL
MONTH
NATIONAL ID
NETWORK ADDRESS
ORGANIZATION
PAYMENT CARD NUMBER
PHONE
POSTAL CODE
UNKNOWN
URL
VEHICLE REGISTRATION

Default Identity UNKNOWN
Examples Input Output
januari MONTH
Mevrouw Ann-Kristin VAN MECHELEN INDIVIDUAL
Professeur Charles E. Janssens INDIVIDUAL
M. Charles Janssens INDIVIDUAL
A Z Castieau INDIVIDUAL
Dr. & Mevr. Van Godtsenhoven INDIVIDUAL
Mr et Mme DESCAMPS INDIVIDUAL
SAS Institute NV/SA ORGANIZATION
NMBS ORGANIZATION
Lotus Bakeries NV ORGANIZATION
SPRL HENRY MG ORGANIZATION
Mertens FAMILY NAME
Dupont FAMILY NAME
Maria GIVEN NAME
Marie GIVEN NAME
Hertenbergstraat 6/1, 3048 Tervuren FULL ADDRESS
A. Puesstraat 59 Bus 11, CHAPELLE-LEZ-HERLAIMONT 7160 FULL ADDRESS
Avenue Louise 527, 1050 Bruxelles, Belgium FULL ADDRESS
6 Hertenbergstraat DELIVERY ADDRESS
Rue Neuve 111-123 DELIVERY ADDRESS
Brusselsestraat 63 DELIVERY ADDRESS
1640 RHODE-SAINT-GENESE CITY-STATE/PROVINCE-POSTAL CODE
BARVAUX-S-OURTHE 6940 CITY-STATE/PROVINCE-POSTAL CODE
KORTRIJK CITY
COURTRAI CITY
KAPELLE-O/D-BOS CITY
3010 POSTAL CODE
B-8560 POSTAL CODE
0032266242623 PHONE
014/78.12.34 PHONE
GSM: 0032 477 947 193 PHONE
591-8109372-80 IDENTITY CARD NUMBER
720322 077 20 NATIONAL ID
1-CDK-936 VEHICLE REGISTRATION
Remarks

Some data has the potential for ambiguity. When conflicts occur, this definition uses the following priorities in this order to produce more accurate results when aggregating over a column of data:

  1. CITY (populations greater than 30,000)
  2. MONTH
  3. FAMILY NAME
  4. GIVEN NAME
  5. CITY (population less than 30,000)
  6. PHONE
  7. IDENTITY CARD NUMBER