SAS Quality Knowledge Base for Contact Information 26
Definitions for the Finnish, Finland locale are described below.
Case Definitions
Extraction Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Lower (Title) | ||
---|---|---|
Description | The Lower (Title) case definition cases titles. | |
Examples | Input | Output |
DATANOMIHARJOITTELIJA | Datanomiharjoittelija | |
DIPLOMI-INSINÖÖRI, MAA- JA VESIRAKENNUS | Diplomi-insinööri, maa- ja vesirakennus | |
CEO | CEO | |
PHD | PhD | |
Remarks | Titles are generally lowercased, but first character is uppercased. |
Proper (Address) | ||
---|---|---|
Description | The Proper (Address) case definition propercases addresses. | |
Examples | Input | Output |
Petri Pasanen Kuja 12 b | Petri Pasanen kuja 12 B | |
PÄIVIÖNKATU 36 A 4 | Päiviönkatu 36 A 4 | |
AKU RÄDYNTIE 5 | Aku Rädyntie 5 | |
Remarks |
Proper (City) | ||
---|---|---|
Description | The Proper (City) case definition propercases city names. | |
Examples | Input | Output |
ALA-SEPPÄ | Ala-Seppä | |
NOKIA | Nokia | |
ÖSTERBY | Österby | |
Remarks |
Proper (Legal Form) | ||
---|---|---|
Description | The Proper (Legal Form) case definition propercases legal forms for organizations. | |
Examples | Input | Output |
OY | Oy | |
RF | rf | |
GMBH | GmbH | |
Remarks |
Proper (Name) | ||
---|---|---|
Description | The Proper (Name) case definition propercases names of individuals. | |
Examples | Input | Output |
Kaarlo Ylppö | Kaarlo Ylppö | |
MINNA VON KNORRING | Minna von Knorring | |
SALLA SAARNI-YLÄ-KÖNNI | Salla Saarni-Ylä-Könni | |
Remarks |
Proper (Organization) | ||
---|---|---|
Description | The Proper (Organization) case definition propercases organization names. | |
Examples | Input | Output |
Esab Dalsbruk | ESAB Dalsbruk | |
MSC Electronics | MSc elektronics | |
Sa-Tu Logistics | SA-TU Logistics | |
Remarks |
None.
Name | ||
---|---|---|
Description | The Name gender analysis definition determines the gender of a name. | |
Possible Outputs | M F U |
|
Examples | Input | Output |
Sari Saaritsa | F | |
Mika Matinvesi | M | |
P. J. Hannikainen | U | |
Remarks |
Individual/Organization | ||
---|---|---|
Description | The Individual/Organization identification analysis definition determines whether a string represents the name of an individual or an organization. | |
Possible Outputs | ORGANIZATION INDIVIDUAL UNKNOWN |
|
Examples | Input | Output |
Heli Nyman | INDIVIDUAL | |
T:mi Tarja Harakka | ORGANIZATION | |
Maken Kiska | ORGANIZATION | |
Remarks |
Address | ||
---|---|---|
Description | The Address match definition generates match codes which can be used to cluster records containing addresses. | |
Max Length of Match Code | 28 characters | |
Examples | Input | Cluster ID |
Kala-Matti 12 A 9 | 0 | |
Fiskar-Matte 12 A 9 | 0 | |
Remarks |
|
Address (Full) | ||
---|---|---|
Description | The Address (Full) match definition generates match codes which can be used to cluster records containing complete two-line addresses. | |
Max Length of Match Code | 40 characters | |
Examples | Input | Cluster ID |
Nihtisillankatu 3 A, 02511 ESPOO | 0 | |
Nihtisillankantie 3 A, 02512 ESPOO | 0 | |
näckensgränd 12-14 a 8, 10900 hangö | 1 | |
Ahdinkuja 12/1 A 8, 10900 HANKO | 1 | |
Remarks |
|
Address (PO Box Only) | ||
---|---|---|
Description | The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
TEKNIIKANTIE 14, PL 123 | 0 | |
Kala-Matti 12 A 9, PL 123 | 0 | |
PL 123 | 0 | |
PL 234 | 0 | |
Remarks |
|
Address (Street Only) | ||
---|---|---|
Description | The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address. | |
Max Length of Match Code | 21 characters | |
Examples | Input | Cluster ID |
TEKNIIKANTIE 14, PL 123 | 0 | |
TEKNIIKANTIE 14 B, PL 234 | 0 | |
PL 123 | 1 | |
PL 234 | 1 | |
Remarks |
|
City | ||
---|---|---|
Description | The City match definition generates match codes which can be used to cluster records containing city names. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
MÄNTYHARJU | 0 | |
MÄNTYHARJU KK | 0 | |
MUURAME | 1 | |
MUURAME 7 | 1 | |
Remarks |
|
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
00250 HELSINKI | 0 | |
00251 HELSINGFORS | 0 | |
65100 VAASA | 1 | |
65101 VASA | 1 | |
Remarks |
|
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 20 characters | |
Examples | Input | Cluster ID |
Diplomi-insinööri Mia Karsten | 0 | |
MIIA-NOORA KARLSTEN | 0 | |
Frederic Virtanen | 1 | |
Fred E. Virtanen | 1 | |
Remarks |
|
Organization | ||
---|---|---|
Description | The Organization match definition generates match codes which can be used to cluster records containing organization names. | |
Max Length of Match Code | 36 characters | |
Examples | Input | Cluster ID |
MAATALOUDEN LASKENTAKESKUS OY | 0 | |
SUOMEN MAATALOUDEN LASKENTAKESKUS KY | 0 | |
Fennia | 1 | |
KESKINÄINEN VAKUUTUSYHTIÖ FENNIA | 1 | |
Remarks |
|
Phone | ||
---|---|---|
Description | The Phone match definition generates match codes which can be used to cluster records containing phone numbers. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
050 3587243 | 0 | |
+358 50 3587244 | 0 | |
Soitella: (050) 358 72 45 | 0 | |
Remarks |
|
Postal Code | ||
---|---|---|
Description | The Postal Code match definition generates match codes which can be used to cluster records containing postal codes. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
02150 | 0 | |
00251 | 1 | |
FI-00252 | 1 | |
Remarks |
|
Address | |||
---|---|---|---|
Description | The Address parse definition parses addresses into a set of tokens. | ||
Output Tokens | Building Name Street Building Number Extension Additional Info |
||
Example 1 | Input | Output Token | Output |
Tekniikantie 14, PL 85 | Building Name | ||
Street | Tekniikantie | ||
Building Number | 14 | ||
Extension | PL 85 | ||
Additional Info | |||
Example 2 | Input | Output Token | Output |
Topeliuskatu 41 aA 15 | Building Name | ||
Street | Topeliuskatu | ||
Building Number | 41 | ||
Extension | aA 15 | ||
Additional Info | |||
Remarks | Parsing is following Public Administration Recommendation JHS106 of how to write postal addresses. Address extension and Post Box address are parsed into the same token. |
Address (Full) | |||
---|---|---|---|
Description | The Address (Full) parse definition parses addresses containing complete two-line addresses into a set of tokens. | ||
Output Tokens | Building Name Street Building Number Extension Postal Code City Additional Info |
||
Example 1 | Input | Output Token | Output |
Tekniikantie 14 PL 85, 02151 ESPOO | Building Name | ||
Street | Tekniikantie | ||
Building Number | 14 | ||
Extension | PL 85 | ||
Postal Code | 02151 | ||
City | Espoo | ||
Additional Info | |||
Example 2 | Input | Output Token | Output |
Övre s:t Mariegatan 8 B 3 02400 Åbo | Building Name | ||
Street | Övre s:t Mariegatan | ||
Building Number | 8 | ||
Extension | B 3 | ||
Postal Code | 20400 | ||
City | Åbo | ||
Additional Info | |||
Remarks |
Address (Global) | |||
---|---|---|---|
Description |
The Address (Global) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Example | Input | Output Token | Output |
Vesakkotie 5 A 55 | Recipient | ||
Building/Site | |||
Street | Vesakkotie 5 | ||
Extension | A 55 | ||
PO Box | |||
Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
City - State/Province - Postal Code | |||
---|---|---|---|
Description | The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens. | ||
Output Tokens | Postal Code City |
||
Example | Input | Output Token | Output |
02151 ESPOO | Postal Code | 02151 | |
City | ESPOO | ||
Remarks |
City - State/Province - Postal Code (Global) | |||
---|---|---|---|
Description | The City - State/Province - Postal Code (Global) parse definition parses last line address information into a globally recognized set of tokens. | ||
Output Tokens | City State/Province Postal Code Additional Info |
||
Example | Input | Output Token | Output |
02151 ESPOO | City | ESPOO | |
State/Province | |||
Postal Code | 02151 | ||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name | |||
---|---|---|---|
Description | The Name parse definition parses names of individuals into a set of tokens. | ||
Output Tokens | Title/Additional Info Prefix Given Name Middle Name Family Name Suffix |
||
Example 1 | Input | Output Token | Output |
MIKAEL T. KLOCKARS | Title/Additional Info | ||
Prefix | |||
Given Name | MIKAEL | ||
Middle Name | T | ||
Family Name | KLOCKARS | ||
Suffix | |||
Example 2 | Input | Output Token | Output |
ROUVA MARJA-LEENA VÄLIAHO | Title/Additional Info | ||
Prefix | ROUVA | ||
Given Name | MARJA-LEENA | ||
Middle Name | |||
Family Name | VÄLIAHO | ||
Suffix | |||
Remarks | The order of name and title is following the JHS106 definition. The commonly recognized order of prefix and title is not followed. |
Name (Global) | |||
---|---|---|---|
Description | The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Example 1 | Input | Output Token | Output |
MIKAEL T. KLOCKARS | Prefix | ||
Given Name | MIKAEL | ||
Middle Name | T. | ||
Family Name | KLOCKARS | ||
Suffix | |||
Title/Additional Info | |||
Example 2 | Input | Output Token | Output |
ROUVA MARJA-LEENA VÄLIAHO | Prefix | ROUVA | |
Given Name | MARJA-LEENA | ||
Middle Name | |||
Family Name | VÄLIAHO | ||
Suffix | |||
Title/Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name (Multiple Name) | |||
---|---|---|---|
Description | The Name (Multiple Name) parse definition parses strings that contain the names of two individuals into a set of tokens. | ||
Output Tokens | Name 1 Name 2 |
||
Example 1 | Input | Output Token | Output |
PEKKA MÄKELÄ, LIISA MUSTONEN | Name 1 | PEKKA MÄKELÄ | |
Name 2 | LIISA MUSTONEN | ||
Example 2 | Input | Output Token | Output |
AULIS LAINE JA DOORIS MÄKI | Name 1 | AULIS LAINE | |
Name 2 | DOORIS MÄKI | ||
Remarks |
Organization | |||
---|---|---|---|
Description | The Organization parse definition parses organization names into a set of tokens. | ||
Output Tokens | Legal Form Prefix
Organization Legal Form Suffix Additional Info |
||
Example | Input | Output Token | Output |
Oy Vaasan Kone Ab | Legal Form Prefix | Oy | |
Organization | Vaasan Kone | ||
Legal Form Suffix | Ab | ||
Additional Info | |||
Remarks |
Phone | |||
---|---|---|---|
Description | The Phone parse definition parses phone numbers into a set of tokens. | ||
Output Tokens | Country Code Area Code Phone Number Additional Info |
||
Example 1 | Input | Output Token | Output |
Työ095255721 | Country Code | ||
Area Code | 09 | ||
Phone Number | 52557214 | ||
Additional Info | Työ | ||
Example 2 | Input | Output Token | Output |
GSM +358 50 5258725 | Country Code | 358 | |
Area Code | 50 | ||
Phone Number | 5258725 | ||
Additional Info | GSM | ||
Remarks |
Phone (Global) | |||
---|---|---|---|
Description | The Phone (Global) parse definition parses phone numbers into a globally recognized set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Example 1 | Input | Output Token | Output |
09 5255 7214 Työ | Country Code | ||
Area Code | 09 | ||
Base Number | 52557214 | ||
Extension | |||
Line Type | |||
Additional Info | Työ | ||
Example 2 | Input | Output Token | Output |
+358 50 5258725 | Country Code | +358 | |
Area Code | 50 | ||
Base Number | 5258725 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
None.
Address | ||
---|---|---|
Description | The Address standardization definition standardizes addresses. | |
Examples | Input | Output |
HAAGAN URHEILUTIE 3 AB 11 | Haagan urheilutie 3 aB 11 | |
LEPOLANTIE 105 AS 3 | Lepolantie 105 as 3 | |
Remarks |
Address (Full) | ||
---|---|---|
Description | The Address (Full) standardization definition standardizes complete two line addresses. | |
Example | Input | Output |
LESKIROUVA FREYTAGIN KUJA 3 B 33, 00790 HELSINKI | Leskirouva Freytagin kuja 3 B 33 00790 HELSINKI | |
Remarks |
City | ||
---|---|---|
Description | The City standardization definition standardizes city names. | |
Examples | Input | Output |
YLI-II | Yli-Ii | |
HELSINKI | Helsinki | |
Remarks |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code standardization definition standardizes last line address information. | |
Examples | Input | Output |
02151 Espoo | 02151 ESPOO | |
00250 Helsinki | 00250 HELSINKI | |
Remarks | Follows the instructions given in JHS106 and uppercases city names. |
City (Upper) | ||
---|---|---|
Description | The City (Upper) standardization definition standardizes and uppercases city names. | |
Examples | Input | Output |
Yli-Ii | YLI-II | |
Helsinki | HELSINKI | |
Remarks |
Name | ||
---|---|---|
Description | The Name standardization definition standardizes names of individuals. | |
Examples | Input | Output |
MIINA VON LODE | Miina von Lode | |
Lahtinen, Pyry | Pyry Lahtinen | |
Mrs Liisa Lahtinen | Rouva Liisa Lahtinen | |
Remarks | Name prefixes are translated to Finnish. Finnish name prefixes are not abbreviated. |
Name (No Translation) | ||
---|---|---|
Description |
The Name (No Translation) standardization definition standardizes names of individuals. |
|
Examples | Input | Output |
MIINA VON LODE | Miina von Lode | |
Lahtinen, Pyry | Pyry Lahtinen | |
Rva Liisa Lahtinen | Rouva Liisa Lahtinen | |
Mrs. Liisa Lahtinen | Mrs Liisa Lahtinen | |
Frk. Liisa Lahtinen | Frk Liisa Lahtinen | |
Remarks | Name prefixes are not translated to Finnish. Finnish name prefixes are not abbreviated. |
Name (Swedish) | ||
---|---|---|
Description |
The Name (Swedish) standardization definition standardizes names of individuals. |
|
Examples | Input | Output |
Rva Liisa Lahtinen | Fr Liisa Lahtinen | |
Mrs. Liisa Lahtinen | Fr Liisa Lahtinen | |
Frk. Liisa Lahtinen | Frk Liisa Lahtinen | |
Remarks | Name prefixes are translated to Swedish. Name prefixes are abbreviated. |
Organization | ||
---|---|---|
Description | The Organization standardization definition standardizes organization names. | |
Examples | Input | Output |
OY ADZONE AB | Oy AdZone Ab | |
MATIN RENGAS KY | Matin Rengas Ky | |
Remarks |
Phone | ||
---|---|---|
Description | The Phone standardization definition standardizes phone numbers for domestic use. | |
Examples | Input | Output |
+358 9525571 | 09 525571 | |
050 525571 | 050 525571 | |
Remarks |
Phone (with Country Code) | ||
---|---|---|
Description | The Phone (with Country Code) standardization definition standardizes phone numbers for international use. | |
Examples | Input | Output |
09525571 | +385 9 525571 | |
003589525571 | +358 9 525571 | |
Remarks |
Postal Code | ||
---|---|---|
Description | The Postal Code standardization definition standardizes postal codes. | |
Examples | Input | Output |
'02150' | 02150 | |
Remarks |
Postal Code (International) | ||
---|---|---|
Description |
The Postal Code (International) standardization definition standardizes postal codes and inserts country prefixes. |
|
Examples | Input | Output |
02150 | FI-02150 | |
22150 | AX-22150 | |
Remarks | Country code AX is inserted for postal codes starting with 22 followed by three digits. |
In addition to the definitions listed on this page, the Finnish, Finland locale also inherits all definitions for the Finnish language and all Global definitions.
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_FIFIN_defs.html |