SAS Quality Knowledge Base for Contact Information 25
Definitions for the Finnish, Finland locale are described below.
Case Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Lower (Title) | ||
---|---|---|
Description | The Case definition for Lower (Title) cases titles. | |
Examples | Input | Output |
DATANOMIHARJOITTELIJA | Datanomiharjoittelija | |
DIPLOMI-INSINÖÖRI, MAA- JA VESIRAKENNUS | Diplomi-insinööri, maa- ja vesirakennus | |
CEO | CEO | |
PHD | PhD | |
Remarks | Titles are generally lowercased, but first character is uppercased. |
Proper (Address) | ||
---|---|---|
Description | The Case definition for Proper (Address) propercases addresses. | |
Examples | Input | Output |
Petri Pasanen Kuja 12 b | Petri Pasanen kuja 12 B | |
PÄIVIÖNKATU 36 A 4 | Päiviönkatu 36 A 4 | |
AKU RÄDYNTIE 5 | Aku Rädyntie 5 | |
Remarks |
Proper (City) | ||
---|---|---|
Description | The Case definition for Proper (City) propercases city names. | |
Examples | Input | Output |
ALA-SEPPÄ | Ala-Seppä | |
NOKIA | Nokia | |
ÖSTERBY | Österby | |
Remarks |
Proper (Legal Form) | ||
---|---|---|
Description | The Case definition for Proper (Legal Form) propercases organizations' legal forms. | |
Examples | Input | Output |
OY | Oy | |
RF | rf | |
GMBH | GmbH | |
Remarks |
Proper (Name) | ||
---|---|---|
Description | The Case definition for Proper (Name) propercases names. | |
Examples | Input | Output |
Kaarlo Ylppö | Kaarlo Ylppö | |
MINNA VON KNORRING | Minna von Knorring | |
SALLA SAARNI-YLÄ-KÖNNI | Salla Saarni-Ylä-Könni | |
Remarks |
Proper (Organization) | ||
---|---|---|
Description | The Case definition for Proper (Organization) propercases organizations. | |
Examples | Input | Output |
Esab Dalsbruk | ESAB Dalsbruk | |
MSC Electronics | MSc elektronics | |
Sa-Tu Logistics | SA-TU Logistics | |
Remarks |
Name | ||
---|---|---|
Description | The Gender Analysis definition for Name determines an individual's gender based on a name. | |
Possible Outputs | M F U |
|
Examples | Input | Output |
Sari Saaritsa | F | |
Mika Matinvesi | M | |
P. J. Hannikainen | U | |
Remarks |
Individual/Organization | ||
---|---|---|
Description | The Identification Analysis definition for Individual/Organization determines whether a string represents the name of an individual or an organization. | |
Possible Outputs | ORGANIZATION INDIVIDUAL UNKNOWN |
|
Examples | Input | Output |
Heli Nyman | INDIVIDUAL | |
T:mi Tarja Harakka | ORGANIZATION | |
Maken Kiska | ORGANIZATION | |
Remarks |
Address | ||
---|---|---|
Description | The Address match definition generates match codes which can be used to cluster records containing addresses. | |
Max Length of Match Code | 28 characters | |
Examples | Input | Cluster ID |
Kala-Matti 12 A 9 | 0 | |
Fiskar-Matte 12 A 9 | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address (Full) | ||
---|---|---|
Description | The Address (Full) match definition generates match codes which can be used to cluster records containing complete two-line addresses. | |
Max Length of Match Code | 40 characters | |
Examples | Input | Cluster ID |
Nihtisillankatu 3 A, 02511 ESPOO | 0 | |
Nihtisillankantie 3 A, 02512 ESPOO | 0 | |
näckensgränd 12-14 a 8, 10900 hangö | 1 | |
Ahdinkuja 12/1 A 8, 10900 HANKO | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address (PO Box Only) | ||
---|---|---|
Description | The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
TEKNIIKANTIE 14, PL 123 | 0 | |
Kala-Matti 12 A 9, PL 123 | 0 | |
PL 123 | 0 | |
PL 234 | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address (Street Only) | ||
---|---|---|
Description | The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address. | |
Max Length of Match Code | 21 characters | |
Examples | Input | Cluster ID |
TEKNIIKANTIE 14, PL 123 | 0 | |
TEKNIIKANTIE 14 B, PL 234 | 0 | |
PL 123 | 1 | |
PL 234 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
City | ||
---|---|---|
Description | The City match definition generates match codes which can be used to cluster records containing city names. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
MÄNTYHARJU | 0 | |
MÄNTYHARJU KK | 0 | |
MUURAME | 1 | |
MUURAME 7 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
00250 HELSINKI | 0 | |
00251 HELSINGFORS | 0 | |
65100 VAASA | 1 | |
65101 VASA | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 20 characters | |
Examples | Input | Cluster ID |
Diplomi-insinööri Mia Karsten | 0 | |
MIIA-NOORA KARLSTEN | 0 | |
Frederic Virtanen | 1 | |
Fred E. Virtanen | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Organization | ||
---|---|---|
Description | The Organization match definition generates match codes which can be used to cluster records containing organization names. | |
Max Length of Match Code | 36 characters | |
Examples | Input | Cluster ID |
MAATALOUDEN LASKENTAKESKUS OY | 0 | |
SUOMEN MAATALOUDEN LASKENTAKESKUS KY | 0 | |
Fennia | 1 | |
KESKINÄINEN VAKUUTUSYHTIÖ FENNIA | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Phone | ||
---|---|---|
Description | The Phone match definition generates match codes which can be used to cluster records containing phone numbers. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
050 3587243 | 0 | |
+358 50 3587244 | 0 | |
Soitella: (050) 358 72 45 | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Postal Code | ||
---|---|---|
Description | The Postal Code match definition generates match codes which can be used to cluster records containing postal codes. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
02150 | 0 | |
00251 | 1 | |
FI-00252 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address | |||
---|---|---|---|
Description | The Parse definition for Address parses addresses. | ||
Output Tokens | Building Name Street Building Number Extension Additional Info |
||
Example 1 | Input | Output | |
Tekniikantie 14, PL 85 | Building Name | ||
Street | Tekniikantie | ||
Building Number | 14 | ||
Extension | PL 85 | ||
Additional Info | |||
Example 2 | Input | Output | |
Topeliuskatu 41 aA 15 | Building Name | ||
Street | Topeliuskatu | ||
Building Number | 41 | ||
Extension | aA 15 | ||
Additional Info | |||
Remarks | Parsing is following Public Administration Recommendation JHS106 of how to write postal addresses. Address extension and Post Box address are parsed into the same token. |
Address (Full) | |||
---|---|---|---|
Description | The Parse definition for Address (Full) parses a full two-line address. | ||
Output Tokens | Building Name Street Building Number Extension Postal Code City Additional Info |
||
Example 1 | Input | Output | |
Tekniikantie 14 PL 85, 02151 ESPOO | Building Name | ||
Street | Tekniikantie | ||
Building Number | 14 | ||
Extension | PL 85 | ||
Postal Code | 02151 | ||
City | Espoo | ||
Additional Info | |||
Example 2 | Input | Output | |
Övre s:t Mariegatan 8 B 3 02400 Åbo | Building Name | ||
Street | Övre s:t Mariegatan | ||
Building Number | 8 | ||
Extension | B 3 | ||
Postal Code | 20400 | ||
City | Åbo | ||
Additional Info | |||
Remarks |
Address (Global) | |||
---|---|---|---|
Description |
The Address (Global) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Example | Input | Output | |
Vesakkotie 5 A 55 | Recipient | ||
Building/Site | |||
Street | Vesakkotie 5 | ||
Extension | A 55 | ||
PO Box | |||
Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
Address (Global) (v23) | |||
---|---|---|---|
Description |
The Address (Global) (v23) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Example | Input | Output | |
Vesakkotie 5 A 55 | Recipient | ||
Building/Site | |||
Street | Vesakkotie 5 | ||
Extension | A 55 | ||
PO Box | |||
Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
City - State/Province - Postal Code | |||
---|---|---|---|
Description | The Parse definition for City - State/Province - Postal Code parses address "last line" data. | ||
Output Tokens | Postal Code City |
||
Example | Input | Output | |
02151 ESPOO | Postal Code | 02151 | |
City | ESPOO | ||
Remarks |
City - State/Province - Postal Code (Global) | |||
---|---|---|---|
Description | The Parse definition for City - State/Province - Postal Code (Global) parses address "last line" data into a globally recognized set of tokens. | ||
Output Tokens | City State/Province Postal Code Additional Info |
||
Example | Input | Output | |
02151 ESPOO | City | ESPOO | |
State/Province | |||
Postal Code | 02151 | ||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name | |||
---|---|---|---|
Description | The Parse definition for Name parses names of individuals. | ||
Output Tokens | Title/Additional Info Prefix Given Name Middle Name Family Name Suffix |
||
Example 1 | Input | Output | |
MIKAEL T. KLOCKARS | Title/Additional Info | ||
Prefix | |||
Given Name | MIKAEL | ||
Middle Name | T | ||
Family Name | KLOCKARS | ||
Suffix | |||
Example 2 | Input | Output | |
ROUVA MARJA-LEENA VÄLIAHO | Title/Additional Info | ||
Prefix | ROUVA | ||
Given Name | MARJA-LEENA | ||
Middle Name | |||
Family Name | VÄLIAHO | ||
Suffix | |||
Remarks | The order of name and title is following the JHS106 definition. The commonly recognized order of prefix and title is not followed. Jobs run using the Name Parse definition may require the Parse Resource Limit set to High. |
Name (Global) | |||
---|---|---|---|
Description | Parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Example 1 | Input | Output | |
MIKAEL T. KLOCKARS | Prefix | ||
Given Name | MIKAEL | ||
Middle Name | T. | ||
Family Name | KLOCKARS | ||
Suffix | |||
Title/Additional Info | |||
Example 2 | Input | Output | |
ROUVA MARJA-LEENA VÄLIAHO | Prefix | ROUVA | |
Given Name | MARJA-LEENA | ||
Middle Name | |||
Family Name | VÄLIAHO | ||
Suffix | |||
Title/Additional Info | |||
Remarks | Jobs run using the Name parse definition may require the Parse Resource Limit set to High. Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name (Multiple Name) | |||
---|---|---|---|
Description | Parses strings that contain the names of one or two individuals. | ||
Output Tokens | Name 1 Name 2 |
||
Example 1 | Input | Output | |
PEKKA MÄKELÄ, LIISA MUSTONEN | Name 1 | PEKKA MÄKELÄ | |
Name 2 | LIISA MUSTONEN | ||
Example 2 | Input | Output | |
AULIS LAINE JA DOORIS MÄKI | Name 1 | AULIS LAINE | |
Name 2 | DOORIS MÄKI | ||
Remarks | Jobs run using the Name (Multiple Name) parse definition may require the Parse Resource Limit set to High. |
Organization | |||
---|---|---|---|
Description | Parses strings that contain organization names. | ||
Output Tokens | Legal Form Prefix
Organization Legal Form Suffix Additional Info |
||
Example | Input | Output | |
Oy Vaasan Kone Ab | Legal Form Prefix | Oy | |
Organization | Vaasan Kone | ||
Legal Form Suffix | Ab | ||
Additional Info | |||
Remarks |
Phone | |||
---|---|---|---|
Description | Parses Finnish phone numbers. | ||
Output Tokens | Country Code Area Code Phone Number Additional Info |
||
Example 1 | Input | Output | |
Työ095255721 | Country Code | ||
Area Code | 09 | ||
Phone Number | 52557214 | ||
Additional Info | Työ | ||
Example 2 | Input | Output | |
GSM +358 50 5258725 | Prefix | ||
Country Code | 358 | ||
Area Code | 50 | ||
Phone Number | 5258725 | ||
Additional Info | GSM | ||
Remarks |
Phone (Global) | |||
---|---|---|---|
Description | Parses phone numbers into a globally recognized set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Example 1 | Input | Output | |
09 5255 7214 Työ | Country Code | ||
Area Code | 09 | ||
Base Number | 52557214 | ||
Extension | |||
Line Type | |||
Additional Info | Työ | ||
Example 2 | Input | Output | |
+358 50 5258725 | Country Code | +358 | |
Area Code | 50 | ||
Base Number | 5258725 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
None.
Address | ||
---|---|---|
Description | Standardizes address data. | |
Examples | Input | Output |
HAAGAN URHEILUTIE 3 AB 11 | Haagan urheilutie 3 aB 11 | |
LEPOLANTIE 105 AS 3 | Lepolantie 105 as 3 | |
Remarks |
Address (Full) | ||
---|---|---|
Description | Standardizes full two-line address data. | |
Example | Input | Output |
LESKIROUVA FREYTAGIN KUJA 3 B 33, 00790 HELSINKI | Leskirouva Freytagin kuja 3 B 33 00790 HELSINKI | |
Remarks |
City | ||
---|---|---|
Description | Standardizes city names. | |
Examples | Input | Output |
YLI-II | Yli-Ii | |
HELSINKI | Helsinki | |
Remarks |
City - State/Province - Postal Code | ||
---|---|---|
Description | Standardizes address "last line" data. | |
Examples | Input | Output |
02151 Espoo | 02151 ESPOO | |
00250 Helsinki | 00250 HELSINKI | |
Remarks | Follows the instructions given in JHS106 and uppercases city names. |
City (Upper) | ||
---|---|---|
Description | Standardizes and uppercases city names. | |
Examples | Input | Output |
Yli-Ii | YLI-II | |
Helsinki | HELSINKI | |
Remarks |
Name | ||
---|---|---|
Description | Standardizes names of individuals. | |
Examples | Input | Output |
MIINA VON LODE | Miina von Lode | |
Lahtinen, Pyry | Pyry Lahtinen | |
Mrs Liisa Lahtinen | Rouva Liisa Lahtinen | |
Remarks | Name prefixes are translated to Finnish. Finnish name prefixes are not abbreviated. |
Name (No Translation) | ||
---|---|---|
Description | Standardizes names of individuals. | |
Examples | Input | Output |
MIINA VON LODE | Miina von Lode | |
Lahtinen, Pyry | Pyry Lahtinen | |
Rva Liisa Lahtinen | Rouva Liisa Lahtinen | |
Mrs. Liisa Lahtinen | Mrs Liisa Lahtinen | |
Frk. Liisa Lahtinen | Frk Liisa Lahtinen | |
Remarks | Name prefixes are not translated to Finnish. Finnish name prefixes are not abbreviated. |
Name (Swedish) | ||
---|---|---|
Description | Standardizes names of individuals. | |
Examples | Input | Output |
Rva Liisa Lahtinen | Fr Liisa Lahtinen | |
Mrs. Liisa Lahtinen | Fr Liisa Lahtinen | |
Frk. Liisa Lahtinen | Frk Liisa Lahtinen | |
Remarks | Name prefixes are translated to Swedish. Name prefixes are abbreviated. |
Organization | ||
---|---|---|
Description | Standardizes organization names. | |
Examples | Input | Output |
OY ADZONE AB | Oy AdZone Ab | |
MATIN RENGAS KY | Matin Rengas Ky | |
Remarks |
Phone | ||
---|---|---|
Description | Standardizes Finnish phone numbers. | |
Examples | Input | Output |
+358 9525571 | 09 525571 | |
050 525571 | 050 525571 | |
Remarks |
Phone (with Country Code) | ||
---|---|---|
Description | Standardizes Finnish phone numbers with country code. | |
Examples | Input | Output |
09525571 | +385 9 525571 | |
003589525571 | +358 9 525571 | |
Remarks |
Postal Code | ||
---|---|---|
Description | Standardizes postal codes. | |
Examples | Input | Output |
'02150' | 02150 | |
Remarks |
Postal Code (International) | ||
---|---|---|
Description | Standardizes postal codes and insert country prefixes (Finland and Åland). | |
Examples | Input | Output |
02150 | FI-02150 | |
22150 | AX-22150 | |
Remarks | Country code AX is inserted for postal codes starting with 22 followed by three digits. |
In addition to the definitions listed on this page, the Finnish, Finland locale also inherits all definitions for the Finnish language and all Global definitions.
Documentation Feedback: yourturn@sas.com
|
Doc ID: QKBCI_FIFIN_defs.html |