You are here: Definitions>Finnish Definitions>Finnish, Finland Definitions

SAS Quality Knowledge Base for Contact Information 26

Finnish, Finland Definitions

Definitions for the Finnish, Finland locale are described below. 

Case Definitions
Extraction Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions

Case Definitions

Lower (Title)
Description The Lower (Title) case definition cases titles.
Examples Input Output
DATANOMIHARJOITTELIJA Datanomiharjoittelija
DIPLOMI-INSINÖÖRI, MAA- JA VESIRAKENNUS Diplomi-insinööri, maa- ja vesirakennus
CEO CEO
PHD PhD
Remarks Titles are generally lowercased, but first character is uppercased.

 

Proper (Address)
Description The Proper (Address) case definition propercases addresses.
Examples Input Output
Petri Pasanen Kuja 12 b Petri Pasanen kuja 12 B
PÄIVIÖNKATU 36 A 4 Päiviönkatu 36 A 4
AKU RÄDYNTIE 5 Aku Rädyntie 5
Remarks  

 

Proper (City)
Description The Proper (City) case definition propercases city names.
Examples Input Output
ALA-SEPPÄ Ala-Seppä
NOKIA Nokia
ÖSTERBY Österby
Remarks  

 

Proper (Legal Form)
Description The Proper (Legal Form) case definition propercases legal forms for organizations.
Examples Input Output
OY Oy
RF rf
GMBH GmbH
Remarks  

 

Proper (Name)
Description The Proper (Name) case definition propercases names of individuals.
Examples Input Output
Kaarlo Ylppö Kaarlo Ylppö
MINNA VON KNORRING Minna von Knorring
SALLA SAARNI-YLÄ-KÖNNI Salla Saarni-Ylä-Könni
Remarks  

 

Proper (Organization)
Description The Proper (Organization) case definition propercases organization names.
Examples Input Output
Esab Dalsbruk ESAB Dalsbruk
MSC Electronics MSc elektronics
Sa-Tu Logistics SA-TU Logistics
Remarks  

Extraction Definitions

None.

Gender Analysis Definitions

Name
Description The Name gender analysis definition determines the gender of a name.
Possible Outputs M
F
U
Examples Input Output
Sari Saaritsa F
Mika Matinvesi M
P. J. Hannikainen U
Remarks  

Identification Analysis Definitions

Individual/Organization
Description The Individual/Organization identification analysis definition determines whether a string represents the name of an individual or an organization.
Possible Outputs ORGANIZATION
INDIVIDUAL
UNKNOWN
Examples Input Output
Heli Nyman INDIVIDUAL
T:mi Tarja Harakka ORGANIZATION
Maken Kiska ORGANIZATION
Remarks  

Match Definitions

Address
Description The Address match definition generates match codes which can be used to cluster records containing addresses.
Max Length of Match Code 28 characters
Examples Input Cluster ID
Kala-Matti 12 A 9 0
Fiskar-Matte 12 A 9 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (Full)
Description The Address (Full) match definition generates match codes which can be used to cluster records containing complete two-line addresses.
Max Length of Match Code 40 characters
Examples Input Cluster ID
Nihtisillankatu 3 A, 02511 ESPOO 0
Nihtisillankantie 3 A, 02512 ESPOO 0
näckensgränd 12-14 a 8, 10900 hangö 1
Ahdinkuja 12/1 A 8, 10900 HANKO 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (PO Box Only)
Description The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address.
Max Length of Match Code 15 characters
Examples Input Cluster ID
TEKNIIKANTIE 14, PL 123 0
Kala-Matti 12 A 9, PL 123 0
PL 123 0
PL 234 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (Street Only)
Description The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address.
Max Length of Match Code 21 characters
Examples Input Cluster ID
TEKNIIKANTIE 14, PL 123 0
TEKNIIKANTIE 14 B, PL 234 0
PL 123 1
PL 234 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

City
Description The City match definition generates match codes which can be used to cluster records containing city names.
Max Length of Match Code 15 characters
Examples Input Cluster ID
MÄNTYHARJU 0
MÄNTYHARJU KK 0
MUURAME 1
MUURAME 7 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information.
Max Length of Match Code 15 characters
Examples Input Cluster ID
00250 HELSINKI 0
00251 HELSINGFORS 0
65100 VAASA 1
65101 VASA 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Name
Description The Name match definition generates match codes which can be used to cluster records containing names of individuals.
Max Length of Match Code 20 characters
Examples Input Cluster ID
Diplomi-insinööri Mia Karsten 0
MIIA-NOORA KARLSTEN 0
Frederic Virtanen 1
Fred E. Virtanen 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Organization
Description The Organization match definition generates match codes which can be used to cluster records containing organization names.
Max Length of Match Code 36 characters
Examples Input Cluster ID
MAATALOUDEN LASKENTAKESKUS OY 0
SUOMEN MAATALOUDEN LASKENTAKESKUS KY 0
Fennia 1
KESKINÄINEN VAKUUTUSYHTIÖ FENNIA 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Phone
Description The Phone match definition generates match codes which can be used to cluster records containing phone numbers.
Max Length of Match Code 15 characters
Examples Input Cluster ID
050 3587243 0
+358 50 3587244 0
Soitella: (050) 358 72 45 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Postal Code
Description The Postal Code match definition generates match codes which can be used to cluster records containing postal codes.
Max Length of Match Code 15 characters
Examples Input Cluster ID
02150 0
00251 1
FI-00252 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

Parse Definitions

Address
Description The Address parse definition parses addresses into a set of tokens.
Output Tokens Building Name
Street
Building Number
Extension
Additional Info
Example 1 Input Output Token Output
Tekniikantie 14, PL 85 Building Name  
Street Tekniikantie
Building Number 14
Extension PL 85
Additional Info  
Example 2 Input Output Token Output
Topeliuskatu 41 aA 15 Building Name  
Street Topeliuskatu
Building Number 41
Extension aA 15
Additional Info  
Remarks Parsing is following Public Administration Recommendation JHS106 of how to write postal addresses. Address extension and Post Box address are parsed into the same token.

 

Address (Full)
Description The Address (Full) parse definition parses addresses containing complete two-line addresses into a set of tokens.
Output Tokens Building Name
Street
Building Number
Extension
Postal Code
City
Additional Info
Example 1 Input Output Token Output
Tekniikantie 14 PL 85, 02151 ESPOO Building Name  
Street Tekniikantie
Building Number 14
Extension PL 85
Postal Code 02151
City Espoo
Additional Info  
Example 2 Input Output Token Output
Övre s:t Mariegatan 8 B 3 02400 Åbo Building Name  
Street Övre s:t Mariegatan
Building Number 8
Extension B 3
Postal Code 20400
City Åbo
Additional Info  
Remarks  

 

Address (Global)
Description

The Address (Global) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
Example Input Output Token Output
Vesakkotie 5 A 55 Recipient  
Building/Site  
Street Vesakkotie 5
Extension A 55
PO Box  
Additional Info  
  Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens.
Output Tokens Postal Code
City
Example Input Output Token Output
02151 ESPOO Postal Code 02151
City ESPOO
Remarks  

 

City - State/Province - Postal Code (Global)
Description The City - State/Province - Postal Code (Global) parse definition parses last line address information into a globally recognized set of tokens.
Output Tokens City
State/Province
Postal Code
Additional Info
Example Input Output Token Output
02151 ESPOO City ESPOO
State/Province  
Postal Code 02151
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Name
Description The Name parse definition parses names of individuals into a set of tokens.
Output Tokens Title/Additional Info
Prefix
Given Name
Middle Name
Family Name
Suffix
Example 1 Input Output Token Output
MIKAEL T. KLOCKARS Title/Additional Info  
Prefix  
Given Name MIKAEL
Middle Name T
Family Name KLOCKARS
Suffix  
Example 2 Input Output Token Output
ROUVA MARJA-LEENA VÄLIAHO Title/Additional Info  
Prefix ROUVA
Given Name MARJA-LEENA
Middle Name  
Family Name VÄLIAHO
Suffix  
Remarks The order of name and title is following the JHS106 definition. The commonly recognized order of prefix and title is not followed.

 

Name (Global)
Description The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens.
Output Tokens Prefix
Given Name
Middle Name
Family Name
Suffix
Title/Additional Info
Example 1 Input Output Token Output
MIKAEL T. KLOCKARS Prefix  
Given Name MIKAEL
Middle Name T.
Family Name KLOCKARS
Suffix  
Title/Additional Info  
Example 2 Input Output Token Output
ROUVA MARJA-LEENA VÄLIAHO Prefix ROUVA
Given Name MARJA-LEENA
Middle Name  
Family Name VÄLIAHO
Suffix  
Title/Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Name (Multiple Name)
Description The Name (Multiple Name) parse definition parses strings that contain the names of two individuals into a set of tokens.
Output Tokens Name 1
Name 2
Example 1 Input Output Token Output
PEKKA MÄKELÄ, LIISA MUSTONEN Name 1 PEKKA MÄKELÄ
Name 2 LIISA MUSTONEN
Example 2 Input Output Token Output
AULIS LAINE JA DOORIS MÄKI Name 1 AULIS LAINE
Name 2 DOORIS MÄKI
Remarks  

 

Organization
Description The Organization parse definition parses organization names into a set of tokens.
Output Tokens Legal Form Prefix
Organization
Legal Form Suffix
Additional Info
Example Input Output Token Output
Oy Vaasan Kone Ab Legal Form Prefix Oy
Organization Vaasan Kone
Legal Form Suffix Ab
Additional Info  
Remarks  

 

Phone
Description The Phone parse definition parses phone numbers into a set of tokens.
Output Tokens Country Code
Area Code
Phone Number
Additional Info
Example 1 Input Output Token Output
Työ095255721 Country Code  
Area Code 09
Phone Number 52557214
Additional Info Työ
Example 2 Input Output Token Output
GSM +358 50 5258725 Country Code 358
Area Code 50
Phone Number 5258725
Additional Info GSM
   
   
Remarks  

 

Phone (Global)
Description The Phone (Global) parse definition parses phone numbers into a globally recognized set of tokens.
Output Tokens Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
Example 1 Input Output Token Output
09 5255 7214 Työ Country Code  
Area Code 09
Base Number 52557214
Extension  
Line Type  
Additional Info Työ
Example 2 Input Output Token Output
+358 50 5258725 Country Code +358
Area Code 50
Base Number 5258725
Extension  
Line Type  
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

Pattern Analysis Definitions

None.

Standardization Definitions

Address
Description The Address standardization definition standardizes addresses.
Examples Input Output
HAAGAN URHEILUTIE 3 AB 11 Haagan urheilutie 3 aB 11
LEPOLANTIE 105 AS 3 Lepolantie 105 as 3
Remarks  

 

Address (Full)
Description The Address (Full) standardization definition standardizes complete two line addresses.
Example Input Output
LESKIROUVA FREYTAGIN KUJA 3 B 33, 00790 HELSINKI Leskirouva Freytagin kuja 3 B 33 00790 HELSINKI
Remarks  

 

City
Description The City standardization definition standardizes city names.
Examples Input Output
YLI-II Yli-Ii
HELSINKI Helsinki
Remarks  

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code standardization definition standardizes last line address information.
Examples Input Output
02151 Espoo 02151 ESPOO
00250 Helsinki 00250 HELSINKI
Remarks Follows the instructions given in JHS106 and uppercases city names.

 

City (Upper)
Description The City (Upper) standardization definition standardizes and uppercases city names.
Examples Input Output
Yli-Ii YLI-II
Helsinki HELSINKI
Remarks  

 

Name
Description The Name standardization definition standardizes names of individuals.
Examples Input Output
MIINA VON LODE Miina von Lode
Lahtinen, Pyry Pyry Lahtinen
Mrs Liisa Lahtinen Rouva Liisa Lahtinen
Remarks Name prefixes are translated to Finnish. Finnish name prefixes are not abbreviated.

 

Name (No Translation)
Description

The Name (No Translation) standardization definition standardizes names of individuals.

Examples Input Output
MIINA VON LODE Miina von Lode
Lahtinen, Pyry Pyry Lahtinen
Rva Liisa Lahtinen Rouva Liisa Lahtinen
Mrs. Liisa Lahtinen Mrs Liisa Lahtinen
Frk. Liisa Lahtinen Frk Liisa Lahtinen
Remarks Name prefixes are not translated to Finnish. Finnish name prefixes are not abbreviated.

 

Name (Swedish)
Description

The Name (Swedish) standardization definition standardizes names of individuals.

Examples Input Output
Rva Liisa Lahtinen Fr Liisa Lahtinen
Mrs. Liisa Lahtinen Fr Liisa Lahtinen
Frk. Liisa Lahtinen Frk Liisa Lahtinen
Remarks Name prefixes are translated to Swedish. Name prefixes are abbreviated.

 

Organization
Description The Organization standardization definition standardizes organization names.
Examples Input Output
OY ADZONE AB Oy AdZone Ab
MATIN RENGAS KY Matin Rengas Ky
Remarks  

 

Phone
Description The Phone standardization definition standardizes phone numbers for domestic use.
Examples Input Output
+358 9525571 09 525571
050 525571 050 525571
Remarks  

 

Phone (with Country Code)
Description The Phone (with Country Code) standardization definition standardizes phone numbers for international use.
Examples Input Output
09525571 +385 9 525571
003589525571 +358 9 525571
Remarks  

 

Postal Code
Description The Postal Code standardization definition standardizes postal codes.
Examples Input Output
'02150' 02150
Remarks  

 

Postal Code (International)
Description

The Postal Code (International) standardization definition standardizes postal codes and inserts country prefixes.

Examples Input Output
02150 FI-02150
22150 AX-22150
Remarks Country code AX is inserted for postal codes starting with 22 followed by three digits.

Inherited Definitions

In addition to the definitions listed on this page, the Finnish, Finland locale also inherits all definitions for the Finnish language and all Global definitions.