SAS Quality Knowledge Base for Contact Information 25
Definitions for the Danish, Denmark locale are described below.
Case Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Proper (Address) | ||
---|---|---|
Description | The Proper (Address) case definition propercases addresses. | |
Input | Output | |
Examples | NØRRE ALLÉ 27 | Nørre Allé 27 |
GLUCKSVEJ 13, 1. TV | Glucksvej 13, 1. tv | |
Remarks |
Proper (Address (Full)) | ||
---|---|---|
Description | The Proper (Address (Full)) case definition propercases complete two-line addresses. | |
Input | Output | |
Example | STOREGADEN 10, 1450 KØBENHAVN K |
Storegaden 10, 1450 København K |
Remarks |
Proper (City - State/Province - Postal Code) | ||
---|---|---|
Description | The Proper (City - State/Province - Postal Code ) case definition propercases last line address information. | |
Input | Output | |
Example | 2450 KØBENHAVN SV | 2450 København SV |
Remarks |
Proper (Name) | ||
---|---|---|
Description | The Proper (Name) case definition propercases names of individuals. | |
Input | Output | |
Examples | SØREN PAHL | Søren Pahl |
KLAUS FRIIS-HANSEN | Klaus Friis-Hansen | |
kim christensen | Kim Christensen | |
Remarks |
Proper (Organization) | ||
---|---|---|
Description | The Proper (Organization) case definition propercases organization names. | |
Input | Output | |
Examples | A/S DANSK RØRINDUSTRI | A/S Dansk Rørindustri |
POLYTEKNISK BOGHANDEL OG FORLAG |
Polyteknisk Boghandel og Forlag |
|
Remarks | This definition uses a list of known organization names to handle exceptions to propercasing rules. |
Name | ||
---|---|---|
Description | The Name gender analysis definition determines the gender of a name. | |
Possible Outputs | M F U |
|
Input | Output | |
Examples | Hr Eli Solomon | M |
Eva Solomon | F | |
Eli Solomon | U | |
Remarks |
Individual/Organization | ||
---|---|---|
Description | The Individual/Organization identification analysis definition determines whether a string represents the name of an individual or an organization. | |
Possible Outputs | Organization Individual Unknown |
|
Input | Output | |
Examples | A/S Dansk Rørindustri | Organization |
Bager Ole Olsen | Individual | |
Remarks |
Org/Individual/Address | ||
---|---|---|
Description | The Org/Individual/Address identification analysis definition identifies content as organization, individual, or address. | |
Possible Outputs | Organization Individual Address Unknown |
|
Input | Output | |
Examples | A/S Dansk Rørindustri | Organization |
Bager Ole Olsen | Individual | |
Ole Olsens gade 14 | Address | |
Remarks | This definition is highly dependent on the content and formatting of the input data. You should fine-tune the data on which it is to be used. |
Address | ||
---|---|---|
Description | The Address match definition generates match codes which can be used to cluster records containing addresses. | |
Max Length of Match Code | 55 characters | |
Input | Cluster ID | |
Examples | Strødamsvej 46, Postboks 11 | 0 |
Postboks 11 Strødamsvej 46 | 0 | |
PO Box 11, Store Havnevej 22 | 1 | |
Store Havnevej 22 PO Box 11 | 1 | |
PO Box 11 | 2 | |
Store Havnevej 22 | 3 | |
Remarks | Note: The results listed above reflect the default match sensitivity (85). | |
The information used in the match code generated by this definition corresponds to what was used in the Address (Standard) match definition in QKB CI 2013A and earlier. |
Address (Full) | ||
---|---|---|
Description | The Address (Full) match definition generates match codes which can be used to cluster records containing complete two-line addresses. | |
Max Length of Match Code | 35 characters | |
Input | Cluster ID | |
Examples | Storegaden 10, vær 201, 1450 København | 0 |
Storegaden 10, 1450 København K | 0 | |
Mars Allé 60 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address (PO Box Only) | ||
---|---|---|
Description | The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Strødamsvej 46, Postboks 11 | 0 |
Postboks 11 Strødamsvej 46 | 0 | |
PO Box 11, Store Havnevej 22 | 0 | |
Store Havnevej 22 PO Box 11 | 0 | |
PO Box 11 | 0 | |
Postboks 22 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address (Street Only) | ||
---|---|---|
Description | The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address. | |
Max Length of Match Code | 28 characters | |
Input | Cluster ID | |
Examples | Strødamsvej 46, Postboks 11 | 0 |
Postboks 11 Strødamsvej 46 | 0 | |
Stroedamsvej 46, 1. tv. | 0 | |
PO Box 11, Store Havnevej 22 | 1 | |
Store Havnevej 22 PO Box 11 | 1 | |
Store Havnevej 22, stuen | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
City | ||
---|---|---|
Description | The City match definition generates match codes which can be used to cluster records containing city names. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Høsholm | 0 |
Hørsholm | 0 | |
Odense | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | 1260 København K. | 0 |
1260 København | 0 | |
2630 Tåstrup | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 20 characters | |
Input | Cluster ID | |
Examples | Erik Petersen | 0 |
Erik Pedersen | 0 | |
Erik Ross Pedersen | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Organization | ||
---|---|---|
Description | The Organization match definition generates match codes which can be used to cluster records containing organization names. | |
Max Length of Match Code | 20 characters | |
Input | Cluster ID | |
Examples | Boehringer Ingelheim Danmark A/S | 0 |
Boehringer Ingelheim | 0 | |
SAS Institute A/S | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Phone | ||
---|---|---|
Description | The Phone match definition generates match codes which can be used to cluster records containing phone numbers. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | +4586789345 | 0 |
86 78 93 45 | 0 | |
33 33 93 33 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Postal Code | ||
---|---|---|
Description | The Postal Code match definition generates match codes which can be used to cluster records containing postal codes. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | 1450 | 0 |
2630 | 1 | |
Remarks | Note: The results listed above reflect the default match sensitivity (85). |
Address | |||
---|---|---|---|
Description | The Address parse definition parses addresses into a set of tokens. | ||
Output Tokens | Recipient Building/Site Street Name Street Number Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Ny Carlsbergvej 9, 1. tv | Recipient | |
Building/Site | |||
Street Name | Ny Carlsbergvej | ||
Street Number | 9 | ||
Extension | 1. tv | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 2 | Niels Bohrs Vej 17 2. th., Stilling, Postboks 683 | Recipient | |
Building/Site | |||
Street Name | Niels Bohrs Vej | ||
Street Number | 17 | ||
Extension | 2. th. | ||
PO Box | Postboks 683 | ||
Additional Info | Stilling | ||
Input | Output | ||
Example 3 | Bygning 303 Institut for Matematik | Recipient | Institut for Matematik |
Building/Site | Bygning 303 | ||
Street Name | |||
Street Number | |||
Extension | |||
PO Box | |||
Additional Info | |||
Remarks |
Address (Full) | |||
---|---|---|---|
Description | The Address (Full) parse definition parses addresses containing complete two-line addresses into a set of tokens. | ||
Output Tokens | Building Name Street Building Number Extension Village Post Number City Additional Info |
||
Input | Output | ||
Example | Storegaden 10, 1450 København K |
Building Name | |
Street | Storegaden | ||
Building Number | 10 | ||
Extension | |||
Village | |||
Post Number | 1450 | ||
City | København K | ||
Additional Info | |||
Remarks |
Address (Global) | |||
---|---|---|---|
Description |
The Address (Global) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example | Ny Carlsbergvej 9, 1. tv | Recipient | |
Building/Site | |||
Street | Ny Carlsbergvej 9 | ||
Extension | 1. tv | ||
PO Box | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. | ||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
Address (Global) (v23) | |||
---|---|---|---|
Description |
The Address (Global) (v23) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example | Ny Carlsbergvej 9, 1. tv | Recipient | |
Building/Site | |||
Street | Ny Carlsbergvej 9 | ||
Extension | 1. tv | ||
PO Box | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. | ||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
City - State/Province - Postal Code | |||
---|---|---|---|
Description | The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens. | ||
Output Tokens | Village Post Number City |
||
Input | Output | ||
Example | Askov, 6000 Vejen | Village | Askov |
Post Number | 6000 | ||
City | Vejen | ||
Remarks |
City - State/Province - Postal Code (Global) | |||
---|---|---|---|
Description | The City - State/Province - Postal Code (Global) parse definition parses last line address information into a globally recognized set of tokens. | ||
Output Tokens | City State/Province Postal Code Additional Info |
||
Input | Output | ||
Example | 2100 København Ø. | City | København Ø. |
State/Province | |||
Postal Code | 2100 | ||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name | |||
---|---|---|---|
Description | The Name parse definition parses names of individuals into a set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | Fru Helle Sørensen | Prefix | Fru |
Given Name | Helle | ||
Middle Name | |||
Family Name | Sørensen | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 2 | Rasmussen, Irene K. | Prefix | |
Given Name | Irene | ||
Middle Name | K. | ||
Family Name | Rasmussen | ||
Suffix | |||
Title/Additional Info | |||
Remarks |
Name (Global) | |||
---|---|---|---|
Description | The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | Fru Helle Sørensen | Prefix | Fru |
Given Name | Helle | ||
Middle Name | |||
Family Name | Sørensen | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 2 | Rasmussen, Irene K. | Prefix | |
Given Name | Irene | ||
Middle Name | K. | ||
Family Name | Rasmussen | ||
Suffix | |||
Title/Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name (Multiple Name) | |||
---|---|---|---|
Description | The Name (Multiple Name) parse definition parses strings that contain the names of two individuals into a set of tokens. | ||
Output Tokens | Name 1 Name 2 |
||
Input | Output | ||
Example 1 | Fru Eva & Herr Hans Brøndum | Name 1 | Fru Eva Brøndum |
Name 2 | Herr Hans Brøndum | ||
Input | Output | ||
Example 2 | Eva og Hans Brøndum | Name 1 | Eva Brøndum |
Name 2 | Hans Brøndum | ||
Remarks |
Phone | |||
---|---|---|---|
Description | The Phone parse definition parses phone numbers into a set of tokens. | ||
Output Tokens | Prefix Country Code Base Number Extension |
||
Input | Output | ||
Example | +45 98 10 20 22 - 3991 | Prefix | |
Country Code | +45 | ||
Base Number | 98 10 20 22 | ||
Extension | 3991 | ||
Remarks |
Phone (Global) | |||
---|---|---|---|
Description | The Phone (Global) parse definition parses phone numbers into a globally recognized set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output | ||
Example | +45 98 10 20 22 - 3991 | Country Code | +45 |
Area Code | |||
Base Number | 98 10 20 22 | ||
Extension | 3991 | ||
Line Type | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
None.
Address | ||
---|---|---|
Description | The Address standardization definition standardizes addresses. | |
Input | Output | |
Examples | Borgm Christiansens Gade 50 | Borgmester Christiansens Gade 50 |
GL. KONGEVEJ 60 2 TV | Gammel Kongevej 60, 2 tv. | |
Remarks |
Address (Full) | ||
---|---|---|
Description | The Address (Full) standardization definition standardizes complete two line addresses. | |
Input | Output | |
Example | Borgm Christiansens Gade 50 2400, Kobenhavn NV. | Borgmester Christiansens Gade 50, 2400 København NV |
Remarks |
City | ||
---|---|---|
Description | The City standardization definition standardizes city names. | |
Input | Output | |
Examples | Copenhagen | København |
Lokken | Løkken | |
Remarks | Common city abbreviations are expanded into full names. |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code standardization definition standardizes last line address information. | |
Input | Output | |
Example | 2400, Kobenhavn NV. | 2400 København NV |
Remarks |
Name | ||
---|---|---|
Description | The Name standardization definition standardizes names of individuals. | |
Input | Output | |
Examples | Herr Bjorn Simonsen | Hr Bjørn Simonsen |
Gregersen, Ole | Ole Gregersen | |
Remarks |
Name (Trailing Title) | ||
---|---|---|
Description | The Name (Trailing Title) standardization definition standardizes names of individuals, putting the title at the end. | |
Input | Output | |
Examples | Bager Bjorn Simonsen | Bjørn Simonsen, Bager |
Gregersen, Ole, Advokat | Ole Gregersen, Advokat | |
Remarks |
Organization | ||
---|---|---|
Description | The Organization standardization definition standardizes organization names. | |
Input | Output | |
Examples | BANG OG OLUFSEN | Bang & Olufsen |
teledk a.s. | TDC A/S | |
Remarks |
Phone | ||
---|---|---|
Description | The Phone standardization definition standardizes phone numbers for domestic use. | |
Input | Output | |
Examples | +45 4442 8633 | 44 42 86 33 |
97180722 lok 11 | 97 18 07 22 - 11 | |
Remarks |
Postal Code | ||
---|---|---|
Description | The Postal Code standardization definition standardizes postal codes. | |
Input | Output | |
Examples | 2400, | 2400 |
-2400 | 2400 | |
Remarks |
In addition to the definitions listed on this page, the Danish, Denmark locale also inherits all definitions for the Danish language and all Global definitions.
Documentation Feedback: yourturn@sas.com
|
Doc ID: QKBCI_DADNK_defs.html |