You are here: Definitions>German Definitions>German, Germany Definitions

SAS Quality Knowledge Base for Contact Information 25

German, Germany Definitions

Definitions for the German, Germany locale are described below.

Case Definitions
Gender Analysis Definitions

Identification Analysis Definitions

Match Definitions

Parse Definitions

Pattern Analysis Definitions

Standardization Definitions

Inherited Definitions

Case Definitions

Proper (Address)
Description The Case definition for Proper (Address) propercases addresses.
  Input Output
Examples Oswald-Von-Nell-Breuning-Allee 24 Oswald-von-Nell-Breuning-Allee 24
AHORNWEG Ahornweg
AM SPORTPLATZ 4a Am Sportplatz 4A
Remarks  

 

Proper (City)
Description The Case definition for Proper (City) propercases city names.
  Input Output
Examples claußnitz b mittweida Claußnitz b Mittweida
AICHA UNTER DEN BÄUMEN Aicha unter den Bäumen
Castrop-rauxel Castrop-Rauxel
Remarks  

 

Proper (Organization)
Description The Case definition for Proper (Organization) propercases organization names.
  Input Output
Examples JT-SYSTEMS JT-Systems
PLUS WARENHANDELSGESELLSCHAFT MBH Plus Warenhandelsgesellschaft mbH
VODAFONE D2 GMBH Vodafone D2 GmbH
DAS LEBEN IST EIN MÄRCHEN e.V. Das Leben ist ein Märchen e.V.
Remarks  

Gender Analysis Definitions

None.

Identification Analysis Definitions

Individual/Organization
Description The Identification Analysis definition for Individual/Organization determines whether a string represents the name of an individual or an organization.
Possible Outputs INDIVIDUAL
ORGANIZATION
UNKNOWN
  Input Output
Examples SAS Institute GmbH ORGANIZATION
Manfred Kiefer INDIVIDUAL
Bayerische Landesbank ORGANIZATION
Schneider UNKNOWN
Remarks  

Match Definitions

Address
Description The Address match definition generates match codes which can be used to cluster records containing addresses.
Max Length of Match Code 40 characters
  Input Cluster ID
Examples DANZIGER STRAßE 56 2. Etage 0
DANZIGER STRAßE 56, 2. Stock links, HH 0
Danziger Straße 56, 2. Etg. 0
Postfach 123456 1
POSTFACH 123456 1
Oswald-Von-Nell-Breuning Allee 24 2
Oswald-Von-Nell-Breuning-All. 24 2
Friedenstrasse 101 3
Friedenstr. 101 3
Postfach 8711, Friedenstrasse 100 4
Postfach 8711, Friedenstrasse 101 5
Postfach 8811, Friedenstrasse 101 6
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (PO Box Only)
Description The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples In der Neckarhelle 162, Postfach 7114 0
PF 7114 98765 0
Postf. 9471 (Kurfürstenstraße) 1
Kunzendorfer Weg - PF 9471 1
Postfach 8711, Friedenstrasse 100 2
Postfach 8711, Friedenstrasse 101 2
Postfach 8811, Friedenstrasse 101 3
  Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

In the Address (PO Box Only) match definition, street information is ignored.

 

Address (Street Only)
Description The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address.
Max Length of Match Code 25 characters
  Input Cluster ID
Examples In der Neckarhelle 162, Postfach 7114 0
I. der Neckarhelle 162 0
Postfach 123456 1
POSTFACH 123456 1
Postfach 8711, Friedenstrasse 100 2
Postfach 8711, Friedenstrasse 101 3
Postfach 8811, Friedenstrasse 101 3
Remarks

PO Box information is ignored.

NoteNote: The results listed above reflect the default match sensitivity (85).

 

City
Description The City match definition generates match codes which can be used to cluster records containing city names.
Max Length of Match Code 23 characters
  Input Cluster ID
Examples Seebad Ahlbeck 0
Ahlbeck 0
St. Augustin 1
St.. Augustin 1
Sankt Augustin 1
VIERSEN 2
Fiersen 2
Frankfurt 3
Frankfurt am Main 4
frankfurt/main 4
Frankfurt a. Main 4
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information.
Max Length of Match Code 30 characters
  Input Cluster ID
Examples 06120 Halle/Saale 0
D-06120 Halle 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Name (with Suggestions)
Description The Name (with Suggestions) match definition generates match codes which can be used to cluster records containing names of individuals.
Max Length of Match Code 26 characters
  Input Cluster ID
Examples HERMANN BORSCH 1
HERKANN BORSCH 1
HENRY NICKELSON 2
HENRY NICKERSON 2
PAUL HEIDEN 3
PAUL HEIDE 3
PAUL HEIDER 3
PAUL HEIDNER 4
PAUL HEIDER 4
  Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

This definition generates one or more match codes for each input string. Each match code represents a suggestion for what might be the true value of the input string; this enables two strings to be matched even when one or both strings contain a spelling mistake. For example, the name HERKANN might match the name HERMANN.

Note that a consequence of the generation of multiple match codes is that a record might be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition.

Another consequence of the generation of multiple match codes is that more processing time is required than when generating a single match code. Generation of match codes using this definition might take up to five times as long as generation of match codes using a traditional match definition.

For more information on suggestion-based matching, refer to the Suggestion-Based Matching section of the DataFlux Data Management Studio Online Help.

 

Organization
Description The Organization match definition generates match codes which can be used to cluster records containing organization names.
Max Length of Match Code 40 characters
  Input Cluster ID
Examples Müller AG 0
Müller Ltd. & Co. KG 0
SAS Institute GmbH 1
SAS Institute Gesellschaft mbH 1
SAS Institute Gesellschaft mit beschränkter Haftung 1
DataFlux GmbH, Heidelberg, A SAS Company 2
DataFlux GmbH, Heidelberg 2
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Phone
Description The Phone match definition generates match codes which can be used to cluster records containing phone numbers.
Max Length of Match Code 22 characters
  Input Cluster ID
Examples 6221-123-456 0
6221 123 - 456 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Postal Code
Description The Postal Code match definition generates match codes which can be used to cluster records containing postal codes.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples D-69118 0
69118 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

Parse Definitions

Address
Description The Parse definition for Address parses first line address data into a set of tokens.
Output Tokens Street Name
House Number
Extension
Additional Info
  Input Output
Example 1 Kölner Dom Domklosterstr. 2583 1/2 (an der Glocke) Street Name Domklosterstr.
House Number 2583 1/2
Extension Kölner Dom
Additional Info (an der Glocke)
  Input Output
Example 2 Bei den Kornschrannen 1 Street Name Magdeburger Str.
House Number 5 a
Extension  
Additional Info  
Remarks  

 

Address (Global)
Description

The Address (Global) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output
Example 1 Goethehaus Frauenplan 1, Vorderhaus (bei Schiller klingeln) Recipient  
Building/Site Goethehaus
Street Frauenplan 1
Extension Vorderhaus
PO Box  
Additional Info (bei Schiller klingeln)
  Input Output
Example 2 Schneidergasse 13 // R 123 Recipient  
Building/Site  
Street Schneidergasse 13
Extension R 123
PO Box  
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB.

The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back.

 

Address (Global) (v23)
Description

The Address (Global) (v23) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output
Example 1 Goethehaus Frauenplan 1, Vorderhaus (bei Schiller klingeln) Recipient  
Building/Site Goethehaus
Street Frauenplan 1
Extension Vorderhaus
PO Box  
Additional Info (bei Schiller klingeln)
  Input Output
Example 2 Schneidergasse 13 // R 123 Recipient  
Building/Site  
Street Schneidergasse 13
Extension R 123
PO Box  
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB.

The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back.

 

City
Description The Parse definition for City parses cities into a set of tokens.
Output Tokens City
Region
Neighboring City
  Input Output
Example 1 Berlin/Zehlendorf City Berlin
Region Zehlendorf
Neighboring City  
  Input Output
Example 2 Eppstein im Taunus City Eppstein
Region Taunus
Neighboring City  
  Input Output
Example 3 Kipfenberg/Arnsberg City Kipfenberg
Region  
Neighboring City Arnsberg
Remarks  

 

City - State/Province - Postal Code
Description The Parse definition for City - State/Province - Postal Code parses address last line data into a set of tokens.
Output Tokens City
Region
Neighboring City
Federal State
Postal Code
  Input Output
Example 1 D-85579 Gut Unterbiberg bei München, Bayern City Gut Unterbiberg
Region  
Neighboring City München
Federal State Bayern
Postal Code 85579
  Input Output
Example 2 06120 Halle/Saale City Halle
Region Saale
Neighboring City  
Federal State  
Postal Code 06120
Remarks  

 

City - State/Province - Postal Code (Global)
Description The Parse definition for City - State/Province - Postal Code (Global) parses address last line data into a globally recognized set of tokens.
Output Tokens City
State/Province
Postal Code
Additional Info
  Input Output
Example 1 D-69118 Heidelberg
Baden-Württemberg
City Heidelberg
State/Province Baden-Württemberg
Postal Code 69118
Additional Info  
  Input Output
Example 2 D-85579 Gut Unterbiberg
bei München
City Gut Unterbiberg
State/Province  
Postal Code 85579
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Organization
Description The Parse definition for Organization parses organization names into a set of tokens.
Output Tokens Name
Legal Form
Site
Additional Info
  Input Output
Example 1 Eon Gastransport AG & Co. KG, Essen, zu Eon Ruhrgas AG Name Eon Gastransport
Legal Form AG & Co. KG
Site Essen
Additional Info zu Eon Ruhrgas AG
  Input Output
Example 2 Kantinen GmbH, Essen und Trinken, Verlagsgruppe Name Kantinen
Legal Form GmbH
Site  
Additional Info Essen und Trinken, Verlagsgruppe
Remarks  

 

Organization (Global)
Description The Parse definition for Organization (Global) parses organization names into a globally recognized set of tokens.
Output Tokens Name
Legal Form
Site
Additional Info
  Input Output
Example 1 Eon Gastransport AG & Co. KG, Essen, zu Eon Ruhrgas AG Name Eon Gastransport
Legal Form AG & Co. KG
Site Essen
Additional Info zu Eon Ruhrgas AG
  Input Output
Example 2 Kantinen GmbH, Essen und Trinken, Verlagsgruppe Name Kantinen
Legal Form GmbH
Site  
Additional Info Essen und Trinken, Verlagsgruppe
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Phone
Description The Parse definition for Phone parses German phone numbers into a set of tokens.
Output Tokens Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
  Input Output
Example 1 Büro: +49 06221 415-0 Durchwahl 4629 (fragen Sie nach Mary) Country Code +49
Area Code 06221
Base Number 415-0
Extension 4629
Line Type Büro:
Additional Info (fragen Sie nach Mary)
  Input Output
Example 2 (030) 12345-67
(030) 12345.67
(030) 12345*67
Country Code  
Area Code 030
Base Number 12345-0
Extension 67
Line Type  
Additional Info  
  Input Output
Example 3 (06221)415-0 Country Code  
Area Code 06221
Base Number 415-0
Extension  
Line Type  
Additional Info  
Remarks This Parse definition is intended for German customers to yield results that follow the local convention.

 

Phone (Global)
Description The Parse definition for Phone (Global) parses phone numbers into a globally recognized set of tokens.
Output Tokens Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
  Input Output
Example 1 Büro: +49 06221 415-0 Durchwahl 4629 (fragen Sie nach Mary) Country Code +49
Area Code 06221
Base Number 415-0
Extension 4629
Line Type Büro:
Additional Info (fragen Sie nach Mary)
  Input Output
Example 2 (030) 12345-67
Country Code  
Area Code 030
Base Number 12345-67
Extension  
Line Type  
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Postal Code
Description The Parse definition for Postal Code parses postal codes into a set of tokens.
Output Tokens Country Code
First Two Postal Code Digits
Last Three Postal Code Digits
  Input Output
Example D-69118 Country Code D
First Two Postal Code Digits 69
Last Three Postal Code Digits 118
Remarks  

Pattern Analysis Definitions

None.

Standardization Definitions

Address
Description The Standardization definition for Address standardizes the first line portion of address data.
  Input Output
Examples Friedenstrasse 100 Friedenstraße 100
DANZIGER STRAßE 56 Danziger Straße 56
Dorfgasse 1/5 Dorfgasse 1 - 5
Remarks  

 

City
Description The Standardization definition for City standardizes city names.
  Input Output
Examples FRANKFURT AN DER ODER Frankfurt (Oder)
Frankfurt/Main Frankfurt am Main
muenster Münster
Remarks Uses commonly accepted standards for city names that include rivers or other geographic features.

 

City - State/Province - Postal Code
Description The Standardization definition for City - State/Province - Postal Code standardizes city and state/province names.
  Input Output
Examples Beckingen-Düppenweiler D-66701 66701 Beckingen-Düppenweiler
Halle (Saale) 06122 06122 Halle (Saale)
59889 Rottenburg a.N. 59889 Rottenburg am Neckar
72172 Sulz 72172 Sulz am Neckar
Remarks  

 

Organization
Description The Standardization definition for Organization standardizes organization names.
  Input Output
Examples Eon Gastransport AG & Co. KG, Essen (zu Eon Ruhrgas AG) EON Gastransport AG & Co KG, Essen, zu EON Ruhrgas AG
(BMW) AG BMW AG
T - Mobile International AG T-Mobile AG, International
ALFRED C. TOEPFER INTERNATIONAL GMBH Alfred C Toepfer GmbH, International
Remarks  

 

Phone
Description The Standardization definition for Phone standardizes phone numbers.
  Input Output
Example +49 6221 123 - 456 0049 6221 123456
Remarks  

 

Phone (with Country Code)
Description The Standardization definition for Phone (with Country Code) standardizes phone numbers for international use.
  Input Output
Examples 06221 4150 +49 6221 4150
(030) 85802 (NACH 4pm) +49 30 85802, Nach 4PM
(0800) 618353 +49 800 618353
004962214150 +49 6221 4150
06221-4159-1234 +49 6221 4159-1234
030 12345-67 (Büro) +49 30 12345-67, Büro
(030) 85802 (fragen Sie nach Mary) +49 30 85802, Fragen Sie Nach Mary
Remarks  

 

Phone (Electronic)
Description The Standardization definition for Phone (Electronic) standardizes phone numbers for automated calling systems.
  Input Output
Examples 0044 (0)20 12345000 +442012345000
06221 4150 +4962214150
06221 415-1234 +4962214151234
Büro: +49 (12136) 85802-1234 (fragen Sie nach Mary) +4912136858021234
0800 COMETOGERMANY +498002663864376269
Remarks  

 

Postal Code
Description The Standardization definition for Postal Code standardizes postal codes.
  Input Output
Example D-48465 48465
Remarks  

Inherited Definitions

In addition to the definitions listed on this page, the German, Germany locale also inherits all definitions for the German language and all Global definitions.