SAS Quality Knowledge Base for Contact Information 25
Definitions for the German, Germany locale are described below.
Case Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Proper (Address) | ||
---|---|---|
Description | The Case definition for Proper (Address) propercases addresses. | |
Input | Output | |
Examples | Oswald-Von-Nell-Breuning-Allee 24 | Oswald-von-Nell-Breuning-Allee 24 |
AHORNWEG | Ahornweg | |
AM SPORTPLATZ 4a | Am Sportplatz 4A | |
Remarks |
Proper (City) | ||
---|---|---|
Description | The Case definition for Proper (City) propercases city names. | |
Input | Output | |
Examples | claußnitz b mittweida | Claußnitz b Mittweida |
AICHA UNTER DEN BÄUMEN | Aicha unter den Bäumen | |
Castrop-rauxel | Castrop-Rauxel | |
Remarks |
Proper (Organization) | ||
---|---|---|
Description | The Case definition for Proper (Organization) propercases organization names. | |
Input | Output | |
Examples | JT-SYSTEMS | JT-Systems |
PLUS WARENHANDELSGESELLSCHAFT MBH | Plus Warenhandelsgesellschaft mbH | |
VODAFONE D2 GMBH | Vodafone D2 GmbH | |
DAS LEBEN IST EIN MÄRCHEN e.V. | Das Leben ist ein Märchen e.V. | |
Remarks |
None.
Individual/Organization | ||
---|---|---|
Description | The Identification Analysis definition for Individual/Organization determines whether a string represents the name of an individual or an organization. | |
Possible Outputs | INDIVIDUAL ORGANIZATION UNKNOWN |
|
Input | Output | |
Examples | SAS Institute GmbH | ORGANIZATION |
Manfred Kiefer | INDIVIDUAL | |
Bayerische Landesbank | ORGANIZATION | |
Schneider | UNKNOWN | |
Remarks |
Address | ||
---|---|---|
Description | The Address match definition generates match codes which can be used to cluster records containing addresses. | |
Max Length of Match Code | 40 characters | |
Input | Cluster ID | |
Examples | DANZIGER STRAßE 56 2. Etage | 0 |
DANZIGER STRAßE 56, 2. Stock links, HH | 0 | |
Danziger Straße 56, 2. Etg. | 0 | |
Postfach 123456 | 1 | |
POSTFACH 123456 | 1 | |
Oswald-Von-Nell-Breuning Allee 24 | 2 | |
Oswald-Von-Nell-Breuning-All. 24 | 2 | |
Friedenstrasse 101 | 3 | |
Friedenstr. 101 | 3 | |
Postfach 8711, Friedenstrasse 100 | 4 | |
Postfach 8711, Friedenstrasse 101 | 5 | |
Postfach 8811, Friedenstrasse 101 | 6 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address (PO Box Only) | ||
---|---|---|
Description | The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | In der Neckarhelle 162, Postfach 7114 | 0 |
PF 7114 98765 | 0 | |
Postf. 9471 (Kurfürstenstraße) | 1 | |
Kunzendorfer Weg - PF 9471 | 1 | |
Postfach 8711, Friedenstrasse 100 | 2 | |
Postfach 8711, Friedenstrasse 101 | 2 | |
Postfach 8811, Friedenstrasse 101 | 3 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
In the Address (PO Box Only) match definition, street information is ignored. |
Address (Street Only) | ||
---|---|---|
Description | The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address. | |
Max Length of Match Code | 25 characters | |
Input | Cluster ID | |
Examples | In der Neckarhelle 162, Postfach 7114 | 0 |
I. der Neckarhelle 162 | 0 | |
Postfach 123456 | 1 | |
POSTFACH 123456 | 1 | |
Postfach 8711, Friedenstrasse 100 | 2 | |
Postfach 8711, Friedenstrasse 101 | 3 | |
Postfach 8811, Friedenstrasse 101 | 3 | |
Remarks |
PO Box information is ignored. Note: The results listed above reflect the default match sensitivity (85). |
City | ||
---|---|---|
Description | The City match definition generates match codes which can be used to cluster records containing city names. | |
Max Length of Match Code | 23 characters | |
Input | Cluster ID | |
Examples | Seebad Ahlbeck | 0 |
Ahlbeck | 0 | |
St. Augustin | 1 | |
St.. Augustin | 1 | |
Sankt Augustin | 1 | |
VIERSEN | 2 | |
Fiersen | 2 | |
Frankfurt | 3 | |
Frankfurt am Main | 4 | |
frankfurt/main | 4 | |
Frankfurt a. Main | 4 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information. | |
Max Length of Match Code | 30 characters | |
Input | Cluster ID | |
Examples | 06120 Halle/Saale | 0 |
D-06120 Halle | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Name (with Suggestions) | ||
---|---|---|
Description | The Name (with Suggestions) match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 26 characters | |
Input | Cluster ID | |
Examples | HERMANN BORSCH | 1 |
HERKANN BORSCH | 1 | |
HENRY NICKELSON | 2 | |
HENRY NICKERSON | 2 | |
PAUL HEIDEN | 3 | |
PAUL HEIDE | 3 | |
PAUL HEIDER | 3 | |
PAUL HEIDNER | 4 | |
PAUL HEIDER | 4 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
This definition generates one or more match codes for each input string. Each match code represents a suggestion for what might be the true value of the input string; this enables two strings to be matched even when one or both strings contain a spelling mistake. For example, the name HERKANN might match the name HERMANN. Note that a consequence of the generation of multiple match codes is that a record might be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition. Another consequence of the generation of multiple match codes is that more processing time is required than when generating a single match code. Generation of match codes using this definition might take up to five times as long as generation of match codes using a traditional match definition. For more information on suggestion-based matching, refer to the Suggestion-Based Matching section of the DataFlux Data Management Studio Online Help. |
Organization | ||
---|---|---|
Description | The Organization match definition generates match codes which can be used to cluster records containing organization names. | |
Max Length of Match Code | 40 characters | |
Input | Cluster ID | |
Examples | Müller AG | 0 |
Müller Ltd. & Co. KG | 0 | |
SAS Institute GmbH | 1 | |
SAS Institute Gesellschaft mbH | 1 | |
SAS Institute Gesellschaft mit beschränkter Haftung | 1 | |
DataFlux GmbH, Heidelberg, A SAS Company | 2 | |
DataFlux GmbH, Heidelberg | 2 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Phone | ||
---|---|---|
Description | The Phone match definition generates match codes which can be used to cluster records containing phone numbers. | |
Max Length of Match Code | 22 characters | |
Input | Cluster ID | |
Examples | 6221-123-456 | 0 |
6221 123 - 456 | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Postal Code | ||
---|---|---|
Description | The Postal Code match definition generates match codes which can be used to cluster records containing postal codes. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | D-69118 | 0 |
69118 | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address | |||
---|---|---|---|
Description | The Parse definition for Address parses first line address data into a set of tokens. | ||
Output Tokens | Street Name House Number Extension Additional Info |
||
Input | Output | ||
Example 1 | Kölner Dom Domklosterstr. 2583 1/2 (an der Glocke) | Street Name | Domklosterstr. |
House Number | 2583 1/2 | ||
Extension | Kölner Dom | ||
Additional Info | (an der Glocke) | ||
Input | Output | ||
Example 2 | Bei den Kornschrannen 1 | Street Name | Magdeburger Str. |
House Number | 5 a | ||
Extension | |||
Additional Info | |||
Remarks |
Address (Global) | |||
---|---|---|---|
Description |
The Address (Global) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Goethehaus Frauenplan 1, Vorderhaus (bei Schiller klingeln) | Recipient | |
Building/Site | Goethehaus | ||
Street | Frauenplan 1 | ||
Extension | Vorderhaus | ||
PO Box | |||
Additional Info | (bei Schiller klingeln) | ||
Input | Output | ||
Example 2 | Schneidergasse 13 // R 123 | Recipient | |
Building/Site | |||
Street | Schneidergasse 13 | ||
Extension | R 123 | ||
PO Box | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. | ||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
Address (Global) (v23) | |||
---|---|---|---|
Description |
The Address (Global) (v23) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Goethehaus Frauenplan 1, Vorderhaus (bei Schiller klingeln) | Recipient | |
Building/Site | Goethehaus | ||
Street | Frauenplan 1 | ||
Extension | Vorderhaus | ||
PO Box | |||
Additional Info | (bei Schiller klingeln) | ||
Input | Output | ||
Example 2 | Schneidergasse 13 // R 123 | Recipient | |
Building/Site | |||
Street | Schneidergasse 13 | ||
Extension | R 123 | ||
PO Box | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. | ||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
City | |||
---|---|---|---|
Description | The Parse definition for City parses cities into a set of tokens. | ||
Output Tokens | City Region Neighboring City |
||
Input | Output | ||
Example 1 | Berlin/Zehlendorf | City | Berlin |
Region | Zehlendorf | ||
Neighboring City | |||
Input | Output | ||
Example 2 | Eppstein im Taunus | City | Eppstein |
Region | Taunus | ||
Neighboring City | |||
Input | Output | ||
Example 3 | Kipfenberg/Arnsberg | City | Kipfenberg |
Region | |||
Neighboring City | Arnsberg | ||
Remarks |
City - State/Province - Postal Code | |||
---|---|---|---|
Description | The Parse definition for City - State/Province - Postal Code parses address last line data into a set of tokens. | ||
Output Tokens | City Region Neighboring City Federal State Postal Code |
||
Input | Output | ||
Example 1 | D-85579 Gut Unterbiberg bei München, Bayern | City | Gut Unterbiberg |
Region | |||
Neighboring City | München | ||
Federal State | Bayern | ||
Postal Code | 85579 | ||
Input | Output | ||
Example 2 | 06120 Halle/Saale | City | Halle |
Region | Saale | ||
Neighboring City | |||
Federal State | |||
Postal Code | 06120 | ||
Remarks |
City - State/Province - Postal Code (Global) | |||
---|---|---|---|
Description | The Parse definition for City - State/Province - Postal Code (Global) parses address last line data into a globally recognized set of tokens. | ||
Output Tokens | City State/Province Postal Code Additional Info |
||
Input | Output | ||
Example 1 | D-69118 Heidelberg Baden-Württemberg |
City | Heidelberg |
State/Province | Baden-Württemberg | ||
Postal Code | 69118 | ||
Additional Info | |||
Input | Output | ||
Example 2 | D-85579 Gut Unterbiberg bei München |
City | Gut Unterbiberg |
State/Province | |||
Postal Code | 85579 | ||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Organization | |||
---|---|---|---|
Description | The Parse definition for Organization parses organization names into a set of tokens. | ||
Output Tokens | Name Legal Form Site Additional Info |
||
Input | Output | ||
Example 1 | Eon Gastransport AG & Co. KG, Essen, zu Eon Ruhrgas AG | Name | Eon Gastransport |
Legal Form | AG & Co. KG | ||
Site | Essen | ||
Additional Info | zu Eon Ruhrgas AG | ||
Input | Output | ||
Example 2 | Kantinen GmbH, Essen und Trinken, Verlagsgruppe | Name | Kantinen |
Legal Form | GmbH | ||
Site | |||
Additional Info | Essen und Trinken, Verlagsgruppe | ||
Remarks |
Organization (Global) | |||
---|---|---|---|
Description | The Parse definition for Organization (Global) parses organization names into a globally recognized set of tokens. | ||
Output Tokens | Name Legal Form Site Additional Info |
||
Input | Output | ||
Example 1 | Eon Gastransport AG & Co. KG, Essen, zu Eon Ruhrgas AG | Name | Eon Gastransport |
Legal Form | AG & Co. KG | ||
Site | Essen | ||
Additional Info | zu Eon Ruhrgas AG | ||
Input | Output | ||
Example 2 | Kantinen GmbH, Essen und Trinken, Verlagsgruppe | Name | Kantinen |
Legal Form | GmbH | ||
Site | |||
Additional Info | Essen und Trinken, Verlagsgruppe | ||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Phone | |||
---|---|---|---|
Description | The Parse definition for Phone parses German phone numbers into a set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output | ||
Example 1 | Büro: +49 06221 415-0 Durchwahl 4629 (fragen Sie nach Mary) | Country Code | +49 |
Area Code | 06221 | ||
Base Number | 415-0 | ||
Extension | 4629 | ||
Line Type | Büro: | ||
Additional Info | (fragen Sie nach Mary) | ||
Input | Output | ||
Example 2 | (030) 12345-67 (030) 12345.67 (030) 12345*67 |
Country Code | |
Area Code | 030 | ||
Base Number | 12345-0 | ||
Extension | 67 | ||
Line Type | |||
Additional Info | |||
Input | Output | ||
Example 3 | (06221)415-0 | Country Code | |
Area Code | 06221 | ||
Base Number | 415-0 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks | This Parse definition is intended for German customers to yield results that follow the local convention. |
Phone (Global) | |||
---|---|---|---|
Description | The Parse definition for Phone (Global) parses phone numbers into a globally recognized set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output | ||
Example 1 | Büro: +49 06221 415-0 Durchwahl 4629 (fragen Sie nach Mary) | Country Code | +49 |
Area Code | 06221 | ||
Base Number | 415-0 | ||
Extension | 4629 | ||
Line Type | Büro: | ||
Additional Info | (fragen Sie nach Mary) | ||
Input | Output | ||
Example 2 | (030) 12345-67 |
Country Code | |
Area Code | 030 | ||
Base Number | 12345-67 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Postal Code | |||
---|---|---|---|
Description | The Parse definition for Postal Code parses postal codes into a set of tokens. | ||
Output Tokens | Country Code First Two Postal Code Digits Last Three Postal Code Digits |
||
Input | Output | ||
Example | D-69118 | Country Code | D |
First Two Postal Code Digits | 69 | ||
Last Three Postal Code Digits | 118 | ||
Remarks |
None.
Address | ||
---|---|---|
Description | The Standardization definition for Address standardizes the first line portion of address data. | |
Input | Output | |
Examples | Friedenstrasse 100 | Friedenstraße 100 |
DANZIGER STRAßE 56 | Danziger Straße 56 | |
Dorfgasse 1/5 | Dorfgasse 1 - 5 | |
Remarks |
City | ||
---|---|---|
Description | The Standardization definition for City standardizes city names. | |
Input | Output | |
Examples | FRANKFURT AN DER ODER | Frankfurt (Oder) |
Frankfurt/Main | Frankfurt am Main | |
muenster | Münster | |
Remarks | Uses commonly accepted standards for city names that include rivers or other geographic features. |
City - State/Province - Postal Code | ||
---|---|---|
Description | The Standardization definition for City - State/Province - Postal Code standardizes city and state/province names. | |
Input | Output | |
Examples | Beckingen-Düppenweiler D-66701 | 66701 Beckingen-Düppenweiler |
Halle (Saale) 06122 | 06122 Halle (Saale) | |
59889 Rottenburg a.N. | 59889 Rottenburg am Neckar | |
72172 Sulz | 72172 Sulz am Neckar | |
Remarks |
Organization | ||
---|---|---|
Description | The Standardization definition for Organization standardizes organization names. | |
Input | Output | |
Examples | Eon Gastransport AG & Co. KG, Essen (zu Eon Ruhrgas AG) | EON Gastransport AG & Co KG, Essen, zu EON Ruhrgas AG |
(BMW) AG | BMW AG | |
T - Mobile International AG | T-Mobile AG, International | |
ALFRED C. TOEPFER INTERNATIONAL GMBH | Alfred C Toepfer GmbH, International | |
Remarks |
Phone | ||
---|---|---|
Description | The Standardization definition for Phone standardizes phone numbers. | |
Input | Output | |
Example | +49 6221 123 - 456 | 0049 6221 123456 |
Remarks |
Phone (with Country Code) | ||
---|---|---|
Description | The Standardization definition for Phone (with Country Code) standardizes phone numbers for international use. | |
Input | Output | |
Examples | 06221 4150 | +49 6221 4150 |
(030) 85802 (NACH 4pm) | +49 30 85802, Nach 4PM | |
(0800) 618353 | +49 800 618353 | |
004962214150 | +49 6221 4150 | |
06221-4159-1234 | +49 6221 4159-1234 | |
030 12345-67 (Büro) | +49 30 12345-67, Büro | |
(030) 85802 (fragen Sie nach Mary) | +49 30 85802, Fragen Sie Nach Mary | |
Remarks |
Phone (Electronic) | ||
---|---|---|
Description | The Standardization definition for Phone (Electronic) standardizes phone numbers for automated calling systems. | |
Input | Output | |
Examples | 0044 (0)20 12345000 | +442012345000 |
06221 4150 | +4962214150 | |
06221 415-1234 | +4962214151234 | |
Büro: +49 (12136) 85802-1234 (fragen Sie nach Mary) | +4912136858021234 | |
0800 COMETOGERMANY | +498002663864376269 | |
Remarks |
Postal Code | ||
---|---|---|
Description | The Standardization definition for Postal Code standardizes postal codes. | |
Input | Output | |
Example | D-48465 | 48465 |
Remarks |
In addition to the definitions listed on this page, the German, Germany locale also inherits all definitions for the German language and all Global definitions.
Documentation Feedback: yourturn@sas.com
|
Doc ID: QKBCI_DEDEU_defs.html |