SAS Quality Knowledge Base for Contact Information 26
In the SAS Quality Knowledge Base, the German definitions are shared by all German-language locales. Shared German definitions are described below.
Case Definitions
Extraction Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Proper (Name) | ||
---|---|---|
Description | The Proper (Name) case definition propercases names of individuals. | |
Input | Output | |
Examples | JUPP SCHMITZ | Jupp Schmitz |
erna meyer | Erna Meyer | |
t.h. Otto | T.H. Otto | |
Remarks |
None.
Name | ||
---|---|---|
Description |
The Name gender analysis definition determines the gender of a name. |
|
Possible Outputs | M F U |
|
Input | Output | |
Examples | Irene Himpsl | F |
Herr M. Kiefer | M | |
T. Schmidt | U | |
Remarks |
Field Name | ||
---|---|---|
Description |
The Field Name identification analysis definition identifies database column names. |
|
Possible Outputs | NAME ORGANIZATION ADDRESS CITY STATE/PROVINCE POSTALCODE COUNTRY PHONE DATE UNKNOWN URL GENDER MATCHCODE PERSONAL_ID ORGANIZATION_ID GENERIC_ID COUNTY MARITAL_STATUS |
|
Input | Output | |
Examples | Company Name | ORGANIZATION |
Unternehmen | ORGANIZATION | |
phone | PHONE | |
Telefonnummer | PHONE | |
Vorname | NAME | |
Family_Name | NAME | |
Remarks |
This definition is recommended to determine the type of data stored in a database column based on the name of the column. |
Field Name | ||
---|---|---|
Description | The Field Name match definition generates match codes which can be used to cluster records containing database field names. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Company Name | 0 |
Unternehmen | 0 | |
Telefonnummer | 1 | |
Phone | 1 | |
Vornames | 2 | |
Name | 2 | |
Remarks |
This definition should be used to find potential matches between database column names.
|
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 20 characters | |
Input | Cluster ID | |
Examples | SCHMIDT FREDERICK | 0 |
SCHMIDT FREDERICKE | 0 | |
Paul von Heiden | 1 | |
Paul von der Heiden | 1 | |
Karl Heinz Maier-Schmidt | 2 | |
Karl Heinz Schmidt | 2 | |
Hans Jürgen Schmidt | 3 | |
Remarks |
|
Text | ||
---|---|---|
Description | The Text match definition generates match codes which can be used to cluster records containing general text strings. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Data Management | 0 |
Das Data Management Studio | 0 | |
Remarks |
Contains generic matching logic only. To be used for strings that do not fit any existing data type.
|
Name | |||
---|---|---|---|
Description | The Name parse definition parses names of individuals into a set of tokens. | ||
Output Tokens | Prefix Title Given Name Family Name Suffix Additional Info |
||
Input | Output Token | Output | |
Example 1 | Herr Dr. Heinz Meyer jun, Bürgermeister | Prefix | Herr |
Title | Dr. | ||
Given Name | Heinz | ||
Family Name | Meyer | ||
Suffix | jun | ||
Additional Info | Bürgermeister | ||
Input | Output Token | Output | |
Example 2 | Vincent Paul Hoffmeister, III | Prefix | |
Title | |||
Given Name | Vincent Paul | ||
Family Name | Hoffmeister | ||
Suffix | III | ||
Additional Info | |||
Remarks |
Name (Global) | |||
---|---|---|---|
Description | The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output Token | Output | |
Example 1 | Herr Dr. Heinz Meyer jun, Bürgermeister | Prefix | Herr Dr. |
Given Name | Heinz | ||
Middle Name | |||
Family Name | Meyer | ||
Suffix | jun | ||
Title/Additional Info | Bürgermeister | ||
Input | Output Token | Output | |
Example 2 | Frau Gräfin Amelia von Thurn und Taxis | Prefix | Frau |
Given Name | Amelia | ||
Middle Name | |||
Family Name | von Thurn und Taxis | ||
Suffix | |||
Title/Additional Info | Gräfin | ||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name (Multiple Name) | |||
---|---|---|---|
Description | The Name (Multiple Name) parse definition parses strings that contain the names of two individuals into a set of tokens. | ||
Output Tokens | Name 1 Name 2 |
||
Input | Output Token | Output | |
Example 1 | Herr und Frau Heinz Meyer | Name 1 | Herr Heinz Meyer |
Name 2 | Frau Meyer | ||
Input | Output Token | Output | |
Example 2 | Frau Gräfin Amelia von Thurn und Taxis | Name 1 | Frau Amelia von Thurn |
Name 2 | Herr Heinz Meyer | ||
Remarks |
None.
Name | ||
---|---|---|
Description | The Name standardization definition standardizes names of individuals. | |
Input | Output | |
Examples | ERNST AUGUST MÜLLER | Ernst August Müller |
Hr Bürgermeister Prof. rer. nat. Ernst Otto,jun. | Herr Prof. rer. nat. Ernst Otto, jun., Bürgermeister | |
Hertha Müller, Prof. | Prof. Hertha Müller | |
Bürgermeister Ernst Otto | Ernst Otto, Bürgermeister | |
Remarks |
In addition to the definitions listed on this page, all German-language locales also inherit all Global definitions.
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_DE_defs.html |