SAS Quality Knowledge Base for Contact Information 25
In the SAS Quality Knowledge Base, the German definitions are shared by all German-language locales. Shared German definitions are described below.
Case Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Proper (Name) | ||
---|---|---|
Description | The Case definition for Proper (Name) propercases names of individuals. | |
Input | Output | |
Examples | JUPP SCHMITZ | Jupp Schmitz |
erna meyer | Erna Meyer | |
t.h. Otto | T.H. Otto | |
Remarks |
Name | ||
---|---|---|
Description | The Gender Analysis definition for Name determines the gender of a name. | |
Possible Outputs | M F U |
|
Input | Output | |
Examples | Irene Himpsl | F |
Herr M. Kiefer | M | |
T. Schmidt | U | |
Remarks |
Field Name | ||
---|---|---|
Description |
The Field Name identification analysis definition identifies database column names. |
|
Possible Outputs | NAME ORGANIZATION ADDRESS CITY STATE/PROVINCE POSTALCODE COUNTRY PHONE DATE UNKNOWN URL GENDER MATCHCODE PERSONAL_ID ORGANIZATION_ID GENERIC_ID COUNTY MARITAL_STATUS |
|
Input | Output | |
Examples | Company Name | ORGANIZATION |
Unternehmen | ORGANIZATION | |
phone | PHONE | |
Telefonnummer | PHONE | |
Vorname | NAME | |
Family_Name | NAME | |
Remarks |
This definition is recommended to determine the type of data stored in a database column based on the name of the column. |
|
The Field Name (v23) identification analysis definition is now deprecated and will be removed in a future release of the QKB. The Field Name identification analysis definition has been replaced with a copy of the Field Name (v23) definition which takes advantage of updated processing. If you changed your jobs to use the Field Name (v23) definition it is suggested that you change them back. |
Field Name (v23) | ||
---|---|---|
Description | The Field Name (v23) identification analysis definition identifies database column names. | |
Possible Outputs | NAME ORGANIZATION ADDRESS CITY STATE/PROVINCE POSTALCODE COUNTRY PHONE DATE UNKNOWN URL GENDER MATCHCODE PERSONAL_ID ORGANIZATION_ID GENERIC_ID COUNTY MARITAL_STATUS |
|
Input | Output | |
Examples | Company Name | ORGANIZATION |
Unternehmen | ORGANIZATION | |
phone | PHONE | |
Telefonnummer | PHONE | |
Vorname | NAME | |
Family_Name | NAME | |
Remarks |
This definition is recommended to determine the type of data stored in a database column based on the name of the column. |
|
The Field Name (v23) identification analysis definition is now deprecated and will be removed in a future release of the QKB. The Field Name identification analysis definition has been replaced with a copy of the Field Name (v23) definition which takes advantage of updated processing. If you changed your jobs to use the Field Name (v23) definition it is suggested that you change them back. |
Field Name | ||
---|---|---|
Description | The Field Name match definition generates match codes which can be used to cluster records containing database field names. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Company Name | 0 |
Unternehmen | 0 | |
Telefonnummer | 1 | |
Phone | 1 | |
Vornames | 2 | |
Name | 2 | |
Remarks |
This definition should be used to find potential matches between database column names. Note: The results listed above reflect the default match sensitivity (85). |
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 20 characters | |
Input | Cluster ID | |
Examples | SCHMIDT FREDERICK | 0 |
SCHMIDT FREDERICKE | 0 | |
Paul von Heiden | 1 | |
Paul von der Heiden | 1 | |
Karl Heinz Maier-Schmidt | 2 | |
Karl Heinz Schmidt | 2 | |
Hans Jürgen Schmidt | 3 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Text | ||
---|---|---|
Description | The Text match definition generates match codes which can be used to cluster records containing general text strings. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Data Management | 0 |
Das Data Management Studio | 0 | |
Remarks |
Contains generic matching logic only. To be used for strings that do not fit any existing data type. Note: The results listed above reflect the default match sensitivity (85). |
Name | |||
---|---|---|---|
Description | The Parse definition for Name parses names of individuals into a set of tokens. | ||
Output Tokens | Prefix Title Given Name Family Name Suffix Additional Info |
||
Input | Output | ||
Example 1 | Herr Dr. Heinz Meyer jun, Bürgermeister | Prefix | Herr |
Title | Dr. | ||
Given Name | Heinz | ||
Family Name | Meyer | ||
Suffix | jun | ||
Additional Info | Bürgermeister | ||
Input | Output | ||
Example 2 | Vincent Paul Hoffmeister, III | Prefix | |
Title | |||
Given Name | Vincent Paul | ||
Family Name | Hoffmeister | ||
Suffix | III | ||
Additional Info | |||
Remarks |
Name (Global) | |||
---|---|---|---|
Description | The Parse definition for Name (Global) parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | Herr Dr. Heinz Meyer jun, Bürgermeister | Prefix | Herr Dr. |
Given Name | Heinz | ||
Middle Name | |||
Family Name | Meyer | ||
Suffix | jun | ||
Title/Additional Info | Bürgermeister | ||
Input | Output | ||
Example 2 | Frau Gräfin Amelia von Thurn und Taxis | Prefix | Frau |
Given Name | Amelia | ||
Middle Name | |||
Family Name | von Thurn und Taxis | ||
Suffix | |||
Title/Additional Info | Gräfin | ||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name (Multiple Name) | |||
---|---|---|---|
Description | The Parse definition for Name (Multiple Name) parses names of multiple individuals into separate tokens. | ||
Output Tokens | Name 1 Name 2 |
||
Input | Output | ||
Example 1 | Herr und Frau Heinz Meyer | Name 1 | Herr Heinz Meyer |
Name 2 | Frau Meyer | ||
Input | Output | ||
Example 2 | Frau Gräfin Amelia von Thurn und Taxis | Name 1 | Frau Amelia von Thurn |
Name 2 | Herr Heinz Meyer | ||
Remarks |
None.
Name | ||
---|---|---|
Description | The Standardization definition for Name standardizes names of individuals. | |
Input | Output | |
Examples | ERNST AUGUST MÜLLER | Ernst August Müller |
Hr Bürgermeister Prof. rer. nat. Ernst Otto,jun. | Herr Prof. rer. nat. Ernst Otto, jun., Bürgermeister | |
Hertha Müller, Prof. | Prof. Hertha Müller | |
Bürgermeister Ernst Otto | Ernst Otto, Bürgermeister | |
Remarks |
In addition to the definitions listed on this page, all German-language locales also inherit all Global definitions.
Documentation Feedback: yourturn@sas.com
|
Doc ID: QKBCI_DE_defs.html |