SAS Quality Knowledge Base for Contact Information 25
In the SAS Quality Knowledge Base, the Arabic definitions are shared by all Arabic-language locales. Shared Arabic definitions are described below.
Note: To obtain optimal results, before performing data cleansing on phone numbers, the area code or governorate (which are often stored in databases separately from the base number) should first be concatenated with the base number.
Case Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
None.
Name | ||
---|---|---|
Description | The Name gender analysis definition determines the gender of a name. | |
Possible Outputs | M F U |
|
Input | Output | |
Examples | Mohammed Soliman Abdallah | M |
Eman Soliman Abdallah | F | |
Reda Soliman Abdallah | U | |
Mr. Reda Soliman Abdallah | M | |
Mrs. Reda Soliman Abdallah | F | |
محمد سليمان عبدالله | M | |
ايمان سليمان عبدالله | F | |
رضا سليمان عبدالله | U | |
الأمير رضا سليمان عبدالله | M | |
مدام رضا سليمان عبدالله | F | |
John Smith | M | |
Mary Smith | F | |
J Smith | U | |
Remarks |
None.
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 30 characters | |
Input | Cluster ID | |
Examples | مازن سلمان عاطف | 0 |
مازن صفوت عاطف | 0 | |
مازن صفوت بهجت عاطف | 0 | |
Ahmed Salman Atef | 1 | |
Ahmed Safwat Atef | 1 | |
Ahmed Safwat Bahgat Atef | 1 | |
شريف حسنى محمد | 2 | |
شريف غنيم محمد | 2 | |
إِسْحَاق | 3 | |
إسحاق | 3 | |
طارق جعفر أبو ال وفاء | 4 | |
طارق جعفر أبوال وفاء | 4 | |
طارق جعفر أبو الوفاء | 4 | |
إحمد | 5 | |
أحمد | 5 | |
آحمد | 5 | |
احمد | 5 | |
Hassan Samir Abu Al Regal | 6 | |
Hassan Samir Bin Al Regal | 6 | |
Hassan Samir Al Regal | 6 | |
Hassan Samir Al-Regal | 6 | |
حسن سمير أبو ملكي | 6 | |
حسن سمير بن ملكي | 6 | |
حسن سمير الملكي | 6 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Name | |||
---|---|---|---|
Description | The Name parse definition parses names of individuals into a set of tokens. | ||
Output Tokens | Prefix Given Name Patronym/Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | الأستاذة مازن سلمان عاطف السيد سلمان قاصر | Prefix | الأستاذة |
Given Name | مازن | ||
Patronym/Middle Name | سلمان عاطف السيد | ||
Family Name | سلمان | ||
Suffix | |||
Title/Additional Info | قاصر | ||
Input | Output | ||
Example 2 | أنس الوجود شريف الخولي | Prefix | |
Given Name | أنس الوجود | ||
Patronym/Middle Name | شريف | ||
Family Name | الخولي | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 3 | حسن إبراهيم | Prefix | |
Given Name | حسن | ||
Patronym/Middle Name | |||
Family Name | إبراهيم | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 4 | عبدالله | Prefix | |
Given Name | عبدالله | ||
Patronym/Middle Name | |||
Family Name | |||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 5 | Mr. Mohamed Abdel Maksoud Abdel-Halim, CEO | Prefix | Mr. |
Given Name | Mohamed | ||
Patronym/Middle Name | Abdel Maksoud | ||
Family Name | Abdel-Halim | ||
Suffix | |||
Title/Additional Info | CEO | ||
Input | Output | ||
Example 6 | Mazen Aziz | Prefix | |
Given Name | Mazen | ||
Patronym/Middle Name | |||
Family Name | Aziz | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 7 | John Smith Sr. | Prefix | |
Given Name | John | ||
Patronym/Middle Name | |||
Family Name | Smith | ||
Suffix | Sr. | ||
Title/Additional Info | |||
Input | Output | ||
Example 8 | John | Prefix | |
Given Name | |||
Patronym/Middle Name | |||
Family Name | John | ||
Suffix | |||
Title/Additional Info | |||
Remarks | The Name parse definition parses Arabic names written in the Arabic script and in the Latin script in an equivalent manner. Single-word Arabic names are parsed into Given Name. Western names are parsed according to Western standards. |
Name (Global) | |||
---|---|---|---|
Description | The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | الأستاذة مازن سلمان عاطف السيد سلمان قاصر | Prefix | الأستاذة |
Given Name | مازن | ||
Middle Name | سلمان عاطف السيد | ||
Family Name | سلمان | ||
Suffix | |||
Title/Additional Info | قاصر | ||
Input | Output | ||
Example 2 | أنس الوجود شريف الخولي | Prefix | |
Given Name | أنس الوجود | ||
Middle Name | شريف | ||
Family Name | الخولي | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 3 | حسن إبراهيم | Prefix | |
Given Name | حسن | ||
Middle Name | |||
Family Name | إبراهيم | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 4 | عبدالله | Prefix | |
Given Name | عبدالله | ||
Middle Name | |||
Family Name | |||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 5 | Mr. Mohamed Abdel Maksoud Abdel-Halim, CEO | Prefix | Mr. |
Given Name | Mohamed | ||
Middle Name | Abdel Maksoud | ||
Family Name | Abdel-Halim | ||
Suffix | |||
Title/Additional Info | CEO | ||
Input | Output | ||
Example 6 | Mazen Aziz | Prefix | |
Given Name | Mazen | ||
Middle Name | |||
Family Name | Aziz | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 7 | John Smith Sr. | Prefix | |
Given Name | John | ||
Middle Name | |||
Family Name | Smith | ||
Suffix | Sr. | ||
Title/Additional Info | |||
Input | Output | ||
Example 8 | John | Prefix | |
Given Name | |||
Middle Name | |||
Family Name | John | ||
Suffix | |||
Title/Additional Info | |||
Remarks | The Name (Global) parse definition parses Arabic names written in the Arabic script and in the Latin script in an equivalent manner. Single-word Arabic names are parsed into Given Name. Western names are parsed according to Western standards. | ||
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
None.
Name | ||
---|---|---|
Description | The Name standardization definition standardizes names of individuals. | |
Input | Output | |
Examples | محمود ابراهيم سلمان، ق | محمود ابراهيم سلمان، قاصر |
محمود ابراهيم سلمان (ق) | محمود ابراهيم سلمان، قاصر | |
مازن سلمان عاطف السيد سلمان مدير إداري | مازن سلمان عاطف السيد سلمان، مدير اداري | |
مازن سلمان حمودة (معلومات إضافية) | مازن سلمان حموده، معلومات اضافيه | |
عناية: مازن سلمان حمودة | مازن سلمان حموده | |
KHDRA ABDALLAH ABBASS | Khdra AbdAllah Abbass | |
Professor Mohamed Abdel-Maksoud Abdel-Halim | Prof Mohamed AbdElMaksoud AbdElHalim | |
Mohamed AbdElMaksoud AbdElHalim (Additional Information) | Mohamed AbdElMaksoud AbdElHalim, Additional Information | |
Remarks |
In addition to the definitions listed on this page, all Arabic-language locales also inherit all Global definitions.
Documentation Feedback: yourturn@sas.com
|
Doc ID: QKBCI_AR_defs.html |