SAS Quality Knowledge Base for Contact Information 25
Definitions for the Arabic, Egypt locale are described below.
Note: To obtain optimal results, before performing data cleansing on phone numbers, the area code or governorate (which are often stored in databases separately from the base number) should first be concatenated with the base number.
Case Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
None.
National ID | ||
---|---|---|
Description | The National ID gender analysis definition determines the gender associated with a National ID number. | |
Possible Outputs | M F U |
|
Input | Output | |
Examples | 28709210101846 | F |
28709210101836 | M | |
287092101018360 | U | |
Remarks |
Phone | ||
---|---|---|
Description | The Phone identification analysis definition determines the type of phone number. | |
Possible Outputs | LANDLINE MOBILE MULTIPLE FOREIGN INVALID |
|
Input | Output | |
Examples | (045) 454 8783 | LANDLINE |
1401451/دمياط | LANDLINE | |
011/1005002 | MOBILE | |
0100509397-33852428 | MULTIPLE | |
+1 (919) 457-7000 | FOREIGN | |
23927892 | LANDLINE | |
2392789 | INVALID | |
Remarks | The area code or Governorate must be present to be considered a valid domestic phone number, except for eight-digit base numbers, which always imply Cairo. |
Governorate | ||
---|---|---|
Description | The Governorate match definition generates match codes which can be used to cluster records containing governorate names. | |
Max Length of Match Code | 18 characters | |
Input | Cluster ID | |
Examples | Met Ghamr | 0 |
ميت غمر | 0 | |
El Sheikh Zweed | 1 | |
Al Sheikh Zweed | 1 | |
الشيخ زويد | 1 | |
ال شيخ زويد | 1 | |
Al Ibrahimia | 2 | |
Al-Ibrahimia | 2 | |
الإبراهيمية | 2 | |
الابراهيميه | 2 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Phone | ||
---|---|---|
Description | The Phone match definition generates match codes which can be used to cluster records containing phone numbers. | |
Max Length of Match Code | 22 characters | |
Input | Cluster ID | |
Examples | 456-7891 | 0 |
456-7890 | 0 | |
456-7800 | 1 | |
2567-8912 | 2 | |
2567-8910 | 2 | |
2567-8900 | 3 | |
+20 48 123 4567 | 4 | |
048 123 4567 | 4 | |
Work: 048 123 4567 | 4 | |
048 123 4567 العمل | 4 | |
(047) 438 5396 | 5 | |
4385396 كفر الشيخ | 5 | |
2545 7930 | 6 | |
(02) 2545 7930 | 6 | |
Remarks |
Note that the number of digits retained in the match codes is a function of the number of digits in the base number and the match sensitivity. Governorates match their corresponding numeric area codes. Eight-digit (Cairo) base numbers match whether the 02 area code is present or not. |
|
Note: The results listed above reflect the default match sensitivity (85). |
Phone | |||
---|---|---|---|
Description | The Phone parse definition parses phone numbers into a set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output | ||
Example 1 | +20-2-2464-8436 x1234 مكتب | Country Code | +20 |
Area Code | 2 | ||
Base Number | 2464-8436 | ||
Extension | 1234 | ||
Line Type | مكتب | ||
Additional Info | |||
Input | Output | ||
Example 2 | 02-2464-8436 في المساء | Country Code | |
Area Code | 02 | ||
Base Number | 2464-8436 | ||
Extension | |||
Line Type | |||
Additional Info | في المساء | ||
Input | Output | ||
Example 3 | 2337290-طنطا | Country Code | |
Area Code | طنطا | ||
Base Number | 2337290 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Input | Output | ||
Example 4 | Home: (062) 787 9214 | Country Code | |
Area Code | 062 | ||
Base Number | 787 9214 | ||
Extension | |||
Line Type | Home: | ||
Additional Info | |||
Input | Output | ||
Example 5 | +212-520-123456 | Country Code | +212 |
Area Code | |||
Base Number | 520-123456 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks |
Phone (Global) | |||
---|---|---|---|
Description | The Phone (Global) parse definition parses phone numbers into a globally recognized set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output | ||
Example 1 | +20-2-2464-8436 x1234 مكتب | Country Code | +20 |
Area Code | 2 | ||
Base Number | 2464-8436 | ||
Extension | 1234 | ||
Line Type | مكتب | ||
Additional Info | |||
Input | Output | ||
Example 2 | 02-2464-8436 في المساء | Country Code | |
Area Code | 02 | ||
Base Number | 2464-8436 | ||
Extension | |||
Line Type | |||
Additional Info | في المساء | ||
Input | Output | ||
Example 3 | 2337290-طنطا | Country Code | |
Area Code | طنطا | ||
Base Number | 2337290 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Input | Output | ||
Example 4 | Home: (062) 787 9214 | Country Code | |
Area Code | 062 | ||
Base Number | 787 9214 | ||
Extension | |||
Line Type | Home: | ||
Additional Info | |||
Input | Output | ||
Example 5 | +212-520-123456 | Country Code | +212 |
Area Code | |||
Base Number | 520-123456 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Phone (Multiple Number) | |||
---|---|---|---|
Description | The Phone (Multiple Number) parse definition parses multiple phone number format input into a set of tokens. | ||
Output Tokens | Phone 1 Phone 2 |
||
Input | Output | ||
Example 1 | 0107902881/010750788 | Phone 1 | 0107902881 |
Phone 2 | 010750788 | ||
Input | Output | ||
Example 2 | 0114698116-27162079 | Phone 1 | 0114698116 |
Phone 2 | 27162079 | ||
Input | Output | ||
Example 3 | 0121176617,37833437 | Phone 1 | 0121176617 |
Phone 2 | 37833437 | ||
Input | Output | ||
Example 4 | 011/1005002 | Phone 1 | 011/1005002 |
Phone 2 | |||
Remarks |
None.
Governorate (Arabic to English Transliteration) | ||
---|---|---|
Description | The Governorate (Arabic to English Transliteration) standardization definition translates names of governorates from Arabic to English. | |
Input | Output | |
Examples | العاشر من رمضان | 10th of Ramadan |
الحوامدية | Al Hawamdya | |
كفر الدوار | Kafr El Dawar | |
المنصورة | Mansoura | |
Remarks |
Governorate (English to Arabic Transliteration) | ||
---|---|---|
Description | The Governorate (English to Arabic Transliteration) standardization defintion translates names of governorates from English to Arabic. | |
Input | Output | |
Examples | Menia Al Qamh | منيا القمح |
Nasr Al Nawbah | نصر النوبة | |
Safaga | سفاجا | |
6th of October | السادس من أكتوبر | |
Remarks |
Name (Arabic to English Transliteration) | ||
---|---|---|
Description | The Governorate (Arabic to English Transliteration) standardization definition translates names of governorates from Arabic to English. | |
Input | Output | |
Examples | طارق جعفر ابوالعينين | Tareq Jafar AboAlEnein |
محمود راضى ابراهيم | Mahmoud Rady Ibrahim | |
محمد سمير عبدالسلام | Mohamed Samir AbdElSalam | |
Remarks |
Name (English to Arabic Transliteration) | ||
---|---|---|
Description | The Governorate (English to Arabic Transliteration) standardization defintion translates names of governorates from English to Arabic. | |
Input | Output | |
Examples | Tareq Jafar AboAlEnein | طارق جعفر ابوالعينين |
Mahmoud Rady Ibrahim | محمود راضى ابراهيم | |
Mohamed Samir AbdElSalam | محمد سمير عبدالسلام | |
Remarks |
Phone | ||
---|---|---|
Description | The Phone standardization definition standardizes phone numbers for domestic use. | |
Input | Output | |
Examples | +20 (0)48 454 8783 | (048) 454 8783 |
+961 01 218-757 | +961 1218757 | |
(02) 2464 8436 العمل | (02) 2464 8436, العمل | |
7483119 الأسكندرية | (03) 748 3119 | |
Alexandria 7483119 | (03) 748 3119 | |
Alexandria (03) 748 3119 | (03) 748 3119 | |
03الأسكندرية748 3119 | (03) 748 3119 | |
2545 7930 | (02) 2545 7930 | |
Work: (02) 2464 8436 | (02) 2464 8436, العمل | |
Remarks | The Phone standardization definition converts governorates to their corresponding numeric area code, inserts the area code 02 onto eight-digit numbers, and translates English line type phrases to Arabic, as shown in the final six examples. |
Phone (Electronic) | ||
---|---|---|
Description | The Phone (Electronic) standardization definition standardizes phone numbers for automated calling systems. | |
Input | Output | |
Examples | (062) 787 9211 x123 مكتب ، في الصباح | +20627879211 |
(0)48 454 8783 | +20484548783 | |
+961 01 218-757 | +9611218757 | |
(101) 961 01 218-757 | +9611218757 | |
00961 01 218-757 | +9611218757 | |
(02) 2464 8436 العمل | +20224648436 | |
7483119 الأسكندرية | +2037483119 | |
Alexandria 7483119 | +2037483119 | |
Alexandria (03) 748 3119 | +2037483119 | |
03الأسكندرية748 3119 | +2037483119 | |
2545 7930 | +20225457930 | |
Remarks | The Phone (Electronic) standardization definition converts governorates to their corresponding numeric area code, and inserts area code 2 onto eight-digit numbers, as shown in the final five examples. |
Phone (Legacy Mobile Conversion) | ||
---|---|---|
Description | The Phone (Legacy Mobile Conversion) standardization definition standardizes phone numbers for domestic use and converts legacy mobile codes to the current format. | |
Input | Output | |
Examples | 012 555 5555 | 0122 555 5555 |
018 555 5555 | 0128 555 5555 | |
017 555 5555 | 0127 555 5555 | |
0150 555 5555 | 0120 555 5555 | |
010 555 5555 | 0100 555 5555 | |
016 555 5555 | 0106 555 5555 | |
019 555 5555 | 0109 555 5555 | |
0151 555 5555 | 0101 555 5555 | |
011 555 5555 | 0111 555 5555 | |
014 555 5555 | 0114 555 5555 | |
0152 555 5555 | 0112 555 5555 | |
+20 12 555 5555 | 0122 555 5555 | |
+20 18 555 5555 | 0128 555 5555 | |
+20 17 555 5555 | 0127 555 5555 | |
+20 150 555 5555 | 0120 555 5555 | |
+20 10 555 5555 | 0100 555 5555 | |
+20 16 555 5555 | 0106 555 5555 | |
+20 19 555 5555 | 0109 555 5555 | |
+20 151 555 5555 | 0101 555 5555 | |
+20 11 555 5555 | 0111 555 5555 | |
+20 14 555 5555 | 0114 555 5555 | |
+20 52 555 5555 | 0112 555 5555 | |
Remarks | You can apply the Phone (Legacy Mobile Conversion) standardization definition prior to applying the Phone match definition if matching legacy numbers with their current format is desired. |
Phone (with Country Code) | ||
---|---|---|
Description | The Phone (with Country Code) standardization definition standardizes phone numbers for international use. | |
Input | Output | |
Examples | (0)48 454 8783 | +20 48 454 8783 |
+961 01 218-757 | +961 1218757 | |
(101) 961 01 218-757 | +961 1218757 | |
00961 01 218-757 | +961 1218757 | |
(02) 2464 8436 العمل | +20 2 2464 8436, العمل | |
7483119 الأسكندرية | +20 3 748 3119 | |
Alexandria 7483119 | +20 3 748 3119 | |
Alexandria (03) 748 3119 | +20 3 748 3119 | |
03الأسكندرية748 3119 | +20 3 748 3119 | |
2545 7930 | +20 2 2545 7930 | |
Work: (02) 2464 8436 | +20 2 2464 8436, العمل | |
Remarks | The Phone (with Country Code) standardization definition converts governorates to their corresponding numeric area code, inserts the area code 2 onto eight-digit numbers, and translates English line type phrases to Arabic, as shown in the final six examples. |
Phone (with Governorate) | ||
---|---|---|
Description | The Phone (with Governorate) standardization definition standardizes phone numbers for domestic use and converts the numeric area code to the Arabic name of the governorate served by that area code. | |
Input | Output | |
Examples | +20 2 2999 9999 | 2999 9999 القاهرة الكبرى |
02 2999 9999 | 2999 9999 القاهرة الكبرى | |
2999 9999 | 2999 9999 القاهرة الكبرى | |
03 499 9999 | 499 9999 الاسكندرية | |
084 499 9999 | 499 9999 الفيوم | |
Remarks | The Phone (with Governorate) standardization definition inserts the القاهرة الكبرى string onto eight-digit numbers, as shown in the third example. |
In addition to the definitions listed on this page, the Arabic, Egypt locale also inherits all definitions for the Arabic language and all Global definitions.
Documentation Feedback: yourturn@sas.com
|
Doc ID: QKBCI_AREGY_defs.html |