SAS Quality Knowledge Base for Contact Information 25
Definitions for the Malay, Malaysia locale are described below.
Case Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Proper (Address) | ||
---|---|---|
Description | The Case definition for Proper (Address) propercases addresses. | |
Input | Output | |
Example | NO 2, JALAN SS2/14 po box 125 | No 2, Jalan SS2/14 PO Box 125 |
Remarks |
Proper (Name) | ||
---|---|---|
Description | The Case definition for Proper (Name) propercases names of individuals. | |
Input | Output | |
Examples | M. KUPPUSAMY A/L MUTHUSAMY | M. Kuppusamy a/l Muthusamy |
nurul binti normawati | Nurul binti Normawati | |
YB DATIN URVID | YB Datin Urvid | |
Remarks |
Proper (Organization) | ||
---|---|---|
Description | The Case definition for Proper (Organization) propercases organization names. | |
Input | Output | |
Examples | A-tech institute | A-Tech Institute |
BSN MERCHANT BANK BHD | BSN Merchant Bank Bhd | |
Remarks | This definition uses a list of known organization names to handle exceptions to propercasing rules. |
Name | ||
---|---|---|
Description | The Gender Analysis definition for Name determines the gender of a name. | |
Possible Outputs | M F U |
|
Input | Output | |
Examples | Prema Punita | F |
Kamal Lee bin Abdullah | M | |
S. Sothinathan | U | |
Remarks |
Name (Ethnicity) | ||
---|---|---|
Description | The Identification Analysis definition for Name (Ethnicity) identifies the ethnic background of an individual based on the individual's name. | |
Possible Outputs | M C E I P O |
|
Input | Output | |
Examples | Azwar Baharudin | M |
Nang Ching Teck | C | |
John Smith | E | |
Anupam Arjun | I | |
Vinu Singh | P | |
Janusz Wojdecki | O | |
Remarks | M = Malay C = Chinese E = Eurasian I = Indian P = Punjabi O = Other |
Address | ||
---|---|---|
Description | The Address match definition generates match codes which can be used to cluster records containing addresses. | |
Max Length of Match Code | 20 characters | |
Input | Cluster ID | |
Examples | WISMA KLN, 10 JLN WONG AH FOOK | 0 |
10 Jalan Wong Ah Fook | 0 | |
2 JLN KASKAS | 1 | |
Remarks | Note: The results listed above reflect the default match sensitivity (85). |
City | ||
---|---|---|
Description | The City match definition generates match codes which can be used to cluster records containing city names. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | KL | 0 |
K. Lumpur | 0 | |
Kota Kinabalu | 1 | |
Remarks | Note: The results listed above reflect the default match sensitivity (85). |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information. | |
Max Length of Match Code | 32 characters | |
Input | Cluster ID | |
Examples | 46200 Petaling Jaya Selangor Darul Ehsan | 0 |
46200,PJ,Sel. D.E. | 0 | |
50450, K.LUMPUR | 1 | |
Remarks | Note: The results listed above reflect the default match sensitivity (85). |
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 20 characters | |
Input | Cluster ID | |
Examples | Caroline Yong Mei-Lin | 0 |
Miss Mei-Lin Yong | 0 | |
Yong Mei-Lin | 0 | |
Remarks | Note: The results listed above reflect the default match sensitivity (85). |
Organization | ||
---|---|---|
Description | The Organization match definition generates match codes which can be used to cluster records containing organization names. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | SRIMANISA SDN BHD | 0 |
Agensi Pekerjaan Srimanisa Sdn Bhd | 0 | |
MCSB Systems Bhd | 1 | |
Remarks | Note: The results listed above reflect the default match sensitivity (85). |
Phone | ||
---|---|---|
Description | The Phone match definition generates match codes which can be used to cluster records containing phone numbers.. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | +603-64219754 | 0 |
60364219754 | 0 | |
03-79901655 | 1 | |
Remarks | Note: The results listed above reflect the default match sensitivity (85). |
Postal Code | ||
---|---|---|
Description | The Postal Code match definition generates match codes which can be used to cluster records containing postal codes. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples |
-46200 | 0 |
46200 | 0 | |
50450 | 1 | |
Remarks | Note: The results listed above reflect the default match sensitivity (85). |
Address | |||
---|---|---|---|
Description | The Parse definition for Address parses addresses into a set of tokens. | ||
Output Tokens | Unit Number Building Name Lot Number Street Type Street Name Additional Street Name Primary Neighborhood Secondary Neighborhood |
||
Input | Output | ||
Example | 5A Wisma Maria, 2, Jalan Kuchai 3, Taman Lian Hoe | Unit Number | 5A |
Building Name | Wisma Maria | ||
Lot Number | 2 | ||
Street Type | Jalan | ||
Street Name | Kuchai 3 | ||
Additional Street Name | |||
Primary Neighborhood | Taman Lian Hoe | ||
Secondary Neighborhood | |||
Remarks |
Address (Global) | |||
---|---|---|---|
Description |
The Address (Global) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example | 5A Wisma Maria, 2, Jalan Kuchai 3, Taman Lian Hoe | Recipient | |
Building/Site | Wisma Maria | ||
Street | 2, Jalan Kuchai 3 | ||
Extension | 5A | ||
PO Box | |||
Additional Info | Taman Lian Hoe | ||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. | ||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
Address (Global) (v23) | |||
---|---|---|---|
Description |
The Address (Global) (v23) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example | 5A Wisma Maria, 2, Jalan Kuchai 3, Taman Lian Hoe | Recipient | |
Building/Site | Wisma Maria | ||
Street | 2, Jalan Kuchai 3 | ||
Extension | 5A | ||
PO Box | |||
Additional Info | Taman Lian Hoe | ||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. | ||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
City - State/Province - Postal Code | |||
---|---|---|---|
Description | The Parse definition for City - State/Province - Postal Code parses address last line data into a set of tokens. | ||
Output Tokens | Postal Code Neighborhood City State |
||
Input | Output | ||
Example | 47400 Petaling Jaya Selangor Darul Ehsan | Postal Code | 47400 |
Neighborhood | |||
City | Petaling Jaya | ||
State | Selangor Darul Ehsan | ||
Remarks |
City - State/Province - Postal Code (Global) | |||
---|---|---|---|
Description | The Parse definition for City - State/Province - Postal Code (Global) parses address last line data into a globally recognized set of tokens. | ||
Output Tokens | City State/Province Postal Code Additional Info |
||
Input | Output | ||
Example | 47400 Petaling Jaya Selangor Darul Ehsan | City | Petaling Jaya |
State/Province | Selangor Darul Ehsan | ||
Postal Code | 47400 | ||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name | |||
---|---|---|---|
Description | The Parse definition for Name parses names of individuals into a set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | Miss Yong Mei-Lin | Prefix | Miss |
Given Name | Mei-Lin | ||
Middle Name | |||
Family Name | Yong | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 2 | En Burhan Basir a/l Asmi Basir | Prefix | En |
Given Name | Burhan Basir | ||
Middle Name | |||
Family Name | a/l Asmi Basir | ||
Suffix | |||
Title/Additional Info | |||
Remarks |
Name (Global) | |||
---|---|---|---|
Description | The Parse definition for Name (Global) parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | Miss Yong Mei-Lin | Prefix | Miss |
Given Name | Mei-Lin | ||
Middle Name | |||
Family Name | Yong | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 2 | En Burhan Basir a/l Asmi Basir | Prefix | En |
Given Name | Burhan Basir | ||
Middle Name | |||
Family Name | a/l Asmi Basir | ||
Suffix | |||
Title/Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Organization | |||
---|---|---|---|
Description | The Parse definition for Organization parses company and organization information into a set of tokens. | ||
Output Tokens | Name Legal Form Registration Number Site Additional Info |
||
Input | Output | ||
Example | SAS Institute Sdn Bhd | Name | SAS Institute |
Legal Form | Sdn Bhd | ||
Registration Number | |||
Site | |||
Additional Info | |||
Remarks |
Phone | |||
---|---|---|---|
Description | The Parse definition for Phone parses phone numbers into a set of tokens. | ||
Output Tokens | Prefix Country Code Area Code Base Number Extension |
||
Input | Output | ||
Example | (603) 7981-4655 | Prefix | |
Country Code | 60 | ||
Area Code | 3 | ||
Base Number | 7981-4655 | ||
Extension | |||
Remarks |
Phone (Global) | |||
---|---|---|---|
Description | The Parse definition for Phone (Global) parses phone numbers into a globally recognized set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output | ||
Example | (603) 7981-4655 | Country Code | 6 0 |
Area Code | 3 | ||
Base Number | 7981 4655 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
None.
Address | ||
---|---|---|
Description | The Standardization definition for Address standardizes addresses. | |
Input | Output | |
Examples | 2, Jln Kuchai 3 Tmn Lian Hoe | 2, Jalan Kuchai 3, Taman Lian Hoe |
jln ss15/5a |
Jalan SS15/5a | |
Remarks |
City | ||
---|---|---|
Description | The Standardization definition for City standardizes city names. | |
Input | Output | |
Examples | kl | Kuala Lumpur |
Jhr. | Johor | |
Remarks | Common city abbreviations are expanded into full names. |
City - State/Province - Postal Code | ||
---|---|---|
Description | The Standardization definition for City - State/Province - Postal Code standardizes address last line data. | |
Input | Output | |
Example | KL 54000, KL | 54000 Kuala Lumpur, Wilayah Persekutuan |
Remarks |
Name | ||
---|---|---|
Description | The Standardization definition for Name standardizes names of individuals. | |
Input | Output | |
Examples | Miss Mei-Lin Yong | Cik Mei-Lin Yong |
EN DIVYENDU EKNATH | Encik Divyendu Eknath | |
Remarks |
Organization | ||
---|---|---|
Description | The Standardization definition for Organization standardizes organization names. | |
Input | Output | |
Examples | SAS Institute Sdn Berhad | SAS Institute Sdn Bhd |
B.I.M.I.T. COLLEGE | BIMIT College | |
Remarks |
Phone | ||
---|---|---|
Description | The Standardization definition for Phone standardizes phone numbers. | |
Input | Output | |
Examples | (603) 7981-4655 | +603 79814655 |
+6 019-8222123 | +6019 8222123 | |
Remarks |
Postal Code | ||
---|---|---|
Description | The Standardization definition for Postal Code standardizes postal codes. | |
Input | Output | |
Examples | 50450. | 50450 |
-50450 | 50450 | |
Remarks |
In addition to the definitions listed on this page, the Malay, Malaysia locale also inherits all definitions for the Malay language and all Global definitions.
Documentation Feedback: yourturn@sas.com
|
Doc ID: QKBCI_MSMYS_defs.html |