You are here: Definitions>Malay Definitions>Malay, Malaysia Definitions

SAS Quality Knowledge Base for Contact Information 25

Malay, Malaysia Definitions

Definitions for the Malay, Malaysia locale are described below.

Case Definitions
Gender Analysis Definitions

Identification Analysis Definitions

Match Definitions

Parse Definitions

Pattern Analysis Definitions

Standardization Definitions

Inherited Definitions

Case Definitions

Proper (Address)
Description The Case definition for Proper (Address) propercases addresses.
  Input Output
Example NO 2, JALAN SS2/14 po box 125 No 2, Jalan SS2/14 PO Box 125
Remarks  

 

Proper (Name)
Description The Case definition for Proper (Name) propercases names of individuals.
  Input Output
Examples M. KUPPUSAMY A/L MUTHUSAMY M. Kuppusamy a/l Muthusamy
nurul binti normawati Nurul binti Normawati
YB DATIN URVID YB Datin Urvid
Remarks  

 

Proper (Organization)
Description The Case definition for Proper (Organization) propercases organization names.
  Input Output
Examples A-tech institute A-Tech Institute
BSN MERCHANT BANK BHD BSN Merchant Bank Bhd
Remarks This definition uses a list of known organization names to handle exceptions to propercasing rules.

Gender Analysis Definitions

Name
Description The Gender Analysis definition for Name determines the gender of a name.
Possible Outputs M
F
U
  Input Output
Examples Prema Punita F
Kamal Lee bin Abdullah M
S. Sothinathan U
Remarks  

Identification Analysis Definitions

Name (Ethnicity)
Description The Identification Analysis definition for Name (Ethnicity) identifies the ethnic background of an individual based on the individual's name.
Possible Outputs M
C
E
I
P
O
  Input Output
Examples Azwar Baharudin M
Nang Ching Teck C
John Smith E
Anupam Arjun I
Vinu Singh P
Janusz Wojdecki O
Remarks M = Malay
C = Chinese
E = Eurasian
I = Indian
P = Punjabi
O = Other

Match Definitions

Address
Description The Address match definition generates match codes which can be used to cluster records containing addresses.
Max Length of Match Code 20 characters
  Input Cluster ID
Examples WISMA KLN, 10 JLN WONG AH FOOK 0
10 Jalan Wong Ah Fook 0
2 JLN KASKAS 1
Remarks NoteNote: The results listed above reflect the default match sensitivity (85).

 

City
Description The City match definition generates match codes which can be used to cluster records containing city names.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples KL 0
K. Lumpur 0
Kota Kinabalu 1
Remarks NoteNote: The results listed above reflect the default match sensitivity (85).

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information.
Max Length of Match Code 32 characters
  Input Cluster ID
Examples 46200 Petaling Jaya Selangor Darul Ehsan 0
46200,PJ,Sel. D.E. 0
50450, K.LUMPUR 1
Remarks NoteNote: The results listed above reflect the default match sensitivity (85).

 

Name
Description The Name match definition generates match codes which can be used to cluster records containing names of individuals.
Max Length of Match Code 20 characters
  Input Cluster ID
Examples Caroline Yong Mei-Lin 0
Miss Mei-Lin Yong 0
Yong Mei-Lin 0
Remarks NoteNote: The results listed above reflect the default match sensitivity (85).

 

Organization
Description The Organization match definition generates match codes which can be used to cluster records containing organization names.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples SRIMANISA SDN BHD 0
Agensi Pekerjaan Srimanisa Sdn Bhd 0
MCSB Systems Bhd 1
Remarks NoteNote: The results listed above reflect the default match sensitivity (85).

 

Phone
Description The Phone match definition generates match codes which can be used to cluster records containing phone numbers..
Max Length of Match Code 15 characters
  Input Cluster ID
Examples +603-64219754 0
60364219754 0
03-79901655 1
Remarks NoteNote: The results listed above reflect the default match sensitivity (85).

 

Postal Code
Description The Postal Code match definition generates match codes which can be used to cluster records containing postal codes.
Max Length of Match Code 15 characters
  Input Cluster ID

Examples
-46200 0
46200 0
50450 1
Remarks NoteNote: The results listed above reflect the default match sensitivity (85).

Parse Definitions

Address
Description The Parse definition for Address parses addresses into a set of tokens.
Output Tokens Unit Number
Building Name
Lot Number
Street Type
Street Name
Additional Street Name
Primary Neighborhood
Secondary Neighborhood
  Input Output
Example 5A Wisma Maria, 2, Jalan Kuchai 3, Taman Lian Hoe Unit Number 5A
Building Name Wisma Maria
Lot Number 2
Street Type Jalan
Street Name Kuchai 3
Additional Street Name  
Primary Neighborhood Taman Lian Hoe
Secondary Neighborhood  
Remarks  

 

Address (Global)
Description

The Address (Global) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output
Example 5A Wisma Maria, 2, Jalan Kuchai 3, Taman Lian Hoe Recipient  
Building/Site Wisma Maria
Street 2, Jalan Kuchai 3
Extension 5A
PO Box  
Additional Info Taman Lian Hoe
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB.

The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back.

 

Address (Global) (v23)
Description

The Address (Global) (v23) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output
Example 5A Wisma Maria, 2, Jalan Kuchai 3, Taman Lian Hoe Recipient  
Building/Site Wisma Maria
Street 2, Jalan Kuchai 3
Extension 5A
PO Box  
Additional Info Taman Lian Hoe
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB.

The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back.

 

City - State/Province - Postal Code
Description The Parse definition for City - State/Province - Postal Code parses address last line data into a set of tokens.
Output Tokens Postal Code
Neighborhood
City
State
  Input Output
Example 47400 Petaling Jaya Selangor Darul Ehsan Postal Code 47400
Neighborhood  
City Petaling Jaya
State Selangor Darul Ehsan
Remarks  

 

City - State/Province - Postal Code (Global)
Description The Parse definition for City - State/Province - Postal Code (Global) parses address last line data into a globally recognized set of tokens.
Output Tokens City
State/Province
Postal Code
Additional Info
  Input Output
Example 47400 Petaling Jaya Selangor Darul Ehsan City Petaling Jaya
State/Province Selangor Darul Ehsan
Postal Code 47400
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Name
Description The Parse definition for Name parses names of individuals into a set of tokens.
Output Tokens Prefix
Given Name
Middle Name
Family Name
Suffix
Title/Additional Info
  Input Output
Example 1 Miss Yong Mei-Lin Prefix Miss
Given Name Mei-Lin
Middle Name  
Family Name Yong
Suffix  
Title/Additional Info  
  Input Output
Example 2 En Burhan Basir a/l Asmi Basir Prefix En
Given Name Burhan Basir
Middle Name  
Family Name a/l Asmi Basir
Suffix  
Title/Additional Info  
Remarks  

 

Name (Global)
Description The Parse definition for Name (Global) parses names of individuals into a globally recognized set of tokens.
Output Tokens Prefix
Given Name
Middle Name
Family Name
Suffix
Title/Additional Info
  Input Output
Example 1 Miss Yong Mei-Lin Prefix Miss
Given Name Mei-Lin
Middle Name  
Family Name Yong
Suffix  
Title/Additional Info  
  Input Output
Example 2 En Burhan Basir a/l Asmi Basir Prefix En
Given Name Burhan Basir
Middle Name  
Family Name a/l Asmi Basir
Suffix  
Title/Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Organization
Description The Parse definition for Organization parses company and organization information into a set of tokens.
Output Tokens Name
Legal Form
Registration Number
Site
Additional Info
  Input Output
Example SAS Institute Sdn Bhd Name SAS Institute
Legal Form Sdn Bhd
Registration Number  
Site  
Additional Info  
Remarks  

 

Phone
Description The Parse definition for Phone parses phone numbers into a set of tokens.
Output Tokens Prefix
Country Code
Area Code
Base Number
Extension
  Input Output
Example (603) 7981-4655 Prefix  
Country Code 60
Area Code 3
Base Number 7981-4655
Extension  
Remarks  

 

Phone (Global)
Description The Parse definition for Phone (Global) parses phone numbers into a globally recognized set of tokens.
Output Tokens Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
  Input Output
Example (603) 7981-4655 Country Code 6 0
Area Code 3
Base Number 7981 4655
Extension  
Line Type  
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

Pattern Analysis Definitions

None.

Standardization Definitions

Address
Description The Standardization definition for Address standardizes addresses.
  Input Output
Examples 2, Jln Kuchai 3 Tmn Lian Hoe 2, Jalan Kuchai 3, Taman Lian Hoe
jln ss15/5a
Jalan SS15/5a
Remarks  

 

City
Description The Standardization definition for City standardizes city names.
  Input Output
Examples kl Kuala Lumpur
Jhr. Johor
Remarks Common city abbreviations are expanded into full names.

 

City - State/Province - Postal Code
Description The Standardization definition for City - State/Province - Postal Code standardizes address last line data.
  Input Output
Example KL 54000, KL 54000 Kuala Lumpur, Wilayah Persekutuan
Remarks  

 

Name
Description The Standardization definition for Name standardizes names of individuals.
  Input Output
Examples Miss Mei-Lin Yong Cik Mei-Lin Yong
EN DIVYENDU EKNATH Encik Divyendu Eknath
Remarks  

 

Organization
Description The Standardization definition for Organization standardizes organization names.
  Input Output
Examples SAS Institute Sdn Berhad SAS Institute Sdn Bhd
B.I.M.I.T. COLLEGE BIMIT College
Remarks  

 

Phone
Description The Standardization definition for Phone standardizes phone numbers.
  Input Output
Examples (603) 7981-4655 +603 79814655
+6 019-8222123 +6019 8222123
Remarks  

 

Postal Code
Description The Standardization definition for Postal Code standardizes postal codes.
  Input Output
Examples 50450. 50450
-50450 50450
Remarks  

Inherited Definitions

In addition to the definitions listed on this page, the Malay, Malaysia locale also inherits all definitions for the Malay language and all Global definitions.