SAS Quality Knowledge Base for Contact Information 27

Name

Match Definition

Name
Description The Name match definition generates match codes which can be used to cluster records containing names of individuals.
Max Length of Match Code 21 characters
Example 1 (Sensitivities
90-100)
Input Cluster ID
李友琴先生 1
李友琴 1
李友勤女士 2
黎友琴(总经理) 3
LIYOUQIN 4
Remarks The family name and given name are evaluated. The 5-bit pinyin code to the Family Name token and the 5-bit pinyin code to the Given Name token are applied. The first 11 digits for family name pinyin code and the 12th-21st digits for given name pinyin code are evaluated.
Example 2 (Sensitivities
85-89)
Input Cluster ID
李期勤 1
李期 2
李奇 3
黎友 4
Remarks The family name and given name are evaluated. The 5-bit pinyin code to the Family Name token and the 5-bit pinyin code to the Given Name token are applied. The first 10 digits for family name pinyin code and the 12th-21st digits for given name pinyin code are evaluated.
Example 3 (Sensitivities
80-84)
Input Cluster ID
李友琴 1
李友勤 1
黎友琴 2
LIYOUQIN 3
黎又青 4
Remarks The family name and given name are evaluated. The 5-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 9 digits for family name pinyin code and the 12th-17th digits for given name pinyin code are evaluated.
Example 4 (Sensitivities
75-79)
Input Cluster ID
黎友琴 1
黎又青 1
李期勤 2
李期 3
黎友 4
Remarks The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 8 digits for family name pinyin code and the 12th-16th digits for given name pinyin code are evaluated.
Example 5 (Sensitivities
70-74)
Input Cluster ID
李期 1
李奇 1
黎友 2
欧阳修 3
欧阳休期 4
Remarks The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 7 digits for family name pinyin code and the 12th-15th digits for given name pinyin code are evaluated.
Example 6 (Sensitivities
65-69)
Input Cluster ID
李期勤 1
李期 1
李奇 1
欧阳修 2
欧阳休期 2
Remarks The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 6 digits for family name pinyin code and the 12th-14th digits for given name pinyin code are evaluated.
Example 7 (Sensitivities
60-64)
Input Cluster ID
李友琴 1
李友勤 1
黎友 1
李期勤 2
李奇 2
Remarks The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 5 digits for family name pinyin code and the 12th-13th digits for given name pinyin code are evaluated.
Example 8 (Sensitivities
55-59)
Input Cluster ID
李奇 1
黎友 2
Remarks The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 4 digits for family name pinyin code and the 12th digits for given name pinyin code are evaluated.
Example 9 (Sensitivities
50-54)
Input Cluster ID
李奇 1
黎友 1
Remarks Only the family name is evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 3 digits for family name pinyin code are evaluated.