SAS Quality Knowledge Base for Contact Information 27
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 21 characters | |
Example 1 (Sensitivities 90-100) |
Input | Cluster ID |
李友琴先生 | 1 | |
李友琴 | 1 | |
李友勤女士 | 2 | |
黎友琴(总经理) | 3 | |
LIYOUQIN | 4 | |
Remarks | The family name and given name are evaluated. The 5-bit pinyin code to the Family Name token and the 5-bit pinyin code to the Given Name token are applied. The first 11 digits for family name pinyin code and the 12th-21st digits for given name pinyin code are evaluated. | |
Example 2 (Sensitivities 85-89) |
Input | Cluster ID |
李期勤 | 1 | |
李期 | 2 | |
李奇 | 3 | |
黎友 | 4 | |
Remarks | The family name and given name are evaluated. The 5-bit pinyin code to the Family Name token and the 5-bit pinyin code to the Given Name token are applied. The first 10 digits for family name pinyin code and the 12th-21st digits for given name pinyin code are evaluated. | |
Example 3 (Sensitivities 80-84) |
Input | Cluster ID |
李友琴 | 1 | |
李友勤 | 1 | |
黎友琴 | 2 | |
LIYOUQIN | 3 | |
黎又青 | 4 | |
Remarks | The family name and given name are evaluated. The 5-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 9 digits for family name pinyin code and the 12th-17th digits for given name pinyin code are evaluated. | |
Example 4 (Sensitivities 75-79) |
Input | Cluster ID |
黎友琴 | 1 | |
黎又青 | 1 | |
李期勤 | 2 | |
李期 | 3 | |
黎友 | 4 | |
Remarks | The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 8 digits for family name pinyin code and the 12th-16th digits for given name pinyin code are evaluated. | |
Example 5 (Sensitivities 70-74) |
Input | Cluster ID |
李期 | 1 | |
李奇 | 1 | |
黎友 | 2 | |
欧阳修 | 3 | |
欧阳休期 | 4 | |
Remarks | The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 7 digits for family name pinyin code and the 12th-15th digits for given name pinyin code are evaluated. | |
Example 6 (Sensitivities 65-69) |
Input | Cluster ID |
李期勤 | 1 | |
李期 | 1 | |
李奇 | 1 | |
欧阳修 | 2 | |
欧阳休期 | 2 | |
Remarks | The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 6 digits for family name pinyin code and the 12th-14th digits for given name pinyin code are evaluated. | |
Example 7 (Sensitivities 60-64) |
Input | Cluster ID |
李友琴 | 1 | |
李友勤 | 1 | |
黎友 | 1 | |
李期勤 | 2 | |
李奇 | 2 | |
Remarks | The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 5 digits for family name pinyin code and the 12th-13th digits for given name pinyin code are evaluated. | |
Example 8 (Sensitivities 55-59) |
Input | Cluster ID |
李奇 | 1 | |
黎友 | 2 | |
Remarks | The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 4 digits for family name pinyin code and the 12th digits for given name pinyin code are evaluated. | |
Example 9 (Sensitivities 50-54) |
Input | Cluster ID |
李奇 | 1 | |
黎友 | 1 | |
Remarks | Only the family name is evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 3 digits for family name pinyin code are evaluated. |
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_ZHCHN_Match_Name.html |