SAS Quality Knowledge Base for Contact Information 27
| Name | ||
|---|---|---|
| Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
| Max Length of Match Code | 21 characters | |
| Example 1 (Sensitivities 90-100) |
Input | Cluster ID |
| 李友琴先生 | 1 | |
| 李友琴 | 1 | |
| 李友勤女士 | 2 | |
| 黎友琴(总经理) | 3 | |
| LIYOUQIN | 4 | |
| Remarks | The family name and given name are evaluated. The 5-bit pinyin code to the Family Name token and the 5-bit pinyin code to the Given Name token are applied. The first 11 digits for family name pinyin code and the 12th-21st digits for given name pinyin code are evaluated. | |
| Example 2 (Sensitivities 85-89) |
Input | Cluster ID |
| 李期勤 | 1 | |
| 李期 | 2 | |
| 李奇 | 3 | |
| 黎友 | 4 | |
| Remarks | The family name and given name are evaluated. The 5-bit pinyin code to the Family Name token and the 5-bit pinyin code to the Given Name token are applied. The first 10 digits for family name pinyin code and the 12th-21st digits for given name pinyin code are evaluated. | |
| Example 3 (Sensitivities 80-84) |
Input | Cluster ID |
| 李友琴 | 1 | |
| 李友勤 | 1 | |
| 黎友琴 | 2 | |
| LIYOUQIN | 3 | |
| 黎又青 | 4 | |
| Remarks | The family name and given name are evaluated. The 5-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 9 digits for family name pinyin code and the 12th-17th digits for given name pinyin code are evaluated. | |
| Example 4 (Sensitivities 75-79) |
Input | Cluster ID |
| 黎友琴 | 1 | |
| 黎又青 | 1 | |
| 李期勤 | 2 | |
| 李期 | 3 | |
| 黎友 | 4 | |
| Remarks | The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 8 digits for family name pinyin code and the 12th-16th digits for given name pinyin code are evaluated. | |
| Example 5 (Sensitivities 70-74) |
Input | Cluster ID |
| 李期 | 1 | |
| 李奇 | 1 | |
| 黎友 | 2 | |
| 欧阳修 | 3 | |
| 欧阳休期 | 4 | |
| Remarks | The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 7 digits for family name pinyin code and the 12th-15th digits for given name pinyin code are evaluated. | |
| Example 6 (Sensitivities 65-69) |
Input | Cluster ID |
| 李期勤 | 1 | |
| 李期 | 1 | |
| 李奇 | 1 | |
| 欧阳修 | 2 | |
| 欧阳休期 | 2 | |
| Remarks | The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 6 digits for family name pinyin code and the 12th-14th digits for given name pinyin code are evaluated. | |
| Example 7 (Sensitivities 60-64) |
Input | Cluster ID |
| 李友琴 | 1 | |
| 李友勤 | 1 | |
| 黎友 | 1 | |
| 李期勤 | 2 | |
| 李奇 | 2 | |
| Remarks | The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 5 digits for family name pinyin code and the 12th-13th digits for given name pinyin code are evaluated. | |
| Example 8 (Sensitivities 55-59) |
Input | Cluster ID |
| 李奇 | 1 | |
| 黎友 | 2 | |
| Remarks | The family name and given name are evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 4 digits for family name pinyin code and the 12th digits for given name pinyin code are evaluated. | |
| Example 9 (Sensitivities 50-54) |
Input | Cluster ID |
| 李奇 | 1 | |
| 黎友 | 1 | |
| Remarks | Only the family name is evaluated. The 3-bit pinyin code to the Family Name token and the 3-bit pinyin code to the Given Name token are applied. The first 3 digits for family name pinyin code are evaluated. | |
|
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_ZHCHN_Match_Name.html |