Name
Match Definition
Name | |||
---|---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | ||
Max Length of Match Code | 25 characters | ||
Example 1 Sensitivities 90-100 |
ID | Input | Cluster |
1 | 鈴木一郎様 | 1 | |
2 | 鈴木一郎 | 1 | |
3 | 鈴木いちろう | 2 | |
4 | すずきいちろう | 3 | |
5 | 医学博士すずきいちろう殿 | 3 | |
6 | 営業課長スズキイチロウ (一級建築士) | 4 | |
7 | スズキイチロウ | 4 | |
8 | Suzuki Ichiro | 5 | |
9 | Yamagawa Hiroshi | 6 | |
The family name and given name are evaluated. Half-width and Full-width Katakana are matched. | |||
Example 2 Sensitivities 85-89 |
ID | Input | Cluster ID |
1 | 鈴木一郎様 | 1 | |
2 | 鈴木一郎 | 1 | |
3 | 鈴木いちろう | 2 | |
4 | すずきいちろう | 3 | |
5 | 医学博士すずきいちろう殿 | 3 | |
6 | 営業課長スズキイチロウ (一級建築士) | 3 | |
7 | スズキイチロウ | 3 | |
8 | Suzuki Ichiro | 3 | |
9 | Yamagawa Hiroshi | 4 | |
For sensitivities 85-100, Family Name and Given Name information are evaluated. Half-width and full-width Kana are matched. Hirakana, Katakana, and Romaji are matched. | |||
Example 3 Sensitivities 50-84 |
ID | Input | Cluster ID |
1 | 鈴木一郎様 | 1 | |
2 | 鈴木一郎 | 1 | |
3 | 鈴木いちろう | 2 | |
4 | すずきいちろう | 2 | |
5 | 医学博士すずきいちろう殿 | 2 | |
6 | 営業課長スズキイチロウ (一級建築士) | 2 | |
7 | スズキイチロウ | 2 | |
8 | Suzuki Ichiro | 2 | |
9 | Yamagawa Hiroshi | 3 | |
For sensitivities 50-84, Family Name and Given Name information are evaluated. Half-width and full-width Katakana are matched. For Family Name, Hirakana, Katakana, Romaji, and Kanji are matched. For Given Name, Hirakana, Katakana, and Romaji are matched. | |||
Remarks |
If this definition is applied to preparsed data, the following input tokens are available:
It is recommended that you map a correlating data field to each available token whenever possible. |