SAS Quality Knowledge Base for Contact Information 27
| Name (with Suggestions) | ||
|---|---|---|
| Description |
The Name (with Suggestions) match definition generates match codes which can be used to cluster records containing names of individuals. |
|
| Max Length of Match Code | 26 characters | |
| Examples | Input | Cluster ID |
| CHANTAL AMILHAT | 0 | |
| CHANATL AMILHAT | 0 | |
| BERNARD AMILHAT | 1 | |
| BERNARD AMIRHAT | 1 | |
| BRIGITTE LASSALLE | 2 | |
| BRIGTTE LASSALLE | 2 | |
| NATH GUIRAUD | 3 | |
| NATHAIE GUIRAUD | 3 | |
| Remarks |
|
|
|
This match definition generates one or more match codes for each input string. Each match code represents a suggestion for what might be the true value of the input string; this enables two strings to be matched even when one or both strings contain a spelling mistake. For example, the name CHANTAL might match the name CHANATL, or the name NATHAIE might match the name NATH. Note that a consequence of the generation of multiple match codes is that a record might be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition. For more information on suggestion-based matching, refer to the Suggestion-Based Matching section of the DataFlux Data Management Studio online Help. |
||
| Some data used in support of this definition were provided by the National Institute for Statistics and Economic Studies in France on February 11, 2013. | ||
|
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_FRFRA_Match_Name-withSuggestions.html |