SAS Quality Knowledge Base for Contact Information 27
Name (with Suggestions) | ||
---|---|---|
Description |
The Name (with Suggestions) match definition generates match codes which can be used to cluster records containing names of individuals. |
|
Max Length of Match Code | 26 characters | |
Examples | Input | Cluster ID |
CHANTAL AMILHAT | 0 | |
CHANATL AMILHAT | 0 | |
BERNARD AMILHAT | 1 | |
BERNARD AMIRHAT | 1 | |
BRIGITTE LASSALLE | 2 | |
BRIGTTE LASSALLE | 2 | |
NATH GUIRAUD | 3 | |
NATHAIE GUIRAUD | 3 | |
Remarks |
|
|
This match definition generates one or more match codes for each input string. Each match code represents a suggestion for what might be the true value of the input string; this enables two strings to be matched even when one or both strings contain a spelling mistake. For example, the name CHANTAL might match the name CHANATL, or the name NATHAIE might match the name NATH. Note that a consequence of the generation of multiple match codes is that a record might be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition. For more information on suggestion-based matching, refer to the Suggestion-Based Matching section of the DataFlux Data Management Studio online Help. |
||
Some data used in support of this definition were provided by the National Institute for Statistics and Economic Studies in France on February 11, 2013. |
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_FRFRA_Match_Name-withSuggestions.html |