SAS Quality Knowledge Base for Contact Information 27
Name (with Suggestions) | ||
---|---|---|
Description | The Name (with Suggestions) match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 21 characters | |
Examples | Input | Cluster ID |
PRAIS HILTON | 1 | |
PARIS HILTON | 1 | |
HENRY NICKELSON | 2 | |
HENRY NICKERSON | 2 | |
NIKI WONG | 3 | |
ANIKI WONG | 3 | |
NIKI WONG | 4 | |
NICLOE WONG | 4 | |
Remarks |
|
|
This match definition generates one or more match codes for each input string. Each match code represents a suggestion for what might be the true value of the input string; this enables two strings to be matched even when one or both strings contain a spelling mistake. For example, the name PRAIS might match the name PARIS, or the name NICLOE might match the name NIKI. Note that a consequence of generating multiple match codes is that a record can be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition.
Another consequence of generating multiple match codes is that more
processing time is required than when generating a single match code.
Generating match codes using this definition might take up to five times as
long as generating match codes using a traditional match definition. |
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_ENGBR_Match_Name-withSuggestions.html |