SAS Quality Knowledge Base for Contact Information 27

Name (with Suggestions)

Match Definition

Name (with Suggestions)
Description

The Name (with Suggestions) match definition generates match codes which can be used to cluster records containing names of individuals.

Max Length of Match Code 26 characters
Examples Input Cluster ID
CHANTAL AMILHAT 0
CHANATL AMILHAT 0
BERNARD AMILHAT 1
BERNARD AMIRHAT 1
BRIGITTE LASSALLE 2
BRIGTTE LASSALLE 2
NATH GUIRAUD 3
NATHAIE GUIRAUD 3
Remarks  

Note Note: The results listed above reflect the default match sensitivity (85).

This match definition generates one or more match codes for each input string. Each match code represents a suggestion for what might be the true value of the input string; this enables two strings to be matched even when one or both strings contain a spelling mistake. For example, the name CHANTAL might match the name CHANATL, or the name NATHAIE might match the name NATH.

Note that a consequence of the generation of multiple match codes is that a record might be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition.

For more information on suggestion-based matching, refer to the Suggestion-Based Matching section of the DataFlux Data Management Studio online Help.

Some data used in support of this definition were provided by the National Institute for Statistics and Economic Studies in France on February 11, 2013.