SAS Quality Knowledge Base for Contact Information 27

Address

Match Definition

Address
Description The Address match definition generates match codes which can be used to cluster records containing addresses.
Max Length of Match Code 19 characters
  Input Cluster ID
Example 1 Av. du Castel, 89 B5 0
Example 2 Appelboomstraat 10 1
Example 3 - French variant (misspelled) Apelbomstraat 10 bus 2 1
Example 4 Rue du Pommier 10/2 1
Example 5 - Dutch variant (misspelled) Rue du Pomie 10/bte2 2
Remarks

Note Note: The results listed above reflect the default match sensitivity (85).

This definition uses Dutch phonetic reduction to match words with similar sounds and spellings in Dutch (see examples 2 and 3). Some translation of French data is performed to enable matching across languages (see example 4). Note that translations cannot be performed for French words that are misspelled in the input string (see example 5).

To match misspellings for French, we recommend setting up a job using a scheme to correct misspellings in French before generating match codes with this definition. For more details, see the Remarks for the Street Name match definition.