SAS Quality Knowledge Base for Contact Information 27

Address (Full) (with Combinations)

Match Definition

Address (Full) (with Combinations)
Description The Address (Full) (with Combinations) match definition generates one or more match codes which can be used to cluster records containing complete two-line addresses.
Max Length of Match Code 138 characters
Examples Input Cluster ID Sensitivity Score
Flat 1, 9, Rockleaze, Bristol, BS9 1NE 0 85 72.25
Moorlands 9 Rockleeze Smeed Park Bristol BS9 1NE 0 85 72.25
Philips Centre 420-430 London Road Surrey CR9 3QR 1 85 72.25
The Phillips Centre 420 London Road Croydon Surrey CR9 3QR 1 85 72.25
25 Kings Hill Avenue, Kings Hill, West Malling, Kent, ME19 4TA 2 85 72.25
c/o CAF, 25 Tingshill Avenue, Kingshill, West Malling, Kent, ME19 4TA 2 85 72.25
Highway House 171 Kings Rd Brentwood Essex CM14 4EJ 3 85 80.75
Pegasus House 171 Kings St Brentwood Essex CM14 4EJ 3 85 80.75
69 Morrison Street, Lothian EH3 8YF 4 85 80.75
69 Morrison, Lothian EH3 8YF 4 85 80.75
11th Floor SJMB 100 Old Hall Street Liverpool L70 1AB 5 85 85.00
14th Floor, Sir John Moores Building, 100 Old Hall Street, Liverpool, L70 1AB 5 85 85.00
97 Haymarket Terrace, Edinburgh, Mid Lothian EH12 5HD 6 85 85.00
Donaldson House, 97 Haymarket Terrace, Edinburgh, Lothian EH12 5HD 6 85 85.00
1 Hagley Road Birmingham West Midlands B16 8SS 7 85 85.00
Metropolitan House 1 Hagley Road Edgbaston Birmingham B16 8SS 7 85 85.00
CISD, Saughton House, Broomhouse Drive, Edinburgh, EH11 3XD 8 85 68.00
Saughton House, EH11 3XD 8 85 68.00
Civic Centre High Street Uxbrudge London UB8 1UW 9 85 68.00
Head Office, Civic Centre High Street, London, UB8 1UW 9 85 68.00
171 Kings Rd Brentwood Essex CM14 4EJ 10 80 48.00
171 Kings Rd Brentwood Essex 10 80 48.00
69 Morrison Street, Lothian EH3 8YF 11 80 48.00
69 Morrison, Lothian 11 80 48.00
9 Buckstone Oval, Alwoodley Leeds, W Yorkshire 12 75 30.00
9 Buckstone Oval, Alwoodley Leeds LS175HF 12 75 30.00
PO Box 3 CLS6 Gloucester Road Filton Bristol BS12 7QE 13 85 76.50
PO Box 3 WH-5 Bristol BS12 7QE 13 85 76.50
Dept DDSUPP Marlborough House PO BOX 1810 Bristol BS99 5SN 14 85 76.50
Sun Life Centre P O Box 1810 Bristol BS99 5SN 14 85 76.50
Austalasa House (HBBG), Waterside, PO Box 365, West Drayton, Middlesex UB7 0GB 15 85 76.50
Waterside P O Box 365 Harmonsworth West Drayton Middlesex UB7 0GB 15 85 76.50
PO Box 3 CLS6 Gloucester Road Filton Bristol BS127QE 16 80 63.75
PO Box 3 WH-5 Bristol 16 80 63.75
PO Box 365, West Drayton, Middlesex UB7 0GB 17 80 63.75
Box 365 Harmonsworth West Drayton Middlesex 17 80 63.75
6 Paddent Court, London NW71GY 18 85 80.75
6 Paddent Ct 12 Stockford Avenue London NW71GY 18 85 55.25
6 Paddent Court, Stockford, London NW71GY 18 85 55.25
4 Howes Place Cambridge Cambridgeshire CB30LD 19 85 80.75
4 Howes Place Huntingdon Rd Cambridge Cambridgeshire CB30LD 19 85 55.25
4 Howes Place, 33 Huntingdon Rd Cambridge Cambridgeshire CB30LD 19 85 55.25
Flat G 5 Bank Street Aberdeen AB117ST 20 65 45.50
5A, Bank St, Aberdeen Aberdeenshire AB117ST 20 65 45.50
Bank Street Aberdeen AB117ST 20 65 45.50
32 West Avenue Handsworth Birmingham West Midlands B202LS 21 65 45.50
9 West Boulevard Birmingham B202LS 21 65 45.50
West Rd B202LS 21 65 45.50
Dunsford Exeter Devon EX67AX 22 85 17.00
Dunsford, Exeter, Devon, EX6 7AX 22 85 17.00
E London EC3R7NE 23 85 17.00
East London, EC3R 7NE 23 85 17.00
Remarks

This Address (Full) (with Combinations) match definition generates one or more match codes for each input string. The number of match codes generated for an input string depends on the content of the string. Each match code represents a combination of different portions of the input string; this enables two strings to be matched even when some portions of one or both of the strings differ. See the examples above for an illustration of clusters that may be produced using match codes generated by this definition.

Note that a consequence of generating multiple match codes is that a record can be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition.

A score is assigned to each match code produced by the definition, and might be used as a factor when resolving conflicts between clusters. The score is dependent on the set of rules that produced the match code and the sensitivity used when the job was executed.