SAS Quality Knowledge Base for Contact Information 26
Definitions for the English, Australia locale are described below.
Case Definitions
Extraction Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
None.
None.
None.
Address (Full) | ||
---|---|---|
Description |
The Address (Full) identification analysis definition identifies the address information that is represented by a string. |
|
Possible Output | Contact Extension PO Box Street |
|
Examples | Input | Output |
146 CECIL STREET WILLIAMSTOWN VIC 3016 | Street | |
15 130 RATHMINES RD HAWTHORN EAST 3123 VIC | Extension | |
C/ J & E WHITE 1010 LEONGATHA RD OUTTRIM VIC 3951 | Contact | |
PO BOX 138 MILLMERRAN QLD 4357 | PO Box | |
Remarks |
Phone (Type) | ||
---|---|---|
Description |
The Phone (Type) identification analysis definition determines the type of phone number. |
|
Possible Output | International Landline Mobile Special Invalid |
|
Examples | Input | Output |
+61 2 9561 9294 | Landline | |
61 02-94280410 | Landline | |
610413050521 | Mobile | |
Remarks |
Address | ||
---|---|---|
Description | The Address match definition generates match codes which can be used to cluster records containing addresses. | |
Max Length of Match Code | 20 characters | |
Examples | Input | Cluster ID |
Suite 1, 300 Burns Bay Road | 2 | |
300 Byrnes Bay Rd, ste 1 | 2 | |
770 Byrnes Bay Rd | 3 | |
Remarks |
|
Address (Full) | ||
---|---|---|
Description | The Address (Full) match definition generates match codes which can be used to cluster records containing complete two-line addresses. | |
Max Length of Match Code | 28 characters | |
Examples | Input | Cluster ID |
UNIT 1108 163 CITY RD SOUTHBANK 3006 VIC | 2 | |
UNIT 2411 163 CITY RD SOUTHBANK 3006 VIC | 2 | |
UNIT 2411 167 CITY RD SOUTHBANK 3006 VIC | 3 | |
Remarks |
|
Address (PO Box Only) | ||
---|---|---|
Description | The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
17 KURRAJONG PLACE PO BOX 123 | 2 | |
17 KURRAJONG PL PO BOX 124 | 3 | |
15 SIR JOSEPH BANKS ST PO BOX 124 | 3 | |
Remarks |
|
Address (Street Only) | ||
---|---|---|
Description | The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address. | |
Max Length of Match Code | 16 characters | |
Examples | Input | Cluster ID |
17 KURRAJONG PLACE PO BOX 123 | 2 | |
17 KURRAJONG PL PO BOX 124 | 2 | |
15 SIR JOSEPH BANKS ST PO BOX 124 | 3 | |
Remarks |
|
City | ||
---|---|---|
Description | The City match definition generates match codes which can be used to cluster records containing city names. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
North Sydney | 4 | |
N SYD | 4 | |
ABBOTSFORD | 5 | |
Remarks |
|
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
North Quay, Queensland 4002 | 3 | |
N Quay, Qld 4002 | 3 | |
MELBOURNE VIC 3002 | 4 | |
Remarks |
|
Phone | ||
---|---|---|
Description | The Phone match definition generates match codes which can be used to cluster records containing phone numbers. | |
Max Length of Match Code | 22 characters | |
Examples | Input | Cluster ID |
1800 HOLIDAY | 1 | |
1800 4654329 | 1 | |
61 02 37141222 | 2 | |
02 37141222 | 2 | |
02 37141222 ext 1234 | 2 | |
07 37141222 | 3 | |
07 37141223 | 3 | |
07 37141233 | 4 | |
Remarks |
|
Postal Code | ||
---|---|---|
Description | The Postal Code match definition generates match codes which can be used to cluster records containing postal codes. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
-4002 | 0 | |
4002 | 0 | |
5002 | 1 | |
Remarks |
|
State/Province | ||
---|---|---|
Description | The State/Province match definition generates match codes which can be used to cluster records containing states and provinces. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
New South Wales | 0 | |
NSW | 0 | |
Northern Territory | 1 | |
Remarks |
|
Address | |||
---|---|---|---|
Description | The Address parse definition parses addresses into a set of tokens. | ||
Output Tokens | Street Number Street Name Extension PO Box |
||
Example 1 | Input | Output Token | Output |
15/10 Murray St | Street Number | 10 | |
Street Name | Murray St | ||
Extension | 15 | ||
PO Box | |||
Example 2 | Input | Output Token | Output |
UNIT 26 6-10 SIR JOSEPH BANKS ST | Street Number | 6-10 | |
Street Name | SIR JOSEPH BANKS ST | ||
Extension | UNIT 26 | ||
PO Box | |||
Remarks |
Address (Full) | |||
---|---|---|---|
Description | The Address (Full) parse definition parses addresses containing complete two-line addresses into a set of tokens. | ||
Output Tokens | Building Floor Room Street Number Street PO Box City State/Province Postal Code Organization Department Contact Info Zoning |
||
Example 1 | Input | Output Token | Output |
130 RATHMINES RD HAWTHORN EAST 3123 VIC | Building | ||
Floor | |||
Room | |||
Street Number | 130 | ||
Street | RATHMINES RD | ||
PO Box | |||
City | HAWTHORN EAST | ||
State/Province | VIC | ||
Postal Code | 3123 | ||
Organization | |||
Department | |||
Contact Info | |||
Zoning | |||
Example 2 | Input | Output Token | Output |
LEVEL 54 RIALTO SOUTH TOWER 525 COLLINS ST MELBOURNE VIC 3000 | Building | RIALTO SOUTH TOWER | |
Floor | LEVEL 54 | ||
Room | |||
Street Number | 525 | ||
Street | COLLINS ST | ||
PO Box | |||
City | MELBOURNE | ||
State/Province | VIC | ||
Postal Code | 3000 | ||
Organization | |||
Department | |||
Contact Info | |||
Zoning | |||
Remarks |
Address (Global) | |||
---|---|---|---|
Description |
The Address (Global) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output Token | Output | |
Example 1 | 15/10 Murray St | Recipient | |
Building/Site | |||
Street | 10 Murray St | ||
Extension | 15/ | ||
PO Box | |||
Additional Info | |||
Input | Output Token | Output | |
Example 2 | St George House 4-16 Montgomery Street, Basement | Recipient | |
Building/Site | St George House | ||
Street | 4-16 Montgomery Street | ||
Extension | Basement | ||
PO Box | |||
Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
City - State/Province - Postal Code | |||
---|---|---|---|
Description | The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens. | ||
Output Tokens | City State/Province Postal Code Additional Info |
||
Example | Input | Output Token | Output |
Lane Cove, NSW 2066 | City | Lane Cove | |
State/Province | NSW | ||
Postal Code | 2066 | ||
Additional Info | |||
Remarks |
City - State/Province - Postal Code (Global) | |||
---|---|---|---|
Description | The City - State/Province - Postal Code (Global) parse definition parses last line address information into a globally recognized set of tokens. | ||
Output Tokens | City State/Province Postal Code Additional Info |
||
Example | Input | Output Token | Output |
North Quay, QLD 4002 | City | North Quay | |
State/Province | QLD | ||
Postal Code | 4002 | ||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Phone | |||
---|---|---|---|
Description | The Phone parse definition parses phone numbers into a set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Example | Input | Output Token | Output |
Work: 610237141222 Ext 456 (ask for Mary) | Country Code | 61 | |
Area Code | 02 | ||
Base Number | 37141222 | ||
Extension | 456 | ||
Line Type | Work: | ||
Additional Info | (ask for Mary) | ||
Remarks |
Phone (Global) | |||
---|---|---|---|
Description | The Phone (Global) parse definition parses phone numbers into a globally recognized set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Example | Input | Output Token | Output |
Work: 61 02 94280410 ext 44 | Country Code | 61 | |
Area Code | 02 | ||
Base Number | 94280410 | ||
Extension | 44 | ||
Line Type | Work: | ||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
None.
Address | ||
---|---|---|
Description | The Address standardization definition standardizes addresses. | |
Examples | Input | Output |
10 PLEASANT VIEW CRES | 10 Pleasant View Cres | |
44 O'Briens Lane | 44 OBriens Lane | |
Remarks | Standardization will remove every character that is not an alphanumeric or space. |
City | ||
---|---|---|
Description | The City standardization definition standardizes city names. | |
Examples | Input | Output |
ADAMSTOWN HEIGHTS | Adamstown Heights | |
mel | Melbourne | |
perth | Perth | |
Alice`s Springs | Alices Springs | |
Remarks |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code standardization definition standardizes last line address information. | |
Examples | Input | Output |
Armidale NEW SOUTH WALES 2351 | ARMIDALE NSW 2351 | |
Melbourn`e Victoria 3001 | MELBOURNE VIC 3001 | |
Melbourn“e Victoria 3001 | MELBOURNE VIC 3001 | |
Remarks | Standardization removes every character that is not alphanumeric or a space. |
Name | ||
---|---|---|
Description | The Name standardization definition standardizes names of individuals. | |
Examples | Input | Output |
Cullen, Mister Peter C. | Mr Peter C Cullen | |
Doctor Peter Sergeant | Dr Peter Sergeant | |
Remarks |
Phone | ||
---|---|---|
Description | The Phone standardization definition standardizes phone numbers for domestic use. | |
Examples | Input | Output |
610246831982 | (02) 4683 1982 | |
2 9561 9294 | (02) 9561 9294 | |
( + 61 ) 03 - 8 2 3 4 - 5 6 7 8 | (03) 8234 5678 | |
412413632 | 0412 413 632 | |
(02) 9876 7654 EXT 456 | (02) 9876 7654 x456 | |
02 69621040 (after 4pm) | (02) 6962 1040, After 4PM | |
Remarks |
Phone (with Country Code) | ||
---|---|---|
Description | The Phone (with Country Code) standardization definition standardizes phone numbers for international use. | |
Examples | Input | Output |
610246831982 | +61 2 4683 1982 | |
001161 (02) 46831982 | +61 2 4683 1982 | |
0246831982 | +61 2 4683 1982 | |
+49-025354102 | +49 25354102 | |
Remarks |
Phone (Electronic) | ||
---|---|---|
Description | The Phone (Electronic) standardization definition standardizes phone numbers for automated calling systems. | |
Examples | Input | Output |
610246831982 | +61246831982 | |
001161 (02) 46831982 | +61246831982 | |
0246831982 | +61246831982 | |
+49-025354102 | +4925354102 | |
1800-HOLIDAY | +18004654329 | |
(02) 9876 7654 EXT 456 | +61298767654 | |
Remarks |
Postal Code | ||
---|---|---|
Description | The Postal Code standardization definition standardizes postal codes. | |
Examples | Input | Output |
2006, | 2006 | |
(2065) | 2065 | |
800 | 0800 | |
Remarks | A 0 will be prepended to the postal code if the input has only three digits. |
In addition to the definitions listed on this page, the English, Australia locale also inherits all definitions for the English language and all Global definitions.
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_ENAUS_defs.html |