You are here: Definitions>English Definitions>English, Australia Definitions

SAS Quality Knowledge Base for Contact Information 25

English, Australia Definitions

Definitions for the English, Australia locale are described below.

Case Definitions
Gender Analysis Definitions

Identification Analysis Definitions

Match Definitions

Parse Definitions

Pattern Analysis Definitions

Standardization Definitions

Inherited Definitions

Case Definitions

None.

Gender Analysis Definitions

None.

Identification Analysis Definitions

Address (Full)
Description The Identification Analysis definition for Address (Full) identifies input with complex address data that is split randomly into multiple fields.
Possible Output Contact
Extension
PO Box
Street
Examples Input Output
146 CECIL STREET WILLIAMSTOWN VIC 3016 Street
15 130 RATHMINES RD HAWTHORN EAST 3123 VIC Extension
C/ J & E WHITE 1010 LEONGATHA RD OUTTRIM VIC 3951 Contact
PO BOX 138 MILLMERRAN QLD 4357 PO Box
Remarks  

 

Phone (Type)
Description The Identification Analysis definition for Phone (Type) determines what type of phone number an input string represents.
Possible Output International
Landline
Mobile
Special
Invalid
Examples Input Output
+61 2 9561 9294 Landline
61 02-94280410 Landline
610413050521 Mobile
Remarks  

Match Definitions

Address
Description The Address match definition generates match codes which can be used to cluster records containing addresses.
Max Length of Match Code 20 characters
Examples Input Cluster ID
Suite 1, 300 Burns Bay Road 2
300 Byrnes Bay Rd, ste 1 2
770 Byrnes Bay Rd 3
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (Full)
Description The Address (Full) match definition generates match codes which can be used to cluster records containing complete two-line addresses.
Max Length of Match Code 28 characters
Examples Input Cluster ID
UNIT 1108 163 CITY RD SOUTHBANK 3006 VIC 2
UNIT 2411 163 CITY RD SOUTHBANK 3006 VIC 2
UNIT 2411 167 CITY RD SOUTHBANK 3006 VIC 3
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (PO Box Only)
Description The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address.
Max Length of Match Code 15 characters
Examples Input Cluster ID
17 KURRAJONG PLACE PO BOX 123 2
17 KURRAJONG PL PO BOX 124 3
15 SIR JOSEPH BANKS ST PO BOX 124 3
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (Street Only)
Description The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address.
Max Length of Match Code 16 characters
Examples Input Cluster ID
17 KURRAJONG PLACE PO BOX 123 2
17 KURRAJONG PL PO BOX 124 2
15 SIR JOSEPH BANKS ST PO BOX 124 3
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

City
Description The City match definition generates match codes which can be used to cluster records containing city names.
Max Length of Match Code 15 characters
Examples Input Cluster ID
North Sydney 4
N SYD 4
ABBOTSFORD 5
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information.
Max Length of Match Code 15 characters
Examples Input Cluster ID
North Quay, Queensland 4002 3
N Quay, Qld 4002 3
MELBOURNE VIC 3002 4
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Name (with Suggestions)
Description The Name (with Suggestions) match definition generates match codes which can be used to cluster records containing names of individuals.
Max Length of Match Code 21 characters
Examples Input Cluster ID
PRAIS HILTON 1
PARIS HILTON 1
HENRY NICKELSON 2
HENRY NICKERSON 2
NIKI WONG 3
ANIKI WONG 3
NIKI WONG 4
NICLOE WONG 4
Remarks

This definition generates one or more match codes for each input string. Each match code represents a suggestion for what might be the true value of the input string; this enables two strings to be matched even when one or both strings contain a spelling mistake. For example, the name PRAIS might match the name PARIS, or the name NICLOE may match the name NIKI.

Note that a consequence of the generation of multiple match codes is that a record might be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition.

For more information on suggestion-based matching, refer to the "Suggestion-Based Matching" section of the DataFlux Data Management Studio Online Help.

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Phone
Description The Phone match definition generates match codes which can be used to cluster records containing phone numbers.
Max Length of Match Code 22 characters
Examples Input Cluster ID
1800 HOLIDAY 1
1800 4654329 1
61 02 37141222 2
02 37141222 2
02 37141222 ext 1234 2
07 37141222 3
07 37141223 3
07 37141233 4
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Postal Code
Description The Postal Code match definition generates match codes which can be used to cluster records containing postal codes.
Max Length of Match Code 15 characters
Examples Input Cluster ID
-4002 0
4002 0
5002 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

State/Province
Description The State/Province match definition generates match codes which can be used to cluster records containing states and provinces.
Max Length of Match Code 15 characters
Examples Input Cluster ID
New South Wales 0
NSW 0
Northern Territory 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

Parse Definitions

Address
Description The Parse definition for Address parses addresses into a set of tokens.
Output Tokens Street Number
Street Name
Extension
PO Box
Example 1 Input Output
15/10 Murray St Street Number 10
Street Name Murray St
Extension 15
PO Box  
Example 2 Input Output
UNIT 26 6-10 SIR JOSEPH BANKS ST Street Number 6-10
Street Name SIR JOSEPH BANKS ST
Extension UNIT 26
PO Box  
Remarks  

 

Address (Full)
Description The Parse definition for Address (Full) parses a complete two-line address into a set of tokens.
Output Tokens Building
Floor
Room
Street Number
Street
PO Box
City
State/Province
Postal Code
Organization
Department
Contact Info
Zoning
Example 1 Input Output
130 RATHMINES RD HAWTHORN EAST 3123 VIC Building  
Floor  
Room  
Street Number 130
Street RATHMINES RD
PO Box  
City HAWTHORN EAST
State/Province VIC
Postal Code 3123
Organization  
Department  
Contact Info  
Zoning  
Example 2 Input Output
LEVEL 54 RIALTO SOUTH TOWER 525 COLLINS ST MELBOURNE VIC 3000 Building RIALTO SOUTH TOWER
Floor LEVEL 54
Room  
Street Number 525
Street COLLINS ST
PO Box  
City MELBOURNE
State/Province VIC
Postal Code 3000
Organization  
Department  
Contact Info  
Zoning  
Remarks  

 

Address (Global)
Description

The Address (Global) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output
Example 1 15/10 Murray St Recipient  
Building/Site  
Street 10 Murray St
Extension 15/
PO Box  
Additional Info  
  Input Output
Example 2 St George House 4-16 Montgomery Street, Basement Recipient  
Building/Site St George House
Street 4-16 Montgomery Street
Extension Basement
PO Box  
Additional Info  
Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB.

The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back.

 

Address (Global) (v23)
Description The Address (Global) (v23) parse definition parses addresses into a globally recognized set of tokens.
Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output
Example 1 15/10 Murray St Recipient  
Building/Site  
Street 10 Murray St
Extension 15/
PO Box  
Additional Info  
  Input Output
Example 2 St George House 4-16 Montgomery Street, Basement Recipient  
Building/Site St George House
Street 4-16 Montgomery Street
Extension Basement
PO Box  
Additional Info  
Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB.

The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back.

 

City - State/Province - Postal Code
Description The Parse definition for City - State/Province - Postal Code parses address "last line" data into a set of tokens.
Output Tokens City
State/Province
Postal Code
Additional Info
Example Input Output
Lane Cove, NSW 2066 City Lane Cove
State NSW
Postal Code 2066
Additional Info  
Remarks  

 

City - State/Province - Postal Code (Global)
Description The Parse definition for City - State/Province - Postal Code (Global) parses address "last line" data into a globally recognized set of tokens.
Output Tokens City
State/Province
Postal Code
Additional Info
Example Input Output
North Quay, QLD 4002 City North Quay
State/Province QLD
Postal Code 4002
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Phone
Description The Parse definition for Phone parses Australian phone numbers into a set of tokens.
Output Tokens Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
Example Input Output
Work: 610237141222 Ext 456 (ask for Mary) Country Code 61
Area Code 02
Base Number 37141222
Extension 456
Line Type Work:
Additional Info (ask for Mary)
Remarks  

 

Phone (Global)
Description The Parse definition for Phone (Global) parses phone numbers into a globally recognized set of tokens.
Output Tokens Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
Example Input Output
Work: 61 02 94280410 ext 44 Country Code 61
Area Code 02
Base Number 94280410
Extension 44
Line Type Work:
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

Pattern Analysis Definitions

None.

Standardization Definitions

Address
Description The Standardization definition for Address standardizes addresses.
Examples Input Output
10 PLEASANT VIEW CRES 10 Pleasant View Cres
44 O'Briens Lane 44 OBriens Lane
Remarks Standardization will remove every character that is not an alphanumeric or space.

 

City
Description The Standardization definition for City standardizes city names and converts them to proper case.
Examples Input Output
ADAMSTOWN HEIGHTS Adamstown Heights
mel Melbourne
perth Perth
Alice`s Springs Alices Springs
Remarks  

 

City - State/Province - Postal Code
Description The Standardization definition for City - State/Province - Postal Code standardizes city and state or province names and converts them to uppercase.
Examples Input Output
Armidale NEW SOUTH WALES 2351 ARMIDALE NSW 2351
Melbourn`e Victoria 3001 MELBOURNE VIC 3001
MelbournĀ“e Victoria 3001 MELBOURNE VIC 3001
Remarks Standardization removes every character that is not alphanumeric or a space. This definition violates specification S_GENL_2100, and should be flagged as justifiably non-compliant.

 

Name
Description The Standardization definition for Name standardizes names of individuals.
Examples Input Output
Cullen, Mister Peter C. Mr Peter C Cullen
Doctor Peter Sergeant Dr Peter Sergeant
Remarks  

 

Phone
Description The Standardization definition for Phone standardizes phone numbers for domestic use.
Examples Input Output
610246831982 (02) 4683 1982
2 9561 9294 (02) 9561 9294
( + 61 ) 03 - 8 2 3 4 - 5 6 7 8 (03) 8234 5678
412413632 0412 413 632
(02) 9876 7654 EXT 456 (02) 9876 7654 x456
02 69621040 (after 4pm) (02) 6962 1040, After 4PM
Remarks  

 

Phone (with Country Code)
Description The Standardization definition for Phone (with Country Code) standardizes phone numbers for international use.
Examples Input Output
610246831982 +61 2 4683 1982
001161 (02) 46831982 +61 2 4683 1982
0246831982 +61 2 4683 1982
+49-025354102 +49 25354102
Remarks  

 

Phone (Electronic)
Description The Standardization definition for Phone (Electronic) standardizes phone numbers for automated calling systems.
Examples Input Output
610246831982 +61246831982
001161 (02) 46831982 +61246831982
0246831982 +61246831982
+49-025354102 +4925354102
1800-HOLIDAY +18004654329
(02) 9876 7654 EXT 456 +61298767654
Remarks  

 

Postal Code
Description The Standardization definition for Postal Code standardizes postal codes.
Examples Input Output
2006, 2006
(2065) 2065
800 0800
Remarks A 0 will be prepended to the postal code if the input has only three digits.

Inherited Definitions

In addition to the definitions listed on this page, the English, Australia locale also inherits all definitions for the English language and all Global definitions.