You are here: Definitions>English Definitions>English, New Zealand Definitions

SAS Quality Knowledge Base for Contact Information 25

English, New Zealand Definitions

Definitions for the English, New Zealand locale are described below.

Case Definitions
Gender Analysis Definitions

Identification Analysis Definitions

Match Definitions

Parse Definitions

Pattern Analysis Definitions

Standardization Definitions

Inherited Definitions

Case Definitions

None.

Gender Analysis Definitions

None.

Identification Analysis Definitions

Phone
Description The Identification Analysis definition for Phone determines what type of phone number a string represents.
Possible Outputs LANDLINE
0800
0900
0508
MOB TELECOM
MOB VODAFONE
MOB TELSTRA
INTL
Examples Input Output
508 123456 0508
+64-9 914 4750 INTL
0800 123456 0800
900-123456 0900
Remarks  

Match Definitions

Address
Description The Address match definition generates match codes which can be used to cluster records containing addresses.
Max Length of Match Code 40 characters
Examples Input Cluster ID
52 Commerce Street 0
52 Commerce St 0
52 Comerce St 0
52 Commerce Street, PO Box 1234 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (Extended)
Description The Address (Extended) match definition generates match codes which can be used to cluster records containing addresses.
Max Length of Match Code 15 characters
Examples Input Cluster ID
52 Commerce St 0
52 Commerce Street 0
52 Comarce Street 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (PO Box Only)
Description The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address.
Max Length of Match Code 15 characters
Examples Input Cluster ID
PO Box 1234 0
52 Commerce Street, PO Box 1234 0
52 Commerce Street, PO Box 2345 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

The Address (PO Box Only) (v22) match definition is now deprecated and will be removed in a future release of the QKB.

The Address (PO Box Only) match definition has been replaced with a copy of the Address (PO Box Only) (v22) definition which takes advantage of updated processing. If you changed your jobs to use the Address (PO Box Only) (v22) definition it is suggested that you change them back.

 

Address (PO Box Only) (v22)
Description The Address (PO Box Only) (v22) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address.
Max Length of Match Code 15 characters
Examples Input Cluster ID
PO Box 1234 0
52 Commerce Street, PO Box 1234 0
52 Commerce Street, PO Box 2345 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

The Address (PO Box Only) (v22) match definition is now deprecated and will be removed in a future release of the QKB.

The Address (PO Box Only) match definition has been replaced with a copy of the Address (PO Box Only) (v22) definition which takes advantage of updated processing. If you changed your jobs to use the Address (PO Box Only) (v22) definition it is suggested that you change them back.

 

Address (Street Only)
Description

The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address.

Max Length of Match Code 33 characters
Examples Input Cluster ID
52 Commerce Street 0
52 Commerce Street, PO Box 1234 0
123 Main Street, PO Box 1234 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

The Address (Street Only) (v22) match definition is now deprecated and will be removed in a future release of the QKB.

The Address (Street Only) match definition has been replaced with a copy of the Address (Street Only) (v22) definition which takes advantage of updated processing. If you changed your jobs to use the Address (Street Only) (v22) definition it is suggested that you change them back.

 

Address (Street Only) (v22)
Description The Address (Street Only) (v22) match definition generates match codes which can be used to cluster records containing the street portion of an address.
Max Length of Match Code 33 characters
Examples Input Cluster ID
52 Commerce Street 0
52 Commerce Street, PO Box 1234 0
123 Main Street, PO Box 1234 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

The Address (Street Only) (v22) match definition is now deprecated and will be removed in a future release of the QKB.

The Address (Street Only) match definition has been replaced with a copy of the Address (Street Only) (v22) definition which takes advantage of updated processing. If you changed your jobs to use the Address (Street Only) (v22) definition it is suggested that you change them back.

 

City
Description The City match definition generates match codes which can be used to cluster records containing city names.
Max Length of Match Code 15 characters
Examples Input Cluster ID
Auckland 0
Tauranga 1
Tarangga 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information.
Max Length of Match Code 28 characters
Examples Input Cluster ID
Wellington 6001 0
Welington, 6001 0
Kaiapoi 8252 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Name (with Suggestions)
Description The Name (with Suggestions) match definition generates match codes which can be used to cluster records containing names of individuals.
Max Length of Match Code 21 characters
Examples Input Cluster ID
PRAIS HILTON 1
PARIS HILTON 1
HENRY NICKELSON 2
HENRY NICKERSON 2
NIKI WONG 3
ANIKI WONG 3
NIKI WONG 4
NICLOE WONG 4
Remarks

This Name (with Suggestions) match definition generates one or more match codes for each input string. Each match code represents a suggestion for what might be the true value of the input string; this enables two strings to be matched even when one or both strings contain a spelling mistake. For example, the name PRAIS might match the name PARIS, or the name NICLOE might match the name NIKI.

Note that a consequence of the generation of multiple match codes is that a record might be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition.

For more information on suggestion-based matching, refer to the Suggestion-Based Matching section of the DataFlux Data Management Studio Online Help.

 

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Phone
Description The Phone match definition generates match codes which can be used to cluster records containing phone numbers.
Max Length of Match Code 19 characters
Examples Input Cluster ID
64-9-931-8077 0
+64 (9) 931 8077 0
(9) 931 8077 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Postal Code
Description The Postal Code match definition generates match codes which can be used to cluster records containing postal codes.
Max Length of Match Code 15 characters
Examples Input Cluster ID
6001 0
-6001 0
8252 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Text
Description The Text match definition generates match codes which can be used to cluster records containing general text strings.
Max Length of Match Code 15 characters
Examples Input Cluster ID
they went 0
you are 1
you're 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

Parse Definitions

Address
Description

The Address parse definition parses addresses into a set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
Example Input Output
ABC Industry Park, Mr John Smith, LEVEL 20 135 ALBERT STREET P.O. Box 12345 Recipient Mr John Smith
Building/Site ABC Industry Park
Street 135 ALBERT STREET
Extension LEVEL 20
PO Box P.O. Box 12345
Additional Info  
Remarks  

 

Address (Extended)
Description The Parse definition for Address (Extended) parses addresses into a set of tokens.
Output Tokens Building Number
Street Name
Street Type
Street Direction
Address Extension
Suburb
Postal
Rural Delivery
State Highway
Example Input Output
97 A Kiwi Rd Point Chevalier Building Number 97 A
Street Name Kiwi
Street Type Rd
Street Direction  
Address Extension  
Suburb Point Chevalier
Postal  
Rural Delivery  
State Highway  
Remarks  

 

Address (Full)
Description The Parse definition for Address (Full) parses full multi-line addresses into a set of tokens.
Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Suburb
City
Postal Code
Country
Additional Info
Example Input Output
ABC Industry Park, Mr John Smith, LEVEL 20 135 ALBERT STREET P.O. Box 12345 NELSON 7010 Recipient Mr John Smith
Building/Site ABC Industry Park
Street 135 ALBERT STREET
Extension LEVEL 20
PO Box P.O. Box 12345
Suburb  
City NELSON
Postal Code 7010
Country  
Additional Info  
Remarks  

 

Address (Global)
Description

The Address (Global) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output
Example 1 ABC Industry Park, Mr John Smith, LEVEL 20 135 ALBERT STREET P.O. Box 12345 Recipient Mr John Smith
Building/Site ABC Industry Park
Street 135 ALBERT STREET
Extension LEVEL 20
PO Box P.O. Box 12345
Additional Info  
  Input Output
Example 2 97 A Kiwi Rd Point Chevalier Recipient  
Building/Site  
Street 97 A Kiwi Rd
Extension  
PO Box  
Additional Info Point Chevalier
Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB.

The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back.

 

Address (Global) (v23)
Description

The Address (Global) (v23) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output
Example 1 ABC Industry Park, Mr John Smith, LEVEL 20 135 ALBERT STREET P.O. Box 12345 Recipient Mr John Smith
Building/Site ABC Industry Park
Street 135 ALBERT STREET
Extension LEVEL 20
PO Box P.O. Box 12345
Additional Info  
  Input Output
Example 2 97 A Kiwi Rd Point Chevalier Recipient  
Building/Site  
Street 97 A Kiwi Rd
Extension  
PO Box  
Additional Info Point Chevalier
Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB.

The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back.

 

Address (with Preferred Delivery Info)
Description The Parse definition for Address (with Preferred Delivery Info) parses addresses into a set of tokens.
Output Tokens Building Name
Building Number
Street Name
Street Type
Street Direction
Address Extension
Address Extension Number
Preferred Delivery Info
Example Input Output
97 A Kiwi Rd Point Chevalier Building Name  
Building Number 97 A
Street Name Kiwi
Street Type Rd
Street Direction  
Address Extension  
Address Extension Number  
Preferred Delivery Info Point Chevalier
Remarks  

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens.
Output Tokens Suburb
City
Postal Code
Additional Info
Example Input Output
BROADFIELD CHRISTCHURCH 8021 Suburb BROADFIELD
City CHRISTCHURCH
Postal Code 8021
Region  
Remarks  

 

City - State/Province - Postal Code (Global)
Description The City - State/Province - Postal Code (Global) parse definition parses last line address information into a globally recognized set of tokens.
Output Tokens City
State/Province
Postal Code
Additional Info
Example Input Output
Wellington 6001 City Wellington
State/Province  
Postal Code 6001
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Phone
Description The Parse definition for Phone parses phone numbers into a set of tokens.
Output Tokens

Country Code
Area Code
Base Number
Extension
Line Type
Additional Info

  Input Output
Example 1 Tel: +64-4 917 6800 SAS Wellington Country Code +64
Area Code 4
Base Number 917 6800
Extension  
Line Type Tel:
Additional Info SAS Wellington
  Input Output
Example 2 09 914 4750 Country Code  
Area Code 09
Base Number 914 4750
Extension  
Line Type  
Additional Info  
  Input Output
Example 3 +1 (919) 447-3000 Country Code +1
Area Code  
Base Number (919) 447-3000
Extension  
Line Type  
Additional Info  
Remarks  

 

Phone (Global)
Description The Parse definition for Phone (Global) parses phone numbers into a globally recognized set of tokens.
Output Tokens Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
Example Input Output
+64-9 914 4750 Country Code 64
Area Code 9
Base Number 914 4750
Extension  
Line Type  
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

Pattern Analysis Definitions

None.

Standardization Definitions

Address
Description The Address standardization definition standardizes addresses.
Examples Input Output
52 Commerce Street 52 Commerce St
5TH avenue 5th Ave
Remarks  

 

Address (Extended)
Description The Standardization definitions for Address (Extended) standardizes addresses.
Examples Input Output
52 Commerce Street 52 Commerce St
5TH avenue 5th Ave
Remarks  

 

City
Description The City standardization definition standardizes city names.
Examples Input Output
wellington Wellington
CHRISTCHURCH Christchurch
Remarks  

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code standardization definition standardizes last line address information.
Example Input Output
wellington 6001 Wellington 6001
Remarks  

 

Name
Description The Name standardization definition standardizes names of individuals.
Examples Input Output
john donovan John Donovan
mister morrison junior Mr Morrison, Jr
Remarks  

 

Phone
Description The Phone standardization definition standardizes phone numbers for domestic use.
Example Input Output
Tel: +64-4 917 6800 ext 1234 SAS Wellington (04) 917 6800 x1234, Tel, SAS Wellington
(09) 914 4750 (09) 914 4750
+1 (919) 447-3000 +1 9194473000
(0800) 234567 0800 234 567
Remarks Optional geographic area codes are surrounded by parentheses, and non-optional geographic area codes are not surrounded by parentheses.

 

Phone (Electronic)
Description The Phone (Electronic) standardization definition standardizes phone numbers for automated calling systems.
Examples Input Output
Tel: +64-4 917 6800 ext 1234 SAS Wellington +6449176800
(09) 914 4750 +6499144750
+1 (919) 447-3000 +19194473000
(0800) 234567 +64800234567
+1-800-DATAFLUX (USA number) +180032823589
Remarks  

 

Phone (with Country Code)
Description The Phone (with Country Code) standardization definition standardizes phone numbers for international use.
Examples Input Output
Tel: +64-4 917 6800 ext 1234 SAS Wellington +64 4 917 6800 x1234, Tel, SAS Wellington
(09) 914 4750 +64 9 914 4750
+1 (919) 447-3000 +1 9194473000
(0800) 234567 +64 800 234 567
Remarks  

 

Postal Code
Description The Postal Code standardization definition standardizes postal codes.
Example Input Output
-8252 8252
Remarks  

Inherited Definitions

In addition to the definitions listed on this page, the English, New Zealand locale also inherits all definitions for the English language and all Global definitions.