You are here: Definitions>English Definitions>English, Canada Definitions

SAS Quality Knowledge Base for Contact Information 25

English, Canada Definitions

Definitions for the English, Canada locale are described below.

Case Definitions
Gender Analysis Definitions

Identification Analysis Definitions

Match Definitions

Parse Definitions

Pattern Analysis Definitions

Standardization Definitions

Inherited Definitions

Case Definitions

Proper (Address)
Description The Case definition for Proper (Address) proper cases addresses.
Examples Input Output
BOX 12 RR 1010 Box 12 RR 1010
po box 125 PO Box 125
Remarks  

 

Proper (Name)
Description The Case definition for Proper (Name) proper cases names of individuals.
Examples Input Output
GWENDOL GUTTERMOTH Gwendol Guttermoth
JOHN O'ROURKE John O'Rourke
ronald chung-hui mcdonald Ronald Chung-Hui McDonald
JEAN D'EGLISE Jean d'Eglise
JEAN L'FLOR Jean l'Flor
Remarks  

 

Proper (Organization)
Description The Case definition for Proper (Organization) proper cases organization names.
Examples Input Output
DATAFLUX CORPORATION DataFlux Corporation
sas institute SAS Institute
HBC, COMPAGNIE DE LA BAIE D'HUDSON HBC, Compagnie de la Baie d'Hudson
MINISTÈRE DE L'ÉDUCATION Ministère de l'Éducation
Remarks  

 

Proper (Site)
Description The Case definition for Proper (Site) proper cases organization sites.
Examples Input Output
of new zealand of New Zealand
uk UK
VAL-D'OR Val-d'Or
SAINT-ROCH-DE-L'ACHIGAN Saint-Roch-de-l'Achigan
Remarks  

Gender Analysis Definitions

None.

Identification Analysis Definitions

Contact Info
Description The Identification Analysis definition for Contact Info identifies the contact information that is represented by a string.
Possible Outputs ADDR
CPP
UNK
ATTN
ADDR2
BLANK
ORG
IND
ACCT
POB
PHONE
Example Input Output
420 Park Ridge Rd ADDR
Calgary, AB T2A2R1 CPP
Guttermoth UNK
c/o Blanch ATTN
#900-A5 ADDR2
  BLANK
DataFlux Corp ORG
James Douglas Morrison IND
PO Box 15 POB
Remarks  

 

Individual/Organization
Description The Identification Analysis definition for Individual/Organization determines whether a string represents the name of an individual or an organization.
Possible Outputs INDIVIDUAL
ORGANIZATION
UNKNOWN
Examples Input Output
DataFlux Corporation ORGANIZATION
Tony Fisher INDIVIDUAL
Guttermoth UNKNOWN
Remarks  

 

Phone (Validation)
Description The Identification Analysis definition for Phone (Validation) determines whether a string represents a valid phone number pattern.
Possible Outputs VALID
INVALID
Examples Input Output
(514) 444-4444 VALID
(511) 444-4444 INVALID
(014) 444-4444 INVALID
(514) 411-4444 INVALID
333-3333 VALID
311-3333 INVALID
055-5555 INVALID
(514) 444-4444 x9 VALID
(514) 444-44449 INVALID
444-4444 x9 VALID
444-44449 INVALID
+51-4-444-444 VALID
+1 555 555 1212 INVALID
232 787 4477 INVALID
(777)777-7777 INVALID
(249)777-7777 VALID
514 728 4421 1234 VALID
Remarks

Requirements for validity:

  • The domestic area code must be within the range [2-9].
  • The second two digits of the domestic area code must not both be 1.
  • The area code must be a valid domestic area code.
  • The first digit of the base number must be within the range [2-9].
  • The second two digits of the base number must not both be 1.
  • In the absence of explicit indicators for extensions or country codes, the number of digits in the input cannot exceed or be less than 7 (if no area code is in the input), or 10 (if the area code is in the input).
  • A number cannot consist of 10 of the same digits.
  • If any or all of these rules fail, INVALID is returned.

 

Postal Code (Validation)
Description The Identification Analysis definition for Postal Code (Validation) determines if the pattern of the input is that of a valid domestic postal code.
Possible Outputs VALID
INVALID
Examples Input Output
N6A 5P2 VALID
N6A 5P2,,, VALID
CA-N6A5P2,,, VALID
M25 7E7 INVALID
R3TR 4T3 INVALID
G3E 1G^ INVALID
27513 INVALID
Remarks  

Match Definitions

Address
Description The Address match definition generates match codes which can be used to cluster records containing addresses.
Max Length of Match Code 51 characters
Examples Input Cluster ID
420 Park Ridge Rd 0
420 Park Ridge Road 0
420 Parkridge Rd 0
C/O John Smith, 420 Parkridge Rd 0
Waterford Apartments, 420 Parkridge Rd 0
888 Premiere rue 1
888 1re rue 1
888 First St 1
888 1st St 1
123 rue principale 2
123 Main St 2
211 Lacombe Rd 3
211 rue Lacombe 3
211 Lacombe Street 3
211 Lacombe Ave 3
14728A Perthshire Rd 5
14728 Perthshire Rd Unit A 5
Apt A-14728 Perthshire Rd 5
A-14728 Perthshire Rd 5
14728 Perthshire Rd #A 5
14728 Perthshire Rd 6
42 rue du Saint-Sacrement, CP 2222 7
42 rue du Saint-Sacrement, case postale 2222 7
42 rue du Saint-Sacrement, PO Box 2222 7
42 rue du Saint-Sacrement, boîte postale 2222 7
General Delivery, 2221 W. Broad Street 8
GD, 2221 W. Broad Street 8
poste restante, 2221 W. Broad Street 8
PR, 2221 W. Broad Street 8
General Delivery Stn Main, 2221 W. Broad Street 8
General Delivery Stn Don Mills, 2221 W. Broad Street 8
RR 29 9
Rural Route 29 9
ROUTE RURALE 29 9
IM 29 9
Itineraire motorise 29 9
Site 3 Comp 7 10
Site 3 Box 7 10
Comp 7 Site 3 10
900 North Leyton Avenue 11
900 Leyton Avenue 11
10-15-397 HUMBERLINE DR 12
397 HUMBERLINE DR, Unit 15, box 10 12
Remarks NoteNote: The results listed above reflect the default match sensitivity (85).

 

City
Description The City match definition generates match codes which can be used to cluster records containing city names.
Max Length of Match Code 40 characters
Examples Input Cluster ID
Alberta 0
Albertta 0
Durham 1
Mont-Saint-Grégoire 2
Mont Saint Grégoire 2
L'Acadie 3
L Acadie 3
LAcadie 3
Duvernay Nord 4
Duvernay Est 4
Duvernay 4
Sainte-Marie 5
Saint Marie 5
St Marie 5
Fredericton Junction 6
Fredericton 6
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information.
Max Length of Match Code 46 characters
Examples Input Cluster ID
Calgary, AB T2A2R1 0
Calgary, Alberta T2A2R1 0
Mississauga, ON L4W1P9 1
Toronto ON A9A 9A9 2
Toronto ON A9A 9B0 2
DDO QC H8Y 3B8 3
D.D.O. QC, H8Y 3B8 3
Dollard-des-Ormeaux, QC, H8Y 3B8 3
Duvernay Nord AB T1S 1L5 4
Duvernay Est AB T1S 1L5 4
Duvernay AB T1S 1L5 4
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Name (with Suggestions)
Description The Name (with Suggestions) match definition generates match codes which can be used to cluster records containing names of individuals.
Max Length of Match Code 21 characters
Examples Input Cluster ID
PRAIS HILTON 1
PARIS HILTON 1
HENRY NICKELSON 2
HENRY NICKERSON 2
NIKI WONG 3
ANIKI WONG 3
NIKI WONG 4
NICLOE WONG 4
Remarks

This match definition generates one or more match codes for each input string. Each match code represents a suggestion for what might be the true value of the input string; this enables two strings to be matched even when one or both strings contain a spelling mistake. For example, the name PRAIS might match the name PARIS, or the name NICLOE might match the name NIKI.

Note that a consequence of the generation of multiple match codes is that a record can be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition.

For more information on suggestion-based matching, refer to the "Suggestion-Based Matching" section of the DataFlux Data Management Studio Online Help.

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Passport
Description The Passport match definition generates match codes which can be used to cluster records containing passport numbers.
Max Length of Match Code 15 characters
Examples Input Cluster ID
AB212709 0
AB212705 0
AB215698 2
AB215695 2
AB015808 3
AB015809 3
AB015807 3
AB128990 4
AB128995 4
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Phone
Description The Phone match definition generates match codes which can be used to cluster records containing phone numbers.
Max Length of Match Code 22 characters
Examples Input Cluster ID
1-800-DATAFLUX 0
1 (800) 328-2358 0
345-6789 1
345-6780 1
345-6700 2
447-3000 ext 1234 3
447-3000 ext 1266 3
+1 (226)-823-5678 4
+1-226-823-5678 4
+1 226-823-5678 (after 4pm) 4
Work: +1 (226) 823-5678 4
MAISON: +1-226-823-5678 4
+1-226-823-5678 (BUREAU) 4
+44 (0)2012345000 5
+44 2012345000 5
011 44 2012345000 5
Remarks

Note that the number of digits retained in the match codes is a function of the number of digits in the base number and the match sensitivity.

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Postal Code
Description The Postal Code match definition generates match codes which can be used to cluster records containing postal codes.
Max Length of Match Code 16 characters
Examples Input Cluster ID
G7S4W9 0
G7S4Z9 0
G7S9Z9 1
CA E8C2R8 2
CDN E8C 2R8 2
E8C2R8-CA 2
E8C2R8-CDN 2
E8C-2R8 2
Postal Code: E8C2R8 2
Code Postal: E8C2R8 2
USA: 67106 3
67106-1234 3
67106 3
6VB 3C2 4
V6B 3C2 4
VB6 3C2 4
M1C 5G5 5
MIC 5G5 5
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Social Security Number
Description The Social Security Number match definition generates match codes which can be used to cluster records containing Social Security Numbers.
Max Length of Match Code 15 characters
Examples Input Cluster ID
121270123 0
121270584 1
121270951 2
121569585 3
121569841 4
101580741 5
101580753 6
101580957 7
110899545 8
110899540 8
Remarks

The match definition removes digits from the right for each sensitivity level.

NoteNote: The results listed above reflect the default match sensitivity (85).

 

State/Province
Description The State/Province match definition generates match codes which can be used to cluster records containing states and provinces.
Max Length of Match Code 15 characters
Examples Input Cluster ID
Manitoba 0
MB 0
NOUVEAU-BRUNSWICK 1
New Brunswick 1
ÎLE-DU-PRINCE-ÉDOUARD 2
ÎLE DU PRINCE ÉDOUARD 2
B.C. 3
BC 3
British Columbia 3
Colombie-Britannique 3
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Text
Description

The Text match definition generates match codes which can be used to cluster records containing general text strings.

Max Length of Match Code 15 characters
Examples Input Cluster ID
they went 0
you are 1
you're 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

Parse Definitions

Address
Description The Parse definition for Address parses addresses into a set of tokens.
Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
Example 1 Input Output
72 West Main St Apt #21 Recipient  
Building/Site  
Street 72 West Main St
Extension Apt #21
PO Box  
Additional Info  
Example 2 Input Output
Care of John Smith, Constitution Square, 360 SE Albert Street, Suite 1600 (Corner of Constitution and Main) Recipient Care of John Smith
Building/Site Constitution Square
Street 360 SE Albert Street
Extension Suite 1600
PO Box  
Additional Info (Corner of Constitution and Main)
Example 3 Input Output
18 rue Demers Recipient  
Building/Site  
Street 18 rue Demers
Extension  
PO Box  
Additional Info  
Example 4 Input Output
201-360-5500 Main Street Recipient  
Building/Site  
Street 5500 Main Street
Extension 360
PO Box 201
Additional Info  
Example 5 Input Output
534 KING GEORGE RD RR 6 Recipient  
Building/Site  
Street 534 KING GEORGE RD
Extension  
PO Box RR 6
Additional Info  
Example 6 Input Output
NE 22-12-21W Recipient  
Building/Site  
Street 22-12-21W
Extension NE
PO Box  
Additional Info  
Remarks  

 

Address (Detailed)
Description The Parse definition for Address (Detailed) parses addresses into a set of tokens with detailed street information.
Output Tokens Recipient
Building/Site
Street Number
Pre-direction
Street Name
Street Type
Post-direction
Extension
PO Box
Additional Info
Example 1 Input Output
72 West Main St Apt #21 Recipient  
Building/Site  
Street Number 72
Pre-direction West
Street Name Main
Street Type St
Post-direction  
Extension Apt #21
PO Box  
Additional Info  
Example 2 Input Output
Care of John Smith, Constitution Square, 360 SE Albert Street, Suite 1600 (Corner of Constitution and Main) Recipient Care of John Smith
Building/Site Constitution Square
Street Number 360
Pre-direction SE
Street Name Albert
Street Type Street
Post-direction  
Extension Suite 1600
PO Box  
Additional Info (Corner of Constitution and Main)
Example 3 Input Output
18 rue Demers Recipient  
Building/Site  
Street Number 18
Pre-direction  
Street Name Demers
Street Type rue
Post-direction  
Extension  
PO Box  
Additional Info  
Example 4 Input Output
201-360-5500 Main Street Recipient  
Building/Site  
Street Number 5500
Pre-direction  
Street Name Main
Street Type Street
Post-direction  
Extension 360
PO Box 201
Additional Info  
Example 5 Input Output
534 KING GEORGE RD RR 6 Recipient  
Building/Site  
Street Number 534
Pre-direction  
Street Name KING GEORGE
Street Type RD
   
Extension  
PO Box RR 6
Additional Info  
Example 6 Input Output
NE 22-12-21W Recipient  
Building/Site  
Street Number 22
Pre-direction  
Street Name 12-21W
Street Type  
Post-direction  
Extension NE
PO Box  
Additional Info  
Remarks  

 

Address (Global)
Description

The Address (Global) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output
Example 1 72 West Main St Apt #21 Recipient  
Building/Site  
Street 72 West Main St
Extension Apt #21
PO Box  
Additional Info  
  Input Output
Example 2 Care of John Smith, Constitution Square, 360 SE Albert Street, Suite 1600 (Corner of Constitution and Main) Recipient Care of John Smith
Building/Site Constitution Square
Street 360 SE Albert Street
Extension Suite 1600
PO Box  
Additional Info (Corner of Constitution and Main)
  Input Output
Example 3 18 rue Demers Recipient  
Building/Site  
Street 18 rue Demers
Extension  
PO Box  
Additional Info  
  Input Output  
Example 4 201-360-5500 Main Street Recipient  
Building/Site  
Street 5500 Main Street
Extension 360
PO Box 201
Additional Info  
  Input Output
Example 5 534 KING GEORGE RD RR 6 Recipient  
Building/Site  
Street 534 KING GEORGE RD
Extension  
PO Box RR 6
Additional Info  
  Input Output  
Example 6 NE 22-12-21W Recipient  
Building/Site  
Street 22-12-21W
Extension NE
PO Box  
Additional Info  
Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB.

The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back.

 

Address (Global) (v23)
Description

The Address (Global) (v23) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output
Example 1 72 West Main St Apt #21 Recipient  
Building/Site  
Street 72 West Main St
Extension Apt #21
PO Box  
Additional Info  
  Input Output
Example 2 Care of John Smith, Constitution Square, 360 SE Albert Street, Suite 1600 (Corner of Constitution and Main) Recipient Care of John Smith
Building/Site Constitution Square
Street 360 SE Albert Street
Extension Suite 1600
PO Box  
Additional Info (Corner of Constitution and Main)
  Input Output
Example 3 18 rue Demers Recipient  
Building/Site  
Street 18 rue Demers
Extension  
PO Box  
Additional Info  
  Input Output  
Example 4 201-360-5500 Main Street Recipient  
Building/Site  
Street 5500 Main Street
Extension 360
PO Box 201
Additional Info  
  Input Output
Example 5 534 KING GEORGE RD RR 6 Recipient  
Building/Site  
Street 534 KING GEORGE RD
Extension  
PO Box RR 6
Additional Info  
  Input Output  
Example 6 NE 22-12-21W Recipient  
Building/Site  
Street 22-12-21W
Extension NE
PO Box  
Additional Info  
Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB.

The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back.

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens.
Output Tokens City
State/Province
Postal Code
Additional Info
Example 1 Input Output
Calgary, AB T2A2R1 City Calgary
State/Province AB
Postal Code T2A2R1
Additional Info  
Example 2 Input Output
Toronto, Ontario S9H3W4 (CDN) City Toronto
State/Province Ontario
Postal Code S9H3W4
Additional Info (CDN)
Example 3 Input Output
ST-BENOIT LABRE ( LAC POULIN ) QC G0M 1P0 City ST-BENOIT LABRE ( LAC POULIN )
State/Province QC
Postal Code G0M 1P0
Additional Info  
Remarks  

 

City - State/Province - Postal Code (Global)
Description The Parse definition for City - State/Province - Postal Code (Global) parses "last line" address information into a globally recognized set of tokens.
Output Tokens City
State/Province
Postal Code
Additional Info
Example 1 Input Output
Calgary, AB T2A2R1 City Calgary
State/Province AB
Postal Code T2A2R1
Additional Info  
Example 2 Input Output
Toronto, Ontario S9H3W4 (CDN) City Toronto
State/Province Ontario
Postal Code S9H3W4
Additional Info (CDN)
Example 3 Input Output
ST-BENOIT LABRE ( LAC POULIN ) QC G0M 1P0 City ST-BENOIT LABRE ( LAC POULIN )
State/Province QC
Postal Code G0M 1P0
Additional Info  
Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Phone
Description The Parse definition for Phone parses phone numbers into a set of tokens.
Output Tokens Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
Example 1 Input Output
Work: +1 418 447-3000 Ext 3118 (ask for Ron) Country Code +1
Area Code 418
Base Number 447-3000
Extension 3118
Line Type Work:
Additional Info (ask for Ron)
Example 2 Input Output
011 44 2012345000 Country Code 011 44
Area Code  
Base Number 2012345000
Extension  
Line Type  
Additional Info  
Example 3 Input Output
Toll Free: (800) 888-4257 Country Code  
Area Code 800
Base Number 888-4257
Extension  
Line Type Toll Free:
Additional Info  
Example 4 Input Output
705 424-1442 3118 (w) Country Code  
Area Code 705
Base Number 424-1442
Extension 3118
Line Type (w)
Additional Info  
Example 5 Input Output
705 424-1442 3118 (Mais) Country Code  
Area Code 705
Base Number 424-1442
Extension 3118
Line Type (Mais)
Additional Info  
Example 6 Input Output
506 447 3000 Poste 1234 Country Code  
Area Code 506
Base Number 447 3000
Extension 1234
Line Type  
Additional Info  
Remarks  

 

Phone (Global)
Description The Parse definition for Phone (Global) parses phone numbers into a globally recognized set of tokens.
Output Tokens Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
Example 1 Input Output
Work: +1 418 447-3000 Ext 3118 (ask for Ron) Country Code +1
Area Code 418
Base Number 447-3000
Extension 3118
Line Type Work:
Additional Info (ask for Ron)
Example 2 Input Output
011 44 2012345000 Country Code 011 44
Area Code  
Base Number 2012345000
Extension  
Line Type  
Additional Info  
Example 3 Input Output
Toll Free: (800) 888-4257 Country Code  
Area Code 800
Base Number 888-4257
Extension  
Line Type Toll Free:
Additional Info  
Example 4 Input Output
705 424-1442 3118 (w) Country Code  
Area Code 705
Base Number 424-1442
Extension 3118
Line Type (w)
Additional Info  
Example 5 Input Output
705 424-1442 3118 (Mais) Country Code  
Area Code 705
Base Number 424-1442
Extension 3118
Line Type (Mais)
Additional Info  
Example 6 Input Output
506 447 3000 Poste 1234 Country Code  
Area Code 506
Base Number 447 3000
Extension 1234
Line Type  
Additional Info  
Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Postal Code
Description The Parse definition for Postal Code parses postal codes into a set of tokens.
Output Tokens Forward Sortation Area
Local Delivery Unit
Additional Info
Example 1 Input Output
Postal Code: L6H4W4 Forward Sortation Area L6H
Local Delivery Unit 4W4
Additional Info Postal Code
Example 3 Input Output
4W4L6H Forward Sortation Area L6H
Local Delivery Unit 4W4
Additional Info  
Remarks  

Pattern Analysis Definitions

None.

Standardization Definitions

Address
Description The Standardization definition for Address standardizes addresses.
Examples Input Output
420 park ridge road 420 Park Ridge Rd
5th avenue 5th Ave
123 Main Street West 123 Main St W
123 West St 123 West St
18 rue Albert Nord 18 rue Albert N
18 rue Nord 18 rue Nord
18 Première rue 18 1re rue
18 Rue Première 18 1re rue
123 First St 123 1st St
1000 Broadway Ave #932 932-1000 Broadway Ave
U135-6440 Main St 6440 Main St, Unit 135
U5-8682 rue Canvin 8682 rue Canvin, Unité 5
BP 100 CP 100
POB 3 PO Box 3
GENERAL DELIVERY STATION MAIN GD STN Main
RURAL ROUTE #6 RR 6
Remarks  

 

City
Description The Standardization definition for City standardizes city names.
Examples Input Output
vancouver Vancouver
TORONTO Toronto
T.O. Toronto
NOTL Niagara on the Lake
DDO Dollard-des-Ormeaux
Sherbrooke (Ascot) Ascot, Sherbrooke
Mt-Saint-Grégoire Mont-Saint-Grégoire
Montreal Montréal
Notre Dame Notre-Dame
Saint Albert St Albert
Saint Jean Baptiste St Jean Baptiste
Remarks The Standardization definition for City standardizes city names according to the Canada Post SERP database. If two conflicting spellings exist as the SERP standard, this definition selects the English spelling, as seen in the final two examples. Common city abbreviations are expanded into full names. Accents and hyphens are added onto French city names, where applicable.

 

City - State/Province - Postal Code
Description The Standardization definition for City - State/Province - Postal Code standardizes address "last line" data.
Examples Input Output
calgary alberta T2A2R1 Calgary, AB T2A2R1
S9H 3W4 Toronto ON Toronto ON S9H 3W4
Toronto, ON, S9H 3W4 Toronto ON S9H 3W4
E Angus QC J0B 1A0 East Angus QC J0B 1A0
DDO QC H8Y 3B8 Dollard-des-Ormeaux QC H8Y 3B8
Toronto ON CA-S9H3W4 Toronto ON S9H 3W4
Saint Albert AB T8N 1M9 St Albert AB T8N 1M9
Saint Jean Baptiste MB R0G 2B0 St Jean Baptiste MB R0G 2B0
Remarks The Standardization definition for City - State/Province - Postal Code standardizes city names according to the Canada Post SERP database. If two conflicting spellings exist as the SERP standard, this definition selects the English spelling, as seen in the final two examples. Common city abbreviations are expanded into full names. Accents and hyphens are added onto French city names, where applicable.

 

City - State/Province - Postal Code (International)
Description The Standardization definition for City - State/Province - Postal Code (International) standardizes "last line" address information and formats Postal Code with country code prefix.
Examples Input Output
calgary alberta T2A 2R1 CALGARY AB  CA-T2A2R1
!S9H 3W4 Toronto ON TORONTO ON  CA-S9H3W4
Toronto, ON, S9H 3W4 TORONTO ON  CA-S9H3W4
E Angus QC J0B 1A0 EAST ANGUS QC   CA-J0B1A0
DDO QC H8Y 3B8 DOLLARD-DES-ORMEAUX QC  CA-H8Y3B8
Toronto ON CN S9H3W4 TORONTO ON  CA-S9H3W4
Toronto ON CDN-S9H3W4 TORONTO ON  CA-S9H3W4
Saint Albert AB T8N 1M9 ST ALBERT AB  CA-T8N1M9
Saint Jean Baptiste MB R0G 2B0 ST JEAN BAPTISTE MB   CA-R0G2B0
Remarks The Standardization definition for City - State/Province - Postal Code (International) standardizes city names according to the Canada Post SERP database. If two conflicting spellings exist as the SERP standard, this definition selects the English spelling, as seen in the final two examples. Common city abbreviations are expanded into full names. Accents and hyphens are added onto French city names, where applicable. Note that two spaces are inserted before the postal code, per the Universal Postal Union (UPU) standard for addressing mail for international use.

 

Name
Description The Standardization definition for Name standardizes names of individuals.
Examples Input Output
john donovan John Donovan
mister morrison junior Mr Morrison, Jr
Jean Luc Beauregard Jean-Luc Beauregard
FRANCOIS CASGRAIN François Casgrain
CLOE GRESLAND Cloé Gresland
Monsieur Jean Picard Mr Jean Picard
Remarks Adds accents onto French names where appropriate. Translates French honorofics (for example, Monsieur) into English.

 

Organization
Description The Standardization definition for Organization standardizes organization names.
Examples Input Output
Robe River Iron Associates Robe River Iron Associates
Moffat Limited Australia Moffat Ltd, Australia
A.T.W.Products Sp. z o.o. ATW Products SP ZOO
MRC D'ARGENTEUIL MRC d'Argenteuil
MINISTERE DE L'EDUCATION Ministère de l'Education
L'Oréal Limitée L'Oréal Ltée
L'Oréal Enregistrée L'Oréal Enr
Remarks  

 

Passport
Description The Standardization definition for Passport standardizes passport numbers.
Examples Input Output
ΑB123456 ΑB123456
ΑB 123456 ΑB123456
ΑB-123456 ΑB123456
ΑB/123456 ΑB123456
Remarks  

 

Phone
Description The Standardization definition for Phone standardizes phone numbers for domestic use.
Examples Input Output
+1 438 961-1317 (438) 961 1317
Work: (204) 940-6100 (204) 940 6100, Work
4509745060 (450) 974 5060
8009611317 800 961 1317
(212) 987-7654 EXT 456 (212) 987 7654 x456
(212) 987-7654 Poste 456 (212) 987 7654 x456
0044 (0)20 12345000 +44 2012345000
011-81-53-460-2871 +81 534602871
Remarks  

 

Phone (with Country Code)
Description The Standardization definition for Phone (with Country Code) standardizes phone numbers for international use.
Examples Input Output
(438) 987-7654 +1 438 987 7654
1(289)5466306 +1 289 546 6306
(450) 242-8202 - after 4pm +1 450 242 8202, After 4PM
(506) 987-7654 EXT 456 +1 506 987 7654 x456
(506) 987-7654 Poste 456 +1 506 987 7654 x456
800-961-1317 +1 800 961 1317
0044 (0)20 12345000 +44 2012345000
011-81-53-460-2871 +81 534602871
9406100 940 6100
Remarks If a number is recognized as a domestic number, and no area code is present, the domestic country code is not added, as seen in the final example.

 

Phone (Electronic)
Description The Standardization definition for Phone (Electronic) standardizes phone numbers for automated calling systems.
Examples Input Output
Work: +1-418-447-3000 Ext 3118 (ask for Ron) +14184473000
Fax: 226 9611317 +12269611317
306.736.4436 +13067364436
+1-800-DATAFLUX +180032823589
0044 (0)20 12345000 +442012345000
9406100 9406100
Remarks If a number is recognized as a domestic number, and no area code is present, the domestic country code is not added, as seen in the final example.

 

Postal Code
Description The Standardization definition for Postal Code standardizes postal codes.
Examples Input Output
H2N-1Z2 H2N 1Z2
h1n2w2 H1N 2W2
CDN E8C2R8 E8C 2R8
U-27514 US-27514
U.S.A. 27514 US-27514
!V6B 2R7 V6B 2R7
Postal Code: S9H 3W4 S9H 3W4
Code Postal: S9H 3W4 S9H 3W4
V6B C56 V6B 5C6
MIC 5G5 M1C 5G5
FR. 12345 FR-12345
12345 6789 12345-6789
Remarks If a postal code contains 5 digits, followed by a space or hyphen, followed by 4 digits, it is recognized as a United States postal code, and a hyphen is placed between the two sets of digits per U.S. convention.

 

Postal Code (with Country Code)
Description The Standardization definition for Postal Code (with Country Code) standardizes postal codes for international use.
Examples Input Output
H2N-1Z2 CA-H2N1Z2
h1n2w2 CA-H1N2W2
CDN E8C2R8 CA-E8C2R8
U-27514 US-27514
U.S.A. 27514 US-27514
!V6B 2R7 CA-V6B2R7
Postal Code: S9H 3W4 CA-S9H3W4
Code Postal: S9H 3W4 CA-S9H3W4
V6B C56 CA-V6B5C6
MIC 5G5 CA-M1C5G5
FR. 12345 FR-12345
12345 6789 US-12345-6789
Remarks If a postal code contains 5 digits, followed by a space or hyphen, followed by 4 digits, it is recognized as a United States postal code, and a hyphen is placed between the two sets of digits per U.S. convention.

 

Postal Code (Advanced Correction)
Description The Standardization definition for Postal Code (Advanced Correction) performs advanced corrections to malformed domestic postal codes.
Examples Input Output
M5PM 1T7 M5M 1T7
P-B 1M0 P9B 1M0
,0J1P- M0J 1P9
M2N`W3 M2N 1W3
Remarks

This Standardization definition is designed to handle malformed postal codes that have not undergone edit checks. The definition makes the best possible corrections. The Standardization definition for Postal Code (Advanced Correction) can be used especially for postal codes that do not parse correctly using the Postal Code Parse definition. Corrections are based on keyboard proximity and misreading of letters and numbers that look similar (such as, S and 5 ). This definition corrects common typographical errors, such as:

  • A comma in the position where a letter should be is typically an M typo.
  • A 6 in the position where a letter should be is typically a misreading of G.
  • A W in the position where a number should be is typically a 2 typographical error.
  • A $ in the position where a number should be is typically a 4 typographical error.
Corrections are based on common problems found in a survey of 31 million records from Passport Canada.

 

Social Security Number
Description The Standardization definition for Social Security Number standardizes Social Security Numbers.
Examples Input Output
SSN: 220379123 220 379 123
13 04 56 789 130 456 789
130-456-789 130 456 789
Numéro d’assurance sociale: 220379123 220 379 123
Remarks  

 

State/Province
Description The Standardization definition for State/Province standardizes provinces to their full name.
Examples Input Output
MB Manitoba
NB New Brunswick
QC Québec
BC British Columbia
Province of Alberta Alberta
Newfoundland Newfoundland and Labrador
IPE Île-du-Prince-Édouard
NOUVEAU BRUNSWICK Nouveau-Brunswick
Remarks Official Postal ISO abbreviations are expanded to their full English form. Non-official abbreviations are expanded to the full form in the source language of the abbreviation. Following this behavior, translations from French to English do not occur, as evidenced in the final two examples.

 

State/Province (Postal Standard)
Description The Standardization definition for State/Province (Postal Standard) standardizes provinces to their ISO 3166-2 two-letter code.
Examples Input Output
Manitoba MB
NOUVEAU BRUNSWICK NB
New Brunswick NB
Québec QC
British Columbia BC
Province of Alberta AB
Newfoundland NL
Newfoundland and Labrador NL
PQ QC
Remarks  

Inherited Definitions

In addition to the definitions listed on this page, the English, Canada locale also inherits all definitions for the English language and all Global definitions.