SAS Quality Knowledge Base for Contact Information 26
Definitions for the Portuguese, Portugal locale are described below.
Case Definitions
Extraction Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Proper (City) | ||
---|---|---|
Description | The Proper (City) case definition propercases city names. | |
Input | Output | |
Example | V.N. FAMALICAO | V.N. Famalicao |
Remarks |
Proper (Name) | ||
---|---|---|
Description | The Proper (Name) case definition propercases names of individuals. | |
Input | Output | |
Examples | JOÃO MORGADO | João Morgado |
MARIA SOARES D'ALBERGARIA | Maria Soares D'Albergaria | |
Remarks |
None.
Name | ||
---|---|---|
Description | The Name gender analysis definition determines the gender of a name. | |
Possible Outputs | M F U |
|
Input | Output | |
Examples | Celeste de Jesus Cordeiro | F |
Jose Maria Martins | M | |
M. Antunes | U | |
Remarks |
Individual/Organization | ||
---|---|---|
Description | The Individual/Organization identification analysis definition determines whether a string represents the name of an individual or an organization. | |
Possible Outputs | Individual Organization Unknown |
|
Input | Output | |
Examples | Blue Light, Lda | Organization |
Antonio Maria de Jesus Castro | Individual | |
Auto Taxis Antonio Pedro | Organization | |
Marmocari | Unknown | |
Remarks |
Address | ||
---|---|---|
Description | The Address match definition generates match codes which can be used to cluster records containing addresses. | |
Max Length of Match Code | 23 characters | |
Input | Cluster ID | |
Examples | Travessa da Corredoura, 90 - 1º Andar | 0 |
TV DA CORREDOURA N 90 | 0 | |
PCT ANTONIO SERGIO 12 7 DT | 1 | |
Remarks |
|
City | ||
---|---|---|
Description | The City match definition generates match codes which can be used to cluster records containing city names. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
S Martinho de Bougado | 0 | |
Martinho Bougado | 0 | |
Porto | 1 | |
Remarks |
|
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information. | |
Max Length of Match Code | 41 characters | |
Input | Cluster ID | |
Examples | S.Domingos Rana 1100 | 0 |
São Domingos de Rana, 1100 | 0 | |
Vimioso 5230 | 1 | |
Remarks |
|
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 22 characters | |
Input | Cluster ID | |
Examples | Rui Manuel do Carmo Lopes | 0 |
RUI MANUEL CARMO LOPES | 0 | |
Jose Maria Jesus Costa | 1 | |
Remarks |
|
Organization | ||
---|---|---|
Description | The Organization match definition generates match codes which can be used to cluster records containing organization names. | |
Max Length of Match Code | 32 characters | |
Input | Cluster ID | |
Examples | AGOSTINHO CERAMICA, LDA | 0 |
AGOSTINO CERAMICA LDA | 0 | |
SASINST Software Lda | 1 | |
Remarks |
|
Phone | ||
---|---|---|
Description | The Phone match definition generates match codes which can be used to cluster records containing phone numbers. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | +351 218746636 | 0 |
218746636 | 0 | |
210 316 000 | 1 | |
Remarks |
|
Postal Code | ||
---|---|---|
Description | The Postal Code match definition generates match codes which can be used to cluster records containing postal codes. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | -1100 | 0 |
1100 | 0 | |
1600 | 1 | |
Remarks |
|
Address | |||
---|---|---|---|
Description | The Address parse definition parses addresses into a set of tokens. | ||
Output Tokens | Street Type Street Name Street Number Extension House Name Neighborhood |
||
Input | Output Token | Output | |
Example | Rua Miguel de Farias 1226 bl. 2 | Street Type | Rua |
Street Name | Miguel de Farias | ||
Street Number | 1226 | ||
Extension | bl. 2 | ||
House Name | |||
Neighborhood | |||
Remarks |
Address (Full) | |||
---|---|---|---|
Description | The Address (Full) parse definition parses addresses containing complete two-line addresses into a set of tokens. | ||
Output Tokens | Street Type Street Name Street Number Extension House Name Neighborhood Locality City Postal Code Postal City |
||
Input | Output Token | Output | |
Example | Rua Elias Garcia 12 3ºB Campo Pequeno, Lisboa | Street Type | Rua |
Street Name | Elias Garcia | ||
Street Number | 12 | ||
Extension | 3ºB | ||
House Name | |||
Neighborhood | Campo Pequeno | ||
Locality | Lisboa | ||
City | |||
Postal Code | |||
Postal City | |||
Remarks |
Address (Global) | |||
---|---|---|---|
Description |
The Address (Global) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output Token | Output | |
Example | Rua Miguel de Farias 1226 bl. 2 | Recipient | |
Building/Site | |||
Street | Rua Miguel de Farias 1226 | ||
Extension | bl. 2 | ||
PO Box | |||
Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
City - State/Province - Postal Code | |||
---|---|---|---|
Description | The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens. | ||
Output Tokens | Neighborhood Locality City Postal Code Postal City |
||
Input | Output Token | Output | |
Example | Campo Grande, 1600 Lisboa | Neighborhood | Campo Grande |
Locality | |||
City | |||
Postal Code | 1600 | ||
Postal City | Lisboa | ||
Remarks |
City - State/Province - Postal Code (Global) | |||
---|---|---|---|
Description | The City - State/Province - Postal Code (Global) parse definition parses last line address information into a globally recognized set of tokens. | ||
Output Tokens | City State/Province Postal Code Additional Info |
||
Input | Output Token | Output | |
Example | 1600 Lisboa | City | Lisboa |
State/Province | |||
Postal Code | 1600 | ||
Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name | |||
---|---|---|---|
Description | The Name parse definition parses names of individuals into a set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output Token | Output | |
Example 1 | James Goodnight | Prefix | |
Given Name | James | ||
Middle Name | |||
Family Name | Goodnight | ||
Suffix | |||
Title/Additional Info | |||
Input | Output Token | Output | |
Example 2 | Carlos Augusto Medeiros da Silva | Prefix | |
Given Name | Carlos | ||
Middle Name | Augusto Medeiros | ||
Family Name | da Silva | ||
Suffix | |||
Title/Additional Info | |||
Remarks |
Name (Global) | |||
---|---|---|---|
Description | The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output Token | Output | |
Example 1 | James Goodnight | Prefix | |
Given Name | James | ||
Middle Name | |||
Family Name | Goodnight | ||
Suffix | |||
Title/Additional Info | |||
Input | Output Token | Output | |
Example 2 | Carlos Augusto Medeiros da Silva | Prefix | |
Given Name | Carlos | ||
Middle Name | Augusto Medeiros | ||
Family Name | da Silva | ||
Suffix | |||
Title/Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Phone | |||
---|---|---|---|
Description | The Phone parse definition parses phone numbers into a set of tokens. | ||
Output Tokens | Prefix Country Code Number Extension Comment |
||
Input | Output Token | Output | |
Example | +351218746636 ext 1121 | Prefix | |
Country Code | +351 | ||
Number | 218746636 | ||
Extension | ext 1121 | ||
Comment | |||
Remarks |
Phone (Global) | |||
---|---|---|---|
Description | The Phone (Global) parse definition parses phone numbers into a globally recognized set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output Token | Output | |
Example | +351218746636 ext 1121 | Country Code | +351 |
Area Code | |||
Base Number | 218746636 | ||
Extension | ext 1121 | ||
Line Type | |||
Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
None.
Address | ||
---|---|---|
Description | The Address standardization definition standardizes addresses. | |
Input | Output | |
Examples | R JUNQUEIRA N 47 | Rua Junqueira 47 |
r antonio nobre 4 bloco 2 | Rua Antonio Nobre 4, Bl 2 | |
Remarks |
City | ||
---|---|---|
Description | The City standardization definition standardizes city names. | |
Input | Output | |
Examples | S TIAGO | São Tiago |
agueda | Águeda | |
Remarks |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code standardization definition standardizes last line address information. | |
Input | Output | |
Example | 4535 Sta Maria Lamas | 4535 SANTA MARIA DE LAMAS |
Remarks |
Name | ||
---|---|---|
Description | The Name standardization definition standardizes names of individuals. | |
Input | Output | |
Example | ENGENHEIRO ANTONIO MARIA DE JESUS CASTRO | ANTONIO MARIA DE JESUS CASTRO, ENG |
Remarks |
Organization | ||
---|---|---|
Description | The Organization standardization definition standardizes organization names. | |
Input | Output | |
Example | blue light, limitada | BLUE LIGHT LDA |
Remarks |
Phone | ||
---|---|---|
Description | The Phone standardization definition standardizes phone numbers for domestic use. | |
Input | Output | |
Example | +351218746636 | +351 - 218746636 |
Remarks |
Postal Code | ||
---|---|---|
Description | The Postal Code standardization definition standardizes postal codes. | |
Input | Output | |
Example | ,1600 | 1600 |
Remarks |
In addition to the definitions listed on this page, the Portuguese, Portugal locale also inherits all definitions for the Portuguese language and all Global definitions. The following definitions are not inherited:
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_PTPRT_defs.html |