You are here: Definitions>Portuguese Definitions>Portuguese, Brazil Definitions

SAS Quality Knowledge Base for Contact Information 26

Portuguese, Brazil Definitions

Definitions for the Portuguese, Brazil locale are described below.

Case Definitions
Extraction Definitions
Gender Analysis Definitions

Identification Analysis Definitions

Match Definitions

Parse Definitions

Pattern Analysis Definitions

Standardization Definitions

Inherited Definitions

Case Definitions

None.

Extraction Definitions

None.

Gender Analysis Definitions

Name
Description The Name gender analysis definition determines the gender of a name.
Possible Outputs M
F
U
  Input Output
Examples Celeste de Jesus Cordeiro F
Jose Maria Martins M
M. Antunes U
Remarks

If this definition is applied to pre-parsed data, the following input tokens are available:

Prefix
Given Name
Middle Name
Family Name
Suffix
Title/Additional Info

It is recommended that you map a correlating data field to each available token whenever possible.

The Name gender analysis definition has been replaced with a copy of the Name (v24) definition. The Name (v24) definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Name (v24) definition, it is suggested that you change them back to use the Name definition.

The Name definition accepts parsed input and the input tokens have been changed. The token change will require you to update any jobs using the Gender Analysis (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Gender Analysis node will not require an update.

 

Name (v24)
Description

The Name (v24) gender analysis definition determines the gender of a name.

Possible Outputs M
F
U
  Input Output
Examples M. Antunes U
Jose Maria Martins M
Celeste de Jesus Cordeiro F
Remarks

If this definition is applied to pre-parsed data, the following input tokens are available:

Prefix
Given Name
Middle Name
Family Name
Suffix
Title/Additional Info

It is recommended that you map a correlating data field to each available token whenever possible.

The Name gender analysis definition has been replaced with a copy of the Name (v24) definition. The Name (v24) definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Name (v24) definition, it is suggested that you change them back to use the Name definition.

 

Identification Analysis Definitions

Name/Organization
Description The Name/Organization identification analysis definition determines whether a string represents the name of an individual or an organization.
Possible Outputs NAME
ORGANIZATION
NAME/ORGANIZATION
UNKNOWN
  Input Output
Examples SAS Brasil ORGANIZATION
Marcel Pasquini NAME
João Fernandes, Oftalmogia Fernandes NAME/ORGANIZATION
PEDRO SA NAME
TEXTILES PEDRO SA ORGANIZATION
PEDRO S/A ORGANIZATION
Doutor João Tomazelli NAME
Doutor do Pneu ORGANIZATION
Almeida, Pasquini, e Pichatelli ORGANIZATION
Fernandes UNKNOWN
Fernandes LTDA ORGANIZATION
Remarks  

 

Offensive
Description The Offensive identification analysis definition identifies potentially offensive words and phrases.
Possible Outputs OFFENSIVE
NOT OFFENSIVE
  Input Output
Examples Sr. João Carlos Mulherengo OFFENSIVE
Sr. João Bumbum Da Silva OFFENSIVE
Baixinho Da Silva OFFENSIVE
Sr. João Carlos Da Silva NOT OFFENSIVE
Remarks  

Match Definitions

Address
Description

The Address match definition generates match codes which can be used to cluster records containing addresses.

Max Length of Match Code 75 characters
  Input Cluster ID
Examples Rua Miguel Sa 1226 0
Rua Miguel Sa 1226 bl. 2 0
Rua General Miguel Sa 1226 0
Rua General Miguel Sa 1226, apto 503 1
Rua Passo do Norte, 21 2
Rua Passo Norte, 21 2
Trav Costa Azul, 23 3
Av Costa Azul, 23 3
Rua Costa Azul, 23 3
Costa Azul, 23 3
Rua Esperança num 23 4
Rua Esperança Nº 23 4
Rua Esperança n 23 4
Rua Esperança 23 4
CXP 12345 5
Caixa Postal 12345 5
C.P. 12345 5
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

Area-type extension information (ex. Bloco, Lote) is retained at sensitivities 90 and 95.

 

Address (Full)
Description

The Address (Full) match definition generates match codes which can be used to cluster records containing complete two-line addresses.

Max Length of Match Code 131 characters
  Input Cluster ID
Examples Rua Miguel Sa 1226, Itaim Bibi, São Paulo - SP, 04530-000 0
Rua Miguel Sa 1226 bl. 2, Itaim Bibi, São Paulo - SP, 04530-000 0
Rua General Miguel Sa 1226, Itaim Bibi, São Paulo - SP, 04530-000 0
Rua General Miguel Sa 1226, apto 503, Itaim Bibi, São Paulo - SP, 04530-000 1
Rua Passo do Norte, 21, Itaim Bibi, São Paulo - SP, 04530-000 2
Rua Passo Norte, 21, Itaim Bibi, São Paulo - SP, 04530-000 2
Trav Costa Azul, 23, Itaim Bibi, São Paulo - SP, 04530-000 3
Av Costa Azul, 23, Itaim Bibi, São Paulo - SP, 04530-000 3
Rua Costa Azul, 23, Itaim Bibi, São Paulo - SP, 04530-000 3
Costa Azul, 23, Itaim Bibi, São Paulo - SP, 04530-000 3
Rua Esperança num 23, Itaim Bibi, São Paulo - SP, 04530-000 4
Rua Esperança Nº 23, Itaim Bibi, São Paulo - SP, 04530-000 4
Rua Esperança n 23, Itaim Bibi, São Paulo - SP, 04530-000 4
Rua Esperança 23, Itaim Bibi, São Paulo - SP, 04530-000 4
CXP 12345, Itaim Bibi, São Paulo - SP, 04530-000 5
Caixa Postal 12345, Itaim Bibi, São Paulo - SP, 04530-000 5
C.P. 12345, Itaim Bibi, São Paulo - SP, 04530-000 5
Avenida Gomes Freire, 430, Rocha, São Gonçalo, RJ 24400-000 6
Avenida Gomes Freire, 430, São Gonçalo, RJ 24400-000 6
Avenida Gomes Freire, 430, São Gonçalo, 24400-000 6
Avenida Gomes Freire, 430, São Gonçalo, 24400-100 6
Avenida Gomes Freire, 430, São Gonçalo, 24401-100 7
Aven. Nova America, 2001, Ouro Branco MG 36420-000 8
Aven. Nova America, 2001, Ouro Branco, Minas Gerais 36420-000 8
Rua Sao Jose, 44, Camaçari BA, CEP 42810-000 9
Rua Sao Jose, 44, Camaçari BA 42810-000 9
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

Area-type extension information (ex. Bloco, Lote) is retained at sensitivities 90 and 95.

 

Address (PO Box Only)
Description

The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address.

Max Length of Match Code 15 characters
  Input Cluster ID
Examples Rua Tiradentes 99, CX Postal 112 0
Rua Almeida 123, CX Postal 112 0
CX Postal 112 0
CX Postal 222 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (Street Only)
Description

The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address.

Max Length of Match Code 67 characters
  Input Cluster ID
Examples Rua Almeida, 233, Caixa Postal 54321 0
Rua Almeida, 233, Caixa Postal 12345 0
Rua Almeida, 233 0
Rua Almeida, 522 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

Area-type extension information (ex. Bloco, Lote) is retained at sensitivities 90 and 95.

 

City
Description The City match definition generates match codes which can be used to cluster records containing city names.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples Sao Paulo 0
S Paulo 0
São Paulo 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples 20080-003 Rio de Janeiro-RJ 0
Rio de Janeiro RJ 20080-003 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Name
Description The Name match definition generates match codes which can be used to cluster records containing names of individuals.
Max Length of Match Code 28 characters
  Input Cluster ID
Examples Luiz de Souza Cabral 0
LUIS SOUZA E CABRAL 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

If this definition is applied to pre-parsed data, the following input tokens are available:

Prefix
Given Name
Middle Name
Family Name
Suffix
Title/Additional Info

It is recommended that you map a correlating data field to each available token whenever possible.

The Name match definition has been replaced with a copy of the Name (v24) definition. The Name (v24) definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Name (v24) definition, it is suggested that you change them back to use the Name definition.

The Name match definition has changed in the following ways:

The Name definition accepts parsed input and the input tokens have been changed. The token change will require you to update any jobs using the Match Codes (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Match Codes node will not require an update.

The match code length has been changed. This change might require you to update any jobs using the Name definition so that the match code fields can handle the new length.

 

Name (v24)
Description

The Name (v24) match definition generates match codes which can be used to cluster records containing names of individuals.

Max Length of Match Code 28 characters
  Input Cluster ID
Examples Luiz de Souza Cabral 0
LUIS SOUZA E CABRAL 0
  LUIS SILVA 1
  Dr. Gerson da Silva 2
  Gerson Silva 2
  JERSON DANIEL DA SILVA 2
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

If this definition is applied to pre-parsed data, the following input tokens are available:

Prefix
Given Name
Middle Name
Family Name
Suffix
Title/Additional Info

It is recommended that you map a correlating data field to each available token whenever possible.

The Name match definition has been replaced with a copy of the Name (v24) definition. The Name (v24) definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Name (v24) definition, it is suggested that you change them back to use the Name definition.

 

Organization
Description The Organization match definition generates match codes which can be used to cluster records containing organization names.
Max Length of Match Code 20 characters
  Input Cluster ID
Examples Telefonica S/A 0
Telefonica 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Phone
Description

The Phone match definition generates match codes which can be used to cluster records containing phone numbers.

Max Length of Match Code 22 characters
  Input Cluster ID
Examples (66) 4134 3945 0
041 (66) 4134 3945 0
+55 (66) 4134 3945 0
0041 55 (66) 4134 3945 0
+54 11 5208 3458 1
+54 (0)11 5208 3458 1
Trabalho: 31 4501 5452 (Peça para Maria) 2
31 4501 5452 2
(11) 96789-1234 3
(11) 96789-1230 3
(11) 96789-1200 4
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Postal Code
Description The Postal Code match definition generates match codes which can be used to cluster records containing postal codes.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples 20080-003 0
20080003 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

State
Description The State match definition generates match codes which can be used to cluster records containing names of states.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples Mato Grosso do Sul 0
ms 0
Mato Groso do Sul 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

The State match definition is being renamed to State/Province. State is now deprecated and will be removed in a future release. Please change your jobs to use the State/Province match definition.

 

State/Province
Description

The State/Province match definition generates match codes which can be used to cluster records containing states and provinces.

Max Length of Match Code 15 characters
  Examples Input Cluster ID
Mato Grosso do Sul 0
ms 0
Mato Groso do Sul 0
Rio Grande do Sul 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Text
Description The Text match definition generates match codes which can be used to cluster records containing general text strings.
Max Length of Match Code 15 characters
  Example Input Cluster ID
Data Management 0
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

Parse Definitions

Address
Description

The Address parse definition parses addresses into a set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output Token Output
Example 1 Gerson Almeida,Condominio Boa Esperança, Rua Professor Marcel Pasquini 23, caixa postal 14 (informações adicionais) Recipient Gerson Almeida
Building/Site Condominio Boa Esperança
Street Rua Professor Marcel Pasquini 23
Extension  
PO Box caixa postal 14
Additional Info (informações adicionais)
  Input Output Token Output
Example 2 Av José Andraus Gassani 2464, Apto 3 Recipient  
Building/Site  
Street Av José Andraus Gassani 2464
Extension Apto 3
PO Box  
Additional Info  
  Input Output Token Output
Example 3 SQS 415 BL B APT 400 Recipient  
Building/Site  
Street SQS 415
Extension BL B APT 400
PO Box  
Additional Info  
  Input Output Token Output
Example 4 1a. avenida jose luiz cavalcante 14/piso 6 Recipient  
Building/Site  
Street 1a. avenida jose luiz cavalcante 14
Extension piso 6
PO Box  
Additional Info  
  Input Output Token Output
Example 5 ACF BARAO DE LIMEIRA Caixa postal 25 Recipient  
Building/Site  
Street  
Extension  
PO Box ACF BARAO DE LIMEIRA Caixa postal 25
Additional Info  
  Input Output Token Output
Example 6 Condominio da Barra 4 Rua Tristão da Silva, 125 Casa 2 Recipient  
Building/Site Condominio da Barra 4
Street Rua Tristão da Silva, 125
Extension Casa 2
PO Box  
Additional Info  
Remarks  

 

Address (Detailed)
Description The Address (Detailed) parse definition parses addresses into a set of tokens with detailed street information.
Output Tokens Recipient
Building/Site
Street Type
Street Name Title
Street Name
Street Number
Extension
PO Box
Additional Info
  Input Output Token Output
Example 1 Gerson Almeida,Condominio Boa Esperança, Rua Professor Marcel Pasquini 23, caixa postal 14 (informações adicionais) Recipient Gerson Almeida
Building/Site Condominio Boa Esperança
Street Type Rua
Street Name Title Professor
Street Name Marcel Pasquini
Street Number 23
Extension  
PO Box caixa postal 14
Additional Info (informações adicionais)
  Input Output Token Output
Example 2 Av José Andraus Gassani 2464 Recipient  
Building/Site  
Street Type Av
Street Name Title  
Street Name José Andraus Gassani
Street Number 2464
Extension  
PO Box  
Additional Info  
  Input Output Token Output
Example 3 SQS 415 BL B APT 400 Recipient  
Building/Site  
Street Type  
Street Name Title  
Street Name SQS 415
Street Number  
Extension BL B APT 400
PO Box  
Additional Info  
  Input Output Token Output
Example 4 1a. avenida jose luiz cavalcante 14 Recipient  
Building/Site  
Street Type 1a. avenida
Street Name Title  
Street Name jose luiz cavalcante
Street Number 14
Extension  
PO Box  
Additional Info  
  Input Output Token Output
Example 5 ACF BARAO DE LIMEIRA Caixa postal 25 Recipient  
Building/Site  
Street Type  
Street Name Title  
Street Name  
Street Number  
Extension  
PO Box ACF BARAO DE LIMEIRA Caixa postal 25
Additional Info  
  Input Output Token Output
Example 6 Condominio da Barra 4 Rua Tristão da Silva, 125 Casa 2 Recipient  
Building/Site Condominio da Barra 4
Street Type Rua
Street Name Title  
Street Name Tristão da Silva
Street Number 125
Extension Casa 2
PO Box  
Additional Info  
Remarks  

 

Address (Full)
Description

The Address (Full) parse definition parses addresses containing complete two-line addresses into a set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Neighborhood/Village
City
State/Province
Postal Code
Country
Additional Info
  Input Output Token Output
Example 1 rua Cristalina 4, Itaim Bibi, Sao Paulo - SP 01304-900 Recipient  
Building/Site  
Street rua Cristalina 4
Extension  
PO Box  
Neighborhood/Village Itaim Bibi
City Sao Paulo
State/Province SP
Postal Code 01304-900
Country  
Additional Info  
  Input Output Token Output
Example 2 C.P. 1424 Fortaleza, Ceara 60127-900 Recipient  
Building/Site  
Street  
Extension  
PO Box C.P. 1424
Neighborhood/Village  
City Fortaleza
State/Province Ceara
Postal Code 60127-900
Country  
Additional Info  
  Input Output Token Output
Example 3 Restaurante Fogo de Chao, Av. dos Bandeirantes, 538, Vila Olímpia, São Paulo 04553-000 Recipient Restaurante Fogo de Chao
Building/Site  
Street Av. dos Bandeirantes, 538
Extension  
PO Box  
Neighborhood/Village Vila Olímpia
City  
State/Province São Paulo
Postal Code 04553-000
Country  
Additional Info  
  Input Output Token Output
Example 4 Condominio Bosque Imperial sala 3, Praia Copacabana, Rio de Janeiro RJ 20090-010 Recipient  
Building/Site Condominio Bosque Imperial
Street  
Extension sala 3
PO Box  
Neighborhood/Village Praia Copacabana
City Rio de Janeiro
State/Province RJ
Postal Code 20090-010
Country  
Additional Info  
  Input Output Token Output
Example 5 Rua arcos 3, Monte Carmelo do Rio Novo, Espírito Santo 29767-000 (Brasil) Recipient  
Building/Site  
Street Rua arcos 3
Extension  
PO Box  
Neighborhood/Village  
City Monte Carmelo do Rio Novo
State/Province Espírito Santo
Postal Code 29767-000
Country (Brasil)
Additional Info  
Remarks  

 

Address (Full) (Detailed)
Description The Address (Full) (Detailed) parse definition parses addresses containing complete two-line addresses into a set of tokens with detailed street information.
Output Tokens Recipient
Building/Site
Street Type
Street Name Title
Street Name
Street Number
Extension
PO Box
Neighborhood/Village
City
State/Province
Postal Code
Country
Additional Info
  Input Output Token Output
Example 1 rua Cristalina 4, Itaim Bibi, Sao Paulo - SP 01304-900 Recipient  
Building/Site  
Street Type rua
Street Name Title  
Street Name Cristalina
Street Number 4
Extension  
PO Box  
Neighborhood/Village Itaim Bibi
City Sao Paulo
State/Province SP
Postal Code 01304-900
Country  
Additional Info  
  Input Output Token Output
Example 2 C.P. 1424 Fortaleza, Ceara 60127-900 Recipient  
Building/Site  
Street Type  
Street Name Title  
Street Name  
Street Number  
Extension  
PO Box C.P. 1424
Neighborhood/Village  
City Fortaleza
State/Province Ceara
Postal Code 60127-900
Country  
Additional Info  
  Input Output Token Output
Example 3 Restaurante Fogo de Chao, Av. dos Bandeirantes, 538, Vila Olímpia, São Paulo 04553-000 Recipient Restaurante Fogo de Chao
Building/Site  
Street Type Av.
Street Name Title  
Street Name dos Bandeirantes
Street Number 538
Extension  
PO Box  
Neighborhood/Village Vila Olímpia
City  
State/Province São Paulo
Postal Code 04553-000
Country  
Additional Info  
  Input Output Token Output
Example 4 Condominio Bosque Imperial sala 3, Praia Copacabana, Rio de Janeiro RJ 20090-010 Recipient  
Building/Site Condominio Bosque Imperial
Street Type  
Street Name Title  
Street Name  
Street Number  
Extension sala 3
PO Box  
Neighborhood/Village Praia Copacabana
City Rio de Janeiro
State/Province RJ
Postal Code 20090-010
Country  
Additional Info  
  Input Output Token Output
Example 5 Rua arcos 3, Monte Carmelo do Rio Novo, Espírito Santo 29767-000 (Brasil) Recipient  
Building/Site  
Street Type Rua
Street Name Title  
Street Name arcos
Street Number 3
Extension  
PO Box  
Neighborhood/Village  
City Monte Carmelo do Rio Novo
State/Province Espírito Santo
Postal Code 29767-000
Country (Brasil)
Additional Info  
Remarks  

 

Address (Global)
Description

The Address (Global) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output Token Output
Example 1 Gerson Almeida,Condominio Boa Esperança, Rua Professor Marcel Pasquini 23, caixa postal 14 (informações adicionais) Recipient Gerson Almeida
Building/Site Condominio Boa Esperança
Street Rua Professor Marcel Pasquini 23
Extension  
PO Box caixa postal 14
Additional Info (informações adicionais)
  Input Output Token Output
Example 2 Av José Andraus Gassani 2464, Apto 3 Recipient  
Building/Site  
Street Av José Andraus Gassani 2464
Extension Apto 3
PO Box  
Additional Info  
  Input Output Token Output
Example 3 SQS 415 BL B APT 400 Recipient  
Building/Site  
Street SQS 415
Extension BL B APT 400
PO Box  
Additional Info  
  Input Output Token Output
Example 4 1a. avenida jose luiz cavalcante 14/piso 6 Recipient  
Building/Site  
Street 1a. avenida jose luiz cavalcante 14
Extension piso 6
PO Box  
Additional Info  
  Input Output Token Output
Example 5 ACF BARAO DE LIMEIRA Caixa postal 25 Recipient  
Building/Site  
Street  
Extension  
PO Box ACF BARAO DE LIMEIRA Caixa postal 25
Additional Info  
  Input Output Token Output
Example 6 Condominio da Barra 4 Rua Tristão da Silva, 125 Casa 2 Recipient  
Building/Site Condominio da Barra 4
Street Rua Tristão da Silva, 125
Extension Casa 2
PO Box  
Additional Info  
Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens.
Output Tokens City
State
Postal Code
  Input Output Token Output
Example Rio de Janeiro RJ 20080-003 City Rio de Janeiro
State RJ
Postal Code 20080-003
Remarks

The City - State/Province - Postal Code (v26) parse definition is a temporary definition provided to facilitate an upgrade of the City - State/Province - Postal Code definition.

In a future release, the output tokens of the City - State/Province - Postal Code definition will be changed. This change will require you to update any jobs using the City - State/Province - Postal Code definition so that the tokens specified in those jobs will match the tokens used by the definition. The City - State/Province - Postal Code (v26) definition uses the tokens that will be used by City - State/Province - Postal Code in the future.

If you want to begin using the new tokens and updated processing now rather than waiting for a later release, you can update your jobs to call the City - State/Province - Postal Code (v26) definition. Be aware however that the City - State/Province - Postal Code (v26) definition is deprecated and will be removed in a subsequent release when the City - State/Province - Postal Code definition is updated.

 

City - State/Province - Postal Code (Global)
Description The City - State/Province - Postal Code (Global) parse definition parses last line address information into a globally recognized set of tokens.
Output Tokens City
State/Province
Postal Code
Additional Info
Example 1 Input Output Token Output
Maracana, Rio de Janeiro, RJ 20090-010 City Rio de Janeiro
State/Province RJ
Postal Code 20090-010
Additional Info Marcana
Example 2 Input Output Token Output
Botafogo, Campinas São Paulo 13091605 City Campinas
State/Province São Paulo
Postal Code 13091605
Additional Info Botafogo
Example 3 Input Output Token Output
VILA SANTA LUZI SAO BERNARDO DO CAMP SP 09668080 City SAO BERNARDO DO CAMP
State/Province SP
Postal Code 09668080
Additional Info VILA SANTA LUZI
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

City - State/Province - Postal Code (v26)
Description

The City - State/Province - Postal Code (v26) parse definition parses last line address information into a set of tokens.

Output Tokens Neighborhood/Village
City
State/Province
Postal Code
Additional Info
Example 1 Input Output Token Output
Maracana, Rio de Janeiro, RJ 20090-010 Neighborhood/Village Marcana
City Rio de Janeiro
State/Province RJ
Postal Code 20090-010
Additional Info  
Example 2 Input Output Token Output
Botafogo, Campinas São Paulo 13091605 Neighborhood/Village Botafogo
City Campinas
State/Province São Paulo
Postal Code 13091605
Additional Info  
Example 3 Input Output Token Output
VILA SANTA LUZI SAO BERNARDO DO CAMP SP 09668080 Neighborhood/Village VILA SANTA LUZI
City SAO BERNARDO DO CAMP
State/Province SP
Postal Code 09668080
Additional Info  
Remarks

The City - State/Province - Postal Code (v26) parse definition is a temporary definition provided to facilitate an upgrade of the City - State/Province - Postal Code definition.

In a future release, the output tokens of the City - State/Province - Postal Code definition will be changed. This change will require you to update any jobs using the City - State/Province - Postal Code definition so that the tokens specified in those jobs will match the tokens used by the definition. The City - State/Province - Postal Code (v26) definition uses the tokens that will be used by City - State/Province - Postal Code in the future.

If you want to begin using the new tokens and updated processing now rather than waiting for a later release, you can update your jobs to call the City - State/Province - Postal Code (v26) definition. Be aware however that the City - State/Province - Postal Code (v26) definition is deprecated and will be removed in a subsequent release when the City - State/Province - Postal Code definition is updated.

 

Name
Description

The Name parse definition parses names of individuals into a set of tokens.

Output Tokens Prefix
Given Name
Middle Name
Family Name Preposition
Family Name
Suffix
Title/Additional Info
  Input Output Token Output
Example 1 Sr. Rodrigo Henrique Garcia Lopes de Souza, filho, Gerente Prefix Sr.
Given Name Rodrigo
Middle Name Henrique Garcia Lopes
Family Name Preposition de
Family Name Souza
Suffix filho
Title/Additional Info Gerente
  Input Output Token Output
Example 2 Carlos Augusto Medeiros da Silva Prefix  
Given Name Carlos
Middle Name Augusto Medeiros
Family Name Preposition da
Family Name Silva
Suffix  
Title/Additional Info  
  Input Output Token Output
Example 3 Pereira Paulo Prefix  
Given Name Paulo
Middle Name  
Family Name Preposition  
Family Name Pereira
Suffix  
Title/Additional Info  
  Input Output Token Output
Example 4 João Filho Prefix  
Given Name João
Middle Name  
Family Name Preposition  
Family Name Filho
Suffix  
Title/Additional Info  
  Input Output Token Output
Example 5 João Almeida Filho Prefix  
Given Name João
Middle Name  
Family Name Preposition  
Family Name Almeida
Suffix Filho
Title/Additional Info  
  Remarks

To facilitate local conventions for data storage and data search, this definition includes a separate token for family name prepositions. Any and all names between the first given name and final family name are considered part of the middle name, as seen in Example 1 above.

 

Name (Global)
Description The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens.
Output Tokens Prefix
Given Name
Middle Name
Family Name
Suffix
Title/Additional Info
  Input Output Token Output
Example 1 Sr. Rodrigo Henrique Garcia Lopes de Souza, filho, Gerente Prefix Sr.
Given Name Rodrigo
Middle Name Henrique Garcia Lopes
Family Name de Souza
Suffix filho
Title/Additional Info Gerente
  Input Output Token Output
Example 2 James Goodnight Prefix  
Given Name James
Middle Name  
Family Name Goodnight
Suffix  
Title/Additional Info  
  Input Output Token Output
Example 3 Carlos Augusto Medeiros da Silva Prefix  
Given Name Carlos
Middle Name Augusto Medeiros
Family Name da Silva
Suffix  
Title/Additional Info  
  Input Output Token Output
Example 4 Pereira Paulo Prefix  
Given Name Paulo
Middle Name  
Family Name Pereira
Suffix  
Title/Additional Info  
  Input Output Token Output
Example 5 João Filho Prefix  
Given Name João
Middle Name  
Family Name Filho
Suffix  
Title/Additional Info  
  Input Output Token Output
Example 6 João Almeida Filho Prefix  
Given Name João
Middle Name  
Family Name Almeida
Suffix Filho
Title/Additional Info  
Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Phone
Description

The Phone parse definition parses phone numbers into a set of tokens.

Output Tokens Carrier Selection Code
Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
  Input Output Token Output
Example 1 Trabalho: +55 11 4501-5452 r1234 (Peça para Maria) Carrier Selection Code  
Country Code +55
Area Code 11
Base Number 4501-5452
Extension 1234
Line Type Trabalho:
Additional Info (Peça para Maria)
  Input Output Token Output
Example 2 Comercial: 0041 54 7368468934 ext. 33 (Número na Argentina) Carrier Selection Code 0041
Country Code 54
Area Code  
Base Number 7368468934
Extension 33
Line Type Comercial:
Additional Info (Número na Argentina)
  Input Output Token Output
Example 3 31-32354200 Carrier Selection Code  
Country Code  
Area Code 31
Base Number 32354200
Extension  
Line Type  
Additional Info  
  Input Output Token Output
Example 4 023 11 98661-0495 -- Movel Carrier Selection Code 023
Country Code  
Area Code 11
Base Number 98661-0495
Extension  
Line Type Movel
Additional Info  
  Input Output Token Output
Example 5 RESIDENCIAL: 0054 99999999 Carrier Selection Code  
Country Code 0054
Area Code  
Base Number 99999999
Extension  
Line Type RESIDENCIAL:
Additional Info  
  Input Output Token Output
Example 6 982990183 Carrier Selection Code  
Country Code  
Area Code  
Base Number 982990183
Extension  
Line Type  
Additional Info  
Remarks  

 

Phone (Detailed)
Description The Phone (Detailed) parse definition parses phone numbers into a detailed set of tokens.
Output Tokens Carrier Selection Code
Country Code
Area Code
Base Number Prefix
Base Number Suffix
Base Number (Other)
Extension
Line Type
Additional Info
  Input Output Token Output
Example 1 Trabalho: +55 11 4501-5452 r1234 (Peça para Maria) Carrier Selection Code  
Country Code +55
Area Code 11
Base Number Prefix 4501
Base Number Suffix 5452
Base Number (Other)  
Extension 1234
Line Type Trabalho:
Additional Info (Peça para Maria)
  Input Output Token Output
Example 2 Comercial: 0041 54 7368468934 ext. 33 (Número na Argentina) Carrier Selection Code 0041
Country Code 54
Area Code  
Base Number Prefix  
Base Number Suffix  
Base Number (Other) 7368468934
Extension 33
Line Type Comercial:
Additional Info (Número na Argentina)
  Input Output Token Output
Example 3 31-32354200 Carrier Selection Code  
Country Code  
Area Code 31
Base Number Prefix 3235
Base Number Suffix 4200
Base Number (Other)  
Extension  
Line Type  
Additional Info  
  Input Output Token Output
Example 4 023 11 98661-0495 -- Movel Carrier Selection Code 023
Country Code  
Area Code 11
Base Number Prefix 98661
Base Number Suffix 0495
Base Number (Other)  
Extension  
Line Type Movel
Additional Info  
  Input Output Token Output
Example 5 RESIDENCIAL: 0054 99999999 Carrier Selection Code  
Country Code 0054
Area Code  
Base Number Prefix  
Base Number Suffix  
Base Number (Other) 99999999
Extension  
Line Type RESIDENCIAL:
Additional Info  
  Input Output Token Output
Example 6 982990183 Carrier Selection Code  
Country Code  
Area Code  
Base Number Prefix 98299
Base Number Suffix 0183
Base Number (Other)  
Extension  
Line Type  
Additional Info  
Remarks  

 

Phone (Global)
Description The Phone (Global) parse definition parses phone numbers into a globally recognized set of tokens.
Output Tokens Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
  Input Output Token Output
Example 1 Trabalho: +55 11 4501-5452 r1234 (Peça para Maria) Country Code +55
Area Code 11
Base Number 4501-5452
Extension 1234
Line Type Trabalho:
Additional Info (Peça para Maria)
  Input Output Token Output
Example 2 Comercial: 0041 54 7368468934 ext. 33 (Número na Argentina) Country Code 0041 54
Area Code  
Base Number 7368468934
Extension 33
Line Type Comercial:
Additional Info (Número na Argentina)
  Input Output Token Output
Example 3 31-32354200 Country Code  
Area Code 31
Base Number 32354200
Extension  
Line Type  
Additional Info  
  Input Output Token Output
Example 4 023 11 98661-0495 -- Movel Country Code  
Area Code 023 11
Base Number 98661-0495
Extension  
Line Type Movel
Additional Info  
  Input Output Token Output
Example 5 RESIDENCIAL: 0054 99999999 Country Code 0054
Area Code  
Base Number 99999999
Extension  
Line Type RESIDENCIAL:
Additional Info  
  Input Output Token Output
Example 6 982990183 Country Code  
Area Code  
Base Number 982990183
Extension  
Line Type  
Additional Info  
  Remarks

Carrier selection codes appearing in international numbers are parsed into the Country Code token, while carrier selection codes appearing in domestic numbers are parsed into the Area Code token, as seen in Examples 2 and 4 above.

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Postal Code
Description The Postal Code parse definition parses postal codes into a set of tokens.
Output Tokens Postal Code
Postal Code Extension
  Input Output Token Output
Example 27653-001 Postal Code 27653
Postal Code Extension 001
Remarks The Postal Code parse definition is no longer supported. It is now deprecated and will be removed in a future release of the QKB.

Pattern Analysis Definitions

None.

Standardization Definitions

Address
Description The Address standardization definition standardizes addresses.
  Input Output
Examples Rua JUNQUEIRA N 47 Rua Junqueira, 47
R JUNQUEIRA 47 Rua Junqueira, 47
TV PRF MATEUS 55 Travessa Professor Mateus, 55
R. Clara Costa 22, 2º andar Apartamento 24 Rua Clara Costa, 22, AN 2 AP 24
R. Clara Costa 22, 2O ANDAR Apartamento 24 Rua Clara Costa, 22, AN 2 AP 24
R. Clara Costa 22, Segundo AN Apartamento 24 Rua Clara Costa, 22, AN 2 AP 24
Av Paulista, 424, COND. Boa Vista 5 Condomínio Boa Vista 5, Avenida Paulista, 424
R de Campo s-n Rua de Campo, S/N
R de Campo s n Rua de Campo, S/N
smpw Qd 4 cj 5 CH 44 Chácara 44, SMPW, QD 4 CJ 5
Remarks  

 

Address (Abbreviated Street Type and Title)
Description The Address (Abbreviated Street Type and Title) standardization definition standardizes addresses and abbreviates street types and street name titles.
  Input Output
Examples Rua JUNQUEIRA N 47 R Junqueira, 47
R JUNQUEIRA 47 R Junqueira, 47
TRAVESSA PROFESSOR MATEUS 55 Tv Prf Mateus, 55
R. Clara Costa 22, 2º andar Apartamento 24 R Clara Costa, 22, AN 2 AP 24
R. Clara Costa 22, 2O ANDAR Apartamento 24 R Clara Costa, 22, AN 2 AP 24
R. Clara Costa 22, Segundo AN Apartamento 24 R Clara Costa, 22, AN 2 AP 24
Avenida Paulista, 424, COND. Boa Vista 5 Condomínio Boa Vista 5, Av Paulista, 424
R de Campo s-n R de Campo, S/N
R de Campo s n R de Campo, S/N
smpw Qd 4 cj 5 CH 44 Chácara 44, SMPW, QD 4 CJ 5
Remarks  

 

City
Description The City standardization definition standardizes city names.
  Input Output
Examples SAO PAULO São Paulo
rio de janeiro Rio de Janeiro
Remarks  

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code standardization definition standardizes last line address information.
  Input Output
Example Rio de Janeiro RJ 20080-003 20080-003 RIO DE JANEIRO-RJ
Remarks  

 

ID Number (CPF)
Description The ID Number (CPF) standardization definition standardizes the Cadastro de Pessoas Físicas.
  Input Output
  12345678901 123.456.789-01
Examples 123-456-789-01 123.456.789-01
CPF: 123.456.789-01 123.456.789-01
Cadastro de Pessoa Fisica -- 123.456.789-01 123.456.789-01
Rodrigo Carlos Machado - 123.456.789-01 123.456.789-01, Rodrigo Carlos Machado
informações adicionais: 123.456.789-01 123.456.789-01, Informações Adicionais
9999999 000.099.999-99
999999999 009.999.999-99
00000123.456.789-01 123.456.789-01
Remarks Pads input that is less than 11 digits with leading 0s. Removes excessive leading 0s.

 

ID Number (CPF) (Electronic)
Description The ID Number (CPF) (Electronic) standardization definition standardizes the Cadastro de Pessoas Físicas and removes all non-numeric characters.
  Input Output
Examples 123.456.789-01 12345678901
123-456-789-01 12345678901
CPF: 123.456.789-01 12345678901
Cadastro de Pessoa Fisica -- 123.456.789-01 12345678901
Rodrigo Carlos Machado - 123.456.789-01 12345678901
informações adicionais: 123.456.789-01 12345678901
9999999 00009999999
999999999 00999999999
0000012345678901 12345678901
Remarks Pads input that is less than 11 digits with leading 0s. Removes excessive leading 0s.

 

ID Number (CNPJ)
Description The ID Number (CNPJ) standardization definition standardizes the Cadastro Nacional de Pessoas Jurídicas.
  Input Output
Examples 12345678901234 12.345.678/9012-34
12-345-678-9012-34 12.345.678/9012-34
CNPJ:12345678901234 12.345.678/9012-34
Cadastro Nacional de Pessoas Juridicas -- 12345678901234 12.345.678/9012-34
Gol Linhas Aéreas: 12.345.678/9012-34 12.345.678/9012-34, Gol Linhas Aéreas
12.345.678/9012-34 (informações adicionais) 12.345.678/9012-34, Informações Adicionais
99999999 00.000.099/9999-99
999999999999 00.999.999/9999-99
0000012.345.678/9012-34 12.345.678/9012-34
Remarks Pads input that is less than 14 digits with leading 0s. Removes excessive leading 0s.

 

ID Number (CNPJ) (Electronic)
Description The ID Number (CNPJ) (Electronic) standardization definition standardizes the Cadastro Nacional de Pessoas Jurídicas and removes all non-numeric characters.
  Input Output
Examples 12.345.678/9012-34 12345678901234
12-345-678-9012-34 12345678901234
CNPJ:12345678901234 12345678901234
Cadastro Nacional de Pessoas Juridicas -- 12345678901234 12345678901234
Gol Linhas Aéreas: 12.345.678/9012-34 12345678901234
12.345.678/9012-34 (informações adicionais) 12345678901234
99999999 00000099999999
999999999999 00999999999999
0000012345678901234 12345678901234
Remarks Pads input that is less than 14 digits with leading 0s. Removes excessive leading 0s.

 

Name
Description The Name standardization definition standardizes names of individuals.
  Input Output
Examples CARLOS AUGUSTO MEDEIROS DA SILVA Carlos Augusto Medeiros da Silva
Professor Joao Vicente Fernandes Prf João Vicente Fernandes
Almeida, Rodrigo Borges Rodrigo Borges Almeida
Andrade Sergio Sergio Andrade
senhor Gerson Antonio F. Pacheco Sr Gerson Antônio F Pacheco
Gerente: FABIO ALVES DE BARROS Fábio Alves de Barros, Gerente
Remarks

If this definition is applied to pre-parsed data, the following input tokens are available:

Prefix
Given Name
Middle Name
Family Name
Suffix
Title/Additional Info

It is recommended that you map a correlating data field to each available token whenever possible.

 

Name (Expanded Prefix)
Description

The Name (Expanded Prefix) standardization definition standardizes names of individuals and expands the prefixes.

  Input Output
Examples Prf Joao Vicente Fernandes Professor João Vicente Fernandes
DR CARLOS AUGUSTO MEDEIROS DA SILVA Doutor Carlos Augusto Medeiros da Silva
Almeida, Rodrigo Borges Rodrigo Borges Almeida
Andrade Sergio Sergio Andrade
sr Gerson Antonio F. Pacheco Senhor Gerson Antônio F Pacheco
Gerente: FABIO ALVES DE BARROS Fábio Alves de Barros, Gerente
Adv. Rodrigo Borges Almeida Advogado Rodrigo Borges Almeida
Remarks

If this definition is applied to pre-parsed data, the following input tokens are available:

Prefix
Given Name
Middle Name
Family Name
Suffix
Title/Additional Info

It is recommended that you map a correlating data field to each available token whenever possible.

 

Phone
Description The Phone standardization definition standardizes phone numbers for domestic use.
  Input Output
Examples +55 11 30375365 (11) 3037 5365
(0)11 30375365 (11) 3037 5365
11-98299-0183 (11) 98299 0183
041-11-98299-0183 041 (11) 98299 0183
08007040465 0800 704 0465
Trabalho: 85-3261-2156 ramal 1234 (85) 3261 2156 r1234, Trabalho
0044 (0)20 12345000 +44 2012345000
0041 44 (0)20 12345000 0041 44 2012345000
Burger King, Buenos Aires: +54 (0)11-4394-9780 +54 1143949780, Burger King, Buenos Aires
Remarks  

 

Phone (Electronic)
Description The Phone (Electronic) standardization definition standardizes phone numbers for automated calling systems.
  Input Output
Examples +55 11 4501 5452 r1234, Trabalho, Peça para Gerson ou Maria +551145015452
(11) 3037-5466 +551130375466
(0)11 30375466 +551130375466
041 (11) 98291 0143 00415511982910143
0044 (0)20 12345000 +442012345000
0041 44 (0)20 12345000 0041442012345000
Remarks  

 

Phone (with Country Code)
Description The Phone (with Country Code) standardization definition standardizes phone numbers for international use.
  Input Output
Examples (11) 3037-5466 +55 11 3037 5466
(0)11 30375466 +55 11 3037 5466
041 (11) 98291 0143 0041 55 11 98291 0143
Trabalho: 85-3261-2156 ramal 1234 +55 85 3261 2156 r1234, Trabalho
Informações Adicionais: (86) 2345 6789 +55 86 2345 6789, Informações Adicionais
0044 (0)20 12345000 +44 2012345000
0041 44 (0)20 12345000 0041 44 2012345000
Remarks  

 

Postal Code
Description The Postal Code standardization definition standardizes postal codes.
  Input Output
Examples 20080003 20080-003
2080003 02080-003
20800 20800-000
Remarks  

 

Postal Code (with Country Code)
Description

The Postal Code (with Country Code) standardization definition standardizes postal codes and adds a domestic country code, unless there is already a country code in the input.

  Input Output
Examples 20080-003 BR-20080003
2080003 BR-02080003
20800 BR-20800000
BR-20800 BR-20800000
USA-27514 US-27514
FR. 12345 FR-12345
CEP-20800 BR-20800000
CODIGO POSTAL 22600123 BR-22600123
Remarks  

 

State (Full Name)
Description The State (Full Name) standardization definition standardizes state names using the long name for the state.
  Input Output
Example RJ Rio de Janeiro
Remarks The State (Full Name) standardization definition is being renamed to State/Province. State (Full Name) is now deprecated and will be removed in a future release. Please change your jobs to use the State/Province standardization definition.

 

State (Two Letter)
Description The State (Two Letter) standardization definition standardizes state names using the standard two-letter state abbreviation.
  Input Output
Example Rio de Janeiro RJ
Remarks The State (Two Letter) standardization definition is being renamed to State/Province (Postal Standard). State (Two Letter) is now deprecated and will be removed in a future release. Please change your jobs to use the State/Province (Postal Standard) standardization definition.

 

State/Province
Description The State/Province standardization definition standardizes state names.
  Input Output
Examples RJ Rio de Janeiro
SP São Paulo
Amapa Amapá
Remarks  

 

State/Province (Postal Standard)
Description

The State/Province (Postal Standard) standardization definition standardizes state names to the postal standard.

  Input Output
Examples Rio de Janeiro RJ
São Paulo SP
Amapá AP
Remarks  

Inherited Definitions

In addition to the definitions listed on this page, the Portuguese, Brazil locale also inherits all definitions for the Portuguese language and all Global definitions.