SAS Quality Knowledge Base for Contact Information 25
Definitions for the Portuguese, Brazil locale are described below.
Case Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Proper | ||
---|---|---|
Description | The Proper case definition performs generic propercasing. | |
Input | Output | |
Example | antônio da silva dos santos | Antônio da Silva dos Santos |
Remarks |
Name | ||
---|---|---|
Description | The Name gender analysis definition determines the gender of a name. | |
Possible Outputs | M F U |
|
Input | Output | |
Examples | Celeste de Jesus Cordeiro | F |
Jose Maria Martins | M | |
M. Antunes | U | |
Remarks |
If this definition is applied to pre-parsed data, the following input tokens are available:
It is recommended that you map a correlating data field to each available token whenever possible. |
|
The Name (v24) gender analysis definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name definition accepts parsed input and the input tokens will be changed in a future release. Name (v24) uses the tokens that will be used by Name in the future. The token change will require you to update any jobs using the Gender Analysis (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Gender Analysis node will not require an update. If you want to begin using the new tokens or the updated processing now rather than waiting for a later release, you can update your jobs to call the Name (v24) definition. Be aware however that the Name (v24) definition will be deprecated in a subsequent release after the Name definition has been updated. |
Name (v24) | ||
---|---|---|
Description |
The Name (v24) gender analysis definition determines the gender of a name. |
|
Possible Outputs | M F U |
|
Input | Output | |
Examples | M. Antunes | U |
Jose Maria Martins | M | |
Celeste de Jesus Cordeiro | F | |
Remarks |
If this definition is applied to pre-parsed data, the following input tokens are available:
It is recommended that you map a correlating data field to each available token whenever possible. |
|
The Name (v24) gender analysis definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name definition accepts parsed input and the input tokens will be changed in a future release. Name (v24) uses the tokens that will be used by Name in the future. The token change will require you to update any jobs using the Gender Analysis (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Gender Analysis node will not require an update. If you want to begin using the new tokens or the updated processing now rather than waiting for a later release, you can update your jobs to call the Name (v24) definition. Be aware however that the Name (v24) definition will be deprecated in a subsequent release after the Name definition has been updated. |
Name/Organization | ||
---|---|---|
Description | The Name/Organization identification analysis definition determines whether a string represents the name of an individual or an organization. | |
Possible Outputs | NAME ORGANIZATION NAME/ORGANIZATION UNKNOWN |
|
Input | Output | |
Examples | SAS Brasil | ORGANIZATION |
Marcel Pasquini | NAME | |
João Fernandes, Oftalmogia Fernandes | NAME/ORGANIZATION | |
PEDRO SA | NAME | |
TEXTILES PEDRO SA | ORGANIZATION | |
PEDRO S/A | ORGANIZATION | |
Doutor João Tomazelli | NAME | |
Doutor do Pneu | ORGANIZATION | |
Almeida, Pasquini, e Pichatelli | ORGANIZATION | |
Fernandes | UNKNOWN | |
Fernandes LTDA | ORGANIZATION | |
Remarks |
Offensive | ||
---|---|---|
Description | The Offensive identification analysis definition identifies potentially offensive words and phrases. | |
Possible Outputs | OFFENSIVE NOT OFFENSIVE |
|
Input | Output | |
Examples | Sr. João Carlos Mulherengo | OFFENSIVE |
Sr. João Bumbum Da Silva | OFFENSIVE | |
Baixinho Da Silva | OFFENSIVE | |
Sr. João Carlos Da Silva | NOT OFFENSIVE | |
Remarks |
Address | ||
---|---|---|
Description |
The Address match definition generates match codes which can be used to cluster records containing addresses. |
|
Max Length of Match Code | 75 characters | |
Input | Cluster ID | |
Examples | Rua Miguel Sa 1226 | 0 |
Rua Miguel Sa 1226 bl. 2 | 0 | |
Rua General Miguel Sa 1226 | 0 | |
Rua General Miguel Sa 1226, apto 503 | 1 | |
Rua Passo do Norte, 21 | 2 | |
Rua Passo Norte, 21 | 2 | |
Trav Costa Azul, 23 | 3 | |
Av Costa Azul, 23 | 3 | |
Rua Costa Azul, 23 | 3 | |
Costa Azul, 23 | 3 | |
Rua Esperança num 23 | 4 | |
Rua Esperança Nº 23 | 4 | |
Rua Esperança n 23 | 4 | |
Rua Esperança 23 | 4 | |
CXP 12345 | 5 | |
Caixa Postal 12345 | 5 | |
C.P. 12345 | 5 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
Area-type extension information (ex. Bloco, Lote) is retained at sensitivities 90 and 95. | ||
The Address match definition has been replaced with a copy of the Address (v23) definition. The Address (v23) definition is a temporary definition provided to facilitate an upgrade of the Address definition. The Address (v23) definition is now deprecated and will be removed in a future release. If you previously modified your jobs to use the Address (v23) definition, it is suggested that you change them back to use the Address definition. The Address definition no longer accepts parsed input. The removal of input tokens will require you to update any jobs using the Match Codes (Parsed) node for the Address definition, replacing the Match Codes (Parsed) node with the Match Codes node and providing input as a single field. Jobs using the Match Codes node will not require an update. |
Address (Full) | ||
---|---|---|
Description |
The Address (Full) match definition generates match codes which can be used to cluster records containing complete two-line addresses. |
|
Max Length of Match Code | 131 characters | |
Input | Cluster ID | |
Examples | Rua Miguel Sa 1226, Itaim Bibi, São Paulo - SP, 04530-000 | 0 |
Rua Miguel Sa 1226 bl. 2, Itaim Bibi, São Paulo - SP, 04530-000 | 0 | |
Rua General Miguel Sa 1226, Itaim Bibi, São Paulo - SP, 04530-000 | 0 | |
Rua General Miguel Sa 1226, apto 503, Itaim Bibi, São Paulo - SP, 04530-000 | 1 | |
Rua Passo do Norte, 21, Itaim Bibi, São Paulo - SP, 04530-000 | 2 | |
Rua Passo Norte, 21, Itaim Bibi, São Paulo - SP, 04530-000 | 2 | |
Trav Costa Azul, 23, Itaim Bibi, São Paulo - SP, 04530-000 | 3 | |
Av Costa Azul, 23, Itaim Bibi, São Paulo - SP, 04530-000 | 3 | |
Rua Costa Azul, 23, Itaim Bibi, São Paulo - SP, 04530-000 | 3 | |
Costa Azul, 23, Itaim Bibi, São Paulo - SP, 04530-000 | 3 | |
Rua Esperança num 23, Itaim Bibi, São Paulo - SP, 04530-000 | 4 | |
Rua Esperança Nº 23, Itaim Bibi, São Paulo - SP, 04530-000 | 4 | |
Rua Esperança n 23, Itaim Bibi, São Paulo - SP, 04530-000 | 4 | |
Rua Esperança 23, Itaim Bibi, São Paulo - SP, 04530-000 | 4 | |
CXP 12345, Itaim Bibi, São Paulo - SP, 04530-000 | 5 | |
Caixa Postal 12345, Itaim Bibi, São Paulo - SP, 04530-000 | 5 | |
C.P. 12345, Itaim Bibi, São Paulo - SP, 04530-000 | 5 | |
Avenida Gomes Freire, 430, Copacabana, Rio de Janeiro, RJ 22060-000 | 6 | |
Avenida Gomes Freire, 430, Rio de Janeiro, RJ 22060-000 | 6 | |
Avenida Gomes Freire, 430, Rio de Janeiro, 22060-000 | 6 | |
Avenida Gomes Freire, 430, Rio de Janeiro, 22060-100 | 6 | |
Avenida Gomes Freire, 430, Rio de Janeiro, 22061-100 | 7 | |
Aven. Nova America, 2001, Ouro Branco MG 36420-000 | 8 | |
Aven. Nova America, 2001, Ouro Branco, Minas Gerais 36420-000 | 8 | |
Rua Sao Jose, 44, Camaçari BA, CEP 42810-000 | 9 | |
Rua Sao Jose, 44, Camaçari BA 42810-000 | 9 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
Area-type extension information (ex. Bloco, Lote) is retained at sensitivities 90 and 95. |
Address (PO Box Only) | ||
---|---|---|
Description |
The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing addresses. Only the PO Box information will be used in the match code. |
|
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Rua Tiradentes 99, CX Postal 112 | 0 |
Rua Almeida 123, CX Postal 112 | 0 | |
CX Postal 112 | 0 | |
CX Postal 222 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address (Street Only) | ||
---|---|---|
Description |
The Address (Street Only) match definition generates match codes which can be used to cluster records containing addresses. Any PO Box information will be excluded from the match code. |
|
Max Length of Match Code | 67 characters | |
Input | Cluster ID | |
Examples | Rua Almeida, 233, Caixa Postal 54321 | 0 |
Rua Almeida, 233, Caixa Postal 12345 | 0 | |
Rua Almeida, 233 | 0 | |
Rua Almeida, 522 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
Area-type extension information (ex. Bloco, Lote) is retained at sensitivities 90 and 95. |
Address (v23) | ||
---|---|---|
Description | The Address (v23) match definition generates match codes which can be used to cluster records containing addresses. | |
Max Length of Match Code | 75 characters | |
Input | Cluster ID | |
Examples | Rua Miguel Sa 1226 | 0 |
Rua Miguel Sa 1226 bl. 2 | 0 | |
Rua General Miguel Sa 1226 | 0 | |
Rua General Miguel Sa 1226, apto 503 | 1 | |
Rua Passo do Norte, 21 | 2 | |
Rua Passo Norte, 21 | 2 | |
Trav Costa Azul, 23 | 3 | |
Av Costa Azul, 23 | 3 | |
Rua Costa Azul, 23 | 3 | |
Costa Azul, 23 | 3 | |
Rua Esperança num 23 | 4 | |
Rua Esperança Nº 23 | 4 | |
Rua Esperança n 23 | 4 | |
Rua Esperança 23 | 4 | |
CXP 12345 | 5 | |
Caixa Postal 12345 | 5 | |
C.P. 12345 | 5 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
Area-type extension information (ex. Bloco, Lote) is retained at sensitivities 90 and 95. | ||
The Address match definition has been replaced with a copy of the Address (v23) definition. The Address (v23) definition is a temporary definition provided to facilitate an upgrade of the Address definition. The Address (v23) definition is now deprecated and will be removed in a future release. If you previously modified your jobs to use the Address (v23) definition, it is suggested that you change them back to use the Address definition. |
City | ||
---|---|---|
Description | The City match definition generates match codes which can be used to cluster records containing city names. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Sao Paulo | 0 |
S Paulo | 0 | |
São Paulo | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | 20080-003 Rio de Janeiro-RJ | 0 |
Rio de Janeiro RJ 20080-003 | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Luiz de Souza Cabral | 0 |
LUIS SOUZA E CABRAL | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
If this definition is applied to pre-parsed data, the following input tokens are available:
It is recommended that you map a correlating data field to each available token whenever possible. |
||
The Name (v24) match definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name definition accepts parsed input and the input tokens will be changed in a future release. Name (v24) uses the tokens that will be used by Name in the future. The token change will require you to update any jobs using the Match Codes (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Match Codes node will not require an update. In addition, the match code length will be changed. This change might require you to update any jobs using the Name definition so that the match code fields can handle the new length. The Name (v24) definition uses the match code length that will be used by Name in the future. If you want to begin using the new tokens or the updated processing now rather than waiting for a later release, you can update your jobs to call the Name (v24) definition. Be aware however that the Name (v24) definition will be deprecated in a subsequent release after the Name definition has been updated. |
Name (v24) | ||
---|---|---|
Description |
The Name (v24) match definition generates match codes which can be used to cluster records containing names of individuals. |
|
Max Length of Match Code | 28 characters | |
Input | Cluster ID | |
Examples | Luiz de Souza Cabral | 0 |
LUIS SOUZA E CABRAL | 0 | |
LUIS SILVA | 1 | |
Dr. Gerson da Silva | 2 | |
Gerson Silva | 2 | |
JERSON DANIEL DA SILVA | 2 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
If this definition is applied to pre-parsed data, the following input tokens are available:
It is recommended that you map a correlating data field to each available token whenever possible. |
||
The Name (v24) match definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name definition accepts parsed input and the input tokens will be changed in a future release. Name (v24) uses the tokens that will be used by Name in the future. The token change will require you to update any jobs using the Match Codes (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Match Codes node will not require an update. In addition, the match code length will be changed. This change might require you to update any jobs using the Name definition so that the match code fields can handle the new length. The Name (v24) definition uses the match code length that will be used by Name in the future. If you want to begin using the new tokens or the updated processing now rather than waiting for a later release, you can update your jobs to call the Name (v24) definition. Be aware however that the Name (v24) definition will be deprecated in a subsequent release after the Name definition has been updated. |
Organization | ||
---|---|---|
Description | The Organization match definition generates match codes which can be used to cluster records containing organization names. | |
Max Length of Match Code | 20 characters | |
Input | Cluster ID | |
Examples | Telefonica S/A | 0 |
Telefonica | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Phone | ||
---|---|---|
Description |
The Phone match definition generates match codes which can be used to cluster records containing phone numbers. |
|
Max Length of Match Code | 22 characters | |
Input | Cluster ID | |
Examples | (66) 4134 3945 | 0 |
041 (66) 4134 3945 | 0 | |
+55 (66) 4134 3945 | 0 | |
0041 55 (66) 4134 3945 | 0 | |
+54 11 5208 3458 | 1 | |
+54 (0)11 5208 3458 | 1 | |
Trabalho: 31 4501 5452 (Peça para Maria) | 2 | |
31 4501 5452 | 2 | |
(11) 96789-1234 | 3 | |
(11) 96789-1230 | 3 | |
(11) 96789-1200 | 4 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
The Phone (v23) match definition is a temporary definition provided to facilitate an upgrade of the Phone definition. In a future release, the match code length will be changed. This change might require you to update any jobs using the Phone definition so that the match code fields can handle the new length. The Phone (v23) definition uses the match code length that will be used by Phone in the future. If you want to begin using the new definition now rather than waiting for a later release, you can update your jobs to call the Phone (v23) definition. Be aware however that the Phone (v23) definition will be deprecated in a subsequent release after the Phone definition has been updated. |
Phone (v23) | ||
---|---|---|
Description |
The Phone (v23) match definition generates match codes which can be used to cluster records containing phone numbers. |
|
Max Length of Match Code | 22 characters | |
Input | Cluster ID | |
Examples | (66) 4134 3945 | 0 |
041 (66) 4134 3945 | 0 | |
+55 (66) 4134 3945 | 0 | |
0041 55 (66) 4134 3945 | 0 | |
+54 11 5208 3458 | 1 | |
+54 (0)11 5208 3458 | 1 | |
Trabalho: 31 4501 5452 (Peça para Maria) | 2 | |
31 4501 5452 | 2 | |
(11) 96789-1234 | 3 | |
(11) 96789-1230 | 3 | |
(11) 96789-1200 | 4 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
The Address (v23) match definition is now deprecated and will be removed in a future release of the QKB. The Address match definition has been replaced with a copy of the Address (v23) definition which takes advantage of updated processing. If you changed your jobs to use the Address (v23) definition it is suggested that you change them back. |
Postal Code | ||
---|---|---|
Description | The Postal Code match definition generates match codes which can be used to cluster records containing postal codes. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | 20080-003 | 0 |
20080003 | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
State | ||
---|---|---|
Description | The State match definition generates match codes which can be used to cluster records containing names of states and union territories. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Mato Grosso do Sul | 0 |
ms | 0 | |
Mato Groso do Sul | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Text | ||
---|---|---|
Description | The Text match definition generates match codes which can be used to cluster records containing general text strings. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Example | Data Management | 0 |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address | |||
---|---|---|---|
Description |
The Address parse definition parses addresses into a set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Gerson Almeida,Condominio Boa Esperança, Rua Professor Marcel Pasquini 23, caixa postal 14 (informações adicionais) | Recipient | Gerson Almeida |
Building/Site | Condominio Boa Esperança | ||
Street | Rua Professor Marcel Pasquini 23 | ||
Extension | |||
PO Box | caixa postal 14 | ||
Additional Info | (informações adicionais) | ||
Input | Output | ||
Example 2 | Av José Andraus Gassani 2464, Apto 3 | Recipient | |
Building/Site | |||
Street | Av José Andraus Gassani 2464 | ||
Extension | Apto 3 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 3 | SQS 415 BL B APT 400 | Recipient | |
Building/Site | |||
Street | SQS 415 | ||
Extension | BL B APT 400 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 4 | 1a. avenida jose luiz cavalcante 14/piso 6 | Recipient | |
Building/Site | |||
Street | 1a. avenida jose luiz cavalcante 14 | ||
Extension | piso 6 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 5 | ACF BARAO DE LIMEIRA Caixa postal 25 | Recipient | |
Building/Site | |||
Street | |||
Extension | |||
PO Box | ACF BARAO DE LIMEIRA Caixa postal 25 | ||
Additional Info | |||
Input | Output | ||
Example 6 | Condominio da Barra 4 Rua Tristão da Silva, 125 Casa 2 | Recipient | |
Building/Site | Condominio da Barra 4 | ||
Street | Rua Tristão da Silva, 125 | ||
Extension | Casa 2 | ||
PO Box | |||
Additional Info | |||
Remarks |
The Address (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address parse definition has been replaced with a copy of the Address (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (v23) it is suggested that you change them back. |
Address (Detailed) | |||
---|---|---|---|
Description | The Address (Detailed) parse definition parses addresses into a set of tokens with detailed street information. | ||
Output Tokens | Recipient Building/Site Street Type Street Name Title Street Name Street Number Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Gerson Almeida,Condominio Boa Esperança, Rua Professor Marcel Pasquini 23, caixa postal 14 (informações adicionais) | Recipient | Gerson Almeida |
Building/Site | Condominio Boa Esperança | ||
Street Type | Rua | ||
Street Name Title | Professor | ||
Street Name | Marcel Pasquini | ||
Street Number | 23 | ||
Extension | |||
PO Box | caixa postal 14 | ||
Additional Info | (informações adicionais) | ||
Input | Output | ||
Example 2 | Av José Andraus Gassani 2464 | Recipient | |
Building/Site | |||
Street Type | Av | ||
Street Name Title | |||
Street Name | José Andraus Gassani | ||
Street Number | 2464 | ||
Extension | |||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 3 | SQS 415 BL B APT 400 | Recipient | |
Building/Site | |||
Street Type | |||
Street Name Title | |||
Street Name | SQS 415 | ||
Street Number | |||
Extension | BL B APT 400 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 4 | 1a. avenida jose luiz cavalcante 14 | Recipient | |
Building/Site | |||
Street Type | 1a. avenida | ||
Street Name Title | |||
Street Name | jose luiz cavalcante | ||
Street Number | 14 | ||
Extension | |||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 5 | ACF BARAO DE LIMEIRA Caixa postal 25 | Recipient | |
Building/Site | |||
Street Type | |||
Street Name Title | |||
Street Name | |||
Street Number | |||
Extension | |||
PO Box | ACF BARAO DE LIMEIRA Caixa postal 25 | ||
Additional Info | |||
Input | Output | ||
Example 6 | Condominio da Barra 4 Rua Tristão da Silva, 125 Casa 2 | Recipient | |
Building/Site | Condominio da Barra 4 | ||
Street Type | Rua | ||
Street Name Title | |||
Street Name | Tristão da Silva | ||
Street Number | 125 | ||
Extension | Casa 2 | ||
PO Box | |||
Additional Info | |||
Remarks |
Address (Full) | |||
---|---|---|---|
Description |
The Address (Full) parse definition parses addresses containing complete two-line addresses into a set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Neighborhood/Village City State/Province Postal Code Country Additional Info |
||
Input | Output | ||
Example 1 | rua Cristalina 4, Itaim Bibi, Sao Paulo - SP 01304-900 | Recipient | |
Building/Site | |||
Street | rua Cristalina 4 | ||
Extension | |||
PO Box | |||
Neighborhood/Village | Itaim Bibi | ||
City | Sao Paulo | ||
State/Province | SP | ||
Postal Code | 01304-900 | ||
Country | |||
Additional Info | |||
Input | Output | ||
Example 2 | C.P. 1424 Fortaleza, Ceara 60127-900 | Recipient | |
Building/Site | |||
Street | |||
Extension | |||
PO Box | C.P. 1424 | ||
Neighborhood/Village | |||
City | Fortaleza | ||
State/Province | Ceara | ||
Postal Code | 60127-900 | ||
Country | |||
Additional Info | |||
Input | Output | ||
Example 3 | Restaurante Fogo de Chao, Av. dos Bandeirantes, 538, Vila Olímpia, São Paulo 04553-000 | Recipient | Restaurante Fogo de Chao |
Building/Site | |||
Street | Av. dos Bandeirantes, 538 | ||
Extension | |||
PO Box | |||
Neighborhood/Village | Vila Olímpia | ||
City | São Paulo | ||
State/Province | |||
Postal Code | 04553-000 | ||
Country | |||
Additional Info | |||
Input | Output | ||
Example 4 | Condominio Bosque Imperial sala 3, Praia Copacabana, Rio de Janeiro RJ 20090-010 | Recipient | |
Building/Site | Condominio Bosque Imperial | ||
Street | |||
Extension | sala 3 | ||
PO Box | |||
Neighborhood/Village | Praia Copacabana | ||
City | Rio de Janeiro | ||
State/Province | RJ | ||
Postal Code | 20090-010 | ||
Country | |||
Additional Info | |||
Input | Output | ||
Example 5 | Rua arcos 3, Monte Carmelo do Rio Novo, Espírito Santo 29767-000 (Brasil) | Recipient | |
Building/Site | |||
Street | Rua arcos 3 | ||
Extension | |||
PO Box | |||
Neighborhood/Village | |||
City | Monte Carmelo do Rio Novo | ||
State/Province | Espírito Santo | ||
Postal Code | 29767-000 | ||
Country | (Brasil) | ||
Additional Info | |||
Remarks |
Address (Full) (Detailed) | |||
---|---|---|---|
Description | The Address (Full) (Detailed) parse definition parses addresses containing complete two-line addresses into a set of tokens with detailed street information. | ||
Output Tokens | Recipient Building/Site Street Type Street Name Title Street Name Street Number Extension PO Box Neighborhood/Village City State/Province Postal Code Country Additional Info |
||
Input | Output | ||
Example 1 | rua Cristalina 4, Itaim Bibi, Sao Paulo - SP 01304-900 | Recipient | |
Building/Site | |||
Street Type | rua | ||
Street Name Title | |||
Street Name | Cristalina | ||
Street Number | 4 | ||
Extension | |||
PO Box | |||
Neighborhood/Village | Itaim Bibi | ||
City | Sao Paulo | ||
State/Province | SP | ||
Postal Code | 01304-900 | ||
Country | |||
Additional Info | |||
Input | Output | ||
Example 2 | C.P. 1424 Fortaleza, Ceara 60127-900 | Recipient | |
Building/Site | |||
Street Type | |||
Street Name Title | |||
Street Name | |||
Street Number | |||
Extension | |||
PO Box | C.P. 1424 | ||
Neighborhood/Village | |||
City | Fortaleza | ||
State/Province | Ceara | ||
Postal Code | 60127-900 | ||
Country | |||
Additional Info | |||
Input | Output | ||
Example 3 | Restaurante Fogo de Chao, Av. dos Bandeirantes, 538, Vila Olímpia, São Paulo 04553-000 | Recipient | Restaurante Fogo de Chao |
Building/Site | |||
Street Type | Av. | ||
Street Name Title | |||
Street Name | dos Bandeirantes | ||
Street Number | 538 | ||
Extension | |||
PO Box | |||
Neighborhood/Village | Vila Olímpia | ||
City | São Paulo | ||
State/Province | |||
Postal Code | 04553-000 | ||
Country | |||
Additional Info | |||
Input | Output | ||
Example 4 | Condominio Bosque Imperial sala 3, Praia Copacabana, Rio de Janeiro RJ 20090-010 | Recipient | |
Building/Site | Condominio Bosque Imperial | ||
Street Type | |||
Street Name Title | |||
Street Name | |||
Street Number | |||
Extension | sala 3 | ||
PO Box | |||
Neighborhood/Village | Praia Copacabana | ||
City | Rio de Janeiro | ||
State/Province | RJ | ||
Postal Code | 20090-010 | ||
Country | |||
Additional Info | |||
Input | Output | ||
Example 5 | Rua arcos 3, Monte Carmelo do Rio Novo, Espírito Santo 29767-000 (Brasil) | Recipient | |
Building/Site | |||
Street Type | Rua | ||
Street Name Title | |||
Street Name | arcos | ||
Street Number | 3 | ||
Extension | |||
PO Box | |||
Neighborhood/Village | |||
City | Monte Carmelo do Rio Novo | ||
State/Province | Espírito Santo | ||
Postal Code | 29767-000 | ||
Country | (Brasil) | ||
Additional Info | |||
Remarks |
Address (Global) | |||
---|---|---|---|
Description |
The Address (Global) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Gerson Almeida,Condominio Boa Esperança, Rua Professor Marcel Pasquini 23, caixa postal 14 (informações adicionais) | Recipient | Gerson Almeida |
Building/Site | Condominio Boa Esperança | ||
Street | Rua Professor Marcel Pasquini 23 | ||
Extension | |||
PO Box | caixa postal 14 | ||
Additional Info | (informações adicionais) | ||
Input | Output | ||
Example 2 | Av José Andraus Gassani 2464, Apto 3 | Recipient | |
Building/Site | |||
Street | Av José Andraus Gassani 2464 | ||
Extension | Apto 3 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 3 | SQS 415 BL B APT 400 | Recipient | |
Building/Site | |||
Street | SQS 415 | ||
Extension | BL B APT 400 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 4 | 1a. avenida jose luiz cavalcante 14/piso 6 | Recipient | |
Building/Site | |||
Street | 1a. avenida jose luiz cavalcante 14 | ||
Extension | piso 6 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 5 | ACF BARAO DE LIMEIRA Caixa postal 25 | Recipient | |
Building/Site | |||
Street | |||
Extension | |||
PO Box | ACF BARAO DE LIMEIRA Caixa postal 25 | ||
Additional Info | |||
Input | Output | ||
Example 6 | Condominio da Barra 4 Rua Tristão da Silva, 125 Casa 2 | Recipient | |
Building/Site | Condominio da Barra 4 | ||
Street | Rua Tristão da Silva, 125 | ||
Extension | Casa 2 | ||
PO Box | |||
Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
Address (Global) (v23) | |||
---|---|---|---|
Description |
The Address (Global) (v23) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Gerson Almeida,Condominio Boa Esperança, Rua Professor Marcel Pasquini 23, caixa postal 14 (informações adicionais) | Recipient | Gerson Almeida |
Building/Site | Condominio Boa Esperança | ||
Street | Rua Professor Marcel Pasquini 23 | ||
Extension | |||
PO Box | caixa postal 14 | ||
Additional Info | (informações adicionais) | ||
Input | Output | ||
Example 2 | Av José Andraus Gassani 2464, Apto 3 | Recipient | |
Building/Site | |||
Street | Av José Andraus Gassani 2464 | ||
Extension | Apto 3 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 3 | SQS 415 BL B APT 400 | Recipient | |
Building/Site | |||
Street | SQS 415 | ||
Extension | BL B APT 400 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 4 | 1a. avenida jose luiz cavalcante 14/piso 6 | Recipient | |
Building/Site | |||
Street | 1a. avenida jose luiz cavalcante 14 | ||
Extension | piso 6 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 5 | ACF BARAO DE LIMEIRA Caixa postal 25 | Recipient | |
Building/Site | |||
Street | |||
Extension | |||
PO Box | ACF BARAO DE LIMEIRA Caixa postal 25 | ||
Additional Info | |||
Input | Output | ||
Example 6 | Condominio da Barra 4 Rua Tristão da Silva, 125 Casa 2 | Recipient | |
Building/Site | Condominio da Barra 4 | ||
Street | Rua Tristão da Silva, 125 | ||
Extension | Casa 2 | ||
PO Box | |||
Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
Address (v23) | |||
---|---|---|---|
Description | The Address (v23) parse definition parses addresses into a set of tokens. | ||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Gerson Almeida,Condominio Boa Esperança, Rua Professor Marcel Pasquini 23, caixa postal 14 (informações adicionais) | Recipient | Gerson Almeida |
Building/Site | Condominio Boa Esperança | ||
Street | Rua Professor Marcel Pasquini 23 | ||
Extension | |||
PO Box | caixa postal 14 | ||
Additional Info | (informações adicionais) | ||
Input | Output | ||
Example 2 | Av José Andraus Gassani 2464, Apto 3 | Recipient | |
Building/Site | |||
Street | Av José Andraus Gassani 2464 | ||
Extension | Apto 3 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 3 | SQS 415 BL B APT 400 | Recipient | |
Building/Site | |||
Street | SQS 415 | ||
Extension | BL B APT 400 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 4 | 1a. avenida jose luiz cavalcante 14/piso 6 | Recipient | |
Building/Site | |||
Street | 1a. avenida jose luiz cavalcante 14 | ||
Extension | piso 6 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 5 | ACF BARAO DE LIMEIRA Caixa postal 25 | Recipient | |
Building/Site | |||
Street | |||
Extension | |||
PO Box | ACF BARAO DE LIMEIRA Caixa postal 25 | ||
Additional Info | |||
Input | Output | ||
Example 6 | Condominio da Barra 4 Rua Tristão da Silva, 125 Casa 2 | Recipient | |
Building/Site | Condominio da Barra 4 | ||
Street | Rua Tristão da Silva, 125 | ||
Extension | Casa 2 | ||
PO Box | |||
Additional Info | |||
Remarks |
The Address (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address parse definition has been replaced with a copy of the Address (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (v23) it is suggested that you change them back. |
City - State/Province - Postal Code | |||
---|---|---|---|
Description | The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens. | ||
Output Tokens | City State Postal Code |
||
Input | Output | ||
Example | Rio de Janeiro RJ 20080-003 | City | Rio de Janeiro |
State | RJ | ||
Postal Code | 20080-003 | ||
Remarks |
City - State/Province - Postal Code (Global) | |||
---|---|---|---|
Description | The City - State/Province - Postal Code (Global) parse definition parses last line address information into a globally recognized set of tokens. | ||
Output Tokens | City State/Province Postal Code Additional Info |
||
Input | Output | ||
Example | 20080-003 Rio de Janeiro-RJ | City | Rio de Janeiro |
State/Province | RJ | ||
Postal Code | 20080-003 | ||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name | |||
---|---|---|---|
Description |
The Name parse definition parses names of individuals into a set of tokens. |
||
Output Tokens | Prefix Given Name Middle Name Family Name Preposition Family Name Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | Sr. Rodrigo Henrique Garcia Lopes de Souza, filho, Gerente | Prefix | Sr. |
Given Name | Rodrigo | ||
Middle Name | Henrique Garcia Lopes | ||
Family Name Preposition | de | ||
Family Name | Souza | ||
Suffix | filho | ||
Title/Additional Info | Gerente | ||
Input | Output | ||
Example 2 | Carlos Augusto Medeiros da Silva | Prefix | |
Given Name | Carlos | ||
Middle Name | Augusto Medeiros | ||
Family Name Preposition | da | ||
Family Name | Silva | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 3 | Pereira Paulo | Prefix | |
Given Name | Paulo | ||
Middle Name | |||
Family Name Preposition | |||
Family Name | Pereira | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 4 | João Filho | Prefix | |
Given Name | João | ||
Middle Name | |||
Family Name Preposition | |||
Family Name | Filho | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 5 | João Almeida Filho | Prefix | |
Given Name | João | ||
Middle Name | |||
Family Name Preposition | |||
Family Name | Almeida | ||
Suffix | Filho | ||
Title/Additional Info | |||
Remarks |
To facilitate local conventions for data storage and data search, this definition includes a separate token for family name prepositions. Any and all names between the first given name and final family name are considered part of the middle name, as seen in Example 1 above. |
Name (Global) | |||
---|---|---|---|
Description | The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | Sr. Rodrigo Henrique Garcia Lopes de Souza, filho, Gerente | Prefix | Sr. |
Given Name | Rodrigo | ||
Middle Name | Henrique Garcia Lopes | ||
Family Name | de Souza | ||
Suffix | filho | ||
Title/Additional Info | Gerente | ||
Input | Output | ||
Example 2 | James Goodnight | Prefix | |
Given Name | James | ||
Middle Name | |||
Family Name | Goodnight | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 3 | Carlos Augusto Medeiros da Silva | Prefix | |
Given Name | Carlos | ||
Middle Name | Augusto Medeiros | ||
Family Name | da Silva | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 4 | Pereira Paulo | Prefix | |
Given Name | Paulo | ||
Middle Name | |||
Family Name | Pereira | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 5 | João Filho | Prefix | |
Given Name | João | ||
Middle Name | |||
Family Name | Filho | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 6 | João Almeida Filho | Prefix | |
Given Name | João | ||
Middle Name | |||
Family Name | Almeida | ||
Suffix | Filho | ||
Title/Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Phone | |||
---|---|---|---|
Description |
The Phone parse definition parses phone numbers into a set of tokens. |
||
Output Tokens | Carrier Selection Code Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output | ||
Example 1 | Trabalho: +55 11 4501-5452 r1234 (Peça para Maria) | Carrier Selection Code | |
Country Code | +55 | ||
Area Code | 11 | ||
Base Number | 4501-5452 | ||
Extension | 1234 | ||
Line Type | Trabalho: | ||
Additional Info | (Peça para Maria) | ||
Input | Output | ||
Example 2 | Comercial: 0041 54 7368468934 ext. 33 (Número na Argentina) | Carrier Selection Code | 0041 |
Country Code | 54 | ||
Area Code | |||
Base Number | 7368468934 | ||
Extension | 33 | ||
Line Type | Comercial: | ||
Additional Info | (Número na Argentina) | ||
Input | Output | ||
Example 3 | 31-32354200 | Carrier Selection Code | |
Country Code | |||
Area Code | 31 | ||
Base Number | 32354200 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Input | Output | ||
Example 4 | 023 11 98661-0495 -- Movel | Carrier Selection Code | 023 |
Country Code | |||
Area Code | 11 | ||
Base Number | 98661-0495 | ||
Extension | |||
Line Type | Movel | ||
Additional Info | |||
Input | Output | ||
Example 5 | RESIDENCIAL: 0054 99999999 | Carrier Selection Code | |
Country Code | 0054 | ||
Area Code | |||
Base Number | 99999999 | ||
Extension | |||
Line Type | RESIDENCIAL: | ||
Additional Info | |||
Input | Output | ||
Example 6 | 982990183 | Carrier Selection Code | |
Country Code | |||
Area Code | |||
Base Number | 982990183 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks |
Phone (Detailed) | |||
---|---|---|---|
Description | The Phone (Detailed) parse definition parses phone numbers into a detailed set of tokens. | ||
Output Tokens | Carrier Selection Code Country Code Area Code Base Number Prefix Base Number Suffix Base Number (Other) Extension Line Type Additional Info |
||
Input | Output | ||
Example 1 | Trabalho: +55 11 4501-5452 r1234 (Peça para Maria) | Carrier Selection Code | |
Country Code | +55 | ||
Area Code | 11 | ||
Base Number Prefix | 4501 | ||
Base Number Suffix | 5452 | ||
Base Number (Other) | |||
Extension | 1234 | ||
Line Type | Trabalho: | ||
Additional Info | (Peça para Maria) | ||
Input | Output | ||
Example 2 | Comercial: 0041 54 7368468934 ext. 33 (Número na Argentina) | Carrier Selection Code | 0041 |
Country Code | 54 | ||
Area Code | |||
Base Number Prefix | |||
Base Number Suffix | |||
Base Number (Other) | 7368468934 | ||
Extension | 33 | ||
Line Type | Comercial: | ||
Additional Info | (Número na Argentina) | ||
Input | Output | ||
Example 3 | 31-32354200 | Carrier Selection Code | |
Country Code | |||
Area Code | 31 | ||
Base Number Prefix | 3235 | ||
Base Number Suffix | 4200 | ||
Base Number (Other) | |||
Extension | |||
Line Type | |||
Additional Info | |||
Input | Output | ||
Example 4 | 023 11 98661-0495 -- Movel | Carrier Selection Code | 023 |
Country Code | |||
Area Code | 11 | ||
Base Number Prefix | 98661 | ||
Base Number Suffix | 0495 | ||
Base Number (Other) | |||
Extension | |||
Line Type | Movel | ||
Additional Info | |||
Input | Output | ||
Example 5 | RESIDENCIAL: 0054 99999999 | Carrier Selection Code | |
Country Code | 0054 | ||
Area Code | |||
Base Number Prefix | |||
Base Number Suffix | |||
Base Number (Other) | 99999999 | ||
Extension | |||
Line Type | RESIDENCIAL: | ||
Additional Info | |||
Input | Output | ||
Example 6 | 982990183 | Carrier Selection Code | |
Country Code | |||
Area Code | |||
Base Number Prefix | 98299 | ||
Base Number Suffix | 0183 | ||
Base Number (Other) | |||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks |
Phone (Global) | |||
---|---|---|---|
Description | The Phone (Global) parse definition parses phone numbers into a globally recognized set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output | ||
Example 1 | Trabalho: +55 11 4501-5452 r1234 (Peça para Maria) | Country Code | +55 |
Area Code | 11 | ||
Base Number | 4501-5452 | ||
Extension | 1234 | ||
Line Type | Trabalho: | ||
Additional Info | (Peça para Maria) | ||
Input | Output | ||
Example 2 | Comercial: 0041 54 7368468934 ext. 33 (Número na Argentina) | Country Code | 0041 54 |
Area Code | |||
Base Number | 7368468934 | ||
Extension | 33 | ||
Line Type | Comercial: | ||
Additional Info | (Número na Argentina) | ||
Input | Output | ||
Example 3 | 31-32354200 | Country Code | |
Area Code | 31 | ||
Base Number | 32354200 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Input | Output | ||
Example 4 | 023 11 98661-0495 -- Movel | Country Code | |
Area Code | 023 11 | ||
Base Number | 98661-0495 | ||
Extension | |||
Line Type | Movel | ||
Additional Info | |||
Input | Output | ||
Example 5 | RESIDENCIAL: 0054 99999999 | Country Code | 0054 |
Area Code | |||
Base Number | 99999999 | ||
Extension | |||
Line Type | RESIDENCIAL: | ||
Additional Info | |||
Input | Output | ||
Example 6 | 982990183 | Country Code | |
Area Code | |||
Base Number | 982990183 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks |
Carrier selection codes appearing in international numbers are parsed into the Country Code token, while carrier selection codes appearing in domestic numbers are parsed into the Area Code token, as seen in Examples 2 and 4 above. |
||
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Postal Code | |||
---|---|---|---|
Description | The Postal Code parse definition parses postal codes into a set of tokens. | ||
Output Tokens | Postal Code Postal Code Extension |
||
Input | Output | ||
Example | 27653-001 | Postal Code | 27653 |
Postal Code Extension | 001 | ||
Remarks |
None.
Address | ||
---|---|---|
Description | The Address standardization definition standardizes addresses. | |
Input | Output | |
Examples | Rua JUNQUEIRA N 47 | Rua Junqueira, 47 |
R JUNQUEIRA 47 | Rua Junqueira, 47 | |
TV PRF MATEUS 55 | Travessa Professor Mateus, 55 | |
R. Clara Costa 22, 2º andar Apartamento 24 | Rua Clara Costa, 22, AN 2 AP 24 | |
R. Clara Costa 22, 2O ANDAR Apartamento 24 | Rua Clara Costa, 22, AN 2 AP 24 | |
R. Clara Costa 22, Segundo AN Apartamento 24 | Rua Clara Costa, 22, AN 2 AP 24 | |
Av Paulista, 424, COND. Boa Vista 5 | Condomínio Boa Vista 5, Avenida Paulista, 424 | |
R de Campo s-n | Rua de Campo, S/N | |
R de Campo s n | Rua de Campo, S/N | |
smpw Qd 4 cj 5 CH 44 | Chácara 44, SMPW, QD 4 CJ 5 | |
Remarks |
Address (Abbreviated Street Type and Title) | ||
---|---|---|
Description | The Address (Abbreviated Street Type and Title) standardization definition standardizes addresses and abbreviates street types and street name titles. | |
Input | Output | |
Examples | Rua JUNQUEIRA N 47 | R Junqueira, 47 |
R JUNQUEIRA 47 | R Junqueira, 47 | |
TRAVESSA PROFESSOR MATEUS 55 | Tv Prf Mateus, 55 | |
R. Clara Costa 22, 2º andar Apartamento 24 | R Clara Costa, 22, AN 2 AP 24 | |
R. Clara Costa 22, 2O ANDAR Apartamento 24 | R Clara Costa, 22, AN 2 AP 24 | |
R. Clara Costa 22, Segundo AN Apartamento 24 | R Clara Costa, 22, AN 2 AP 24 | |
Avenida Paulista, 424, COND. Boa Vista 5 | Condomínio Boa Vista 5, Av Paulista, 424 | |
R de Campo s-n | R de Campo, S/N | |
R de Campo s n | R de Campo, S/N | |
smpw Qd 4 cj 5 CH 44 | Chácara 44, SMPW, QD 4 CJ 5 | |
Remarks |
City | ||
---|---|---|
Description | The City standardization definition standardizes city names. | |
Input | Output | |
Examples | São Paulo | SAO PAULO |
rio de janeiro | RIO DE JANEIRO | |
Remarks |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code standardization definition standardizes last line address information. | |
Input | Output | |
Example | Rio de Janeiro RJ 20080-003 | 20080-003 RIO DE JANEIRO-RJ |
Remarks |
ID Number (CPF) | ||
---|---|---|
Description | The ID Number (CPF) standardization definition standardizes the Cadastro de Pessoas Físicas. | |
Input | Output | |
12345678901 | 123.456.789-01 | |
Examples | 123-456-789-01 | 123.456.789-01 |
CPF: 123.456.789-01 | 123.456.789-01 | |
Cadastro de Pessoa Fisica -- 123.456.789-01 | 123.456.789-01 | |
Rodrigo Carlos Machado - 123.456.789-01 | 123.456.789-01, Rodrigo Carlos Machado | |
informações adicionais: 123.456.789-01 | 123.456.789-01, Informações Adicionais | |
9999999 | 000.099.999-99 | |
999999999 | 009.999.999-99 | |
00000123.456.789-01 | 123.456.789-01 | |
Remarks | Pads input that is less than 11 digits with leading 0s. Removes excessive leading 0s. |
ID Number (CPF) (Electronic) | ||
---|---|---|
Description | The ID Number (CPF) (Electronic) standardization definition standardizes the Cadastro de Pessoas Físicas and removes all non-numeric characters. | |
Input | Output | |
Examples | 123.456.789-01 | 12345678901 |
123-456-789-01 | 12345678901 | |
CPF: 123.456.789-01 | 12345678901 | |
Cadastro de Pessoa Fisica -- 123.456.789-01 | 12345678901 | |
Rodrigo Carlos Machado - 123.456.789-01 | 12345678901 | |
informações adicionais: 123.456.789-01 | 12345678901 | |
9999999 | 00009999999 | |
999999999 | 00999999999 | |
0000012345678901 | 12345678901 | |
Remarks | Pads input that is less than 11 digits with leading 0s. Removes excessive leading 0s. |
ID Number (CNPJ) | ||
---|---|---|
Description | The ID Number (CNPJ) standardization definition standardizes the Cadastro Nacional de Pessoas Jurídicas. | |
Input | Output | |
Examples | 12345678901234 | 12.345.678/9012-34 |
12-345-678-9012-34 | 12.345.678/9012-34 | |
CNPJ:12345678901234 | 12.345.678/9012-34 | |
Cadastro Nacional de Pessoas Juridicas -- 12345678901234 | 12.345.678/9012-34 | |
Gol Linhas Aéreas: 12.345.678/9012-34 | 12.345.678/9012-34, Gol Linhas Aéreas | |
12.345.678/9012-34 (informações adicionais) | 12.345.678/9012-34, Informações Adicionais | |
99999999 | 00.000.099/9999-99 | |
999999999999 | 00.999.999/9999-99 | |
0000012.345.678/9012-34 | 12.345.678/9012-34 | |
Remarks | Pads input that is less than 14 digits with leading 0s. Removes excessive leading 0s. |
ID Number (CNPJ) (Electronic) | ||
---|---|---|
Description | The ID Number (CNPJ) (Electronic) standardization definition standardizes the Cadastro Nacional de Pessoas Jurídicas and removes all non-numeric characters. | |
Input | Output | |
Examples | 12.345.678/9012-34 | 12345678901234 |
12-345-678-9012-34 | 12345678901234 | |
CNPJ:12345678901234 | 12345678901234 | |
Cadastro Nacional de Pessoas Juridicas -- 12345678901234 | 12345678901234 | |
Gol Linhas Aéreas: 12.345.678/9012-34 | 12345678901234 | |
12.345.678/9012-34 (informações adicionais) | 12345678901234 | |
99999999 | 00000099999999 | |
999999999999 | 00999999999999 | |
0000012345678901234 | 12345678901234 | |
Remarks | Pads input that is less than 14 digits with leading 0s. Removes excessive leading 0s. |
Name | ||
---|---|---|
Description | The Name standardization definition standardizes names of individuals. | |
Input | Output | |
Examples | CARLOS AUGUSTO MEDEIROS DA SILVA | Carlos Augusto Medeiros da Silva |
Professor Joao Vicente Fernandes | Prf João Vicente Fernandes | |
Almeida, Rodrigo Borges | Rodrigo Borges Almeida | |
Andrade Sergio | Sergio Andrade | |
senhor Gerson Antonio F. Pacheco | Sr Gerson Antônio F Pacheco | |
Gerente: FABIO ALVES DE BARROS | Fábio Alves de Barros, Gerente | |
Remarks |
If this definition is applied to pre-parsed data, the following input tokens are available:
It is recommended that you map a correlating data field to each available token whenever possible. |
Name (Expanded Prefix) | ||
---|---|---|
Description |
The Name (Expanded Prefix) standardization definition standardizes names of individuals and expands the prefixes. |
|
Input | Output | |
Examples | Prf Joao Vicente Fernandes | Professor João Vicente Fernandes |
DR CARLOS AUGUSTO MEDEIROS DA SILVA | Doutor Carlos Augusto Medeiros da Silva | |
Almeida, Rodrigo Borges | Rodrigo Borges Almeida | |
Andrade Sergio | Sergio Andrade | |
sr Gerson Antonio F. Pacheco | Senhor Gerson Antônio F Pacheco | |
Gerente: FABIO ALVES DE BARROS | Fábio Alves de Barros, Gerente | |
Adv. Rodrigo Borges Almeida | Advogado Rodrigo Borges Almeida | |
Remarks |
If this definition is applied to pre-parsed data, the following input tokens are available:
It is recommended that you map a correlating data field to each available token whenever possible. |
Organization | ||
---|---|---|
Description | The Organization standardization definition standardizes organization names. | |
Input | Output | |
Example | Telefonica SA | TELEFONICA S/A |
Remarks |
Phone | ||
---|---|---|
Description | The Phone standardization definition standardizes phone numbers for domestic use. | |
Input | Output | |
Examples | +55 11 30375365 | (11) 3037 5365 |
(0)11 30375365 | (11) 3037 5365 | |
11-98299-0183 | (11) 98299 0183 | |
041-11-98299-0183 | 041 (11) 98299 0183 | |
08007040465 | 0800 704 0465 | |
Trabalho: 85-3261-2156 ramal 1234 | (85) 3261 2156 r1234, Trabalho | |
0044 (0)20 12345000 | +44 2012345000 | |
0041 44 (0)20 12345000 | 0041 44 2012345000 | |
Burger King, Buenos Aires: +54 (0)11-4394-9780 | +54 1143949780, Burger King, Buenos Aires | |
Remarks |
Phone (Electronic) | ||
---|---|---|
Description | The Phone (Electronic) standardization definition standardizes phone numbers for automated calling systems. | |
Input | Output | |
Examples | +55 11 4501 5452 r1234, Trabalho, Peça para Gerson ou Maria | +551145015452 |
(11) 3037-5466 | +551130375466 | |
(0)11 30375466 | +551130375466 | |
041 (11) 98291 0143 | 00415511982910143 | |
0044 (0)20 12345000 | +442012345000 | |
0041 44 (0)20 12345000 | 0041442012345000 | |
Remarks |
Phone (with Country Code) | ||
---|---|---|
Description | The Phone (with Country Code) standardization definition standardizes phone numbers for international use. | |
Input | Output | |
Examples | (11) 3037-5466 | +55 11 3037 5466 |
(0)11 30375466 | +55 11 3037 5466 | |
041 (11) 98291 0143 | 0041 55 11 98291 0143 | |
Trabalho: 85-3261-2156 ramal 1234 | +55 85 3261 2156 r1234, Trabalho | |
Informações Adicionais: (86) 2345 6789 | +55 86 2345 6789, Informações Adicionais | |
0044 (0)20 12345000 | +44 2012345000 | |
0041 44 (0)20 12345000 | 0041 44 2012345000 | |
Remarks |
Postal Code | ||
---|---|---|
Description | The Postal Code standardization definition standardizes postal codes. | |
Input | Output | |
Example | 20080003 | 20080-003 |
Remarks |
State (Full Name) | ||
---|---|---|
Description | The State (Full Name) standardization definition standardizes state names using the long name for the state. | |
Input | Output | |
Example | RJ | Rio de Janeiro |
Remarks |
State (Two Letter) | ||
---|---|---|
Description | The State (Two Letter) standardization definition standardizes state names using the standard two-letter state abbreviation. | |
Input | Output | |
Example | Rio de Janeiro | RJ |
Remarks |
In addition to the definitions listed on this page, the Portuguese, Brazil locale also inherits all definitions for the Portuguese language and all Global definitions.
Documentation Feedback: yourturn@sas.com
|
Doc ID: QKBCI_PTBRA_defs.html |