SAS Quality Knowledge Base for Contact Information 25
Definitions for the Spanish, Mexico locale are described below.
Case Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Proper (City - State/Province - Postal Code) | ||
---|---|---|
Description | The Proper (City - State/Province - Postal Code ) case definition propercases last line address information. | |
Input | Output | |
Example | 77520 cancun, q. roo | 77520 Cancun, Q. Roo |
Remarks |
Proper (Organization) | ||
---|---|---|
Description | The Proper (Organization) case definition propercases organization names. | |
Input | Output | |
Examples | diconsa sa de cv | Diconsa SA de CV |
sas institute | SAS Institute | |
Remarks | The Proper (Organization) case definition uses a list of known organization names to handle exceptions to propercasing rules. |
CURP | ||
---|---|---|
Description |
The CURP gender analysis definition determines the gender of an individual based on their Unique Population Registry Code (Clave Única de Registro de Población). |
|
Possible Outputs | M F U |
|
Input | Output | |
Examples | ZAZD801124MBSYQN13 | F |
NURV140413HQTWGN12 | M | |
NURV140413XQTWGN12 | U | |
Remarks |
The CURP gender analysis definition is designed to provide valid results when operating on valid CURP identity codes. It is recommended that you use the CURP (Validation) identification analysis definition to validate your input and that you pass only valid CURP identity codes into the CURP gender analysis definition. Results from invalid data cannot be guaranteed to be accurate. |
Name | ||
---|---|---|
Description | The Name gender analysis definition determines the gender of a name. | |
Possible Outputs | M F U |
|
Input | Output | |
Examples | Guadalupe Salvador Rivera | U |
Jose Maria Rivera Morales | M | |
Maria De Lourdes Rodriguez | F | |
Remarks |
If this definition is applied to pre-parsed data, the following input tokens are available:
It is recommended that you map a correlating data field to each available token whenever possible. |
CURP (Validation) | |||
---|---|---|---|
Description |
The CURP (Validation) identification analysis definition determines if a string is a Unique Population Registry Code (Clave Única de Registro de Población). |
||
Possible Outputs | VALID INVALID |
||
Input | Output | Comments | |
Examples | ZAZD801124MBSYQN13 | VALID | |
RUNO390705HSLPTC02 | VALID | ||
IMS120607120 | INVALID | Too short (only 12 characters) | |
KAOB750230MASLHJ07 | INVALID | Invalid date (Feb 30) | |
EOMR330613FDFPKT08 | INVALID | Invalid gender (F instead of H or M) | |
YOOR280801HXXLTK07 | INVALID | Invalid state (XX is not a state) | |
EIMV310211MQRPKT1X | INVALID | Invalid last character (X instead of a value 0-9) | |
L6NF640805MYNTNN10 | INVALID | Invalid second character (6 instead of an A-Z value) | |
MOJL020101HCLKWMXY09 | INVALID | Too long (over 18 characters) | |
Remarks |
This definition validates on the following criteria:
|
Individual/Organization | ||
---|---|---|
Description | The Individual/Organization identification analysis definition determines whether a string represents the name of an individual or an organization. | |
Possible Outputs | INDIVIDUAL ORGANIZATION UNKNOWN |
|
Input | Output | |
Examples | Grupo Dixon S.A. | ORGANIZATION |
Roberto Cruz Morales | INDIVIDUAL | |
Marta Leticia Ceballos Gomez | INDIVIDUAL | |
Guttermoth | UNKNOWN | |
Remarks |
Individual/Organization (RFC) | ||
---|---|---|
Description |
The Individual/Organization (RFC) identification analysis definition determines the identify of an individual or organization based on their Federal Taxpayer Registry Code (Registro Federal de Contribuyentes). |
|
Possible Outputs | INDIVIDUAL ORGANIZATION UNKNOWN |
|
Input | Output | |
Examples | ABC680524F23 | ORGANIZATION |
VECJ680524V23 | INDIVIDUAL | |
ABCDE12345678 | UNKNOWN | |
H1N831130GD9 | UNKNOWN | |
PIA850230ZVA | UNKNOWN | |
BEPJ361225IS@ | UNKNOWN | |
Remarks |
This definition validates using the following criteria:
|
Address | ||
---|---|---|
Description | The Address match definition generates match codes which can be used to cluster records containing addresses. | |
Max Length of Match Code | 20 characters | |
Input | Cluster ID | |
Examples | Av Los Angeles num 325 Ote | 1 |
Ave. Los Angeles 325Ote. | 1 | |
Ave Las Angeles #325 Oriente | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
The Address (v24) match definition is a temporary definition provided to facilitate an upgrade of the Address definition. The Address definition accepts parsed input, but this will be changed in a future release. Address (v24) does not accept parsed input, which will also be true of Address in the future. The token change will require you to update any jobs using the Match Codes (Parsed) node for the Address definition, using the non-parsed Match Codes node, and providing input as a single field. Jobs using the non-parsed input Match Codes node will not require an update. In addition, the match code length will be changed. This change might require you to update any jobs using the Address definition so that the match code fields can handle the new length. The Address (v24) definition uses the match code length that will be used by Address in the future. If you want to begin using the updated processing now rather than waiting for a later release, you can update your jobs to call the Address (v24) definition. Be aware however that the Address (v24) definition will be deprecated in a subsequent release after the Address definition has been updated. |
Address (Full) | ||
---|---|---|
Description | The Address (Full) match definition generates match codes which can be used to cluster records containing complete two-line addresses. | |
Max Length of Match Code |
118 characters | |
Input | Cluster ID | |
Examples | Av Independencia 800 Bario De San Agustin Municipio De Jalatlaco CP 05286 Edo De Mexico | 0 |
Avenida Independencia 800 Bario De San Agustin Municipio De Jalatlaco CP 05286 Edo De Mexico | 0 | |
Avenida Independencia 800 Bario De San Agustin Municipio De Jalatlaco CP 5286 Edo De Mexico | 0 | |
Av Independencia #800 Bario De San Agustin Municipio De Jalatlaco CP 5286 Edo De Mexico | 0 | |
Calle Insurgentes 9908 APDO POSTAL 1329 | 1 | |
Av Insurgentes 9908 APDO POSTAL 1329 | 1 | |
Torre Mayor Condominio horizontal 200, 82200 Mazatlan, Sinaloa | 2 | |
Torre Mayor Condominio horizontal 200, 82200 Mazatlan de Norte, Sinaloa | 2 | |
Torre Mayor Condominio horizontal 200, 82200 Mazatlan de sur, Sinaloa | 2 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address (PO Box Only) | ||
---|---|---|
Description |
The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address. |
|
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | APDO POSTAL 1329, Avenida Insurgentes #325 | 0 |
APDO POSTAL 1329, Avenida Insurgentes 123 | 0 | |
Ave. Insurgentes 325 APDO POSTAL #1329 | 0 | |
APDO POSTAL 1329 | 0 | |
APDO 345 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address (Street Only) | ||
---|---|---|
Description |
The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address. |
|
Max Length of Match Code | 68 characters | |
Input | Cluster ID | |
Examples | Av Insurgentes num 325 | 0 |
Ave. Insurgentes 325 | 0 | |
Avenida Insurgentes #325 | 0 | |
APDO POSTAL 1329, Avenida Insurgentes #325 | 0 | |
Ave. Insurgentes 325 APDO POSTAL 1329 | 0 | |
Ave. Universidad 1200 | 1 | |
Ave. Universidad No. 1200 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address (v24) | ||
---|---|---|
Description |
The Address (v24) match definition generates match codes which can be used to cluster records containing addresses. |
|
Max Length of Match Code | 68 characters | |
Input | Cluster ID | |
Examples | Av Insurgentes num 325 | 0 |
Ave. Insurgentes 325 | 0 | |
Avenida Insurgentes #325 | 0 | |
APDO POSTAL 1329, Calle Insurgentes 9908 | 1 | |
Calle Insurgentes 9908 APDO POSTAL 1329 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
The Address (v24) match definition is a temporary definition provided to facilitate an upgrade of the Address definition. The Address definition accepts parsed input, but this will be changed in a future release. Address (v24) does not accept parsed input, which will also be true of Address in the future. The token change will require you to update any jobs using the Match Codes (Parsed) node for the Address definition, using the non-parsed Match Codes node, and providing input as a single field. Jobs using the non-parsed input Match Codes node will not require an update. In addition, the match code length will be changed. This change might require you to update any jobs using the Address definition so that the match code fields can handle the new length. The Address (v24) definition uses the match code length that will be used by Address in the future. If you want to begin using the updated processing now rather than waiting for a later release, you can update your jobs to call the Address (v24) definition. Be aware however that the Address (v24) definition will be deprecated in a subsequent release after the Address definition has been updated. |
City | ||
---|---|---|
Description | The City match definition generates match codes which can be used to cluster records containing city names. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Acaspulco | 1 |
Acasppulco | 1 | |
Mexico City | 2 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | 11570 Distrito Federal, Mexico | 1 |
Mexico Distrito Federal, CP 11570 | 1 | |
11570 DF Mexico | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Imelda Marina Escobar Hernandez | 0 |
Sra Imelda Escobar Hernandez | 0 | |
Excel·lentíssim i Magnífic Senyor Marcel·lí J. Rigau i Molins | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
If this definition is applied to pre-parsed data, the following input tokens are available: Name Prefix It is recommended that you map a correlating data field to each available token whenever possible. |
||
The Name (v24) match definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name definition accepts parsed input and the input tokens will be changed in a future release. Name (v24) uses the tokens that will be used by Name in the future. The token change will require you to update any jobs using the Match Codes (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Match Codes node will not require an update. In addition, the match code length will be changed. This change might require you to update any jobs using the Name definition so that the match code fields can handle the new length. The Name (v24) definition uses the match code length that will be used by Name in the future. If you want to begin using the new tokens or the updated processing now rather than waiting for a later release, you can update your jobs to call the Name (v24) definition. Be aware however that the Name (v24) definition will be deprecated in a subsequent release after the Name definition has been updated. |
Name (v24) | ||
---|---|---|
Description |
The Name (v24) match definition generates match codes which can be used to cluster records containing names of individuals. |
|
Max Length of Match Code | 27 characters | |
Input | Cluster ID | |
Examples | Imelda Marina Escobar Hernandez | 0 |
Sra Imelda Escobar Hernandez | 0 | |
Excel·lentíssim i Magnífic Senyor Marcel·lí J. Rigau i Molins | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
|
If this definition is applied to pre-parsed data, the following input tokens are available: Prefix It is recommended that you map a correlating data field to each available token whenever possible. |
||
The Name (v24) match definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name definition accepts parsed input and the input tokens will be changed in a future release. Name (v24) uses the tokens that will be used by Name in the future. The token change will require you to update any jobs using the Match Codes (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Match Codes node will not require an update. In addition, the match code length will be changed. This change might require you to update any jobs using the Name definition so that the match code fields can handle the new length. The Name (v24) definition uses the match code length that will be used by Name in the future. If you want to begin using the new tokens or the updated processing now rather than waiting for a later release, you can update your jobs to call the Name (v24) definition. Be aware however that the Name (v24) definition will be deprecated in a subsequent release after the Name definition has been updated. |
Organization | ||
---|---|---|
Description | The Organization match definition generates match codes which can be used to cluster records containing organization names. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Agroasemex SA | 1 |
Agroasemex | 1 | |
SAS Institute | 2 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Phone | ||
---|---|---|
Description | The Phone match definition generates match codes which can be used to cluster records containing phone numbers. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | +52 (55) 5122 4300 | 1 |
52-55-5122-4300 | 1 | |
52-55-51224300 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Postal Code | ||
---|---|---|
Description | The Postal Code match definition generates match codes which can be used to cluster records containing postal codes. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | 11570 | 1 |
-11570 | 1 | |
13210 | 2 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Address | |||
---|---|---|---|
Description |
The Address parse definition parses addresses into a set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Paseo de la Reforma 505 Piso 38 | Recipient | |
Building/Site | |||
Street | Paseo de la Reforma 505 | ||
Extension | Piso 38 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 2 | Hospital General de México, Calle Dr. Balmis No. 148 | Recipient | |
Building/Site | Hospital General de México | ||
Street | Calle Dr. Balmis No. 148 | ||
Extension | |||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 3 | A/A: Linda Salcido AV INDEPENDENCIA 800 | Recipient | A/A: Linda Salcido |
Building/Site | |||
Street | AV INDEPENDENCIA 800 | ||
Extension | |||
PO Box | |||
Additional Info | |||
Remarks |
The Address (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address parse definition has been replaced with a copy of the Address (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (v23) it is suggested that you change them back. |
Address (Full) | |||
---|---|---|---|
Description | The Address (Full) parse definition parses addresses containing complete two-line addresses into a set of tokens. | ||
Output Tokens | Recipient Building/Site Street Extension PO Box Neighborhood/Village Delegation/Municipality State/Province Postal Code Country Additional Info |
||
Input | Output | ||
Example 1 | Paseo de la Reforma 505 Piso 38 Del. Cuauhtémoc C.P. 06500 México, D.F. | Recipient | |
Building/Site | |||
Street | Paseo de la Reforma 505 | ||
Extension | Piso 38 | ||
PO Box | |||
Neighborhood/Village | |||
Delegation/Municipality | Del. Cuauhtémoc | ||
State/Province | México, D.F. | ||
Postal Code | C.P. 06500 | ||
Country | |||
Additional Info | |||
Input | Output | ||
Example 2 | Hospital General de México, Calle Dr. Balmis No. 148, Col. Doctores, Delegacion Cuauhtemoc, Mexico, D.F., C.P. 06726 | Recipient | |
Building/Site | Hospital General de México | ||
Street | Calle Dr. Balmis No. 148 | ||
Extension | |||
PO Box | |||
Neighborhood/Village | Col. Doctores | ||
Delegation/Municipality | Delegacion Cuauhtemoc | ||
State/Province | México, D.F. | ||
Postal Code | C.P. 06726 | ||
Country | |||
Additional Info | |||
Input | Output | ||
Example 3 | A/A: Linda Salcido AV INDEPENDENCIA 800 BARIO DE SAN AGUSTIN MUNICIPIO DE JALATLACO CP 52860 EDO DE MEXICO | Recipient | A/A: Linda Salcido |
Building/Site | |||
Street | AV INDEPENDENCIA 800 | ||
Extension | |||
PO Box | |||
Neighborhood/Village | BARIO DE SAN AGUSTIN | ||
Delegation/Municipality | MUNICIPIO DE JALATLACO | ||
State/Province | EDO DE MEXICO | ||
Postal Code | CP 52860 | ||
Country | |||
Additional Info | |||
Remarks |
Address (Global) | |||
---|---|---|---|
Description |
The Address (Global) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Paseo de la Reforma 505 Piso 38 | Recipient | |
Building/Site | |||
Street | Paseo de la Reforma 505 | ||
Extension | Piso 38 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 2 | Hospital General de México, Calle Dr. Balmis No. 148 | Recipient | |
Building/Site | Hospital General de México | ||
Street | Calle Dr. Balmis No. 148 | ||
Extension | |||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 3 | A/A: Linda Salcido AV INDEPENDENCIA 800 | Recipient | A/A: Linda Salcido |
Building/Site | |||
Street | AV INDEPENDENCIA 800 | ||
Extension | |||
PO Box | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. | ||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
Address (Global) (v23) | |||
---|---|---|---|
Description |
The Address (Global) (v23) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Paseo de la Reforma 505 Piso 38 | Recipient | |
Building/Site | |||
Street | Paseo de la Reforma 505 | ||
Extension | Piso 38 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 2 | Hospital General de México, Calle Dr. Balmis No. 148 | Recipient | |
Building/Site | Hospital General de México | ||
Street | Calle Dr. Balmis No. 148 | ||
Extension | |||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 3 | A/A: Linda Salcido AV INDEPENDENCIA 800 | Recipient | A/A: Linda Salcido |
Building/Site | |||
Street | AV INDEPENDENCIA 800 | ||
Extension | |||
PO Box | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. | ||
The Address (Global) (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address (Global) parse definition has been replaced with a copy of the Address (Global) (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (Global) (v23) it is suggested that you change them back. |
Address (v23) | |||
---|---|---|---|
Description |
The Address (v23) parse definition parses addresses into a set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output | ||
Example 1 | Paseo de la Reforma 505 Piso 38 | Recipient | |
Building/Site | |||
Street | Paseo de la Reforma 505 | ||
Extension | Piso 38 | ||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 2 | Hospital General de México, Calle Dr. Balmis No. 148 | Recipient | |
Building/Site | Hospital General de México | ||
Street | Calle Dr. Balmis No. 148 | ||
Extension | |||
PO Box | |||
Additional Info | |||
Input | Output | ||
Example 3 | A/A: Linda Salcido AV INDEPENDENCIA 800 | Recipient | A/A: Linda Salcido |
Building/Site | |||
Street | AV INDEPENDENCIA 800 | ||
Extension | |||
PO Box | |||
Additional Info | |||
Remarks |
The Address (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Address parse definition has been replaced with a copy of the Address (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Address (v23) it is suggested that you change them back. |
City - State/Province - Postal Code | |||
---|---|---|---|
Description | The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens. | ||
Output Tokens | City Province Postal Code Municipio/Delegación |
||
Input | Output | ||
Example | 11570 Mexico, DF | City | Mexico |
Province | DF | ||
Postal Code | 11570 | ||
Municipio/Delegación | |||
Remarks |
City - State/Province - Postal Code (Global) | |||
---|---|---|---|
Description | The City - State/Province - Postal Code (Global) parse definition parses last line address information into a globally recognized set of tokens. | ||
Output Tokens | City State/Province Postal Code Additional Info |
||
Input | Output | ||
Example | 11570 Mexico, DF | City | Mexico |
State/Province | DF | ||
Postal Code | 11570 | ||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name | |||
---|---|---|---|
Description |
The Name parse definition parses names of individuals into a set of tokens. |
||
Output Tokens | Prefix Given Name Family Name 1 Family Name 2 Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | Dr. Juan Carlos Lopez | Prefix | Dr. |
Given Name | Juan Carlos | ||
Family Name 1 | Lopez | ||
Family Name 2 | |||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 2 | Maria Julia Garcia Arroyo | Prefix | |
Given Name | Maria Julia | ||
Family Name 1 | Garcia | ||
Family Name 2 | Arroyo | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 3 | Mr John D'Onofrio Jr, CEO | Prefix | Mr |
Given Name | John | ||
Family Name 1 | D'Onofrio | ||
Family Name 2 | |||
Suffix | Jr | ||
Title/Additional Info | CEO | ||
Remarks |
The Name (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Name parse definition has been replaced with a copy of the Name (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Name (v23) it is suggested that you change them back. |
Name (Global) | |||
---|---|---|---|
Description | The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | Dr. Juan Carlos Lopez | Prefix | Dr. |
Given Name | Juan Carlos | ||
Middle Name | |||
Family Name | Lopez | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 2 | Maria Julia Garcia Arroyo | Prefix | |
Given Name | Maria Julia | ||
Middle Name | |||
Family Name | Garcia Arroyo | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 3 | Sr. Juan Gomez Gonzales Prof | Prefix | Sr. |
Given Name | Juan | ||
Middle Name | |||
Family Name | Gomez Gonzales | ||
Suffix | |||
Title/Additional Info | Prof | ||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name (v23) | |||
---|---|---|---|
Description | The Name (v23) parse definition parses names of individuals into a set of tokens. | ||
Output Tokens | Prefix Given Name Family Name 1 Family Name 2 Suffix Title/Additional Info |
||
Input | Output | ||
Example 1 | Dr. Juan Carlos Lopez | Prefix | Dr. |
Given Name | Juan Carlos | ||
Family Name 1 | Lopez | ||
Family Name 2 | |||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 2 | Maria Julia Garcia Arroyo | Prefix | |
Given Name | Maria Julia | ||
Family Name 1 | Garcia | ||
Family Name 2 | Arroyo | ||
Suffix | |||
Title/Additional Info | |||
Input | Output | ||
Example 3 | Mr John D'Onofrio Jr, CEO | Prefix | Mr |
Given Name | John | ||
Family Name 1 | D'Onofrio | ||
Family Name 2 | |||
Suffix | Jr | ||
Title/Additional Info | CEO | ||
Remarks |
The Name (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Name parse definition has been replaced with a copy of the Name (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Name (v23) it is suggested that you change them back. |
Organization | |||
---|---|---|---|
Description | The Organization parse definition parses company and organization information into a set of tokens. | ||
Output Tokens | Organization Legal Form Site |
||
Input | Output | ||
Example 1 | Grupo Carso S.A. de C.V. | Organization | Grupo Carso |
Legal Form | S.A. de C.V. | ||
Site | |||
Remarks |
The Organization (v24) parse definition is a temporary definition provided to facilitate an upgrade of the Organization definition. In a future release, the output tokens of the Organization definition will be changed. This change will require you to update any jobs using the Organization definition so that the tokens specified in those jobs will match the tokens used by the definition. The Organization (v24) definition uses the tokens that will be used by Organization in the future. If you want to begin using the new tokens and updated processing now rather than waiting for a later release, you can update your jobs to call the Organization (v24) definition. Be aware however that the Organization (v24) definition will be deprecated in a subsequent release after the Organization definition has been updated. |
Organization (v24) | |||
---|---|---|---|
Description |
The Organization (v24) parse definition parses company and organization information into a set of tokens. |
||
Output Tokens |
Name |
||
Input | Output | ||
Example 1 | Grupo Carso S.A. de C.V. | Name | Grupo Carso |
Legal Form | S.A. de C.V. | ||
Site | |||
Additional Info | |||
Input | Output | ||
Example 2 | CEPSA SA CHIHUAHUA - Compañía Española de Petróleos | Name | CEPSA |
Legal Form | SA | ||
Site | CHIHUAHUA | ||
Additional Info | Compañía Española de Petróleos | ||
Input | Output | ||
Example 3 | Hewlett Packard Española SL, Barcelona | Name | Hewlett Packard Española |
Legal Form | SL | ||
Site | Barcelona | ||
Additional Info | |||
Input | Output | ||
Example 4 | BBV TITULIZACION, S.A. (S.G.F.T.H.) | Name | BBV TITULIZACION |
Legal Form | S.A. | ||
Site | |||
Additional Info | (S.G.F.T.H.) | ||
Remarks |
The Organization (v24) parse definition is a temporary definition provided to facilitate an upgrade of the Organization definition. In a future release, the output tokens of the Organization definition will be changed. This change will require you to update any jobs using the Organization definition so that the tokens specified in those jobs will match the tokens used by the definition. The Organization (v24) definition uses the tokens that will be used by Organization in the future. If you want to begin using the new tokens and updated processing now rather than waiting for a later release, you can update your jobs to call the Organization (v24) definition. Be aware however that the Organization (v24) definition will be deprecated in a subsequent release after the Organization definition has been updated. |
Phone | |||
---|---|---|---|
Description |
The Phone parse definition parses phone numbers into a set of tokens. |
||
Output Tokens | Prefix Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output | ||
Example 1 | 52 5521234567 | Prefix | |
Country Code | 52 | ||
Area Code | 55 | ||
Base Number | 21234567 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Input | Output | ||
Example 2 | 044 5521234567 | Prefix | 044 |
Country Code | |||
Area Code | 55 | ||
Base Number | 21234567 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Input | Output | ||
Example 3 | Oficina: 55 2123 4567 ext 156 | Prefix | |
Country Code | |||
Area Code | 55 | ||
Base Number | 2123 4567 | ||
Extension | 156 | ||
Line Type | Oficina: | ||
Additional Info | |||
Input | Output | ||
Example 4 | Movil: 55 2123 4567 (noches) | Prefix | |
Country Code | |||
Area Code | 55 | ||
Base Number | 2123 4567 | ||
Extension | |||
Line Type | Movil: | ||
Additional Info | (noches) | ||
Remarks | The Prefix token will contain the domestic long-distance dialing code (01), the domestic cellular dialing codes (044 and 045), and their accepted variants. | ||
The Phone (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Phone parse definition has been replaced with a copy of the Phone (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Phone (v23) it is suggested that you change them back. |
Phone (Global) | |||
---|---|---|---|
Description | The Phone (Global) parse definition parses phone numbers into a globally recognized set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output | ||
Example | +52 (55) 5122 4300 | Country Code | 52 |
Area Code | 55 | ||
Base Number | 51224300 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Phone (IFETEL) | |||
---|---|---|---|
Description |
The Phone (IFETEL) parse definition parses an IFETEL-compliant phone number into a set of tokens. |
||
Output Tokens | Prefix Country Code Cellular Code LADA Serie Numero Extension Additional Info |
||
Input | Output | ||
Example 1 | 520442432604890 | Prefix | |
Country Code | 52 | ||
Cellular Code | 044 | ||
LADA | 243 | ||
Serie | 260 | ||
Numero | 4890 | ||
Extension | |||
Additional Info | |||
Input | Output | ||
Example 2 | 013323456789 | Prefix | 01 |
Country Code | |||
Cellular Code | |||
LADA | 33 | ||
Serie | 2345 | ||
Numero | 6789 | ||
Extension | |||
Additional Info | |||
Input | Output | ||
Example 3 | 8162047148 ext 304 | Prefix | |
Country Code | |||
Cellular Code | |||
LADA | 81 | ||
Serie | 6204 | ||
Numero | 7148 | ||
Extension | 304 | ||
Additional Info | |||
Input | Output | ||
Example 4 | 234 234 5678901 | Prefix | |
Country Code | |||
Cellular Code | |||
LADA | |||
Serie | |||
Numero | |||
Extension | |||
Additional Info | 2 3 4 2 3 4 5 6 7 8 9 0 1 | ||
Remarks | The input string for Example 4 contains too many digits for IFETEL compliance. | ||
The Phone (IFETEL) definition is specific to the Spanish, Mexico locale. The purpose of the definition is to separate a telephone number into the tokens used by the IFETEL national phone number registry. If the input string does not contain a Mexican phone number in IFETEL-compliant format, the output will have a NO SOLUTION parse result and the input string will be output in the Additional Info token as shown in Example 4, with spaces between the digits. The recommended procedure is to use the Result code field in the Parse node of your Data Job to store the results of the parsing operation. Then the data job should use the value of the Result code field to determine how the record should be processed. Records with an "OK" Result Code should be processed as valid IFETEL phone numbers. Those with a "NO SOLUTION" or "NULL" Result Code should be processed separately, if it all. |
|||
Phone (v23) | |||
---|---|---|---|
Description |
The Phone (v23) parse definition parses phone numbers into a set of tokens. |
||
Output Tokens | Prefix Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output | ||
Example 1 | 52 5521234567 | Prefix | |
Country Code | 52 | ||
Area Code | 55 | ||
Base Number | 21234567 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Input | Output | ||
Example 2 | 044 5521234567 | Prefix | 044 |
Country Code | |||
Area Code | 55 | ||
Base Number | 21234567 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Input | Output | ||
Example 3 | Oficina: 55 2123 4567 ext 156 | Prefix | |
Country Code | |||
Area Code | 55 | ||
Base Number | 2123 4567 | ||
Extension | 156 | ||
Line Type | Oficina: | ||
Additional Info | |||
Input | Output | ||
Example 4 | Movil: 55 2123 4567 (noches) | Prefix | |
Country Code | |||
Area Code | 55 | ||
Base Number | 2123 4567 | ||
Extension | |||
Line Type | Movil: | ||
Additional Info | (noches) | ||
Remarks | The Prefix token will contain the domestic long-distance dialing code (01), the domestic cellular dialing codes (044 and 045), and their accepted variants. | ||
The Phone (v23) parse definition is now deprecated and will be removed in a future release of the QKB. The Phone parse definition has been replaced with a copy of the Phone (v23) definition which takes advantage of the new tokens and updated processing. If you changed your jobs to use Phone (v23) it is suggested that you change them back. |
None.
Address | ||
---|---|---|
Description |
The Address standardization definition standardizes addresses. |
|
Input | Output | |
Examples | Paseo de la Reforma 76 1ER PISO | Pso de la Reforma 76 1er Piso |
Avenida Hidalgo 1524 Colonia Cordoba Americas | Av Hidalgo 1524 Col Cordoba Americas | |
Remarks |
The Address (v24) standardization definition is a temporary definition provided to facilitate an upgrade of the Address definition. The Address definition accepts parsed input, but will not accept parsed input in a future release. Address (v24) does not accept parsed input, which will also be true of Address in the future. The removal of input tokens will require you to update any jobs using the Standardization (Parsed) node for the Address definition, replacing the Standardization (Parsed) node with the Standardization node and providing input as a single field. Jobs using the Standardization node will not require an update. If you want to begin using the updated processing now rather than waiting for a later release, you can update your jobs to call the Address (v24) definition. Be aware however that the Address (v24) definition will be deprecated in a subsequent release after the Address definition has been updated. |
Address (v24) | ||
---|---|---|
Description |
The Address (v24) standardization definition standardizes addresses. |
|
Input | Output | |
Examples | Pso de la Reforma 76 Tercero Piso | Paseo de la Reforma 76, Piso 3 |
Av Hidalgo 1524 Col Cordoba Americas | Avenida Hidalgo 1524 (Col Cordoba Americas) | |
Remarks |
The Address (v24) standardization definition is a temporary definition provided to facilitate an upgrade of the Address definition. The Address definition accepts parsed input, but will not accept parsed input in a future release. Address (v24) does not accept parsed input, which will also be true of Address in the future. The removal of input tokens will require you to update any jobs using the Standardization (Parsed) node for the Address definition, replacing the Standardization (Parsed) node with the Standardization node and providing input as a single field. Jobs using the Standardization node will not require an update. If you want to begin using the updated processing now rather than waiting for a later release, you can update your jobs to call the Address (v24) definition. Be aware however that the Address (v24) definition will be deprecated in a subsequent release after the Address definition has been updated. |
City | ||
---|---|---|
Description | The City standardization definition standardizes city names. | |
Input | Output | |
Examples | mexico city | Mexico City |
PALACHO | Palacho | |
Remarks |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code standardization definition standardizes last line address information. | |
Input | Output | |
Example | 13210 Mexico Distrito Federal | 13210 Mexico, DF |
Remarks |
Name | ||
---|---|---|
Description | The Name standardization definition standardizes names of individuals. | |
Input | Output | |
Examples | senora maria garcia arroyo | Sra Maria Garcia Arroyo |
LEONEL DE JESUS CAMPOS | Leonel de Jesus Campos | |
Director Tecnico Eduardo Gonzalez De La Peña | Director Tecnico Eduardo Gonzalez de la Peña | |
Reverendo Antonio Parrera | R Antonio Parrera | |
Remarks |
If this definition is applied to pre-parsed data, the following input tokens are available:
It is recommended that you map a correlating data field to each available token whenever possible. |
|
The Name (v24) standardization definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name definition accepts parsed input and the input tokens will be changed in a future release. Name (v24) uses the tokens that will be used by Name in the future. The token change will require you to update any jobs using the Standardization (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Standardization node will not require an update. If you want to begin using the new tokens or the updated processing now rather than waiting for a later release, you can update your jobs to call the Name (v24) definition. Be aware however that the Name (v24) definition will be deprecated in a subsequent release after the Name definition has been updated. |
Name (v24) | ||
---|---|---|
Description |
The Name (v24) standardization definition standardizes names of individuals. |
|
Input | Output | |
Examples | Antonio la Merced | Antonio La Merced |
JOSE ALEJANDRO ASENCIO DE LA PENA | José Alejandro Asencio de la Pena | |
Señor Marcell d'Marco Camey Martinez | Sr Marcell D'Marco Camey Martinez | |
Luis de Torres Contador Publico | CP Luis de Torres | |
Remarks |
If this definition is applied to pre-parsed data, the following input tokens are available:
It is recommended that you map a correlating data field to each available token whenever possible. |
|
The Name (v24) standardization definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name definition accepts parsed input and the input tokens will be changed in a future release. Name (v24) uses the tokens that will be used by Name in the future. The token change will require you to update any jobs using the Standardization (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Standardization node will not require an update. If you want to begin using the new tokens or the updated processing now rather than waiting for a later release, you can update your jobs to call the Name (v24) definition. Be aware however that the Name (v24) definition will be deprecated in a subsequent release after the Name definition has been updated. |
*
Organization | ||
---|---|---|
Description | The Organization standardization definition standardizes organization names. | |
Input | Output | |
Examples | dataflux | DataFlux |
Aventis Pharma S.A. DE C.V. | Aventis Pharma SA de CV | |
Remarks |
Phone | ||
---|---|---|
Description | The Phone standardization definition standardizes phone numbers for domestic use. | |
Input | Output | |
Examples | 52 55 5122 4300 | +52 (55) 5122 4300 |
+52 55-5122-4300 | +52 (55) 5122 4300 | |
Remarks |
Phone (Basic) | ||
---|---|---|
Description | The Phone (Basic) standardization definition standardizes phone numbers to a more basic format. | |
Input | Output | |
Examples | 52 55 5122 4300 | 52 55 5122 4300 |
+52 55-5122-4300 | 52 55 5122 4300 | |
Remarks |
Postal Code | ||
---|---|---|
Description | The Postal Code standardization definition standardizes postal codes. | |
Input | Output | |
Examples | -13210 | 13210 |
13210, | 13210 | |
Remarks |
In addition to the definitions listed on this page, the Spanish, Mexico locale also inherits all definitions for the Spanish language and all Global definitions.
Documentation Feedback: yourturn@sas.com
|
Doc ID: QKBCI_ESMEX_defs.html |