You are here: Definitions>Spanish Definitions>Spanish, Mexico Definitions

SAS Quality Knowledge Base for Contact Information 26

Spanish, Mexico Definitions

Definitions for the Spanish, Mexico locale are described below.

Case Definitions
Extraction Definitions
Gender Analysis Definitions

Identification Analysis Definitions

Match Definitions

Parse Definitions

Pattern Analysis Definitions

Standardization Definitions

Inherited Definitions

Case Definitions

Proper (City - State/Province - Postal Code)
Description The Proper (City - State/Province - Postal Code ) case definition propercases last line address information.
  Input Output
Example 77520 cancun, q. roo 77520 Cancun, Q. Roo
Remarks  

 

Proper (Organization)
Description The Proper (Organization) case definition propercases organization names.
  Input Output
Examples diconsa sa de cv Diconsa SA de CV
sas institute SAS Institute
Remarks The Proper (Organization) case definition uses a list of known organization names to handle exceptions to propercasing rules.

Extraction Definitions

None.

Gender Analysis Definitions

CURP
Description

The CURP gender analysis definition determines the gender of an individual based on their Unique Population Registry Code (Clave Única de Registro de Población).

Possible Outputs M
F
U
  Input Output
Examples ZAZD801124MBSYQN13 F
NURV140413HQTWGN12 M
NURV140413XQTWGN12 U
  Remarks

The CURP gender analysis definition is designed to provide valid results when operating on valid CURP identity codes. It is recommended that you use the CURP (Validation) identification analysis definition to validate your input and that you pass only valid CURP identity codes into the CURP gender analysis definition. Results from invalid data cannot be guaranteed to be accurate.

 

Name
Description The Name gender analysis definition determines the gender of a name.
Possible Outputs M
F
U
  Input Output
Examples Guadalupe Salvador Rivera U
Jose Maria Rivera Morales M
Maria De Lourdes Rodriguez F
  Remarks

If this definition is applied to pre-parsed data, the following input tokens are available:

Prefix
Given Name
Family Name 1
Family Name 2
Suffix
Title/Additional Info

It is recommended that you map a correlating data field to each available token whenever possible.

Identification Analysis Definitions

CURP (Validation)
Description

The CURP (Validation) identification analysis definition determines if a string is a Unique Population Registry Code (Clave Única de Registro de Población).

Possible Outputs VALID
INVALID
  Input Output Comments
Examples ZAZD801124MBSYQN13 VALID  
RUNO390705HSLPTC02 VALID  
IMS120607120 INVALID Too short (only 12 characters)
KAOB750230MASLHJ07 INVALID Invalid date (Feb 30)
EOMR330613FDFPKT08 INVALID Invalid gender (F instead of H or M)
YOOR280801HXXLTK07 INVALID Invalid state (XX is not a state)
EIMV310211MQRPKT1X INVALID Invalid last character (X instead of a value 0-9)
L6NF640805MYNTNN10 INVALID Invalid second character (6 instead of an A-Z value)
MOJL020101HCLKWMXY09 INVALID Too long (over 18 characters)
Remarks

This definition validates on the following criteria:

  • The string is exactly 18 characters in length.
  • Characters 1-4 are alphabetic characters (A-Z, a-z), in a combination that is not deemed offensive.
  • Characters 5-10 are a valid date in YYMMDD format.
  • Character 11 is either H or M, representing the male or female gender respectively.
  • Characters 12-13 are a valid 2-letter Mexican state code, or "NE" for individuals who are foreign-born.
  • Characters 14-16 are alphabetic (A-Z, a-z).
  • Character 17 is alphanumeric (0-9, A-Z, a-z).
  • Character 18 is numeric (0-9).

 

Individual/Organization
Description The Individual/Organization identification analysis definition determines whether a string represents the name of an individual or an organization.
Possible Outputs INDIVIDUAL
ORGANIZATION
UNKNOWN
  Input Output
Examples Grupo Dixon S.A. ORGANIZATION
Roberto Cruz Morales INDIVIDUAL
Marta Leticia Ceballos Gomez INDIVIDUAL
Guttermoth UNKNOWN
Remarks  

 

Individual/Organization (RFC)
Description

The Individual/Organization (RFC) identification analysis definition determines the identify of an individual or organization based on their Federal Taxpayer Registry Code (Registro Federal de Contribuyentes).

Possible Outputs INDIVIDUAL
ORGANIZATION
UNKNOWN
  Input Output
Examples ABC680524F23 ORGANIZATION
VECJ680524V23 INDIVIDUAL
ABCDE12345678 UNKNOWN
H1N831130GD9 UNKNOWN
PIA850230ZVA UNKNOWN
BEPJ361225IS@ UNKNOWN
Remarks

This definition validates using the following criteria:

  • For organizations and individuals the string is either 12 or 13 characters in length. The string length determines if the entity is an individual or organization.
  • The first 3 (for 12-character strings) or 4 characters (for 13-character strings) are alphabetic characters (A-Z, a-z), with a combination that is not deemed to be offensive.
  • The following 6 characters are a valid date in YYMMDD format.
  • The last 3 characters are alphanumeric characters (A-Z, a-z, 0-9).

Match Definitions

Address
Description

The Address match definition generates match codes which can be used to cluster records containing addresses.

Max Length of Match Code 68 characters
  Input Cluster ID
Examples Av Insurgentes num 325 0
Ave. Insurgentes 325 0
Avenida Insurgentes #325 0
APDO POSTAL 1329, Calle Insurgentes 9908 1
Calle Insurgentes 9908 APDO POSTAL 1329 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

The Address match definition has been replaced with a copy of the Address (v24) definition. The Address (v24) definition is a temporary definition provided to facilitate an upgrade of the Address definition. The Address (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Address (v24) definition, it is suggested that you change them back to use the Address definition.

The Address match definition has changed in the following ways:

The Address definition no longer accepts parsed input. The removal of input tokens will require you to update any jobs using the Match Codes (Parsed) node for the Address definition, replacing the Match Codes (Parsed) node with the Match Codes node and providing input as a single field. Jobs using the Match Codes node will not require an update.

The match code length has been changed. This change might require you to update any jobs using the Address definition so that the match code fields can handle the new length.

 

Address (Full)
Description The Address (Full) match definition generates match codes which can be used to cluster records containing complete two-line addresses.
Max Length
of Match Code
118 characters
  Input Cluster ID
Examples Av Independencia 800 Bario De San Agustin Municipio De Jalatlaco CP 05286 Edo De Mexico 0
Avenida Independencia 800 Bario De San Agustin Municipio De Jalatlaco CP 05286 Edo De Mexico 0
Avenida Independencia 800 Bario De San Agustin Municipio De Jalatlaco CP 5286 Edo De Mexico 0
Av Independencia #800 Bario De San Agustin Municipio De Jalatlaco CP 5286 Edo De Mexico 0
Calle Insurgentes 9908 APDO POSTAL 1329 1
Av Insurgentes 9908 APDO POSTAL 1329 1
Torre Mayor Condominio horizontal 200, 82200 Mazatlan, Sinaloa 2
Torre Mayor Condominio horizontal 200, 82200 Mazatlan de Norte, Sinaloa 2
Torre Mayor Condominio horizontal 200, 82200 Mazatlan de sur, Sinaloa 2
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (PO Box Only)
Description

The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address.

Max Length of Match Code 15 characters
  Input Cluster ID
Examples APDO POSTAL 1329, Avenida Insurgentes #325 0
APDO POSTAL 1329, Avenida Insurgentes 123 0
Ave. Insurgentes 325 APDO POSTAL #1329 0
APDO POSTAL 1329 0
APDO 345 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (Street Only)
Description

The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address.

Max Length of Match Code 68 characters
  Input Cluster ID
Examples Av Insurgentes num 325 0
Ave. Insurgentes 325 0
Avenida Insurgentes #325 0
APDO POSTAL 1329, Avenida Insurgentes #325 0
Ave. Insurgentes 325 APDO POSTAL 1329 0
Ave. Universidad 1200 1
Ave. Universidad No. 1200 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Address (v24)
Description

The Address (v24) match definition generates match codes which can be used to cluster records containing addresses.

Max Length of Match Code 68 characters
  Input Cluster ID
Examples Av Insurgentes num 325 0
Ave. Insurgentes 325 0
Avenida Insurgentes #325 0
APDO POSTAL 1329, Calle Insurgentes 9908 1
Calle Insurgentes 9908 APDO POSTAL 1329 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

The Address match definition has been replaced with a copy of the Address (v24) definition. The Address (v24) definition is a temporary definition provided to facilitate an upgrade of the Address definition. The Address (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Address (v24) definition, it is suggested that you change them back to use the Address definition.

 

City
Description The City match definition generates match codes which can be used to cluster records containing city names.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples Acaspulco 1
Acasppulco 1
Mexico City 2
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

The City (v26) match definition is a temporary definition provided to facilitate an upgrade of the City definition.

In a future release, the match code length will be changed. This change might require you to update any jobs using the City definition so that the match code fields can handle the new length. The City (v26) definition uses the match code length that will be used by City in the future.

If you want to begin using the updated processing now rather than waiting for a later release, you can update your jobs to call the City (v26) definition. Be aware however that the City (v26) definition is deprecated and will be removed in a subsequent release when the City definition is updated.

 

City (v26)
Description

The City (v26) match definition generates match codes which can be used to cluster records containing city names.

Max Length of Match Code 20 characters
  Input Cluster ID
Examples Acaspulco 1
Acasppulco 1
Mexico City 2
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

The City (v26) match definition is a temporary definition provided to facilitate an upgrade of the City definition.

In a future release, the match code length will be changed. This change might require you to update any jobs using the City definition so that the match code fields can handle the new length. The City (v26) definition uses the match code length that will be used by City in the future.

If you want to begin using the updated processing now rather than waiting for a later release, you can update your jobs to call the City (v26) definition. Be aware however that the City (v26) definition is deprecated and will be removed in a subsequent release when the City definition is updated.

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples 11570 Distrito Federal, Mexico 1
Mexico Distrito Federal, CP 11570 1
11570 DF Mexico 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Name
Description

The Name match definition generates match codes which can be used to cluster records containing names of individuals.

Max Length of Match Code 27 characters
  Input Cluster ID
Examples Imelda Marina Escobar Hernandez 0
Sra Imelda Escobar Hernandez 0
Excel·lentíssim i Magnífic Senyor Marcel·lí J. Rigau i Molins 1
  Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

If this definition is applied to pre-parsed data, the following input tokens are available:

Prefix
Given Name
Family Name 1
Family Name 2
Suffix
Title/Additional Info

It is recommended that you map a correlating data field to each available token whenever possible.

The Name match definition has been replaced with a copy of the Name (v24) definition. The Name (v24) definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Name (v24) definition, it is suggested that you change them back to use the Name definition.

The Name match definition has changed in the following ways:

The Name definition accepts parsed input and the input tokens have been changed. The token change will require you to update any jobs using the Match Codes (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Match Codes node will not require an update.

The match code length has been changed. This change might require you to update any jobs using the Name definition so that the match code fields can handle the new length.

 

Name (v24)
Description

The Name (v24) match definition generates match codes which can be used to cluster records containing names of individuals.

Max Length of Match Code 27 characters
  Input Cluster ID
Examples Imelda Marina Escobar Hernandez 0
Sra Imelda Escobar Hernandez 0
Excel·lentíssim i Magnífic Senyor Marcel·lí J. Rigau i Molins 1
  Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

If this definition is applied to pre-parsed data, the following input tokens are available:

Prefix
Given Name
Family Name 1
Family Name 2
Suffix
Title/Additional Info

It is recommended that you map a correlating data field to each available token whenever possible.

The Name match definition has been replaced with a copy of the Name (v24) definition. The Name (v24) definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Name (v24) definition, it is suggested that you change them back to use the Name definition.

 

Organization
Description The Organization match definition generates match codes which can be used to cluster records containing organization names.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples Agroasemex SA 1
Agroasemex 1
SAS Institute 2
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Phone
Description The Phone match definition generates match codes which can be used to cluster records containing phone numbers.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples +52 (55) 5122 4300 1
52-55-5122-4300 1
52-55-51224300 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

Postal Code
Description The Postal Code match definition generates match codes which can be used to cluster records containing postal codes.
Max Length of Match Code 15 characters
  Input Cluster ID
Examples 11570 1
-11570 1
13210 2
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

 

State/Province
Description

The State/Province match definition generates match codes which can be used to cluster records containing states and provinces.

Max Length of Match Code 15 characters
  Input Cluster ID
Examples Chihuahua 0
Chih 0
CH 0
Campeche 1
Camp. 1
Remarks

NoteNote: The results listed above reflect the default match sensitivity (85).

Parse Definitions

Address
Description

The Address parse definition parses addresses into a set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output Token Output
Example 1 Paseo de la Reforma 505 Piso 38 Recipient  
Building/Site  
Street Paseo de la Reforma 505
Extension Piso 38
PO Box  
Additional Info  
  Input Output Token Output
Example 2 Hospital General de México, Calle Dr. Balmis No. 148 Recipient  
Building/Site Hospital General de México
Street Calle Dr. Balmis No. 148
Extension  
PO Box  
Additional Info  
  Input Output Token Output
Example 3 A/A: Linda Salcido AV INDEPENDENCIA 800 Recipient A/A: Linda Salcido
Building/Site  
Street AV INDEPENDENCIA 800
Extension  
PO Box  
Additional Info  
Remarks  

 

Address (Full)
Description The Address (Full) parse definition parses addresses containing complete two-line addresses into a set of tokens.
Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Neighborhood/Village
Delegation/Municipality
State/Province
Postal Code
Country
Additional Info
  Input Output Token Output
Example 1 Paseo de la Reforma 505 Piso 38 Del. Cuauhtémoc C.P. 06500 México, D.F. Recipient  
Building/Site  
Street Paseo de la Reforma 505
Extension Piso 38
PO Box  
Neighborhood/Village  
Delegation/Municipality Del. Cuauhtémoc
State/Province México, D.F.
Postal Code C.P. 06500
Country  
Additional Info  
  Input Output Token Output
Example 2 Hospital General de México, Calle Dr. Balmis No. 148, Col. Doctores, Delegacion Cuauhtemoc, Mexico, D.F., C.P. 06726 Recipient  
Building/Site Hospital General de México
Street Calle Dr. Balmis No. 148
Extension  
PO Box  
Neighborhood/Village Col. Doctores
Delegation/Municipality Delegacion Cuauhtemoc
State/Province México, D.F.
Postal Code C.P. 06726
Country  
Additional Info  
  Input Output Token Output
Example 3 A/A: Linda Salcido AV INDEPENDENCIA 800 BARIO DE SAN AGUSTIN MUNICIPIO DE JALATLACO CP 52860 EDO DE MEXICO Recipient A/A: Linda Salcido
Building/Site  
Street AV INDEPENDENCIA 800
Extension  
PO Box  
Neighborhood/Village BARIO DE SAN AGUSTIN
Delegation/Municipality MUNICIPIO DE JALATLACO
State/Province EDO DE MEXICO
Postal Code CP 52860
Country  
Additional Info  
Remarks  

 

Address (Global)
Description

The Address (Global) parse definition parses addresses into a globally recognized set of tokens.

Output Tokens Recipient
Building/Site
Street
Extension
PO Box
Additional Info
  Input Output Token Output
Example 1 Paseo de la Reforma 505 Piso 38 Recipient  
Building/Site  
Street Paseo de la Reforma 505
Extension Piso 38
PO Box  
Additional Info  
  Input Output Token Output
Example 2 Hospital General de México, Calle Dr. Balmis No. 148 Recipient  
Building/Site Hospital General de México
Street Calle Dr. Balmis No. 148
Extension  
PO Box  
Additional Info  
  Input Output Token Output
Example 3 A/A: Linda Salcido AV INDEPENDENCIA 800 Recipient A/A: Linda Salcido
Building/Site  
Street AV INDEPENDENCIA 800
Extension  
PO Box  
Additional Info  
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens.
Output Tokens City
Province
Postal Code
Municipio/Delegación
  Input Output Token Output
Example 11570 Mexico, DF City Mexico
Province DF
Postal Code 11570
Municipio/Delegación  
Remarks

The City - State/Province - Postal Code (v26) parse definition is a temporary definition provided to facilitate an upgrade of the City - State/Province - Postal Code definition.

In a future release, the output tokens of the City - State/Province - Postal Code definition will be changed. This change will require you to update any jobs using the City - State/Province - Postal Code definition so that the tokens specified in those jobs will match the tokens used by the definition. The City - State/Province - Postal Code (v26) definition uses the tokens that will be used by City - State/Province - Postal Code in the future.

If you want to begin using the new tokens and updated processing now rather than waiting for a later release, you can update your jobs to call the City - State/Province - Postal Code (v26) definition. Be aware however that the City - State/Province - Postal Code (v26) definition is deprecated and will be removed in a subsequent release when the City - State/Province - Postal Code definition is updated.

 

City - State/Province - Postal Code (Global)
Description

The City - State/Province - Postal Code (Global) parse definition parses last line address information into a globally recognized set of tokens.

Output Tokens City
State/Province
Postal Code
Additional Info
Example 1 Input Output Token Output
San Francisco de Campeche C.P. 06500 Campeche City San Francisco de Campeche
State/Province Campeche
Postal Code C.P. 06500
Additional Info  
Example 2 Input Output Token Output
ZOQUITE,GUADALUPE,ZACATECAS,98631 City GUADALUPE
State/Province ZACATECAS
Postal Code 98631
Additional Info ZOQUITE
Example 3 Input Output Token Output
RIO GRANDE,RIO GRANDE,ZACATECAS,98400 City RIO GRANDE
State/Province ZACATECAS
Postal Code 98400
Additional Info RIO GRANDE
Example 4 Input Output Token Output
LA HORMIGA,SAN CRISTOBAL DE LAS CASAS,CHIAPAS,29215 City SAN CRISTOBAL DE LAS CASAS
State/Province CHIAPAS
Postal Code 29215
Additional Info LA HORMIGA
Remarks

Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

City - State/Province - Postal Code (v26)
Description

The City - State/Province - Postal Code (v26) parse definition parses last line address information into a set of tokens.

Output Tokens Neighborhood/Village
Delegation/Municipality
State/Province
Postal Code
Additional Info
Example 1 Input Output Token Output
San Francisco de Campeche C.P. 06500 Campeche Neighborhood/Village  
Delegation/Municipality San Francisco de Campeche
State/Province Campeche
Postal Code C.P. 06500
Additional Info  
Example 2 Input Output Token Output
ZOQUITE,GUADALUPE,ZACATECAS,98631 Neighborhood/Village ZOQUITE
Delegation/Municipality GUADALUPE
State/Province ZACATECAS
Postal Code 98631
Additional Info  
Example 3 Input Output Token Output
RIO GRANDE,RIO GRANDE,ZACATECAS,98400 Neighborhood/Village RIO GRANDE
Delegation/Municipality RIO GRANDE
State/Province ZACATECAS
Postal Code 98400
Additional Info  
Example 4 Input Output Token Output
LA HORMIGA,SAN CRISTOBAL DE LAS CASAS,CHIAPAS,29215 Neighborhood/Village LA HORMIGA
Delegation/Municipality SAN CRISTOBAL DE LAS CASAS
State/Province CHIAPAS
Postal Code 29215
Additional Info  
Remarks

The City - State/Province - Postal Code (v26) parse definition is a temporary definition provided to facilitate an upgrade of the City - State/Province - Postal Code definition.

In a future release, the output tokens of the City - State/Province - Postal Code definition will be changed. This change will require you to update any jobs using the City - State/Province - Postal Code definition so that the tokens specified in those jobs will match the tokens used by the definition. The City - State/Province - Postal Code (v26) definition uses the tokens that will be used by City - State/Province - Postal Code in the future.

If you want to begin using the new tokens and updated processing now rather than waiting for a later release, you can update your jobs to call the City - State/Province - Postal Code (v26) definition. Be aware however that the City - State/Province - Postal Code (v26) definition is deprecated and will be removed in a subsequent release when the City - State/Province - Postal Code definition is updated.

 

Name
Description

The Name parse definition parses names of individuals into a set of tokens.

Output Tokens Prefix
Given Name
Family Name 1
Family Name 2
Suffix
Title/Additional Info
  Input Output Token Output
Example 1 Dr. Juan Carlos Lopez Prefix Dr.
Given Name Juan Carlos
Family Name 1 Lopez
Family Name 2  
Suffix  
Title/Additional Info  
  Input Output Token Output
Example 2 Maria Julia Garcia Arroyo Prefix  
Given Name Maria Julia
Family Name 1 Garcia
Family Name 2 Arroyo
Suffix  
Title/Additional Info  
  Input Output Token Output
Example 3 Mr John D'Onofrio Jr, CEO Prefix Mr
Given Name John
Family Name 1 D'Onofrio
Family Name 2  
Suffix Jr
Title/Additional Info CEO
Remarks  

 

Name (Global)
Description The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens.
Output Tokens Prefix
Given Name
Middle Name
Family Name
Suffix
Title/Additional Info
  Input Output Token Output
Example 1 Dr. Juan Carlos Lopez Prefix Dr.
Given Name Juan Carlos
Middle Name  
Family Name Lopez
Suffix  
Title/Additional Info  
  Input Output Token Output
Example 2 Maria Julia Garcia Arroyo Prefix  
Given Name Maria Julia
Middle Name  
Family Name Garcia Arroyo
Suffix  
Title/Additional Info  
  Input Output Token Output
Example 3 Sr. Juan Gomez Gonzales Prof Prefix Sr.
Given Name Juan
Middle Name  
Family Name Gomez Gonzales
Suffix  
Title/Additional Info Prof
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Organization
Description

The Organization parse definition parses organization names into a set of tokens.

Output Tokens

Name
Legal Form
Site
Additional Info

  Input Output Token Output
Example 1 Grupo Carso S.A. de C.V. Name Grupo Carso
Legal Form S.A. de C.V.
Site  
Additional Info  
  Input Output Token Output
Example 2 CEPSA SA CHIHUAHUA - Compañía Española de Petróleos Name CEPSA
Legal Form SA
Site CHIHUAHUA
Additional Info Compañía Española de Petróleos
  Input Output Token Output
Example 3 Hewlett Packard Española SL, Barcelona Name Hewlett Packard Española
Legal Form SL
Site Barcelona
Additional Info  
  Input Output Token Output
Example 4 BBV TITULIZACION, S.A. (S.G.F.T.H.) Name BBV TITULIZACION
Legal Form S.A.
Site  
Additional Info (S.G.F.T.H.)
Remarks

The Organization parse definition has been replaced with a copy of the Organization (v24) definition. The Organization (v24) definition is a temporary definition provided to facilitate an upgrade of the Organization definition. The Organization (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Organization (v24) definition, it is suggested that you change them back to use the Organization definition.

The output tokens of the Organization definition have been changed. This change will require you to update any jobs using the Organization definition so that the tokens specified in those jobs will match the tokens used by the definition.

 

Organization (v24)
Description

The Organization (v24) parse definition parses organization names into a set of tokens.

Output Tokens

Name
Legal Form
Site
Additional Info

  Input Output Token Output
Example 1 Grupo Carso S.A. de C.V. Name Grupo Carso
Legal Form S.A. de C.V.
Site  
Additional Info  
  Input Output Token Output
Example 2 CEPSA SA CHIHUAHUA - Compañía Española de Petróleos Name CEPSA
Legal Form SA
Site CHIHUAHUA
Additional Info Compañía Española de Petróleos
  Input Output Token Output
Example 3 Hewlett Packard Española SL, Barcelona Name Hewlett Packard Española
Legal Form SL
Site Barcelona
Additional Info  
  Input Output Token Output
Example 4 BBV TITULIZACION, S.A. (S.G.F.T.H.) Name BBV TITULIZACION
Legal Form S.A.
Site  
Additional Info (S.G.F.T.H.)
Remarks

The Organization parse definition has been replaced with a copy of the Organization (v24) definition. The Organization (v24) definition is a temporary definition provided to facilitate an upgrade of the Organization definition. The Organization (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Organization (v24) definition, it is suggested that you change them back to use the Organization definition.

 

Phone
Description

The Phone parse definition parses phone numbers into a set of tokens.

Output Tokens Prefix
Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
  Input Output Token Output
Example 1 52 5521234567 Prefix  
Country Code 52
Area Code 55
Base Number 21234567
Extension  
Line Type  
Additional Info  
  Input Output Token Output
Example 2 044 5521234567 Prefix 044
Country Code  
Area Code 55
Base Number 21234567
Extension  
Line Type  
Additional Info  
  Input Output Token Output
Example 3 Oficina: 55 2123 4567 ext 156 Prefix  
Country Code  
Area Code 55
Base Number 2123 4567
Extension 156
Line Type Oficina:
Additional Info  
  Input Output Token Output
Example 4 Movil: 55 2123 4567 (noches) Prefix  
Country Code  
Area Code 55
Base Number 2123 4567
Extension  
Line Type Movil:
Additional Info (noches)
Remarks The Prefix token will contain the domestic long-distance dialing code (01), the domestic cellular dialing codes (044 and 045), and their accepted variants.

 

Phone (Global)
Description The Phone (Global) parse definition parses phone numbers into a globally recognized set of tokens.
Output Tokens Country Code
Area Code
Base Number
Extension
Line Type
Additional Info
  Input Output Token Output
Example 1 52 5521234567 Country Code 52
Area Code 55
Base Number 21234567
Extension  
Line Type  
Additional Info  
  Input Output Token Output
Example 2 044 5521234567 Country Code  
Area Code 044 55
Base Number 21234567
Extension  
Line Type  
Additional Info  
  Input Output Token Output
Example 3 Oficina: 55 2123 4567 ext 156 Country Code  
Area Code 55
Base Number 2123 4567
Extension 156
Line Type Oficina:
Additional Info  
  Input Output Token Output
Example 4 Movil: 55 2123 4567 (noches) Country Code  
Area Code 55
Base Number 2123 4567
Extension  
Line Type Movil:
Additional Info (noches)
Remarks Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales.

 

Phone (IFETEL)
Description

The Phone (IFETEL) parse definition parses an IFETEL-compliant phone number into a set of tokens.

Output Tokens Prefix
Country Code
Cellular Code
LADA
Serie
Numero
Extension
Additional Info
  Input Output Token Output
Example 1 520442432604890 Prefix  
Country Code 52
Cellular Code 044
LADA 243
Serie 260
Numero 4890
Extension  
Additional Info  
  Input Output Token Output
Example 2 013323456789 Prefix 01
Country Code  
Cellular Code  
LADA 33
Serie 2345
Numero 6789
Extension  
Additional Info  
  Input Output Token Output
Example 3 8162047148 ext 304 Prefix  
Country Code  
Cellular Code  
LADA 81
Serie 6204
Numero 7148
Extension 304
Additional Info  
  Input Output Token Output
Example 4 234 234 5678901 Prefix  
Country Code  
Cellular Code  
LADA  
Serie  
Numero  
Extension  
Additional Info 2 3 4 2 3 4 5 6 7 8 9 0 1
Remarks The input string for Example 4 contains too many digits for IFETEL compliance.

The Phone (IFETEL) definition is specific to the Spanish, Mexico locale. The purpose of the definition is to separate a telephone number into the tokens used by the IFETEL national phone number registry.

If the input string does not contain a Mexican phone number in IFETEL-compliant format, the output will have a NO SOLUTION parse result and the input string will be output in the Additional Info token as shown in Example 4, with spaces between the digits.

The recommended procedure is to use the Result code field in the Parse node of your Data Job to store the results of the parsing operation. Then the data job should use the value of the Result code field to determine how the record should be processed. Records with an "OK" Result Code should be processed as valid IFETEL phone numbers. Those with a "NO SOLUTION" or "NULL" Result Code should be processed separately, if it all.

Pattern Analysis Definitions

None.

Standardization Definitions

Address
Description

The Address standardization definition standardizes addresses.

  Input Output
Examples Pso de la Reforma 76 Tercero Piso Paseo de la Reforma 76, Piso 3
Av Hidalgo 1524 Col Cordoba Americas Avenida Hidalgo 1524 (Col Cordoba Americas)
Remarks

The Address standardization definition has been replaced with a copy of the Address (v24) definition. The Address (v24) definition is a temporary definition provided to facilitate an upgrade of the Address definition. The Address (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Address (v24) definition, it is suggested that you change them back to use the Address definition.

The Address definition no longer accepts parsed input. The removal of input tokens will require you to update any jobs using the Standardization (Parsed) node for the Address definition, replacing the Standardization (Parsed) node with the Standardization node and providing input as a single field. Jobs using the Standardization node will not require an update.

 

Address (v24)
Description

The Address (v24) standardization definition standardizes addresses.

  Input Output
Examples Pso de la Reforma 76 Tercero Piso Paseo de la Reforma 76, Piso 3
Av Hidalgo 1524 Col Cordoba Americas Avenida Hidalgo 1524 (Col Cordoba Americas)
Remarks

The Address standardization definition has been replaced with a copy of the Address (v24) definition. The Address (v24) definition is a temporary definition provided to facilitate an upgrade of the Address definition. The Address (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Address (v24) definition, it is suggested that you change them back to use the Address definition.

 

City
Description The City standardization definition standardizes city, delegation, and municipality names.
  Input Output
Examples mexico city Ciudad de México
PALACHO Palacho
Remarks  

 

City - State/Province - Postal Code
Description The City - State/Province - Postal Code standardization definition standardizes last line address information.
  Input Output
Example 13210 Mexico Distrito Federal 13210 Mexico, DF
Remarks  

 

Name
Description

The Name standardization definition standardizes names of individuals.

  Input Output
Examples Antonio la Merced Antonio La Merced
JOSE ALEJANDRO ASENCIO DE LA PENA José Alejandro Asencio de la Pena
Señor Marcell d'Marco Camey Martinez Sr Marcell D'Marco Camey Martinez
Luis de Torres Contador Publico CP Luis de Torres
Remarks

If this definition is applied to pre-parsed data, the following input tokens are available:

Prefix
Given Name
Family Name 1
Family Name 2
Suffix
Title/Additional Info

It is recommended that you map a correlating data field to each available token whenever possible.

The Name standardization definition has been replaced with a copy of the Name (v24) definition. The Name (v24) definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Name (v24) definition, it is suggested that you change them back to use the Name definition.

The Name definition accepts parsed input and the input tokens have been changed. The token change will require you to update any jobs using the Standardization (Parsed) node for the Name definition so that the tokens specified in that node will match the tokens used by the definition. Jobs using the non-parsed input Standardization node will not require an update.

 

Name (v24)
Description

The Name (v24) standardization definition standardizes names of individuals.

  Input Output
Examples Antonio la Merced Antonio La Merced
JOSE ALEJANDRO ASENCIO DE LA PENA José Alejandro Asencio de la Pena
Señor Marcell d'Marco Camey Martinez Sr Marcell D'Marco Camey Martinez
Luis de Torres Contador Publico CP Luis de Torres
Remarks

If this definition is applied to pre-parsed data, the following input tokens are available:

Prefix
Given Name
Family Name 1
Family Name 2
Suffix
Title/Additional Info

It is recommended that you map a correlating data field to each available token whenever possible.

The Name standardization definition has been replaced with a copy of the Name (v24) definition. The Name (v24) definition is a temporary definition provided to facilitate an upgrade of the Name definition. The Name (v24) definition is now deprecated and will be removed in a future release.

If you previously modified your jobs to use the Name (v24) definition, it is suggested that you change them back to use the Name definition.

 

Organization
Description The Organization standardization definition standardizes organization names.
  Input Output
Examples dataflux DataFlux
Aventis Pharma S.A. DE C.V. Aventis Pharma SA de CV
Remarks  

 

Phone
Description The Phone standardization definition standardizes phone numbers for domestic use.
  Input Output
Examples 52 55 5122 4300 +52 (55) 5122 4300
+52 55-5122-4300 +52 (55) 5122 4300
Remarks  

 

Phone (Basic)
Description The Phone (Basic) standardization definition standardizes phone numbers to a more basic format.
  Input Output
Examples 52 55 5122 4300 52 55 5122 4300
+52 55-5122-4300 52 55 5122 4300
Remarks  

 

Postal Code
Description The Postal Code standardization definition standardizes postal codes.
  Input Output
Examples -13210 13210
13210, 13210
Remarks  

 

Postal Code (with Country Code)
Description

The Postal Code (with Country Code) standardization definition standardizes postal codes and adds a domestic country code, unless there is already a country code in the input.

  Input Output
Examples -13210 MX-13210
13210, MX-13210
20800 MX-20800
MX-20800 MX-20800
USA-27514 US-27514
FR. 12345 FR-12345
CP-20800 MX-20800
CODIGO POSTAL 22600 MX-22600
Remarks  

 

State/Province
Description

The State/Province standardization definition standardizes state names.

  Input Output
Examples q. roo Quintana Roo
JAL. Jalisco
tlax Tlaxcala
San Luis San Luis Potosí
YU Yucatán
Remarks  

 

State/Province (Postal Standard)
Description

The State/Province (Postal Standard) standardization definition standardizes state names to the postal standard.

  Input Output
Examples Baja California BC
Estado Libre y Soberano de Sinaloa SIN
Quintana Roo Q ROO
Estado de México MEX
Coah. COAH
Remarks  

Inherited Definitions

In addition to the definitions listed on this page, the Spanish, Mexico locale also inherits all definitions for the Spanish language and all Global definitions.