SAS Quality Knowledge Base for Contact Information 26
Definitions for the Spanish, Spain locale are described below.
Case Definitions
Extraction Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Proper (Address) | ||
---|---|---|
Description | The Proper (Address) case definition propercases addresses. | |
Input | Output | |
Examples | paseo de recoletos, 23 | Paseo de Recoletos, 23 |
C. ENRIQUE LARRETA, 2 | C. Enrique Larreta, 2 | |
AV DEL CRISTO 25, COL LIBERTAD | Av del Cristo 25, Col Libertad | |
calle D-A4 col lomas de chairel | Calle D-A4 Col Lomas de Chairel | |
Remarks |
Proper (Organization) | ||
---|---|---|
Description | The Proper (Organization) case definition propercases organization names. | |
Input | Output | |
Examples | atofina españa sa | Atofina España SA |
LIGA EUROPEA DE DIABETICOS (LED) | Liga Europea de Diabeticos (LED) | |
BBV TITULIZACION, S.A. (S.G.F.T.H.) | BBV Titulizacion, S.A. (S.G.F.T.H.) | |
Remarks | This definition uses a list of known organization names to handle exceptions to propercasing rules. |
None.
None.
None.
Address | ||
---|---|---|
Description | The Address match definition generates match codes which can be used to cluster records containing addresses. | |
Max Length of Match Code | 30 characters | |
Input | Cluster ID | |
Examples | AVENIDA DE AMERICA, 38 | 0 |
AV DE AMERICO, 38 | 0 | |
C/ AMERIKA, 38 Planta 1 izquierda | 0 | |
Av De Andalucia 38 Planta 3 derecha | 1 | |
APARTADO de CORREOS número 241 | 2 | |
Remarks |
|
Address (PO Box Only) | ||
---|---|---|
Description | The Address (PO Box Only) match definition generates match codes which can be used to cluster records containing the PO Box portion of an address. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | APARTADO de CORREOS número 241 | 0 |
APTDO CORREOS 241 | 0 | |
Ap CORREOS 00241 | 0 | |
AVENIDA DE AMERICA, 38 | 1 | |
Remarks |
|
Address (Street Only) | ||
---|---|---|
Description | The Address (Street Only) match definition generates match codes which can be used to cluster records containing the street portion of an address. | |
Max Length of Match Code | 25 characters | |
Input | Cluster ID | |
Examples | AVENIDA DE AMERICA, 38 | 0 |
AV DE AMERICO, 38 | 0 | |
C/ AMERIKA, 38 Planta 1 izquierda | 0 | |
Av De Andalucia 38 Planta 3 derecha | 1 | |
APARTADO de CORREOS número 241 | 2 | |
Remarks |
|
City | ||
---|---|---|
Description | The City match definition generates match codes which can be used to cluster records containing city names. | |
Max Length of Match Code | 20 characters | |
Input | Cluster ID | |
Examples | Mataró | 0 |
MATARO | 0 | |
Madrid | 1 | |
Remarks |
|
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code match definition generates match codes which can be used to cluster records containing last line address information. | |
Max Length of Match Code | 19 characters | |
Input | Cluster ID | |
Examples | 28300 Aranjuez (Madrid) | 0 |
28301 Aranjuez - Madrid | 0 | |
28310 Aranjuez | 0 | |
28300 Madrid | 1 | |
08950 Barcelona | 2 | |
Remarks |
|
Organization | ||
---|---|---|
Description | The Organization match definition generates match codes which can be used to cluster records containing organization names. | |
Max Length of Match Code | 40 characters | |
Input | Cluster ID | |
Examples | BMW Iberica SA | 0 |
BMW Ibérica S.A., Madrid | 0 | |
SAS Institute | 1 | |
Remarks |
|
Phone | ||
---|---|---|
Description | The Phone match definition generates match codes which can be used to cluster records containing phone numbers. | |
Max Length of Match Code | 17 characters | |
Input | Cluster ID | |
Examples | 800 VENGAAESPAÑA | 0 |
800 836422 | 0 | |
926-503-988 | 1 | |
+34 926-503-988 | 1 | |
+682 35664897 | 2 | |
+683 35664897 | 3 | |
(968) 987654 | 4 | |
(968) 987650 | 4 | |
(91) 2411691 Extensión 1234 | 5 | |
(91) 2411691 Extensión 1235 | 5 | |
(91) 2411691 Extensión 1245 | 5 | |
Remarks |
|
Postal Code | ||
---|---|---|
Description | The Postal Code match definition generates match codes which can be used to cluster records containing postal codes. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | 28300. | 0 |
28300 | 0 | |
C.P. 28301 | 0 | |
08950 | 1 | |
Remarks |
|
Address | |||
---|---|---|---|
Description | The Address parse definition parses addresses into a set of tokens. | ||
Output Tokens | Street Type Street Name Building Number Extension Building Name Additional Info |
||
Input | Output Token | Output | |
Example 1 | Avda. del Parallelo, 51 | Street Type | Avda. |
Street Name | del Parallelo | ||
Building Number | 51 | ||
Extension | |||
Building Name | |||
Additional Info | |||
Input | Output Token | Output | |
Example 2 | C. MARINA, 16 P38-39 TORRE MAPFRE | Street Type | C. |
Street Name | MARINA | ||
Building Number | 16 | ||
Extension | P38-39 | ||
Building Name | TORRE MAPFRE | ||
Additional Info | |||
Input | Output Token | Output | |
Example 3 | APARTADO DE CORREOS 10015 | Street Type | |
Street Name | |||
Building Number | |||
Extension | APARTADO DE CORREOS 10015 | ||
Building Name | |||
Additional Info | |||
Remarks |
Address (Full) | |||
---|---|---|---|
Description | The Address (Full) parse definition parses addresses containing complete two-line addresses into a set of tokens. | ||
Output Tokens | Street Type Street Name Building Number Extension Building Name Additional Info Postal Code City Province |
||
Input | Output Token | Output | |
Example 1 | Av. de Bruselas, 12 Arroyo de la Vega 28100 Alcobendas Madrid | Street Type | Av. |
Street Name | de Bruselas | ||
Building Number | 12 | ||
Extension | |||
Building Name | |||
Additional Info | Arroyo de la Vega | ||
Postal Code | 28100 | ||
City | Alcobendas | ||
Province | Madrid | ||
Input | Output Token | Output | |
Example 2 | CALLE VALENCIA 3 7 46183 L' ELIANA | Street Type | CALLE |
Street Name | VALENCIA | ||
Building Number | 3 | ||
Extension | 7 | ||
Building Name | |||
Additional Info | |||
Postal Code | 46183 | ||
City | L' ELIANA | ||
Province | |||
Input | Output Token | Output | |
Example 3 | Centro Empresarial Cityparc Ca. de l'Hospitalet, 147-149 Cornellà de Llobregat . Barcelona | Street Type | Ca. |
Street Name | de l'Hospitalet | ||
Building Number | 147-149 | ||
Extension | |||
Building Name | |||
Additional Info | Centro Empresarial Cityparc | ||
Postal Code | |||
City | Cornellà de Llobregat | ||
Province | Barcelona | ||
Remarks |
Address (Global) | |||
---|---|---|---|
Description |
The Address (Global) parse definition parses addresses into a globally recognized set of tokens. |
||
Output Tokens | Recipient Building/Site Street Extension PO Box Additional Info |
||
Input | Output Token | Output | |
Example 1 | Avda. del Parallelo, 51 | Recipient | |
Building/Site | |||
Street | Avda. del Parallelo, 51 | ||
Extension | |||
PO Box | |||
Additional Info | |||
Input | Output Token | Output | |
Example 2 | C. MARINA, 16 P38-39 TORRE MAPFRE | Recipient | |
Building/Site | TORRE MAPFRE | ||
Street | C. MARINA, 16 | ||
Extension | P38-39 | ||
PO Box | |||
Additional Info | |||
Input | Output Token | Output | |
Example 3 | APARTADO DE CORREOS 10015 | Recipient | |
Building/Site | |||
Street | |||
Extension | |||
PO Box | APARTADO DE CORREOS 10015 | ||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
City - State/Province - Postal Code | |||
---|---|---|---|
Description | The City - State/Province - Postal Code parse definition parses last line address information into a set of tokens. | ||
Output Tokens | City Province Postal Code |
||
Input | Output Token | Output | |
Example 1 | 14006 CORDOBA | City | CORDOBA |
Province | |||
Postal Code | 14006 | ||
Input | Output Token | Output | |
Example 2 | ALMUNIA DOÑA GODINA- 50100, Zaragoza | City | ALMUNIA DOÑA GODINA |
Province | Zaragoza | ||
Postal Code | 50100 | ||
Input | Output Token | Output | |
Example 3 | C.P.: 35214 TELDE (LAS PALMAS) | City | TELDE |
Province | (LAS PALMAS) | ||
Postal Code | C.P.: 35214 | ||
Remarks |
City - State/Province - Postal Code (Global) | |||
---|---|---|---|
Description | The City - State/Province - Postal Code (Global) parse definition parses last line address information into a globally recognized set of tokens. | ||
Output Tokens | City State/Province Postal Code Additional Info |
||
Input | Output Token | Output | |
Example 1 | 14006 CORDOBA | City | CORDOBA |
State/Province | |||
Postal Code | 14006 | ||
Additional Info | |||
Input | Output Token | Output | |
Example 2 | ALMUNIA DOÑA GODINA- 50100, Zaragoza | City | ALMUNIA DOÑA GODINA |
State/Province | Zaragoza | ||
Postal Code | 50100 | ||
Additional Info | |||
Input | Output Token | Output | |
Example 3 | C.P.: 35214 TELDE (LAS PALMAS) | City | TELDE |
State/Province | (LAS PALMAS) | ||
Postal Code | C.P.: 35214 | ||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name (Global) | |||
---|---|---|---|
Description | The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output Token | Output | |
Example 1 | Dr. Juan Carlos Lopez | Prefix | |
Given Name | Juan Carlos | ||
Middle Name | |||
Family Name | Lopez | ||
Suffix | |||
Title/Additional Info | Dr. | ||
Input | Output Token | Output | |
Example 2 | Maria Julia Garcia Arroyo | Prefix | |
Given Name | Maria Julia | ||
Middle Name | |||
Family Name | Garcia Arroyo | ||
Suffix | |||
Title/Additional Info | |||
Input | Output Token | Output | |
Example 3 | Mr John D'Onofrio Jr, CEO | Prefix | Mr |
Given Name | John | ||
Middle Name | |||
Family Name | D'Onofrio | ||
Suffix | Jr | ||
Title/Additional Info | CEO | ||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name (Multiple Name) | |||
---|---|---|---|
Description | The Name (Multiple Name) parse definition parses strings that contain the names of two individuals into a set of tokens. | ||
Output Tokens | Name 1 Name 2 |
||
Input | Output Token | Output | |
Example 1 | Eduardo Gonzalez de la Peña y Susana Rebello | Name 1 | Eduardo Gonzalez de la Peña |
Name 2 | Susana Rebello | ||
Input | Output Token | Output | |
Example 2 | Jose y Cristina Perez | Name 1 | Jose Perez |
Name 2 | Cristina Perez | ||
Input | Output Token | Output | |
Example 3 | Don y Doña Mestres i López | Name 1 | Don Mestres i López |
Name 2 | Doña Mestres i López | ||
Remarks |
Phone | |||
---|---|---|---|
Description | The Phone parse definition parses phone numbers into a set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output Token | Output | |
Example 1 | Ofcna.: 34953399220 Ext 208 (después de las 5 pm) | Country Code | 34 |
Area Code | 953 | ||
Base Number | 399220 | ||
Extension | 208 | ||
Line Type | Ofcna.: | ||
Additional Info | (después de las 5 pm) | ||
Input | Output Token | Output | |
Example 2 | 34913399220 | Country Code | 34 |
Area Code | 91 | ||
Base Number | 3399220 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks |
Phone (Global) | |||
---|---|---|---|
Description | The Phone (Global) parse definition parses phone numbers into a globally recognized set of tokens. | ||
Output Tokens | Country Code Area Code Base Number Extension Line Type Additional Info |
||
Input | Output Token | Output | |
Example 1 | Ofcna.: 34953399220 Ext 208 (después de las 5 pm) | Country Code | 34 |
Area Code | 953 | ||
Base Number | 399220 | ||
Extension | 208 | ||
Line Type | Ofcna.: | ||
Additional Info | (después de las 5 pm) | ||
Input |
Output Token |
Output | |
Example 2 | 34913399220 | Country Code | 34 |
Area Code | 91 | ||
Base Number | 3399220 | ||
Extension | |||
Line Type | |||
Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
None.
Address | ||
---|---|---|
Description | The Address standardization definition standardizes addresses. | |
Input | Output | |
Examples | Avda. del Paralelo 51 | Avenida del Paralelo 51 |
Pso. de la Castellana 67 | Paseo de la Castellana 67 | |
Remarks |
City | ||
---|---|---|
Description | The City standardization definition standardizes city names. | |
Input | Output | |
Examples | madrid | Madrid |
EL PUERTO DE Santa María | EL Puerto de Santa María | |
HUETOR - TAJAR | Huétor-Tájar | |
Remarks |
City - State/Province - Postal Code | ||
---|---|---|
Description | The City - State/Province - Postal Code standardization definition standardizes last line address information. | |
Input | Output | |
Examples | 28300 Aranjuez Madrid | 28300 Aranjuez (Madrid) |
SAN JAIME DOMENYS Cód. Postal 43720, Tarragona | 43720 San Jaime Domenys, Tarragona | |
ALMUNIA DOÑA GODINA- 50100, Zaragoza | 50100 La Almunia de Doña Godina, Zaragoza | |
Remarks |
Organization | ||
---|---|---|
Description | The Organization standardization definition standardizes organization names. | |
Input | Output | |
Examples | N. MARTINEZ Y.J.PIERA C.B. | N Martinez YJ Piera CB |
Agrupacio D'interes Economic De La Conca De La Tordera | La Conca de la Tordera AIE | |
Ibm International, inc | IBM Intl Inc | |
Remarks |
Phone | ||
---|---|---|
Description | The Phone standardization definition standardizes phone numbers for domestic use. | |
Input | Output | |
Examples | 34(922)-783-692 | 922 783 692 |
949391000 (después de las 5 pm) | 949 391 000, Después de las 5 PM | |
806123123 | 806 123 123 | |
030 12345-67 | (030) 12345-67 | |
+1 (919) 447-3000 | +1 9194473000 | |
924-936-827 ex 234 | 924 936 827 x234 | |
Remarks |
Phone (Electronic) | ||
---|---|---|
Description | The Phone (Electronic) standardization definition standardizes phone numbers for automated calling systems. | |
Input | Output | |
Examples | 0044 (0)20 12345000 | +442012345000 |
924-936-827 | +34924936827 | |
800 MIESPAÑA | +3480064377262 | |
949391000 (después de las 5 pm) | +34949391000 | |
Remarks |
Phone (with Country Code) | ||
---|---|---|
Description | The Phone (with Country Code) standardization definition standardizes phone numbers for international use. | |
Input | Output | |
Examples | 924-936-827 | +34 924 936 827 |
905640640 | +34 905 640 640 | |
949391799 (Trabajo) | +34 949 391 799, Trabajo | |
0044 (0)20 12345000 | +44 2012345000 | |
Remarks |
Postal Code | ||
---|---|---|
Description | The Postal Code standardization definition standardizes postal codes. | |
Input | Output | |
Examples | 08950- | 08950 |
,08950 | 08950 | |
C.P.08950 | 08950 | |
ES-08950 | 08950 | |
Remarks |
In addition to the definitions listed on this page, the Spanish, Spain locale also inherits all definitions for the Spanish language and all Global definitions.
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_ESESP_defs.html |