SAS Quality Knowledge Base for Contact Information 26
In the QKB, the Italian definitions are shared by all Italian-language locales. Shared Italian definitions are described below.
Case Definitions
Extraction Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Proper (Name) | ||
---|---|---|
Description | The Proper (Name) case definition propercases names of individuals. | |
Input | Output | |
Carlo m. d'Agostino | Carlo M. D'Agostino | |
Examples | dott.ssa lucia di sieno | Dott.ssa Lucia Di Sieno |
prof.ssa Maria Cristina Tozzi | Prof.ssa Maria Cristina Tozzi | |
Jan van de Brick | Jan Van De Brick | |
Remarks |
Proper (Organization) | ||
---|---|---|
Description | The Proper (Organization) case definition propercases organization names. | |
Input | Output | |
Examples | A G CAR SNC DI MURA GIOVANNI E TOFFALI ADRIANO AUTORIPARAZIONI | A G Car SNC di Mura Giovanni e Toffali Adriano Autoriparazioni |
EBISCOM SPA | eBiscom SPA | |
UNIVERSITA' DI PISA | Universita' di Pisa | |
ASSOCIAZIONE VOLONTARIA DI P.S. E P.A. CROCE BIANCA | Associazione Volontaria di P.S. e P.A. Croce Bianca | |
Remarks |
None.
Name | ||
---|---|---|
Description | The Name gender analysis definition determines the gender of a name. | |
Possible Output | M F U |
|
Input | Output | |
Examples | Stefania Barone | F |
Dott.ssa Castaldo | F | |
Sig.ra Brambilla | F | |
Maria Nicola Giotta | F | |
Nicola Maria Giotta | M | |
Signora Bossi | F | |
Signore Bossi | M | |
Sig. Bossi | U | |
Roberto Micalizzi | M | |
Remarks | If the gender is determined by the prefix or suffix, the given name is not analyzed. If the gender is not determined by the suffix, prefix, or given name, the title/additional information is analyzed. If there is more than one given name, only the first is used to determine the gender. |
Individual/Organization | ||
---|---|---|
Description | The Individual/Organization identification analysis definition determines whether a string represents the name of an individual or an organization. | |
Possible Outputs | INDIVIDUAL ORGANIZATION UNKNOWN |
|
Input | Output | |
Examples | Barilla Alimentare SPA | ORGANIZATION |
SERVIZI TECNOLOGIE AMBIENTALI | ORGANIZATION | |
Avv.ssa Eva Simone | ORGANIZATION | |
Acanfora Stefania SRA | INDIVIDUAL | |
Dottor Luigi Nello Villa | INDIVIDUAL | |
Ferrari | UNKNOWN | |
n corsi | UNKNOWN | |
Remarks | Strings that contain titles indicating an independent profession are identified as ORGANIZATION. Examples are lawyer (avvocato) and carpenter (carpentiere). |
Date (DMY) | ||
---|---|---|
Description | The Date (DMY) match definition generates match codes which can be used to cluster records containing dates that have the format DMY. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | 10 gennaio 1998 | 0 |
10-1-1998 | 0 | |
Remarks |
|
Date (MDY) | ||
---|---|---|
Description | The Date (MDY) match definition generates match codes which can be used to cluster records containing dates that have the format MDY. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | gennaio 10, 1998 | 0 |
1.10.1998 | 0 | |
Remarks |
|
Date (YMD) | ||
---|---|---|
Description | The Date (YMD) match definition generates match codes which can be used to cluster records containing dates that have the format YMD. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | 1998 gennaio 10 | 0 |
1998.1.10 | 0 | |
Remarks |
|
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 20 characters | |
Input | Cluster ID | |
Examples | Maria Concetta Fabrizi | 0 |
Maria Sonia Fabrizi | 0 | |
Moira Fabrizzi | 0 | |
Mario Rosso | 1 | |
Mario Rossi | 2 | |
Jan v/d Brink | 3 | |
Jan van de Brink | 3 | |
Jan vd. Brink | 3 | |
Alex Brutto | 4 | |
Alexander Lobrutto | 4 | |
Alexandar Lo Brutto | 4 | |
Remarks |
|
Organization | ||
---|---|---|
Description | The Organization match definition generates match codes which can be used to cluster records containing organization names. | |
Max Length of Match Code | 35 characters | |
Input | Cluster ID | |
Examples | FINSIEL Cantieri Navali - Monfalcone SPA | 0 |
FINSIEL Cantieri Navali SCRL | 0 | |
FINSIEL Cantieri Navali Società Per Azioni | 0 | |
Banca Unicredit SPA | 1 | |
Unicredit Banca SPA | 1 | |
100 Cose Srl | 2 | |
Cento Cose SRL | 2 | |
ASL 1 Bolzano | 3 | |
AZIENDA SANITARIA LOCALE 1 DI BOLZANO | 3 | |
ABB Group | 4 | |
A.B.B. S.p.A. | 4 | |
Remarks |
|
Text | ||
---|---|---|
Description | The Text match definition generates match codes which can be used to cluster records containing general text strings. | |
Max Length of Match Code | 15 characters | |
Input | Cluster ID | |
Examples | Virgilio ti guida a Virgilio | 0 |
Crea la mappa | 1 | |
Remarks |
|
Date (DMY) | |||
---|---|---|---|
Description |
The Date (DMY) parse definition parses dates with format DMY into a set of tokens. |
||
Output Tokens | Day Month Year |
||
Input | Output Token | Output | |
Example 1 | 10/18/2007 | Day | 18 |
Month | 10 | ||
Year | 2007 | ||
Input | Output Token | Output | |
Example 2 | 10/1/07 | Day | 10 |
Month | 1 | ||
Year | 07 | ||
Input | Output Token | Output | |
Example 3 | Ottobre 5, 2007 | Day | 5 |
Month | Ottobre | ||
Year | 2007 | ||
Remarks | If it is not possible to determine whether information in the input is a day, month, or year, it will be determined by the assumed order of day, month, and year. |
Date (MDY) | |||
---|---|---|---|
Description | The Date (MDY) parse definition parses dates with format MDY into a set of tokens. | ||
Output Tokens | Day Month Year |
||
Input | Output Token | Output | |
Example 1 | 10/18/2007 | Day | 18 |
Month | 10 | ||
Year | 2007 | ||
Input | Output Token | Output | |
Example 2 | 10/1/07 | Day | 1 |
Month | 10 | ||
Year | 07 | ||
Input | Output Token | Output | |
Example 3 | Ottobre 5, 2007 | Day | 5 |
Month | Ottobre | ||
Year | 2007 | ||
Remarks | If it is not possible to determine whether information in the input is a day, month, or year, it will be determined by the assumed order of month, day, and year. |
Date (YMD) | |||
---|---|---|---|
Description | The Date (YMD) parse definition parses dates with format YMD into a set of tokens. | ||
Output Tokens | Day Month Year |
||
Input | Output Token | Output | |
Example 1 | 10/12/07 | Day | 07 |
Month | 12 | ||
Year | 10 | ||
Input | Output Token | Output | |
Example 2 | 10/1/2007 | Day | 10 |
Month | 1 | ||
Year | 2007 | ||
Input | Output Token | Output | |
Example 3 | Ottobre 5, 2007 | Day | 5 |
Month | Ottobre | ||
Year | 2007 | ||
Remarks | If it is not possible to determine whether information in the input is a day, month, or year, it will be determined by the assumed order of year, month, and day. |
Name | |||
---|---|---|---|
Description | The Name parse definition parses names of individuals into a set of tokens. | ||
Output Tokens | Prefix Given Name Family Name Suffix Title/Additional Info |
||
Input | Output Token | Output | |
Example 1 | Dr. Mario Rossi Jr., architetto | Prefix | Dr. |
Given Name | Mario | ||
Family Name | Rossi | ||
Suffix | Jr. | ||
Title/Additional Info | architetto | ||
Input | Output Token | Output | |
Example 2 | Dott.ssa Maria Rossi in Foà | Prefix | Dott.ssa |
Given Name | Maria | ||
Family Name | Rossi in Foà | ||
Suffix | |||
Title/Additional Info | |||
Remarks | If you want to separate family name extensions into a separate token you can use the parse definition named Name (with Family Name Extension). |
Name (Global) | |||
---|---|---|---|
Description | The Name (Global) parse definition parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Input | Output Token | Output | |
Example 1 | Dr. Mario Rossi Jr., architetto | Prefix | Dr. |
Given Name | Mario | ||
Middle Name | |||
Family Name | Rossi | ||
Suffix | Jr. | ||
Title/Additional Info | architetto | ||
Input | Output Token | Output | |
Example 2 | Dott.ssa Maria Rossi in Foà | Prefix | Dott.ssa |
Given Name | Maria | ||
Middle Name | |||
Family Name | Rossi in Foà | ||
Suffix | |||
Title/Additional Info | |||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name (Multiple Name) | |||
---|---|---|---|
Description | The Name (Multiple Name) parse definition parses strings that contain the names of two individuals into a set of tokens. | ||
Output Tokens | Name 1 Name 2 |
||
Input | Output Token | Output | |
Example 1 | Marco Buzzi e Giovanna Tordi | Name 1 | Marco Buzzi |
Name 2 | Giovanna Tordi | ||
Input | Output Token | Output | |
Example 2 | Mario e Elena Rossi | Name 1 | Mario Rossi |
Name 2 | Elena Rossi | ||
Input | Output Token | Output | |
Example 3 | Signor & Signora Balossi | Name 1 | Signor Balossi |
Name 2 | Signora Balossi | ||
Input | Output Token | Output | |
Example 4 | Signor Paolo Fantozzi | Name 1 | Signor Paolo Fantozzi |
Name 2 | |||
Remarks | If only one name is present in the input, it is parsed into the Name 1 token. |
Name (with Family Name Extension) | |||
---|---|---|---|
Description |
The Name (with Family Name Extension) parse definition parses names of individuals into a set of tokens. |
||
Output Tokens | Prefix Given Name Family Name Family Name Extension Suffix Title/Additional Info |
||
Input | Output Token | Output | |
Example 1 | Dr. Mario Rossi Jr., architetto | Prefix | Dr. |
Given Name | Mario | ||
Family Name | Rossi | ||
Family Name Extension | |||
Suffix | Jr. | ||
Title/Additional Info | architetto | ||
Input | Output Token | Output | |
Example 2 | Dott.ssa Maria Rossi in Foà | Prefix | Dott.ssa |
Given Name | Maria | ||
Family Name | Rossi | ||
Family Name Extension | in Foà | ||
Suffix | |||
Title/Additional Info | |||
Remarks |
Organization | |||
---|---|---|---|
Description | The Organization parse definition parses organization names into a set of tokens. | ||
Output Tokens | Name Legal Form Site Additional Info |
||
Input | Output Token | Output | |
Example 1 | MOC MEDITERRANEO S.R.L. | Name | MOC MEDITERRANEO |
Legal Form | S.R.L. | ||
Site | |||
Additional Info | |||
Input | Output Token | Output | |
Example 2 | RIV-SKF OFFICINE Soc. per az. VILLAR PEROSA | Name | RIV-SKF OFFICINE |
Legal Form | Soc. per az. | ||
Site | VILLAR PEROSA | ||
Additional Info | |||
Input | Output Token | Output | |
Example 3 | Brian Pack S.a.s. di La Scala Alberto | Name | Brian Pack |
Legal Form | S.a.s. | ||
Site | |||
Additional Info | di La Scala Alberto | ||
Remarks |
Organization (Global) | |||
---|---|---|---|
Description | The Organization (Global) parse definition parses organization names into a set of tokens. | ||
Output Tokens | Name Legal Form Site Additional Info |
||
Input | Output Token | Output | |
Example 1 | MOC MEDITERRANEO S.R.L. | Name | MOC MEDITERRANEO |
Legal Form | S.R.L. | ||
Site | |||
Additional Info | |||
Input | Output Token | Output | |
Example 2 | RIV-SKF OFFICINE Soc. per az. VILLAR PEROSA | Name | RIV-SKF OFFICINE |
Legal Form | Soc. per az. | ||
Site | VILLAR PEROSA | ||
Additional Info | |||
Input | Output Token | Output | |
Example 3 | Brian Pack S.a.s. di La Scala Alberto | Name | Brian Pack |
Legal Form | S.a.s. | ||
Site | |||
Additional Info | di La Scala Alberto | ||
Remarks |
Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
None.
Date (DMY) | ||
---|---|---|
Description | The Date (DMY) standardization definition standardizes dates that have format DMY. The output is a zero-padded two-digit day, followed by a zero-padded two-digit month, followed by a four-digit year. The day, month, and year are separated by spaces. | |
Input | Output | |
Examples | 10/01/12 | 10 01 2012 |
10gen30 | 10 01 1930 | |
10.01.05 | 10 01 1930 | |
10 gen 2015 | 10 01 2015 | |
10-01-2015 | 10 01 2015 | |
Remarks | If the input year is a two-digit value, it is assumed to be within the hundred year span with 2029 as the end of the span. For example, a year of 29 will be 2029, but a year of 30 will be 1930. |
Date (MDY) | ||
---|---|---|
Description | The Date (MDY) standardization definition standardizes dates that have format MDY. The output is a zero-padded two-digit month, followed by a zero-padded two-digit day, followed by a four-digit year. The month, day, and year are separated by spaces. | |
Input | Output | |
Examples | gennaio 10,12 | 01 10 2012 |
01/10/12 | 01 10 2012 | |
gen 10, 30 | 01 10 1930 | |
01.10.30 | 01 10 1930 | |
gennaio 10, 2015 | 10 10 2015 | |
01-10-2015 | 01 10 2015 | |
Remarks | If the input year is a two-digit value, it is assumed to be within the hundred year span with 2029 as the end of the span. For example, a year of 29 will be 2029, but a year of 30 will be 1930. |
Date (YMD) | ||
---|---|---|
Description | The Date (YMD) standardization definition standardizes dates that have format YMD. The output is a four-digit year, followed by a zero-padded two-digit month, followed by a zero-padded two-digit day. The year, month, and day are separated by spaces. | |
Input | Output | |
Examples | 12gen10 | 2012 01 10 |
12/01/10 | 2012 01 10 | |
30gen10 | 1930 01 10 | |
30.10.10 | 1930 10 10 | |
2015 gennaio 10 | 2015 01 10 | |
2015-01-10 | 2015 01 10 | |
Remarks | If the input year is a two-digit value, it is assumed to be within the hundred year span with 2029 as the end of the span. For example, a year of 29 will be 2029, but a year of 30 will be 1930. |
Name | ||
---|---|---|
Description | The Name standardization definition standardizes names of individuals. | |
Input | Output | |
Examples | Mario Rossi (Professore Associato) | Mario Rossi, Prof. Associato |
Sottotenente Federico Fabris | Federico Fabris, S. Ten. | |
Llorenç Maym'o | Llorenç Maym'o | |
Marta Agui' Berte' | Marta Aguì Bertè | |
Enzo D'Angelo | Enzo D'Angelo | |
Enzo D´Angelo | Enzo D'Angelo | |
Enzo D`Angelo | Enzo D'Angelo | |
Anna - Maria Rossi | Anna-Maria Rossi | |
Dottoressa Marta A. B. Zanotti | Dott.ssa Marta A B Zanotti | |
Remarks |
Organization | ||
---|---|---|
Description | The Organization standardization definition standardizes organization names. | |
Input | Output | |
Examples | I.B.A. INDUSTRIA BIELLESE ARREDAMENTO - S.R.L. | IBA Industria Biellese Arredamento SRL |
ARREDAMENTO - SRL | Arredamento SRL | |
//F.I.A.M.M. Spa FABBRICA ITALIANA ACCUMULATORI MOTOCARRI MONTECCHIO// | FIAMM SPA, Fabbrica Italiana Accumulatori Motocarri Montecchio | |
FIAT Auto - Torino SPA | Fiat Auto SPA, Torino | |
ASL - Azienda Sanitaria Locale N. 1 di Bolzano | ASL 1 di Bolzano | |
Remarks |
In addition to the definitions listed on this page, all Italian-language locales also inherit all Global definitions.
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_IT_defs.html |