SAS Quality Knowledge Base for Contact Information 25
In the Quality Knowledge Base, the English definitions are shared by all English-language locales. Shared English definitions are described below.
Case Definitions
Gender Analysis Definitions
Identification Analysis Definitions
Match Definitions
Parse Definitions
Pattern Analysis Definitions
Standardization Definitions
Inherited Definitions
Proper (Address) | ||
---|---|---|
Description | The Case definition for Proper (Address) propercases addresses. | |
Input | Output | |
Examples | 11TH FLOOR | 11th Floor |
po box 125 | PO Box 125 | |
Remarks |
Proper (Business Title) | ||
---|---|---|
Description | The Case definition for Proper (Business Title) propercases business titles. | |
Input | Output | |
Examples | ceo | CEO |
SALES MANAGER | Sales Manager | |
Remarks |
Proper (Given Name) | ||
---|---|---|
Description | The Case definition for Proper (Given Name) propercases given names of individuals. | |
Examples | Input | Output |
aj | AJ | |
mckensie | McKensie | |
anne-marie | Anne-Marie | |
Remarks |
Proper (Name) | ||
---|---|---|
Description | The Case definition for Proper (Name) propercases names of individuals. | |
Examples | Input | Output |
GWENDOL GUTTERMOTH | Gwendol Guttermoth | |
JOHN O'ROURKE | John O'Rourke | |
ronald chung-hui mcdonald | Ronald Chung-Hui McDonald | |
Remarks |
Proper (Organization) | ||
---|---|---|
Description | The Case definition for Proper (Organization) propercases organization names. | |
Examples | Input | Output |
DATAFLUX CORPORATION | DataFlux Corporation | |
sas institute | SAS Institute | |
Remarks |
Proper (Site) | ||
---|---|---|
Description | The Case definition for Proper (Site) propercases organization sites. | |
Examples | Input | Output |
of new zealand | of New Zealand | |
uk | UK | |
Remarks |
Upper (Legal Form) | ||
---|---|---|
Description | The Case definition for Upper (Legal Form) uppercases legal forms. | |
Examples | Input | Output |
inc | Inc | |
LTD | Ltd | |
sp zoo | SP ZOO | |
Remarks | Legal forms default to uppercasing. Some known forms are propercased. |
Name | ||
---|---|---|
Description | The Gender Analysis definition for Name determines the gender of a name. | |
Possible Outputs | M F U |
|
Examples | Input | Output |
Sandi Baker | F | |
Pat Young, Jr. | M | |
T. Millhouse | U | |
Remarks |
Contact Info | ||
---|---|---|
Description | The Identification Analysis definition for Contact Info identifies the contact information that is represented by a string. | |
Possible Outputs | NAME ORGANIZATION NAME/ORGANIZATION UNKNOWN BLANK |
|
Examples | Input | Output |
DataFlux Corporation | ORGANIZATION | |
Tony Fisher | NAME | |
Joe Smith DBA Some Company Name | NAME/ORGANIZATION | |
john.smith@sas.com | ||
BLANK | ||
Fisher | UNKNOWN | |
Remarks |
Field Name | ||
---|---|---|
Description |
The Field Name identification analysis definition identifies database column names. |
|
Possible Outputs | NAME ORGANIZATION ADDRESS CITY STATE/PROVINCE POSTALCODE COUNTRY PHONE DATE UNKNOWN URL GENDER MATCHCODE PERSONAL_ID ORGANIZATION_ID GENERIC_ID COUNTY MARITAL_STATUS |
|
Examples | Input | Output |
Company Name | ORGANIZATION | |
Address | ADDRESS | |
Telephone | PHONE | |
Remarks |
This definition is recommended to determine the type of data stored in a database column based on the name of the column. |
|
The Field Name (v23) identification analysis definition is now deprecated and will be removed in a future release of the QKB. The Field Name identification analysis definition has been replaced with a copy of the Field Name (v23) definition which takes advantage of updated processing. If you changed your jobs to use the Field Name (v23) definition it is suggested that you change them back. |
Field Name (v23) | ||
---|---|---|
Description |
The Field Name (v23) identification analysis definition identifies database column names. |
|
Possible Outputs | NAME ORGANIZATION ADDRESS CITY STATE/PROVINCE POSTALCODE COUNTRY PHONE DATE UNKNOWN URL GENDER MATCHCODE PERSONAL_ID ORGANIZATION_ID GENERIC_ID COUNTY MARITAL_STATUS |
|
Examples | Input | Output |
Company Name | ORGANIZATION | |
Address | ADDRESS | |
Telephone | PHONE | |
Remarks |
This definition is recommended to determine the type of data stored in a database column based on the name of the column. |
|
The Field Name (v23) identification analysis definition is now deprecated and will be removed in a future release of the QKB. The Field Name identification analysis definition has been replaced with a copy of the Field Name (v23) definition which takes advantage of updated processing. If you changed your jobs to use the Field Name (v23) definition it is suggested that you change them back. |
Offensive | ||
---|---|---|
Description | The Identification Analysis definition for Offensive identifies potentially vulgar, offensive or invalid words and phrases. | |
Possible Outputs | OFF_ENUSA_1 OFF_ENUSA_2 OFF_ENGBR_1 OFF_ENGBR_2 OFF_FRFRA_1 OFF_FRFRA_2 INVALID, PASS |
|
Examples | Input | Output |
ababab | INVALID | |
none of your business | INVALID | |
Mr. Dork | OFF_ENUSA_1 | |
Remarks | Values in some of the libraries associated with this definition would be considered offensive by most. DataFlux in no way condones or approves of the use of this information for any reason other than that for which it was designed. |
Business Title | ||
---|---|---|
Description | The Business Title match definition generates match codes which can be used to cluster records containing business title information. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
CEO | 0 | |
Chief Executive Officer | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Country | ||
---|---|---|
Description | The Country match definition generates match codes which can be used to cluster records containing country names. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
United States | 0 | |
Great Britain | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Date (DMY) | ||
---|---|---|
Description | The Date (DMY) match definition generates match codes which can be used to cluster records containing dates that have the format DMY. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
05March1969 | 1 | |
5-3-1969 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Date (DMY) (with Combinations) | ||
---|---|---|
Description | The Date (DMY) (with Combinations) match definition generates match codes which can be used to cluster records containing dates, with a score for each match code, that have the format DMY. | |
Max Length of Match Code | 15 characters | |
Example 1 | Input | Cluster ID |
Sensitivities 50 - 100 Weight 100 |
01/02/2013 | 0 |
Feb-01-2013 | 0 | |
Example 2 | Input | Cluster ID |
Sensitivities 50 - 100 Weight 40 |
Feb-01-2013 | 1 |
01/02/2013 | 1 | |
02/01/2013 | 1 | |
Remarks |
A date with DMY format will match with the date with MDY format. This definition generates one or more match codes for each input string. The number of match codes generated for an input string depends on the content of the string. Each match code represents a combination of different parts of the input string; this enables two strings to be matched even when some parts of one or both of the strings differ. See the examples above for an illustration of clusters that can be produced using match codes generated by this definition. Note that a consequence of generating multiple match codes is that a record can be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition. Generation of multiple match codes is achieved through the use of token-combination rules in the match definition. Each match code generated by the definition is associated with one token-combination rule. There is a weight assigned to each rule; each rule's weight is used to calculate a score that is assigned to the match code that is generated by that rule. The score for a match code is equal to the weight of the rule used to generate the match code times the sensitivity that is selected when the definition is executed. When a record is clustered, the score for the record’s match code represents the confidence with which we can assert that the record belongs in the cluster. Note that when different rules lead to identical clustering results, the scores of the match codes generated by the different rules can be aggregated using the Cluster Aggregation node in a Data Job. The Cluster Aggregation node allows several different methods for aggregating match code scores, such as minimum, maximum, or mean across instances of a record, or minimum, maximum, or mean across all records in a cluster. For information on the Cluster Aggregation node, refer to the documentation provided with the DataFlux Data Management Studio installation. |
Date (MDY) | ||
---|---|---|
Description | The Date (MDY) match definition generates match codes which can be used to cluster records containing dates that have the format MDY. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
8.30.1997 | 0 | |
August 30th, 1997 | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Date (MDY) (with Combinations) | ||
---|---|---|
Description | The Date (MDY) (with Combinations) match definition generates match codes which can be used to cluster records containing dates, with a score for each match code, that have the format MDY. | |
Max Length of Match Code | 15 characters | |
Example 1 | Input | Cluster ID |
Sensitivities 50 - 100 Weight 100 |
01/02/2013 | 0 |
Jan-02-2013 | 0 | |
Example 2 | Input | Cluster ID |
Sensitivities 50 - 100 Weight 40 |
Jan-02-2013 | 1 |
02/01/2013 | 1 | |
Remarks |
A date with MDY format will match with the date with DMY format. This definition generates one or more match codes for each input string. The number of match codes generated for an input string depends on the content of the string. Each match code represents a combination of different parts of the input string; this enables two strings to be matched even when some parts of one or both of the strings differ. See the examples above for an illustration of clusters that may be produced using match codes generated by this definition. Note that a consequence of generating multiple match codes is that a record can be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition. Generation of multiple match codes is achieved through the use of token-combination rules in the match definition. Each match code generated by the definition is associated with one token-combination rule. There is a weight assigned to each rule; each rule's weight is used to calculate a score that is assigned to the match code that is generated by that rule. The score for a match code is equal to the weight of the rule used to generate the match code times the sensitivity that is selected when the definition is executed. When a record is clustered, the score for the record’s match code represents the confidence with which we can assert that the record belongs in the cluster. Note that when different rules lead to identical clustering results, the scores of the match codes generated by the different rules can be aggregated using the Cluster Aggregation node in a Data Job. The Cluster Aggregation node allows several different methods for aggregating match code scores, such as minimum, maximum, or mean across instances of a record, or minimum, maximum, or mean across all records in a cluster. For information on the Cluster Aggregation node, refer to the documentation provided with the DataFlux Data Management Studio installation. |
Date (YMD) | ||
---|---|---|
Description | The Date/Time (YMD) match definition generates match codes which can be used to cluster records containing date/time information. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
2002dec31 | 1 | |
2002.12.31 | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Date/Time (DMY) | ||
---|---|---|
Description | The Date/Time (DMY) match definition generates match codes which can be used to cluster records containing date/time information. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
20 Jul 2969 02:20:32 | 0 | |
20/7/69 02:20:32 | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Date/Time (MDY) | ||
---|---|---|
Description | The Date/Time (MDY) match definition generates match codes which can be used to cluster records containing date/time information. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
Nov-06-2001 2:32 PM | 1 | |
11/6/01 2:32 PM | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Date/Time (YMD) | ||
---|---|---|
Description | The Date/Time (YMD) match definition generates match codes which can be used to cluster records containing date/time information. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
1999.10.11 22:40:00+04:00 | 0 | |
99/10/11 22:40:00+04:00 | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Field Name | ||
---|---|---|
Description | The Field Name match definition generates match codes which can be used to cluster records containing database field names. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
Company Name | 0 | |
Organization | 0 | |
Home_Address | 1 | |
Address | 1 | |
Telephone | 2 | |
PHONE | 2 | |
First_Name | 3 | |
NAME | 3 | |
Remarks |
This definition should be used to find potential matches between database column names. Note: The results listed above reflect the default match sensitivity (85). |
Name | ||
---|---|---|
Description | The Name match definition generates match codes which can be used to cluster records containing names of individuals. | |
Max Length of Match Code | 27 characters | |
Examples | Input | Cluster ID |
Mary Shafer | 0 | |
Mrs. Marian V. Schaeffer | 0 | |
Shaffer, Mary | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Name (with Combinations) | ||
---|---|---|
Description | The Name (with Combinations) match definition generates match codes which can be used to cluster records containing names. | |
Max Length of Match Code | 27 characters | |
Example 1 | Input | Cluster ID |
Sensitivities 50 - 100 Weight 100 |
Mary Shafer | 0 |
Shaffer, Mary | 0 | |
Example 2 | Input | Cluster ID |
Sensitivities 50 - 100 Weight 20 |
D Harrison Wood | 1 |
Harrison Wood | 1 | |
Example 2 | Input | Cluster ID |
Sensitivities 50 - 100 Weight 80 |
Martha Newell-Warner | 2 |
Martha Newell | 2 | |
Example 3 | Input | Cluster ID |
Sensitivities 50 - 100 Weight 50 |
John Elton | 3 |
Elton John | 3 | |
Remarks |
This definition generates one or more match codes for each input string. The number of match codes generated for an input string depends on the content of the string. Each match code represents a combination of different parts of the input string; this enables two strings to be matched even when some parts of one or both of the strings differ. See the examples above for an illustration of clusters that might be produced using match codes generated by this definition. Note that a consequence of generating multiple match codes is that a record might be placed in more than one cluster by a subsequent clustering operation. Therefore, special attention should be given to the entity resolution process when using this definition. Generation of multiple match codes is achieved through the use of token-combination rules in the match definition. Each match code generated by the definition is associated with one token-combination rule. There is a weight assigned to each rule; each rule's weight is used to calculate a score that is assigned to the match code that is generated by that rule. The score for a match code is equal to the weight of the rule used to generate the match code times the sensitivity that is selected when the definition is executed. When a record is clustered, the score for the record’s match code represents the confidence with which we can assert that the record belongs in the cluster. Note that when different rules lead to identical clustering results, the scores of the match codes generated by the different rules might be aggregated using the Cluster Aggregation node in a Data Job. The Cluster Aggregation node allows several different methods for aggregating match code scores, such as minimum, maximum, or mean across instances of a record, or minimum, maximum, or mean across all records in a cluster. For information on the Cluster Aggregation node, refer to the documentation provided with the DataFlux Data Management Studio installation. |
Organization | ||
---|---|---|
Description | The Organization match definition generates match codes which can be used to cluster records containing organization names. | |
Max Length of Match Code | 35 characters | |
Examples | Input | Cluster ID |
DataFlux Corporation | 0 | |
DataFlex LLC | 0 | |
SAS Institute | 0 | |
SAS Institute, Canada | 0 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Text | ||
---|---|---|
Description | The Text match definition generates match codes which can be used to cluster records containing general text strings. | |
Max Length of Match Code | 15 characters | |
Examples | Input | Cluster ID |
they went | 0 | |
you are | 1 | |
you're | 1 | |
Remarks |
Note: The results listed above reflect the default match sensitivity (85). |
Business Title | |||
---|---|---|---|
Description | The Parse definition for Business Title parses business title information into a set of tokens. | ||
Output Tokens | Position Level Location Department Grade |
||
Example | Input | Output | |
Marketing Director | Position | Director | |
Level | |||
Location | |||
Department | Marketing | ||
Grade | |||
Remarks |
Date (DMY) | |||
---|---|---|---|
Description | The Parse definition for Date (DMY) parses dates with format DMY into a set of tokens. | ||
Output Tokens | Day Month Year |
||
Example | Input | Output | |
05March1969 | Day | 05 | |
Month | March | ||
Year | 1969 | ||
Remarks |
Date (MDY) | |||
---|---|---|---|
Description | The Parse definition for Date (MDY) parses dates with format MDY into a set of tokens. | ||
Output Tokens | Day Month Year |
||
Example | Input | Output | |
03-05-1969 | Day | 05 | |
Month | 03 | ||
Year | 1969 | ||
Remarks |
Date (YMD) | |||
---|---|---|---|
Description | The Parse definition for Date (YMD) parses dates with format YMD into a set of tokens. | ||
Output Tokens | Day Month Year |
||
Example | Input | Output | |
1969.3.5 | Day | 5 | |
Month | 3 | ||
Year | 1969 | ||
Remarks |
Date/Time (DMY) | |||
---|---|---|---|
Description | The Parse definition for Date/Time (DMY) parses dates with format DMY HH:MM:SS into a set of tokens. | ||
Output Tokens | Year Month Day Weekday Hours Minutes Seconds Time Extension |
||
Example | Input | Output | |
20 Jul 1969 02:20:32 | Year | 1969 | |
Month | Jul | ||
Day | 20 | ||
Weekday | |||
Hours | 02 | ||
Minutes | 20 | ||
Seconds | 32 | ||
Time Extension | |||
Remarks |
Date/Time (MDY) | |||
---|---|---|---|
Description | The Parse definition for Date/Time (MDY) parses dates with format MDY HH:MM:SS into a set of tokens. | ||
Output Tokens | Year Month Day Weekday Hours Minutes Seconds Time Extension |
||
Example | Input | Output | |
Nov-06-2001 2:32 PM | Year | 2001 | |
Month | Nov | ||
Day | 06 | ||
Weekday | |||
Hours | 2 | ||
Minutes | 32 | ||
Seconds | |||
Time Extension | PM | ||
Remarks |
Date/Time (YMD) | |||
---|---|---|---|
Description | The Parse definition for Date/Time (YMD) parses dates with format YMD HH:MM:SS into a set of tokens. | ||
Output Tokens | Year Month Day Weekday Hours Minutes Seconds Time Extension |
||
Example | Input | Output | |
1999.10.11 22:40:00+04:00 | Year | 1999 | |
Month | 10 | ||
Day | 11 | ||
Weekday | |||
Hours | 22 | ||
Minutes | 40 | ||
Seconds | 00 | ||
Time Extension | +04:00 | ||
Remarks |
Name | |||
---|---|---|---|
Description | The Parse definition for Name parses names of individuals into a set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Example 1 | Input | Output | |
Dr. James Goodnight, CEO | Prefix | Dr. | |
Given Name | James | ||
Middle Name | |||
Family Name | Goodnight | ||
Suffix | |||
Title/Additional Info | CEO | ||
Example 2 | Input | Output | |
Smith, Sr., Ronald G. | Prefix | ||
Given Name | Ronald | ||
Middle Name | G. | ||
Family Name | Smith | ||
Suffix | Sr. | ||
Title/Additional Info | |||
Remarks |
Name (Global) | |||
---|---|---|---|
Description | The Parse definition for Name (Global) parses names of individuals into a globally recognized set of tokens. | ||
Output Tokens | Prefix Given Name Middle Name Family Name Suffix Title/Additional Info |
||
Example 1 | Input | Output | |
Dr. James Goodnight, CEO | Prefix | Dr. | |
Given Name | James | ||
Middle Name | |||
Family Name | Goodnight | ||
Suffix | |||
Title/Additional Info | CEO | ||
Example 2 | Input | Output | |
Smith, Sr., Ronald G. | Prefix | ||
Given Name | Ronald | ||
Middle Name | G. | ||
Family Name | Smith | ||
Suffix | Sr. | ||
Title/Additional Info | |||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Name (Multiple Name) | |||
---|---|---|---|
Description | The Parse definition for Name (Multiple Name) parses strings that contain the names of two individuals into a set of tokens. | ||
Output Tokens | Name 1 Name 2 |
||
Example 1 | Input | Output | |
Mr Jon and Mrs Sally Smith | Name 1 | Mr Jon Smith | |
Name 2 | Mrs Sally Smith | ||
Example 2 | Input | Output | |
Jon Smith & Pat Young Jr. | Name 1 | Jon Smith | |
Name 2 | Pat Young Jr. | ||
Remarks | If only one name is present in the input, the first token is used. This definition replaces Name (Two Name). |
Organization | |||
---|---|---|---|
Description | The Parse definition for Organization parses organization names into a set of tokens. | ||
Output Tokens | Name
Legal Form Site Additional Info |
||
Example 1 | Input | Output | |
3Com Europe Limited | Name | 3Com Europe | |
Legal Form | Limited | ||
Site | |||
Additional Info | |||
Example 2 | Input | Output | |
Corporate Psychology International (CPI) | Name | Corporate Psychology International | |
Legal Form | |||
Site | |||
Additional Info | (CPI) | ||
Remarks |
Organization (Global) | |||
---|---|---|---|
Description | The Parse definition for Organization (Global) parses organization names into a globally recognized set of tokens. | ||
Output Tokens | Name
Legal Form Site Additional Info |
||
Example 1 | Input | Output | |
3Com Europe Limited | Name | 3Com Europe | |
Legal Form | Limited | ||
Site | |||
Additional Info | |||
Example 2 | Input | Output | |
Corporate Psychology International (CPI) | Name | Corporate Psychology International | |
Legal Form | |||
Site | |||
Additional Info | (CPI) | ||
Remarks | Parse definitions named with the Global keyword use a set of output tokens that is consistent across every locale. Results obtained from these definitions can be stored in the same database fields as the results obtained from definitions of the same name in other locales. |
Organization (Multiple) | |||
---|---|---|---|
Description | The Parse definition for Organization (Multiple) parses strings that contain the names of one or two organizations into a set of tokens. | ||
Output Tokens | Organization 1 Organization 2 |
||
Examples | Input | Output | |
THE GINGER CAT T/A EMBROIDERY PLACE LLC | Organization 1 | THE GINGER CAT | |
Organization 2 | EMBROIDERY PLACE LLC | ||
Remarks |
None.
Business Title | ||
---|---|---|
Description | The Standardization definition for Business Title standardizes business titles using a parse definition as an initial step. | |
Examples | Input | Output |
cfo | CFO | |
Chief Financial Officer | CFO | |
sales director | Director, Sales | |
Remarks |
Business Title (No Parse) | ||
---|---|---|
Description | The Standardization definition for Business Title (No Parse) standardizes business titles without using a parse definition as an initial step. | |
Examples | Input | Output |
bus mngr | Business Manager | |
BUSINESS MGR | Business Manager | |
sales director | Sales Director | |
Remarks |
Country | ||
---|---|---|
Description | The Standardization definition for Country standardizes country names to a common short format. | |
Examples | Input | Output |
UK | UNITED KINGDOM | |
gbr | UNITED KINGDOM | |
Remarks |
Country (FIPS) | ||
---|---|---|
Description | The Standardization definition for Country (FIPS) standardizes country names to the US FIPS code format. | |
Examples | Input | Output |
Czech Republic | CZ | |
gbr | UK | |
Remarks |
Country (Internet) | ||
---|---|---|
Description | The Standardization definition for Country (Internet) standardizes country names to the top-level domain designation. | |
Examples | Input | Output |
Czech Republic | CZ | |
gbr | UK | |
Remarks |
Country (ISO Number) | ||
---|---|---|
Description | The Standardization definition for Country (ISO Number) standardizes country names to the ISO-3166 numeric designation. | |
Examples | Input | Output |
Czech Republic | 203 | |
gbr | 826 | |
Remarks |
Country (ISO 2 Char) | ||
---|---|---|
Description | The Standardization definition for Country (ISO 2 Char) standardizes country names to the ISO-3166 two-character designation. | |
Examples | Input | Output |
Czech Republic | CZ | |
gbr | GB | |
Remarks |
Country (ISO 3 Char) | ||
---|---|---|
Description | The Standardization definition for Country (ISO 3 Char) standardizes country names to the ISO-3166 three-character designation. | |
Examples | Input | Output |
Czech Republic | CZE | |
gbr | GBR | |
Remarks |
Country (Region) | ||
---|---|---|
Description | The Standardization definition for Country (Region) standardizes country names to the United Nations region designation. | |
Examples | Input | Output |
Czech Republic | ASIA | |
gbr | EUROPE | |
Remarks |
Country (Sub-Region) | ||
---|---|---|
Description | The Standardization definition for Country (Sub-Region) standardizes country names to the United Nations sub-region designation. | |
Examples | Input | Output |
Czech Republic | WESTERN ASIA | |
gbr | NORTHERN EUROPE | |
Remarks |
Date (DMY) | ||
---|---|---|
Description | The Standardization definition for Date (DMY) standardizes dates that have format DMY. The output is a zero-padded two-digit day, followed by a zero-padded two-digit month, followed by a four-digit year. The day, month, and year are separated by spaces. | |
Examples | Input | Output |
04/07/02 | 04 07 2002 | |
04July05 | 04 07 1905 | |
04.07.05 | 04 07 1905 | |
04July2005 | 04 07 2005 | |
04-07-2005 | 04 07 2005 | |
Remarks | If the input year is a two-digit value, it is assumed to be within the hundred year span with 2019 as the end of the span. For example, a year of 19 will be 2019, but a year of 20 will be 1920. |
Date (MDY) | ||
---|---|---|
Description | The Standardization definition for Date (MDY) standardizes dates that have format MDY. The output is a zero-padded two-digit month, followed by a zero-padded two-digit day, followed by a four-digit year. The month, day, and year are separated by spaces. | |
Examples | Input | Output |
July04, 02 | 07 04 2002 | |
07/04/02 | 07 04 2002 | |
July04, 05 | 07 04 1905 | |
07.04.05 | 07 04 1905 | |
July 4, 2005 | 07 04 2005 | |
07-04-2005 | 07 04 2005 | |
Remarks | If the input year is a two-digit value, it is assumed to be within the hundred year span with 2019 as the end of the span. For example, a year of 19 will be 2019, but a year of 20 will be 1920. |
Date (YMD) | ||
---|---|---|
Description | The Standardization definition for Date (YMD) standardizes dates that have format YMD. The output is a four-digit year, followed by a zero-padded two-digit month, followed by a zero-padded two-digit day. The year, month, and day are separated by spaces. | |
Examples | Input | Output |
02July04 | 2002 07 04 | |
02/07/04 | 2002 07 04 | |
05July04 | 1905 07 04 | |
05.07.04 | 1905 07 04 | |
2005July04 | 2005 07 04 | |
2005-07-04 | 2005 07 04 | |
Remarks | If the input year is a two-digit value, it is assumed to be within the hundred year span with 2019 as the end of the span. For example, a year of 19 will be 2019, but a year of 20 will be 1920. |
Date/Time (DMY) Basic | ||
---|---|---|
Description | The Standardization definition for Date/Time (DMY) Basic standardizes dates that have format DMY HH:MM:SS. The output is a 4-digit year followed by a zero-padded two-digit month, followed by a zero-padded two-digit day, followed by a zero-padded two-digit hour, followed by zero-padded two-digit minutes, followed by zero-padded two-digit seconds. The day, month, and year are separated by spaces and the time elements are separated by spaces. The date and the time are separated by a "T". | |
Examples | Input | Output |
04/07/02 | 2002 07 04 | |
15 Oct 1970 | 1970 10 15 | |
11 May 2000 10:12:45 | 2000 05 11T10 12 45 | |
Remarks | The output of this definition is the same as Date/Time (MDY) Basic and Date/Time (YMD) Basic. |
Date/Time (DMY) Extended | ||
---|---|---|
Description | The Standardization definition for Date/Time (DMY) Extended standardizes dates that have format DMY HH:MM:SS. The output is a 4-digit year followed by a zero-padded two-digit month, followed by a zero-padded two-digit day, followed by a zero-padded two-digit hour, followed by zero-padded two-digit minutes, followed by zero-padded two-digit seconds. The day, month, and year are separated by dashes and the time elements are separated by colons. The date and the time are separated by a "T". | |
Examples | Input | Output |
04/07/02 | 2002-07-04 | |
15 Oct 1970 | 1970-10-15 | |
11 May 2000 10:12:45 | 2000-05-11T10:12:45 | |
Remarks | The output of this definition is the same as Date/Time (MDY) Extended and Date/Time (YMD) Extended. |
Date/Time (MDY) Basic | ||
---|---|---|
Description | The Standardization definition for Date/Time (MDY) Basic standardizes dates that have format MDY HH:MM:SS. The output is a 4-digit year followed by a zero-padded two-digit month, followed by a zero-padded two-digit day, followed by a zero-padded two-digit hour, followed by zero-padded two-digit minutes, followed by zero-padded two-digit seconds. The day, month, and year are separated by spaces and the time elements are separated by spaces. The date and the time are separated by a "T". | |
Examples | Input | Output |
04/07/02 | 2002 04 07 | |
Oct 15 1970 | 1970 10 15 | |
May 11, 2000 10:12:45 | 2000 05 11T10 12 45 | |
Remarks | The output of this definition is the same as Date/Time (DMY) Basic and Date/Time (YMD) Basic. |
Date/Time (MDY) Extended | ||
---|---|---|
Description | The Standardization definition for Date/Time (MDY) Extended standardizes dates that have format MDY HH:MM:SS. The output is a 4-digit year followed by a zero-padded two-digit month, followed by a zero-padded two-digit day, followed by a zero-padded two-digit hour, followed by zero-padded two-digit minutes, followed by zero-padded two-digit seconds. The day, month, and year are separated by dashes and the time elements are separated by colons. The date and the time are separated by a "T". | |
Examples | Input | Output |
04/07/02 | 2002-04-07 | |
Oct 15 1970 | 1970-10-15 | |
May 11, 2000 10:12:45 | 2000-05-11T10:12:45 | |
Remarks | The output of this definition is the same as Date/Time (DMY) Extended and Date/Time (YMD) Extended. |
Date/Time (YMD) Basic | ||
---|---|---|
Description | The Standardization definition for Date/Time (YMD) Basic standardizes dates that have format YMD HH:MM:SS. The output is a 4-digit year followed by a zero-padded two-digit month, followed by a zero-padded two-digit day, followed by a zero-padded two-digit hour, followed by zero-padded two-digit minutes, followed by zero-padded two-digit seconds. The day, month, and year are separated by spaces and the time elements are separated by spaces. The date and the time are separated by a "T". | |
Examples | Input | Output |
04/07/02 | 2004 07 02 | |
1970 Oct 15 | 1970 10 15 | |
2000 May 11 10:12:45 | 2000 05 11T10 12 45 | |
Remarks | The output of this definition is the same as Date/Time (DMY) Basic and Date/Time (MDY) Basic. |
Date/Time (YMD) Extended | ||
---|---|---|
Description | The Standardization definition for Date/Time (YMD) Extended standardizes dates that have format YMD HH:MM:SS. The output is a 4-digit year followed by a zero-padded two-digit month, followed by a zero-padded two-digit day, followed by a zero-padded two-digit hour, followed by zero-padded two-digit minutes, followed by zero-padded two-digit seconds. The day, month, and year are separated by dashes and the time elements are separated by colons. The date and the time are separated by a "T". | |
Examples | Input | Output |
04/07/02 | 2004-07-02 | |
1970 Oct 15 | 1970-10-15 | |
2000 May 11 10:12:45 | 2000-05-11T10:12:45 | |
Remarks | The output of this definition is the same as Date/Time (DMY) Extended and Date/Time (MDY) Extended. |
Name | ||
---|---|---|
Description | The Standardization definition for Name standardizes names of individuals. | |
Examples | Input | Output |
john donovan | John Donovan | |
mister morrison junior | Mr Morrison, Jr | |
Remarks |
Organization | ||
---|---|---|
Description | The Standardization definition for Organization standardizes organization names. | |
Examples | Input | Output |
Robe River Iron Associates | Robe River Iron Associates | |
Moffat Limited Australia | Moffat Ltd, Australia | |
A.T.W.Products Sp. z o.o. | ATW Products SP ZOO | |
Remarks |
In addition to the definitions listed on this page, all English-language locales also inherit all Global definitions.
Documentation Feedback: yourturn@sas.com
|
Doc ID: QKBCI_EN_defs.html |