DataFlux Data Management Studio 2.7: User Guide
The SAS Quality Knowledge Base (QKB) is a collection of files that store data and logic that define data management operations such as parsing, standardization, and matching. SAS software products refer to the QKB when performing data management operations, also referred to as data cleansing, on your data.
There are two QKB products, QKB for Contact Information and QKB for Product Data. QKB for Contact Information supports the management of commonly used contact information for individuals and organizations, such as names, addresses, company names, and phone numbers. QKB for Product Data supports the management of common attributes related to products and services, such as dimensions, color, materials, packaging terms, and part numbers.
A QKB supports locales organized by language and country, for example, English, United States; English, Canada; and French, Canada. Each QKB is licensed by a locale. You can license support for one or more locales for each QKB for your enterprise. To process data that originates in specific locales, license those locales for the QKB that handles that type of data.
Data and logic in a QKB are organized into a set of objects called definitions. In DataFlux Data Management Studio, these definitions are listed on the Quality Knowledge Base tab of the QKB interface, as shown in the next display.
Definitions on the Quality Knowledge Base Tab
Each definition defines a single, context-sensitive data management operation. For example, a definition in a QKB might contain data and logic used to parse phone numbers. Another definition might contain data and logic used to determine the gender of individuals by analyzing their names.
When you use SAS data management products to process your data, you specify which definitions the software should invoke. For example, if you are standardizing company names with the Standardization node in a data job, you can specify that the node should use the Organization definition when processing data in the Company field in your table.
A QKB contains definitions developed for use with common types of data such as names, addresses, and phone numbers. You can use definitions in a QKB as delivered, or you can use DataFlux Data Management Studio to customize the QKB by modifying definitions or creating new definitions for use with your own business data. Since a QKB is used by DataFlux Data Management Studio, SAS Data Quality Server, and the SAS Data Quality Accelerators, QKB customizations are automatically available to your entire enterprise.
To display the help for QKBs installed at your site, see Accessing Documentation for QKBs. There are periodic updates to a QKB with new definitions and enhancements to existing definitions. To download updates to a QKB, visit the QKB downloads page.
Documentation Feedback: yourturn@sas.com
|
Doc ID: DMCust_QKB.html |