The data quality directives
in SAS Data Loader for Hadoop are supported by SAS
Data Quality Accelerator and the SAS Quality Knowledge Base (QKB).
SAS Data Quality Accelerator is a required component for SAS Data Loader for Hadoop
and is included in SAS In-Database Technologies for Hadoop. The QKB, either
the SAS QKB for Contact Information or the SAS QKB for Product Data,
is a collection of files that store data and logic to support data
management operations. A QKB is specific to a locale, that is, to
a country and language. SAS Data Loader for Hadoop
data quality directives reference the QKB when performing data quality
operations on your data. It is recommended that you periodically update
the QKB. For more information, see
Updating and Customizing the QKB.
Both the SAS Data Quality
Accelerator and the SAS Quality Knowledge Base must be deployed in
the Hadoop cluster.