Because the
SPD Engine organizes data for high-performance processing, an
SPD Engine data set is physically different from a default Base SAS engine data set. The default Base
SAS engine stores data in a single data file that contains both data and data descriptors
for the file (metadata). The SPD Engine creates separate files for the data and data
descriptors. In addition, if the data set is indexed, the SPD Engine creates two index
files for each index. Each of these four files is called an SPD Engine
component file,
and each has an identifier embedded in the filename.
The metadata component
is a single physical file, but it can occupy multiple physical files,
and each file has .mdf embedded in the filename. The data component
is one or more physical files, and each file has .dpf embedded in
the filename. If the index component exists because indexes have been
defined, each index has two physical files:
-
one file with .hbx embedded in
the filename
-
one file with .idx embedded in
the filename
Each of these component files can consist of one or more physical files so that the
component can span volumes, but be referenced as one logical file. For example, the
SPD Engine can create many physical files containing data, but it references the files
containing
data as a single data component in an SPD Engine data set.
The metadata and index components
differ from the data component in two ways:
-
You can specify a fixed-length
partition size for data component files using the PARTSIZE= option. You cannot specify the
partition size for the metadata or index components.
-
The data component files
are created in a cyclical fashion across all defined paths. The metadata
and index component files are created in a single defined path until
that path is full, and then the next defined path is used.