Glossary
-
This glossary defines SAS software terms that
are used in this document as well as terms that relate specifically
to SAS/ACCESS software.
- access descriptor
-
a SAS/ACCESS file that describes data that is
managed by SAS, by a database management system, or by a PC-based
software application such as Microsoft Excel, Lotus 1-2-3, or dBASE.
After creating an access descriptor, you can use it as the basis for
creating one or more view descriptors.
- bulk load
-
to load large amounts of data into a database
object, using methods that are specific to a particular DBMS. Bulk
loading enables you to rapidly and efficiently add multiple rows of
data to a table as a single unit.
- client
-
an application that requests either resources
or services from a server, possibly over a network.
- column
-
a vertical component of a table. Each column has
a unique name, contains data of a specific type, and has particular
attributes. A column is analogous to a variable in SAS terminology.
- column function
-
an operation that is performed for each value
in the column that is named as an argument of the function. For example,
AVG(SALARY) is a column function.
- commit
-
the process that ends a transaction and that makes
permanent any changes to the database that the user made during the
transaction.
- data set
-
a file whose contents are in one of the native
SAS file formats. There are two types of SAS data sets: SAS data files
and SAS data views.
- DATA step view
-
a type of SAS data set that consists of a stored
DATA step program. A DATA step view contains a definition of data
that is stored elsewhere; the view does not contain the physical data.
The view's input data can come from one or more sources, including
external files and other SAS data sets. Because a DATA step view only
reads (opens for input) other files, you cannot update the view's
underlying data.
- data type
-
an attribute of every column in a table or database.
The data type tells the operating system how much physical storage
to set aside for the column, and specifies what type of data the column
will contain. It is similar to the type attribute of SAS variables.
- data value
-
a unit of character, numeric, or alphanumeric
information that is stored as a single item in a data record.
- database
-
an organized collection of related data. A database
usually contains named files, named objects, or other named entities
such as tables, views, and indexes.
- database management system
-
a software application that enables you to create
and manipulate data that is stored in the form of databases. Short
form: DBMS.
- DB2 catalog
-
a set of tables that DB2 maintains for its own
use. These tables contain information about the other objects your
DB2 system manages.
- DBMS
-
a software application that enables you to create
and manipulate data that is stored in the form of databases. Short
form: DBMS.
- engine
-
a component of SAS software that reads from or
writes to a file. Various engines enable SAS to access different types
of file formats.
- explicit pass-through
-
a form of the SQL pass-through facility that passes
the user-written, DBMS-specific SQL query code directly to the particular
DBMS for processing.
- foreign key
-
a SAS/ACCESS file that describes data that is
managed by SAS, by a database management system, or by a PC-based
software application such as Microsoft Excel, Lotus 1-2-3, or dBASE.
After creating an access descriptor, you can use it as the basis for
creating one or more view descriptors.
- format
-
a type of SAS language element that is used to
write or display data values according to the data's type: numeric,
character, date, time, or timestamp. Short form: format.
- function
-
a type of SAS language element that is used to
process one or more arguments and then to return a result that can
be used in either an assignment statement or an expression.
- Hadoop Distributed File System
-
a framework for managing files as blocks of equal
size, which are replicated across the machines in a Hadoop cluster
to provide fault tolerance.
- HDFS
-
See Hadoop Distributed File System.
- implicit pass-through
-
a form of the SQL pass-through facility that translates
SAS SQL query code to the DBMS-specific SQL code, enabling the translated
code to be passed to the particular DBMS for processing.
- index
-
a component of a SAS data set that enables SAS
to access observations in the SAS data set quickly and efficiently.
The purpose of SAS indexes is to optimize WHERE-clause processing
and to facilitate BY-group processing.
- informat
-
a type of SAS language element that is used to
read data values according to the data's type: numeric, character,
date, time, or timestamp. Short form: informat.
- interactive line mode
-
a method of running SAS programs in which you
enter one line of a SAS program at a time at the SAS session prompt.
SAS processes each line immediately after you press the ENTER or RETURN
key. Procedure output and informative messages are returned directly
to your display device.
- interface view engine
-
a type of SAS engine that SAS/ACCESS software
uses to retrieve data from files that have been formatted by another
vendor's software. Each SAS/ACCESS interface has its own interface
view engine, which reads the interface product data and returns the
data in a form that SAS can understand (that is, in a SAS data set).
- key column
-
in Oracle, a column in a table that is used to
create an index on the table. The column contains a unique data value
for each row in a table.
- libref
-
a SAS name that is associated with the location
of a SAS library. For example, in the name MYLIB.MYFILE, MYLIB is
the libref, and MYFILE is a file in the SAS library.
- line mode
-
a method of running SAS programs in which you
enter one line of a SAS program at a time at the SAS session prompt.
SAS processes each line immediately after you press the ENTER or RETURN
key. Procedure output and informative messages are returned directly
to your display device.
- member
-
a SAS file in a SAS library.
- member name
-
a name that is assigned to a SAS file in a SAS
library.
- member type
-
a SAS name that identifies the type of information
that is stored in a SAS file. Member types include ACCESS, AUDIT,
DMBD, DATA, CATALOG, FDB, INDEX, ITEMSTOR, MDDB, PROGRAM, UTILITY,
and VIEW.
- missing value
-
a type of value for a variable that contains no
data for a particular row or column. By default, SAS writes a missing
numeric value as a single period and a missing character value as
a blank space.
- null value
-
a special value that indicates the absence of
information. Null values are analogous to SAS missing values.
- object
-
any entity that can be manipulated by the commands
of a programming language. Examples are values, variables, functions,
and data structures.
- observation
-
a row in a SAS data set. All of the data values
in an observation are associated with a single entity such as a customer
or a state. Each observation contains either one data value or a missing-value
indicator for each variable.
- pass-through facility
-
the technology that enables SQL query code to
be passed to a particular DBMS for processing. Short form: pass-through
facility.
- predicate
-
a component of a SAS WHERE statement or a component
of an SQL WHERE or HAVING clause that is used in view descriptor selection
criteria.
- PROC SQL view
-
a SAS data set that is created by the SQL procedure.
A PROC SQL view contains no data. Instead, it stores information that
enables it to read data values from other files, which can include
SAS data files, SAS/ACCESS views, DATA step views, or other PROC SQL
views. The output of a PROC SQL view can be either a subset or a superset
of one or more files.
- query
-
a set of instructions that requests particular
information from one or more data sources.
- RDBMS
-
a database management system that organizes and
accesses data according to relationships between data items. The main
characteristic of a relational database management system is the two-dimensional
table. Examples of relational database management systems are DB2,
Oracle, Sybase, and Microsoft SQL Server.
- referential integrity
-
a set of rules that a DBMS uses to ensure that
a change to a data value in one table also results in a change to
any related values in other tables or in the same table. Referential
integrity is also used to ensure that related data is not deleted
or changed accidentally.
- relational database management system
-
a database management system that organizes and
accesses data according to relationships between data items. The main
characteristic of a relational database management system is the two-dimensional
table. Examples of relational database management systems are DB2,
Oracle, Sybase, and Microsoft SQL Server.
- rollback
-
a data recovery process that restores a database
after a hardware or software failure, or that returns it to a state
before changes were made.
- row
-
in relational database management systems, the
horizontal component of a table. A row is analogous to a SAS observation.
- SAS/ACCESS view
-
a type of file that retrieves data values from
files that are stored in other software vendors' file formats. You
use the ACCESS procedure of SAS/ACCESS software to create SAS/ACCESS
views.
- SAS data file
-
a type of SAS data set that contains data values
as well as descriptor information that is associated with the data.
The descriptor information includes information such as the data types
and lengths of the variables, as well as the name of the engine that
was used to create the data.
- SAS data set
-
a file whose contents are in one of the native
SAS file formats. There are two types of SAS data sets: SAS data files
and SAS data views.
- SAS data view
-
a type of SAS data set that retrieves data values
from other files. A SAS data view contains only descriptor information
such as the data types and lengths of the variables (columns) plus
other information that is required for retrieving data values from
other SAS data sets or from files that are stored in other software
vendors' file formats. Short form: data view.
- SAS file
-
a type of SAS data set that retrieves data values
from other files. A SAS data view contains only descriptor information
such as the data types and lengths of the variables (columns) plus
other information that is required for retrieving data values from
other SAS data sets or from files that are stored in other software
vendors' file formats. Short form: data view.
- SAS format
-
a type of SAS language element that is used to
write or display data values according to the data's type: numeric,
character, date, time, or timestamp. Short form: format.
- SAS function
-
a type of SAS language element that is used to
process one or more arguments and then to return a result that can
be used in either an assignment statement or an expression.
- SAS informat
-
a type of SAS language element that is used to
read data values according to the data's type: numeric, character,
date, time, or timestamp. Short form: informat.
- SAS library
-
one or more files that are defined, recognized,
and accessible by SAS, and that are referenced and stored as a unit.
Each file is a member of the library.
- SAS metadata
-
metadata that is created by SAS software. Metadata
that is in SAS Open Metadata Architecture format is one example.
- SAS variable
-
a column in a SAS data set or in a SAS data view.
The data values for each variable describe a single characteristic
for all observations (rows).
- serde
-
an interface that enables serialization or deserialization
of one or more file formats. [plural: serdes]
- server
-
software that provides either resources or services
to requesting clients, possibly over a network.
- SQL
-
a standardized, high-level query language that
is used in relational database management systems to create and manipulate
objects in a database management system. SAS implements SQL through
the SQL procedure. Short form: SQL.
- SQL pass-through facility
-
the technology that enables SQL query code to
be passed to a particular DBMS for processing. Short form: pass-through
facility.
- SSID
-
the unique name that identifies each DB2 subsystem
on your machine. Short form: SSID.
- Structured Query Language (SQL)
-
a standardized, high-level query language that
is used in relational database management systems to create and manipulate
objects in a database management system. SAS implements SQL through
the SQL procedure. Short form: SQL.
- subsystem ID
-
the unique name that identifies each DB2 subsystem
on your machine. Short form: SSID.
- table space
-
a named DB2 object that identifies sets of pages
where records for one or more tables are stored. (A page is a 4K-byte
or 32K-byte unit of storage within a table space.)
- temporal data
-
event data that occurs at a particular date and
time, such as an account inquiry. Temporal data is often referred
to as time-sensitive data.
- trigger
-
a type of user-defined stored procedure that is
executed whenever a user issues a data-modification command such as
INSERT, DELETE, or UPDATE for a specified table or column. Triggers
can be used to implement referential integrity or to maintain business
constraints.
- term
-
an attribute of every column in a table or database.
The data type tells the operating system how much physical storage
to set aside for the column, and specifies what type of data the column
will contain. It is similar to the type attribute of SAS variables.
- variable
-
a column in a SAS data set or in a SAS data view.
The data values for each variable describe a single characteristic
for all observations (rows).
- view
-
a definition of a virtual data set that is named
and stored for later use. A view contains no data; it merely describes
or defines data that is stored elsewhere.
- view descriptor
-
a SAS/ACCESS file that defines part or all of
the DBMS data that is described by an access descriptor.
Copyright © SAS Institute Inc. All rights reserved.