This chapter describes the features of base SAS procedures that have
been implemented or enhanced since Release 6.12. If you use SAS
software in an OS/390, CMS, or OpenVMS VAX environment, then also see Additional Version 7 Changes and Enhancements for OS/390, CMS, and OpenVMS VAX.
For detailed information about any of these
new or enhanced features,
see the SAS Procedures Guide. For information about changes and
enhancements to base SAS procedures that are relevant only to a particular
operating environment, see the SAS documentation for that operating environment.
|
| Changes That Affect Multiple Procedures |
Output Delivery System
The Output Delivery System (ODS) is a new feature that enhances your
ability to manage procedure output. Procedures that fully support ODS
- store links to each piece of output in the Results folder in
the SAS Explorer
- can generate HTML output
- can generate output for a high-resolution printer
- can generate output data sets
from procedure output
- provide a way for you to customize the procedure output by creating
templates that you can use whenever you run the procedure.
Integrity Constraints
Integrity constraints are a set of rules that modifications to data
sets (tables) must follow in order to guarantee the validity of the data.
You can create integrity constraints with the DATASETS and SQL procedures.
Integrity constraints have implications for other procedures, such as PROC
SORT and PROC CPORT. (See the information on individual procedures in the SAS Procedures Guide.)
Generation Data Sets
Generation data sets enable you to keep historical versions of SAS data
sets, SAS data views, SAS catalogs, and SAS/ACCESS files. In addition, a historical
record of changes is provided so that you can audit the changes that have
been made to these files.
PROC DATASETS manages the generation group - the set of generation
data sets. Other procedures can use the GENNUM= data set option to read a
specific version of the data.
Changes to the Behavior of the WEIGHT Statement
Prior to Version 7 of the SAS System, all base procedures except PROC
REPORT used a value of 0 for missing weights. PROC REPORT and most SAS/STAT
procedures, such as PROC GLM, excluded observations with missing weight from
the analysis. Now all procedures exclude observations with missing weights
from the analysis.
PROC REPORT and most SAS/STAT procedures have always excluded not only
observations with missing weights but also observations with negative and
zero weights from the analysis. Now, base procedures that do not, by default,
exclude observations with negative and 0 weights support the EXCLNPWGT option
in the PROC statement. EXCLNPWGT excludes observations with negative and 0
weights.
-
The DBCSTAB procedure produces conversion tables for the double-byte character
sets that SAS supports.
- The FSLIST procedure is now a part of base SAS software. You no
longer need a SAS/FSP license to run the procedure.
- The EXPORT procedure reads data from a SAS data set and writes
it to an external data source.
- The IMPORT procedure reads data from an external data source
and writes it to a SAS data set.
- The REGISTRY procedure maintains the SAS Registry.
|
| Changes to Existing Procedures |
PROC CATALOG
- The PROC CATALOG statement supports this new option:
- FORCE
- forces statements to execute when the catalog has been opened
by a process other than the current one.
- The COPY statement supports the NEW option, which overwrites the
destination catalog (specified by OUT=) if it already exists.
PROC CIMPORT
- The CIMPORT procedure transports integrity constraints, passwords,
and generation data sets.
- The PROC CIMPORT statement supports these new options:
- MEMTYPE=
- specifies to move only data sets or only catalogs during
a library import.
- NEW
- creates a new catalog for the imported transport file, and
deletes any existing catalog with the same name.
- The SELECT statement and the EXCLUDE statement replace the SELECT=
option and the EXCLUDE= option.
PROC CORR
-
The WEIGHT statement now excludes observations with missing weight values
from the analysis.
-
The EXCLNPWGT option in the PROC CORR statement excludes observations with
nonpositive weights from the analysis.
PROC CPORT
- The procedure transports integrity constraints, passwords, and
generation data sets.
- The SELECT statement and the EXCLUDE statement replace the SELECT=
option and the EXCLUDE= option.
PROC DATASETS
The SAS System now supports the creation and maintenance of generations
of SAS data sets. A generation group consists of the base version of the
file and a set of historical versions. You can use PROC DATASETS to manage
generation data sets.
The SAS System now supports the creation of integrity constraints.
Integrity constraints are a set of rules that modifications to a data set
(table) must follow in order to guarantee the validity of the data. You can
use PROC DATASETS to create and delete integrity constraints.
- PROC DATASETS supports these new
statements:
-
AUDIT
- initiates and controls event logging to an audit file.
- IC CREATE
- creates an integrity constraint.
- IC DELETE
- deletes an integrity constraint.
- IC REACTIVATE
- reactivates a foreign key integrity constraint that has
been set to inactive.
- INDEX CENTILES
- updates centiles information for indexed variables.
- The APPEND statement supports this new
option:
- APPENDVER=V6
- uses the Version 6 behavior for appending observations to
the BASE= data set.
- The CHANGE statement supports this new option:
- GENNUM=
- restricts processing to a single generation file or to the
entire generation group.
- The CONTENTS statement supports these new options:
- CENTILES
- prints centiles information for indexed variables.
-
OUT2=
- names an output data set to contain information about indexes
and integrity constraints.
- The COPY statement supports this new option:
- CONSTRAINT=
- specifies whether to copy all integrity constraints when
copying a data set.
- The DELETE statement supports this new option:
- GENNUM=
- restricts processing to the specified generation files
- The INDEX CREATE statement supports this new
option:
- UPDATECENTILES=
- specifies when the centiles are updated.
- The IC DELETE statement supports this new
option:
-
_ALL_
- deletes all constraints.
- The INDEX DELETE statement supports this new
option:
-
_ALL_
- deletes all indexes, except for indexes that are owned by
an integrity constraint.
- The MODIFY statement supports these new options:
- GENMAX=
- sets the maximum number of generation files in a generation
group.
- GENNUM=
- restricts processing to the specified generation file.
- The REPAIR statement supports this new
option:
- GENNUM=
- restricts processing to the specified generation files.
PROC FORMAT
- The PICTURE statement supports these new options:
- DATATYPE=
- specifies that the picture value is a template for formatting
date, time, or datetime values.
- DECSEP=
- specifies the separator character for the fractional part
of a number. By default, this character is a period (.)
- DIG3SEP=
- specifies the character that separates each group of three
digits in a number. By default, this character is a comma (,).
- MULTILABEL
- indicates that multiple labels for the same range are allowed.
Secondary labels can be used by certain applications that are designed to
handle multilabel formats. For all other applications, the secondary labels
are ignored.
When MULTILABEL is specified, overlapping ranges are also permitted.
- The VALUE statement supports this new
option:
- MULTILABEL
- indicates that multiple labels for the same range are allowed.
Secondary labels can be used by certain applications that are designed to
handle multilabel formats. For all other applications, the secondary labels
are ignored.
When MULTILABEL is specified, overlapping ranges are also permitted.
PROC FREQ
PROC MEANS
PROC OPTIONS
The OPTION= option in the PROC OPTIONS statement supports two new suboptions,
DEFINE and VALUE, that provide additional information about the specified
option.
PROC PMENU
PROC PMENU supports submenus to enable multiple items to point to a
common submenu.
PROC PRINT
- The PROC PRINT statement supports this new option:
- OBS=
- specifies a column header for the column that identifies
each observation by number.
- The N= option in the PROC PRINT statement has been enhanced so
that you can specify explanatory text to print with the value of N.
PROC REPORT
PROC SQL
- Some PROC SQL views are now updateable. The view must be based
on a single DBMS table or SAS data file and must not contain a join, an ORDER
BY clause, or a subquery.
- Whenever possible, PROC SQL passes joins to the DBMS rather than
doing the joins itself. This enhances performance.
- You can now store DBMS connection information in a view with the
USING LIBNAME clause.
- A new option, DQUOTE=ANSI, enables you to use names that are not
normally permissible in PROC SQL.
- A PROC SQL query can now reference up to 32 views or tables. PROC
SQL can perform joins on up to 32 tables.
- PROC SQL can now create and update tables that contain integrity
constraints.
PROC STANDARD
- The PROC STANDARD statement supports this new option:
- EXCLNPWGT
- excludes observations with nonpositive weights from the
analysis.
- The WEIGHT statement now excludes observations with missing weight
values from the analysis.
PROC SUMMARY
- The PROC SUMMARY statement supports this new option:
- EXCLNPWGT
- excludes observations with nonpositive weights from the
analysis.
- The WEIGHT statement now excludes observations with missing weight
values from the analysis.
PROC TABULATE
PROC UNIVARIATE
-
PROC UNIVARIATE now supports high resolution graphical displays. You can generate
histograms and comparative histograms and optionally superimpose fitted probabilty
density curves for various distributions and kernel density estimates. You
can generate quantile-quantile plots (Q-Q plots) and probability plots to
compare a data distribution with various theoretical distributions. You also
have the ability to inset summary statistics in the graphical displays.
- The WEIGHT statement now excludes observations with missing weight
values from the analysis.
- The output from PROC UNIVARIATE has been reorganized and includes
some new tables:
| The Moments table displays only those statistics that are related to
sample moments. |
| A new table, Tests for Location, shows the Student's t
test, the sign test, and the signed rank test. |
| A new table, Basic Statistical Measures, provides basic measures of
location and variability. |
- The UNIVARIATE procedure supports these new statements:
-
CLASS
statement
- specifies one or two class variables for the analysis.
You can use the CLASS statement with a HISTOGRAM, PROBPLOT, or QQPLOT
statement to create one-way and two-way high resolution comparative plots.
When you use a single class variable, PROC UNIVARIATE displays an array
of componentplots (stacked or side-by-side), for each level of the class variable.
When you use two class variables, PROC UNIVARIATE displays a matrix of component
plots, one for each combination of levels of the class variables
-
HISTOGRAM
statement
- creates a high resolution graph of a histogram and optionally
includes parametric and nonparametric density curve estimates. You can use
the HISTOGRAM statement to specify the midpoints for histogram intervals,
display density curves for fitted theoretical distributions (beta, exponential,
gamma, lognormal, normal, and Weibull) on histograms, request goodness-of-fit
tests for fitted distributions, display kernel density estimates on histograms,
save histogram intervals and parameters of fitted distributions in output
data sets, and request graphical enhancements.
-
INSET
statement
- Places a box or table of summary statistics, called an inset, directly in the graphical display. The inset can display statistics
that PROC UNIVARIATE calculates or display values that you provide in a SAS
data set. The INSET statement does not produce the graphical display. You
must specify a HISTOGRAM, PROBPLOT, or QQPLOT statement. You can use options
in the INSET statement to specify the position of the inset, to specify
a header for the inset, and to specify graphical enhancements, such as background
colors, text colors, text height, text font, and drop shadows
-
PROBPLOT
statement
- creates a high-resolution graphics display of a probability
plot, which compares ordered variable values with the percentiles of a specified
theoretical distribution.
-
QQPLOT
statement
- Creates a graphical display of a quantile-quantile plot
(Q-Q plot), which compares ordered variable values with quantiles of a specified
theoretical distribution.
- The PROC UNIVARIATE statement statement supports these new options:
-
ANNOTATE=
- specifies an input data set that contains annotate variables
as described in SAS/GRAPH documentation. You can use this data set to add
features to your high resolution graphics. PROC UNIVARIATE adds the features
in this data set to every high resolution graph produced in the PROC step.
- EXCLNPWGT
- excludes observations with nonpositive weights from the
analysis.
-
GOUT=
- specifies the SAS catalog in which to save the high resolution
graphics output that the UNIVARIATE procedure produces.
PROC UNIVARIATE also supports the following enhancements:
- The ALL option
requests tables of confidence limits, frequency,
modes, and extreme values, and plots. For unweighted analysis variables, it
also requests location counts, tests for normality, and robust estimators
of scale and location.
- The CIBASIC option produces the Basic Statistical Measures table.
This table provides basic measures of location and variability.
- The EXCLNPWGT option excludes observations with nonpositive weights
from the analysis.
- The LOCCOUNT option produces the Location Counts table. This table
shows the numbers of observations greater than, less than, and equal to the
value of the new MU0= option. Formerly, this information was in the Moments
table. The default value for MU0= is 0.
- Mode is no longer shown for continuous data. If there are multiple
modes, the lowest mode is still shown. The MODES option in the PROC UNIVARIATE
statement displays a table of all modes.
- The NORMAL option produces a table of Tests for Normality. These
tests include empirical distribution function (EDF) goodness-of-fit tests.
The Shapiro-Wilk test is included only if the sample size is less than or
equal to 2000. The Kolmogorov test is always included.
- The ROBUSTSCALE option produces a table with robust estimates
of scale.
- The TRIM= option produces a table of trimmed means.
- The WINSOR= option produces a table of winsorized means.
- A new
table, Basic Confidence Limits, provides basic confidence
intervals for the mean, the standard deviation, and the variance.
- You can request confidence intervals for quantiles with these
options:
- CIPCTLNORMAL
- requests confidence intervals that are based on the assumption
that the data are normal.
- CIPCTLDF
- requests confidence intervals that are distribution free.
- The Extremes table is replaced by two
tables:
- Extreme Observations table
- displays the n lowest observations
and n highest observations, where n is specified with the new option NEXTROBS= in the PROC UNIVARIATE
statement. FREQ and ID variables change the table.
- Extreme Values table
- displays the n lowest unique
and n highest unique values, where n is specified with the new option NEXTRVAL= in the PROC UNIVARIATE
statement.
Copyright 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.