The OUTPUT statement creates a new SAS data set that contains statistics that are calculated for each observation unit. These
statistics can include the estimated linear predictor () and its standard error, residuals, and influence statistics. In addition, this data set includes all the variables from the DATA= input data set.
Only score residuals are available in the OUTPUT data set if the model contains a time-dependent variable that is defined
by means of programming statements.
The following list explains specifications in the OUTPUT statement:
-
OUT=SAS-data-set
-
names the output data set. If you omit the OUT= option, the OUTPUT data set is named by using the DATAn
convention. See the section OUT= Data Set for the OUTPUT statement for more information.
-
keyword=name
-
specifies the statistics to include in the OUTPUT data set and names the new variables that contain the statistics. Specify
a keyword for each desired statistic (see the following list of keywords), and optionally an equal sign with either a variable or a list of variables in parentheses to contain the statistics. If
you specify a keyword without a variable name, then the procedure uses default names. The keywords that accept a list of variables are RESSCH, RESSCO, and WTRESSCH. For these keywords, you can specify as many names in name as the number of explanatory variables in the MODEL statement. If you specify k names and k is less than the total number of explanatory variables, only the first k names are taken from the list; the procedure assigns default names for the rest of the statistics. The keywords and the corresponding statistics are as follows:
-
ATRISK
-
specifies the number of subjects at risk at the observation time .
-
RESDEV
-
specifies the deviance residual . This is a transform of the martingale residual to achieve a more symmetric distribution.
-
RESMART
-
specifies the martingale residual . The residual at the observation time can be interpreted as the difference over in the observed number of events minus the expected number of events given by the model.
-
RESSCH
-
specifies the Schoenfeld residuals. These residuals are useful in assessing the proportional hazards assumption.
-
RESSCO
-
specifies the score residuals. These residuals are a decomposition of the first partial derivative of the log likelihood. They can be used to assess the
leverage that is exerted by each subject in the parameter estimation. They are also useful in constructing design-based variance
estimators.
-
STDXBETA
-
specifies the standard error of the estimated linear predictor
, .
-
WTATRISK
-
specifies the weighted number of subjects at risk at the observation time .
-
XBETA
-
specifies the estimate of the linear predictor, .