The ESM Procedure |
Data Set Output |
The ESM procedure can create the OUT=, OUTEST=, OUTFOR=, OUTSTAT=, and OUTSUM= data sets. These data sets contain the variables listed in the BY statement and statistics related to the variables listing in the FORECAST statement. In general, if a forecasting step related to an output data set fails, the values of this step are not recorded or are set to missing in the related output data set and appropriate error and/or warning messages are recorded in the log.
The OUT= data set contains the variables specified in the BY, ID, and FORECAST statements. If the ID statement is specified, the ID variable values are aligned and extended based on the ALIGN= and INTERVAL= options. The values of the variables specified in the FORECAST statements are accumulated based on the ACCUMULATE= option, and missing values are interpreted based on the SETMISSING= option. If the REPLACEMISSING option is specified, embedded missing values are replaced by the one-step-ahead predicted values.
These FORECAST variables are then extrapolated based on the forecasts from the fitted models, or extended with missing values when the MODEL=NONE option is specified. If USE=LOWER is specified, the variable is extrapolated with the lower confidence limits; if USE=UPPER, the variable is extrapolated using the upper confidence limits; otherwise, the variable values are extrapolated with the predicted values. If the TRANSFORM= option is specified, the predicted values contain either mean or median forecasts depending on whether or not the MEDIAN option is specified.
If any of the forecasting steps fail for a particular variable, the variable is extended by missing values.
The OUTEST= data set contains the variables specified in the BY statement as well as the variables listed below. For variables listed in FORECAST statements where the option MODEL=NONE is specified, no observations are recorded in the OUTEST= data set. For variables listed in FORECAST statements where the option MODEL=NONE is not specified, the following variables in the OUTEST= data set contain observations related to the parameter estimation step:
variable name
forecasting model
transformation
parameter name
parameter estimate
standard errors
t values
probability values
If the parameter estimation step fails for a particular variable, no observations are output to the OUTEST= data set for that variable.
The OUTFOR= data set contains the variables specified in the BY statement as well as the variables listed below. For variables listed in FORECAST statements where the option MODEL=NONE is specified, no observations are recorded in the OUTFOR= data set for these variables. For variables listed in FORECAST statements where the option MODEL=NONE is not specified, the following variables in the OUTFOR= data set contain observations related to the forecasting step:
variable name
time ID values
actual values
predicted values
prediction standard errors
lower confidence limits
upper confidence limits
prediction errors
If the forecasting step fails for a particular variable, no observations are recorded in the OUTFOR= data set for that variable. If the TRANSFORM= option is specified, the values in the preceding variables are the inverse transform forecasts. If the MEDIAN option is specified, the median forecasts are stored; otherwise, the mean forecasts are stored.
The OUTPROCINFO= data set contains information about the run of the ESM procedure. The following variables are present:
set to the name of the procedure, in this case ESM
name of an item being reported; can be the number of errors, notes, or warnings, number of forecasts requested, and so on
descriptive label for the item in _NAME_
set to the current stage of the procedure, for ESM this is set to ALL
value of the item specified in _NAME_
The OUTSTAT= data set contains the variables specified in the BY statement as well as the variables listed below. For variables listed in FORECAST statements where the option MODEL=NONE is specified, no observations are recorded for these variables in the OUTSTAT= data set. For variables listed in FORECAST statements where the option MODEL=NONE is not specified, the following variables in the OUTSTAT= data set contain observations related to the statistics of fit:
variable name
the region in which the statistics are calculated. Statistics calculated in the fit region are indicated by FIT. Statistics calculated in the forecast region, which happens only if the BACK= option is greater than zero, are indicated by FORECAST.
degrees of freedom error
number of observations
number of observations used
number of missing actuals
number of missing predicted values
number of parameters
total sum of squares
corrected total sum of squares
sum of square error
mean square error
unbiased mean square error
root mean square error
unbiased root mean square error
mean absolute percent error
mean absolute error
mean absolute scaled error
R square
adjusted R square
Amemiya’s adjusted R square
random walk R square
Akaike information criterion
finite sample corrected AIC
Schwarz Bayesian information criterion
Amemiya’s prediction criterion
maximum error
minimum error
minimum percent error
maximum percent error
mean error
mean percent error
median absolute percent error
geometric mean absolute percent error
minimum predictive percent error
maximum predictive percent error
mean predictive percent error
symmetric mean absolute predictive percent error
median absolute predictive percent error
geometric mean absolute predictive percent error
minimum symmetric percent error
maximum symmetric percent error
mean symmetric percent error
symmetric mean absolute percent error
median absolute symmetric percent error
geometric mean absolute symmetric percent error
minimum relative error
maximum relative error
mean relative error
mean relative absolute error
median relative absolute error
geometric mean relative absolute error
minimum absolute error percent of standard deviation
maximum absolute error percent of standard deviation
mean absolute error percent of standard deviation
median absolute error percent of standard deviation
geometric mean absolute error percent of standard deviation
If the statistics of fit cannot be computed for a particular variable, no observations are recorded in the OUTSTAT= data set for that variable. If the TRANSFORM= option is specified, the values in the preceding variables are computed based on the inverse transform forecasts. If the MEDIAN option is specified, the median forecasts are the basis; otherwise, the mean forecasts are the basis.
See Chapter 46, Forecasting Process Details, for more information about the calculation of forecasting statistics of fit.
The OUTSUM= data set contains the variables specified in the BY statement as well as the variables listed below. The OUTSUM= data set records the summary statistics for each variable specified in a FORECAST statement. For variables listed in FORECAST statements where the option MODEL=NONE is specified, the values related to forecasts are set to missing for those variables in the OUTSUM= data set. For variables listed in FORECAST statements where the option MODEL=NONE is not specified, the forecast values are set based on the USE= option.
The following variables related to summary statistics are based on the ACCUMULATE= and SETMISSING= options:
variable name
forecasting status. Nonzero values imply that no forecast was generated for the series.
number of observations
number of nonmissing observations
number of missing observations
minimum value
maximum value
mean value
standard deviation
The following variables related to forecast summation are based on the LEAD= and STARTSUM= options:
forecast summation predicted values
forecast summation prediction standard errors
forecast summation lower confidence limits
forecast summation upper confidence limits
Variance-related computations are computed only when no transformation is specified (TRANSFORM=NONE).
The following variables related to multistep forecast are based on the LEAD= and USE= options:
multistep forecast ( ranges from one to the value of the LEAD= option). If USE=LOWER, this variable contains the lower confidence limits; if USE=UPPER, this variable contains the upper confidence limits; otherwise, this variable contains the predicted values.
If the forecast step fails for a particular variable, the variables that are related to forecasting are set to missing for that variable. The OUTSUM= data set contains both a summary of the (accumulated) time series and optionally its forecasts for all series.
Copyright © SAS Institute, Inc. All Rights Reserved.