Forecasting Methods :: SAS/ETS(R) 13.2 User's Guide

STEPAR Method

In the STEPAR method, PROC FORECAST first fits a time trend model to the series and takes the difference between each value and the estimated trend. (This process is called detrending.) Then, the remaining variation is fit by using an autoregressive model.

The STEPAR method fits the autoregressive process to the residuals of the trend model by using a backwards-stepping method to select parameters. Because the trend and autoregressive parameters are fit in sequence rather than simultaneously, the parameter estimates are not optimal in a statistical sense. However, the estimates are usually close to optimal, and the method is computationally inexpensive.

The STEPAR Algorithm

The STEPAR method consists of the following computational steps:

Fit the trend model as specified by the TREND= option by using ordinary least-squares regression. This step detrends the data. The default trend model for the STEPAR method is TREND=2, a linear trend model.
Take the residuals from step 1 and compute the autocovariances to the number of lags specified by the NLAGS= option.
Regress the current values against the lags, using the autocovariances from step 2 in a Yule-Walker framework. Do not bring in any autoregressive parameter that is not significant at the level specified by the SLENTRY= option. (The default is SLENTRY=0.20.) Do not bring in any autoregressive parameter that results in a nonpositive-definite Toeplitz matrix.
Find the autoregressive parameter that is least significant. If the significance level is greater than the SLSTAY= value, remove the parameter from the model. (The default is SLSTAY=0.05.) Continue this process until only significant autoregressive parameters remain. If the OUTEST= option is specified, write the estimates to the OUTEST= data set.
Generate the forecasts by using the estimated model and output to the OUT= data set. Form the confidence limits by combining the trend variances with the autoregressive variances.

Missing values are tolerated in the series; the autocorrelations are estimated from the available data and tapered if necessary.

This method requires at least three passes through the data: two passes to fit the model and a third pass to initialize the autoregressive process and write to the output data set.

Default Value of the NLAGS= Option

If the NLAGS= option is not specified, the default value of the NLAGS= option is chosen based on the data frequency specified by the INTERVAL= option and on the number of observations in the input data set, if this can be determined in advance. (PROC FORECAST cannot determine the number of input observations before reading the data when a BY statement or a WHERE statement is used or if the data are from a tape format SAS data set or external database. The NLAGS= value must be fixed before the data are processed.)

If the INTERVAL= option is specified, the default NLAGS= value includes lags for up to three years plus one, subject to the maximum of 13 lags or one-third of the number of observations in your data set, whichever is less. If the number of observations in the input data set cannot be determined, the maximum NLAGS= default value is 13. If the INTERVAL= option is not specified, the default is NLAGS=13 or one-third the number of input observations, whichever is less.

If the Toeplitz matrix formed by the autocovariance matrix at a given step is not positive definite, the maximal number of autoregressive lags is reduced.

For example, for INTERVAL=QTR, the default is NLAGS=13 (that is, ${4{\times }3+1}$ ) provided that there are at least 39 observations. The NLAGS= option default is always at least 3.

EXPO Method

Exponential smoothing is used when the METHOD=EXPO option is specified. The term exponential smoothing is derived from the computational scheme developed by Brown and others (Brown and Meyer, 1961; Brown, 1962). Estimates are computed with updating formulas that are developed across time series in a manner similar to smoothing.

The EXPO method fits a trend model such that the most recent data are weighted more heavily than data in the early part of the series. The weight of an observation is a geometric (exponential) function of the number of periods that the observation extends into the past relative to the current period. The weight function is

$w_{{\tau }}={\omega } (1-{\omega })^{t-{\tau }}$

where ${\tau }$ is the observation number of the past observation, t is the current observation number, and ${\omega }$ is the weighting constant specified with the WEIGHT= option.

You specify the model with the TREND= option as follows:

TREND=1 specifies single exponential smoothing (a constant model)
TREND=2 specifies double exponential smoothing (a linear trend model)
TREND=3 specifies triple exponential smoothing (a quadratic trend model)

Updating Equations

The single exponential smoothing operation is expressed by the formula

$S_{t}={\omega }x_{t}+(1-{\omega })S_{t-1}$

where ${\mi{S} _{t}}$ is the smoothed value at the current period, t is the time index of the current period, and ${x_{t}}$ is the current actual value of the series. The smoothed value ${\mi{S} _{t}}$ is the forecast of ${x_{t+1}}$ and is calculated as the smoothing constant ${\omega }$ times the value of the series, ${x_{t}}$ , in the current period plus ( ${1-{\omega }}$ ) times the previous smoothed value ${\mi{S} _{t-1}}$ , which is the forecast of ${x_{t}}$ computed at time ${t-1}$ .

Double and triple exponential smoothing are derived by applying exponential smoothing to the smoothed series, obtaining smoothed values as follows:

$\mi{S} _{t}^{[2]}={\omega }\mi{S} _{t} +(1-{\omega }) \mi{S} _{t-1}^{[2]}$

$\mi{S} _{t}^{[3]}={\omega } \mi{S} _{t}^{[2]} +(1-{\omega }) \mi{S} _{t-1}^{[3]}$

Missing values after the start of the series are replaced with one-step-ahead predicted values, and the predicted value is then applied to the smoothing equations.

The polynomial time trend parameters CONSTANT, LINEAR, and QUAD in the OUTEST= data set are computed from ${ S_{T}}$ , ${ S_{T}^{[2]}}$ , and ${ S_{T}^{[3]}}$ , the final smoothed values at observation T, the last observation used to fit the model. In the OUTEST= data set, the values of ${ S_{T}}$ , ${ S^{[2]}_{T}}$ , and ${ S^{[3]}_{T}}$ are identified by _TYPE_=S1, _TYPE_=S2, and _TYPE_=S3, respectively.

Smoothing Weights

Exponential smoothing forecasts are forecasts for an integrated moving-average process; however, the weighting parameter is specified by the user rather than estimated from the data. Experience has shown that good values for the WEIGHT= option are between 0.05 and 0.3. As a general rule, smaller smoothing weights are appropriate for series with a slowly changing trend, while larger weights are appropriate for volatile series with a rapidly changing trend. If unspecified, the weight defaults to ${(1- 0.8^{1/trend})}$ , where trend is the value of the TREND= option. This produces defaults of WEIGHT=0.2 for TREND=1, WEIGHT=0.10557 for TREND=2, and WEIGHT=0.07168 for TREND=3.

The ESM procedure can be used to forecast time series by using exponential smoothing with smoothing weights that are optimized automatically. See Chapter 14: The ESM Procedure.

The Time Series Forecasting System provides for exponential smoothing models and enables you to either specify or optimize the smoothing weights. See Chapter 46: Getting Started with Time Series Forecasting, for details.

Confidence Limits

The confidence limits for exponential smoothing forecasts are calculated as they would be for an exponentially weighted time trend regression, using the simplifying assumption of an infinite number of observations. The variance estimate is computed by using the mean square of the unweighted one-step-ahead forecast residuals.

More detailed descriptions of the forecast computations can be found in Montgomery and Johnson (1976); Brown (1962).

WINTERS Method

The WINTERS method uses updating equations similar to exponential smoothing to fit parameters for the model

$x_{t} = ( a + b t ) s(t) + {\epsilon }_{t}$

where a and b are the trend parameters and the function s (t ) selects the seasonal parameter for the season that corresponds to time t.

The WINTERS method assumes that the series values are positive. If negative or zero values are found in the series, a warning is printed and the values are treated as missing.

The preceding standard WINTERS model uses a linear trend. However, PROC FORECAST can also fit a version of the WINTERS method that uses a quadratic trend. When TREND=3 is specified for METHOD=WINTERS, PROC FORECAST fits the following model:

$x_{t} = ( a + b t + c t^{2} ) s(t)+{\epsilon }_{t}$

The quadratic trend version of the Winters method is often unstable, and its use is not recommended.

When TREND=1 is specified, the following constant trend version is fit:

$x_{t} = a s(t) + {\epsilon }_{t}$

The default for the WINTERS method is TREND=2, which produces the standard linear trend model.

Seasonal Factors

The notation s (t) represents the selection of the seasonal factor used for different time periods. For example, if INTERVAL=DAY and SEASONS=MONTH, there are 12 seasonal factors, one for each month in the year, and the time index t is measured in days. For any observation, t is determined by the ID variable and s (t) selects the seasonal factor for the month that t falls in. For example, if t is 9 February 1993 then s (t) is the seasonal parameter for February.

When there are multiple seasons specified, s (t) is the product of the parameters for the seasons. For example, if SEASONS=(MONTH DAY), then s (t) is the product of the seasonal parameter for the month that corresponds to the period t and the seasonal parameter for the day of the week that corresponds to period t. When the SEASONS= option is not specified, the seasonal factors s (t) are not included in the model. See the section Specifying Seasonality for more information about specifying multiple seasonal factors.