The ARIMA Procedure

General Notation for ARIMA Models

The order of an ARIMA (autoregressive integrated moving-average) model is usually denoted by the notation ARIMA(p,d,q ), where

p: is the order of the autoregressive part
d: is the order of the differencing
q: is the order of the moving-average process

If no differencing is done (d = 0), the models are usually referred to as ARMA(p, q) models. The final model in the preceding example is an ARIMA(1,1,1) model since the IDENTIFY statement specified d = 1, and the final ESTIMATE statement specified p = 1 and q = 1.

Notation for Pure ARIMA Models

Mathematically the pure ARIMA model is written as

$\text{[math]}$

where

t: indexes time
$\text{[math]}$: is the response series $\text{[math]}$ or a difference of the response series
$\text{[math]}$: is the mean term
$\text{[math]}$: is the backshift operator; that is, $\text{[math]}$
$\text{[math]}$: is the autoregressive operator, represented as a polynomial in the backshift operator: $\text{[math]}$
$\text{[math]}$: is the moving-average operator, represented as a polynomial in the backshift operator: $\text{[math]}$
$\text{[math]}$: is the independent disturbance, also called the random error

The series $\text{[math]}$ is computed by the IDENTIFY statement and is the series processed by the ESTIMATE statement. Thus, $\text{[math]}$ is either the response series Y $\text{[math]}$ or a difference of $\text{[math]}$ specified by the differencing operators in the IDENTIFY statement.

For simple (nonseasonal) differencing, $\text{[math]}$ . For seasonal differencing $\text{[math]}$ , where d is the degree of nonseasonal differencing, D is the degree of seasonal differencing, and s is the length of the seasonal cycle.

For example, the mathematical form of the ARIMA(1,1,1) model estimated in the preceding example is

$\text{[math]}$

Model Constant Term

The ARIMA model can also be written as

$\text{[math]}$

where

$\text{[math]}$

Thus, when an autoregressive operator and a mean term are both included in the model, the constant term for the model can be represented as $\text{[math]}$ . This value is printed with the label "Constant Estimate" in the ESTIMATE statement output.

Notation for Transfer Function Models

The general ARIMA model with input series, also called the ARIMAX model, is written as

$\text{[math]}$

where

$\text{[math]}$: is the ith input time series or a difference of the ith input series at time t
$\text{[math]}$: is the pure time delay for the effect of the ith input series
$\text{[math]}$: is the numerator polynomial of the transfer function for the ith input series
$\text{[math]}$: is the denominator polynomial of the transfer function for the ith input series.

The model can also be written more compactly as

$\text{[math]}$

where

$\text{[math]}$: is the transfer function for the ith input series modeled as a ratio of the $\text{[math]}$ and $\text{[math]}$ polynomials: $\text{[math]}$
$\text{[math]}$: is the noise series: $\text{[math]}$

This model expresses the response series as a combination of past values of the random shocks and past values of other input series. The response series is also called the dependent series or output series. An input time series is also referred to as an independent series or a predictor series. Response variable, dependent variable, independent variable, or predictor variable are other terms often used.

Notation for Factored Models

ARIMA models are sometimes expressed in a factored form. This means that the $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , or $\text{[math]}$ polynomials are expressed as products of simpler polynomials. For example, you could express the pure ARIMA model as

$\text{[math]}$

where $\text{[math]}$ and $\text{[math]}$ .

When an ARIMA model is expressed in factored form, the order of the model is usually expressed by using a factored notation also. The order of an ARIMA model expressed as the product of two factors is denoted as ARIMA(p,d,q) $\text{[math]}$ (P,D,Q).

Notation for Seasonal Models

ARIMA models for time series with regular seasonal fluctuations often use differencing operators and autoregressive and moving-average parameters at lags that are multiples of the length of the seasonal cycle. When all the terms in an ARIMA model factor refer to lags that are a multiple of a constant s, the constant is factored out and suffixed to the ARIMA(p,d,q ) notation.

Thus, the general notation for the order of a seasonal ARIMA model with both seasonal and nonseasonal factors is ARIMA(p,d,q) $\text{[math]}$ (P,D,Q) $\text{[math]}$ . The term (p,d,q) gives the order of the nonseasonal part of the ARIMA model; the term (P,D,Q) $\text{[math]}$ gives the order of the seasonal part. The value of s is the number of observations in a seasonal cycle: 12 for monthly series, 4 for quarterly series, 7 for daily series with day-of-week effects, and so forth.

For example, the notation ARIMA(0,1,2) $\text{[math]}$ (0,1,1) $\text{[math]}$ describes a seasonal ARIMA model for monthly data with the following mathematical form:

$\text{[math]}$