PROC AUTOREG: Generalized Durbin-Watson Tests :: SAS/ETS(R) 9.2 User's Guide

The AUTOREG Procedure

Consider the following linear regression model:

$\text{[math]}$

where $\text{[math]}$ is an $\text{[math]}$ data matrix, $\text{[math]}$ is a $\text{[math]}$ coefficient vector, and $\text{[math]}$ is a $\text{[math]}$ disturbance vector. The error term $\text{[math]}$ is assumed to be generated by the $\text{[math]}$ th-order autoregressive process $\text{[math]}$ where $\text{[math]}$ , $\text{[math]}$ is a sequence of independent normal error terms with mean 0 and variance $\text{[math]}$ . Usually, the Durbin-Watson statistic is used to test the null hypothesis $\text{[math]}$ against $\text{[math]}$ . Vinod (1973) generalized the Durbin-Watson statistic:

$\text{[math]}$

where $\text{[math]}$ are OLS residuals. Using the matrix notation,

$\text{[math]}$

where $\text{[math]}$ and $\text{[math]}$ is a $\text{[math]}$ matrix:

$\text{[math]}$

and there are $\text{[math]}$ zeros between $\text{[math]}$ and 1 in each row of matrix $\text{[math]}$ .

The QR factorization of the design matrix $\text{[math]}$ yields a $\text{[math]}$ orthogonal matrix $\text{[math]}$ :

$\text{[math]}$

where R is an $\text{[math]}$ upper triangular matrix. There exists an $\text{[math]}$ submatrix of $\text{[math]}$ such that $\text{[math]}$ and $\text{[math]}$ . Consequently, the generalized Durbin-Watson statistic is stated as a ratio of two quadratic forms:

$\text{[math]}$

where $\text{[math]}$ are upper n eigenvalues of $\text{[math]}$ and $\text{[math]}$ is a standard normal variate, and $\text{[math]}$ . These eigenvalues are obtained by a singular value decomposition of $\text{[math]}$ (Golub and Van Loan; 1989; Savin and White; 1978).

The marginal probability (or p-value) for $\text{[math]}$ given $\text{[math]}$ is

$\text{[math]}$

where

$\text{[math]}$

When the null hypothesis $\text{[math]}$ holds, the quadratic form $\text{[math]}$ has the characteristic function

$\text{[math]}$

The distribution function is uniquely determined by this characteristic function:

$\text{[math]}$

For example, to test $\text{[math]}$ given $\text{[math]}$ against $\text{[math]}$ , the marginal probability (p-value) can be used:

$\text{[math]}$

where

$\text{[math]}$

and $\text{[math]}$ is the calculated value of the fourth-order Durbin-Watson statistic.

In the Durbin-Watson test, the marginal probability indicates positive autocorrelation ( $\text{[math]}$ ) if it is less than the level of significance ( $\text{[math]}$ ), while you can conclude that a negative autocorrelation ( $\text{[math]}$ ) exists if the marginal probability based on the computed Durbin-Watson statistic is greater than $\text{[math]}$ . Wallis (1972) presented tables for bounds tests of fourth-order autocorrelation, and Vinod (1973) has given tables for a 5% significance level for orders two to four. Using the AUTOREG procedure, you can calculate the exact p-values for the general order of Durbin-Watson test statistics. Tests for the absence of autocorrelation of order p can be performed sequentially; at the $\text{[math]}$ th step, test $\text{[math]}$ given $\text{[math]}$ against $\text{[math]}$ . However, the size of the sequential test is not known.

The Durbin-Watson statistic is computed from the OLS residuals, while that of the autoregressive error model uses residuals that are the difference between the predicted values and the actual values. When you use the Durbin-Watson test from the residuals of the autoregressive error model, you must be aware that this test is only an approximation. See Autoregressive Error Model earlier in this chapter. If there are missing values, the Durbin-Watson statistic is computed using all the nonmissing values and ignoring the gaps caused by missing residuals. This does not affect the significance level of the resulting test, although the power of the test against certain alternatives may be adversely affected. Savin and White (1978) have examined the use of the Durbin-Watson statistic with missing values.

Enhanced Durbin-Watson Probability Computation

The Durbin-Watson probability calculations have been enhanced to compute the $\text{[math]}$ -value of the generalized Durbin-Watson statistic for large sample sizes. Previously, the Durbin-Watson probabilities were only calculated for small sample sizes.

Consider the following linear regression model:

$\text{[math]}$

where $\text{[math]}$ is an $\text{[math]}$ data matrix, $\text{[math]}$ is a $\text{[math]}$ coefficient vector, $\text{[math]}$ is a $\text{[math]}$ disturbance vector, and $\text{[math]}$ is a sequence of independent normal error terms with mean 0 and variance $\text{[math]}$ .

The generalized Durbin-Watson statistic is written as

$\text{[math]}$

where $\text{[math]}$ is a vector of OLS residuals and $\text{[math]}$ is a $\text{[math]}$ matrix. The generalized Durbin-Watson statistic DW $\text{[math]}$ can be rewritten as

$\text{[math]}$

where $\text{[math]}$ .

The marginal probability for the Durbin-Watson statistic is

$\text{[math]}$

where $\text{[math]}$ .

The $\text{[math]}$ -value or the marginal probability for the generalized Durbin-Watson statistic is computed by numerical inversion of the characteristic function $\text{[math]}$ of the quadratic form $\text{[math]}$ . The trapezoidal rule approximation to the marginal probability $\text{[math]}$ is

$\text{[math]}$

where $\text{[math]}$ is the imaginary part of the characteristic function, $\text{[math]}$ and $\text{[math]}$ are integration and truncation errors, respectively. Refer to Davies (1973) for numerical inversion of the characteristic function.

Ansley, Kohn, and Shively (1992) proposed a numerically efficient algorithm that requires O( $\text{[math]}$ ) operations for evaluation of the characteristic function $\text{[math]}$ . The characteristic function is denoted as

	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$

where $\text{[math]}$ and $\text{[math]}$ . By applying the Cholesky decomposition to the complex matrix $\text{[math]}$ , you can obtain the lower triangular matrix $\text{[math]}$ that satisfies $\text{[math]}$ . Therefore, the characteristic function can be evaluated in O( $\text{[math]}$ ) operations by using the following formula:

$\text{[math]}$

where $\text{[math]}$ . Refer to Ansley, Kohn, and Shively (1992) for more information on evaluation of the characteristic function.

Tests for Serial Correlation with Lagged Dependent Variables

When regressors contain lagged dependent variables, the Durbin-Watson statistic ( $\text{[math]}$ ) for the first-order autocorrelation is biased toward 2 and has reduced power. Wallis (1972) shows that the bias in the Durbin-Watson statistic ( $\text{[math]}$ ) for the fourth-order autocorrelation is smaller than the bias in $\text{[math]}$ in the presence of a first-order lagged dependent variable. Durbin (1970) proposes two alternative statistics (Durbin h and t ) that are asymptotically equivalent. The h statistic is written as

$\text{[math]}$

where $\text{[math]}$ and $\text{[math]}$ is the least squares variance estimate for the coefficient of the lagged dependent variable. Durbin’s t test consists of regressing the OLS residuals $\text{[math]}$ on explanatory variables and $\text{[math]}$ and testing the significance of the estimate for coefficient of $\text{[math]}$ .

Inder (1984) shows that the Durbin-Watson test for the absence of first-order autocorrelation is generally more powerful than the h test in finite samples. Refer to Inder (1986) and King and Wu (1991) for the Durbin-Watson test in the presence of lagged dependent variables.

Top of Page