Vector Error Correction Modeling
This section discusses the implication of cointegration for
the autoregressive representation. Assume that the
cointegrated series can be represented by a vector error
correction model
according to the Granger representation theorem
(Engle and Granger 1987).
Consider the vector autoregressive process with
Gaussian errors defined by

or

where the initial values, y-p+1, ... ,y0,
are fixed and
.
Since the AR operator
can be re-expressed as
where
with
, the vector error
correction model is

or

where
.One motivation for the VECM(p) form is to consider the relation
as defining the underlying economic relations
and
assume that the agents react to the disequilibrium error
through the adjustment coefficient
to restore equilibrium; that is, they satisfy the
economic relations. The cointegrating vector,
is sometimes
called the long-run parameters.
You can consider a vector error correction model with a
deterministic term.
The deterministic term Dt can contain a constant, a linear trend,
seasonal dummy variables, or nonstochastic regressors.

where
.The alternative vector error correction representation considers
the error correction term at lag t-p
and is written as

If the matrix
has a full rank (r=k), all components of
yt are
I(0). On the other hand, yt are stationary in difference
if
.When the rank of the matrix
is r < k, there are k-r linear
combinations that are nonstationary and r stationary cointegrating
relations.
Note that the linearly independent vector
is stationary and this transformation is not unique unless r=1.
There does not exist a unique cointegrating matrix
since
the coefficient matrix
can also be decomposed as

where M is an r×r nonsingular matrix.
Test for the Cointegration
The cointegration rank test determines the linearly independent
columns of
.
Johansen (1988, 1995a) and Johansen and Juselius (1990) proposed
the cointegration rank test using the reduced rank regression.
Different Specifications of Deterministic Trends
When you construct the VECM(p) form from the VAR(p) model,
the deterministic
terms in the VECM(p) form can differ from those in the VAR(p) model.
When there are deterministic cointegrated relationships among
variables, deterministic terms in the VAR(p) model will not
be present in the VECM(p) form.
On the other hand, if there are stochastic cointegrated relationships,
deterministic terms appear in the VECM(p) form
via the error correction term or
as an independent term in the VECM(p) form.
- Case 1: There is no separate drift in the VECM(p) form.

- Case 2: There is no separate drift in the VECM(p) form,
but a constant enters only via the error correction term.

- Case 3: There is a separate drift and no separate
linear trend in the VECM(p) form.

- Case 4: There is a separate drift and no separate
linear trend in the VECM(p) form,
but a linear trend enters only via the error correction term.

- Case 5: There is a separate linear trend in the
VECM(p) form.

First, focus on cases 1, 3, and 5 to test the null hypothesis that
there are at most r cointegrating vectors. Let
![Z_{0t}&=&\Delta{y}_t \ Z_{1t}&=&y_{t-1} \ Z_{2t}&=&[\Delta{y}_{t-1}', ... ,\Delt... ...' \ Z_{1} &=& [Z_{11}, ... , Z_{1T}]' \ Z_{2} &=& [Z_{21}, ... , Z_{2T}]'](images/vareq349.gif)
where Dt can be empty for Case 1; 1 for Case 3; (1,t) for Case 5.
Let
be the matrix of parameters consisting of
and A, where parameters A corresponds to
regressors Dt.
Then the VECM(p) form is rewritten in these variables as

The log-likelihood function is given by

The residuals, R0t and R1t, are obtained by regressing
Z0t and Z1t on Z2t, respectively.
The regression equation in residuals is

The crossproducts matrices are computed

Then the maximum likelihood estimator for
is obtained from
the eigenvectors corresponding to the r largest eigenvalues of
the following equation:

The eigenvalues of the preceding equation are
squared canonical correlations between R0t and R1t, and
the eigenvectors corresponding to the r largest eigenvalues
are the r linear combinations of yt-1, which have
the largest
squared partial correlations with the stationary process
after correcting for lags and deterministic terms.
Such an analysis calls for a reduced rank regression of
on yt-1 corrected for
, as
discussed by Anderson (1951).
Johansen (1988) suggested two test statistics to test
the null hypothesis that there are at most r cointegrating vectors

Trace Test

The asymptotic distribution of this statistic is given by

where tr(A) is the trace of a matrix A,
W is the k-r dimensional Brownian motion, and
is the
Brownian motion itself, or the demeaned or detrended Brownian
motion according
to the different specifications of deterministic trends
in the vector error correction model.
Maximum Eigenvalue Test

The asymptotic distribution of this statistic is given by

where max(A) is the maximum eigenvalue of a matrix A.
Osterwald-Lenum (1992) provided the detailed tables of critical values of
these statistics.
In case 2, consider that Z1t = (yt',1)' and
.In case 4, Z1t = (yt',t)' and
.
The following statements use the JOHANSEN option to compute the
Johansen cointegration rank test of integrated order 1:
proc varmax data=simul2;
model y1 y2 / p=2 cointtest=(johansen=(normalize=y1));
run;
|
| Cointegration Rank Test |
| H_0: Rank=r |
H_1: Rank>r |
Eigenvalue |
Trace |
Critical Value |
DriftInECM |
DriftInProcess |
| 0 |
0 |
0.4644 |
61.75 |
15.34 |
Constant |
Linear |
| 1 |
1 |
0.0056 |
0.56 |
3.84 |
|
|
| Cointegration Rank Test under the Restriction |
| H_0: Rank=r |
H_1: Rank>r |
Eigenvalue |
Trace |
Critical Value |
DriftInECM |
DriftInProcess |
| 0 |
0 |
0.5209 |
76.38 |
19.99 |
Constant |
Constant |
| 1 |
1 |
0.0426 |
4.27 |
9.13 |
|
|
|
Figure 4.40: Cointegration Rank Test (JOHANSEN option)
Suppose that the model has an intercept term.
The first table in Figure 4.32 shows
the trace statistics based on case 3;
the second table, case 2. The output indicates that
the series are cointegrated with rank 1.
|
Test of the Restriction when Rank=r |
| Hypo |
DriftInECM |
DriftInProcess |
| H_0 |
Constant |
Linear |
| H_1 |
Constant |
Constant |
| Test of the Restriction when Rank=r |
| Rank |
Eigenvalue On Restrict |
Eigenvalue |
Chi-Square |
DF |
Prob>ChiSq |
| 0 |
0.5209 |
0.4644 |
14.63 |
2 |
0.0007 |
| 1 |
0.0426 |
0.0056 |
3.71 |
1 |
0.0540 |
|
Figure 4.41: Cointegration Rank Test Continued
Figure 4.33 shows which result, case 3 (the hypothesis H_0) and case 2
(the hypothesis H_1), is appropriate.
Since the cointegration rank is chosen to be 1 by the result in
Figure 4.32, look at the last row corresponding to rank=1.
|
| Long-Run Parameter BETA Estimates |
| Variable |
Dummy 1 |
Dummy 2 |
| y1 |
1.00000 |
1.00000 |
| y2 |
-2.04869 |
-0.02854 |
Adjustment Coefficient ALPHA Estimates |
| Variable |
Dummy 1 |
Dummy 2 |
| y1 |
-0.46421 |
-0.00502 |
| y2 |
0.17535 |
-0.01275 |
|
Figure 4.42: Cointegration Rank Test Continued
Figure 4.34 shows the estimates of long-run parameter and
adjustment coefficients based on case 3.
Considering that the cointegration rank is 1,
the long-run relationship of the series is
-
y1t = 2.049 y2t
|
Long-Run Coefficient BETA based on the Restricted Trend |
| Variable |
Dummy 1 |
Dummy 2 |
Dummy 3 |
| y1 |
1.00000 |
1.00000 |
1.00000 |
| y2 |
-2.04366 |
-2.75773 |
-0.73198 |
| 1 |
6.75919 |
101.37051 |
-42.50051 |
Adjustment Coefficient ALPHA based on the Restricted Trend |
| Variable |
Dummy 1 |
Dummy 2 |
Dummy 3 |
| y1 |
-0.48015 |
0.01091 |
5.96583E-14 |
| y2 |
0.12538 |
0.03722 |
-2.834E-13 |
|
Figure 4.43: Cointegration Rank Test Continued
Figure 4.35 shows the estimates of long-run parameter and
adjustment coefficients based on case 2.
Considering that the cointegration rank is 1,
the long-run relationship of the series is
-
y1t = 2.044 y2t - 6.760
Estimation of Vector Error Correction Model
Now consider cases 1, 3, and 5 for the
vector error correction model.
The preceding log-likelihood function is maximized for
![\hat {{\beta}} &=& S_{11}^{-1/2} [v_1, ... ,v_r] \\hat {{\alpha}} &=& S_{01}\hat... ...{2} \hat \Psi' - Z_{1} \hat\Pi')' (Z_{0} - Z_{2} \hat \Psi' - Z_{1} \hat\Pi')/T](images/vareq366.gif)
The estimators of the orthogonal complements of
and
are
![\hat {{\beta}}_{\bot}=S_{11} [v_{r+1}, ... ,v_{k}]](images/vareq367.gif)
and
![\hat {{\alpha}}_{\bot}=S_{00}^{-1} S_{01} [v_{r+1}, ... ,v_{k}]](images/vareq368.gif)
The ML estimators have the following asymptotic properties:
![{\sqrt T} {\rm vec}([\hat \Pi,\hat \Psi] - [\Pi, \Psi]) \stackrel{d}{arrow} N(0, \Sigma_{co})](images/vareq369.gif)
where
![\Sigma_{co}=\Sigma \otimes ( [ {\beta} & 0 \ 0 & I_k ] \Omega^{-1} [ {\beta}' & 0 \ 0 & I_k ] )](images/vareq370.gif)
and
![\Omega={\rm plim} \frac{1}T [ {\beta}'Z_{1}'Z_{1}{\beta} & {\beta}'Z_{1}'Z_{2} \ Z_{2}'Z_{1}{\beta} & Z_{2}'Z_{2} \ ]](images/vareq371.gif)
Test for the Linear Restriction of 
Consider the example with the variables mt, log real money,
yt, log real income, idt, deposit interest rate, and
ibt, bond interest rate. It seems a natural hypothesis that
in the long-run relation, money and income have equal coefficients
with opposite signs.
This can be formulated as the hypothesis that the cointegrated
relation contains only mt and yt through mt - yt.
For the analysis, you can express these restrictions in the
parameterization
of H such that
, where H is a known
k×s matrix
and
is the
parameter matrix to be estimated. For this example, H is given by
![H=[ 1 & 0 & 0 \ -1 & 0 & 0 \ 0 & 1 & 0 \ 0 & 0 & 1 \ ]](images/vareq374.gif)
Restriction
When the linear restriction
is given, it implies
that the same restrictions are imposed on all cointegrating vectors.
You obtain the maximum likelihood estimator of
by reduced
rank regression of
on Hyt-1
corrected for
,
solving the following equation

for the eigenvalues
and eigenvectors
(v1, ... ,vs),
Sij are given in the preceding section.
Then choose
corresponding to
the r largest eigenvalues, and the
is
.The test statistic for
is given by

If the data have no deterministic trend, the constant term should be
restricted by
like Case 2.
Then H is given by
![H=[ 1 & 0 & 0 & 0\ -1 & 0 & 0 & 0\ 0 & 1 & 0 & 0\ 0 & 0 & 1 & 0\ 0 & 0 & 0 & 1\ ]](images/vareq382.gif)
The following statements test that
:
proc varmax data=simul2;
model y1 y2 / p=2 ecm=(rank=1 normalize=y1);
cointeg rank=1 h=(1,-2);
run;
|
Long-Run Coefficient BETA with respect to Hypothesis on BETA |
| Variable |
Dummy 1 |
| y1 |
1.00000 |
| y2 |
-2.00000 |
Adjustment Coefficient ALPHA with respect to Hypothesis on BETA |
| Variable |
Dummy 1 |
| y1 |
-0.47404 |
| y2 |
0.17534 |
Test for Restricted Long-Run Coefficient BETA |
| Index |
EigenvalueOnRestrict |
Eigenvalue |
Chi-Square |
DF |
Prob>ChiSq |
| 1 |
0.4616 |
0.4644 |
0.51 |
1 |
0.4738 |
|
Figure 4.44: Testing of Linear Restriction
(H= option)
Figure 4.36 shows the results of testing
.The input H matrix is H=(1, -2)'. The adjustment coefficient is
reestimated under the restriction, and the test indicates
that you cannot reject the null hypothesis.
Test for the Weak Exogeneity and Restrictions of

Consider a vector error correction model:

Divide the process yt into (y1t',y2t')'
with dimension k1 and k2 and the
into
![\Sigma=[ \Sigma_{11} & \Sigma_{12} \ \Sigma_{21} & \Sigma_{22} ]](images/vareq386.gif)
Similarly, the parameters can be decomposed as follows:
![{\alpha}=[ {\alpha_1} \ {\alpha_2} ] \Phi^*_i=[ \Phi^*_{1i} \ \Phi^*_{2i} ] A=[ A_{1} \ A_{2} ]](images/vareq387.gif)
Then the VECM(p) form can be rewritten using the decomposed parameters
and processes:
![[ \Delta y_{1t} \ \Delta y_{2t} ]=[ {\alpha_1} \ {\alpha_2} ] {\beta}'y_{t... ...elta y_{t-i} + [ A_{1} \ A_{2} ] D_t + [ {\epsilon}_{1t} \ {\epsilon}_{2t} ]](images/vareq388.gif)
The conditional model for y1t given y2t is

and the marginal model of y2t,

where
.The test of weak exogeneity of y2t for the parameters
determines whether
.Weak exogeneity means that there is no information about
in the
marginal model or that the variables y2t do not react to a
disequilibrium.
Restriction
Consider the null hypothesis
,
where J is a k×m matrix with
.From the previous residual regression equation

you can obtain

where
and
is orthogonal to J such
that
.Define

and let
.Then
can be written

Using the marginal distribution of
and
the conditional distribution of
, the new
residuals are computed as

where

In terms of
and
, the
MLE of
is computed by using the reduced rank regression.
Let

Under the null hypothesis
, the MLE
is computed by solving the equation

Then
, where the eigenvectors
correspond to the r largest eigenvalues.
The likelihood ratio test for
is

The test of weak exogeneity of y2t is the special case of
the test
, considering J=(Ik1,0)'.
Consider the previous example with four variables
( mt, yt, itb, itd ). If r=1,
you formulate the weak exogeneity of (yt,itb,itd)
for mt as J=[1, 0, 0, 0]'
and the weak exogeneity of itd for (mt, yt, itb)
as J = [I3,0]'.
The following statements test the weak exogeneity of other variables:
proc varmax data=simul2;
model y1 y2 / p=2 ecm=(rank=1 normalize=y1);
cointeg rank=1 exogeneity;
run;
|
Tests of Weak Exogeneity of Each of Variables |
| Variable |
Chi-Square |
DF |
Prob>ChiSq |
| y1 |
53.46 |
1 |
<.0001 |
| y2 |
8.76 |
1 |
0.0031 |
|
Figure 4.45: Testing of Weak Exogeneity (EXOGENEITY option)
Figure 4.37 shows that each variable is not the weak
exogeneity of other variable.
Forecasting of the VECM
Consider the cointegrated moving-average representation
of the differenced process of yt

Assume that y0 = 0. The linear process yt
can be written

Therefore, for any l > 0,

The l-step-ahead forecast is derived from the preceding equation:

Note that

since
and
. The long-run forecast of the cointegrated system
shows that the cointegrated relationship holds, though
there might exist some deviations from the equilibrium status
in the short-run.
The covariance matrix of the predict error
is
![\Sigma(l)=\sum_{i=1}^l[(\sum_{j=0}^{l-i}\Psi_j)\Sigma (\sum_{j=0}^{l-i}\Psi_j')]](images/vareq422.gif)
When the linear process is represented as a VECM(p) model,
you can obtain

The transition equation is defined as

where
is a state vector and the
transition matrix is
![F=[ I_k & I_k & 0 & ... & 0 \ \Pi &(\Pi+\Phi^*_1)&\Phi^*_2 & ... &\Phi^*_{... ... & 0 \ \vdots & \vdots & \vdots & \ddots & \vdots \ 0 & 0 & ... & I_k & 0 \ ]](images/vareq426.gif)
where <U>0U> is a k-dimensional zero vector.
The observation equation can be written

where H = [Ik,Ik,0, ... ,0].
The l-step-ahead forecast is computed as

Copyright © 2000 by SAS Institute Inc., Cary, NC, USA. All rights reserved.