|
Chapter Contents |
Previous |
Next |
| The MI Procedure |
With m imputations, m different sets of the point
and variance estimates for a parameter Q can be computed.
Suppose
and
are the point and variance estimates
from the ith imputed data set, i=1, 2, ..., m.
Then the combined point estimate for Q from multiple imputation is
the average of the m complete-data estimates:

Suppose
is the within-imputation variance,
which is the average of the m complete-data estimates:

and B is the between-imputation variance

Then the variance estimate associated with
is the total variance (Rubin 1987)

The statistic
is approximately distributed
as t with vm degrees of freedom (Rubin 1987), where
![v_{m}=(m-1) [1 + \frac{{\overline U}}{(1+m^{-1})B} ]^2](images/mieq121.gif)
When the complete-data degrees of freedom v0 is small, and there is only a modest proportion of missing data, the computed degrees of freedom, vm, can be much larger than v0, which is inappropriate. Barnard and Rubin (1999) recommend the use of an adjusted degrees of freedom
![v_{m}^{*}=\, [ \frac{1}{v_{m}} + \frac{1}{\hat{v}_{obs}} ] ^{-1}](images/mieq122.gif)
where
and
.
Note that the MI procedure uses the adjusted degrees of freedom, vm*, for inference.
The degrees of freedom vm depends on m and the ratio

The ratio r is called the relative increase in variance
due to nonresponse (Rubin 1987).
When there is no missing information about Q,
the values of r and B are both zero.
With a large value of m or a small value of r,
the degrees of freedom v will be large and the distribution of
will be approximately normal.
Another useful statistic is the fraction of missing information about Q:

Both statistics r and
are helpful diagnostics for
assessing how the missing data contribute to the uncertainty
about Q.
|
Chapter Contents |
Previous |
Next |
Top |
Copyright © 2001 by SAS Institute Inc., Cary, NC, USA. All rights reserved.