

With m imputations, m different sets of the point and variance estimates for a parameter Q can be computed. Suppose
and
are the point and variance estimates from the ith imputed data set, i = 1, 2, …, m. Then the combined point estimate for Q from multiple imputation is the average of the m complete-data estimates:
![\[ \overline{Q} = \frac{1}{m} \sum _{i=1}^{m} \hat{Q_ i} \]](images/statug_mi0292.png)
Suppose
is the within-imputation variance,
which is the average of the m complete-data estimates,
![\[ \overline{W} = \frac{1}{m} \sum _{i=1}^{m} \hat{W_ i} \]](images/statug_mi0294.png)
and B is the between-imputation variance
![\[ B = \frac{1}{m-1} \sum _{i=1}^{m} (\hat{Q_ i}-\overline{Q})^2 \]](images/statug_mi0295.png)
Then the variance estimate associated with
is the total variance (Rubin 1987)
![\[ T = \overline{W} + (1+\frac{1}{m}) B \]](images/statug_mi0297.png)
The statistic
is approximately distributed as t with
degrees of freedom (Rubin 1987),
where
![\[ v_{m} = (m-1) {\left[ 1 + \frac{\overline{W}}{(1+m^{-1})B} \right]}^2 \]](images/statug_mi0300.png)
The degrees of freedom
depend on m and the ratio
![\[ r = \frac{(1+m^{-1})B}{\overline{W}} \]](images/statug_mi0301.png)
The ratio r is called the relative increase in variance
due to nonresponse (Rubin 1987). When there is no missing information about Q, the values of r and B are both zero. With a large value of m or a small value of r, the degrees of freedom
will be large and the distribution of
will be approximately normal.
Another useful statistic is the fraction of missing information about Q:
![\[ \hat{\lambda } = \frac{r+2/(v_{m}+3)}{r+1} \]](images/statug_mi0302.png)
Both statistics r and
are helpful diagnostics for assessing how the missing data contribute to the uncertainty about Q.
When the complete-data degrees of freedom
are small, and there is only a modest proportion of missing data, the computed degrees of freedom,
, can be much larger than
, which is inappropriate. For example, with m = 5 and r = 10%, the computed degrees of freedom
, which is inappropriate for data sets with complete-data degrees of freedom less than 484.
Barnard and Rubin (1999) recommend the use of adjusted degrees of freedom
![\[ v_{m}^{*} = \, \left[ \frac{1}{v_{m}} + \frac{1}{\hat{v}_{\mathit{obs}}} \right] ^{-1} \]](images/statug_mi0305.png)
where
and
.
Note that the MI procedure uses the adjusted degrees of freedom,
, for inference.