The MIANALYZE Procedure

Multivariate Inferences

Multivariate inference based on Wald tests can be done with $\text{[math]}$ imputed data sets. The approach is a generalization of the approach taken in the univariate case (Rubin 1987, p. 137; Schafer 1997, p. 113). Suppose that $\text{[math]}$ and $\text{[math]}$ are the point and covariance matrix estimates for a $\text{[math]}$ -dimensional parameter $\text{[math]}$ (such as a multivariate mean) from the $\text{[math]}$ imputed data set, $\text{[math]}$ = 1, 2, ..., $\text{[math]}$ . Then the combined point estimate for $\text{[math]}$ from the multiple imputation is the average of the $\text{[math]}$ complete-data estimates:

$\text{[math]}$

Suppose that $\text{[math]}$ is the within-imputation covariance matrix, which is the average of the $\text{[math]}$ complete-data estimates:

$\text{[math]}$

And suppose that $\text{[math]}$ is the between-imputation covariance matrix:

$\text{[math]}$

Then the covariance matrix associated with $\text{[math]}$ is the total covariance matrix

$\text{[math]}$

The natural multivariate extension of the $\text{[math]}$ statistic used in the univariate case is the $\text{[math]}$ statistic

$\text{[math]}$

with degrees of freedom $\text{[math]}$ and

$\text{[math]}$

where

$\text{[math]}$

is an average relative increase in variance due to nonresponse (Rubin 1987, p. 137; Schafer 1997, p. 114).

However, the reference distribution of the statistic $\text{[math]}$ is not easily derived. Especially for small $\text{[math]}$ , the between-imputation covariance matrix $\text{[math]}$ is unstable and does not have full rank for $\text{[math]}$ (Schafer 1997, p. 113).

One solution is to make an additional assumption that the population between-imputation and within-imputation covariance matrices are proportional to each other (Schafer 1997, p. 113). This assumption implies that the fractions of missing information for all components of $\text{[math]}$ are equal. Under this assumption, a more stable estimate of the total covariance matrix is

$\text{[math]}$

With the total covariance matrix $\text{[math]}$ , the $\text{[math]}$ statistic (Rubin 1987, p. 137)

$\text{[math]}$

has an $\text{[math]}$ distribution with degrees of freedom $\text{[math]}$ and $\text{[math]}$ , where

$\text{[math]}$

For $\text{[math]}$ , PROC MIANALYZE uses the degrees of freedom $\text{[math]}$ in the analysis. For $\text{[math]}$ , PROC MIANALYZE uses $\text{[math]}$ , a better approximation of the degrees of freedom given by Li, Raghunathan, and Rubin (1991):

$\text{[math]}$