|
Chapter Contents |
Previous |
Next |
| The MI Procedure |
The monotone data MCMC method was first proposed by Li (1988), and Liu (1993) described the algorithm. The method is useful especially when a data set is close to having a monotone missing pattern. In this case, the method only needs to impute a few missing values to the data set to have a monotone missing pattern in the imputed data set. Compared to a full data imputation that imputes all missing values, the monotone data MCMC method imputes fewer missing values in each iteration and achieves approximate stationarity in fewer iterations (Schafer 1997, p. 227).
You can request the monotone MCMC method by specifying the option IMPUTE=MONOTONE in the MCMC statement. The "Missing Data Patterns" table now denotes the variables with missing values by "." or "O". A "." means that the variable is missing and will be imputed and an "O" means that the variable is missing and will not be imputed. The tables of "Multiple Imputation Variance Information" and "Multiple Imputation Parameter Estimates" are not created.
You must specify the variables in the VAR statement. The variable order in the list determines the monotone missing pattern in the imputed data set. With a different order in the VAR list, the results will be different because the monotone missing pattern to be constructed will be different.
Assuming that the data are from a multivariate normal distribution, then similar to the MCMC method, the monotone MCMC method repeats the following steps:
1. The imputation I-step: Given an estimated mean vector and covariance matrix, the I-step simulates the missing values for each observation independently. Only a subset of missing values are simulated to achieve a monotone pattern of missingness.
2. The posterior P-step: Given a new sample with a monotone pattern of missingness, the P-step simulates the posterior population mean vector and covariance matrix with a noninformative Jeffreys prior. These new estimates are then used in the next I-step.
That is, for the variable Yj, a model

is fitted using nonmissing observations.
The fitted model consists of the regression parameter estimates
and the associated covariance matrix
,where Vj is the usual X'X inverse matrix
from the intercept and variables Y1, Y2, ... , Yj-1.
For each imputation,
new parameters
and
are drawn from the posterior predictive
distribution of the parameters.
That is, they are simulated from
,
, and Vj.
The variance is drawn as


These simulated values of
and
are then used to re-create the parameters
and
.
For a detailed description of how to produce monotone-missingness
with the MCMC method for a multivariate normal data,
refer to Schafer (1997, pp. 226 -235).
|
Chapter Contents |
Previous |
Next |
Top |
Copyright © 2001 by SAS Institute Inc., Cary, NC, USA. All rights reserved.