|
Chapter Contents |
Previous |
Next |
| The MI Procedure |
The MI procedure assumes that the data are from a continuous multivariate distribution and contain missing values that can occur on any of the variables. It also assumes that the data are from a multivariate normal distribution when either the regression method or the MCMC method is used.
Suppose Y is the n×p matrix of complete data, which is not fully observed, and denote the observed part of Y by Yobs and the missing part by Ymis. The SAS MI and MIANALYZE procedures assume that the missing data are missing at random (MAR), that is, the probability that an observation is missing can depend on Yobs, but not on Ymis (Rubin 1976; 1987, p. 53).
To be more precise, suppose that R is the n×p matrix of response indicators whose elements are zero or one depending on whether the corresponding elements of Y are missing or observed. Then the MAR assumption is that the distribution of R can depend on Yobs but not on Ymis.

For example, consider a trivariate data set with variables Y1 and Y2 fully observed, and a variable Y3 that has missing values. MAR assumes that the probability that Y3 is missing for an individual can be related to the individual's values of variables Y1 and Y2, but not to its value of Y3. On the other hand, if a complete case and an incomplete case for Y3 with exactly the same values for variables Y1 and Y2 have systematically different values, then there exists a response bias for Y3, and MAR is violated.
The MAR assumption is not the same as missing completely at random (MCAR), which is a special case of MAR. Under the MCAR assumption, the missing data values are a simple random sample of all data values; the missingness does not depend on the values of any variables in the data set.
Furthermore, the MI and MIANALYZE procedures assume that
the parameters
of the data model and
the parameters
of the model for the missing data indicators
are distinct.
That is, knowing the values of
does not provide any
additional information about
,
and vice versa.
If both the MAR and distinctness assumptions are satisfied,
the missing-data mechanism is said to be ignorable
(Rubin 1987, pp. 50 -54; Schafer 1997, pp. 10 -11) .
|
Chapter Contents |
Previous |
Next |
Top |
Copyright © 2001 by SAS Institute Inc., Cary, NC, USA. All rights reserved.