The MI Procedure |
This example uses the EM algorithm to compute the maximum likelihood estimates for parameters of multivariate normally distributed data with missing values. The following statements invoke the MI procedure and request the EM algorithm to compute the MLE for of a multivariate normal distribution from the input data set FitMiss:
proc mi data=FitMiss seed=1518971 simple nimpute=0; em itprint outem=outem; var Oxygen RunTime RunPulse; run;
Note that when you specify the NIMPUTE=0 option, the missing values are not imputed.
The "Model Information" table in Output 54.1.1 describes the method and options used in the procedure if a positive number is specified in the NIMPUTE= option.
Model Information | |
---|---|
Data Set | WORK.FITMISS |
Method | MCMC |
Multiple Imputation Chain | Single Chain |
Initial Estimates for MCMC | EM Posterior Mode |
Start | Starting Value |
Prior | Jeffreys |
Number of Imputations | 0 |
Number of Burn-in Iterations | 200 |
Number of Iterations | 100 |
Seed for random number generator | 1518971 |
The "Missing Data Patterns" table in Output 54.1.2 lists distinct missing data patterns with corresponding frequencies and percents. Here, a value of "X" means that the variable is observed in the corresponding group and a value of "." means that the variable is missing. The table also displays group-specific variable means.
With the SIMPLE option, the procedure displays simple descriptive univariate statistics for available cases in the "Univariate Statistics" table in Output 54.1.3 and correlations from pairwise available cases in the "Pairwise Correlations" table in Output 54.1.4.
When you use the EM statement, the MI procedure displays the initial parameter estimates for the EM algorithm in the "Initial Parameter Estimates for EM" table in Output 54.1.5.
When you use the ITPRINT option in the EM statement, the "EM (MLE) Iteration History" table in Output 54.1.6 displays the iteration history for the EM algorithm.
EM (MLE) Iteration History | ||||
---|---|---|---|---|
_Iteration_ | -2 Log L | Oxygen | RunTime | RunPulse |
0 | 289.544782 | 47.116179 | 10.688214 | 171.863636 |
1 | 263.549489 | 47.116179 | 10.688214 | 171.863636 |
2 | 255.851312 | 47.139089 | 10.603506 | 171.538203 |
3 | 254.616428 | 47.122353 | 10.571685 | 171.426790 |
4 | 254.494971 | 47.111080 | 10.560585 | 171.398296 |
5 | 254.483973 | 47.106523 | 10.556768 | 171.389208 |
6 | 254.482920 | 47.104899 | 10.555485 | 171.385257 |
7 | 254.482813 | 47.104348 | 10.555062 | 171.383345 |
8 | 254.482801 | 47.104165 | 10.554923 | 171.382424 |
9 | 254.482800 | 47.104105 | 10.554878 | 171.381992 |
10 | 254.482800 | 47.104086 | 10.554864 | 171.381796 |
11 | 254.482800 | 47.104079 | 10.554859 | 171.381708 |
12 | 254.482800 | 47.104077 | 10.554858 | 171.381669 |
The "EM (MLE) Parameter Estimates" table in Output 54.1.7 displays the maximum likelihood estimates for and of a multivariate normal distribution from the data set FitMiss.
You can also output the EM (MLE) parameter estimates to an output data set with the OUTEM= option. The following statements list the observations in the output data set outem:
proc print data=outem; title 'EM Estimates'; run;
The output data set outem in Output 54.1.8 is a TYPE=COV data set. The observation with _TYPE_=‘MEAN’ contains the MLE for the parameter , and the observations with _TYPE_=‘COV’ contain the MLE for the parameter of a multivariate normal distribution from the data set FitMiss.
Copyright © 2009 by SAS Institute Inc., Cary, NC, USA. All rights reserved.