Chapter Contents
Chapter Contents
Previous
Previous
Next
Next
The MIANALYZE Procedure

Examples of the Complete-Data Inferences

For a given parameter of interest, it is not always possible to compute the estimate and associated covariance matrix directly from a SAS procedure. This section gives examples of parameters with their estimates and associated covariance matrices, which provide the input to the MIANALYZE procedure. Some are straightforward, and others require special techniques.

Means

For a population mean vector mu,the usual estimate is the sample mean vector
{\overline y}=\frac{1}n \sum {y_{i}}

A variance estimate for {\overline y} is [1/n]S, where S is the sample covariance matrix

S=\frac{1}{\, n-1 \,} \sum { ( y_{i} - {\overline y} ) ( y_{i} - {\overline y} )' }

These statistics can be computed from a procedure such as CORR. This approach is illustrated in Example 10.1.

Regression Coefficients

Many SAS procedures are available for regression analysis. Among them, REG provides the most general analysis capabilities, and others like LOGISTIC and MIXED provide more specialized analyses.

Some regression procedures, such as REG and LOGISTIC, create an EST type data set that contains both the parameter estimates for the regression coefficients and their associated covariance matrix. You can read an EST type data set in the MIANALYZE procedure with the DATA= option. This approach is illustrated in Example 10.2.

Other procedures, such as GLM, MIXED, and GENMOD, do not generate EST type data sets for regression coefficients. For MIXED and GENMOD, you can use ODS OUTPUT statement to save parameter estimates in a data set and the associated covariance matrix in a separate data set. These data sets are then read in the MIANALYZE procedure with the PARMS= and COVB= options, respectively. This approach is illustrated in Example 10.3 for PROC MIXED and in Example 10.4 for PROC GENMOD.

PROC GLM does not display tables for covariance matrices. However, you can use the ODS OUTPUT statement to save parameter estimates and associated standard errors in a data set and the associated (X'X)-1 matrix in a separate data set. These data sets are then read in the MIANALYZE procedure with the PARMS= and XPXI= options, respectively. This approach is illustrated in Example 10.5.

Correlation Coefficients

For the population correlation coefficient \rho,a point estimate is the sample correlation coefficient r. However, for nonzero \rho, the distribution of r is skewed.

The distribution of r can be normalized through Fisher's Z transformation

z(r) = [1/2]  log ( [(1+r)/(1-r)] )

z(r) is approximately normally distributed with mean z(\rho)and variance 1/(n-3).

With a point estimate \hat z and an approximate 95% confidence interval (z1, z2) for z(\rho),a point estimate \hat r and a 95% confidence interval (r1, r2) for \rho can be obtained by applying the inverse transformation

r = [(e2z - 1)/(e2z + 1)]
to z=\hat{z}, z_{1}, and z2.

This approach is illustrated in Example 10.3.

Ratios of Variable Means

For the ratio {\mu_{1}}/{\mu_{2}} of means for variables Y1 and Y2, the point estimate is {\overline y_{1}}/{\overline y_{2}},the ratio of the sample means. The Taylor expansion and delta method can be applied to the function y1/y2 to obtain the variance estimate (Schafer 1997, p. 196)
\frac 1{n} [ {( \frac {\overline{y}_{1}}{\overline{y}_{2}^2} )}^2 s_{22} - 2 {... ...rac 1{\overline{y}_{2}} )} s_{12} + {( \frac 1{\overline{y}_{2}} )}^2 s_{11} ]
where s11 and s22 are the sample variances of Y1 and Y2, respectively, and s12 is the sample covariance between Y1 and Y2.

A ratio of sample means will be approximately unbiased and normally distributed if the coefficient of variation of the denominator (the standard error for the mean divided by the estimated mean) is 10% or less (Cochran 1977, p. 166; Schafer 1997, p. 196). This approach is illustrated in Example 10.7.

Chapter Contents
Chapter Contents
Previous
Previous
Next
Next
Top
Top

Copyright © 2001 by SAS Institute Inc., Cary, NC, USA. All rights reserved.