Mixed Models |
The example in this section contains information on a study investigating the heights of individuals sampled from different families. The response variable Height measures the height (in inches) of 18 individuals that are classified according to Family and Gender. Since the data occurs in clusters (families), it is very likely that observations from the same family are statistically correlated and not independent. In this case, it is inappropriate to analyze the data using a standard linear model.
A simple way to model the correlation is through the use of a Family random effect. The Family effect is assumed to be normally distributed with mean of zero and some unknown variance. Defining Family as a random effect sets up a common correlation among all observations having the same level of family.
In addition, a female within a certain family may exhibit more correlation with other females in that same family than with the males in that family, and likewise for males. Defining Family*Gender as a random effect models an additional correlation for all observations having the same value of both Family and Gender.
Based on your selections, the Mixed Models task constructs the X matrix by creating indicator variables for the Gender effect and including a column of 1s to model the global intercept. The Z matrix contains indicator variables for both the Family effect and the Family*Gender interaction.
Figure 15.9: Mixed Model: Plots Dialog
When you have completed your selections, click OK in the main dialog to perform the analysis.
Figure 15.10: Mixed Models: Project Tree
Double-click on the Analysis node in the project tree to view the contents in a separate window.
Figure 15.11: Mixed Models: Analysis Results
Figure 15.11 displays the mixed models analysis results for the clustered Heights data. The covariance parameter estimates for Family, Family*Gender, and the residual variance are 2.4, 1.8, and 2.2, respectively. The "Test of Fixed Effects" table contains a significance test for the single fixed effect, Gender. With a p-value of 0.0712, the Type 3 test of Gender is not significant at the level of significance. Note that the denominator degrees of freedom for the Type 3 test are computed using a general Satterthwaite approximation. A benefit of performing a random effects analysis using both Family and Family*Gender as random effects is that you can make inferences about gender that apply to an entire population of families, not necessarily to the specific families in this study.
Figure 15.12: Mixed Models: Residuals Plot
Figure 15.12 displays a plot of the residuals versus predicted values that includes random effects, versus . Plots are useful for checking model assumptions and identifying potential outlying and influential observations. Based on the plot in Figure 15.12, the data seem to exhibit relatively constant variance across predicted values, and there do not appear to be any outliers or influential observations.
Copyright © 2007 by SAS Institute Inc., Cary, NC, USA. All rights reserved.