Previous Page  Next Page 
Box Plots and Mosaic Plots

Multiple Comparison Options

Box plots enable you to examine means in different groups. Statistical questions you might have about the group means include

From the Multiple Comparison Options dialog, you can select a multiple comparison of means test and a confidence level for the test. Multiple comparison tests enable you to infer differences between means and also to construct simultaneous confidence intervals for these differences.

All of the tests implemented in SAS/INSIGHT software are constructed assuming that the displayed variables are independent and normally distributed with identical variance. For details, refer to Hsu (1996).

Each of the tests available in SAS/INSIGHT software is described below. In the descriptions that follow, k is the number of categories (that is, the number of boxes in the box plot), ni is the number of observations for the ith category, \mu_i is the true mean for the ith category, \hat{ \mu }_i is the sample mean for the ith category, \nu= \sum_{i=1}^k (n_i-1) is the total degrees of freedom, and \hat{\sigma} is the root mean square error, also known as the pooled standard deviation. Each test creates a table showing 100(1-\alpha)\% confidence intervals for the difference \hat{ \mu }_i - \hat{ \mu }_j, i \neq j, i = 1 ... k.

mcc2.gif (2950 bytes)

Figure 33.8: Multiple Comparison Options

The Pairwise t-test is not a true simultaneous comparison test, but rather uses a pairwise t test to provide confidence intervals about the difference between two means. These intervals have a half-width equal to t_{\alpha/2,\nu} \hat{\sigma} \sqrt{ n_i^{-1} + n_j^{-1}}. Although each confidence interval was computed at the 100(1-\alpha)\% level, the probability that all of your confidence intervals are correct simultaneously is less than 100(1-\alpha)\%.The actual simultaneous confidence for the t-based intervals is approximately 100(1-k\alpha)\%. For example, for five groups the actual simultaneous confidence for the t-based intervals is approximately only 75%.

The Tukey-Kramer method is a true "multiple comparison" test, appropriate when all pairwise comparisons are of interest; it is the default test used. The test is an exact \alpha-level test if the sample sizes are the same, and it is slightly conservative for unequal sample sizes. The confidence interval around the point-estimate \hat{ \mu }_i - \hat{ \mu }_j has half-width q^* \hat{\sigma} \sqrt{ n_i^{-1} + n_j^{-1}}. It is a common convention to report the quantity \sqrt{2}q^* as the Tukey-Kramer quantile, rather than just q*.

The Pairwise Bonferroni method is also appropriate when all pairwise comparisons are of interest. It is conservative; that is, Bonferroni tests performed at a nominal significance level of \alphaactually have a somewhat greater level of significance. The Bonferroni method uses the t distribution, like the pairwise t test, but returns smaller intervals with half-width t_{\alpha/(k(k-1)),\nu} \hat{\sigma}\sqrt{ n_i^{-1} + n_j^{-1}}.Note that the t probability (\alpha/2, since this is a two-sided test) is divided by the total number of pairwise comparisons (k(k-1)/2). The Bonferroni test produces wider confidence intervals than the Tukey-Kramer test.

Dunnett's Test with Control is a two-sided multiple comparison method used to compare a set of categories to a control group. The quantile that scales the confidence interval is usually denoted |d|. If the ith confidence interval does not include zero, you may infer that the ith group is significantly different from the control. A control group may be a placebo or null treatment, or it may be a standard treatment. While the interactive nature of SAS/INSIGHT enables you to select any category to use as the basis of comparison in Dunnett's test, you should select a category only if it truly is the control group. To select a category, click on the corresponding comparison circle.

Hsu's Test for Best can be used to screen out group means that are statistically less than the (unknown) largest true mean. It forms nonsymmetric confidence intervals around the difference between the largest sample mean and each of the others. If an interval does not properly contain zero in its interior, then you may infer that the associated group is not among the best.

Similary, Hsu's Test for Worst can be used to screen out group means that are statistically greater than the (unknown) smallest true mean. If an interval does not properly contain zero in its interior, then you may infer that the true mean of that group is not equal to the (unknown) smallest true mean.

Previous Page  Next Page  Top of Page

Copyright © 2007 by SAS Institute Inc., Cary, NC, USA. All rights reserved.