Distribution Analyses |
When outliers are present in the data, trimmed and Winsorized means are robust estimators of the population mean that are relatively insensitive to the outlying values. Therefore, trimming and Winsorization are methods for reducing the effects of extreme values in the sample.
The k-times trimmed mean is calculated as
The k-times Winsorized mean is calculated as
For a symmetric distribution, the symmetrically trimmed or Winsorized mean is an unbiased estimate of the population mean. But the trimmed or Winsorized mean does not have a normal distribution even if the data are from a normal population.
The Winsorized sum of squared deviations is defined as
A robust estimate of the variance of the trimmed mean can be based on the Winsorized sum of squared deviations (Tukey and McLaughlin 1963). The resulting trimmed t test is given by
A Winsorized t test is given by
When the data are from a symmetric distribution, the distribution of the trimmed t statistic ttk or the Winsorized t statistic twk can be approximated by a Student's t distribution with n-2k-1 degrees of freedom (Tukey and McLaughlin 1963, Dixon and Tukey 1968).
You can specify the number or percentage of observations to be trimmed or Winsorized from each end either by using the Trimmed/Winsorized Means options dialog or by using the Trimmed/Winsorized Means dialog after choosing Tables:Trimmed/Winsorized Mean:(1/2)N or Tables:Trimmed/Winsorized Mean:(1/2)Percent from the menus.
Figure 38.15: (1/2)N Menu
Figure 38.16: (1/2)Percent Menu
If you specify a percentage, 100 p%, 0<p<1, the smallest integer greater than or equal to np is trimmed or Winsorized from each end.
The Trimmed Mean and Winsorized Mean tables, as shown in Figure 38.17, contain the following statistics:
Figure 38.17: Trimmed Means and Winsorized Means Tables
Copyright © 2007 by SAS Institute Inc., Cary, NC, USA. All rights reserved.