
See SHWBOXA in the SAS/QC Sample LibraryIn this example, the BOXCHART statement is used to create a summary data set that can be read later by the SHEWHART procedure
(as in the preceding example). The following statements read measurements from the data set Turbine and create a summary data set named Turbhist:
title 'Summary Data Set for Power Output';
proc shewhart data=Turbine;
boxchart KWatts*Day / outhistory = Turbhist
nochart;
run;
The OUTHISTORY= option names the output data set, and the NOCHART option suppresses the display of the chart, which would be identical to the chart in Figure 17.4.
Figure 17.8 contains a partial listing of Turbhist.
Figure 17.8: The Summary Data Set Turbhist
| Summary Data Set for Power Output |
| Obs | Day | KWattsL | KWatts1 | KWattsX | KWattsM | KWatts3 | KWattsH | KWattsS | KWattsN |
|---|---|---|---|---|---|---|---|---|---|
| 1 | 04JUL | 3180 | 3340.0 | 3487.40 | 3490.0 | 3610.0 | 4050 | 220.260 | 20 |
| 2 | 05JUL | 3179 | 3333.5 | 3471.65 | 3419.5 | 3605.0 | 3849 | 210.427 | 20 |
| 3 | 06JUL | 3304 | 3376.0 | 3488.30 | 3456.5 | 3604.5 | 3781 | 147.025 | 20 |
| 4 | 07JUL | 3045 | 3390.5 | 3434.20 | 3447.0 | 3550.0 | 3629 | 157.637 | 20 |
| 5 | 08JUL | 2968 | 3321.0 | 3475.80 | 3487.0 | 3611.5 | 3916 | 258.949 | 20 |
There are nine variables in the data set Turbhist.
Day is the subgroup variable.
KWattsL contains the subgroup minimums.
KWatts1 contains the first quartiles for each subgroup.
KWattsX contains the subgroup means.
KWattsM contains the subgroup medians.
KWatts3 contains the third quartiles for each subgroup.
KWattsH contains the subgroup maximums.
KWattsS contains the subgroup standard deviations.
KWattsN contains the subgroup sample sizes.
Note that the summary statistic variables are named by adding the suffix characters L, 1, X, M, 3, H, S, and N to the process KWatts specified in the BOXCHART statement. In other words, the variable naming convention for OUTHISTORY= data sets is the same
as that for HISTORY= data sets.
If you specify the RANGES option, the OUTHISTORY= data set includes a subgroup range variable, rather than a subgroup standard deviation variable, as demonstrated by the following statements:
proc shewhart data=Turbine;
boxchart KWatts*Day / outhistory = Turbhist2
ranges
nochart;
run;
Figure 17.9 contains a partial listing of Turbhist2. The variable KWattsR contains the subgroup ranges.
The RANGES option is not recommended when the subgroup sample sizes are greater than 10, nor when you use the NOLIMITS option to create standard side-by-side box-and-whisker plots.
For more information, see OUTHISTORY= Data Set.
Figure 17.9: The Summary Data Set Turbhist2
| Summary Data Set for Power Output |
| Day | KWattsL | KWatts1 | KWattsX | KWattsM | KWatts3 | KWattsH | KWattsR | KWattsN |
|---|---|---|---|---|---|---|---|---|
| 04JUL | 3180 | 3340.0 | 3487.40 | 3490.0 | 3610.0 | 4050 | 870 | 20 |
| 05JUL | 3179 | 3333.5 | 3471.65 | 3419.5 | 3605.0 | 3849 | 670 | 20 |
| 06JUL | 3304 | 3376.0 | 3488.30 | 3456.5 | 3604.5 | 3781 | 477 | 20 |
| 07JUL | 3045 | 3390.5 | 3434.20 | 3447.0 | 3550.0 | 3629 | 584 | 20 |
| 08JUL | 2968 | 3321.0 | 3475.80 | 3487.0 | 3611.5 | 3916 | 948 | 20 |