[See SHWBOXA in the SAS/QC Sample Library]In this example, the BOXCHART statement is used to create a summary data set that can be read later by the SHEWHART procedure (as in the preceding example). The following statements read measurements from the data set Turbine and create a summary data set named Turbhist:
title 'Summary Data Set for Power Output'; proc shewhart data=Turbine; boxchart KWatts*Day / outhistory = Turbhist nochart; run;
The OUTHISTORY= option names the output data set, and the NOCHART option suppresses the display of the chart, which would be identical to the chart in Figure 15.4.
Figure 15.8 contains a partial listing of Turbhist.
Summary Data Set for Power Output |
Obs | Day | KWattsL | KWatts1 | KWattsX | KWattsM | KWatts3 | KWattsH | KWattsS | KWattsN |
---|---|---|---|---|---|---|---|---|---|
1 | 04JUL | 3180 | 3340.0 | 3487.40 | 3490.0 | 3610.0 | 4050 | 220.260 | 20 |
2 | 05JUL | 3179 | 3333.5 | 3471.65 | 3419.5 | 3605.0 | 3849 | 210.427 | 20 |
3 | 06JUL | 3304 | 3376.0 | 3488.30 | 3456.5 | 3604.5 | 3781 | 147.025 | 20 |
4 | 07JUL | 3045 | 3390.5 | 3434.20 | 3447.0 | 3550.0 | 3629 | 157.637 | 20 |
5 | 08JUL | 2968 | 3321.0 | 3475.80 | 3487.0 | 3611.5 | 3916 | 258.949 | 20 |
There are nine variables in the data set Turbhist.
Day is the subgroup variable.
KWattsL contains the subgroup minimums.
KWatts1 contains the first quartiles for each subgroup.
KWattsX contains the subgroup means.
KWattsM contains the subgroup medians.
KWatts3 contains the third quartiles for each subgroup.
KWattsH contains the subgroup maximums.
KWattsS contains the subgroup standard deviations.
KWattsN contains the subgroup sample sizes.
Note that the summary statistic variables are named by adding the suffix characters L, 1, X, M, 3, H, S, and N to the process KWatts specified in the BOXCHART statement. In other words, the variable naming convention for OUTHISTORY= data sets is the same as that for HISTORY= data sets.
If you specify the RANGES option, the OUTHISTORY= data set includes a subgroup range variable, rather than a subgroup standard deviation variable, as demonstrated by the following statements:
proc shewhart data=Turbine; boxchart KWatts*Day / outhistory = Turbhist2 ranges nochart; run;
Figure 15.9 contains a partial listing of Turbhist2. The variable KWattsR contains the subgroup ranges.
The RANGES option is not recommended when the subgroup sample sizes are greater than 10, nor when you use the NOLIMITS option to create standard side-by-side box-and-whisker plots.
For more information, see OUTHISTORY= Data Set.
Summary Data Set for Power Output |
Day | KWattsL | KWatts1 | KWattsX | KWattsM | KWatts3 | KWattsH | KWattsR | KWattsN |
---|---|---|---|---|---|---|---|---|
04JUL | 3180 | 3340.0 | 3487.40 | 3490.0 | 3610.0 | 4050 | 870 | 20 |
05JUL | 3179 | 3333.5 | 3471.65 | 3419.5 | 3605.0 | 3849 | 670 | 20 |
06JUL | 3304 | 3376.0 | 3488.30 | 3456.5 | 3604.5 | 3781 | 477 | 20 |
07JUL | 3045 | 3390.5 | 3434.20 | 3447.0 | 3550.0 | 3629 | 584 | 20 |
08JUL | 2968 | 3321.0 | 3475.80 | 3487.0 | 3611.5 | 3916 | 948 | 20 |