The OUTBOX= data set saves group summary statistics, decision limits, and outlier values. The following variables can be saved:
the group-variable
the variable _VAR_, containing the analysis variable name
the variable _TYPE_, identifying features of box-and-whisker plots
the variable _VALUE_, containing values of box-and-whisker plot features
the variable _ID_, containing labels for outliers
the variable _HTML_, containing links associated with box-and-whisker plot features
_ID_ is included in the OUTBOX= data set only if one of the keywords SCHEMATICID or SCHEMATICIDFAR is specified with the BOXSTYLE= option. _HTML_ is present only if the HTML= or HTML2= option is specified.
Each observation in an OUTBOX= data set records the value of a single feature of one group’s box-and-whisker plot, such as its mean. The _TYPE_ variable identifies the feature whose value is recorded in _VALUE_. The following table lists valid _TYPE_ variable values:
_TYPE_ Value |
Description |
---|---|
N |
group size |
ALPHA |
significance level |
LIMITN |
nominal sample size associated with decision limits |
LDLX |
lower decision limit for group mean |
UDLX |
upper decision limit for group mean |
RESPMEAN |
overall response variable mean |
MIN |
group minimum value |
Q1 |
group first quartile |
MEDIAN |
group median |
MEAN |
group mean |
Q3 |
group third quartile |
MAX |
group maximum value |
LOW |
low outlier value |
HIGH |
high outlier value |
LOWHISKR |
low whisker value, if different from MIN |
HIWHISKR |
high whisker value, if different from MAX |
FARLOW |
low far outlier value |
FARHIGH |
high far outlier value |
Additionally, the following variables, if specified, are included:
block-variables
symbol-variable
BY variables
ID variables
The OUTLIMITS= data set saves decision limits and decision limit parameters. The following variables can be saved:
Variable |
Description |
---|---|
_ALPHA_ |
significance level |
_DFE_ |
degrees of freedom for mean square error |
_GROUP_ |
group-variable specified in the BOXCHART statement |
_INDEX_ |
optional identifier for the decision limits specified with the OUTINDEX= option |
_LDLX_ |
lower decision limit for group means |
_LIMITK_ |
number of groups |
_LIMITN_ |
sample size associated with the decision limits |
_MEAN_ |
weighted average of group means () |
_MSE_ |
mean square error |
_TYPE_ |
type (estimate or standard value) of _MEAN_ and _MSE_ |
_UDLX_ |
upper decision limit for group means |
_VAR_ |
response specified in the BOXCHART statement |
Notes:
In the unbalanced case, the special missing value V is assigned to the variables _LIMITN_, _LDLX_, and _UDLX_.
Optional BY variables are saved in the OUTLIMITS= data set.
The OUTLIMITS= data set contains one observation for each response specified in the BOXCHART statement. For an example, see Saving Decision Limits.
The OUTSUMMARY= data set saves group summary statistics. The following variables can be saved:
the group-variable
a group minimum variable named by response suffixed with L
a group first-quartile variable named by response suffixed with 1
a group mean variable named by response suffixed with X
a group median variable named by response suffixed with M
a group third-quartile variable named by response suffixed with 3
a group maximum variable named by response suffixed with H
a group standard deviation variable named by response suffixed with S
a group sample size variable named by response suffixed with N
Given a response name that contains 32 characters, the procedure first shortens the name to its first 16 characters and its last 15 characters, and then it adds the suffix.
Group summary variables are created for each response specified in the BOXCHART statement. For example, consider the following statements:
proc anom data=Steel; xchart (Width Diameter)*Lot / outsummary=Summary; run;
The data set Summary contains variables named Lot, WidthL, Width1, WidthX, WidthM, Width3, WidthH, WidthS, WidthN, DiameterL, Diameter1, DiameterX, DiameterM, Diameter3, DiameterH, DiameterS, and DiameterN. Additionally, the following variables, if specified, are included:
BY variables
block-variables
symbol-variable
ID variables
_PHASE_ (if the OUTPHASE= option is specified)
For an example of an OUTSUMMARY= data set, see Saving Summary Statistics for Groups.
The OUTTABLE= data set saves group summary statistics, decision limits, and related information. The following variables can be saved:
Variable |
Description |
---|---|
_ALPHA_ |
significance level |
_EXLIM_ |
decision limit exceeded (if any) |
group |
values of the group variable |
_LDLX_ |
lower decision limit for group mean |
_LIMITN_ |
nominal sample size associated with the decision limits |
_MEAN_ |
central line |
_SUBMAX_ |
group maximum |
_SUBMED_ |
group median |
_SUBMIN_ |
group minimum |
_SUBN_ |
group sample size |
_SUBQ1_ |
group first quartile |
_SUBQ3_ |
group third quartile |
_SUBX_ |
group mean |
_UDLX_ |
upper decision limit for group mean |
_VAR_ |
response specified in the BOXCHART statement |
In addition, the following variables, if specified, are included:
BY variables
block-variables
symbol-variable
ID variables
_PHASE_ (if the READPHASES= option is specified)
Note: The variable _EXLIM_ is a character variable of length 8. The variable _PHASE_ is a character variable of length 48. The variable _VAR_ is a character variable whose length is no greater than 32. All other variables are numeric.
For an example, see Saving Decision Limits.