The BOXPLOT
statement displays a single box if given just a Y argument. It displays
multiple boxes if given both Y and X arguments and X has more than
one unique value. For numeric or character columns, the X-axis is
always of TYPE=DISCRETE.
Two basic
box plot representations can be drawn with the BOXPLOT statement:
a
schematic (Tukey) box plot and a
skeletal box plot.
See the
EXTREME= option for details.
The following
figure illustrates the box plot elements:
As shown
in the figure, the bottom and top edges of the box are located at
the 25th and 75th percentiles of the sample. Within the box you can
display the median (50th percentile) as a line and the mean as a marker
(see
DISPLAY= option).
You can
also display markers and data labels for outliers. Outliers are observations
that are more extreme than the upper and lower fences (
). Outliers that are beyond upper and lower far
fences (
) are called FAR OUTLIERS and can also be identified
and labeled. From a graphical perspective, the location of fences
along the axis are known, but there is no line or marker that displays
a fence. (See
DISPLAY=,
LABELFAR=, and
DATALABEL= options).
Finally,
you can control the range represented by the whiskers. By default,
the whiskers are drawn from the upper edge of the box to the MAX value,
and from the lower edge of the box to the MIN value (see
EXTREME= option).