The BOXPLOT statement
displays a single box if given just a Y argument. It displays multiple
boxes if given both Y and X arguments and X has more than one unique
value.
By default for numeric
or character columns, the category (X) axis is TYPE=DISCRETE. You
can override the default and set the TYPE= to LINEAR or TIME in the
parent layout, provided that the X column is numeric. The axis for
the analysis (Y) column is always LINEAR. When the X axis is LINEAR,
you must use the
INTERVALBOXWIDTH= option to specify the box width.
When
ORIENT=VERTICAL, the X (or X2) axis is used for the X column and
the Y (or Y2) axis is used for the Y column. When ORIENT=HORIZONTAL,
the X (or X2) axis is used for the Y column and the Y (or Y2) axis
is used for the X column.
Two basic box plot representations
can be drawn with the BOXPLOT statement: a
schematic
(Tukey) box plot and a
skeletal
box plot. See the
EXTREME= option for details.
The following figure
illustrates the box plot elements:
As shown in the figure,
the bottom and top edges of the box are located at the 25th and 75th
percentiles of the sample. Within the box, you can display the median
(50th percentile) as a line and the mean as a marker (see
DISPLAY= option).
You can also display
markers and data labels for outliers. Outliers are observations that
are more extreme than the upper and lower fences (
). Outliers that are beyond upper and lower far fences
(
) are called FAR OUTLIERS and can also be identified
and labeled. From a graphical perspective, the location of fences
along the axis are known, but there is no line or marker that displays
a fence. (See
DISPLAY=,
LABELFAR=, and
DATALABEL= options).
Finally, you can control
the range represented by the whiskers. By default, the whiskers are
drawn from the upper edge of the box to the MAX value, and from the
lower edge of the box to the MIN value. (See the
EXTREME= option.)