Statistical Graphics


Histograms

You can use the HISTOGRAM subroutine to create a histogram. The required argument is a vector that contains values of a continuous variable.

The following statements read the MPG_City variable from the Sashelp.Cars data set:

proc iml;
use Sashelp.Cars;
read all var {MPG_City};
close Sashelp.Cars;

The following statements create a simple histogram, which is shown in Figure 15.6:

title "Histogram with Default Properties";
call Histogram(MPG_City);

Figure 15.6: Histogram with Default Options

Histogram with Default Options


For a more complicated example, the following statements create a histogram by using the SCALE=, DENSITY=, REBIN=, GRID=, LABEL=, and XVALUES= options: The result is shown in Figure 15.7.

title "Histogram with Density Curves";
call Histogram(MPG_City)
          scale="Percent"
          density={"Normal" "Kernel"}
          rebin={0 5}
          grid="y"
          label="Miles per Gallon (City)"
          xvalues=do(0, 60, 10);

Figure 15.7: Histogram with Overlaid Densities

Histogram with Overlaid Densities


The following list explains the options that are used to create Figure 15.7:

  • The SCALE= option specifies the scaling that is applied to the vertical axis of the histogram. In Figure 15.6, the vertical axis is scaled to represent the percentage of observations in each bar.

  • The DENSITY= option specifies whether to overlay a density estimate on the histogram. In Figure 15.6, a normal density estimate and a kernel density estimate are overlaid.

  • The REBIN= option specifies two numerical values that set the location of the first bins and the width of bins. In Figure 15.6, the bins are anchored at the value 0 and have a width of 5 units.

  • The GRID= option specifies whether grid lines are displayed for the X and Y axes. Figure 15.6 shows grid lines for the Y axis.

  • The LABEL= option specifies axis labels. In Figure 15.6, a label is specified for the X axis.

  • The XVALUES= option specifies a vector of values for ticks for the X and Y axes. In Figure 15.6, tick marks are placed every 10 units in the interval $[0,60]$.