

You can create an OUTKERNEL= data set with the HISTOGRAM statement. This data set contains information about histogram intervals. Because you can specify multiple HISTOGRAM statements with the UNIVARIATE procedure, you can create multiple OUTKERNEL= data sets.
An OUTKERNEL= data set contains a group of observations for each kernel density estimate requested with the HISTOGRAM statement.
These observations span a range of analysis variable values recorded in the _VALUE_ variable. The procedure determines the increment between values, and therefore the number of observations in the group. The
variable _DENSITY_ contains the kernel density calculated for the corresponding analysis variable value.
When a density curve is overlaid on a histogram, the curve is scaled so that the area under the curve equals the total area
of the histogram bars. The scaled density values are saved in the variable _COUNT_, _PERCENT_, or _PROPORTION_, depending on the histogram’s vertical axis scale, determined by the VSCALE= option. Only one of these variables appears
in a given OUTKERNEL= data set.
Table 4.38 lists the variables in an OUTKERNEL= data set.
Table 4.38: Variables in the OUTKERNEL= Data Set
|
Variable |
Description |
|---|---|
|
_C_ |
standardized bandwidth parameter |
|
_COUNT_ |
kernel density scaled for VSCALE=COUNT |
|
_DENSITY_ |
kernel density |
|
_PERCENT_ |
kernel density scaled for VSCALE=PERCENT (default) |
|
_PROPORTION_ |
kernel density scaled for VSCALE=PROPORTION |
|
_TYPE_ |
kernel function |
|
_VALUE_ |
variable value at which kernel function is calculated |
|
_VAR_ |
variable name |