HISTOGRAM Statement: CAPABILITY Procedure

Summary of Options

The following tables list the HISTOGRAM statement options by function. For detailed descriptions, see Dictionary of Options.

Parametric Density Estimation Options

Table 5.18 lists options that display a parametric density estimate on the histogram.

Table 5.18: Parametric Distribution Options

Option

Description

BETA(beta-options)

fits beta distribution with threshold parameter $\theta $, scale parameter $\sigma $, and shape parameters $\alpha $ and $\beta $

EXPONENTIAL(exponential-options)

fits exponential distribution with threshold parameter $\theta $ and scale parameter $\sigma $

GAMMA(gamma-options)

fits gamma distribution with threshold parameter $\theta $, scale parameter $\sigma $, and shape parameter $\alpha $

GUMBEL(Gumbel-options)

plots Gumbel distribution with location parameter $\mu $ and scale parameter $\sigma $

IGAUSS(iGauss-options)

plots inverse Gaussian distribution with mean $\mu $ and shape parameter $\lambda $

LOGNORMAL(lognormal-options)

fits lognormal distribution with threshold parameter $\theta $, scale parameter $\zeta $, and shape parameter $\sigma $

NORMAL(normal-options)

fits normal distribution with mean $\mu $ and standard deviation $\sigma $

PARETO(Pareto-options)

plots Pareto distribution with threshold parameter $\theta $, scale parameter $\sigma $, and shape parameter $\alpha $

POWER(power-options)

plots power function distribution with threshold parameter $\theta $, scale parameter $\sigma $, and shape parameter $\alpha $

RAYLEIGH(Rayleigh-options)

plots Rayleigh distribution with threshold parameter $\theta $ and scale parameter $\sigma $

SB(SB-options)

fits Johnson $S_{B}$ distribution with threshold parameter $\theta $, scale parameter $\sigma $, and shape parameters $\delta $ and $\gamma $

SU(SU-options)

fits Johnson $S_{U}$ distribution with location parameter $\theta $, scale parameter $\sigma $, and shape parameters $\delta $ and $\gamma $

WEIBULL(Weibull-options)

fits Weibull distribution with threshold parameter $\theta $, scale parameter $\sigma $, and shape parameter c


Table 5.19 lists secondary options that specify parameters for fitted parametric distributions and that control the display of fitted curves. Specify these secondary options in parentheses after the distribution keyword. For example, the following statements fit a normal curve by using the NORMAL option:

proc capability;
   histogram / normal(color=red mu=10 sigma=0.5);
run;

The COLOR= normal-option draws the curve in red, and the MU= and SIGMA= normal-options specify the parameters $\mu =10$ and $\sigma =0.5$ for the curve. Note that the sample mean and sample standard deviation are used to estimate $\mu $ and $\sigma $, respectively, when the MU= and SIGMA= options are not specified.

You can specify lists of values for distribution parameters to display more than one fitted curve from the same distribution family on a histogram. Option values are matched by list position. You can specify the value EST in a list of distribution parameter values to use an estimate of the parameter.

For example, the following code displays two normal curves on a histogram:

proc capability;
   histogram / normal(color=(red blue) mu=10 est sigma=0.5 est);
run;

The first curve is red, with $\mu =10$ and $\sigma =0.5$. The second curve is blue, with $\mu $ equal to the sample mean and $\sigma $ equal to the sample standard deviation.

See the section Formulas for Fitted Curves for detailed information about the families of parametric distributions that you can fit with the HISTOGRAM statement.

Table 5.19: Distribution Options

Option

Description

Options Used with All Parametric Distributions

COLOR=

specifies color of fitted density curve

FILL

fills area under fitted density curve

INDICES

calculates capability indices based on fitted distribution

L=

specifies line type of fitted curve

MIDPERCENTS

prints table of midpoints of histogram intervals

NOPRINT

suppresses printed output summarizing fitted curve

PERCENTS=

lists percents for which quantiles calculated from data and quantiles estimated from fitted curve are tabulated

SYMBOL=

specifies character used for fitted density curve in line printer plots

W=

specifies width of fitted density curve

Beta-Options

ALPHA=

specifies first shape parameter $\alpha $ for fitted beta curve

BETA=

specifies second shape parameter $\beta $ for fitted beta curve

SIGMA=

specifies scale parameter $\sigma $ for fitted beta curve

THETA=

specifies lower threshold parameter $\theta $ for fitted beta curve

Exponential-Options

SIGMA=

specifies scale parameter $\sigma $ for fitted exponential curve

THETA=

specifies threshold parameter $\theta $ for fitted exponential curve

Gamma-Options

ALPHA=

specifies shape parameter $\alpha $ for fitted gamma curve

ALPHADELTA=

specifies change in successive estimates of $\alpha $ at which the Newton-Raphson approximation of $\hat{\alpha }$ terminates

ALPHAINITIAL=

specifies initial value for $\alpha $ in Newton-Raphson approximation of $\hat{\alpha }$

MAXITER=

specifies maximum number of iterations in Newton-Raphson approximation of $\hat{\alpha }$

SIGMA=

specifies scale parameter $\sigma $ for fitted gamma curve

THETA=

specifies threshold parameter $\theta $ for fitted gamma curve

Gumbel-Options

EDFNSAMPLES=

specifies number of samples for EDF goodness-of-fit simulation

EDFSEED=

specifies seed value for EDF goodness-of-fit simulation

MU=

specifies location parameter $\mu $ for fitted Gumbel curve

SIGMA=

specifies scale parameter $\sigma $ for fitted Gumbel curve

IGauss-Options

EDFNSAMPLES=

specifies number of samples for EDF goodness-of-fit simulation

EDFSEED=

specifies seed value for EDF goodness-of-fit simulation

LAMBDA=

specifies shape parameter $\lambda $ for fitted inverse Gaussian curve

MU=

specifies mean $\mu $ for fitted inverse Gaussian curve

Lognormal-Options

SIGMA=

specifies shape parameter $\sigma $ for fitted lognormal curve

THETA=

specifies threshold parameter $\theta $ for fitted lognormal curve

ZETA=

specifies scale parameter $\zeta $ for fitted lognormal curve

Normal-Options

MU=

specifies mean $\mu $ for fitted normal curve

SIGMA=

specifies standard deviation $\sigma $ for fitted normal curve

Pareto-Options

ALPHA=

specifies shape parameter $\alpha $ for fitted Pareto curve

EDFNSAMPLES=

specifies number of samples for EDF goodness-of-fit simulation

EDFSEED=

specifies seed value for EDF goodness-of-fit simulation

SIGMA=

specifies scale parameter $\sigma $ for fitted Pareto curve

THETA=

specifies threshold parameter $\theta $ for fitted Pareto curve

Power-Options

ALPHA=

specifies shape parameter $\alpha $ for fitted power function curve

SIGMA=

specifies scale parameter $\sigma $ for fitted power function curve

THETA=

specifies threshold parameter $\theta $ for fitted power function curve

Rayleigh-Options

EDFNSAMPLES=

specifies number of samples for EDF goodness-of-fit simulation

EDFSEED=

specifies seed value for EDF goodness-of-fit simulation

SIGMA=

specifies scale parameter $\sigma $ for fitted Rayleigh curve

THETA=

specifies threshold parameter $\theta $ for fitted Rayleigh curve

$S_{B}$-Options

DELTA=

specifies first shape parameter $\delta $ for fitted $S_{B}$ curve

FITINTERVAL=

specifies z-value for method of percentiles

FITMETHOD=

specifies method of parameter estimation

FITTOLERANCE=

specifies tolerance for method of percentiles

GAMMA=

specifies second shape parameter $\gamma $ for fitted $S_{B}$ curve

SIGMA=

specifies scale parameter $\sigma $ for fitted $S_{B}$ curve

THETA=

specifies lower threshold parameter $\theta $ for fitted $S_{B}$ curve

$S_{U}$-Options

DELTA=

specifies first shape parameter $\delta $ for fitted $S_{U}$ curve

FITINTERVAL=

specifies z-value for method of percentiles

FITMETHOD=

specifies method of parameter estimation

FITTOLERANCE=

specifies tolerance for method of percentiles

GAMMA=

specifies second shape parameter $\gamma $ for fitted $S_{U}$ curve

OPTBOUNDRANGE=

specifies the sampling range for parameter starting values in MLE optimization

OPTMAXITER=

specifies an interation limit for MLE optimization

OPTMAXSTARTS=

specifies the maximum number of starting points to be used for MLE optimization

OPTPRINT

prints an iteration history for MLE optimization

OPTSEED=

specifies a seed value for MLE optimization

OPTTOLERANCE=

specifies the optimality tolerance for MLE optimization

SIGMA=

specifies scale parameter $\sigma $ for fitted $S_{U}$ curve

THETA=

specifies location parameter $\theta $ for fitted $S_{U}$ curve

Weibull-Options

C=

specifies shape parameter c for fitted Weibull curve

CDELTA=

specifies change in successive estimates of c at which the Newton-Raphson approximation of $\hat{c}$ terminates

CINITIAL=

specifies initial value for c in Newton-Raphson approximation of $\hat{c}$

MAXITER=

specifies maximum number of iterations in Newton-Raphson approximation of $\hat{c}$

SIGMA=

specifies scale parameter $\sigma $ for fitted Weibull curve

THETA=

specifies threshold parameter $\theta $ for fitted Weibull curve


Nonparametric Density Estimation Options

Table 5.20: Kernel Density Estimation Options

Option

Description

KERNEL(kernel-options)

fits kernel density estimates


Specify the options listed in Table 5.21 in parentheses after the keyword KERNEL to control features of kernel density estimates requested with the KERNEL option.

Table 5.21: Kernel-Options

Option

Description

C=

specifies standardized bandwidth parameter c for fitted kernel density estimate

COLOR=

specifies color of the fitted kernel density curve

FILL

fills area under fitted kernel density curve

K=

specifies type of kernel function

L=

specifies line type used for fitted kernel density curve

LOWER=

specifies lower bound for fitted kernel density curve

SYMBOL=

specifies character used for fitted kernel density curve in line printer plots

UPPER=

specifies upper bound for fitted kernel density curve

W=

specifies line width for fitted kernel density curve


General Options

Table 5.22 summarizes general options for the HISTOGRAM statement, including options for enhancing charts and producing output data sets.

Table 5.22: General HISTOGRAM Statement Options

Option

Description

Options to Create Output Data Sets

OUTFIT=

requests information about fitted curves

OUTHISTOGRAM=

requests information about histogram intervals

OUTKERNEL=

creates a data set containing kernel density estimates

General Histogram Layout Options

CLIPCURVES

scales vertical axis without considering fitted curves

CONTENTS=

specifies table of contents entry for histogram grouping

CURVELEGEND=

specifies LEGEND statement for curves

ENDPOINTS=

lists endpoints for histogram intervals

HANGING

constructs hanging histogram

HREF=

specifies reference lines perpendicular to the horizontal axis

HREFLABELS=

specifies labels for HREF= lines

MIDPERCENTS

prints table of histogram intervals

MIDPOINTS=

lists midpoints for histogram intervals

NENDPOINTS=

specifies number of histogram interval endpoints

NMIDPOINTS=

specifies number of histogram interval midpoints

NOBARS

suppresses histogram bars

NOCURVELEGEND

suppresses legend for curves

NOFRAME

suppresses frame around plotting area

NOLEGEND

suppresses legend

NOPLOT

suppresses plot

NOSPECLEGEND

suppresses specifications legend

NOTABCONTENTS

suppresses table of contents entries for tables produced by HISTOGRAM statement

RTINCLUDE

includes right endpoint in interval

SPECLEGEND=

specifies LEGEND statement for specification limits

VREF=

specifies reference lines perpendicular to the vertical axis

VREFLABELS=

specifies labels for VREF= lines

VSCALE=

specifies scale for vertical axis

Options to Enhance Graphical Output

ANNOTATE=

specifies annotate data set

BARLABEL=

produces labels above histogram bars

BARWIDTH=

specifies width for the bars

BMCFILL=

specifies fill color for box-and-whisker plot in bottom margin

BMCFRAME=

specifies fill color bottom margin plot frame

BMCOLOR=

specifies color for bottom margin plot

BMMARGIN=

specifies height of margin for bottom margin plot

BMPLOT=

requests a plot in bottom margin of histogram

CAXIS=

specifies color for axis

CBARLINE=

specifies color for outlines of histogram bars

CFILL=

specifies color for filling under curve

CFRAME=

specifies color for frame

CGRID=

specifies color for grid lines

CHREF=

specifies colors for HREF= lines

CLIPREF

draws reference lines behind histogram bars

CLIPSPEC=

clips histogram bars at specification limits

CSTATREF=

specifies colors for STATREF= lines

CTEXT=

specifies color for text

CVREF=

specifies colors for VREF= lines

DESCRIPTION=

specifies description for plot in graphics catalog

FONT=

specifies software font for text

FRONTREF

draws reference lines in front of histogram bars

GRID

creates a grid

HAXIS=

specifies AXIS statement for horizontal axis

HEIGHT=

specifies height of text used outside framed areas

HMINOR=

specifies number of horizontal minor tick marks

HOFFSET=

specifies offset for horizontal axis

HREFLABPOS=

specifies vertical position of labels for HREF= lines

INFONT=

specifies software font for text inside framed areas

INHEIGHT=

specifies height of text inside framed areas

INTERBAR=

specifies space between histogram bars

LEGEND=

identifies LEGEND statement

LGRID=

specifies a line type for grid lines

LHREF=

specifies line styles for HREF= lines

LSTATREF=

specifies line styles for STATREF= lines

LVREF=

specifies line styles for VREF= lines

MAXNBIN=

specifies maximum number of bins to display

MAXSIGMAS=

limits the number of bins that display to within a specified number of standard deviations above and below mean of data in key cell

MIDPOINTS=

specifies midpoints for histogram intervals

NAME=

specifies name for plot in graphics catalog

NOHLABEL

suppresses label for horizontal axis

NOVLABEL

suppresses label for vertical axis

NOVTICK

suppresses tick marks and tick mark labels for vertical axis

PFILL=

specifies pattern for filling under curve

STATREF=

specifies reference lines at values of summary statistics

STATREFLABELS=

specifies labels for STATREF= lines

STATREFSUBCHAR=

specifies substitution character for displaying statistic values in STATREFLABELS= labels

TURNVLABELS

turns and vertically strings out characters in labels for vertical axis

VAXIS=

specifies AXIS statement or values for vertical axis

VAXISLABEL=

specifies label for vertical axis

VMINOR=

specifies number of vertical minor tick marks

VOFFSET=

specifies length of offset at upper end of vertical axis

VREFLABPOS=

specifies horizontal position of labels for VREF= lines

WAXIS=

specifies line thickness for axes and frame

WBARLINE=

specifies line thickness for bar outlines

WGRID=

specifies line thickness for grid

Options for ODS Graphics Output

ODSFOOTNOTE=

specifies footnote displayed on histogram

ODSFOOTNOTE2=

specifies secondary footnote displayed on histogram

ODSTITLE=

specifies title displayed on histogram

ODSTITLE2=

specifies secondary title displayed on histogram

Options for Comparative Plots

ANNOKEY

applies annotation requested in ANNOTATE= data set to key cell only

CFRAMESIDE=

specifies color for filling frame for row labels

CFRAMETOP=

specifies color for filling frame for column labels

CPROP=

specifies color for proportion of frequency bar

CTEXTSIDE=

specifies color for row labels of comparative histograms

CTEXTTOP=

specifies color for column labels of comparative histograms

INTERTILE=

specifies distance between tiles

NCOLS=

specifies number of columns in comparative histogram

NROWS=

specifies number of rows in comparative histogram

OVERLAY

overlays plots for different class levels (ODS Graphics only)

Options to Enhance Line Printer Plots

HREFCHAR=

specifies line character for HREF= lines

VREFCHAR=

specifies line character for VREF= lines