The CDFPLOT statement plots the observed cumulative distribution function (cdf) of a variable, defined as
where N is the number of nonmissing observations. The cdf is an increasing step function that has a vertical jump of at each value of x equal to an observed value. The cdf is also referred to as the empirical cumulative distribution function (ECDF).
You can use any number of CDFPLOT statements in the UNIVARIATE procedure. The components of the CDFPLOT statement are as follows.
Table 4.2 lists primary options for requesting a theoretical distribution.
Table 4.2: Primary Options for Theoretical Distribution
Option |
Description |
---|---|
plots two-parameter beta distribution function, parameters and assumed known |
|
plots one-parameter exponential distribution function, parameter assumed known |
|
plots two-parameter gamma distribution function, parameter assumed known |
|
plots Gumbel distribution with location parameter and scale parameter |
|
plots inverse Gaussian distribution with mean and shape parameter |
|
plots two-parameter lognormal distribution function, parameter assumed known |
|
plots normal distribution function |
|
plots generalized Pareto distribution with threshold parameter , scale parameter , and shape parameter |
|
plots power function distribution with threshold parameter , scale parameter , and shape parameter |
|
plots Rayleigh distribution with threshold parameter and scale parameter |
|
plots two-parameter Weibull distribution function, parameter assumed known |
Table 4.3 lists secondary options that specify distribution parameters and control the display of a theoretical distribution function. Specify these options in parentheses after the distribution keyword. For example, you can request a normal probability plot with a distribution reference line by specifying the NORMAL option as follows:
proc univariate; cdfplot / normal(mu=10 sigma=0.5 color=red); run;
The COLOR= option specifies the color for the curve, and the normal-options MU= and SIGMA= specify the parameters and for the distribution function. If you do not specify these parameters, maximum likelihood estimates are computed.
Table 4.3: Secondary Distribution Options
Option |
Description |
---|---|
Options Used with All Distributions |
|
specifies color of theoretical distribution function |
|
specifies line type of theoretical distribution function |
|
specifies width of theoretical distribution function |
|
Beta-Options |
|
specifies first shape parameter for beta distribution function |
|
specifies second shape parameter for beta distribution function |
|
specifies scale parameter for beta distribution function |
|
specifies lower threshold parameter for beta distribution function |
|
Exponential-Options |
|
specifies scale parameter for exponential distribution function |
|
specifies threshold parameter for exponential distribution function |
|
Gamma-Options |
|
specifies shape parameter for gamma distribution function |
|
specifies change in successive estimates of at which the Newton-Raphson approximation of terminates |
|
specifies initial value for in the Newton-Raphson approximation of |
|
specifies maximum number of iterations in the Newton-Raphson approximation of |
|
specifies scale parameter for gamma distribution function |
|
specifies threshold parameter for gamma distribution function |
|
Gumbel-Options |
|
specifies location parameter for Gumbel distribution function |
|
specifies scale parameter for Gumbel distribution function |
|
IGauss-Options |
|
specifies shape parameter for inverse Gaussian distribution function |
|
specifies mean for inverse Gaussian distribution function |
|
Lognormal-Options |
|
specifies shape parameter for lognormal distribution function |
|
specifies threshold parameter for lognormal distribution function |
|
specifies scale parameter for lognormal distribution function |
|
Normal-Options |
|
specifies mean for normal distribution function |
|
specifies standard deviation for normal distribution function |
|
Pareto-Options |
|
specifies shape parameter for generalized Pareto distribution function |
|
specifies scale parameter for generalized Pareto distribution function |
|
specifies threshold parameter for generalized Pareto distribution function |
|
Power-Options |
|
specifies shape parameter for power function distribution |
|
specifies scale parameter for power function distribution |
|
specifies threshold parameter for power function distribution |
|
Rayleigh-Options |
|
specifies scale parameter for Rayleigh distribution function |
|
specifies threshold parameter for Rayleigh distribution function |
|
Secondary Weibull-Options |
|
specifies shape parameter c for Weibull distribution function |
|
requests table of iteration history and optimizer details |
|
specifies maximum number of iterations in the Newton-Raphson approximation of |
|
specifies scale parameter for Weibull distribution function |
|
specifies threshold parameter for Weibull distribution function |
Table 4.4 summarizes general options for enhancing cdf plots.
Table 4.4: General Graphics Options
Option |
Description |
---|---|
General Graphics Options |
|
specifies reference lines perpendicular to the horizontal axis |
|
specifies labels for HREF= lines |
|
specifies position for HREF= line labels |
|
suppresses plot of empirical (observed) distribution function |
|
suppresses label for horizontal axis |
|
suppresses label for vertical axis |
|
suppresses tick marks and tick mark labels for vertical axis |
|
specifies reference lines at values of summary statistics |
|
specifies labels for STATREF= lines |
|
specifies substitution character for displaying statistic values in STATREFLABELS= labels |
|
specifies label for vertical axis |
|
specifies reference lines perpendicular to the vertical axis |
|
specifies labels for VREF= lines |
|
specifies position for VREF= line labels |
|
specifies scale for vertical axis |
|
Options for Traditional Graphics Output |
|
specifies annotate data set |
|
specifies color for axis |
|
specifies color for frame |
|
specifies colors for HREF= lines |
|
specifies colors for STATREF= lines |
|
specifies color for text |
|
specifies colors for VREF= lines |
|
specifies description for graphics catalog member |
|
specifies text font |
|
specifies AXIS statement for horizontal axis |
|
specifies height of text used outside framed areas |
|
specifies number of horizontal axis minor tick marks |
|
specifies software font for text inside framed areas |
|
specifies height of text inside framed areas |
|
specifies line types for HREF= lines |
|
specifies line types for STATREF= lines |
|
specifies line types for VREF= lines |
|
specifies name for plot in graphics catalog |
|
suppresses frame around plotting area |
|
turns and vertically strings out characters in labels for vertical axis |
|
specifies AXIS statement for vertical axis |
|
specifies number of vertical axis minor tick marks |
|
specifies line thickness for axes and frame |
|
Options for ODS Graphics Output |
|
specifies footnote displayed on plot |
|
specifies secondary footnote displayed on plot |
|
specifies title displayed on plot |
|
specifies secondary title displayed on plot |
|
overlays plots for different class levels |
|
Options for Comparative Plots |
|
applies annotation requested in ANNOTATE= data set to key cell only |
|
specifies color for filling row label frames |
|
specifies color for filling column label frames |
|
specifies color for proportion of frequency bar |
|
specifies color for row labels |
|
specifies color for column labels |
|
specifies distance between tiles in comparative plot |
|
specifies number of columns in comparative plot |
|
specifies number of rows in comparative plot |
|
Miscellaneous Options |
|
specifies table of contents entry for cdf plot grouping |
The following entries provide detailed descriptions of the options specific to the CDFPLOT statement. See the section Dictionary of Common Options for detailed descriptions of options common to all plot statements.