The RAREEVENTS Procedure(Experimental)

Constructing Rare Events Charts

Each point on the rare events chart indicates the value of an individual measurement from the input data set. You compute the lower probability limit (LPL), median, and upper probability limit (UPL) by solving for their values in the following equations, which use the cumulative distribution function (CDF) of the probability distribution that you specify in the DIST= option:

  • $\mr{cdf(LPL)} = 1 - \alpha _{\mr{LPL}}$

  • $\mr{cdf(median)} = 0.5$

  • $\mr{cdf(UPL)} = \alpha _{\mr{UPL}}$

Probability Limits Based on an Exponential Distribution

The cumulative distribution function of an exponential distribution with scale parameter $\sigma $ and threshold parameter $\theta $ is

\[ \mr{cdf}(x) = 1 - \exp \left(-\frac{(x-\theta )}{\sigma }\right) \]

Solving the equations listed previously, the median and probability limits values are as follows:

  • $\mr{LPL} = \theta - \sigma \ln (1 - \alpha _{\mr{LPL}})$

  • $\mr{median} = \ln (2)$

  • $\mr{UPL} = \theta - \sigma \ln (\alpha _{\mr{UPL}})$

Probability Limits Based on a Geometric Distribution

The cumulative distribution function of a geometric distribution with shift parameter a and probability p is

\[ \mr{cdf}(x) = 1 - (1 - p)^{x - a + 1} \]

Because the geometric distribution is used with integer data, meaningful probability limits must have integer values. Therefore the solutions to the equations listed previously are

  • $\mr{LPL} = \left\lfloor \frac{\ln (1 - \alpha _{\mr{LPL}})}{\ln (1 - p)} + a \right\rfloor $

  • $\mr{median} = \frac{\ln (0.5)}{\ln (1 - p)} + a$

  • $\mr{UPL} = \left\lceil \frac{\ln (\alpha _{\mr{UPL}})}{\ln (1 - p)} + a \right\rceil $

The probability of a value from the distribution being greater than the UPL is as close as possible to $\alpha _{\mr{UPL}}$ without exceeding it, and the probability of a value from the distribution being less than the LPL is as close as possible to $\alpha _{\mr{LPL}}$ without exceeding it. The $\alpha _{\mr{UPL}}$ and $\alpha _{\mr{LPL}}$ values that correspond to the computed limits are displayed in a legend on the rare events chart.

With integer probability limits, it is not unusual for the computed LPL to be equal to the minimum possible data value, so that no data value can be less than the LPL. In that case, the following value is computed:

\[ m = \left\lceil \frac{\ln (\alpha _{\mr{LPL}})}{\ln (p)} \right\rceil \]

The probability of a sequence of m consecutive values from the geometric distribution each being equal to the LPL is as close to $\alpha _{\mr{LPL}}$ as possible without exceeding it. The RAREEVENTS procedure flags any sequence of m consecutive measurements equal to the LPL as a sign of unusual variation.