PROC CAPABILITY and General Statements

Assumptions and Terminology for Capability Indices

One of the fundamental assumptions in process capability analysis is that the process must be in statistical control. Without statistical control, the process is not predictable, the concept of a process distribution does not apply, and quantities related to the distribution, such as probabilities, percentiles, and capability indices, cannot be meaningfully estimated. Additionally, all of the standard process capability indices described in the next section require that the process distribution be normal, or at least approximately normal.

In many industries, statistical control is routinely checked with a Shewhart chart (such as an $\bar{X}$ and R chart) before capability indices such as

$C_{pk} = \min \left( \frac{ \mr{USL} - \mu }{ 3 \sigma } , \frac{ \mr{LSL} - \mu }{ 3 \sigma } \right)$

are computed. The control chart analysis yields estimates for the process mean $\mu$ and standard deviation $\sigma$ , which are based on subgrouped data and can be used to estimate $C_{pk}$ . In particular, $\sigma$ can be estimated by

$s_ R = \bar{R} / d_2$

rather than the ungrouped sample standard deviation

$s = \frac{1}{n-1} \sqrt { \sum _{i=1}{n} (x_ i - \bar{x} )/^2 }$

You can use the SHEWHART procedure to carry out the control chart analysis and to compute capability indices based on $s_ R$ . On the other hand, the CAPABILITY procedure computes indices based on s.

Some industry manuals distinguish these two approaches. For instance, the ASQC/AIAG manual Fundamental Process Control uses the notation $C_{pk}$ for the estimate based on $s_ R$ , and it uses the notation $P_{pk}$ for the estimate based on s. However, assuming that the process is in control and only common cause variation is present, both $s_ R$ and s are estimates of the same parameter $\sigma$ , and so there is fundamentally no difference in the two approaches^[8].

Once control has been established, attention should focus on the distribution of the process measurements, and at this point there is no practical or statistical advantage to working with subgrouped measurements. In fact, the use of s is closely associated with a wide variety of methods that are highly useful for process capability analysis, including tests for normality, graphical displays such as histograms and probability plots, and confidence intervals for parameters and capability indices.

^[8]Statistically, s is a more efficient estimator of $\sigma$ than $s_ R$ .