MACHART Statement: MACONTROL Procedure

Methods for Estimating the Standard Deviation

When control limits are computed from the input data, four methods are available for estimating the process standard deviation $\sigma $. Three methods (referred to as the default, MVLUE, and RMSDF) are available with subgrouped data. A fourth method is used if the data are individual measurements (see Default Method for Individual Measurements).

Default Method for Subgroup Samples

This method is the default for moving average charts using subgrouped data. The default estimate of $\sigma $ is

\[ \hat{\sigma }=\frac{s_{1}/c_{4}(n_{1})+\ldots + s_{N}/c_{4}(n_{N})}{N}  \]

where N is the number of subgroups for which $n_{i}\geq 2$, $s_{i}$ is the sample standard deviation of the ith subgroup

\[  s_{i} = \sqrt { \frac{1}{n_{i} - 1} \sum ^{n_ i}_{j=1}(x_{ij}-\bar{X}_{i})^{2}} \]

and

\[ c_{4}(n_{i})=\frac{\Gamma (n_{i}/2)\sqrt {2/(n_{i}-1)} }{\Gamma ((n_{i}-1)/2)}  \]

Here $\Gamma (\cdot )$ denotes the gamma function, and $\bar{X}_{i}$ denotes the ith subgroup mean. A subgroup standard deviation $s_{i}$ is included in the calculation only if $n_{i}\geq 2$. If the observations are normally distributed, then the expected value of $s_{i}$ is $c_{4}(n_{i})\sigma $. Thus, $\hat{\sigma }$ is the unweighted average of N unbiased estimates of $\sigma $. This method is described in the American Society for Testing and Materials (1976).

MVLUE Method for Subgroup Samples

If you specify SMETHOD=MVLUE, a minimum variance linear unbiased estimate (MVLUE) is computed for $\sigma $. Refer to Burr (1969, 1976) and Nelson (1989, 1994). The MVLUE is a weighted average of N unbiased estimates of $\sigma $ of the form $s_{i}/c_{4}(n_{i})$, and it is computed as

\[ \hat{\sigma }=\frac{h_{1}s_{1}/c_{4}(n_{1})+\ldots + h_{N}s_{N}/c_{4}(n_{N})}{h_{1}+\ldots +h_{N}}  \]

where

\[ h_ i = \frac{[c_4(n_ i)]^{2}}{1 - [c_4(n_ i)]^{2}} \]

A subgroup standard deviation $s_{i}$ is included in the calculation only if $n_{i}\geq 2$, and N is the number of subgroups for which $n_{i}\geq 2$. The MVLUE assigns greater weight to estimates of $\sigma $ from subgroups with larger sample sizes, and it is intended for situations where the subgroup sample sizes vary. If the subgroup sample sizes are constant, the MVLUE reduces to the default estimate.

RMSDF Method for Subgroup Samples

If you specify SMETHOD=RMSDF, a weighted root-mean-square estimate is computed for $\sigma $ as follows:

\[  \hat{\sigma } = \frac{\sqrt {(n_{1} - 1)s_1^{2} + \cdots + (n_{N} - 1)s_{N}^{2}}}{c_{4}(n)\sqrt {n_{1} + \cdots + n_{N} - N}}  \]

The weights are the degrees of freedom $n_{i}-1$. A subgroup standard deviation $s_{i}$ is included in the calculation only if $n_{i}\geq 2$, and N is the number of subgroups for which $n_{i}\geq 2$.

If the unknown standard deviation $\sigma $ is constant across subgroups, the root-mean-square estimate is more efficient than the minimum variance linear unbiased estimate. However, in process control applications it is generally not assumed that $\sigma $ is constant, and if $\sigma $ varies across subgroups, the root-mean-square estimate tends to be more inflated than the MVLUE.

Default Method for Individual Measurements

When each subgroup sample contains a single observation ($n_{i} \equiv 1$), the process standard deviation $\sigma $ is estimated as

\[ \hat{\sigma }=\sqrt {\frac{1}{2(N-1)} \sum _{i=1}^{N-1}{(x_{i+1}-x_{i})^{2}}}  \]

where N is the number of observations, and $x_{1},x_{2},\ldots ,x_{N}$ are the individual measurements. This formula is given by Wetherill (1977), who states that the estimate of the variance is biased if the measurements are autocorrelated.