The SEQDESIGN Procedure

Boundary Scales

The boundaries computed by the SEQDESIGN procedure are applied to test statistics computed during the analysis, and so generally, the scale you select for the boundaries is determined by the scale of the statistics that you will be using.

The following scales are available in the SEQDESIGN procedure:

  • MLE, maximum likelihood estimate

  • standardized Z

  • score statistic S

  • p-value

These scales are all equivalent for a given set of boundary values—that is, there exists a unique transformation between any two of these scales. If you know the boundary values in terms of statistics from one scale, you can uniquely derive the boundary values of statistics for other scales. You can specify the scale with the BOUNDARYSCALE= option; the default is BOUNDARYSCALE=STDZ, the standardized Z scale.

You can also select the boundary scale to better examine the features of an individual group sequential design or to compare features among multiple designs. For example, with the standardized Z scale, the boundary values for the Pocock design are identical across all stages, and the O’Brien-Fleming design has boundary values (in absolute value) that decrease over the stages.

The remaining section demonstrates the transformations from one scale to the other scales. If the maximum likelihood estimate $\hat{\theta }$ is computed by the analysis, then

\[  \hat{\theta } \sim N \left( \,  \theta , \,  \,  \frac{1}{I} \right)  \]

where I is the Fisher information if it does not depend on $\theta $. Otherwise, I is either the expected Fisher information evaluated at $\hat{\theta }$ or the observed Fisher information. See the section Maximum Likelihood Estimator for a detailed description of these statistics.

With the MLE statistic $\hat{\theta }$, the corresponding standardized Z statistic is computed as

\[  Z = \hat{\theta } \,  \sqrt {I} \sim N \left( \,  \theta \,  \sqrt {I}, \,  1 \right)  \]

and the corresponding score statistic is computed as

\[  S = \hat{\theta } \,  I \sim N \left( \,  \theta \,  I, \,  I \right)  \]

Similarly, if a score statistic S is computed by the analysis, then with

\[  S \sim N \left( \,  \theta \,  I, \,  I \right)  \]

where I is the information, either an expected Fisher information ($E_{\theta =0} (I(\theta ))$ or $E_{\theta ={\hat{\theta }}} (I(\theta ))$) or an observed Fisher information ($I(0)$ or $I({\hat{\theta }})$).

The corresponding standardized Z statistic is computed as

\[  Z = \frac{S}{\sqrt {I}} \sim N \left( \,  \theta \sqrt {I}, \,  1 \right)  \]

and the corresponding MLE-scaled statistic is computed as

\[  \hat{\theta } = \frac{S}{I} \sim N \left( \,  \theta , \,  \,  \frac{1}{I} \right)  \]

With a standardized normal Z statistic, the corresponding fixed-sample nominal p-value depends on the type of alternative hypothesis. With an upper alternative, the nominal p-value is defined as the one-sided p-value under the null hypothesis $H_{0}: \theta = 0$ with an upper alternative:

\[  p_{k} = 1 - \Phi ( Z )  \]

With a lower alternative or a two-sided alternative, the nominal p-value is defined as the one-sided p-value under the null hypothesis $H_{0}: \theta = 0$ with a lower alternative:

\[  p_{k} = \Phi ( Z )  \]

which is an increasing function of the standardized Z statistic (Emerson, Kittelson, and Gillen, 2005, p. 12).

The BOUNDARYSCALE= MLE, STDZ, SCORE, and PVALUE options display the boundary values in the MLE, standardize Z, score, and p-value scales, respectively. For example, suppose ${y}_{k1}, {y}_{k2}, \ldots , {y}_{kn_{k}}$ are $n_{k}$ observations of a response variable Y in a data set from a normal distribution with an unknown mean $\mu $ and a known variance ${\sigma }^{2}$. Then

\[  y_{kj} \sim N \left( \,  \mu , \,  {\sigma }^{2} \right)  \]

for $k=1, 2, \ldots , K$, where K is the number of groups and $n_{k}$ is the number of observations at group k.

If $N_{k}$ is the cumulative number of observations for the first k groups, then the sample mean from these $N_{k}$ observations

\[  {\overline{y}}_{k} = \frac{1}{N_{k}} \sum _{j=1}^{N_{k}} y_{kj}  \]

has a normal distribution with mean $\theta $ and variance ${\sigma }^{2}/N_{k}$:

\[  {\overline{y}}_{k} \sim N \left( \,  \theta , \,  \frac{{\sigma }^{2}}{N_{k}} \right)  \]

To test the null hypothesis $\mu = \mu _{0}$, $H_{0}: \theta = 0$, where $\theta = \mu - \mu _{0}$ can be used. The MLE of $\theta $ is $\hat{\theta }_{k} = {\overline{y}}_{k} - \mu _0$ and

\[  \hat{\theta }_{k} \sim N \left( \,  \theta , \,  \frac{1}{I_{k}} \right)  \]

where the information is the inverse of the variance of ${\overline{y}}_{k}$,

\[  I_{k} = \frac{N_{k}}{{\sigma }^{2}}  \]

The corresponding standardized Z statistic is

\[  Z_{k} = \hat{\theta }_{k} \,  I^{\frac{1}{2}}_{k} \sim N \left( \,  \theta \,  I^{\frac{1}{2}}_{k}, \,  1 \right)  \]

The score statistic in the SEQDESIGN procedure is then given by

\[  S_{k} = \hat{\theta }_{k} \,  I_{k} = Z_{k} \,  I^{\frac{1}{2}}_{k} \sim N \left( \,  \theta \,  I_{k}, \,  I_{k} \right)  \]

For a null hypothesis $H_{0}: \theta =0$ with an upper alternative, the nominal p-value of the standardized Z statistic is $p_{k} = 1 - \Phi ( Z_{k} )$. For a null hypothesis $H_{0}: \theta =0$ with a lower alternative or a two-sided alternative, the nominal p-value of the standardized Z statistic is $p_{k} = \Phi ( Z_{k} )$.