The SEQDESIGN Procedure

Two-Sided Fixed-Sample Tests in Clinical Trials

A two-sided test is a test of a hypothesis with a two-sided alternative. Two-sided tests include simple symmetric tests and more complicated asymmetric tests that might have distinct lower and upper alternative references.

Symmetric Two-Sided Tests for Equality

For a symmetric two-sided test with the null hypothesis $\delta = \delta _{0}$ against the alternative $\delta \neq \delta _{0}$, an equivalent null hypothesis is $H_{0}: \theta = 0$ with a two-sided alternative $H_{1}: \theta \neq 0$, where $\theta = \delta - \delta _{0}$. A fixed-sample test rejects $H_{0}$ if $| \hat{\theta } \sqrt {I_{0}} | \geq C_{\alpha /2}$, where $\hat{\theta }$ is a sample estimate of $\theta $ and $C_{\alpha /2}= {\Phi }^{-1}(1-\alpha /2)$ is the critical value.

A common two-sided test is the test for the response difference between a treatment group and a control group. The null and alternative hypotheses are $H_{0}: \theta = 0$ and $H_{1}: \theta \neq 0$, respectively, where $\theta $ is the response difference between the two groups. If a greater value indicates a beneficial effect, then there are three possible results:

  • The test rejects the hypothesis $H_{0}$ of equality and indicates that the treatment is significantly better if the standardized statistic $Z_{0}= \hat{\theta } \sqrt {I_{0}} \geq C_{\alpha /2}$.

  • The test rejects the hypothesis $H_{0}$ and indicates the treatment is significantly worse if the standardized statistic $Z_{0}= \hat{\theta } \sqrt {I_{0}} \leq -C_{\alpha /2}$.

  • The test indicates no significant difference between the two responses if $-C_{\alpha /2} < \hat{\theta } \sqrt {I_{0}} < C_{\alpha /2}$.

The p-value of the test is $2 (1-{\Phi }(Z_{0}))$ if $Z_{0} > 0$ and $2 {\Phi }(Z_{0})$ if $Z_{0} \leq 0$. The hypothesis $H_{0}$ is rejected if the p-value of the test is less than $\alpha $—that is, if $1-{\Phi }(Z_{0}) < \alpha /2$ or ${\Phi }(Z_{0}) < \alpha /2$. A symmetric $(1-\alpha )$ confidence interval for $\theta $ has lower and upper limits

\[ \left( \hat{\theta } - \frac{C_{\alpha /2}}{\sqrt {I_{0}}} \, , \, \, \, \hat{\theta } + \frac{C_{\alpha /2}}{\sqrt {I_{0}}} \right) \]

which is

\[ \left( \frac{1}{ \sqrt {I_{0}} } \left( Z_{0} - C_{\alpha /2} \right) \, , \, \, \, \frac{1}{ \sqrt {I_{0}} } \left( Z_{0} + C_{\alpha /2} \right) \right) \]

The hypothesis $H_{0}$ is rejected if the confidence interval for the parameter $\theta $ does not contain zero. That is, the lower limit is greater than zero or the upper limit is less than zero.

With an alternative reference $\theta = \theta _1 > 0$, a Type II error probability is defined as

\[ \beta = P_{\theta =\theta _1} ( - C_{\alpha /2} \, \, < \, \, Z_{0} \, \, < \, \, C_{\alpha /2} ) \]

which is

\[ \beta = P_{\theta =\theta _1} \left( (-C_{\alpha /2} - \theta _1 \sqrt {I_{0}}) \, \, < \, \, (Z_{0} - \theta _1 \sqrt {I_{0}}) \, \, < \, \, (C_{\alpha /2} - \theta _1 \sqrt {I_{0}}) \right) \]

Thus

\[ \beta = \Phi \left( C_{\alpha /2} - {\theta }_{1} \sqrt {I_{0}} \right) - \Phi \left( -C_{\alpha /2} - {\theta }_{1} \sqrt {I_{0}} \right) \]

The resulting power $1-\beta $ is the probability of correctly rejecting the null hypothesis, which includes the probability for the lower alternative and the probability for the upper alternative. The SEQDESIGN procedure uses only the probability of correctly rejecting the null hypothesis for the correct alternative in the power computation.

Thus, under the upper alternative hypothesis, the power in the SEQDESIGN procedure is computed as the probability of rejecting the null hypothesis for the upper alternative, $1. - \Phi \left( C_{\alpha /2} - \theta _1 \sqrt {I_{0}} \right) = \Phi \left( \theta _1 \sqrt {I_{0}} - C_{\alpha /2} \right)$, and a very small probability of rejecting the null hypothesis for the lower alternative, $\Phi \left( -C_{\alpha /2} - \theta _1 \sqrt {I_{0}} \right)$, is ignored. This power computation is more rational than the power based on the probability of correctly rejecting the null hypothesis (Whitehead 1997, p. 75).

That is,

\[ \beta = P_{\theta =\theta _1} \left( (Z_{0} - \theta _1 \sqrt {I_{0}}) \, \, < \, \, (C_{\alpha /2} - \theta _1 \sqrt {I_{0}}) \right) = \Phi \left( C_{\alpha /2} - \theta _1 \sqrt {I_{0}} \right) \]

Then with ${\Phi }^{-1} (\beta ) = C_{\alpha /2} - \theta _1 \sqrt {I_{0}}$,

\[ \theta _1 \sqrt {I_{0}} = C_{\alpha /2} - {\Phi }^{-1}(\beta ) = {\Phi }^{-1}(1-\alpha /2) + {\Phi }^{-1}(1-\beta ) \]

The drift parameter $\theta _1 \sqrt {I_{0}}$ can be derived for specified $\alpha $ and $\beta $, and the maximum information is given by

\[ I_{0} = \, \left( \frac{ {\Phi }^{-1}(1-\alpha /2) + {\Phi }^{-1}(1-\beta ) }{ \theta _1 } \right)^{2} \]

If the maximum information is available, then the required sample size can be derived. For example, in a one-sample test for mean, if the standard deviation $\sigma $ is known, the sample size n required for the test is

\[ n= {\sigma }^{2} \, I_{0} = \, {\sigma }^{2} \left( \frac{{\Phi }^{-1}(1-\alpha /2) + {\Phi }^{-1}(1-\beta )}{\theta _1} \right)^{2} \]

On the other hand, if the alternative reference $\theta _1$, standard deviation $\sigma $, and sample size n are all known, then $\alpha $ can be derived with a given $\beta $ and, similarly, $\beta $ can be derived with a given $\alpha $.

Generalized Two-Sided Tests for Equality

For a generalized two-sided test with the null hypothesis $\delta = \delta _{0}$ against the alternative $\delta \neq \delta _{0}$, an equivalent null hypothesis is $H_{0}: \theta \leq 0$ with a two-sided alternative $H_{1}: \theta \neq 0$, where $\theta = \delta - \delta _{0}$. A fixed-sample test rejects $H_{0}$ if the standardized statistic $Z_{0}= \hat{\theta } \sqrt {I_{0}} < -C_{\alpha _{l}}$ or $Z_{0}= \hat{\theta } \sqrt {I_{0}} > C_{\alpha _{u}}$, where the critical values $C_{\alpha _{l}}= {\Phi }^{-1}(1-\alpha _{l})$ and $C_{\alpha _{u}}= {\Phi }^{-1}(1-\alpha _{u})$.

With the lower alternative reference $\theta _{1l} < 0$, a lower Type II error probability is defined as

\[ \beta _{l} = P_{\theta =\theta _{1l}} \left( -C_{\alpha _{l}} \, \leq \, Z_{0l} \sqrt {I_{0}} \right) = P_{\theta =\theta _{1l}} \left( -C_{\alpha _{l}} - \theta _{1l} \sqrt {I_{0}}\, \leq \, Z_{0l} \sqrt {I_{0}} - \theta _{1l} \sqrt {I_{0}} \right) \]

This implies

\[ \beta _{l} = 1 - {\Phi }( -C_{\alpha _{l}} - \theta _{1l} \sqrt {I_{0}} ) \]

and the power is the probability of correctly rejecting the null hypothesis for the lower alternative,

\[ 1 - \beta _{l} = {\Phi }( -C_{\alpha _{l}} - \theta _{1l} \sqrt {I_{0}} ) \]

The lower drift parameter is derived as

\[ \theta _{1l} \sqrt {I_{0}} = - \left( \, {\Phi }^{-1}(1-\alpha _{l}) + {\Phi }^{-1}(1-\beta _{l}) \, \right) \]

Then, with specified $\alpha _{l}$ and $\beta _{l}$, if the maximum information is known, the lower alternative reference $\theta _{1l}$ can be derived. If the maximum information is unknown, then with the specified lower alternative reference $\theta _{1l}$, the maximum information required is

\[ I_{0l} = \, \left( \frac{{\Phi }^{-1}(1-\alpha _{l}) + {\Phi }^{-1}(1-\beta _{l})}{-\theta _{1l}} \right)^{2} \]

Similarly, the upper drift parameter is derived as

\[ \theta _{1u} \sqrt {I_{0}} = {\Phi }^{-1}(1-\alpha _{u}) + {\Phi }^{-1}(1-\beta _{u}) \]

For a given $\alpha _{u}$, $\beta _{u}$, and the upper alternative reference $\theta _{1u}$, the maximum information required is

\[ I_{0u} = \, \left( \frac{ {\Phi }^{-1}(1-\alpha _{u}) + {\Phi }^{-1}(1-\beta _{u}) }{ \theta _{1u} } \right)^{2} \]

Thus, the maximum information required for the design is given by

\[ I_{0} = \mr{max} \, ( \, I_{0l}, \, I_{0u} ) \]

Note that with the maximum information level $I_{0}$, if $I_{0l} < I_{0}$, then the derived power from the lower alternative is larger than the specified $1-\beta _{l}$. Similarly, if $I_{0u} < I_{0}$, then the derived power from the upper alternative is larger than the specified $1-\beta _{u}$.

If maximum information is available, the required sample size can be derived. For example, in a one-sample test for mean, if the standard deviation $\sigma $ is known, the sample size n required for the test is $n= {\sigma }^{2} \,  I_{0}$.

On the other hand, if the alternative references, Type I error probabilities $\alpha _ l$ and $\alpha _ u$, standard deviation $\sigma $, and sample size n are all specified, then the Type II error probabilities $\beta _ l$ and $\beta _ u$ and the corresponding powers can be derived.