The SEQDESIGN Procedure

Example 87.2 Creating a One-Sided O’Brien-Fleming Design

This example demonstrates a group sequential design for a clinical study. A clinic is conducting a study on the effect of vitamin C supplements in treating flu symptoms. The study groups consist of patients in the clinic with their first sign of flu symptoms within the last 24 hours. These individuals are randomly assigned to either the control group, which receives the placebo pills, or the treatment group, which receives large doses of vitamin C supplements. At the end of a five-day period, the flu symptoms of each individual are recorded.

Suppose that from past experience, 60% of individuals experiencing flu symptoms have the symptoms disappeared within five days. The clinic wants to detect a 75% symptoms disappearance with a high probability in the trial. A test that compares the proportions directly is to specify a null hypothesis $H_{0}: \theta = p_{a} - p_{b}= 0$ with a Type I error probability level $\alpha =0.025$, where $p_{a}$ and $p_{b}$ are the proportions of symptoms’ disappearance in the treatment group and control group, respectively. A one-sided alternative $H_{1}: \theta > 0$ is also specified with a power of $1-\beta = 0.90$ at $H_{1}: \theta = 0.15$.

For a one-sided fixed-sample design, the critical value for the standardized Z test statistic is given by $C_{\alpha }= \Phi ^{-1} (1-\alpha )= 1.96$. That is, at the end of study, if the test statistic $z \geq C_{\alpha }$, then the null hypothesis is rejected and the efficacy of vitamin C supplements is declared. Otherwise, the null hypothesis is not rejected and the effect of vitamin C supplements is not significant.

To achieve a $1-\beta = 0.90$ power at $H_{1}: \theta = 0.15$ for a fixed-sample design, the information required is given by

\[  I_{0} = \frac{ ( \Phi ^{-1}(1-\alpha ) + \Phi ^{-1}(1-\beta ) )^{2} }{ {0.15}^{2} } = \frac{ (1.96 + 1.28155)^{2} }{ 0.0225 } = 466.99  \]

With an equal sample size on the treatment and control groups, $N_{a}= N_{b}$, the sample size required for each group under $H_{1}$ is computed from the information $I_{0}$:

\[  N_{a} = N_{b} = \left( p_{1a} \,  (1-p_{1a}) + p_{1b} \,  (1-p_{1b}) \right) \,  I_{0}  \]

where $p_{1a}= 0.75$ and $p_{1b}= 0.60$ are proportions in the treatment and control groups under $H_{1}$. That is,

\[  N_{a} = N_{b} = \left( 0.75 \times 0.25 + 0.6 \times 0.4 \right) \times 466.99 = 199.64  \]

Thus, 200 individuals are required for each group in the fixed-sample study. See the section Test for the Difference between Two Binomial Proportions for a detailed derivation of these required sample sizes.

Instead of a fixed-sample design for the trial, a group sequential design is used to stop the trial early for ethical concerns of possible harm or an unexpected strong efficacy outcome of the new drug. It can also save time and resources in the process. The following statements invoke the SEQDESIGN procedure and request a four-stage group sequential design that uses an O’Brien-Fleming method for normally distributed statistics. The design uses a one-sided alternative hypothesis $H_{1}$ with early stopping to reject or accept $H_{0}$.

ods graphics on;
proc seqdesign altref=0.15
               ;
   OneSidedOBrienFleming: design nstages=4
                          method=obf
                          alt=upper   stop=both
                          alpha=0.025 beta=0.10
                          ;
   samplesize model=twosamplefreq(nullprop=0.6 test=prop);
ods output Boundary=Bnd_Prop;
run;
ods graphics off;

In a sequential design, a hypothesis can be rejected, accepted, or continued to the next time point at each interim stage. The STOP=BOTH option specifies early stopping to reject or accept the null hypothesis. The Design Information, Method Information, and Boundary Information tables are displayed by default.

The Design Information table in Output 87.2.1 displays design specifications and derived statistics such as power and maximum information. With a specified alternative reference, ALTREF=0.15, the maximum information $I_{X}$ is derived.

Output 87.2.1: Design Information

The SEQDESIGN Procedure
Design: OneSidedOBrienFleming

Design Information
Statistic Distribution Normal
Boundary Scale Standardized Z
Alternative Hypothesis Upper
Early Stop Accept/Reject Null
Method O'Brien-Fleming
Boundary Key Both
Alternative Reference 0.15
Number of Stages 4
Alpha 0.025
Beta 0.1
Power 0.9
Max Information (Percent of Fixed Sample) 107.6741
Max Information 502.8343
Null Ref ASN (Percent of Fixed Sample) 61.12891
Alt Ref ASN (Percent of Fixed Sample) 75.89782


The Max Information (Percent Fixed-Sample) is the ratio in percentage between the maximum information for the group sequential design and the information required for a corresponding fixed-sample design:

\[  100 \times \frac{I_{X}}{I_{0}} = 100 \times \frac{502.83}{466.99} = 107.67  \]

That is, if the group sequential trial does not stop at any interim stages, the information needed is 7.67% more than is needed for the corresponding fixed-sample design. For a two-sample test for binomial proportions, the information is proportional to the sample size. Thus, 7.67% more observations are needed for the group sequential trial.

The Null Ref ASN (Percent Fixed-Sample) is the ratio in percentage between the expected sample size required under the null hypothesis for the group sequential design and the sample size required for the corresponding fixed-sample design. With a ratio of 61.1%, the expected sample size for the group sequential trial under the null hypothesis is 61.1% of the sample size in the corresponding fixed-sample design.

Similarly, the Alt Ref ASN (Percent Fixed-Sample) is the ratio in percentage between the expected sample size required under the alternative hypothesis for the group sequential design and the sample size required for the corresponding fixed-sample design. With a ratio of 75.9%, the expected sample size for the group sequential trial under the alternative hypothesis is 75.9% of the sample size in the corresponding fixed-sample design.

For a one-sided design with an upper alternative and early stopping to reject or accept the null hypothesis, upper $\alpha $ and $\beta $ boundaries are created. The Method Information table in Output 87.2.2 displays the Type I error probability $\alpha $, the Type II error probability $\beta $, and the derived drift parameter. The drift parameter is the standardized reference improvement between the alternative and null hypotheses at the final stage. It is also the standardized alternative reference at the final stage if the null reference is zero.

Output 87.2.2: Method Information

Method Information
Boundary Method Alpha Beta Unified Family Alternative
Reference
Drift
Rho Tau C
Upper Alpha O'Brien-Fleming 0.02500 . 0.5 0 1.9784 0.15 3.363595
Upper Beta O'Brien-Fleming . 0.10000 0.5 0 1.3852 0.15 3.363595


With the METHOD=OBF option, the O’Brien-Fleming method is used for each boundary. The O’Brien-Fleming method is one of the unified family methods, and the Method Information table displays the corresponding parameter $\rho $ in the unified family method. The table also displays the critical values $C_{\alpha }=1.9784$ for the $\alpha $ boundary and $C_{\beta }=1.3852$ for the $\beta $ boundary. These critical values are used to create the boundary values.

The Boundary Information table in Output 87.2.3 displays information level, alternative reference, and boundary values at each stage. By default (or equivalently if you specify BOUNDARYSCALE=STDZ), the alternative references and boundary values are displayed with the standardized Z statistic scale. The resulting standardized alternative reference at stage k is given by $\theta _1 \sqrt {I_ k}$, where $\theta _1$ is the alternative reference and $I_ k$ is the information level at stage k, $k= 1, 2, 3, 4$.

Output 87.2.3: Boundary Information

Boundary Information (Standardized Z Scale)
Null Reference = 0
_Stage_   Alternative Boundary Values
Information Level Reference Upper
Proportion Actual N Upper Beta Alpha
1 0.2500 125.7086 107.4808 1.68180 -1.08860 3.95679
2 0.5000 251.4171 214.9617 2.37842 0.41946 2.79788
3 0.7500 377.1257 322.4425 2.91296 1.31347 2.28446
4 1.0000 502.8343 429.9233 3.36360 1.97840 1.97840


By default (or equivalently if you specify INFO=EQUAL), equally spaced information levels are used. An information proportion is the proportion of maximum information available at each stage. With the derived maximum information, the actual information level at each stage is also displayed. With the SAMPLESIZE statement, the required sample size N is also displayed under the heading Information Level.

At each interim stage, if the standardized Z test statistic is larger than or equal to the corresponding upper $\alpha $ boundary, then the hypothesis $H_{0}: \theta = 0$ is rejected. If the test statistic is less than the corresponding upper $\beta $ boundary, then the trial is stopped and the hypothesis $H_{0}$ is accepted. Otherwise, the process continues to the next stage. At the final stage, stage 4, the trial stops and the hypothesis $H_{0}$ is rejected if the standardized Z statistic $Z_{4} \geq 1.9784$. Otherwise, the trial is accepted.

The ODS OUTPUT statement with the BOUNDARY=BND_PROP option creates an output data set that contains the resulting boundary information. After the actual data from the clinical trial are collected and analyzed at each stage with a procedure such as PROC GENMOD, the SEQTEST procedure is used to test the resulting statistics at stage 1 with the boundary information stored in the BOUND_PROP data set.

With ODS Graphics enabled, a detailed boundary plot with the rejection and acceptance regions is displayed, as shown in Output 87.2.4.

Output 87.2.4: Boundary Plot


The horizontal axis indicates the information levels for the design. The stages are indicated by vertical lines with accompanying stage numbers. If at any stage a test statistic is in a rejection region, the trial stops and the hypothesis is rejected. If a test statistic is in an acceptance region, then the trial also stops and the hypothesis is accepted. If the statistic is not in a rejection region or an acceptance region, the trial continues to the next stage. The boundary plot also displays the information level and the critical value for the corresponding fixed-sample design.

The SEQDESIGN procedure derives the drift parameter $\theta _{1} \sqrt {I_{X}}$, where $\theta _{1}$ is the alternative reference and $I_{X}$ is the maximum information. If either $\theta _{1}$ or $I_{X}$ is specified, the other can be derived. With the SAMPLESIZE statement, the maximum information is used to compute the required sample size for the study.

The Sample Size Summary table in Output 87.2.5 displays parameters for the sample size computation. With the MODEL=TWOSAMPLEFREQ( NULLPROP=0.6 TEST=PROP) option in the SAMPLESIZE statement, the total sample size in each group for testing the difference between two proportions is computed. By default (or equivalently if you specify REF=PROP in the MODEL=TWOSAMPLEFREQ option), the required sample sizes are computed under the alternative hypothesis. That is,

\[  N_{a} = N_{b} =\left( \,  p_{1a} \,  (1-p_{1a}) + p_{1b} \,  (1-p_{1b}) \,  \right) \,  I_{X}  \]

where $p_{1b}=0.60$ and $p_{1a}=p_{1b}+\theta _{1}=0.75$ are the proportions in the control and treatment groups, respectively, under the alternative hypothesis $H_{1}$. See the section Test for the Difference between Two Binomial Proportions for a detailed description of these parameters.

Output 87.2.5: Sample Size Summary

Sample Size Summary
Test Two-Sample Proportions
Null Proportion 0.6
Proportion (Group A) 0.75
Test Statistic Z for Proportion
Reference Proportions Alt Ref
Max Sample Size 429.9233
Expected Sample Size (Null Ref) 244.0768
Expected Sample Size (Alt Ref) 303.0464


The Sample Sizes (N) table in Output 87.2.6 displays the required sample sizes at each stage, in both fractional and integer numbers. The derived fractional sample sizes are under the heading Fractional N. These sample sizes are rounded up to integers under the heading Ceiling N. In practice, integer sample sizes are used, and the resulting information levels increase slightly. Thus, 54, 108, 162, and 215 individuals are needed in each group for the four stages, respectively.

Output 87.2.6: Derived Sample Sizes

Sample Sizes (N)
Two-Sample Z Test for Proportion Difference
_Stage_ Fractional N Ceiling N
N N(Grp 1) N(Grp 2) Information N N(Grp 1) N(Grp 2) Information
1 107.48 53.74 53.74 125.7 108 54 54 126.3
2 214.96 107.48 107.48 251.4 216 108 108 252.6
3 322.44 161.22 161.22 377.1 324 162 162 378.9
4 429.92 214.96 214.96 502.8 430 215 215 502.9