The SEQDESIGN Procedure

Example 87.2 Creating a One-Sided O’Brien-Fleming Design

This example demonstrates a group sequential design for a clinical study. A clinic is conducting a study on the effect of vitamin C supplements in treating flu symptoms. The study groups consist of patients in the clinic with their first sign of flu symptoms within the last 24 hours. These individuals are randomly assigned to either the control group, which receives the placebo pills, or the treatment group, which receives large doses of vitamin C supplements. At the end of a five-day period, the flu symptoms of each individual are recorded.

Suppose that from past experience, 60% of individuals experiencing flu symptoms have the symptoms disappeared within five days. The clinic wants to detect a 75% symptoms disappearance with a high probability in the trial. A test that compares the proportions directly is to specify a null hypothesis $H_{0}: \theta = p_{a} - p_{b}= 0$ with a Type I error probability level $\alpha =0.025$ , where $p_{a}$ and $p_{b}$ are the proportions of symptoms’ disappearance in the treatment group and control group, respectively. A one-sided alternative $H_{1}: \theta > 0$ is also specified with a power of $1-\beta = 0.90$ at $H_{1}: \theta = 0.15$ .

For a one-sided fixed-sample design, the critical value for the standardized Z test statistic is given by $C_{\alpha }= \Phi ^{-1} (1-\alpha )= 1.96$ . That is, at the end of study, if the test statistic $z \geq C_{\alpha }$ , then the null hypothesis is rejected and the efficacy of vitamin C supplements is declared. Otherwise, the null hypothesis is not rejected and the effect of vitamin C supplements is not significant.

To achieve a $1-\beta = 0.90$ power at $H_{1}: \theta = 0.15$ for a fixed-sample design, the information required is given by

$I_{0} = \frac{ ( \Phi ^{-1}(1-\alpha ) + \Phi ^{-1}(1-\beta ) )^{2} }{ {0.15}^{2} } = \frac{ (1.96 + 1.28155)^{2} }{ 0.0225 } = 466.99$

With an equal sample size on the treatment and control groups, $N_{a}= N_{b}$ , the sample size required for each group under $H_{1}$ is computed from the information $I_{0}$ :

$N_{a} = N_{b} = \left( p_{1a} \, (1-p_{1a}) + p_{1b} \, (1-p_{1b}) \right) \, I_{0}$

where $p_{1a}= 0.75$ and $p_{1b}= 0.60$ are proportions in the treatment and control groups under $H_{1}$ . That is,

$N_{a} = N_{b} = \left( 0.75 \times 0.25 + 0.6 \times 0.4 \right) \times 466.99 = 199.64$

Thus, 200 individuals are required for each group in the fixed-sample study. See the section Test for the Difference between Two Binomial Proportions for a detailed derivation of these required sample sizes.

Instead of a fixed-sample design for the trial, a group sequential design is used to stop the trial early for ethical concerns of possible harm or an unexpected strong efficacy outcome of the new drug. It can also save time and resources in the process. The following statements invoke the SEQDESIGN procedure and request a four-stage group sequential design that uses an O’Brien-Fleming method for normally distributed statistics. The design uses a one-sided alternative hypothesis $H_{1}$ with early stopping to reject or accept $H_{0}$ .

ods graphics on;
proc seqdesign altref=0.15
               ;
   OneSidedOBrienFleming: design nstages=4
                          method=obf
                          alt=upper   stop=both
                          alpha=0.025 beta=0.10
                          ;
   samplesize model=twosamplefreq(nullprop=0.6 test=prop);
ods output Boundary=Bnd_Prop;
run;
ods graphics off;

In a sequential design, a hypothesis can be rejected, accepted, or continued to the next time point at each interim stage. The STOP=BOTH option specifies early stopping to reject or accept the null hypothesis. The “Design Information,” “Method Information,” and “Boundary Information” tables are displayed by default.

The “Design Information” table in Output 87.2.1 displays design specifications and derived statistics such as power and maximum information. With a specified alternative reference, ALTREF=0.15, the maximum information $I_{X}$ is derived.

Output 87.2.1: Design Information

The SEQDESIGN Procedure

Design: OneSidedOBrienFleming

Design Information
Statistic Distribution	Normal
Boundary Scale	Standardized Z
Alternative Hypothesis	Upper
Early Stop	Accept/Reject Null
Method	O'Brien-Fleming
Boundary Key	Both
Alternative Reference	0.15
Number of Stages	4
Alpha	0.025
Beta	0.1
Power	0.9
Max Information (Percent of Fixed Sample)	107.6741
Max Information	502.8343
Null Ref ASN (Percent of Fixed Sample)	61.12891
Alt Ref ASN (Percent of Fixed Sample)	75.89782

The Max Information (Percent Fixed-Sample) is the ratio in percentage between the maximum information for the group sequential design and the information required for a corresponding fixed-sample design:

$100 \times \frac{I_{X}}{I_{0}} = 100 \times \frac{502.83}{466.99} = 107.67$

That is, if the group sequential trial does not stop at any interim stages, the information needed is 7.67% more than is needed for the corresponding fixed-sample design. For a two-sample test for binomial proportions, the information is proportional to the sample size. Thus, 7.67% more observations are needed for the group sequential trial.

The Null Ref ASN (Percent Fixed-Sample) is the ratio in percentage between the expected sample size required under the null hypothesis for the group sequential design and the sample size required for the corresponding fixed-sample design. With a ratio of 61.1%, the expected sample size for the group sequential trial under the null hypothesis is 61.1% of the sample size in the corresponding fixed-sample design.

Similarly, the Alt Ref ASN (Percent Fixed-Sample) is the ratio in percentage between the expected sample size required under the alternative hypothesis for the group sequential design and the sample size required for the corresponding fixed-sample design. With a ratio of 75.9%, the expected sample size for the group sequential trial under the alternative hypothesis is 75.9% of the sample size in the corresponding fixed-sample design.

For a one-sided design with an upper alternative and early stopping to reject or accept the null hypothesis, upper $\alpha$ and $\beta$ boundaries are created. The “Method Information” table in Output 87.2.2 displays the Type I error probability $\alpha$ , the Type II error probability $\beta$ , and the derived drift parameter. The drift parameter is the standardized reference improvement between the alternative and null hypotheses at the final stage. It is also the standardized alternative reference at the final stage if the null reference is zero.

Output 87.2.2: Method Information

Method Information
Boundary	Method	Alpha	Beta	Unified Family			Alternative Reference	Drift
Boundary	Method	Alpha	Beta	Rho	Tau	C	Alternative Reference	Drift
Upper Alpha	O'Brien-Fleming	0.02500	.	0.5	0	1.9784	0.15	3.363595
Upper Beta	O'Brien-Fleming	.	0.10000	0.5	0	1.3852	0.15	3.363595

With the METHOD=OBF option, the O’Brien-Fleming method is used for each boundary. The O’Brien-Fleming method is one of the unified family methods, and the “Method Information” table displays the corresponding parameter $\rho$ in the unified family method. The table also displays the critical values $C_{\alpha }=1.9784$ for the $\alpha$ boundary and $C_{\beta }=1.3852$ for the $\beta$ boundary. These critical values are used to create the boundary values.

The “Boundary Information” table in Output 87.2.3 displays information level, alternative reference, and boundary values at each stage. By default (or equivalently if you specify BOUNDARYSCALE=STDZ), the alternative references and boundary values are displayed with the standardized Z statistic scale. The resulting standardized alternative reference at stage k is given by $\theta _1 \sqrt {I_ k}$ , where $\theta _1$ is the alternative reference and $I_ k$ is the information level at stage k, $k= 1, 2, 3, 4$ .

Output 87.2.3: Boundary Information

Boundary Information (Standardized Z Scale) Null Reference = 0
_Stage_				Alternative	Boundary Values
	Information Level			Reference	Upper
	Proportion	Actual	N	Upper	Beta	Alpha
1	0.2500	125.7086	107.4808	1.68180	-1.08860	3.95679
2	0.5000	251.4171	214.9617	2.37842	0.41946	2.79788
3	0.7500	377.1257	322.4425	2.91296	1.31347	2.28446
4	1.0000	502.8343	429.9233	3.36360	1.97840	1.97840

By default (or equivalently if you specify INFO=EQUAL), equally spaced information levels are used. An information proportion is the proportion of maximum information available at each stage. With the derived maximum information, the actual information level at each stage is also displayed. With the SAMPLESIZE statement, the required sample size N is also displayed under the heading “Information Level.”

At each interim stage, if the standardized Z test statistic is larger than or equal to the corresponding upper $\alpha$ boundary, then the hypothesis $H_{0}: \theta = 0$ is rejected. If the test statistic is less than the corresponding upper $\beta$ boundary, then the trial is stopped and the hypothesis $H_{0}$ is accepted. Otherwise, the process continues to the next stage. At the final stage, stage 4, the trial stops and the hypothesis $H_{0}$ is rejected if the standardized Z statistic $Z_{4} \geq 1.9784$ . Otherwise, the trial is accepted.

The ODS OUTPUT statement with the BOUNDARY=BND_PROP option creates an output data set that contains the resulting boundary information. After the actual data from the clinical trial are collected and analyzed at each stage with a procedure such as PROC GENMOD, the SEQTEST procedure is used to test the resulting statistics at stage 1 with the boundary information stored in the BOUND_PROP data set.

With ODS Graphics enabled, a detailed boundary plot with the rejection and acceptance regions is displayed, as shown in Output 87.2.4.

Output 87.2.4: Boundary Plot

The horizontal axis indicates the information levels for the design. The stages are indicated by vertical lines with accompanying stage numbers. If at any stage a test statistic is in a rejection region, the trial stops and the hypothesis is rejected. If a test statistic is in an acceptance region, then the trial also stops and the hypothesis is accepted. If the statistic is not in a rejection region or an acceptance region, the trial continues to the next stage. The boundary plot also displays the information level and the critical value for the corresponding fixed-sample design.

The SEQDESIGN procedure derives the drift parameter $\theta _{1} \sqrt {I_{X}}$ , where $\theta _{1}$ is the alternative reference and $I_{X}$ is the maximum information. If either $\theta _{1}$ or $I_{X}$ is specified, the other can be derived. With the SAMPLESIZE statement, the maximum information is used to compute the required sample size for the study.

The “Sample Size Summary” table in Output 87.2.5 displays parameters for the sample size computation. With the MODEL=TWOSAMPLEFREQ( NULLPROP=0.6 TEST=PROP) option in the SAMPLESIZE statement, the total sample size in each group for testing the difference between two proportions is computed. By default (or equivalently if you specify REF=PROP in the MODEL=TWOSAMPLEFREQ option), the required sample sizes are computed under the alternative hypothesis. That is,

$N_{a} = N_{b} =\left( \, p_{1a} \, (1-p_{1a}) + p_{1b} \, (1-p_{1b}) \, \right) \, I_{X}$

where $p_{1b}=0.60$ and $p_{1a}=p_{1b}+\theta _{1}=0.75$ are the proportions in the control and treatment groups, respectively, under the alternative hypothesis $H_{1}$ . See the section Test for the Difference between Two Binomial Proportions for a detailed description of these parameters.

Output 87.2.5: Sample Size Summary

Sample Size Summary
Test	Two-Sample Proportions
Null Proportion	0.6
Proportion (Group A)	0.75
Test Statistic	Z for Proportion
Reference Proportions	Alt Ref
Max Sample Size	429.9233
Expected Sample Size (Null Ref)	244.0768
Expected Sample Size (Alt Ref)	303.0464

The “Sample Sizes (N)” table in Output 87.2.6 displays the required sample sizes at each stage, in both fractional and integer numbers. The derived fractional sample sizes are under the heading “Fractional N.” These sample sizes are rounded up to integers under the heading “Ceiling N.” In practice, integer sample sizes are used, and the resulting information levels increase slightly. Thus, 54, 108, 162, and 215 individuals are needed in each group for the four stages, respectively.

Output 87.2.6: Derived Sample Sizes

Sample Sizes (N) Two-Sample Z Test for Proportion Difference
_Stage_	Fractional N				Ceiling N
_Stage_	N	N(Grp 1)	N(Grp 2)	Information	N	N(Grp 1)	N(Grp 2)	Information
1	107.48	53.74	53.74	125.7	108	54	54	126.3
2	214.96	107.48	107.48	251.4	216	108	108	252.6
3	322.44	161.22	161.22	377.1	324	162	162	378.9
4	429.92	214.96	214.96	502.8	430	215	215	502.9