PROC FREQ: Risks and Risk Differences

The FREQ Procedure

Risks and Risk Differences

The RISKDIFF option in the TABLES statement provides estimates of risks (or binomial proportions) and risk differences for $\text{[math]}$ tables. This analysis might be appropriate when comparing the proportion of some characteristic for two groups, where row 1 and row 2 correspond to the two groups, and the columns correspond to two possible characteristics or outcomes. For example, the row variable might be a treatment or dose, and the column variable might be the response. See Collett (1991), Fleiss, Levin, and Paik (2003), and Stokes, Davis, and Koch (2000) for more information.

Let the frequencies of the $\text{[math]}$ table be represented as follows.

	Column 1	Column 2	Total
Row 1	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
Row 2	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
Total	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$

For column 1 and column 2, PROC FREQ provides estimates of the row 1 risk (or proportion), the row 2 risk, the overall risk and the risk difference. The risk difference is defined as the row 1 risk minus the row 2 risk. The risks are binomial proportions of their rows (row 1, row 2, or overall), and the computation of their standard errors and confidence limits follow the binomial proportion computations, which are described in the section Binomial Proportion.

The column 1 risk for row 1 is the proportion of row 1 observations classified in column 1,

$\text{[math]}$

This estimates the conditional probability of the column 1 response, given the first level of the row variable.

The column 1 risk for row 2 is the proportion of row 2 observations classified in column 1,

$\text{[math]}$

and the overall column 1 risk is the proportion of all observations classified in column 1,

$\text{[math]}$

The column 1 risk difference compares the risks for the two rows, and it is computed as the column 1 risk for row 1 minus the column 1 risk for row 2,

$\text{[math]}$

The risks and risk difference are defined similarly for column 2.

The standard error of the column 1 risk for row $\text{[math]}$ is computed as

$\text{[math]}$

The standard error of the overall column 1 risk is computed as

$\text{[math]}$

If the two rows represent independent binomial samples, the standard error for the column 1 risk difference is computed as

$\text{[math]}$

The standard errors are computed in a similar manner for the column 2 risks and risk difference.

Confidence Limits

By default, the RISKDIFF option provides standard Wald asymptotic confidence limits for the risks (row 1, row 2, and overall) and the risk difference. The risks are equivalent to binomial proportions of their corresponding rows, and the computations follow the methods in the section Wald Confidence Limits.

The standard Wald asymptotic confidence limits are based on the normal approximation to the binomial distribution. PROC FREQ computes the Wald confidence limits for the risks and risk differences as

$\text{[math]}$

where $\text{[math]}$ is the estimate, $\text{[math]}$ is the $\text{[math]}$ th percentile of the standard normal distribution, and $\text{[math]}$ is the standard error of the estimate. The confidence level $\text{[math]}$ is determined from the value of the ALPHA= option, which, by default, equals 0.05 and produces 95% confidence limits.

If you specify the CORRECT riskdiff-option or the RISKDIFFC option, PROC FREQ includes continuity corrections in the Wald asymptotic confidence limits for the risks and risk differences. The purpose of a continuity correction is to adjust for the difference between the normal approximation and the binomial distribution, which is discrete. See Fleiss, Levin, and Paik (2003) for more information. With the continuity correction, the asymptotic confidence limits are computed as

$\text{[math]}$

where $\text{[math]}$ is the continuity correction. For the row 1 risk, $\text{[math]}$ ; for the row 2 risk, $\text{[math]}$ ; for the overall risk, $\text{[math]}$ ; and for the risk difference, $\text{[math]}$ . The column 1 and column 2 risks use the same continuity corrections.

PROC FREQ also computes exact (Clopper-Pearson) confidence limits for the column 1, column 2, and overall risks. These confidence limits are constructed by inverting the equal-tailed test based on the binomial distribution. PROC FREQ uses the $\text{[math]}$ distribution to compute the Clopper-Pearson confidence limits. See the section Exact (Clopper-Pearson) Confidence Limits for details.

PROC FREQ does not provide exact confidence limits for the risk difference by default. If you specify the RISKDIFF option in the EXACT statement, PROC FREQ provides exact unconditional confidence limits for the risk difference, which are described in the section Exact Unconditional Confidence Limits for the Risk Difference. Note that the conditional exact approach, which is the basis for other exact tests provided by PROC FREQ such as Fisher’s exact test, does not apply to the risk difference due to nuisance parameters. See Agresti (1992) for more information.

Tests

You can specify riskdiff-options to request tests of the risk (or proportion) difference. You can request tests of equality, noninferiority, superiority, and equivalence for the risk difference. The test of equality is a standard Wald asymptotic test, available with or without a continuity correction. For noninferiority, superiority, and equivalence tests of the risk difference, the following test methods are provided: Wald (with and without continuity correction), Hauck-Anderson, Farrington-Manning, and Newcombe score (with and without continuity correction). You can specify the test method with the METHOD= riskdiff-option. By default, PROC FREQ uses METHOD=WALD.

Equality Test

If you specify the EQUAL riskdiff-option, PROC FREQ computes a test of equality, or a test of the null hypothesis that the risk difference equals zero. For the column 1 (or 2) risk difference, this test can be expressed as $\text{[math]}$ versus the alternative $\text{[math]}$ , where $\text{[math]}$ denotes the column 1 (or 2) risk difference. PROC FREQ computes a standard Wald asymptotic test, and the test statistic is

$\text{[math]}$

By default, the standard error is computed from the sample proportions as

$\text{[math]}$

If you specify the VAR=NULL riskdiff-option, the standard error is based on the null hypothesis that the row 1 and row 2 risks are equal,

$\text{[math]}$

where $\text{[math]}$ estimates the overall column 1 risk.

If you specify the CORRECT riskdiff-option or the RISKDIFFC option, PROC FREQ includes a continuity correction in the asymptotic test statistic. If $\text{[math]}$ , the continuity correction is subtracted from $\text{[math]}$ in the numerator of the test statistic; otherwise, the continuity correction is added to the numerator. The value of the continuity correction is $\text{[math]}$ .

PROC FREQ computes one-sided and two-sided $\text{[math]}$ -values for this test. When the test statistic $\text{[math]}$ is greater than 0, PROC FREQ displays the right-sided p-value, which is the probability of a larger value occurring under the null hypothesis. The one-sided p-value can be expressed as

$\text{[math]}$

where $\text{[math]}$ has a standard normal distribution. The two-sided p-value is computed as $\text{[math]}$ .

Noninferiority Test

If you specify the NONINF riskdiff-option, PROC FREQ provides a noninferiority test for the risk difference, or the difference between two proportions. The null hypothesis for the noninferiority test is

$\text{[math]}$

versus the alternative

$\text{[math]}$

where $\text{[math]}$ is the noninferiority margin. Rejection of the null hypothesis indicates that the row 1 risk is not inferior to the row 2 risk. See Chow, Shao, and Wang (2003) for more information.

You can specify the value of $\text{[math]}$ with the MARGIN= riskdiff-option. By default, $\text{[math]}$ . You can specify the test method with the METHOD= riskdiff-option. The following methods are available for the risk difference noninferiority analysis: Wald (with and without continuity correction), Hauck-Anderson, Farrington-Manning, and Newcombe score (with and without continuity correction). The Wald, Hauck-Anderson, and Farrington-Manning methods provide tests and corresponding test-based confidence limits; the Newcombe score method provides only confidence limits. If you do not specify METHOD=, PROC FREQ uses the Wald test by default.

The confidence coefficient for the test-based confidence limits is $\text{[math]}$ % (Schuirmann 1999). By default, if you do not specify the ALPHA= option, these are $\text{[math]}$ % confidence limits. You can compare the confidence limits to the noninferiority limit, – $\text{[math]}$ .

The following sections describe the noninferiority analysis methods for the risk difference.

Wald Test
If you specify the METHOD=WALD riskdiff-option, PROC FREQ provides an asymptotic Wald test of noninferiority for the risk difference. This is also the default method. The Wald test statistic is computed as

$\text{[math]}$

where ( $\text{[math]}$ ) estimates the risk difference and $\text{[math]}$ is the noninferiority margin.

By default, the standard error for the Wald test is computed from the sample proportions as

$\text{[math]}$

If you specify the VAR=NULL riskdiff-option, the standard error is based on the null hypothesis that the risk difference equals – $\text{[math]}$ (Dunnett and Gent 1977). The standard error is computed as

$\text{[math]}$

where

$\text{[math]}$

If you specify the CORRECT riskdiff-option or the RISKDIFFC option, a continuity correction is included in the test statistic. The continuity correction is subtracted from the numerator of the test statistic if the numerator is greater than zero; otherwise, the continuity correction is added to the numerator. The value of the continuity correction is $\text{[math]}$ .

The p-value for the Wald noninferiority test is $\text{[math]}$ , where $\text{[math]}$ has a standard normal distribution.

Hauck-Anderson Test
If you specify the METHOD=HA riskdiff-option, PROC FREQ provides the Hauck-Anderson test for noninferiority. The Hauck-Anderson test statistic is computed as

$\text{[math]}$

where $\text{[math]}$ and the standard error is computed from the sample proportions as

$\text{[math]}$

The Hauck-Anderson continuity correction $\text{[math]}$ is computed as

$\text{[math]}$

The p-value for the Hauck-Anderson noninferiority test is $\text{[math]}$ , where $\text{[math]}$ has a standard normal distribution. See Hauck and Anderson (1986) and Schuirmann (1999) for more information.

Farrington-Manning Test
If you specify the METHOD=FM riskdiff-option, PROC FREQ provides the Farrington-Manning test of noninferiority for the risk difference. The Farrington-Manning test statistic is computed as

$\text{[math]}$

where $\text{[math]}$ and

$\text{[math]}$

where $\text{[math]}$ and $\text{[math]}$ are the maximum likelihood estimators of $\text{[math]}$ and $\text{[math]}$ under the null hypothesis that the risk difference equals – $\text{[math]}$ . The p-value for the Farrington-Manning noninferiority test is then $\text{[math]}$ , where $\text{[math]}$ has a standard normal distribution.

From Farrington and Manning (1990), the solution to the maximum likelihood equation is

$\text{[math]}$

where

$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
$\text{[math]}$	$\text{[math]}$	$\text{[math]}$

Newcombe Score Confidence Limits
If you specify the METHOD=SCORE riskdiff-option, PROC FREQ provides the Newcombe hybrid score (or Wilson) confidence limits for the risk difference. The confidence coefficient for the confidence limits is $\text{[math]}$ % (Schuirmann 1999). By default, if you do not specify the ALPHA= option, these are $\text{[math]}$ % confidence limits. You can compare the confidence limits to the noninferiority limit, – $\text{[math]}$ .

The Newcombe score confidence limits for the risk difference are constructed from the Wilson score confidence limits for each of the two individual proportions. The confidence limits for the individual proportions are used in the standard error terms of the Wald confidence limits for the proportion difference. See Newcombe (1998) and Barker et al. (2001) for more information.

Wilson score confidence limits for $\text{[math]}$ and $\text{[math]}$ are the roots of

$\text{[math]}$

for $\text{[math]}$ . The confidence limits are computed as

$\text{[math]}$

See the section Wilson (Score) Confidence Limits for details.

Denote the lower and upper Wilson score confidence limits for $\text{[math]}$ as $\text{[math]}$ and $\text{[math]}$ , and denote the lower and upper confidence limits for $\text{[math]}$ as $\text{[math]}$ and $\text{[math]}$ . The Newcombe score confidence limits for the proportion difference ( $\text{[math]}$ ) are computed as

	$\text{[math]}$
	$\text{[math]}$

If you specify the CORRECT riskdiff-option, PROC FREQ provides continuity-corrected Newcombe score confidence limits. By including a continuity correction of $\text{[math]}$ , the Wilson score confidence limits for the individual proportions are the roots of

$\text{[math]}$

These confidence limits for the individual proportions are then used in the standard error terms of the Wald confidence limits for the proportion difference to compute $\text{[math]}$ and $\text{[math]}$ .

Superiority Test

If you specify the SUP riskdiff-option, PROC FREQ provides a superiority test for the risk difference. The null hypothesis is

$\text{[math]}$

versus the alternative

$\text{[math]}$

where $\text{[math]}$ is the superiority margin. Rejection of the null hypothesis indicates that the row 1 proportion is superior to the row 2 proportion. You can specify the value of $\text{[math]}$ with the MARGIN= riskdiff-option. By default, $\text{[math]}$ .

The superiority analysis is identical to the noninferiority analysis but uses a positive value of the margin $\text{[math]}$ in the null hypothesis. The superiority computations follow those in the section Noninferiority Test by replacing – $\text{[math]}$ by $\text{[math]}$ . See Chow, Shao, and Wang (2003) for more information.

Equivalence Tests

If you specify the EQUIV riskdiff-option, PROC FREQ provides an equivalence test for the risk difference, or the difference between two proportions. The null hypothesis for the equivalence test is

$\text{[math]}$

versus the alternative

$\text{[math]}$

where $\text{[math]}$ is the lower margin and $\text{[math]}$ is the upper margin. Rejection of the null hypothesis indicates that the two binomial proportions are equivalent. See Chow, Shao, and Wang (2003) for more information.

You can specify the value of the margins $\text{[math]}$ and $\text{[math]}$ with the MARGIN= riskdiff-option. If you do not specify MARGIN=, PROC FREQ uses lower and upper margins of –0.2 and 0.2 by default. If you specify a single margin value $\text{[math]}$ , PROC FREQ uses lower and upper margins of – $\text{[math]}$ and $\text{[math]}$ . You can specify the test method with the METHOD= riskdiff-option. The following methods are available for the risk difference equivalence analysis: Wald (with and without continuity correction), Hauck-Anderson, Farrington-Manning, and Newcombe’s score (with and without continuity correction). The Wald, Hauck-Anderson, and Farrington-Manning methods provide tests and corresponding test-based confidence limits; the Newcombe score method provides only confidence limits. If you do not specify METHOD=, PROC FREQ uses the Wald test by default.

PROC FREQ computes two one-sided tests (TOST) for equivalence analysis (Schuirmann 1987). The TOST approach includes a right-sided test for the lower margin $\text{[math]}$ and a left-sided test for the upper margin $\text{[math]}$ . The overall p-value is taken to be the larger of the two p-values from the lower and upper tests.

The section Noninferiority Test gives details about the Wald, Hauck-Anderson, Farrington-Manning and Newcombe score methods for the risk difference. The lower margin equivalence test statistic takes the same form as the noninferiority test statistic but uses the lower margin value $\text{[math]}$ in place of – $\text{[math]}$ . The upper margin equivalence test statistic take the same form as the noninferiority test statistic but uses the upper margin value $\text{[math]}$ in place of – $\text{[math]}$ .

The test-based confidence limits for the risk difference are computed according to the equivalence test method that you select. If you specify METHOD=WALD with VAR=NULL, or METHOD=FM, separate standard errors are computed for the lower and upper margin tests. In this case, the test-based confidence limits are computed by using the maximum of these two standard errors. The confidence limits have a confidence coefficient of $\text{[math]}$ % (Schuirmann 1999). By default, if you do not specify the ALPHA= option, these are $\text{[math]}$ % confidence limits. You can compare the confidence limits to the equivalence limits, $\text{[math]}$ .

Exact Unconditional Confidence Limits for the Risk Difference

If you specify the RISKDIFF option in the EXACT statement, PROC FREQ provides exact unconditional confidence limits for the risk difference. Unconditional computations differ from the exact conditional approach that PROC FREQ uses for other exact statistics such as Fisher’s exact test. (See the section Exact Statistics for more information.) Exact conditional inference does not apply to the risk difference due to nuisance parameters. See Agresti (1992) for details. The unconditional approach eliminates nuisance parameters by maximizing the p-value over all possible values of the nuisance parameters (Santner and Snell 1980).

Denote the proportion difference by $\text{[math]}$ . For a $\text{[math]}$ table with row totals $\text{[math]}$ and $\text{[math]}$ , the joint probability function can be expressed in terms of the table cell frequencies and the parameters $\text{[math]}$ and $\text{[math]}$ ,

$\text{[math]}$

When constructing confidence limits for the proportion difference, the parameter of interest is $\text{[math]}$ and $\text{[math]}$ is a nuisance parameter.

Denote the observed value of the proportion difference by $\text{[math]}$ . The $\text{[math]}$ % confidence limits for $\text{[math]}$ are computed as

	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$

where

	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$

The set $\text{[math]}$ includes all $\text{[math]}$ tables with row sums equal to $\text{[math]}$ and $\text{[math]}$ , and $\text{[math]}$ denotes the value of the proportion difference ( $\text{[math]}$ ) for table $\text{[math]}$ in $\text{[math]}$ . To compute $\text{[math]}$ , the sum includes probabilities of those tables for which ( $\text{[math]}$ ), where $\text{[math]}$ is the observed value of the proportion difference. For a fixed value of $\text{[math]}$ , $\text{[math]}$ is taken to be the maximum sum over all possible values of $\text{[math]}$ . See Santner and Snell (1980) and Agresti and Min (2001) for details.

This method of eliminating the nuisance parameter is considered to be a conservative approach (Agresti and Min 2001). Additionally, the confidence limits are conservative for small samples because this is a discrete problem; the confidence coefficient is not exactly ( $\text{[math]}$ ) but is at least ( $\text{[math]}$ ). See Agresti (1992) for more information.

Top of Page