The FREQ Procedure

Common Risk Difference

PROC FREQ provides Mantel-Haenszel and summary score estimates for the common risk (proportion) difference for multiway $2 \times 2$ tables.

Mantel-Haenszel Estimate of the Common Risk Difference

PROC FREQ computes the Mantel-Haenszel (Mantel and Haenszel, 1959) estimate of the common risk difference as

\[  \hat{d}_{\mi {MH}} = \left( \sum _ h \hat{d}_ h w_ h \right) ~  / ~  \left( \sum _ h w_ h \right)  \]

where $\hat{d}_ h$ is the risk difference in stratum h and

\[  w_ h = n_{h1\cdot } n_{h2\cdot } / n_ h  \]

The column 1 risk difference in stratum h is computed as

\[  \hat{d}_ h = \hat{p}_{h1} - \hat{p}_{h2} = (n_{h11} / n_{h1 \cdot }) - (n_{h21} / n_{h2 \cdot })  \]

where $\hat{p}_{h1}$ is the proportion of row 1 observations that are classified in column 1 and $\hat{p}_{h2}$ is the proportion or row 2 observations that are classified in column 1. The column 2 risk is computed in the same way. For more information, see Agresti (2013, p. 231).

PROC FREQ computes the variance of $\hat{d}_{\mi {MH}}$ (Sato, 1989) as

\[  \hat{\sigma }^2(\hat{d}_{\mi {MH}}) = \left( \hat{d}_{\mi {MH}} \sum _ h P_ h + \sum _ h Q_ h \right) / \left( \sum _ h w_ h \right)^2  \]

where

\[  P_ h = \left( n_{h1\cdot }^2 n_{h21} - n_{h2\cdot }^2 n_{h11} + n_{h1\cdot } n_{h2\cdot } (n_{h2\cdot }-n_{h1\cdot })/2 \right) / n_ h^2  \]
\[  Q_ h = \left( n_{h11} ( n_{h2\cdot } - n_{h21} ) + n_{h21} (n_{h1\cdot } - n_{h11} ) \right) / 2 n_ h  \]

The $100(1-\alpha /2)$% confidence limits for the common risk difference are

\[  \hat{d}_{\mi {MH}} ~  \pm ~  \left( z_{\alpha /2} \times \hat{\sigma }(\hat{d}_{\mi {MH}}) \right)  \]
Summary Score Estimate of the Common Risk Difference

PROC FREQ computes the summary score estimate of the common risk difference as described in Agresti (2013, p. 231). This estimate is computed from Miettinen-Nurminen (score) confidence limits for the stratum risk differences. For more information, see the section Miettinen-Nurminen (Score) Confidence Limits. The score confidence interval for the risk difference in stratum h can be expressed as $\hat{d}_ h^\prime \pm z_{\alpha /2} s_ h^\prime $, where $\hat{d}_ h^\prime $ is the midpoint of the score confidence interval and $s_ h^\prime $ is the width of the confidence interval divided by $2z_{\alpha /2}$. The summary score estimate of the common risk difference is computed as

\[  \hat{d}_{\mi {S}} = \sum _ h \hat{d}_ h^\prime w_ h^\prime  \]

where

\[  w_ h^\prime = ( 1 / {s_ h^\prime }^2 ) / \sum _ i (1/{s_ i^\prime }^2)  \]

The variance of $\hat{d}_{\mi {S}}$ is computed as

\[  \hat{\sigma }^2(\hat{d}_{\mi {S}}) = 1 / \sum _ h ( 1 / {s_ h^\prime }^2 )  \]

The $100(1-\alpha /2)$% summary score confidence limits for the common risk difference are

\[  \hat{d}_{\mi {S}} ~  \pm ~  \left( z_{\alpha /2} \times \hat{\sigma }(\hat{d}_{\mi {S}}) \right)  \]