The SURVEYFREQ Procedure

Rao-Scott Likelihood Ratio Chi-Square Test

The Rao-Scott likelihood ratio chi-square test is a design-adjusted version of the likelihood ratio test, which involves ratios of observed and expected frequencies. For information about design-adjusted chi-square tests, see Lohr (2010, Section 10.3.2), Rao and Scott (1981), Rao and Scott (1984), Rao and Scott (1987), and Thomas, Singh, and Roberts (1996).

PROC SURVEYFREQ provides a first-order Rao-Scott likelihood ratio test by default. If you specify the LRCHISQ(SECONDORDER) option, PROC SURVEYFREQ provides a second-order (Satterthwaite) likelihood ratio chi-square test.

The procedure computes the Rao-Scott likelihood ratio test by applying design adjustments to the weighted likelihood ratio statistic that is based on estimated totals. This computation is identical to the Rao-Scott chi-square test computation except that it uses the likelihood ratio statistic $G^2$ in place of the Pearson chi-square statistic $Q_{\mi{P}}$. For more information, see the section Rao-Scott Chi-Square Test.

One-Way Tables

For one-way tables, the LRCHISQ option provides a Rao-Scott (design-based) goodness-of-fit test for one-way tables. By default, this is a test for the null hypothesis of equal proportions. If you specify null hypothesis proportions in the TESTP= option, the goodness-of-fit test uses the specified proportions.

The Rao-Scott likelihood ratio test uses the likelihood ratio statistic that is based on the estimated totals,

\[ G^2 = 2 ~ ~ (~ n ~ / ~ \widehat{N} ~ ) ~ ~ \sum _ c ~ \widehat{N}_ c ~ \ln \left( \widehat{N}_ c ~ / ~ E_ c \right) \]

where n is the sample size, $\widehat{N}$ is the estimated overall total, $\widehat{N}_ c$ is the estimated total for level c, and $E_ c$ is the expected total for level c under the null hypothesis. For the null hypothesis of equal proportions, the expected total for each level is

\[ E_ c = \widehat{N} ~ / ~ C \]

For specified null proportions, the expected total for level c is

\[ E_ c = \widehat{N} \times P^{~ 0}_ c \]

where $P^{~ 0}_ c$ is the null proportion that you specify for level c.

The computation of the Rao-Scott likelihood ratio test for one-way tables uses $G^2$ in place of $Q_{\mi{P}}$ in the Rao-Scott chi-square test computation and is otherwise identical to the chi-square test computation. For more information, see the sections First-Order Test and Second-Order Test.

If you specify the LRCHISQ(MODIFIED) option, PROC SURVEYFREQ computes the design corrections by using null hypothesis proportions instead of proportion estimates. By default, null hypothesis proportions are equal proportions for all levels of the one-way table. Alternatively, you can specify null proportion values in the TESTP= option.

Two-Way Tables

For two-way tables, the LRCHISQ option provides a Rao-Scott (design-based) test of association between the row and column variables.

The Rao-Scott likelihood ratio test uses the likelihood ratio statistic that is based on the estimated totals,

\[ G^2 = 2 ~ (n / \widehat{N}) ~ \sum _ r \sum _ c \widehat{N}_{rc} ~ \ln \left( \widehat{N}_{rc} ~ / ~ E_{rc} \right) \]

where n is the sample size, $\widehat{N}$ is the estimated overall total, $\widehat{N}_{rc}$ is the estimated total for table cell (r, c), and $E_{rc}$ is the expected total for cell (r, c) under the null hypothesis of no association. The expected total for cell (r, c) is

\[ E_{rc} = \widehat{N}_{r \cdot } ~ \widehat{N}_{\cdot c} ~ / ~ \widehat{N} \]

The computation of the Rao-Scott likelihood ratio test for two-way tables uses $G^2$ in place of $Q_{\mi{P}}$ in the Rao-Scott chi-square test computation and is otherwise identical to the chi-square test computation. For more information, see the sections First-Order Test and Second-Order Test.

By default for a first-order test, PROC SURVEYFREQ computes the design correction from proportion estimates. If you specify the LRCHISQ(MODIFIED) option for a first-order test, the procedure computes the design correction from null hypothesis proportions.

Second-order tests, which you request by specifying the LRCHISQ(SECONDORDER) option, are computed by applying both first-order and second-order design corrections to the weighted likelihood ratio statistic. For second-order tests for two-way tables, PROC SURVEYFREQ always uses null hypothesis proportions to compute both the first-order and second-order design corrections.