PROC FREQ: Chi-Square Tests and Statistics

The FREQ Procedure

Chi-Square Tests and Statistics

The CHISQ option provides chi-square tests of homogeneity or independence and measures of association based on the chi-square statistic. When you specify the CHISQ option in the TABLES statement, PROC FREQ computes the following chi-square tests for each two-way table: the Pearson chi-square, likelihood-ratio chi-square, and Mantel-Haenszel chi-square. PROC FREQ provides the following measures of association based on the Pearson chi-square statistic: the phi coefficient, contingency coefficient, and Cramer’s $\text{[math]}$ . For $\text{[math]}$ tables, the CHISQ option also provides Fisher’s exact test and the continuity-adjusted chi-square. You can request Fisher’s exact test for general $\text{[math]}$ tables by specifying the FISHER option in the TABLES or EXACT statement.

For one-way frequency tables, the CHISQ option provides a chi-square goodness-of-fit test. The other chi-square tests and statistics described in this section are computed only for two-way tables.

All of the two-way test statistics described in this section test the null hypothesis of no association between the row variable and the column variable. When the sample size $\text{[math]}$ is large, these test statistics have an asymptotic chi-square distribution when the null hypothesis is true. When the sample size is not large, exact tests might be useful. PROC FREQ provides exact tests for the Pearson chi-square, the likelihood-ratio chi-square, and the Mantel-Haenszel chi-square (in addition to Fisher’s exact test). PROC FREQ also provides an exact chi-square goodness-of-fit test for one-way tables. You can request these exact tests by specifying the corresponding options in the EXACT statement. See the section Exact Statistics for more information.

Note that the Mantel-Haenszel chi-square statistic is appropriate only when both variables lie on an ordinal scale. The other chi-square tests and statistics in this section are appropriate for either nominal or ordinal variables. The following sections give the formulas that PROC FREQ uses to compute the chi-square tests and statistics. See Agresti (2007), Stokes, Davis, and Koch (2000), and the other references cited for each statistic for more information.

Chi-Square Test for One-Way Tables

For one-way frequency tables, the CHISQ option in the TABLES statement provides a chi-square goodness-of-fit test. Let $\text{[math]}$ denote the number of classes, or levels, in the one-way table. Let $\text{[math]}$ denote the frequency of class $\text{[math]}$ (or the number of observations in class $\text{[math]}$ ) for $\text{[math]}$ . Then PROC FREQ computes the one-way chi-square statistic as

$\text{[math]}$

where $\text{[math]}$ is the expected frequency for class $\text{[math]}$ under the null hypothesis.

In the test for equal proportions, which is the default for the CHISQ option, the null hypothesis specifies equal proportions of the total sample size for each class. Under this null hypothesis, the expected frequency for each class equals the total sample size divided by the number of classes,

$\text{[math]}$

In the test for specified frequencies, which PROC FREQ computes when you input null hypothesis frequencies by using the TESTF= option, the expected frequencies are the TESTF= values that you specify. In the test for specified proportions, which PROC FREQ computes when you input null hypothesis proportions by using the TESTP= option, the expected frequencies are determined from the specified TESTP= proportions $\text{[math]}$ as

$\text{[math]}$

Under the null hypothesis (of equal proportions, specified frequencies, or specified proportions), $\text{[math]}$ has an asymptotic chi-square distribution with $\text{[math]}$ degrees of freedom.

In addition to the asymptotic test, you can request an exact one-way chi-square test by specifying the CHISQ option in the EXACT statement. See the section Exact Statistics for more information.

Pearson Chi-Square Test for Two-Way Tables

The Pearson chi-square for two-way tables involves the differences between the observed and expected frequencies, where the expected frequencies are computed under the null hypothesis of independence. The Pearson chi-square statistic is computed as

$\text{[math]}$

where $\text{[math]}$ is the observed frequency in table cell ( $\text{[math]}$ ) and $\text{[math]}$ is the expected frequency for table cell ( $\text{[math]}$ ). The expected frequency is computed under the null hypothesis that the row and column variables are independent,

$\text{[math]}$

When the row and column variables are independent, $\text{[math]}$ has an asymptotic chi-square distribution with $\text{[math]}$ degrees of freedom. For large values of $\text{[math]}$ , this test rejects the null hypothesis in favor of the alternative hypothesis of general association.

In addition to the asymptotic test, you can request an exact Pearson chi-square test by specifying the PCHI or CHISQ option in the EXACT statement. See the section Exact Statistics for more information.

For $\text{[math]}$ tables, the Pearson chi-square is also appropriate for testing the equality of two binomial proportions. For $\text{[math]}$ and $\text{[math]}$ tables, the Pearson chi-square tests the homogeneity of proportions. See Fienberg (1980) for details.

Likelihood-Ratio Chi-Square Test

The likelihood-ratio chi-square involves the ratios between the observed and expected frequencies. The likelihood-ratio chi-square statistic is computed as

$\text{[math]}$

where $\text{[math]}$ is the observed frequency in table cell ( $\text{[math]}$ ) and $\text{[math]}$ is the expected frequency for table cell ( $\text{[math]}$ ).

When the row and column variables are independent, $\text{[math]}$ has an asymptotic chi-square distribution with $\text{[math]}$ degrees of freedom.

In addition to the asymptotic test, you can request an exact likelihood-ratio chi-square test by specifying the LRCHI or CHISQ option in the EXACT statement. See the section Exact Statistics for more information.

Continuity-Adjusted Chi-Square Test

The continuity-adjusted chi-square for $\text{[math]}$ tables is similar to the Pearson chi-square, but it is adjusted for the continuity of the chi-square distribution. The continuity-adjusted chi-square is most useful for small sample sizes. The use of the continuity adjustment is somewhat controversial; this chi-square test is more conservative (and more like Fisher’s exact test) when the sample size is small. As the sample size increases, the continuity-adjusted chi-square becomes more like the Pearson chi-square.

The continuity-adjusted chi-square statistic is computed as

$\text{[math]}$

Under the null hypothesis of independence, $\text{[math]}$ has an asymptotic chi-square distribution with $\text{[math]}$ degrees of freedom.

Mantel-Haenszel Chi-Square Test

The Mantel-Haenszel chi-square statistic tests the alternative hypothesis that there is a linear association between the row variable and the column variable. Both variables must lie on an ordinal scale. The Mantel-Haenszel chi-square statistic is computed as

$\text{[math]}$

where $\text{[math]}$ is the Pearson correlation between the row variable and the column variable. For a description of the Pearson correlation, see the Pearson Correlation Coefficient. The Pearson correlation and thus the Mantel-Haenszel chi-square statistic use the scores that you specify in the SCORES= option in the TABLES statement. See Mantel and Haenszel (1959) and Landis, Heyman, and Koch (1978) for more information.

Under the null hypothesis of no association, $\text{[math]}$ has an asymptotic chi-square distribution with one degree of freedom.

In addition to the asymptotic test, you can request an exact Mantel-Haenszel chi-square test by specifying the MHCHI or CHISQ option in the EXACT statement. See the section Exact Statistics for more information.

Fisher’s Exact Test

Fisher’s exact test is another test of association between the row and column variables. This test assumes that the row and column totals are fixed, and then uses the hypergeometric distribution to compute probabilities of possible tables conditional on the observed row and column totals. Fisher’s exact test does not depend on any large-sample distribution assumptions, and so it is appropriate even for small sample sizes and for sparse tables.

2 $\text{[math]}$ 2 Tables

For $\text{[math]}$ tables, PROC FREQ gives the following information for Fisher’s exact test: table probability, two-sided $\text{[math]}$ -value, left-sided $\text{[math]}$ -value, and right-sided $\text{[math]}$ -value. The table probability equals the hypergeometric probability of the observed table, and is in fact the value of the test statistic for Fisher’s exact test.

Where $\text{[math]}$ is the hypergeometric probability of a specific table with the observed row and column totals, Fisher’s exact $\text{[math]}$ -values are computed by summing probabilities $\text{[math]}$ over defined sets of tables,

$\text{[math]}$

The two-sided $\text{[math]}$ -value is the sum of all possible table probabilties (conditional on the observed row and column totals) that are less than or equal to the observed table probability. For the two-sided $\text{[math]}$ -value, the set $\text{[math]}$ includes all possible tables with hypergeometric probabilities less than or equal to the probability of the observed table. A small two-sided $\text{[math]}$ -value supports the alternative hypothesis of association between the row and column variables.

For $\text{[math]}$ tables, one-sided $\text{[math]}$ -values for Fisher’s exact test are defined in terms of the frequency of the cell in the first row and first column of the table, the (1,1) cell. Denoting the observed (1,1) cell frequency by $\text{[math]}$ , the left-sided $\text{[math]}$ -value for Fisher’s exact test is the probability that the (1,1) cell frequency is less than or equal to $\text{[math]}$ . For the left-sided $\text{[math]}$ -value, the set $\text{[math]}$ includes those tables with a (1,1) cell frequency less than or equal to $\text{[math]}$ . A small left-sided $\text{[math]}$ -value supports the alternative hypothesis that the probability of an observation being in the first cell is actually less than expected under the null hypothesis of independent row and column variables.

Similarly, for a right-sided alternative hypothesis, $\text{[math]}$ is the set of tables where the frequency of the (1,1) cell is greater than or equal to that in the observed table. A small right-sided $\text{[math]}$ -value supports the alternative that the probability of the first cell is actually greater than that expected under the null hypothesis.

Because the (1,1) cell frequency completely determines the $\text{[math]}$ table when the marginal row and column sums are fixed, these one-sided alternatives can be stated equivalently in terms of other cell probabilities or ratios of cell probabilities. The left-sided alternative is equivalent to an odds ratio less than 1, where the odds ratio equals ( $\text{[math]}$ ). Additionally, the left-sided alternative is equivalent to the column 1 risk for row 1 being less than the column 1 risk for row 2, $\text{[math]}$ . Similarly, the right-sided alternative is equivalent to the column 1 risk for row 1 being greater than the column 1 risk for row 2, $\text{[math]}$ . See Agresti (2007) for details.

R $\text{[math]}$ C Tables

Fisher’s exact test was extended to general $\text{[math]}$ tables by Freeman and Halton (1951), and this test is also known as the Freeman-Halton test. For $\text{[math]}$ tables, the two-sided $\text{[math]}$ -value definition is the same as for $\text{[math]}$ tables. The set $\text{[math]}$ contains all tables with $\text{[math]}$ less than or equal to the probability of the observed table. A small $\text{[math]}$ -value supports the alternative hypothesis of association between the row and column variables. For $\text{[math]}$ tables, Fisher’s exact test is inherently two-sided. The alternative hypothesis is defined only in terms of general, and not linear, association. Therefore, Fisher’s exact test does not have right-sided or left-sided $\text{[math]}$ -values for general $\text{[math]}$ tables.

For $\text{[math]}$ tables, PROC FREQ computes Fisher’s exact test by using the network algorithm of Mehta and Patel (1983), which provides a faster and more efficient solution than direct enumeration. See the section Exact Statistics for more details.

Phi Coefficient

The phi coefficient is a measure of association derived from the Pearson chi-square. The range of the phi coefficient is $\text{[math]}$ for $\text{[math]}$ tables. For tables larger than $\text{[math]}$ , the range is $\text{[math]}$ (Liebetrau 1983). The phi coefficient is computed as

$\text{[math]}$

See Fleiss, Levin, and Paik (2003, pp. 98–99) for more information.

Contingency Coefficient

The contingency coefficient is a measure of association derived from the Pearson chi-square. The range of the contingency coefficient is $\text{[math]}$ , where $\text{[math]}$ (Liebetrau 1983). The contingency coefficient is computed as

$\text{[math]}$

See Kendall and Stuart (1979, pp. 587–588) for more information.

Cramer’s V

Cramer’s $\text{[math]}$ is a measure of association derived from the Pearson chi-square. It is designed so that the attainable upper bound is always 1. The range of Cramer’s $\text{[math]}$ is $\text{[math]}$ for $\text{[math]}$ tables; for tables larger than $\text{[math]}$ , the range is $\text{[math]}$ . Cramer’s $\text{[math]}$ is computed as