## Example 3.6 Output Data Set of Chi-Square Statistics

This example uses the Color data from Example 3.1 to output the Pearson chi-square and the likelihood-ratio chi-square statistics to a SAS data set. The following PROC FREQ statements create a two-way table of eye color versus hair color.

proc freq data=Color order=data;
tables Eyes*Hair / expected cellchi2 norow nocol chisq;
output out=ChiSqData n nmiss pchi lrchi;
weight Count;
title 'Chi-Square Tests for 3 by 5 Table of Eye and Hair Color';
run;

proc print data=ChiSqData noobs;
title1 'Chi-Square Statistics for Eye and Hair Color';
title2 'Output Data Set from the FREQ Procedure';
run;

The EXPECTED option displays expected cell frequencies in the crosstabulation table, and the CELLCHI2 option displays the cell contribution to the overall chi-square. The NOROW and NOCOL options suppress the display of row and column percents in the crosstabulation table. The CHISQ option produces chi-square tests.

The OUTPUT statement creates the ChiSqData output data set and specifies the statistics to include. The N option requests the number of nonmissing observations, the NMISS option stores the number of missing observations, and the PCHI and LRCHI options request Pearson and likelihood-ratio chi-square statistics, respectively, together with their degrees of freedom and -values.

The preceding statements produce Output 3.6.1 and Output 3.6.2. The contingency table in Output 3.6.1 displays eye and hair color in the order in which they appear in the Color data set. The Pearson chi-square statistic in Output 3.6.2 provides evidence of an association between eye and hair color (=0.0073). The cell chi-square values show that most of the association is due to more green-eyed children with fair or red hair and fewer with dark or black hair. The opposite occurs with the brown-eyed children.

Output 3.6.3 displays the output data set created by the OUTPUT statement. It includes one observation that contains the sample size, the number of missing values, and the chi-square statistics and corresponding degrees of freedom and -values as in Output 3.6.2.

**
Output 3.6.1
Contingency Table**