The SURVEYFREQ Procedure

Covariances of Frequency Estimates

The covariance matrix of the table cell frequency estimates is an $rc \times rc$ matrix that contains the pairwise cell frequency covariances. $\widehat{\mb{V}}(\widehat{\mb{N}})$ denotes the covariance matrix and $\widehat{\mr{Cov}}(\widehat{N}_{rc}, \widehat{N}_{ab})$ denotes the pairwise covariances, for $r = 1, \ldots , R$; $c = 1, \ldots , C$; $a = 1, \ldots , R$; and $b = 1, \ldots , C$.

PROC SURVEYFREQ estimates the covariances by using the variance estimation method that you request. If you request BRR variance estimation (by specifying the VARMETHOD=BRR option in the PROC SURVEYFREQ statement), PROC SURVEYFREQ estimates the covariances by using the BRR method. If you request jackknife variance estimation (by specifying the VARMETHOD=JACKKNIFE option), PROC SURVEYFREQ estimates the covariances by using the jackknife method. For more information, see the sections Balanced Repeated Replication (BRR) and The Jackknife Method.

By default, or if you request Taylor series variance estimation, PROC SURVEYFREQ estimates the covariance between frequency estimates for table cells (r,c) and (a,b) as

\[  \widehat{\mr{Cov}}(\widehat{N}_{rc}, \widehat{N}_{ab}) = \sum _{h=1}^ H { \left( \frac{n_ h(1-f_ h)}{n_ h-1} ~  \sum _{i=1}^{n_ h} (n_{rc}^{~ hi} - \bar{n}_{rc}^{~ h}) ~  (n_{ab}^{~ hi} - \bar{n}_{ab}^{~ h}) \right) }  \]