The SURVEYMEANS Procedure

Jackknife Method

The jackknife method of variance estimation deletes one PSU at a time from the full sample to create replicates. The total number of replicates R is the same as the total number of PSUs. In each replicate, the sample weights of the remaining PSUs are modified by the jackknife coefficient $\alpha _ r$. The modified weights are called replicate weights.

The jackknife coefficient and replicate weights are described as follows.

Without Stratification

If there is no stratification in the sample design (no STRATA statement), the jackknife coefficients $\alpha _ r$ are the same for all replicates:

\[  \alpha _ r=\frac{R-1}{R} \, \, \, \, \,  \mbox{where } r=1, 2, ..., R  \]

Denote the original weight in the full sample for the jth member of the ith PSU as $w_{ij}$. If the ith PSU is included in the rth replicate ($r=1, 2, ..., R$), then the corresponding replicate weight for the jth member of the ith PSU is defined as

\[  w^{(r)}_{ij}={w_{ij}}/{\alpha _ r}  \]
With Stratification

If the sample design involves stratification, each stratum must have at least two PSUs to use the jackknife method.

Let stratum ${{\tilde h}_ r}$ be the stratum from which a PSU is deleted for the rth replicate. Stratum ${{\tilde h}_ r}$ is called the donor stratum. Let $n_{{{\tilde h}_ r}}$ be the total number of PSUs in the donor stratum ${\tilde h}_ r$. The jackknife coefficients are defined as

\[  \alpha _ r=\frac{n_{{{\tilde h}_ r}}-1}{n_{{\tilde h}_ r}} \, \, \, \, \,  \mbox{where } r=1, 2, ..., R  \]

Denote the original weight in the full sample for the jth member of the ith PSU as $w_{ij}$. If the ith PSU is included in the rth replicate ($r=1, 2, ..., R$), then the corresponding replicate weight for the jth member of the ith PSU is defined as

\[  w^{(r)}_{ij}=\left\{  {\begin{array}{ll} w_{ij} &  \mbox{if \Mathtext{i}th PSU is not in the donor stratum ${{\tilde h}_ r}$} \\ \displaystyle {w_{ij}/\alpha _ r} &  \mbox{if \Mathtext{i}th PSU is in the donor stratum ${{\tilde h}_ r}$} \end{array}} \right.  \]

You can use the VARMETHOD=JACKKNIFE(OUTJKCOEFS=) method-option to save the jackknife coefficients into a SAS data set and use the VARMETHOD=JACKKNIFE(OUTWEIGHTS=) method-option to save the replicate weights into a SAS data set.

If you provide your own replicate weights with a REPWEIGHTS statement, then you can also provide corresponding jackknife coefficients with the JKCOEFS= option.

Suppose that $\theta $ is a population parameter of interest. Let $\hat{\theta }$ be the estimate from the full sample for $\theta $. Let $\hat{\theta _ r}$ be the estimate from the rth replicate subsample by using replicate weights. PROC SURVEYMEANS estimates the variance of $\hat{\theta }$ by

\[  \widehat{V}(\hat{\theta }) = \sum _{r=1}^ R \alpha _ r \left( \hat{\theta _ r} - \hat{\theta } \right)^2  \]

with $R-H$ degrees of freedom, where R is the number of replicates and H is the number of strata, or R – 1 when there is no stratification.