The SURVEYPHREG Procedure

Variance Ratios and Standard Error Ratios

PROC SURVEYPHREG provides options to compute different variance ratios and standard error ratios.

If you specify the VARRATIO=MODEL option, then the procedure computes the variance ratio of the estimated regression parameter $\hat\beta _ j$ as $\frac{\hat V(\hat\beta _ j)}{\hat V_{M} (\hat\beta _ j)}$, where $\hat V(\hat\beta _ j)$, the estimated variance of $\hat\beta _ j$, uses the complete design information and $\hat V_{M} (\hat\beta _ j)$ is the jth diagonal element of the observed information matrix $\mc {I}^{-1}(\hat{\bbeta })$.

If you specify the VARRATIO=IND option, then the procedure computes the variance ratio of the estimated regression parameter $\hat\beta _ j$ as $\frac{\hat V(\hat\beta _ j)}{\hat V_{\text {IND}} (\hat\beta _ j)}$, where $\hat V(\hat\beta _ j)$, the estimated variance of $\hat\beta _ j$, uses the complete design information and $\hat V_{\text {IND}} (\hat\beta _ j)$ is the jth diagonal element of $\hat V_{\text {IND}} (\hat\bbeta )$. $\hat V_{\text {IND}} (\hat\bbeta )$ is the sandwich variance estimator, which ignores the strata and the clusters and is computed as

\[  \hat V_{\text {IND}} (\hat\bbeta ) = \mc {I}^{-1}(\hat{\bbeta }) \left\{  \frac{n}{n-1} (1-f) \sum _ h \sum _ i \sum _ j (\mb {e}_{hij}-\bar{\mb {e}}_{\cdot \cdot \cdot })’ (\mb {e}_{hij}-\bar{\mb {e}}_{\cdot \cdot \cdot }) \right\}  \mc {I}^{-1}(\hat{\bbeta })  \]

where $e_{hij}$ are the weighted score residuals, f is the overall sampling fraction, and n is the number of observation units. The three sums are over the observation units (j) across the PSUs (i) and the strata (h).

For Taylor series variance estimation, PROC SURVEYPHREG determines the value of f, the overall sampling fraction, based on the RATE= or TOTAL= option. If you do not specify either of these options, PROC SURVEYPHREG assumes that the value of f is negligible and does not use a finite population correction in the analysis. If you specify RATE=value, PROC SURVEYPHREG uses value as the overall sampling fraction f. If you specify TOTAL=value, PROC SURVEYPHREG computes f as the ratio of the number of PSUs in the sample to the specified total.

If you specify stratum sampling rates by using the RATE=SAS-data-set option, then PROC SURVEYPHREG computes stratum totals based on these stratum sampling rates and the number of sample PSUs in each stratum. The procedure sums the stratum totals to form the overall total and then computes f as the ratio of the number of sample PSUs to the overall total. Alternatively, if you specify stratum totals with the TOTAL=SAS-data-set option, then PROC SURVEYPHREG sums these totals to compute the overall total. The overall sampling fraction f is then computed as the ratio of the number of sample PSUs to the overall total.

The replication methods do not use the finite population correction factor $(1-f)$ in the denominator.

Standard error ratios are computed as the square root of the variance ratios.