The GLIMMIX Procedure

Notes on Output Statistics

Table 43.15 lists the statistics computed with the OUTPUT statement of the GLIMMIX procedure and their default names. This section provides further details about these statistics.

The distinction between prediction and confidence limits in Table 43.15 stems from the involvement of the predictors of the random effects. If the random-effect solutions (BLUPs, EBES) are involved, then the associated standard error used in computing the limits are standard errors of prediction rather than standard errors of estimation. The prediction limits are not limits for the prediction of a new observation.

The Pearson residuals in Table 43.15 are “Pearson-type” residuals, because the residuals are standardized by the square root of the marginal or conditional variance of an observation. Traditionally, Pearson residuals in generalized linear models are divided by the square root of the variance function. The GLIMMIX procedure divides by the square root of the variance so that marginal and conditional residuals have similar expressions. In other words, scale and overdispersion parameters are included.

When residuals or predicted values involve only the fixed effects part of the linear predictor (that is, $\widehat{\eta }_ m = \mb {x}’\widehat{\bbeta }$ ), then all model quantities are computed based on this predictor. For example, if the variance by which to standardize a marginal residual involves the variance function, then the variance function is also evaluated at the marginal mean, $g^{-1}(\widehat{\eta }_ m)$ . Thus the residuals $p-\widehat{\eta }$ and $p_ m - \widehat{\eta }_ m$ can also be expressed as $(y-\mu )/\partial \mu$ and $(y-\mu _ m)/\partial \mu _ m$ , respectively, where $\partial \mu$ is the derivative with respect to the linear predictor. To construct the residual $p-\widehat{\eta }_ m$ in a GLMM, you can add the value of _ZGAMMA_ to the conditional residual $p-\widehat{\eta }$ . (The residual $p-\widehat{\eta }_ m$ is computed instead of the default marginal residual when you specify the CPSEUDO option in the OUTPUT statement.) If the predictor involves the BLUPs, then all relevant expressions and evaluations involve the conditional mean $g^{-1}(\widehat{\eta })$ .

The naming convention to add “PA” to quantities not involving the BLUPs is chosen to suggest the concept of a population average. When the link function is nonlinear, these are not truly population-averaged quantities, because $g^{-1}(\mb {x}’\bbeta )$ does not equal $\mr {E}[Y]$ in the presence of random effects. For example, if

$\mu _ i = g^{-1}(\mb {x}_ i’\bbeta + \mb {z}_ i’\bgamma _ i)$

is the conditional mean for subject i, then

$g^{-1}(\mb {x}_ i’\widehat{\bbeta })$

does not estimate the average response in the population of subjects but the response of the average subject (the subject for which $\bgamma _ i = \mb {0}$ ). For models with identity link, the average response and the response of the average subject are identical.

The GLIMMIX procedure obtains standard errors on the scale of the mean by the delta method. If the link is a nonlinear function of the linear predictor, these standard errors are only approximate. For example,

$\mr {Var}[g^{-1}(\widehat{\eta }_ m)] \doteq \left( \frac{\partial g^{-1}(t)}{\partial t}_{|\widehat{\eta }_ m}\right)^2 \mr {Var}[\widehat{\eta }_ m]$

Confidence limits on the scale of the data are usually computed by applying the inverse link function to the confidence limits on the linked scale. The resulting limits on the data scale have the same coverage probability as the limits on the linked scale, but they are possibly asymmetric.

In generalized logit models, confidence limits on the mean scale are based on symmetric limits about the predicted mean in a category. Suppose that the multinomial response in such a model has J categories. The probability of a response in category i is computed as

$\widehat{\mu }_ i = \frac{\exp \left\{ \widehat{\eta }_ i\right\} }{\sum _{j=1}^{J}\exp \left\{ \widehat{\eta }_ i\right\} }$

The variance of $\widehat{\mu }_ i$ is then approximated as

$\mr {Var}[\widehat{\mu }_ i] \doteq \zeta = \bupsilon _ i’\mr {Var}\left[ \begin{array}{cccc} \widehat{\eta }_1 & \widehat{\eta }_2 & \cdots & \widehat{\eta }_ J \end{array} \right] \bupsilon _ i$

where $\bupsilon _ i$ is a $J \times 1$ vector with kth element

$\begin{array}{ll} \widehat{\mu }_ i (1 - \widehat{\mu }_ i) & i = k \\ -\widehat{\mu }_ i \widehat{\mu }_ k & i \not= k \end{array}$

The confidence limits in the generalized logit model are then obtained as

$\widehat{\mu }_ i \pm t_{\nu ,\alpha /2} \sqrt {\zeta }$

where $t_{\nu ,\alpha /2}$ is the $100 \times (1-\alpha /2)$ percentile from a t distribution with $\nu$ degrees of freedom. Confidence limits are truncated if they fall outside the $[0,1]$ interval.