The GLM Procedure

MODEL Statement

  • MODEL dependent-variables = independent-effects </ options>;

The MODEL statement names the dependent variables and independent effects. The syntax of effects is described in the section Specification of Effects. For any model effect involving classification variables (interactions as well as main effects), the number of levels cannot exceed 32,767. If no independent effects are specified, only an intercept term is fit. You can specify only one MODEL statement (in contrast to the REG procedure, for example, which allows several MODEL statements in the same PROC REG run).

Table 46.9 summarizes the options available in the MODEL statement.

Table 46.9: MODEL Statement Options

Option

Description

Produce effect size information

EFFECTSIZE

Adds measures of effect size to each analysis of variance table

Produce tests for the intercept

INTERCEPT

Produces the hypothesis tests associated with the intercept

Omit the intercept parameter from model

NOINT

Omits the intercept parameter from the model

Produce parameter estimates

SOLUTION

Produces parameter estimates

Produce tolerance analysis

TOLERANCE

Displays the tolerances used in the SWEEP routine

Suppress univariate tests and output

NOUNI

Suppresses the display of univariate statistics

Display estimable functions

E

Displays the general form of all estimable functions

E1

Displays the Type I estimable functions and computes sums of squares

E2

Displays the Type II estimable functions and computes sums of squares

E3

Displays the Type III estimable functions and computes sums of squares

E4

Displays the Type IV estimable functions computes sums of squares

ALIASING

Specifies that the estimable functions should be displayed as an aliasing structure

Control hypothesis tests performed

SS1

Displays Type I sum of squares

SS2

Displays Type II sum of squares

SS3

Displays Type III sum of squares

SS4

Displays Type IV sum of squares

Produce confidence intervals

ALPHA=

Specifies the level of significance

CLI

Produces confidence limits for individual predicted values

CLM

Produces confidence limits for a mean predicted value

CLPARM

Produces confidence limits for the parameter estimates

Display predicted and residual values

P

Displays observed, predicted, and residual values

Display intermediate calculations

INVERSE

Displays the augmented inverse (or generalized inverse) $\mb{X}’\mb{X}$ matrix

XPX

Displays the augmented $\mb{X}’\mb{X}$ crossproducts matrix

Tune sensitivity

SINGULAR=

Tunes the sensitivity of the regression routine to linear dependencies

ZETA=

Tunes the sensitivity of the check for estimability for Type III and Type IV functions


The options available in the MODEL statement are described in the following list.

ALIASING

specifies that the estimable functions should be displayed as an aliasing structure, for which each row says which linear combination of the parameters is estimated by each estimable function; also, this option adds a column of the same information to the table of parameter estimates, giving for each parameter the expected value of the estimate associated with that parameter. This option is most useful in fractional factorial experiments that can be analyzed without a CLASS statement.

ALPHA=p

specifies the level of significance p for $100(1-p)$% confidence intervals. By default, p is equal to the value of the ALPHA= option in the PROC GLM statement, or 0.05 if that option is not specified. You can use values between 0 and 1.

CLI

produces confidence limits for individual predicted values for each observation. The CLI option is ignored if the CLM option is also specified.

CLM

produces confidence limits for a mean predicted value for each observation.

CLPARM

produces confidence limits for the parameter estimates (if the SOLUTION option is also specified) and for the results of all ESTIMATE statements.

E

displays the general form of all estimable functions. This is useful for determining the order of parameters when you are writing CONTRAST and ESTIMATE statements.

E1

displays the Type I estimable functions for each effect in the model and computes the corresponding sums of squares.

E2

displays the Type II estimable functions for each effect in the model and computes the corresponding sums of squares.

E3

displays the Type III estimable functions for each effect in the model and computes the corresponding sums of squares.

E4

displays the Type IV estimable functions for each effect in the model and computes the corresponding sums of squares.

EFFECTSIZE

adds measures of effect size to each analysis of variance table displayed by the procedure, except for those displayed by the TEST option in the RANDOM statement, by CONTRAST and TEST statements with the E= option, and by MANOVA and REPEATED statements. The effect size measures include the intraclass correlation and both estimates and confidence intervals for the noncentrality for the F test, as well as for the semipartial and partial correlation ratios. For more information about the computation and interpretation of these measures, see the section Effect Size Measures for F Tests in GLM.

INTERCEPT
INT

produces the hypothesis tests associated with the intercept as an effect in the model. By default, the procedure includes the intercept in the model but does not display associated tests of hypotheses. Except for producing the uncorrected total sum of squares instead of the corrected total sum of squares, the INT option is ignored when you use an ABSORB statement.

INVERSE
I

displays the augmented inverse (or generalized inverse) $\mb{X}’\mb{X}$ matrix:

\[ \left[\begin{array}{rr} (\mb{X}’\mb{X})^- & (\mb{X}’\mb{X})^-\mb{X}’\mb{y} \\ \mb{y}’\mb{X}(\mb{X}’\mb{X})^- & \mb{y}’\mb{y} - \mb{y}’\mb{X}(\mb{X}’\mb{X})^-\mb{X}’\mb{y} \end{array}\right] \]

The upper-left corner is the generalized inverse of $\mb{X}’\mb{X}$, the upper-right corner is the parameter estimates, and the lower-right corner is the error sum of squares.

NOINT

omits the intercept parameter from the model. The NOINT option is ignored when you use an ABSORB statement.

NOUNI

suppresses the display of univariate statistics. You typically use the NOUNI option with a multivariate or repeated measures analysis of variance when you do not need the standard univariate results. The NOUNI option in a MODEL statement does not affect the univariate output produced by the REPEATED statement.

P

displays observed, predicted, and residual values for each observation that does not contain missing values for independent variables. The Durbin-Watson statistic is also displayed when the P option is specified. The PRESS statistic is also produced if either the CLM or CLI option is specified.

SINGULAR=number

tunes the sensitivity of the regression routine to linear dependencies in the design. If a diagonal pivot element is less than $C\times \mi{number}$ as PROC GLM sweeps the $\mb{X}’\mb{X}$ matrix, the associated design column is declared to be linearly dependent with previous columns, and the associated parameter is zeroed.

The C value adjusts the check to the relative scale of the variable. The C value is equal to the corrected sum of squares for the variable, unless the corrected sum of squares is 0, in which case C is 1. If you specify the NOINT option but not the ABSORB statement, PROC GLM uses the uncorrected sum of squares instead.

The default value of the SINGULAR= option, $10^{-7}$, might be too small, but this value is necessary in order to handle the high-degree polynomials used in the literature to compare regression routines.

SOLUTION

produces a solution to the normal equations (parameter estimates). PROC GLM displays a solution by default when your model involves no classification variables, so you need this option only if you want to see the solution for models with classification effects.

SS1

displays the sum of squares associated with Type I estimable functions for each effect. These are also displayed by default.

SS2

displays the sum of squares associated with Type II estimable functions for each effect.

SS3

displays the sum of squares associated with Type III estimable functions for each effect. These are also displayed by default.

SS4

displays the sum of squares associated with Type IV estimable functions for each effect.

TOLERANCE

displays the tolerances used in the SWEEP routine. The tolerances are of the form C/USS or C/CSS, as described in the discussion of the SINGULAR= option. The tolerance value for the intercept is not divided by its uncorrected sum of squares.

XPX

displays the augmented $\mb{X}’\mb{X}$ crossproducts matrix:

\[ \left[\begin{array}{cc} \mb{X}’\mb{X} & \mb{X}’\mb{y} \\ \mb{y}’\mb{X} & \mb{y}’\mb{y} \end{array}\right] \]
ZETA=value

tunes the sensitivity of the check for estimability for Type III and Type IV functions. Any element in the estimable function basis with an absolute value less than the ZETA= option is set to zero. The default value for the ZETA= option is $10^{-8}$.

Although it is possible to generate data for which this absolute check can be defeated, the check suffices in most practical examples. Additional research is needed in order to make this check relative rather than absolute.