The Four Types of Estimable Functions: Type I SS and Estimable Functions

Type I SS and Estimable Functions

In PROC GLM, the Type I SS and the associated hypotheses they test are byproducts of the modified sweep operator used to compute a generalized $\text{[math]}$ -inverse of $\text{[math]}$ and a solution to the normal equations. For the model $\text{[math]}$ , the Type I SS for each effect are as follows:

Effect		Type I SS
$\text{[math]}$		$\text{[math]}$
$\text{[math]}$		$\text{[math]}$
$\text{[math]}$		$\text{[math]}$

Note that some other SAS/STAT procedures compute Type I hypotheses by sweeping $\text{[math]}$ (for example, PROC MIXED and PROC GLIMMIX), but their test statistics are not necessarily equivalent to the results of using those procedures to fit models that contain successively more effects.

The Type I SS are model-order dependent; each effect is adjusted only for the preceding effects in the model.

There are numerous ways to obtain a Type I hypothesis matrix $\text{[math]}$ for each effect. One way is to form the $\text{[math]}$ matrix and then reduce $\text{[math]}$ to an upper triangular matrix by row operations, skipping over any rows with a zero diagonal. The nonzero rows of the resulting matrix associated with $\text{[math]}$ provide an $\text{[math]}$ such that

$\text{[math]}$

The nonzero rows of the resulting matrix associated with $\text{[math]}$ provide an $\text{[math]}$ such that

$\text{[math]}$

The last set of nonzero rows (associated with $\text{[math]}$ ) provide an $\text{[math]}$ such that

$\text{[math]}$

Another more formalized representation of Type I generating sets for $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ , respectively, is

$\text{[math]}$

where

$\text{[math]}$

and

$\text{[math]}$

Using the Type I generating set $\text{[math]}$ (for example), if an $\text{[math]}$ is formed from linear combinations of the rows of $\text{[math]}$ such that $\text{[math]}$ is of full row rank and of the same row rank as $\text{[math]}$ , then SS $\text{[math]}$ .

In the GLM procedure, the Type I estimable functions displayed symbolically when the E1 option is requested are

$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
$\text{[math]}$	$\text{[math]}$	$\text{[math]}$

As can be seen from the nature of the generating sets $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ , only the Type I estimable functions for $\text{[math]}$ are guaranteed not to involve the $\text{[math]}$ and $\text{[math]}$ parameters. The Type I hypothesis for $\text{[math]}$ can (and often does) involve $\text{[math]}$ parameters, and likewise the Type I hypothesis for $\text{[math]}$ often involves $\text{[math]}$ and $\text{[math]}$ parameters.

There are, however, a number of models for which the Type I hypotheses are considered appropriate. These are as follows:

balanced ANOVA models specified in proper sequence (that is, interactions do not precede main effects in the MODEL statement and so forth)
purely nested models (specified in the proper sequence)
polynomial regression models (in the proper sequence)