PROC CALIS: Estimation Criteria :: SAS/STAT(R) 9.2 User's Guide, Second Edition

The CALIS Procedure

Estimation Criteria

The following five estimation methods are available in PROC CALIS:

unweighted least squares (ULS)
generalized least squares (GLS)
normal-theory maximum likelihood (ML)
weighted least squares (WLS, ADF)
diagonally weighted least squares (DWLS)

An INWGT= data set can be used to specify other than the default weight matrices $\text{[math]}$ for GLS, WLS, and DWLS estimation.

PROC CALIS do not exhaust all estimation methods in the field. As mentioned in the section Overview: CALIS Procedure, partial least squares (PLS) is not implemented. The PLS method is developed under less restrictive statistical assumptions. It circumvents some computational and theoretical problems encountered by the existing estimation methods in PROC CALIS. However, PLS estimates are less efficient in general. When the statistical assumptions of PROC CALIS are tenable (for example, large sample size, correct distributional assumptions, etc.), ML, GLS, or WLS methods yield better estimates than the PLS method. Note that there is a SAS/STAT procedure called PROC PLS, which employs the partial least squares technique but for a class of models different from those of PROC CALIS. For example, in a PROC CALIS model each latent variable is typically associated with only a subset of manifest variables (predictor or outcome variables). However, in PROC PLS latent variables are not prescribed with subsets of manifest variables. Rather, they are extracted from linear combinations of all manifest predictor variables. Therefore, for general path analysis with latent variables you should consider using PROC CALIS.

In each estimation method, the parameter vector is estimated iteratively by a nonlinear optimization algorithm that optimizes a goodness of fit function $\text{[math]}$ . When $\text{[math]}$ denotes the number of manifest variables, $\text{[math]}$ denotes the given sample covariance or correlation matrix for a sample with size $\text{[math]}$ , and $\text{[math]}$ denotes the predicted moment matrix, then the fit function for unweighted least squares estimation is

$\text{[math]}$

For normal-theory generalized least squares estimation, the function is

$\text{[math]}$

For normal-theory maximum likelihood estimation, the function is

$\text{[math]}$

The first three functions can be expressed by the generalized weighted least squares criterion (Browne 1982):

$\text{[math]}$

For unweighted least squares, the weight matrix $\text{[math]}$ is chosen as the identity matrix $\text{[math]}$ ; for generalized least squares, the default weight matrix $\text{[math]}$ is the sample covariance matrix $\text{[math]}$ ; and for normal-theory maximum likelihood, $\text{[math]}$ is the iteratively updated predicted moment matrix $\text{[math]}$ . The values of the normal-theory maximum likelihood function $\text{[math]}$ and the generally weighted least squares criterion $\text{[math]}$ with $\text{[math]}$ are asymptotically equivalent.

The goodness of fit function that is minimized in weighted least squares estimation is

$\text{[math]}$

where $\text{[math]}$ denotes the vector of the $\text{[math]}$ elements of the lower triangle of the symmetric matrix $\text{[math]}$ , and $\text{[math]}$ is a positive-definite symmetric matrix with $\text{[math]}$ rows and columns.

If the moment matrix $\text{[math]}$ is considered as a covariance rather than a correlation matrix, the default setting of $\text{[math]}$ is the consistent but biased estimators of the asymptotic covariances $\text{[math]}$ of the sample covariance $\text{[math]}$ with the sample covariance $\text{[math]}$ , as defined in the following:

$\text{[math]}$

where

$\text{[math]}$

The formula of the asymptotic covariances of uncorrected covariances (using the UCOV or NOINT option) is a straightforward generalization of this expression.

The resulting weight matrix $\text{[math]}$ is at least positive semidefinite (except for rounding errors). Using the ASYCOV option, you can use Browne’s (1984, formula (3.8)) unbiased estimators

	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$

There is no guarantee that this weight matrix is positive semidefinite. However, the second part is of order $\text{[math]}$ and does not destroy the positive semidefinite first part for sufficiently large $\text{[math]}$ . For a large number of independent observations, default settings of the weight matrix $\text{[math]}$ result in asymptotically distribution-free parameter estimates with unbiased standard errors and a correct $\text{[math]}$ test statistic (Browne 1982, 1984).

If the moment matrix $\text{[math]}$ is a correlation (rather than a covariance) matrix, the default setting of $\text{[math]}$ is the estimators of the asymptotic covariances $\text{[math]}$ of the correlations $\text{[math]}$ (Browne and Shapiro 1986; DeLeeuw 1983)

	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$	$\text{[math]}$

where

$\text{[math]}$

The asymptotic variances of the diagonal elements of a correlation matrix are 0. Therefore, the weight matrix computed by Browne and Shapiro’s formula is always singular. In this case the goodness of fit function for weighted least squares estimation is modified to

$\text{[math]}$

where $\text{[math]}$ is the penalty weight specified by the WPENALTY= $\text{[math]}$ option and the $\text{[math]}$ are the elements of the inverse of the reduced $\text{[math]}$ weight matrix that contains only the nonzero rows and columns of the full weight matrix $\text{[math]}$ . The second term is a penalty term to fit the diagonal elements of the moment matrix $\text{[math]}$ . The default value of $\text{[math]}$ can be decreased or increased by the WPENALTY= option. The often used value of $\text{[math]}$ seems to be too small in many cases to fit the diagonal elements of a correlation matrix properly. If your model does not fit the diagonal of the moment matrix $\text{[math]}$ , you can specify the NODIAG option to exclude the diagonal elements from the fit function.

Storing and inverting the huge weight matrix $\text{[math]}$ in WLS estimation needs considerable computer resources. A compromise is found by implementing the DWLS method that uses only the diagonal of the weight matrix $\text{[math]}$ from the WLS estimation in the minimization function

$\text{[math]}$

The statistical properties of DWLS estimates are still not known.

In generalized, weighted, or diagonally weighted least squares estimation, you can change from the default settings of weight matrices $\text{[math]}$ by using an INWGT= data set. Because the diagonal elements $\text{[math]}$ of the weight matrix $\text{[math]}$ are interpreted as asymptotic variances of the sample covariances or correlations, they cannot be negative. The CALIS procedure requires a positive-definite weight matrix that has positive diagonal elements.

Top of Page