Canonical discriminant analysis is equivalent to canonical correlation analysis between the quantitative variables and a set
of dummy variables coded from the class variable. In the following notation the dummy variables are denoted by
and the quantitative variables by
. The total sample covariance matrix for the
and
variables is
|
|
When c is the number of groups,
is the number of observations in group t, and
is the sample covariance matrix for the
variables in group t, the within-class pooled covariance matrix for the
variables is
|
|
The canonical correlations,
, are the square roots of the eigenvalues,
, of the following matrix. The corresponding eigenvectors are
.
|
|
Let
be the matrix with the eigenvectors
that correspond to nonzero eigenvalues as columns. The raw canonical coefficients are calculated as follows:
|
|
The pooled within-class standardized canonical coefficients are
|
|
The total sample standardized canonical coefficients are
|
|
Let
be the matrix with the centered
variables as columns. The canonical scores can be calculated by any of the following:
|
|
|
|
|
|
For the multivariate tests based on
,
|
|
|
|
where n is the total number of observations.