The FACTOR Procedure

Example 37.3 Maximum Likelihood Factor Analysis

This example uses maximum likelihood factor analyses for one, two, and three factors. It is already apparent from the principal factor analysis that the best number of common factors is almost certainly two. The one- and three-factor ML solutions reinforce this conclusion and illustrate some of the numerical problems that can occur. The following statements produce Output 37.3.1 through Output 37.3.3:

title3 'Maximum Likelihood Factor Analysis with One Factor';
proc factor data=SocioEconomics method=ml heywood n=1;
run;
title3 'Maximum Likelihood Factor Analysis with Two Factors';
proc factor data=SocioEconomics method=ml heywood n=2;
run;
title3 'Maximum Likelihood Factor Analysis with Three Factors';
proc factor data=SocioEconomics method=ml heywood n=3;
run;

Output 37.3.1 displays the results of the analysis with one factor.

Output 37.3.1: Maximum Likelihood Factor Analysis

Maximum Likelihood Factor Analysis with One Factor

The FACTOR Procedure

Input Data Type Raw Data
Number of Records Read 12
Number of Records Used 12
N for Significance Tests 12

Maximum Likelihood Factor Analysis with One Factor

The FACTOR Procedure
Initial Factor Method: Maximum Likelihood

Prior Communality Estimates: SMC
Population School Employment Services HouseValue
0.96859160 0.82228514 0.96918082 0.78572440 0.84701921

Preliminary Eigenvalues: Total = 76.1165859
Average = 15.2233172
  Eigenvalue Difference Proportion Cumulative
1 63.7010086 50.6462895 0.8369 0.8369
2 13.0547191 12.7270798 0.1715 1.0084
3 0.3276393 0.6749199 0.0043 1.0127
4 -0.3472805 0.2722202 -0.0046 1.0081
5 -0.6195007   -0.0081 1.0000


1 factor will be retained by the NFACTOR criterion.

Iteration Criterion Ridge Change Communalities
1 6.5429218 0.0000 0.1033 0.93828 0.72227 1.00000 0.71940 0.74371
2 3.1232699 0.0000 0.7288 0.94566 0.02380 1.00000 0.26493 0.01487

Convergence criterion satisfied.

Significance Tests Based on 12 Observations
Test DF Chi-Square Pr >
ChiSq
H0: No common factors 10 54.2517 <.0001
HA: At least one common factor      
H0: 1 Factor is sufficient 5 24.4656 0.0002
HA: More factors are needed      

Chi-Square without Bartlett's Correction 34.355969
Akaike's Information Criterion 24.355969
Schwarz's Bayesian Criterion 21.931436
Tucker and Lewis's Reliability Coefficient 0.120231

Squared Canonical
Correlations
Factor1
1.0000000

Eigenvalues of the Weighted Reduced
Correlation Matrix: Total = 0
Average = 0
  Eigenvalue Difference
1 Infty Infty
2 1.92716032 2.15547340
3 -.22831308 0.56464322
4 -.79295630 0.11293464
5 -.90589094  

Factor Pattern
  Factor1
Population 0.97245
School 0.15428
Employment 1.00000
Services 0.51472
HouseValue 0.12193

Variance Explained by Each Factor
Factor Weighted Unweighted
Factor1 17.8010629 2.24926004

Final Communality Estimates and Variable
Weights
Total Communality: Weighted = 17.801063
Unweighted = 2.249260
Variable Communality Weight
Population 0.94565561 18.4011648
School 0.02380349 1.0243839
Employment 1.00000000 Infty
Services 0.26493499 1.3604239
HouseValue 0.01486595 1.0150903


The solution on the second iteration is so close to the optimum that PROC FACTOR cannot find a better solution; hence you receive this message:

   Convergence criterion satisfied.

When this message appears, you should try rerunning PROC FACTOR with different prior communality estimates to make sure that the solution is correct. In this case, other prior estimates lead to the same solution or possibly to worse local optima, as indicated by the information criteria or the chi-square values.

The variable Employment has a communality of 1.0 and, therefore, an infinite weight that is displayed next to the final communality estimate as a missing/infinite value. The first eigenvalue is also infinite. Infinite values are ignored in computing the total of the eigenvalues and the total final communality.

Output 37.3.2 displays the results of the analysis with two factors. The analysis converges without incident. This time, however, the Population variable is a Heywood case.

Output 37.3.2: Maximum Likelihood Factor Analysis: Two Factors

Input Data Type Raw Data
Number of Records Read 12
Number of Records Used 12
N for Significance Tests 12

Prior Communality Estimates: SMC
Population School Employment Services HouseValue
0.96859160 0.82228514 0.96918082 0.78572440 0.84701921

Preliminary Eigenvalues: Total = 76.1165859
Average = 15.2233172
  Eigenvalue Difference Proportion Cumulative
1 63.7010086 50.6462895 0.8369 0.8369
2 13.0547191 12.7270798 0.1715 1.0084
3 0.3276393 0.6749199 0.0043 1.0127
4 -0.3472805 0.2722202 -0.0046 1.0081
5 -0.6195007   -0.0081 1.0000


2 factors will be retained by the NFACTOR criterion.

Iteration Criterion Ridge Change Communalities
1 0.3431221 0.0000 0.0471 1.00000 0.80672 0.95058 0.79348 0.89412
2 0.3072178 0.0000 0.0307 1.00000 0.80821 0.96023 0.81048 0.92480
3 0.3067860 0.0000 0.0063 1.00000 0.81149 0.95948 0.81677 0.92023
4 0.3067373 0.0000 0.0022 1.00000 0.80985 0.95963 0.81498 0.92241
5 0.3067321 0.0000 0.0007 1.00000 0.81019 0.95955 0.81569 0.92187

Convergence criterion satisfied.

Significance Tests Based on 12 Observations
Test DF Chi-Square Pr >
ChiSq
H0: No common factors 10 54.2517 <.0001
HA: At least one common factor      
H0: 2 Factors are sufficient 1 2.1982 0.1382
HA: More factors are needed      

Chi-Square without Bartlett's Correction 3.3740530
Akaike's Information Criterion 1.3740530
Schwarz's Bayesian Criterion 0.8891463
Tucker and Lewis's Reliability Coefficient 0.7292200

Squared Canonical Correlations
Factor1 Factor2
1.0000000 0.9518891

Eigenvalues of the Weighted Reduced Correlation Matrix: Total = 19.7853157 Average = 4.94632893
  Eigenvalue Difference Proportion Cumulative
1 Infty Infty    
2 19.7853143 19.2421292 1.0000 1.0000
3 0.5431851 0.5829564 0.0275 1.0275
4 -0.0397713 0.4636411 -0.0020 1.0254
5 -0.5034124   -0.0254 1.0000

Factor Pattern
  Factor1 Factor2
Population 1.00000 0.00000
School 0.00975 0.90003
Employment 0.97245 0.11797
Services 0.43887 0.78930
HouseValue 0.02241 0.95989

Variance Explained by Each Factor
Factor Weighted Unweighted
Factor1 24.4329707 2.13886057
Factor2 19.7853143 2.36835294

Final Communality Estimates and Variable
Weights
Total Communality: Weighted = 44.218285
Unweighted = 4.507214
Variable Communality Weight
Population 1.00000000 Infty
School 0.81014489 5.2682940
Employment 0.95957142 24.7246669
Services 0.81560348 5.4256462
HouseValue 0.92189372 12.7996793


The results of the three-factor analysis are shown in Output 37.3.3.

Output 37.3.3: Maximum Likelihood Factor Analysis: Three Factors

Input Data Type Raw Data
Number of Records Read 12
Number of Records Used 12
N for Significance Tests 12

Prior Communality Estimates: SMC
Population School Employment Services HouseValue
0.96859160 0.82228514 0.96918082 0.78572440 0.84701921

Preliminary Eigenvalues: Total = 76.1165859
Average = 15.2233172
  Eigenvalue Difference Proportion Cumulative
1 63.7010086 50.6462895 0.8369 0.8369
2 13.0547191 12.7270798 0.1715 1.0084
3 0.3276393 0.6749199 0.0043 1.0127
4 -0.3472805 0.2722202 -0.0046 1.0081
5 -0.6195007   -0.0081 1.0000


3 factors will be retained by the NFACTOR criterion.


Warning: Too many factors for a unique solution.

Iteration Criterion Ridge Change Communalities
1 0.1798029 0.0313 0.0501 0.96081 0.84184 1.00000 0.80175 0.89716
2 0.0016405 0.0313 0.0678 0.98081 0.88713 1.00000 0.79559 0.96500
3 0.0000041 0.0313 0.0094 0.98195 0.88603 1.00000 0.80498 0.96751
4 0.0000000 0.0313 0.0006 0.98202 0.88585 1.00000 0.80561 0.96735

ERROR: Converged, but not to a proper optimum.


Try a different 'PRIORS' statement.

Significance Tests Based on 12 Observations
Test DF Chi-Square Pr >
ChiSq
H0: No common factors 10 54.2517 <.0001
HA: At least one common factor      
H0: 3 Factors are sufficient -2 0.0000 .
HA: More factors are needed      

Chi-Square without Bartlett's Correction 0.0000003
Akaike's Information Criterion 4.0000003
Schwarz's Bayesian Criterion 4.9698136
Tucker and Lewis's Reliability Coefficient 0.0000000

Squared Canonical Correlations
Factor1 Factor2 Factor3
1.0000000 0.9751895 0.6894465

Eigenvalues of the Weighted Reduced Correlation Matrix: Total = 41.5254193 Average = 10.3813548
  Eigenvalue Difference Proportion Cumulative
1 Infty Infty    
2 39.3054826 37.0854258 0.9465 0.9465
3 2.2200568 2.2199693 0.0535 1.0000
4 0.0000875 0.0002949 0.0000 1.0000
5 -0.0002075   -0.0000 1.0000

Factor Pattern
  Factor1 Factor2 Factor3
Population 0.97245 -0.11233 -0.15409
School 0.15428 0.89108 0.26083
Employment 1.00000 0.00000 0.00000
Services 0.51472 0.72416 -0.12766
HouseValue 0.12193 0.97227 -0.08473

Variance Explained by Each Factor
Factor Weighted Unweighted
Factor1 54.6115241 2.24926004
Factor2 39.3054826 2.27634375
Factor3 2.2200568 0.11525433

Final Communality Estimates and Variable
Weights
Total Communality: Weighted = 96.137063
Unweighted = 4.640858
Variable Communality Weight
Population 0.98201660 55.6066901
School 0.88585165 8.7607194
Employment 1.00000000 Infty
Services 0.80564301 5.1444261
HouseValue 0.96734687 30.6251078


In the results, a warning message is displayed:

   WARNING:  Too many factors for a unique solution.

The number of parameters in the model exceeds the number of elements in the correlation matrix from which they can be estimated, so an infinite number of different perfect solutions can be obtained. The criterion approaches zero at an improper optimum, as indicated by this message:

   Converged, but not to a proper optimum.

The degrees of freedom for the chi-square test are –2, so a probability level cannot be computed for three factors. Note also that the variable Employment is a Heywood case again.

The probability levels for the chi-square test are 0.0001 for the hypothesis of no common factors, 0.0002 for one common factor, and 0.1382 for two common factors. Therefore, the two-factor model seems to be an adequate representation. Akaike’s information criterion and Schwarz’s Bayesian criterion attain their minimum values at two common factors, so there is little doubt that two factors are appropriate for these data.