The IRT Procedure (Experimental)

A Simple 2PL Model

This example shows how you can use PROC IRT to fit an item response theory model by using all the default settings. In this example, there are 50 subjects and each subject responds to 10 items. These 10 items are binary responses: 1 indicates correct and 0 indicates incorrect.

The following DATA step creates the SAS data set IrtBinary:

data IrtBinary;
   input item1-item10 @@;
   datalines;
1 0 1 1 1 1 1 1 1 0 1 1 1 1 1 1 1 1 1 1 0 0 0 0 1 0 1 0 0 1 1 1 1 1 1
0 0 0 0 0 1 1 1 1 1 0 0 0 1 1 1 1 1 1 1 0 0 1 0 1 1 1 1 1 1 1 1 1 1 1

   ... more lines ...   

1 1 1 1 1 0 1 1 1 1 1 1 1 1 1 1 1 0 1 1 
; 

The following statements fit an IRT model:

proc irt data=IrtBinary;
   var item1-item10;
run;

The ODS GRAPHICS ON statement invokes the ODS Graphics environment and displays the plots, such as the item characteristic curve plot. For more information about ODS, see Chapter 21: Statistical Graphics Using ODS.

The PROC IRT statement invokes the procedure, and the DATA= option specifies the input data set IrtBinary. The VAR statement names the variables to be used in the model. As you can see from the syntax in this example, fitting a IRT model can be very simple when you use the default settings. These default settings are chosen to reflect the common setups in practice. Some of the important default settings follow:

  • The number of factors is 1.

  • The two-parameter model is assumed for binary variables, and the graded response model is assumed for ordinal variables.

  • The link function is logistic link.

  • The estimation method is based on marginal likelihood.

  • The optimization method is the quasi-Newton algorithm.

  • The quadrature method is adaptive Gauss-Hermite quadrature, in which the number of quadrature points per dimension is determined adaptively.

As a result, the preceding statements fit two-parameter logistic (2PL) models for all the variables that are listed in the VAR statement.

The first table that PROC IRT produces is the Modeling Information table, as shown in Figure 51.1. This table displays basic information about the analysis, such as the name of the input data set, link function, number of items and factors, number of observations, and estimation method.

Figure 51.1: Model Information

The IRT Procedure

Modeling Information
Data Set WORK.IRTBINARY
Link Function Logit
Response Model Graded Response Model
Number of Items 10
Number of Factors 1
Number of Observations Read 100
Number of Observations Used 100
Estimation Method Marginal Maximum Likelihood


The Item Information table, shown in Figure 51.2, is displayed by default and can be used to check the item-level information. In this case, all 10 variables have two levels, and the raw values for these two levels are 0 and 1, respectively.

Figure 51.2: Item Information

Item Information
Item Levels Values
item1 2 0 1
item2 2 0 1
item3 2 0 1
item4 2 0 1
item5 2 0 1
item6 2 0 1
item7 2 0 1
item8 2 0 1
item9 2 0 1
item10 2 0 1


The eigenvalues of polychoric correlations are also computed by default and are shown in Figure 51.3. You can use the information from these eigenvalues to assess a reasonable range for the number of factors.

Figure 51.3: Eigenvalues of Polychoric Correlation

The IRT Procedure

Eigenvalues of the Polychoric Correlation Matrix
  Eigenvalue Difference Proportion Cumulative
1 4.71105177 3.51149453 0.4711 0.4711
2 1.19955723 0.14183502 0.1200 0.5911
3 1.05772221 0.26577735 0.1058 0.6968
4 0.79194486 0.07204549 0.0792 0.7760
5 0.71989938 0.17782491 0.0720 0.8480
6 0.54207446 0.12713664 0.0542 0.9022
7 0.41493782 0.10631770 0.0415 0.9437
8 0.30862012 0.12256183 0.0309 0.9746
9 0.18605829 0.11792444 0.0186 0.9932
10 0.06813385   0.0068 1.0000


Next, the Optimization Information table, shown in Figure 51.4, lists the optimization technique, numeric quadrature method, and number of quadrature points per dimension.

Figure 51.4: Optimization Information

The IRT Procedure

Optimization Information
Optimization Technique Quasi-Newton
Likelihood Approximation Adaptive Gauss-Hermite Quadrature
Number of Quadrature Points 19
Number of Free Parameters 20


Because the estimation of IRT models can be slow, the Iteration History table, shown in Figure 51.5, is also included by default. It is updated after each iteration. For each iteration, the table displays current iteration number, number of function evaluations, objective function value, change of object function value, and maximum value of gradients. You can use this information to monitor the estimation status of the model. You can turn off the display of the Iteration History table by specifying the NOITPRINT option in the PROC IRT statement.

Figure 51.5: Iteration History

Iteration History
Iteration Evaluations Objective
Function
Change Max
Gradient
0 2 5.53317407 5.53317407 0.055021
1 4 5.43362050 -0.09955356 0.017514
2 6 5.41516532 -0.01845518 0.017074
3 8 5.40358346 -0.01158186 0.007844
4 10 5.40165205 -0.00193141 0.007266
5 12 5.40105183 -0.00060022 0.003494
6 15 5.40087699 -0.00017484 0.001484
7 18 5.40081315 -0.00006385 0.000862
8 21 5.40078736 -0.00002579 0.000806
9 24 5.40077714 -0.00001021 0.000441
10 27 5.40077058 -0.00000656 0.000141
11 30 5.40076985 -0.00000073 0.000157
12 32 5.40076956 -0.00000030 0.000202
13 34 5.40076913 -0.00000042 0.000033
14 37 5.40076910 -0.00000003 0.000024
15 40 5.40076909 -0.00000001 0.00001


Following the Iteration History table is the convergence status table, shown in Figure 51.6. It shows whether the optimization algorithm converges successfully or not.

Figure 51.6: Convergence Status

Convergence criterion (GCONV=.000000010) satisfied.


Next is the Model Fit Statistics table, shown in Figure 51.7, which include the log likelihood, Akaike’s information criterion (AIC), Bayesian information criterion (BIC), Pearson’s chi-square, and likelihood ratio.

Figure 51.7: Fit Statistics

The IRT Procedure

Model Fit Statistics
Log Likelihood -540.0769091
AIC (Smaller is Better) 1120.1538182
BIC (Smaller is Better) 1172.257222
Likelihood Ratio 300.3475528


Finally, the Item Parameter Estimates table, shown in Figure 51.8, includes parameter estimates, standard errors, and p-values. Parameters are organized and displayed within each item. The items are listed in the order of their appearance in the modeling statements.

Figure 51.8: Parameter Estimates

The IRT Procedure

Item Parameter Estimates
Item Parameter Estimate Standard
Error
Pr > |t|
item1 Threshold -1.92246 0.53399 0.0002
  Slope 2.22071 0.68479 0.0006
item2 Threshold -2.37769 0.64464 0.0001
  Slope 2.33987 0.76209 0.0011
item3 Threshold -1.99304 0.54014 0.0001
  Slope 2.18731 0.68382 0.0007
item4 Threshold -1.73061 0.45433 <.0001
  Slope 1.87329 0.57549 0.0006
item5 Threshold -1.46288 0.35236 <.0001
  Slope 1.33942 0.42772 0.0009
item6 Threshold -0.57445 0.26843 0.0162
  Slope 1.17667 0.37105 0.0008
item7 Threshold -0.58404 0.25017 0.0098
  Slope 0.94542 0.32647 0.0019
item8 Threshold -0.48604 0.24737 0.0247
  Slope 0.95660 0.32981 0.0019
item9 Threshold -0.45940 0.25892 0.0380
  Slope 1.11730 0.35663 0.0009
item10 Threshold -0.65574 0.26191 0.0061
  Slope 1.05212 0.34962 0.0013