SUPPORT / SAMPLES & SAS NOTES
 

Support

Usage Note 24455: Estimating an odds ratio for a variable involved in an interaction

DetailsAboutRate It

By default, PROC GENMOD does not display odds ratio estimates and PROC LOGISTIC computes odds ratio estimates only for variables not involved in interactions or nested terms. Note that when a variable is involved in an interaction there isn't a single odds ratio estimate for it. Rather, the odds ratio for the variable depends on the level(s) of the interacting variable(s).

In the example below, the variable of interest interacts with a categorical variable. When the interacting variable is continuous, you can estimate the odds ratio at various levels and plot its change as described in SAS Note 69621.

Use the ODDSRATIO statement in PROC LOGISTIC to get odds ratio estimates of simple effects within an interaction. If you select GLM parameterization of CLASS variables in the LOGISTIC, GENMOD, GLIMMIX, or SURVEYLOGISTIC procedures then you can also use the EXP option in the LSMESTIMATE statement, or the DIFF and ODDSRATIO options in the LSMEANS and SLICE statements. The ODDSRATIO statement is the easiest way to obtain odds ratio estimates.

If you fit a logistic model using PROC GLIMMIX, you can use the ODDSRATIO option in the LSMEANS statement in conjunction with DIFF or SLICEDIFF= option to obtain odds ratio estimates. See the example below.

Custom odds ratios can be constructed using the CONTRAST or ESTIMATE statements in PROC LOGISTIC or the ESTIMATE statement in the GENMOD or GLIMMIX procedure. These statements are the most general and flexible for estimating odds ratios and can be used when a continuous predictor is involved or when a parameterization other than GLM parameterization is used for CLASS variables. However, the generality of these statements make them the most difficult to use. Fortunately, most needs can be satisfied by one of the statements discussed above. While assistance with constructing custom odds ratios is beyond the scope of the services provided by Technical Support, SAS Note 24447 illustrates the proper method for determining how to construct CONTRAST and ESTIMATE statements. See the Introduction and Example 3 which uses a logistic model with interaction and shows construction of a custom odds ratio using either of two coding methods for categorical predictors.

The example below and Example 3 in SAS Note 24447 illustrate use of the ODDSRATIO, LSMEANS, LSMESTIMATE, SLICE, ESTIMATE, and CONTRAST statements to compute an odds ratio of interest. SAS Note 42728 uses the LSMEANS, LSMESTIMATE, and ESTIMATE statements to estimate odds ratios in a model without interaction in PROC GENMOD. For details and additional options, see the descriptions of the above statements and options in the documentation for the LOGISTIC, GENMOD, or GLIMMIX procedures (SAS Note 22930).

An Example Using PROC LOGISTIC

In the following example, patients with one of two diagnoses (complicated or uncomplicated) are treated with one of three treatments (A, B, or C) and the result (cured or not cured) is observed.

      data uti;
         input diagnosis : $13. treatment $ response $ count @@;
         datalines;
      complicated    A  cured 78  complicated   A not 28
      complicated    B  cured 101 complicated   B not 11
      complicated    C  cured 68  complicated   C not 46
      uncomplicated  A  cured 40  uncomplicated A not 5
      uncomplicated  B  cured 54  uncomplicated B not 5
      uncomplicated  C  cured 34  uncomplicated C not 6
      ;

The following PROC LOGISTIC step uses all six of the statements mentioned above to estimate the odds ratio comparing the A and C treatments in the complicated diagnosis. Any one of the statements alone would suffice. Some of these statements produce additional odds ratio estimates for comparing other treatment pairs. The ODDSRATIO and CONTRAST statements provide confidence limits by default. Confidence limits are available in the other statements by adding an option (typically, CL). When ODS Graphics is enabled, plots of odds ratios or of log odds differences are also provided.

      proc logistic data=uti;
         freq count;
         class diagnosis treatment / param=glm;
         model response(event="cured") = diagnosis treatment diagnosis*treatment;
         oddsratio treatment / at(diagnosis='complicated');
         lsmeans diagnosis*treatment / ilink oddsratio diff;
         slice diagnosis*treatment / sliceby(diagnosis='complicated') diff oddsratio;
         lsmestimate diagnosis*treatment 'A vs C complicated' 1 0 -1 / exp;
         estimate 'A vs C in complicated' treatment 1 0 -1
                  diagnosis*treatment 1 0 -1 0 0 0 / exp;
         contrast 'A vs C in complicated' treatment 1 0 -1
                  diagnosis*treatment 1 0 -1 0 0 0 / estimate=exp;
         run;

These results are created by the ODDSRATIO statement. The AT option is used to limit the results to the complicated diagnosis. The odds ratio comparing treatments A and C in the complicated diagnosis is estimated to be 1.88. A 95% confidence interval for the odds ratio is (1.064, 3.337). If p-values are desired, specify the ORPVALUE option in the MODEL statement as discussed in SAS Note 53376.

Odds Ratio Estimates and Wald Confidence Intervals
Label Estimate 95% Confidence Limits
treatment A vs B at diagnosis=complicated 0.303 0.142 0.647
treatment A vs C at diagnosis=complicated 1.884 1.064 3.337
treatment B vs C at diagnosis=complicated 6.210 3.005 12.837

The next results are created by the LSMEANS statement. Shown first is the table of LS-means which contains estimates of the log odds (Estimate column), the odds ratio, and the probability of being cured (Mean column) for each treatment and diagnosis combination. The ODDSRATIO option produces the Odds Ratio columns in the Differences table. The ILINK option produces the Mean (probability) column. Note that the estimated odds of being cured by treatment A in the complicated diagnosis are 2.79 and the estimated probability of being cured is 0.74. The table of LS-mean differences which follows contains estimates of the log odds ratio (Estimate column) and the odds ratio (provided by the ODDSRATIO option). This table is produced by the DIFF option. The odds ratio estimate for A vs C in the complicated diagnosis is again found to be 1.88.

Diagnosis*Treatment Least Squares Means
Diagnosis Treatment Estimate Standard Error z Value Pr > |z| Mean Standard Error
of Mean
complicated A 1.0245 0.2203 4.65 <.0001 0.7358 0.04282
complicated B 2.2171 0.3175 6.98 <.0001 0.9018 0.02812
complicated C 0.3909 0.1909 2.05 0.0406 0.5965 0.04595
uncomplicated A 2.0794 0.4743 4.38 <.0001 0.8889 0.04685
uncomplicated B 2.3791 0.4674 5.09 <.0001 0.9152 0.03627
uncomplicated C 1.7346 0.4428 3.92 <.0001 0.8500 0.05646
 
Differences of Diagnosis*Treatment Least Squares Means
Diagnosis Treatment _Diagnosis _Treatment Estimate Standard Error z Value Pr > |z| Odds Ratio
complicated A complicated B -1.1926 0.3864 -3.09 0.0020 0.303
complicated A complicated C 0.6336 0.2915 2.17 0.0297 1.884
complicated A uncomplicated A -1.0549 0.5230 -2.02 0.0437 0.348
complicated A uncomplicated B -1.3546 0.5167 -2.62 0.0088 0.258
complicated A uncomplicated C -0.7101 0.4946 -1.44 0.1511 0.492
complicated B complicated C 1.8262 0.3705 4.93 <.0001 6.210
complicated B uncomplicated A 0.1377 0.5708 0.24 0.8094 1.148
complicated B uncomplicated B -0.1620 0.5650 -0.29 0.7744 0.850
complicated B uncomplicated C 0.4825 0.5449 0.89 0.3759 1.620
complicated C uncomplicated A -1.6885 0.5113 -3.30 0.0010 0.185
complicated C uncomplicated B -1.9882 0.5049 -3.94 <.0001 0.137
complicated C uncomplicated C -1.3437 0.4822 -2.79 0.0053 0.261
uncomplicated A uncomplicated B -0.2997 0.6659 -0.45 0.6527 0.741
uncomplicated A uncomplicated C 0.3448 0.6489 0.53 0.5952 1.412
uncomplicated B uncomplicated C 0.6445 0.6438 1.00 0.3168 1.905

The following is produced by the SLICE statement. The SLICEBY option requests comparisons of treatments within the complicated diagnosis. The first table provides a joint test of the treatment differences in the complicated diagnosis. The DIFF option produces the next table containing log odds ratio (Estimate column) and odds ratio estimates (produced by the ODDSRATIO option). The odds ratio estimate for A vs C in the complicated diagnosis is again 1.88.

Chi-Square Test for Diagnosis*Treatment Least Squares Means Slice
Slice Num DF Chi-Square Pr > ChiSq
Diagnosis complicated 2 24.59 <.0001
 
Simple Differences of Diagnosis*Treatment Least Squares Means
Slice Treatment _Treatment Estimate Standard Error z Value Pr > |z| Odds Ratio
Diagnosis complicated A B -1.1926 0.3864 -3.09 0.0020 0.303
Diagnosis complicated A C 0.6336 0.2915 2.17 0.0297 1.884
Diagnosis complicated B C 1.8262 0.3705 4.93 <.0001 6.210

The LSMESTIMATE statement provides the next table. The LSMESTIMATE statement allows you to specify custom contrasts of the LS-means. Since A and C in the complicated diagnosis are the first and third LS-means as shown in the "diagnosis*treatment Least Squares Means" table from the LSMEANS statement, the specified coefficients (1 0 -1) estimate the difference in log odds (logs odds ratio) between treatments A and C in the complicated diagnosis (0.63). The EXP option adds the Exponentiated column containing the odds ratio estimate (again, 1.88).

Least Squares Means Estimate
Effect Label Estimate Standard Error z Value Pr > |z| Exponentiated
diagnosis*treatment A vs C complicated 0.6336 0.2915 2.17 0.0297 1.8845

The following results come from the ESTIMATE statement. See Example 3 in this note for how the coefficients are determined. The results show the log odds ratio estimate (0.63) and the EXP option provides the odds ratio estimate (1.88).

Estimate
Label Estimate Standard Error z Value Pr > |z| Exponentiated
A vs C in complicated 0.6336 0.2915 2.17 0.0297 1.8845

The CONTRAST statement produces these results. The coefficients are determined in the same way as for the ESTIMATE statement. An overall test of the specified contrast is provided first. The ESTIMATE=EXP option produces the estimated odds ratio (1.88). A test and confidence limits are also provided.

Contrast Test Results
Contrast DF Wald
Chi-Square
Pr > ChiSq
A vs C in complicated 1 4.7246 0.0297

Contrast Estimation and Testing Results by Row
Contrast Type Row Estimate Standard
Error
Alpha Confidence Limits Wald
Chi-Square
Pr > ChiSq
A vs C in complicated EXP 1 1.8845 0.5493 0.05 1.0643 3.3367 4.7246 0.0297

An Example Using PROC GLIMMIX

The following statements produce odds ratio estimates and confidence limits for comparing two treatments at each level of the interacting diagnosis variable using PROC GLIMMIX:

      proc glimmix data=uti;
         freq count;
         class diagnosis treatment;
         model response = diagnosis treatment diagnosis*treatment / dist=binary;
         lsmeans diagnosis*treatment / slicediff=diagnosis oddsratio ilink;
         run;

Following are the results from the LSMEANS statement which are very similar to the LSMEANS results from PROC LOGISTIC above. The first table provides estimates of the log odds (Estimate column) and probability (Mean column) of being cured. The ILINK option produces the mean (probability) estimates. The second table provides estimates of the log odds ratio (Estimate column) and odds ratio for each comparison of treatments. The ODDSRATIO option produces the odds ratio estimates.

diagnosis*treatment Least Squares Means
diagnosis treatment Estimate Standard Error DF t Value Pr > |t| Mean Standard
Error
Mean
complicated A 1.0245 0.2203 470 4.65 <.0001 0.7358 0.04282
complicated B 2.2172 0.3175 470 6.98 <.0001 0.9018 0.02812
complicated C 0.3909 0.1909 470 2.05 0.0412 0.5965 0.04595
uncomplicated A 2.0794 0.4743 470 4.38 <.0001 0.8889 0.04685
uncomplicated B 2.3795 0.4675 470 5.09 <.0001 0.9153 0.03626
uncomplicated C 1.7346 0.4428 470 3.92 0.0001 0.8500 0.05646

Simple Effect Comparisons of diagnosis*treatment Least Squares Means By diagnosis
Simple Effect Level treatment _treatment Estimate Standard Error DF t Value Pr > |t| Odds Ratio
diagnosis complicated A B -1.1927 0.3865 470 -3.09 0.0021 0.303
diagnosis complicated A C 0.6336 0.2915 470 2.17 0.0302 1.884
diagnosis complicated B C 1.8264 0.3705 470 4.93 <.0001 6.211
diagnosis uncomplicated A B -0.3001 0.6660 470 -0.45 0.6525 0.741
diagnosis uncomplicated A C 0.3448 0.6489 470 0.53 0.5954 1.412
diagnosis uncomplicated B C 0.6449 0.6439 470 1.00 0.3170 1.906


Operating System and Release Information

Product FamilyProductSystemSAS Release
ReportedFixed*
SAS SystemSAS/STATAlln/a
* For software releases that are not yet generally available, the Fixed Release is the software release in which the problem is planned to be fixed.