When explanatory variables contain only individual characteristics, the multinomial logit model is defined as
| 
                  
                   | 
               
               
            
 where 
 is a random variable that indicates the choice made, 
 is a vector of characteristics specific to the 
th individual, and 
 is a vector of coefficients specific to the 
th alternative. Thus, this model involves choice-specific coefficients and only individual specific regressors. For model
            identification, it is often assumed that 
. The multinomial logit model reduces to the binary logit model if 
. 
         
The ratio of the choice probabilities for alternatives 
 and 
 (the odds ratio of alternatives 
 and 
) is 
         
| 
                  
                   | 
               
               
            
 Note that the odds ratio of alternatives 
 and 
 does not depend on any alternatives other than 
 and 
. For more information, see the section Independence from Irrelevant Alternatives (IIA). 
         
The log-likelihood function of the multinomial logit model is
| 
                  
                   | 
               
               
            
where
| 
                  
                   | 
               
               
            
This type of multinomial choice modeling has a couple of weaknesses: it has too many parameters (the number of individual
            characteristics times 
), and it is difficult to interpret. The multinomial logit model can be used to predict the choice probabilities, among a
            given set of 
 alternatives, of an individual with known vector of characteristics 
. 
         
The parameters of the multinomial logit model can be estimated with the TYPE=CLOGIT option in the MODEL statement; however, this requires modification of the conditional logit model to allow individual specific effects.
The conditional logit model, sometimes called the multinomial logit model, is similarly defined when choice-specific data
            are available. Using properties of Type I extreme-value (Gumbel) distribution, the probability that individual 
 chooses alternative 
 from among the choices in his choice set 
 is 
         
| 
                  
                   | 
               
               
            
 where 
 is a vector of attributes specific to the 
th alternative as perceived by the 
th individual. It is assumed that there are 
 choices in each individual’s choice set, 
. 
         
The log-likelihood function of the conditional logit model is
| 
                  
                   | 
               
               
            
The conditional logit model can be used to predict the probability that an individual will choose a previously unavailable
            alternative, given knowledge of 
 and the vector 
 of choice-specific characteristics. 
         
The problematic aspect of the conditional logit (and the multinomial logit) model lies in the property of independence from irrelevant alternatives (IIA). The IIA property can be derived from the probability ratio of any two choices. For the conditional logit model,
| 
                     
                      | 
                  
                  
               
 It is evident that the ratio of the probabilities for alternatives 
 and 
 does not depend on any alternatives other than 
 and 
. This was also shown to be the case for the multinomial logit model. Thus, for the conditional and multinomial logit models,
               the ratio of probabilities of any two alternatives is necessarily the same regardless of what other alternatives are in the
               choice set or what the characteristics of the other alternatives are. This is referred to as the IIA property. 
            
The IIA property is useful from the point of view of estimation and forecasting. For example, it allows the prediction of
               demand for currently unavailable alternatives. If the IIA property is appropriate for the choice situation being considered,
               then estimation can be based on the set of currently available alternatives, and then the estimated model can be used to calculate
               the probability that an individual would choose a new alternative not considered in the estimation procedure. However, the
               IIA property is restrictive from the point of view of choice behavior. Models that display the IIA property predict that a
               change in the attributes of one alternative changes the probabilities of the other alternatives proportionately such that
               the ratios of probabilities remain constant. Thus, cross elasticities due to a change in the attributes of an alternative
               
 are equal for all alternatives 
. This particular substitution pattern might be too restrictive in some choice settings. 
            
The IIA property of the conditional logit model follows from the assumption that the random components of utility are identically and independently distributed. The other models in PROC MDC (namely, nested logit, HEV, mixed logit, and multinomial probit) relax the IIA property in different ways.
For an example of Hausman’s specification test of IIA assumption, see Hausman’s Specification Test.