The QLIM Procedure

Heteroscedasticity and Box-Cox Transformation

Heteroscedasticity

If the variance of regression disturbance, ($\epsilon _{i}$), is heteroscedastic, the variance can be specified as a function of variables

\[  E(\epsilon _{i}^{2}) = \sigma _{i}^{2} = f(\mb {z}_{i}’\bgamma )  \]

The following table shows various functional forms of heteroscedasticity and the corresponding options to request each model.

No.

Model

 

Options

1

$ f(\mb {z}_{i}’\bgamma ) = \sigma ^{2}(1+\exp (\mb {z}_{i}’\gamma )) $

 

LINK=EXP (default)

2

$ f(\mb {z}_{i}’\bgamma ) = \sigma ^{2}\exp (\mb {z}_{i}’\gamma ) $

 

LINK=EXP NOCONST

3

$ f(\mb {z}_{i}’\bgamma ) = \sigma ^{2}(1+\sum _{l=1}^{L}\gamma _{l}z_{li}) $

 

LINK=LINEAR

4

$ f(\mb {z}_{i}’\bgamma ) = \sigma ^{2}(1+(\sum _{l=1}^{L}\gamma _{l}z_{li})^2) $

 

LINK=LINEAR SQUARE

5

$ f(\mb {z}_{i}’\bgamma ) = \sigma ^{2}(\sum _{l=1}^{L}\gamma _{l}z_{li}) $

 

LINK=LINEAR NOCONST

6

$ f(\mb {z}_{i}’\bgamma ) = \sigma ^{2}((\sum _{l=1}^{L}\gamma _{l}z_{li})^2) $

 

LINK=LINEAR SQUARE NOCONST

For discrete choice models, $\sigma ^{2}$ is normalized ($\sigma ^{2}=1$) since this parameter is not identified. Note that in models 3 and 5, it may be possible that variances of some observations are negative. Although the QLIM procedure assigns a large penalty to move the optimization away from such region, it is possible that the optimization cannot improve the objective function value and gets locked in the region. Signs of such outcome include extremely small likelihood values or missing standard errors in the estimates. In models 2 and 6, variances are guaranteed to be greater or equal to zero, but it may be possible that variances of some observations are very close to zero. In these scenarios, standard errors may be missing. Models 1 and 4 do not have such problems. Variances in these models are always positive and never close to zero.

The heteroscedastic regression model is estimated using the following log-likelihood function:

\[  \ell = -\frac{N}{2}\ln (2\pi ) - \sum _{i=1}^{N}\frac{1}{2}\ln (\sigma _{i}^{2}) - \frac{1}{2}\sum _{i=1}^{N} (\frac{e_{i}}{\sigma _{i}})^{2}  \]

where $e_{i} = y_{i} - \mb {x}_{i}’\bbeta $.

Box-Cox Modeling

The Box-Cox transformation on $x$ is defined as

\[  x^{(\lambda )} = \left\{  \begin{array}{ll} \frac{x^{\lambda }-1}{\lambda } &  \mr {if} \lambda \neq 0 \\ \ln (x) &  \mr {if} \lambda = 0 \end{array} \right.  \]

The Box-Cox regression model with heteroscedasticity is written as

$\displaystyle  y_{i}^{(\lambda _{0})}  $
$\displaystyle  =  $
$\displaystyle  \beta _{0} + \sum _{k=1}^{K}\beta _{k} x_{ki}^{(\lambda _{k})} + \epsilon _{i}  $
$\displaystyle  $
$\displaystyle  =  $
$\displaystyle  \mu _{i} + \epsilon _{i}  $

where $\epsilon _{i} \sim N(0,\sigma _{i}^{2})$ and transformed variables must be positive. In practice, too many transformation parameters cause numerical problems in model fitting. It is common to have the same Box-Cox transformation performed on all the variables — that is, $\lambda _{0}=\lambda _{1}=\cdots =\lambda _{K}$. It is required for the magnitude of transformed variables to be in the tolerable range if the corresponding transformation parameters are $|\lambda | > 1$.

The log-likelihood function of the Box-Cox regression model is written as

\[  \ell = -\frac{N}{2}\ln (2\pi ) - \sum _{i=1}^{N}\ln (\sigma _{i}) - \frac{1}{2\sigma _{i}^{2}}\sum _{i=1}^{N}e_{i}^{2} + (\lambda _{0}-1)\sum _{i=1}^{N}\ln (y_{i})  \]

where $e_{i} = y_{i}^{(\lambda _{0})} - \mu _{i}$.

When the dependent variable is discrete, censored, or truncated, the Box-Cox transformation can be applied only to explanatory variables.