The TSCSREG Procedure

Estimation Techniques

If the effects are fixed, the models are essentially regression models with dummy variables that correspond to the specified effects. For fixed-effects models, ordinary least squares (OLS) estimation is equivalent to best linear unbiased estimation.

The output from TSCSREG is identical to what one would obtain from creating dummy variables to represent the cross-sectional and time (fixed) effects. The output is presented in this manner to facilitate comparisons to the least squares dummy variables estimator (LSDV). As such, the inclusion of a intercept term implies that one dummy variable must be dropped. The actual estimation of the fixed-effects models is not LSDV. LSDV is much too cumbersome to implement. Instead, TSCSREG operates in a two step fashion. In the first step, the following occurs:

  • One-way fixed-effects model: In the one-way fixed-effects model, the data is transformed by removing the cross-sectional means from the dependent and independent variables. The following is true:

    \[  \tilde{\mi{y}} _\mi {it} = \mi{y_\mi {it}} - \bar{\mi{y}}_\mi {i\cdot }  \]
    \[  \tilde{\mi{\mb{x}}}_\mi {it} = \mi{\mb{x} _\mi {it}} - \bar{\mi{\mb{x}}}_\mi {i\cdot }  \]
  • Two-way fixed-effects model: In the two-way fixed-effects model, the data is transformed by removing the cross-sectional and time means and adding back the overall means:

    \[  \tilde{\mi{y}} _\mi {it} = \mi{y_\mi {it}} - \bar{\mi{y}} _\mi {i \cdot } - \bar{\mi{y}} _\mi {\cdot t} + \bar{\bar{\mi{y}}}  \]
    \[  \tilde{\mi{\mb{x}}} _\mi {it} = \mi{\mb{x} _\mi {it}} - \bar{\mi{\mb{x}}}_\mi {i \cdot } - \bar{\mi{\mb{x}}}_\mi {\cdot t} + \bar{\bar{\mi{\mb{x}}}}  \]

where the symbols:

$\mi{y_\mi {it}}$ and $\mi{\mb{x} _\mi {it}} $ are the dependent variable (a scalar) and the explanatory variables (a vector whose columns are the explanatory variables not including a constant), respectively

$\bar{\mi{y}}_\mi {i \cdot }$ and $\bar{\mi{\mb{x}}}_\mi {i \cdot }$ are cross section means

$\bar{\mi{y}}_\mi {\cdot t}$ and $\bar{\mi{\mb{x}}}_\mi {\cdot t}$ are time means

$\bar{\bar{\mi{y}}}$ and $\bar{\bar{\mi{\mb{x}}}}$ are the overall means

The second step consists of running OLS on the properly demeaned series, provided that the data are balanced. The unbalanced case is slightly more difficult, because the structure of the missing data must be retained. For this case, PROC TSCSREG uses a slight specialization on Wansbeek and Kapteyn.

The other alternative is to assume that the effects are random. In the one-way case, ${\mi{E} ({{\nu }_{i}})=0}$, ${\mi{E} ({{\nu }^{2}_{i}})= {\sigma }^{2}_{{\nu } }}$, and ${\mi{E} ({{\nu }_{i}{\nu }_{j}})=0}$ for ${i {\neq } j}$, and ${{\nu }_{i}}$ is uncorrelated with ${{\epsilon }_{it}}$ for all ${i}$ and ${t}$. In the two-way case, in addition to all of the preceding, ${\mi{E} ({e_{t}} )=0}$, ${\mi{E} ({e^{2}_{t}} )= {\sigma }^{2}_{e} }$, and ${\mi{E} ({e_{t} e_{s}} )=0}$ for ${t {\neq } s}$, and the ${e_{t}}$ are uncorrelated with the ${{\nu }_{i}}$ and the ${{\epsilon }_{it}}$ for all ${i}$and ${t}$. Thus, the model is a variance components model, with the variance components ${ {\sigma }^{2}_{{\nu }} }$, ${ {\sigma }^{2}_{e}}$, and ${ {\sigma }_{{\epsilon }}^{2}}$, to be estimated. A crucial implication of such a specification is that the effects are independent of the regressors. For random-effects models, the estimation method is an estimated generalized least squares (EGLS) procedure that involves estimating the variance components in the first stage and using the estimated variance covariance matrix thus obtained to apply generalized least squares (GLS) to the data.