The COUNTREG Procedure

ZEROMODEL Statement

  • ZEROMODEL dependent variable $\sim $ <zero-inflated regressors> </options>;

The ZEROMODEL statement is required if you specify either ZIP or ZINB in the DIST= option in the MODEL statement. If ZIP or ZINB is specified, then the ZEROMODEL statement must follow immediately after the MODEL statement. The dependent variable in the ZEROMODEL statement must be the same as the dependent variable in the MODEL statement.

The zero-inflated (ZI) regressors appear in the equation that determines the probability ($\varphi _ i$) of a zero count. Each of these q variables has a parameter to be estimated in the regression. For example, let $\mathbf{z}_{i}’$ be the ith observation’s $1 \times (q+1)$ vector of values of the q ZI explanatory variables ($w_0$ is set to 1 for the intercept term). Then $\varphi _ i$ is a function of $\mathbf{z}_{i}’\bgamma $, where $\bgamma $ is the $(q+1) \times 1$ vector of parameters to be estimated. (The ZI intercept is $\gamma _0$; the coefficients for the q ZI covariates are $\gamma _1, \ldots , \gamma _ q$.) If this option is omitted, then only the intercept term $\gamma _0$ is estimated. The “Parameter Estimates” table in the displayed output gives the estimates for the ZI intercept and ZI explanatory variables; they are labeled with the prefix "Inf_". For example, the ZI intercept is labeled "Inf_intercept". If you specify Age (a variable in your data set) as a ZI explanatory variable, then the “Parameter Estimates” table labels the corresponding parameter estimate "Inf_Age".

You can specify the following options after a slash (/):

LINK=value

specifies the distribution function to use to compute probability of zeros. The following distribution functions are supported:

LOGISTIC

specifies the logistic distribution.

NORMAL

specifies the standard normal distribution.

If this option is omitted, then the default ZI link function is logistic.

OFFSET=variable

specifies a variable in the input data set to be used as a zero-inflated (ZI) offset variable. The ZI offset variable is included as a term, with its coefficient restricted to 1, in the equation that determines the probability ($\varphi _ i$) of a zero count. The ZI offset variable cannot be the response variable, the offset variable (if any), or one of the explanatory variables. The name of the data set variable that is used as the ZI offset variable is displayed in the “Model Fit Summary” output, where it is labeled as "Inf_offset".

SELECT=INFO<(option)>
SELECTVAR=INFO<(option)>

requests that the variable selection method be based on an information criterion. For a list of selection-options, see the section Options for Variable Selection Based on an Information Criterion. For more information about this type of variable selection, see the section Variable Selection Using an Information Criterion.