After analyzing the
data, the company selected a subset of 12 predictor (or input) variables
to model whether each applicant defaulted. The response (or target)
variable BAD indicates whether an applicant defaulted on the home
equity line of credit. These variables, along with their model role,
measurement level, and description are shown in the following table.
Note: This book uses uppercase
for variable names. SAS accepts mixed case and lowercase variable
names as well.
|
|
|
|
|
|
|
A value of 1 indicates
that the client defaulted on the loan or is seriously delinquent.
A value of 0 indicates that the client paid off the loan.
|
|
|
|
Age of the oldest credit
line, measured in months
|
|
|
|
|
|
|
|
|
|
|
|
Number of delinquent
credit lines
|
|
|
|
Number of major derogatory
reports
|
|
|
|
|
|
|
|
Amount requested for
the loan
|
|
|
|
Amount due on the existing
mortgage
|
|
|
|
Number of recent credit
inquiries
|
|
|
|
The value DebtCon indicates
that the loan was intended for debt consolidation. The value HomeImp
indicates that the loan was for home improvement.
|
|
|
|
Value of the current
property
|
|
|
|
Years at the applicant’s
current job
|
The SAMPSIO.HMEQ data
set contains 5,960 observations for building and comparing competing
models. The data set is split into training, validation, and test
data sets for analysis.