On the
Explore tab,
drag a
Variable Selection node to your diagram
workspace. Connect the
Data Partition node
to the
Variable Selection node.
Set the value of the
Max
Missing Percentage property to
10
.
This eliminates variables that have more than 10% of their values
missing.
Set the value of the
Target
Model property to
R-Square.
This indicates that the r-square criterion is used to evaluate and
select variables. Notice that the
Chi-Square Options properties
subgroup is now unavailable. Use the default values of the properties
in the
R-Square Options subgroup.
The
R-Square criterion
uses a goodness-of-fit criterion to evaluate variables. It uses a
stepwise method of selecting variables that stops when the improvement
in the r-square value is less than 0.0005. By default, this method
rejects variables whose contribution to the r-square value is less
than 0.005.
The following three-step
process is done when you apply the
R-Square criterion
to a binary target. When the target is non-binary, only the first
two steps are performed.
-
SAS Enterprise Miner
computes the squared correlation for each variable with the target
and then assigns the rejected role to those variables that have a
value less than
Minimum R-Square value.
-
SAS Enterprise Miner
evaluates the remaining variables with a forward stepwise r-square
regression. Variables that have a stepwise r-square improvement less
than the
Stop R-Square value are rejected.
-
SAS Enterprise Miner
performs a logistic regression with the predicted values that are
output from the forward stepwise regression used as the independent
input variable.
Right-click the
Variable
Selection node and click
Run.
In the
Confirmation window, click
Yes.
Click
Results in the
Run Status window.
In the
Variable
Selection window, notice that CLAGE, DEBTINC, DELINQ,
DEROG, G_JOB, NINQ, and YOJ have their
Role set
to
Input. This indicates that they were the
variables selected by the node for inclusion in the preceding neural
network model.
Close the
Results window.