PROC GENESELECT
<options> ;
The PROC GENESELECT statement starts the GENESELECT procedure. Either the DATA= option or the INMODEL= option must appear, but not both. The DATA= option must appear in order to fit a model. The INMODEL= option specifies a previously saved model. Table 7.2 summarizes the options available in the PROC GENESELECT statement.
Table 7.2: PROC GENESELECT Statement Options
Option |
Description |
Default |
---|---|---|
DATA= |
data set |
|
INMODEL= |
data set containing model information |
|
LEAFFRACTION= |
LEAFSIZE as fraction of data |
0.001 |
LEAFSIZE= |
minimum number of observations in a branch |
|
MAXDEPTH= |
maximum depth of a tree |
6 |
MAXSURROGATES= |
maximum number of surrogates rules in a node |
10 |
MINCATSIZE= |
observations needed for each category |
5 |
MODELTYPE= |
type of model to fit |
TREEBOOST |
SEED= |
seed for pseudo-random number generator |
8976153 |
SPLITSIZE= |
minimum number of observations to split a node |
10 |