Number of Input Variables
|
Number of Observations
Processed
|
---|---|
<100
|
80,000
|
100–200
|
40,000
|
>200
|
20,000
|
Condition
|
Rare Event
|
|
---|---|---|
Yes
|
No
|
|
total number of observations < number of observations being processed
OR
total number of events < (0.10*number of observations being processed)
|
Sample the data so that
there is a 10:1 ratio of non-events to events.
|
no sampling
|
total number of events > (0.10*number of observations being processed)
|
Sample the following
proportion of the rare events:
|
stratified sampling
|
Name
|
Age
|
Gender
|
Income
|
Treatment
|
Purchase
|
---|---|---|---|---|---|
Ricardo
|
29
|
M
|
33000
|
Y
|
Y
|
Susan
|
35
|
F
|
51000
|
Y
|
N
|
Jeremy
|
49
|
M
|
110000
|
N
|
Y
|
Role
|
Description
|
---|---|
Roles
|
|
Dependent
variable
|
specifies the value that you want to predict or classify. The dependent variable is also known as the target variable.
|
Decisions
and Priors
|
specifies this information:
|
Additional Roles
|
|
Variables
to exclude from the model
|
specifies the variables
that you do not want to include in your analysis.
|
Frequency
count
|
specifies the variable
to use to represent the frequency value. The data is treated as if
each case is replicated as many times as the value of the frequency
variable.
|
ID variables
|
specifies variables
that are useful for reporting and scoring selection functions. These
variables are not included in the analysis.
|
Option
|
Description
|
---|---|
Output Data Set
|
|
Save Enterprise
Miner project data
|
specifies whether to save the SAS Enterprise Miner data from this task. A model from
the SAS Rapid Predictive Modeler is an example of a SAS Enterprise Miner project.
When you save SAS Enterprise Miner
data, you can use the SAS Enterprise Miner interface to open and edit the model that
you created using the SAS Rapid Predictive Modeler. In SAS Enterprise Miner, you can
save and export your analysis for use outside of SAS Enterprise Miner, and register
your model with a SAS Metadata Repository.
|
Export scoring
code
|
saves the scoring code from this task to the specified location. You can then run this code to score other
sets of data in other SAS products.
|
Score input
data set
|
specifies the name of the output data set that contains the scored values. The values in the input data set are scored by the
model that the SAS Rapid Predictive Modeler builds.
|