Setting the Report Options

About the Reports

The reports identify significant terms in the model and generate common business graphics, such as lift charts. The results include statistics for training and validation data. The SAS Rapid Predictive Modeler process divides the input data into training data and validation data. Training data is used to compute the parameters for each model, resulting in the training fit statistics. Validation data is then scored with each model, resulting in the validation fit statistics. The validation fit statistics are used to compare models and detect overfitting. If the training statistics are significantly better than the validation statistics, then you would suspect overfitting, which occurs when the model is trained to detect random signals in the data. Models with the best validation statistics are generally preferred.
The SAS Rapid Predictive Modeler automatically generates a concise set of core reports that provide a summary of the data source and variables that were used for modeling, a ranking of the important predictor variables, multiple fit statistics that evaluate the accuracy of the model, and a model scorecard.

About the Standard Reports for the SAS Rapid Predictive Modeler

Here are the standard reports that are automatically generated by the SAS Rapid Predictive Modeler:
Gains chart
Gains chart plots are available only for models that have class target variables. This chart shows percentiles of the data ranked by predicted value. Lift is a measure of the ratio of the number of target events that the model identified, compared to the number of target events that were found by random selection.
Receiver Operating Characteristic plot (ROC)
The Receiver Operating Characteristic plot shows the maximum predictive power of a model for the entire sample (rather than for a single decile). The data is plotted as sensitivity versus (1 – specificity). The separation between the model curve and the diagonal line (which represents a random selection model) is called the Kolmogorov-Smirnov (KS) value. Higher KS values represent more powerful models.
Scorecard
The results include a scorecard so that the model's characteristics can be interpreted for business purposes. When the software builds a scorecard, each interval variable is binned into distinct ranges of values. Then, each variable is ranked by model importance and scaled to a maximum of 1,000 points. The distinct value for each variable then receives a portion of the scaled point total.
Project information
The project information shows which user created the model, when the model was created, and where the model's component files are stored.