What's New Table of Contents |
Two parallel versions of Enterprise Miner, either 4.3 or 5.1, are available in SAS 9.1. Enterprise Miner 4.3 is a continuation of the SAS client/SAS server system. Enterprise Miner 5.1 uses the production SAS 9.1 server and is a Web-deployable client application, which was developed using Java technology.
Enterprise Miner 4.3 includes the following new features and enhancements since Enterprise Miner 4.2:
A comparison of Enterprise Miner 4.3 and 5.1 is summarized at the end of this document.
The Link Analysis node is now production and includes the following new features and enhancements:
In Enterprise Miner 4.2 and earlier releases, interactive training is performed in the local SAS session. Beginning with Enterprise Miner 4.3, interactive training is supported through the Enterprise Miner Tree Desktop Application.
The Enterprise Miner Tree Desktop Application in SAS mode is production in Enterprise Miner 4.3. New features and enhancements include the following:
The groupings of a variable in your input data might have been defined by other methods outside of the Interactive Grouping node. If you set the model role to Group and the status to Use, the Interactive Grouping node places each level of the variable in a group, and calculates the weight of evidence and other statistics without applying the node settings.
The Enterprise Miner Model Repository has been enhanced to include the following:
The following list summarizes the advantages of using Enterprise Miner 5.1 instead of Enterprise Miner 4.3:
The following table is a listing of the primary differences between Enterprise Miner 4.3 and Enterprise Miner 5.1.
Configuration | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
Server | SAS 9.1 MVA | SAS 9.1 MVA |
Middleware | N/A |
Java
It manages multiple users and model training can be disconnected. |
Client | SAS 9.1 Windows | Java 1.4.1 |
Metadata Server | Optional, but needed for model registration. | Required |
Project Storage | Client and Server | Server |
SAS Environment | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
Program Editor, Log, Output windows | Yes | Yes |
Viewer for SAS/GRAPH output | Yes | Yes |
SAS/INSIGHT | Yes | No |
SAS DMS-based solutions | Yes | No |
Enterprise Miner Interfaces | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
SAS System GUI | Yes | No |
Java client GUI | No | Yes |
Batch project execution | No | Yes |
Web application results viewer | Yes | Yes |
Java API | No | Yes (experimental) |
SAS Code node interface | Yes | Yes. It provides better support for macro variables, macros, code generation, and results definition. |
SAS Code node based custom nodes | No | Yes. XML definitions for node properties. |
DMTOOL custom nodes | Yes | No |
Enterprise Miner Procedures Usage | Unsupported | Unsupported |
Metadata | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
Table and Column analytical metadata | Yes | Yes |
Batch interface for creating table and column metadata | No | Yes |
Batch interface for creating target decision profiles | No | Yes |
Sample-based metadata calculation statistics | Yes | No |
Complete data-based metadata calculation statistics | No | Yes |
Configurable data advisory rules | No | Yes. You can set thresholds for missing percentages. |
Extensible column attributes | No | Yes. You can add additional column attributes to be included in reports. |
Report column attribute | No | Yes. You can include variables that have the report attribute in most reports such as score rankings and score distributions. |
Hidden variables | No | Yes. You can hide rejected variables from the variable usage user interfaces but retain them in the data for score applications. |
Data Model for entire project | No | Yes. It facilitates batch and GUI execution of common projects. |
GUI Functionality | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
XML diagram exchange | No | Yes |
Diagram copy/paste between projects | No | Yes |
Open multiple diagrams | No | Yes |
Open multiple results windows | No | Yes |
Open multiple child windows in node results | No | Yes |
Common property sheet | No | Yes |
Individual property dialogs | Yes | No |
Group processing | Yes (stratified, bagging, and boosting) | No |
Job Execution | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
Stop running diagram | No | Yes |
Run multiple diagrams | No | Yes |
Disconnect while diagram running | No | Yes. Middleware configuration is required. |
Continue work while diagrams running | No | Yes |
Share projects with multiple users | Yes | Yes |
Batch mode execution | No | Yes |
Scheduling | No | No |
Multi-threaded procedures | Yes | Yes |
Multi-tasking projects | No | Yes |
Reporting | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
HTML Reports | Yes | No |
SAS Publish Packages (SPK) | No | Yes |
Web application for viewing stored models | Yes | Yes |
Interactive Analysis | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
Tree growing and pruning | Yes | Yes |
Neural network model | Yes | No |
Association rule: WHERE clause | Yes | No |
Transformation bin allocation | Yes | No |
Filter outliers selection | Yes | No |
Link analysis | Yes | No |
Interactive grouping | Yes | No |
Decision threshold charts | Yes | No |
Decision Processing | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
Class target profile matrix | Yes | Yes |
Class target loss matrix | Yes | Yes |
Cost values and cost variables | Yes | Yes |
Class target variable number of decisions | Yes | No |
Interval target decisions | Yes | No |
Model Assessment | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
Class probability score rankings: gain, lift, etc. | Yes | Yes |
Class probability score distributions | No | Yes |
Classification tables in node output listings | Sometimes | Always |
Decision tables in node output listings | No | Yes |
Type I and Type II error table in node output listings | No | Yes |
Interval target score rankings | No | Yes |
Interval target score distributions | No | Yes |
Interval target prediction vs. actual target | Yes. It is sample based and available in Model Manager. | Yes. It is user-generated plots of exported data. |
Score rankings printed in output listings | No | Yes |
Score distributions printed in output listings | No | Yes |
Post-model decision matrix what-if investigations | Yes | No |
Decision threshold charts | Yes | No |
Komogorov-Smirnov (KS), Receiver Operating Characteristic (ROC) index, GINI statistics | No | Yes |
Validation data assessment | Yes | Yes |
Train and Test data assessment | Optional. You must enable it through Model Manager, | Yes |
ROC chart | Yes | Yes |
Scoring | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
SAS score code | Yes | Yes |
C and Java score code | Yes | Yes |
Separation of residual and non-residual score code | No | Yes |
PMML generation | No | Yes |
Nodes | Enterprise Miner 4.3 | Enterprise Miner 5.1 |
Input Data | Yes | Yes |
Sampling | Yes | Yes |
Partitioning | Yes | Yes |
Time Series | Yes | Yes |
Variable Selection | Yes | Yes |
SOM | Yes | No |
Link Analysis | Yes | No |
Insight | Yes | No. Graphs can be generated from the Results window of any node. |
Distribution Explorer | Yes | No. Graphs can be generated from the Results window of any node. |
Multiplot | Yes | Yes |
StatExplore | No | Yes. This node computes univariate and bivariate distribution statistics for interval and class variables. Target and segment variables are used as by variables and/or correlation terms. |
Merge | No | Yes. This node merges training, test, and validation data sets by row number or by ID variable. It is useful for combining predictions from multiple models or for matching ID in multiple tables. |
Association | Yes | Yes, but the Results window does not support filtering rules and scatter plot for items. This node supports network display of rules. It generates a transposed data set that has one row per customer and variables for rules. The transposed data set can be used to cluster or predict customer behavior by rules. |
Path Analysis | No | Yes. This node uses the new PATH procedure that includes a referrer variable for Web log analysis. |
Transform | Yes | Yes, but this node does not support user-specified equations. It supports the creation of dummy and interactive terms. |
Interactive Grouping | Yes. This node supports user-driven grouping of variable levels and bins based on GINI, Information Gain, and Weight of Evidence (WOE)scores. | No |
Drop | No | Yes. This node drops variables from temporary tables for processing efficiency. |
Filter | Yes | Yes, but this node does not support graphical selection of filter ranges. |
Impute | Yes, this node is the Replacement node. | Yes |
Principle Components | Yes. It is in the Princomp/Dmneural node. | Yes. This node does not support the selection of number of components in the Results window. |
Regression (linear and logistic) | Yes | Yes |
Dmine Regression | No. Dmine regression is available as an optional output from the Variable Selection node. | Yes. This node uses the DMINE procedure to produce models that directly include the Analysis of Variance (AOV), group, and interaction effects for interval and binary targets. |
Decision Tree | Yes. Use the Tree Desktop Application for interactive training. | Yes. Use the Tree Desktop Application for interactive training. |
Neural Network | Yes | Yes. This node does not support interactive training or advanced user network configuration. |
Rule Induction | No | Yes. This node uses an algorithm for building models by recursively identifying target events. It is useful for modeling rare events. This functionality was formerly included in DMTOOL. |
Autoneural | No | Yes. This node uses an algorithm for automated MLP network building. It selects the type and number of activation functions from four different architectures. This functionality was formerly included in DMTOOL. |
Dmneural | Yes. This node is part of the Princomp/Dmneural node. | Yes. |
Two Stage Model | Yes | Yes. You can specify the options for the first and second stage models, and the neural network models that have two targets. |
Memory-Based Reasoning | Yes. This node is not recommended for score deployment, because it requires training table availability. | No |
Ensemble | Yes | Yes. The node supports simple averaging and voting methods. It does not support bagging and boosting models. |
Model Comparison | Yes. It is the Assessment node. | Yes. This node computes ROC and KS and automatically selects a model based on your selection. |
Group Processing | Yes | No |
Subdiagram | Yes | No |
Control Point | Yes | Yes |
SAS Code | Yes | Yes. This node provides extended support through better organized macros and macro variables. It supports building model and model assessment functions, and the creation of report tables and plots. |
Score | Yes | Yes. If score data is defined, the node always scores data to create output view and table. |
Score Converter | Yes | No. C and Java code are included in the SPK results package. PMML code for decision trees is available on a request basis. |
User Defined Model | Yes | No |
Reporter | Yes | No. Reports in the SPK format can be generated from any node. |
Data Set Attributes | Yes | Yes. The Metadata node in Enterprise Miner 5.1 replaces the Data Set Attributes node in Enterprise Miner 4.3. |
Data Mining Database | Yes | N/A |