Resources

What's New Table of Contents  

What's New in SAS Enterprise Miner 5.3

Overview

SAS Enterprise Miner 5.3 marks a major upgrade with the following new features and enhancements:


Improved Usability

SAS Enterprise Miner 5.3 features an improved user interface that incorporates common design principles established for SAS software. These enhancements include the following:


Improved Functionality

New data preparation tools make it easier to manipulate data and to construct a data sample for mining. For data exploration, SAS Enterprise Miner 5.3 now supports the following:


Enhanced Administration and Configuration

The following changes and enhancements improve the installation, administration, and configuration of SAS Enterprise Miner 5.3:


Enhanced User Interface

Nodes Toolbar and Shortcut Toolbar

The SAS Enterprise Miner 5.3 nodes toolbar and the SAS Enterprise Miner 5.3 shortcut toolbar have both been relocated and their contents have been reorganized for consistency. Additional information is available in the SAS Enterprise Miner Reference Help section of the SAS Enterprise Miner 5.3 User Interface.

Diagram Workspace

The SAS Enterprise Miner 5.3 diagram workspace provides improved navigation and greater ease of use when working with large process flow diagrams.

Properties Panel

The SAS Enterprise Miner 5.3 Properties Panel has been reorganized.

Library Wizard

A new SAS Enterprise Miner Library Wizard enables you to select directories in the scope of the SAS session and create simple SAS library definitions. The Library Wizard is especially useful if you are not accustomed to using SAS LIBNAME statements to allocate SAS librefs. For more complex library definitions, use the SAS Enterprise Miner project startup code or the SAS Metadata LIBNAME Engine that is accessed through SAS Management Console. The Library Wizard has a Browse Directories action that browses directories on the SAS server, and not the SAS client.

SAS Library Explorer

The revised SAS Library Explorer displays data sets and catalogs, including some catalog contents such as SOURCE entries. You can select a data set in the SAS Library Explorer and initiate SAS Enterprise Miner data source creation. The Refresh button in the SAS Library Explorer updates the list of libraries, data sets, and catalogs.

Target Profile Panel

SAS Enterprise Miner Target Profile panels have been revised in the Data Source Wizard, the Data Source object, and the Decisions node. You can now select inverse proportions for decision weights in the Target Profile panels. Inversely proportioned decision weights are useful when working with very unbalanced data.

Variables Editor

The SAS Enterprise Miner Variables Editor has been revised to include a configurable WHERE clause filter, which is helpful when navigating a long list of variables.

Graph Persistence

You can create new graphs in the Results window as before, but the graphs are saved and re-created when the Results window is closed and reopened. This includes the new Graph Explore node that is the migration path for the Insight and Distribution Explorer nodes from SAS Enterprise Miner 4.3.


Conversion of SAS Enterprise Miner Projects

New Conversion Macro

SAS Enterprise Miner 5.3 contains a SAS macro that enables you to convert SAS Enterprise Miner 4.3 projects to SAS Enterprise Miner 5.3 projects. The conversion macro functions only on the Windows version of SAS.

The conversion macro runs in the SAS language. It can be used to convert multiple SAS Enterprise Miner 4.3 projects to multiple SAS Enterprise Miner 5.3 projects, or to convert and consolidate multiple SAS Enterprise Miner 4.3 projects into a single SAS Enterprise Miner 5.3 project.

Converting diagrams from SAS Enterprise Miner 4.3 to SAS Enterprise Miner 5.3 requires some changes due to differences in the versions.

Basic Diagram Structure

The basic process flow diagram structure is retained during project conversion, including the following:

Retaining the basic diagram and results structures enables you to evenly compare SAS Enterprise Miner 4.3 and SAS Enterprise Miner 5.3 results.

Subdiagrams

SAS Enterprise Miner 4.3 subdiagrams are expanded to the top level. Enter and Exit nodes appear and function in diagrams that were converted, but Enter and Exit nodes cannot be added to new SAS Enterprise Miner 5.3 process flow diagrams.

Replacement Node

When converting a SAS Enterprise Miner 4.3 project whose process flow diagrams contain the Replacement node, the Replacement node can be expanded to a combination of an Impute node and a Replacement node. The combination of Impute and Replacement nodes gives you better control over the processes.

Ensemble Node

When converting a SAS Enterprise Miner 4.3 project whose process flow diagrams contain stratified, bagging, or boosting models that use the Ensemble node, the Ensemble node is replaced by an End Groups node.

Insight and Distribution Explorer Nodes

When converting a SAS Enterprise Miner 4.3 project whose process flow diagrams contain Insight or Distribution Explorer nodes, the nodes are replaced in the SAS Enterprise Miner 5.3 process flow diagram with Graph Explore nodes.

Link Analysis Node

SAS Enterprise Miner 5.3 supports conversion of SAS Enterprise Miner 4.3 projects whose process flow diagrams contain the Link Analysis node. Link Analysis nodes from SAS Enterprise Miner 4.3 process flow diagrams appear and function in diagrams that were converted for use with SAS Enterprise Miner 5.3, but the Link Analysis node cannot be added to a new SAS Enterprise Miner 5.3 diagram. If you want to analyze transaction data, refer to the SAS Enterprise Miner 5.3 Association, Path Analysis, and Market Basket nodes. If you want to analyze training data, continue to use the Link Analysis node from SAS Enterprise Miner 4.3.


New Nodes in SAS Enterprise Miner 5.3

SAS Enterprise Miner 5.3 contains the following new nodes that add functionality to the environment:

Append Node

The Append node has been added to the SAS Enterprise Miner 5.3 Sample tab of the node toolbar. The Append node belongs to the Sample category of the SAS SEMMA (Sample, Explore, Modify, Model, Assess) data mining process. You use the Append node to append data sets that are exported by two or more paths in a single SAS Enterprise Miner process flow diagram.

The Append node can append data according to the data role, such as joining training data to training data, transaction data to transaction data, score data to score data, and so on. The Append node can also append data from other roles that was previously partitioned in train, test, and validate roles to one large training data set.

Graph Explore Node

The Graph Explore node belongs to the Explore category of the SAS SEMMA (Sample, Explore, Modify, Model, Assess) data mining process. The Graph Explore node is an advanced visualization tool that enables you to sample data and to interactively explore the data graphically in the Results window. The interactive Results window plots can be saved and used to uncover patterns and trends in the data as well as reveal extreme values in the database. Graph Explore plots are fully interactive. You can rotate a chart to different angles and move it anywhere on the screen to obtain different perspectives on the data.

Variable Clustering Node

The Variable Clustering node belongs to the Explore category of the SAS SEMMA (Sample, Explore, Modify, Model, Assess) data mining process. The Variable Clustering node is a useful tool for data reduction, such as choosing the best variables or cluster components for analysis. Variable clustering can be used to decrease variable redundancy and help to reveal the underlying structure of the input variables in a data set. Large numbers of variables can complicate the task of determining the relationships that might exist between the independent variables and the target variable in a model. Models that are built with correlated variables can destabilize parameter estimates, confound variable interpretation, and increase the computing time that is required to run the model. Variable clustering can reduce the number of variables that are required to build reliable predictive or segmentation models.

Interactive Binning Node

The Interactive Binning node belongs to the Modify category of the SAS SEMMA (Sample, Explore, Modify, Model, Assess) data mining process. The Interactive Binning node is an interactive grouping tool that you use to model nonlinear functions of multiple modes of continuous distributions. The interactive tool computes initial bins by quintiles, then you can interactively split and combine the initial bins.

Rules Builder Node

The Rules Builder node belongs to the Modify category of the SAS SEMMA (Sample, Explore, Modify, Model, Assess) data mining process. You use the Rules Builder node to create ad hoc sets of rules for your data that result in user-definable outcomes.

Gradient Boosting Node

The Gradient Boosting node belongs to the Model category of the SAS SEMMA (Sample, Explore, Modify, Model, Assess) data mining process. The Gradient Boosting node uses tree boosting to create a series of decision trees that together form a single predictive model. A tree in the series is fit to the residual of the prediction from the earlier trees in the series. The residual is defined in terms of the derivative of a loss function. Boosting is available for binary, nominal, and interval targets. The Gradient Boosting node is not included with SAS Enterprise Miner for the Desktop.

Model Import Node

The Model Import node belongs to the Model category of the SAS SEMMA (Sample, Explore, Modify, Model, Assess) data mining process. You use the Model Import node to import models to the SAS Enterprise Miner environment that were not created by SAS Enterprise Miner. For example, models that were created using SAS PROC LOGISTIC can now be run, assessed, and modified in SAS Enterprise Miner 5.3.

Partial Least Squares Node

The Partial Least Squares node belongs to the Model category of the SAS SEMMA (Sample, Explore, Modify, Model, Assess) data mining process. You use the Partial Least Squares node to model continuous and binary targets based on SAS/STAT PROC PLS. The Partial Least Squares node produces DATA step score code and standard predictive model assessment results.

Cutoff Node

The Cutoff node belongs to the Assess category of the SAS SEMMA (Sample, Explore, Modify, Model, Assess) data mining process. You use the Cutoff node as a probability cutoff tool that creates a frequency table of probability distributions and graphs to assist you in determining the best cutoff point or points for decision-making models with binary target variables. Probability distributions are aggregated to the 1% level and are not smoothed between bins. Cumulative and marginal distributions are plotted. You specify the cutoff value by using a report property.

Start Groups Node

The Start Groups node belongs to the Utility group. You use the Start Groups node to initiate a group processing segment in the process flow diagram. The Start Groups node performs the following types of group processing:

End Groups Node

The End Groups node belongs to the Utility group. You use the End Groups node to terminate a group processing segment in the process flow diagram. If the group processing function is stratified, bagging, or boosting, the End Groups node will function as a model node and present the final aggregated model. Nodes that follow the End Groups node continue data mining processes normally, after the group processing stage is completed.

Reporter Node

The Reporter node belongs to the Utility group. The Reporter node uses SAS ODS capability to create a single document for the given analysis in *.pdf or *.rtf format. The document includes important SAS Enterprise Miner results such as variable selection, model diagnostic tables, and model results plots. The document can be viewed and saved directly, and will be included in SAS Enterprise Miner report package files.


Enhanced Nodes in SAS Enterprise Miner 5.3

Several nodes in SAS Enterprise Miner 5.3 contain significant enhancements in functionality or have been reorganized into new SAS Enterprise Miner tool groups.

Sample Tools

The following changes have been made to the Sample tools in SAS Enterprise Miner 5.3:

Explore Tools

The following changes have been made to the Explore tools in SAS Enterprise Miner 5.3:

Modify Tools

The following changes have been made to the Modify tools in SAS Enterprise Miner 5.3:

Model Tools

The following changes have been made to the Model node tools in SAS Enterprise Miner 5.3:

Assess Tools

The following enhancements have been made to the existing Assess node tools in SAS Enterprise Miner 5.3:

Utility Tools

The following enhancements have been made to the existing Utility node tools in SAS Enterprise Miner 5.3:

Credit Scoring Tools

The following enhancements have been made to the add-on Credit Scoring node tools of SAS Enterprise Miner 5.3:


Experimental Nodes in SAS Enterprise Miner 5.3

The following experimental nodes are introduced in SAS Enterprise Miner 5.3:

Market Basket Node

The Market Basket node belongs to the Explore category of the SAS SEMMA (Sample, Explore, Modify, Model, Assess) data mining process. You use the experimental Market Basket node to perform association rule mining over transaction data in conjunction with item taxonomy. This node is useful in retail marketing scenarios that involve tens of thousands of distinct items, where the items are grouped into subcategories, categories, departments, and so on, called item taxonomy. The Market Basket node uses the taxonomy data and generates rules at multiple levels in the taxonomy. The experimental Market Basket node is not included with SAS Enterprise Miner for the Desktop.

Support Vector Machine Node

The Support Vector Machine node belongs to the Model category of the SAS SEMMA (Sample, Explore, Modify, Model, Assess) data mining process. You use the experimental Support Vector Machine node for classification and regression modeling. The Support Vector Machine node uses hyperplanes to separate points that are mapped on a higher dimensional space. The data points that make up the hyperplane are called support vectors. The Support Vector Machine node is not included with SAS Enterprise Miner for the Desktop.


Enhanced PMML Capabilities

The following PMML enhancements are introduced in SAS Enterprise Miner 5.3:


Enhanced Capabilities for Creating Extension Tools

In SAS Enterprise Miner 5.3, the capabilities available for SAS programmers to write custom SAS Enterprise Miner extension tools have been significantly enhanced.


Documentation Enhancements

The Getting Started with SAS Enterprise Miner 5.3 book is now available from SAS OnlineDoc 9.1.3 for the Web. This guide introduces you to the core functionality of SAS Enterprise Miner and shows you how to perform basic data mining tasks. You will learn how to use the graphical user interface (GUI) tools to create and manage process flow diagrams and projects, and to export mining results for reporting and integration with other SAS software. The data mining tasks you will learn include sampling, exploring, modifying, modeling, and assessing data in order to create and refine predictive models. Getting Started with SAS Enterprise Miner 5.3 provides step-by-step examples that create a complete process flow diagram, including graphic results. This title is also available online and in hard-copy format.