SAS Enterprise Miner Key Features
Intuitive interfaces
- Interactive GUI for building process flow diagrams
- Batch processing code for scheduling large modeling and scoring jobs
Data preparation, summarization, and exploration
- Access to and integration of structured and unstructured data sources
- Outlier filtering
- Data sampling and partitioning
- File import from various sources
- Tools for merging and appending
- Easy-to-use Graph wizard and Graph Explore node
- Segment profile plots
- Data transformations
- Interactive variable binning
- Data replacement
- Time series data preparation
- Rules Builder node for creating ad hoc data-driven rules and policies
- Market basket analysis
- Sequence and web path analysis
- Link analysis
Dimension-reduction techniques
- Unsupervised and supervised variable selection
- LAR (least angle regression) variable selection\
- Variable clustering
- Principal component analysis
- Time series dimension reduction
Predictive modeling and machine learning
- Clustering
- Self-organizing maps
- Linear and logistic regression
- Decision trees
- Gradient boosting
- Neural networks
- Partial least squares regression
- Two-stage modeling
- Memory-based reasoning
- Model ensembles, including bagging and boosting
- Time series data mining
- Survival analysis
- Incremental response (net lift) models
Open-source integration node enables you to:
- Write code in the R language inside SAS Enterprise Miner
- Use SAS Enterprise Miner data and metadata in your R code and return R results to SAS Enterprise Miner
- Perform model comparison and generate SAS score code for supported models
Select set of high-performance nodes
- HP Data Partition
- HP Explore
- HP Transform
- HP Impute
- HP Variable Selection
- HP Principal Components
- HP Cluster
- HP BN Classifier
- HP Regression
- HP Neural
- HP Forest
- HP Tree
- HP GLM
- HP SVM
Fast, easy, and self-sufficient way for business users to generate models
- SAS® Rapid Predictive Modeler automatically generates predictive models for a variety of business problems and produces concise reports (including variable importance charts, lift charts, ROC charts, and model scorecards) for easy consumption and review
- Business analysts and subject-matter experts work from SAS® Enterprise Guide® or SAS® Add-In for Microsoft Office (Excel only)
- Models can be opened, augmented, and modified in SAS Enterprise Miner
Model comparisons, reporting, and management
- Assessment features for comparing multiple models by using lift curves, statistical diagnostics, and ROI metrics
- The innovative Cutoff node to determine probability cutoff points for binary targets
- Report creation and distribution
- Group processing for multiple targets and segments
- Interactive environment for comparing competing models and assessing the importance of key input variables on the predicted response times
- Integrated environment provided by the Register Model node for registering models into the SAS® Metadata Server
- Models that are developed with SAS code can also be registered into the SAS Metadata Server by using a SAS macro
Automated scoring process
- Interactive scoring in a variety of real-time or batch environments
- Automatic generation of score code in SAS, C, Java, and PMML
- Creation of the code needed to score models directly inside Aster, EMC Pivotal (previously Greenplum), IBM DB2, IBM Netezza, Oracle, and Teradata databases with SAS® Scoring Accelerator
- Integration of SAS Enterprise Miner training and scoring processes directly into other SAS solutions
Open, extensible design
- Extension nodes for easily adding tools and personalized SAS code
- Interactive editor features for training and score code in the SAS Code node
- Integration of text mining for analysis of both structured and unstructured data
- Incorporation of time series, web paths, and association rules as additional input variables into the model development process
Scalable processing
- Both the Java client and the SAS server architecture scale from single-user solutions to large-enterprise solutions
- Server-based processing and storage
- Grid computing, in-database, and in-memory processing options
- Asynchronous model building
- Ability to stop processing cleanly
- Parallel processing