SAS Technical Papers » Data Mining and Text Mining

See other SAS Enterprise Miner technical papers.

See other SAS Credit Scoring technical papers.

Big Data Analytics: Benchmarking SAS, R, and Mahout
Ames, Allison J.; Abbey, Ralph; Thompson, Wayne; SAS Institute, Inc. 2013
This paper benchmarks SAS and open-source products to analyze big data by modeling four classification problems from real customers. The products that were benchmarked are SAS Rapid Predictive Modeler (a component of SAS Enterprise Miner), SAS High-Performance Analytics Server (using Hadoop), R and Apache Mahout. Results were compared in terms of model quality, modeler effort, scalability and completeness.
Scalability of the SAS/STAT HPGENSELECT High-Performance Analytical Procedure: A Comparison with RevoScaleR
Thompson, Wayne; Ames, Jennifer; Ho, Dright; SAS Institute, Inc. 2013
This paper compares the performance of the HPGENSELECT procedure with results cited for the RevoScaleR package by using data that are similar to the insurer's data. The paper also demonstrates the scalability of the HPGENSELECT procedure by using two sizes of data sets and three different computing environments.

A New Age of Data Mining in the High-Performance World
Dean, Jared; Duling, David; Thompson, Wayne; SAS Institute, Inc. 2012
This paper discusses the options and methods available for use in High- Performance Data Mining and uses real data for performance benchmarks.