Contents
About
Credits and Acknowledgments
Credits
Documentation
Software
Testing
Internationalization Testing
Technical Support
Acknowledgments
What’s New in Base SAS High-Performance Procedures 13.2
Overview
Infrastructure Enhancements
GRIDDATASERVER= Option in the PERFORMANCE Statement No Longer Needed
GRIDMODE= Option in the PERFORMANCE Statement No Longer Needed
Multiple Data Access Modes Supported in a Single Procedure Step
Running Asymmetrically Alongside SAP HANA
Procedure Enhancements
HPBIN Procedure
HPCORR Procedure
HPSAMPLE Procedure
Introduction
Overview of Base SAS High-Performance Procedures
About This Book
Chapter Organization
Typographical Conventions
Options Used in Examples
Online Documentation
SAS Technical Support Services
Shared Concepts and Topics
Overview
Processing Modes
Single-Machine Mode
Distributed Mode
Controlling the Execution Mode with Environment Variables and Performance Statement Options
Determining Single-Machine Mode or Distributed Mode
Data Access Modes
Single-Machine Data Access Mode
Distributed Data Access Mode
Determining the Data Access Mode
Alongside-the-Database Execution
Alongside-LASR Distributed Execution
Running High-Performance Analytical Procedures Alongside a SAS LASR Analytic Server in Distributed Mode
Starting a SAS LASR Analytic Server Instance
Associating a SAS Libref with the SAS LASR Analytic Server Instance
Running a High-Performance Analytical Procedure Alongside the SAS LASR Analytic Server Instance
Terminating a SAS LASR Analytic Server Instance
Alongside-LASR Distributed Execution on a Subset of the Appliance Nodes
Running High-Performance Analytical Procedures in Asymmetric Mode
Running in Symmetric Mode
Running in Asymmetric Mode on One Appliance
Running in Asymmetric Mode on Distinct Appliances
Alongside-HDFS Execution
Alongside-HDFS Execution by Using the SASHDAT Engine
Alongside-HDFS Execution by Using the Hadoop Engine
Output Data Sets
Working with Formats
PERFORMANCE Statement
The HPBIN Procedure
Overview: HPBIN Procedure
Bucket Binning
Winsorized Binning
Quantile Binning
Pseudo–Quantile Binning
PROC HPBIN Features
Getting Started: HPBIN Procedure
Syntax: HPBIN Procedure
PROC HPBIN Statement
CODE Statement
FREQ Statement
ID Statement
INPUT Statement
PERFORMANCE Statement
TARGET Statement
Details: HPBIN Procedure
Computing the Quantiles (Percentiles)
Binning Computation and Formulas
Computing the Weight of Evidence and Information Value
Data Output
ODS Tables
Examples
Bucket Binning in Single-Machine Mode
Pseudo–Quantile Binning in Distributed Mode
Quantile Binning in Distributed Mode
Winsorized Binning
Bucket Binning and Weight-of-Evidence Computation
The HPCORR Procedure
Overview: HPCORR Procedure
PROC HPCORR Features
Getting Started: HPCORR Procedure
Syntax: HPCORR Procedure
PROC HPCORR Statement
FREQ Statement
PERFORMANCE Statement
VAR Statement
WEIGHT Statement
WITH Statement
Details: HPCORR Procedure
Pearson Product-Moment Correlation
Multithreading
Output Tables
Output Data Sets
ODS Table Names
Examples: HPCORR Procedure
Computing the Pearson Measure of Association in Single-Machine Mode
Computing the Pearson Measure of Association in Distributed Mode
References
The HPDMDB Procedure
Overview: HPDMDB Procedure
Getting Started: HPDMDB Procedure
Syntax: HPDMDB Procedure
PROC HPDMDB Statement
CLASS Statement
FREQ Statement
PERFORMANCE Statement
VAR Statement
WEIGHT Statement
Details: HPDMDB Procedure
Examples: HPDMDB Procedure
Running PROC HPDMDB on the Client
Running with Client Data on the SAS appliance
Running with Data on the SAS appliance
The HPDS2 Procedure
Overview: HPDS2 Procedure
PROC HPDS2 Features
Single-Machine and Distributed Execution Modes
Getting Started: HPDS2 Procedure
Syntax: HPDS2 Procedure
PROC HPDS2 Statement
DATA Statement
ENDDATA Statement
PERFORMANCE Statement
QUIT Statement
RUN Statement
RUN CANCEL Statement
Details: HPDS2 Procedure
Parallel Execution of DS2 Code
Limitations and Issues
Packages
PERFORMANCE Statement Options
Data Input/Output
Data Types and Declarations
Error Messages
Examples: HPDS2 Procedure
Compute Mandelbrot Set
Aggregate Result Data Set
The HPIMPUTE Procedure
Overview: HPIMPUTE Procedure
PROC HPIMPUTE Features
Single-Machine and Distributed Execution Modes
Getting Started: HPIMPUTE Procedure
Syntax: HPIMPUTE Procedure
PROC HPIMPUTE Statement
CODE Statement
FREQ Statement
ID Statement
IMPUTE Statement
INPUT Statement
PERFORMANCE Statement
Details: HPIMPUTE Procedure
Obtaining the Statistics for Imputation
Random Imputation
Displayed Output
ODS Table Names
Examples: HPIMPUTE Procedure
Running Client Data on the Client
Running Client Data on the SAS Appliance
The HPSAMPLE Procedure
Overview: HPSAMPLE Procedure
Getting Started: HPSAMPLE Procedure
Syntax: HPSAMPLE Procedure
PROC HPSAMPLE statement
CLASS statement
PERFORMANCE statement
TARGET statement
VAR statement
Details: HPSAMPLE Procedure
Class Level
Displayed Output
ODS Table Names
Examples: HPSAMPLE Procedure
Running PROC HPSAMPLE on the Client
Running with Client Data on the SAS Appliance
Running with Data on the SAS Appliance
The HPSUMMARY Procedure
Overview: HPSUMMARY Procedure
PROC HPSUMMARY Features
Single-Machine and Distributed Execution Modes
PROC HPSUMMARY Contrasted with Other Procedures
Getting Started: HPSUMMARY Procedure
Syntax: HPSUMMARY Procedure
PROC HPSUMMARY Statement
Statistic Keywords
CLASS Statement
FREQ Statement
OUTPUT Statement
PERFORMANCE Statement
TYPES Statement
VAR Statement
WAYS Statement
WEIGHT Statement
Details: HPSUMMARY Procedure
How PROC HPSUMMARY Groups Data
Computational Resources
Statistical Computations
Results
Keywords and Formulas
References
Product
Release
Base SAS
9.4_M2
Type
Usage and Reference
Copyright Date
August 2014
Last Updated
05Aug2014