The HPBIN procedure has the following features:
provides a bucket (equal-length) binning method
provides a Winsorized binning method and Winsorized statistics
provides a pseudo–quantile binning method, which is similar to quantile binning
provides a mapping table for the selected binning method
provides a basic statistical table that contains the minimum, maximum, mean, pseudo-median, and so on
provides an estimation of a pseudo–quantile table
calculates the weight of evidence (WOE) and information value (IV) based on binning results
reads input data in parallel and writes output data in parallel when the data source is in a database on the appliance
Because the HPBIN procedure is a high-performance analytical procedure, it also does the following:
enables you to run in distributed mode on a cluster of machines that distribute the data and the computations
enables you to run in single-machine mode on the server where SAS is installed
exploits all the available cores and concurrent threads, regardless of execution mode
For more information, see the section Processing Modes in Chapter 2: Shared Concepts and Topics.