Binning is a common step in the data preparation stage of the model-building process. You can use binning to classify missing variables, reduce the impact of outliers, and generate multiple effects. The generated effects are useful and contain certain nonlinear information about the original interval variables.
The HPBIN procedure conducts high-performance binning by using bucket binning, Winsorized binning, or pseudo–quantile binning. The HPBIN procedure can also calculate the weight of evidence (WOE) and information value (IV) based on binning results.
The HPBIN procedure runs in either single-machine mode or distributed mode.
Note: Distributed mode requires SAS High-Performance Server Distributed Mode .