About the Book
Anticipated publication date: May 2009
The tools used to combine and modify data sets include the SET, MERGE, MODIFY, and UPDATE statements in the DATA step; joins and set operators in PROC SQL; BY-group processing; indexes; hash objects in the DATA step; use of PROC FORMAT and hash tables as table lookups; and generation data sets. Functions and techniques used to reshape and modify observations include those that work with variable type; PROC SQL dictionary tables; creating random samples using the DATA step and PROC SURVEYSELECT; time and date interval calculations; cleaning and validating data values using functions, hash tables, and regular expressions.