This course introduces programming techniques to craft and feature engineer meaningful inputs to improve predictive modeling performance. In addition, this course provides strategies to preemptively spot and avoid common pitfalls that compromise the integrity of the data being used to build a predictive model. This course relies heavily on SAS programming techniques to accomplish the desired objectives.
The self-study e-learning includes:
- Annotatable course notes in PDF format.
- Virtual Lab time to practice.
- Extract data from a relational data table structure.
- Define population qualifications and create a target sample.
- Use feature engineering techniques to transform transactional data into meaningful inputs into a predictive model.
- Transform low-, mid-, and high-cardinality categorical input variables into meaningful predictive modeling inputs.
- Use ZIP codes and latitude/longitude points to calculate great-circle distance, driving distance, and estimated driving time.
- Use Bayes' theorem to estimate meaningful predictive modeling inputs, impute missing observations, and partition the target sample into training and validation data sets for honest assessment of the predictive model.
A quién va dirigido
Analysts, data scientists, and IT professionals looking to craft better inputs to improve predictive modeling performance
Duración estándar (la duración puede variar, consulte su horario)
||6 sesión(es) de medio-día
||21 horas/180 día licencia
This course assumes some experience in both predictive modeling and SAS programming. Before attending this course, you should have:
- Exposure to DATA step programming equivalent to the Programación SAS 1: Introducción course.
- Exposure to programming in SQL or the SQL procedure.
- Exposure to querying data in PROC SQL and building and deploying a predictive model.
- Familiarity with the analytical process of building predictive models and scoring new data.
Familiarity with the SAS macro language is helpful but not required.
Este curso utiliza Base SAS, SAS/STAT software.