Hadoop Data Management with Hive, Pig, and SAS
There is a new version of this course. Please see Hadoop Data Management with Hive, Pig, and SAS.
In this course, you use processing methods to prepare structured and unstructured big data for analysis. You learn to organize this data into structured tabular form using Apache Hive and Apache Pig. You also learn SAS software technology and techniques that integrate with Hive and Pig and how to leverage these open source capabilities by programming with Base SAS and SAS/ACCESS Interface to Hadoop, and with SAS Data Integration Studio.
The Extended Learning page for this course includes the option to purchase Virtual Lab time to practice.
The e-learning format of this course also includes the option to purchase Virtual Lab time to practice.Learn how to
Who should attendData scientists and programmers, database administrators, applications developers, and ETL developers who are looking for an in-depth technical overview of data management and extraction for big data and the Hadoop ecosystem
A basic understanding of and experience with UNIX and SQL is preferred. For advanced topics such as user-defined functions, prior programming experience is necessary.
This course addresses SAS/ACCESS, Base SAS, SAS Data Integration Studio software.
The Apache Hadoop Project