Prerequisites for the Hadoop Transformations

Prerequisites for the Hive Transformation

The Hive transformation uses the SAS/ACCESS Interface for Hadoop. Accordingly, the Hive transformation has the following prerequisites:
  • Your site must meet the system requirements for SAS/ACCESS Interface for Hadoop, for your SAS release and operating system. You can review system requirements for SAS products at: http://support.sas.com/documentation/installcenter/index.html.
  • You must establish connectivity to Hadoop. This includes registering the Hadoop Server and a “Hadoop via Hive” library on the SAS Metadata Server. See “Establishing Connectivity to Hadoop” in the SAS Intelligence Platform: Data Administration Guide. This book is available at the SAS Intelligence Platform documentation page: http://support.sas.com/documentation/onlinedoc/intellplatform/.
  • You must copy certain JAR files from the Hadoop installation folder to a folder on the SAS Workspace Server that executes SAS Data Integration Studio jobs. For a list of the JAR files required to support the SAS/ACCESS Interface for Hadoop, see SAS Hadoop Configuration Guide for Base SAS and SAS/ACCESS. This book is available at the Third-Party Software page: http://support.sas.com/resources/thirdpartysupport/.

Prerequisites for Other Hadoop Transformations

Unlike the Hive transformation, the following Hadoop transformations use the HADOOP procedure:
  • Hadoop Container
  • Hadoop File Reader
  • Hadoop File Writer
  • Map Reduce
  • Pig
  • Transfer From Hadoop
  • Transfer to Hadoop
Accordingly, these transformations have the following prerequisites:
  • You must establish connectivity to Hadoop. This includes registering the Hadoop Server on the SAS Metadata Server. See “Establishing Connectivity to Hadoop” in the SAS Intelligence Platform: Data Administration Guide. This book is available at the SAS Intelligence Platform documentation page: http://support.sas.com/documentation/onlinedoc/intellplatform/.
  • You must copy certain JAR files from the Hadoop installation folder to a folder on the SAS Workspace Server that executes SAS Data Integration Studio jobs. For a list of the JAR files required to support Base SAS (including the HADOOP procedure), see SAS Hadoop Configuration Guide for Base SAS and SAS/ACCESS. This book is available at the Third-Party Software page: http://support.sas.com/resources/thirdpartysupport/.