Overview of Hadoop Installation and Configuration Steps

To install and configure Hadoop, follow these steps:
  1. If you are upgrading from or reinstalling a previous release, follow the instructions in Upgrading from or Reinstalling a Previous Version before installing the in-database deployment package.
  2. Copy the in-database deployment package install script (sepcorehadp) to the Hadoop master node (the NameNode).
    Note: In the July 2015 release of SAS 9.4, the in-database deployment package install script changed name from tkindbsrv to sepcorehadp. The SAS Embedded Process and the SAS Hadoop MapReduce JAR files are now included in the same script. The SAS Embedded Process is the core technology of the in-database deployment package.
  3. Install the SAS Embedded Process.
    For more information, see Installing the SAS Embedded Process.
  4. Review any additional configuration that might be needed depending on your Hadoop distribution.
Note: If you are installing the SAS Data Loader for Hadoop, you must perform additional steps after you install the SAS Embedded Process. For more information, see Part 3, “Administrator’s Guide for SAS Data Loader for Hadoop”.
Note: If you are installing the SAS High-Performance Analytics environment, you must perform additional steps after you install the SAS Embedded Process. For more information, see SAS High-Performance Analytics Infrastructure: Installation and Configuration Guide.