Getting Started with the Deployment

About the Deployment

This chapter describes deployment of SAS Data Loader 2.4 for Hadoop Cloudera and SAS Data Loader 2.4 for Hadoop Hortonworks. SAS sends an email to a contact person at your business or organization. This email specifies whether the product is SAS Data Loader 2.4 for Hadoop Cloudera or SAS Data Loader 2.4 for Hadoop Hortonworks and includes instructions for downloading a ZIP file. The ZIP file contains all product files that are required for installation of SAS In-Database Technologies for Hadoop on the Hadoop cluster. The contact person is responsible for making the ZIP file available to you.
The individual components of SAS In-Database Technologies for Hadoop are described in the following chapters: SAS In-Database Deployment Package for Hadoop, SAS In-Database Technologies for Data Quality Directives, and SAS Data Management Accelerator for Spark.
Note: For further specific information about SAS In-Database Technologies for Hadoop, see Introduction.

Before Deployment

If you are installing a new version or reinstalling a previous version of SAS In-Database Technologies for Hadoop, you must deactivate or remove other existing SAS In-Database Technologies for Hadoop parcels or stacks after installing the new one. More than one parcel or stack can be deployed on your cluster, but only one parcel can be activated at a time. See Deactivating or Removing Existing Versions.

Overview of Deployment Steps

  1. Configure Kerberos, if appropriate, and then provide required configuration values to the vApp user.
  2. Identify a Windows server in a shared network location that is accessible to vApp users.
  3. Review the Hadoop Environment topic in the system requirements for SAS Data Loader 2.4.
  4. Obtain the ZIP file.
  5. Extract zipped files.
  6. Deploy services using Cloudera Manager or Ambari.
  7. Edit the Hadoop configuration file.
  8. Collect required files from the Hadoop cluster.
  9. Make the required vApp directory available on the Windows server in the shared network location.
  10. Configure the Hadoop cluster, and then provide required configuration values to the vApp user.
Note: If you switch to a different distribution of Hadoop after the initial installation of SAS In-Database Technologies for Hadoop, you must reinstall and reconfigure SAS In-Database Technologies for Hadoop on the new Hadoop cluster.