Getting Started

About Deployment

This chapter describes manual deployment of SAS Data Management Accelerator for Spark. SAS sends an email to a contact person at your business or organization. This email includes instructions for downloading your software to the SAS Software Depot. After downloading the software, you can install it manually. The conditions under which you install manually are described in When to Deploy the SAS In-Database Deployment Package Manually. Although this is a description of the conditions for the SAS In-Database deployment package, they are also valid concerning SAS Data Management Accelerator for Spark.
Note: Deploy SAS Data Management Accelerator for Spark only if Spark is available on the cluster.

Before Deploying

If you are installing a new version or reinstalling a previous version of SAS Data Management Accelerator for Spark, you must first remove the current version. For this procedure, see -remove -keepconfig under SASDMP_ADMIN.SH Syntax.

Overview of Deployment Steps

Here are the tasks to be completed during deployment:
  1. Configure Kerberos, if appropriate, and then provide required configuration values to the vApp user.
  2. Identify a Windows server in a shared network location that is accessible to vApp users.
  3. Review the Hadoop Environment topic from the system requirements for SAS Data Loader 2.4.
  4. Install SAS Data Management Accelerator for Spark.
  5. Install additional components, if necessary.
  6. Configure the Hadoop cluster, and then provide required configuration values to the vApp user.
Note: If you switch to a different distribution of Hadoop after the initial installation of SAS In-Database Technologies for Hadoop, you must reinstall and reconfigure SAS In-Database Technologies for Hadoop on the new Hadoop cluster.