Deploying SAS In-Memory Statistics

Installation Sequence

If SAS In-Memory Statistics is installed along with a SAS solution such as SAS Visual Analytics, then follow the steps that are provided in the installation guide for the solution. The software is automatically installed when it is delivered with a SAS solution.
If you are not installing the software as part of a solution, then you are performing a "Basic" installation instead of a "Planned" installation. Use the documents in the following sections to install the software.

Software for Your Analytics Cluster

Basic Steps

Information about installing and configuring SAS High-Performance Analytics Environment, SAS Plug-ins for Hadoop, and SAS High-Performance Computing Management Console is available in the SAS High-Performance Analytics Infrastructure: Installation and Configuration Guide. This book is available at the following URL:
Note: SAS High-Performance Analytics Infrastructure: Installation and Configuration Guide directs you to install the SAS Embedded Process. You can install it only after SAS Foundation and SAS/ACCESS are installed. This applies whether you use Hadoop or Teradata.

How Is Hadoop Used?

The SAS Plug-ins for Hadoop provide services that run inside Hadoop to enable working with the SASHDAT format. This software is installed on the same machines in the analytics cluster (co-located) for the purpose of providing a physical location for staging data. By staging the data on the same machines, data access is more efficient.
Typically, the data to analyze are in a data store like Hadoop or Teradata that is remote from the analytics cluster. The LASR procedure can transfer the data from the remote data store and the SAVE statement in the IMSTAT procedure can persist the data in HDFS as a SASHDAT table.

SAS Foundation and Related Software

Install SAS Foundation

On the machine that you will use as the SAS client for writing and submitting SAS programs, run the SAS Deployment Wizard to install SAS Foundation. The type of SAS Studio that is installed depends on your host operating system. See the following sections for details.

About SAS Studio Basic and SAS Studio - Single User

SAS Studio is a development application for writing SAS programs and submitting them. You can access SAS Studio through your web browser.
SAS Studio Basic is included with an order for SAS In-Memory Statistics for Hadoop on Linux for x64.
SAS Studio - Single User is included with an order for SAS In-Memory Statistics for Hadoop on Windows.
See SAS Studio: Administrator’s Guide for information about installing SAS Studio and administration.

UNIX Hosts

Use the SAS Deployment Wizard to install SAS. Refer to the documentation for UNIX hosts at the following URL:
When you run the SAS Deployment Wizard, you can specify to install SAS Studio Basic. Refer to "SAS Studio Basic" in the SAS Studio: Administrator’s Guide.
After the SAS Deployment Wizard installs the software, be sure to follow the instructions for configuring Hadoop JAR files. Refer to Configuration Guide for SAS 9.4 Foundation for UNIX Environments available at the preceding URL.

Windows Hosts

Use the SAS Deployment Wizard to install SAS. Refer to the documentation for Windows hosts at the following URL:
After the SAS Deployment Wizard installs the software, be sure to follow the instructions for configuring Hadoop JAR files. Refer to Configuration Guide for SAS 9.4 Foundation for Microsoft Windows for x64 available at the preceding URL.

SAS In-Database Products, SAS Embedded Process, and Remote Parallel Connections

After SAS Foundation and SAS/ACCESS are installed, follow the instructions in the SAS High-Performance Analytics Infrastructure: Installation and Configuration Guide to install the SAS Embedded Process for Hadoop or Teradata on the remote Hadoop cluster or Teradata appliance. The instructions for Teradata are included in SAS In-Database Products: Administrator’s Guide.
Make sure you follow the steps in section "Configuring the Analytics Environment for a Remote Parallel Connection" to ensure that the LASR procedure can load data in parallel from the data store to SAS LASR Analytic Server.