SAS In-Memory Statistics
is an offering that provides the data scientist or analytical expert
with interactive programming access to in-memory data and integrates
seamlessly with data in Hadoop or a Teradata Data Warehouse Appliance.
This information applies to the SAS In-Memory Statistics 2.6.
In order to use the
offering, the following must be true:
-
You
are using a distributed SAS LASR Analytic Server only.
-
The SAS LASR Analytic Server
is co-located with SAS High-Performance Deployment for Hadoop
or a commercial Hadoop distribution that has been configured with
the services from SAS High-Performance Deployment for Hadoop. The services enable
you to use the
SASHDAT file format for storing tables in
HDFS.
-
Data Access Requirement At least one of the following:
-
For deployments that work with
Hadoop data, SAS/ACCESS Interface to Hadoop must be configured on
a client machine that you use for submitting SAS programs. Be sure
to install the SAS Embedded Process on the machines in the Hadoop
cluster. The SAS/ACCESS engine, the embedded process, and the HDMD
procedure enable you to describe your data that is in Hadoop and access
it directly without an intermediate metadata repository such as Hive.
-
For deployments that work with
Teradata, SAS/ACCESS Interface to Teradata must be configured on a
client machine that you use for submitting SAS programs. Be sure to
install the SAS Embedded Process on the machines in the Teradata appliance.
The SAS/ACCESS engine and the embedded process work together to transfer
data from Teradata into memory on the server.
-
SAS Studio provides an interactive
web-based development application that enables you to write and submit
SAS programs. Make sure that your user ID is configured for passwordless
SSH to the machines the cluster. Also make sure that you have passwordless
SSH access from the machine that hosts SAS Studio to the machines
in the cluster.
For more information,
see Passwordless SSH.