Prerequisites for the High-Performance Analytics Transformations

For SAS Data Integration Studio Users

High-Performance Analytics transformations have the following unique prerequisites:
  • SAS Data in HDFS Loaders require a SAS Data in HDFS library for their target tables. Administrators should let you know which libraries are available to you.
  • SAS LASR Analytic Server Loaders require a SAS LASR Analytic Server library for their target tables. Administrators should let you know which libraries are available to you.
  • In order to submit a job with any High-Performance Analytics transformation, you must have login credentials that are configured for Passwordless Secure Shell (SSH) on the machines in the High-Performance Analytics cluster. Contact your administrator to obtain these login credentials.
Administrators usually set up the servers, libraries, and users that are required by the High-Performance Analytics transformations. SAS Data Integration Studio users are simply told what user credentials to use and which libraries should be specified in a given job. See also Usage Notes for HPA Software and Hadoop.

For Administrators

Libraries, Users, and Servers Are in the Same Metadata Environment

The servers, libraries, and user IDs that are required by the High-Performance Analytics transformations are assumed to be registered in the same metadata repositories on the SAS Metadata Server. This is the usual configuration when SAS Data Integration Studio and SAS LASR Analytic Server are installed as part of a SAS Analytics solution.

Verify HDFS and LASR Libraries

An initial set of libraries might be configured when SAS Data Integration Studio and SAS LASR Analytic Server are installed as part of a SAS Analytics solution. This includes SAS Data in HDFS libraries and SAS LASR Analytic Server libraries. These libraries are used to connect to HDFS and the SAS LASR Analytic Server. Let SAS Data Integration Studio users know which libraries are available to them.
For more information about registering SAS Data in HDFS libraries and SAS LASR Analytic Server libraries, see the” Connecting to Common Data Sources” chapter in SAS Intelligence Platform: System Administration Guide. This book is available at: http://support.sas.com/documentation/onlinedoc/intellplatform/

Set Up Passwordless Secure Shell (SSH) Access for Selected Users

Note: This is a required, post-installation task for administrators.
Each user who wants to submit a job that includes a High-Performance Analytics transformation must log on to SAS Data Integration Studio with a special operating system user ID. This ID must be configured for Passwordless Secure Shell (SSH) on the machines in the High-Performance Analytics cluster. One way to do this is to configure the operating system user IDs of appropriate SAS Data Integration Studio users. The operating system user ID is specified in the metadata connection profile for the SAS Data Integration Studio user.
For more information about this task, see in the sections about SSH in SAS LASR Analytic Server: Reference Guide. This book is available at: http://support.sas.com/documentation/onlinedoc/securedoc/index_lasrserver.html.
Last updated: January 16, 2018