SAS Visual Analytics
offers self-service options for importing data from Cloudera Hadoop,
Hortonworks Data Platform, and InfoSphere BigInsights. Each of these
platforms requires separate setup by your administrator.
A common connection
type for all of these platforms is to connect to Hive or HiveServer2
and then import tables. The SAS system options field
can be used to specify environment variables such as the following:
set=SAS_HADOOP_JAR_PATH="/path/to/files"
The options
keyword
is submitted with any options that you specify in the field.
If the Hadoop cluster
is configured with the SAS Embedded Process, then you can perform
parallel loading from HDFS to SAS LASR Analytic Server. In this case,
the Configuration field must specify the
path to a Hadoop configuration file. You must also specify at least
the HDFS_METADIR= and HDFS_DATADIR= options in the Hadoop
options field. More options might be necessary for your
site. For information about setting up parallel loading from Hadoop,
see SAS Visual Analytics: Administration Guide.