Note: SAS In-Database Technologies for Hadoop must
be installed before the installation of the SAS Data Loader for Hadoop
vApp in order for the vApp to communicate successfully with the Hadoop
cluster.
System requirements
for the Hadoop environment are as follows:
-
Cloudera CDH 5.2
or Hortonworks HDP 2.1.
Note: Both Hive 2 and YARN (MapReduce
2) are supported. MapReduce 1 is not supported.
-
If Kerberos security is supported
on the Hadoop cluster, then the vApp must be configured for Kerberos
on the client machine.
-
SAS Data Loader for Hadoop uses
the SQOOP and OOZIE components of your Hadoop deployment to move data
into or out of a DBMS. These components must be enabled in your Hadoop
cluster in order to communicate with the DBMS to which SAS Data Loader for Hadoop
users need access.
-
The JDBC drivers required by the
DBMS that the SAS Data Loader for Hadoop users need
access to must be installed on the Hadoop cluster.