To use
SPD Server to access files on a
Hadoop server, a set of Hadoop
JAR and configuration files must be available to the SPD Server machine. To make the
required JAR and configuration files available, you must obtain these files from the
Hadoop
cluster, copy them to the SPD Server machine, and specify the Hadoop
parameter file options.
There are two methods to obtain the JAR and configuration files:
-
If you license SAS/ACCESS Interface to Hadoop, use the SAS Deployment Manager.
-
Use the Hadoop tracer script hadooptracer.py in Python that is provided by SAS.
Note: Gathering the JAR and configuration
files is a one-time process (unless you are updating your cluster
or changing Hadoop vendors). If you have already gathered the Hadoop
JAR and configuration files for another SAS component using the SAS
Deployment Manager or the Hadoop tracer script, you do not need to
do it again.