Validating the SPD Server to Hadoop Connection

To validate that SPD Server is correctly configured to access the Hadoop cluster:
  • Ensure that you specified values for the HADOOPCFG= and HADOOPJAR= parameter options in either the spdsserv.parm file or in the libnames.parm file.
  • If you want one or more domains to access Hadoop data that is secured with Kerberos, make sure that you specified values for the HADOOPKEYTAB=, HADOOPREALM, and HADOOPUSER= parameter file options for the domains in either the spdsserv.parm file or the libnames.parm file.
  • After you start SPD Server, check the SPD Server log file to verify that the domains were added to the Name Server. Hadoop option information is logged when each domain is added to the Name Server. The log entry should resemble the following:
    libname KERB added to Name Server (HADOOP=yes, KERBEROS)
       HADOOPCFG=/u/fedadmin/sdm/hadoopcfg/hdp23k1/prod
       HADOOPJAR=/bigdisk/lax/xxxxxx/spds53/hw23_hadoopjar
       HADOOPUSER=spdsmgr
       HADOOPREALM=NA.SAS.COM
       HADOOPKEYTAB=/u/fedadmin/keytab/spdsmgr.keytab
       HADOOPACLPATH=/bigdisk/lax/xxxxxx/spds53/acls.lax/HADOOPACLS/KERB
  • Submit a SASSPDS engine LIBNAME statement to a domain, such as the following example:
    libname foo sasspds 'public' server=myhost.5400 user="anonymous";
    If the LIBNAME statement is successful, then your Hadoop connection has been made.
  • Add MYDOMAIN to your libnames.parm file as a domain libref with a path located in your Hadoop file system.
The following SPD Server log shows the output for a LIBNAME statement and two PROC steps. The log results verify that the Hadoop environment is set up correctly.
SPD Server Log
1       libname foo sasspds "public" host="lax94t01.unx.sas.com" service="14503" user="anonymous";
NOTE: This is a SPD 5.3  Engine 
      executing SAS (r) 9.4 (TS1M3) on the Linux platform.
NOTE: User anonymous(ACL Group ) connected to SPD(LAX) 5.3 server at 10.24.8.107.
NOTE: Libref FOO was successfully assigned as follows: 
      Engine:        SASSPDS 
      Physical Name: :5763/user/spdsmgr/sasxxx/test_domains/lax94t01/public/
2          data foo.atable;
3            x=1;
4          run;

NOTE: The data set FOO.ATABLE has 1 observations and 1 variables.
NOTE: DATA statement used (Total process time):
      real time           1.42 seconds
      cpu time            0.04 seconds
      
2                                                The SAS System                                 

5          proc datasets lib=foo;
                                                             Directory

                  Libref               FOO                                                    
                  Engine               SASSPDS                                                
                  Physical Name        :5763/user/spdsmgr/sasxxx/test_domains/lax94t01/public/
                  Local Host Name      lax94t01                                               
                  Local Host IP addr   10.24.8.107                                            
                  Server Hostname      lax94t01.unx.sas.com                                   
                  Server IP addr       .                                                      
                  Server Portno        37615                                                  
                  Free Space (Kbytes)  9.0071993E15                                           
                  Metapath             '/user/spdsmgr/sasxxx/test_domains/lax94t01/public/'   
                  Indexpath            '/user/spdsmgr/sasxxx/test_domains/lax94t01/public/'   
                  Datapath             '/user/spdsmgr/sasxxx/test_domains/lax94t01/public/'   
                  Hadoop               YES                                                    


                                                                    Member
                                                         #  Name    Type

                                                         1  ATABLE  DATA  
6          run;


NOTE: PROCEDURE DATASETS used (Total process time):
      real time           0.62 seconds
      cpu time            0.09 seconds
      
7          proc contents data=foo.atable;
8          run;

                                                 The SAS System                             1

                                                       The CONTENTS Procedure

                               Data Set Name        FOO.ATABLE               Observations          1 
                               Member Type          DATA                     Variables             1 
                               Engine               SASSPDS                  Indexes               0 
                               Created              05/03/2016 12:06:33      Observation Length    8 
                               Last Modified        05/03/2016 12:06:33      Deleted Observations  0 
                               Protection                                    Compressed            NO
                               Data Set Type                                 Sorted                NO
                               Label                                                                 
                               Data Representation  Default                                          
                               Encoding             latin1  Western (ISO)                            


                                                 Engine/Host Dependent Information

                                               Blocking Factor (obs/block)  131072   
                                               ACL Entry                    NO       
                                               ACL User Access(R,W,A,C)     (Y,Y,Y,Y)
                                               ACL UserName                 ANONYMOU 
                                               ACL OwnerName                ANONYMOU 
                                               Data set is Ranged           NO       
                                               Data set is a Cluster        NO       


                                             Alphabetic List of Variables and Attributes
 
                                                    #    Variable    Type    Len

                                                    1    x           Num       8
NOTE: PROCEDURE CONTENTS used (Total process time):
      real time           0.89 seconds
      cpu time            0.03 seconds
      
NOTE: The PROCEDURE CONTENTS printed page 1.

9          proc print data=foo.atable;
10         run;

                                                 The SAS System                             

                                                              Obs    x

                                                               1     1
NOTE: There were 1 observations read from the data set FOO.ATABLE.
NOTE: The PROCEDURE PRINT printed page 2.
NOTE: PROCEDURE PRINT used (Total process time):
      real time           0.16 seconds
      cpu time            0.00 seconds