Step 8: Start and Configure SAS Data Loader for Hadoop

Follow these steps to start and configure SAS Data Loader for Hadoop:
  1. Open the SAS Data Loader for Hadoop: Information Center if it is not already open.
  2. In the SAS Data Loader: Information Center, click Start SAS Data Loader.
    SAS Data Loader: Information Center
    Note: When starting SAS Data Loader for Hadoop, if an error occurs stating that VT-x or AMD-v is not available, see Troubleshoot the vApp Start Process in SAS Data Loader for Hadoop: User's Guide.
  3. The SAS Data Loader web application opens in a new tab in your web browser. The first time you open the application, the Configuration window appears:
    SAS Data Loader Configuration Window
  4. In the Host field of the Configuration window, enter the fully qualified name of the host that supports your Hadoop cluster.
    Note: Contact your Hadoop administrator as needed to determine Hadoop configuration values.
  5. In the Port field, enter the number of the Hadoop port on the host that supports your cluster.
  6. In the User ID field, enter the name of the user account that will be used to connect to the Hadoop cluster.
  7. In the Oozie URL field, enter the URL to the Oozie Web Console, which is an interface to the Oozie server. The URL is similar to the following example: http://host_name:port_number/oozie/. Oozie is a workflow scheduler system that is used to manage Hadoop jobs.
  8. In the Schema for temporary file storage field, either accept the Hive default schema or click Specify a different schema and enter the name of an existing Hadoop schema.
  9. If you intend to use the directive Load Data to LASR (to copy data to an existing grid of SAS LASR Analytic Servers), then click LASR Analytic Servers. For additional steps, see Load Data to LASR in SAS Data Loader for Hadoop: User's Guide.
  10. At this point you can configure connections to the databases that you will use to copy data to and from Hadoop. To configure database connections now, see Install JDBC Drivers and Add Database Connections in SAS Data Loader for Hadoop: User's Guide.
  11. Click QKB to view the default locale, which is English. To change the default locale, right-click and select from the list.
  12. To configure the processing of profile jobs, click Profiles and see Configure Profile Jobs in SAS Data Loader for Hadoop: User's Guide. Profile jobs report on the structure and quality of the data in one or more Hadoop tables.
  13. Click OK to close the Configuration window.
To configure general preferences, see Step 9: Set General Preferences.