Follow these steps to
start and configure SAS Data Loader for Hadoop:
-
-
In the SAS Data Loader:
Information Center, click
Start SAS Data Loader.
Note: When starting SAS Data Loader
for Hadoop, if an error occurs stating that VT-x or AMD-v is not available,
see Troubleshoot the vApp Start Process in SAS Data Loader for Hadoop: User's Guide.
-
The SAS Data Loader
web application opens in a new tab in your web browser. The first
time you open the application, the
Configuration window
appears:
-
In the
Host field
of the
Configuration window, enter the fully
qualified name of the host that supports your Hadoop cluster.
Note: Contact your Hadoop administrator
as needed to determine Hadoop configuration values.
-
In the
Port field,
enter the number of the Hadoop port on the host that supports your
cluster.
-
In the
User
ID field, enter the name of the user account that will
be used to connect to the Hadoop cluster.
-
In the
Oozie
URL field, enter the URL to the Oozie Web Console, which
is an interface to the Oozie server. The URL is similar to the following
example:
http://host_name:port_number/oozie/.
Oozie is a workflow scheduler system that is used to manage Hadoop
jobs.
-
In the
Schema
for temporary file storage field, either accept the Hive
default schema or click
Specify a different schema and
enter the name of an existing Hadoop schema.
-
If you intend to use
the directive Load Data to LASR (to copy data to an existing grid
of SAS LASR Analytic Servers), then click
LASR Analytic
Servers. For additional steps,
see Load Data to LASR in SAS Data Loader for Hadoop: User's Guide.
-
At this point you can
configure connections to the databases that you will use to copy data
to and from Hadoop. To configure database connections now,
see Install JDBC Drivers and Add Database Connections in SAS Data Loader for Hadoop: User's Guide.
-
Click
QKB to
view the default locale, which is English. To change the default locale,
right-click and select from the list.
-
To configure the processing
of profile jobs, click
Profiles and
see Configure Profile Jobs in SAS Data Loader for Hadoop: User's Guide. Profile jobs report on the structure and quality of the
data in one or more Hadoop tables.
-
Click
OK to
close the
Configuration window.