The directives Copy
Data to Hadoop and Copy Data from Hadoop use JDBC drivers to connect
your vApp host to databases such as Oracle. The JDBC drivers that
are installed in the shared folder of the vApp must be the same as
those that are installed on the Hadoop cluster. The process of copying
drivers to your vApp host was part of the initial vApp configuration
process, as addressed in the SAS Data Loader for Hadoop:
vApp Deployment Guide.
Follow these steps to
add a new JDBC driver, and to add a database connection for that new
driver.
-
To obtain a new JDBC
driver, ask your Hadoop administrator to copy the driver on your Hadoop
cluster into a ZIP file and mail you the ZIP file. This process is
described in the
SAS Data Loader for Hadoop: Administrator’s
Guide.
-
Unzip the ZIP file as
follows:
-
Right-click and select
Open
with WinZip or
Expand All.
-
If you are using WinZip,
click
Unzip.
-
In Windows Explorer,
open the directory that is designated as the Shared Folder for your
vApp. Here is a typical path to the Shared Folder:
C:\Program Files\SAS Data Loader\2.2\SASWorkspace\JDBCDrivers
To find the path to
your Shared Folder, open the
VMware Player Pro window
and select
PlayerManageVirtual Machine Settings. In
the
Virtual Machine Settings window, click
the
Options tab, and then click
Shared
Folders (in the
Settings list.)
On the right side, the path to the Shared Folder is provided in the
Host
Path column.
-
Restart the vApp so
that it can pick up the new JDBC driver.
Check
the Run Status directive
to ensure that all jobs are stopped and saved.
In VMware Player Pro
, select
PlayerPowerRestart Guest. Wait for the
vApp to restart.
-
Open SAS Data Loader
for Hadoop, as described
in Get Started.
-
Click
and select
Configuration.
-
In the
Configuration window,
click
Databases. To add a new database connection,
click
Add . To edit an existing database connection, click the
name of the connection, and then click
Edit .
-
Contact your Hadoop
administrator as needed to enter values into the
Database
Configuration window. The values of
Driver
class and
Connect string are
generated automatically when you select either Teradata or Oracle
in the
Type field. For an Oracle connection
that requires a Service ID (SID), enter the SID in the
Database
name field. If you select
Other,
you must obtain these values from the JDBC driver provider.
-
When the configuration
data is ready, click
Test Connection to verify
that the connection is operational.
-
If the test fails for
a new Oracle connection, then examine the
Connect string field.
If the string has either of the following formats, then change the
string to the other format and test the connection again.
jdbc:oracle:thin:@raintree.us.ourco.com:1521:oadev
jdbc:oracle:thin:@raintree.us.ourco.com:1521/oadev
One version uses a final
colon character. The other version uses a final slash character.
To edit the
Connect
string field, click
Edit .
-
Click
OK to
close the window.
-
Open the SAS Data Loader:
Information Center and the SAS Data Loader for Hadoop and begin copying
data to and from Hadoop with your new JDBC driver.