After you run the SAS
Deployment Manager to create the SAS Embedded Process parcel, you
must distribute and activate the parcel on the cluster. Follow these
steps:
Note: More than one SAS Embedded
Process parcel can be deployed on your cluster, but only one parcel
can be activated at one time. Before activating a new parcel, deactivate
the old one.
-
Log on to Cloudera Manager.
-
In Cloudera Manager,
choose
HostsParcels.
-
Click
Distribute to
copy the parcel to all nodes and the SASEPHome directory is created.
Note: If you have licensed and
downloaded SAS Data Loader for Hadoop, some SAS Data Loader for Hadoop
data quality components are silently deployed at the same time as
the SAS Embedded Process for Hadoop. Other configuration is required
as noted in step 9.
You can log on to the
node and show the contents in the /opt/cloudera/parcel
directory.
-
This step creates a
symbolic link to the SAS Hadoop JAR files.
You are prompted to
either restart the cluster or close the window.
-
CAUTION:
Do not
restart the cluster.
Do not click Restart.
-
Use the
Add
Service Wizard page to add
SASEP as
a service on any node where HDFS is a client:
-
Navigate to the Cloudera
Manager Home.
-
Select
ActionsAdd a Service.
-
Select the
SASEP service
and click
Continue.
-
Select the dependencies
for the SAS Embedded Process service in the
Add
Service WizardSelect the set of dependencies
for your new service page. Click
Continue.
-
Choose a location for
the SAS Embedded Process ep-config.xml file in the
Add Service Wizard Customize Role
Assignments page. Click
Select
the set of dependencies for your new service page. Click
OK.
The ep-config.xml file
is created and added to the HDFS /sas/ep/config
directory.
This task is done in the host that you select.
Note: If your cluster is secured
with Kerberos, the host that you select must have a valid ticket for
the HDFS user.
-
After the SAS Embedded
Process ep-config.xml file is created, Cloudera Manager starts the
SAS Embedded Process service. This step is not required. MapReduce
is the only service that is required for the SAS Embedded Process.
You
must stop the SAS Embedded Process service immediately when the task
that adds the SAS Embedded Process is finished. The
SAS Embedded Process service no longer needs to be stopped or started.
-
Verify that the ep-config.xml
file exists in the
/sas/ep/config
directory
of the host that you selected in step 6e.
-
Review any additional
configuration that might be needed depending on your Hadoop distribution.
-
Validate the deployment
of the SAS Embedded Process by running a program that uses the SAS
Embedded Process and the MapReduce service. An example is a scoring
program.
-
If you have licensed
and downloaded any of the following SAS software, additional configuration
is required:
-
SAS Data Loader for Hadoop
For more information,
see Part 3, “Administrator’s Guide for SAS Data Loader
for Hadoop”.
-
SAS High-Performance Analytics
For more information,
see SAS High-Performance Analytics Infrastructure: Installation
and Configuration Guide.