Deploying the SAS Embedded Process Parcel on Cloudera

After you run the SAS Deployment Manager to create the SAS Embedded Process parcel, you must distribute and activate the parcel on the cluster. Follow these steps:
Note: More than one SAS Embedded Process parcel can be deployed on your cluster, but only one parcel can be activated at one time. Before activating a new parcel, deactivate the old one.
  1. Log on to Cloudera Manager.
  2. In Cloudera Manager, choose Hoststhen selectParcels.
    The SASEP parcel is located under your cluster. The parcel name is the one from Step 6 in Using the SAS Deployment Manager to Create the SAS Embedded Process Parcel or Stack. An example name is 9.43.p0.1.
  3. Click Distribute to copy the parcel to all nodes and the SASEPHome directory is created.
    Note: If you have licensed and downloaded SAS Data Loader for Hadoop, some SAS Data Loader for Hadoop data quality components are silently deployed at the same time as the SAS Embedded Process for Hadoop. Other configuration is required as noted in step 9.
    You can log on to the node and show the contents in the /opt/cloudera/parcel directory.
  4. Click Activate.
    This step creates a symbolic link to the SAS Hadoop JAR files.
    You are prompted to either restart the cluster or close the window.
  5. Click Close the Window.
    CAUTION:
    Do not restart the cluster.
    Do not click Restart.
  6. Use the Add Service Wizard page to add SASEP as a service on any node where HDFS is a client:
    1. Navigate to the Cloudera Manager Home.
    2. Select Actionsthen selectAdd a Service.
    3. Select the SASEP service and click Continue.
    4. Select the dependencies for the SAS Embedded Process service in the Add Service Wizardthen selectSelect the set of dependencies for your new service page. Click Continue.
    5. Choose a location for the SAS Embedded Process ep-config.xml file in the Add Service Wizard then selectCustomize Role Assignments page. Click Select the set of dependencies for your new service page. Click OK.
      The ep-config.xml file is created and added to the HDFS /sas/ep/config directory. This task is done in the host that you select.
      Note: If your cluster is secured with Kerberos, the host that you select must have a valid ticket for the HDFS user.
    6. After the SAS Embedded Process ep-config.xml file is created, Cloudera Manager starts the SAS Embedded Process service. This step is not required. MapReduce is the only service that is required for the SAS Embedded Process. You must stop the SAS Embedded Process service immediately when the task that adds the SAS Embedded Process is finished. The SAS Embedded Process service no longer needs to be stopped or started.
  7. Verify that the ep-config.xml file exists in the /sas/ep/config directory of the host that you selected in step 6e.
  8. Review any additional configuration that might be needed depending on your Hadoop distribution.
  9. Validate the deployment of the SAS Embedded Process by running a program that uses the SAS Embedded Process and the MapReduce service. An example is a scoring program.
  10. If you have licensed and downloaded any of the following SAS software, additional configuration is required:
    • SAS Data Loader for Hadoop
      For more information, see Part 3, “Administrator’s Guide for SAS Data Loader for Hadoop”.
    • SAS High-Performance Analytics
      For more information, see SAS High-Performance Analytics Infrastructure: Installation and Configuration Guide.