Introduction to the In-Database Deployment Package for Hadoop
The in-database deployment
package for Hadoop must be installed and configured on your Hadoop
cluster before you can perform the following tasks:
-
Run a scoring model in Hadoop Distributed
File System (HDFS) using the SAS Scoring Accelerator for Hadoop.
-
Run DATA step scoring programs
in Hadoop.
-
Run DS2 threaded programs in Hadoop
using the SAS In-Database Code Accelerator for Hadoop.
-
Perform data quality operations
in Hadoop, transform data in Hadoop, and extract transformed data
out of Hadoop for analysis in SAS using the SAS Data Loader for Hadoop.
For more information,
see SAS Data Loader for Hadoop: User’s Guide.
-
Deploy and score text analytic
models in Hadoop using SAS Contextual Analysis In-Database Scoring
in Hadoop.
For more information,
see SAS Contextual Analysis In-Database Scoring in Hadoop:
User’s Guide
-
Read and write data to HDFS in
parallel for SAS High-Performance Analytics.
Note: For deployments that use
SAS High-Performance Deployment of Hadoop for the co-located data
provider, and access SASHDAT tables exclusively, SAS/ACCESS and SAS
Embedded Process are not needed.
Note: If you are installing the
SAS High-Performance Analytics environment, you must perform additional
steps after you install the SAS Embedded Process. For more information,
see SAS High-Performance Analytics Infrastructure: Installation
and Configuration Guide.
Copyright © SAS Institute Inc. All Rights Reserved.
Last updated: February 9, 2017