This
section describes how to install and configure the in-database deployment
package for Greenplum (SAS Formats Library for Greenplum and the SAS
Embedded Process).
The in-database deployment
package for Greenplum must be installed and configured before you
can perform the following tasks:
-
Use the %INDGP_PUBLISH_FORMATS format publishing macro to create or publish the SAS_PUT( )
function and to create or publish user-defined formats as format functions
inside the database.
-
Use the %INDGP_PUBLISH_MODEL scoring publishing macro to create scoring files and
functions inside the database.
-
Use the SAS In-Database Code Accelerator
for Greenplum to execute DS2 thread programs in parallel inside the
database.
For more information,
see the SAS DS2 Language Reference.
-
Run SAS High-Performance Analytics
when the analytics cluster is co-located with the Greenplum data appliance
or when the analytics cluster is using a parallel connection with
a remote Greenplum data appliance. The SAS Embedded Process, which
resides on the data appliance, is used to provide high-speed parallel
data transfer between the data appliance and the analytics environment
where it is processed.
For more information,
see the SAS High-Performance Analytics Infrastructure:
Installation and Configuration Guide.
The in-database deployment
package for Greenplum contains the SAS formats library and precompiled
binary files for the utility functions. The package also contains
the SAS Embedded Process.
The SAS formats library
is a run-time library that is installed on your Greenplum system.
This installation is done so that the SAS scoring model functions
and the SAS_PUT( ) function created in Greenplum can access
the routines within the run-time library. The SAS formats library
contains the formats that are supplied by SAS.
The %INDGP_PUBLISH_COMPILEUDF macro registers utility functions in the database.
The utility functions are called by the format and scoring publishing
macros: %INDGP_PUBLISH_FORMATS
and %INDGP_PUBLISH_MODEL.
You must run the %INDGP_PUBLISH_COMPILEUDF macro before you run the format and scoring publishing
macros.
The SAS Embedded
Process is a SAS server process that runs within Greenplum to read
and write data. The SAS Embedded Process contains the %INDGP_PUBLISH_COMPILEUDF_EP macro, run-time libraries, and other software that
is installed on your Greenplum system. The %INDGP_PUBLISH_COMPILEUDF_EP macro defines the SAS_EP table functions to the Greenplum
database. You use the SAS_EP table function to produce scoring models
after you run the %INDGP_PUBLISH_MODEL
macro to create the SAS scoring files and publish them to the scoring
model table. The SAS Embedded Process accesses the SAS scoring files
when a scoring operation is performed. You also use the SAS_EP table
function for other SAS software that requires it, such as SAS High-Performance
Analytics.