What’s New in SAS 9.4 In-Database Products

Overview

In SAS 9.4, the following new features and enhancements were added to expand the capabilities of the SAS In-Database products:
  • In the February 2015 release of SAS 9.4, the following changes and additions were made:
    • The SAS In-Database Code Accelerator for Hadoop uses HCatalog to process complex, non-delimited files. This enables the SAS In-Database Code Accelerator for Hadoop to support Avro, ORC, RCFile, and Parquet file types.
    • You can now use the DBCREATE_TABLE_OPTS table option to specify the output SerDe, the output delimiter of the Hive table, the output ESCAPED BY character, and any other CREATE TABLE syntax allowed by Hive.
  • In the August 2014 release of SAS 9.4, the following changes and additions were made:
    • The SAS Scoring Accelerator and in-database processing of Base SAS procedures is available for SAP HANA.
    • Running limited DATA step scoring programs in Hadoop is now production.
    • Numerous changes were made to the installation and configuration script for the SAS Embedded Process for Hadoop.
  • In the April 2014 release of SAS 9.4, documentation enhancements were made in the following areas:
    • installation and configuration of the SAS Embedded Process for Hadoop
    • considerations when creating or modifying DATA step score code
    • semaphore requirements when using the SAS Embedded Process for Greenplum
  • In the December 2013 release of SAS 9.4, the following changes and additions were made:
    • Limited DATA step programs can be run inside Hadoop for scoring.
    • New parameters have been added for the Hadoop INDCONN macro variable.
    • New Hadoop JAR files are now tied to the version of Apache Hadoop that you are using.
    • The SAS In-Database Code Accelerator for Teradata now can run the DS2 data program as well as the thread program inside the database.
    • DS2ACCEL, a new system option, controls whether the DS2 code is executed inside the database. The default value is NONE, which prevents DS2 code from executing inside the database.
    • The PROC DS2 INDB option has changed its name to DS2ACCEL. INDB is still supported. However, the default value for this option has changed from YES to NO, which prevents DS2 code from executing in the database. This is a change in behavior from the initial 9.4 release.
  • In the September 2013 release of SAS 9.4, the following changes and additions were made:
    • In-database processing for Hadoop has been enhanced by the addition of the SAS Scoring Accelerator for Hadoop.
    • The autocall macros that initialized the publishing macros are no longer needed for any DBMS. However, they are still supported.
  • In the July 2013 release of SAS 9.4, the following changes and additions were made:
    • The SAS Scoring Accelerator for SPD Server is available.
  • In the June 2013 release of SAS 9.4, the following changes and additions were made:
    • Greenplum and Teradata in-database processing has been enhanced by the addition of the SAS In-Database Code Accelerators.
    • The SAS In-Database Code Accelerator enables you to publish a DS2 thread program to the database and execute that thread program in parallel inside the database.
    • In-database scoring for Netezza has been enhanced by the addition of the SAS Embedded Process. The SAS Embedded Process is a SAS server process that runs within Netezza to read and write data.
    • Two columns have been added to the model table. The ModelUUID and Notes columns assist in processing scoring models when using the SAS Embedded Process.
    • The Hadoop scripts that install, control, and provide status of the SAS Embedded Process have changed. There is now just one script, sasep-server.sh, that installs both the SAS Embedded Process and the Hadoop JAR files.

SAS In-Database Code Accelerator

February 2015 Release of SAS 9.4: Changes and Enhancements

In the February 2015 release of SAS 9.4, the following changes and additions were made:
  • The SAS In-Database Code Accelerator for Hadoop supports only Cloudera 5.2 and Hortonworks 2.1 or later.
  • The SAS In-Database Code Accelerator for Hadoop uses HCatalog to process complex, non-delimited files.
  • The SAS In-Database Code Accelerator for Hadoop now supports Avro, ORC, RCFile, and Parquet file types.
  • For the SAS In-Database Code Accelerator for Hadoop, you can use the DBCREATE_TABLE_OPTS table option to specify the output SerDe, the output delimiter of the Hive table, the output escaped by, and any other CREATE TABLE syntax allowed by Hive.

August 2014 Release of SAS 9.4: Changes and Enhancements

Hadoop in-database processing has been enhanced by the addition of the SAS In-Database Code Accelerator. The SAS In-Database Code Accelerator enables you to publish a DS2 thread and data program to the database and execute those programs in parallel inside the database.

December 2013 Release of SAS 9.4: Changes and Enhancements

In the December 2013 release of SAS 9.4, the following changes and additions were made:
  • The SAS In-Database Code Accelerator for Teradata now runs the DS2 data program as well as the thread program inside the database.
  • DS2ACCEL, a new system option, controls whether the DS2 code is executed inside the database. The default value is NONE, which prevents DS2 code from executing inside the database.
  • The PROC DS2 INDB option has changed its name to DS2ACCEL. INDB is still supported. However, the default value for this option has changed from YES to NO. This change prevents DS2 code from executing in the database. This is a change in behavior from the initial SAS 9.4 release.

SAS 9.4: Changes and Enhancements

The SAS In-Database Code Accelerator enables you to publish a DS2 thread program to the database and execute that thread program in parallel inside the database. Examples of thread programs include large transpositions, computationally complex programs, scoring models, and BY-group processing. The SAS In-Database Code Accelerator is available for Greenplum and Teradata.
The SAS In-Database Code Accelerator must be licensed at your site.

Greenplum Changes

April 2014 Release of SAS 9.4: Changes and Enhancements

Information about semaphore requirements when using the SAS Embedded Process was added to SAS In-Database Products: Administrator's Guide.

SAS 9.4: Changes and Enhancements

There are several changes for Greenplum:
  • Version 1.2 of the Greenplum Partner Connector (GPPC) is now available and should be installed if you use SAS Embedded Process 9.4.
  • A new script, UninstallSASEPFiles.sh, is available. This script stops and uninstalls the SAS Embedded Process on each database host node.
  • You can now specify a non-default port when you create the connection string to publish formats and models.

Hadoop Changes

August 2014 Release of SAS 9.4: Changes and Enhancements

In the August 2014 release of SAS 9.4, there are several Hadoop changes:
  • You can now specify a fixed record format for the output file of the SAS Scoring Accelerator for Hadoop. Previously, all output was delimited.
  • SPD file formats are supported by the SAS Embedded Process for Hadoop.
  • Instead of manually selecting the Hadoop JAR files to the client machine, the SAS Embedded Process determines which version of the JAR files are required and gathers them into a ZIP file for you to copy to the client machine.
  • You now have the option whether to automatically start the SAS Embedded Process when the installation is complete.

April 2014 Release of SAS 9.4: Changes and Enhancements

The documentation about the installation and configuration of the SAS Embedded Process was enhanced.

December 2013 Release of SAS 9.4: Changes and Enhancements

In the December 2013 release of SAS 9.4, there are several Hadoop changes:
  • The SAS Embedded Process and the SAS Scoring Accelerator for Hadoop support Kerberos and Hive2 for both Cloudera and Hortonworks.
  • The trace log messages for the SAS Embedded Process are now stored in the MapReduce job log.
  • A new option, hdfsuser, is available in the sasep-servers.sh script. hdfsuser specifies the user ID that has Write access to HDFS root directory.
  • A new parameter, HADOOP_CFG=, is available for the INDCONN macro variable. The HADOOP_CFG= parameter specifies the location of the Hadoop configuration file that is used with the %INDHD_PUBLISH_MODEL and the %INDHD_RUN_MODEL macros.
  • The Cloudera JAR files for the SAS Embedded Process have been replaced by a set of Apache JAR files. The new JAR files are based on a release of the Apache Hadoop instead of a particular Hadoop distributor.

September 2013 Release of SAS 9.4: Changes and Enhancements

In-database scoring for Hadoop is available.

SAS 9.4: Changes and Enhancements

The Hadoop scripts that install, control, and provide status of the SAS Embedded Process have changed. There is now just one script, sasep-servers.sh, that installs both the SAS Embedded Process and the Hadoop JAR files. Running this script also enables you to start, stop, and provide status of the SAS Embedded Process.

DATA Step Processing in Hadoop

August 2014 Release of SAS 9.4: Changes and Enhancements

Running limited DATA step scoring programs in Hadoop is now production.

December 2013 Release of SAS 9.4: Changes and Enhancements

Limited DATA step scoring programs can be run inside Hadoop. This feature is pre-production.

Netezza Changes

SAS 9.4: Changes and Enhancements

In-database scoring for Netezza has been enhanced by the addition of the SAS Embedded Process. The SAS Embedded Process is a SAS server process that runs within Netezza to read and write data. The SAS Embedded Process can be used with the SAS Scoring Accelerator for Netezza to run scoring models.

In-Database Processing for SAP HANA

August 2014 Release of SAS 9.4: Changes and Enhancements

In-database scoring for SAP HANA is available. You can also run Base SAS procedures inside SAP HANA.

SAS Scoring Accelerator for SPD Server

July 2013 Release of SAS 9.4: Changes and Enhancements

In-database scoring for the SAS Scalable Performance Data Server is available.

Changes for Running In-Database Procedures

SAS 9.4: Changes and Enhancements

The PRESERVE_NAMES LIBNAME option no longer prevents in-database processing.

SAS Model Manager Changes

April 2014 Release of SAS 9.4: Changes and Enhancements

A new section was added about considerations when creating or modifying DATA step score code.

SAS 9.4: Changes and Enhancements

Two columns have been added to the model table. The ModelUUID and Notes columns assist in processing scoring models when using the SAS Embedded Process.

Autocall Macros

September 2013 Release of SAS 9.4: Changes and Enhancements

The following autocall macros are no longer needed for any DBMS. However, they are still supported. These macros initialized the publishing macros.
%INDACPF %INDB2PF %INDGPPM %INDNZPM
%INDACPM %INDB2PM %INDNZPC %INDORPM
%INDB2PC %INDGPPC %INDNZPF %INDTDPF
%INDB2PD %INDGPPF %INDNZPJ %INDTDPM