What’s New in SAS 9.4 In-Database Products
Overview
In SAS 9.4, the following
new features and enhancements were added to expand the capabilities
of the SAS In-Database products:
-
In the February 2015 release of
SAS 9.4, the following changes and additions were made:
-
The SAS In-Database Code Accelerator
for Hadoop uses HCatalog to process complex, non-delimited files.
This enables the SAS In-Database Code Accelerator for Hadoop to support
Avro, ORC, RCFile, and Parquet file types.
-
You can now use the DBCREATE_TABLE_OPTS
table option to specify the output SerDe, the output delimiter of
the Hive table, the output ESCAPED BY character, and any other CREATE
TABLE syntax allowed by Hive.
-
In the August 2014 release of SAS
9.4, the following changes and additions were made:
-
The SAS Scoring Accelerator and
in-database processing of Base SAS procedures is available for SAP
HANA.
-
Running limited DATA step scoring
programs in Hadoop is now production.
-
Numerous changes were made to the
installation and configuration script for the SAS Embedded Process
for Hadoop.
-
In the April 2014 release of SAS
9.4, documentation enhancements were made in the following areas:
-
installation and configuration
of the SAS Embedded Process for Hadoop
-
considerations when creating or
modifying DATA step score code
-
semaphore requirements when using
the SAS Embedded Process for Greenplum
-
In the December 2013 release of
SAS 9.4, the following changes and additions were made:
-
Limited DATA step programs can
be run inside Hadoop for scoring.
-
New parameters have been added
for the Hadoop INDCONN macro variable.
-
New Hadoop JAR files are now tied
to the version of Apache Hadoop that you are using.
-
The SAS In-Database Code Accelerator
for Teradata now can run the DS2 data program as well as the thread
program inside the database.
-
DS2ACCEL, a new system option,
controls whether the DS2 code is executed inside the database. The
default value is NONE, which prevents DS2 code from executing inside
the database.
-
The PROC DS2 INDB option has changed
its name to DS2ACCEL. INDB is still supported. However, the default
value for this option has changed from YES to NO, which prevents DS2
code from executing in the database. This is a change in behavior
from the initial 9.4 release.
-
In the September 2013 release of
SAS 9.4, the following changes and additions were made:
-
In-database processing for Hadoop
has been enhanced by the addition of the SAS Scoring Accelerator for
Hadoop.
-
The autocall macros that initialized
the publishing macros are no longer needed for any DBMS. However,
they are still supported.
-
In the July 2013 release of SAS
9.4, the following changes and additions were made:
-
The SAS Scoring Accelerator for
SPD Server is available.
-
In the June 2013 release of SAS
9.4, the following changes and additions were made:
-
Greenplum and Teradata in-database
processing has been enhanced by the addition of the SAS In-Database
Code Accelerators.
-
The SAS In-Database Code Accelerator
enables you to publish a DS2 thread program to the database and execute
that thread program in parallel inside the database.
-
In-database scoring for Netezza
has been enhanced by the addition of the SAS Embedded Process. The
SAS Embedded Process is a SAS server process that runs within Netezza
to read and write data.
-
Two columns have been added to
the model table. The ModelUUID and Notes columns assist in processing
scoring models when using the SAS Embedded Process.
-
The Hadoop scripts that install,
control, and provide status of the SAS Embedded Process have changed.
There is now just one script, sasep-server.sh, that installs both
the SAS Embedded Process and the Hadoop JAR files.
SAS In-Database Code Accelerator
February 2015 Release of SAS 9.4: Changes and Enhancements
In the February 2015
release of SAS 9.4, the following changes and additions were made:
-
The SAS In-Database Code Accelerator
for Hadoop supports only Cloudera 5.2 and Hortonworks 2.1 or later.
-
The SAS In-Database Code Accelerator
for Hadoop uses HCatalog to process complex, non-delimited files.
-
The SAS In-Database Code Accelerator
for Hadoop now supports Avro, ORC, RCFile, and Parquet file types.
-
For the SAS In-Database Code Accelerator
for Hadoop, you can use the DBCREATE_TABLE_OPTS table option to specify
the output SerDe, the output delimiter of the Hive table, the output
escaped by, and any other CREATE TABLE syntax allowed by Hive.
August 2014 Release of SAS 9.4: Changes and Enhancements
Hadoop in-database processing
has been enhanced by the addition of the SAS In-Database Code Accelerator.
The SAS In-Database Code Accelerator enables you to publish a DS2
thread and data program to the database and execute those programs
in parallel inside the database.
December 2013 Release of SAS 9.4: Changes and Enhancements
In the December 2013
release of SAS 9.4, the following changes and additions were made:
-
The SAS In-Database Code Accelerator
for Teradata now runs the DS2 data program as well as the thread program
inside the database.
-
DS2ACCEL, a new system option,
controls whether the DS2 code is executed inside the database. The
default value is NONE, which prevents DS2 code from executing inside
the database.
-
The PROC DS2 INDB option has changed
its name to DS2ACCEL. INDB is still supported. However, the default
value for this option has changed from YES to NO. This change prevents
DS2 code from executing in the database. This is a change in behavior
from the initial SAS 9.4 release.
SAS 9.4: Changes and Enhancements
The SAS In-Database
Code Accelerator enables you to publish a DS2 thread program to the
database and execute that thread program in parallel inside the database.
Examples of thread programs include large transpositions, computationally
complex programs, scoring models, and BY-group processing. The SAS
In-Database Code Accelerator is available for Greenplum and Teradata.
The SAS In-Database
Code Accelerator must be licensed at your site.
Greenplum Changes
April 2014 Release of SAS 9.4: Changes and Enhancements
SAS 9.4: Changes and Enhancements
There are several changes
for Greenplum:
-
Version 1.2 of the Greenplum Partner
Connector (GPPC) is now available and should be installed if you use
SAS Embedded Process 9.4.
-
A new script, UninstallSASEPFiles.sh,
is available. This script stops and uninstalls the SAS Embedded Process
on each database host node.
-
You can now specify a non-default
port when you create the connection string to publish formats and
models.
Hadoop Changes
August 2014 Release of SAS 9.4: Changes and Enhancements
In the August 2014 release
of SAS 9.4, there are several Hadoop changes:
-
You can now specify a fixed record
format for the output file of the SAS Scoring Accelerator for Hadoop.
Previously, all output was delimited.
-
SPD file formats are supported
by the SAS Embedded Process for Hadoop.
-
Instead of manually selecting the
Hadoop JAR files to the client machine, the SAS Embedded Process determines
which version of the JAR files are required and gathers them into
a ZIP file for you to copy to the client machine.
-
You now have the option whether
to automatically start the SAS Embedded Process when the installation
is complete.
April 2014 Release of SAS 9.4: Changes and Enhancements
The documentation about
the installation and configuration of the SAS Embedded Process was
enhanced.
December 2013 Release of SAS 9.4: Changes and Enhancements
In the December 2013
release of SAS 9.4, there are several Hadoop changes:
-
The SAS Embedded Process and the
SAS Scoring Accelerator for Hadoop support Kerberos and Hive2 for
both Cloudera and Hortonworks.
-
The trace log messages for the
SAS Embedded Process are now stored in the MapReduce job log.
-
A new option, hdfsuser,
is available in the sasep-servers.sh script. hdfsuser specifies
the user ID that has Write access to HDFS root directory.
-
A new parameter, HADOOP_CFG=, is
available for the INDCONN macro variable. The HADOOP_CFG= parameter
specifies the location of the Hadoop configuration file that is used
with the %INDHD_PUBLISH_MODEL and the %INDHD_RUN_MODEL macros.
-
The Cloudera JAR files for the
SAS Embedded Process have been replaced by a set of Apache JAR files.
The new JAR files are based on a release of the Apache Hadoop instead
of a particular Hadoop distributor.
September 2013 Release of SAS 9.4: Changes and Enhancements
In-database scoring
for Hadoop is available.
SAS 9.4: Changes and Enhancements
The Hadoop scripts that
install, control, and provide status of the SAS Embedded Process have
changed. There is now just one script, sasep-servers.sh, that installs
both the SAS Embedded Process and the Hadoop JAR files. Running this
script also enables you to start, stop, and provide status of the
SAS Embedded Process.
DATA Step Processing in Hadoop
August 2014 Release of SAS 9.4: Changes and Enhancements
Running limited DATA
step scoring programs in Hadoop is now production.
December 2013 Release of SAS 9.4: Changes and Enhancements
Limited DATA step scoring
programs can be run inside Hadoop. This feature is pre-production.
Netezza Changes
SAS 9.4: Changes and Enhancements
In-database scoring
for Netezza has been enhanced by the addition of the SAS Embedded
Process. The SAS Embedded Process is a SAS server process that runs
within Netezza to read and write data. The SAS Embedded Process can
be used with the SAS Scoring Accelerator for Netezza to run scoring
models.
In-Database Processing for SAP HANA
August 2014 Release of SAS 9.4: Changes and Enhancements
In-database scoring
for SAP HANA is available. You can also run Base SAS procedures inside
SAP HANA.
SAS Scoring Accelerator for SPD Server
July 2013 Release of SAS 9.4: Changes and Enhancements
In-database scoring
for the SAS Scalable Performance Data Server is available.
Changes for Running In-Database Procedures
SAS 9.4: Changes and Enhancements
The PRESERVE_NAMES LIBNAME
option no longer prevents in-database processing.
SAS Model Manager Changes
April 2014 Release of SAS 9.4: Changes and Enhancements
A new section was added
about considerations when creating or modifying DATA step score code.
SAS 9.4: Changes and Enhancements
Two columns have been
added to the model table. The ModelUUID and Notes columns assist in
processing scoring models when using the SAS Embedded Process.
Autocall Macros
September 2013 Release of SAS 9.4: Changes and Enhancements
The following autocall
macros are no longer needed for any DBMS. However, they are still
supported. These macros initialized the publishing macros.
%INDACPF |
%INDB2PF |
%INDGPPM |
%INDNZPM |
%INDACPM |
%INDB2PM |
%INDNZPC |
%INDORPM |
%INDB2PC |
%INDGPPC |
%INDNZPF |
%INDTDPF |
%INDB2PD |
%INDGPPF |
%INDNZPJ |
%INDTDPM |
Copyright © SAS Institute Inc. All rights reserved.