Problem Note 57998: The Serializer/Deserializer (SerDe) for the SASĀ® Scalable Performance Data (SPD) Engine on HDFS treats VarChar data types as String data types
The SAS Scalable Performance Data (SPD) Engine creates SAS® data sets on a Hadoop Distributed File System (HDFS). The Serializer/Deserializer (SerDe) provided by SAS enables read access to these tables directly from Hive.
SAS character variables are stored in Hive as a VarChar data type with the length of the original variable definition. However, when executing a query from Hive, the variable values are presented as a String data type padded with spaces to the full length of the variable.
This means that comparison strings used in the query must also be padded with spaces in order to match the data returned from the SerDe.
Click the Hot Fix tab in this note to access the hot fix for this issue.
Operating System and Release Information
SAS System | Base SAS | Solaris for x64 | 9.4 TS1M3 | |
Linux for x64 | 9.4 TS1M3 | |
HP-UX IPF | 9.4 TS1M3 | |
64-bit Enabled Solaris | 9.4 TS1M3 | |
64-bit Enabled AIX | 9.4 TS1M3 | |
*
For software releases that are not yet generally available, the Fixed
Release is the software release in which the problem is planned to be
fixed.
When querying SAS SPD Engine tables from Hive using the Serializer/Deserializer (SerDe), Varchar data types must be treated as String data types with their values padded with spaces.
Type: | Problem Note |
Priority: | high |
Date Modified: | 2017-01-04 14:38:28 |
Date Created: | 2016-04-04 19:43:48 |