Problem Note 60331: Hive character data is truncated when read by SAS® and run in a multi-byte encoded SAS® session
When SAS is run in a multi-byte encoded session such as UTF-8 or various double-byte character sets (DBCSs) such as EUC_CN, the SAS/ACCESS® Interface to Hadoop engine sizes the character columns the same as declared in Hive. For Hive character types (STRING, VARCHAR, CHAR), this behavior might cause data truncation because encoded characters might require multiple bytes when read.
Click the Hot Fix tab in this note to access the hot fix for this issue.
After you apply the hot fix, column lengths for character data from Hive are inflated in SAS. The inflation is two times for SAS DBCS encodings such as EUC_CN and three times for UTF-8.
Operating System and Release Information
SAS System | SAS/ACCESS Interface to Hadoop | Microsoft® Windows® for x64 | 9.4 TS1M3 | |
Microsoft Windows 8 Enterprise 32-bit | 9.4 TS1M3 | |
Microsoft Windows 8 Enterprise x64 | 9.4 TS1M3 | |
Microsoft Windows 8 Pro 32-bit | 9.4 TS1M3 | |
Microsoft Windows 8 Pro x64 | 9.4 TS1M3 | |
Microsoft Windows 8.1 Enterprise 32-bit | 9.4 TS1M3 | |
Microsoft Windows 8.1 Enterprise x64 | 9.4 TS1M3 | |
Microsoft Windows 8.1 Pro 32-bit | 9.4 TS1M3 | |
Microsoft Windows 8.1 Pro x64 | 9.4 TS1M3 | |
Microsoft Windows 10 | 9.4 TS1M3 | |
Microsoft Windows Server 2008 | 9.4 TS1M3 | |
Microsoft Windows Server 2008 R2 | 9.4 TS1M3 | |
Microsoft Windows Server 2008 for x64 | 9.4 TS1M3 | |
Microsoft Windows Server 2012 Datacenter | 9.4 TS1M3 | |
Microsoft Windows Server 2012 R2 Datacenter | 9.4 TS1M3 | |
Microsoft Windows Server 2012 R2 Std | 9.4 TS1M3 | |
Microsoft Windows Server 2012 Std | 9.4 TS1M3 | |
Windows 7 Enterprise 32 bit | 9.4 TS1M3 | |
Windows 7 Enterprise x64 | 9.4 TS1M3 | |
Windows 7 Home Premium 32 bit | 9.4 TS1M3 | |
Windows 7 Home Premium x64 | 9.4 TS1M3 | |
Windows 7 Professional 32 bit | 9.4 TS1M3 | |
Windows 7 Professional x64 | 9.4 TS1M3 | |
Windows 7 Ultimate 32 bit | 9.4 TS1M3 | |
Windows 7 Ultimate x64 | 9.4 TS1M3 | |
64-bit Enabled AIX | 9.4 TS1M3 | |
64-bit Enabled Solaris | 9.4 TS1M3 | |
HP-UX IPF | 9.4 TS1M3 | |
Linux for x64 | 9.4 TS1M3 | |
Solaris for x64 | 9.4 TS1M3 | |
*
For software releases that are not yet generally available, the Fixed
Release is the software release in which the problem is planned to be
fixed.
Type: | Problem Note |
Priority: | medium |
Date Modified: | 2017-12-06 15:31:43 |
Date Created: | 2017-04-20 13:11:56 |