Problem Note 65935: The UNICODE function does not support Numeric Character Representation (NCR) for a surrogate pair
The UNICODE function does not convert an NCR surrogate pair into the appropriate Supplementary Multilingual Plane (SMP) character.
For example, you run the following code in a SAS® session that is using UTF-8 encoding:
data _null_;
str1='��';
char=unicode(str1, "ncr");
put char=;
run;
The UNICODE function should create a character variable whose value should be the display character that is represented by the surrogate pair. However, instead, the string is written to the SAS log in the following NCR format:
char=��
Click the Hot Fix tab in this note to access the hot fix for this issue.
After the hot fix is applied, the above example code displays the accurate Vietnamese character in the log.
Operating System and Release Information
SAS System | Base SAS | z/OS | 9.4_M5 | | 9.4 TS1M5 | |
z/OS 64-bit | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft® Windows® for x64 | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows 8 Enterprise 32-bit | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows 8 Enterprise x64 | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows 8 Pro 32-bit | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows 8 Pro x64 | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows 8.1 Enterprise 32-bit | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows 8.1 Enterprise x64 | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows 8.1 Pro 32-bit | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows 8.1 Pro x64 | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows 10 | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows Server 2008 | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows Server 2008 R2 | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows Server 2008 for x64 | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows Server 2012 Datacenter | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows Server 2012 R2 Datacenter | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows Server 2012 R2 Std | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows Server 2012 Std | 9.4_M5 | | 9.4 TS1M5 | |
Microsoft Windows Server 2016 | 9.4_M5 | | 9.4 TS1M5 | |
Windows 7 Enterprise 32 bit | 9.4_M5 | | 9.4 TS1M5 | |
Windows 7 Enterprise x64 | 9.4_M5 | | 9.4 TS1M5 | |
Windows 7 Home Premium 32 bit | 9.4_M5 | | 9.4 TS1M5 | |
Windows 7 Home Premium x64 | 9.4_M5 | | 9.4 TS1M5 | |
Windows 7 Professional 32 bit | 9.4_M5 | | 9.4 TS1M5 | |
Windows 7 Professional x64 | 9.4_M5 | | 9.4 TS1M5 | |
Windows 7 Ultimate 32 bit | 9.4_M5 | | 9.4 TS1M5 | |
Windows 7 Ultimate x64 | 9.4_M5 | | 9.4 TS1M5 | |
64-bit Enabled AIX | 9.4_M5 | | 9.4 TS1M5 | |
64-bit Enabled Solaris | 9.4_M5 | | 9.4 TS1M5 | |
HP-UX IPF | 9.4_M5 | | 9.4 TS1M5 | |
Linux for x64 | 9.4_M5 | | 9.4 TS1M5 | |
Solaris for x64 | 9.4_M5 | | 9.4 TS1M5 | |
*
For software releases that are not yet generally available, the Fixed
Release is the software release in which the problem is planned to be
fixed.
Using the NCR form of a surrogate pair as an input string to the UNICODE function does not convert the string to the appropriate display character.
Type: | Problem Note |
Priority: | medium |
Date Modified: | 2020-05-06 08:33:48 |
Date Created: | 2020-05-05 15:11:26 |