UNICODEC Function

converts characters in the current SAS session encoding to Unicode characters.

Category:

Character

Syntax

Details

Example

Syntax

STR=UNICODEC(<instr> (,<Unicode type> ))

Required Arguments

str: data string that has been converted to Unicode encoding.

instr: input data string.

Unicode type

Unicode character formats

ESC	Unicode Escape (for example, \u0042 ) ESC is the default format.
NCR	Numeric Character Representation (for example, &#22823 or &#177 ; )
PAREN	Unicode Parenthesis Escape (for example, <u0061>)
UCS2	UCS2 encoding with native endian.
UCS2B	UCS2 encoding with big endian.
UCS2L	UCS2 encoding with little endian.
UCS4	UCS4 encoding with native endian.
UCS4B	UCS4 encoding with big endian.
UCS4L	UCS4 encoding with little endian.
UTF16	UTF16 encoding with big endian.
UTF16B	UTF16 encoding with big endian.
UTF16L	UTF16 encoding with little endian.
UTF8	UTF8 encoding.

Details

This function reads characters that are in the current SAS session encoding and converts them to Unicode encoding.

Example

The following example demonstrates the functionality of the UNICODEC function:

length str4 $20;
dai=unicode('\u5927');
str1=unicodec("ABC");
str2=unicodec("ABC",'esc');
str3=unicodec(dai, 'ncr');
str4=unicodec("ab",'paren');
str5=unicodec(dai, 'ucs2');
str6=unicodec(dai, 'ucs2b');
str7=unicodec(dai, 'ucs2l');
str8=unicodec(dai, 'ucs4');
str9=unicodec(dai, 'ucs4b');
str10=unicodec(dai, 'ucs4l');
str11=unicodec(dai, 'utf8');
str12=unicodec(dai, 'utf16');
str13=unicodec(dai, 'utf16b');
str14=unicodec(dai, 'utf16l');
Results:
str1=414243
str2=414243
str3=
str4=str5=2759
str6=5927
str7=2759
str8=27590000
str9=00005927
str10=27590000
str11=E5A4A7
str12=2759
str13=5927
str14=2759