$UCS4Xw. Informat

Reads a character string that is encoded in 32-bit, UCS4, Unicode encoding, and then converts the character string to the encoding of the current SAS session.
Category: Character

Syntax

$UCS4Xw.

Syntax Description

w
specifies the width of the input field. Specify enough width to accommodate the 32-bit size of the Unicode characters.
Default:4
Range:4–32000

Comparisons

The $UCS4Xw. informat performs processing that is the opposite of the $UCS4XEw. informat. Use the $UCS4Xw. informat when you are processing data within the same operating environment. Use the $UCS4Bw. and $UCS4Lw. informats when you are processing data from different operating environments.

Example

These examples use the Japanese Shift_JIS encoding, which is supported under the UNIX operating environment. This example uses little-endian formatting.
Statements
Results
----+----1----+
ucs4=put('91e5'x,$ucs4x.);
sjis=input(ucs4,$ucs4x.);
put ucs4=$hex8. sjis=$hex8.;
run;
ucs4=27590000
sjis=91E52020