SAS LASR Analytic Server
supports compression for in-memory tables. All the analytic statements,
such as PERCENTILES, LOGISTIC, and so on, in the IMSTAT procedure
are supported for compressed tables as well as regular, uncompressed,
tables. Clients like SAS Visual Analytics can also operate on compressed
tables as well.
All compression is performed
by the server. In other words, when you transfer a table to the server
in a DATA step and specify the SQUEEZE= data set option, the rows
are sent to the server as is, and the server compresses the rows.
The server uses the zlib compression algorithm that is described in
RFC 1950, "ZLIB Compressed Data Format Specification."
All data in a row, both
character and number variables, are compressed. Every row in a table
is compressed, the server does not support some rows in compressed
form and others as uncompressed. The server can report the uncompressed
size of the table, the compressed size, and the compression ratio.
For matrices of computed
doubles (with lots of decimal places), compression might not reduce
the storage requirements at all. For rows with many long character
variables that consist mostly of blanks, the compression ratio can
be very high. For rows with mixed variables where most doubles do
not have fractional parts and most character variables have a small
amount of blank padding, the compression ratio is typically moderate.
As with most cases of using compression, character variables tend
to compress the most and the ratio depends on your data.