Usage Note 5687: "What's New in 8.2" document misleading regarding COMPRESS= system
and data set OPTION
In the document "What's New in SAS Software for Release 8.2" under the
category of System Options, it states the following:
When a request is made to compress a SAS data set with the COMPRESS=
system OPTION, SAS now determines whether the resulting compressed
file will be larger than an uncompressed file. If so, SAS creates an
uncompressed data set and issues a warning message stating that the
file has not been compressed.
This statement is misleading. The overhead associated with a compressed
data set is 12 bytes per observation. If, due to the number and type of
the variables defined in the data set, it is not possible for the
compressed data set to be at least 12 bytes per observation smaller than
an uncompressed version, compression is disabled and a message is
written to the SAS Log.
Here is an example where SAS determines that it is not possible for
the compressed data set to be smaller. Compression is disabled:
data one(compress=yes);
length x y $2;
input x y;
cards;
ab cd
;
run;
The following message is written to the SAS Log:
WARNING: Compression was disabled for data set WORK.ONE because
compression overhead would increase the size of the data
set.
Because both of the SAS-supplied compression algorithms are highly
data-dependant, it is possible for a compressed data set to be larger
than an uncompressed version. The notes written to the SAS Log
indicating the effectiveness of the compression should be used during
application development to determine if data set compression offers
any benefits, given the specific structure and content of the data
set(s) involved.
Here is an example of the message written to the Log if compression is
enabled and the data set increases in size:
NOTE: Compressing data set libref.member increased size by nn.nn
percent. Compressed is n pages; un-compressed would require
n pages.
Operating System and Release Information
| Product Family | Product | System | Reported Release | Fixed Release* |
| SAS System | Base SAS | Microsoft Windows NT Workstation | 8.2 TS2M0 | |
| Windows Millennium Edition (Me) | 8.2 TS2M0 | |
| Microsoft Windows 95/98 | 8.2 TS2M0 | |
| Microsoft Windows 2000 Professional | 8.2 TS2M0 | |
| Microsoft Windows 2000 Server | 8.2 TS2M0 | |
| OpenVMS VAX | 8.2 TS2M0 | |
| Microsoft Windows 2000 Advanced Server | 8.2 TS2M0 | |
| Microsoft Windows 2000 Datacenter Server | 8.2 TS2M0 | |
| Solaris | 8.2 TS2M0 | |
| OS/2 | 8.2 TS2M0 | |
| IRIX | 8.2 TS2M0 | |
| 64-bit Enabled Solaris | 8.2 TS2M0 | |
| z/OS | 8.2 TS2M0 | |
| ABI+ for Intel Architecture | 8.2 TS2M0 | |
| Linux | 8.2 TS2M0 | |
| HP-UX | 8.2 TS2M0 | |
| 64-bit Enabled AIX | 8.2 TS2M0 | |
| OpenVMS Alpha | 8.2 TS2M0 | |
| CMS | 8.2 TS2M0 | |
| 64-bit Enabled HP-UX | 8.2 TS2M0 | |
| AIX | 8.2 TS2M0 | |
| Tru64 UNIX | 8.2 TS2M0 | |
*
For software releases that are not yet generally available, the Fixed
Release is the software release in which the problem is planned to be
fixed.
| Type: | Usage Note |
| Priority: | |
| Topic: | SAS Reference ==> Procedures ==> DATASETS Data Management ==> Manipulation and Transformation ==> Compressing Data Files
|
| Date Modified: | 2008-03-18 14:16:43 |
| Date Created: | 2001-08-21 14:25:20 |