Large SAS transport
files have become an issue for the FDA to process. Large SAS transport
file sizes occur, in part, when the maximum column length of 200 is
used for character variables.
The FDA has requested
that the allotted character length for each column in a data set be
the maximum length of the variable used.
There are valid reasons
to use different column lengths for data collection and data submission.
During data collection,
you might not know the final column length until the study is complete.
The maximum column length of a column that uses a non-extensible codelist
is predictable. The maximum column length of a column that uses an
extensible codelist or no codelist at all is not predictable and cannot
be known until the data collection is complete. To avoid possible
data truncation, longer column lengths can be warranted.
For data submissions,
you might choose to reduce a column length to a shorter length, such
as the maximum observed value or the maximum possible codelist value.
The %CSTUTILMANAGECOLUMNSIZE
macro enables you to determine and set the lengths of columns based
on observed, expected, predetermined, or codelist values. (This functionality
is enabled by the _cstTrimAlgorithm parameter.)
For complete information
about the %CSTUTILMANAGECOLUMNSIZE macro, see the SAS Clinical
Standards Toolkit: Macro API Documentation.