CDISC CRT-DDS 1.0

Purpose

The CDISC CRT-DDS standard defines the metadata structures in a machine-readable XML format. These metadata structures are used to describe the CRT data sets and variables for regulatory submissions. The XML schema that is used to define the metadata structures in an XML format is based on an extension to the CDISC Operational Data Model (ODM).

Release Date

CDISC CRT-DDS, Final Version 1.0, February 10, 2005

Regulatory Basis

(Source: CDISC Case Report Tabulation Data Definition Specification)
In 1999, the FDA standardized the submission of clinical and non-clinical data and metadata in a set of eSubmission guidelines to include metadata descriptions of the data sets and columns within a Data Definition Document (define.pdf). In 2003, the FDA published a set of guidance documents on receiving electronic product applications per the International Conference on Harmonisation (ICH) electronic Common Technical Document (eCTD) specifications. In these specifications, the FDA expanded the acceptable file types to include the XML format.

CDISC CRT-DDS 1.0 Reference Standard

Overview

The domain and column metadata that constitute the SAS representation of CDISC CRT-DDS 1.0 are derived from the global standards library in these formats:
Sample Reference_Tables Record (CDISC CRT-DDS 1.0)
Column Name
Column Value
sasref
REFDATA
table
ItemGroupDefs
label
keys
OID
standard
CDISC-CRTDDS
standardversion
1.0
standardref
comment
xmlelementname
ItemGroupDefs
class
ItemGroupDefs
qualifiers
Sample Reference_Columns Record (CDISC CRT-DDS 1.0)
Column Name
Column Value
sasref
REFDATA
table
DefineDocument
column
FileType
label
File type (Snapshot | Transactional)
order
5
type
C
length
13
displayformat
$13.
standard
CDISC-CRTDDS
standardversion
1.0
standardref
comment
core
Req
xmlcodelist
FILETYPE
qualifiers
As a general rule, the SAS representation of the CDISC CRT-DDS standard is patterned to match the XML element (data set) and attribute (column) structure of define.xml. For example, for CDISC SDTM, domain-level metadata is represented by a define.xml ItemGroupDef element. This metadata is captured in the ItemGroupDefs SAS data set. The TE domain metadata is shown in this code:
<ItemGroupDef OID="docroot.IG.TE"
    Name="TE"
    Repeating="No"
    IsReferenceData="Yes"
    Purpose="Tabulation"
    def:Label="Trial Elements"
    def:Structure="One record per planned element"
    def:DomainKeys="STUDYID,ETCD"
    def:Class="Trial Design"
    def:ArchiveLocationID="ArchiveLocation.te">
    !-- All ItemRefs would be listed here -->
    <def:leaf ID="ArchiveLocation.te"
    xlink:href="te.xpt"> <def:title>te.xpt</def:title>
  </def:leaf>
</ItemGroupDef>
The TE domain metadata is shown in this table.
Sample Data Set Representation: ItemGroupDefs.sas7bdat
Column
Value
OID
docroot.IG.TE
Name
TE
Repeating
No
IsReferenceData
Yes
SASDatasetName
Domain
Origin
Role
Purpose
Tabulation
Comment
Label
Trial Elements
Class
Trial Design
Structure
One record per planned element
DomainKeys
STUDYID, ETCD
ArchiveLocationID
ArchiveLocation.te
FK_MetaDataVersion
Note: Empty or null attributes are not typically included in the XML file.
This table lists the complete set of 39 tables that form the SAS Clinical Standards Toolkit SAS representation of the CDISC CRT-DDS 1.0 standard.
Data Sets in the SAS Representation of the CDISC CRT-DDS 1.0 Standard
Table
Table
AnnotatedCRFs
ItemQuestionTranslatedText
CLItemDecodeTranslatedText
ItemRangeCheckValues
CodeListItems
ItemRangeChecks
CodeLists
ItemRole
ComputationMethods
ItemValueListRefs
DefineDocument
MDVLeaf
ExternalCodeLists
MDVLeafTitles
FormDefArchLayouts
MUTranslatedText
FormDefItemGroupRefs
MeasurementUnits
FormDefs
MetaDataVersion
ImputationMethods
Presentation
ItemAliases
ProtocolEventRefs
ItemDefs
RCErrorTranslatedText
ItemGroupAliases
Study
ItemGroupDefItemRefs
StudyEventDefs
ItemGroupDefs
StudyEventFormRefs
ItemGroupLeaf
SupplementalDocs
ItemGroupLeafTitles
ValueListItemRefs
ItemMURefs
ValueLists
ItemQuestionExternal
The highly structured nature of CDISC CRT-DDS data requires that any mapping to a relational format include a large number of data sets, with foreign key relationships to help preserve the intended non-relational object structure. In the SAS Clinical Standards Toolkit, foreign key relationships are enforced when validating the CDISC CRT-DDS data sets.
Field lengths in the CDISC CRT-DDS data sets are consistent by core data type. CDISC has not specified any limit to the length of most character fields. Arbitrary lengths have been chosen by data type. These lengths are listed in this table. In the table, standard data types are distilled into core data types. To be safe, larger lengths have been chosen to ensure that no data loss occurs in the SAS Clinical Standards Toolkit pre-installed data sets. Production tables might be compressed using SAS mechanisms to preserve disk space.
CDISC CRT-DDS Default Lengths by Data Type
Type Name
Length
Description
oid
128
A unique object identifier or a reference
text
2000
A character field that can accommodate a large number of characters
name
128
A descriptive identifier
value
512
An item of collected or reference data
path
512
An absolute or relative file system path or URL

CDISC CRT-DDS SAS Data Set Construction

CRT-DDS 1.0 SAS Data Sets lists the data sets with member columns that form the CDISC CRT-DDS 1.0 data in the SAS Clinical Standards Toolkit.
It is this set of files, in whole or in part, that defines the CDISC CRT-DDS reference standard.
The SAS Clinical Standards Toolkit CDISC CRT-DDS reference standard supports reading and representing in SAS a define.xml file, building a define.xml file, and validating the structure and content of the SAS representation of a define.xml file. In addition, it validates the structural integrity of the define.xml file. To support this functionality, supplemental files include these global standards library files:
  • A SAS format catalog (crtddsct.sas7bcat) in the formats folder provides valid values for selected columns in the 39 data sets of the SAS representation.
  • The Validation Master data set in the validation/control folder contains the super-set of checks validating the structure and content of the 39 data sets.
  • The Messages data set in the messages folder provides error messaging for all Validation Master checks.
  • SAS code in the macros folder provides CDISC CRT-DDS-specific code that augments code that is provided in the primary SAS Clinical Standards Toolkit autocall library (!sasroot/cstframework/sasmacro).
  • The style sheet folder contains the define1-0-0.xsl file. The style sheet is copied from http://www.cdisc.org/stuff/contentmgr/files/0/464923b10ea16b477151fcaa9f465166/misc/define1_0_0.xsl. A define.xml file can be rendered in a human-readable form if it contains an explicit XML style sheet reference, such as a reference to the default style sheet.