0% found this document useful (0 votes)
156 views

Understanding SDTM

Uploaded by

greeen.pat6918
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
156 views

Understanding SDTM

Uploaded by

greeen.pat6918
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Paper PO08

A Relational Understanding of SDTM Tables


John R. Gerlach, MaxisIT, Inc. Glenn OBrien, ALTANA Pharma US, Inc.

Abstract
The Study Data Tabulation Model (SDTM) is fast becoming the industry standard for processing data in clinical trials. Although the CDISC standard is well defined, with obvious benefits to those who manage and analyze the data, it is not easy to implement. For example, the design of a Case Report Form (CRF) or how the data are presented in reports affords little help to mapping raw data to SDTM variables and their appropriate domains. Indeed, the data pertaining to one SDTM domain may originate from several pages in a CRF; conversely, the data found on one page of a CRF might map to several SDTM domains, such as demography (DM) and inclusion/exclusion criteria (IE). Thus, the process of creating valid SDTM domain data sets requires a thorough understanding of SDTM domains. In order to become more learned about SDTM domains, it is important to develop a relational understanding of SDTM variables across domains, that is, the variables without their domain prefix (e.g., SEQ, not AESEQ). Better still, the relational schema can be class specific, for example, domains pertaining only to the Events class (i.e., Adverse Events, Patient Disposition, Medical History). Even better, the relational schema can indicate the data type and core function (i.e., Required, Expected, Permissible) of each variable, along with its label, across the several domains, assuming the variable exists in the domain. This paper explains a method for learning about SDTM domains by producing class-specific relational schemas.

Introduction
In July 2004 the Clinical Data Interchange Standards Consortium (CDISC) published standards on the design and content of clinical trial tabulation data sets, known as the Study Data Tabulation Model (SDTM). According to the CDISC standard, there are four ways to represent a subject in a clinical study: tabulations, data listings, analysis datasets, and subject profiles. With the implementation of the CDISC standard, trained professionals can use software tools to work more efficiently. Moreover, clinical trials following this standard can be consolidated into a repository for further research. SDTM domains contain observations about a subject that are topic-specific in a study. The variables in each domain are pre-defined for which there are five major categories, illustrated by the following examples. Identifier Topic Timing Qualifier Rule USUBJID LBTYPE LBDTC LBORRESU TEDUR Subject identifier Type of lab test Date / time of lab measurement Units of original lab measurement Rule describing the duration of a Trial Element.

Most SDTM variables are distinguished by a two-character identifier that denotes the domain itself. For example, the timing variable AEDTC is found in the AE (Adverse Events) domain and represents the start date/time of an event following the ISO 8601 convention (i.e., yyyy-mm-ddThh:mm:ss). In order to produce more meaningful relational schemas, that is, to show the existence of a variable across specific-type domains, the prefix of variables denoting the domain will be ignored. Despite the fact that even similar variables (e.g., AESEQ, LBSEQ) are not like common variables (e.g., STUDYID) found in a typical relational data model, the CDISC standard has evolved into a general relational model for representing all types of study data, even defining relationships between records in different domains, as well as between so-called supplemental qualifiers and a parent domain. Consequently, it behooves the SAS professional working in clinical trials to develop a relational understanding of the SDTM model.

More About SDTM Domains


SDTM Domains are grouped by classes, which is useful for producing more meaningful relational schemas. Consider the following domain classes and their respective domains. Special Purpose Class Pertains to unique domains concerning detailed information about the subjects in a study. o Demography (DM) o Comments (CM) Findings Class Collected information resulting from a planned evaluation to address specific questions about the subject, such as whether a subject is suitable to participate or continue in a study. o Electrocardiogram (EG) o Inclusion / Exclusion (IE) o Lab Results (LB) o Physical Examination (PE) o Questionnaire (QS) o Subject Characteristics (SC) o Vital Signs (VS) Events Class Incidents independent of the study that happen to the subject during the lifetime of the study. o Adverse Events (AE) o Patient Disposition (DS) o Medical History (MH) Interventions Class Treatments and procedures that are intentionally administered to the subject, such as treatment coincident with the study period, per protocol, or self-administered (e.g., alcohol and tobacco use). o Concomitant Medications (CM) o Exposure to Treatment Drug (EX) o Substance Usage (SU) Trial Design Class Information about the design of the clinical trial (e.g., crossover trial, treatment arms) including information about the subjects with respect to treatment and visits. o Subject Elements (SE) o Subject Visits (SV) o Trial Arms (TA) o Trial Elements (TE) o Trial Inclusion / Exclusion Criteria (TI) o Trial Visits (TV)

Besides the name and data type of each variable in a domain dataset, the relational schema includes another very important piece of information, the Core function of each variable. This attribute ensures CDISC compliance and provides guidance for those creating the domains. The Core function of a variable falls into three categories: Required A variable that is fundamental or pertinent to the identification of the domain. These variables are always included in the domain data set and cannot contain null values. Expected A variable that makes a record meaningful in the context of its domain. These variables should exist, but may contain null values. Permissible A variable that should exist if it is appropriate, either collected or derived. All timing variables are exemplary of this core function.

The Elements of the Report


In order to produce a meaningful schema, it is necessary to ignore the prefix component of variables denoting the domain (e.g., AESEQ). For example, the ubiquitous variable denoting Sequence Number (<domain>SEQ) becomes SEQ in the schema. Without eliminating the prefix denoting the domain, this very important variable would not be listed in a single row, across domains, indicating existence (data type and core function), which is the primary objective of the report. Again, the purpose is to produce a relational schema of class-specific domains. Also, keep in mind that there are variable names that do not use the domain as part of the variable identifier, such as: STUDYID, DOMAIN, USUBJID, SUBJID; found in most domains, as well as others that are specific to one or more domains (e.g., SEXCD, ARMCD). The CDISC SDTM Implementation Guide (SDS Version 3.1) contains a list of keywords along with their respective variable ID component, called a fragment, that are used to name variables. For example, the fragment DUR denotes duration of an event, which is found in the Adverse Events (AE) domain. Even though the relational schema contains the label of the variables, the following list should help to understand the naming convention for CDISC variables. Keyword ACTION CANCER COMPLIANC E DISABILITY ELAPSED FLAG INDICATION LOINC CODE NOT DONE ONGOING POSITION RESULT SERIOUS START SUBJECT TREATMENT Fragment ACN CAN CP DISAB EL FL INDC LOINC ND ONGO POS RES S, SER ST SUBJ TRT Keyword BASELINE CONDITION CONGENITAL DISPOSITION ELEMENT GROUP INDICATOR LOWER_LIMI T NUMBER ORIGIN REASON RULE SEVERITY STATUS TIME UNIT Fragment BL CND CONG DS ET GRP IND LO NUM ORIG REAS RL SEV STAT TM U Keyword BODY CODE DECODE DURATION EMERGENT HOSPITALIZATIO N LOCATION NAME NUMERIC OUTCOME REGIMENT SEQUENCE SPONSOR SUBCATEGORY TOTAL VALUE Fragment BOD CD DECOD DUR EM HOSP LOC NAM N OUT RGM SEQ SP SUBCAT TOT VAL

The report (relational schema) lists the set of variables (excluding the domain prefix) for a specific class juxtaposed with those domains indicating the data type (Character | Numeric) and core function (Requested | Expected | Permissible) for each variable, that is, if the variable exists in that domain; otherwise, a dash is written to indicate that the variable does not exist in that domain. Consider the illustration below that shows the general layout of the schema. Notice the sub-title that indicates the domain class, such as Findings, such that only those domains (e.g., Physical Exams, Vital Signs, etc.) contribute their variables to the report, most importantly, the data type and core function of each variable. Consequently, the reader can develop a relational understanding of the several domains in the context of the respective domain class.

Relational Schema of Standard SDTM Domains ( <Domain Class> ) Name Variable1 Variable2 Variable3 : Varn * Domain1 <C|N>/<R|E|P> <C|N>/<R|E|P> : Domain2 <C|N>/<R|E|P> : * Domain3 <C|N>/<R|E|P> : ...

Obtaining the Relevant Data


In order to produce schemas on class-specific domains, it is necessary to associate a domain identifier to its class, which is easily accomplished by the user-defined format below.
proc format; value $classf 'AE' 'CO' 'DS' 'EX' 'LB' 'PE' 'SC' 'SU' 'TA' 'TI' 'VS' run; = = = = = = = = = = = 'Events' 'Special Purpose' 'Events' 'Interventions' 'Findings' 'Findings' 'Findings' 'Interventions' 'Trial Design' 'Trial Design' 'Findings'; 'CM' 'DM' 'EG' 'IE' 'MH' 'QS' 'SE' 'SV' 'TE' 'TV' = = = = = = = = = = 'Interventions' 'Special Purpose' 'Findings' 'Findings' 'Events' 'Findings' 'Trial Design' 'Trial Design' 'Trial Design' 'Trial Design'

Besides the $classf format, the following metadata is required to produce the intended report: variable identifier, variable label, data type, and core function. Because the order of SDTM variables is considered as part of the CDISC standard, another variable, called order, will be used to produce the report. In fact, the utility produces a variable call group so that identically named variables will be listed first, followed by the other common variables. Given a data file that contains metadata about CDISC domains, the following SAS code reads the relevant metadata about the standard domains and imputes the other variables germane to the report. For this example, assume that the variable NAME does not contain the domain prefix, such as SEQ, DUR, DTC, rather than AESEQ, AEDUR, CODTC.
filename sdtm 'SDTM_Vars.txt'; data sdtm_vars; length domain name $8 label $60 type $4 core t_c $3; infile sdtm; input order domain name label type core; class = put(domain,$classf.); = upcase(substr(type,1,1)) || '/' || upcase(substr(core,1,1)); t_c select(name); when('STUDYID') do; group=1; order = 1; end; when('DOMAIN') do; group=1; order = 2; end; when('USUBJID') do; group=1; order = 3; end; when('SUBJID') do; group=1; order = 4; end; otherwise do; group=2; end; end; keep domain group order name t_c label; run;

The Utility
The utility, a SAS macro, processes a data set that contains the pertinent information (domain, variable ID, datatype, core-function, label, and class) and produces a class-specific relational schema. Initially, it is necessary to obtain only those observations that are relevant to a given class (e.g., Findings), which the SORT procedure accomplishes easily.
proc sort data=sdtm_vars out=class; by domain name; where upcase(class) eq "%upcase(&class.)"; run;

Next it is necessary to determine the number of domains for that class. For the Special Purpose class, there are only two domains; whereas, for the Findings class, there seven domains. In any case, the number of domains is determined by the completeness of the data source from which the metadata originated. The SQL step below creates a macro variable denoting the number of class-specific domains, and the following SQL step creates n-macro variables denoting the several domains.
proc sql noprint; select left(put(count(distinct domain),best.)) into :ndomains from class;

quit;

proc sql noprint; select distinct(domain) into :domain1 - :domain&ndomains. from class; quit;

Now comes the conceptual hard part. How do you create the appropriate data set that produces the aforementioned report, a relational schema, listing the relevant variables and, in juxtaposition, denoting their respective existence (data type and core function) or non-existence for a class-specific domain? The structure of the acquired metadata is normalized, whose unit of analysis is the domain and variable; whereas, the intended report lists variables across domains, that is, the domains are column headers. Consider the partial listing of the metadata data set found in the AE domain.

Metadata - AE Domain DOMAIN AE AE AE AE AE AE AE AE AE AE AE AE AE AE AE AE AE AE : NAME STUDYID DOMAIN USUBJID SEQ GRPID REFID SPID TERM MODIFY DECOD CAT SCAT OCCUR BODSYS LOC SEV SER ACN : GROUP 1 1 1 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 ORDER 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 : T_C C/R C/R C/R C/R C/P C/P C/P C/R C/P C/R C/P C/P C/P C/E C/P C/P C/E C/E LABEL Study Identifier Domain Abbreviation Unique Subject Identifier Sequence Number Group ID Reference ID Sponsor ID Reported Term for Adverse Event Modified Reported Term Dictionary-Derived Term Category for Adverse Event Subcategory for Adverse Event Adverse Event Ocurrence Body System or Organ Class Location of the Reaction Severity / Intensity Serious Event Action Taken with Study Treatment :

Since the normalized metadata data set contains all the domains, it is necessary to create subset data sets each representing a particular domain, then to perform a match-merge by the NAME variable. However, because the matrix of common variables and specific domains indicates the data type and core function in its cells, it is necessary to rename the T_C variable, appropriately, to the name of the respective domain. But, how do you do this systematically? Recall that the previous SQL steps generated macro variables denoting the number of domains and their names for a specific class. The following Data step uses these macro variables to formulate the needed MERGE statement, along with the WHERE data set option to obtain the subset data set representing each domain. Also, notice that the variable T_C is renamed to its respective domain data set name (e.g., CO, DM). It is sufficient to merge the several data sets by the variable NAME. The variable denoting the data type and core function, called by its domain name, contains a blank for those instances where the domain data set does not contribute an observation to the match merge, since the variable does not exist in that domain. The DO loop with an IF statement supplants the missing value with a dash by using the array. Consider the following Data step that creates the appropriate schema for a specific class of domains.
data schema; array domains{*}$5 %do i = 1 %to &ndomains.; &&domain&i.. %end; ; merge %do i = 1 %to &ndomains.; class(where=(domain eq "&&domain&i..") rename=t_c=&&domain&i..) %end; ; by name; do i = 1 to dim(domains); if domains{i} eq '' then domains{i} = ' - '; end; drop i class domain;

run;

The reporting data set contains all the pertinent information needed to produce the schema that represents several domains pertaining to a class and contains the collection of variables along with an appropriate label and an element denoting the data type and core function of the variable. The REPORT procedure generates the desired report.
proc report data=schema nowindows headline headskip split='!'; columns group order name label ('- Domains -' %do i = 1 %to &ndomains.; &&domain&i.. %end;) ; define group / order noprint; define order / order noprint; define name / order id width=8 'Variable!Name'; define label / display id width=40 'Label'; break after group / skip; title2 "Relational Schema of Standard SDTM Domains"; title3 "( %upcase(&class.) )"; run;

Keep in mind that the previous steps reside in a macro definition called %schema. This macro contains only one keyword parameter, indicating the default class Special Purpose. Now, consider several invocations of the %schema macro that produces the intended relational schema.
%schema(); %schema(class=interventions); %schema(class=findings); %schema(class=trial design); %schema(class=events);

We proceed now to study the several reports with the intent to develop a better understanding of SDTM domains.

Special Purpose Class


Unlike the other classes, there is not much similarity amongst the two Special Purpose domains: Comments (CO) and Demography (DM). In fact, except for the usual set of identifying variables, there is only one common variable, the Date/Time of Collection (DTC), that is, the variables CODTC and DMDTC. Otherwise, the collection of variables is almost mutually exclusive. Also noteworthy is the core function of the DTC variable is consistent (i.e., Permissible), which is surprisingly not always the case.

Relational Schema of Standard SDTM Domains ( SPECIAL PURPOSE ) Variable - Domains -Name Label CO DM ---------------------------------------------------------------STUDYID DOMAIN USUBJID SUBJID AGE AGEU ARM ARMCD BRTHDTC COUNTRY DTC DY ETHNIC EVAL IDVAR IDVARVAL INVID INVNAM RACE RDOMAIN REF RFENDTC RFSTDTC SEQ SEX Study Identifier Domain Abbreviation Unique Subject Identifier Subject Identifier for the Study Age in AGEU at Reference Date/Time Age Units Description of Arm Arm Code Date/Time of Birth Country Date/Time of Collection Study Day of Collection Ethnicity Evaluator Identifier Variable Name Identifier Variable Value Investigator Identifier Investigator Name Race Related Domain Abbreviation Comment Reference Subject Reference End Date/Time Subject Reference Start Date/Time Sequence Number Sex C/R C/R C/R C/P C/P C/P C/P C/E C/P N/R C/R C/R C/R C/R N/E C/E C/R C/R C/P C/R C/P N/P C/P C/P C/P C/E C/R C/R C/R

SITEID VAL

Study Site Identifier Comment

C/R

C/R -

Findings Class
The Findings class pertains to information about a subject related to some kind of evaluation or assessment, which includes: Electrocardiogram (EG), Inclusion / Exclusion (IE), Laboratory Results (LB), Physical Examinations (PE), Questionnaire (QS), Subject Characteristics (SC), and Vital Signs (VS). The relational schema below shows that the primary common variables (Study ID, Domain, and Subject ID) exist in all the domains for that class, as well as date-related variables. Notice that the variables concerning visits are found in all except Subject Characteristic (SC). Also, the variable ORRES is Expected in all domains except for the Inclusion / Exclusion (IE) domain, where it is Required, which makes sense. It is left to the reader to consider other common traits or differences with respect to existence, data type, or core function.

Relational Schema of Standard SDTM Domains ( FINDINGS ) Variable ------------------- Domains ------------------Name Label EG IE LB PE QS SC VS --------------------------------------------------------------------------------------------------STUDYID DOMAIN USUBJID BLFL BODSYS CAT DRVFL DTC DY ELTM EVAL FAST GRPID LOC LOINC METHOD MODIFY NAM NRIND ORNRHI ORNRLO ORRES ORRESU POS REASND REFID SCAT SEQ SPCCND SPEC SPID STAT STNRC STNRHI STNRLO STRESC STRESN STRESU TEST TESTCD TOX TOXGR TPT TPTNUM TPTREF VISIT VISITDY VISITNUM Study Identifier Domain Abbreviation Unique Subject Identifier Baseline Flag Body System or Organ Class Category for Vital Signs Derived Flag Date/Time of Measurements Study Day of Vital Signs Elapsed Time from Reference Point Evaluator Fasting Status Group ID Location of Vital Signs Measurement LOINC Code Method of Test or Examination Modified Reported Term Vendor Name Reference Range Indicator Normal Range Upper Limit in Orig Units Normal Range Lower Limit in Orig Units Result or Finding in Original Units Original Units Vital Signs Position of Subject Reason Not Performed Specimen ID Subcategory for Vital Signs Sequence Number Specimen Condition Specimen Type Sponsor ID Vitals Status Reference Range for Char Rslt-Std Units Normal Range Upper Limit-Standard Units Normal Range Lower Limit-Standard Units Character Result/Finding in Std Format Numeric Result/Finding in Standard Units Standard Units Vital Signs Test Name Vital Signs Test Short Name Toxicity Standard Toxicity Grade Planned Time Point Name Planned Time Point Number Time Point Reference Visit Name Planned Study Day of Visit Visit Number C/R C/R C/R C/E C/P C/P C/E N/P C/P C/E C/P C/P C/P C/P C/ C/E C/P C/E C/P C/P C/P N/R C/P C/P C/E N/P C/P C/R C/R C/P N/P C/P C/P N/P N/R C/R C/R C/R C/R C/E N/P C/R C/P N/R C/P C/R C/R C/R C/P N/P N/P C/R C/R C/R C/E C/E C/P C/E N/P C/P C/P C/P C/P C/P C/P C/E C/E C/E C/E C/E C/P C/P C/P N/R C/P C/P C/P C/P C/P N/E N/E C/E N/E C/E C/R C/R C/P C/P C/P N/P C/P C/P N/P N/R C/R C/R C/R C/E C/P C/P C/E N/P C/P C/P C/P C/P C/E C/P C/P C/P N/R C/P C/P C/E N/E C/E C/R C/R C/P N/P N/E C/R C/R C/R C/E C/R C/P C/E N/P C/P C/P C/E C/P C/P C/P N/R C/P C/P C/E N/P C/P C/R C/R C/P N/P C/P C/P N/P N/E C/R C/R C/R C/P C/E N/P C/P C/E C/P C/P C/P N/R C/P C/P C/E N/P C/P C/R C/R C/R C/R C/R C/E C/P C/P C/E N/P C/P C/P C/P C/P C/E C/E C/E C/P C/P N/R C/P C/P C/E N/E C/E C/R C/R C/P N/P C/P C/P N/P N/R

XFN

ECG External file Name

C/P

Events Class
The Events class pertains to an occurrence or incident independent of the clinical trial, such as an adverse event (AE), occurring during the trial, and medical history (MH), occurring prior to the trial. This class documents protocol milestones such as randomization and patient disposition (DS) (e.g., completed study). Obviously, the AE domain represents most of the data found in this class. Notice that the DS and MH domains have actual visit numbers; whereas, AE does not, which makes sense, since such an event is not planned according to the protocol.

Relational Schema of Standard SDTM Domains ( EVENTS ) Variable ----- Domains ----Name Label AE DS MH ----------------------------------------------------------------------STUDYID DOMAIN USUBJID ACN ACNOTH BODSYS CAT CONTRT DECOD DTC DUR DY ENDTC ENDY ENRF EPOCH GRPID LOC MODIFY OCCUR OUT PATT REASND REFID REL RELNST SCAN SCAT SCONG SDISAB SDTH SEQ SER SEV SHOSP SLIFE SMIE SOD SPID STAT STDTC STDY TERM TOXGR VISIT VISITDY VISITNUM Study Identifier Domain Abbreviation Unique Subject Identifier Action Taken with Study Treatment Other Action Taken Body System or Organ Class Category for Medical History Concomitant or Additional Trtmnt Given Dictionary-Derived Term Date/Time of History Collection Duration of Event Study Day of History Collection End Date/Time of Medical History Event Study Day of End of Event End Relative to Reference Period Trial Epoch Group ID Location of the Reaction Modified Reported Term Medical History Occurrence Outcome of Adverse Event Pattern of Event Reason Medical History Not Collected Reference ID Causality Relationship to Non-Study Treatment Involves Cancer Subcategory for Medical History Congenital Anomaly or Birth Defect Persist or Signif Disability/Incapacity Results in Death Unique Sequence Number Serious Event Severity/Intensity Requires or Prolongs Hospitalization Is Life Threatening Other Medically Important Serious Event Occurred with Overdose Sponsor ID Medical History Status Start Date/Time of Medical History Event Study Day of Start of Disposition Event Reported Term for the Medical History Standard Toxicity Grade Visit Name Planned Study Day of Visit Visit Number C/R C/R C/R C/E C/P C/E C/P C/P C/R C/P C/E N/P C/P C/P C/P C/P C/P C/P C/P C/P C/E C/P C/P C/P C/P C/P C/P N/R C/E C/P C/P C/P C/P C/P C/P C/E N/P C/R C/P C/R C/R C/R C/P C/R C/P C/P C/P C/P C/P N/R C/P C/E N/P C/R C/P N/P N/P C/R C/R C/R C/E C/P C/E C/P N/P C/P C/P C/P C/P C/P C/P C/P C/P N/R C/P C/P C/P C/R C/P N/P N/P

Interventions Class
The Interventions class pertains to information about treatment either as specified by the protocol or prior to the study period. This class includes the domains Concomitant Medications (CM), Exposure (EX), and Substance Use (SU). Once again, the primary common variables exist in all three domains and have the same attributes. Notice that several of the variables change across domains with respect to core function, such as DOSE (Substance Use Consumption) and ENDTC (End Date / Time of Substance Use). Also, the CM and EX domains are not concerned with the variables denoting visits, since visits are scheduled according to the protocol.

Relational Schema of Standard SDTM Domains ( INTERVENTIONS ) Variable ----- Domains ----Name Label CM EX SU ----------------------------------------------------------------------STUDYID DOMAIN USUBJID CAT CLAS CLASCD DECOD DOSE DOSFRM DOSFRQ DOSRGM DOSTOT DOSTXT DOSU DUR ELTM ENDTC ENDY ENRF GRPID INDC LOC LOT MODIFY OCCUR REASND ROUTE SCAT SEQ SPID STAT STDTC STDY STRF TAETORD TPT TPTNUM TPTREF TRT VISIT VISITDY VISITNUM Study Identifier Domain Abbreviation Unique Subject Identifier Category for Substance Use Substance Use Class Substance Use Class Code Standardized Substance Name Substance Use Consumption Dose Form Use Frequency Per Interval Intended Dose Regimen Total Daily Consumption using SUDOSU Substance Use Consumption Text Consumption Units Duration of Substance Use Planned Elapsed Time from Reference Pt End Date/Time of Substance Use Study Day of End of Substance Use End Relative to Reference Period Group ID Indication Location of Dose Administration Lot Number Modified Substance Name SU Occurrence Reason Substance Use Not Collected Route of Administration Subcategory for Substance Use Sequence Number Sponsor ID Substance Use Status Start Date/Time of Substance Use Study Day of Start of Substance Use Start Relative to Reference Period Order of Element within Arm Planned Time Point Name Planned Time Point Number Time Point Reference Name of Substance Visit Name Planned Study Day of Visit Visit Number C/R C/R C/R C/P C/P C/P C/P N/P C/P C/P C/P N/P C/P C/P C/P C/P N/P C/P C/P C/P C/P C/P C/P C/P C/P N/R C/P C/P C/P N/P C/P C/R C/R C/R C/R C/P N/E C/P C/P C/P N/P C/P C/E C/P C/P C/E N/P C/P C/P C/P C/P C/P N/R C/P C/R N/P N/P C/P N/P C/P C/R C/R C/R C/R C/P C/P C/P C/P N/P C/P C/P N/P C/P C/P C/P C/P N/P C/P C/P C/P C/P C/P C/P N/R C/P C/P C/P N/P C/R C/P N/P N/R

Trial Design Class


9

Finally, the Trial Design class represents information about the planned sequence of events and the treatment plan for a clinical trial. Also, this class documents events about the subject during the trial. The domains include: Subject Elements (SE), Subject Visits (SV), Treatment Arms (TA), Trial Elements (TE), Trial Inclusion / Exclusion (TI), and Trial Visits. Unlike the IE domain, the TI domain is not subject oriented, since the IE domain contains records only for inclusion and exclusion criteria that a subject did not meet. Upon inspection of these domains, it is reasonable that the Trial domains do not have a subject identifier, unlike the SE and SV domains. Similarly, date variables are manifest in the Subject domains. Also noteworthy, the variables ARM and ARMCD differ between the TA and TV domains with respect to core function.

Relational Schema of Standard SDTM Domains ( TRIAL DESIGN ) Variable --------------- Domains ---------------Name Label SE SV TA TE TI TV -------------------------------------------------------------------------------------------STUDYID DOMAIN USUBJID ARM ARMCD BRANCH CAT DUR ELEMENT ENDTC ENRL EPOCH ETCD ETORD RL STDTC STRL TEST TESTCD TRANS UPDES VISIT VISITDY VISITNUM Study Identifier Domain Abbreviation Unique Subject Identifier Description of Arm Arm Code Branch Category for Exception Criterion Planned Duration of Element Description of Element End Date/Time of Visit Visit End Rule Trial Epoch Element Code Order of Element within Arm Inclusion/Exclusion Criterion Rule Start Date/Time of Visit Visit Start Rule Exception Criterion Exception Criterion Short Name Transition Rule Description of Unplanned Visit Visit Name Planned Study Day of Visit Visit Number C/R C/R C/R C/P C/E C/R C/E C/P C/R C/R C/R C/E C/E C/P C/P N/P N/R C/R C/R C/R C/R C/E C/P C/P C/R N/R C/E C/R C/R C/P C/R C/R C/R C/R C/R C/R C/R C/P C/R C/R C/R C/R C/P C/E C/P C/P C/R N/P N/R

Conclusion
As with any computer generated report, the outcome is as good as the data. With the growth and development of the CDISC standard, it is obvious that there will be changes, such as variables being added or dropped, attributes being changed, or even the creation of new domains; e.g., the Protocol Deviations domain (DV) found in the CDISC SDTM Implementation Guide (SDS Version 3.1.1). Thus, it is extremely important to have the latest, most complete, version of the metadata in order to produce viable reports. The Study Data Tabulation Model is becoming the industry standard for clinical trials. Proper implementation of this data model requires an understanding of the rules that map clinical data to their appropriate domains. By using metadata concerning these domains, one can develop a relational understanding of SDTM domains in the context of their specific class with respect to the data type of each variable, if it exists in the domain, and its core function. This SAS solution depicts a clever way to generate relational schemas on class-specific SDTM domains and affords a good opportunity to understand the structure and content of those domains.
SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks or SAS Institute Inc. in the USA and other countries. indicates USA registration.

10

You might also like