SDTCG v5.7 March2024 Final
SDTCG v5.7 March2024 Final
SDTCG v5.7 March2024 Final
March 2024
STUDY DATA
TECHNICAL CONFORMANCE GUIDE
March 2024
Revision History
Date Version Summary of Revisions
January 2014 1.0 Initial Version
Revisions based on the public comment period (February 2014 – May 2014); and CDER/CBER
December 2014 2.0 internal review May 2014 – December 2014
Revisions based on comments received to version 2.0. Updates to Sections 2.2 Study Data
Reviewer’s Guide (SDRG)
March 2015 2.1 SDRG, 2.3 Analysis Data Reviewer’s (ADRG), 3.3.1 SAS Transport Format, 3.3.2. Dataset
Size and a revision of Section 4.1.4.5 Data Definition File
Revisions based on comments received to version 2.1. Updates on Table of Contents; Sections
4.1, 4.1.1.2, 4.1.2.3. Updates to Trial Design. Added Exposure as Collected (EC Domain) and
Death Details (DD) subsections. Updates to 4.1.2.2, 4.1.2.3, 4.1.2.4, 4.1.2.5, 4.1.2.6, 4.1.2.8,
June 2015 2.2 4.1.2.9.1, 4.1.2.9.2, 4.1.4 (header and all sub-headers updated to specify which standards
apply), 4.1.4.5, and 4.1.4.6. Added 5.1 subsection; 6.7, 6.7.1, 6.7.1.1. Updates on Section 7,
8.2.2 and Glossary.
Updates to Section 1.3, Exposure as Collected (EC Domain) and Death Details (DD Domain).
October 2015 2.3 Reorganization of Section 4.1.2 and corresponding updates to appropriate sub-sections. Updates
to Sections 4.1.4.5 and 5.1. Added Sections 7.1 and 7.2.
Section 2.2 (Study Data Reviewer’s Guide) – Updated link for SDRG in Footnote 10
Section 3.3.2 (Dataset Size) – Increased Data Set Size
Section 4.1.1.2 (SDTM General Considerations) – Updated to reflect define.xml file and SDRG
reference.
Section 4.1.2.2 (Analysis Data Model – General Considerations) – Updated to reflect define.xml file
and SDRG reference.
Section 4.1.3.2 (Standard for Exchange of Nonclinical Data – General Considerations) – Updated to
reflect define.xml file and SDRG reference.
Section 4.1.4.5 (Data Definition Files for SDTM, SEND, and AdaM) – Updated to reflect define.xml
version 2.0 and data definition specification details
March 2016 3.0 Section 5.1 (Therapeutic Area Standards – General) – Updated to reflect more detailed information
related to Therapeutic Area Standards
Section 5.2 (Supported Therapeutic Area Standards) – Added information related to acceptance
testing on the standard
Section 5.2.1 (Chronic Hepatitis C) – Added Section for this information.
Section 5.2.2 (Dyslipidemia) – Added Section for this information.
Section 6.1.2.1 (Use of the specific controlled term “OTHER”) – Added information related to
controlled terminology and the mapping to “Other”
Section 8.3.1 (Study Data Traceability Overview) – Update to Study Data Traceability flow diagram
reference.
Section 2.1 (Study Data Standardization Plan) Updated to reflect acronym SDSP (Study Data
Standardization Plan) and added footnote 10.
Section 4.1.1.3 (SDTM Domain Specifications) – Updated Trial Design Model (TDM)
Section 4.1.3.3 (SEND Domain Specification) – Added Trial Design (TD)
Section 5.2.3 (Diabetes) – Added Section for this information.
Section 5.2.4 (QT Studies) – Added Section for this information.
July 2016 3.1 Section 5.2.5 (Tuberculosis) – Added Section for this information.
Section 8.2.1.1 (Conformance validation) – Created Section Header and expanded information.
Section 8.2.1.2 (Quality checks) – Created Section Header and updated to reflect study data standard.
Section 8.2.2 (Support on Data Validation Rules) – Expanded information.
Section 3.2 (Portable Document Format) & Glossary – Updated International Council for
Harmonisation (ICH) name
i
Revision History
Date Version Summary of Revisions
Section 2.2.1 (SDRG for Clinical Data) – Added naming convention
Section 2.2.2 (SDRG for Nonclinical Data) -Added naming convention
Section 2.3 (Analysis Data Reviewer’s Guide) – Provided additional information
Section 3.3.3 (Dataset Column Length) – Expanded Information
Section 4.1.1.2 (SDTM General Considerations) – Expanded Adjudication Data
Section 4.1.2.10 ( Software Programs) – Added more detail related to software programs
Section 4.1.3.2 (General Considerations) – Added VISITDY variable information
Section 4.1.3.3 (SEND Domain Specification) – Added Clinical Observations (CL) Domain and
Pharmacokinetic Concentrations (PC) Domain. Expanded Trial Arms and Trial Sets information.
Section 5.1 (General) – Expanded Information
October 2016 3.2 Section 5.2 (Supported Therapeutic Areas) – Expanded Information
Section 7.1 (ECTD File Directory Structure) – Referenced the Guidance to Industry Providing
Regulatory Submissions in Electronic Format: Certain Human Pharmaceutical Product Applications
and Related Submissions Using the Electronic Common Technical Document Specifications and
added footnote
Section 7.2 (ECTD Sample Submission) – Change header to align with detailed information.
Section 8.2.1 (Types of Data Validation Rules) – Expanded Information
Section 8.2.1.1 (Conformance validation) – Expanded Information
Section 8.2.1.2 (FDA Business Rules) – Added new Section
Section 8.2.2 (Support on Data Validation Rules) – updated to reflect conformance rules
Section 8.3.1 (Overview (Study Data Traceability) – added relate counts information
Section 8.2.2 (Support on Data Validation Rules) – Footnote 50 Added reference to the Standards
Webpage.
Section 4.1.3.3 (SEND Domain Specification) – Fixed Typo.
November 2016 3.2.1 Global (Updated naming convention for clinical Study Data Reviewer’s Guide (“csdrg.pdf”) and
the non-clinical Study Data Reviewer’s Guide (“nsdrg.pdf”) to reflect lower case instead of upper
case. eCTD requires lower case file names
Section 1.1 (Background) – Updated tense
Section 1.4 (Organization and Summary of the Guide) – Clarification on terminology section
Section 2.3 (Analysis Data Reviewer’s Guide) – Clarification on ADRG
Section 4.1.1.3 (SDTM Domain Specifications) – Clarification on DS Domain
Section 4.1.2.2 (General Considerations) – Added prior text for desired analysis views for reviews
Section 4.1.3.2 (General Considerations) – Clarification on VISITDY for MA, MI, OM in the DS
Domain
Section 4.1.3.3 (SEND Domain Specification) – Clarification on SUPPQUAL and MI Domain.
Expanded PC Domain, Custom Domain, and Trial Design Model information. Added footnote for
March 2017 3.3 SENDIG
Section 4.1.4.1 (Variables in SDTM and SEND: CDISC Required, Expected, and Permissible) –
Clarification on SEND datasets and Subject Visits in SDTM.
Section 5.1 (General) – Updated and clarified text
Section 5.2 (Supported Therapeutic Areas) – Updated and clarified text, added TA section
Section 8 (Study Validation and Traceability) –Updated and clarified text
Section 8.3.1 (Overview) – Expanded information on traceability
Section 8.3.2 (Legacy Study Data Conversion to Standardized Study Data) – Clarification on legacy
data conversions
Glossary – Additions
ii
Revision History
Date Version Summary of Revisions
Section 1.5 (Relationship to Other Documents) – Updated references
Section 2.1 (Study Data Standardization Plan) – Clarification on SDSP and added footnotes
Section 2.2 (Study Data Reviewer’s Guides) – Clarification on Reviewer Guides
Section 4.1 (Clinical Data Interchange Standards Consortium) – Clarification on terms SDTM, AdaM,
and SEND
Section 4.1.1.2 (SDTM General Considerations) – Updated and clarified text
Section 4.1.1.3 (SDTM Domain Specifications) – Added QS Domain (Questionnaires)
Section 4.1.2.4 (Subject-Level Analysis Data) – Updated and clarified text on baseline characteristics
Section 4.1.2.10 (Software Programs) – Updated and clarified text
Section 4.1.3.1 (Definition) – Updated and clarified text
Section 4.1.3.2 (General Considerations) – Clarification on variable usage
October 2017 4.0 Section 4.1.3.3 (SEND Domain Specification) – Clarification and added text
Section 4.1.4.1 (Variables in SDTM and SEND: CDSIC Required, Expected, and Permissible) –
Added text
Section 4.1.4.6 (Annotated Case Report Form (aCRF) for SDTM) – Updated and clarified text. The
recommendation to use the SDTM Metadata Submission Guidelines was removed pending further
FDA review.
Section 5.2 (Supported Therapeutic Areas) –Added TA sections
Section 6.3.1.1 (General Considerations) – Updated and clarified text
Section 6.7.1.1 (General Considerations) – Added clarification text
Section 8.3.2 (Legacy Study Data Conversion to Standardized Study Data) – Added clarification text
Section 8.3.2.2 (Legacy Data Conversion Plan and Report) – Added clarification text
Section 2.1 (Study Data Standardization Plan) – Clarified text
Section 2.2 (Study Data Reviewer’s Guides) – Updated footnote text
Section 2.3 (Analysis Data Reviewer’s Guide) – Clarified text
Section 3.3.1 (SAS Transport Format) – Updated text
Section 3.3.6 (Variable and Dataset Names) – Updated text
Section 4.1.1.2 (SDTM General Considerations) – Updated and clarified text
Section 4.1.1.3 (SDTM Domain Specifications) – Updated and clarified text
Section 4.1.2.2 (General Considerations) – Clarified text
March 2018 4.1 Section 4.1.2.10 (Software Programs) – Updated and clarified text
Section 4.1.3.3 (SEND Domain Specification) – Clarified text
Section 4.1.4.1 (Variables in SDTM and SEND: CDISC Required, Expected, and Permissible) –
Updated and clarified text
Section 5.2 (Supported Therapeutic Areas) –Added TA sections
Section 6.1.2 (Use of Controlled Terminologies) – Updated and clarified text
Section 6.4.2.1 (General Considerations) – Clarified text
Section 6.6.1.1 (General Considerations) – Updated text
Section 3.3.5 (Special Characters: Variables and Datasets) – Added clarification text
Section 4.1.1.3 (SDTM Domain Specifications) – Additional text and table under DM and Trial
Design Model sections, added DV Domain section
Section 4.1.2.10 (Software Programs) – Clarified text
Section 4.1.3.2 (General Considerations) – Updated text
Section 4.1.3.3 (SEND Domain Specification) – Added Lab Test Results, Body Weight, and
Comments domains, updated Pharmacokinetics Concentrations Domain and Trial Design Model
October 2018 4.2 sections
Section 4.1.4.1 (Variables in SDTM and SEND: CDISC Required, Expected, and Permissible) –
Updated text
Section 5.2 (Supported Therapeutic Areas) –Added TA sections
Section 6.6.1.1 (General Considerations) – Updated text
Section 7.1 (eCTD Specifications) – Updated text
Section 8.2.2 (Support on Data Validation Rules) – Updated text
Appendix B, C, D, E, F, G – Added
Section 4.1.1.3 (SDTM Domain Specifications) – Updated text
December 2018 4.2.1 Section 4.1.3.3 (SEND Domain Specification) – Updated text
Appendix F, G – Removed
iii
Revision History
Date Version Summary of Revisions
Section 2.3 (Analysis Data Reviewer’s Guide) – Clarified text
Section 4.1.1.3 (SDTM Domain Specifications) – Updated text
Section 4.1.3.2 (General Considerations) – Clarified text
Section 4.1.3.3 (SEND Domain Specification) – Added, updated, and clarified text
Section 4.1.4.5 (Data Definition Files for SDTM, SEND, and AdaM) – Clarified text
Section 5.2 (Supported Therapeutic Areas) – Added TA sections
March 2019 4.3 Section 6.4.1.1 (General Considerations) – Updated text
Section 6.4.2.1 (General Considerations) – Updated text
Section 6.5.1 (Medication – Reference Terminology) – Updated text
Section 6.5.1.1 (General Considerations) – Updated text
Section Appendix B – Updated text
Section Appendix C – Added, updated, and clarified text
Section 2.1 (Study Data Standardization Plan) – Added clarifying text
Section 2.2 (Study Data Reviewer’s Guide) – Added clarifying text
Section 2.3 (Analysis Data Reviewer’s Guide) – Added clarifying text
Section 4.1.1.2 (SDTM General Considerations) – Added clarifying text
Section 4.1.1.3 (SDTM Domain Specifications) – Added text
Section 4.1.3.3 (SEND Domain Specification) – Added text
October 2019 4.4 Section 4.1.2.6 (Key Efficacy and Safety Data) – Updated title and text
Section 5.2 (Supported Therapeutic Areas) – Re-ordered by inclusion date, added TA section
Section 6.1 (Use of Controlled Terminologies) – Added clarifying text
Section 6.7.1.1 (General Considerations) – Updated section
Section 7.1 (eCTD Specifications) – Added text
Section 8.2.2 (Support on Data Validation Rules) – Added text
Section 4.1.3.3 (SEND Domain Specification) – Updated and added clarifying text
March 2020 4.5 Section 5.2 (Supported Therapeutic Areas) – Added TA section
Section 5.2.18 (Vaccines Therapeutic Area User Guide v1.1) – Added text
July 2020 4.5.1 Appendix D – Added text
Section 1.2 (Purpose) – Added text
Section 4.1.1.2 (SDTM General Considerations) – Text clarified
Section 4.1.3.2 (General Considerations) – Updated and added clarifying text
Section 4.1.4.1 (Variables in SDTM and SEND: CDISC Required, Expected, and Permissible) – Text
November 2020 4.6 clarified
Section 5.2 (Supported Therapeutic Areas) – Added TA section
Section 6.1.3 (Maintenance of Controlled Terminologies) – Added clarifying text
Appendix C – Added text and notes
Appendix D – Updated text
Section 4.1.1.3 (SDTM Domain Specifications) – Provided clarification on the use of AE domains and
the use of laboratory data units.
Section 4.1.3.2 (General Considerations) – Specified when FOCID should be utilized
Section 4.1.3.3 (SEND Domain Specification) – Provided clarification on the use of BLQ vs BQL,
March 2021 4.7 and the use of custom domains
Section 7.1 (eCTD Specifications) – Provided clarification on the submission of toxicity study data for
nonclinical Weight of Evidence documents
Appendix C: Removed TS code for Interim Study Flag, updated the description for TS code Planned
Dose Frequency, and added new TS code titled Study Report Status
Section 4.1.4.7 (Requirements During Specific Public Health Emergencies Declared by the Secretary
June 2021 4.7.1 of HHS) – Provided clarification about when electronic standardized study data are required as part of
a submission during a declared public health emergency
Section 8.2.2 (Support on Data Validation Rules) – Provided clarification regarding the submission of
4.7.2
August 2021 clinical study data with the use of simplified ts.xpt and technical rejection criteria
Section 4.1.3.2 (General Considerations) – Correction made to indicate that SEND datasets will not be
4.7.2.1 required for CBER submissions until March 15, 2023
Section 4.1.3.4 (Scope of SEND), Section 4.1.3.4.1 (Scope of SEND for SENDIGs v3.0 and v3.1),
September 2021 4.8 Section 4.1.3.4.2 (Scope of SEND for SENDIG – Animal Rule v1.0) – Provided clarification on the
expectation of SEND for studies listed in the referenced SENDIGs.
iv
Revision History
Date Version Summary of Revisions
October 2021 4.8.1 Footnotes were updated to fix web links and other typos and formatting issues.
Section 3.3.3 (Dataset Column Length) – Provided clarification
Section 3.3.7 (Variable and Dataset Labels) – Provided clarification
Section 4.1.1.2 (SDTM General Considerations) – Provided clarification
Section 4.1.1.3 (SDTM Domain Specifications) – Provided clarification
Section 4.1.3.2 (General Considerations) – Provided clarification
Section 5.3 – Added section “List of Technical Specifications Documents”;
Added technical rejection criteria for study data documentation information in the following:
March 2022 4.9 Section 7.1
Section 8.2.2
Appendix F
Appendix G
Section 7.2 (Electronic File Directory) – Added this section ‘to focus on recommended file folder
structure
Appendix B – Provided clarification
Appendix C – Updated table
Document first paragraph – Added link to Docket for entering comments on this document
Purpose – Added clarification to the use of the word ‘require’
Footnotes – Updated links in footnotes referring to FDA Guidance documents
Section 4.1.2.10 – Clarified language concerning acceptable file extensions that aligns with eCTD file
format specification
Section 4.1.3.3 – Under BG Domain (Body Weight Gain), removed ‘CDER’ as this applies to both
October 2022 5.0 CBER and CDER submissions
Section 4.1.4 – General Considerations: SDTM, SEND, and/or AdaM; clarified the use of the word
‘required’
Section 4.1.4.1 – Clarified headings
Section 6.5.1.1 – Updated the link to the document entitled Established Pharmacologic Class Text
Phrase
Appendix B and C – Clarified use of TS Parameters
Footnotes – Updated links in footnotes referring to FDA Guidance documents
Footnotes – Added footnotes relating to the expectation of SEND datasets for nonclinical EFD studies
Section 3.3.1 (v5 Transport Format) – Heading changed, content updated
Section 4.1.1.3 (SDTM Domain Specifications) – Addition of Immunogenicity Domain (IS)
Section 4.1.2.6 (Key Efficacy and Safety Data) – Language added regarding analysis of
immunogenicity
Section 4.1.3.1 (Definition) – SEND definition updated to match the SDTM definition under Section
4.1.1.1
Section 4.1.3.3 (SEND Domain Specification) – Language updated
Section 4.1.3.4.1 (Scope of SEND for SENDIGv3.0 and v3.1) – Language updated to match the Scope
March 2023 5.1 of SENDIG-DARTv.1.1
Section 4.1.3.4.3 (Scope of SEND for SENDIG-DARTv1.1 for CDER) – Provided clarification on the
expectation of SEND
Section 4.1.4.5 (Data Definition Files for SDTM, SEND, and AdaM) – Updated language for Define-
XML
Section 6.5.1.1 (General Considerations) – Updated for clarity
Appendix C – Updated link for MED-RT terminology
Appendix F – Table 6 updated for clarity
Glossary – Addition of MED-RT
Section 4.1.4.7.1 (SEND Requirements During the COVID-19 Public Health Emergency) – Updated
language due to the expiration of the COVID-19 PHE
May 2023 5.2 Appendix H: HHS Declared Public Health Emergencies and Modifications to Data Standards
Requirements – Added
May 2023 5.3 Section 4.1.4.5 (Data Definition Files for SDTM, SEND, and AdaM) – Updated language
June 2023 5.4 Section 4.1.1.3 (SDTM Domain Specification) – Updated language under LB Domain (Laboratory)
v
Revision History
Date Version Summary of Revisions
Footnotes – Added links to the FDA Data Standards Catalog and SAS Help Center
Section 3.3.2 (Dataset Size) – Updated for clarity
Section 3.3.3 (Dataset Column Length) – Updated for clarity
Section 4.1.1.3 (SDTM Domain Specifications) – Added PC Domain (Pharmacokinetic Concentration) and
PP Domain (Pharmacokinetic Parameters)
Section 4.1.1.3 (SDTM Domain Specifications) – Added SV Domain (Subject Visits)
Section 4.1.3.2 (General Considerations) – Updated for clarity
Section 4.1.3.3 (SEND Domain Specifications) – PC Domain (Pharmacokinetic Concentration) updated for
clarity
Section 4.1.3.3 (SEND Domain Specifications) – Updated Custom Domains to include SENDIGv3.1.1
Section 4.1.3.4.1 (Scope of SEND for SENDIGs v3.0, v3.1 and v3.1.1) – Updated to include mentions of
SENDIGv3.1.1
Section 4.1.3.4.1 (Scope of SEND for SENDIGs v3.0, v3.1 and v3.1.1) – Section C updated for clarity.
Section 4.1.3.4.3 (Scope of SEND for SENDIG-DARTv1.1 for CDER) – Sections H, I, J, and K, updated
October 2023 5.5 for clarity
Section 4.1.4.1 (Variables in SDTM and SEND: CDISC Required, Expected, and Permissible) – Updated to
address baselines
Section 4.1.4.7.1 (SEND Requirements During the COVID-19 Public Health Emergency) – Language added
to address the end of the 180-day wind-down period for the modification to the SEND requirement
Section 6.1.3 (Maintenance of Controlled Terminologies) – Updated for clarity
Section 6.5.1.1 (General Considerations) – Language added to reflect CDER preferences
Appendix C – Added the following TSPARMCDs: PPTCNAM, PPTEGID, PPTEGSYM, PPTMDA
Appendix D – Updated list of SDO properties that align or do not align with current CBER and CDER
business needs
Appendix G – Updated for clarity
Appendix H – Language added to address the end of the 180-day wind-down period for the modification to
the SEND requirement
Glossary – Addition of PHE and SDO
Section 4.1.1.3 (SDTM Domain Specifications) – Updated to address LOINC
Section 4.1.3.4.2 (Scope of SEND for SENDIG-Animal Rule v1.0) – Updated for clarity
December 2023 5.6 Section 5.3 (List of FDA Technical Specification Documents) – Section updated
Section 6.7 (Laboratory Tests) – Updated for clarity
Section 4.1.1.3 (SDTM Domain Specifications) – Updates for LB and LC Domain
Section 4.1.3.2 (General Considerations) – Addressed ELTM and DOSDUR
Section 4.1.3.3 (SEND Domain Specifications) – Updates for LB Domain
Section 4.1.4.3 (Naming Conventions in SDTM and SEND) – Addressed naming for drugs and metabolites
March 2024 5.7 Section 8.2.2.3 (Technical Rejection Criteria and Use of a Simplified ts.xpt for Nonclinical Studies) –
Language updated
Appendix B – Added new TSPARMCD
Appendix C – Added reference to Section 4.1.3.2 for the DOSDUR TSPARMCD
Appendix D – Updated lists
vi
Table of Contents
1. INTRODUCTION............................................................................................................... 1
1.1 BACKGROUND ...................................................................................................................................... 1
1.2 PURPOSE ............................................................................................................................................. 1
1.3 DOCUMENT REVISION AND CONTROL ........................................................................................................ 2
1.4 ORGANIZATION AND SUMMARY OF THE GUIDE ........................................................................................... 2
1.5 RELATIONSHIP TO OTHER DOCUMENTS ..................................................................................................... 3
2. PLANNING AND PROVIDING STANDARDIZED STUDY DATA .............................................. 4
2.1 STUDY DATA STANDARDIZATION PLAN ...................................................................................................... 4
2.2 STUDY DATA REVIEWER’S GUIDES ............................................................................................................ 5
2.2.1 SDRG for Clinical Data ........................................................................................................ 5
2.2.2 SDRG for Nonclinical Data.................................................................................................. 5
2.3 ANALYSIS DATA REVIEWER’S GUIDE .......................................................................................................... 5
3. EXCHANGE FORMAT – ELECTRONIC SUBMISSIONS........................................................... 6
3.1 EXTENSIBLE MARK-UP LANGUAGE ............................................................................................................ 6
3.2 PORTABLE DOCUMENT FORMAT .............................................................................................................. 6
3.3 FILE TRANSPORT FORMAT ....................................................................................................................... 6
3.3.1 v5 Transport Format .......................................................................................................... 6
3.3.2 Dataset Size........................................................................................................................ 7
3.3.3 Dataset Column Length...................................................................................................... 7
3.3.4 Variable and Dataset Descriptor Length ............................................................................ 7
3.3.5 Special Characters: Variables and Datasets ....................................................................... 8
3.3.6 Variable and Dataset Names ............................................................................................. 8
3.3.7 Variable and Dataset Labels .............................................................................................. 8
4. STUDY DATA SUBMISSION FORMAT – CLINICAL AND NONCLINICAL ................................. 8
4.1 CLINICAL DATA INTERCHANGE STANDARDS CONSORTIUM ............................................................................. 8
4.1.1 Study Data Tabulation Model ............................................................................................ 9
Definition ..................................................................................................................................... 9
SDTM General Considerations ..................................................................................................... 9
SDTM Domain Specifications ..................................................................................................... 10
4.1.2 Analysis Data Model ........................................................................................................ 15
Definition ................................................................................................................................... 15
General Considerations .............................................................................................................. 16
Dataset Labels ............................................................................................................................ 16
Subject-Level Analysis Data........................................................................................................ 16
Core Variables ............................................................................................................................ 16
Key Efficacy and Safety Data ...................................................................................................... 17
Timing Variables ......................................................................................................................... 17
Numeric Date Variables ............................................................................................................. 17
Imputed Data ............................................................................................................................. 17
Software Programs..................................................................................................................... 17
4.1.3 Standard for Exchange of Nonclinical Data ..................................................................... 18
Definition ................................................................................................................................... 18
General Considerations .............................................................................................................. 18
SEND Domain Specification ........................................................................................................ 20
Scope of SEND ............................................................................................................................ 24
4.1.4 General Considerations: SDTM, SEND, and/or ADaM ...................................................... 30
Variables in SDTM and SEND: CDISC Required, Expected, and Permissible ............................... 30
Dates in SDTM and SEND ........................................................................................................... 31
Naming Conventions in SDTM and SEND ................................................................................... 31
vii
SDTM and SEND Versions .......................................................................................................... 31
Data Definition Files for SDTM, SEND, and ADaM...................................................................... 31
Annotated Case Report Form (aCRF) for SDTM ......................................................................... 32
Modification of Requirements During Specific Public Health Emergencies Declared by the
Secretary of HHS ............................................................................................................................................. 32
viii
6.1.1 Controlled Terminologies ................................................................................................. 38
6.1.2 Use of Controlled Terminologies ...................................................................................... 38
Use of the Specific Controlled Term ‘OTHER’............................................................................. 39
6.1.3 Maintenance of Controlled Terminologies ....................................................................... 39
6.2 CDISC CONTROLLED TERMINOLOGY ....................................................................................................... 40
6.3 ADVERSE EVENTS ................................................................................................................................ 40
6.3.1 MedDRA ........................................................................................................................... 40
General Considerations .............................................................................................................. 40
6.4 MEDICATIONS .................................................................................................................................... 41
6.4.1 FDA Unique Ingredient Identifier ..................................................................................... 41
General Considerations .............................................................................................................. 41
6.4.2 WHODrug Global ............................................................................................................. 41
General Considerations .............................................................................................................. 41
6.5 PHARMACOLOGIC CLASS ....................................................................................................................... 42
6.5.1 Medication Reference Terminology ................................................................................. 42
General Considerations .............................................................................................................. 42
6.6 INDICATION ........................................................................................................................................ 43
6.6.1 SNOMED CT...................................................................................................................... 43
General Considerations .............................................................................................................. 43
6.7 LABORATORY TESTS ............................................................................................................................. 43
6.7.1 LOINC ............................................................................................................................... 43
General Considerations .............................................................................................................. 43
7. ELECTRONIC SUBMISSION FORMAT ............................................................................... 44
7.1 ECTD SPECIFICATIONS..........................................................................................................................
44
7.2 ELECTRONIC FILE DIRECTORY ................................................................................................................. 45
7.3 ECTD SAMPLE SUBMISSION .................................................................................................................. 48
ix
APPENDIX H: HHS DECLARED PUBLIC HEALTH EMERGENCIES AND MODIFICATIONS TO DATA
STANDARDS REQUIREMENTS............................................................................................................... 73
GLOSSARY 76
x
Contains Nonbinding Recommendations
STUDY DATA
TECHNICAL CONFORMANCE GUIDE
This technical specifications document represents the Food and Drug Administration's
(FDA's) current thinking on this topic. It does not create or confer any rights for or on
any person and does not operate to bind FDA or the public. You can use an alternative
approach if the approach satisfies the requirements of the applicable statutes and
regulations. If you want to discuss an alternative approach, contact the FDA staff
responsible for implementing this guidance. If you cannot identify the appropriate FDA
staff, send an email to [email protected] or [email protected]. You can
submit comments to this document online at https://www.regulations.gov and searching
Docket No. FDA-2018-D-12160002.
1. Introduction
1.1 Background
This Study Data Technical Conformance Guide (Guide) provides specifications,
recommendations, and general considerations on how to submit standardized study data
using FDA-supported 1 data standards located in the FDA Data Standards Catalog
(Catalog). 2 The Guide supplements the guidance for industry Providing Regulatory
Submissions in Electronic Format — Standardized Study Data (eStudy Data). The eStudy
Data guidance implements the electronic submission requirements of section 745A(a) of
the Food, Drug, & Cosmetic (FD&C) Act with respect to standardized study data
contained in certain investigational new drug applications (INDs), new drug applications
(NDAs); abbreviated new drug applications (ANDAs); and certain biologics license
applications (BLAs) that are submitted to the Center for Drug Evaluation and Research
(CDER) or the Center for Biologics Evaluation and Research (CBER). 3
1.2 Purpose
This Guide provides technical recommendations to sponsors 4 for the submission of
animal and human study data and related information in a standardized electronic format
in INDs, NDAs, ANDAs, and BLAs. 5 The Guide is intended to complement and promote
interactions between sponsors and FDA review divisions. However, it is not intended to
replace the need for sponsors to communicate directly with review divisions regarding
implementation approaches or issues relating to data standards.
1
For the purposes of this document, “supported” means the receiving Center has established processes and
technology to support receiving, processing, reviewing, and archiving files in the specified file format.
2
Available at http://www.fda.gov/eStudyResources.
3
See Providing Regulatory Submissions in Electronic Format — Standardized Study Data (section II.A)
available at http://www.fda.gov/eStudyResources.
4
For the purposes of this document, the term “sponsor” refers to both “sponsors” and “applicants” who are
submitting study data to the Agency.
5
Docket No. FDA-2018-D-1216
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 1 of 77 March 2024
Contains Nonbinding Recommendations
Not every data element included in a standard’s underlying data model is fit for purpose
for every trial. The use of the word ‘required’ in this document generally indicates a
requirement by the Agency and not any external organization. Any use of the word
‘required’ that would have a different meaning will be explained in the text. For example,
the Study Data Tabulation Model Implementation Guide (SDTMIG) 6 classifies variables
as required, expected, or permissible. This use of the word ‘required’ by the Standards
Data Organization does not necessarily indicate an Agency requirement. What data are
collected and submitted is a decision that should be made based on scientific reasons,
regulation requirements, and discussions with the review division. However, all study-
specific data necessary to evaluate the safety and efficacy of the medical product should
be submitted in conformance with the standards currently supported by FDA and listed in
the Catalog.
This document applies to submissions to CDER and CBER, however some review offices
and multi-disciplinary review teams may have specific technical guidance which provides
additional details on preparing and submitting information that may differ from this
document. In those cases the specific technical guidance should be followed instead of
the information contained herein. If there is a question regarding a specific submission or
a particular data standard implementation, the sponsor should contact the review division
for specific submission questions or the appropriate contact for data standards issues
([email protected] or [email protected]).
This Guide supersedes all previous Study Data Specifications documents (Versions 1.0 -
2.0) and CDER Study Data Common Issues Documents (Versions 1.0 -1.1).
6
See http://www.cdisc.org.
7
The Standards Web page can be accessed at http://www.fda.gov/eStudyResources.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 2 of 77 March 2024
Contains Nonbinding Recommendations
In addition, for clinical studies that will be submitted to CBER, the SDSP appendix
should be provided to the review office no later than the End-of-Phase 2 (EOP2) meeting.
The CBER SDSP appendix should include tables of proposed SDTM domain/variable
usage, supplemental domain usage and proposed analysis.
8
See https://www.fda.gov/drugs/electronic-regulatory-submission-and-review/electronic-common-
technical-document-ectd
9
Available at
https://www.fda.gov/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/default.htm
10
Available at
https://www.fda.gov/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/default.htm
11
A specific template for a Study Data Standardization Plan is not specified. However, an example can be
found at https://advance.phuse.global/display/WEL/Deliverables. The PhUSE SDSP template has been reviewed
by FDA and published in the Federal Register
https://www.federalregister.gov/documents/2016/11/08/2016-26913/intent-to-review-a-study-data-
standardization-plan-template-notice-of-availability-establishment-of. FDA prefers but does not require its
use.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 4 of 77 March 2024
Contains Nonbinding Recommendations
There are two study data reviewer guides (SDRG): clinical and nonclinical. The SDRG
for nonclinical studies (nSDRG) 13 and SDRG for clinical studies (cSDRG) 14 should be
placed with the study data in Module 4 and 5, respectively, in the eCTD. 15 The SDRG
should be file-tagged as ‘data-tabulation-data-definition’, with a clear leaf title.
An SDRG for clinical data should be named cSDRG (the prefix ‘c’ designates ‘clinical’)
and the document should be named ‘csdrg’ and provided as a PDF file upon submission
(csdrg.pdf).
An SDRG for nonclinical data should be named nsdrg (the prefix ‘n’ designates
‘nonclinical’) and the document should be named ‘nsdrg’ and provided as a PDF file
upon submission (nsdrg.pdf).
12
For the purposes of this document, the term ‘Reviewer Guide’ refers only to those located in the m4 or
m5 eCTD folders.
13
A specific template for a Study Data Reviewer’s Guide for nonclinical studies is not specified. However,
an example can be found at https://advance.phuse.global/display/WEL/Deliverables. The PhUSE nSDRG
template has been reviewed by FDA and published in the Federal Register
https://www.federalregister.gov/documents/2015/07/23/2015-18027/intent-to-review-a-study-data-
reviewers-guide-template. FDA prefers but does not require its use.
14
A specific template for a Study Data Reviewer’s Guide for clinical studies is not specified. However, an
example can be found at https://advance.phuse.global/display/WEL/Deliverables. The PhUSE cSDRG template
has been reviewed by FDA and published in the Federal Register
https://www.federalregister.gov/documents/2016/03/04/2016-04791/intent-to-review-a-nonclinical-study-
data-reviewers-guide-template. FDA prefers but does not require its use.
15
The Study Data Reviewer’s Guides are separate documents from an overall reviewer’s guide which is
placed in Module 1 of the eCTD.
16
A specific template for an Analysis Data Reviewer’s Guide is not specified. However, an example can be
found at https://advance.phuse.global/display/WEL/Deliverables.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 5 of 77 March 2024
Contains Nonbinding Recommendations
other submission documentation (e.g., the protocol, statistical analysis plan (SAP),
clinical study report, define.xml) in order to provide FDA reviewers with a single point of
orientation to the analysis datasets. It should be noted that the submission of an ADRG
does not eliminate the requirement to submit a complete and informative define.xml file
corresponding to the analysis datasets.
• The ADRG for a clinical study should be placed with the analysis data in Module
5 of the eCTD. The ADRG should be file-tagged as ‘analysis-data-definition’,
with a clear leaf title.
• An ADRG for clinical data should be called an ADRG and the document should
be a PDF file ‘adrg.pdf’ upon submission.
The Transport Format (XPORT) Version 5 is the file format for the submission of all
electronic datasets. 22 XPORT is an open file format published by SAS Institute for the
exchange of study data. Data can be translated to and from XPORT to other commonly
used formats without the use of programs from any specific vendor. There should be one
dataset per transport file, and the dataset in the transport file should be named the same as
the transport file (e.g., ‘ae’ and ae.xpt, ‘suppae’ and suppae.xpt, ‘lb1’ and lb1.xpt).
17
See http://en.wikipedia.org/wiki/XML.
18
See http://www.w3.org/XML/.
19
Adobe Systems Incorporated, PDF Reference, sixth edition, version 1, Nov. 2006, p. 33.
20
See http://www.ich.org/products/electronic-standards.html.
21
Available at http://www.fda.gov/ectd
22
See http://www.sas.com
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 6 of 77 March 2024
Contains Nonbinding Recommendations
XPORT files can be created by the COPY Procedure in SAS Version 5 and higher of the
SAS Software. SAS Transport files processed by the SAS CPORT cannot be reviewed,
processed, or archived by FDA. Sponsors can find the record layout for SAS XPORT
transport files through SAS technical document TS-140. 23 All SAS XPORT transport
files should use .xpt as the file extension, and the files should not be compressed. Note
also that SAS custom formats should NOT be used in submissions to the FDA.
Each dataset should be provided in a single transport file. The maximum size of an
individual dataset that FDA can process depends on many factors. Datasets greater than 5
gigabytes (GB) in size should be split into smaller datasets no larger than 5 GB. Sponsors
should submit these smaller datasets, in addition to the larger non-split datasets, to better
support regulatory reviewers. The split datasets should be placed in a separate sub-
directory labeled ‘split’ (See section 7.2). A clear explanation regarding how these
datasets were split needs to be presented within the relevant data RG.
The allotted length for each column containing character (text) data should be set to
the maximum length of the variable used across all datasets in the study except for
suppqual datasets. For suppqual datasets, the allotted length for each column
containing character (text) data should be set to the maximum length of the variable
used in the individual dataset. This will significantly reduce file sizes. For example, if
USUBJID has a maximum length of 18, the USUBJID’s column size should be set to
18, not 200. If datasets are split according to section 3.3.2, reduce variable length
before datasets are split. Care should be taken to avoid accidental truncation of data
through dataset merges.
The length of variable names, descriptive labels, and dataset labels should not exceed the
maximum permissible number of characters described in Table 1.
23
http://support.sas.com/techsup/technote/ts140.pdf
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 7 of 77 March 2024
Contains Nonbinding Recommendations
Variable names, as well as variable and dataset labels should include American Standard
Code for Information Interchange (ASCII) text codes only. Variable values are the most
broadly compatible with software and operating systems when they are restricted to
ASCII text codes (printable values below 128). Use UTF-8 for extending character sets;
however, the use of extended mappings is not recommended. Transcoding errors, variable
length errors, and lack of software support for multi byte UTF-8 encodings can result in
incorrect character display and variable value truncations. Ensure that LBSTRESC and
controlled terminology extensions in LBTEST do not contain byte values 160-191 as
some character mappings in that range may interfere with agency processes.
Please implement the SDO technical specifications when preparing your datasets for
submission. Datasets submitted in legacy formats should follow the conventions
described in other documents. 24
Use V7 as the valid variable name option (VALIDVARNAME) and extend for the valid
member name option (VALIDMEMNAME).
Do not submit study data with the following special characters in variable and dataset
labels:
1. Unbalanced apostrophe, e.g., “Parkinson's”
2. Unbalanced single and double quotation marks
3. Unbalanced parentheses, braces or brackets, e.g.,‘(‘, ‘{‘and ‘[‘
Data format specifications for the tabulation datasets of clinical and nonclinical
toxicology studies are provided by SDTM and SEND, respectively, while data format
24
https://documentation.sas.com/doc/en/lrcon/9.4/p18cdcs4v5wd2dn1q0x296d3qek6.htm#n0nfuxt1u6y9jin12
0gszztjh00o
https://documentation.sas.com/doc/en/lesysoptsref/9.4/p124dqdk8zoqu3n1r4nsfqu5vx52.htm
25
See http://www.cdisc.org.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 8 of 77 March 2024
Contains Nonbinding Recommendations
specifications for the analysis datasets of clinical studies are provided by ADaM. It
should be noted that data format specifications for the analysis datasets of nonclinical
toxicology studies have not been developed. As noted in section 1.1, the Catalog provides
a listing of the currently supported data standards with links to reference materials. For
the purposes of this Guide, the terms SDTM, ADaM, and SEND apply to versions only
listed and supported by FDA in the Catalog.
Although the SDTM and SEND formats facilitate review of the data, they do not always
provide the data structured in a way that supports all analyses needed for review.
Analysis files are critical for FDA to understand, on a per subject basis, how the specific
analyses contained in the study report have been created. Therefore, sponsors should
supplement the SDTM with ADaM analysis datasets as described below.
Definition
The Study Data Tabulation Model (SDTM) defines a standard structure for human
clinical trials tabulation datasets.
SDTM General Considerations
It is recommended that sponsors implement the SDTM standard for representation of
clinical trial tabulation data prior to the conduct of the study.
The SDTMIG should be followed unless otherwise indicated in this Guide or in the
Catalog. The conformance criteria listed in the SDTMIG should not be interpreted as the
sole determinant of the adequacy of submitted data. If there is uncertainty regarding
implementation, the sponsor should discuss application-specific questions with the
review division and general standards implementation questions with the specific center
resources identified elsewhere in this Guide (See section 1.2). Each submitted SDTM
dataset should have its contents described with complete metadata in the define.xml file
(See section 4.1.4.5) and within the cSDRG as appropriate (See section 2.2). When
updated datasets (e.g., ‘ae.xpt’, ‘lb.xpt’) are submitted, updated and complete define.xml
and cSDRG covering all datasets should be submitted using the “replace” lifecycle
operator to update the original file.
Except for variables that are defined in the SDTMIG as being coded, numerically coded
variables typically are not submitted as part of the SDTM datasets. Numeric values
generated from validated scoring instruments or questionnaires do not represent codes,
and therefore have no relevance for this issue. There may be special instances when
codes are preferred, hence sponsors should refer to the review division for direction, if
there are any questions.
Sponsors should not add leading or trailing spaces to the USUBJID variable in any
dataset. For example, applications have been previously submitted in which the
USUBJID variable for each individual subject appeared to be the same across datasets;
however, in certain datasets, the actual entry had leading zeros added, or zeros added
elsewhere in the entry. This does not allow for machine-readable matching of individual
subject data across all datasets. Improper implementation of the USUBJID variable is a
common error with applications and often requires sponsors to re-submit their data.
Adjudication Data
There are no existing standards or best practices for the representation of adjudication
data as part of a standard data submission. Until standards for adjudication data are
developed, it is advised that sponsors discuss their proposed approach with the review
division and also include details about the presence, implementation approach, and
location of adjudication data in the SDRG.
Whenever adjudication data are provided, they should be clearly identified so that the
reviewer can distinguish the results of adjudication from data as originally collected.
SDTM Domain Specifications
SUPPQUAL (Supplemental Qualifier)
A SUPPQUAL dataset is a special SDTM dataset that contains non-standard variables
which cannot be represented in the existing SDTM domains. SUPPQUAL should be used
26
CDISC, https://www.cdisc.org/standards/foundational
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 10 of 77 March 2024
Contains Nonbinding Recommendations
only when key data cannot be represented in SDTM domains. In general, variables used
to support key analyses should not be represented in SUPPQUAL. Discussion with the
review division should occur if the sponsor intends to include important variables (e.g.,
that support key analyses) in SUPPQUAL datasets, and this should be reflected in the
SDRG.
DM Domain (Demographics)
In the DM domain, each subject should have only one single record per study.
Screen failures, when provided, should be included as a record in DM with the ARM,
ARMCD, ACTARM, and ACTARMCD field left blank. For subjects who are
randomized in treatment group but not treated, the planned arm variables (ARM and
ARMCD) should be populated, but actual treatment arm variables (ACTARM and
ACTARMCD) should be left blank. 27
For subjects with multiple enrollments within a single study, the primary enrollment
should be submitted in DM. Additional enrollments should be included in a custom
domain with a similar structure to DM. Clarifying statements in the RG would be helpful.
For subjects with multiple screenings and no subsequent enrollment, include the primary
screening in DM with additional screenings in a custom domain with a structure similar
to DM.
For subjects with multiple screenings and subsequent enrollment, include the enrollment
in DM with screenings in a custom domain with a structure similar to DM.
DS Domain (Disposition)
When there is more than one disposition event, the EPOCH or DSSCAT variable should
be used to aid in distinguishing between them. This will allow identification of the
EPOCH in which each event occurred or DSSCAT to differentiate if the disposition is for
treatment or study. If a death of any type occurs, it should be the last record and should
include its associated EPOCH. It is expected that EPOCH variable values will be
determined based on the trial design and thus should be defined clearly and documented
in the define.xml.
27
Although this convention is inconsistent with the SDTMIG, FDA recommends its use so that ‘Screen
Failure’, ‘Not assigned’, and ‘Not treated’ are not specified as a treatment arm.
28
Technical Specification Document(s) can be found on the FDA Study Data Standards Resources
webpage, https://www.fda.gov/industry/fda-resources-data-standards/study-data-standards-resources.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 11 of 77 March 2024
Contains Nonbinding Recommendations
treatment emergent adverse events should be agreed upon with the review division and
specified in the protocol (e.g., any AE after first dose of investigational product
administration, or any AE after first dose of investigational product administration until
X days after the last dose).
The entry of a ‘Y’ for the serious adverse event variable, AESER, should have the
assessment indicated (e.g., as a death, hospitalization, or disability/permanent damage).
Frequently, sponsors omit the assessment information, even when it has been collected on
the CRF. The criteria that led to the determination should be provided. This information
is critical during FDA review to support the characterization of serious AEs.
Custom Domains
The SDTMIG permits the creation of custom domains if the data do not fit into an
existing domain. Prior to creating a custom domain, sponsors should confirm that the data
do not fit into an existing domain. If it is necessary to create custom domains, sponsors
should follow the recommendations in the SDTMIG. In addition, sponsors should present
their implementation approach in the cSDRG. To provide study data that do not fit into
an existing SDTM domain or draft SDTM domain, consider creating a custom dataset
aligned with the Study Data Tabulation Model (SDTM). Questions about custom
domains should be addressed in pre-submission meetings and documented in the SDSP.
All planned samples should have a record in this domain. If a sample analysis was
planned but no result is available, a reason for why the test was not done should be
included in PCREASND. Values for the lower limit of quantitation should be included in
PCLLOQ and correspond to those units in PCSTRESU. Naming for all timepoints and
visits should be consistent with the data provided in the clinical study report. Naming for
all analytes should be consistent with the data represented in the pharmacokinetic
parameters (PP) domain. If data for an analyte per subject is present in PP, there should
be information for that analyte per subject per sample in PC.
Naming for all visits should be consistent with the data provided in the clinical study
report. Naming for all analytes should be consistent with the data represented in the
pharmacokinetic concentrations (PC) domain.
useable. Sponsors should submit these smaller files in addition to the larger non-split
standard LB domain file. Sponsors should submit the split files in a separate sub-
directory/split that is clearly documented in addition to the non-split standard LB domain
file in the SDTM datasets directory (See section 7).
For clinical studies, please submit two separate domains for lab results. The LB domain
should contain SI units in LBSTRESU for the SI results in the LBSTRESC and
LBSTRESN fields. An additional custom domain called LC structured identically to LB
should contain conventional units in --STRESU for the results in conventional units in
the --STRESC and --STRESN variables. It is ideal if both conventional and SI units come
directly from the lab vendor. Submit the results of all tests obtained on subjects, including
the results from unscheduled tests or visits, and results obtained from local laboratories.
Identify all reference ranges used for specific populations in the SDRG and ADRG.
When submitting a full ts.xpt, please refer to the appendix section for a list of study
parameters that should be submitted where relevant for clinical studies. Additional
parameters may be included beyond those listed in the appendix. For clinical studies,
study start date (SSTDTC) is the earliest date of informed consent among any subject that
enrolled in the study. 29
In addition to the study parameters indicated in the appendix section, if the study data
submitted follows a Therapeutic Area User Guide (TAUG) or an FDA Technical
Specification, 30 use the values for TSPARM/TSPARMCD and TSVAL from the table
below in the TS domain. Use of these parameters in TS will allow for tracking and
reporting on the submission rates of study data following a particular TAUG or technical
specification. At this time, it is also helpful to include the version of the CDISC
29
Available at https://www.fda.gov/regulatory-information/search-fda-guidance-documents/providing-
regulatory-submissions-electronic-format-standardized-study-data.
30
https://www.fda.gov/forindustry/datastandards/studydatastandards/default.htm
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 13 of 77 March 2024
Contains Nonbinding Recommendations
implementation guide (IG) and model used using the parameters indicated in the table
below.
QS Domain (Questionnaires)
Some items in an instrument may be logically skipped per the instrument’s instructions.
Responses for logically skipped items should be (1) recorded and/or scored according to
the instructions provided in the instrument’s user manual, scoring manual, or other
documentation provided by the instrument developer and (2) included in the submission
dataset.
If instructions on how to record and/or score responses to logically skipped items are
available from the instrument developer, then records for logically skipped items should
be included in the submission dataset with the following:
• QSSTAT = ‘NOT DONE’;
• QSREASND = ‘LOGICALLY SKIPPED ITEM’; and
If instructions on how to record and/or score responses to logically skipped items are not
available from the instrument developer, then records for logically skipped items should
be included in the submission dataset with the following:
• QSSTAT = ‘NOT DONE’;
• QSREASND = ‘LOGICALLY SKIPPED ITEM’; and
• QSORRES, QSSTRESC, and QSSTRESN all set to null.
Definition
Specifications for analysis datasets for human drug product clinical studies are provided
by the Analysis Data Model (ADaM) and its implementation by the Analysis Data Model
Implementation Guide (ADaMIG). ADaM datasets should be used to create and to
support the results in clinical study reports (CSRs), Integrated Summaries of Safety (ISS),
and Integrated Summaries of Efficacy (ISE), as well as other analyses required for a
thorough regulatory review. ADaM datasets can contain imputed data or data derived
from SDTM datasets.
General Considerations
Generally, ADaM assists FDA review. However, it does not always provide data
structured in a way that supports all of the analyses that should be submitted for review.
For example, ADaM structures do not support simultaneous analysis of multiple
dependent variables or correlation analysis across several response variables. Therefore,
sponsors should, as needed, supplement their ADaM datasets after discussions with the
specific review division.
One of the expected benefits of analysis datasets that conform to ADaM is that they
simplify the programming steps necessary for performing an analysis. As noted above,
ADaM datasets should be derived from the data contained in the SDTM datasets. There
are features built into the ADaM standard that promote traceability from analysis results
to ADaM datasets and from ADaM datasets to SDTM datasets. To ensure traceability, all
SDTM variables utilized for variable derivations in ADaM should be included in the
ADaM datasets when practical. Each submitted ADaM dataset should have its contents
described with complete metadata in the define.xml file (See section 4.1.4.5) and within
the ADRG as appropriate (See section 2.3).
Dataset Labels
Each dataset should be described by an internal label that is shown in the define.xml file.
The label names of ADaM datasets should be different from those of the SDTM datasets.
For example, the SDTM adverse event dataset (i.e., AE) and the ADaM adverse event
dataset (i.e., ADAE) should not share the exact same dataset label, such as “Adverse
Events.”
Subject-Level Analysis Data
Subject-Level Analysis Data (ADSL) is the subject-level analysis dataset for ADaM. All
submissions containing standard analysis data should contain an ADSL file for each
study. In addition to the variables specified for ADSL in the ADaMIG, such as those
listed below in the core variables section (See section 4.1.2.5), the sponsor should include
multiple additional variables representing various important baseline subject
characteristics / covariates presented in the study protocol. Some examples of baseline
characteristics / covariates for drug studies include, but are not limited to, disease severity
scores such as Acute Physiology and Chronic Health Evaluation (APACHE) scores, 31
baseline organ function measurements such as calculated creatinine clearance or Forced
Expiratory Volume in 1 second (FEV1), range categories for continuous variables, and
numeric date variables in non-International Standards Organization (ISO) formats. Some
examples of baseline characteristics for vaccine studies include, but are not limited to,
past medical history (e.g., prior infection history), immunosuppressive conditions, prior
vaccination history and concomitant medications/vaccines.
Core Variables
Core variables, which include covariates presented in the study protocol that are
necessary to analyze data, should be included in each ADaM dataset, and are typically
already included in the ADSL dataset (See section 4.1.2.4). The core variables included
31
Knaus WA, Draper EA, Wagner DP, Zimmerman JE (1985). “APACHE II: a severity of disease
classification system.” Critical Care Medicine, 13 (10): 818–829.29.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 16 of 77 March 2024
Contains Nonbinding Recommendations
in an ADaM dataset should be necessary for the analysis need in that dataset. Examples
of core variables include study/protocol number, center/site number, geographic region,
country, treatment assignment information, sex, age, race, analysis population flags (e.g.,
Intent-to-Treat (ITTFL), Full Analysis Set (FASFL), Safety (SAFFL), and Per-Protocol
(PPROTFL)), and other important baseline demographic variables. Note that all variables
that contain coded data should be accompanied by a variable that provides the decoded
information.
In addition, it is important to note that SDTM datasets do not have core variables (such as
demographic and population variables) repeated across the different domains. The
duplication of core variables across various domains can be fulfilled through their
inclusion in the corresponding analysis datasets. For example, the SDTM AE dataset does
not allow for the inclusion of variables such as treatment arm, sex, age, or race. These
and other variables should be included in the adverse event ADaM dataset (i.e., ADAE).
Key Efficacy and Safety Data
Sponsors should submit ADaM datasets to support efficacy and safety analyses
(including analysis of immunogenicity). At least one dataset should be referenced in the
data definition file as containing the primary efficacy variables. Further, variables and
parameters pertaining to the primary and secondary endpoints of a study, along with their
derivations (as applicable), should be provided as well as documented appropriately (i.e.,
variable-level metadata or parameter value-level metadata) in the data definition file.
Timing Variables
A variable for relative day of measurement or event, along with timing variables for visit,
should be included when an ADaM dataset contains multiple records per subject (i.e.,
repeated measures data).
Numeric Date Variables
Numeric date variables are needed for analysis and review purposes. Apply formats to all
numeric date variables using a format that is understandable by SAS XPORT Version 5
files as per section 3.3.1 above. The software specific (as opposed to study specific) date
of reference used to calculate numeric dates should be specified within the ADRG. In the
event of partial dates, imputation should be performed only for dates required for analysis
according to the SAP, and appropriate corresponding ADaM imputation flags should be
utilized. When numeric time or date time variables are needed, all considerations apply as
previously discussed for numeric dates.
For traceability purposes, SDTM character dates formatted as ISO 8601 should also be
included in the ADaM datasets.
Imputed Data
When data imputation is utilized in ADaM, sponsors should submit the relevant
supporting documentation (i.e., define.xml and ADRG) explaining the imputation
methods.
Software Programs
Sponsors should provide the source code used to create all ADaM datasets, tables, and
figures associated with primary and secondary efficacy analyses. Sponsors should submit
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 17 of 77 March 2024
Contains Nonbinding Recommendations
source code in single byte ASCII text format. Files with MS Windows executable
extensions (.cmd, .com, and .exe) should NOT be submitted. For a list of acceptable file
extensions, refer to the document entitled Specifications for File Format Types Using
eCTD Specifications. 32
Furthermore, sponsors should submit the source code used to generate additional
information included in Section 14 CLINICAL STUDIES of the Prescribing
Information, 33 if applicable. The specific software utilized (version and operating system)
should be specified in the ADRG.
Definition
The Standard for Exchange of Nonclinical Data (SEND) provides the organization,
structure, and format of standard nonclinical tabulation datasets for regulatory
submission.
General Considerations
The SENDIG provides specific domain models, assumptions, and examples for preparing
standard tabulation datasets that are based on the SDTM model. If there is uncertainty
regarding SEND implementation, the sponsor should discuss the issue with the review
division.
The ideal time to implement SEND is prior to the conduct of the study as it is very
important that the results presented in the accompanying study report be traceable back to
the original data collected. Each submitted SEND dataset should have its contents
described with complete metadata in the define.xml file (See section 4.1.4.5) and within
the nSDRG as appropriate (See section 2.2).
For nonclinical studies, the define.xml StudyName element value should contain the
sponsor’s study identifier, consistent with the study identifier [study-id] used in the eCTD
study tagging file (STF) referenced under the appropriate subsection of Module 4; refer
to Section 7.1 for additional information about the STF. For studies outsourced to a
contract test facility, the alternate study identifier assigned to the study by the testing
facility, which is typically included in the STUDYID field of the SEND datasets, should
be included in the ProtocolName element value in define.xml.
For submissions to CBER and CDER, SEND datasets are required when submitting a
draft report as these data form the basis of regulatory decisions regarding nonclinical
support for clinical development in accordance with the dates specified in the Catalog. 34
If there are changes to the SEND datasets requiring resubmission with the final study
report, resubmit the updated datasets using the ‘replace’ operator. Information about
32
Available at https://www.fda.gov/media/85816/download.
33
Available at https://www.fda.gov/regulatory-information/search-fda-guidance-documents/labeling-
human-prescription-drug-and-biological-products-implementing-plr-content-and-format.
34
https://www.fda.gov/regulatory-information/search-fda-guidance-documents/data-standards-catalog
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 18 of 77 March 2024
Contains Nonbinding Recommendations
using the ‘replace’ operator to update datasets can be found in Section 7.1. SEND
datasets would not need to be resubmitted with the final report if there were no changes
to the dataset from the draft report. Even when SEND datasets do not need to be
resubmitted, it is recommended that an updated nSDRG is submitted with the final study
report. This updated nSDRG should include the current study report version (Section
1.1), any date (or administrative) changes, and a notation that no changes to SEND
datasets were made or needed other than the notation of the version change (e.g.,
STRPSTAT change) after the draft report was submitted.
Sponsor should use the VISITDY or --NOMDY variable appropriate to the selected
SENDIG version to group observations for summary analysis. This includes grouping
animal data collected over multiple days for a single planned event.
For animals necropsied over multiple grace days for a single scheduled interim, terminal
or recovery termination event, the DS dataset VISITDY or DSNOMDY variables should
contain a single scheduled day for the event. Postmortem findings in DD, MA, MI, OM,
and TF for each planned termination event can then be analyzed together based on the DS
dataset VISITDY or DSNOMDY. When in-life observations such as terminal body
weight or clinical pathology sample collection are scheduled at the time of necropsy, the
VISITDY or --NOMDY associated with those observations should also contain the single
planned day for the termination event.
For other in-life observations, when the defined schedule for an observation covers
multiple days or the schedule is for a specific day but grace days allowed, and animals
are observed/tested over multiple days, VISITY or --NOMDY should contain a single
day under which the data should be grouped for analysis. Some examples:
• ECGs are scheduled for week 1, and some animals are tested on day 1, some
animals are tested on day 2, and some animals are tested on day 3, all animal ECG
results for week 1 should have a common VISITDY or EGNOMDY.
• Urinalysis is scheduled for day 15, but no urine was collected from one animal on
that day so the collection attempted again on day 16 and was successful. In the
Study Report, the data collected for the day 16 urine sample would be analyzed
with the day 15 sample results, so VISITDY or LBNOMDY for day 15 and 16
should be 15.
• Due to the number of animals on study, the protocol allows 1 grace day for
physical exams with vital signs scheduled for day 1. Some animals are examined
on day 1 and some on day 2. All physical exam and vital sign data should be
reported under VISTIDY or --NOMDY day 1.
For tests or observations scheduled relative to dose and having --TPTREF and --
RFTDTC should be filled to describe the dosing event, and --TPT, --TPTNUM and --
ELTM filled to describe the time relative to dose. VISITDY or --NOMDY should always
contain the dose day, not the day of the test or observation. VISITDY should be empty
for records with unscheduled tests or observations. In SENDIGv3.0, an empty VISITDY
identifies data collected for an unplanned event.
Whenever used, FOCID should be consistently represented across domains for the same
focus within a study.
Population of ELTM is preferred for pre-dose collections for our nonclinical visualization
tools. If there is only one pre-dose collection period, ELTM is PT0H to indicate that the
end of the period is immediately prior to dose. If there is more than one period prior to
dose, then two different ELTM values would be used to identify each of them separately.
DOSDUR should be used to indicate the longest planned duration of time between the
first day of dosing and the terminal sacrifice, regardless of the number of doses
administered subsequent to the first dose. For example, DOSDUR would be P28D
whether dosing occurred daily beginning on Day 1, only on Day 1, or biweekly on Days
1 and 14 when the terminal sacrifice is Day 28 (see the latter example in Fig. 1 below).
DOSDUR should not include the recovery phase, defined as the non-dosing period that
follows the main dosing phase of a study. For safety pharmacology studies, DOSDUR
would include the time after the dose was administered to the end of the planned
observation period if the animals are not sacrificed.
Currently, SUPPMA, SUPPMI, and SUPPTF should be used to capture some collected
information (e.g., pathology modifiers) as detailed in the SENDIG.
When using a CDISC CT version dated before 2018 and histopathology severity data are
collected on a severity scale that cannot be represented using the CDISC MISEV codelist
without a loss of scientific accuracy (e.g. data were collected on 3 levels or 4 levels but
MISEV specifies 5 levels), severity scores may be represented in MISEV as ‘1 OF 4’ ‘2
OF 4’ or ‘1 OF 3’as appropriate, where the first number is the score and the second is the
number of available severities in the scale. A score of 1 should be the least severe
finding. Extend the non-extensible MISEV codelist with the necessary terms to describe
the alternative severity scores, include these extended values in the define.xml and
nSDRG, and explain any resulting validation error(s) in the nSDRG.
When a laboratory test result is either above or below the limit of quantification (LOQ)
for the measurement method and this result was used in calculation of group means in the
study report, the value used for calculation should be submitted using the supplemental
qualifier variable LBCALCN.
If an animal is fasted prior to collection of a sample for laboratory testing, all results from
testing of the collected sample should have LBFAST = ‘Y’.
When reporting results for tests that do not have published controlled terminology, it is
important that the test code values defined for LBTESTCD strictly adhere to the
specifications in the SENDIG: It cannot be longer than eight characters, it cannot start
with a number and it cannot contain characters other than letters, numbers and
underscores.
When actual dose dates or date/time values are available for PCRFTDTC/PPRFTDTC,
they can be included.
When a test result is below a lower limit of quantitation (LLOQ), it should be submitted
using the following instructions:
• PCORRES should not contain a specific value. For example, the value in
PCORRES may be ‘<LLOQ’, where LLOQ is the numerical value.
• ‘BLQ’ 35 should be in PCSTRESC to signify that the result is below the LLOQ.
• PCSTRESN should be blank.
• Standardized units for LOQ should be in PCSTRESU.
• PCLLOQ should be populated with the lower limit of quantitation for the analyte.
• When a numeric value has been assigned to a result that is below the LLOQ for
the purpose of group summary statistics, that value should be submitted in
SUPPPC as QNAM = ‘PCCALCN’ to allow the group statistics presented in the
study report to be reproduced. When a value that is below the LLOQ is excluded
from group statistics, no PCCALCN entry is needed.
Custom Domains
To provide study data that does not fit into an existing SENDIG domain, draft SENDIG
domain, or published SDTMIG domain, consider creating a custom dataset aligned with
35
According to the FDA’s Bioanalytical Method Validation Guidance for Industry (May 2018), study
samples with concentrations listed below the LLOQ should be reported as ‘BQL’; however, ‘BLQ’, as
specified in FDA-supported SENDIG versions, is appropriate to use in SEND datasets to report this data.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 22 of 77 March 2024
Contains Nonbinding Recommendations
the SDTM model version associated with the SENDIG version used for submission.
Questions about custom domains should be addressed in pre-submission meetings and
documented in the SDSP.
When immune response data are collected in toxicology studies intended for submission
to CBER, these data ideally should be submitted in a dataset(s); however, these data
currently may be submitted as part of the study report. For data submitted using
SENDIGv3.1 or 3.1.1, use of LB domain or a custom IS domain is acceptable. In these
cases, when a numeric value has been assigned for calculation purpose to a result of
below limit of quantification, the value should be provided in SUPPLB, as QNAM =
‘LBCALCN’; or in SUPPIS, as QNAM = ‘ISCALCN’.
Ensure that Trial Arms and Trial Sets represented in TA and TX closely follow the
SENDIG examples of study designs with recovery and/or toxicokinetic animals.
Recovery and/or toxicokinetic animals should be presented in separate Trial Sets from the
main animals. Trial Sets should be defined to contain animals of both sexes if all other
experimental parameters are the same.
The Trial Sets domain (TX) should be submitted for each study. Every set in the TX
domain should have only one record with each of the following TXPARMCD values:
SPGRPCD (sponsor group code associated with the set), GRPLBL (sponsor group label
associated with the set), PLANMSUB (planned number of males in set), and PLANFSUB
(planned number of females in set). There should be a one-to-one correspondence
between GRPLBL and SPGRPCD entries in the TX domain.
See the appendix section for a list of parameters that should be included in the full Trial
Summary (TS) dataset where relevant for nonclinical studies. Additional parameters can
be included beyond those listed in the appendix. If information for a parameter listed in
the appendix of a full TS.xpt file is not available, the parameter should not be included
for datasets modeled in SENDIGv3.0. If information for a parameter listed in the
appendix of a full TS.xpt file is not available, it can be included with TSVAL blank and
TSVALNF filled for datasets modeled in SENDIGv3.1. For nonclinical studies, study
start date (TSPARMCD= STSTDTC) is the date on which the study protocol or plan is
approved (signed) by the Study Director, also known as the study initiation date. 36
Tumor Dataset
Carcinogenicity studies should include an electronic dataset of tumor findings to allow
for a complete review. At this time, sponsors should continue to include the tumor.xpt
and associated define.pdf files regardless of whether the study is in SEND format. When
both tumor.xpt and SEND are submitted, the sponsor should ensure that data are
36
Available at https://www.fda.gov/regulatory-information/search-fda-guidance-documents/providing-
regulatory-submissions-electronic-format-standardized-study-data.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 23 of 77 March 2024
Contains Nonbinding Recommendations
consistent and traceable between tumor.xpt and the SEND datasets, with the information
specified in the FDA Business Rules. Any information needed to establish traceability
should be presented in the nSDRG. The Tumor Findings dataset (tf.xpt) is necessary if
the SEND datasets are the basis for creation of the tumor.xpt dataset. If sponsors choose
to not submit Tumor Finding dataset (tf.xpt) with the SEND submission, the algorithm
used to calculate 'Time in days to detection of tumor' should be included in the nSDRG.
CO Domain (Comments)
Comments submitted in the CO domain should be relevant to study interpretation. To
reduce ambiguity, abbreviations in any free text field should be avoided or outlined in the
nSDRG.
Scope of SEND
The following is the Agency’s current thinking of the scope of SEND for studies listed in
the SENDIGv3.0, SENDIGv3.1 and SENDIGv3.1.1, as supported in the FDA Data
Standards Catalog. The intent is to provide clarification on the expectation of SEND for
studies listed in these SENDIGs, specifically addressing the following language:
It is acknowledged that some of these study types can encompass a broad range of study
designs (e.g., number of animals per group, number of endpoints tested) and have
different drug development purposes (e.g., exploratory or tolerability studies versus
standard toxicity studies designed to assess clinical safety). Given the variability of study
design and intent of a nonclinical study, the Agency is providing clarification on what
studies are subject to the SEND requirement. Study types outlined in the FDA-supported
SENDIGs that are out of scope for this discussion include those described in the
SENDIG-ARv1.0 (the scope of SEND for nonclinical natural history and efficacy studies
in Animal Rule submissions are discussed in 4.1.3.4.2). The Agency’s current
interpretation of the scope of SEND is subject to change based on the availability of new
information. This document will be updated to reflect any needed change.
Overall, the expectation of SEND datasets for nonclinical studies is linked to the
pharmacological and toxicological information required to provide FDA with the data
37
See CDISC SENDIGv3.1 (Section 1.1) available at www.cdisc.org.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 24 of 77 March 2024
Contains Nonbinding Recommendations
needed to assess and support the safety of the proposed clinical investigations. 38 These
data form the basis of the rationale on how the sponsor concluded that it is reasonably
safe to conduct the proposed clinical trial. If the nonclinical pharmacology or toxicology
study is required to support a regulatory decision by the Agency, such that the absence of
this study would result in a determination that there is insufficient information to assess
the risks to human subjects, then the nonclinical study would require SEND.
Additionally, SEND datasets are required to be submitted at the same time as the
nonclinical PDF study report for a commercial IND or NDA/BLA. Further clarification
on specific topics is outlined below.
38
21 CFR 312.23(a)(8).
39
See the Data Standards Catalog for the latest version of SEND required and the relevant requirement
dates for specific submission types. Please note all references to SEND being required in this Guide refer to
this standardized format being required for an electronic submission of clinical or nonclinical study data
under section 745A(a) of the FD&C Act.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 25 of 77 March 2024
Contains Nonbinding Recommendations
in the Data Standards Catalog. These studies are used to inform regulatory
decisions related to the risk to human subjects and ultimately impact labeling.
E. The requirement for SEND is not limited to the drug substance. Nonclinical
studies that are modeled in an FDA-supported SENDIG version (e.g., repeat-
dose toxicology) and are conducted to assess the safety of any component or
metabolite of the proposed clinical therapeutic product, require SEND.
Examples of such components include but are not limited to the active moiety
(API), impurities, excipients, leachables, extractables, pro-drugs, combination
products, vaccine adjuvants, and drug/device combinations.
F. The study report status or the finalization of the study report (i.e., draft,
interim or final) does not impact whether the SEND requirement applies.
G. The requirement for SEND is not limited to GLP studies. As both GLP and
non-GLP toxicity studies may be submitted to the FDA to support clinical
safety, the decision for inclusion of SEND is independent of GLP status. In
cases where non-GLP toxicity studies are submitted to support a
determination of safety, as outlined above, such studies must include SEND.
H. SEND is not required for study types that are not listed in an FDA-supported
SENDIG (e.g., primary pharmacology) even if one or more endpoints are able
to be modeled in SEND.
I. If SEND datasets are generated by the sponsor for nonclinical studies that are
not intended to support clinical safety, the Agency would accept these;
however, submission of these datasets would not be required.
Technical Specifications
A. Sponsors are encouraged to use the Study Data Standardization Plan (SDSP)
to communicate the intent to submit SEND datasets during product
development, and to allow for discussion with the review division when there
is any ambiguity on the SEND requirement for a study (See sections 8.2.2 and
2.1 of the Guide).
B. When SEND is not submitted for reasons outlined under Section 4.1.3.4.1
(Scope of SEND for SENDIGv3.0, SENDIGv3.1 and SENDIGv3.1.1), use of
a simplified ts.xpt file may be needed where the value “NA” (Not Applicable)
should be populated in the TSVALNF field (See section 8.2.2 of the Guide).
C. For further information on nonclinical Weight of Evidence documents, refer
to Section 7.1 of the Guide.
On March 11, 2020, FDA published a Federal Register notice (85 FR 14205) announcing the dates that
40
FDA’s support began and requirements become effective for specific Animal Rule data standards. That
submitted to CDER and for which the CDER review division expects a full tabulation of
data (i.e., line listings of the results for each individual animal) to support detailed
review. Although not required, FDA also recommends that sponsors submit SEND
datasets for such studies that are initiated before March 15, 2022, and March 15, 2023, as
applicable. In addition, SEND datasets are recommended for such studies that are
submitted to pre-INDs and FDA’s Animal Model Qualification Program.
Application-specific questions about which natural history and efficacy studies should
include full tabulations of data and datasets should be discussed with the CDER review
division as early as possible during product development. Similarly, questions about
natural history studies that will be submitted to an animal model qualification package
should be discussed with the Animal Model Qualification Program
([email protected]).
Although not currently required by CBER, CBER recommends sponsors submit datasets
modeled with SDTMv1.8 and SENDIG-ARv1.0 for nonclinical natural history or
efficacy studies. CBER also recommends including the immunogenicity (IS) domain
(described in SDTMv1.4 and SDTMIGv3.2 and later versions) to represent
immunogenicity data obtained in animal studies.
Per the ICH S5(R3) guidance for industry, Detection of Reproductive and Developmental
Toxicity for Human Pharmaceuticals, “The EFD toxicity study is designed to assess
maternal toxicity relative to that in nonpregnant females, and to evaluate potential effects
on embryo-fetal survival, intrauterine growth, and morphological development.” For
further information on EFD study design and regulatory context, please refer to ICH
S5(R3).
document omitted the 36-month implementation period for certain INDs as required by FDA’s guidance for
industry Providing Regulatory Submissions in Electronic Format--Standardized Study Data. On June 10,
2021, FDA published a Federal Register notice (86 FR 30960) that corrected that error.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 27 of 77 March 2024
Contains Nonbinding Recommendations
EFD studies can vary by study design (e.g., combination study) or have different drug
development purposes (e.g., dose range, definitive). Given this variability, the Agency is
providing clarification on what EFD studies are subject to the SEND requirement under
SENDIG-DARTv1.1. The Agency’s current interpretation of the scope of SEND is
subject to change based on the availability of new information. This document will be
updated to reflect any needed change.
Similar to the Scope of SEND for SENDIGv3.0, SENDIGv3.1 and SENDIGv3.1.1, the
expectation of SEND datasets for nonclinical EFD studies is linked to the toxicological
information required to provide FDA with the data needed to assess and support the
safety of the proposed clinical investigations. 41 These data form the basis of the rationale
on how the sponsor concluded that it is reasonably safe to conduct the proposed clinical
trial.
A. SEND is required42 for EFD studies that are submitted by the sponsor to support
the safety of a proposed clinical trial under commercial IND development or for
the support of marketing authorization and/or labeling. SEND datasets are
required to be submitted at the same time as the nonclinical PDF study report for
a commercial IND or NDA/BLA. These nonclinical studies generally identify
potential safety concerns, support the dose and population in human clinical trials,
and characterize the toxicologic profile of the test article and proposed clinical
product.
B. The requirement for SEND is not limited to the drug substance. Reproductive and
developmental studies may be conducted for the API and its metabolites when
appropriate, as well as for novel excipients. Nonclinical EFD studies conducted to
address any of these would require SEND if the purpose of that study was to
assess and inform clinical safety.
C. The study report status or the finalization of the study report (i.e., draft or final)
does not impact whether the SEND requirement applies.
D. The requirement for SEND is not limited to GLP studies. As both GLP and non-
GLP toxicity studies may be submitted to the FDA to support clinical safety, the
decision for inclusion of SEND is independent of GLP status. In cases where non-
GLP toxicity studies are submitted to support a determination of safety, as
outlined above, such studies must include SEND. Refer to ICH S5(R3) about the
appropriateness of GLP status for EFD study design.
E. As stated in ICH S5(R3), preliminary EFD toxicity studies have a similar design
to the definitive EFD toxicity study with evaluation of at least six pregnant
females per group (versus 16 per group for a definitive study). In cases where a
preliminary EFD study is being used to assess clinical safety, in lieu of a
definitive EFD study, then SEND is required.
41
21 CFR 312.23(a)(8).
42
See the Data Standards Catalog for the latest version of SEND required and the relevant requirement
dates for specific submission types. Please note all references to SEND being required in this Guide refer to
this standardized format being required for an electronic submission of clinical or nonclinical study data
under section 745A(a) of the FD&C Act.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 28 of 77 March 2024
Contains Nonbinding Recommendations
Technical Specifications
A. Sponsors are encouraged to use the Study Data Standardization Plan (SDSP) to
communicate the intent to submit SEND datasets during product development,
and to allow for discussion with the review division when there is any ambiguity
on the SEND requirement for a study (See sections 8.2.2 and 2.1 of the Guide).
B. If the Technical Rejection Criteria (TRC) is not implemented for an eCTD
Module, use of a simplified ts.xpt file is not needed when SEND is not required
for a study; however, submission of a simplified ts.xpt file to an eCTD Module
without TRC implementation will not interfere with electronic validations.
C. For preparation of complete SEND datasets for EFD studies, CDER’s preference
is that SENDIG-DARTv1.1 should be used in conjunction with the most recently
updated and required SENDIG for nonclinical studies, as listed in the FDA Data
Standards Catalog.
FDA recognizes that SDTM contains certain operationally derived variables that have
standard derivations across all studies (e.g., --STDY, EPOCH). If the data needed to
derive these variables are missing, then these variables cannot be derived and the values
should be null. The following are examples of some of the permissible and expected
variables in SDTM and SEND that should be included, if available:
1. Clinical baseline flags (e.g., last non-missing value prior to first dose) for
laboratory results, vital signs, ECG, pharmacokinetic concentrations, and
microbiology results. Nonclinical baseline flags (e.g., last non-missing value prior
to first dose in parallel design studies) for laboratory results, vital signs, body
weight, cardiovascular test results, respiratory test results, and ECG results.
Currently for SDTM and SEND, baseline flags should be submitted if the data
were collected or can be derived.
43
See CDISC SDTM Implementation Guides and the SEND Implementation Guides at www.cdisc.org for
additional information on variables referenced throughout this Guide.
44
See http://www.cancer.gov/cancertopics/terminologyresources/page6.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 30 of 77 March 2024
Contains Nonbinding Recommendations
To the extent possible, naming for drugs and metabolites should be consistent across
different clinical and nonclinical studies and across domains within a study. This
includes, but it is not limited to, the TS, EX, PC, and PP domains.
SDTM and SEND Versions
When submitting clinical or nonclinical data, sponsors should not mix versions within a
study. As noted above, the Catalog lists the versions that are supported by FDA.
Data Definition Files for SDTM, SEND, and ADaM
The data definition file describes the metadata of the submitted electronic datasets, and is
considered arguably the most important part of the electronic dataset submission for
regulatory review. This data definition specification for submitted datasets defines the
metadata structures that should be used to describe the datasets, variables, possible values
of variables when appropriate, and controlled terminologies and codes. An insufficiently
documented data definition file is a common deficiency that reviewers have noted.
Consequently, the sponsor needs to provide complete detail in this file, especially for the
specifications pertaining to derived variables. In addition, sponsors should also make
certain that the code list and origin for each variable are clearly and easily accessible
from the data definition file. The version of any external dictionary should be clearly
stated both in the data definition file and in the full TS domain when it is submitted. The
internal dataset label should also clearly describe the contents of the dataset. For
example, the dataset label for an efficacy dataset might be ‘Time to Relapse (Efficacy).’
Separate data definition files should be included for each type of electronic dataset
submission, i.e., a separate data definition file for the SDTM datasets of a given clinical
study, a separate data definition file for the SEND datasets of a given nonclinical study,
and a separate data definition file for the ADaM datasets of a given clinical study. The
data definition file should be submitted in XML format, i.e., a properly functioning
45
See http://www.cdisc.org.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 31 of 77 March 2024
Contains Nonbinding Recommendations
The aCRF should include treatment assignment forms, when applicable, and should map
each variable on the CRF to the corresponding variables in the datasets (or database). The
aCRF should include the variable names and coding for each CRF item.
When data are recorded on the CRF but are not submitted, the CRF should be annotated
with the text ‘NOT SUBMITTED.’ There should be an explanation in the relevant RG
stating why these data have not been submitted.
46
See https://www.cdisc.org/standards/data-exchange/define-xml.
47
Detailed FDA PDF specifications are located on FDA’s Electronic Common Technical Document Web
site, http://www.fda.gov/ectd.
48
Previously acrf.pdf was called blankcrf.pdf.
49 As was the case during the COVID-19 PHE, SEND is required at the time of submission of a marketing
application for products with COVID-19 specific indications, even if SEND was not submitted under the
commercial IND, and cross-referencing nonclinical studies submitted to a commercial IND for a COVID-
19 indication does not obviate the requirement of SEND for a commercial IND for a non-COVID-19
indication.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 32 of 77 March 2024
Contains Nonbinding Recommendations
CDER during the COVID-19 PHE and the 180-day wind-down period can be found in
Appendix H.
Therapeutic area (TA) standards are not data standards, but rather extend the CDISC
foundational standards (e.g., SDTM and ADaM) to represent data that pertain to specific
disease areas. CDISC publishes a TA User Guide (TAUG) for each therapeutic area
which includes the extensions as disease-specific metadata, examples and
recommendations for use (https://www.cdisc.org/standards/therapeutic-areas). The
CDISC TAUGs should not be interpreted as FDA guidance.
Questionnaires, Ratings and Scales are often used as outcome measures in clinical
studies. The instruments listed in the TAUGs should not be viewed as FDA
recommended instruments. Sponsors should consult with the appropriate FDA review
division on the best approach for each specific study.
If the study data submitted follows a Therapeutic Area User Guide (TAUG), include the
values for TSPARM/TSPARMCD and TSVAL indicated in the table from section 4.1.1.3
in the TS domain.
The TA extensions that are currently incorporated into FDA supported CDISC
foundational standards include:
5.2.4 Diabetes Therapeutic Area User Guide v1.0 – Supplement for ADaM
The Ebola Virus Disease (EVD) TAUG identified the ISARIC 50 EVD CORE Clinical
Dataset as input; however, only one of the two sets of source data is represented in the
TAUG. The Survivor forms are not included because they contain primarily standard
data seen in many studies. Sponsors should be aware of both components of the
ISARIC CORE Dataset when conducting EVD clinical trials.
The Kidney Transplant TAUG does not address two important data elements. First, the
date of the request for a biopsy is important for review, not just the date the biopsy
50
International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC)
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 34 of 77 March 2024
Contains Nonbinding Recommendations
The TAUG v1.0 does not include a guidance on where to capture “Reason Not Done”
information for the tumor lesions that were Inevaluable (this is a known issue). In
addition, the Agency considers it more accurate use the phrase ‘tumor lesions’ rather
than ‘tumors’.
Based on datasets previously submitted to the Agency, about 10% of scans are not
readable in identifying bone lesions. FDA recommends capturing Image Readability
flag for all scans, but the current TAUG does not address this. Sponsors should consult
with the appropriate FDA review division on the best approach for each specific study.
For the Disease Assessments and Response for Metastatic Disease, in the proposed
Non-Standard Variables (NSV) comparison reference variable CMPREF, FDA
recommends providing a value of ‘First Post Treatment Scan’ instead of ‘Flare’ to
make it more inclusive, as not all subjects will have a flare in the 12 week scans.
The Schizophrenia TAUG does not address two important data elements. First, the
subjects daily living situation for the past 12 months. Second, when a protocol
violation prompts study termination, sponsors should use the existing Disposition
domain as appropriate and provide a referential link to any detailed information
regarding the protocol violation. Sponsors should consult with the appropriate FDA
review division on the best approach for each specific study.
When reviewing the Major Depressive Disorder TAUG, please also reference the
FDA’s Guidance for Industry document for MDD. Additionally, please consult the
Division of Psychiatry Products when planning the submission.
The Vaccine TAUG should be used in conjunction with the FDA Guidance for
Industry “Submitting Study Datasets for Vaccines to the Office of Vaccines Research
and Review.” Investigator determined reactogenicity reporting should follow the
“Interim User Guide for COVID-19” examples on page 32 with the following
revisions:
• inclusion of the Investigator date/time of collection of the event in CE;
• inclusion of additional language in example 2 description first sentence to read
“In the study in this example, subjects kept a diary for 3 days assessing the
severity of symptoms.”;
• change of date/day of investigator assessment in FACE to 2020-04-02 (day 2)
• addition of rows in FACE to report data obtained from the subjects diary from
study day 2 (moderate vomiting) and 3 (no vomiting).
The Vaccine TAUG represents the concept of maximum in the NSV, COLSRT
(Collected Summary Result Type). We assume that a daily value/result will be a
maximum value for the day. The protocol should clarify that a maximum value should
be recorded for each day. If you will be reporting more than one value per day, please
consult with your review team on how the data should be reported.
Issue about Primary Tumor: The TAUG V1.0 does not provide guidance about the
identification, location, or laterality of the primary tumor. Even though this is noted as a
Known Issue, the importance of primary tumor for colorectal cancer is well established
and impacts interpretation of trial results. The FDA recommends that data related to the
primary tumor be provided.
Issue about Prior Therapies: The TAUG does not provide guidance about the importance
of documenting prior therapies and this is considered an oversight given the importance
of these data. The FDA recommends that data related to prior therapies be included in
clinical trial data.
Issue about Non-Target Lesions: The TAUG does not discuss the importance of
providing data to document the change in size of non-target lesions. This information is
required when using certain criteria (e.g., iRECIST). If these data are not provided in the
clinical data base, then the response criteria cannot be confirmed by the Agency.
Therefore, these data on non-target lesions are necessary if criteria, like iRECIST, is used
for trials in colorectal cancer.
5.2.22 Post Traumatic Stress Disorder Therapeutic Area User Guide v1.0
5.3.5 Submitting Select Clinical Trial Data Sets for Drugs Intended to Treat
Human Immunodeficiency Virus-1 Infection
5.3.6 Submitting Study Datasets for Vaccines to the Office of Vaccines Research
and Review
5.3.8 Technical Specifications for Submitting Clinical Trial Data Sets for
Treatment of Noncirrhotic Nonalcoholic Steatohepatitis (NASH)
51
Available at https://www.fda.gov/regulatory-information/search-fda-guidance-documents/providing-
regulatory-submissions-electronic-format-standardized-study-data.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 37 of 77 March 2024
Contains Nonbinding Recommendations
6. Terminology
6.1 General
Common dictionaries should be used across all clinical studies and throughout the
submission for each of the following: adverse events, concomitant medications,
procedures, indications, study drug names, and medical history. FDA recommends that
sponsors use, where appropriate, the terminologies supported and listed in the Catalog. It
is important that coding standards, if they exist, be followed (e.g., ICH Medical
Dictionary for Regulatory Activities (MedDRA) Term Selection: Points-to-Consider
document). Frequently, sponsors submit data that do not conform to terminology
standards, for example, misspelling of MedDRA or WHODrug Global terms, lack of
conformance to upper / lower case, or the use of hyphens. All controlled terms submitted
in datasets should conform to the exact case and spelling used by the terminology
maintenance organization (e.g., MedDRA, CDISC controlled terminology). These
conformance issues make it difficult to use or develop automated review and analysis
tools. The use of a dictionary that is sponsor-defined or an extension of a standard
dictionary should be avoided if possible, but, if essential, its use should be documented in
the define.xml file and the relevant RGs.
The analysis of study data is greatly facilitated by the use of controlled terms for clinical
or scientific concepts that have standard, predefined meanings and representations. In
electronic study data submissions, sponsors should provide the actual verbatim terms that
were collected (e.g., on the CRF), as well as the coded term.
FDA recognizes that studies are conducted over many years, during which time versions
of a terminology may change. Sponsors should use the most recent version of the
dictionary available at the start of a clinical or nonclinical study. If a new version
becomes available after the start of the study, sponsors may use the most current version
of the dictionary for that clinical or nonclinical study. It is common to have different
studies use different versions of the same dictionary within the same application (e.g.,
NDA, BLA). A submission of study data should describe (e.g., in the SDSP or relevant
RG) the impact, if any, of the use of different versions on the study results. For example,
if the sponsor anticipates pooling coded data across multiple studies, then it may be
desirable to use a single version across those studies to facilitate pooling. If a sponsor
selects this approach, then the approach and the justification should be documented in the
Standardization Plan, or in an update to the plan.
Regardless of the specific versions used for individual studies, pooled analyses (e.g., for
an ISS) should be conducted using a single version of a terminology. The current version
should be used at the time that data across studies are pooled. This will ensure a
consistent and coherent comparison of clinical and scientific concepts across multiple
studies. Sponsors should specify the terminologies and versions used in the study in the
relevant RG.
Use of the Specific Controlled Term ‘OTHER’
It is understood that the expansion of controlled terminology may lag behind scientific
advancement, and that sometimes there may not be a relevant term within a controlled
terminology’s value set to describe a clinical trial event, finding, or observation.
However, it is not recommended to map a collected value to ‘OTHER’ when there is a
controlled term available to match the collected value – even when the terminology
allows for sponsor expansion. Each unique value in a --TERM field mapped to a --
DECODE value of ‘OTHER’ should have a clear rationale outlined in the relevant RGs.
52
See the guidance for industry Providing Clinical Evidence of Effectiveness for Human Drugs and
Biological Products, available at https://www.fda.gov/drugs/guidance-compliance-regulatory-
information/guidances-drugs. We update guidance periodically. To make sure you have the most recent
version of guidance, check the FDA Drugs guidance Web page at https://www.fda.gov/drugs/guidance-
compliance-regulatory-information/guidances-drugs.
53
See the guidance for industry Exposure-Response Relationships — Study Design, Data Analysis, and
Regulatory Applications, https://www.fda.gov/drugs/guidance-compliance-regulatory-
information/guidances-drugs.
54
See the guidance for industry Population Pharmacokinetics, available at
https://www.fda.gov/drugs/guidance-compliance-regulatory-information/guidances-drugs.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 39 of 77 March 2024
Contains Nonbinding Recommendations
within the submission, reference them in the relevant RGs, and use them consistently
throughout the application.
If a sponsor identifies an entire information domain 55 for which FDA has not accepted a
specific standard terminology, the sponsor may select a standard terminology to use, if
one exists. FDA recommends that sponsors include this selection in the Standardization
Plan (See section 2.1) or in an update to the existing plan, and reference it in the relevant
RG. If no controlled terminology exists, the sponsor may define custom terms. For
clinical studies, the non-FDA supported terms (whether from a non-supported standard
terminology or sponsor-defined custom terms) should then be used consistently
throughout all relevant studies within the application. Although the consistent use of non-
FDA supported terms across all nonclinical studies within an application is
recommended, it is understood that that this may not always be possible.
6.2 CDISC Controlled Terminology
Sponsors should use the terminologies and code lists in the CDISC Controlled
Terminology, which can be found at the NCI (National Cancer Institute) Enterprise
Vocabulary Services. 56 For variables for which no standard terms exists, or if the
available terminology is insufficient, the sponsor should propose its own terms. The
sponsor should provide this information in the define.xml file and in the relevant RGs.
6.3 Adverse Events
6.3.1 MedDRA
General Considerations
MedDRA is used for coding adverse events. 57 Generally, the studies included in an
application are conducted over many years and may have used different MedDRA
versions. The expectation is that sponsors or applicants will use the most current version
of MedDRA at the time of study start. However, there is no requirement to recode earlier
studies.
The spelling and capitalization of MedDRA terms should match the way the terms are
presented in the MedDRA dictionary (e.g., spelling and case). Common errors that have
been observed include the incorrect spelling of a System Organ Class (SOC) and other
MedDRA terms.
To avoid potential confusion or incorrect results, the preparation of the adverse event
dataset for the ISS should include MedDRA terms from the most current version of
MedDRA at the time that data across studies are pooled. The reason for an ISS based on a
single version of MedDRA is that reviewers often analyze adverse events across studies,
including the use of Standardized MedDRA Queries. 58 In addition, sponsors should use
55
By information domain, we mean a logical grouping of clinical or scientific concepts that are amenable
to standardization (e.g., adverse event data, laboratory data, and histopathology data, imaging data).
56
See http://www.cancer.gov/cancertopics/terminologyresources/page6.
57
See https://www.meddra.org/.
58
See http://www.meddra.org/standardised-meddra-queries.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 40 of 77 March 2024
Contains Nonbinding Recommendations
the MedDRA-specified hierarchy of terms. The SDTM variables for the different
hierarchy levels should represent MedDRA-specified primary SOC-coded terms.
6.4 Medications
General Considerations
The Unique Ingredient Identifier (UNII) 59 should be used to identify active ingredients
(specifically, active moieties) that are administered to investigational subjects in a study
(either clinical or nonclinical). This information should be provided in the SDTM TS
domain. UNIIs should be included for all active moieties of investigational products
(TSPARMCD= TRT or TRTUNII), active comparators (TSPARMCD= COMPTRT), and
any protocol-specified background treatments (TSPARMCD= CURTRT).
If a medicinal product has more than one active moiety, then multiple records in the full
TS should be provided, one for each active moiety. For example, if the investigational
product is Bactrim (a combination of sulfamethoxazole and trimethoprim), then TS will
contain two records for TSPARMCD= TRT: one for sulfamethoxazole and one for
trimethoprim.
The preferred substance names and UNII codes can be found by searching FDA’s
Substance Registration System, hosted by the National Library of Medicine. 60 We
recognize that unapproved substances may not yet have registered UNII codes. We
recommend that sponsors obtain UNII codes for unapproved substances as early in drug
development as possible, so that relevant information, such as study data, can be
unambiguously linked to those substances.
General Considerations
World Health Organization (WHO) Drug Global 61 is a dictionary maintained and updated
by Uppsala Monitoring Centre. WHODrug Global contains unique product codes for
identifying drug names and listing of medicinal product information, including active
ingredients and therapeutic uses.
59
See http://www.fda.gov/ForIndustry/DataStandards/SubstanceRegistrationSystem-
UniqueIngredientIdentifierUNII/.
60
The Substance Registration System can be accessed at https://precision.fda.gov/uniisearch.
61
See http://www.who-umc.org/.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 41 of 77 March 2024
Contains Nonbinding Recommendations
SUPPCM or FACM domains to populate all ATC Classes associated with the --DECOD
value is acceptable. ATC classes should be submitted at the fourth level or most specific
available as defined within WHODrug Global.
Generally, studies included in a submission are conducted over many years and may have
used different WHODrug Global versions to code concomitant medications. The
expectation is the most current B3-format annual version of WHODrug Global at the
time of study start will be used to code concomitant medications. There is no requirement
to recode earlier studies to align with the WHODrug Global version of later studies.
General Considerations
The Veterans Administration’s Medication Reference Terminology (MED-RT) 62 should
be used to identify the pharmacologic class(es) of all active investigational substances
that are used in a study (either clinical or nonclinical). This information should be
provided in the SDTM TS domain when a full TS is indicated. The information should be
provided as one or more records in TS, where TSPARM = “Pharmacologic Class”.
62
See MED-RT Documentation (nih.gov).
63
See the guidance for industry and review staff Labeling for Human Prescription Drug and Biologic
Products —Determining Established Pharmacologic Class for Use in the Highlights of Prescribing
Information, available at https://www.fda.gov/drugs/guidance-compliance-regulatory-
information/guidances-drugs.
64
The FDA Listing of Established Pharmacologic Class (EPC) Text Phrases is available within Highlights
of Prescribing Information on https://www.fda.gov/drugs/fdas-labeling-resources-human-prescription-
drugs/prescribing-information-resources.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 42 of 77 March 2024
Contains Nonbinding Recommendations
6.6 Indication
6.6.1 SNOMED CT
General Considerations
The International Health Terminology Standards Organization’s (IHTSDO) Systematized
Nomenclature of Medicine – Clinical Terms (SNOMED CT) 65 should be used to identify
the medical condition or problem that the investigational product in a study is intended to
affect (treat, diagnose or prevent, i.e., the indication). This information should be
provided in the SDTM TS domain (the full TS) as a record where TSPARMCD= INDIC
and TSPARMCD= TDIGRP. SNOMED CT was chosen to harmonize with Indication
information in Structured Product Labeling (SPL). 66 Because the granted indication may
include important qualifiers to fulfill the need for adequate directions for use (e.g.,
descriptors of the population to be treated, adjunctive or concomitant therapy, or specific
tests needed for patient selection), the indication section in a label may not be fully
represented by available SNOMED CT codes.
6.7.1 LOINC
General Considerations
The Logical Observation Identifiers Names and Codes (LOINC) is a clinical terminology
housed by the Regenstrief Institute. LOINC codes are universal identifiers for laboratory
and other clinical observations that enable semantically interoperable clinical data
exchange. The laboratory portion of the LOINC database contains the categories of
chemistry, hematology, serology, toxicology, and more. The SDTM standard supports
LOINC codes using the LB.LOINC variable. LOINC codes should not be added to
SEND datasets.
65
Available at http://www.ihtsdo.org/snomed-ct/.
66
See https://www.fda.gov/ForIndustry/DataStandards/StructuredProductLabeling/default.htm.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 43 of 77 March 2024
Contains Nonbinding Recommendations
The study identifier (STUDYID) in trial summary (TS) and [study-id] in the study
tagging file (STF)) should be identical wherever possible. 68 For studies where alignment
of the study identifier across TS and STF is not feasible, the value for [study-id] used in
the STF should be included in TS using the parameter SPREFID. Though SPREFID is
not in the SDTM controlled terminology for TSPARMCD, please use SPREFID to
reconcile study identifiers where necessary for SEND or SDTM studies. FDA will use
SPREFID to match study identifiers across STF and TS to establish the study start date
where necessary for evaluation against the eCTD validation criteria.
Do not use the eCTD ‘append’ lifecycle operator when submitting updated or changed
content within study data files that were previously submitted. Updated files should be
submitted using the ‘replace’ operator.
When nonclinical Weight of Evidence (WOE) documents are submitted to the Agency as
assessments for particular topics or as justification of why a toxicity study is not needed,
it is recommended that these toxicity risk assessments are submitted to the nonclinical
eCTD Modules relevant to the topic. Examples are listed below:
67
See “eCTD Technical Conformance Guide” for further details. Available at
https://www.fda.gov/drugs/electronic-regulatory-submission-and-review/ectd-resources.
68
ICH M2 EWG: The eCTD Backbone File Specification for Study Tagging Files (June 2008)
https://www.ich.org/page/study-tagging-file-specification-and-related-files and CDISC Submission
Metadata Model SDTM Metadata Submission Guidelines v2.0 | CDISC.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 44 of 77 March 2024
Contains Nonbinding Recommendations
Cross-reference to these WOE documents may also be included within eCTD Module 2.4
(Nonclinical Overview summaries). Supporting literature references submitted with any
WOE document should be submitted to eCTD Module 4.3 (Literature References). When
a WOE document is submitted to an eCTD module that is subject to the Technical
Rejection Criteria (e.g., carcinogenicity risk assessment submitted to Module 4.2.3.4), a
simplified ts.xpt file must accompany this document. The TSVALNF field of the
simplified ts.xpt file should be populated with the null value “NA” (Not Applicable) as
further described under Section 8.2.2 (Support on Data Validation Rules) of this
Technical Conformance Guide.
If you need to split a file that exceeds file size limits (See section 3.3.2), you should
submit the smaller split files in the ‘split’ sub-folder in addition to the larger non-split file
in the original data folder. There is no need for a second define.xml file to be submitted
within the split subfolder.
For rodent carcinogenicity studies submitted in 4.2.3.4, the tumor.xpt file and its
associated define.pdf should be placed in analysis\legacy\datasets subfolder under the
study datasets folder.
The file folder structure for study datasets is summarized in Figure 2. Table 2 provides
the study dataset and file folder structure and associated description. For more detailed
examples of file folder structures for clinical and non-clinical datasets in both
standardized and legacy formats, please see Appendix E: Example Study Data Folder
Structure.
69
See http://www.ich.org/products/ctd.html.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 45 of 77 March 2024
Contains Nonbinding Recommendations
2. FDA eCTD Technical Rejection Criteria for Study Data that assess
conformance to the standards listed in the Catalog (See section 7.1, section
8.2.2, and Appendix F).
3. FDA Business and Validator rules to assess that the data support regulatory
review and analysis.
70 See https://www.fda.gov/drugs/electronic-regulatory-submission-and-review/submit-ectd-or-
standardized-data-sample-fda.
71
See http://www.fda.gov/eStudyResources.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 48 of 77 March 2024
Contains Nonbinding Recommendations
In order for the FDA automated eCTD validation process to determine the study start
date (SSD) for the submitted study, FDA relies on the SSD value provided in the Trial
Summary dataset (ts.xpt) that is referenced in the Study Tagging File (STF). 73 This
validation confirms the submission of a valid STF (eCTD validation error 1789) and a
Trial Summary (TS) domain (eCTD validation error 1734). For a nonclinical study that
contains a study report with file tags “pre-clinical-study-report,” “legacy-clinical-study-
report,” or “study-report-body,” and/or an xpt formatted dataset, the expectation for
content in the TS domain (simplified or full) 74 depends on whether the study is
submitted in compliance with a CDISC standard. Appendix G (Examples of ts.xpt
Datasets) provides the appropriate content and an example of the TS domains for each
case. The expectation is that when the SSD is after the established required deadlines,
the study data must comply with the standards in the FDA Data Standards Catalog. The
validation will then identify that the required dataset files (eCTD validation error 1736)
are under the correct file tag within the STF (eCTD validation error 1735). If there are
no high validation errors within the eCTD submission, the submission will continue to
be processed.
72
The FDA Specifications for eCTD Validation Criteria can be accessed through the eCTD Submission
Standards catalog. The catalog is located on the FDA eCTD website available at https://www.fda.gov/ectd.
73
The Study Start Date (SSD) should follow the ISO 8601 standard that provides, at minimum, the year,
month, and day for the study start date.
74
Please refer to Appendix 2 of this document for more information on the simplified TS file and the Study
Data Technical Conformance Guide on the FDA Study Data Standards Resources web page for more
information on the full TS file.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 49 of 77 March 2024
Contains Nonbinding Recommendations
Technical rejection criteria have been added to the Specifications for eCTD Validation
Criteria to determine compliance with the requirements for submitting standardized study
data 75 when xpt formatted datasets are submitted to FDA in TRC applicable sections
within Module 5.
When a xpt formatted dataset is submitted, the STF for the study is then checked for the
presence of a trial summary (TS) file (full or simplified). A full ts.xpt file would be
expected when the study type and study initiation date meet the criteria for requiring
SDTM and ADaM datasets as described in the current FDA Data Standards Catalog.
There are cases in which a xpt formatted dataset submitted to TRC applicable sections
within eCTD Module 5 using one of the STFs (see sections 7.1, 8.2.2.1, and Appendix F)
is not required to include accompanying SDTM and ADaM datasets. In such cases, a
simplified ts.xpt file should be included with the xpt formatted dataset. A simplified
ts.xpt file serves to provide limited machine-readable information such that any submitted
xpt formatted dataset not requiring SDTM and ADaM datasets will be appropriately
identified by the Center’s processing system. 76,77
There may also be cases where SDTM and ADaM are not required even though the study
started after December 17, 2016. The list below comprises possible examples (not an
exhaustive list):
• pilot studies submitted to an ANDA application
• failed studies submitted to an ANDA application
When SDTM and ADaM are not applicable in a study started after December 17, 2016,
see Appendix G, example B for the format of a simplified ts.xpt file that should be used,
where the TSVALNF field is to be populated with the null value “NA” (Not Applicable).
Technical rejection criteria have been added to the Specifications for eCTD Validation
Criteria to determine compliance with the requirements for submitting standardized
study data 78 to any nonclinical study report submitted under eCTD modules 4.2.3.1,
4.2.3.2, or 4.2.3.4 that includes one of the following three file tags: ‘pre-clinical-study-
report’, ‘legacy-clinical-study-report’, or ‘study-report-body’.
75
See pp. 6-9 of the Study Data Guidance.
76
See Appendix F: Technical Rejection Criteria for Study Data Validation Important Information
77
See eCTD Submission Standards located at https://www.fda.gov/eCTD for further information on the
validation tool FDA is currently using and all eCTD validation criteria and rules.
78
See pp. 6-9 of the Study Data Guidance.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 50 of 77 March 2024
Contains Nonbinding Recommendations
When a nonclinical study report is submitted using one of these file tags in the study
tagging file (STF) or the study is submitted with an xpt formatted dataset, the STF for the
study is then checked for the presence of a trial summary (TS) file (full or simplified). A
full ts.xpt file would be expected when the study type and study initiation date meet the
criteria for requiring SEND datasets as described in the current FDA Data Standards
Catalog (e.g., a single dose toxicity study initiated after December 17, 2017 for INDs).
There are cases in which a study report submitted to eCTD Modules 4.2.3.1, 4.2.3.2 or
4.2.3.4 using one of the file tags listed above is not required to include accompanying
SEND datasets. In such cases, a simplified ts.xpt file should be included with the study
report. A simplified ts.xpt file serves to provide limited machine-readable information
such that any submitted study report not requiring SEND will be appropriately identified
by the Center’s processing system. 79,80 A simplified ts.xpt file should not be used when
submitting SEND datasets for a study. Doing so will prevent loading of these datasets
resulting in a need for TS correction and resubmission.
A simplified ts.xpt file would be expected when the study type could be modeled in an
applicable SEND Implementation Guide (SENDIG) version (e.g., repeat dose toxicity)
but the study initiation date is prior to the implementation of the requirement (e.g., before
or on Dec. 17, 2016 for NDAs). When this is the case, the following format of a
simplified ts.xpt file may be used:
There may also be cases where a study initiation date is not relevant. When nonclinical
submissions are primarily text based, do not have tabulated data or line listings, are
specifically sent to or requested by the Agency due to emergent safety concerns (with
prior agreement), or only contain data that are not modeled in an applicable SENDIG, a
simplified ts.xpt file should be used. The list below comprises possible examples of the
types of submissions that meet these criteria (not an exhaustive list):
79
See Appendix F: Technical Rejection Criteria for Study Data Validation Important Information
80
See eCTD Submission Standards located at https://www.fda.gov/eCTD for further information on the
validation tool FDA is currently using and all eCTD validation criteria and rules.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 51 of 77 March 2024
Contains Nonbinding Recommendations
When a study initiation date is not applicable, the following format of a simplified ts.xpt
file should be used, where the TSVALNF field is to be populated with the null value
“NA” (Not Applicable):
It is recommended that the Study Data Standardization Plan (SDSP) should be used
during development (See section 2.1) to communicate the intent to submit SEND
datasets. The SDSP can be updated so that all historical, current, and planned use of study
data standards is included. When appropriate, the SDSP may also be used to further
explain the intended use of simplified ts.xpt files. SDSP instructions are available
(https://advance.phuse.global/display/WEL/Deliverables) and allow flexibility to
accommodate any type of submission. Use of the SDSP will allow for identification of
potential data standardization issues and timely discussion with the review division, if
needed.
For additional information on the Technical Rejection Criteria, please see Appendix F.
The FDA Data Standards Catalog may be found at: https://www.fda.gov/industry/study-
data-standards-resources/study-data-submission-cder-and-cber
The CDER resource ‘Creating Simplified ts.xpt Files’, using free and open-source
software may be found at https://www.fda.gov/industry/study-data-standards-
resources/study-data-submission-cder-and-cber
If there are any questions as to the appropriate use of the simplified ts.xpt file, contact the
CDER eDATA Team at [email protected] or CBER eData Team at cber-
[email protected].
8.3 Study Data Traceability
8.3.1 Overview
An important component of a regulatory review is an understanding of the provenance of
the data (e.g., traceability of the sponsor’s results back to the CRF data). Traceability
permits an understanding of the relationships between the analysis results (tables, listings
and figures in the study report), analysis datasets, tabulation datasets, and source data.
Traceability enables the reviewer to accomplish the following:
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 52 of 77 March 2024
Contains Nonbinding Recommendations
Based upon reviewer experience, establishing traceability is one of the most problematic
issues associated with any data conversion. If the reviewer is unable to trace study data
from the data collection of subjects participating in a study to the analysis of the overall
study data, then the regulatory review of a submission may be compromised. Traceability
can be enhanced when studies are prospectively designed to collect data using a
standardized CRF, e.g., CDASH. Traceability can be further enhanced when a flow
diagram is submitted showing how data move from collection through preparation and
submission to the Agency.
Reviewers evaluating nonclinical studies have similar needs to the above list, though in
the case of nonclinical studies traceability allows the reviewer to understand and trace
relationships between analysis results, single animal listings in the Study Report, and the
tabulation data sets. Traceability between the Study Report and tabulation data can be
enhanced when data in collection systems has a well-defined relationship to the SEND
standard.
Legacy study data are study data in a non-standardized format, not supported by FDA,
and not ever listed in the Catalog. Sponsors should use processes for legacy data
conversion that account for traceability. Generally, a conversion to a standard format will
map every data element as originally collected to a corresponding data element described
in a standard. Some study data conversions will be straightforward and will result in all
data converted to a standardized format. In some instances, it may not be possible to
represent a collected data element as a standardized data element. In these cases, there
should be an explanation in the RG as to why certain data elements could not be fully
standardized or were otherwise not included in the standardized data submission. The
legacy data (i.e., aCRF, legacy tabulation data, and legacy analysis data) may be needed
in addition to the submission of converted data.
In cases where the data were collected on a Case Report Form (CRF) or electronic CRF
but were not included in the converted datasets, the omitted data should be apparent on
the annotated CRF and described in the RG. The tabular list of studies in the
Standardization Plan should indicate which studies contained previously collected non-
standard data that were subsequently converted to a standard format.
For nonclinical studies where data are converted to SEND from a previously
established collection system, instances may arise where it is not possible to represent
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 53 of 77 March 2024
Contains Nonbinding Recommendations
a collected data element as a standardized data element. In these cases, there should
be an explanation in the nSDRG as to why certain data elements could not be fully
standardized or were otherwise not included in the standardized data submission. As
the Study Report should contain a complete representation of the study data in the
individual animal listings, no non-standardized electronic study data should be
submitted.
Traceability Issues with Legacy Data Conversion
FDA does not recommend a particular approach to legacy clinical study data conversion,
but rather explains the issues that should be addressed so that the converted data are
traceable and adequate to support review.
Table 3 presents some of the issues that can be observed during a review when legacy
study data are converted to SDTM and submitted with legacy analysis datasets.
Table 4 presents the issues when legacy study data and legacy analysis data are
independently converted to SDTM and ADaM formats, respectively, rather than ADaM
datasets being created directly from the SDTM datasets (converted from legacy study
data).
Issues
8. Difficulty in understanding the source or derivation methods for imputed or
derived variables in integrated/pooled data, supplemental qualifiers, and related
records.
Table 5 presents the issues when legacy data are converted to SDTM and ADaM formats
in sequence (i.e., converting legacy study data to SDTM and then creating ADaM from
the SDTM). The key concern is the traceability from ADaM to the Tables, Figures and
CSR.
Submission of a Legacy Data Conversion Plan and Report is not expected for nonclinical
studies where data were collected in a previously established data collection system.
This appendix provides some of the guiding principles for the Agency’s long-term study
data standards management strategies. An important goal of standardizing study data
submissions is to achieve an acceptable degree of semantic interoperability (discussed
below). This appendix describes different types of interoperability and how data
standards can support interoperable data exchange now and in the future.
At the most fundamental level, study data can be considered a collection of data elements
and their relationships. A data element is the smallest (or atomic) piece of information
that is useful for analysis (e.g., a systolic blood pressure measurement, a lab test result, a
response to a question on a questionnaire).
A data value is by itself meaningless without additional information about the data (so
called metadata). Metadata is often described as data about data. Metadata is structured
information that describes, explains, or otherwise makes it easier to retrieve, use, or
manage data. 81 For example, the number 44 itself is meaningless without an association
with Hematocrit and the unit of measurement (e.g. "%"). Hematocrit in this example is
metadata that further describes the data.
Just as it is important to standardize the representation of data (e.g., M and F for male and
female, respectively), it is equally important to standardize the metadata. The expressions
Hematocrit = 44; Hct = 44, or Hct Lab Test = 44 all convey the same information to a
human, but an information system or analysis program will fail to recognize that they are
equivalent because the metadata is not standardized. It is also important to standardize the
definition of the metadata, so that the meaning of a hematocrit value is constant across
studies and submissions.
In addition to standardizing the data and metadata, it is important to capture and represent
relationships (also called associations) between data elements in a standard way.
Relationships between data elements are critical to understand or interpret the data.
Consider the following information collected on the same day for one subject in a study:
81
Metadata is said to “give meaning to data” or to put data “in context.” Although the term is now
frequently used to refer to XML (extensible markup language) tags, there is nothing new about the concept
of metadata. Data about a library book such as author, type of book, and the Library of Congress number,
are metadata and were once maintained on index cards. SAS labels and formats are a rudimentary form of
metadata, although they have not historically been referred to as metadata.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 57 of 77 March 2024
Contains Nonbinding Recommendations
Time = 10:20 a.m. Position = lying Systolic Blood Pressure = 110 mmHg
Time = 10:23 a.m. Position = standing Systolic Blood Pressure = 90 mmHg
With these fundamental concepts of data standardization in mind, data standards can be
considered in the context of interoperable data exchange.
Interoperability
Much has been written about interoperability, with many available definitions and
interpretations within the health care informatics community. In August 2006, the
President signed an Executive Order mandating that the Federal Government use
interoperable data standards for health information exchange. 82 Although this order was
directed at Federal agencies that administer health care programs (and therefore not the
FDA), it is relevant to this guidance because it defined interoperability for use by Federal
agencies:
Achieving interoperable study data exchange between sponsors, applicants and FDA is
not an all-or-nothing proposition. Interoperability represents a continuum, with higher
degrees of data standardization resulting in greater interoperability, which in turn makes
the data more useful and increasingly capable of supporting efficient processes and
analyses by the data recipient. It is therefore useful to understand the degree of
interoperability that is desirable for standardized study data submissions.
In 2007, the Electronic Health Record Interoperability Work Group within Health Level
Seven issued a white paper that characterized the different types of interoperability based
on an analysis of how the term was being defined and used in actual practice. 83 Three
types of interoperability were identified: technical, semantic, and process interoperability.
A review of these three types provides insight into the desired level of interoperability for
standardized study data submissions.
82
See Executive Order 13410 available at http://www.cga.ct.gov/2006/rpt/2006-R-0603.htm.
83
See Coming to Terms: Scoping Interoperability for Health Care available at
http://www.hln.com/assets/pdf/Coming-to-Terms-February-2007.pdf.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 58 of 77 March 2024
Contains Nonbinding Recommendations
Because the vast majority of study data are submitted after the study is complete,
achieving process interoperability for study data submissions in a regulatory setting is
relatively unimportant, at least for the foreseeable future. It is reasonable to conclude that
it is most desirable to achieve semantic interoperability in standardized study data
submissions.
In summary, the goal of standardizing study data is to make the data more useful and to
support semantically interoperable data exchange between sponsors, applicants, and the
FDA such that it is commonly understood by all parties.
Sponsors may use additional TS Parameters not listed in the table below if they are
needed to describe the trial.
FDA Desired
TSPARMCD TSPARM FDA Notes
- Clinical
Actual Number of
Y ACTSUB
Subjects
Y ADAPT Adaptive Design
Added on to Existing
Y ADDON
Treatments
Planned Maximum
Y AGEMAX
Age of Subjects
Planned Minimum
Y AGEMIN
Age of Subjects
Comparative
Y COMPTRT
Treatment Name
Confirmed Response
Conditional CRMDUR If applicable.
Minimum Duration
If applicable. the value should be the
CDISC Therapeutic exact listing as in section 5.2 of the
Conditional CTAUG
Area User Guide Technical Conformance Guide.
Use as many rows as needed.
Current Therapy or Where ADDON = ‘Y’. Use as many
Conditional CURTRT
Treatment rows as needed.
Data Cutoff GRPID relates DCUTDTC to
Y DCUTDESC
Description DCUTDESC.
GRPID relates DCUTDTC to
Y DCUTDTC Data Cutoff Date
DCUTDESC.
ECG Reading
Conditional EGBLIND For QT submissions.
Blinded
ECG Continuous
Conditional EGCTMON For QT submissions.
Monitoring
ECG Planned
Conditional EGLEADPR For QT submissions.
Primary Lead
ECG Used Same
Conditional EGLEADSM For QT submissions.
Lead
Conditional EGRDMETH ECG Read Method For QT submissions.
ECG Replicates at
Conditional EGREPLBL For QT submissions.
Baseline
ECG Replicates On-
Conditional EGREPLTR For QT submissions.
Treatment
ECG Twave
Conditional EGTWVALG For QT submissions.
Algorithm
Extension Trial
Y EXTTIND
Indicator
Planned Country of
Y FCNTRY Use as many rows as needed.
Investigational Sites
FDA Desired
TSPARMCD TSPARM FDA Notes
- Clinical
If applicable. the value should be the
FDA Technical exact listing as in the appendix of the
Conditional FDATCHSP
Specification Technical Conformance Guide.
Use as many rows as needed.
Healthy Subject
Y HLTSUBJI
Indicator
Trial
For a healthy volunteer study,
Conditional INDIC Disease/Condition
TSVALNF = 'NA'.
Indication
Where STYPE =
Conditional INTMODEL Intervention Model
‘INTERVENTIONAL’.
Where STYPE =
Conditional INTTYPE Intervention Type
‘INTERVENTIONAL’.
Y LENGTH Trial Length
Planned Number of
Y NARMS
Arms
Number of
Y NCOHORT
Groups/Cohorts
Trial Primary
Y OBJPRIM Use as many rows as needed.
Objective
Trial Secondary
Y OBJSEC Use as many rows as needed.
Objective
Ongoing Study
Y ONGOSIND
Indicator
Exploratory If applicable. Use as many rows as
Conditional OUTMSEXP
Outcome Measure needed.
Primary Outcome
Y OUTMSPRI Use as many rows as needed.
Measure
Secondary Outcome
Conditional OUTMSSEC Use as many rows as needed.
Measure
If STYPE = ‘INTERVENTIONAL’ and
Conditional PCLAS Pharmacologic Class
where applicable for INTTYPE.
Pediatric Postmarket
Y PDPSTIND
Study Indicator
Pediatric Study
Y PDSTIND
Indicator
Pediatric
Y PIPIND Investigation Plan
Indicator
Planned Number of
Y PLANSUB
Subjects
FDA Desired
TSPARMCD TSPARM FDA Notes
- Clinical
Rare Disease
Y RDIND
Indicator
Y REGID Registry Identifier Use as many rows as needed.
Conditional RLPSCRIT Relapse Criteria If applicable.
Stable Disease
Conditional SDMDUR If applicable.
Minimum Duration
Y SENDTC Study End Date
Y SEXPOP Sex of Participants
Clinical Study
Y SPONSOR
Sponsor
The value should be the exact term listed
in the FDA Data Standards Catalog in
Y SDTMVER SDTM Version Column E. If multiple SDTM Versions
are used for a study the every version
should be listed on each row.
The value should be the exact term listed
in the FDA Data Standards Catalog in
Y SDTIGVER SDTM IG Version Column F. If multiple SDTM IG
Versions are used for a study the every
version should be listed on each row.
Sponsor's Study
Conditional SPREFID If applicable.
Reference ID
The term ‘Conditional’ means a parameter might not be relevant to a trial or study design.
If the TS Parameter is relevant to the study design and is listed as ‘Conditional’ in the
table below, it should be included in the SEND dataset submitted to the FDA.
Sponsors may use additional TS Parameters not listed in the table below if they are
needed to describe the trial.
FDA
Desired - TSPARMCD TSPARM FDA Notes
Nonclinical
Age of subjects planned for the study population as an
integer. Either AGE or AGETXT should be populated
See Notes AGE Age
(not both). If the planned age is a range, then use
AGETXT
Age of subjects planned for the study population
expressed as a range. Either AGE or AGETXT should
See Notes AGETXT Age Text
be populated (not both). If an age integer value is
available, populate the AGE variable instead
Y AGEU Age Unit
Conditional ASOCSTDY Associated Study If applicable.
Y DOSDUR Dosing Duration See Section 4.1.3.2
End Date/Time of
Y DOSENDTC
Dose Interval
Start Date/Time
Y DOSSTDTC
of Dose Interval
Experimental End
Y EXPENDTC
Date
Experimental
Y EXPSTDTC
Start Date
Good Laboratory
Y GLPTYP
Practice Type
Time to Interim
Conditional INTSAC Include when the study has an interim sacrifice
Sacrifice
We recognize that pharmacologic class can change
Pharmacologic
Y PCLASS throughout the drug development timeline. Refer to the
Class
MED-RT for terminology.
The planned number of doses administered per a
specific interval, as defined in the SENDIG Animal
Planned Dose
Conditional PDOSFRQ Rule v1.0. Use of PDOSFRQ is recommended for all
Frequency
study types modelled in FDA-supported SENDIG
versions when relevant.
Planned
Pharmacologic
Y PPTCNAM
Target Common
Name
FDA
Desired - TSPARMCD TSPARM FDA Notes
Nonclinical
Planned
Pharmacologic
Y PPTEGID
Target Entrez
Gene Identifier
Planned
Pharmacologic
Y PPTEGSYM
Target Entrez
Gene Symbol
Planned
Pharmacologic
Y PPTMDA
Target Mode of
Action
Conditional RECSAC Recovery Period Include when the study has a recovery sacrifice
Route of
Y ROUTE
Administration
Strain/Substrain
Conditional SBSTRAIN If applicable.
Details
Y SDESIGN Study Design
Sex of
Y SEXPOP
Participants
SEND Controlled
Y SNDCTVER Terminology
Version
SEND
Y SNDIGVER Implementation
Guide Version
Y SPECIES Species
Planned Number
Y SPLANSUB
of Subjects
Test Subject
Y SPLRNAM
Supplier
Sponsor's Study
Y SPREFID
Reference ID
Sponsoring
Y SSPONSOR
Organization
Y SSTYP Study Type
Y STCAT Study Category
Y STDIR Study Director
FDA
Desired - TSPARMCD TSPARM FDA Notes
Nonclinical
The status of the study report associated with the
Study Report dataset, as defined in the SENDIG Animal Rule v1.0.
Y STRPSTAT
Status Use of STRPSTAT is recommended for all study types
modelled in FDA-supported SENDIG versions.
Test Facility
Y TFCNTRY
Country
Time to Terminal
Y TRMSAC
Sacrifice
Investigational
Y TRT Therapy or
Treatment
Primary We recognize that the CAS number may not be
Y TRTCAS Treatment CAS immediately available, especially at the opening IND
Registry Number submission.
Primary We recognize that the UNII code may not be
Y TRTUNII Treatment Unique immediately available, especially at the opening IND
Ingredient ID submission.
Y TRTV Treatment Vehicle
Test Facility
Y TSTFLOC
Location
Test Facility
Y TSTFNAM
Name
The Agency recognizes that there may be additional documents beyond Therapeutic Area
User Guides (TAUGs), Implementation Guides (IGs), and Models that provide technical
information about how to implement a CDISC standard and that these documents fall
outside the scope of the FDA Data Standards Catalog.
Immediately below is a list of SDO properties that have been evaluated by CBER and
CDER and are considered to align with their current business needs.
• CDISC OCCDSv1.0
• CDISC SEND Tumor Combinations-v1.0
• CDISC ADaM popPK Implementation Guide-v1.0
Immediately below is a list of SDO properties that have been evaluated by CBER and
CDER and are not considered to align with their current business needs. Consider
refering to FDA comments that were submitted to the SDO for more details. This list may
not be comprehensive of all properties and absence form this list does not indicate
encouragement to use. Consult with your division for more specific instructions:
• CDISC OCCDSv1.1
• CDISC ADaM Examples of Traceability-v1.0
• CDISC ADaM Metadata Submission Guidlines-v1.0
• CDISC Document: Interim User Guide for COVID-19
• CDISC Document: Guidance for Ongoing Studies Disrupted by COVID-19
• CDISC SENDIG-DARTv1.2
Study_001S
Study_001L
(Standardized Data
(Legacy Data):
Tabulation Datasets):
M4 (Non-Clinical)
Study_002S Study_002L
(Standardized Data): (Legacy Data):
M5 (Clinical)
IMPORTANT
If a file is referenced within a study section in Module 4 or 5, a STF and ts.xpt must be
present to identify the study ID and SSD to which the file belongs. The ts.xpt needs to
contain either a study ID (STUDYID) or Sponsor Reference ID (SPREFID) value that
matches with the STF study ID.
If a clinical study submitted to CDER or CBER started after December 17, 2016, the files
for NDAs, BLAs, and ANDAs 84 must comply with CDISC standards as specified in the
guidance Providing Regulatory Submissions in Electronic Format—Standardized Study
Data.
If a nonclinical study submitted to CDER started after December 17, 2016, the files for
NDAs, BLAs, and ANDAs (or December 17, 2017, for commercial INDs) 85 must comply
with CDISC standards as specified in the guidance Providing Regulatory Submissions in
Electronic Format—Standardized Study Data.
If a nonclinical study submitted to CBER started after March 15, 2023, the files for NDAs,
BLAs, ANDAs, and commercial INDs 86 must comply with CDISC standards as specified
in the guidance Providing Regulatory Submissions in Electronic Format—Standardized
Study Data.
If a clinical study submitted to CDER or CBER started on or prior to December 17, 2016
and the study contains an xpt dataset (other than the ts.xpt), for NDAs, BLAs, and
ANDAs, 87 a simplified ts.xpt (see Appendix G) file should be submitted. 88
If a nonclinical study submitted to CDER started on or prior to December 17, 2016, for
NDAs, BLAs, and ANDAs (or December 17, 2017, for commercial INDs), 89 a
simplified ts.xpt (see Appendix G) file should be submitted whether or not the study
contains an xpt dataset (other than the ts.xpt).
84
This requirement is discussed in the guidance for industry Providing Regulatory Submissions in
Electronic Format—Standardized Study Data (June 2021), available on the FDA guidance web page at
https://www.fda.gov/RegulatoryInformation/Guidances/default.htm, and on the FDA Study Data Standards
Resources web page at https://www.fda.gov/forindustry/datastandards/studydatastandards/default.htm. We
update guidances periodically. For the most recent version of a guidance, check the FDA guidance web
page.
85
Ibid.
86
Ibid.
87
Ibid.
88
The study ID and the SSD are used to determine the start date for the study. For more information, see
the sections 7.1 and 8.2.2.1.
89
See footnote 80.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 69 of 77 March 2024
Contains Nonbinding Recommendations
eCTD validation for study data (Technical Rejection Criteria) WILL APPLY to the
following eCTD sections:
• 4.2 Study Reports
• 5.3 Clinical Study Reports and Related Information
eCTD validation for study data (Technical Rejection Criteria) WILL NOT APPLY to
the following eCTD sections:
• 4.2.1 Pharmacology
• 4.2.2 Pharmacokinetics
• 4.2.3.3 Genotoxicity
• 4.2.3.5 Reproductive and Developmental Toxicity
• 4.2.3.6 Local Tolerance
• 4.2.3.7 Other Toxicity Studies
• 5.3.1.3 In Vitro – In Vivo Correlation Study Reports and Related Information
• 5.3.1.4 Reports of Bioanalytical and Analytical Methods for Human Studies
• 5.3.2 Reports of Studies Pertinent to Pharmacokinetics Using Human Biomaterials
• 5.3.3.5 Population PK Study Reports and Related Information 91
• 5.3.5.3 Reports of Analyses of Data from More than One Study
• 5.3.5.4 Other Study Reports and Related Information
• 5.3.6 Reports of Postmarketing Experience
90
See footnote 80.
91
PK/PD modeling and simulation study reports can be placed under this section, under Module 5.3.3.5.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 70 of 77 March 2024
Contains Nonbinding Recommendations
*Rejection criteria will be applied if a study report with one of the three file tags, ‘pre-clinical-study-
report’, ‘legacy-clinical-study-report’, or ‘study-report-body’ is included, and/or an xpt file (other than
the ts.xpt) is submitted.
92
This table only applies to eCTD validation 1734, 1735, and 1736. An STF must be provided for all
applications and data types for both CDER and CBER (eCTD validation 1789). For more information, see
the Specifications for eCTD Validation Criteria found here: https://www.fda.gov/drugs/electronic-
regulatory-submission-and-review/electronic-common-technical-document-ectd
93
See section 8.2.2 for more information on submitting a simplified TS for non-standardized data.
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 72 of 77 March 2024
Contains Nonbinding Recommendations
Section Details
4.1.4.7.1 HHS Declared PHE: SARS-CoV-2 (COVID-19) pandemic (Jan. 31, 2020 -
May 11, 2023)
Impacted Electronic Data Standards: The Standard for Exchange of
Nonclinical Data (SEND) for commercial INDs submitted to CDER.
Modification to Requirement: FDA will not require datasets in SEND
format until the time of submission of a marketing application for products
with COVID-19 specific indications.
Means of Public Notification: Implementation announced in Version 4.7.1
of the Study Data TCG (June 2021) and expiration announced in Version 5.2
of the Study Data TCG (May 2023).
Rationale for Modification: To help prevent delays in the initiation of
clinical trials for products with a proposed indication to diagnose, cure,
mitigate, treat, or prevent COVID-19 (COVID-19 specific indications).
Section Details
further information and resources including the guidance for industry,
Providing Regulatory Submissions In Electronic Format – Standardized
Study Data, refer to the following website:
https://www.fda.gov/industry/study-data-standards-resources/study-data-
submission-cder-and-cber.
Certain circumstances that were specific to the COVID-19 PHE warrant the
allowance of additional time to transition from the policies adopted during the
COVID-19 PHE to the reinstatement of the SEND requirement for
commercial INDs. To allow for a wind-down period, SEND will not be
required to be submitted for commercial INDs with COVID-19 indications
for an additional 180 days after the expiration of the COVID-19 PHE (i.e.,
through November 7, 2023). A simplified ts.xpt file may continue to be used
U.S. Food & Drug Administration
10903 New Hampshire Avenue
Silver Spring, MD 20903
www.fda.gov Page 74 of 77 March 2024
Contains Nonbinding Recommendations
Section Details
as needed for the additional 180 days, until November 8, 2023. As a
reminder, SEND is required at the time of submission of a marketing
application for products with COVID-19 specific indications, even if SEND
was not submitted under the commercial IND.
Glossary
The following is a list of acronyms and terms used in this Guide: