0% found this document useful (0 votes)

70 views6 pages

Integrated Summary of Safety and Efficacy Programming For Studies Using Electronic Data Capture

Integrated Summary of Safety and Efficacy Programming

Uploaded by

yileen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views6 pages

Integrated Summary of Safety and Efficacy Programming For Studies Using Electronic Data Capture

Integrated Summary of Safety and Efficacy Programming

Uploaded by

yileen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

NESUG 2010

Pharmaceutical Applications

Integrated Summary of Safety and Efficacy

Programming for Studies Using Electronic Data Capture
Changhong Shi, Merck & Co., Inc., Rahway, NJ
Qing Xue, Merck & Co., Inc., Rahway, NJ
ABSTRACT
The Integrated Summary of Safety (ISS) and Integrated Summary of Efficacy (ISE) are essential components of a successful
submission. In legacy studies where different types of data are frequently collected through diverse systems by various
vendors, programming ISS and ISE analysis can be a daunting job because all study data need to be converted and
harmonized to the same format before programming and analysis work can begin. Studies that utilize the Electronic Data
Capture (EDC) system have similar structured views which can greatly ease the harmonization process. However, even
though harmonization is limited there remain many unique challenges to be addressed by programmers in multi-study data
integration for ISS and ISE. This paper discusses specific tips and techniques to efficiently program integrated analyses which
focus on the following areas: (1) data source checking, (2) "spread and convene" programming approach, and (3) consistent
data and folder structure.
Keywords: ISS, Integrated Summary of Safety, Integrated Summary of Efficacy

INTRODUCTION
The Integrated Summary of Safety (ISS) and Integrated Summary of Efficacy (ISE) are essential components of a successful
submission. They differ from a regular study since: (a) there is a larger amount of data, (b) usually each study has been
locked for frozen file before ISS and ISE, and (c) in the component individual study, different folder structures might have been
used since these studies could have been locked for a long period of time which means that they may have followed different
standards. This paper will detail the techniques to efficiently handle and accommodate these challenges which include:
(1) data source checking
(2) "spread and convene" programming approach
(3) consistent data and folder structure.

1. DATA SOURCE CHECKING

ISS and ISE typically contain more than one study as well as a large amount of data. In order to achieve accurate integrated
analyses, data must be scrutinized in order to catch important scenarios that need special attention. Also, due to the number
of patients and large amount of data involved, it is impossible to "eyeball" everything in ISS or ISE as is sometimes done
against a single small study. The following techniques, although simple, prove to be efficient for checking the data source
before programming:
A). Frequency procedure
By checking the values of variables using a frequency procedure, it can be determined if special attention is required
and can be used to propose suggestions to the statistician on data handling. The following example checks the values
of "Action Taken with Study Treatment" (AEACN variable in SDTM AE domain, SDTM 3.1.1. IG) across the pooled ISS
studies:

proc freq data=iss.ae;

tables aeacn/list;
run;
Result obtained:

AEACN
DOSE INCREASED
DOSE NOT CHANGED
DOSE REDUCED
DRUG INTERRUPTED
DRUG WITHDRAWN
NOT APPLICABLE
UNKNOWN

Cumulative
Cumulative
Frequency
Percent
Frequency
Percent

2
0.01
2
0.01
32999
95.44
33001
95.44
136
0.39
33137
95.84
650
1.88
33787
97.72
556
1.61
34343
99.33
230
0.67
34573
99.99
3
0.01
34576
100.00
Frequency Missing = 7

-1-

NESUG 2010

Pharmaceutical Applications

In the frequency distribution above, there are seven AE records with AEACN as blank and three with wording as
'Unknown'. Rather than going directly to the table production, further investigation and reports to the statisticians
and database team to consult for a final decision is recommended.

B). Missing values and blank values are our "friends" in ISS or ISE
Statistical programmers have to deal frequently with missing or blank values, and this is especially important in an
ISS or ISE when dictionary leveling is involved.
Commonly, ISS data is leveled to use the same dictionary version across all studies. This may result in missing data
due to expired terminology. For example, consider the following hierarchy in the drug dictionary:
CMDECOD (Standardized Medication Name)
Then
CMCLAS (Medication Class)
If one CMDECOD expires and cannot be leveled per the dictionary version used by an ISS or ISE, no corresponding
CMCLAS is able to be assigned. This data leveling issue in the resulting ISS or ISE could be identified by performing
a simple frequency procedure against the leveled variables (CMDECOD and CMCLAS in the above example) to
ensure blanks do not occur. Due to the large amount of data in an ISS or ISE, this programmatic checking is
important as it can be overlooked by manual methods. Therefore, missing or blank values are our "friends" for an
ISS or ISE in that they help to identify harmonization issues for integrated analyses.

2. PROGRAMMING APPROACH
It is possible to put the raw data for all studies together and write one set of programs for ISS or ISE, but this approach
becomes problematic to debug and determine the source of problems, especially when there are a large number of
component studies. To save debugging and validation time, the approach adopted for our 19 ISS studies was to first program
by individual study and then reuse the code from the clinical summary report (CSR) or other existing programs. The results
are then compared with the existing CSR or other published results.
After the programming work is done for each individual component study of an ISS or ISE and all programs for individual
studies have been developed and validated, we then just need simple stacking programs to stack the analyses datasets
together. A simple set of stacking programs were written to stack the analysis datasets in ADaM format with the same data
structure; further work was completed on the stacked analysis data. We called this approach "spread and convene".
The advantage of this approach can be seen in the laboratory safety (LAB) and predefined limit of change (PDLC) analysis.
For the example listed in the next page, i.e. we produced a PDLC listing table for an ISS consisting of 19 studies, which had
11 columns as follows:
lab test name
treatment group name
protocol number
patient allocation number
lab test code (CDISC code)
analysis time point (week)
lab measurement day relative to the reference start date (here, it is the trial start date, i.e. the date for the first non-zero
dose medication date)
baseline value (for simplicity in this example, baseline value is defined using the last measurement with
measurement day relative to the reference start date <=1)
test value for the analysis time point
upper limit of normal range (UL)/lower limit of normal range (LLN)
hit (indicates if the specific record meets in the PDLC criterion in the row header)

In this example, we show three patients: two from Prot123 with allocation number (AN) as10001and10030, and one from
Prot456, with AN as 1000. Since Prot456 was a Phase IIB study, and Pro123 was a Phase III study, the study design was
somewhat different, and the baseline definition was different. Therefore, in order to obtain the table below where baseline
value is in one column, the most efficient approach was to "spread" first, i.e. set up the lab data for Prot456 and Prot123
separately, compare the results against the original CSR or other exploratory outputs, and then "convene", i.e. stack the
analysis dataset where baseline value is set as one column. This is also suitable for the analysis time point column where the
way to define weeks was different for each study. Note that the table should only contain those patients who had at least one
dose of study medication. Using the "spread and convene" approach instead of trying to integrate all data together - in this
case data from 19 studies running the program took considerably less time.

-2-

NESUG 2010

Pharmaceutical Applications

Listing of Patients With Two or More Consecutive Serum Creatinine Measurements

with an Increase from Baseline of 0.3 mg/dL or of 50%
Pooled Studies

Lab Test

Treatment

Proto
col

Alloc
ation
Numb
er

Endpoint(s)
Assessed
for Test

Week

Relative
Day

Baseline
Value

Test
Value

LLN,
ULN

Hit

Criterion: Two or more consecutive measurements with an increase from baseline of >=0.3 mg/dL or of >= 50%
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)
Serum Creatinine (mg/dL)

Non-exposed

123

456

10001
10001
10001
10001
10001
10001
10001
10001

CREAT
CREAT
CREAT
CREAT
CREAT
CREAT
CREAT
CREAT

-10
0
3
6
6
12
18
18

-69
1
22
36
43
91
127
141

1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1

1.2
1.1
1.4
1.4
1.2
1.3
1.2
1.2

0.7, 1.4
0.7, 1.4
0.7, 1.4
0.7, 1.4
0.7, 1.4
0.7, 1.4
0.7, 1.4
0.7, 1.4

10030
10030
10030
10030
10030
10030
10030
10030
10030

CREAT
CREAT
CREAT
CREAT
CREAT
CREAT
CREAT
CREAT
CREAT

-9
0
3
3
6
12
18
18
24

-63
1
13
22
43
85
114
125
167

0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9

1
0.9
1
1
1.2
1.2
1.2
1.2
1.2

0.7, 1.4
0.7, 1.4
0.7, 1.4
0.7, 1.4
0.7, 1.4
0.7, 1.4
0.7, 1.4
0.7, 1.4
0.7, 1.4

1000
1000
1000

CREAT
CREAT
CREAT

-7
-2
0

-49
-14
1

0.7
0.7
0.7

0.8
0.8
0.7

0.7, 1.4
0.7, 1.4
0.7, 1.4

-3-

Yes
Yes

Yes
Yes
Yes
Yes
Yes

NESUG 2010

Pharmaceutical Applications

3. CONSISTENT DATA and FOLDER STRUCTUE

For an ISS or ISE that only contains studies where data are collected using EDC, we may have a consistent data structure at
database lock. However, if for some reason such as standard changes, or a non-EDC study within an ISS or ISE, we may
have different folder and data structures for each study. To fully realize the advantage of data and folder structures in ISS and
ISE, a consistent data and folder structure which has exactly the same naming convention is necessary. This way it is possible
to use virtually the same code for defining the input and output directory paths at startup. Listed below is a folder structure we
found helpful:

ISS Directory Structure

(
|

-- folder

-- file)

-4-

NESUG 2010

Pharmaceutical Applications

The following is a consistent data structure example for our ADSL dataset within each component study:
LABEL

TYPE/
LENGTH

STUDYID

Study Identifier

C/200

USUBJID

Unique Subject Identifier

C/200

SUBJID

Subject Identifier for the Study

C/200

SITEID

Study Site Identifier

C/200

ETHNIL

Ethnicity

C/200

ETHNIN

Ethnicity, Num

N/8

1: Hispanic or Latino
2: Not Hispanic or Latino

AGE

Age

N/8

Age in Years

SEX

Sex

C/2

RACE

Race

C/200

FASFL

Full Analysis Set Pop Flag

C/1

FASFN

Full Analysis Set Pop Flag,

Num

N/8

ARM

Description of Planned Arm

C/200

TRT1P

Planned Treatment for Period 1

C/200

TRT1PN

Planned Treatment Number for

Period 1

N/8

RANDDT

Date of Randomization

N/8

TRTSTDT

Date of First Exposure to

Treatment

N/8

TRTENDT

Date of Last Exposure to

Treatment

N/8

VARIABLE

DECODE/DERIVATION/COMMENTS

Also known as Randomized Patient Identifier.

AMERICAN INDIAN OR ALASKA NATIVE:

American Indian or Alaska Native | ASIAN: Asian
| BLACK OR AFRICAN AMERICAN: Black or
African American | MULTI-RACIAL: Multi-Racial
| NATIVE HAWAIIAN OR OTHER PACIFIC
ISLANDER: Native Hawaiian Or Other Pacific
Islander | WHITE: White
Flag to identify FAS population for the primary
efficacy end point.
1: Included in FAS population
0: Excluded from FAS population

1: Placebo
2: Study drug

-5-

NESUG 2010

Pharmaceutical Applications

CONCLUSION
This paper provides some basic techniques and tips for ISS and ISE programming. The steps help to enable the efficient and
accurate creation of multiple ISS and ISE studies. If all the component study data are collected using the SDTM format, more
development can be made to standardize the programs for each component study analysis, when applicable, and further
improve efficiency.

REFERENCES
CDISC Study Data Tabulation Model Implementation Guide: Human Clinical Trials Version 3.1.1(SDTM 3.1.1 IG)
SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in
the USA and other countries. indicates USA registration.

ACKNOWLEDGEMENTS
The author would like to thank the management team for their review of this paper.

CONTACT INFORMATION
Your comments and questions are valued and encouraged. Contact the authors at:
Changhong Shi
Merck Co. & Inc.
RY34-A320
P.O. Box 2000
Rahway, NJ 07065
[email protected]
Qing Xue
Merck Co. & Inc.
RY34-A320
P.O. Box 2000
Rahway, NJ 07065
[email protected]

-6-

Saso Iso 1938-1: 2019
100% (1)
Saso Iso 1938-1: 2019
35 pages
1010 Analytical Data-Interpretation and Treatment Usp
100% (1)
1010 Analytical Data-Interpretation and Treatment Usp
29 pages
A House in The Rift v0.5.11r1 Scene Guide
No ratings yet
A House in The Rift v0.5.11r1 Scene Guide
14 pages
Sas Interview Questions
No ratings yet
Sas Interview Questions
9 pages
Lecture Notes - Metal Forming PDF
No ratings yet
Lecture Notes - Metal Forming PDF
68 pages
Case Digest in Special Civil Action
No ratings yet
Case Digest in Special Civil Action
6 pages
Unit 1
No ratings yet
Unit 1
8 pages
Behavioural Learning Theories
No ratings yet
Behavioural Learning Theories
18 pages
Reading Comprehension and Skills: Fifth Grade Basic Skills
No ratings yet
Reading Comprehension and Skills: Fifth Grade Basic Skills
49 pages
Research Stem
No ratings yet
Research Stem
50 pages
Factor Analysis Presentation
No ratings yet
Factor Analysis Presentation
158 pages
Vdoc - Pub English Accents and Dialects An Introduction To Social and Regional Varieties of British English
100% (3)
Vdoc - Pub English Accents and Dialects An Introduction To Social and Regional Varieties of British English
220 pages
Needs and Motivation Theories
No ratings yet
Needs and Motivation Theories
13 pages
Business English at Work: © 2003 Glencoe/Mcgraw-Hill
No ratings yet
Business English at Work: © 2003 Glencoe/Mcgraw-Hill
32 pages
Outsourcing by Prashant Priyadarshi
No ratings yet
Outsourcing by Prashant Priyadarshi
14 pages
Nursing Research Chapter 1
No ratings yet
Nursing Research Chapter 1
17 pages
4727-File Utama Naskah-12537-1-10-20210401
No ratings yet
4727-File Utama Naskah-12537-1-10-20210401
15 pages
6418 Chapter 3A Meyers I Proof 2
No ratings yet
6418 Chapter 3A Meyers I Proof 2
32 pages
CPTM 3.2 - Respect For Diversity
No ratings yet
CPTM 3.2 - Respect For Diversity
21 pages
ABM-PRINCIPLES OF MARKETING 11 - Q1 - W6 - Mod6
No ratings yet
ABM-PRINCIPLES OF MARKETING 11 - Q1 - W6 - Mod6
16 pages
The Impact of Big Five Personality Factors On Organizational Citizenship Behaviour
No ratings yet
The Impact of Big Five Personality Factors On Organizational Citizenship Behaviour
5 pages
ADSL Issues
No ratings yet
ADSL Issues
16 pages
Social Formation in Bangladesh An Essay
No ratings yet
Social Formation in Bangladesh An Essay
23 pages
Unit 1
No ratings yet
Unit 1
21 pages
Lecture 2 Data Information Knowledge-1
No ratings yet
Lecture 2 Data Information Knowledge-1
110 pages
Hume On 'Is' and 'Ought': A Defense of MacIntyre
No ratings yet
Hume On 'Is' and 'Ought': A Defense of MacIntyre
9 pages
2
No ratings yet
2
8 pages
Lesson Plan Science - Aug 10 To 12
No ratings yet
Lesson Plan Science - Aug 10 To 12
5 pages
Interpretation of Data by Surakshya
No ratings yet
Interpretation of Data by Surakshya
13 pages
Clinical Trial Online - Running Sas On The Web Without Sas/Intrnet
No ratings yet
Clinical Trial Online - Running Sas On The Web Without Sas/Intrnet
15 pages
Pharmasug China 2021 DM023
No ratings yet
Pharmasug China 2021 DM023
9 pages
Analysis As A Process
0% (1)
Analysis As A Process
17 pages
7 Basic Quality Tools
No ratings yet
7 Basic Quality Tools
49 pages
Data Cleaning
No ratings yet
Data Cleaning
10 pages
Data Analysis
No ratings yet
Data Analysis
65 pages
8 Adam Amuraro
No ratings yet
8 Adam Amuraro
28 pages
Allied Ii Basic Physics - I
No ratings yet
Allied Ii Basic Physics - I
1 page
Lesson 09 Data Analysis I Descriptive Statistics
No ratings yet
Lesson 09 Data Analysis I Descriptive Statistics
15 pages
Isotopic Analysis
No ratings yet
Isotopic Analysis
65 pages
Users Guide CUDAL
No ratings yet
Users Guide CUDAL
52 pages
Record Linkage Systems
No ratings yet
Record Linkage Systems
33 pages
The US Legislature
No ratings yet
The US Legislature
11 pages
Chemistry Investigatory Project Class 12 Cold Drinks
No ratings yet
Chemistry Investigatory Project Class 12 Cold Drinks
21 pages
Best Practice in SAS - Case Study PDF
No ratings yet
Best Practice in SAS - Case Study PDF
31 pages
Managing-Data PW Final 09252013
No ratings yet
Managing-Data PW Final 09252013
35 pages
Integrated Project Management
100% (1)
Integrated Project Management
333 pages
Epi Data
No ratings yet
Epi Data
24 pages
《公共照明條例》Cap 105 Consolidated Version for the Whole Chapter (11!04!2019) (English and Traditional Chinese)
No ratings yet
《公共照明條例》Cap 105 Consolidated Version for the Whole Chapter (11!04!2019) (English and Traditional Chinese)
6 pages
Draft Script - Phone Addiction
100% (1)
Draft Script - Phone Addiction
4 pages
IMSE 623 Occupational Ergonomics - Work Design Lab # 2 Object Procedure
No ratings yet
IMSE 623 Occupational Ergonomics - Work Design Lab # 2 Object Procedure
1 page
Statistical Process Control: Online Library of Quality, Service Improvement and Redesign Tools
100% (2)
Statistical Process Control: Online Library of Quality, Service Improvement and Redesign Tools
7 pages
Sneha Sarmukadam, Statistical Programmer, Pharmanet/I3, Pune, India Sandeep Sawant, Manager-Statistical Programming, Pharmanet/I3, Pune, India
No ratings yet
Sneha Sarmukadam, Statistical Programmer, Pharmanet/I3, Pune, India Sandeep Sawant, Manager-Statistical Programming, Pharmanet/I3, Pune, India
11 pages
Enrollment: 1. Data Analysis and Statistical Methods
No ratings yet
Enrollment: 1. Data Analysis and Statistical Methods
4 pages
Chain Rule Key
No ratings yet
Chain Rule Key
1 page
Biostat Lec Part 3 (SV)
No ratings yet
Biostat Lec Part 3 (SV)
4 pages
Data Management
No ratings yet
Data Management
28 pages
AE
No ratings yet
AE
6 pages
Chapter 3 QUality Control
No ratings yet
Chapter 3 QUality Control
39 pages
Noble MC Project Coputer Scie
No ratings yet
Noble MC Project Coputer Scie
27 pages
ADAM
No ratings yet
ADAM
12 pages
Bba English Lecture 10 Solution
No ratings yet
Bba English Lecture 10 Solution
10 pages
〈1010〉 ANALYTICAL DATA-INTERPRETATION AND TREATMENT
No ratings yet
〈1010〉 ANALYTICAL DATA-INTERPRETATION AND TREATMENT
29 pages
Analytical Data - Interpretation and Treatment
No ratings yet
Analytical Data - Interpretation and Treatment
31 pages
Session 4 Data Analysis
No ratings yet
Session 4 Data Analysis
18 pages
LearnEnglish Speaking A2 Apologising
100% (2)
LearnEnglish Speaking A2 Apologising
4 pages
Clinic Management System
No ratings yet
Clinic Management System
48 pages
SAS Clinical Trials
100% (5)
SAS Clinical Trials
25 pages
WHY, The World Needs You - Michael MacIntosh
No ratings yet
WHY, The World Needs You - Michael MacIntosh
143 pages
WUSS 2023 Paper 206
No ratings yet
WUSS 2023 Paper 206
14 pages
HITEC 1st Merit List 22 Nov 20191574405819
No ratings yet
HITEC 1st Merit List 22 Nov 20191574405819
4 pages
Archiving, SAE
No ratings yet
Archiving, SAE
17 pages
Chapter Three
No ratings yet
Chapter Three
6 pages
PharmaSUG 2012 AD20
No ratings yet
PharmaSUG 2012 AD20
6 pages
Session 3
No ratings yet
Session 3
63 pages
Quality Control
No ratings yet
Quality Control
57 pages
NP - Chart
No ratings yet
NP - Chart
30 pages
Control Charts Examples
No ratings yet
Control Charts Examples
28 pages
1 Data MNGT CH 1,2,3
No ratings yet
1 Data MNGT CH 1,2,3
28 pages
AOSK Workshop Day5 - Data Analysis Karen Nairobi
No ratings yet
AOSK Workshop Day5 - Data Analysis Karen Nairobi
18 pages
Data Management Guidelines B
No ratings yet
Data Management Guidelines B
26 pages
Results 32
No ratings yet
Results 32
32 pages
Data Acquisition Cleaning
No ratings yet
Data Acquisition Cleaning
12 pages
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
From Everand
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
César Pérez López
No ratings yet
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Patient Registry Data for Research: A Basic Practical Guide
From Everand
Patient Registry Data for Research: A Basic Practical Guide
Mohamad Adam Bujang
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
SPSS for Applied Sciences: Basic Statistical Testing
From Everand
SPSS for Applied Sciences: Basic Statistical Testing
Cole Davis
2.5/5 (6)
71st AACC Annual Scientific Meeting
From Everand
71st AACC Annual Scientific Meeting
CTI Meeting Technology
No ratings yet
How to Find Inter-Groups Differences Using Spss/Excel/Web Tools in Common Experimental Designs: Book 1
From Everand
How to Find Inter-Groups Differences Using Spss/Excel/Web Tools in Common Experimental Designs: Book 1
P.Y. Cheng
No ratings yet

Integrated Summary of Safety and Efficacy Programming For Studies Using Electronic Data Capture

Uploaded by

Integrated Summary of Safety and Efficacy Programming For Studies Using Electronic Data Capture

Uploaded by

NESUG 2010

Integrated Summary of Safety and Efficacy

1. DATA SOURCE CHECKING

proc freq data=iss.ae;

Listing of Patients With Two or More Consecutive Serum Creatinine Measurements

3. CONSISTENT DATA and FOLDER STRUCTUE

ISS Directory Structure

Unique Subject Identifier

Subject Identifier for the Study

Study Site Identifier

Full Analysis Set Pop Flag

Full Analysis Set Pop Flag,

Description of Planned Arm

Planned Treatment for Period 1

Planned Treatment Number for

Date of First Exposure to

Date of Last Exposure to

Also known as Randomized Patient Identifier.

AMERICAN INDIAN OR ALASKA NATIVE:

You might also like