Lecture 2 - Study Methodology and Statistics
Lecture 2 - Study Methodology and Statistics
Ideally: 𝑝𝑇 = 0.5
Treatment Effect
Δ = 𝑝𝑇 − 𝑝𝐶
𝑝𝐶 = 0.25
All patients with non-small
cell lung cancer
In Reality
We can’t give the control and treatment to everyone in the population let alone reach the whole
population.
A p-value measures the likelihood of observing the estimated treatment effect due to random
chance
If the p-value is below a pre-specified cutoff 𝛼 (typically 0.05 or 5%) will reject H0
Power and Type I Error
The Truth
No treatment benefit Treatment Benefit
Δ=0 Δ>0
Conclusion from Fail to reject H0 Type II Error
Hypothesis Test Conclude Δ = 0 False negative
Reject H0 Type I Error
Conclude Δ > 0 False positive
Probability of a Type I Error is called the 𝛼-level:
◦ Is selected ahead of time, typically at 5%
Power is the probability of detecting a treatment effect when one is present
◦ Power = 1 – Prob(Type II error)
◦ Driven by the true treatment effect Δ and the sample size
◦ As Δ increases, power increases
◦ As n increases, power increases
Statistical Bias
The expected difference between the estimated parameter and the true parameter. In other
to incorrectly estimate Δ
words, anything which systematically causes Δ
Let’s assume there is a way to determine if a patient is going to respond to treatment ( ) or not ( )
systematically
Δ systematically
Δ
over-estimates Δ under-estimates Δ
A study endpoint is a specific measurement or observation to assess the effect of the study
intervention.
◦ Succinct but precise definition
◦ Should be prioritized and should correspond to the study objectives and hypothesis being tested
In general, the more hypothesis tests you perform the more difficult it is to claim significance for
an individual test. So, the number of primary/secondary objectives should be kept small
Common Endpoints in Cancer Trials
The choice of endpoint should be based on the specific disease and intervention
Time-to-event Outcomes:
Randomization CR Death
◦ Overall Survival (OS)
◦ Time from randomization to death
◦ Progression-Free survival (PFS)
◦ Time from randomization to death or disease progression OS and PFS
◦ Time to Progression (TTP) TTP is censored
◦ Time from randomization to disease progression
Randomization Progressive Death
Binary Outcomes: Disease
◦ Complete Response (CR)
◦ No detectable evidence of tumor
OS
◦ Objective Response Rate (ORR)
◦ Either a complete or a partial response PFS and TTP
Study Design
Let’s look at an example:
Double Blinded:
◦ Patient and study operator do not know which treatment the patient received
Keeping treatment information hidden helps to reduce the amount of bias introduced into the
trial. For example:
◦ If a patient knows they received the placebo they may drop out or not follow all procedures
◦ If a study operator knows a patient will receive the treatment, they may consciously or unconsciously
give the patient better care introducing additional bias
Number of Treatment Arms
Refers to how many different study interventions will be given
Singe-arms Trials:
◦ A single study intervention is given and the results are compared against a benchmark
◦ Generally, used in early phase (1/2) trials and/or when it would be unethical/impossible to give a
placebo
◦ Provides less information about the true treatment effect Δ since the “control” did not come from the
same trial
◦ Survival data from single-arm trials can be unreliable
Multi-arm Trials:
◦ Allows for randomization and blinding
◦ One arm is the control, while other(s) are interventions
◦ Allows for a better estimate of the treatment effect as the control is given in similar conditions as the
treatment, reducing bias
Common Types of Experimental Designs
Refers to the statistical design used to obtain an estimate of the treatment effect
Parallel Group Design:
◦ Patients are assigned to a single
intervention group and followed until the
end of the study
Crossover Design
◦ Patients are assigned to either treatment
or control, follow by a “wash-out” period
and then the patient is assigned the other
intervention
◦ Allows each patient to be their own
control Treatment
◦ Not all interventions can “wash-out”
Wash-out time
Back to Our Example:
Generally, the study design is summarized in an illustration
Want the study population to be as large as possible while maintaining a clinically meaningful
treatment effect.
◦ Too general a population, the treatment effect may no longer be present
◦ Too specific a population, recruitment could be an issue
Study Intervention
This sections describes in-detail the study intervention(s)/control that are being tested for safety
and efficacy.
Detailing the interventions helps to reduce bias introduce by different study sites and
investigators
Statistical Considerations
This section should specify:
◦ The formal null and alternative hypothesis for primary and key secondary endpoints
◦ How the sample size was determined for the trial
◦ A general description of the statistical analyses to be performed
For time-to-event endpoints, the sample size tells you the number of events needed to achieve
the desired power
◦ Using this number of events can then predict the number of patients needed
Example: Sample Size Determination
From the study of Abemaciclib protocol:
◦ “The primary PFS analysis will be performed after 385 PFS events have occurred. Assuming a hazard
ratio (HR) of 0.703, this sample size yields at least 90% statistical power to detect superiority of
Abemaciclib plus fulvestrant arm over the placebo plus fulvestrant arm with the use of a 1-sided log-
rank test and type I error of 0.025.”
Study targeted at least 90% power, with a predicted treatment effect of HR = 0.703. To achieve this the
study would need 385 PFS events
Using this 385 events, study designers can then estimate how many patients will need to be recruited
based on predicted drop-out rates and screening failure rates
Thank you!
References
Protocol Templates for Clinical Trials: https://grants.nih.gov/policy/clinical-trials/protocol-template.htm
Clinical Trial Endpoints for the Approval of Cancer Drugs and Biologics:
https://www.fda.gov/media/71195/download
A Study of Abemaciclib (LY2835219) Combined With Fulvestrant in Women With Hormone Receptor Positive
HER2 Negative Breast Cancer (MONARCH 2)
◦ https://clinicaltrials.gov/ct2/show/results/NCT02107703
◦ https://clinicaltrials.gov/ProvidedDocs/03/NCT02107703/Prot_000.pdf
◦ Sledge GW, Toi M, Neven P, et al. The Effect of Abemaciclib Plus Fulvestrant on Overall Survival in Hormone Receptor–
Positive, ERBB2-Negative Breast Cancer That Progressed on Endocrine Therapy—MONARCH 2: A Randomized Clinical
Trial. JAMA Oncol. 2020;6(1):116–124. doi:10.1001/jamaoncol.2019.4782