lect23
lect23
0 Survival Analysis
• Answer Questions
1
• Failure Rates
• Reliability of Systems
23.1 Failure Rates
• Increasing Failure Rates describe things which are more likely to fail
with age, such as machines whose parts wear out.
• Decreasing Failure Rates describe things that are less likely to wear
out with time: a business that has lasted two centuries is less likely to go
bankrupt than one that has lasted two years.
4
Often researchers want to make use models that have specific kinds of failure
rate behavior.
Graphs of some of the main kinds of failure rate behavior:
5
Consider the Rayleigh distribution with parameters θ0 and θ1 :
1
F (t) = 1 − exp(−θ0 t − θ1 t2 )
2
so the hazard function or failure rate is
λ(t) = f (t)/S(t)
(θ0 + θ1 t) exp(−θ0 t − 21 θ1 t2 )
=
6
exp(−θ0 t − 21 θ1 t2 )
= θ0 + θ1 t.
The log of the overall hazard function is adjusted up or down according to the
additive linear effects of the covariates (i.e., the explanatory variables).
Note that the Cox model is closely related to nonlinear regression. One has
data on covariates and survival times, and uses those to find coefficient
estimates β̂j . If an estimated coefficient is not significantly different from zero,
then it has little or no effect on the lifespan of the item or the person.
The model assumes that the covariates have the same effect at all ages, and
that there are no unspecified interactions among the variables (e.g., smoking
and working around diesel exhaust may be more unhealthy than just the sum
8
A nice feature is that if one believes the model, one can assess the relative
importance of the different covariates without having to know or even to
estimate the baseline rate λ0 (t). This baseline need not be a simple increasing
or decreasing failure rate—it may be realistically complicated, and the
inference on the covariates is still sound.
Since this is like regression analysis, what happened to β0 , the intercept term?
ln λ(t; x1 , . . . , xp ) = ln λ0 (t) + β1 x1 + · · · + βp xp
Note: Is this a good model for assessing the impact of smoking on human
lifespan? Why or why not?
Note: Is this a good model for assessing the impact of the prime interest rate
on the lifespan of a business? (Hint: The prime rate may change over time.)
The hazard ratio for an entity with a set of covariates (x1 , . . . , xp ) compared
to an entity with covariates (x∗1 , . . . , x∗p ) indicates how much their overall risks
differ. This is often used in medical and insurance settings, where the doctor
or actuary is addressing multiple lifestyle issues.
exp(xβ)
=
exp(x∗ β)
= exp[(x − x∗ )β].
The point estimate for the hazard ratio is exp[(x − x∗ )β̂]. In a more advanced
class, one can set confidence intervals on the hazard ratio.
For technical reasons one cannot estimate β̂1 , . . . , β̂p by directly maximizing
the likelihood function (which is one of our usual strategies). Instead, we must
maximize the partial likelihood:
Y exp(xj β)
L[β; (yj , xj )] = P .
yk ≥yj exp(x k β)
uncensored yj
It turns out that the maximization of the partial likelihood gives you the
estimates that one would have gotten if one had been able to maximize the
full likelihood.
Note: You will not need to know how to maximize the partial likelihoods for
the exam. Here we are just connecting some intellectual dots that underlie
the Cox proportional hazards model. But you should know how to do regular
maximum likelihood estimation for the exam.
A useful alternative to the Cox proportional hazards model is the competing
risks model. In these, an item or a person can fail in multiple ways. For
example, a person can be hit by a car, or struck by lightning, or have a heart
attack.
The competing risks model assumes the these risks operate independently, and
the survival time is a race to see which failure mode occurs first.
12
If each of the k failure modes independently has cdf Fi (t), then the overall
probability of survival beyond time t is
Y
k
S(t) = [1 − Fi (t)]
i=1
second which is connected to the third, and so forth, Then the probability
that the system survives is the probability that none of the components
fail. If component i has lifespan with cdf Fi (t), and if each component fails
independently, then then probability that the system survives beyond time t is
Q
S(t) = [1 − Fi (t)].
From this, one can calculate hazard rate functions and so forth, at least in
principle.
From previous work, we know that the probability that the first subsystem
fails before time t is
17
And we know the probability that the second subsystem fails is F4 (t) ∗ F5 (t).
Therefore the probability that the overall system fails before time t is