Predicting Crash Likelihood and Severity
Predicting Crash Likelihood and Severity
a r t i c l e i n f o a b s t r a c t
Article history: Real-time crash risk prediction using traffic data collected from loop detector stations is useful in dynamic
Received 27 August 2012 safety management systems aimed at improving traffic safety through application of proactive safety
Received in revised form 4 March 2013 countermeasures. The major drawback of most of the existing studies is that they focus on the crash
Accepted 31 March 2013
risk without consideration of crash severity. This paper presents an effort to develop a model that pre-
dicts the crash likelihood at different levels of severity with a particular focus on severe crashes. The
Keywords:
crash data and traffic data used in this study were collected on the I-880 freeway in California, United
Crash severity
States. This study considers three levels of crash severity: fatal/incapacitating injury crashes (KA), non-
Real-time safety management
Crash risk prediction
incapacitating/possible injury crashes (BC), and property-damage-only crashes (PDO). The sequential
Sequential logit model logit model was used to link the likelihood of crash occurrences at different severity levels to various
Freeway traffic flow characteristics derived from detector data. The elasticity analysis was conducted to evaluate
the effect of the traffic flow variables on the likelihood of crash and its severity.The results show that
the traffic flow characteristics contributing to crash likelihood were quite different at different levels of
severity. The PDO crashes were more likely to occur under congested traffic flow conditions with highly
variable speed and frequent lane changes, while the KA and BC crashes were more likely to occur under
less congested traffic flow conditions. High speed, coupled with a large speed difference between adja-
cent lanes under uncongested traffic conditions, was found to increase the likelihood of severe crashes
(KA). This study applied the 20-fold cross-validation method to estimate the prediction performance of
the developed models. The validation results show that the model’s crash prediction performance at each
severity level was satisfactory. The findings of this study can be used to predict the probabilities of crash
at different severity levels, which is valuable knowledge in the pursuit of reducing the risk of severe
crashes through the use of dynamic safety management systems on freeways.
© 2013 Elsevier Ltd. All rights reserved.
1. Introduction et al., 2003; Abdel-Aty et al., 2004, 2005; Zheng et al., 2010; Pande
et al., 2011; Ahmed et al., 2012; Ahmed and Abdel-Aty, 2012; Xu
Real-time crash risk prediction models estimate the likelihood et al., 2012a,b, in press; Li et al., 2012).
of crash occurrence for a given freeway segment over a short time Oh et al. (2001, 2005) applied a Bayesian model to establish the
period, such as 5 min. One of the important practical applications of statistical relationship between the crash risk and the real-time
real-time crash risk prediction models is identification of hazardous traffic flow states. The results showed that the standard deviation
traffic conditions that may lead to a crash. Predicting the crash risk of speed estimated in five-minute intervals was a good indicator of
in real-time is an essential task in freeway dynamic safety man- hazardous traffic conditions where the crash potential was consid-
agement systems. Crash risk prediction helps identify hazardous erably higher than under other traffic conditions. Lee et al. (2003)
traffic conditions where proactive crash prevention strategies are used a log-linear model to estimate crash risks based on real-time
needed to mitigate the high crash risk. In recent years, numerous traffic flow data collected from freeway loop detector stations. It
studies have developed freeway crash risk prediction models that was concluded that the coefficient of variation in speed, traffic den-
link the crash risk with certain traffic flow characteristics measured sity, and speed difference between upstream and downstream loop
with freeway traffic surveillance systems (Oh et al., 2001, 2005; Lee detector stations were significantly correlated with the crash risk.
Abdel-Aty et al. (2004) applied matched case-control logis-
tic regression to link crash likelihood with real-time traffic flow
∗ Corresponding author. Tel.: +86 13801580045.
characteristics. The traffic intervals preceding a crash were cases
E-mail addresses: [email protected] (C. Xu), [email protected] (A.P. Tarko),
that were matched with the crash-free intervals used as controls.
[email protected] (W. Wang), [email protected] (P. Liu). The results showed that the likelihood of crash occurrence was
0001-4575/$ – see front matter © 2013 Elsevier Ltd. All rights reserved.
http://dx.doi.org/10.1016/j.aap.2013.03.035
C. Xu et al. / Accident Analysis and Prevention 57 (2013) 30–39 31
Table 1
Variables considered for the models.
Symbol Variables
equal to 0 vph. The 30-s raw detector readings from two consecu- at different time along the selected I-880 freeway section. Thus,
tive upstream–downstream detector stations were aggregated into the selected non-crash cases could generally represent the normal
5-min intervals and converted into the 22 traffic flow variables traffic flow conditions.
presented in Table 1. The geometric data for the I-880 freeway were also obtained
The traffic flow variables in Table 1 consist of 5-min observations from the PeMS database. As shown in Table 1, 9 geometric vari-
supplemented with a crash indicator (1 if a crash occurred between ables were used in this study. The geometric data for each crash
the upstream and downstream detectors, and 0 otherwise). The case and non-crash case were extracted based on their milepost
researchers extracted traffic data in the time interval between 5 location. The weather data were obtained from National Climate
and 10 min prior to crash occurrence. The purpose of doing so was Data Center (NCDC) website which provides hourly weather infor-
to identify hazardous traffic condition ahead of the crash occur- mation from weather stations across the United States. Weather
rence time to make preemptive measures possible (Pande et al., conditions for each crash case and non-crash case were extracted
2011; Lee et al., 2011; Xu et al., in press). For example, if a crash based on their time and milepost location. Considering the sample
occurred at 10:00 pm, the traffic data were extracted from 9:50 size in each category, the rain and fog were combined as adverse
to 9:55 pm. This time lag was also adopted in previous studies weather conditions. As a result, the study considered two difference
to develop real-time crash risk prediction models (Pande et al., weather conditions, including clear weather and adverse weather.
2011; Lee et al., 2011; Xu et al., in press). For each crash in the
dataset, the researchers randomly selected 20 five-minute inter-
vals without crashes from the crash-free days. These intervals were
supplemented with the 22 traffic flow variables to form crash-free
observations. To generate the dataset of non-crash cases, the time
for each non-crash case was randomly chosen from the 527,0401-
min intervals in 2008 (60 min × 24 h × 366 days in 2008). Similarly,
the upstream and downstream station for each non-crash case was
randomly selected from the 119 loop detector stations. Then, each
randomly selected combination of time and stations was used to
extract 5-min detector data after the assigned time of the non-
crash from the assigned upstream and downstream station of the
non-crash. In addition, it was ensured that there were no crashes
observed at the location of each non-crash case during the whole
day. Each non-crash case was also assigned a random milepost loca-
tion based on its upstream and downstream station. Figs. 2 and 3
illustrate the distributions of non-crash cases over time and space.
The dataset of non-crash cases covers the normal traffic conditions Fig. 2. The distribution of non-crash cases over time.
C. Xu et al. / Accident Analysis and Prevention 57 (2013) 30–39 33
Fig. 3. The distribution of non-crash cases along the I-880 Freeway. In this study, a binary logit model was applied at each stage to
fit the developed sequential logit model. At each stage, a binary
logit model was used to fit a sub-sample that excluded the obser-
The crash data were obtained from the Statewide Integrated vations of a certain level used in the previous stage. The binary
Traffic Records System (SWITRS) maintained by Caltrans. The logit regression model was used in previous studies for predicting
crash dataset provided by SWITRS included crash location, crash a binary dependent variable as a function of the predictor variables
time, and crash severity. The crash severity is divided into five in transportation engineering (Xu and Tian, 2008; Hubbard et al.,
levels, including fatal crash (K), incapacitating injury crash (A), 2009; Liu et al., 2007). Using the binary logit model, the probability
non-incapacitating injury crash (B), possible injury crash (C), and of the occurrence of a crash can be estimated using the following
property-damage-only crash (PDO). The definition of each crash equation:
severity level is:
A fatal injury is any injure that results in death within a 30 day 1
P(xi ) = (i = 1, 2, ...., n) (1)
period after the crash occurred. 1 + e−g(xi )
An incapacitating injury is any injure other than a fatal injure,
where P(xi ) denotes the probability of the occurrence of a crash and
which prevents the injured person from normally continuing the
g(x) is the multiple linear combination of explanatory variables,
activities the person was capable of performing before the injury
which can be expressed as:
occurred.
A non-incapacitating injury is any injure other than a fatal injury P(xi )
or an incapacitating injury, which is evident to observers at the g(x) = ln = ˇ0 + ˇ1 x1i + · · · + ˇk xki (2)
1 − P(xi )
scene of the crash.
A possible injury is any injure that includes complaint of pain where xki denotes the value of variable k for sample i and ˇk is the
without visible injury. coefficient of variable k. The parameters ˇ0 , ˇ1 , ˇ2 , . . ., ˇk can be
The original five levels of severity were combined into three estimated by solving the log-likelihood function for Eq. (2), which
levels: KA, BC, and PDO (Leckrone et al., 2011; Jung et al., 2010). is given by:
This combining process acknowledged the similarity of the com- n
bined levels and increased the number of crashes at each new level,
ln L(ˇ, xi ) = [ˇ0 +ˇ1 x1i +· · ·+ˇki xki − ln(1 + eˇ0 +ˇ1 x1i +···+ˇki xki )]
thereby improving the chance for more significant variables being
i=1
included in the final models. Most of the crashes (56.0%) in the (3)
dataset were rear-end crashes followed by sideswipe crashes with
about 22.3%. About 54.9% and 13.7% of injury crashes in dataset
were rear-end crashes and sideswipe crashes respectively, and As per findings of previous studies (Hauer and Hakkert, 1988;
20.2% were hit object crashes. The crash frequency for each sever- Elvik and Mysen, 1999; Hauer, 2006; Savolainen et al., 2011), less
ity level is shown in Table 2. A total of 794 crash cases and 15,880 severe crashes are more likely to be under-reported and the under-
non-crash cases were included in our database. reporting rate decreases with the increase in severity level. Thus,
accident samples are usually over-represented by high severity
3. Research methodology level crashes. Further, it is prohibitive to include all the observations
of non-crash cases in the dataset. 15,880 observations of non-crash
The ordered probit/logit model is the most common model- cases were randomly selected to represent the normal traffic con-
ing approach that fits the data structure of an ordinal response. ditions. Finally, due to the missing or invalid real-time traffic data,
the crashes that could not be matched with real-time traffic data
were excluded from further data analysis. These factors will make
Table 2
the dataset an outcome (choice)-based sample.
Frequency distribution of observations in the sequential logit model.
When the conventional maximum likelihood estimator (MLE) is
Crash severity level Sequential structure used to the outcome-based samples, the multinomial logit model
Stage 1 Stage 2 Stage 3 could still produce unbiased estimates for model parameters except
the constant terms (Cosslett, 1981a,b; Yamamoto et al., 2008; Patil
Fatal and incapacitating injury (KA) 59 (1a ) 59 (1) 59 (1)
Non-incapacitating and possible injury(BC) 203 (1) 203 (1) 203 (0) et al., 2011). Similar to multinomial logit model, the parameter esti-
Property damage only (PDO) 532 (1) 532 (0) – mates in the binary sequential logit model are unbiased except
Non-crash 15,880 (0) – – constant terms (Yamamoto et al., 2008; Savolainen et al., 2011),
Total 16,674 794 262
because the binary sequential logit model is the combination of
a
SAS coding of crash severity level are in parentheses. several binary logit models.
34 C. Xu et al. / Accident Analysis and Prevention 57 (2013) 30–39
P(Crash) = Pf 1 (7)
(9)
P(KA) = P(Crash)P(KA or BC|Crash)P(KA|KA or BC) = Pf 1 Pf 2 Pf 3
Fig. 4. The structure of the sequential logit model considered in this study. (10)
Table 3
Estimation results for the sequential logit model.
Stage 3 KA vs. BC
AvgSpdu 0.033 0.015 4.900 0.027 2.022 (0.448)
SpdDifu 0.067 0.020 10.723 0.001 0.859 (0.404)
VehCntd −0.117 0.042 7.577 0.006 −1.007 (0.464)
Intercept −3.510 (−1.971) 1.199 8.568 0.003 –
Summary statistics:
−2L(c) = 279.501; −2L(ˇ) = 238.076
−2[L(c) − L(ˇ)] = 41.426 (3df); P < 0.0001
a
The intercept adjustment for each logit model.
b
The standard deviation of elasticity for each variable.
station (represented by SpdDevu ), the speed variance at the down- model (first stage). The positive coefficient of variable DetDistu–d
stream detector station (represented by SpdDevd ), the traffic indicates that the probability of crash grows with the length of
inter-lane imbalance (measured with OccDifd ), the volume differ- the road segment. The findings from the aggregate crash pre-
ence between upstream and downstream station (represented by diction models in previous studies also demonstrated that the
AvgCntu–d ), and the occupancy difference between upstream and increase in road segment resulted in an increase in the number of
downstream station (represented by AvgOccu–d ) are high as well. accidents (Anastasopoulos and Mannering, 2009). Both variables
These results are consistent with the high lane-change activities Widths and Widtho had negative coefficients, indicating that the
postulated by Gazis et al. (1962) for certain conditions and con- crash risk decreases with the increase in road surface width and
firm the findings of previous statistical analyses (Lee et al., 2003; out shoulder width. As indicated by the coefficient of the variable
Abdel-Aty et al., 2004; Ahmed and Abdel-Aty, 2012; Xu et al., Curve, the curve segment could increase the crash risk on freeways.
2012a,b). Generally speaking, the small distances between vehicles These results are consistent with the findings of previous studies
in high-density traffic flow leave less time for taking crash avoid- (Anastasopoulos and Mannering, 2009; Das and Abdel-Aty, 2010;
ance maneuver. Speed fluctuations and traffic imbalances between Shively et al., 2010). The positive coefficient of weather conditions
lanes may encourage drivers to change lanes more frequently – a indicates that adverse weather conditions could increase crash like-
maneuver that can be quite dangerous in dense traffic. lihood on freeways. Surprisingly, the model does not include the
Elasticity tells how many times the crash probability changes if presence of on-ramps and off-ramps between the detector sta-
the explanatory variable changes by 1% while the other variables tions. It seems that the six traffic flow characteristics included in
remain fixed. Unlike the marginal effects, the elasticity is dimen- the model have already captured the impacts of the ramps.
sionless, thus it is more convenient for comparing the effects of As mentioned above, the parameter estimates in the binary
different variables. For example, the average elasticity values for sequential logit model are unbiased except constant terms when
the six traffic flow characteristics (DetOccu , SpdDevu , SpdDevd , MLE is used to the outcome-based samples. To account for the
OccDifd , AvgCntu–d , and AvgOccu–d ) are: 0.467, 0.224, 0.189, 0.299, biases caused by outcome-based sampling scheme, Eq. (4) was used
0.090, and 0.038, respectively. It means that the one-percent to adjust the intercept of the model at the first stage as follows:
increase in these six traffic flow characteristics is associated with
the 0.467%, 0.224%, 0.189%, 0.299%, 0.090%, and 0.038% increases in
Non-crashp
SR Crashs
crash
the crash probability, respectively. offset = −Ln = −Ln ×
PRcrash Crashp Non-crashs
Among the geometric variables, the spacing between upstream
and downstream stations (DetDistu–d ), the road surface width Crashr Crashs Non-crashp
(Widths ), the outer shoulder width (Widtho ) and the curve sec- = −Ln × × (11)
Crashp Crashr Non-crashs
tion (Curve) were found to be significant in the crash probability
36 C. Xu et al. / Accident Analysis and Prevention 57 (2013) 30–39
where SRcrash represents the ratio of crash cases to non-crash cases VehCntd ) were significantly correlated with the risk of fatal or inca-
in the data sample; PRcrash represents the ratio of crash cases to pacitating injury upon crash occurrence. Both variables AvgSpdu
non-crash cases in the total population; Non-crashs and Non-crashp and SpdDifu had positive coefficients, indicating a high risk of a fatal
represent the number of non-crash cases in the data sample and or incapacitating injury outcome from a crash if the crash occurs
total population, respectively; Crashs and Crashp represent the at high speed and considerable speed difference between lanes.
number of crashes in the data sample and total population, respec- On the other hand, the negative coefficient of the downstream
tively; Crashr represents the reported number of crashes. flow variable (VehCntd ) indicates that a fatality and incapacitat-
As shown in Eq. (11), three ratios were included in the offset ing injury are less likely in a crash that occurs in a high-volume
to account for the biased caused by the outcome-based sampling traffic flow. Therefore, crashes that occur in less congested traf-
scheme. The ratio between Non-Crashs and Non-Crashp is used to fic flow conditions with high speeds and high speed differences
account for the reduction in the number of non-crash cases in the between adjacent lanes are prone to produce fatal and incapacitat-
data sample. Due to the missing or invalid real-time traffic data, the ing injuries.
number of crashes in the data sample is smaller than the number The average elasticity for average speed variable (AvgSpdu ) was
of reported crashes. The ratio between Crashs and Crashr is used 2.022; and for the speed difference across lanes (SpdDifd ), the aver-
to account for the reduction in the number of crashes caused by age elasticity was 0.859, indicating that a 1% increase in AvgSpdu
missing or invalid real-time traffic data. Finally, the ratio between and SpdDifu increases the probability of a fatal and incapacitating
Crashr and Crashp is used to account for the effect of underreport- injury crash by 2.022% and 0.859%, respectively. The average elas-
ing of crash data. Although the actual value for reporting rate of all ticity of −1.007 for the flow intensity (represented by VehCntd )
crashes is unknown, this value can be estimated using the results of implies that the probability of a fatal and incapacitating injury crash
previous studies about under reporting of crashes. The study con- decreases 1.007% for 1% increase in VehCntd .
ducted by Elvik and Mysen (1999) reported that the reporting rates
for crash K, A, B, C, O are 95%, 69%, 27%, 11%, and 25%, respectively. By 4.2. Prediction performance
the combination of the above reporting rates and reported crashes
in the crash data, the reporting rate of all crashes can be easily esti- 4.2.1. Test design
mated. After determining the reporting rate of all crashes, the offset This study applied the 20-fold cross-validation method to esti-
for adjusting the intercept of the model at the first stage can be esti- mate the prediction performance of the developed models. The
mated using Eq. (11). The offsets for adjusting the intercept terms whole sample was randomly partitioned into 20 mutually exclusive
at other stages shown in Table 3 can be estimated using the similar sub-samples of approximately equal size. Then, each sub-sample
method shown above. was used as a validation sample and the other 19 sub-samples were
combined as a training sample. The sequential logit models were
4.1.2. Second-stage model – injury crashes developed for different training samples. Note that the explana-
At the second stage, the upstream traffic density (represented tory variables included in the binary logit model at each stage were
by DetOccu ), the downstream traffic volume (represented by the same variables shown in Table 3. Eqs. (7)–(10) were used to
VehCntd ), the weather conditions (represented by weather), the calculate the crash likelihood at different severity levels for the
peak period (represented by Peak), and road surface width (repre- observations in each validation sample.
sented by Widths ) were found to be significantly correlated with
the risk of injury and fatality once a crash happens. The negative 4.2.2. Measure of performance and results
coefficients of the two traffic flow variables indicate that an injury is The prediction accuracy of a model of binary outcome (event = 1
less likely to result from a crash that occurs in more intense and con- and non-event = 0) can be measured with two complementary indi-
gested traffic. The interpretation here is that more congested traffic cators: (1) the proportion of events predicted as an event (true
tends to be slower than less congested conditions; thus crashes positive rate) called sensitivity in SAS, and (2) the proportion of
tend to occur at relatively low speeds, thereby decreasing the risk non-events predicted as a non-event (true negative rate) called
of injury and fatality. The findings from the aggregate crash predic- specificity in SAS. The model predicts an event if the predicted prob-
tion models in previous studies also demonstrated that the crashes ability of event exceeds a pre-specified threshold. The sensitivity
occurred in congested traffic conditions are likely to be less severe and specificity depend on the same threshold varying between 0
(Shefer, 1997; Chang and Xiang, 2003; Wang et al., 2009). The aver- and 1. A convenient and meaningful tool for evaluating the pre-
age elasticity for the upstream occupancy (DetOccu ) was −0.202 diction performance of a logit model is a curve called the Receiver
which indicates that the 1% increase in the upstream occupancy Operating Characteristic (ROC), which compares the sensitivity and
will result in 0.202% reduction in the probability of injury crash. the 1 − specificity for a threshold running from 0 to 1. Thus, the ROC
The average elasticity of −0.303 for the downstream flow indicates curve is a graphical plot of the sensitivity (y-axis) vs. 1 − specificity
that the probability of injury crash decreases by 0.303% for each 1% (x-axis). To generate the ROC curve for each severity level, we cal-
increase in VehCntd . The variable Widths had negative coefficients, culated the sensitivity and 1 − specificity for multiple thresholds by
indicating that the risk of injury and fatality decreases with the using all of the validation samples. Fig. 5 presents the ROC curves
increase in road surface width. Large road surface width gives for four possible outcomes for the sequential logit model: (1) crash
drivers more space for taking crash avoidance maneuver and reduc- at any severity level, (2) property-damage-only crash (PDO), (3)
ing crash severity. Both variables Peak and Weather had negative non-incapacitating or possible injury crash (BC), and (4) fatal or
coefficient, indicating that the peak period and adverse weather incapacitating injury crash (KA).
conditions could decrease the risk of injury and fatality. These are The areas under the ROC curves for the four possible outcomes
consistent with the findings of previous studies (Das and Abdel-Aty, were found to be 0.784, 0.803, 0.758, and 0.797 respectively, indi-
2010; Khattak and Knapp, 2001; Kim et al., 2013). cating that the sequential logit model can provide good predictive
performance. Table 4 summarizes the crash prediction perfor-
4.1.3. Third-stage model – fatal and incapacitating injury crashes mance at different severity levels for the developed sequential logit
The model estimated in the third stage implies that the aver- model. The prediction performance is measured by the percent of
age speed measured at the upstream detector station (AvgSpdu ), predicted crashes at each severity level and for several false alarm
the difference in speeds between adjacent lanes at the upstream (false positive) rates. As shown in Table 4, the prediction accuracy
station (SpdDifu ) and the downstream traffic flow (represented by of crashes increased as the false alarm rate was increased. This
C. Xu et al. / Accident Analysis and Prevention 57 (2013) 30–39 37
Fig. 5. The ROC curve of each severity level for the sequential logit model.
Table 4 trade-off between the prediction accuracy and the false alarm rate
Prediction performance at different false alarm rates.
must be considered when setting a threshold value and needs to
1 − specificity Sensitivity of the sequential model be determined carefully to meet the requirement of the practical
implementation or the preference of a specific traffic agency. After
PDO BC KA
determining the threshold, the ROC curve can be easily used to
0.02 22.8% 16.5% 22.4%
estimate the predictive performance. For example, if a threshold
0.1 48.5% 38.0% 37.9%
0.2 66.2% 59.5% 60.3% value is selected to accept a 30% false alarm rate, the prediction
0.3 75.7% 67.5% 75.9% accuracy of PDO, BC, KA crashes is found to be 75.7%, 67.5% and
0.4 82.3% 76.5% 84.5% 75.9%, respectively on the ROC curve. For comparison purpose, we
0.5 87.2% 84.5% 91.4% also briefly summarized the previous studies regarding the real-
time crash risk prediction modes in Table 5. Compared with the
predictive performance of the models in previous studies in Table 5,
the predictive performance of the models in this study is good.
Table 5
Summary of predictive performance of the real-time crash risk models in previous studies.
Authors Prediction accuracy of Prediction accuracy of False alarm Sample sized of Sample size of
crash non-crash rate crash non-crash
4.2.3. Implementation discussion The sequential logit model was applied to link the likelihood of
One possible real-time implementation of a model such the one crash occurrences at different severity levels with various traffic
presented in this paper is calculating the probabilities of crashes flow characteristics. The real-time traffic and crash data utilized in
at certain levels of severity between specific detector stations and this study were obtained on the I-880 freeway in California, United
warning drivers entering the segment about the high risk. It is States. The model estimation results showed that the traffic flow
imperative, however, in such an implementation, that the rate of characteristics contributing to crash probability were found to vary
false positives is not too high because of the anticipated erosion of substantially across different crash severity level. In general, the
drivers’ attention to the message. Although drivers cannot tell in low severity crashes (PDO) tended to occur in congested traffic flow
advance which alerts will not be followed by actual crash events, conditions with highly variable speed and frequent lane changes.
they will soon realize the high frequency of such alerts. For exam- The injury crashes (KA and BC) were found to occur more often in
ple, let us assume the false alarm rate is 50% and a driver who travels less congested traffic flow conditions. The KA crashes, in particular,
daily to work on a five-mile freeway section with 10 pairs of detec- occurred under uncongested traffic flow conditions as well as with
tor stations and who will be subject to five alarms on average during large differences in speed between adjacent lanes. The elasticity
a one-way trip. Even if drivers are not familiar with the frequency analysis was conducted at each stage of the sequential logit model
of crashes, they may become immune to the alerts quickly if no estimation to evaluate the effects of the traffic flow variables on the
crashes are apparent. If the false alarm rate is set at much lower likelihood of crash severity.
rate, say 1/50, then the traveler in the considered example would The 20-fold cross-validation method was applied to evaluate the
witness one false alarm a week. However, setting the false alarm predictive performance of the developed sequential logit model.
rate at 0.02% makes the prediction performance rather low. Fortun- The validation results demonstrate that the predictive performance
ately, it seems that predictive accuracy of the most severe crashes of the developed models was deemed satisfactory. The predictive
is not too bad. Being able to predict approximately 22% of inca- accuracy of PDO, BC, KA crashes is found to be 75.7%, 67.5% and
pacitating injury or fatal crashes and preventing, hopefully, most 75.9% when the false alarm rate was equal to 30%. As expected, there
of them without eroding the alertness of motorists might be suf- was a strong trade-off between the false alarm rate and the percent-
ficient justification for implementing real-time warning systems age of crashes predicted by the model, which is an important aspect.
utilizing models such the one presented in this paper. A model should have a reasonably low false alarm rate in order to
reduce the danger of losing drivers’ responsiveness to alerts about
high-risk traffic conditions. The set of developed models were able
4.3. Temporal and spatial transferability
to correctly identify high-risk severe crash conditions in 22% of the
cases with the false alarm rate set at a safe level of 2%. This result
Previous studies have demonstrated that the real-time crash
provides hope that the real-time prediction of severe crashes is pos-
prediction models cannot be directly transferred from one road
sible, and implementation of this study’s results, based on warning
to another due to the difference in driver population and traffic
motorists, is practical.
patterns (Pande et al., 2011; Ahmed and Abdel-Aty, 2012). The
Other advanced dynamic safety management systems, such as
Bayesian updating approach could be used to improve the temporal
variable speed limits and ramp metering, may benefit from the
and spatial transferability of the developed sequential model above.
real-time detection of high-risk conditions as well if the connection
The Bayesian updating approach could update the old model as new
between these traffic control methods and safety is better under-
data becomes available. Assuming that a real-time crash prediction
stood. In these cases, the false alarm rate is not as critical as it is
model has been developed based on the historical data Y1 , and that
for warning-based solutions because the road users are not aware
the new data or data from other road Y2 are then obtained, the pos-
of the reason for a reduced speed limit or a ramp metering rate
terior distribution can be updated using Bayes’ theorem as follows:
change.
Anastasopoulos, P., Mannering, F., 2009. A note on modeling vehicle accident Liu, P., Wang, X., Lu, J., Sokolow, G., 2007. Headway acceptance characteristics of U-
frequencies with random-parameters count models. Accident Analysis and Pre- turning vehicles at unsignalized intersections. Transportation Research Record
vention 41, 153–159. 2027, 52–57.
Chang, G., Xiang, H., 2003. The Relationship Between Congestion Levels and Acci- Li, Z., Chung, K., Liu, P., Wang, W., Ragland, D., 2012. Surrogate safety measure
dents. Maryland State Highway Administration, Baltimore, MD. for evaluating rear-end collision risk near recurrent bottlenecks. In: Presented
Cosslett, S.R., 1981a. Efficient estimation of discrete-choice methods. In: Manski, at the 91th Annual Meeting of the Transportation Research Board, CD-ROM,
C., McFadden, D. (Eds.), Structural Analysis of Discrete Choice Data with Econo- Washington, DC.
metric Applications. MIT Press, Cambridge, MA, pp. 51–111. Oh, C., Oh, J., Ritchie, S., 2001. Real-time estimation of freeway accident likelihood.
Cosslett, S.R., 1981b. MLE for choice-based samples. Econometrica 49, 1289– In: Presented at 80th Annual Meeting of the Transportation Research Board,
1316. CD-ROM, Washington, D.C.
Das, A., Abdel-Aty, M., 2010. A genetic programming approach to explore the Oh, C., Oh, J., Ritchie, S., 2005. Real-time hazardous traffic condition warning sys-
crash severity on multi-lane roads. Accident Analysis and Prevention 42, tem: framework and evaluation. IEEE Transactions on Intelligent Transportation
548–557. Systems 6 (3), 265–272.
Elvik, R., Mysen, A.B., 1999. Incomplete accident reporting; meta-analysis of studies Olson, D., Delen, D., 2008. Advanced Data Mining Techniques. Springer, Berlin,
made in 13 countries. Transportation Research Record 1665, 133–140. Germany.
Golob, T., Recker, W., Pavlis, Y., 2008. Probabilistic models of freeway safety perfor- Pande, A., Abdel-Aty, M., 2006. Assessment of freeway traffic parameters lead-
mance using traffic flow data as predictors. Safety Science 46 (9), 1306–1333. ing to lane-change related collisions. Accident Analysis and Prevention 38 (5),
Gazis, D., Herman, R., Weiss, G.H., 1962. Density oscillations between lanes of a 936–948.
multilane highway. Operations Research 10, 658–667. Pande, A., Dasand, A., Abdel-Aty, M., Hassan, H., 2011. Real-time crash risk esti-
Hauer, E., Hakkert, A., 1988. Extent and some implications of incomplete accident mation are all freeways created equal? Transportation Research Record 2237,
reporting. Transportation Research Record 1185, 1–10. 60–66.
Hauer, E., 2006. The frequency-severity indeterminacy. Accident Analysis and Pre- Patil, S., Geedipally, S., Lord, D., 2011. Analysis of crash severities using nested
vention 38, 78–83. logit model—accounting for the underreporting of crashes. Accident Analysis
Hassan, H., Abdel-Aty, M., 2011. Exploring visibility-related crashes on freeways and Prevention 45, 646–653.
based on real-time traffic flow data. In: Presented at 90th Annual Meeting of the Savolainen, P., Mannering, F., Lord, D., Quddus, M., 2011. The statistical analysis
Transportation Research Board, CD-ROM, Washington, D.C. of highway crash-injury severities: a review and assessment of methodological
Hossain, M., Muromachi, Y., 2010. Evaluating location of placement and spacing of alternatives. Accident Analysis and Prevention 43, 1666–1676.
detectors for real-time crash prediction on urban expressways. In: Presented at Scott, A.J., Wild, C.J., 1986. Fitting logistic models under case-control or choice based
89th Annual Meeting of the Transportation Research Board, CD-ROM, Washing- sampling. Journal of the Royal Statistical Society Series B48 (2), 170–182.
ton, D.C. SAS Institute Inc, 2011. SAS/STAT(R) 9.2 User’s Guide, second edition.
Hossain, M., Muromachi, Y., 2011. Understanding crash mechanism and selecting Shively, T., Kockelman, K., Damien, P., 2010. A Bayesian semi-parametric model
appropriate interventions for real-time hazard mitigation on urban express- to estimate relationships between crash counts and roadway characteristics.
ways. Transportation Research Record 2213, 53–62. Transportation Research Part B 44, 699–715.
Hourdos, N., Garg, V., Michalopoulos, G., Davis, G., 2006. Real-time detection Shefer, D., 1997. Congestion and safety on highways: towards an analytical model.
of crash-prone conditions at freeway high-crash locations. Transportation Urban Studies 34 (4), 679–692.
Research Record 1968, 83–91. Wang, C., Quddus, M., Ison, S., 2009. The effects of area-wide road speed and cur-
Hubbard, S., Bullock, D., Mannering, F., 2009. Right turns on green and pedestrian vature on traffic casualties in England. Journal of Transport Geography 17 (5),
level of service: statistical assessment. Journal of Transportation Engineering 385–395.
135 (4), 153–159. Washington, S., Karlaftis, M., Mannering, F., 2003. Statistical and Econometric Meth-
Jung, S., Qin, X., Noyce, D., 2010. Rainfall effect on single-vehicle crash severities ods for Transportation Data Analysis. Chapman & HALL/CRC.
using polychotomous response models. Accident Analysis and Prevention 42 Xu, C., Liu, P., Wang, W., Li, Z., 2012a. Evaluation of the impacts of traffic states on
(1), 213–224. crash risks on freeways. Accident Analysis and Prevention 47, 162–171.
Khattak, A., Knapp, K., 2001. Interstate highway crash injuries during winter snow Xu, F., Tian, Z., 2008. Driver behavior and gap-acceptance characteristics at round-
and nonsnow events. Transportation Research Record 1746, 30–36. abouts in California. Transportation Research Record 2071, 117–124.
Kim, J., Ulfarsson, F., Kim, S., Shankar, V., 2013. Driver-injury severity in single- Xu, C., Liu, P., Wang, W., Li, Z., 2012b. Development of a crash risk index to identify
vehicle crashes in California: a mixed logit analysis of heterogeneity due to age real-time crash risks on freeways. In: Presented at the 91th Annual Meeting of
and gender. Accident Analysis and Prevention 50, 1073–1801. the Transportation Research Board, CD-ROM, Washington, DC.
Lee, C., Saccomanno, F., Hellinga, B., 2003. Real-time crash prediction model for Xu, C., Wang, W., Liu, P. A genetic programming model for real-time crash prediction
the application to crash prevention in freeway traffic. Transportation Research on freeways. IEEE Transactions on Intelligent Transportation Systems, in press.
Record 1840, 67–77. Yamamoto, T., Hashiji, J., Shankar, V., 2008. Underreporting in traffic accident data,
Lee, C., Park, P., Abdel-Aty, M., 2011. Lane-by-lane analysis of crash occurrence based bias in parameters and the structure of injury severity models. Accident Analysis
on driver’s lane-changing and car-following behavior. Journal of Transportation and Prevention 43, 1320–1329.
Safety and Security 3 (2), 108–122. Yu, R., Abdel-Aty, M., 2013. Utilizing support vector machine in real-time crash risk
Leckrone, S.J., Tarko, A.P., Anastasopoulos, P.C., 2011. Improving safety at high-speed evaluation. Accident Analysis and Prevention 51, 252–259.
rural intersections. In: Presented at 3rd International Conference on Road Safety Zheng, Z., Ahna, S., Monsere, C., 2010. Impact of traffic oscillations on freeway crash
and Simulation, CD-ROM, Indianapolis, IN. occurrences. Accident Analysis and Prevention 42, 626–636.