Chapter 7: Statistical Applications in Traffic Engineering
Chapter 7: Statistical Applications in Traffic Engineering
Chapter 7: Statistical Applications in Traffic Engineering
Traffic Engineering
Chapter objectives: By the end of these chapters the student will be
able to (We spend 3 lecture periods for this chapter. We do skip
simple descriptive stats because they were covered in CE361.):
Lecture Lecture Objectives
number (after these lectures you will be able to)
Lecture 3 Apply the basic principles of statistics contained in section 7.1 to traffic data analyses
(Chap 7a file) Explain the characteristics of the normal distribution and read the normal distribution
table correctly (section 7.2) and get necessary values from Excel.
Explain the meaning of confidence bounds and determine the confidence interval of
the mean (section 7.3)
Determine sample sizes of traffic data collection (section 7.4)
Explain how random variables are added (section 7.5)
Explain the implication of the central limit theorem (section 7.5.1)
Explain the characteristics of various probabilistic distributions useful for traffic
engineering studies and choose a correct distribution for the study(section 7.6)
Lecture 4a Explain the special characteristics of the Poisson distribution and its usefulness to
(Chap 7b file) traffic engineering studies (section 7-7)
Conduct a hypothesis test correctly (two-sided, one-sided, paired test, F-test) (section
7-8)
Most of the topics in this section are reviews of what we have learned
in CEEn 361. (Review 7.1.1, 7.1.3 and 7.1.4 by yourself.)
z = (x - )/ (Discuss the 3
procedures in p.
137 left column
= (65 55)/7 top)
= 1.43
Z = 1.43
X
Point estimate of X from a
sample
X
Two-sided interval
X tas/sqrt(n) X + tas/sqrt(n) estimate
7.3 (cont)
Y ai X i
Expected value (or mean) of the random variable Y:
Y ai xi
a
2
Y
2
i
2
xi
work. See the
sample problems in
page 140.
7.5.1 The central limit theorem
f (X )
F(x)
approaches
X
x
X
X distribution X distribution
X ~ any (, 2) X ~ N ( , X 2 )
7.6 The Binomial Distribution Related to the
Bernoulli and Normal Distributions
7.6.1 Bernoulli and the Binomial distribution (discrete
probability functions))
In this inspection,
15% of the unsafe vehicles are determined to be safe Type II error (bad error)
and 5% of the safe vehicles are determined to be unsafe Type I error
(economically bad but safety-wise it is better than Type II error.)
Types of errors
We want to minimize
especially Type II
Reality Decision error.
One-sided upper
Two-sided or one-sided test
12 22
82 82
Y 1.53
|1 - 2| = |60-55| = 5 > zc
n1 n2 55 55
z / 2 1.96 z 1.65
By either test, H0 is
At significance level = 0.05 (See rejected.
Table 7-3.)
7.8.3 Other useful statistical tests
The t-test (for small samples, n<=30) Table 7.6:
x1 x2 n1 1s12 n2 1s22
t sp
sp 1 n1 1 n2 n1 n2 2
When the t-test and other similar means tests are conducted,
there is an implicit assumption made that s1=s2. The F-test can
test this hypothesis.
s12
F 2 The numerator variance > The denominator
variance when you compute a F-value.
s2
If Fcomputed Ftable (n1-1, n2-1, a), then s1s2 at a
asignificance level.
You perform a paired difference test only when you have a control
over the sequence of data collection.
e.g. Simulation You control parameters. You have two different
signal timing schemes. Only the timing parameters are changed. Use
the same random number seeds. Then you can pair. If you cannot
control random number seeds in simulation, you are not able to do a
paired test.
Table 7-8 shows an example showing the benefits of paired testing
The only thing changed is the method to collect speed data. The
same vehicles speed was measure by the two methods.
Paired or not-paired example (table 7.8)
20
15
10
5.0-5.2
5.2-5.4
5.4-5.6
5.6-5.8
5.8-6.0
6.0-6.2
6.2-6.4
6.4-6.6
6.6-6.8
6.8-7.0
Observed Freq Theoretical Freq
The authors intentionally used the uniform distribution to make the computation simple. We will
test a normal distribution I class using Excel.
Steps of Chi-square (2-) test
Chi-square (2-)
Expected test Actual
distribution (or histogram
histogram)