Civil Stat 02
Civil Stat 02
Lecture (2)
Graphical Presentation of Data
Numerical Descriptive Measures
• Frequency Polygon
• Frequency Curve
◼ Find range: 58 - 12 = 46
◼ Select number of classes: 5 (usually between 5 and 15)
◼ The smaller the number of classes, the greater the loss of information
◼ Compute class interval (width): 10 (46/5 then round up)
◼ Determine class boundaries (limits): 10, 20, 30, 40, 50, 60
Lecture 2- Page 3
Dr. Mohamed Gad
Frequency Table
7 6
6 5 No Gaps
Frequency
5 4
4 3
Between
3 2 Bars
2
1 0 0
0
5 15 25 35
36 45 55 More
Class Boundaries
Dr. Mohamed Gad
Class Midpoints Lecture 2 – Page 5
Graphing Numerical Data:
The Frequency Polygon
Data in ordered array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Frequency Polygon
7
6
5
4
3
2
1
0
5 15 25 36 45 55 More
Frequency Curve
7
6
5
4
3
2
1
0
5 15 25 36 45 55 More
0 0
5 15 25 35
36 45 55 More
Relative
Class No. Class Interval Frequency
3 Description Frequency
j I j (m /s) fj
rf j
1 (1200, 1800) 1200 but under 1800 16 0.32
2 (1800, 2400) 1800 but under 2400 18 0.36
3 (2400, 3000) 2400 but under 3000 8 0.16
4 (3000, 3600) 3000 but under 3600 5 0.1
5 (3600, 4200) 3600 but under 4200 2 0.04
6 (4200, 4800) 4200 but under 4800 1 0.02
Total 50 1.00
Relative
Frequency
Histogram
100
90
80
70
60
%F
50
40
30
20
10
0
2 6 10 14 18 22
Classes
Central Tendency
Average or
Arithmetic Mean
1 3 5 7 24
Mean = 8
Dr. Mohamed Gad Lecture 2 – Page 20
Median
◼ The variate value that divides the data into two equal
halves
◼ Not affected by extreme values
1 3 5 7 9 Median = 5
Median = 5
1 3 5 7 24
0 1 2 3 4 5 6
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
No Mode
Mode = 9
Dr. Mohamed Gad Lecture 2 – Page 22
Measures of Variation
Variation
◼ Measure of variation
◼ Difference between the largest and the
smallest observations:
Range = X Largest − X Smallest
◼ Ignores the way in which data are distributed
Range = 12 - 7 = 5 Range = 12 - 7 = 5
7 8 9 10 11 12 7 8 9 10 11 12
( X −X)
◼
2
i
S =
2 i =1
n −1
◼ Population variance:
N
( X −)
2
i
=
2 i =1
( X −X)
2
i
S= i =1
n −1
N
Population standard deviation:
( X −)
2
◼
i
= i =1
N
Dr. Mohamed Gad Lecture 2 – Page 27
Coefficient of Variation
S
CV = 100%
X
◼ Skewness Coefficient
◼ Describes how data is distributed
◼ Measure of shape
(X i − X)
n
3
n
For population or large sample CS = i =1
3/ 2
2
(X i − X )
n
i =1
◼ Corrected form of CS
(X i − X)
n
3
n2 n
For small sample CS = i =1
(n − 1)(n − 2) n 2
3/ 2
(X i − X )
i =1
Dr. Mohamed Gad Lecture 2 – Page 29
Shape of a Distribution
◼ Symmetric or skewed
CS < 0 CS = 0 CS > 0
Left-Skewed Symmetric Right-Skewed
Mean < Median < Mode Mean = Median =Mode Mode < Median < Mean
Sample Mean
f j X Classj
1 k
= f j X Classj
◼ j =1
X= k
f
n j =1
j
j =1
k
1
◼ Sample Variance S2 =
n − 1 j =1
f j ( X Classj − X ) 2
7
6
6
5
5
4
4
3
3
2
2
1
0 0
0
5 15 25 36
35 45 55 More
𝑋ሜ = 32.4 S2=160.57
7
6 𝑘
6 1
5 𝑋ሜ = 𝑓𝑗 𝑋𝐶𝑙𝑎𝑠𝑠𝑗
5 𝑛
4 𝑗=1
4 1
3 = 45 + 150 + 175 + 180 + 110
3
2
50
2 = 33
1
0
0 0
𝑆2
1
5 15 25 35
36 45 55 More
= σ𝑘𝑗=1 𝑓𝑗 (𝑋𝐶𝑙𝑎𝑠𝑠𝑗 ሜ 2 =153.68
− 𝑋)
𝑛−1
◼ Sample Mode
Δ1
M ode = L1 + C
Δ1 + Δ 2
C = L2 − L1
L1 L2
◼ Sample Mode
Δ1
M ode = L1 + C
Δ1 + Δ 2
C = L2 − L1
L1 L2
3
Mode = 20 + × 10 = 27.5
3+1
M edian = L1 +
N /2 − f i
C
fmedian
f median
f i = frequency until L1
C = L2 − L1
L1 L2
M edian = L1 +
N /2 − f i
C
fmedian
f median
f i = frequency until L1
C = L2 − L1
L1 L2
10 − 9
Median = 30 + × 10
5
= 32
Dr. Mohamed Gad Lecture 2 – Page 37
Coefficient of Correlation
◼ Measures the strength of the linear
relationship between two quantitative
variables
n
( X i − X )(Yi − Y )
r= i =1
n n
( X −X) (Y − Y )
2 2
i i
i =1 i =1
X X X
r = -1 r = -.6 r=0
Y Y
X X
Dr. Mohamed Gad
r = .6 r=1 Lecture 2 – Page 40