Lecture 2
Lecture 2
Dr Nooraida Yakob
Dr Nor Asniza Ishak
Dr Muzirah Musa
• Summarising and displaying data, describe
the distribution
– Data distribution. Constructing charts and graphs
– bar chart, histogram, box plot, stem and leaf
plot
– Describing the distribution, calculating mean,
mode and median, standard deviation and
variance, skewness of a distribution.
Data distribution.
Constructing charts and graphs – bar chart, histogram,
box plot, stem and leaf plot
Mathematics score
63,59,53,61,58,63, 77,65,59,63
59,59,88,58,59,64,60,64,75,66
62,61,60,70,64,63,59,62,65,76
Step 5.
Figure 3.14
Prem Mann, Introductory Statistics, 7/E Copyright © 2010 John Wiley & Sons. All right reserved
Boxplot
Extreme outliers – ID
13
Outliers – ID 7,19,30
IQR – interquartile
range
Describing the distribution, calculating mean,
mode and median, standard deviation and
variance, skewness of a distribution.
Central Tendency Measurement
Numerical value to represent the center point of
data set
Indicate where most values in a distribution fall
Describe the whole data with one value
Three types :
Mode
Median
Mean
Central Tendency Measurement
3, 7, 9, 4, 5, 4, 6, 9, 9
Mode = 9 (perhatikan mod ≠ 3)
Median = 3, 4, 4, 5, 6, 7, 9, 9, 9
Mean = (3+7+9+4+5+4+6+9+9)/9 = 6.22
Central Tendency Measurement
Advantages Weaknesses
Min Used the most as it is Dipengaruhi oleh nilai ekstrim
stable
Take into account all 22, 25, 21, 26, 67
data Min = 32.20
Suitable for continuous
data
Median Tidak dipengaruhi oleh Hanya mengambil nilai di tengah-
nilai ekstrim tengah
Mod The simplest Some of the data did not have
measurement mode value
Suitable for discrete data 11, 14, 19, 16, 25, 36
Measures of Dispersion
A measurement that showed how well the values in
a data set differ from one another or from the
central of the data set
Variance
Standard of deviation
=
.
4, 6, 9, 3, 5, 12, 10
Min, 𝑥ҧ = X 4 6 9 3 5 12 10
7
N 7
x (x – 𝑥)ҧ (x – 𝑥)ҧ 2 x x2
x 4 16
4 4 – 7 = –3 (–3)2 = 9 6 36
6 6 – 7 = –1 (–1)2 = 1 9 81
9 9–7=2 (2)2 = 4 3 9
3 3 – 7 = –4 (–4)2 = 16 5 25
5 5 – 7 = –2 (–2)2 = 4 12 144
12 12 – 7 = 5 (5)2 = 25 10 100
10 10 – 7 = 3 (3)2 = 9 x2 = 411
(x – 𝑥)ҧ 2 = 68
σ 𝑥2 411
2 =
σ 𝑥−𝑥ҧ 2
=
68
= 9.71 2 = − 𝑥ҧ 2 = − 72 = 9.71
𝑁 7 𝑁 7
Variance & Standard Deviation
Advantages:
More accurate as the calculation involved all values in
the data set
Measure the dispersion of each value from the mean
value in the data set.
The calculation did not involved the square of deviation.
The unit for standard deviation and the unit for data –
same
Skewness Dispersion
Positive skewness –
Negative skewness – tail on the right side
tail on the left side
Terima
Kasih