Lecture 2 - Descriptive Statistics
Lecture 2 - Descriptive Statistics
DESCRIPTIVE
STATISTIC
DR HAR I ATI ABDULLAH HAS HI M
hariati@ u tm .m y
Measure of central
tendency
Lecture
Content Measure of dispersion
0-2 1
2-4 2 ∑𝑓𝑖𝑥𝑖
4-6 2
𝑥ҧ =
6-8 4
∑𝑓𝑖
8-10 6
10-12 2
12-14 3
Normally used to average indexed and percent
(e.g percent increase in sales, production or other
business from one time to another)
Growth
rate
Geometric 𝑮𝑴 = 𝒏
(𝒙𝟏)(𝒙𝟐)(𝒙𝟑)(𝒙𝟒)(𝒙𝒏)
Mean
OR
𝑛
𝐺𝑀 = (𝑥1)(𝑥2)(𝑥3)(𝑥4)(𝑥𝑛)
3
𝐺𝑀 = 1 + 0.1 1 − 0.2 1 + 0.3
Example
Geometric Mean in Portfolio Return
Consider a portfolio of stocks that goes up from RM100 to RM110 in year one, then declines to
RM80 in year two and goes up to RM150 in year three. The return on portfolio is then
calculated as follows:
3 150
𝐺𝑀 = −1
100
Weighted mean is calculated when
certain values in a data set are more
important than the others.
A weight wi is attached to each of the
Weighted values xi to reflect this importance.
Step 1:
Trim the top and bottom 20% from the data. That leaves us with the middle three values:
Step 2:
Find the mean with the remaining values. The mean is (81 + 83 + 91) / 3 ) = ?.
Measure of Central Tendency
As the data becomes skewed the mean loses its ability to provide
the best central location for the data because the skewed data is
dragging it away from the typical value.
MEDIAN
• Median is the middle value that lies in the centre of the
data when the values are ranked in ascending or
descending order
0 – 10 10
10 – 20 20
20 – 30 30
30 – 40 40
40 – 50 50
50 – 60 30
Marks No. of Students
Step 1 : Obtain the 0 – 10 10
cumulative
10 – 20 20
frequencies
20 – 30 30
30 – 40 40
40 – 50 50
Step 2 : determine 50 – 60 30
the location of
median class interval
using cumulative
frequency column
Step 3 : calculate
the value of
median
Measure of Central Tendency
The mode is useful when the most common item, characteristic or value of a data set
is required.
Standard
Range Variance
Deviation
Range
• Range is the difference between the largest value (maximum) and the smallest (minimum) value
in the data set.
Marks Frequency
20 – 29 4
30 – 39 8
40 – 49 20
50 – 59 16
60 -69 9
70 – 79 3
Standard
Range Variance
Deviation
Variance and SD
Variance and SD are the most common measures of dispersion for continuous data
Variance and SD is used to describe how far the individual value disperse from the mean value
Find out the Mean, Variance and SD for the 5 days of business.
SKEWNESS IN
RELATION TO MEAN,
MODE, MEDIAN
Normal distribution – BELL CURVE
Skewness
▪A distribution can be
symmetric, skewed to right or
skewed to the left
▪Pearson’s coefficient of
skewness is usually used to
measure the skewness of the
distribution
Thank You