Central Tendency - Lecture Notes
Central Tendency - Lecture Notes
Data Distribution
Chapter 1 - Central Tendency
Statistics
Descriptive Inferential
Statistics Statistics
Central Tendency
Central Tendency
Descriptive Statistics Outcome is one value
Definition:
To tell us
Name Age
John 33
Mark
Susan
Joe
28
25
46
? Average
= Mean
Descriptive Inferential
Statistics Statistics
Central Tendency
Name Age
John 33
Mark 28
33 + 28 + 25 + 46 + 32 + 29 + 42 + 21
Susan 25
8
Joe 46
Ema 32
Julia 21
Calculating Mean
Day Sales
Sunday 9500
Monday 100
9500 + 100 + 50 + 150 + 100 + 150 + 100
Tuesday 50
7
Wednesday 150
○ e.g. data[9500, 100, 50, 350], then mean is 9500 + 100 + 50 + 350 / 4 = 2500
○ in above example, mean value is too far high then most of our data in the dataset
● Formula,
○ Mean = 𝝨x/N
Median
● Middle value in our dataset
● Must sort out the data from low to high
○ e.g. data[150, 50, 600, 200, 350]
○ sorted_data[50, 150, 200, 350, 600]
○ therefore, median value is 200
● Formula
○ {(n + 1) ÷ 2}th value
Descriptive Inferential
Statistics Statistics
Central Tendency
Weighted:
Grade Point 85
Frequency Distribution
0-2 11
3-5 8
6-8 5
9-11 3
12-14 1
15-17 2
Mean/Avg = 150/30 = 5
Mean of Grouped Data - Calculation
Descriptive Inferential
Statistics Statistics
Central
Tendency Variability
Descriptive Inferential
Statistics Statistics
Central
Tendency Variability
● the difference between the largest number and the smallest number
○ e.g. data[100, 50, 200, 50, 150]
○ therefore, range is 200 - 50 = 150
● Formula
○ Range = Largest number - Smallest number
● Calculating Variance
Step 2 & 3
● If the variance is too small, then our data is very close to the mean
○ E.g. data[15, 17, 16, 14, 18, 16]
○ Variance = 1.67, Mean = 16
○ Since the value of variance is small, each data point is not much far from mean
● If the variance is large, then our data is very far from the mean
○ E.g. data[13, 3, 40, 12, 3, 25]
○ Variance = 170, Mean = 16
○ Since the value of variance is large, each data point is considered far from mean
Standard Deviation - Interpretation
● the value of standard deviation shows us how far each data is deviated
from the mean
● Formula
○ take the square root of Variance
● Divide the data set into four equal segments after arranging in ascending order
Quartiles
2, 3, 5, 7, 10, 11, 13, 16, 18 ● Step 4: Find the median value of upper half of the data set, i.e. Q3
Step 2 Step 2
Step 2
IQR = Q3 - Q1
OR
Q3
Q3 + 1.5 IQR
(53+57)/2 = 55
55 + (1.5 x 12.5) = 73.75
Q1