02.1 Descriptive Statistics
02.1 Descriptive Statistics
STATISTICAL METHODS
Descriptive Inferential
Tabular Estimation
Numerical
3
Descriptive Statistics
ETHOD
U L A R M
TAB HOD
AL MET
AP H IC THOD
Frequency GR ICAL ME
Bar graph NUMER
Percentage
Line graph Central Tendencies
Cumulative Dot/Scatter plot Measures of Variability
The Mean
!"# $% &'' $(!)*+&,-$.!
• MEAN = ."#()* $% $(!)*+&,-$.!
5
1
Population Mean: 4 = 2 /-
5
-31
Sample Mean: .
1
/ = 2 /-
.
-31
02. DESCRIBING DATA WITH NUMBERS 7
The Mean
Some properties of the Mean:
1. It always exists
2. It is unique
3. It reflects the magnitude of every observation
4. It is easily affected by extreme values
5. The mean of the subgroups can be combined into the
overall mean of all the data, called the weighted
mean.
8
• MEDIAN
The Median is the midpoint of a distribution, the number such
that half the observations are smaller, and the other half are
larger.
The Median
Some properties of the Median:
1. It is a positional value
2. Extreme values do not affect the median
as strongly as they do the mean.
ASLatonio
The Mode
• The Mode is the most common observation in
a data set.
! = 1 62 9 Valen F 4.7
0
234
ASLatonio
The Range
Range
= highest value – lowest value
= HV – LV
ASLatonio
Variance
Variance is the mean of the squared
deviations of the observations from the mean
*
Population variance: 1
!" = & +' − + -
%
'()
0
-
1 -
Sample variance: . = & +' − +
/−1
'()
ASLatonio
Computation formula
for Sample Variance
Sample variance (computation formula):
+ + "
1 1
!" = "
' ,( − ' ,(
%−1 %
()* ()*
ASLatonio
! = !#
• Population SD:
• Sample SD: $ = %#
20
Measures of Variability
Interpretation of Standard Deviation:
- the greater the value of the standard
deviation, the more the observations scatter
from the mean.
Standard Error
The Standard Error is the standard deviation
of the sample mean
Population SE: !
"
#
Sample SE: "
ASLatonio
Coefficient of Variation
!"#$%#&% %'()#")*$
CV = ×-..%
+'#$
0
×100%
=1
ASLatonio
Example: Range
ID Name SEX Hours
Range 1 Adrielle M 3.5
2 Marvin M 4.2
R = HV – LV 3 Arianne F HV =
= 5.8 – 1.3 5.8
= 4.5 hrs 4 Sarah F LV =
1.3
5 Kit M 4.1
The length of hours spent performing a 6 Princess F 4.5
specific task ranges from 1.3 to 5.8 hrs. 7 Angelee F 3.2
8 Barbz F 3.0
9 Valen F 4.7
ASLatonio
ID Hours (X i) X i2
1 1 6 4.5 20.25
!" = 143.61 − 34.3 "
9−1 9
1 1 7 3.2 10.24
= 143.61 − 1176.49
8 9
8 3.0 9.00
1
= 143.61 − 130.72
8
9 4.7 22.09
1
= 12.89
8
= 1.61 squared hrs Total 34.3 143.61
ASLatonio
! = !# 2 Marvin M 4.2
3 Arianne F 5.8
= 1.61 4 Sarah F 1.3
Percentiles
Deciles
Quartiles
29
Measures of Position
Percentile – is a value below which a
certain percentage of observations lie.
Measures of Position
Percentile
!(#$%)
Position of the Pth percentile =
%''
31
• Arrange first the data from lowest to highest. Assign X 1 as the lowest
value and X 22 as the highest value. Identify the 5 th and 6 th values to
solve for the value of the 25 th percentile as follows.
Measures of Position
Decile
Measures of Position
Quartile
Interquartile Range
Measure of Skewness
Measure of Kurtosis
36
Further descriptions
Skewness
Mode Mean
Median
Mean Mode
Mean Median
Median
'
Cs @ 0, approaches normality
! ∑ #$ %& Cs > 0, positively skewed
Cs = !%( (!%*), ' Cs < 0, negatively skewed
. Sk @ 0, approaches normality
-(&%&) Sk > 0, positively skewed
Sk = , Sk < 0, negatively skewed
38
Kurtosis
– is a measure of a distribution’s peakedness
(or flatness)
Coefficient of Kurtosis
C k @ 0, mesokurtic
!" ∑ $% &' (
Ck = !&) (!&+)(!&-). (
−3 C k > 0, leptokurtic
C k < 0, platykurtic
Leptokurtic
Mesokurtic
platykurtic
39
DEPARTMENT OF
STATISTICS
En j oy
d y i n g
Stu
t i s t i c s J
Sta