3.5 Descriptive Statistics
3.5 Descriptive Statistics
Descriptive Statistics
1. Skewness
2. Kurtosis
3. Modality
SKEWNESS
▪ The piling of scores on one end and tapers of
gradually at the other end
▪ The disproportionate concentration of lower or higher
scores is indicated by:
- Tendency of scores to pile on one end
- Tendency of the tail to point at the opposite end
shoulder
tail
KURTOSIS
▪ It tells us about the abundance or scarcity of outliers
(extreme scores)
▪ It is indicated by how heavy or how light the tails are.
▪ Remember:
Interpretation: There are more students who with lower than higher
levels of love shyness.
For Example:
1. Mean
2. Median
3. Mode
MEAN
▪ Also called as “arithmetic mean/average”
▪ Most commonly used average and is the most convenient
and versatile measure of central tendency.
▪ Computed by: adding all the scores in the distribution and
dividing by the number of scores.
▪ Symbols:
population – μ (read as ‘mew’)
sample – M or x̄ (read as x-bar)
MEAN
▪ Formula
Example: 2, 5, 3, 4, 9, 10, 7, 1
∑𝑋
μ= 𝑁
= 2 + 5 + 3 + 4 + 9 + 10 + 7 + 1
= 5.125
MEAN
▪ The mean can also be defined as:
- The amount each individual receives when the
total (∑𝑋) is divided equally among all the
individuals (N) in the distribution.
- The balance point for or the center of gravity
(fulcrum) of the distribution.
MEAN
▪ Consider this:
1, 2, 6, 6, 10
μ=5
MEAN
▪ It considers every score in the distribution. That
is, every scores adds to the total (∑X) and every
score contributes to the number of scores (N or
n).
▪ Changing the value of any score will change the
mean.
▪ Adding or removing a score will also change the
mean (note: unless the score you added has the
same value as the mean)
MEAN
▪ It is sensitive to outliers or extreme values.
▪ The presence of outliers pulls the mean towards
the tail or towards an outlying value.
▪ Keep in mind: only used when the data are
measured on an interval or ratio scale.
MEAN
▪ Weighted Mean
- When it is necessary to get the over all mean of
two or more group, we can compute for the
weighted mean.
Formula:
∑𝑋𝑁
μT =
𝑁𝑇
MEDIAN
▪ Is the value in the exact midpoint (middle) of a
distribution.
▪ The point that divides a distribution into two
equal halves: 50% of the scores above it and
50% below it.
▪ Cannot be found in nominal scales.
MEDIAN
▪ Appropriate to use when the distribution is
skewed and the scores then to concentrate on
one side of the distribution.
▪ It is unaffected by extreme scores and
skewness.
MEDIAN
▪ Obtaining the median when n is an odd number:
1. Array the scores from lowest to highest.
2. The median is the score in the middle of the
distribution.
Example: 26, 9, 3, 2, 13, 16, 5
= 2, 3, 5, 9, 13, 16, 26
=9
MEDIAN
▪ Obtaining the median when n is an even
number:
1. Array the scores from lowest to highest.
2. Add the two middle scores and divide the sum
by 2.
Example: 1, 4, 8, 7, 1, 5
=1, 1, 4, 5, 7, 8
= 4 + 5 / 2 -> 9/2
= 4.5
MODE
▪ It is the score or category that is most common or
typical, that is, it has the greatest frequency.
▪ It is the only central tendency that is applicable to all
four types of measurement scales.
▪ The only measure of CT that can be used for nominal
data
MODE
▪ The only descriptive statistic that can take up more
than one value.
▪ Remember: The mode does not represent the most
number of scores or observations.
▪ It is simply the most common score in the group,
which may or may not represent most number of
individuals.
Measures of CT: Proper
Utility of Each Measure
There are 4 things that we must consider:
Variance:
Sum of Squares!
∑(𝑿𝒊−𝝁)2
σ2 =
𝑵
VARIANCE (Mean Square)
∑(𝑿𝒊−𝝁)2
σ2 =
𝑵
= 10/6
= 1.67 km2
PROBLEMA SA VARIABILITY
Edi…
STANDARD
DEVIATION
STANDARD DEVIATION
Standard Deviation:
∑(𝑿𝒊−𝝁)2
σ=
𝑵
STANDARD DEVIATION
Why???
STANDARD DEVIATION
▪ Because…
∑(𝑿𝒊−𝝁)2
σ2 = σ=
∑(𝑿𝒊−𝝁)2
𝑵
𝑵
VARIANCE & STANDARD DEVIATION for SAMPLES
∑(𝑿𝒊−𝒙̄)2
s2 = s=
∑(𝑿𝒊−𝒙̄)2
𝒏−𝟏
𝒏−𝟏
∑(𝑿𝒊−𝝁)2
s2 = s=
∑(𝑿𝒊−𝝁)2
𝒏−𝟏
𝒏−𝟏
Answer:
➢ Such adjustment is necessary to correct for the bias in sample
variability.
➢ Dividing by a smaller number (n – 1) produces a larger results and
makes the sample variance closer to the population variance.
QUARTILE DEVIATION
QUARTILE DEVIATION
▪ Used when the distribution is skewed or have open-ended
scores
▪ Measures the dispersion only within the middle 50% of the
distribution and ignores the two ends
▪ Thus, the quartile deviation (Q) measures the distance from
the median to the boundaries that define the middle 50% of
the distribution.
𝐐𝟑 −𝐐𝟏
Q=
𝟐
QUARTILE DEVIATION
50%
Q1 Md Q3
Measures of Variability:
Proper Utility of Each
Measure
SD QD RANGE
Advantages • Provides a • Not influenced to • Easily
relatively accurate outliers understood
description of • Can be used in • Obtained
symmetrical skewed easily
distributions distributions
• Not influenced
by open-ended
scores
Disadvantages • Inaccurate for • Only considers • Easily
skewed the middle 50% influenced by
distributions and disregards extreme
the rest scores
SD QD RANGE
Scales • Interval • Ordinal • Nominal
• Ratio • Interval (special
• Ratio cases)
• Ordinal
• Interval
• Ratio
DOGS
Mean SQ = 96 CATS
SQ Range = 85-100 Mean SQ = 110
SD = 1.2 SQ Range = 20-130
SD = 24.3
68% of dist. = 𝝁 ± 1σ
95% of dist. = 𝝁 ± 2σ
99% of dist. = 𝝁 ± 2.5σ
▪ Mean
▪ Standard Deviation
▪ Verbal Interpretation
Sample 1
25
20 23.45
15
10
0
Group Individual
TYPE OF COMPETITION
Steps in Writing the Results
Sample Output
Based on the results of the descriptive statistics implemented,
the philosophy students that is subjected to individual
competition generally had higher scores in the logic quiz
administered. They have a mean score of 37.02 with a standard
deviation of 2.9. This average score can be verbally interpreted
as “High Passed”. On the other hand, those students subjected
to group competition only garnered an average score of 23.45
(SD = 3.2) which has a verbal interpretation of “Passed”
Another Sample