CH.5.
CH.5.
Objectives
At the end of this chapter, the student is expected to:
Introduction
You have learned in the previous lesson’s measures of central tendencies and
variations. In this lesson we will study a form of distribution that described a symmetrical
plot in which the given sets of data are clustered mostly to the center of the mean and the
width of the distribution is the standard deviation. This distribution with mean and standard
deviation as parameters is called the Normal probability distribution, and its graphical
presentation is called Normal curve, where the total area under the curve is always equal to
100% or 1.
.
Definition 5-1
The Normal Probability Distribution is distribution described by a bell-shaped curve
because of its flared shape where the mean, median and mode are approximately equal and the
distribution clustered closely to the center. It is symmetrical and the areas are equal each to the
left and right.
Normal Distribution 88
If the mean, median and mode are equal, then skewness is zero, hence no skewness and
symmetrically distributed.
• heights of students
• Standardized test scores
• Income of employees
• size of materials
• errors in measurements
• blood pressure
• The mean, median, and mode are equal in values and plotted on a central point along the
horizontal axis.
• The scores can be divided into three standard score each to the left(negative Z-score ) and to
the right(positive Z-score)
• The curve is symmetric around the mean and is asymptotic to the horizontal axis; that is, the
curve extends indefinitely in both directions.
• The total area (or probability) under the normal curve is equal to 100% or 1; or 50% or 0.5 to
each side from the center
Normal Distribution 89
2. Positively skewed or spread out to the right is distribution where most values are clustered
around the left tail of the distribution while the right tail of the distribution is longer. If the
mean is higher than the mean and the median, then, the coefficient skewness of skewness is
positive.
3. Negatively skewed or spread out to the left is distribution where most values are clustered
around the left tail of the distribution while the left tail of the distribution is longer. If the
mean is lower than the mode and the median, then, the coefficient of skewness is negative.
Normal Distribution 90
5.3. Kurtosis
Kurtosis is a descriptive statistics that measures the degree to which the data
cluster in the tails. It measures the extent to which the distribution contains outliers, where an
outlier is a data point that differs significantly from other observations when the variability in
the measurements occur,. Kurtosis helps measure how data disperse between a distribution’s
center and tails. It identifies whether a distribution contains extreme values or depicts heavy
tails.
where:
̿ - Mean of samples
𝒙
xi -The ith sample
n - Total sample number
s - Standard Deviation of all samples
Normal Distribution 91
3. Leptokurtic (K> 3) The data is more than normal distribution
Find the coefficient of skewness and kurtosis from the data gathered as to the outputs
in the experimental study 5, 7, 14, 15, 20, 19, 18, 17, 16, 25.
Solution: To find the median, sort the data in ascending order, and find the middle item or
score:
Normal Distribution 92
∑𝑥 156
Solution: Solve for the mean : 𝑥̅ = = = 15.6.
𝑛 10
X X- 𝒙
̅ ̅)2
(x - 𝒙 ̅
X − 𝒙 ̅ 4
X − 𝒙
𝑆 ( )
𝑆
5 -10.6 112.36 -1.79 10.21
7 8.6 73.96 1.45 4.42
14 -1.6 2.56 -0.27 0.01
15 -0.6 0.36 -0.10 0.00
0.07 0.00
16 0.4 0.16
0.24 0.00
17 1.4 1.96
0.40 0.03
18 2.4 5.76
0.57 0.11
19 3.4 11.56 0.74 0.30
20 4.4 19.36 1.59 6.31
25 9.4 88.36
∑ x = 156 ∑(X-𝑋̅)2 = 316.4 ̅ 4
X− 𝒙
∑( ) = 21.39
n = 10 𝑆
̅)𝟐
∑(x − 𝒙 316.4
S=√ = √ , s =5.93
𝒏−𝟏 9
𝑠𝑘= 3(𝑀𝑒𝑎𝑛−𝑀𝑒𝑑𝑖𝑎𝑛)
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑖𝑡𝑖𝑜𝑛
𝑠𝑘= 3(15.6−16.5)
5.93
𝑠𝑘= −0.455 - negatively skewed
Solving for degree of Kurtosis
Normal Distribution 93
𝟏𝟎(𝟏𝟎+𝟏) 𝟑(𝟏𝟎−𝟏)𝟐
K= (𝟏𝟎−𝟏)(𝟏𝟎−𝟐)(𝟏𝟎−𝟑) (21.39) − (𝟏𝟎−𝟐)(𝟏𝟎−𝟑)
𝟏𝟎(𝟏𝟏) 𝟑(𝟖𝟏) 𝟏𝟎(𝟏𝟏) 𝟐𝟒𝟑
K= (𝟗)(𝟖)(𝟕) (21.39) − (𝟖)(𝟕) , K= (𝟗)(𝟖)(𝟕) (21.39) −
𝟓𝟔
𝟏𝟏𝟎
K= 𝟓𝟎𝟒 (21.39) − 4.34, K = (0.2183)(21.39) − 4.34
K = 0.33
*The coefficient of skewness (𝑠𝑘 ) has negative value and degree of kurtosis is greater lower
than 3(K<3), thus, data forms a negatively skewed and platykurtic.
To test whether the data follows a normal distributed, the Kolmogorov-Smirnov test
(K-S test) can be utilized. The complete details and information will be discussed in the
succeeding lessons. If the probability value (also known as significance) is higher than 0.05, the
data is normally distributed, otherwise, data is not normally distributed.
K-S Test,
Step 1: Open the Jamovi software and enter the raw score in the first column. Click the
variable A 2x(double click), and modify to given variable such as Scores, select
“continuous” for measure, integer(numeric)for data type, and then, click the arrow key
to go back to normal view.
Normal Distribution 94
Step 2: Click the “Exploration”, select “descriptive” , move the variable(score) to the right
or variable dialog box, and then, click “ Statistics” below, and Plots if charts are
needed in the presentation
Normal Distribution 95
Step 3: Check the boxes for the desired values of Statistics, such as mean, standard deviation,
skewness, kurtosis, and Shapiro-Wilk for test of normality of data
Step 4: Look up the results or output to the right side, which simultaneously displayed while
working on the selections of Statistics.
Normal Distribution 96
Solution no. 2: By Statistical Package for Social Sciences (SPSS).
Step 1: Open the SPSS in “variable view” and accomplish the information needed in the
as shown:
Step 2: Go to the “data view” enter the raw scores vertically to the scores’ column, and click
“Analyze” select, “Descriptive” , and “Explore”
Normal Distribution 97
Step 3. Move the variable or “Post test result” to the right or variable box, click “option”
and select “ descriptive “ for statistics, and “ continue”
Step 4: Click “Plots”, check the “Normality plots tests” , histogram and other desired values,
click “ continue” and “ both” and “OK”
Normal Distribution 98
Step 5: Just click “no” and close the notices page, and look for the values of coefficient of
skewness and kurtosis which are similar to the computed values in illustrative example 5-1.
The table shows that the P-value(significance) is 0.535 which is greater then 0.05
(P-value > 0.05), the data is normally distributed
Normal Distribution 99
5.4. Areas Under the Normal Curve
The table below is the corresponding area for each standard score(Z)
Z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
0.0 0.0000 0.0040 0.0080 0.0120 0.0160 0.0199 0.0239 0.0279 0.0319 0.0359
0.1 0.0398 0.0438 0.0478 0.0517 0.0557 0.0596 0.0636 0.0675 0.0714 0.0753
0.2 0.0793 0.0832 0.0871 0.0910 0.0948 0.0987 0.1026 0.1064 0.1103 0.1141
+0.3 0.1179 0.1217 0.1255 0.1293 0.1331 0.1368 0.1406 0.1443 0.1480 0.1517
0.4 0.1554 0.1591 0.1628 0.1664 0.1700 0.1736 0.1772 0.1808 0.1844 0.1879
0.5 0.1915 0.1950 0.1985 0.2019 0.2054 0.2088 0.2123 0.2157 0.2190 0.2224
0.6 0.2257 0.2291 0.2324 0.2357 0.2389 0.2422 0.2454 0.2486 0.2517 0.2549
0.7 0.2580 0.2611 0.2642 0.2673 0.2704 0.2734 0.2764 0.2794 0.2823 0.2852
0.8 0.2881 0.2910 0.2939 0.2967 0.2995 0.3023 0.3051 0.3078 0.3106 0.3133
0.9 0.3159 0.3186 0.3212 0.3238 0.3264 0.3289 0.3315 0.3340 0.3365 0.3389
1.0 0.3413 0.3438 0.3461 0.3485 0.3508 0.3531 0.3554 0.3577 0.3599 0.3621
1.1 0.3643 0.3665 0.3686 0.3708 0.3729 0.3749 0.3770 0.3790 0.3810 0.3830
1.2 0.3849 0.3869 0.3888 0.3907 0.3925 0.3944 0.3962 0.3980 0.3997 0.4015
1.3 0.4032 0.4049 0.4066 0.4082 0.4099 0.4115 0.4131 0.4147 0.4162 0.4177
1.4 0.4192 0.4207 0.4222 0.4236 0.4251 0.4265 0.4279 0.4292 0.4306 0.4319
1.5 0.4332 0.4345 0.4357 0.4370 0.4382 0.4394 0.4406 0.4418 0.4429 0.4441
1.6 0.4452 0.4463 0.4474 0.4484 0.4495 0.4505 0.4515 0.4525 0.4535 0.4545
1.8 0.4641 0.4649 0.4656 0.4664 0.4671 0.4678 0.4686 0.4693 0.4699 0.4706
1.9 0.4713 0.4719 0.4726 0.4732 0.4738 0.4744 0.4750 0.4756 0.4761 0.4767
2.0 0.4772 0.4778 0.4783 0.4788 0.4793 0.4798 0.4803 0.4808 0.4812 0.4817
2.1 0.4821 0.4826 0.4830 0.4834 0.4838 0.4842 0.4846 0.4850 0.4854 0.4857
2.2 0.4861 0.4864 0.4868 0.4871 0.4875 0.4878 0.4881 0.4884 0.4887 0.4890
2.3 0.4893 0.4896 0.4898 0.4901 0.4904 0.4906 0.4909 0.4911 0.4913 0.4916
2.4 0.4918 0.4920 0.4922 0.4925 0.4927 0.4929 0.4931 0.4932 0.4934 0.4936
2.5 0.4938 0.4940 0.4941 0.4943 0.4945 0.4946 0.4948 0.4949 0.4951 0.4952
2.6 0.4953 0.4955 0.4956 0.4957 0.4959 0.4960 0.4961 0.4962 0.4963 0.4964
2.7 0.4965 0.4966 0.4967 0.4968 0.4969 0.4970 0.4971 0.4972 0.4973 0.4974
2.8 0.4974 0.4975 0.4976 0.4977 0.4977 0.4978 0.4979 0.4979 0.4980 0.4981
2.9 0.4981 0.4982 0.4982 0.4983 0.4984 0.4984 0.4985 0.4985 0.4986 0.4986
3.0 0.4987 0.4987 0.4987 0.4988 0.4988 0.4989 0.4989 0.4989 0.4990 0.4990
Source: Socratic Q&A(2017), Area under normal curve: table for normal curve - Search
Images (bing.com)
Assuming the given value of Z = 0.178, what is the area in decimal and in %?
Solution:
Always round-off the z-score to 2 decimal digits (Z=0.18), The first number after
decimal or tenth value is located at the left side of the table, and the second or hundredth value
is on top, then get their intersection. The area is 0.0714or 7.14%
Solution: The negative sign indicates that the area is at the left side of the curve, since
normal curve is symmetrical, the area will be the same with that of the positive one.
Exercise 5-1
Finding the area under the normal curve around the center of mean.
If two Z scores are both positive, or both negative in values subtract the areas.
If two Z scores have different signs, add the areas in the ;eft and right of the mean.
Solution: If z is negative, the area is at the left side of the curve; if z is positive, the
area is at the right of curve.
• If the unknown is to the right of positive Z-value, Subtract its area from 50% or 0.5.
What is the area under the normal curve to the right of Z= 1.52 ( Z > 1.52)
Solution:
From the table of normal distribution, at Z=1.52, Area= 0.4357 or 43.57%,
Subtract the area from Z=0 to Z= 43.57% from 50%
• If the unknown is the area to the left of positive Z- value. Add 0.5 or 50%
What is the area under the normal curve to the left of Z= 1.52 or Z > 1.1?
Solution:
From the The area for Z = 1.52 , A = 43.57%.
Then, total area = 50% + 43.57% = 93.57% answer.
1. Find the area under the normal curve for Z > 1.91.
2. Find the area under the normal curve for Z < 1.91.
If the unknown area is to the left of negative Z-value, subtract its area from 0.5 or 50%
Find the area under the normal curve to the right of Z= −1.62 or Z > −1.62
Solution:
The area bounded by Z= −1.62 is A = 44.74% , subtract its area from 50%
Find the area under the normal curve to the right of Z= −1.62 or Z > −1.62
Solution:
Add 0.5 or 50% if the unknown is the area to the right of negative Z- value.
The area bounded by Z > −1.62 is A = 44.74% , Add its area from 50%
Exercise 5-4
1. z = 1.33
2. z = -2.81
3. From z = 2.0 to z = 1.02
4. From z = -1.02 to z = 3.0
5. From z = 1.09 to z = 2.1.0
6. z > 2.15
7. z < 2.15
8. z > - 3.3 9. z < - 1.11
10. To the right of Z = - 1.85
̅
𝐱− 𝒙 𝐱− µ
Z= , z=
𝐬 𝛔
Solution:
Given: N = 50 , Mean µ= 85 , σ = 3.25,
a. To find the total area in percent of scores 75 - 95, we work with 75-85, and 85-
90
x−µ̅ 75 – 85
𝑍1 = = = −2.22
σ 4.5
x−µ̅ 90 – 85
𝑍2 = = = 1.11
σ 4.5
If two Z values have different signs, add the areas to get the total area.
x−µ̅ 80 – 85
Z= = = −1.11 , From Table, Area = 0.3665 or 36.65%
σ 4.5
If the unknown area is to the left of negative Z-value, subtract its area from 0.5 or
50%:
𝐴𝑇 = 50% - 36.65% = 13.35%
n = 0.1335(50) = 6.68 0r 7
c. Above 80
Add 0.5 or 50% if the unknown is the area to the right of negative Z- value.
n = 0.8665(50) =43.3
Exercise 5-5
2. From Problem No. 1, how many students have a score of 20 and below?
Solving for the value or raw score (X) with a given area or percentage under the
curve.
To find the raw sore (x), we need to manipulate the formula from the Z-score:
X−𝑋̅
Z = , multiply each side by σ
σ
Z (σ) = X − 𝑋̅ , add 𝑋̅ t o each side
Z (σ)+(x̄ ) = X
Therefore: X = Z (𝛔)+(𝐱̄ )
X = 1.25 (12) + 80
X = 95 answer.
A gardener wishes to trim the plants in the garden and decided to select the middle
75% of the height . Assuming the heights are normally distributed, what are the height limits
if the mean height is 1.22m and standard deviation of 0.18 m?
Solution: 75% in the middle means 37.5% (0.375) each from left and right of curve.
𝑥̅ = 1.22 , σ = 0.18
From the table the nearest value corresponding the area of 37.5% or 0.375 is 0.3749
Determining the Z-scores and Areas Under the Normal Curve Using SPSS
Assuming the ages of respondents follow a normal distribution, find the individual
Z-values and the area under the normal using the SPSS software.
20, 19, 18, 17, 16, 25, 50, 60, 18, 48, 52, 43, 35, 55, 49
Solution;
Step 1: Open the SPSS software in variable view and enter any name description of the
variable in single word such as “age”, for type “numeric” and for the label :
“Ages of respondents” and complete other needed information.
Step 3; In the Descriptive dialog box, move the variable output to the right
Check ” Save standardized values….” and then, click “OK”
Close the blank page and click ”no” for notices.
Step 7: Enter the target variable in space provided in the upper left corner , such as,
“Area.from.left.tail” put separator in each word such as dot or underscore because it
does not allow any space for each word, enter “cdf.normal(Zoutputs,0,1), to the right
of the box, then click “OK”
*cdf means cumulative distribution function, and the “0 and 1” are the mean where
Z= 0, and a standard deviation of 1.
Step 8: Just close the output box by clicking “x’ and select” No” for save contents, then, Go
to variable view, and modify the decimal into “4” places, to get exact values similar to
the values in table of normal distribution. Look up the area from left tail of curve in the
“Data view”-
Step 10. Close the output page, click “no” for save contents and then, go to variable view and
modify the decimal “4”, then look up the area of each Z- value in the “Data view”
Find the individual Z-values and the area under the normal curve of each of the
following outputs in experimental study using SPSS.
Step 1: Open the SPSS software in variable view and enter any name description of the
variable in single word such as “outputs”, and complete other needed information.
Step 2: Go to data view and enter the raw scores in the first column, click ”Analyze” and
select “Descriptive Statics” and another ” Descriptive Statistics”
Step 4: Go back to variable view of SPSS, and modify decimal by “2” places, to get the same
values in normal distribution table.
Step 7: Enter the target variable in space in the upper left corner , such as, “Area.fr.left.tail”
each word must be separated by underscore or dot, because it does not allow spaces,
then enter “cdf.normal(Zoutputs,0,1), then click “OK”
*You may also move the variable from left then inside the parenthesis, and the next
Numbers, separated by comma.
*cdf means cumulative distribution function, and the “0 and 1” are the mean where
Z= 0, and a standard deviation of 1.
Step 9: To get the corresponding area of each Z-value from the center or mean(Z = 0),
Click again “Transform” and “Compute Variables” then modify the target variable to
“Area,fr,z=0”, and enter the desired formula to the right side, or move the variable to
the right and subtract 0.5, then click “OK”
Disregard the negative sign in the area because it only indicates the position of area bounded
by the z-score is at the left side of Z=0 or mean
6. The mean salary of employees in a certain city is is10,800 PHP every 15 days and the
standard deviation is 900 PHP., If the expenses are normally distributed, what is the
probability of getting a salary of more than 8,000 in 15 days?
7. The mean production of factory workers is 15 finished products per hour, with a
standard deviation of 1.5 products/hour. What is the probability of finishing: a. less
than 12.8 products/hour? b. more than 12 products/hour?
8. The mean weight of insects for laboratory experiments is 15 g, with the standard
deviation of 1.3 g. What is the probability of randomly selecting insects with weight
of : a. 12 g? b. 13 to 17.5 g?
9 For final line-up of a basketball team, the coach decided to select players based from
middle 70% of their height limits to be fair with other teams. Assuming the heights
are normally distributed, what are the height limits if the mean height is 1.75 meters
and standard deviation of 0.11 m?
10 The height of 100 plants in the garden follows a normal distribution with a mean of 90
cm. and a standard deviation of 8.5 cm. How many plants have a height of
b. higher than 75 cm
II. Find the Z-value and area bounded by the following series of scores in aptitude test
using Jamovi apps. or SPSS.
20, 90, 85, 50, 60, 70, 85, 50, 45, 30, 65, 88