Module 6 Content
Module 6 Content
I. INTRODUCTION
In the previous topic (Pearson’s Moment Correla:on), you are tasked to determine whether the
rela:onship between two variables is significant or not. Also, you were able to draw findings from
sta:s:cal result, contextualize conclusions and provide addi:onal perspec:ve based on the problem.
It was also men:oned that correla:on is not causa:on, however, it is not the case in a regression
analysis. Also, the purpose of regression analysis is to predict by looking at the trend of the regression
line based on the data.
II. OBJECTIVES
2.) For any specific value of the independent variable x, the value of the dependent
variable y must be normally distributed about the regression line.
3.) The standard devia:on of each of the dependent variables must be the same for
each value of the independent variable.
Extrapola+on, or making predic+ons beyond the bounds of the data, must be interpreted cau+ously.
For example, in 1979, some experts predicted that the United States would run out of oil by the year
2003. This predic:on was based on the current consump:on and on known oil reserves at that :me.
However, since then, the automobile industry has produced many new fuel-efficient vehicles. Also,
there are many as yet undiscovered oil fields. Finally, science may someday discover a way to run a car
on something as unlikely but as common as peanut oil. In addi:on, the price of a gallon of gasoline
was predicted to reach $10 a few years later. Fortunately, this has not come to pass. Remember that
when predic+ons are made, they are based on present condi:ons or on the premise that present
1|Page12
trends will con+nue. This assump:on may or may not prove true in the future (Bluman, 2009, pp.
556).
2|Page12
In Example 1, I will show you how to compute linear regression manually (however, you are not
required to compute manually). The purpose is for you to appreciate and understand deeply the
meaning of the best fit line and coefficient of determina:on (r²).
Example 1. The data below are the Mathema:cs and Physics scores of the students. Answer the
following ques:ons:
a. Given the Mathema:cs score alone, guess the Math score of we let another student take the
test.
b. What is the varia:on of the dependent variable (Physics scores)
c. How many percent of the varia:on of the dependent variable (DV) is explained by the
independent variable (IV) and what is the best fit line?
d. Is Mathema:cs score a significant predictor of the Physics scores?
e. What are the possible conclusions can you draw based on the sta:s:cal results?
3|Page12
How many percent of the variation of the DV
(physics score) is explained by the IV (Math
Score)?
4|Page12
No:ce that the graph of the best
fit line has the least
error/residual.
20 30 46 15 30 35 20
2.
1
5.
9
52
71.23 3
.
8
5.
52
6.1
5
28.73
6.1
5
2.
1
9
28.73
71.23
6.1 6.1
5 5
3
.
8
64 64
4 169 4 49
324
2.
1
5. 6.1
52 5
28.7
9
71.23 3
3 .
8
6.1
5
GOOD FIT!
5|Page12
It is easier to use the sta:s:cal soNware compared to manual computa:on, you may follow the
following steps using the Minitab soNware:
6|Page12
You can no:ce that
both manual
computa:on and
using the Minitab
soNware yield the
same result.
7|Page12
Below are the findings based on the Minitab
statistical results.
Note: No:ce that the slope in the best fit line is posi:ve. By compu:ng r using the Pearson’s
moment correla:on, it is indeed posi:ve. It is shown below. However, by looking at the slope
of the best fit line, you will know whether r is posi:ve or nega:ve.
Note: In item 2, it's worth no:ng that while some students excel in mathema:cs, they may
s:ll struggle to develop the cri:cal thinking and problem-solving skills essen:al for processing
and applying mathema:cal concepts to solve physics problem.
8|Page12
Example 2. These data were obtained for the years 1993 through 1998 and indicate the number of
fireworks (in millions) used and the related injuries. Predict the number of injuries if 100 million
fireworks are used during a given year. Use α = 0.05.
Step 3: Transfer the DV to responses and IV to con:nuous predictors. If the IV is nominal or ordinal
data, it must be transferred to categorical predictors.
9|Page12
Step 4: Interpret the results.
Analysis of Variance
Model Summary
Regression Equation
Findings:
Since the p-value is 0.296, the number of fireworks is not a significant predictor of related
injuries.
(1) Increasing the number of fireworks uses will not increase the number of injuries.
(2) Firework is not the cause of injuries, there may be other firecrackers that causes
injuries.
10 | P a g e 1 2
Statistical Result:
Model Summary
S R-sq R-sq(adj) R-sq(pred)
0.246829 88.67% 86.41% 66.49%
Regression Equation
Chlorophyll = -0.240 + 1.093 Soluble Protein
Findings:
(1) Since the p-value is 0.002 is less than 0.01. The soluble protein is a significant predictor of
amount of chlorophyll in plants.
(2) 86.67% of the varia:on of chlorophyll in plants is explained by the amount of soluble
protein.
(3) For every 1 unit increase in soluble protein, there is 1.093 unit increase in the chlorophyll
of plants.
Conclusion/Other perspective/s:
(1) The increase in soluble protein in plants leads to high chlorophyll content. Conversely,
low soluble protein in plants leads to low chlorophyll content.
(2) Ensuring plants have a high concentra:on of soluble protein is vital for promo:ng their
overall health and vitality.
(3) Insufficient soluble protein in plants can lead to diseases and, in severe cases, can result
in death.
11 | P a g e 1 2
IV. REFERENCES
Books
Abbop, M. L., (2017). Using Sta:s:cs In The Social And Health Sciences With Spss® And Excel®.
John Wiley & Sons, Inc
Bluman, A. G., (2009). Elementary Sta:s:cs: A Step by Step Approach (Eight Edi:on). McGraw-Hill
Ho, R., (2018). Understanding Sta:s:cs for the Social Sciences with IBM SPSS. Taylor & Francis
Group, LLC
Navidi, W. & Monk, B., (2019). Elementaty Sta:s:cs (Third Edi:on). McGraw-Hill EducaMon
hpp://eagri.org/eagri50/STAM101/pdf/pract07.pdf
hpps://www.cimt.org.uk/projects/mepres/alevel/fstats_ch7.pdf
Indoria, A. K., Sharma, K. L., Reddy, K. S., & Rao, C. S. (2017). Role of soil physical proper:es in soil
health management and crop produc:vity in rainfed systems-I: Soil physical constraints
and scope. Current science, 2405-2414.
hpps://www.kaggle.com/
hpps://sesricdiag.blob.core.windows.net/oicstatcom/TEXTBOOK-CORRELATION-AND-
REGRESSION-ANALYSIS-EGYPT-EN.pdf
hpps://www.cimt.org.uk/projects/mepres/alevel/stats_ch12.pdf
hpps://02402.compute.dtu.dk/enotes/solu:ons-chapter5.pdf
Utah State University. (2024). What is Iron Chlorosis and What Causes it? | Forestry | Extension.
Usu.edu. hpps://extension.usu.edu/forestry/trees-ci:es-towns/tree-care/causes-iron-
chlorosis#:~:text=The%20primary%20symptom%20of%20iron,as%20the%20plant%20ce
lls%20die.
12 | P a g e 1 2