0% found this document useful (0 votes)

28 views53 pages

MATH 101-Week 7-8 - Lesson 4.1 Correlation & Regression Analysis

This document outlines the objectives and methodologies for using correlation and linear regression in statistics to analyze data and make predictions. It explains key concepts such as independent and dependent variables, correlation coefficients, and hypothesis testing, including the formulation of null and alternative hypotheses. The document also provides examples and procedures for conducting statistical analyses, emphasizing the importance of these methods in decision-making.

Uploaded by

Kasten Estolas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views53 pages

MATH 101-Week 7-8 - Lesson 4.1 Correlation & Regression Analysis

Uploaded by

Kasten Estolas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 53

OBJECTIVES:

At the end of this lesson, you must be able to:

1. Use the method of correlation and linear
regression to predict the value of a variable given
certain conditions.
2. Recognize the importance of correlation
analyses in making decisions.
INTRODUCTION
Statistics , a branch of Mathematics that examines and
investigates ways to process and analyze the data
gathered.
It provides procedure in data collection, presentation,
organization, and interpretation to have meaningful idea
that is useful to decision-makers.
INTRODUCTION
• Collection of data is the process of gathering
relevant information from the population.
• Organization of data is the systematic arrangement
of data into tables, graphs, or charts so that logical
and statistical conclusions can easily be derived
from the collected information.
INTRODUCTION
• Analysis of data refers to the process of deducing
relevant information from the given data so that
the numerical description can be formulated.
• Interpretation of data is all about deriving
conclusion from the data that have been analyzed.
It also involves making predictions and forecasts
about large groups based on gathered data from
small groups.
INTRODUCTION
INTRODUCTION
Two Fields of Statistics
1. Descriptive Statistics consist of the collection,
organization, summarization, and presentation of data
Here, the statistician tries to describe a given situation. To
tell something about a particular group of observation
2. Inferential Statistics The logical process from sample
analysis to a generalization of conclusion.
Here, the statistician tries to make inferences from samples
to population. This area also makes use of the concept of
probability.
IMPORTANT TERMS
Population (N) - consist of all the members of the
group about which to draw conclusion.
Sample (n) - portion or part, of the population of
interest selected for analysis.
IMPORTANT TERMS
Parameter
Numerical index describing a characteristic of a
population.
Statistic
Numerical index describing a characteristic of a
sample.
IMPORTANT TERMS
Constant
Characteristics of objects, people, or events that can
take of different values.
Example: Weight
Variable
Characteristics of objects, people, or events that can
take of different values.
Example : Boiling temperature in degree centigrade
Types of Variables
CORRELATION ANALYSIS
Independent Variable (x) -The variable being used as the
basis of prediction and is usually goes on the x-axis.
Dependent Variable (y) -The dependent variable
(sometimes known as the responding variable) is what is
being studied and measured. The dependent variable
always goes on the y-axis.
Example : Hours Studied Vs. Score on Exam
Dependent Variable: Score (Effect)
Independent Variable: Hours Studied (Cause)
CORRELATION ANALYSIS
Correlation Analysis is a method of statistical
evaluation used to study the strength of a relationship
between two, numerically measured, continuous
variables (e.g. height and weight).
If correlation is found between two variables it means
that when there is a systematic change in one variable,
there is also a systematic change in the other; the
variables alter together over a certain period of time.
CORRELATION ANALYSIS
If there is correlation found, depending upon the numerical
values measured, this can be either positive or negative.
Positive correlation exists if one variable
increases/decreases simultaneously with the other, i.e. the
high numerical values of one variable relate to the high
numerical values of the other.
Negative correlation exists if one variable decreases when
the other increases, i.e. the high numerical values of one
variable relate to the low numerical values of the other.
CORRELATION ANALYSIS
Two variables are positively correlated if the values of
the two variables both increase or both decrease.
Two variables are negatively correlated if the values of
one variable increase while the values of the other
decreases.
Two variables are not correlated or they have zero
correlation if one variable neither increases or
decreases while the other increases.
SCATTER PLOT/SCATTER DIAGRAM
A scatter plot is drawn so we can analyze if the two
variables are related somehow. If there is correlation
found, depending upon the numerical values
measured, this can be either positive or negative.
A scatter plot is a graph of ordered pairs (x, y)
consisting of data from two data sets
SCATTER PLOT
SCATTER PLOT
SCATTER PLOT
The Correlation Coefficient (r)
The correlation coefficient (r) is a number that describes
how strong the relationship between two data sets.
Correlation coefficients range from -1 (perfect negative
correlation) to 1 (perfect positive correlation). A
correlation coefficient close to zero indicates that the data
sets are most likely not linearly correlated (See figure 1).
Pearson Product Moment-Correlation Formula (Pearson’s r)
n xy  ( x)( y )
r
[n( x 2 )  ( x) 2 ][n( y 2 )  ( y ) 2 ]
CORRELATION ANALYSIS

The Table below is the interpretation of the various

degree of linear correlation (Blay2013)

Between ±0.80 𝑡𝑜 ± 0.99 high correlation

Between ±0.60 𝑡0 ± 0.79 moderately high correlation
Between ±0.40 𝑡𝑜 ± 0.59 moderate correlation
Between ±0.20 𝑡𝑜 ± 0.39 low correlation
Between ±0.01 𝑡𝑜 ± 0.19 negligible correlation
Example 1
Is there a significant relationship between the two sets of
test scores in Algebra and Geometry of ten students?
Draw a scatter plot. Find the correlation coefficient (r) for
the data and discuss what you think it indicates.
Example 1: Solution

Interpretation: There is a positive correlation between the scores in

Algebra and scores in Geometry, hence, when scores in Algebra
increased/decreased, scores in Geometry increased/decreased.
Example 1: Solution
Example 1: Solution
Linear Regression/Regression Analysis
Regression analysis is a statistical tool used to show how
two or more variables are related to each other. If two
variables are observed to be related, it is helpful if we
can produce an equation to model the relationship.

If this relationship follows a linear pattern, the model is

a linear equation, or in statistics, a linear regression
equation.
Linear Regression/Regression Analysis
Three (3) Major Uses of Regression analysis are:
1. determining the strength of predictors’
2. forecasting an effect, and
3. trend forecasting.
How to Find the Regression Equation
The simplest form of regression equation with one independent
variable and one dependent variable is defined by the formula

Where: x – score in the independent variable (predictor)

y – estimated dependent variable score (criterion measure)
b – regression coefficient
a - constant
Example:
Find the equation of the regression line for the data
in Example 1.
Solution:
We already calculated the values need for each
formula when we found the correlation coefficient in
Example 1.
Substitute into the first formula to find the value of
the slope.
Regression Equation
Predicted value of y:

Slope (b) :
Regression Analysis
Substitute into the second formula to find the value of a
(y-intercept) when b = 0.80

Substituting the value of a and b, the regression equation is

𝒚 = 𝟑. 𝟔𝟒 + 𝟎. 𝟖𝟎𝒙
Objectives:
At the end of this lesson, you must be able to:
1. to be able to formulate the null and alternative
hypotheses.
2. to differentiate between the null hypothesis and
the alternative hypothesis.
3. to perform the step by step procedure for
hypothesis testing.
Hypothesis Testing
It is a statistical method that is used in making
statistical decisions using experimental data.

It is basically an assumption that we make about the

population parameter.
Hypothesis Testing
There are two (2) types of statistical hypothesis:
a. Null Hypothesis , symbolized by H0, is a statistical
hypothesis testing that assumes that the
observation is due to a chance factor.

b. Alternative Hypothesis, symbolized by Ha it

states that there is a difference between two
population means (or parameters)
Two (2) types of hypothesis
A null hypothesis (Ho) is a hypothesis that says there
is no statistical significance between the two
variables. It is the one which the researcher always
hopes to reject; it shows no significant
difference/relationship.

Example: There is no significant relationship between

the test scores in Algebra and Geometry.
Two (2) types of hypothesis
An alternative hypothesis (Ha) is one that states
there is a statistically significant relationship between
two variables. It challenges Ho and shows a
significant difference/relationship.

Example: There is a significant relationship between

the test scores in Algebra and Geometry.
Why do we need to test a hypothesis?
Hypothesis testing is an essential procedure in
statistics.

A hypothesis test evaluates two mutually exclusive

statements about a population to determine which
statement is best supported by the sample data like
when we say that a finding is statistically significant.
Level of Significance
The level of significance refers to the degree of
significance in which we accept or reject the null
hypothesis.

Level of significance is the maximum probability of

committing a Type I error.

That is, P (Type I error) = α.

Level of Significance
The critical or rejection value is the range of the
values of the test value that indicates that there is
significant difference and that the null hypothesis
(H0) should be rejected

noncritical or nonrejection region is the range of the

values of the test value that indicates that the
difference was probably due to chance and that the
null hypothesis (H0) should not be rejected.
One Tailed versus Two Tailed
A one-tailed test shows that the Ho be rejected
when test value is in the critical region on one side of
the mean.

A two-tailed test, the Ho should be rejected when

the test value is in either of the two critical regions.
Procedure in Testing a Hypothesis (t-test for correlation)
Step 1. Formulate the hypotheses. (Null and Alternative)
Ho: There is no significant relationship between the
scores in Algebra and Geometry.
Ha: There is a significant relationship between the
scores in Algebra and Geometry.

Step 2: Calculate the value of correlation coefficient, r.

Step 3. Set the Level of significance (∝ =0.05)

Procedure in Testing a Hypothesis (t-test for correlation)
Step 4. Calculate the value of t computed using the formula
below:
Procedure in Testing a Hypothesis (t-test for correlation)
Step 5. Statistical decision (reject or do not reject)
Calculate the degrees of freedom to find the value of
T-critical on the t-table of values:
The degree of freedom (df) gives the number of pieces of
independent information available for computing variability.
 df is calculated only from samples.
NOTE: If tcomputed  tcritical, do not reject H0
If tcomputed  tcritical, reject H0

Step 6: Draw conclusions

Example: Testing a Hypothesis
Let us test the hypothesis for Example 1 in lesson 4.1. Is
there a significant relationship between the two sets of test
scores in Algebra and Geometry of ten students? Find the
correlation coefficient for the data and discuss what you
think it indicates.

For this problem r = 0.81, use this coefficient in testing the

hypothesis.
Example: Testing a Hypothesis
Step 1. State the Null and alternative hypotheses.
Ho: There is no significant relationship between the
scores in Algebra and Geometry.
Ha: There is a significant relationship between the
scores in Algebra and Geometry.

Step 2. Calculate the correlation coefficient ( r ).

r = 0.81
Example: Testing a Hypothesis
Step 3. Level of significance, α= 0.05 (this is a constant value)

Step 4. Calculate the value of t computed.

Example: Testing a Hypothesis
Step 5. Statistical Decision
From the t-table of values, at 0.05 level of significance
𝑡𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 = 2.2306.
Since 𝑡𝑐𝑜𝑚𝑝 𝑖𝑠 3.906 > 𝑡𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 = 2.2306.
Decision: Reject the Ho and accept Ha.

Step 6. Conclusion
We can conclude that there is a very strong/highly
significant correlation between Algebra and Geometry scores.
Hence, when the scores in Algebra are increased/ decreased
then the scores in Geometry are also increased/or decreased.
t-critical values
References

Prepared by:

Gracia T. Canlas, LPT, MAED

Instructor – MATH 101
Thank you for listening!

SAS - Regression Using JMP
100% (1)
SAS - Regression Using JMP
283 pages
Business Statistics: Correlation Study Alumni Giving Case
No ratings yet
Business Statistics: Correlation Study Alumni Giving Case
4 pages
Business Statistics Method: by Farah Nurul Aisyah (4122001020) Jasmine Alviana Zalzabillah (4122001070)
No ratings yet
Business Statistics Method: by Farah Nurul Aisyah (4122001020) Jasmine Alviana Zalzabillah (4122001070)
35 pages
Correlation and Regression
No ratings yet
Correlation and Regression
17 pages
Confidence Interval
No ratings yet
Confidence Interval
6 pages
Correlation
No ratings yet
Correlation
22 pages
Final Project: Raiha, Maheen, Fabiha Mahnoor, Zara
No ratings yet
Final Project: Raiha, Maheen, Fabiha Mahnoor, Zara
14 pages
Session 4 Correlation and Regression
No ratings yet
Session 4 Correlation and Regression
81 pages
Correlation
100% (1)
Correlation
29 pages
408 Mid
No ratings yet
408 Mid
7 pages
Regression Analysis
No ratings yet
Regression Analysis
7 pages
CORRELATION
No ratings yet
CORRELATION
23 pages
Correlation and Regression
No ratings yet
Correlation and Regression
7 pages
Simple Regression and Correlation Analysis
100% (2)
Simple Regression and Correlation Analysis
27 pages
Statistics MATH FINALS REVIEWER
No ratings yet
Statistics MATH FINALS REVIEWER
28 pages
Research Paper
No ratings yet
Research Paper
20 pages
Review: I Am Examining Differences in The Mean Between Groups
100% (2)
Review: I Am Examining Differences in The Mean Between Groups
44 pages
Module-4
No ratings yet
Module-4
35 pages
6) CorrelationAndRegression - 27
No ratings yet
6) CorrelationAndRegression - 27
5 pages
Corr PDF
No ratings yet
Corr PDF
30 pages
stAT AND PROB
No ratings yet
stAT AND PROB
26 pages
Correlation Regression
No ratings yet
Correlation Regression
42 pages
Correlation and Regression 2020
No ratings yet
Correlation and Regression 2020
63 pages
Regression and Correlation
No ratings yet
Regression and Correlation
37 pages
STAT22209 - Chapter 01-Correlation Analyisis - 2022
No ratings yet
STAT22209 - Chapter 01-Correlation Analyisis - 2022
53 pages
Unit 6, Regression
No ratings yet
Unit 6, Regression
34 pages
Correlationanalysis
No ratings yet
Correlationanalysis
49 pages
Review of Basic Stat
No ratings yet
Review of Basic Stat
40 pages
Quantitative Data Analysis: Harshad Bajpai
No ratings yet
Quantitative Data Analysis: Harshad Bajpai
26 pages
Relationship - Correlation and Regression
No ratings yet
Relationship - Correlation and Regression
42 pages
Correlation Anad Regression
No ratings yet
Correlation Anad Regression
13 pages
CH. 9 Correlation Rev2
No ratings yet
CH. 9 Correlation Rev2
44 pages
Correlation Analysis
No ratings yet
Correlation Analysis
102 pages
Simple Correlation Converted 23
No ratings yet
Simple Correlation Converted 23
5 pages
Coorelation
No ratings yet
Coorelation
8 pages
Descriptive Stats (E.g., Mean, Median, Mode, Standard Deviation) Z-Test &/or T-Test For A Single Population Parameter (E.g., Mean)
No ratings yet
Descriptive Stats (E.g., Mean, Median, Mode, Standard Deviation) Z-Test &/or T-Test For A Single Population Parameter (E.g., Mean)
43 pages
Correlation Research Design - PRESENTASI
100% (1)
Correlation Research Design - PRESENTASI
62 pages
Key Points - STATS
No ratings yet
Key Points - STATS
15 pages
Microsoft PowerPoint Session 4 PDF
No ratings yet
Microsoft PowerPoint Session 4 PDF
86 pages
Regression 1
No ratings yet
Regression 1
28 pages
Lesson 6.2 Correlation and Regression Analysis Final Edition
No ratings yet
Lesson 6.2 Correlation and Regression Analysis Final Edition
8 pages
Inferential Statistics
No ratings yet
Inferential Statistics
171 pages
Correlation and Regration
No ratings yet
Correlation and Regration
57 pages
UNIT-2 by Ramanathan
No ratings yet
UNIT-2 by Ramanathan
67 pages
PMC 500 Statistical Reasoning in Education: Correlation
No ratings yet
PMC 500 Statistical Reasoning in Education: Correlation
45 pages
SolomonAntonioVisuyanTandoyBallartaGumbocAretanoNaive - Ed104 - Pearson R & Simple Regression - April 24, 2021
No ratings yet
SolomonAntonioVisuyanTandoyBallartaGumbocAretanoNaive - Ed104 - Pearson R & Simple Regression - April 24, 2021
13 pages
Chapter 14 Simple Linear Regression .
No ratings yet
Chapter 14 Simple Linear Regression .
39 pages
Quantitative MEthods Course Guide Inferential Statistics (1) - 220924 - 143556
No ratings yet
Quantitative MEthods Course Guide Inferential Statistics (1) - 220924 - 143556
8 pages
Statistics
No ratings yet
Statistics
13 pages
Pred 354 12th Lesson
0% (1)
Pred 354 12th Lesson
16 pages
Correlation and Regression
No ratings yet
Correlation and Regression
12 pages
CH 5 - Correlation and Regression
No ratings yet
CH 5 - Correlation and Regression
9 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
37 pages
Correlation & Regression Analysis
No ratings yet
Correlation & Regression Analysis
21 pages
Lecture 7 - Correlation Regression
No ratings yet
Lecture 7 - Correlation Regression
47 pages
Stats Dish Clean
No ratings yet
Stats Dish Clean
47 pages
Lesson 11 - Regression and Correlation Analysis
No ratings yet
Lesson 11 - Regression and Correlation Analysis
8 pages
Day 8 - Module Linear Correlation
No ratings yet
Day 8 - Module Linear Correlation
5 pages
Correlation Analysis
No ratings yet
Correlation Analysis
3 pages
11august2010 - Correlation and Regression
No ratings yet
11august2010 - Correlation and Regression
7 pages
Statistical Tool
No ratings yet
Statistical Tool
4 pages
MSC Management Dissertation Examples
100% (2)
MSC Management Dissertation Examples
8 pages
Momentum Bias Index (AlgoAlpha) @fxsignalspot
No ratings yet
Momentum Bias Index (AlgoAlpha) @fxsignalspot
2 pages
Lesson 4 - Formulation of Monitoring and Evaluation Plan
No ratings yet
Lesson 4 - Formulation of Monitoring and Evaluation Plan
15 pages
Gender Yes NO Total Male 10 23 33 Female 4 4 8 Total 14 27 41
No ratings yet
Gender Yes NO Total Male 10 23 33 Female 4 4 8 Total 14 27 41
5 pages
DB Schenker PDF
0% (1)
DB Schenker PDF
54 pages
Anreg - StatG - (Fara, Nada, Hanan, Rey)
No ratings yet
Anreg - StatG - (Fara, Nada, Hanan, Rey)
12 pages
Convesion To Islam PDF
No ratings yet
Convesion To Islam PDF
292 pages
2023 10 - 23 0036 AQPSD Dutystatement
No ratings yet
2023 10 - 23 0036 AQPSD Dutystatement
3 pages
Test Week 4 Answers
No ratings yet
Test Week 4 Answers
18 pages
Rishabh Mathur Resume
No ratings yet
Rishabh Mathur Resume
2 pages
Andi Batari Khairunnisa.
No ratings yet
Andi Batari Khairunnisa.
19 pages
Contoh Data Regresi Berganda
No ratings yet
Contoh Data Regresi Berganda
7 pages
L18 K Means
No ratings yet
L18 K Means
27 pages
Training Plan New Format
No ratings yet
Training Plan New Format
35 pages
SIMSIP Model
No ratings yet
SIMSIP Model
39 pages
Ch. 10 Principal Components Analysis (PCA)
No ratings yet
Ch. 10 Principal Components Analysis (PCA)
17 pages
G 2 Tos - Math3a
No ratings yet
G 2 Tos - Math3a
2 pages
TYBBAA 1007points Tally Show
No ratings yet
TYBBAA 1007points Tally Show
33 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
39 pages
Assignment Brief 2023
No ratings yet
Assignment Brief 2023
10 pages
Ds Python Unit-I
No ratings yet
Ds Python Unit-I
30 pages
Class12 ISC Maths Board Questions Chapter Linear Regression
No ratings yet
Class12 ISC Maths Board Questions Chapter Linear Regression
34 pages
Efficacy of Problem Solving Therapy For Spouses of Men With Prostate Cancer - A Randomized Controlled Trial-2018
No ratings yet
Efficacy of Problem Solving Therapy For Spouses of Men With Prostate Cancer - A Randomized Controlled Trial-2018
9 pages
Flight Ticket Price Predictor - Formatted Paper
No ratings yet
Flight Ticket Price Predictor - Formatted Paper
5 pages
BA Module 1 Summary
100% (1)
BA Module 1 Summary
4 pages
R10 Sampling and Estimation
No ratings yet
R10 Sampling and Estimation
17 pages
Statistical Inference
No ratings yet
Statistical Inference
2 pages

MATH 101-Week 7-8 - Lesson 4.1 Correlation & Regression Analysis

Uploaded by

MATH 101-Week 7-8 - Lesson 4.1 Correlation & Regression Analysis

Uploaded by

OBJECTIVES:

At the end of this lesson, you must be able to:

The Table below is the interpretation of the various

Between ±0.80 𝑡𝑜 ± 0.99 high correlation

Interpretation: There is a positive correlation between the scores in

If this relationship follows a linear pattern, the model is

Where: x – score in the independent variable (predictor)

Substituting the value of a and b, the regression equation is

It is basically an assumption that we make about the

b. Alternative Hypothesis, symbolized by Ha it

Example: There is no significant relationship between

Example: There is a significant relationship between

A hypothesis test evaluates two mutually exclusive

Level of significance is the maximum probability of

That is, P (Type I error) = α.

noncritical or nonrejection region is the range of the

A two-tailed test, the Ho should be rejected when

Step 2: Calculate the value of correlation coefficient, r.

Step 3. Set the Level of significance (∝ =0.05)

Step 6: Draw conclusions

For this problem r = 0.81, use this coefficient in testing the

Step 2. Calculate the correlation coefficient ( r ).

Step 4. Calculate the value of t computed.

Gracia T. Canlas, LPT, MAED

You might also like