0% found this document useful (0 votes)
4 views

Module 6 Correlation Analysis

Uploaded by

Rennel Mallari
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

Module 6 Correlation Analysis

Uploaded by

Rennel Mallari
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 24

MODULE 6:

CORRELATION
ANALYSIS
Engr Ren
Correlation
• Is a linear association between random variables

• Correlation analysis shows us how to determine both the nature


and strength of relationship two variables
• Example: Is there a connection between the age at which child
speaks its first sentences and later school success?

• Correlation lies between +1 to -1


Types of Correlation
• Scatterplot Diagram Method
• Karl Pearson Coefficient Correlation Method
• Spearman’s Rank Correlation Method
Scatterplot Diagram Method
• A scatterplot is a type of data display that shows the relationship
between two numerical variables. Each member of the dataset
gets plotted as a point whose x-y coordinates relates to its
values for the two variables.

• Scatterplots help us to understand the association between 2


variables using:
• Trend
• Shape
• strength
When the y variable tends to
increase as the x variable
increases, we say there is
a positive correlation between
the variables.
When the y variable tends to
increase as the x variable
increases, we say there is
a positive correlation between
the variables.
When the y variable tends to
decrease as the x variable
increases, we say there is
a negative
correlation between the
variables.
When the y variable tends to
decrease as the x variable
increases, we say there is
a negative
correlation between the
variables.
• When there is no clear relationship
between the two variables, we say
there is no correlation between the
two variables.
• When there is no clear relationship
between the two variables, we say
there is no correlation between the
two variables.
Example
• Construct a scatterplot using the data below concerning the number of roller
coaster in different countries and the amount each country has contributed to
tsunami aid (in millions). Describe the trend, shape and strength of this graph.
Roller Coaster Tsunami Aid (in Millions)
Norway 7 139
Australia 18 193
Netherland 36 156
New Zealand 3 37
Canada 51 176
Ireland 2 26
Germany 108 313
Sweden 19 86
Switzerland 3 29
Graph:

• The trend is increasing and there is fairly strong, positive linear association
between these variables. Countries with more roller coasters tend to contribute
more to tsunami aid.
Pearson Coefficient of Correlation
• A measure of the strength of the linear relationship between two
variables that is defined in terms of the sample of the variables
divided by their standard deviations
• Represented by “r”
• r lies between +1 to -1
• The + and – signs are used for positive linear correlations and
negative linear correlations, respectively
Problem
• Gil, a Physics teacher is interested in determining the relationship between the
number of hours students spent on studying the Physics final exam and their
obtained final scores. He selected a random sample of eight students in a Physics
class. The scores obtained by these students are shown in the table. What is the
Pearson’s coefficient correlation?

Hours spent in 1 3 5 2 4 3 2 0
studying (X)
Physics Final 70 79 90 77 85 81 75 64
Exam Score (Y)
Solution
Solution
Spearman’s Rank Coefficient
• A method to determine correlation when the data is not
available in numerical form and as an alternative method, the
method of rank correlation is used. Thus, when the values of two
variables are converted to their ranks, and the correlation is
obtained, the method is known as rank correlation

Where n= no of participants/samples
D = difference of rank
Spearman’s Rank Coefficient
• A method to determine correlation when the data is not
available in numerical form and as an alternative method, the
method of rank correlation is used. Thus, when the values of two
variables are converted to their ranks, and the correlation is
obtained, the method is known as rank correlation

Where n= no of participants/samples
D = difference of rank
Problem
• Calculate the Spearman’s rank correlation coefficient of the data in the
table:
X 10 6 9 12 8
Y 8 8 5 6 9
Solution

ρ
SW2
Researchers interested in determining if there is a relationship
between death anxiety and religiosity conducted the following
study. Subjects completed a death anxiety scale (high score =
high anxiety) and also completed a checklist designed to
measure an individuals degree of religiosity (belief in a
particular religion, regular attendance at religious services,
number of times per week they regularly pray, etc.) (high score
= greater religiosity . A data sample is provided on the right.
(a) Construct a scatterplot using the data and describe the
trend, shape and strength of this graph.
(b) Determine their relationship using Pearson’s “r”
(c) Determine their relationship using another tool, Spearman’s
rho

You might also like