0% found this document useful (0 votes)

27 views43 pages

Lecture 10 Correlation and Regression

Uploaded by

Senthilkumar Devaraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views43 pages

Lecture 10 Correlation and Regression

Uploaded by

Senthilkumar Devaraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 43

Correlation

Chapter 15
Correlation
• Sir Francis Galton (Uncle to
Darwin
– Development of behavioral statistics
– Father of Eugenics
– Science of fingerprints as unique
– Retrospective IQ of 200
– Drove himself mad just to prove you
could do it
– Invented the pocket
Defining Correlation

• Co-variation or co-relation between two

variables
• These variables change together
• Usually scale (interval or ratio) variables

• http://www.youtube.com/watch?v=ahp7QhbB8G4
Correlation Coefficient
• A statistic that quantifies a relation between
two variables
• Can be either positive or negative
• Falls between -1.00 and 1.00
• The value of the number (not the sign)
indicates the strength of the relation
Linear Correlation
Linear relationships Curvilinear relationships

Y Y

X X

Y Y

X X
 Slide from: Statistics for Managers Using Microsoft® Excel 4th Edition, 2004 Prentice-Hall
Linear Correlation
Strong relationships Weak relationships

Y Y

X X

Y Y

X X
Slide from: Statistics for Managers Using Microsoft® Excel 4th Edition, 2004 Prentice-Hall
Linear Correlation
No relationship

X
Slide from: Statistics for Managers Using Microsoft® Excel 4th Edition, 2004 Prentice-Hall
Correlation

10
Positive Correlation
Association between variables such that high
scores on one variable tend to have high
scores on the other variable
A direct relation between the variables
Negative Correlation
Association between variables such that high
scores on one variable tend to have low
scores on the other variable
An inverse relation between the variables
A Perfect Positive Correlation
A Perfect Negative Correlation
What is “Linear”?
 Remember this:
 Y=mX+B?

B
What’s Slope?

A slope of 2 means that every 1-unit change in

X yields a 2-unit change in Y.
Simple linear regression

P=.22; not
significant

The linear regression model: intercept

Love of Math = 5 + .01*math SAT score
slope
Check Your Learning

• Which is stronger?
– A correlation of 0.25 or -0.74?
Misleading Correlations

• Something to think about

– There is a 0.91 correlation between ice cream
consumption and drowning deaths.
• Does eating ice cream cause drowning?
• Does grief cause us to eat more ice cream?
Correlation
Correlation is NOT
causation
-e.g., armspan and
height

21
The Limitations of Correlation

• Correlation is not causation.

– Invisible third variables

Three Possible
Causal
Explanations for a
Correlation
The Limitations of Correlation,
cont.
> Restricted Range.
A sample of boys and girls who performed in the
top 2% to 3% on standardized tests - a much
smaller range than the full population from which
the researchers could have drawn their sample.
> Restricted Range, cont.
If we only look at the older students between the
ages of 22 and 25, the strength of this correlation
is now far smaller, just 0.05.
The Limitations of Correlation,
cont.
> The effect of an outlier.
One individual who both studies and uses her cell
phone more than any other individual in the
sample changed the correlation from 0.14, a
negative correlation, to 0.39, a much stronger and
positive correlation!
The Pearson Correlation Coefficient

• A statistic that quantifies a linear relation

between two scale variables.
• Symbolized by the italic letter r when it is a
statistic based on sample data.
• Symbolized by the italic letter p “rho” when it
is a population parameter.
• Pearson correlation coefficient
–r
– Linear relationship

r
 [( X  M X )(Y  M Y )]
( SS X )( SSY )
Correlation Hypothesis
Testing
• Step 1. Identify the population, distribution, and
assumptions
• Step 2. State the null and research hypotheses.
• Step 3. Determine the characteristics of the
comparison distribution.
• Step 4. Determine the critical values.
• Step 5. Calculate the test statistic
• Step 6. Make a decision.
Always Start with a Scatterplot
Correlation and Psychometrics

• Psychometrics is used in the development of

tests and measures.
• Psychometricians use correlation to examine
two important aspects of the development of
measures—reliability and validity.
Reliability
• A reliable measure is one that is consistent.
• One particular type of reliability is test–retest
reliability.
• Correlation is used by psychometricians to help
professional sports teams assess the reliability of
athletic performance, such as how fast a pitcher
can throw a baseball.
Validity
• A valid measure is one that measures what
it was designed or intended to measure.
• Correlation is used to calculate validity,
often by correlating a new measure with
existing measures known to assess the
variable of interest.
• Correlation can also be used to establish
the validity of a personality test.
• Establishing validity is usually much
more difficult than establishing
reliability.
• Most magazines and newspapers never
examine the psychometric
properties of the quizzes
that they publish.
Partial Correlation
• A technique that quantifies the degree of
association between two variables after
statistically removing the association of a third
variable with both of those two variables.
• Allows us to quantify the relation between two
variables, controlling for the correlation of
each of these variables with a third related
variable.
> We can assess the correlation between
number of absences and exam grade, over
and above the correlation of percentage of
completed homework assignments with these
variables.
Partial Correlation
• A partial correlation is the relationship between two variables after
removing the overlap with a third variable completely from both variables.
In the diagram below, this would be the relationship between male literacy
(Y) and percentage living in cities (X2), after removing the influence of
gross domestic product (X1) on both literacy and percentage living in cities

In the calculation of the partial correlation

coefficient rYX2.X1, the area of interest is
section a, and the effects removed are
those in b, c, and d; partial correlation is
the relationship of X2 and Y after the
influence of X1 is completely removed from
both variables. When only the effect of X1
on X2 is removed, this is called a part
correlation; part correlation first removes
from X2 all variance which may be
accounted for by X1 (sections c and b),
then correlates the remaining unique
component of the X2 with the dependent
variable, Y
Statistical Control
• Using Multivariate Analysis
Statistical Control
• Using Multivariate Analysis
Simpson’s Paradox
• In each of these examples, the bivariate
analysis (cross-tabulation or correlation) gave
misleading results
• Introducing another variable gave a better
understanding of the data
– It even reversed the initial conclusions
Another Example
• A study of graduates’ salaries showed
negative association between economists’
starting salary and the level of the degree
– i.e. PhDs earned less than Masters degree
holders, who in turn earned less than those
with just a Bachelor’s degree
– Why?
• The data was split into three employment
sectors
– Teaching, government and private industry
– Each sector showed a positive relationship
– Employer type was confounded with degree
level

Operations Management Chapter 3 - Forecasting
100% (2)
Operations Management Chapter 3 - Forecasting
44 pages
Correlation Research Design - PRESENTASI
100% (1)
Correlation Research Design - PRESENTASI
62 pages
L6 - Biostatistics - Linear Regression and Correlation
No ratings yet
L6 - Biostatistics - Linear Regression and Correlation
121 pages
Correlation Coefficient
No ratings yet
Correlation Coefficient
14 pages
SAMPLING & SAMPLING DISTRIBUTION EDITED ch07
100% (1)
SAMPLING & SAMPLING DISTRIBUTION EDITED ch07
50 pages
202003241550009941rajeev Pandey Correlation Research
No ratings yet
202003241550009941rajeev Pandey Correlation Research
87 pages
Correlation Analysis
No ratings yet
Correlation Analysis
54 pages
Correlation
No ratings yet
Correlation
22 pages
Econometrics With R
No ratings yet
Econometrics With R
56 pages
Correlation
No ratings yet
Correlation
42 pages
L3 Correlation
No ratings yet
L3 Correlation
101 pages
Correlation Analysis
No ratings yet
Correlation Analysis
48 pages
Correlation
No ratings yet
Correlation
8 pages
Additional Textual Learning Material - B5
No ratings yet
Additional Textual Learning Material - B5
114 pages
4456 Et 4456 Et 04et
No ratings yet
4456 Et 4456 Et 04et
11 pages
Correlation Analysis and Its Types
No ratings yet
Correlation Analysis and Its Types
50 pages
Correlation Analysis PDF
No ratings yet
Correlation Analysis PDF
30 pages
Correlation DU Final
No ratings yet
Correlation DU Final
56 pages
12 - The Correlational Research Strategy Short
No ratings yet
12 - The Correlational Research Strategy Short
44 pages
Regression C
No ratings yet
Regression C
48 pages
Correlation Analysis
No ratings yet
Correlation Analysis
102 pages
May 8 2023
No ratings yet
May 8 2023
39 pages
Cce 68 D 4 CC 4
No ratings yet
Cce 68 D 4 CC 4
28 pages
Correlations
No ratings yet
Correlations
30 pages
Correlational Research
No ratings yet
Correlational Research
41 pages
Questions 161261
100% (2)
Questions 161261
3 pages
Correlation
No ratings yet
Correlation
27 pages
Prof. Dr. Moustapha Ibrahim Salem Mansourms@alexu - Edu.eg 01005857099
No ratings yet
Prof. Dr. Moustapha Ibrahim Salem Mansourms@alexu - Edu.eg 01005857099
34 pages
16.. Correlation Analysis - Michael
No ratings yet
16.. Correlation Analysis - Michael
25 pages
Online Class Etiquettes and Precautions For The Students
No ratings yet
Online Class Etiquettes and Precautions For The Students
49 pages
Microsoft PowerPoint Session 4 PDF
No ratings yet
Microsoft PowerPoint Session 4 PDF
86 pages
Inferential Statistics (Inferential Statistics (Correlation AND PARTIAL-Correlation)
No ratings yet
Inferential Statistics (Inferential Statistics (Correlation AND PARTIAL-Correlation)
28 pages
Correlation
No ratings yet
Correlation
35 pages
11 Correlation
No ratings yet
11 Correlation
28 pages
DMUU Assignment 1 - GroupC
No ratings yet
DMUU Assignment 1 - GroupC
4 pages
Chapter1
No ratings yet
Chapter1
55 pages
302 Assignment 1 COMPLETE
No ratings yet
302 Assignment 1 COMPLETE
8 pages
8 Correlation
No ratings yet
8 Correlation
22 pages
Regression and Correlation
No ratings yet
Regression and Correlation
19 pages
Lesson 11 Pearsons R
No ratings yet
Lesson 11 Pearsons R
12 pages
BS Module 2
No ratings yet
BS Module 2
7 pages
Lecture 10 - Correlation and Regression
No ratings yet
Lecture 10 - Correlation and Regression
26 pages
Correlation BMLT
No ratings yet
Correlation BMLT
5 pages
Lectures 5 6 - Correlation Analysis
No ratings yet
Lectures 5 6 - Correlation Analysis
29 pages
Research Paper
No ratings yet
Research Paper
20 pages
Chapter - Six
No ratings yet
Chapter - Six
8 pages
Mix Design - BMCT
No ratings yet
Mix Design - BMCT
56 pages
QTT Lec Correlations
No ratings yet
QTT Lec Correlations
33 pages
Correlation Coefficient
No ratings yet
Correlation Coefficient
8 pages
Unit 3-1
No ratings yet
Unit 3-1
12 pages
Correlation
No ratings yet
Correlation
20 pages
Correlation
No ratings yet
Correlation
20 pages
Wub Ante
No ratings yet
Wub Ante
8 pages
Correlation Notes
No ratings yet
Correlation Notes
8 pages
Presentation On: Correlation and Rank Correlation: Submitted To
100% (3)
Presentation On: Correlation and Rank Correlation: Submitted To
23 pages
Pearson Correlation Analysis
100% (1)
Pearson Correlation Analysis
26 pages
Correlation Analysis
100% (1)
Correlation Analysis
51 pages
Introduction To Correlationand Regression Analysis BY Farzad Javidanrad PDF
No ratings yet
Introduction To Correlationand Regression Analysis BY Farzad Javidanrad PDF
52 pages
Module 2 Unit 4
No ratings yet
Module 2 Unit 4
4 pages
Correlation
No ratings yet
Correlation
34 pages
Biostatistics Stat-301: WWW - Tuf.edu - PK
No ratings yet
Biostatistics Stat-301: WWW - Tuf.edu - PK
16 pages
Correlation: A Mutual Relationship or Connection Between Two or More Things
No ratings yet
Correlation: A Mutual Relationship or Connection Between Two or More Things
6 pages
PMC 500 Statistical Reasoning in Education: Correlation
No ratings yet
PMC 500 Statistical Reasoning in Education: Correlation
45 pages
Midterm Assessment #5: Answers Will Mean A Deduction of Points
No ratings yet
Midterm Assessment #5: Answers Will Mean A Deduction of Points
4 pages
2.1 Stats
No ratings yet
2.1 Stats
3 pages
Lecture10 Correlation
No ratings yet
Lecture10 Correlation
13 pages
Chapter 10
No ratings yet
Chapter 10
167 pages
Kruskal and Wallis 1952
No ratings yet
Kruskal and Wallis 1952
40 pages
CHAPTER THREE - SMEs
No ratings yet
CHAPTER THREE - SMEs
6 pages
Stock Watson 3U ExerciseSolutions Chapter10 Students
No ratings yet
Stock Watson 3U ExerciseSolutions Chapter10 Students
7 pages
TARGET-1000 (JEE ADV-2025) - MATHS - SPL Assignment - Statistics
No ratings yet
TARGET-1000 (JEE ADV-2025) - MATHS - SPL Assignment - Statistics
3 pages
Sta 250 2022 Session 2
No ratings yet
Sta 250 2022 Session 2
9 pages
Econometrics: Problem Set 1: Professor: Mauricio Sarrias
No ratings yet
Econometrics: Problem Set 1: Professor: Mauricio Sarrias
5 pages
A Bound Testing Analysis of Wagners Law in Nigeri
No ratings yet
A Bound Testing Analysis of Wagners Law in Nigeri
18 pages
Chi-Square Test & McNemar Test - D.Boduszek
No ratings yet
Chi-Square Test & McNemar Test - D.Boduszek
25 pages
Foundations of Statistical Inference
No ratings yet
Foundations of Statistical Inference
22 pages
Business Statistics
No ratings yet
Business Statistics
4 pages
final ap statistics qp 6人
No ratings yet
final ap statistics qp 6人
63 pages
Hypothesis For Math
No ratings yet
Hypothesis For Math
42 pages
T-Test-Assignment 3-Act
No ratings yet
T-Test-Assignment 3-Act
4 pages
Contoh: Analisis Bivariat
No ratings yet
Contoh: Analisis Bivariat
1 page
Practical No: 7 STATEMENT: Two Different Types of Drugs D: X X S N N
No ratings yet
Practical No: 7 STATEMENT: Two Different Types of Drugs D: X X S N N
2 pages
Random Variable Exercises
No ratings yet
Random Variable Exercises
5 pages
Gender Inequality A Case Study in Pakistan
No ratings yet
Gender Inequality A Case Study in Pakistan
11 pages
Mean Median Mode
No ratings yet
Mean Median Mode
4 pages
TH TH
No ratings yet
TH TH
1 page
Set C
No ratings yet
Set C
1 page
Effect of Service Quality On Customer Satisfaction (Case Study in Indomaret KM 30)
No ratings yet
Effect of Service Quality On Customer Satisfaction (Case Study in Indomaret KM 30)
7 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)

Lecture 10 Correlation and Regression

Uploaded by

Lecture 10 Correlation and Regression

Uploaded by

Correlation

• Co-variation or co-relation between two

A slope of 2 means that every 1-unit change in

The linear regression model: intercept

• Something to think about

• Correlation is not causation.

• A statistic that quantifies a linear relation

• Psychometrics is used in the development of

In the calculation of the partial correlation

You might also like