Chapter 10
Chapter 10
a. Correlation Analysis
- its purpose is to measure the strength and direction of the linear association between two
variables
-used when we’re interested in describing how two variables change relative to each other
𝐸(𝑋𝑌) − 𝐸(𝑋)𝐸(𝑌)
𝜌=
√𝑉𝑎𝑟(𝑋)𝑉𝑎𝑟(𝑌)
A strong linear relationship doesn’t mean that X causes Y or Y causes X as it is possible that
other variables may have caused the change in both X and Y or due to a coincidence
a.2.a. Pearson’s r
- a point estimator of 𝜌
𝑌𝑖 = 𝛽𝑜 + 𝛽1 𝑋𝑖 + 𝜖1
Legend:
- Value of the response variable for the ith element
- Value of the explanatory for the ith element
- Regression coefficient that gives the Y-intercept of the regression line
It is the value of the mean of Y when X = 0
- Regression coefficient that gives the slope of the regression line
It gives the amount of change in the mean or expected value of Y for every unit
increase in the value of X
- Random error term for the ith element; independent; normally distributed w/ mean
0 and variance
Since 𝜖𝑖 ~𝑁𝑜𝑟𝑚𝑎𝑙 (0, 𝜎 2 ), it follows that the 𝑌𝑖′ 𝑠 follow a normal distribution where the
expected value of 𝑌𝑖 is 𝛽𝑜 + 𝛽1 𝑥 and the variance is still 𝜎 2 . Thus we have 𝑌𝑖 ~𝑁𝑜𝑟𝑚𝑎𝑙 (𝛽𝑜 +
𝛽1 𝑥, 𝜎 2 ), 𝑌𝑖′ 𝑠 independent
𝜖𝑖 = 𝑌𝑖 − 𝐸(𝑌𝑖 ) = 𝑌𝑖 − (𝛽𝑜 + 𝛽1 𝑋𝑖 )
∑ 𝜀𝑖 2
𝑖=1
𝑌̂ = 𝑏𝑜 + 𝑏1 𝑋
Legend:
- Estimated value of Y
- Are the estimates for 𝛽𝑜 and 𝛽1