0% found this document useful (0 votes)

41 views40 pages

Clase Regresión Lineal

Multiple linear regression is a statistical technique that allows researchers to analyze the relationship between one dependent variable and multiple independent variables. It enables researchers to determine the strength of each relationship and estimate the impact of independent variables on the dependent variable. The technique requires certain assumptions about the variables and data to be valid. Researchers apply multiple linear regression by building an equation relating the dependent variable to independent variables and their coefficients, which can then be used to make predictions.

Uploaded by

Luis G. Rivera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views40 pages

Clase Regresión Lineal

Uploaded by

Luis G. Rivera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 40

MULTIPLE LINEAR REGRESSION

JUST A LITTLE POKE

GERNIMO MALDONADO-MARTNEZ, RPT; MPH, PHD(C)

DIANA M. FERNNDEZ-SANTOS, MS; EDD

Sir Francis Galton

Widely promoted regression techniques.
Cousin of Charles Darwin.

Making Sense of Regression

My emphasis here is on

Understanding the key elements of regression

Requirements
Application
Limitations

Regression Is A Powerful Analytical

Technique
Enables researchers to do two things:
1. Determine the strength of the relationship
The r-squared value

Regression Is A Powerful Analytical

Technique
2. Determine the impact of the independent
variable(s) on the dependent variable
The regression coefficient is the predicted
change in the dependent variable for every one
unit of change in the independent variable
Collectively, the regression coefficients enable
the researchers to make estimates of how the
dependent variable will change using different
scenarios for the independent variables

Assumptions
Variables are normally distributed.
Continuous nature
Assumption of a linear relationship
between the independent and
dependent variables.
Assumption of homoscedasticity.

Multiple Regression Equation

Y = a + bX+bX2+bXk + e
Where:
Y = predicted value of the dependent variable
a = the constant or Y intercept (where the
imaginary line crosses the Y access)
b = the regression coefficient
X = the independent variable
e = error

Theoretical Linear Model

Linear Regression
Model Type X=size of house, Y=cost of house
Deterministic Model: an equation or set of equations
that allow us to fully determine the value of the dependent
variable from the values of the independent variables.

Probabilistic Model: a method used to capture the

randomness that is part of a real-life process.

R-square And Its Companions

r = correlation coefficient (overall fit or measure of
association, which is also called r, Pearsons r, Pearson
Product Moment Correlation coefficient, or zero-order
coefficient).
r-square = proportion of the explained variance of the
dependent variable (also called the coefficient of
determination)
1 minus r-square = proportion of unexplained variance
in the dependent variable

Dirty Interpretation

Example: Researchers look at GRE scores and academic

performance in graduate school as measured by grade point
average
The hypothesis is that people who have high GRE scores will
also have high GPAs
From an admissions committee perspective: the belief
that GRE scores are a good predictor of future academic
success and are, therefore, a good criteria for admission
decisions
The researchers report an r-squared of .2
GREs explain 20 percent of the change in GPAs
This means that 80 percent of the changes in GPA are
explained by other factors.

A quick example

Dont get lost

.
.
.
.
. . .. . . .
. . ..
. . .
.
. .

Y Axis: Plane Maintenance Costs

$1,000

$500

Predicted values
if perfect relationship

X Axis: Age of Planes

5 years

10 years

20 years

How It Is Applied
Analysts collect data over the past two years and
crunch it. The computer gives these results:
Y = 100 and .020X
The constant is 100:
If they do not fly at all, the computer estimates
there is still a cost of $100
The .020 is the regression coefficient:
This gets interpreted as: for every mile flown, there
is $.02 change in maintenance costs.

How It Is Applied
Y = 100 and .020X
Interpreting the regression coefficient:
For every mile flown, the maintenance costs
goes up by 2 cents.
For every 100 miles flown, costs are $2
For every 1,000 miles, the costs are $20
For every 100,000 miles, the costs are
$20,000

Making Maintenance Cost Estimates

They can then solve the equation:
Assuming 100,000 miles will be flown, how much
will they need to budget for maintenance?
100,000 multiplied by .020 = $20,000
Y= 100 + $20,000 + error

The estimate maintenance will cost:

$20,100 + error

Practicality

Simple Regression: Another Example

Hypothesis: If schools have a higher
percentage of poor children, then they
will have lower test scores.
A regression analysis shows:
A regression coefficient of -.04
An r-squared value of .25

Even More
Interpretation?
Regression coefficient: For every increase in the
percent of children in poverty within a school, the
average test score goes down by .04
R-squared: 25% of the test scores are explained
by the percent of children in poverty in the school
Researchers will ask: what other factors might
explain differences in test scores in the schools?

Multiple Regression Equation

Y = a + bX1 + bX2 + bX3 + bX4 + e.
Y = dependent variable
X1 = independent variable 1,
for X2, X3, X4
X2 = independent variable 2
controlling for X1, X3, X4
X3 = independent variable 3
controlling for X1, X2, X4
X4= independent variable 4
controlling for X1, X2, X3

controlling

Multiple Regression Equation

It has the same basic structure of simple
regression
Y is still the dependent variable
There is still a constant (a) and some
amount of error (e) that the computer
calculates
But there are more Xs to represent the
multiple independent variables

Multiple Regression:
An Example
Hypothesis: Income is a function of education
and seniority?
We suggest that income (the dependent
variable) will increase as both education and
seniority increases (two independent
variables)
Y (Income) = a + education + seniority +
error

Multiple Regression: Interpretation

Results:
Y= 6000 + 400X1 (education) + 200X2 (seniority)
R square = .67
First look at the R-Square: This shows a strong
relationshipso analysis can continue
Partial regression coefficients:
For every year of education, holding seniority
constant, income increases by $400.
For every year of seniority, holding education
constant, income increases by $200.

Multiple Regression: Application

Estimate the income of someone who has
10 years of education and
5 years of seniority
We solve the regression equation:
Multiply the 10 years of education by the regression
coefficient of 400: equals 4,000
Multiply 5 years of senior by the regression coefficient of
200: equals 1,000
Put it together with the constant and you have
Y=6000 + 400(10) + 200(5) + error
Y= $ 11,000 + error

Demystifying the monster

Statistics
Statistics
Data
Data
x

Information

Multivariate regression pitfalls

Multi-collinearity
Residual

confounding
Overfitting

Multicollinearity
Multicollinearity

arises when two variables

that measure the same thing or similar things
(e.g., weight and BMI) are both included in a
multiple regression model; they will, in
effect, cancel each other out and generally
destroy your model.
VIF>1

is bad
Tolerance: low is bad

Residuals Diagnostics
You cannot completely wipe out confounding simply
by adjusting for variables in multiple regression
unless variables are measured with zero error (which
is usually impossible).
Example: meat eating and mortality

A clean example in PRISM

A real linear regression output

References
Kleinbaum, D. G. Applied
Regression Analysis and
Multivariable Methods.
Third Edition (2011)

Chapter 10 Solutions Horngren Cost Accounting
100% (2)
Chapter 10 Solutions Horngren Cost Accounting
43 pages
Regression Analysis Assignment
100% (1)
Regression Analysis Assignment
8 pages
APPLIED REGRESSION ANALYSIS AND GENERALIZED LINEAR MODELS Fox 2008
0% (1)
APPLIED REGRESSION ANALYSIS AND GENERALIZED LINEAR MODELS Fox 2008
103 pages
Regression Analysis Assignment
No ratings yet
Regression Analysis Assignment
8 pages
Unit 1 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 1 - Machine Learning - WWW - Rgpvnotes.in
23 pages
Correlation and Regression 2
No ratings yet
Correlation and Regression 2
24 pages
Chapter 8 Multiple Regression
No ratings yet
Chapter 8 Multiple Regression
24 pages
Chap 014
No ratings yet
Chap 014
20 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
Multivar 2 - Simple and Multiple Regression PDF
No ratings yet
Multivar 2 - Simple and Multiple Regression PDF
26 pages
Linear Regression II
No ratings yet
Linear Regression II
54 pages
Chapter 14 Multiple Regression and Correlation Analysis
No ratings yet
Chapter 14 Multiple Regression and Correlation Analysis
25 pages
Unit 11
No ratings yet
Unit 11
21 pages
INTRO
No ratings yet
INTRO
7 pages
Chapter Fourteen: Multiple Regression and Correlation Analysis
No ratings yet
Chapter Fourteen: Multiple Regression and Correlation Analysis
27 pages
Regression Analysis
No ratings yet
Regression Analysis
6 pages
Regression Analysis (Simple)
100% (1)
Regression Analysis (Simple)
8 pages
Correlation and Regression
No ratings yet
Correlation and Regression
10 pages
REGRESSION
No ratings yet
REGRESSION
7 pages
Regression Analysis
No ratings yet
Regression Analysis
34 pages
CH - 03 - Multiple Regression Analysis Estimation
No ratings yet
CH - 03 - Multiple Regression Analysis Estimation
36 pages
Methodology
No ratings yet
Methodology
4 pages
Multivariate Analysis IBS
No ratings yet
Multivariate Analysis IBS
20 pages
Unit 4 Multiple Linear Regression
No ratings yet
Unit 4 Multiple Linear Regression
3 pages
06 Regression
No ratings yet
06 Regression
18 pages
Topic 3 Multiple Regression Analysis Estimation
No ratings yet
Topic 3 Multiple Regression Analysis Estimation
31 pages
Multivariate Ana
No ratings yet
Multivariate Ana
20 pages
8-Simple Regression Analysis
No ratings yet
8-Simple Regression Analysis
9 pages
STAT22209 - Chapter 03-Multiple Regression - 2022
No ratings yet
STAT22209 - Chapter 03-Multiple Regression - 2022
41 pages
Lecture 25 - Multiple Regression
No ratings yet
Lecture 25 - Multiple Regression
34 pages
Correlation Regression
No ratings yet
Correlation Regression
26 pages
Stats and Probability
No ratings yet
Stats and Probability
13 pages
Topics: Regression
No ratings yet
Topics: Regression
26 pages
Lecture5 Mar22 2024
No ratings yet
Lecture5 Mar22 2024
44 pages
10 Regression Analysis
No ratings yet
10 Regression Analysis
55 pages
Regression Analysis
100% (2)
Regression Analysis
28 pages
Regression: Regression. But Quite Often The Values of A Particular Phenomenon May Be Affected by Multiplicity of
No ratings yet
Regression: Regression. But Quite Often The Values of A Particular Phenomenon May Be Affected by Multiplicity of
8 pages
Corelation and Regression
No ratings yet
Corelation and Regression
137 pages
Updated Lecture 7
No ratings yet
Updated Lecture 7
29 pages
University of Caloocan City: Managerial Economics Eco 3
No ratings yet
University of Caloocan City: Managerial Economics Eco 3
34 pages
Chapter 8 Linear Regression
No ratings yet
Chapter 8 Linear Regression
34 pages
STB1003 - Unit-3 BSC
No ratings yet
STB1003 - Unit-3 BSC
12 pages
Encyclopedia of Research Design-Multiple Regression
No ratings yet
Encyclopedia of Research Design-Multiple Regression
13 pages
Pol 222
No ratings yet
Pol 222
8 pages
Brief Lecture Notes On Simple Linear Regression Regression Analysis
No ratings yet
Brief Lecture Notes On Simple Linear Regression Regression Analysis
8 pages
Linear Regression
No ratings yet
Linear Regression
25 pages
Lecture 4
No ratings yet
Lecture 4
3 pages
Week 8 - 10
No ratings yet
Week 8 - 10
72 pages
Multivariate Analysis and Statistical Significance
No ratings yet
Multivariate Analysis and Statistical Significance
16 pages
Regression PPT Final
100% (1)
Regression PPT Final
59 pages
Regression Analysis
No ratings yet
Regression Analysis
10 pages
Handout 05 Regression and Correlation PDF
No ratings yet
Handout 05 Regression and Correlation PDF
17 pages
Module V
No ratings yet
Module V
19 pages
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
No ratings yet
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
11 pages
Regression
No ratings yet
Regression
60 pages
Regression Analysis Assignment
No ratings yet
Regression Analysis Assignment
8 pages
Regression Analysis
No ratings yet
Regression Analysis
65 pages
Correlation Regression 15 16
No ratings yet
Correlation Regression 15 16
19 pages
Simple and Multiple Regression
No ratings yet
Simple and Multiple Regression
56 pages
Linear Regresion
No ratings yet
Linear Regresion
28 pages
QBM 101 Lecture 10
No ratings yet
QBM 101 Lecture 10
45 pages
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Niyom Colostate 0053A 11899
No ratings yet
Niyom Colostate 0053A 11899
107 pages
D10262-V1.1 - SMR LATAM Tilt-Up Installation Manual-2
No ratings yet
D10262-V1.1 - SMR LATAM Tilt-Up Installation Manual-2
15 pages
Dr. Richard Hann Department of Biochemistry Universidad Central Del Caribe
No ratings yet
Dr. Richard Hann Department of Biochemistry Universidad Central Del Caribe
28 pages
Homemade Air Conditioner
No ratings yet
Homemade Air Conditioner
10 pages
SAPM - Sem1 - MidSem
No ratings yet
SAPM - Sem1 - MidSem
9 pages
Unit 2 Data Analytics
No ratings yet
Unit 2 Data Analytics
33 pages
Chapter 13 Part 1
No ratings yet
Chapter 13 Part 1
49 pages
Appendix E - The Linear Regression Model in Matrix Form
No ratings yet
Appendix E - The Linear Regression Model in Matrix Form
14 pages
Multivariate Data Analysis
No ratings yet
Multivariate Data Analysis
24 pages
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
No ratings yet
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
34 pages
Problems: Simple Linear Regression
No ratings yet
Problems: Simple Linear Regression
5 pages
Simple Linear Regression & Correlation
No ratings yet
Simple Linear Regression & Correlation
4 pages
Portfolio Risk & Return
100% (1)
Portfolio Risk & Return
37 pages
Sts Reviewer
No ratings yet
Sts Reviewer
5 pages
Salinan Dari Untitled0.Ipynb - Colaboratory
No ratings yet
Salinan Dari Untitled0.Ipynb - Colaboratory
3 pages
Financial Sector Development and Economic Growth PDF
No ratings yet
Financial Sector Development and Economic Growth PDF
23 pages
Geo Ma HG Basic Statistics Self Test
No ratings yet
Geo Ma HG Basic Statistics Self Test
9 pages
Dhana Doc 1
No ratings yet
Dhana Doc 1
25 pages
13
No ratings yet
13
18 pages
Delta Wire Corporation IA-1 Anmol Singh, Anjali Verma
No ratings yet
Delta Wire Corporation IA-1 Anmol Singh, Anjali Verma
11 pages
Numerical Analysis - Curve Fitting
No ratings yet
Numerical Analysis - Curve Fitting
35 pages
Econometrics II (N)
No ratings yet
Econometrics II (N)
30 pages
3 079 16 - Therkelsen
No ratings yet
3 079 16 - Therkelsen
11 pages
Topic 8 Time Series and Forecasting
No ratings yet
Topic 8 Time Series and Forecasting
33 pages
Multivariate Analysis
No ratings yet
Multivariate Analysis
7 pages
Chapter 17 Correlation and Regression
No ratings yet
Chapter 17 Correlation and Regression
16 pages
Chapter 5
No ratings yet
Chapter 5
39 pages
Syl MSC Stats PDF
No ratings yet
Syl MSC Stats PDF
16 pages
Garn W. Data Analytics For Business. AI-ML-PBI-SQL-R 2024
100% (1)
Garn W. Data Analytics For Business. AI-ML-PBI-SQL-R 2024
283 pages
All Algos - of - ML
No ratings yet
All Algos - of - ML
31 pages
Network Coding For Fault-Tolerant Transmission of Biomedical Data
No ratings yet
Network Coding For Fault-Tolerant Transmission of Biomedical Data
6 pages

Clase Regresión Lineal

Uploaded by

Clase Regresión Lineal

Uploaded by

MULTIPLE LINEAR REGRESSION

JUST A LITTLE POKE

GERNIMO MALDONADO-MARTNEZ, RPT; MPH, PHD(C)

Sir Francis Galton

Making Sense of Regression

Understanding the key elements of regression

Regression Is A Powerful Analytical

Regression Is A Powerful Analytical

Multiple Regression Equation

Theoretical Linear Model

Probabilistic Model: a method used to capture the

R-square And Its Companions

Example: Researchers look at GRE scores and academic

Dont get lost

Y Axis: Plane Maintenance Costs

X Axis: Age of Planes

Making Maintenance Cost Estimates

The estimate maintenance will cost:

Simple Regression: Another Example

Multiple Regression Equation

Multiple Regression Equation

Multiple Regression: Interpretation

Multiple Regression: Application

Demystifying the monster

Multivariate regression pitfalls

arises when two variables

A clean example in PRISM

A clean example in PRISM

A clean example in PRISM

A clean example in PRISM

A clean example in PRISM

A clean example in PRISM

A real linear regression output

A real linear regression output

A real linear regression output

A real linear regression output

You might also like