0% found this document useful (0 votes)
128 views12 pages

Regression Analysis Handouts

The document provides information about regression analysis including: - Regression analysis investigates the relationship between dependent and independent variables and is used for prediction, time series modeling, and finding causal relationships. - The key aspects of regression analysis are the regression line, dependent and independent variables, residuals, and using it for prediction, modeling relationships, and testing hypotheses. - Simple and multiple linear regression can model linear or non-linear relationships between one or more independent variables and a dependent variable.

Uploaded by

anderson
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
128 views12 pages

Regression Analysis Handouts

The document provides information about regression analysis including: - Regression analysis investigates the relationship between dependent and independent variables and is used for prediction, time series modeling, and finding causal relationships. - The key aspects of regression analysis are the regression line, dependent and independent variables, residuals, and using it for prediction, modeling relationships, and testing hypotheses. - Simple and multiple linear regression can model linear or non-linear relationships between one or more independent variables and a dependent variable.

Uploaded by

anderson
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 12

REGRESSION ANALYSIS

Introduction
REGRESSION ANALYSIS is a form of predictive modelling
technique which investigates the relationship between a
dependent and independent variable.

This technique is used for forecasting, time series modelling


and finding the causal effect relationship between variables

Ex. Relationship between drunk driving and number of


road accidents by a driver.

Why do we need REGRESSION ANALYSIS?

Typically, a regression analysis is used for these purposes:


• Prediction of the target variable
• Modelling the relationships between the dependent and independent variable
• Testing of by Hypotheses

Benefits:
• It indicates the strength of impact of multiple independent variables on a dependent variable
• It indicates the significant relationships between dependent variable and independent variable
These benefits help market researchers/data analysts/data scientists to eliminate and evaluate the best set of
variables to be used for building predictive models.
Regression Model
The REGRESSION LINE is a
single line that best fits the
data

The DEPENDENT VARIABLE


or the EXPLAINED
VARIABLE, is the variable to The INDEPENDENTVARIABLE
be predicted. or the EXPLANATORY
VARIABLE, is a variable used
to predict another variable.

The RESIDUALis the vertical


distance between a data point
and the regression line

• REGRESSION LINE represents the pattern of the data. It also predicts the change in ‘y’ when ‘x’ increases by one
unit. The change in y describes either an INCREASE or a DECREASE.
Types of Regression Analysis
Has two Has two or
variables more variables

SIMPLE MULTIPLE

LINEAR NON-LINEAR LINEAR NON-LINEAR

• LINEAR REGRESSION’s graph is a straight line with a linear regression equation degree equal to 1.
• NON-LINEAR REGRESSION’s graph is not a straight line and it is NOT A FIRST-DEGREE regression equation
Simple Linear Regression
• In statistics, it is a linear regression model with a single
explanatory variable

• It estimates the relationship between the independent and


dependent variable using a STRAIGHT LINE.

Simple Linear Regression Model


Slope for the Estimated y-intercept for the Estimated
Regression Equation Regression Equation
Positive Linear Relationship

Negative Linear Relationship


No Relationship

Example:
REGRESSION MODEL EXAMPLE
Number of Number of
6
TV Ads Houses
Sold
5
1 2

2 4 4

3 5 3

4 4
2
5 5
1

0
0 1 2 3 4 5 6
Determine the mean of the independent (x) and Subtract the x values to its mean
dependent (y) variable values

Square the difference of x and barred x. Multiply also the


Subtract the y values to its mean difference of the x and y values that were subtracted earlier to
their means and get the total of each

Slope for the Estimated


REGRESSION MODEL EXAMPLE
Regression Equation
͸ 6

ܾଵ ൌ ൌͲǤ͸Ͳ
ͳͲ 5

4
y-intercept for the
Estimated Regression 3
Equation

ͶെͲǤ͸Ͳ ͵ ൌ
2
ܾ଴ ൌ ʹ Ǥʹ
1
Estimated Regression
Equation 0
0 1 2 3 4 5 6

ŷ=2.2൅ሺͲǤ͸Ͳሻई
Coefficient of Determination
• The coefficient of determination or R-SQUARED tells us how
well a regression line predicts or estimates actual values

Relationship between SST, SSR, and SSE

The COEFFICIENT OF DETERMINATION, or R-SQUARED, is the ration of explained variation in y to the total variation
in y. It can take any value between 0 and 1. The closer the value is to 1, the better the explanatory power.

 SST stands for TOTAL SUM OF SQUARES


 SSR stands for SUM OF SQUARES DUE TO REGRESSION
 SSE stands for SUM OF SQUARES DUE TO ERROR
Example (continued)
Determine the mean of the dependent Subtract the y values to its mean. Take note that the total
(y) variable values should always be equal to 0

Square the difference and summarize Using the estimated regression equation that was solved earlier,
ŷ=2.2൅ሺͲǤ͸Ͳሻई , substitute x with its corresponding values

͵ Ǥ͸
‫ݎ‬ଶ ൌ ൌ ͲǤ͸
͸

‫ݎ ݂ܫ‬ଶ ൌ
ͳ, it is a perfect fit

‫ݎ ݂ܫ‬ଶ ܽ‫݄݁ܿܽ݋ݎ݌݌݄݁ܿܽݎ‬
‫݋ ݁݁ݖ ݋ݐ ݏ ݏ‬
‫ݎ‬ǡ‫ݐ ݋݊ݏ݅ ݁ݎ݄݁݁ݎ݄݄݄݁ܿܽ݁݁ݐ‬
‫݈ܽ݁ݎ‬ ‫݈݈ܽݐܽ ݌݄݅ݏ ݊ ݅݊݋݅ݐ‬

‫ݎ ݂ܫ‬ଶ ܽ‫݄݁ܿܽ݋ݎ݌݌݄݁ܿܽݎ‬
‫Ͳ ݋ݐ ݏ ݏ‬Ǥͻ ݅‫ݕ ݐݐ݁ݎ݌ܽݏ݅ݐ‬ ݃‫ݐ݂݅ ݀݋݋‬
Sample Exercises.

Show your solutions for each number and box your final answers.

1. The sales of a company (in million pesos) for each year are shown in the table below:

Year Sales
2007 9
2008 20
2009 39
2010 47
2011 54

a. Find the slope and the y-intercept of the estimated regression line.

b. Use the estimated regression equation to estimate the sales of the company in 2015

c. Calculate the coefficient of determination.

2. The values of y and their corresponding values of y are shown in the table below
x 0 1 2 3 4
y 3 4 5 7 6

a. Find the slope and y-intercept of the estimated regression line.

b. Estimate the value of y when x=13.

c. Calculate the coefficient of determination.

You might also like