0% found this document useful (0 votes)

187 views6 pages

Topic Three Sta450 (Part1)

This document discusses simple linear regression analysis. It defines the simple linear regression model as having one independent variable and one response variable related by a linear equation with an error term. Least squares estimation is used to estimate the model parameters by minimizing the sum of squared errors between the observed and predicted response values. The example shows calculating the correlation coefficient between number of sales calls and machines sold, plotting the scatter diagram, and using the least squares method to estimate the intercept and slope parameters of the linear regression model relating these two variables. The estimated regression equation is then interpreted to describe how machines sold changes with number of sales calls.

Uploaded by

nur daliena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

187 views6 pages

Topic Three Sta450 (Part1)

Uploaded by

nur daliena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

STA450: FUNDAMENTALS OF REGRESSION ANALYSIS

TOPIC THREE: SIMPLE LINEAR REGRESSION

3.1 General Concepts

Simple Linear Regression Model is a basic regression model where there is only one independent
variable and one response variable.

The simple linear regression model can be stated as follows:

Yi   o  1 X i   i
where,
Yi = the value of the response variable in the ith trial
βo and β1 are the parameters of the regression equation
Xi = is a known constant which is the value of the predictor variable in the ith trial
εi = the random error term with mean E i   0 and variance   i   
2 2

Note:
1. The above regression model is said to be simple, linear in the parameters, and linear in
the independent variable.
“simple” – only one predictor variable
“linear in the parameters” no parameter appears as exponent or is multiplied or divided
by another parameter.
“linear in the predictor variable” – the predictor variable appears only in the first power.
2. A model that is linear in parameter and in the independent variable is also called as first
order model.
3. The above model is subject to the following conditions.
i. The relationship between X and Y must be linear.
ii. The error variable must be normally distributed
iii. The error variance must be constant
iv. The errors must be independent

3.2 Least Square Estimation of the parameters

As we recall the model for simple linear regression:

Yi   o  1 X i   i
 β0 known as intercept.
If x = 0 is in the range, then β0 is the mean of the distribution of the response y,
when x = 0.
If x = 0 is not in the range, then β0 has no practical interpretation.
 β1 known as slope.
Change in the mean of the distribution of the response produced by a unit change
in x
 is random error.

1
STA450: FUNDAMENTALS OF REGRESSION ANALYSIS

Example:

A study was undertaken to examine the number of machines sold (Y) per month by a
sales representative and its relationship to the number of sales calls (X) made in a month
for a random sample of ten representatives. The data obtained is shown in the following
table.

Sales Representative Number of calls Number of machines sold

1 50 70
2 35 50
3 40 45
4 50 60
5 40 55
6 50 65
7 30 40
8 35 50
9 25 30
10 50 60

Correlation coefficient

∑ x = 405 ∑ y = 525 ∑ xy = 22200

2 2
∑ x = 17175 ∑ y = 28875 n = 10

 X Y
SS XY  XY 
r  n
SS XX SSYY     X  
2
  Y 
2
 X 
2
  Y 
2

 n 

n 

405  525 
22200 
 10
  405   28875   525 2 
2
17175 
 10  

10 

 0.931

From the above calculation, we had established that the relationship between the number
of calls made and the number of machines sold is linear and the correlation coefficient
indicated that this relationship was strong. The next step is to construct the model. Hence,
we need to estimate the parameters of the model.

2
STA450: FUNDAMENTALS OF REGRESSION ANALYSIS

The scatter diagram for the above data is shown below. When we fit a regression line, we
want the line of best fit. That is, we want a line that is as close as possible to the actual
data.

Best fitted regression line @ estimated regression line;

   
Yi  0  1 X @ Yi  b0  b1X

The double sided arrows shows the distance between the actual data values and the
regression line which is the error, ei. We want to minimize this error.

ei  Yi  Yi

Where Yi is the actual or observed values and Yi is the estimated values of Y from the
regression equation. Since we want to minimize the errors we take the sum of the squared
errors and then differentiate the function.
2 2
      
2
 ei    Yi  Yi     Yi  0  1 X 
   

The objective of the least square method is to find the estimates of  0 ( 0 ) and the

estimates of 1 ( 1 ) for which ∑ei2 is minimum.

3
STA450: FUNDAMENTALS OF REGRESSION ANALYSIS

The formula for calculating the parameters of the regression line is found by
differentiating the above function and equating it to zero. The formulas are as follows:

 X Y
 SS XY  XY 
1   n
SS XX   X
2
X 
2
n
 
 0  Y  1 X

Back to the example:

∑ x = 405 ∑ y = 525 ∑ xy = 22200 ∑ x2 = 17175 ∑ y2 = 28875 n = 10

To compute ̂ 1 :

 X Y 405  525 
  XY  22200 
SS XY n 10 937.5
1      1.2136
SS XX   X
2
 405 
2
772.5
X  17175 
2
n 10

To compute ̂ 0 :

  
y   x  525  405 
0  Y  1 X   1    10  1.2136  10   3.3492
n  n   

  
Thus, estimated regression function ( Yi  0  1 X ) are:

Yi  3.3490  1.2136X

Interpretation of the regression function:

For every 1 unit number of sales calls increase, unit number of machines sold will be
increased by 1.2136.

4
STA450: FUNDAMENTALS OF REGRESSION ANALYSIS

The standard error of estimate

This measures the deviation of the observations from the regression line. If all the
points on the scatter diagram fall on the regression line, then se would be zero giving
us a perfect forecast. This never occurs in reality. The standard error of the estimate is
actually a measure of the variability of the error terms. The smaller the value of se, the
better the model. We can compare the value of se with the mean of the dependent
variable. If the value is lesser than 10% of the mean, the model can be considered as
good. So how do we measure se? There are two ways of getting this value.

Method 1
As stated above the standard error of the estimate measures the deviation of the data
from the regression line. This is a measure of error.

ei  Yi  Yi
The sum of ei will always be zero. Hence we take the sum of the squared deviations
e 2
i . This is also known as the sum of squares error, SSE. This value is calculated
as shown in the following table. (Refer on example number of calls and number of
machines sold)
 
X Y Y ei  Yi  Yi ei2
50 70 64.0291 5.9709 35.6513
35 50 45.8252 4.1748 17.4286
40 45 51.8932 -6.8932 47.5163
50 60 64.0291 -4.0291 16.2339
40 55 51.8932 3.1068 9.6522
50 65 64.0291 0.9709 0.9426
30 40 39.7573 0.2427 0.0589
35 50 45.8252 4.1748 17.4286
25 30 33.6893 -3.6893 13.6111
50 60 64.0291 -4.0291 16.2339
2
 e =174.7573

SSE = e 2
i = 174.7573

SSE 174.7573
se    4.6738
n2 10  2

Mean of Y = 52.5
A good model must have se less than 10% from the mean of Y.
10% of 52.5 = 5.25
Thusc our model can be said to be good as it is less than 10% of the mean of Y (4.6738 < 5.25)

5
STA450: FUNDAMENTALS OF REGRESSION ANALYSIS

Method 2

We can also work out SSE by using the formula method.

SS2XY 
SSE  SSYY   SSYY  1 SS XY
SS XX

SSyy   Y 2

  Y
2
 28875 
 525 
2
 1312.5
n 10

SSxy  937.5


1  1.2136


SSE  SSYY  1 SSXY  1312.5  1.2136  937.5  174.75

SSE 174.75
se    4.6738
n2 10  2

Class Exercise:
The director of admissions of small college administered a newly designed entrance test
to 20 students selected at random from the new freshman class in a study to determine
whether a student’s grade point average (GPA) at the end of the freshman year (Y) can be
predicted from the test score (X). The results of the study are as follow.
i 1 2 3 4 5 6 7 8 9 10
Xi 5.5 4.8 4.7 3.9 4.5 6.2 6.0 5.2 4.7 4.3
Yi 3.1 2.3 3.0 1.9 2.5 3.7 3.4 2.6 2.8 1.6

i 11 12 13 14 15 16 17 18 19 20
Xi 4.9 5.4 5.0 6.3 4.6 4.3 5.0 5.9 4.1 4.7
Yi 2.0 2.9 2.3 3.2 1.8 1.4 2.0 3.8 2.2 1.5

a. Obtain the least square estimates of β0 and β1, and state the estimated regression
function.
b. What is the point estimate of the change in the mean response when the entrance test
score increases by one point?

Linear Regression
No ratings yet
Linear Regression
5 pages
Research Methodology MCQ Questions With Answers
100% (5)
Research Methodology MCQ Questions With Answers
48 pages
PDF
No ratings yet
PDF
9 pages
"A Range of Values Within Which, We Believe, The True Parameter Lies With High Probability" Is Called
No ratings yet
"A Range of Values Within Which, We Believe, The True Parameter Lies With High Probability" Is Called
7 pages
Regression (Manual)
No ratings yet
Regression (Manual)
7 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
Mungadze Linear
No ratings yet
Mungadze Linear
21 pages
Slides Prepared by John S. Loucks St. Edward's University
No ratings yet
Slides Prepared by John S. Loucks St. Edward's University
48 pages
Lecture 6 Simple Linear Regression
No ratings yet
Lecture 6 Simple Linear Regression
36 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
95 pages
Chap 010
No ratings yet
Chap 010
45 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
36 pages
Regression Equation For SI
No ratings yet
Regression Equation For SI
12 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
Introduction To Linear Regression and Correlation Analysis
No ratings yet
Introduction To Linear Regression and Correlation Analysis
92 pages
Topic 6B Regression
No ratings yet
Topic 6B Regression
13 pages
QMM Epgdm 5
No ratings yet
QMM Epgdm 5
58 pages
325unit 1 Simple Regression Analysis
No ratings yet
325unit 1 Simple Regression Analysis
10 pages
Regression Equation
No ratings yet
Regression Equation
56 pages
Complete Business Statistics: Simple Linear Regression and Correlation
No ratings yet
Complete Business Statistics: Simple Linear Regression and Correlation
50 pages
Daunit 3
No ratings yet
Daunit 3
32 pages
Chapter 9 Simple Linear Regression and Correlation
No ratings yet
Chapter 9 Simple Linear Regression and Correlation
56 pages
STAT 445-Lecture 1 - 2021
No ratings yet
STAT 445-Lecture 1 - 2021
42 pages
Advanced Marketing Research
No ratings yet
Advanced Marketing Research
32 pages
F Regression
No ratings yet
F Regression
65 pages
03 Revisions L Regression
No ratings yet
03 Revisions L Regression
25 pages
Complete Business Statistics: Simple Linear Regression and Correlation
No ratings yet
Complete Business Statistics: Simple Linear Regression and Correlation
50 pages
Regression Analysis and Multiple Regression: Session 7
No ratings yet
Regression Analysis and Multiple Regression: Session 7
100 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
7 pages
BUSINESS STATISTICS: Simple Linear Regression and Correlation
No ratings yet
BUSINESS STATISTICS: Simple Linear Regression and Correlation
55 pages
Chap01-3 (Autosaved)
No ratings yet
Chap01-3 (Autosaved)
51 pages
Reg 02
No ratings yet
Reg 02
46 pages
9 Regression (Statistics IEM 2-2)
No ratings yet
9 Regression (Statistics IEM 2-2)
32 pages
15.simple Linear Regression-530
No ratings yet
15.simple Linear Regression-530
54 pages
Simple Regression
No ratings yet
Simple Regression
35 pages
Regression Course For Second Year (Chap 1-3)
No ratings yet
Regression Course For Second Year (Chap 1-3)
59 pages
Simple Linear Regression Sample
No ratings yet
Simple Linear Regression Sample
55 pages
10 - Regression 1
No ratings yet
10 - Regression 1
58 pages
Regression
100% (1)
Regression
43 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
25 pages
Lecture - 8 Regression and Correlation
No ratings yet
Lecture - 8 Regression and Correlation
34 pages
Lecturer 4 Regression Analysis
100% (1)
Lecturer 4 Regression Analysis
29 pages
Regression Equations
No ratings yet
Regression Equations
94 pages
LinearStatisticalModels and Regression Analysis
No ratings yet
LinearStatisticalModels and Regression Analysis
27 pages
BST 32202 Linear Regression 6 SLR Assumptions Lse
No ratings yet
BST 32202 Linear Regression 6 SLR Assumptions Lse
20 pages
Simple Linear Regression and Multiple Linear Regression: MAST 6474 Introduction To Data Analysis I
No ratings yet
Simple Linear Regression and Multiple Linear Regression: MAST 6474 Introduction To Data Analysis I
15 pages
Business Statistics II
100% (2)
Business Statistics II
100 pages
Week 2
No ratings yet
Week 2
33 pages
Topic04 - Simple Linear Regression
No ratings yet
Topic04 - Simple Linear Regression
11 pages
STAT 445 Regression Analysis
No ratings yet
STAT 445 Regression Analysis
49 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
Chapter 16
No ratings yet
Chapter 16
6 pages
6th Lecture Note 108335647 230518 203102
No ratings yet
6th Lecture Note 108335647 230518 203102
35 pages
Linear Regression Analysis - 4
No ratings yet
Linear Regression Analysis - 4
23 pages
Notes On Applied Linear Regression
No ratings yet
Notes On Applied Linear Regression
47 pages
Slide Chap11
No ratings yet
Slide Chap11
19 pages
Unit - 3 PDA
No ratings yet
Unit - 3 PDA
20 pages
ECO2004 Ch13
No ratings yet
ECO2004 Ch13
13 pages
UE20CS312 Unit2 Slides
No ratings yet
UE20CS312 Unit2 Slides
206 pages
9 Regression (Statistics IEM 2-2)
No ratings yet
9 Regression (Statistics IEM 2-2)
32 pages
ch2 Linear Regression
No ratings yet
ch2 Linear Regression
39 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
Graphs and Tables of the Mathieu Functions and Their First Derivatives
From Everand
Graphs and Tables of the Mathieu Functions and Their First Derivatives
James C. Wiltse
No ratings yet
MRF MBA Project Report
50% (2)
MRF MBA Project Report
58 pages
SSC201 PDF
No ratings yet
SSC201 PDF
163 pages
Sampling Methods: Søren Højsgaard
No ratings yet
Sampling Methods: Søren Højsgaard
22 pages
Business Report Sparkling Dataset - TSF
No ratings yet
Business Report Sparkling Dataset - TSF
26 pages
Quiz File 8604 Merged ASK
100% (1)
Quiz File 8604 Merged ASK
116 pages
First Returns of The Symmetric Random Walk When P Q Basic Statistics and Data Analysis
No ratings yet
First Returns of The Symmetric Random Walk When P Q Basic Statistics and Data Analysis
7 pages
T-Test: T-TEST GROUPS Sex (1 2) /missing Analysis /VARIABLES Level - of - Satisfaction /CRITERIA CI (.95)
No ratings yet
T-Test: T-TEST GROUPS Sex (1 2) /missing Analysis /VARIABLES Level - of - Satisfaction /CRITERIA CI (.95)
16 pages
Full Download Robust Statistics 2° Edition Peter J. Huber PDF
100% (2)
Full Download Robust Statistics 2° Edition Peter J. Huber PDF
51 pages
14 F DISTRIBUTION - Pps
No ratings yet
14 F DISTRIBUTION - Pps
11 pages
Unit 1 Basic Mathematics For Management
No ratings yet
Unit 1 Basic Mathematics For Management
229 pages
Chapter 3 Econometrics Practice MC
No ratings yet
Chapter 3 Econometrics Practice MC
35 pages
1 Hsiao
No ratings yet
1 Hsiao
4 pages
Final Report Sushil
No ratings yet
Final Report Sushil
28 pages
4
No ratings yet
4
12 pages
Mann Whitney U Test Calculator
No ratings yet
Mann Whitney U Test Calculator
2 pages
Assignment 5: For Sheet Granger Causality
No ratings yet
Assignment 5: For Sheet Granger Causality
8 pages
TD 1
No ratings yet
TD 1
6 pages
T-Test Used in Thesis
100% (3)
T-Test Used in Thesis
6 pages
Chapter 6
No ratings yet
Chapter 6
35 pages
BADM (2nd) May2022 2
No ratings yet
BADM (2nd) May2022 2
2 pages
CH 3 Describing Relationship Review
No ratings yet
CH 3 Describing Relationship Review
8 pages
Upper Critical Values of The Student's-T Distribution
No ratings yet
Upper Critical Values of The Student's-T Distribution
3 pages
Estimation of Parameters (Part 2)
No ratings yet
Estimation of Parameters (Part 2)
33 pages
Stats 301 Textbook 4 The Dition
No ratings yet
Stats 301 Textbook 4 The Dition
413 pages
Steps in Factor Analysis
No ratings yet
Steps in Factor Analysis
3 pages
(Ebook PDF) The Process of Social Research 2nd Edition by Jeffrey C. Dixon PDF Download
100% (1)
(Ebook PDF) The Process of Social Research 2nd Edition by Jeffrey C. Dixon PDF Download
55 pages

Topic Three Sta450 (Part1)

Uploaded by

Topic Three Sta450 (Part1)

Uploaded by

STA450: FUNDAMENTALS OF REGRESSION ANALYSIS

TOPIC THREE: SIMPLE LINEAR REGRESSION

3.1 General Concepts

The simple linear regression model can be stated as follows:

3.2 Least Square Estimation of the parameters

As we recall the model for simple linear regression:

Sales Representative Number of calls Number of machines sold

∑ x = 405 ∑ y = 525 ∑ xy = 22200

Best fitted regression line @ estimated regression line;

Back to the example:

∑ x = 405 ∑ y = 525 ∑ xy = 22200 ∑ x2 = 17175 ∑ y2 = 28875 n = 10

Interpretation of the regression function:

The standard error of estimate

We can also work out SSE by using the formula method.

You might also like