0% found this document useful (0 votes)

26 views16 pages

Session 18 Regression

Uploaded by

gautamchandan25

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views16 pages

Session 18 Regression

Uploaded by

gautamchandan25

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Department of AI & DS

COURSE NAME: DATA SCIENCE & STATISTICS

COURSE CODE: 23MT2013

Topic
REGRESSION

Session - 18
AIM OF THE
SESSION
To familiarize students with the concept of regression analysis

INSTRUCTIONAL
OBJECTIVES

This Session is designed to:

1. Demonstrate Linear regression
2. Describe Linear and Non linear regression in real life applications
3. List out the two lines of regression

LEARNING OUTCOMES

At the end of this session, you should be able to:

1. Define liner regression
2. Describe the method of least squares to fit a linear and non linear association between two variables
3. Summarize the difference between linear and non linear regression.
SESSION INTRODUCTION
CONTENTS

Linear Regression

Nonlinear Regression
Regression analysis
A reasonable form of a relationship between the dependent variable and the regressors x is the linear relationship Y=α+βx

Where, α is the intercept and β is the slope.

If the relationship is exact, then it is a deterministic relationship between the two variables. However, in the examples
listed above, as well as countless other scientific and engineering phenomena, the relationship is not deterministic and there
will be random component in it. The concept of regression analysis deals with finding the best relationship between Y and
x, and using methods that allow for prediction of the response values for given values of the regressor x.

In many applications there will be more than one regressor. For example, in the case where the dependent variable is the
price of house, one would expect the age of the house to contribute to the explanation of the price so in this case the
multiple regression structure might be written

Y=α+β1X1+β2X2

Where Y is price, X1 is square footage and X2 is age in years. The resulting analysis is termed as multiple regressions while
the analysis of the single regressor case is called simple regression.
Regression analysis

Simple Linear regression model: The dependent variable Y is related to the independent variable x through the
equation

Y=α+βx+ε

Where α and β are unknown intercept and slope parameters respectively, and ε is a random variable that is assumed to
be distributed with E(ε)=0 and Var(ε)=σ2. Since ε is random the quantity Y is a random variable. The value x of the
regressor variable is not random and measured with negligible error. Ε is called random error or random
disturbance, has constant variance. E(ε)=0 implies that at a specific x and y values are distributed around the true or
population regression line Y=α+βx.
Regression analysis

The method of least squares: An aspect of regression analysis is to estimate the parameters α and β. We denote the
estimates a for α and b for β. Then the estimated or fitted regression line is given by

where is the predicted or fitted value. We expect that the fitted line should be closer to the true regression line. When a
large amount of data is available.

Residual: A residual is essentially an error in the fit of the model

Given a set of regression data {(xi, yi), i=1,2,...,n} and a fitted model

, the ith residual εi is given by εi=yi-, i=1,2,...,n.

ACTIVITIES/ CASE STUDIES/ IMPORTANT FACTS RELATED TO THE
SESSION
We shall find a and b, the estimates of α and β, so that the sum of the squares of the residuals is a minimum. The residual
sum of squares is also called the sum of squares of the errors about the regression line and is denoted by SSE. This
minimization procedure for estimating the parameters is called the method of least squares. Hence, we shall find a and b so
as to minimize

Differentiating SSE with respect to a and b, equating the partial derivatives to zero and rearranging the terms to obtain the
equations (called the normal equations)
ACTIVITIES/ CASE STUDIES/ IMPORTANT FACTS RELATED
TO THE SESSION

Which may solved simultaneously to yield the computing formulas for a and b.
EXAMPLES

Example: Engineers fabricating a new transmission-type electron multiplier created an array of silicon nanopillars on a
flat silicon membrane. The precise structure can influence the electrical properties so, subsequently, the height and widths
of 50 nanopillars were measured in nanometres or 10 -9 meters. The summary statistics, with x=width and y=height, are

N=50, Sxx=7239.22, Sxy=17840.1, Syy=66957.2

a) Find the least squares line for predicting height from width

b) Find the least squares line for predicting width from height.

c) Make a scatter plot and show both lines. Comment.

Solution:

a) Here y=height and the least squares estimates are

slope=b=Sxy/Sxx=17840.1/7239.22=2.464 and
EXAMPLES

The fitted line is height =87.88+2.464 width.

b) Width is now the response variable and height the predictor, so x and y must be interchanged.

Slope b= 17,840.1/66976.2=0.266 and

The fitted line is width=6.944+0.266 height.

c) Here we construct the scatter plot and include the two lines of regression. The line from part (b) is written as

Height =-(6.944/0.266)+(1/0.266)width=-26.11+3.759width

Note that both pass through the mean point (

The chice of fitted line depends on which variable you wish to predict.
SUMMARY

In this session,
1. Define Regression analysis and how it is related with correlation discussed
2. Differentiate the linear and nonlinear regressions.
3. Method of least squares in determining the coefficient have described
SELF-ASSESSMENT QUESTIONS

1. In regression analysis, the variable that is being predicted is the

a) response, or dependent, variable

b) independent variable …
c) intervening variable
d) is usually x

In regression, the equation that describes how the response variable (y) is related to
the explanatory variable (x) is:

a) the correlation model

b) the regression model
c) used to compute the correlation
coefficient
d) None of these alternatives is correct.
TERMINAL QUESTIONS
1. Describe the linear and non linear regression

2. List out the properties of regression coefficients

3. Analyze the regression analysis and its importance in practical experiment

4. In the accompanying table, x is the tensile force applied to a steel specimen in thousands of pounds, and y is the resulting
elongation in thousandths of an inch:
X: 1 2 3 4 5 6
Y: 14 33 40 63 76 85
a) Graph the data to verify that it is reasonable to assume that the regression of Y on x is linear.
b) Find the equation of the least squares line, and use it to predict the elongation when the tensile force is 3.5 thousand pounds.
TERMINAL QUESTIONS

5) A professor in the school of business in a university polled a dozen colleagues about the number of professional
meetings professors attended in the past five years (x) and the number of papers submitted by those to refereed journals
(y) during the same period. The summary data are given as follows:
n=12,
Fit a straight line to the given data.
REFERENCES FOR FURTHER LEARNING OF THE
SESSION
Reference Books:
1. Chapter 1 of TP1: William Feller, An Introduction to Probability Theory and Its Applications:
Volume 1, Third Edition, 1968 by John Wiley & Sons,Inc.
2. Richard A Johnson, Miller& Freund’s Probability and statistics for Engineers, PHI, New Delhi,
11th Edition (2011).

Sites and Web links:

1. https://www.statisticshowto.com/probability-and-statistics/correlation-coefficient-formula/
2.https://www.khanacademy.org/math/statistics-probability/describing-relationships-quantitative-data/regression-
library/v/introduction-to-residuals-and-least-squares
3. https://nptel.ac.in/courses/105105150/24
THANK YOU

Team – DATA SCIENCE AND STATISTICS

2024-25

Lecture 6 - Regression Analysis
No ratings yet
Lecture 6 - Regression Analysis
34 pages
Proton Waja 4G18 Engine Service Manual
No ratings yet
Proton Waja 4G18 Engine Service Manual
144 pages
Chapter 5 Regression Analysis
No ratings yet
Chapter 5 Regression Analysis
14 pages
FeelingFaces Cards En-Blank
No ratings yet
FeelingFaces Cards En-Blank
4 pages
1 - Stat-701 Regression
No ratings yet
1 - Stat-701 Regression
18 pages
Module 11 Unit 2 Simple Linear Regression
No ratings yet
Module 11 Unit 2 Simple Linear Regression
10 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
Lect5 Math231
No ratings yet
Lect5 Math231
31 pages
Math (Regression Theory)
No ratings yet
Math (Regression Theory)
31 pages
Slide Chap11
No ratings yet
Slide Chap11
19 pages
Linear Regression
No ratings yet
Linear Regression
33 pages
Simple Linear Regression & Correlation Chapter No 14...
No ratings yet
Simple Linear Regression & Correlation Chapter No 14...
43 pages
Engineering - Simple Correlation and Regression - 2024
No ratings yet
Engineering - Simple Correlation and Regression - 2024
35 pages
Chap01-3 (Autosaved)
No ratings yet
Chap01-3 (Autosaved)
51 pages
Linear Models
No ratings yet
Linear Models
92 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
95 pages
Regression and Correlation
No ratings yet
Regression and Correlation
13 pages
Simple Linear Regression Analysis - ReliaWiki
No ratings yet
Simple Linear Regression Analysis - ReliaWiki
29 pages
Unit 2 Regression
No ratings yet
Unit 2 Regression
31 pages
STAT22209 - Chapter 02-Regression Analyisis - 2022
No ratings yet
STAT22209 - Chapter 02-Regression Analyisis - 2022
41 pages
Lecture 4
No ratings yet
Lecture 4
22 pages
Biostat Lecture 10
No ratings yet
Biostat Lecture 10
47 pages
(Mathe) Simple Linear Regression and Correlation
No ratings yet
(Mathe) Simple Linear Regression and Correlation
61 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
25 pages
Lecture6 Regression
No ratings yet
Lecture6 Regression
42 pages
Updated Lecture 7
No ratings yet
Updated Lecture 7
29 pages
Linear Regression Model
No ratings yet
Linear Regression Model
36 pages
QUIZ (Objectives) Identification: - (Residual)
No ratings yet
QUIZ (Objectives) Identification: - (Residual)
5 pages
Bio-L8 - Correlation and Regression Analysis
No ratings yet
Bio-L8 - Correlation and Regression Analysis
15 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Simple and Multiple Regression
No ratings yet
Simple and Multiple Regression
56 pages
Engineering Analysis & Statistics: Lect. # 11
No ratings yet
Engineering Analysis & Statistics: Lect. # 11
22 pages
Regression Analysis
No ratings yet
Regression Analysis
22 pages
Regression 1
No ratings yet
Regression 1
32 pages
Regression Analysis
No ratings yet
Regression Analysis
38 pages
Topic 8 - Regression Analysis
No ratings yet
Topic 8 - Regression Analysis
51 pages
Week 2
No ratings yet
Week 2
33 pages
5 - Part II - Regression Analysis W-Notes
No ratings yet
5 - Part II - Regression Analysis W-Notes
10 pages
8-Simple Regression Analysis
No ratings yet
8-Simple Regression Analysis
9 pages
Linear Regression Models
No ratings yet
Linear Regression Models
41 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Unit 3new
No ratings yet
Unit 3new
34 pages
Lecture 6 Simple Linear Regression
No ratings yet
Lecture 6 Simple Linear Regression
36 pages
BST 32202 Linear Regression 6 SLR Assumptions Lse
No ratings yet
BST 32202 Linear Regression 6 SLR Assumptions Lse
20 pages
03 Revisions L Regression
No ratings yet
03 Revisions L Regression
25 pages
Cs3351 Aiml Unit 3 Notes Eduengg
No ratings yet
Cs3351 Aiml Unit 3 Notes Eduengg
38 pages
Lecture9 Regression1 PDF
No ratings yet
Lecture9 Regression1 PDF
22 pages
Regression Course For Second Year (Chap 1-3)
No ratings yet
Regression Course For Second Year (Chap 1-3)
59 pages
Light Activated Switch Circuit Diagram
100% (1)
Light Activated Switch Circuit Diagram
2 pages
Correlation and Linear Regression
No ratings yet
Correlation and Linear Regression
51 pages
325unit 1 Simple Regression Analysis
No ratings yet
325unit 1 Simple Regression Analysis
10 pages
(Revised) Simple Linear Regression and Correlation
No ratings yet
(Revised) Simple Linear Regression and Correlation
41 pages
Handout 4 Regression and Correlation
No ratings yet
Handout 4 Regression and Correlation
13 pages
Artificial Intelligence and Machine Learning - CS3491 - Notes - Unit 3 - Supervised Learning
No ratings yet
Artificial Intelligence and Machine Learning - CS3491 - Notes - Unit 3 - Supervised Learning
37 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
Lesson 11 Simple Linear Regression and Correlation
No ratings yet
Lesson 11 Simple Linear Regression and Correlation
38 pages
Da Unit 3 R22
No ratings yet
Da Unit 3 R22
15 pages
Simple Linear Regression: Definition of Terms
No ratings yet
Simple Linear Regression: Definition of Terms
13 pages
1486016038da Mod12 Q1 e Text
No ratings yet
1486016038da Mod12 Q1 e Text
11 pages
Unit 2-1
No ratings yet
Unit 2-1
30 pages
Differential Equations A Problem Solving Approach Based On MATLAB by P. Mohana Shankar
No ratings yet
Differential Equations A Problem Solving Approach Based On MATLAB by P. Mohana Shankar
459 pages
5.IMPRESSION TECHNIQUES FOR COMPLETE DENTURES (Shewlett)
100% (1)
5.IMPRESSION TECHNIQUES FOR COMPLETE DENTURES (Shewlett)
45 pages
Factors That Influence Temperature & Rainfall
100% (1)
Factors That Influence Temperature & Rainfall
4 pages
3.2.9. Rubber Closures For Containers For Aqueous Parenteral Preparations, For Powders and For Freeze-Dried Powders
No ratings yet
3.2.9. Rubber Closures For Containers For Aqueous Parenteral Preparations, For Powders and For Freeze-Dried Powders
2 pages
Olivetti - MS-DOS 3.30 - Software Installation Guide
No ratings yet
Olivetti - MS-DOS 3.30 - Software Installation Guide
203 pages
IRD Project 1
No ratings yet
IRD Project 1
16 pages
2020 GKS-U Application Guidelines (Regional University Track)
No ratings yet
2020 GKS-U Application Guidelines (Regional University Track)
28 pages
Parts
No ratings yet
Parts
4 pages
7 - American National Standards Institute (ANSI) Standard
No ratings yet
7 - American National Standards Institute (ANSI) Standard
20 pages
Lab: ARMA (1, 1) Process: T T T T
No ratings yet
Lab: ARMA (1, 1) Process: T T T T
7 pages
2nd Grade Skills Checklist: Reading & Language Arts
No ratings yet
2nd Grade Skills Checklist: Reading & Language Arts
4 pages
23AD2001R Lab Workbook
No ratings yet
23AD2001R Lab Workbook
56 pages
77777
No ratings yet
77777
29 pages
Session-5 - DBMS
No ratings yet
Session-5 - DBMS
18 pages
Assignment Plan
No ratings yet
Assignment Plan
3 pages
CMT Quiz
No ratings yet
CMT Quiz
3 pages
Chap 10
No ratings yet
Chap 10
50 pages
En5922 Db08a-01
No ratings yet
En5922 Db08a-01
2 pages
ME 142L Gear Pump Test Experimentmade
No ratings yet
ME 142L Gear Pump Test Experimentmade
8 pages
Crash 1500
No ratings yet
Crash 1500
77 pages
Macquarie's Secret To Superfast Mortgage Growth
No ratings yet
Macquarie's Secret To Superfast Mortgage Growth
4 pages
Partition
No ratings yet
Partition
52 pages
Industry 4.0
No ratings yet
Industry 4.0
4 pages
Apex Voltage To Current Conversion
No ratings yet
Apex Voltage To Current Conversion
4 pages
Varm All 300 English-1
No ratings yet
Varm All 300 English-1
26 pages
Rait Terminal Questions Amswers
No ratings yet
Rait Terminal Questions Amswers
11 pages
I Am Curious (Yellow)
No ratings yet
I Am Curious (Yellow)
7 pages
Imiforce 200 SC
No ratings yet
Imiforce 200 SC
5 pages
Mesin Skala Industri
No ratings yet
Mesin Skala Industri
2 pages
Question Bank Advanced CO1, CO2
No ratings yet
Question Bank Advanced CO1, CO2
4 pages
Intended VS Implemented VS Achieved
No ratings yet
Intended VS Implemented VS Achieved
9 pages
Abss
No ratings yet
Abss
8 pages
Experience The Mahabharat Through Play CertificationKLVFinal
No ratings yet
Experience The Mahabharat Through Play CertificationKLVFinal
9 pages
CR03 - PPAP-Flammability-IMDS-OTOP Status
No ratings yet
CR03 - PPAP-Flammability-IMDS-OTOP Status
1 page
Exp 11 2
No ratings yet
Exp 11 2
3 pages
Anu CV
No ratings yet
Anu CV
2 pages
Y21 B.Tech In-Semester II Examinations, November-2024 (2024-25 Odd Sem) TimeTable
No ratings yet
Y21 B.Tech In-Semester II Examinations, November-2024 (2024-25 Odd Sem) TimeTable
1 page
Advanced Calculus
From Everand
Advanced Calculus
H.K Nickerson
No ratings yet
Elements of Tensor Calculus
From Everand
Elements of Tensor Calculus
A. Lichnerowicz
3.5/5 (2)
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet

Session 18 Regression

Uploaded by

Session 18 Regression

Uploaded by

Department of AI & DS

COURSE NAME: DATA SCIENCE & STATISTICS

COURSE CODE: 23MT2013

This Session is designed to:

At the end of this session, you should be able to:

Where, α is the intercept and β is the slope.

Residual: A residual is essentially an error in the fit of the model

, the ith residual εi is given by εi=yi­-, i=1,2,...,n.

N=50, Sxx=7239.22, Sxy=17840.1, Syy=66957.2

c) Make a scatter plot and show both lines. Comment.

a) Here y=height and the least squares estimates are

The fitted line is height =87.88+2.464 width.

Slope b= 17,840.1/66976.2=0.266 and

The fitted line is width=6.944+0.266 height.

Note that both pass through the mean point (

1. In regression analysis, the variable that is being predicted is the

a) response, or dependent, variable

a) the correlation model

2. List out the properties of regression coefficients

3. Analyze the regression analysis and its importance in practical experiment

Sites and Web links:

Team – DATA SCIENCE AND STATISTICS

You might also like

, the ith residual εi is given by εi=yi-, i=1,2,...,n.