0% found this document useful (0 votes)

27 views34 pages

Supervised Machine Learning - Regression

Uploaded by

P. VENKATESHWARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views34 pages

Supervised Machine Learning - Regression

Uploaded by

P. VENKATESHWARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Supervised Machine

Learning- Regression
Sunita Tiwari
Steps involved in developing
Machine Learning Solutions
Types of Supervised Learning
• There are two types of supervised learning-

1. Regression
1. Linear Regression
2. Polynomial Regression
3. Support Vector Regression
4. Decision Tree Regression

2. Classification
1. Logistic Regression

2. Decision Tree

3. Random Forest

4. K-Nearest Neighbour

5. Support Vector Machines

6. Naïve Bayes
Difference b/w Regression &
Classification
Criteria Regression Classification
Objective Predict continuous values Assign data to discrete categories

Output Continuous and numerical values Categorical, representing classes/labels

Examples Predicting house prices, stock values Email spam detection, tumor classification

Algorithms Linear Regression, SVR, Polynomial Regression Logistic Regression, Decision Trees, SVM

Output Evaluation Metrics like MSE, RMSE Metrics like Accuracy, Precision, Recall

Decision Boundary No strict boundary; continuous spectrum Defines boundaries between classes
Linear Vs Non Linear Relationship
General Approach- Regression
• Let x denote the set of input variables and y the output variable.
• In General, Regression assume a model , i.e., some mathematical
relation between x and y, involving some parameters say, θ, in the
following form:
y = f(x, θ)
• The function f(x, θ) is called the regression function.
• The ML algo. optimizes the parameters in the set θ such that the
• Approximation error is minimized (the estimates of the values of the
dependent variable y are as close as possible to the correct values given in
the training set.)
Example
• For housing example
• if the input variables are “Age”, “Distance” and “Weight”
• the output variable is “Price”
• the model may be

y = f(x, θ)
Price = a0 + a1 × (Age) + a2 × (Distance) + a3 × (Weight)

• where x = (Age, Distance, Weight)

• θ = (a0, a1, a2, a3)
• Y= (Price)
Different Regression Models
• Simple linear regression:
• There is only one continuous independent variable x
• the assumed relation between the independent variable and the dependent
variable y is
y = a + bx.
• Multivariate linear regression:
• There are more than one independent variable, say x1, . . . , xn,
• the assumed relation between the independent variables and the dependent
variable is
y = a0 + a 1 x 1 + ⋯ + a n x n .
Contd..
• Polynomial regression:
• There is only one continuous independent variable x
• the assumed model is
y = a0 + a 1 x + ⋯ + a n x n .
• Logistic regression:
• The dependent variable is binary, that is, a variable which takes only the
values 0 and 1.
• Even though the output is a binary variable, what is being sought is a
probability function which may take any value from 0 to 1.
Criterion for minimisation of error
• In regression, we assume that the output is the sum of a function f(x) of the input and
some random error denoted by ε :
y = f(x) + ε.
• Here the function f(x) is unknown
• We would like to approximate it by some estimator g(x, θ) containing a set of
parameters θ.
• We assume that the random error ε follows normal distribution with mean 0.
• Let x1, . . . , xn be a random sample of observations of the input variable x and y 1, . . . , yn
the corresponding observed values of the output variable y
• we can apply the method of maximum likelihood estimation to estimate the values of
the parameter θ.
E(θ) = (y1 − g(x1, θ))2 + ⋯ + (yn − g(xn, θ))2 .
Example

Sample Data Error in Observed Values

Regression
Simple Linear Regression
Definition OLS Method
• Let x be the independent predictor variable • In the OLS method, the values of y-
and y the dependent variable.
• Assume that we have a set of observed
intercept and slope are chosen such
values of x and y: that they minimize the sum of the
• A simple linear regression model defines the squared errors;
relationship between x and y using a line
defined by an equation in the following form:
• that is, the sum of the squares of the
y = α + βx vertical distance between the predicted
• In order to determine the optimal estimates y-value and the actual y-value.
of α and β, an estimation method known as • So we are required to find the values of
Ordinary Least Squares (OLS) is used.
α and β such that E is minimum
Contd..
• we can show that the values of a and b, which are respectively the
values of α and β for which E is minimum, can be obtained by solving
the following equations.

• Recall that the means of x and y are given by

Contd..
• and also that the variance of x and covariance of x and y is given by

• It can be shown that the values of a and b can be computed using the
following formulas:
Formal Derivation of Linear
regression
• Given n inputs and outputs. • Put the value of equation 2 into
equation 3.

• We define the line of best fit as:

• To minimize our error function, S, we

• Now we need to minimize the must find where the first derivative
error function we named S of S is equal to 0 w.r.t. a and b.
• The closer a and b are to 0, the less
total error for each point is. Let’s find
the partial derivative of a first.
Finding a or alpha
• Find the derivative of S wrt a. • Using partial derivative

• Expanding
• Using the chain rule, let’s say
Contd.. • Now let’s break the summation in 3 parts

• Simplifying
• Now the summation of a is

• To find extreme values, we put it to zero

• Substituting it back in the equation

• Dividing both sides with -2

Contd..
• Now we need to solve it for a • Similarly we can find the value of
B, b or Beta

• The summation of Y and x

divided by n, is simply it’s mean
Example
• Obtain a linear regression for the data in Table assuming that y is the
independent variable.
Solution
• In the usual notations of simple linear regression, we have

Therefore, the linear regression model for the data is

y = 0.785 + 0.425x.
Contd..
• So for a new sample x we can find y by using

• Y= 0.785 + 0.425 * x
Example for implementation
• In this example we will predict GPA of a
student from his SAT score.
• Our dependent variable is GPA
Implementation of Linear Regression
in Python
• Step-1 : Importing the relevant libraries
• The first three are pretty conventional. We will not use numpy
as of now
• In addition, the machine learning library we will employ for
this linear regression example is: statsmodels.
Contd..
• Step-2: Loading the data
• After running it, the data from the .csv file will be loaded
in the data variable.
Contd..
• You can check the data just by writing data in next line and execute it
Contd..
• There are two columns SAT and GPA.
• Use data.describe() to find further information about data
• This is pandas method.
• We have sample of 84 students.
Contd..
• Let’s create a variable called y which will contain GPA.
1.First, we write the name of the data frame, in this case data
2.Then, we add in square brackets the relevant column name,
which is GPA in our case.
Contd..
• Step-3 : Exploring the data using
matplotlib
• Each point on the graph
represents a different
student.
• For instance, the
highlighted point below is a
student
• who scored around 1900 on
the SAT and graduated with
a 3.4 GPA.
Contd..
• Observing all data points,
we can see that there is a
strong relationship
between SAT and GPA.
• In general, the higher the
SAT of a student, the
higher their GPA.
Contd..
• Step-4: Next, we need to create a
new variable, which we’ll call x.
• We have our x1, but we don’t have
an x0.
• In fact, in the regression
equation there is no explicit x0.
• The coefficient b0 is alone.
• That can be represented as: b0 * 1.
• So, if there was an x0, it would
always be 1.
• Use add_constant()
Contd..
• Step-5:we will create
another variable
named results.
• It will contain the output
of the ordinary least
squares regression
(OLS)
Contd..
• Step-6: Lets plot again
How to interpret Regression table
• The regression table has
• Model Summary
• Coefficient table
• Some additional tests
• it is 0.275, which means
b0 is 0.275.
• The lower the standard
error, the better the
estimate!
• The next two values are a
T-statistic and its P-value.

ROn Book
No ratings yet
ROn Book
1,123 pages
Chapter 10
No ratings yet
Chapter 10
63 pages
Econometrics Model Exam
100% (3)
Econometrics Model Exam
10 pages
Short Quiz 1 6
No ratings yet
Short Quiz 1 6
5 pages
5 - AML Lecture 5 - Linear Regression
No ratings yet
5 - AML Lecture 5 - Linear Regression
56 pages
Linear Regression
No ratings yet
Linear Regression
16 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Day.9 SML
No ratings yet
Day.9 SML
23 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
13 pages
Lecture 3 - Linear Regression Imran 20022025 092939am
No ratings yet
Lecture 3 - Linear Regression Imran 20022025 092939am
46 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
Machine Learning Class Slide
No ratings yet
Machine Learning Class Slide
44 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Lecture 4 Linear Regression
100% (1)
Lecture 4 Linear Regression
44 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
3CP10 Final MJJ Linear Regression
No ratings yet
3CP10 Final MJJ Linear Regression
68 pages
L4a - Supervised Learning
No ratings yet
L4a - Supervised Learning
25 pages
Lecture3 221109 035214
No ratings yet
Lecture3 221109 035214
87 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
AI Lec23
No ratings yet
AI Lec23
36 pages
Lecture6 Regression
No ratings yet
Lecture6 Regression
42 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
LinearRegression PDF
No ratings yet
LinearRegression PDF
4 pages
Linear Regression - Module 3
No ratings yet
Linear Regression - Module 3
16 pages
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
No ratings yet
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
19 pages
Module III (Part II) (Regression and Time Series)
No ratings yet
Module III (Part II) (Regression and Time Series)
118 pages
SimpleMultipleLinearRegression FoundationalMathofAI S24
No ratings yet
SimpleMultipleLinearRegression FoundationalMathofAI S24
6 pages
ML L6 Linear Regresion
No ratings yet
ML L6 Linear Regresion
54 pages
2.1 Regression Analysis
No ratings yet
2.1 Regression Analysis
28 pages
DS Unit-Iv
No ratings yet
DS Unit-Iv
34 pages
Unit 2
No ratings yet
Unit 2
136 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
Unit-2 ML
No ratings yet
Unit-2 ML
39 pages
6 Regression Analysis
No ratings yet
6 Regression Analysis
12 pages
Unit 2 Regression Analysis
No ratings yet
Unit 2 Regression Analysis
16 pages
ML Unit
No ratings yet
ML Unit
23 pages
Isn't Linear Regression From Statistics?
No ratings yet
Isn't Linear Regression From Statistics?
4 pages
Deep Learning
No ratings yet
Deep Learning
7 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
ML Unit Ii
No ratings yet
ML Unit Ii
30 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
ML 02 Regression 2
No ratings yet
ML 02 Regression 2
30 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Lect 10 Regression
No ratings yet
Lect 10 Regression
7 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
Da Unit-3
No ratings yet
Da Unit-3
27 pages
4 ML
No ratings yet
4 ML
41 pages
Engineering Analysis & Statistics: Lect. # 11
No ratings yet
Engineering Analysis & Statistics: Lect. # 11
22 pages
1.linear Regression PSP
No ratings yet
1.linear Regression PSP
92 pages
ML Exp1 C36
No ratings yet
ML Exp1 C36
13 pages
Types of Supervised Learning2
No ratings yet
Types of Supervised Learning2
66 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Regression
No ratings yet
Regression
45 pages
Linear Regression - 1st Draft
No ratings yet
Linear Regression - 1st Draft
5 pages
Chapter4 Regression
No ratings yet
Chapter4 Regression
15 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Regenerative Semi-Supervised Bidirectional W-Network-Based Knee Bone Tumor Classification On Radiographs Guided by Three-Region Bone Segmentation
No ratings yet
Regenerative Semi-Supervised Bidirectional W-Network-Based Knee Bone Tumor Classification On Radiographs Guided by Three-Region Bone Segmentation
13 pages
Chapter - 1 - Image Enhancement (Spatial & Freq. Domain)
No ratings yet
Chapter - 1 - Image Enhancement (Spatial & Freq. Domain)
21 pages
TEXTIO
No ratings yet
TEXTIO
11 pages
A Light-Weighted Hypergraph Neural Network For Multimodal Remote Sensing Image Retrieval
No ratings yet
A Light-Weighted Hypergraph Neural Network For Multimodal Remote Sensing Image Retrieval
13 pages
Regression and Correlation (Ch.14) )
No ratings yet
Regression and Correlation (Ch.14) )
7 pages
Irs Ae18e8196b04e1548f
No ratings yet
Irs Ae18e8196b04e1548f
44 pages
Group 4 CHM 812 Assgn.
No ratings yet
Group 4 CHM 812 Assgn.
7 pages
Chapter 1 Simple Linear Regression Model
No ratings yet
Chapter 1 Simple Linear Regression Model
2 pages
Instant Ebooks Textbook (Ebook PDF) Business Statistics A First Course, 6th Edition Download All Chapters
100% (2)
Instant Ebooks Textbook (Ebook PDF) Business Statistics A First Course, 6th Edition Download All Chapters
45 pages
The Correlation Between Minimum Wage and Youth Unemployment Final-1
No ratings yet
The Correlation Between Minimum Wage and Youth Unemployment Final-1
14 pages
Obe Syllabus (Oblicon)
No ratings yet
Obe Syllabus (Oblicon)
43 pages
PPT2-W1-S2-R0 - Descriptive Analytics I Statistical Modelling
No ratings yet
PPT2-W1-S2-R0 - Descriptive Analytics I Statistical Modelling
46 pages
II Year IV Semester 2017 Syllabus
No ratings yet
II Year IV Semester 2017 Syllabus
32 pages
Advance Statistics (Simple Linear Regression)
No ratings yet
Advance Statistics (Simple Linear Regression)
3 pages
05 Aktas
No ratings yet
05 Aktas
11 pages
Statistics For Business and Economics: Simple Regression
No ratings yet
Statistics For Business and Economics: Simple Regression
64 pages
Planet, Code - MACHINE LEARNING WITH PYTHON - A Comprehensive Guide To Algorithms, Deep Learning Techniques, and Practical Applications (2025)
No ratings yet
Planet, Code - MACHINE LEARNING WITH PYTHON - A Comprehensive Guide To Algorithms, Deep Learning Techniques, and Practical Applications (2025)
233 pages
Business Statistics With Solutions in R (Mustapha Abiodun Akinkunmi)
No ratings yet
Business Statistics With Solutions in R (Mustapha Abiodun Akinkunmi)
278 pages
Practical AI For Business Leaders, Product Managers, and Entrepreneurs (Alfred Essa, Shirin Mojarad)
100% (1)
Practical AI For Business Leaders, Product Managers, and Entrepreneurs (Alfred Essa, Shirin Mojarad)
240 pages
Simple Linear Regression-Example
100% (1)
Simple Linear Regression-Example
4 pages
Screenshot 2024-10-29 at 10.49.03 PM
No ratings yet
Screenshot 2024-10-29 at 10.49.03 PM
88 pages
Chapter 7 - Quantitative Analysis
100% (1)
Chapter 7 - Quantitative Analysis
13 pages
Demand Forecasting Karthi
No ratings yet
Demand Forecasting Karthi
62 pages
Sta108 Grouping
No ratings yet
Sta108 Grouping
15 pages
Simple Linear Regression and Its Properties 82
No ratings yet
Simple Linear Regression and Its Properties 82
8 pages
Levine Bsfc7ge Ch12 1
No ratings yet
Levine Bsfc7ge Ch12 1
93 pages
Module 4 (Forecasting)
No ratings yet
Module 4 (Forecasting)
42 pages
Clarification: The Covariance of Intercept and Slope in Simple Linear Regression? - Cross Validated
No ratings yet
Clarification: The Covariance of Intercept and Slope in Simple Linear Regression? - Cross Validated
1 page
NumXL Functions
No ratings yet
NumXL Functions
11 pages
Regression Solution
No ratings yet
Regression Solution
11 pages
CMO 87, S. 2017 - PSG FOR BSCpE-final PDF
No ratings yet
CMO 87, S. 2017 - PSG FOR BSCpE-final PDF
85 pages

Supervised Machine Learning - Regression

Uploaded by

Supervised Machine Learning - Regression

Uploaded by

Supervised Machine

5. Support Vector Machines

Output Continuous and numerical values Categorical, representing classes/labels

• where x = (Age, Distance, Weight)

Sample Data Error in Observed Values

• Recall that the means of x and y are given by

• We define the line of best fit as:

• To minimize our error function, S, we

• To find extreme values, we put it to zero

• Substituting it back in the equation

• Dividing both sides with -2

• The summation of Y and x

Therefore, the linear regression model for the data is

You might also like