0% found this document useful (0 votes)

2 views69 pages

ML Unit2

The document covers supervised learning techniques, focusing on various regression methods including linear regression, multiple linear regression, and logistic regression. It explains the concepts of independent and dependent variables, the mathematical representation of regression models, and the use of cost functions and gradient descent in optimizing these models. Additionally, it highlights applications of logistic regression in classification problems and provides examples of its use in predicting outcomes based on input variables.

Uploaded by

pm626560211

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views69 pages

ML Unit2

Uploaded by

pm626560211

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 69

SUPERVISED

LEARNING
UNIT-2
Unit-2

SUPERVISED LEARNING: Linear

Regression, Multiple Linear
Regression, Logistic
Regression, K Nearest
Neighbours, Decision Trees:
ID3, Classification and
Regression Trees, Support
Vector Machines: Linear and
Nonlinear, Kernel Functions.
WHAT ARE INDEPENDENT AND DEPENDENT VARIABLES?

What's an independent variable?

Answer: An independent variable is exactly what it sounds like. It is a
variable that stands alone and isn't changed by the other variables
you are trying to measure. For example, someone's age might be an
independent variable. Other factors (such as what they eat, how much they
go to school, how much television they watch) aren't going to change a
person's age. In fact, when you are looking for some kind of relationship
between variables you are trying to see if the independent variable causes
some kind of change in the other variables, or dependent variables.
WHAT ARE INDEPENDENT AND
DEPENDENT VARIABLES?
What's a dependent variable?
Answer: Just like an independent variable, a dependent variable is exactly
what it sounds like. It is something that depends on other factors. For
example, a test score could be a dependent variable because it could
change depending on several factors such as how much you studied,
how much sleep you got the night before you took the test, or
even how hungry you were when you took it. Usually when you are
looking for a relationship between two things you are trying to find out
what makes the dependent variable change the way it does.
WHAT ARE INDEPENDENT AND
DEPENDENT VARIABLES?
• (Independent variable) causes a change in (Dependent Variable) and it isn't
possible that (Dependent Variable) could cause a change in (Independent
Variable).
• For example:
• (Time Spent Studying) causes a change in (Test Score) and it isn't possible that
(Test Score) could cause a change in (Time Spent Studying).
• We see that "Time Spent Studying" must be the independent variable and
"Test Score" must be the dependent variable because the sentence doesn't
make sense the other way around.
WHAT ARE INDEPENDENT AND DEPENDENT VARIABLES?
WHAT IS LINEAR REGRESSION?

• Linear regression analysis is used to predict the value of a variable based on the
value of another variable.
• The variable you want to predict is called the dependent variable. The
variable you are using to predict the other variable's value is called the
independent variable.
• Linear regression is a supervised machine learning model majorly used in
forecasting. Supervised machine learning models are those where we use the
training data to build the model and then test the accuracy of the model using
the loss function.
• As the name suggests, it assumes a linear relationship between a set of
independent variables to that of the dependent variable (the variable of interest).
A LINE OF
LINEAR
REGRESSION

Positive Regression Line: Negative Regression

X axis independent Line: X axis independent
variable value is variable value is
increasing and so does increasing and so does
the Y axis dependent the Y axis dependent
variable value is variable value is
increasing decreasing
• Below is the equation to the linear regression model, we all
D
are well aware of it
e
y=mx+c p
e
• Where y is the dependent variable n
d
• M refers to the slope of the line e
n
• X is the independent variable and t
• C is the constant-coefficient of the line
Independent variable
• When there is only one independent feature, it is known as
Simple Linear Regression, and when there are more than
one feature, it is known as Multiple Linear Regression.
• Similarly, when there is only one dependent variable, it is
considered Univariate Linear Regression, while when there
are more than one dependent variables, it is known as
Multivariate Regression.
Uses of Regression

• Determining the strength of predictors

• Relationship between Strength of sales and marketing
spending
• Relationship between age and income
• Forecasting an effect
• How much additinational sale income will I get for each
Thousands of Rupee spent on marketing
• Trend Forecasting /Point Estimates
• What will be the price of GOLD in next six months
Mathematical Representation of Linear Regression :
y= a0+a1x+ e
Where
y=taget variable i.e Dependent variable
x=preditor variable i.e Independent variable
a0=intercept of the line
a1=linear regression coefficient
e = random error
PROBLEM TO SOLVE
Linear Regression using Matrix method
Linear Regression using Matrix method
Linear Regression using Matrix method
Linear Regression using Matrix method
Finding the Best fit Line

• The error between predicted values and actual values

are minimized
• The best fit line will have least errors
• The different values for weights or the coefficient of
lines(a0,a1) gives different lines of regression,So we
need to find the calculate the best fist values for a0
and a1 to find bestfit line
• So to calculate this we use cost function
Need of cost function
Need of cost function
Need for cost function
Cost function-

○ The different values for weights or coefficient of lines (a0, a1) gives the

different line of regression, and the cost function is used to estimate the
values of the coefficient for the best fit line.
○ Cost function optimizes the regression coefficients or weights. It measures
how a linear regression model is performing.
○ We can use the cost function to find the accuracy of the mapping function,
which maps the input variable to the output variable. This mapping function is
also known as Hypothesis function.
The cost function used in linear regression are
• Mean Squared error(MSE)
Gradient Descent
○ Gradient descent is used to minimize the MSE by
calculating the gradient of the cost function.
○ A regression model uses gradient descent to update the
coefficients of the line by reducing the cost function.
○ It is done by a random selection of values of coefficient
and then iteratively update the values to reach the
minimum cost function.
Multiple linear regression

Simple Linear Regression:

If a single independent variable is used to predict the
value of a numerical dependent variable, then such a
Linear Regression algorithm is called Simple Linear
Regression.
Multiple Linear Regression :
It involves multiple independent variables/predictors
and one dependent variable
• The multiple regression of two variables x1 and x2 is given as

y = f(x1,x2)
y = ao+a1x1+a2x2
• Similarly,for a given ‘n’ independent variables,the equation is

y = f(x1,x2,..xn)
y = ao+a1x1+a2x2+..+anxn+ε
X1 X2 Y
- - - ● Apply multiple Regression
Product Product Weekly
for the values given in Table
1 Sales 2 sales sales
where weekly sales along
1 4 1 with sales for products x1
2 5 6 and x2 are provided
3 8 8 ● Matrix approach is used to
solve the problem
4 2 12
X1 X2 Y X= 1 1 4
1
- - - 1 2 5 Y= 6
1 3 8
Product 1 Product 2 Weekly 1 4 2
8
12
Sales sales sales CO
1 4 1 B I L1
AS - – X2
L 3
2 5 6 CO

3 8 8 COL2-X1
4 2 12
• The coefficient of multiple linear regression equation is
a0
a= a1
a2

• The regression coefficient for multiple linear regression is

calculated as:
XTX= 1 1 1 1 1 1 4
4 10 19
1 2 3 4 1 2 5
4 5 8 2 1 3 8 = 10 30 46
19 46 109
1 4 2

3.15 -0.59 -0.30

4 10 19 -0.59 0.20 0.016
10 30 -1
46 -0.30 0.016 0.054
19 46 109
(XTX)-1=

A
Now calculate
1 1 1 1
(X X) X =
T -1 T
A * 1 2 3 4
4 5 8 2

0.05 0.47 -1.02 0.19

-0.32 -0.098 0.155 0.26
-0.065 0.005 0.185 -0.125
Final Ans is..

a1 a0
a2
y=a0+a1x1+a2x2

y=-1.69+3.48x1-0.05x2

In the above equation ,if we know the values of

independent variables x1 and x2,then we can predict y
Logistic Regression
Logistic Regression

• Linear regression is suitable when we need to

predict the numerical responses but it is not
suitable for categorical responses
• When categorical problems are involved ,it is
called as classification problem
• Logistic Regression is suitable for Binary
classification problem
The sigmoid function transforms the continuous real number into a range of ( 0 , 1 )
Some of the applications of Logistic
regression are…
• Fraud Detection in credit card
• Email spam or not
• Sentiment Analysis in Twitter analysis
• Image segmentation,recognition, and classification-X-rays,scans
• Object detection through video
• Handwriting recognition
• Disease prediction-Diabetes,cancer etc
• and many more….
For example ,The organization wants to determine the
increase of sal based on the employee performance
In this case linear regression will help where
• emp rating-x-independent variable
• performance-y-dependent variable
• Now,what if the organization wants to know
whether an employee would be given promotion or
not based on performance
• Now the problem statement response is not in
numerical form ..it is in categorical form
• So the linear graph will not be suitable for this kind
of problem stmts
• Solution is SIGMOID CURVE(S
Curve)
• Based on the threshold
values,the organization decides
to give promotion or not
Example:

• The student dataset has entrance marks based on the historic data of those who are

Based on the logistic regression ,the values of the learnt parameters are 𝜷0=1 and
selected or not selected
•
𝜷1=8
• Assume the marks of X=60 and threshold value=0.5,compute the resultant class
p(x)=1/(1+e-(𝜷0+𝜷1x))
𝜷0+𝜷1x=481 where
𝜷0=1,𝜷1=8,x=60
p(x)=1/(1+e-481) =0.44
• 0.44<0.5 ,therefore for the given marks the student will come under the class “NOT
SELECTED”
Logistic Regression example

Integrated Mathematics IA
50% (2)
Integrated Mathematics IA
40 pages
Linear Regression
No ratings yet
Linear Regression
16 pages
Hays 5th Edition Errors
No ratings yet
Hays 5th Edition Errors
3 pages
Multiple Regression Analysis Using SPSS Statistics
No ratings yet
Multiple Regression Analysis Using SPSS Statistics
9 pages
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
No ratings yet
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
60 pages
18-Linear Regression
No ratings yet
18-Linear Regression
29 pages
Unit 2
No ratings yet
Unit 2
19 pages
Mod3 Eda
No ratings yet
Mod3 Eda
16 pages
SML Updated UNIT 3
No ratings yet
SML Updated UNIT 3
41 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Linear Regression For Machine Learning
No ratings yet
Linear Regression For Machine Learning
9 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Hanan
No ratings yet
Hanan
9 pages
Data Science
100% (1)
Data Science
14 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
Unit-2 ML
No ratings yet
Unit-2 ML
39 pages
ML Unit-2 Final
No ratings yet
ML Unit-2 Final
32 pages
L4a - Supervised Learning
No ratings yet
L4a - Supervised Learning
25 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Linear Regression
No ratings yet
Linear Regression
22 pages
Lecture Note #8 - PEC-CS701E
No ratings yet
Lecture Note #8 - PEC-CS701E
20 pages
Simple Linear Regression With Example Problem
No ratings yet
Simple Linear Regression With Example Problem
12 pages
ML Algorithm
No ratings yet
ML Algorithm
4 pages
Module III (Part II) (Regression and Time Series)
No ratings yet
Module III (Part II) (Regression and Time Series)
118 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Regression
No ratings yet
Regression
14 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
DA Unit-3
No ratings yet
DA Unit-3
13 pages
DS Unit-Iv
No ratings yet
DS Unit-Iv
34 pages
3CP10 Final MJJ Linear Regression
No ratings yet
3CP10 Final MJJ Linear Regression
68 pages
5 - AML Lecture 5 - Linear Regression
No ratings yet
5 - AML Lecture 5 - Linear Regression
56 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
Machine Learning and Deep Learning Course
No ratings yet
Machine Learning and Deep Learning Course
23 pages
Chapter2 1
No ratings yet
Chapter2 1
55 pages
Regression Modelling
No ratings yet
Regression Modelling
25 pages
Unit-2 Notes
No ratings yet
Unit-2 Notes
30 pages
Machine Learning Class Slide
No ratings yet
Machine Learning Class Slide
44 pages
MachineLearning Unit-II
No ratings yet
MachineLearning Unit-II
45 pages
Machine Learning I
No ratings yet
Machine Learning I
61 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
2.1 Regression Analysis
No ratings yet
2.1 Regression Analysis
28 pages
Lecture 4 - Linear Regression
No ratings yet
Lecture 4 - Linear Regression
18 pages
Solving One Variable Linear Equations
No ratings yet
Solving One Variable Linear Equations
10 pages
Chapter 8
No ratings yet
Chapter 8
39 pages
Linear Regression
100% (1)
Linear Regression
8 pages
Unit 3c Linear Regression
No ratings yet
Unit 3c Linear Regression
98 pages
Day 2-Data Science
No ratings yet
Day 2-Data Science
16 pages
OE-ML Unit - 3
No ratings yet
OE-ML Unit - 3
29 pages
ML Exp1 C36
No ratings yet
ML Exp1 C36
13 pages
Unit5 R
No ratings yet
Unit5 R
5 pages
Unit-Iii-1 1
No ratings yet
Unit-Iii-1 1
31 pages
Ssdma Unit 2 Part1
No ratings yet
Ssdma Unit 2 Part1
20 pages
Day.9 SML
No ratings yet
Day.9 SML
23 pages
PA
No ratings yet
PA
28 pages
ML Module3 Regression
No ratings yet
ML Module3 Regression
51 pages
Fai Module 3
No ratings yet
Fai Module 3
67 pages
ML - Module 3 Chapter 5
No ratings yet
ML - Module 3 Chapter 5
10 pages
IDS UNIT 5 Linear Regression
No ratings yet
IDS UNIT 5 Linear Regression
27 pages
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
QT Model Paper .2
No ratings yet
QT Model Paper .2
3 pages
Neural Network Assignment Enhanced
No ratings yet
Neural Network Assignment Enhanced
8 pages
Gmail - FWD - Google Maps
No ratings yet
Gmail - FWD - Google Maps
3 pages
Gmail - FWD - ML 7th Labset
No ratings yet
Gmail - FWD - ML 7th Labset
1 page
Mobile Application Development - 6049 - B.voc. It
No ratings yet
Mobile Application Development - 6049 - B.voc. It
2 pages
ISYE 6413: Design and Analysis of Experiments Spring, 2020: Jeffwu@isye - Gatech.edu
No ratings yet
ISYE 6413: Design and Analysis of Experiments Spring, 2020: Jeffwu@isye - Gatech.edu
2 pages
(Ch. 6) Comparisons of Several Multivariate Means MH & NAM
No ratings yet
(Ch. 6) Comparisons of Several Multivariate Means MH & NAM
44 pages
Reporting Statistics in Psychology
No ratings yet
Reporting Statistics in Psychology
7 pages
Chapter 10 QBM
No ratings yet
Chapter 10 QBM
38 pages
Walmart Business Case Study - Ipynb - Colab
No ratings yet
Walmart Business Case Study - Ipynb - Colab
28 pages
MSS Assignment 2
No ratings yet
MSS Assignment 2
4 pages
Exploratory Data Analysis: 2.1 Objectives
No ratings yet
Exploratory Data Analysis: 2.1 Objectives
23 pages
Q2 Module 4 Statistics
No ratings yet
Q2 Module 4 Statistics
11 pages
Scott and Watson CHPT 4 Solutions
No ratings yet
Scott and Watson CHPT 4 Solutions
4 pages
Evaluation and Cross Validation Detailed
No ratings yet
Evaluation and Cross Validation Detailed
2 pages
BEO1106 Business Statistics Assignment Part III AnswerSheet
No ratings yet
BEO1106 Business Statistics Assignment Part III AnswerSheet
4 pages
Lecture 7
No ratings yet
Lecture 7
6 pages
Feature Selection
No ratings yet
Feature Selection
22 pages
Warranty Data Analysis: A Review: Shaomin Wu
No ratings yet
Warranty Data Analysis: A Review: Shaomin Wu
21 pages
4.anova Test
No ratings yet
4.anova Test
55 pages
Dept of Eco Ets Course Content Mphil Econometrics
No ratings yet
Dept of Eco Ets Course Content Mphil Econometrics
18 pages
BSIT PU PP (Solved) MS251 S23
No ratings yet
BSIT PU PP (Solved) MS251 S23
2 pages
Ceng317 Gc32 Final Exam: Two-Way Anova
No ratings yet
Ceng317 Gc32 Final Exam: Two-Way Anova
6 pages
Exercise EC5002 Econometrics All Questions
No ratings yet
Exercise EC5002 Econometrics All Questions
24 pages
MAE 108 - Probability and Statistical Methods For Engineers - Spring 2014 Final Exam, June 10 Instructions
No ratings yet
MAE 108 - Probability and Statistical Methods For Engineers - Spring 2014 Final Exam, June 10 Instructions
8 pages
Session 09 - BS - 2020-Z Score
No ratings yet
Session 09 - BS - 2020-Z Score
32 pages
Forcasting Dimsum
No ratings yet
Forcasting Dimsum
18 pages
A Dynamic Binary Probit Model With Time-Varying Parameters and Shrinkage Prior
No ratings yet
A Dynamic Binary Probit Model With Time-Varying Parameters and Shrinkage Prior
13 pages
Test1 Answers
No ratings yet
Test1 Answers
7 pages
IAC Lecture4 Homework
No ratings yet
IAC Lecture4 Homework
12 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Unit 4-B: Multiple Regression
No ratings yet
Unit 4-B: Multiple Regression
75 pages

ML Unit2

Uploaded by

ML Unit2

Uploaded by

SUPERVISED

SUPERVISED LEARNING: Linear

What's an independent variable?

Positive Regression Line: Negative Regression

• Determining the strength of predictors

• The error between predicted values and actual values

Simple Linear Regression:

• The regression coefficient for multiple linear regression is

3.15 -0.59 -0.30

0.05 0.47 -1.02 0.19

In the above equation ,if we know the values of

• Linear regression is suitable when we need to

You might also like