0% found this document useful (0 votes)

11 views45 pages

Unit 3 - ML - CH-1

Uploaded by

Priyanka Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views45 pages

Unit 3 - ML - CH-1

Uploaded by

Priyanka Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

Unit 3

Classification – Logistic Regression & Neural Network

Dr. ABHINANDAN P. SHIRAHATTI

Associate Professor,
Department of Computer Science Engineering,
KIT’s College of Engineering (Autonomous),
Kolhapur
Maharashtra – 416234

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Content
Logistic regression- definition, hypothesis representation, decision boundary, cost
function, gradient descent for logistic regression. Multiclass classification.
Regularization – over fitting & under fitting, cost function, regularized linear
regression, regularized logistic Regression.

Neural networks-neuron representation and model, hypothesis for neuron, cost

function, solution of a problem using single neuron, gradient descent for a neuron.
Multiclass classification with neural network. Learning in neural networks –
feedforward neural network, backpropagation algorithm. Loss function – support
vector machines (SVMs), softmax
regression.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Logistic Regression
• Logistic regression is a supervised machine learning algorithm used for classification
tasks where the goal is to predict the probability that an instance belongs to a given class or
not. Logistic regression is a statistical algorithm which analyze the relationship between two
data factors.
Logistic regression is used for binary classification where we use sigmoid
function, that takes input as independent variables and produces a probability
value between 0 and 1.
For example, we have two classes Class 0 and Class 1 if the value of the logistic
function for an input is greater than 0.5 (threshold value) then it belongs to
Class 1 otherwise it belongs to Class 0. It’s referred to as regression because it
is the extension of linear regression but is mainly used for classification
problems.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Key Points
• Logistic regression predicts the output of a categorical dependent variable. Therefore,
the outcome must be a categorical or discrete value.

• It can be either Yes or No, 0 or 1, true or False, etc. but instead of giving the exact value
as 0 and 1, it gives the probabilistic values which lie between 0 and 1.

• In Logistic regression, instead of fitting a regression line, we fit an “S” shaped logistic
function, which predicts two maximum values (0 or 1).

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Logistic Function – Sigmoid Function
 The sigmoid function is a mathematical function used to map the predicted values to
probabilities.

 It maps any real value into another value within a range of 0 and 1. The value of the
logistic regression must be between 0 and 1, which cannot go beyond this limit, so it
forms a curve like the “S” form.

 The S-form curve is called the Sigmoid function or the logistic function.

 In logistic regression, we use the concept of the threshold value, which defines the
probability of either 0 or 1. Such as values above the threshold value tends to 1, and a
value below the threshold values tends to 0.
KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Types of Logistic Regression
On the basis of the categories, Logistic Regression can be classified into three types:

 Binomial: In binomial Logistic regression, there can be only two possible

types of the dependent variables, such as 0 or 1, Pass or Fail, etc.

 Multinomial: In multinomial Logistic regression, there can be 3 or more

possible unordered types of the dependent variable, such as “cat”, “dogs”, or
“sheep”

 Ordinal: In ordinal Logistic regression, there can be 3 or more possible

ordered types of dependent variables, such as “low”, “Medium”, or “High”.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Terminologies involved in Logistic Regression
Independent variables: The input characteristics or predictor factors applied to the dependent variable’s
predictions.
Dependent variable: The target variable in a logistic regression model, which we are trying to predict.
Logistic function: The formula used to represent how the independent and dependent variables relate to
one another. The logistic function transforms the input variables into a probability value between 0 and
1, which represents the likelihood of the dependent variable being 1 or 0.
Odds: It is the ratio of something occurring to something not occurring. it is different from probability as
the probability is the ratio of something occurring to everything that could possibly occur.
Log-odds: The log-odds, also known as the logit function, is the natural logarithm of the odds. In logistic
regression, the log odds of the dependent variable are modeled as a linear combination of the
independent variables and the intercept.
Coefficient: The logistic regression model’s estimated parameters, show how the independent and
dependent variables relate to one another.
Intercept: A constant term in the logistic regression model, which represents the log odds when all
independent variables are equal to zero.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Cost function for Linear Regression
Logistic regression can be used where the probabilities between two classes is required. Such as
whether it will rain today or not, either 0 or 1, true or false etc.
Logistic regression is based on the concept of Maximum Likelihood estimation. According to this
estimation, the observed data should be most probable.
In logistic regression, we pass the weighted sum of inputs through an activation function that can
map values in between 0 and 1. Such activation function is known as sigmoid function and the
curve obtained is called as sigmoid curve or S-curve. Consider the below image:

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Cost function for Logistic Regression
The cost function for Logistic Regression (not "logistic linear regression") is designed to
measure the error between predicted probabilities and actual class labels in
classification problems. Since Logistic Regression deals with probabilities and binary
classification, the cost function is different from the Mean Squared Error used in
Linear Regression.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Cost Function
.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Cost function for Logistic Regression
.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
.
.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Mathematical intuition
To understand the logistic regression, lets go over the odds of success.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Mathematical intuition

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Problem on Logistic Regression
1. You are tasked with predicting whether students will pass or fail a course based on the
number of hours they studied. The outcome variable is binary:1 (Pass) /0 (Fail)
Given a dataset where each student has studied for a certain number of hours, we want to
build a logistic regression model to predict whether a student will pass or fail.
Data set:
Study Hours (x) Pass/Fail (y)
1 0
2 1
3 1
4 1
5 ?

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Formulation of the Logistic Regression Problem:
1. Hypothesis Function:
Logistic regression is used to model the probability that the dependent variable y
(Pass/Fail) is 1, given the number of study hours x. The hypothesis function is given
by the sigmoid function:

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
2. Cost Function:
For logistic regression, the cost function is based on maximum likelihood
estimation and is defined as:

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
3. Gradient Descent
.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
4. Decision Boundary
.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Solution
Here , the independent variable X is represent in the form of Matrix
XT =[1 2 3 4]
The dependent variable Y is represent in the form of Matrix:
YT =[0 1 1 1]
The data can be given in the matrix form as follows:

The first column is used for setting the bias

 Y=

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Logistic Regression Matrix method
The regression is given below:

 Step by step computation is given below:

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Logistic Regression Matrix Method

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Logistic regression method

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Multiclass classification
 Multiclass classification is the task of assigning an instance to one of three
or more classes, as opposed to binary classification, which involves only two
classes.
 In multiclass classification, we deal with datasets where the target variable
can belong to more than two categories, and the goal is to develop models
that can accurately predict the class for unseen instances.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Binary classification vs. Multi-class classification

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Multiclass classification
Binary Classification
Only two class instances are present in the dataset.
It requires only one classifier model.
Confusion Matrix is easy to derive and understand.
Example:- Check email is spam or not, predicting gender based on height and weight.

Multi-class Classification
Multiple class labels are present in the dataset.
The number of classifier models depends on the classification technique we are applying to.
One vs. All:- N-class instances then N binary classifier models
One vs. One:- N-class instances then N* (N-1)/2 binary classifier models
The Confusion matrix is easy to derive but complex to understand.
Example:- Check whether the fruit is apple, banana, or orange.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
One vs. All (One-vs-Rest)
In one-vs-All classification, for the N-class instances dataset, we have to generate the N-
binary classifier models.
The number of class labels present in the dataset and the number of generated binary
classifiers must be the same.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Multiclass classification
As shown in the above image, consider we have three classes, for example, type 1 for
Green, type 2 for Blue, and type 3 for Red.
Now, as I told you earlier that we have to generate the same number of classifiers as the
class labels are present in the dataset, So we have to create three classifiers here for three
respective classes.
Classifier 1:- [Green] vs [Red, Blue]
Classifier 2:- [Blue] vs [Green, Red]
Classifier 3:- [Red] vs [Blue, Green]
Now to train these three classifiers, we need to create three training datasets. So let’s
consider our primary dataset is as follows,

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Primary Data set

You can see that there are three class labels Green, Blue, and Red present in the
dataset. Now we have to create a training dataset for each class.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Create the training datasets by putting +1 in the class column for that feature value, which is aligned
to that particular class only. For the costs of the remaining features, we put -1 in the class column.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Preparation for training data set

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Now, after creating a training dataset for each classifier, we provide it to our classifier model
and train the model by applying an algorithm

Overfitting, Underfitting, and Regularization are three common concepts in machine

learning that are related to the training of models.

Under fitting in Machine Learning

 Machine learning algorithm is said to have under fitting when a model is too simple to
capture data complexities. It represents the inability of the model to learn the training data
effectively result in poor performance both on the training and testing data. In simple terms,
an underfit model’s are inaccurate, especially when applied to new, unseen examples. It
mainly happens when we uses very simple model with overly simplified assumptions. To
address underfitting problem of the model, we need to use more complex models, with
enhanced feature representation, and less regularization.

 Note: The underfitting model has High bias and low variance.
KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Reasons for Under fitting
• The model is too simple, So it may be not capable to represent the complexities in the data.
• The input features which is used to train the model is not the adequate representations of
underlying factors influencing the target variable.
• The size of the training dataset used is not enough.
• Features are not scaled.
Techniques to Reduce Under fitting
 Increase model complexity.
 Increase the number of features, performing feature engineering.
 Remove noise from the data.
 Increase the number of epochs or increase the duration of training to get better results.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Over fitting in Machine Learning

Machine Learning model said to be overfitted when the model does not make accurate
predictions on testing data.When a model gets trained with so much data, it starts learning
from the noise and inaccurate data entries in our data set. And when testing with test data
results in High variance. Then the model does not categorize the data correctly, because of
too many details and noise. The causes of overfitting are the non-parametric and non-linear
methods because these types of machine learning algorithms have more freedom in building
the model based on the dataset and therefore they can really build unrealistic models.

Note: The Over fitting model has Low bias and high variance.
KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Reasons for Overfitting:
• High variance and low bias.
• The model is too complex.
• The size of the training data.
Techniques to Reduce Overfitting
• Improving the quality of training data reduces overfitting by focusing on meaningful patterns,
mitigate the risk of fitting the noise or irrelevant features.
• Increase the training data can improve the model’s ability to generalize to unseen data and reduce the
likelihood of overfitting.
• Reduce model complexity.
• Early stopping during the training phase (have an eye over the loss over the training period as soon as
loss begins to increase stop training).
• Ridge Regularization and Lasso Regularization.
• Use dropout for neural networks to tackle overfitting.
KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Bias and Variance in Machine Learning
Bias: Bias refers to the error due to overly simplistic assumptions in the learning algorithm.
These assumptions make the model easier to comprehend and learn but might not capture the
underlying complexities of the data. It is the error due to the model’s inability to represent the
true relationship between input and output accurately. When a model has poor performance
both on the training and testing data means high bias because of the simple model, indicating
underfitting.
Variance: Variance, on the other hand, is the error due to the model’s sensitivity to
fluctuations in the training data. It’s the variability of the model’s predictions for different
instances of training data. High variance occurs when a model learns the training data’s noise
and random fluctuations rather than the underlying pattern. As a result, the model performs
well on the training data but poorly on the testing data, indicating overfitting.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Bias and Variance in Machine Learning

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Lasso (L1) (Least Absolute Shrinkage and Selection Operator)
The Lasso (Least Absolute Shrinkage and Selection Operator) is a regression
technique used to perform both variable selection and regularization in order to
enhance the prediction accuracy and interpretability of a model.

Lambda λ is the regularization parameter that controls the degree of regularization.

•If lambda λ=0, Lasso reduces to ordinary least squares regression.
•As lambda λ increases, some of the Wi coefficients shrink to zero, effectively selecting a simpler model.

encourages sparsity in the coefficients, meaning it can set some coefficients to

penalty term exactly zero, eliminating irrelevant features.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Ridge(L2) Regularization
Ridge Regularization, also known as L2 regularization, adds a penalty equal to the square of the weights
associated with each feature variable.
This encourages all coefficients to reduce in size by an amount proportional to their values and reduces
model complexity by shrinking large weights toward zero.
Ridge regularization can be more effective than Lasso when there are many collinear variables because
it prevents individual coefficients from becoming too large and overwhelming others.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Elastic Net (L1 +L2) regularization
Elastic Net is a regularization technique that combines both Lasso (L1 regularization) and
Ridge (L2 regularization). It is useful when there are multiple correlated features, and neither
Lasso nor Ridge alone gives optimal results.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111
Assignment questions
1. Explain the difference between logistic regression and linear regression. Why is logistic regression used for
classification problems?
2. Describe the significance of the sigmoid function in logistic regression. What properties make it suitable for
classification?
3. Given a logistic regression hypothesis function explain how this function maps predicted outputs to a
probability.
4. Illustrate, with an example, how logistic regression handles binary classification problems.
5. Define the concept of a decision boundary in logistic regression. How is it determined from the hypothesis function ?
6. Provide an example where the decision boundary is a straight line. How would this change for a nonlinear decision
boundary?
7. Why can't we use the Mean Squared Error (MSE) cost function in logistic regression?
8. Explain the One-vs-All (OvA) strategy for extending logistic regression to multiclass classification problems. Provide
an example scenario where this approach would be used.
9. How does the Softmax Regression (also known as Multinomial Logistic Regression) work for multiclass classification?
Provide the hypothesis function and cost function for softmax regression.
10.How can we mitigate overfitting in logistic regression? List and explain at least two techniques.
11.Explain how the regularized cost function for logistic regression is formulated. Write the regularized cost function
and explain how it differs from the unregularized version.
12.Implement logistic regression from scratch (without using libraries like Scikit-learn) using gradient descent. Apply it
to a binary classification problem.

KIT | Department of Basic Sciences and Humanities | Course: INTRODUCTION TO PYTHON PROGRAMMING | Course Code : UHSES0111

Lecture 4-Logistic Regression
No ratings yet
Lecture 4-Logistic Regression
20 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
FALLSEM2024-25 BCSE209L TH VL2024250101695 2024-08-12 Reference-Material-II
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101695 2024-08-12 Reference-Material-II
19 pages
Logistic Regression
No ratings yet
Logistic Regression
22 pages
6725133c9e12e9db65ccf8d9 Mopumiwejapov
No ratings yet
6725133c9e12e9db65ccf8d9 Mopumiwejapov
2 pages
Session 5 - Logistic Regression
No ratings yet
Session 5 - Logistic Regression
69 pages
Logistic Regression in Machine Learning - GeeksforGeeks
No ratings yet
Logistic Regression in Machine Learning - GeeksforGeeks
10 pages
4.logistic Regression
No ratings yet
4.logistic Regression
28 pages
Supervised Logistic Regression
No ratings yet
Supervised Logistic Regression
13 pages
DSCSignerServiceVer 4 1 6UserGuidelines
No ratings yet
DSCSignerServiceVer 4 1 6UserGuidelines
75 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
All Merged Chap 2
No ratings yet
All Merged Chap 2
19 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
29 LogisticRegression
No ratings yet
29 LogisticRegression
15 pages
API Marketplace Engineering Design
No ratings yet
API Marketplace Engineering Design
281 pages
Logistic Regression
No ratings yet
Logistic Regression
17 pages
EXP-2-To Implement Logistic Regression
No ratings yet
EXP-2-To Implement Logistic Regression
5 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
M2 Logistic Regression Classcopy 4
No ratings yet
M2 Logistic Regression Classcopy 4
7 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
53 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Logistics Regression
No ratings yet
Logistics Regression
8 pages
04 CBLM With Competency Assessment Tools
No ratings yet
04 CBLM With Competency Assessment Tools
73 pages
Lecture 22. GLM
No ratings yet
Lecture 22. GLM
41 pages
Task 1
No ratings yet
Task 1
7 pages
Lecture Note #9 - PEC-CS701E
No ratings yet
Lecture Note #9 - PEC-CS701E
41 pages
Experiment - 4: Aim: Gradient Descent Approach
No ratings yet
Experiment - 4: Aim: Gradient Descent Approach
3 pages
B.Tech V KCS055 Unit2 2
No ratings yet
B.Tech V KCS055 Unit2 2
7 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
10 pages
ML Assignment
No ratings yet
ML Assignment
20 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
21 pages
2+logistic Regression
No ratings yet
2+logistic Regression
10 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Google: Google, LLC Is An American Multinational Technology Company
100% (1)
Google: Google, LLC Is An American Multinational Technology Company
45 pages
Exp 2 121a1047 ML Lavanya Kurup Div C C3
No ratings yet
Exp 2 121a1047 ML Lavanya Kurup Div C C3
8 pages
Aiml Unit 3 1
No ratings yet
Aiml Unit 3 1
9 pages
Samatrix Kaa Kaam
No ratings yet
Samatrix Kaa Kaam
3 pages
Dav Exp4 66
No ratings yet
Dav Exp4 66
5 pages
Vendor Manual
No ratings yet
Vendor Manual
74 pages
2-Logistic Regression
No ratings yet
2-Logistic Regression
15 pages
ML (08-08-2024)
No ratings yet
ML (08-08-2024)
5 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
3 pages
MACHINE LEARNING Presentation Logistic Regression
No ratings yet
MACHINE LEARNING Presentation Logistic Regression
18 pages
A Research Project On Applying Logistic Regression To Predict Result of Binary Classification Problems
No ratings yet
A Research Project On Applying Logistic Regression To Predict Result of Binary Classification Problems
6 pages
Wa0004.
No ratings yet
Wa0004.
9 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Logistic Regression
No ratings yet
Logistic Regression
22 pages
2port-Efr Thu-6404
No ratings yet
2port-Efr Thu-6404
255 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
NPD Module 5 PDF
No ratings yet
NPD Module 5 PDF
36 pages
What Is Logistic Regression
No ratings yet
What Is Logistic Regression
20 pages
Experiment No 8
No ratings yet
Experiment No 8
4 pages
Unit 04 Pandas
No ratings yet
Unit 04 Pandas
46 pages
Misc 5
No ratings yet
Misc 5
1 page
Reference Material Logistic Regression
No ratings yet
Reference Material Logistic Regression
11 pages
Chapter 4 Enumeration
No ratings yet
Chapter 4 Enumeration
26 pages
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
No ratings yet
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
10 pages
Chapter Two Dss
No ratings yet
Chapter Two Dss
3 pages
Reference Material - Logistic - Regression
No ratings yet
Reference Material - Logistic - Regression
11 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Reference Material - Logistic - Regression
No ratings yet
Reference Material - Logistic - Regression
11 pages
Unit2 ML
No ratings yet
Unit2 ML
79 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Professional Writing Skills Handout © Write It Well Academy
100% (1)
Professional Writing Skills Handout © Write It Well Academy
9 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
Grade XI: Computer Science Project Work: Submitted By: Rashihang Rai
No ratings yet
Grade XI: Computer Science Project Work: Submitted By: Rashihang Rai
21 pages
Ds Unit 2 Notes
No ratings yet
Ds Unit 2 Notes
26 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
LinkVIeW User Manual EN 1.21
No ratings yet
LinkVIeW User Manual EN 1.21
20 pages
Huawei AC650-128AP Wireless Access Controller Datasheet
No ratings yet
Huawei AC650-128AP Wireless Access Controller Datasheet
15 pages
Ds Unit 1 Notes
No ratings yet
Ds Unit 1 Notes
23 pages
MS Xca
No ratings yet
MS Xca
30 pages
Ds Unit 3 Notes
No ratings yet
Ds Unit 3 Notes
29 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Gauss Jordan Method
No ratings yet
Gauss Jordan Method
19 pages
Syllabus Ns (Network Security)
No ratings yet
Syllabus Ns (Network Security)
6 pages
Jamb Test Manual
No ratings yet
Jamb Test Manual
14 pages
Bimal Maruti Audit Report
No ratings yet
Bimal Maruti Audit Report
25 pages
Singleton Design Pattern in C#
No ratings yet
Singleton Design Pattern in C#
9 pages
Ayya Nadar Janaki Ammal College (Autonomous), Sivakasi
No ratings yet
Ayya Nadar Janaki Ammal College (Autonomous), Sivakasi
3 pages
CEC2010 RealParameterOptimization TechnicalReport
No ratings yet
CEC2010 RealParameterOptimization TechnicalReport
16 pages
References From The Reading
No ratings yet
References From The Reading
16 pages
SolarWinds CSFI Report
No ratings yet
SolarWinds CSFI Report
10 pages
Exam1 f12
No ratings yet
Exam1 f12
15 pages
Tourism Management: Ankita Sharma, Swati Sharma, Monica Chaudhary
No ratings yet
Tourism Management: Ankita Sharma, Swati Sharma, Monica Chaudhary
10 pages
Puter Literacy MS Power Point Q & A SR
No ratings yet
Puter Literacy MS Power Point Q & A SR
11 pages
de3eff8f6907b6b29ecc2014b615d71dd241738f80ca7be596d3983253f3d57d
No ratings yet
de3eff8f6907b6b29ecc2014b615d71dd241738f80ca7be596d3983253f3d57d
2 pages
A Neural Network Approach To Ordinal Regression
No ratings yet
A Neural Network Approach To Ordinal Regression
6 pages
In21 EN2853 ProgrammingAssignment2
No ratings yet
In21 EN2853 ProgrammingAssignment2
3 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Unit 3 - ML - CH-1

Uploaded by

Unit 3 - ML - CH-1

Uploaded by

Unit 3

Classification – Logistic Regression & Neural Network

Dr. ABHINANDAN P. SHIRAHATTI

Neural networks-neuron representation and model, hypothesis for neuron, cost

 Binomial: In binomial Logistic regression, there can be only two possible

 Multinomial: In multinomial Logistic regression, there can be 3 or more

 Ordinal: In ordinal Logistic regression, there can be 3 or more possible

The first column is used for setting the bias

 Step by step computation is given below:

Overfitting, Underfitting, and Regularization are three common concepts in machine

Under fitting in Machine Learning

Lambda λ is the regularization parameter that controls the degree of regularization.

encourages sparsity in the coefficients, meaning it can set some coefficients to

You might also like