Lect 9 - Loss Functions

The document explains loss functions, which measure how well a model's predictions match true values in deep learning. It categorizes various loss functions into regression, binary classification, and multi-class classification, detailing specific types such as Mean Squared Error, Binary Cross-Entropy, and Huber Loss. The document also discusses the sensitivity of these loss functions to outliers and the importance of using appropriate loss functions for different types of models.

Uploaded by

cs22b2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views28 pages

Lect 9 - Loss Functions

Uploaded by

cs22b2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

LOSS FUNCTIONS

Dr. Umarani Jayaraman

What is loss function?
 It provides a measure of how well the
model is performing on training data
(that includes validation data) with
respect to its objective.
 A loss function, in the context of deep
learning and optimization, is a measure
of how well a model's predictions match
the true values (ground truth) of the
dataset.
Various Loss Function
• Regression Loss Functions
• Mean Squared Error Loss
• Mean Absolute Error Loss
• Huber Loss
• Binary Classification Loss Functions
• Binary Cross-Entropy
• Hinge Loss
• Multi-class Classification Loss Functions
• Multi-class Cross Entropy Loss
• Kullback Leibler Divergence Loss
Various Loss Function
 1. Regression Loss Functions (Used for continuous output)
1. MSE(Mean Squared Error)
2. MAE(Mean Absolute Error)
3. Hubber loss
 2. Classification
1. Binary cross-entropy
2. Categorical cross-entropy
 3. Auto-Encoder
1. KL Divergence
 4. GAN
1. Discriminator loss
2. Minmax GAN loss
 5. Object detection
1. Focal loss
 6. Word embeddings
1. Triplet loss
Regression Loss
 Mean Squared Error/Squared loss/
L2 loss

Penalizes large
errors more than
small errors.
Regression Loss
 Mean Absolute Error/ L1 loss

Less
sensitive to
outliers
compared to
MSE
Huber Loss
 Huber Loss: A combination of MSE and
MAE, useful for handling outliers.
Understanding Sensitivity to Outliers

 Since MSE squares the error, even a

single large error (outlier) can
significantly increase the total loss. This
happens because squaring amplifies
large values more than small ones.
 Example: Effect of an Outlier
 Consider a dataset with actual values (y)
and two sets of predictions (y^):
 One set has a normal error.
 The other contains an outlier.
MSE Calculation
A graph comparing MSE and MAE with
outliers
 MSE (red curve) grows much faster for large
errors because of squaring.
 MAE (blue curve) increases linearly, meaning it
does not exaggerate large errors
Why This Is a Problem
 MSE overreacts to outliers because
the squared error makes big errors
dominate the loss.
 The model may prioritize reducing a few
large errors instead of improving overall
performance.
Next For Classification…
Alternative to Reduce Outlier Sensitivity
Classification Loss-
Binary Cross Entropy Loss

 Let us start by understanding the

term ‘entropy’.
 Generally, we use entropy to indicate
disorder or uncertainty.
 It is measured for a random variable X
with probability distribution p(X):
Binary Cross Entropy Loss
 The negative sign is used to make the
overall quantity positive.
 A greater value of entropy for a
probability distribution indicates a
greater uncertainty in the distribution.
 Likewise, a smaller value indicates a
more certain distribution.
Binary Cross Entropy Loss
This makes binary cross-entropy suitable


as a loss function – you want to

minimize its value.
 We use binary cross-entropy loss for

classification models which output a

probability p.
Probability that the element belongs to class 1
(or positive class) = p
Then, the probability that the element belongs
to class 0 (or negative class) = 1 - p
The probability
Binary Cross Entropy Loss
 Then, the cross-entropy loss for output
label y (can take values 0 and 1) and
predicted probability p is defined as:

− [y log(p) + (1−y) log(1−p)]

Binary Cross Entropy Loss

 This is also called Log-Loss.

 To calculate the probability p, we can
use the sigmoid function. Here, z is a
function of our input features:
Binary Cross Entropy Loss
 The range of the sigmoid function is [0,
1] which makes it suitable for calculating
probability.
Binary Cross Entropy Loss
Multi-class cross-entropy/categorical cross-entropy
Categorical Cross Entropy
Loss
Categorical Cross Entropy
Loss
 Softmax converts logits into
probabilities. The purpose of cross-
entropy is to take the output
probabilities (P) and measure the
distance from the truth values (as shown
below).
Categorical Cross Entropy
Loss
Categorical Cross Entropy
Loss
Categorical Cross Entropy
Loss
Questions
 Why does cross-entropy is used most
commonly as compared to MSE for
classification problem?
Thank you

CMO Olympiad Book For Class 5
0% (1)
CMO Olympiad Book For Class 5
13 pages
Oberon 2
No ratings yet
Oberon 2
327 pages
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
No ratings yet
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
30 pages
3 - Loss Functions
No ratings yet
3 - Loss Functions
14 pages
Loss Functions Types
No ratings yet
Loss Functions Types
11 pages
Losses
No ratings yet
Losses
9 pages
Lesson 4 Deep Neural Network and Tools
No ratings yet
Lesson 4 Deep Neural Network and Tools
159 pages
Practical 1 Installation of Mat Lab
No ratings yet
Practical 1 Installation of Mat Lab
31 pages
Lect 8
No ratings yet
Lect 8
117 pages
CM20315 05 Loss
No ratings yet
CM20315 05 Loss
100 pages
Lecture 11
No ratings yet
Lecture 11
26 pages
Lesson 04 Deep Neural Network
No ratings yet
Lesson 04 Deep Neural Network
81 pages
Loss Functions
No ratings yet
Loss Functions
29 pages
DL145611 03 Shallow
No ratings yet
DL145611 03 Shallow
92 pages
Loss Functions
No ratings yet
Loss Functions
37 pages
Loss Functions in Neural Networks PDF
No ratings yet
Loss Functions in Neural Networks PDF
6 pages
Module 6 - Loss Function
No ratings yet
Module 6 - Loss Function
22 pages
Lecture 2
No ratings yet
Lecture 2
80 pages
Machine Learning Models
No ratings yet
Machine Learning Models
52 pages
Linear Classfiers, Loss
No ratings yet
Linear Classfiers, Loss
38 pages
04 LossFunctions
No ratings yet
04 LossFunctions
22 pages
05 AIS302 ANN-Optimization
No ratings yet
05 AIS302 ANN-Optimization
44 pages
Neural Networks
No ratings yet
Neural Networks
63 pages
What Are Probabilistic Machine Learning Models?
No ratings yet
What Are Probabilistic Machine Learning Models?
61 pages
Lecture 7 Loss Function and Regularization
No ratings yet
Lecture 7 Loss Function and Regularization
38 pages
Loss Function
No ratings yet
Loss Function
23 pages
Cross Entropy Loss Intro, Applications
No ratings yet
Cross Entropy Loss Intro, Applications
21 pages
Loss
No ratings yet
Loss
18 pages
Lecture 4 - Linear Classification
No ratings yet
Lecture 4 - Linear Classification
34 pages
Cross Entropy Loss
No ratings yet
Cross Entropy Loss
31 pages
ML 19.03 Sidenotes
No ratings yet
ML 19.03 Sidenotes
30 pages
Loss Functions
No ratings yet
Loss Functions
17 pages
UNIT4 CostFunctions
No ratings yet
UNIT4 CostFunctions
23 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
AI and Math - Python Multiple-Choice Questions
No ratings yet
AI and Math - Python Multiple-Choice Questions
16 pages
Loss Functions
No ratings yet
Loss Functions
7 pages
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-09 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-09 Reference-Material-I
15 pages
Practical-5 - 2CEIT606 - Artificial Intelligence
No ratings yet
Practical-5 - 2CEIT606 - Artificial Intelligence
14 pages
Cross-Entropy Loss Functions: Theoretical Analysis and Applications
No ratings yet
Cross-Entropy Loss Functions: Theoretical Analysis and Applications
26 pages
4-Loss Function
No ratings yet
4-Loss Function
8 pages
Loss Functions
No ratings yet
Loss Functions
8 pages
Cost Function Loss Function
No ratings yet
Cost Function Loss Function
7 pages
Loss Function
No ratings yet
Loss Function
13 pages
Module 1 - Problems in Neural Network
No ratings yet
Module 1 - Problems in Neural Network
20 pages
3a Variations
No ratings yet
3a Variations
17 pages
Assignment 1 - Machine Learning
No ratings yet
Assignment 1 - Machine Learning
9 pages
Machine Vesion hw6
No ratings yet
Machine Vesion hw6
18 pages
Lesson 12
No ratings yet
Lesson 12
14 pages
MTBF MTTR MTTF FIT Terms 3618wp 1703446671
No ratings yet
MTBF MTTR MTTF FIT Terms 3618wp 1703446671
2 pages
Loss Function - Ipynb - Colaboratory
No ratings yet
Loss Function - Ipynb - Colaboratory
6 pages
02 - Linear Models - D (Multiclass Classification)
No ratings yet
02 - Linear Models - D (Multiclass Classification)
9 pages
A Friendly Introduction To Cross Entropy Loss
No ratings yet
A Friendly Introduction To Cross Entropy Loss
10 pages
8 Linear Classifiers HInge Loss 03-08-2024
No ratings yet
8 Linear Classifiers HInge Loss 03-08-2024
20 pages
DL Practical 3 Loss Function
No ratings yet
DL Practical 3 Loss Function
6 pages
Pentestmonkey
No ratings yet
Pentestmonkey
5 pages
NN WK 3 Lec 5 6 Gradient Descent
No ratings yet
NN WK 3 Lec 5 6 Gradient Descent
7 pages
NIRNAY Quiz
No ratings yet
NIRNAY Quiz
14 pages
3a Variations4
No ratings yet
3a Variations4
5 pages
Loss Function
No ratings yet
Loss Function
9 pages
CTET Tamil-2-2021
No ratings yet
CTET Tamil-2-2021
20 pages
What Is Cross-Entropy?: 1 Answer
No ratings yet
What Is Cross-Entropy?: 1 Answer
3 pages
Exp 5
No ratings yet
Exp 5
4 pages
Binary Classification MSE Cross Entropy Explanation
No ratings yet
Binary Classification MSE Cross Entropy Explanation
2 pages
ML Lec 10 ANN CrossEntropy Training
No ratings yet
ML Lec 10 ANN CrossEntropy Training
12 pages
Computer Literacy Test 2
No ratings yet
Computer Literacy Test 2
11 pages
Blue Coat Authentication Webcast Final
No ratings yet
Blue Coat Authentication Webcast Final
53 pages
Loss Functions and Transformers Notes
No ratings yet
Loss Functions and Transformers Notes
2 pages
Ferroelectric Ram (FRAM) : Presented by Javad.P N0:30
100% (1)
Ferroelectric Ram (FRAM) : Presented by Javad.P N0:30
17 pages
Report 1
No ratings yet
Report 1
28 pages
500-201 Alteon TrainingManual 30.5.x
No ratings yet
500-201 Alteon TrainingManual 30.5.x
126 pages
Passport Reader SDK User Manual
No ratings yet
Passport Reader SDK User Manual
6 pages
ShopperNOW Training Manual Download
No ratings yet
ShopperNOW Training Manual Download
18 pages
B.sc. Honors Mathematics Syllabus 2021-25
No ratings yet
B.sc. Honors Mathematics Syllabus 2021-25
31 pages
21.4 Graphs On Semi-Log Paper
No ratings yet
21.4 Graphs On Semi-Log Paper
2 pages
PSW User Manual SD Exports
No ratings yet
PSW User Manual SD Exports
34 pages
Chapter 1.2
No ratings yet
Chapter 1.2
47 pages
Registers
No ratings yet
Registers
6 pages
Computer Basics Intro
No ratings yet
Computer Basics Intro
12 pages
Maths Prelims Paper
No ratings yet
Maths Prelims Paper
8 pages
Microsoft Office Dependencies of SAP GUI 7.60 Components
No ratings yet
Microsoft Office Dependencies of SAP GUI 7.60 Components
1 page
IC Cyber Security Risk Assessment Report 11680 WORD
No ratings yet
IC Cyber Security Risk Assessment Report 11680 WORD
12 pages
AGMG7VSS
No ratings yet
AGMG7VSS
8 pages
Gsi Markets Setup Guide
No ratings yet
Gsi Markets Setup Guide
7 pages
Displaying A Smart Form As PDF in Enterprise Portal Using WebDynpro For Java
No ratings yet
Displaying A Smart Form As PDF in Enterprise Portal Using WebDynpro For Java
13 pages
Lesson 3 IPV4 IPV6 in IP Addressing Mechanism.
No ratings yet
Lesson 3 IPV4 IPV6 in IP Addressing Mechanism.
9 pages
Tetris Game
No ratings yet
Tetris Game
5 pages
The Retail Site Location Decision Process Using GIS and The Analytical Hierarchy Process
No ratings yet
The Retail Site Location Decision Process Using GIS and The Analytical Hierarchy Process
10 pages
A Brief Introduction To Circuit Analysis 1st Edition J. David Irwin Download PDF
100% (4)
A Brief Introduction To Circuit Analysis 1st Edition J. David Irwin Download PDF
84 pages
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet

Lect 9 - Loss Functions

Uploaded by

Lect 9 - Loss Functions

Uploaded by

LOSS FUNCTIONS

Dr. Umarani Jayaraman

 Since MSE squares the error, even a

 Let us start by understanding the

as a loss function – you want to

classification models which output a

− [y log(p) + (1−y) log(1−p)]

 This is also called Log-Loss.

You might also like