0% found this document useful (0 votes)

34 views17 pages

Loss Functions

The document discusses various loss functions used in training machine learning models, highlighting their properties and applications. Key loss functions include Mean Squared Error, Mean Absolute Error, Huber Loss, Log-Cosh Loss, and others, each with distinct characteristics regarding sensitivity to outliers and differentiability. Additionally, it covers specialized loss functions for classification tasks, such as Binary Cross Entropy and Hinge Loss, emphasizing their importance in optimizing model performance.

Uploaded by

poojaexplore1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views17 pages

Loss Functions

Uploaded by

poojaexplore1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Loss Functions

Dr. V. Sowmya,
Associate Professor,
Amrita School of Artificial
Intelligence,
Coimbatore,
Amrita Vishwa Vidyapeetham,
India.
27-01-2025.
Loss Functions

• During training, a loss function is used to

optimize the model’s parameters.

• Measures the difference between the predicted

and expected outputs of the model.

• The objective of training is to minimize this

difference.
Loss Functions - Properties
Mean Squared Error (MSE) / L2 Loss

Properties:
• Non-negative.
• Sensitive to Outliers.
• Differentiable.
• Convex (non-convex due to the multiple layers of
non-linear activation functions in DL).
• Susceptible to outliers in the data.
• Loss function and performance metric.
• Scale-dependent.
Mean Absolute Error (MSE) / L1 Loss

Properties:
• Non-negative.
• Robust to Outliers.
• Non-Differentiable.
• Convex (non-convex due to the
multiple layers of non-linear activation
functions in DL).
• Loss function and performance metric.
• Scale Dependent.
Mean Absolute Percentage Error (MAPE) or Normalized Mean Absolute Error (NMAE) to
compare models across different scales or units.
Huber Loss

Properties:
• Robust to Outliers.
• Differentiable.
• Used in time series
forecasting.

δ to a small value if the data has a lot of noise and to a

large value if the data has outliers.
Log-Cosh Loss

Properties:
• Smooth and Differentiable.
• Less Sensitive to Outliers than MSE.
• More sensitive to small errors than the
Huber loss.
Huber Loss - when we have a reason to define a specific point where the loss
function should switch from quadratic to linear, depending on the noise
characteristics of the data.
Log – Cosh Loss - when we do not have clear reasons to manually set a transition
threshold as in Huber loss.
Quantile Loss

Used for predicting an interval instead of a

single value.
The loss is scaled by q for underestimations and (1 − q) for
overestimations.
When q = 0.5, the quantile loss is equivalent to the Mean
Absolute Error (MAE), making it a generalization of MAE that
allows for asymmetric penalties for underestimations and
overestimations.
Financial Risk Management, Supply Chain and Inventory
Management, Energy Production, Economic Forecasting, Weather
Forecasting, Real Estate Pricing, Healthcare.
Poisson Loss

when the target variable represents

count data

Traffic Modelling, Healthcare, Insurance, Customer

Service, Internet Usage, Manufacturing, Crime Analysis.
Binary Cross Entropy (BCE) and
Weighted BCE

Assigns a higher weight to the minority class, helping to balance the

influence of each class on the training process.
Categorical Cross Entropy (CCE)

Sparse Categorical Cross Entropy

(CCE)
Cross-Entropy Loss with Label
Smoothing
This technique has been shown to improve the generalization
of models, particularly in scenarios with many categories or
when the dataset contains noisy labels.
Negative Log Likelihood
(NLL)
Poly Loss

ϵ = 0, Poly-1 reduces to the standard cross-entropy

loss. When ϵ > 0, the loss function becomes more
sensitive to confident predictions, reducing
overfitting in imbalanced datasets or tasks
requiring higher precision
When dealing with imbalanced datasets, To
simplify the hyperparameter optimization process
Hinge Loss
maximum-margin classification
tasks

y · f(x) reflects the raw margin, which measures how

far the predicted value is from the decision boundary
in terms of alignment and distance

Squared Hinge
Loss

4-Loss Function
No ratings yet
4-Loss Function
8 pages
Loss Functions
No ratings yet
Loss Functions
7 pages
Loss Function in Deep Learning
No ratings yet
Loss Function in Deep Learning
15 pages
Lect 9 - Loss Functions
No ratings yet
Lect 9 - Loss Functions
28 pages
Loss Functions
No ratings yet
Loss Functions
29 pages
3 - Loss Functions
No ratings yet
3 - Loss Functions
14 pages
Loss Functions
No ratings yet
Loss Functions
37 pages
Loss Functions in Neural Networks PDF
No ratings yet
Loss Functions in Neural Networks PDF
6 pages
Lecture 11
No ratings yet
Lecture 11
26 pages
Assignment 1 - Machine Learning
No ratings yet
Assignment 1 - Machine Learning
9 pages
Machine Learning Models
No ratings yet
Machine Learning Models
52 pages
Loss Functions Types
No ratings yet
Loss Functions Types
11 pages
Loss Functions
No ratings yet
Loss Functions
8 pages
Loss Fuction: by Fatema Khairunnasa Lecturer, Dept. of Statistics, Bsmrstu
No ratings yet
Loss Fuction: by Fatema Khairunnasa Lecturer, Dept. of Statistics, Bsmrstu
7 pages
Loss
No ratings yet
Loss
18 pages
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
No ratings yet
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
30 pages
Loss Function
No ratings yet
Loss Function
23 pages
CO5 Neural Network LossFunctions
No ratings yet
CO5 Neural Network LossFunctions
34 pages
Using The Mean Absolute Percentage Error For Regression Models
No ratings yet
Using The Mean Absolute Percentage Error For Regression Models
7 pages
Loss Functions
No ratings yet
Loss Functions
16 pages
DL Practical 3 Loss Function
No ratings yet
DL Practical 3 Loss Function
6 pages
Op Tim Ization
No ratings yet
Op Tim Ization
18 pages
DeepLearning Lect2 3
No ratings yet
DeepLearning Lect2 3
89 pages
Ai - W3L6
No ratings yet
Ai - W3L6
29 pages
Loss Function - Ipynb - Colaboratory
No ratings yet
Loss Function - Ipynb - Colaboratory
6 pages
Most Influential Data Science Research Papers
No ratings yet
Most Influential Data Science Research Papers
628 pages
A General and Adaptive Robust Loss Function: Jonathan T. Barron Google Research
No ratings yet
A General and Adaptive Robust Loss Function: Jonathan T. Barron Google Research
19 pages
Detailed Guide 7 Loss Functions Machine Learning Python Code
No ratings yet
Detailed Guide 7 Loss Functions Machine Learning Python Code
16 pages
A General and Adaptive Robust Loss Function
No ratings yet
A General and Adaptive Robust Loss Function
9 pages
05 AIS302 ANN-Optimization
No ratings yet
05 AIS302 ANN-Optimization
44 pages
Module 6 - Loss Function
No ratings yet
Module 6 - Loss Function
22 pages
Huber Loss
No ratings yet
Huber Loss
5 pages
L1, L2 and Huber Loss
No ratings yet
L1, L2 and Huber Loss
8 pages
Loss Functions
No ratings yet
Loss Functions
7 pages
01 Lecturenote SRM
No ratings yet
01 Lecturenote SRM
9 pages
Machine Vesion hw6
No ratings yet
Machine Vesion hw6
18 pages
NN WK 3 Lec 5 6 Gradient Descent
No ratings yet
NN WK 3 Lec 5 6 Gradient Descent
7 pages
Lecture-4 Emprical Risk and Optimization
No ratings yet
Lecture-4 Emprical Risk and Optimization
20 pages
01 Lecturenote SRM
No ratings yet
01 Lecturenote SRM
9 pages
Lecture+4+ +Intro+to+Modeling+and+Linear+Regression
No ratings yet
Lecture+4+ +Intro+to+Modeling+and+Linear+Regression
51 pages
Lecture 07
No ratings yet
Lecture 07
29 pages
AI and Math - Python Multiple-Choice Questions
No ratings yet
AI and Math - Python Multiple-Choice Questions
16 pages
Losses
No ratings yet
Losses
9 pages
Stopping Criterion & Pruning Loss Functions
No ratings yet
Stopping Criterion & Pruning Loss Functions
9 pages
Esitmation 1
No ratings yet
Esitmation 1
9 pages
Capstone Project
No ratings yet
Capstone Project
7 pages
ML-W2L02 Supervised Learning Setup
No ratings yet
ML-W2L02 Supervised Learning Setup
16 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
Regression Loss TA
No ratings yet
Regression Loss TA
68 pages
ML-W2L02 Supervised Learning Setup
No ratings yet
ML-W2L02 Supervised Learning Setup
16 pages
ML 19.03 Sidenotes
No ratings yet
ML 19.03 Sidenotes
30 pages
Lecture-16 Machine Learning With Python
No ratings yet
Lecture-16 Machine Learning With Python
39 pages
Loss Function
No ratings yet
Loss Function
13 pages
Group 30
No ratings yet
Group 30
33 pages
Beyond Classification Beyond Classification Beyond Classification Beyond Classification
No ratings yet
Beyond Classification Beyond Classification Beyond Classification Beyond Classification
23 pages
Cost Function Loss Function
No ratings yet
Cost Function Loss Function
7 pages
Lecture 03 - Feedforward Networks - 4p
No ratings yet
Lecture 03 - Feedforward Networks - 4p
19 pages
The Maths That Drives Ai: Work / Technology & Tools
No ratings yet
The Maths That Drives Ai: Work / Technology & Tools
3 pages
Loss Function
No ratings yet
Loss Function
3 pages
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Paper 1
No ratings yet
Paper 1
19 pages
New4 DL&RL
No ratings yet
New4 DL&RL
11 pages
JAGADEESH G - CB - SC.P2DSC24009 - Assignmnet 1
No ratings yet
JAGADEESH G - CB - SC.P2DSC24009 - Assignmnet 1
12 pages
Respiratory Diseases
No ratings yet
Respiratory Diseases
87 pages
Bias Variance Trade Off
No ratings yet
Bias Variance Trade Off
20 pages
4.4 Determinate Frame Analysis
No ratings yet
4.4 Determinate Frame Analysis
18 pages
CMS Sample Paper - Class 9 Mathematics
No ratings yet
CMS Sample Paper - Class 9 Mathematics
9 pages
Manual Instalare TRE4x4 TR135
No ratings yet
Manual Instalare TRE4x4 TR135
35 pages
A Word and Its Relatives: Derivation
No ratings yet
A Word and Its Relatives: Derivation
23 pages
Introducton To Thermodynamics
No ratings yet
Introducton To Thermodynamics
7 pages
Power:: Public Class Public Static Double Double Int Double Double If
No ratings yet
Power:: Public Class Public Static Double Double Int Double Double If
1 page
Shortcut Keys Description: Bold
No ratings yet
Shortcut Keys Description: Bold
7 pages
Paper2 Maths Sachin Acedamy CTET 2023-24
No ratings yet
Paper2 Maths Sachin Acedamy CTET 2023-24
328 pages
Ucc280X-Q1 Low-Power Bicmos Current-Mode PWM Controllers: 1 Features 2 Applications
No ratings yet
Ucc280X-Q1 Low-Power Bicmos Current-Mode PWM Controllers: 1 Features 2 Applications
47 pages
Motherboard Chip Level Servicing Tutorials
No ratings yet
Motherboard Chip Level Servicing Tutorials
5 pages
Geography Form 3 Notes
No ratings yet
Geography Form 3 Notes
8 pages
Status of The Geodetic Infrastructure of The Philippines
No ratings yet
Status of The Geodetic Infrastructure of The Philippines
19 pages
? 5 MB PDF Example File - Dummy PDF File - Sample PDF
No ratings yet
? 5 MB PDF Example File - Dummy PDF File - Sample PDF
9 pages
Physics IGCSE CIE Past Paper QP (8) 2023
No ratings yet
Physics IGCSE CIE Past Paper QP (8) 2023
26 pages
Unconfirmed 401821.crdownload
No ratings yet
Unconfirmed 401821.crdownload
16 pages
String Matching
100% (1)
String Matching
12 pages
Toshimoku's Trading Tips & Tricks - #SatoshiMoku - Medium
No ratings yet
Toshimoku's Trading Tips & Tricks - #SatoshiMoku - Medium
48 pages
Introduction To Chemistry
No ratings yet
Introduction To Chemistry
116 pages
Technical Indicators in Trading
No ratings yet
Technical Indicators in Trading
1 page
Manufacturing Process of Chlorpyrifos TC
No ratings yet
Manufacturing Process of Chlorpyrifos TC
3 pages
FWR-8610GSD Filter Optical Receiver-FULLWELL PDF
No ratings yet
FWR-8610GSD Filter Optical Receiver-FULLWELL PDF
2 pages
A7 CBE Cleaver
No ratings yet
A7 CBE Cleaver
34 pages
A. Indeterminate Beams
No ratings yet
A. Indeterminate Beams
24 pages
Module 7: Centre of Mass: Definition
No ratings yet
Module 7: Centre of Mass: Definition
10 pages
Abutment Selection in Fixed Partial Dentures Final Copy With Pictures
100% (1)
Abutment Selection in Fixed Partial Dentures Final Copy With Pictures
10 pages
Benjamin Franklin ° 1752 ° Michael Faraday ° 1831 °
No ratings yet
Benjamin Franklin ° 1752 ° Michael Faraday ° 1831 °
1 page
Fit India Letter
No ratings yet
Fit India Letter
2 pages
Total Mechanical Energy
No ratings yet
Total Mechanical Energy
2 pages
Problems On FDM, SLS and SLA
No ratings yet
Problems On FDM, SLS and SLA
4 pages
21BCS7540 Sneha Gupta Day1
No ratings yet
21BCS7540 Sneha Gupta Day1
10 pages

Loss Functions

Uploaded by

Loss Functions

Uploaded by

Loss Functions

• During training, a loss function is used to

• Measures the difference between the predicted

• The objective of training is to minimize this

δ to a small value if the data has a lot of noise and to a

Used for predicting an interval instead of a

when the target variable represents

Traffic Modelling, Healthcare, Insurance, Customer

Assigns a higher weight to the minority class, helping to balance the

Sparse Categorical Cross Entropy

ϵ = 0, Poly-1 reduces to the standard cross-entropy

y · f(x) reflects the raw margin, which measures how

You might also like