0% found this document useful (0 votes)

58 views7 pages

Loss Functions

The document discusses different types of loss functions used in neural networks for regression and classification tasks. For regression, it describes mean squared error (MSE), mean absolute error (MAE), and Huber loss. MSE is the most commonly used but is affected by outliers, while MAE is more robust to outliers but has discontinuities. Huber loss combines the benefits of MSE and MAE. For classification, it details cross-entropy loss, which is most commonly used and measures the distance between predicted and actual probabilities for classification problems. It also provides the mathematical formulations for many of these loss functions.

Uploaded by

raja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views7 pages

Loss Functions

Uploaded by

raja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Loss Functions

 Neural Network uses optimizing strategies like stochastic gradient descent to minimize the error
in the algorithm. The way we actually compute this error is by using a Loss Function. It is used to
quantify how good or bad the model is performing.
 Loss functions can be classified into two major categories depending upon the type of learning
task we are dealing with — Regression losses and Classification losses.
 In classification, we are trying to predict output from set of finite categorical values i.e Given
large data set of images of hand written digits, categorizing them into one of 0–9 digits.
 Regression, on the other hand, deals with predicting a continuous value for example given floor
area, number of rooms, size of rooms, predict the price of room.
NOTE
n - Number of training examples.
i - ith training example in a data set.
y(i) - Ground truth label for ith training example.
y_hat(i) - Prediction for ith training example.

Regression Losses
1. Mean Square Error/Quadratic Loss/L2 Loss
Mathematical formulation :-

As the name suggests, Mean square error is measured as the average of squared difference between
predictions and actual observations.

It’s only concerned with the average magnitude of error irrespective of their direction. However, due to
squaring, predictions which are far away from actual values are penalized heavily in comparison to less
deviated predictions. Calculating gradient of MSE is easier.

# calculate mean squared error

def mean_squared_error(actual, predicted):
sum_square_error = 0.0
for i in range(len(actual)):
sum_square_error += (actual[i] - predicted[i])**2.0
mean_square_error = 1.0 / len(actual) * sum_square_error
return mean_square_error

from sklearn.metrics import mean_squared_error

>>> y_true = [3, -0.5, 2, 7]
>>> y_pred = [2.5, 0.0, 2, 8]
>>> mean_squared_error(y_true, y_pred)
0.375
2. Mean Absolute Error/L1 Loss

Mean absolute error, on the other hand, is measured as the average of sum of absolute differences
between predictions and actual observations. Like MSE, this as well measures the magnitude of error
without considering their direction.
MAE is more robust to outliers since it does not make use of square.
Mathematical formulation :-

.
from sklearn.metrics import mean_absolute_error
>>> y_true = [3, -0.5, 2, 7]
>>> y_pred = [2.5, 0.0, 2, 8]
>>> mean_absolute_error(y_true, y_pred)
0.5
MAE loss is useful if the training data is corrupted with outliers (i.e. we erroneously receive
unrealistically huge negative/positive values in our training environment, but not our testing
environment).
Deciding which loss function to use

If the outliers represent anomalies that are important for business and should be detected, then we
should use MSE. On the other hand, if we believe that the outliers just represent corrupted data,
then we should choose MAE as loss.

 L1 loss is more robust to outliers, but its derivatives are not continuous, making it inefficient to
find the solution. L2 loss is sensitive to outliers, but gives a more stable and closed form solution

3. Huber Loss:
 Mean Square Error (MSE) is greater for learning the outliers in the dataset, on the other
hand, Mean Absolute Error(MAE) is good to ignore the outliers.

 But in some cases, the data which looks like outliers should not be ignored and also those
points should not get high priority. Here where Huber Loss comes in.
Huber Loss = Combination of both MSE and MAE

 Huber loss is both MSE and MAE means it is quadratic(MSE) when the error is small else
MAE. Here delta is the hyperparameter to define the range for MAE and MSE which can be
iterative to make sure the correct delta value.

Classification Losses

1. Cross Entropy Loss

 Cross-entropy loss is often simply referred to as “cross-entropy,” “logarithmic loss,” “logistic

loss,” or “log loss” for short.

 It gives the probability value between 0 and 1 for a classification task. Cross-Entropy calculates
the average difference between the predicted and actual probabilities.

 Each predicted probability is compared to the actual class output value (0 or 1) and a score is
calculated that penalizes the probability based on the distance from the expected value. The
penalty is logarithmic, offering a small score for small differences (0.1 or 0.2) and enormous
score for a large difference (0.9 or 1.0).

 This is the most common setting for classification problems. Cross-entropy loss increases as the
predicted probability diverges from the actual label.

Consider a 4-class classification task where an image is classified as either a dog, cat,
horse or cheetah.
Let us calculate the probability generated by the first logit after Softmax is applied

E= 2.73

In the above Figure, Softmax converts logits into probabilities. The purpose of the Cross-Entropy is to
take the output probabilities (P) and measure the distance from the truth values (as shown in Figure
below).
Cross-entropy is defined as

Binary Cross-Entropy Loss

For binary classification, we have binary cross-entropy defined as

Or it can be written as follows

from sklearn.metrics import log_loss

>>> log_loss(["spam", "ham", "ham", "spam"],
... [[.1, .9], [.9, .1], [.8, .2], [.35, .65]])
0.21616...

from math import log

# calculate binary cross entropy
def binary_cross_entropy(actual, predicted):
sum_score = 0.0
for i in range(len(actual)):
sum_score += actual[i] * log(1e-15 + predicted[i])
mean_sum_score = 1.0 / len(actual) * sum_score
return mean_sum_score

DL Unit-2
No ratings yet
DL Unit-2
24 pages
Important Cyber Law Case Studies
100% (2)
Important Cyber Law Case Studies
14 pages
3 - Loss Functions
No ratings yet
3 - Loss Functions
14 pages
Assignment 1 - Machine Learning
No ratings yet
Assignment 1 - Machine Learning
9 pages
DL Practical 3 Loss Function
No ratings yet
DL Practical 3 Loss Function
6 pages
Module 6 - Loss Function
No ratings yet
Module 6 - Loss Function
22 pages
Lect 9 - Loss Functions
No ratings yet
Lect 9 - Loss Functions
28 pages
Loss Functions
No ratings yet
Loss Functions
29 pages
Lecture 11
No ratings yet
Lecture 11
26 pages
04 LossFunctions
No ratings yet
04 LossFunctions
22 pages
Loss Functions
No ratings yet
Loss Functions
37 pages
Loss
No ratings yet
Loss
18 pages
Loss Functions
No ratings yet
Loss Functions
8 pages
Loss Functions Types
No ratings yet
Loss Functions Types
11 pages
AI and Math - Python Multiple-Choice Questions
No ratings yet
AI and Math - Python Multiple-Choice Questions
16 pages
Losses
No ratings yet
Losses
9 pages
Lesson 4 Deep Neural Network and Tools
No ratings yet
Lesson 4 Deep Neural Network and Tools
159 pages
8 Linear Classifiers HInge Loss 03-08-2024
No ratings yet
8 Linear Classifiers HInge Loss 03-08-2024
20 pages
Loss Functions
No ratings yet
Loss Functions
17 pages
4-Loss Function
No ratings yet
4-Loss Function
8 pages
Practical-5 - 2CEIT606 - Artificial Intelligence
No ratings yet
Practical-5 - 2CEIT606 - Artificial Intelligence
14 pages
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-09 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-09 Reference-Material-I
15 pages
Lesson 04 Deep Neural Network
No ratings yet
Lesson 04 Deep Neural Network
81 pages
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
No ratings yet
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
30 pages
Machine Learning Models
No ratings yet
Machine Learning Models
52 pages
Loss Functions in Neural Networks PDF
No ratings yet
Loss Functions in Neural Networks PDF
6 pages
Cost Function Loss Function
No ratings yet
Cost Function Loss Function
7 pages
Unit 2 - Part A - B - C
No ratings yet
Unit 2 - Part A - B - C
25 pages
Machine Vesion hw6
No ratings yet
Machine Vesion hw6
18 pages
Loss Functions
No ratings yet
Loss Functions
7 pages
Detailed Guide 7 Loss Functions Machine Learning Python Code
No ratings yet
Detailed Guide 7 Loss Functions Machine Learning Python Code
16 pages
Loss Function - Ipynb - Colaboratory
No ratings yet
Loss Function - Ipynb - Colaboratory
6 pages
Loss Function
No ratings yet
Loss Function
23 pages
Ai - W3L6
No ratings yet
Ai - W3L6
29 pages
Lecture 07
No ratings yet
Lecture 07
29 pages
Lecture 4 - Linear Classification
No ratings yet
Lecture 4 - Linear Classification
34 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Loss Function
No ratings yet
Loss Function
3 pages
03-Linear Classification
No ratings yet
03-Linear Classification
17 pages
Cross Entropy Loss Intro, Applications
No ratings yet
Cross Entropy Loss Intro, Applications
21 pages
9.b Handout-1-Loss Functions
No ratings yet
9.b Handout-1-Loss Functions
3 pages
Lect 8
No ratings yet
Lect 8
117 pages
NN WK 3 Lec 5 6 Gradient Descent
No ratings yet
NN WK 3 Lec 5 6 Gradient Descent
7 pages
Linear Classfiers, Loss
No ratings yet
Linear Classfiers, Loss
38 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
Loss Functions
No ratings yet
Loss Functions
25 pages
05 AIS302 ANN-Optimization
No ratings yet
05 AIS302 ANN-Optimization
44 pages
Module 1 - Problems in Neural Network
No ratings yet
Module 1 - Problems in Neural Network
20 pages
Lecture 2
No ratings yet
Lecture 2
80 pages
16-Softmax Regression - Softmax Classifier-19!08!2024
No ratings yet
16-Softmax Regression - Softmax Classifier-19!08!2024
14 pages
UNIT4 CostFunctions
No ratings yet
UNIT4 CostFunctions
23 pages
DeepLearning Lect2 3
No ratings yet
DeepLearning Lect2 3
89 pages
Week 6 - Lecture 12-1
No ratings yet
Week 6 - Lecture 12-1
34 pages
DL145611 03 Shallow
No ratings yet
DL145611 03 Shallow
92 pages
Lecture Notes 5 Linear Regression
No ratings yet
Lecture Notes 5 Linear Regression
11 pages
DeekshikaJadyada20 AP24LDS11
No ratings yet
DeekshikaJadyada20 AP24LDS11
4 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
1 Intro
No ratings yet
1 Intro
5 pages
practicalMachineLearning Lecture3
No ratings yet
practicalMachineLearning Lecture3
25 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Connection of A SIMATIC S7-1x00 To A SQL Database: SQL / Tabular Data Stream (SQL)
No ratings yet
Connection of A SIMATIC S7-1x00 To A SQL Database: SQL / Tabular Data Stream (SQL)
69 pages
2020 Chaudhari A Review Article On Artificial Intelligence Change in Farmaceutical Formulation and Development
No ratings yet
2020 Chaudhari A Review Article On Artificial Intelligence Change in Farmaceutical Formulation and Development
8 pages
OSI Model Notes
No ratings yet
OSI Model Notes
3 pages
T50e User Guide
No ratings yet
T50e User Guide
541 pages
1.2-4 Apps
No ratings yet
1.2-4 Apps
7 pages
27.1.5 Lab - Convert Data Into A Universal Format
100% (1)
27.1.5 Lab - Convert Data Into A Universal Format
9 pages
Lecture 3
No ratings yet
Lecture 3
46 pages
Pcnse Study Guide
100% (3)
Pcnse Study Guide
308 pages
chp4 CCD
No ratings yet
chp4 CCD
8 pages
Machine Learning Approaches For Fake Reviews Detection A Systematic Literature Review
No ratings yet
Machine Learning Approaches For Fake Reviews Detection A Systematic Literature Review
27 pages
Data Scientist Cover Letter Example
100% (2)
Data Scientist Cover Letter Example
5 pages
Markerless Human Motion Capture Through Visual Hull and Articulated ICP
No ratings yet
Markerless Human Motion Capture Through Visual Hull and Articulated ICP
5 pages
4 - Power Series - Practice Problem
No ratings yet
4 - Power Series - Practice Problem
9 pages
ANNEX 10 Annual Implementation Plan Template
100% (1)
ANNEX 10 Annual Implementation Plan Template
13 pages
Cloud Computing - Everything Is A Service
No ratings yet
Cloud Computing - Everything Is A Service
11 pages
Unit-IV Management Information System
No ratings yet
Unit-IV Management Information System
29 pages
Info
No ratings yet
Info
62 pages
JSS 1 - 3
No ratings yet
JSS 1 - 3
6 pages
CIS Google Container-Optimized OS Benchmark v1.0.0
No ratings yet
CIS Google Container-Optimized OS Benchmark v1.0.0
280 pages
Ezserver User Guide: Ezhometech
No ratings yet
Ezserver User Guide: Ezhometech
120 pages
Kenneth Hagin JR Vida de Obediencia
100% (1)
Kenneth Hagin JR Vida de Obediencia
47 pages
Question Paper Code:: (10×2 20 Marks)
No ratings yet
Question Paper Code:: (10×2 20 Marks)
2 pages
Intel® Volume Management Device LED Management Tool Release Notes and Guide
No ratings yet
Intel® Volume Management Device LED Management Tool Release Notes and Guide
8 pages
Cyberforce-Infographic - Palo ALTO
No ratings yet
Cyberforce-Infographic - Palo ALTO
1 page
PJYD2200PWE Model
No ratings yet
PJYD2200PWE Model
1 page
Rationals Review 8 - Practice Test
No ratings yet
Rationals Review 8 - Practice Test
2 pages
Transformation DJ Sanghvi Topic
No ratings yet
Transformation DJ Sanghvi Topic
5 pages
Pi Score Cabinet Print Guide: File QTY Notes
No ratings yet
Pi Score Cabinet Print Guide: File QTY Notes
1 page
MLA LabManual1
No ratings yet
MLA LabManual1
52 pages

Loss Functions

Uploaded by

Loss Functions

Uploaded by

Loss Functions

# calculate mean squared error

from sklearn.metrics import mean_squared_error

1. Cross Entropy Loss

 Cross-entropy loss is often simply referred to as “cross-entropy,” “logarithmic loss,” “logistic

Binary Cross-Entropy Loss

Or it can be written as follows

from sklearn.metrics import log_loss

from math import log

You might also like