0% found this document useful (0 votes)

11 views

Loss Functions

Uploaded by

Gujuluva Karthik

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Loss Functions

Uploaded by

Gujuluva Karthik

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

SNS COLLEGE OF TECHNOLOGY

(An Autonomous Institution)

Coimbatore – 35.

DEPARTMENT OF BIOMEDICAL ENGINEERING

UNIT – 1

TRAINING THE NETWORK-LOSS FUNCTIONS

3 KEY LOSS FUNCTIONS

1. Mean Squared Error Loss Function
2. Cross-Entropy Loss Function
3. Mean Absolute Percentage Error

1. Mean Squared Error Loss Function

Mean squared error (MSE) loss function is the sum of squared differences
between the entries in the prediction vector y and the ground truth vector
y_hat.

You divide the sum of squared differences by N, which corresponds to the

length of the vectors. If the output y of your neural network is a vector with
multiple entries then N is the number of the vector entries with y_i being
one particular entry in the output vector.

The mean squared error loss function is the perfect loss function if you're
dealing with a regression problem. That is, if you want your neural network
to predict a continuous scalar value.
An example of a regression problem would be predictions of . . .

 the number of products needed in a supply chain.

 future real estate prices under certain market conditions.

 a stock value.

2. Cross-Entropy Loss Function

Regression is only one of two areas where feedforward networks enjoy great
popularity. The other area is classification.

In classification tasks, we deal with predictions of probabilities, which

means the output of a neural network must be in a range between zero and
one. A loss function that can measure the error between a predicted
probability and the label which represents the actual class is called the
cross-entropy loss function.

One important thing we need to discuss before continuing with the cross-
entropy is what exactly the ground truth vector looks like in the case of a
classification problem.
One-hot-encoded vector (left) and prediction
vector (right).

The label vector y_hat is one hot encoded which means the values in this
vector can only take discrete values of either zero or one. The entries in this
vector represent different classes. The values of these entries are zero,
except for a single entry which is one. This entry tells us the class into
which we want to classify the input feature vector x.

The prediction y, however, can take continuous values between zero and
one.

Given the prediction vector y and the ground truth vector y_hat you can
compute the cross-entropy loss between those two vectors as follows:

Cross-entropy loss function

First, we need to sum up the products between the entries of the label
vector y_hat and the logarithms of the entries of the predictions vector y.
Then we must negate the sum to get a positive value of the loss function.

One interesting thing to consider is the plot of the cross-entropy loss

function. In the following graph, you can see the value of the loss function
(y-axis) vs. the predicted probability y_i. Here y_i takes values between
zero and one.

Cross-entropy function depending on prediction value.

We can see clearly that the cross-entropy loss function grows exponentially
for lower values of the predicted probability y_i. For y_i=0 the function
becomes infinite, while for y_i=1 the neural network makes an accurate
probability prediction and the loss value goes to zero.

3. Mean Absolute Percentage Error

Finally, we come to the Mean Absolute Percentage Error (MAPE) loss

function. This loss function doesn’t get much attention in deep learning.
For the most part, we use it to measure the performance of a neural
network during demand forecasting tasks.

First thing first: what is demand forecasting?

Demand forecasting is the area of predictive analytics dedicated to

predicting the expected demand for a good or service in the near future. For
example:

 In retail, we can use demand forecasting models to determine the amount

of a particular product that should be available and at what price.

 In industrial manufacturing, we can predict how much of each product

should be produced, the amount of stock that should be available at various
points in time, and when maintenance should be performed.

 In the travel and tourism industry, we can use demand forecasting models
to assess optimal price points for flights and hotels, in light of available
capacity, what price should be assigned (for hotels, flights), which
destinations should be spotlighted, or, what types of packages should be
advertised.

Although demand forecasting is also a regression task and the minimization

of the MSE loss function is an adequate training goal, this type of loss
function to measure the performance of the model during training isn’t
suitable for demand forecasting.

Why is that?

Well, imagine the MSE loss function gives you a value of 100. Can you tell if
this is generally a good result? No, because it depends on the situation. If
the prediction y of the model is 1000 and the actual ground truth label
y_hat is 1010, then the MSE loss of 100 would be in fact a very small error
and the performance of the model would be quite good.

However in the case where the prediction would be five and the label is 15,
you would have the same loss value of 100 but the relative deviation to the
ground-truth value would be much higher than in the previous case.

This example shows the shortcoming of the mean squared error function as
the loss function for the demand forecasting models. For this reason, I
strongly recommend using mean absolute percentage error (MAPE).

The mean absolute percentage error, also known as mean absolute

percentage deviation (MAPD) usually expresses accuracy as a percentage.
We define it with the following equation:

In this equation, y_i is the predicted value and y_hat is the label. We divide
the difference between y_i and y_hat by the actual value y_hat again.
Finally, multiplying by 100 percent gives us the percentage error.
Applying this equation to the example above gives you a more meaningful
understanding of the model’s performance. In the first case, the deviation
from the ground truth label would be only one percent, while in the second
case the deviation would be 66 percent:

We see that the performance of these two models is very different.

Meanwhile, the MSE loss function would indicate that the performance of
both models is the same.

Reference:

https://builtin.com/machine-learning/loss-functions

DL Unit-2
No ratings yet
DL Unit-2
24 pages
Effect of Mnemonic Technique On Serial Recall Saved
100% (1)
Effect of Mnemonic Technique On Serial Recall Saved
17 pages
DL Practical 3 Loss Function
No ratings yet
DL Practical 3 Loss Function
6 pages
Loss Functions
No ratings yet
Loss Functions
7 pages
Loss functions
No ratings yet
Loss functions
29 pages
Module 6_Loss Function
No ratings yet
Module 6_Loss Function
22 pages
loss function
No ratings yet
loss function
23 pages
Losses
No ratings yet
Losses
9 pages
Loss Functions in Deep Learning - MLearning - Ai
No ratings yet
Loss Functions in Deep Learning - MLearning - Ai
14 pages
Loss Functions in Neural Networks PDF
No ratings yet
Loss Functions in Neural Networks PDF
6 pages
Loss Function
No ratings yet
Loss Function
3 pages
Lecture 11
No ratings yet
Lecture 11
26 pages
Practical-5_2CEIT606_Artificial Intelligence
No ratings yet
Practical-5_2CEIT606_Artificial Intelligence
14 pages
1 Intro
No ratings yet
1 Intro
5 pages
04 LossFunctions
No ratings yet
04 LossFunctions
22 pages
Lesson 4 Deep Neural Network and Tools
No ratings yet
Lesson 4 Deep Neural Network and Tools
159 pages
Assignment 1 - Machine Learning
No ratings yet
Assignment 1 - Machine Learning
9 pages
Deep Learning(Part 2). Loss Function and Gradient Function _ by Sumbatilinda _ Medium
No ratings yet
Deep Learning(Part 2). Loss Function and Gradient Function _ by Sumbatilinda _ Medium
30 pages
Loss Functions Types
No ratings yet
Loss Functions Types
11 pages
CM20315 05 Loss
No ratings yet
CM20315 05 Loss
100 pages
4-Loss Function
No ratings yet
4-Loss Function
8 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
7 pages
Ai - W3L6
No ratings yet
Ai - W3L6
29 pages
ML-2
No ratings yet
ML-2
155 pages
Cost Function Loss Function
No ratings yet
Cost Function Loss Function
7 pages
Lec 04 Deep Networks 2
No ratings yet
Lec 04 Deep Networks 2
78 pages
Machine Vesion hw6
No ratings yet
Machine Vesion hw6
18 pages
Loss Functions
No ratings yet
Loss Functions
37 pages
Cross Entropy Loss Intro, Applications
No ratings yet
Cross Entropy Loss Intro, Applications
21 pages
3 - Loss Functions
No ratings yet
3 - Loss Functions
14 pages
10 - 4 - ML - SUP - Linear Regression
No ratings yet
10 - 4 - ML - SUP - Linear Regression
59 pages
Machine Learning: MACHINE LEARNING - Copy Rights Reserved Real Time Signals
No ratings yet
Machine Learning: MACHINE LEARNING - Copy Rights Reserved Real Time Signals
56 pages
Lecture-2-1 Model Representation 20220301
No ratings yet
Lecture-2-1 Model Representation 20220301
10 pages
loss-functions
No ratings yet
loss-functions
8 pages
Ch2-Training, Optimization and Regularization of DNN-new (1)
No ratings yet
Ch2-Training, Optimization and Regularization of DNN-new (1)
114 pages
Machine Learning Models
No ratings yet
Machine Learning Models
52 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
04 AIS302 ANN - Loss Functions (1)
No ratings yet
04 AIS302 ANN - Loss Functions (1)
74 pages
Week 2 Artificial Neural Networks
No ratings yet
Week 2 Artificial Neural Networks
62 pages
10 - 4 - ML - SUP - Linear Regression
No ratings yet
10 - 4 - ML - SUP - Linear Regression
59 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
eng
No ratings yet
eng
10 pages
Lec 25
No ratings yet
Lec 25
15 pages
Capstone Project
No ratings yet
Capstone Project
7 pages
05 AIS302 ANN-Optimization
No ratings yet
05 AIS302 ANN-Optimization
44 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
The Maths That Drives Ai: Work / Technology & Tools
No ratings yet
The Maths That Drives Ai: Work / Technology & Tools
3 pages
Lect 9- Loss Functions
No ratings yet
Lect 9- Loss Functions
28 pages
Unit 2b
No ratings yet
Unit 2b
11 pages
03-Linear Classification
No ratings yet
03-Linear Classification
17 pages
Loss Function
No ratings yet
Loss Function
2 pages
HODL Lec 2 Training NNs Intro TF
No ratings yet
HODL Lec 2 Training NNs Intro TF
83 pages
Lecture 4 - Cost Function
No ratings yet
Lecture 4 - Cost Function
18 pages
Lecture 4 - Linear Classification
No ratings yet
Lecture 4 - Linear Classification
34 pages
Detailed Guide 7 Loss Functions Machine Learning Python Code
No ratings yet
Detailed Guide 7 Loss Functions Machine Learning Python Code
16 pages
DNN - M2 - Deep Feedforward NN 23dec
No ratings yet
DNN - M2 - Deep Feedforward NN 23dec
97 pages
DL - M2 - Deep Feedforward NN
No ratings yet
DL - M2 - Deep Feedforward NN
97 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
PowerPoint Presentation-2
No ratings yet
PowerPoint Presentation-2
52 pages
AAA Improving The Prediction of Asset Returns With Machine Learning by Using A Custom Loss
No ratings yet
AAA Improving The Prediction of Asset Returns With Machine Learning by Using A Custom Loss
25 pages
Mod 2.3 - Activation Function, Loss Functions
No ratings yet
Mod 2.3 - Activation Function, Loss Functions
12 pages
Factsheet - Q-Bix Digital Signage Player
No ratings yet
Factsheet - Q-Bix Digital Signage Player
2 pages
State Delegate(s) Notes
No ratings yet
State Delegate(s) Notes
15 pages
Racing Selective
No ratings yet
Racing Selective
6 pages
Drafting Is The Preliminary Stage of A Written Work in Which The Author Begins To Develop A
No ratings yet
Drafting Is The Preliminary Stage of A Written Work in Which The Author Begins To Develop A
19 pages
PROJECT WORK ON Maths
No ratings yet
PROJECT WORK ON Maths
9 pages
Raz lb05 Iread
No ratings yet
Raz lb05 Iread
12 pages
Gasket Selection Poster 1
No ratings yet
Gasket Selection Poster 1
1 page
Aptis Advanced Test Format Overview 2023
No ratings yet
Aptis Advanced Test Format Overview 2023
9 pages
The Future Shape of Banking
0% (1)
The Future Shape of Banking
20 pages
Log
No ratings yet
Log
12 pages
Test Bank Ngan Hang Thuong Mai 2
No ratings yet
Test Bank Ngan Hang Thuong Mai 2
110 pages
Hazel A. Serraon: Arellano University #2600 Legarda Street Sampaloc, Manila
No ratings yet
Hazel A. Serraon: Arellano University #2600 Legarda Street Sampaloc, Manila
2 pages
Secure Electronic Transaction
No ratings yet
Secure Electronic Transaction
22 pages
HomeWork-133287018082536654
No ratings yet
HomeWork-133287018082536654
4 pages
List of Peer Advisers - Intramuros
No ratings yet
List of Peer Advisers - Intramuros
4 pages
Difference Systematic Review and Literature Review
100% (1)
Difference Systematic Review and Literature Review
6 pages
Maths Worksheet 1+2
No ratings yet
Maths Worksheet 1+2
4 pages
Enhancing The Pupils' Problem Solving Skills Involving Time Through Mobile Learning Approach (Mlearning)
No ratings yet
Enhancing The Pupils' Problem Solving Skills Involving Time Through Mobile Learning Approach (Mlearning)
6 pages
Horkheimer, Max - Critical Theory (Continuum, 1972)
No ratings yet
Horkheimer, Max - Critical Theory (Continuum, 1972)
312 pages
I. Assessment of MVA and Severity of MS
No ratings yet
I. Assessment of MVA and Severity of MS
4 pages
Grove GMK 4090-1 - Brochure
No ratings yet
Grove GMK 4090-1 - Brochure
16 pages
Education in Egypt
No ratings yet
Education in Egypt
6 pages
Unit 1: Lesson 1: Science, Technology, and Its Significance To Society in Contemporary World
No ratings yet
Unit 1: Lesson 1: Science, Technology, and Its Significance To Society in Contemporary World
29 pages
The Quite Life
No ratings yet
The Quite Life
4 pages
Straight Line: Depreciation Methods Year # 1 2 3 4 5 6
No ratings yet
Straight Line: Depreciation Methods Year # 1 2 3 4 5 6
6 pages
Trial Addmath
No ratings yet
Trial Addmath
8 pages
Form Administrasi Pelanggan
No ratings yet
Form Administrasi Pelanggan
2 pages
PGDM GLC Ib (Organisation Behavior)
No ratings yet
PGDM GLC Ib (Organisation Behavior)
126 pages
Q4 Take-Home Activity 6
No ratings yet
Q4 Take-Home Activity 6
3 pages