0% found this document useful (0 votes)

138 views7 pages

Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)

The document discusses feed forward neural networks and deep learning concepts. It provides details on: 1) The architecture of feed forward neural networks, including input, hidden, and output layers connected in a forward direction without loops. 2) How backpropagation works by calculating gradients to fine-tune weights and reduce errors through multiple iterations. 3) Common loss functions used in neural networks like mean squared error, likelihood, and log loss, and how they evaluate model performance. 4) Gradient descent optimization algorithms and types including batch, stochastic, and mini-batch gradient descent. 5) The importance of the sigmoid activation function in allowing neural networks to learn non-linear and complex problems.

Uploaded by

Mrunal Bhilare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

138 views7 pages

Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)

Uploaded by

Mrunal Bhilare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Week – 5 (Deep Learning)

Q. 1) Explain the architecture of Feed Forward Neural Network or

Multilayer Perceptron. (12 marks)

Ans: - Feed Forward Neural Networks, also known as Deep Feed Forward Networks or
Multilayer Perceptrons. For example, Convolutional and Recurrent Neural Networks
(which are used extensively in computer vision applications) are based on these
networks. Search engines, machine translation, and mobile applications all rely on deep
learning technologies. It works by stimulating the human brains in terms of identifying
and creating patterns from various types of input. A feed forward neural network is a
key component of this fantastic technology since it aids software developers with
pattern recognition and classification, non-linear regression, and function
approximation.

A feed forward neural network is a type of artificial neural network in which nodes
connections do not form a loop. Often referred to as a multilayered network or neurons,
feed forward neural networks are so named because all information flows in a forward
manner only. The data enters the input nodes, travels through the hidden layers, and
eventually exits the output nodes. The network is devoid of links that would allow the
information exiting the output node to be sent back into the network. The purpose of
feed forward neural networks is to approximate functions.

Here’s how it works

There is a classifier using the formula y = f*(x)

This assigns the value of input x to the category y.

The feed forward network will map y = f(x; θ). It then memorizes the value of θ that
most closely approximates the function.

Fig: - Feed Forward Neural Network

A Feed Forward Neural Network’s Layers:

The following are the components of a feed forward neural network:

Input Layer:

It contains the neurons that receive input. The data is subsequently passed on the next
tier. The input layer’s total number of neurons is equal to the number of variables in the
dataset.

Hidden Layer:

This is the intermediate layer, which is concealed between the input and output layers.
This layer has a large number of neurons that perform alterations on the inputs. They
then communicate with the output layer.

Output Layer:

It is the last layer and is depending on the model’s construction. Additionally, the output
layer is the expected feature, as you are aware of the desired outcome.

Neurons weights:

Weights are used to describe the strength of a connection between neurons. The range
of a weight’s value is from 0 to 1.
Q. 2) What is Backpropagation & How Backpropagation algorithm works?
(6 marks)

Ans: - Backpropagation is the essence of neural network training. It is the method of

fine-tuning the weights of a neural network based on the error rate obtained in the
previous epoch (i.e., iteration). Proper tuning of the weights allows you to reduce error
rates and make the model reliable by increasing its generalization.

Backpropagation in neural network is a short form for “backpropagation of errors”. It is

a standard method of training artificial neural networks. This method helps to calculate
the gradient of a loss function with respect to all the weights in the network.

The Backpropagation algorithm in neural network computes the gradient of the loss
function for a single weight by the chain rule. It efficiently computes one layer at a time,
unlike a native direct computation. It computes the gradient, but it does not define how
the gradient is used. It generalizes the computation in the delta rule.

Consider the following Backpropagation neural network example diagram to

understand:

Fig: - Working of Backpropagation Algorithm

1. Inputs X, arrive through the preconnected path

2. Input is modeled using real weights W. The weights are usually randomly
selected.
3. Calculate the output for every neuron from the input layer, to the hidden layers,
to the output layer.
4. Calculate the error in the outputs.

ErrorB= Actual Output – Desired Output

5. Travel back from the output layer to the hidden layer to adjust the weights such
that the error is decreased.

Keep repeating the process until the desired output is achieved.

Q.3) What is Loss Function? Explain types of loss function. (6 marks)

Ans: - At its core, a loss function is incredibly simple: It’s a method of evaluating how
well your algorithm models your dataset. If your predictions are totally off, your loss
function will output a higher number. If they’re pretty good, it’ll output a higher
number. If they’re pretty good, it’ll output a lower number. As you change pieces of your
algorithm to try and improve your model, your loss function will tell you if you’re
getting anywhere.

Types of loss functions: -

A few of the most popular loss functions currently being used, from simple to more
complex are: -

1. Mean square error:

Mean squared error (MSE) is the workhorse of basic loss functions; it’s easy to
understand and implement and generally works pretty well. To calculate MSE, you
take the difference between your predictions and the ground truth, square it, and
average it out across the whole dataset.

2. Likelihood loss:

The likelihood function is also relatively simple, and is commonly used in

classification problems. The function takes the predicted probability for each input
example and multiplies them. And although the output isn’t exactly human-
interpretable, it’s useful for comparing models.

For example, consider a model that outputs probabilities of [0.4, 0.6, 0.9, 0.1] for the
ground truth labels of [0, 1, 1, 0]. The likelihood loss would be computed as

(0.6) * (0.6) * (0.9) * (0.9) = 0.2916.

Since the model outputs probabilities for TRUE (or 1) only, when the ground truth
label is 0 we take (1-p) as the probability. In other words, we multiply the model’s
outputted probabilities together for the actual outcomes.

3. Log loss (Cross Entropy Loss):

Log loss is a loss function also used frequently in classification problems, and is one
of the most popular measures for kaggle competitions. It’s just a straightforward
modification of the likelihood function with logarithms.

This is actually exactly the same formula as the regular likelihood function, but with
logarithms added in. You can see that when the actual class is 1, the second half of the
function disappears, and when the actual class is 0, the first half drops. That way, we
just end up multiplying the log of the actual predicted probability for the ground truth
class.

The cool thing about the log loss function is that is has a kick: It penalizes heavily for
being very confident and very wrong. The graph below is for when the true label =1, and
you can see that it skyrockets as the predicted probability for label = 0 approaches 1.

Q. 4) What is Gradient descent? Explain the types of Gradient descent.

(3 marks)

Ans: - Gradient descent is an optimization algorithm which is commonly-used to train

machine learning models and neural networks. Training data helps these models learn
over time, and the cost function within gradient descent specifically acts as a barometer,
gauging its accuracy with each iteration of parameter updates. Until the function is close
to or equal to zero, the model will continue to adjust its parameters to yield the smallest
possible error.

Types of Gradient Descent: -

1. Batch gradient descent :

Batch gradient descent sums the error for each point in a training set, updating the
model only after all training examples have been evaluated. This process referred to
as a training epoch. While this batching provides computation efficiency, it can still
have a long processing time for large training datasets as it still needs to store all of
the data into memory. Batch gradient descent also usually produces a stable error
gradient and convergence, but sometimes that convergence point isn’t the most
ideal, finding the local minimum versus the global one.
2. Stochastic gradient descent :
Stochastic gradient descent (SGD) runs a training epoch for each example within the
dataset and it updates each training example's parameters one at a time. Since you
only need to hold one training example, they are easier to store in memory. While
these frequent updates can offer more detail and speed, it can result in losses in
computational efficiency when compared to batch gradient descent. Its frequent
updates can result in noisy gradients, but this can also be helpful in escaping the
local minimum and finding the global one.

3. Mini-batch gradient descent :

Mini-batch gradient descent combines concepts from both batch gradient descent
and stochastic gradient descent. It splits the training dataset into small batch
sizes and performs updates on each of those batches. This approach strikes a
balance between the computational efficiency of batch gradient descent and the
speed of stochastic gradient descent.

Q. 5) Why the Sigmoid function is important in neural networks?

(3 marks)

Ans: - If we use a linear activation function in a neural network, then this model can only
learn linearly separable problems. However, with the addition of just one hidden layer
and a sigmoid activation function in the hidden layer, the neural network can easily
learn a non-linearly separable problem. Using a non-linear function produces non-linear
boundaries and hence, the sigmoid function can be used in neural networks for learning
complex decision functions. The only non-linear function that can be used as an
activation function in a neural network is one which is monotonically increasing. So for
example, sin(x) or cos(x) cannot be used as activation functions. Also, the activation
function should be defined everywhere and should be continuous everywhere in the
space of real numbers. The function is also required to be differentiable over the entire
space of real numbers.

Typically a back propagation algorithm uses gradient descent to learn the weights of a
neural network. To derive this algorithm, the derivative of the activation function is
required. The fact that the sigmoid function is monotonic, continuous and differentiable
everywhere, coupled with the property that its derivative can be expressed in terms of
itself makes it easy to derive the update equations for learning the weights in a neural
network when using back propagation algorithm.

UNIT 1 Introduction Part 1
No ratings yet
UNIT 1 Introduction Part 1
37 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
Unit 2 Deep Learning
No ratings yet
Unit 2 Deep Learning
19 pages
AD601 Deep Learning Unit-2 Notes
No ratings yet
AD601 Deep Learning Unit-2 Notes
14 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
CS601 - Machine Learning - Unit 2 New
No ratings yet
CS601 - Machine Learning - Unit 2 New
56 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
Module 2 Deep Feed Forward Networks
No ratings yet
Module 2 Deep Feed Forward Networks
18 pages
Deep Learning
No ratings yet
Deep Learning
299 pages
Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908
No ratings yet
Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908
5 pages
Week 03-04 - Deep Feedforward Networks - Intro
No ratings yet
Week 03-04 - Deep Feedforward Networks - Intro
141 pages
DL Unit2
No ratings yet
DL Unit2
113 pages
Home Assignment Submission Solutions
No ratings yet
Home Assignment Submission Solutions
82 pages
Anthony Kuh - Neural Networks and Learning Theory
No ratings yet
Anthony Kuh - Neural Networks and Learning Theory
72 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Unit 1
No ratings yet
Unit 1
72 pages
HODL Lec 2 Training NNs Intro TF
No ratings yet
HODL Lec 2 Training NNs Intro TF
83 pages
DeepLearning Recap
No ratings yet
DeepLearning Recap
104 pages
Introduction Deep Eng
No ratings yet
Introduction Deep Eng
50 pages
Wa0006.
No ratings yet
Wa0006.
70 pages
DL 2
No ratings yet
DL 2
62 pages
DL U-I Introduction Part-2
No ratings yet
DL U-I Introduction Part-2
48 pages
Neural Network (Basics)
No ratings yet
Neural Network (Basics)
48 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
Maths - Gr5 - T2 - 2023 Mid Term Test
100% (1)
Maths - Gr5 - T2 - 2023 Mid Term Test
3 pages
QUESTIONS DFA-Solved-Examples
No ratings yet
QUESTIONS DFA-Solved-Examples
12 pages
Understanding and Creating Neural Networks
No ratings yet
Understanding and Creating Neural Networks
69 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Unit 2 v1.
No ratings yet
Unit 2 v1.
41 pages
Chapter 3-3 Neural Network-Back Propagation
No ratings yet
Chapter 3-3 Neural Network-Back Propagation
32 pages
Module 2
No ratings yet
Module 2
44 pages
Soft Module 1
No ratings yet
Soft Module 1
14 pages
CCLab
No ratings yet
CCLab
37 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Must Know Questions Deep Learning
No ratings yet
Must Know Questions Deep Learning
22 pages
Machine Learning
No ratings yet
Machine Learning
83 pages
Unit 2
No ratings yet
Unit 2
36 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
ML Unit 4
No ratings yet
ML Unit 4
23 pages
3 ArtificialNeuralNetworks PDF
No ratings yet
3 ArtificialNeuralNetworks PDF
77 pages
Unit-5: Introduction To Deep Learning: Artificial Neural Networks
No ratings yet
Unit-5: Introduction To Deep Learning: Artificial Neural Networks
14 pages
Artificial Neural Network: Lecture Module 22
No ratings yet
Artificial Neural Network: Lecture Module 22
54 pages
Module 2 DL Snotes P1
No ratings yet
Module 2 DL Snotes P1
16 pages
1
No ratings yet
1
15 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
Mid 1 DL Notes
No ratings yet
Mid 1 DL Notes
15 pages
Types of MAC Protocols
No ratings yet
Types of MAC Protocols
16 pages
Mod 2.4,2.5,2.6 Architecture Design
No ratings yet
Mod 2.4,2.5,2.6 Architecture Design
20 pages
Deep Learning
No ratings yet
Deep Learning
11 pages
Tutorial 1,2
No ratings yet
Tutorial 1,2
12 pages
GDG Sof Week 2
No ratings yet
GDG Sof Week 2
11 pages
Linearity: Skip To Content
No ratings yet
Linearity: Skip To Content
10 pages
Interview Questions in Neural Network
No ratings yet
Interview Questions in Neural Network
9 pages
Neural Networks Essay Feranmi Dere
No ratings yet
Neural Networks Essay Feranmi Dere
7 pages
Artificial Neural Networks - Lect - 3
No ratings yet
Artificial Neural Networks - Lect - 3
16 pages
DEEP LEARNING Paper
No ratings yet
DEEP LEARNING Paper
12 pages
ANN Unit IV Notes
No ratings yet
ANN Unit IV Notes
4 pages
Assignment Mtech
No ratings yet
Assignment Mtech
5 pages
Notes Chapter8
No ratings yet
Notes Chapter8
4 pages
Vanishing Gradient Problem
No ratings yet
Vanishing Gradient Problem
3 pages
FSD Project Mobile Phone Store: Source Code
No ratings yet
FSD Project Mobile Phone Store: Source Code
8 pages
Practice Final CS61c
No ratings yet
Practice Final CS61c
19 pages
© LPU:: CSE310 Programming in Java:: Sawal Tandon
No ratings yet
© LPU:: CSE310 Programming in Java:: Sawal Tandon
12 pages
Section 1.8 Gaussian Elimination With Pivoting
No ratings yet
Section 1.8 Gaussian Elimination With Pivoting
8 pages
0-1 Knapsack Problem
No ratings yet
0-1 Knapsack Problem
6 pages
DBSCAN Clustering Algorithm: Presented by
No ratings yet
DBSCAN Clustering Algorithm: Presented by
22 pages
ICSC Example 2025
No ratings yet
ICSC Example 2025
2 pages
General Paper For VRP PDF
No ratings yet
General Paper For VRP PDF
9 pages
Step 1: Download Binary Package
No ratings yet
Step 1: Download Binary Package
50 pages
Graph Alignment Using Graph Embeddings
No ratings yet
Graph Alignment Using Graph Embeddings
8 pages
PATTERN CLASSIFICATION BY DISTANCE FUNCTIONS BY Dr. K.Vijayarekha
No ratings yet
PATTERN CLASSIFICATION BY DISTANCE FUNCTIONS BY Dr. K.Vijayarekha
8 pages
Daa LM Practical Exercises
No ratings yet
Daa LM Practical Exercises
49 pages
Karnaugh Map: Logic Optimization
No ratings yet
Karnaugh Map: Logic Optimization
15 pages
CS402 MidTerm MCQs by Talha Sajid
No ratings yet
CS402 MidTerm MCQs by Talha Sajid
35 pages
Twisted Question Bank - AI
No ratings yet
Twisted Question Bank - AI
2 pages
ICSE Computer Applications 2011 Question Paper Solved: Section A (40 Marks)
No ratings yet
ICSE Computer Applications 2011 Question Paper Solved: Section A (40 Marks)
3 pages
Unit 4
No ratings yet
Unit 4
34 pages
Cos 102
No ratings yet
Cos 102
8 pages
Q. 1) What Is Class Condition Density? (3 Marks) Ans
No ratings yet
Q. 1) What Is Class Condition Density? (3 Marks) Ans
12 pages
Week 4
No ratings yet
Week 4
5 pages
AI Mid Semester Examination 2021 (CS6412) (Preview) Microsoft Forms
No ratings yet
AI Mid Semester Examination 2021 (CS6412) (Preview) Microsoft Forms
23 pages
Local Search Algorithms and Optimization Problems: Presented By: DR - Qanita Bani Baker
No ratings yet
Local Search Algorithms and Optimization Problems: Presented By: DR - Qanita Bani Baker
32 pages
DSA Bonafide CSE Specialization Index Updated
No ratings yet
DSA Bonafide CSE Specialization Index Updated
4 pages
Lec 18
No ratings yet
Lec 18
6 pages
Page Rank, Structure of Web and Analyzing A Web Graph
No ratings yet
Page Rank, Structure of Web and Analyzing A Web Graph
17 pages
Shortest Path Algorithms: Discrete Mathematics
No ratings yet
Shortest Path Algorithms: Discrete Mathematics
28 pages
Week 1
No ratings yet
Week 1
5 pages
DS Lab-Scheme
No ratings yet
DS Lab-Scheme
4 pages
Haseeb
No ratings yet
Haseeb
5 pages
Lecture Notes COMP3506
No ratings yet
Lecture Notes COMP3506
17 pages
WWW - Manaresults.co - In: Set No. 1
No ratings yet
WWW - Manaresults.co - In: Set No. 1
2 pages
NP and Computational Intractability
No ratings yet
NP and Computational Intractability
11 pages
LAZER - Editorial-CodeChef
No ratings yet
LAZER - Editorial-CodeChef
2 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)

Uploaded by

Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)

Uploaded by

Week – 5 (Deep Learning)

Q. 1) Explain the architecture of Feed Forward Neural Network or

Here’s how it works

There is a classifier using the formula y = f*(x)

This assigns the value of input x to the category y.

Fig: - Feed Forward Neural Network

The following are the components of a feed forward neural network:

Ans: - Backpropagation is the essence of neural network training. It is the method of

Backpropagation in neural network is a short form for “backpropagation of errors”. It is

Consider the following Backpropagation neural network example diagram to

Fig: - Working of Backpropagation Algorithm

1. Inputs X, arrive through the preconnected path

ErrorB= Actual Output – Desired Output

Keep repeating the process until the desired output is achieved.

Types of loss functions: -

1. Mean square error:

The likelihood function is also relatively simple, and is commonly used in

(0.6) * (0.6) * (0.9) * (0.9) = 0.2916.

3. Log loss (Cross Entropy Loss):

Q. 4) What is Gradient descent? Explain the types of Gradient descent.

Ans: - Gradient descent is an optimization algorithm which is commonly-used to train

Types of Gradient Descent: -

1. Batch gradient descent :

3. Mini-batch gradient descent :

Q. 5) Why the Sigmoid function is important in neural networks?

You might also like