100% found this document useful (3 votes)

1K views7 pages

DL - Assignment 9 Solution

This document contains a 10 question multiple choice quiz about deep learning concepts from an NPTEL online certification course. The questions cover topics such as loss landscapes and learning rates, residual connections, momentum in gradient descent, advantages of stochastic gradient descent, exploding gradients, and characteristics of GoogleNet like inception modules and auxiliary classifiers. For each question there is a multiple choice answer option and a short explanation of the correct answer.

Uploaded by

swathisreejith6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (3 votes)

1K views7 pages

DL - Assignment 9 Solution

Uploaded by

swathisreejith6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

NPTEL Online Certification Courses

Indian Institute of Technology Kharagpur

Deep Learning
Assignment- Week 9
TYPE OF QUESTION: MCQ/MSQ
Number of questions: 10 Total mark: 10 X 1 = 10
______________________________________________________________________________

QUESTION 1:
For the following figure A and figure B of loss landscape, choose correct statement
Figure A Figure B

a. Figure A has small learning rate, Figure B has High learning rate
b. Figure A has high learning rate, Figure B has small learning rate
c. Figure A and Figure B have different Loss function
d. None of Above

Correct Answer: a
Detailed Solution:

Figure A has small learning rate which is evident from slow convergence before optimal
valley point. Figure B has highly fluctuating weight updates therefore has high learning
rate. (Figures taken from book Dive into Deep Learning)
____________________________________________________________________________
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

QUESTION 2:
Which of the following problem is primarily solved by the Residual connection in ResNet?

a. Vanishing Gradient problem

b. Overfitting
c. Underfitting
d. Exploding gradient

Correct Answer: a
Detailed Solution:

Residual connection formulated as F(x) = H(x) + x provides a unattenuated signal form

deep layers to shallow layers.

____________________________________________________________________________

QUESTION 3:
The following is the equation of update vector for momentum optimizer. Which of the
following is true for 𝛾?
𝑉𝑡 = 𝛾𝑉𝑡−1 + 𝜂∇𝜃 𝐽(𝜃)
a. 𝛾 is the momentum term which indicates how much acceleration you want
b. 𝛾 is the step size
c. 𝛾 is the first order moment
d. 𝛾 is the second order moment

Correct Answer: a

Detailed Solution:

A fraction of the update vector of the past time step is added to the current update vector.
𝜸 is that fraction which indicates how much acceleration you want and its value lies
between 0 and 1.

____________________________________________________________________________

QUESTION 4:
Choose the correct option
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

Statement 1: Stochastic gradient descent is less prone to getting stuck in local minima because
of inherent noise due to minibatch sampling.
Statement 2: Large learning rates with annealing schedule can be used with higher mini-batch
size.
a. Statement 1 is True, Statement 2 is True
b. Statement 1 is False, Statement 2 is True
c. Statement 1 is True, Statement 2 is False
d. Statement 1 is False, Statement 2 is False
Correct Answer: a
Detailed Solution:

Stochastic Gradient Descent does not consider the whole batch for update and thus has noisier
updates, due to noise, the gradient direction is very likely to avoid updates in direction of Local
minima. With higher mini-batch size, noise in SGD goes down making higher learning rate with
annealing schedule strategies can be used.

____________________________________________________________________________

QUESTION 5:
Which of the following is simplest optimizer, in computational requirement sense, to deal with
oscillations and saddle points?

a. Stochastic Gradient Descent (SGD)

b. SGD and Momentum/Nestrovs Accelerated Gradient
c. RMSProp
d. AdaGrad/ Adam

Correct Answer: b
Detailed Solution:

Mini-batch gradient descent makes a parameter update after seeing just a subset of
examples, the direction of the update has some variance, and so the path taken by mini-
batch gradient descent will "oscillate" toward convergence. Using momentum can reduce
these oscillations and deal with saddle points for vanishing gradients.

____________________________________________________________________________

QUESTION 6:
Given following three figures A, B and C choose the correct option:
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

Figure A

Figure B

Figure C
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

a. Figure A is SGD momentum optimizer with high momentum, Figure B is RMSProp

or AdaGrad and Figure C is SGD momentum optimizer with low Momentum.
b. Figure A is RMSProp or AdaGrad, Figure B is SGD Momentum with low
Momentum and Figure C is SGD momentum with high momentum.
c. Figure A is SGD Momentum optimizer with low momentum, Figure B is RMSProp
or AdaGrad and Figure C is SGD Momentum optimizer with high Momentum.
d. None of the above

Correct Answer: b

Detailed Solution:

RMSProp/AdaGrad show less Oscillation in steep slopes of contour lines, Low value of
momentum will make optimizer converge with high degree of oscillations, High value of
momentum dampen the oscillation in high gradient regions. (Figures taken from book Dive
into Deep Learning)

______________________________________________________________________________

QUESTION 7:
For a function f(θ0,θ1), if θ0 and θ1 are initialized at a global minimum, then what should be the
values of θ0 and θ1 after a single iteration of gradient descent?

a. θ0 and θ1 will update as per gradient descent rule

b. θ0 and θ1 will remain same
c. Depends on the values of θ0 and θ1
d. Depends on the learning rate

Correct Answer: b
Detailed Solution:
At a local minimum, the derivative (gradient) is zero, so gradient descent will not change the
parameters.
______________________________________________________________________________

QUESTION 8:
What can be one of the practical problems of exploding gradient?
a. Too large update of weight values leading to unstable network
b. Too small update of weight values inhibiting the network to learn
c. Too large update of weight values leading to faster convergence
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

d. Too small update of weight values leading to slower convergence

Correct Answer: a
Detailed Solution:
Exploding gradients are a problem where large error gradients accumulate and result in very
large updates to neural network model weights during training. This has the effect of your model
being unstable and unable to learn from your training data.
____________________________________________________________________________

QUESTION 9:
Two version of SGD are implemented as follows:

SGD1: SGD1 samples data points in same order for every epoch while constructing minibatch

SGD2: SGD2 samples data samples in random order for every epoch to construct minibatch

Select the correct statement

a. SGD1 is faster than SGD2 and robust to local minima entrapment

b. SGD2 is faster than SGD1 and robust to local minima entrapment
c. SGD1 and SGD2 have same convergence characteristics
d. None of above

Correct Answer: b
Detailed Solution:

Stochasticity of gradient descent adds noise which makes it less likely to get attracted
towards local minima. Deterministic gradient descent is likely to get trapped as it follows
same sequence of gradient updates for each epoch.

______________________________________________________________________________

QUESTION 10:
Choose correct statement in regards to GoogleNet?

a. Multiple Auxiliary classifiers are used at different depth levels to avoid vanishing
gradient problem
b. Bottleneck Layer in reduces learnable weights
c. Inception module captures information of image at varying resolution
d. All of the above
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

Correct Answer: d

Detailed Solution:

Please refer to your class notes on GoogleNet lecture 41

____________________________________________________________________________

______________________________________________________________________

______________________________________________________________________________

************END*******

DL Assignment Solution 00 To 10
100% (1)
DL Assignment Solution 00 To 10
67 pages
DEEP LEARNING IIT Kharagpur Assignment - 1 - 2024 - Updated
No ratings yet
DEEP LEARNING IIT Kharagpur Assignment - 1 - 2024 - Updated
6 pages
Assignment 1: Reinforcement Learning Prof. B. Ravindran
100% (2)
Assignment 1: Reinforcement Learning Prof. B. Ravindran
4 pages
2022 ML Assignments
No ratings yet
2022 ML Assignments
45 pages
MCQ1
No ratings yet
MCQ1
22 pages
DL - Assignment 3 Solution
No ratings yet
DL - Assignment 3 Solution
7 pages
DEEP LEARNING IIT Kharagpur Assignment - 5 - 2024
No ratings yet
DEEP LEARNING IIT Kharagpur Assignment - 5 - 2024
9 pages
CS230 Midterm Fall 2022
No ratings yet
CS230 Midterm Fall 2022
14 pages
DEEP LEARNING IIT Kharagpur Assignment - 3 - 2024
100% (2)
DEEP LEARNING IIT Kharagpur Assignment - 3 - 2024
7 pages
Deep Learning - IIT Ropar - Unit 12 - Week 9
No ratings yet
Deep Learning - IIT Ropar - Unit 12 - Week 9
4 pages
Deep Learning - IIT Ropar - Unit 7 - Week 4
100% (1)
Deep Learning - IIT Ropar - Unit 7 - Week 4
5 pages
Assignment 3: Reinforcement Learning Prof. B. Ravindran
100% (1)
Assignment 3: Reinforcement Learning Prof. B. Ravindran
4 pages
NLP Assignment-2 Solution
100% (3)
NLP Assignment-2 Solution
5 pages
NLP Assignment-7 Solution
No ratings yet
NLP Assignment-7 Solution
5 pages
Assignment 4 2022
No ratings yet
Assignment 4 2022
7 pages
Deep Learning - IIT Ropar - Unit 3 - Week 1
100% (1)
Deep Learning - IIT Ropar - Unit 3 - Week 1
3 pages
Practice Assignment 6: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Practice Assignment 6: Reinforcement Learning Prof. B. Ravindran
24 pages
DEEP LEARNING IIT Kharagpur Assignment - 4 - 2024
100% (2)
DEEP LEARNING IIT Kharagpur Assignment - 4 - 2024
7 pages
Deep Learning - Week 11
No ratings yet
Deep Learning - Week 11
4 pages
NLP Assignment-10 Solution
0% (1)
NLP Assignment-10 Solution
4 pages
NLP Assignment-1 Solution
No ratings yet
NLP Assignment-1 Solution
4 pages
NPTEL ML Assignment Week1
100% (4)
NPTEL ML Assignment Week1
5 pages
DL - Assignment 11 Solution
No ratings yet
DL - Assignment 11 Solution
7 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
8 pages
DL - Assignment 6 Solution
100% (3)
DL - Assignment 6 Solution
6 pages
Digital Image Processing Assignment-Week 1: NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
No ratings yet
Digital Image Processing Assignment-Week 1: NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
10 pages
DL - Assignment 2 Solution
No ratings yet
DL - Assignment 2 Solution
7 pages
Deep Learning - IIT Ropar - Unit 8 - Week 5
No ratings yet
Deep Learning - IIT Ropar - Unit 8 - Week 5
4 pages
DL - Assignment 12 Solution
No ratings yet
DL - Assignment 12 Solution
7 pages
DEEP LEARNING IIT Kharagpur Assignment - 2 - 2024 - Updated
No ratings yet
DEEP LEARNING IIT Kharagpur Assignment - 2 - 2024 - Updated
6 pages
EET402 - M4-Ktunotes - in
No ratings yet
EET402 - M4-Ktunotes - in
106 pages
DL - Assignment 1 Solution
No ratings yet
DL - Assignment 1 Solution
8 pages
Cs230exam spr21 Soln
No ratings yet
Cs230exam spr21 Soln
21 pages
Assignment Week 11-Deep-Learning PDF
100% (2)
Assignment Week 11-Deep-Learning PDF
7 pages
Assignment Week 4-Deep-Learning PDF
100% (1)
Assignment Week 4-Deep-Learning PDF
7 pages
Assignment Week 12-Deep-Learning PDF
100% (3)
Assignment Week 12-Deep-Learning PDF
6 pages
Assignment Week 8-Deep-Learning PDF
100% (1)
Assignment Week 8-Deep-Learning PDF
5 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
Practice Assignment 4: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Practice Assignment 4: Reinforcement Learning Prof. B. Ravindran
2 pages
DL - Assignment 10 Solution
100% (2)
DL - Assignment 10 Solution
6 pages
Deep Learning - IIT Ropar - Unit 6 - Week 3
No ratings yet
Deep Learning - IIT Ropar - Unit 6 - Week 3
4 pages
DL - Assignment 8 Solution
100% (2)
DL - Assignment 8 Solution
6 pages
NPTEL Introduction To Machine Learning Assignment 10 Answers
100% (1)
NPTEL Introduction To Machine Learning Assignment 10 Answers
7 pages
Assignment 7 (Sol.) : Reinforcement Learning
0% (1)
Assignment 7 (Sol.) : Reinforcement Learning
3 pages
Assignment 2
No ratings yet
Assignment 2
7 pages
DL - Assignment 7 Solution
100% (1)
DL - Assignment 7 Solution
5 pages
Assignment9 DeepLearning
No ratings yet
Assignment9 DeepLearning
6 pages
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
3 pages
3 Regression Diagnostics
100% (1)
3 Regression Diagnostics
53 pages
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
100% (2)
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Assignment 12: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 12: Introduction To Machine Learning Prof. B. Ravindran
4 pages
Machine Learning, ML Ass 5
No ratings yet
Machine Learning, ML Ass 5
6 pages
MCQ Question
No ratings yet
MCQ Question
5 pages
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 6 - Week 3
No ratings yet
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 6 - Week 3
5 pages
Natural Language Processing - Unit 10 - Week 8
No ratings yet
Natural Language Processing - Unit 10 - Week 8
6 pages
DL - Assignment 5 Solution
No ratings yet
DL - Assignment 5 Solution
7 pages
Week3 Assignment
No ratings yet
Week3 Assignment
6 pages
DL - Assignment 4 Solution
No ratings yet
DL - Assignment 4 Solution
6 pages
Deep Learning
No ratings yet
Deep Learning
6 pages
EET402 - M1-Ktunotes - in
No ratings yet
EET402 - M1-Ktunotes - in
61 pages
Introduction To Machine Learning Assignment-Week 4
No ratings yet
Introduction To Machine Learning Assignment-Week 4
5 pages
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
No ratings yet
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
3 pages
Introduction To Machine Learning - Unit 3 - Week 1
No ratings yet
Introduction To Machine Learning - Unit 3 - Week 1
3 pages
CST303 M1 Ktunotes - in
No ratings yet
CST303 M1 Ktunotes - in
25 pages
NDM - M3 - Ktunotes - in
No ratings yet
NDM - M3 - Ktunotes - in
36 pages
Ankit Singh Gurgaon 2.10 Yrs
No ratings yet
Ankit Singh Gurgaon 2.10 Yrs
2 pages
PS6000
No ratings yet
PS6000
16 pages
Development of Smart Home Automation System Based On PLC
No ratings yet
Development of Smart Home Automation System Based On PLC
6 pages
50 Uses of Computers in My Area
100% (1)
50 Uses of Computers in My Area
4 pages
EET402 - M3-Ktunotes - in
No ratings yet
EET402 - M3-Ktunotes - in
67 pages
M1600 SiteSentinel® Isite™ Installation Manual
No ratings yet
M1600 SiteSentinel® Isite™ Installation Manual
144 pages
User Manual Foi Voice Recording (Funcrowd)
No ratings yet
User Manual Foi Voice Recording (Funcrowd)
14 pages
Applied Linear Regression Models 4th Edi
No ratings yet
Applied Linear Regression Models 4th Edi
4 pages
CMM Company
No ratings yet
CMM Company
640 pages
Project Report
No ratings yet
Project Report
23 pages
EET402 - M2-Ktunotes - in
No ratings yet
EET402 - M2-Ktunotes - in
67 pages
Shounter Volume III, Section - 4
No ratings yet
Shounter Volume III, Section - 4
99 pages
Training Curriculum - Mainframe
No ratings yet
Training Curriculum - Mainframe
5 pages
Upload A Document - Scribd
No ratings yet
Upload A Document - Scribd
4 pages
ECS4863 - Solutions To Activity 1.1
No ratings yet
ECS4863 - Solutions To Activity 1.1
17 pages
Introduction To Computer L1-L3
No ratings yet
Introduction To Computer L1-L3
66 pages
Placement Question Papers of Past Years of ABB Company
No ratings yet
Placement Question Papers of Past Years of ABB Company
6 pages
Brochure Inpage
No ratings yet
Brochure Inpage
2 pages
Coin98 (C98) - Audit - BSC
No ratings yet
Coin98 (C98) - Audit - BSC
23 pages
1990 Duncan Parallel Architectures
No ratings yet
1990 Duncan Parallel Architectures
12 pages
TR 23689330.01.1.w
No ratings yet
TR 23689330.01.1.w
20 pages
Interview Questions
No ratings yet
Interview Questions
14 pages
Bulk Insert To Oracle - Final
No ratings yet
Bulk Insert To Oracle - Final
13 pages
Pfe Book
No ratings yet
Pfe Book
9 pages
Cod PDF
No ratings yet
Cod PDF
14 pages
Imp Web Address
No ratings yet
Imp Web Address
7 pages
Hunting Vulnerabilities: Asynchronous
No ratings yet
Hunting Vulnerabilities: Asynchronous
31 pages
06 Finite Elements Catalogs Options
No ratings yet
06 Finite Elements Catalogs Options
28 pages
ENPH110
No ratings yet
ENPH110
3 pages
Ethernet Standards
No ratings yet
Ethernet Standards
3 pages
Elixir Tutorial
No ratings yet
Elixir Tutorial
10 pages
RLDB Quick Start Guide D2L Student
No ratings yet
RLDB Quick Start Guide D2L Student
1 page
Mustafa Awni CV PDF
No ratings yet
Mustafa Awni CV PDF
1 page

DL - Assignment 9 Solution

Uploaded by

DL - Assignment 9 Solution

Uploaded by

NPTEL Online Certification Courses

Indian Institute of Technology Kharagpur

a. Vanishing Gradient problem

Residual connection formulated as F(x) = H(x) + x provides a unattenuated signal form

a. Stochastic Gradient Descent (SGD)

a. Figure A is SGD momentum optimizer with high momentum, Figure B is RMSProp

a. θ0 and θ1 will update as per gradient descent rule

d. Too small update of weight values leading to slower convergence

Select the correct statement

a. SGD1 is faster than SGD2 and robust to local minima entrapment

Please refer to your class notes on GoogleNet lecture 41

You might also like