0% found this document useful (0 votes)

22 views6 pages

Insem2 Scheme

The document describes the scheme for a Deep Learning exam with 9 multiple choice questions. It provides the questions, scoring rubrics for each question, and sample answers for 6 of the questions. The questions cover topics like recurrent neural networks, LSTMs, convolutional neural networks, encoder-decoder models, and attention mechanisms. The scoring rubrics assign partial marks for explaining concepts and showing relevant equations. The sample answers demonstrate an understanding of deep learning concepts and architectures as well as the ability to apply equations to compute outputs.

Uploaded by

Balathrinath Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views6 pages

Insem2 Scheme

Uploaded by

Balathrinath Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

II SEMESTER M.

TECH
SECOND SESSIONAL EXAMINATION APRIL 2023: SCHEME
SUBJECT: DEEP LEARNING (DSE 5251)
(18/04/2023)
Time: 10:30-11:30 AM MAX. MARKS: 15

1 Consider an RNN with a single hidden layer containing 50 hidden units. The input to the network is a 1
sequence of 100-dimensional vectors, and the output is a sequence of 10-dimensional vectors. If there are
5 time steps, compute the total number of parameters in the network, assuming no biases are used?

Answer:

No. of parameters from input to hidden layer (size of U) = 100*50 = 5000

No. of parameters from hidden layer to output layer (size of V) = 50*10 = 500
No. of parameters of the recurrent unit (size of W) = 50*50 = 2500

Total parameters = 5000 + 500 + 2500 = 8000.

Scheme: Individual parameters (0.5 marks) + Total parameters (0.5 marks)

2 Consider the task of Video Captioning using an encoder-decoder network, where the input is a video and 1
output is the caption. State the equation/s for the encoder part of this network?

Answer:

Scheme: Correct equation (1 mark)

3 How does the input gate in an LSTM regulate the flow of new information? 1

Answer:
An LSTM cell consists of three main components: the input gate, the forget gate, and the output gate.
These gates control the flow of information in and out of the memory cell, allowing LSTMs to effectively
capture long-term dependencies in sequential data.
The input gate in an LSTM determines how much of the new input should be stored in the cell state and
how much of the existing cell state should be preserved. The input gate is typically implemented as a
sigmoid activation function applied to a combination of the current input and the previous hidden state
(output) of the LSTM. The sigmoid function squashes the values between 0 and 1, representing the gating
mechanism. The equation for the input gate of LSTM is given as:
The output of the input gate (it) acts as a filter for the new input. A value close to 0 means that
the gate is closed, and no new information is allowed into the cell state. A value close to 1
means that the gate is open, and all the new information is allowed to flow into the cell state.
The input gate activation is then element-wise multiplied with the candidate values, which are
derived from the current input. The candidate values represent the information that can be
added to the cell state. This multiplication allows the input gate to selectively update the cell
state, preserving only the relevant information.
Scheme: Description of input gate (0.5 mark) + Brief explanation of how it regulates the
flow of information (0.5 marks)

4 Briefly explain the solution for the exploding gradient problem in RNNs. 1

Answer:
The solution for exploding gradient problem in RNN is gradient clipping, which involves
rescaling the gradients if their norm exceeds a certain threshold. This prevents the gradients from
growing too large and helps stabilize the training process. Gradient clipping can be implemented
using the following ways:
I. Clipping by value:
if ‖g‖ ≥ max_threshold or ‖g‖ ≤ min_threshold then
g ← threshold (accordingly)
end if

II. Clipping by norm:

if ‖g‖ ≥ threshold then
g ← threshold * g/‖g‖
end if
Scheme: Description of gradient clipping (0.5 marks) + Two ways of implementing it (0.5
marks)
5 The third convolutional layer of VGGNet is a 56×56×256 (w x h x d) size feature map volume. If you 1
are asked to design a encoder-decoder model with attention mechanism, how many locations will the
model have to learn to attend to?

Answer:
The attention mechanism typically operates on a spatial grid, attending to different locations
within the feature map. The number of locations corresponds to the spatial dimensions of the
feature map, which is 56 * 56 =3136.
Scheme: Correct answer (1 mark)
6 Consider a RNN based network which takes as input yesterday and today’s values to predict tomorrow’s 2
values (The weights and biases are pretrained and mentioned in the diagram). Write down the
computations to predict tomorrow’s value based on yesterday and today. If yesterday’s input was 0.2 and
today’s input is 0.5:

Answer:
Predicted today’s value:
Input * w1 + b1 => Relu Output * w3 + b2 = 0.693

Predicted tomorrow’s value?

Input * w1 + b1 + Relu Output of Prev step * -0.5 => Relu Output * w3 + b2 = 0.935

Scheme:
Computing Relu output for today (0.5 mark)
Computing tomorrow’s predicted value (1.5 mark)

7 Briefly explain how the issues of vanishing and exploding gradients arise in an RNN using relevant 2
equations.

Answer:
The problem of vanishing and exploding gradients occurs during backpropagation through time
in RNNs, specifically while computing gradients. For instance, consider the following equation
of gradient computation w.r.t recurrent weight W.

Let us analyze the term , as it is the most dependent on all the previous time steps.

--- (1)

Considering the term , in the above product:

We know that:

So,

This implies that,

As the activation functions and their derivatives are bounded functions, we can re-write the
above as:
, where represents the bound.

If λ = ||W||, then:

Now, if we re-write equation (1) =>

Scheme:
Briefly explaining the problem of vanishing and exploding gradients (0.5 marks)
The proof with equation (1.5 marks)
8 You have been given the task of designing a deep learning model which takes as input the names of the
people in English and converts it to the closest corresponding letters in Hindi. With the help of a neat
block/architecture diagram and computations, explain the working of the encoder-decoder model for this
task. 3
Answer:

Scheme:
Block diagram with all inputs, outputs, parameters marked (1 mark)
Computation with explanation (2 marks)

9 You have been given the task of designing a deep learning model for document classification. With the 3
help of a neat block/architecture diagram and computations, explain the working of the encoder-decoder
model with attention mechanism, for this task.
Answer:
Scheme:
Block diagram (1.5 marks)
Computation (1.5 marks)

UDL Answer Booklet Students
100% (1)
UDL Answer Booklet Students
79 pages
UDL Answer Booklet Students
No ratings yet
UDL Answer Booklet Students
79 pages
State Space Search and Heuristic Search Techniques
100% (1)
State Space Search and Heuristic Search Techniques
16 pages
Solution Dseclzg524 05-07-2020 Ec3r
No ratings yet
Solution Dseclzg524 05-07-2020 Ec3r
7 pages
MT1SP19
No ratings yet
MT1SP19
13 pages
Prak. Robotika Cerdas Tugas 2
0% (1)
Prak. Robotika Cerdas Tugas 2
7 pages
IIP Midterm Sol
No ratings yet
IIP Midterm Sol
7 pages
10-FA To Regular Expression-18!01!2023
No ratings yet
10-FA To Regular Expression-18!01!2023
100 pages
Soft Computing Unit-1 by Arun Pratap Singh
100% (1)
Soft Computing Unit-1 by Arun Pratap Singh
100 pages
Dependency Parsing 2: CMSC 723 / LING 723 / INST 725
No ratings yet
Dependency Parsing 2: CMSC 723 / LING 723 / INST 725
52 pages
A New Iterative Refinement of The Solution of Ill-Conditioned Linear System of Equations
No ratings yet
A New Iterative Refinement of The Solution of Ill-Conditioned Linear System of Equations
10 pages
DSP Bits
No ratings yet
DSP Bits
5 pages
Linear Code - Wikipedia
No ratings yet
Linear Code - Wikipedia
27 pages
Limiter6 Manual
No ratings yet
Limiter6 Manual
22 pages
Kidneysegmentation Matlab
No ratings yet
Kidneysegmentation Matlab
12 pages
Bubble Sort Technique
No ratings yet
Bubble Sort Technique
5 pages
Adaptive FIR Filter Algorithms: D.K. Wise
No ratings yet
Adaptive FIR Filter Algorithms: D.K. Wise
11 pages
Homework 3: Gauss-Jordan Elimination, Otherwise No Points)
No ratings yet
Homework 3: Gauss-Jordan Elimination, Otherwise No Points)
2 pages
Binary Search Algorithm
100% (1)
Binary Search Algorithm
12 pages
Booth's, Barrel's and Array Multiplication
No ratings yet
Booth's, Barrel's and Array Multiplication
7 pages
1.1. Linear Models - Scikit-Learn 1.4.2 Documentation
No ratings yet
1.1. Linear Models - Scikit-Learn 1.4.2 Documentation
17 pages
16 SVM
No ratings yet
16 SVM
41 pages
Mandatory - Exercise 2
No ratings yet
Mandatory - Exercise 2
11 pages
C++ Graph Theory Sample Cheat Sheet: by Via
No ratings yet
C++ Graph Theory Sample Cheat Sheet: by Via
4 pages
Ps 2
No ratings yet
Ps 2
5 pages
Advance Algorithm Introduction
No ratings yet
Advance Algorithm Introduction
71 pages
Linear Regression Mca Lab - Jupyter Notebook
No ratings yet
Linear Regression Mca Lab - Jupyter Notebook
2 pages
Solution: Introduction To Deep Learning
No ratings yet
Solution: Introduction To Deep Learning
20 pages
Sms Spam Filtering Pres
No ratings yet
Sms Spam Filtering Pres
18 pages
Homework 03
No ratings yet
Homework 03
3 pages
Recurrent Neural Networks (RNNS) : A Gentle Introduction and Overview
No ratings yet
Recurrent Neural Networks (RNNS) : A Gentle Introduction and Overview
16 pages
NM Lab789
No ratings yet
NM Lab789
3 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Second Exam 2021-22
No ratings yet
Second Exam 2021-22
14 pages
Deep Learning - Assignment 11 Your Name, Roll Number 1. What Is The Difference Between Backpropagation Algorithm and Backpropagation Through Time (BPTT) Algorithm ?
No ratings yet
Deep Learning - Assignment 11 Your Name, Roll Number 1. What Is The Difference Between Backpropagation Algorithm and Backpropagation Through Time (BPTT) Algorithm ?
10 pages
Coletta 2012
No ratings yet
Coletta 2012
21 pages
Mock Endterm ADL 2021
No ratings yet
Mock Endterm ADL 2021
8 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
9 pages
Sample-Part B
No ratings yet
Sample-Part B
5 pages
Exam Long Questions
No ratings yet
Exam Long Questions
8 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
RNN & LSTM: Vamsi Krishna B 1 9 M E 0 2 3
No ratings yet
RNN & LSTM: Vamsi Krishna B 1 9 M E 0 2 3
14 pages
Transportation Problem
No ratings yet
Transportation Problem
8 pages
CT1 DL Ans
No ratings yet
CT1 DL Ans
13 pages
DNN Cluster S2 22 MidSem Makeup
No ratings yet
DNN Cluster S2 22 MidSem Makeup
7 pages
CS 601 Machine Learning Unit 4
No ratings yet
CS 601 Machine Learning Unit 4
14 pages
Lecture 2 Power Planning
No ratings yet
Lecture 2 Power Planning
35 pages
DSE 3151 25 Sep 2023
No ratings yet
DSE 3151 25 Sep 2023
9 pages
A Star Algorithm
No ratings yet
A Star Algorithm
8 pages
Genai See
No ratings yet
Genai See
51 pages
DL Exam 2023-2
No ratings yet
DL Exam 2023-2
5 pages
Exam - Deep Learning - From Theory To Practice (201800177) - Jan 22 2019
No ratings yet
Exam - Deep Learning - From Theory To Practice (201800177) - Jan 22 2019
3 pages
DSP Experiment 5 Lab Report
No ratings yet
DSP Experiment 5 Lab Report
36 pages
36-Gated RNNs - Optimization For Long-Term Dependencies - Explicit Memory-07!10!2024
No ratings yet
36-Gated RNNs - Optimization For Long-Term Dependencies - Explicit Memory-07!10!2024
4 pages
Chap 7.2 Sequence Analysis Using RNN LSTM
No ratings yet
Chap 7.2 Sequence Analysis Using RNN LSTM
60 pages
Deep Learning 15
No ratings yet
Deep Learning 15
13 pages
UDL Answer Booklet Students
No ratings yet
UDL Answer Booklet Students
79 pages
Module 2
No ratings yet
Module 2
13 pages
Week 11
No ratings yet
Week 11
3 pages
Sessional-II Exam Solution Spring 2024
No ratings yet
Sessional-II Exam Solution Spring 2024
7 pages
UDL Answer Booklet Students
No ratings yet
UDL Answer Booklet Students
79 pages
ML Endsem 2022
No ratings yet
ML Endsem 2022
7 pages
1157 CS F425 20231222015056 Mid Semester Question Paper DL
No ratings yet
1157 CS F425 20231222015056 Mid Semester Question Paper DL
2 pages
DSE 5251 Insem2
No ratings yet
DSE 5251 Insem2
2 pages
Tutorial 1,2
No ratings yet
Tutorial 1,2
12 pages
Disc11-Examprep-Sols (9 Files Merged)
No ratings yet
Disc11-Examprep-Sols (9 Files Merged)
12 pages
Rubrics Midsem
No ratings yet
Rubrics Midsem
2 pages
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
100% (1)
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
4 pages
Ucs664 Est 23
No ratings yet
Ucs664 Est 23
3 pages
Unit 5 RNN
No ratings yet
Unit 5 RNN
14 pages
SS 2021 Solutions
No ratings yet
SS 2021 Solutions
16 pages
(AK) AIMLCZG511 Midsem Regular
No ratings yet
(AK) AIMLCZG511 Midsem Regular
7 pages
Question Bank - Deep Learning
No ratings yet
Question Bank - Deep Learning
25 pages
Iva Unit-5 Edited
No ratings yet
Iva Unit-5 Edited
42 pages
Exercise #2 28 - 4 - 2025
No ratings yet
Exercise #2 28 - 4 - 2025
7 pages
Imperial Dlcourse2022 RNN Notes
No ratings yet
Imperial Dlcourse2022 RNN Notes
9 pages
Week 11
No ratings yet
Week 11
3 pages
Week 11 Nptel Deep Learning
No ratings yet
Week 11 Nptel Deep Learning
6 pages
Comprehensive Exam - Answer Key - DNN - EC3M - October 2024
No ratings yet
Comprehensive Exam - Answer Key - DNN - EC3M - October 2024
7 pages
Week 3
No ratings yet
Week 3
5 pages
Minor 2 - DNN
No ratings yet
Minor 2 - DNN
3 pages
Deep Learning Question Bank
No ratings yet
Deep Learning Question Bank
8 pages
Machine Learning and Pattern Recognition-Dr Dibakar Raj Pant
No ratings yet
Machine Learning and Pattern Recognition-Dr Dibakar Raj Pant
2 pages
Dis6 Sol
No ratings yet
Dis6 Sol
6 pages
cs224n spr2024 Lecture06 Fancy RNN
No ratings yet
cs224n spr2024 Lecture06 Fancy RNN
56 pages
Document 11
No ratings yet
Document 11
7 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
105 pages
Deep Learning
No ratings yet
Deep Learning
9 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
A Friendly Introduction to MATLAB Programming
From Everand
A Friendly Introduction to MATLAB Programming
Orhan Gazi
No ratings yet

Insem2 Scheme

Uploaded by

Insem2 Scheme

Uploaded by

II SEMESTER M.

No. of parameters from input to hidden layer (size of U) = 100*50 = 5000

Total parameters = 5000 + 500 + 2500 = 8000.

Scheme: Individual parameters (0.5 marks) + Total parameters (0.5 marks)

Scheme: Correct equation (1 mark)

II. Clipping by norm:

Predicted tomorrow’s value?

Considering the term , in the above product:

This implies that,

Now, if we re-write equation (1) =>

You might also like