HW4 Supplement Quiz

1. The jth word in the ith training example is represented as x(i)<j>. 2. The RNN architecture shown is appropriate when the length of the input sequence Tx is equal to the length of the output sequence Ty. 3. The tasks that a many-to-one RNN architecture could be applied to include speech recognition, sentiment classification, and gender recognition from speech.

Uploaded by

Shruthi Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views5 pages

HW4 Supplement Quiz

Uploaded by

Shruthi Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

EAI6080 Week 5 Assignment (Week 4 Supplement)

NAME: Shruthi
Date: 02/18/2022
1. Suppose your training examples are sentences (sequences of words). Which of the following
refers to the jth word in the ith training example?

x(i)<j>
x<i>(j)
x(j)<i>
x<j>(i)

2. Consider this RNN:

This specific type of architecture is appropriate when:

Tx=Ty
Tx<Ty
Tx>Ty
Tx=1

3. To which of these tasks would you apply a many-to-one RNN architecture? (Check all that
apply).
Speech recognition (input an audio clip and output a transcript)
Sentiment classification (input a piece of text and output a 0/1 to denote positive or negative
sentiment)
Image classification (input an image and output a label)
Gender recognition from speech (input an audio clip and output a label indicating the speaker’s
gender)

4. You are training this RNN language model.

At the tth time step, what is the RNN doing? Choose the best answer.

Estimating P(y<1>,y<2>,…,y<t−1>)
Estimating P(y<t>)
Estimating P(y<t>∣y<1>,y<2>,…,y<t−1>)
Estimating P(y<t>∣y<1>,y<2>,…,y<t>)

5. You have finished training a language model RNN and are using it to sample random
sentences, as follows:
What are you doing at each time step tt?

(i) Use the probabilities output by the RNN to pick the highest probability word for that time-step
as y^<t>. (ii) Then pass the ground-truth word from the training set to the next time-step.
(i) Use the probabilities output by the RNN to randomly sample a chosen word for that time-step
as y^<t>. (ii) Then pass the ground-truth word from the training set to the next time-step.
(i) Use the probabilities output by the RNN to pick the highest probability word for that time-step
as y^<t>. (ii) Then pass this selected word to the next time-step.
(i) Use the probabilities output by the RNN to randomly sample a chosen word for that time-step
as y^<t>. (ii) Then pass this selected word to the next time-step.

6. You are training an RNN, and find that your weights and activations are all taking on the
value of NaN (“Not a Number”). Which of these is the most likely cause of this problem?
Vanishing gradient problem.
Exploding gradient problem.
ReLU activation function g(.) used to compute g(z), where z is too large.
Sigmoid activation function g(.) used to compute g(z), where z is too large.

7. Here’re the update equations for the GRU.

Alice proposes to simplify the GRU by always removing the Γu. I.e., setting Γu = 1. Betty proposes to
simplify the GRU by removing the Γr. I. e., setting Γr = 1 always. Which of these models is more
likely to work without vanishing gradient problems even when trained on very long input sequences?

Alice’s model (removing Γu), because if Γr≈0 for a timestep, the gradient can propagate back
through that timestep without much decay.
Alice’s model (removing Γu), because if Γr≈1 for a timestep, the gradient can propagate back
through that timestep without much decay.
Betty’s model (removing Γr), because if Γu≈0 for a timestep, the gradient can propagate back
through that timestep without much decay.
Betty’s model (removing \Gamma_rΓr), because if \Gamma_u \approx 1Γu≈1 for a timestep,
the gradient can propagat

e back through that timestep without much decay.

8. Here are the equations for the GRU and the LSTM:
From these, we can see that the Update Gate and Forget Gate in the LSTM play a role similar to
_______ and ______ in the GRU. What should go in the blanks?

Γu and 1-1−Γu
Γu and Γr
1-Γu and Γu
Γr and Γu

9. You have a pet dog whose mood is heavily dependent on the current and past few days’
weather. You’ve collected data for the past 365 days on the weather, which you represent as
a sequence as x<1>,…,x<365>. You’ve also collected data on your dog’s mood, which you
represent as y<1>,…,y<365>. You’d like to build a model to map from x→y. Should you use
a Unidirectional RNN or Bidirectional RNN for this problem?

Bidirectional RNN, because this allows the prediction of mood on day t to take into account more
information.
Bidirectional RNN, because this allows backpropagation to compute more accurate gradients.
Unidirectional RNN, because the value of y<t> depends only on x<1>,…,x<t>, but not on x<t+1>,
…,x<365>
Unidirectional RNN, because the value of y<t> depends only on x<t>, and not other days’
weather.

RNN LSTM GRU Transformers
0% (1)
RNN LSTM GRU Transformers
123 pages
Guide - Making Money Online
91% (11)
Guide - Making Money Online
324 pages
Sequence Generation With RNNs - Post Quiz - Attempt Review
100% (2)
Sequence Generation With RNNs - Post Quiz - Attempt Review
5 pages
Deep Learning-Question Bank-Module-Wise
67% (3)
Deep Learning-Question Bank-Module-Wise
5 pages
Recurrent Neural Nets
No ratings yet
Recurrent Neural Nets
144 pages
RNN LSTM
No ratings yet
RNN LSTM
49 pages
RNN Vanishing Gradients LSTM Compressed
No ratings yet
RNN Vanishing Gradients LSTM Compressed
53 pages
Long Short-Term Memory (LSTM)
No ratings yet
Long Short-Term Memory (LSTM)
25 pages
NLP Lecture 6
No ratings yet
NLP Lecture 6
57 pages
RNN & LSTM: Nguyen Van Vinh Computer Science Department, UET, Vnu Ha Noi
No ratings yet
RNN & LSTM: Nguyen Van Vinh Computer Science Department, UET, Vnu Ha Noi
35 pages
Assignment 5 Solution
No ratings yet
Assignment 5 Solution
4 pages
Module 4 RNN LSTM GRU
No ratings yet
Module 4 RNN LSTM GRU
59 pages
CE6146 Lecture 4
No ratings yet
CE6146 Lecture 4
53 pages
Chapter 2
No ratings yet
Chapter 2
68 pages
Aiml C6 DL RNN CS
No ratings yet
Aiml C6 DL RNN CS
42 pages
ML (Cs-601) Unit 4 Complete
No ratings yet
ML (Cs-601) Unit 4 Complete
45 pages
RNN-1 All
No ratings yet
RNN-1 All
44 pages
RNN StannfordBased
No ratings yet
RNN StannfordBased
102 pages
CSE 4237 SoftCom Solutions
No ratings yet
CSE 4237 SoftCom Solutions
115 pages
42 Recurrent Neural Networks and LSTM
No ratings yet
42 Recurrent Neural Networks and LSTM
68 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Sequence Models231205
No ratings yet
Sequence Models231205
72 pages
DL Unit-3 Question Bank
No ratings yet
DL Unit-3 Question Bank
39 pages
Week11 Discussion - Deep Learning
No ratings yet
Week11 Discussion - Deep Learning
23 pages
ch6 RNN
No ratings yet
ch6 RNN
25 pages
Deep Learning
No ratings yet
Deep Learning
26 pages
CH4 - AA1.1-Sequence Models
No ratings yet
CH4 - AA1.1-Sequence Models
26 pages
RNNs and LSTMs
No ratings yet
RNNs and LSTMs
41 pages
Chapter 13 Solutions
67% (3)
Chapter 13 Solutions
8 pages
Semster - DL
No ratings yet
Semster - DL
15 pages
UNIT 4 (MCQS)
No ratings yet
UNIT 4 (MCQS)
13 pages
Module 4-1
No ratings yet
Module 4-1
44 pages
Unit 3 RCNN
No ratings yet
Unit 3 RCNN
25 pages
NN Text Generation Zaid Bouslikhin
No ratings yet
NN Text Generation Zaid Bouslikhin
14 pages
6 - RNN LSTM & Gru
No ratings yet
6 - RNN LSTM & Gru
14 pages
Week 11 Nptel Deep Learning
No ratings yet
Week 11 Nptel Deep Learning
6 pages
Deep Learning 2017 Lecture6RNN 1 18
No ratings yet
Deep Learning 2017 Lecture6RNN 1 18
18 pages
Artists and Artisans
100% (2)
Artists and Artisans
46 pages
Week 11
No ratings yet
Week 11
3 pages
Exam Long Questions
No ratings yet
Exam Long Questions
8 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
14 pages
Exercise #2 28 - 4 - 2025
No ratings yet
Exercise #2 28 - 4 - 2025
7 pages
Module 4
No ratings yet
Module 4
14 pages
LSTM - Nem
No ratings yet
LSTM - Nem
8 pages
Dis6 Sol
No ratings yet
Dis6 Sol
6 pages
DL QB 2marks
No ratings yet
DL QB 2marks
4 pages
Practice Question DL Unit-3
No ratings yet
Practice Question DL Unit-3
3 pages
Document 11
No ratings yet
Document 11
7 pages
Al Karama School, Phase 2, Abu Dhabi, UAE Risk Assessment Record Activity
No ratings yet
Al Karama School, Phase 2, Abu Dhabi, UAE Risk Assessment Record Activity
11 pages
Lecture Notes - RRN
No ratings yet
Lecture Notes - RRN
8 pages
RNN & LSTM: Vamsi Krishna B 1 9 M E 0 2 3
No ratings yet
RNN & LSTM: Vamsi Krishna B 1 9 M E 0 2 3
14 pages
Assignment 7 Solution
No ratings yet
Assignment 7 Solution
3 pages
Assignment-8 Task 1
No ratings yet
Assignment-8 Task 1
2 pages
Machine Learning Unit 4 RNN
No ratings yet
Machine Learning Unit 4 RNN
11 pages
QUESTION BANK - (Laplace and Fourier Transform - CUTM1002)
No ratings yet
QUESTION BANK - (Laplace and Fourier Transform - CUTM1002)
7 pages
Deep Learning - Assignment 11 Your Name, Roll Number 1. What Is The Difference Between Backpropagation Algorithm and Backpropagation Through Time (BPTT) Algorithm ?
No ratings yet
Deep Learning - Assignment 11 Your Name, Roll Number 1. What Is The Difference Between Backpropagation Algorithm and Backpropagation Through Time (BPTT) Algorithm ?
10 pages
Predicted RNN LSTM Questions
No ratings yet
Predicted RNN LSTM Questions
2 pages
245008-23CS2902 - Deep Learning
No ratings yet
245008-23CS2902 - Deep Learning
4 pages
Unit 3
No ratings yet
Unit 3
4 pages
UNIT-5 Foundations of Deep Learning
No ratings yet
UNIT-5 Foundations of Deep Learning
9 pages
Week 11
No ratings yet
Week 11
3 pages
Mos Cabin R1
100% (1)
Mos Cabin R1
13 pages
Electric Charges and Fields
No ratings yet
Electric Charges and Fields
58 pages
Cheatsheet Recurrent Neural Networks
No ratings yet
Cheatsheet Recurrent Neural Networks
5 pages
Altivar 61 For Medium Voltage Motors
No ratings yet
Altivar 61 For Medium Voltage Motors
34 pages
(001~052) 영어3학년 미래엔 (최연희) 정답.indd 1 2020-12-11 오후 3:17:30
No ratings yet
(001~052) 영어3학년 미래엔 (최연희) 정답.indd 1 2020-12-11 오후 3:17:30
52 pages
2023 Usnco National Exam Part III
No ratings yet
2023 Usnco National Exam Part III
14 pages
Orchid Hotel Explaination
No ratings yet
Orchid Hotel Explaination
14 pages
UEET
No ratings yet
UEET
36 pages
DLP - 2 - Weel 2 - in 21ST Centurt Literature in The Philippines and The World
No ratings yet
DLP - 2 - Weel 2 - in 21ST Centurt Literature in The Philippines and The World
5 pages
Homeworkproblems PDF
No ratings yet
Homeworkproblems PDF
144 pages
RFP DURG EPC S&T Work
No ratings yet
RFP DURG EPC S&T Work
110 pages
Oteco General
No ratings yet
Oteco General
16 pages
CS Project
No ratings yet
CS Project
17 pages
Anc Assessment
No ratings yet
Anc Assessment
6 pages
Hazardous Substance Fact Sheet: Right To Know
No ratings yet
Hazardous Substance Fact Sheet: Right To Know
6 pages
Lecture 1 Definitions & Terminologies in Experimental Design
No ratings yet
Lecture 1 Definitions & Terminologies in Experimental Design
11 pages
T7 Astro Camera Astronomy Planetary Quick Guide
No ratings yet
T7 Astro Camera Astronomy Planetary Quick Guide
20 pages
Charlton Salt Screener
No ratings yet
Charlton Salt Screener
2 pages
Entrepreneurship and Innovation in Pharmacy - 2022 - Canvas
No ratings yet
Entrepreneurship and Innovation in Pharmacy - 2022 - Canvas
29 pages
Blackand Berendzen 2020
No ratings yet
Blackand Berendzen 2020
16 pages
UNIT 11 - BT MLH 11 - Test 2
No ratings yet
UNIT 11 - BT MLH 11 - Test 2
3 pages
Chemical Identity: Material Safety Data Sheet Gasoline/Petrol
No ratings yet
Chemical Identity: Material Safety Data Sheet Gasoline/Petrol
4 pages
Running Head: Turning Off Dining in 1
No ratings yet
Running Head: Turning Off Dining in 1
3 pages
Conference Coordinator-OMICS International
No ratings yet
Conference Coordinator-OMICS International
2 pages
Butterfly Arrow 500 W Mixer Grinder: Grand Total 1625.00
No ratings yet
Butterfly Arrow 500 W Mixer Grinder: Grand Total 1625.00
1 page
Assignment 3
No ratings yet
Assignment 3
2 pages