0% found this document useful (0 votes)

54 views22 pages

Lab RNN Intro

This document discusses recurrent neural networks (RNNs) and their applications. It provides examples of RNN tasks like image captioning, sentiment classification, translation, and video classification. It then explains the basic vanilla RNN model, how RNNs are unfolded in time, and backpropagation through time. Next, it covers truncated backpropagation, teacher forcing, and warm-starting. Finally, it introduces long short-term memory (LSTM) cells and their components, equations, and practical applications in large sequence to sequence models.

Uploaded by

Miruna -Alondra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views22 pages

Lab RNN Intro

Uploaded by

Miruna -Alondra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Machine Learning

- Intro to Recurrent Neural Networks -

RNN Tasks

2/18
RNN Tasks

Vanilla RNNs

Source: CS231n Lecture 10

3/18
RNN Tasks

e.g. Image Captioning

Image → sequence of words
Source: CS231n Lecture 10

4/18
RNN Tasks

e.g. Sentiment Classification

Sequence of words → sentiment
Source: CS231n Lecture 10

5/18
RNN Tasks

e.g. Translation
Sequence of words → sequence of words
Source: CS231n Lecture 10

6/18
RNN Tasks

e.g. Video classification

on frame level
Source: CS231n Lecture 10

7/18
RNN Model

8/18
Vanilla RNN Model

(t) (t ) (t )
x h y

wih whh who

●
Current state depends on current inputs and previous state
●
RNNs can yield outputs at each time step
(t ) (t−1) (t)
h =f w (h
hh
, f w ( x ))
ih

(t ) (t )
y =f w (h ), ∀ t ∈{1... τ }
ho

9/18
Unfolding RNN in time

Source: NN Lectures, Tudor

Berariu, 2016

10/18
Unfolding RNN in time

Source: NN Lectures, Tudor

Berariu, 2016

11/18
Unfolding RNN in time

Source: NN Lectures, Tudor

Berariu, 2016

12/18
Forward through entire sequence to
compute loss, then backward through
Backpropagation through time entire sequence to compute gradient

Loss

Fei-Fei Li, Ranjay Krishna, Danfei Xu Lecture 10 - 50 April 29, 2021

Truncated Backpropagation through time
Loss

Run forward and backward

through chunks of the
sequence instead of whole
sequence

Fei-Fei Li, Ranjay Krishna, Danfei Xu Lecture 10 - 51 April 29, 2021

Truncated Backpropagation through time
Loss

Carry hidden states

forward in time forever,
but only backpropagate
for some smaller
number of steps

Fei-Fei Li, Ranjay Krishna, Danfei Xu Lecture 10 - 52 April 29, 2021

Truncated Backpropagation through time
Loss

Fei-Fei Li, Ranjay Krishna, Danfei Xu Lecture 10 - 53 April 29, 2021

Truncated BPTT

●
Used in practice
●
Summary of the algorithm:
– Present a sequence of k1 timesteps of input and output pairs to
the network.
– Unroll the network then calculate and accumulate errors across
k2 timesteps.
– Roll-up the network and update weights.
– Repeat

13/18
Teacher Forcing and Warm-start

●
When training a RNN to generate a sequence, often, the
predictions (outputs y(t)) of a RNN cell are used as the input of
the cell at the next timestamp
●
Teacher Forcing: at training time, use the targets of the
sequence, instead of RNN predictions, as inputs to the next
step

●
Warm-start: when using an RNN to predict a next value
conditioned on previous predictions, it is sometimes
necessary to give the RNN some context (known ground truth
elements) before letting it predict on its own

14/18
LSTM

15/18
LSTM Cell

Img source:
https://medium.com/
@kangeugine/

●
Input Gate (i in (0, 1) – sigmoid) – scales input to cell (write)
●
Output Gate (o in (0, 1) – sigmoid) – scales output from cell
(read)
●
Forget Gate (f in (0, 1) – sigmoid) – scales old cell values
(reset mem)

16/18
LSTM Cell - Equations

(t ) (t−1)
it =σ ( θ xi x + θhi h +b i )

(t ) (t−1)
f t =σ ( θ xf x + θhf h +b f )

(t ) (t−1)
o t =σ ( θ xo x + θho h +b o )

(t) (t−1)
g t =tanh ( θ xg x + θhg h +b g )

c t =f t ⊙c(t−1)+it ⊙g t
h t =ot ⊙tanh(ct ) , where ⊙ is elementwise multiplication

17/18
LSTMs in practice

●
Sutskever et al, Sequence
to Sequence Learning with
Neural Networks, NIPS 2014
– Models are huge :-)

– 4 layers, 1000 LSTM cells

per layer
– Input vocabulary of 160k
– Output vocabulary of 80k
– 1000 dimensional word
embeddings

18/18

Complete Download Machine Learning With R, The Tidyverse, and MLR 1st Edition Hefin Ioan Rhys PDF All Chapters
100% (4)
Complete Download Machine Learning With R, The Tidyverse, and MLR 1st Edition Hefin Ioan Rhys PDF All Chapters
62 pages
Unit 4
No ratings yet
Unit 4
50 pages
ML.0-Introduction To ML Course
No ratings yet
ML.0-Introduction To ML Course
7 pages
Final PDL_Unit IV
No ratings yet
Final PDL_Unit IV
51 pages
AN2DL_04_2324_RecurrentNeuralNetworks
No ratings yet
AN2DL_04_2324_RecurrentNeuralNetworks
34 pages
AIML-Module-3-part 2
No ratings yet
AIML-Module-3-part 2
122 pages
RNN LSTM
No ratings yet
RNN LSTM
37 pages
RNN IITMumbai
No ratings yet
RNN IITMumbai
9 pages
Lecture5
No ratings yet
Lecture5
102 pages
RNN and LSTM.pptx
No ratings yet
RNN and LSTM.pptx
65 pages
CE6146_Lecture_4
No ratings yet
CE6146_Lecture_4
53 pages
ioegc-10-032-100471
No ratings yet
ioegc-10-032-100471
8 pages
Session2 2024_2025_ Natural Language Processing
No ratings yet
Session2 2024_2025_ Natural Language Processing
30 pages
42 Recurrent Neural Networks and LSTM
No ratings yet
42 Recurrent Neural Networks and LSTM
68 pages
5707 11 RNN LSTM
No ratings yet
5707 11 RNN LSTM
128 pages
Rnn
No ratings yet
Rnn
50 pages
RNNs and LSTMs
No ratings yet
RNNs and LSTMs
41 pages
DL MODULE 5
No ratings yet
DL MODULE 5
10 pages
Supervised Learning Network Introduction: Unit 2
No ratings yet
Supervised Learning Network Introduction: Unit 2
52 pages
Classification of Hyperspectral Images
No ratings yet
Classification of Hyperspectral Images
64 pages
AD3501-DL-UNIT 3 NOTES
No ratings yet
AD3501-DL-UNIT 3 NOTES
34 pages
Deep Learning With Tensorflow 2 and Keras
No ratings yet
Deep Learning With Tensorflow 2 and Keras
14 pages
Rnn Tutorial
No ratings yet
Rnn Tutorial
41 pages
Lecture 11
No ratings yet
Lecture 11
21 pages
CNN Basic Beak of Bird
100% (1)
CNN Basic Beak of Bird
20 pages
Aiml C6 DL RNN CS
No ratings yet
Aiml C6 DL RNN CS
42 pages
10DL
No ratings yet
10DL
20 pages
RNN
No ratings yet
RNN
48 pages
Tiny Machine Learning
No ratings yet
Tiny Machine Learning
39 pages
RNN LSTM
No ratings yet
RNN LSTM
49 pages
chapter 2
No ratings yet
chapter 2
68 pages
RNN & LSTM
No ratings yet
RNN & LSTM
12 pages
DM
No ratings yet
DM
2 pages
RNNs
No ratings yet
RNNs
22 pages
Q Bank2
No ratings yet
Q Bank2
4 pages
Sequence Modeling - Recurrent Networks: Biplab Banerjee
No ratings yet
Sequence Modeling - Recurrent Networks: Biplab Banerjee
66 pages
21CSE356T-NLP-Unit 4.1
No ratings yet
21CSE356T-NLP-Unit 4.1
46 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
34 pages
DS303_RNN_LSTM
No ratings yet
DS303_RNN_LSTM
16 pages
ESE_577_syllabus_Fall2024
No ratings yet
ESE_577_syllabus_Fall2024
4 pages
A Study Recognition of Facial Emotions Using Deep Learning
No ratings yet
A Study Recognition of Facial Emotions Using Deep Learning
8 pages
lec-10
No ratings yet
lec-10
37 pages
Time Series Rnn Lstm 1746197734
No ratings yet
Time Series Rnn Lstm 1746197734
25 pages
07 RNN Recurrent Neural Networks
No ratings yet
07 RNN Recurrent Neural Networks
115 pages
1 Recurrent Neural Networks (1)
No ratings yet
1 Recurrent Neural Networks (1)
34 pages
RNN_2
No ratings yet
RNN_2
144 pages
NLP Lecture 6
No ratings yet
NLP Lecture 6
57 pages
RNN LSTM GRU Transformers
0% (1)
RNN LSTM GRU Transformers
123 pages
RNN
No ratings yet
RNN
79 pages
Lecture8 421
No ratings yet
Lecture8 421
85 pages
Module 6
No ratings yet
Module 6
42 pages
Introduction To RNNS!: Arun Mallya!
No ratings yet
Introduction To RNNS!: Arun Mallya!
52 pages
dis6-sol
No ratings yet
dis6-sol
6 pages
CSE 4237 SoftCom Solutions
No ratings yet
CSE 4237 SoftCom Solutions
115 pages
Long Short-Term Memory Networks (LSTM)- simply explained! _ Data Basecamp
No ratings yet
Long Short-Term Memory Networks (LSTM)- simply explained! _ Data Basecamp
4 pages
Sequence Modeling RNN-LSTM-APPL-Anand Kumar JUNE2021
No ratings yet
Sequence Modeling RNN-LSTM-APPL-Anand Kumar JUNE2021
71 pages
Chap 7.2 Sequence Analysis Using RNN LSTM
No ratings yet
Chap 7.2 Sequence Analysis Using RNN LSTM
60 pages
15.03.2024_CSA3007_A24+D23+D24 (1)
No ratings yet
15.03.2024_CSA3007_A24+D23+D24 (1)
8 pages
Openai Chatgpt Arhitektura
No ratings yet
Openai Chatgpt Arhitektura
13 pages
RNN-1 All
No ratings yet
RNN-1 All
44 pages
8.5 Recurrent Neural Networks
No ratings yet
8.5 Recurrent Neural Networks
5 pages
Neural Networks PDF
No ratings yet
Neural Networks PDF
1 page
Unit_3_rcnn
No ratings yet
Unit_3_rcnn
25 pages
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
No ratings yet
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
6 pages
Blue and White Simple Business Plan Presentation
No ratings yet
Blue and White Simple Business Plan Presentation
15 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
ssw9 PS2-13 Wu
No ratings yet
ssw9 PS2-13 Wu
6 pages
Deep Learning (Syllabus)
No ratings yet
Deep Learning (Syllabus)
1 page
Prashanth2022 Article HandwrittenDevanagariCharacter
No ratings yet
Prashanth2022 Article HandwrittenDevanagariCharacter
30 pages
The Unreasonable Effectiveness of Recurrent Neural Networks
No ratings yet
The Unreasonable Effectiveness of Recurrent Neural Networks
1 page
2 Days AI Deep Learning Workshop
No ratings yet
2 Days AI Deep Learning Workshop
9 pages
TH3769 1
No ratings yet
TH3769 1
10 pages
4-Recurrent Neural Network
No ratings yet
4-Recurrent Neural Network
21 pages
Neural Networks / Deep Learning
No ratings yet
Neural Networks / Deep Learning
9 pages
Dbscan: Fast Density-Based Clustering With R: Michael Hahsler Matthew Piekenbrock
No ratings yet
Dbscan: Fast Density-Based Clustering With R: Michael Hahsler Matthew Piekenbrock
28 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
216 pages
RNN-StannfordBased
No ratings yet
RNN-StannfordBased
102 pages
DEEP LEARNING IIT Kharagpur Assignment - 5 - 2024
No ratings yet
DEEP LEARNING IIT Kharagpur Assignment - 5 - 2024
9 pages
KSC2016 - Recurrent Neural Networks
No ratings yet
KSC2016 - Recurrent Neural Networks
66 pages
CS5560 Lect12-RNN - LSTM
No ratings yet
CS5560 Lect12-RNN - LSTM
30 pages
Weka Lab Record Experiments
No ratings yet
Weka Lab Record Experiments
21 pages
Top 100 Interview Questions On Machine Learning
100% (1)
Top 100 Interview Questions On Machine Learning
155 pages
Deep Learning Unit-III
No ratings yet
Deep Learning Unit-III
9 pages
Programa Ciencia de Datos y Machine Learning Con Python - Feb23
No ratings yet
Programa Ciencia de Datos y Machine Learning Con Python - Feb23
13 pages
Optimization Theory with Applications
From Everand
Optimization Theory with Applications
Donald A. Pierre
4/5 (4)
4th Attempts Huawei
No ratings yet
4th Attempts Huawei
6 pages
Harmonic Analysis and the Theory of Probability
From Everand
Harmonic Analysis and the Theory of Probability
Salomon Bochner
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Deep Learning Interview Questions
No ratings yet
Deep Learning Interview Questions
17 pages