0% found this document useful (0 votes)

50 views43 pages

Deeplearning - Ai Deeplearning - Ai

Uploaded by

Lê Tường

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views43 pages

Deeplearning - Ai Deeplearning - Ai

Uploaded by

Lê Tường

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

Copyright Notice

These slides are distributed under the Creative Commons License.

DeepLearning.AI makes these slides available for educational purposes. You may not use or distribute
these slides for commercial purposes. You may make copies of these slides and use or distribute them for
educational purposes as long as you cite DeepLearning.AI as the source of the slides.

For the rest of the details of the license, see https://creativecommons.org/licenses/by-sa/2.0/legalcode

Recurrent Neural
Networks

Why sequence
deeplearning.ai
models?
Examples of sequence data
“The quick brown fox jumped
Speech recognition over the lazy dog.”

Music generation ∅
“There is nothing to like
Sentiment classification in this movie.”

DNA sequence analysis AGCCCCTGTGAGGAACTAG AGCCCCTGTGAGGAACTAG

Machine translation Voulez-vous chanter avec Do you want to sing with

moi? me?

Video activity recognition Running

Name entity recognition Yesterday, Harry Potter Yesterday, Harry Potter

met Hermione Granger. met Hermione Granger.
Andrew Ng
Recurrent Neural
Networks

Notation
deeplearning.ai
Motivating example
x: Harry Potter and Hermione Granger invented a new spell.

Andrew Ng
Representing words
x: Harry Potter and Hermione Granger invented a new spell.
! "#$ ! "%$ ! "&$ ⋯ ! "($

And = 367
Invented = 4700
A=1
New = 5976
Spell = 8376
Harry = 4075
Potter = 6830
Hermione = 4200
Gran… = 4000

Andrew Ng
Recurrent Neural
Networks

Recurrent Neural
deeplearning.ai
Network Model
Why not a standard network?
! "#$ ) "#$

! "%$ ) "%$

⋮ ⋮ ⋮ ⋮

! "'($ ) "'*$

Problems:
- Inputs, outputs can be different lengths in different examples.
- Doesn’t share features learned across different positions of text.
Andrew Ng
Recurrent Neural Networks

He said, “Teddy Roosevelt was a great President.”

He said, “Teddy bears are on sale!”
Andrew Ng
Forward Propagation
)- "#$ )- "%$ )- ".$ )- "'* $

+"#$ +"%$ +"'( /#$

+",$ ⋯

! "#$ ! "%$ ! ".$ ! "'( $

Andrew Ng
Simplified RNN notation
+"1$ = 3(566 +"1/#$ + 568 ! "1$ + 96 )

)- "1$ = 3(5;6 +"1$ + 9; )

Andrew Ng
Recurrent Neural
Networks

Backpropagation
deeplearning.ai
through time
Forward propagation and backpropagation
'( "&$ '( ")$ '( "*$ '( "+. $

!"&$ !")$ !"+, -&$

!"#$ ⋯

% "&$ % ")$ % "*$ % "+, $

Andrew Ng
Forward propagation and backpropagation

ℒ "1$ '( "1$ , ' "1$ =

Backpropagation through time

Andrew Ng
Recurrent Neural
Networks

Different types
deeplearning.ai
of RNNs
Examples of sequence data
“The quick brown fox jumped
Speech recognition over the lazy dog.”

Music generation ∅
“There is nothing to like
Sentiment classification in this movie.”

DNA sequence analysis AGCCCCTGTGAGGAACTAG AGCCCCTGTGAGGAACTAG

Machine translation Voulez-vous chanter avec Do you want to sing with

moi? me?

Video activity recognition Running

Name entity recognition Yesterday, Harry Potter Yesterday, Harry Potter

met Hermione Granger. met Hermione Granger.
Andrew Ng
Examples of RNN architectures

Andrew Ng
Examples of RNN architectures

Andrew Ng
Summary of RNN types
() #'% () #'% () #*% () #+, % ()
"#$% "#$% ⋯ "#$% ⋯
& #'% & & #'% & #*% & #+. %
One to one One to many Many to one

() #'% () #*% () #+, % () #'% () #+, %

"#$% "#$% ⋯ ⋯ ⋯
⋯

& #'% & #*% & #'% & #+. %

& #+. %
Many to many Many to many
Andrew Ng
Recurrent Neural
Networks

Language model and

deeplearning.ai
sequence generation
What is language modelling?
Speech recognition
The apple and pair salad.

The apple and pear salad.

!(The apple and pair salad) =

!(The apple and pear salad) =

Andrew Ng
Language modelling with an RNN
Training set: large corpus of english text.

Cats average 15 hours of sleep a day.

The Egyptian Mau is a bread of cat. <EOS>

Andrew Ng
RNN model

Cats average 15 hours of sleep a day. <EOS>

ℒ &' (), & () = − - &0()* log &'0()*

0
ℒ = - ℒ ()* &' ()*, & ()*
) Andrew Ng
Recurrent Neural
Networks

Sampling novel
deeplearning.ai
sequences
Sampling a sequence from a trained RNN
'( "&$ '( "/$ '( "0$ '( ")* $

!"#$ !"&$ !"/$ !"0$ ⋯ !")* $

% "&$ ' "&$ ' "/$ ' ")- .&$

Andrew Ng
Character-level language model
y<n> la 1 softmax co do dai bang voi vocabulary. Cho nao co percentage cao nhat thi y<n> la word do (hoac chon theo random choice,
danh xe percentage).
Nhu vay P(n) = P(y<n>) trong softmax. Do do, P(ca cau y) = P(y<0>).P(y<1>)....

Vocabulary = [a, aaron, …, zulu, <UNK>]

'( "&$ '( "/$ '( "0$ '( ")* $

!"#$ !"&$ !"/$ !"0$ ⋯ !")* $

% "&$ '( "&$ '( "/$ '( ")- .&$

Andrew Ng
Sequence generation
Neu model train tren tap du lieu tin tuc (News) thi ket qua la doan van khong co lang man, an du.
Con neu model train tren Shakespeare thi noi dung se tru tinh, an du va co nhieu bien phap tu tu.

News Shakespeare

President enrique peña nieto, announced The mortal moon hath her eclipse in love.
sench’s sulk former coming football langston
paring. And subject of this thou art another this fold.

“I was not at all surprised,” said hich langston. When besser be my love to me see sabl’s.

“Concussion epidemic”, to be examined. For whose are ruse of mine eyes heaves.

The gray football the told some and this has on

the uefa icon, should money as.

Andrew Ng
Recurrent Neural
Networks

Vanishing gradients
deeplearning.ai
with RNNs
Vanishing gradients with RNNs
Vi du: Translate tu tieng viet sang
tieng anh. Den tu "đã là" ta dich
sang were hay was. RNN can '( "&$ '( "-$ '( "/$ '( ")* $
thong tin ty trc do de chon, nhung
tu cat thi o qua xa. Co lay history
nhung qua xa thi khong update
duoc. Do do can LSTM.

!"#$ !"&$ !"-$ !"/$ ⋯ !")* $

% "&$ % "-$ % "/$ % "). $

% ⋮ ⋮ ⋮ ⋮ ⋯ ⋮ ⋮ ⋮ '(

Exploding gradients.
Andrew Ng
Recurrent Neural
Networks

Gated Recurrent
deeplearning.ai
Unit (GRU)
RNN unit

!"#$ = &(() !"#*+$ , - "#$ + /) )

Andrew Ng
GRU (simplified)

The cat, which already ate …, was full.

[Cho et al., 2014. On the properties of neural machine translation: Encoder-decoder approaches]
[Chung et al., 2014. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling] Andrew Ng
Full GRU
Truoc do: c = c~ va c = c~ = tanh(c_)
Thay thanh: c = Tc*c~ + (1-Tc)*c_
5̃ "#$ = tanh((> [ 5 "#*+$ , - "#$ ] + /> ) => c = c~ = tanh(c_) = tanh(Tc*c~_ + (1-Tc)*c_ _)

Γ2 = 3((2 5 "#*+$ , - "#$ + /2 )

5 "#$ = Γ2 ∗ 5̃ "#$ + 1 − Γ2 + 5 "#*+$

y tuong: Thay the y = tanh(..) la c~ = tanh(...)
Sau do co 1 bien Tu bien thien tu 0 den 1. Nhu 1 he so (1 switch dieu chinh volumn cho tung khau)
y = c = gamma.c~ + (1-gamma)*c_
Nhu vay gamma la 1 he so scale tu c~ thanh c, gamma la su anh huong cua history vao c~ de thanh c.
Binh thuong y = c = c~ nhung khi co gamma thi ta can nhac them 1 phan gia tri qua khu de thay doi gia gamma la % hien tai va qua khu.

The cat, which ate already, was full.

Andrew Ng
Recurrent Neural
Networks

LSTM (long short

deeplearning.ai
term memory) unit
GRU and LSTM
GRU LSTM
!̃ #$% = tanh(,- Γ/ ∗ ! #$12%, 4 #$% + 6- )

Γ8 = 9(,8 ! #$12%, 4 #$% + 68 )

khong co Tr dau chi co Tu va (1-Tu) thoi

Γ/ = 9(,/ ! #$12%, 4 #$% + 6/ )

! #$% = Γ8 ∗ !̃ #$% + 1 − Γ8 ∗ ! #$12%

=#$% = ! #$%

Khac voi GRU, LSTM co he so rieng cho c~ la Tu va c_ la Tf, ngoai ra con 1 he so To chuyen tu ct thanh at. Nhu vay thi he so c~ la To*Tu va c_ la To*Tf

[Hochreiter & Schmidhuber 1997. Long short-term memory] Andrew Ng

LSTM units
GRU LSTM
!̃ #$% = tanh(,- Γ/ ∗ ! #$12%, 4 #$% + 6- ) !̃ #$% = tanh(,- =#$12%, 4 #$% + 6- )

Γ8 = 9(,8 ! #$12%, 4 #$% + 68 ) Γ8 = 9(,8 =#$12%, 4 #$% + 68 )

Γ/ = 9(,/ ! #$12%, 4 #$% + 6/ ) Γ> = 9(,> =#$12%, 4 #$% + 6> )

! #$% = Γ8 ∗ !̃ #$% + 1 − Γ8 ∗ ! #$12% Γ? = 9(,? =#$12%, 4 #$% + 6? )

=#$% = ! #$% ! #$% = Γ8 ∗ !̃ #$% + Γ> ∗ ! #$12%

=#$% = Γ? ∗ ! #$%
[Hochreiter & Schmidhuber 1997. Long short-term memory] Andrew Ng
LSTM in pictures
D #$%

!̃ #$% = tanh(,- =#$12%, 4 #$% + 6- )

softmax

=#$%
Γ8 = 9(,8 =#$12%, 4 #$% + 68 )
! #$12% * ⨁ ! #$%
--
Γ> = 9(,> =#$12%, 4 #$% + 6> ) tanh ! #$%
* =#$%
Γ? = 9(,? =#$12%, 4 #$% + 6? ) =#$12% B #$%
C #$%
!̃ #$% A #$%
*
=#$%
! #$% = Γ8 ∗ !̃ #$% + Γ> ∗ ! #$12%
forget gate update gate tanh output gate

=#$% = Γ? ∗ ! #$%
4 #$%
D #2% D #F% Duong a la duong value D #G%
Duong c la duong history, luu tru cac
softmax softmax softmax
gia tri qua khu de bo sung cho a
=#2% =#F% #G% =
! #F%
-- --
! #G%
--
! #2%
! #E% * ⨁ ! #2% * ⨁ ! #F% * ⨁

=#E% #2% =#2% =#F%

=#F% =#G%
=

4 #2% 4 #F% 4 #G% Andrew Ng

Recurrent Neural
Networks

Bidirectional RNN
deeplearning.ai
Getting information from the future
He said, “Teddy bears are on sale!”
He said, “Teddy Roosevelt was a great President!”

!" #)% !" #(% !" #*% !" #.% !" #-% !" #/% !" #$%

+#,% +#)% +#(% +#*% +#.% +#-% +#/% +#$%

' #)% ' #(% ' #*% ' #.% ' #-% ' #/% ' #$%
He said, “Teddy bears are on sale!”

Andrew Ng
Wy khong chi thoa man Wy*a_(forward) = a(forward)

Bidirectional RNN (BRNN)

ma cung phai thoa man Wy*a(backward) =
a_(backward). De hieu la, Wy thoa man train "I love
you" ra "anh yeu em" thi khi dua "You love I" cung
phai ra "em yeu anh". Noi chung thi, Wy la giai
nghiem 2 chieu cua phep Wy*a_(forward) = a(forward)
va Wy*a(backward) = a_(backward)

Andrew Ng
Recurrent Neural
Networks

Deep RNNs
deeplearning.ai
Deep RNN example softmax of tôi softmax of yêu
softmax of class (dog,cat) = (1x2) (1x923) (1x923)

, "#$ , "%$ , "&$ , "'$

([&]"+$ ([&]"#$ ([&]"%$ ([&]"&$ ([&]"'$

([%]"+$ ([%]"#$ ([%]"%$ ([%]"&$ ([%]"'$

([#]"+$

Ex: "I love you too" one hot of love one hot of you
!"#$ ! "%$ ! "&$ （1x1023) ! "'$
va 1 dict co 1023 tu english (1x1023)
va 923 tu tieng viet one hot of I in vocab too (1x1023)
mat (1x1023)

image (12x12 = 1x 224)

Deep RNN la Deeper tung block => so luong weight tang cao tu length of RNN L thanh L*depth
Andrew Ng

DL Unit Iv
No ratings yet
DL Unit Iv
15 pages
RNN LSTM GRU Transformers
0% (1)
RNN LSTM GRU Transformers
123 pages
Sequence Modeling RNN-LSTM-APPL-Anand Kumar JUNE2021
No ratings yet
Sequence Modeling RNN-LSTM-APPL-Anand Kumar JUNE2021
71 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
43 pages
06-DL-Deep Learning For Text Data (LSTM Seq2Seq Models)
No ratings yet
06-DL-Deep Learning For Text Data (LSTM Seq2Seq Models)
44 pages
NLP Lecture 6
No ratings yet
NLP Lecture 6
57 pages
RNN-1
No ratings yet
RNN-1
50 pages
Recurrent Neural Networks Cheatsheet
No ratings yet
Recurrent Neural Networks Cheatsheet
44 pages
Module 4 RNN LSTM GRU
No ratings yet
Module 4 RNN LSTM GRU
59 pages
04 - RNNs
No ratings yet
04 - RNNs
37 pages
Lecture10 Lstms
No ratings yet
Lecture10 Lstms
34 pages
Deep Learning For Natural Language
No ratings yet
Deep Learning For Natural Language
23 pages
RNN LSTM
No ratings yet
RNN LSTM
42 pages
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-02-28 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-02-28 Reference-Material-I
39 pages
NLP - Machine Learning
No ratings yet
NLP - Machine Learning
23 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
40 pages
NLP Baigiang 10
No ratings yet
NLP Baigiang 10
67 pages
08 NLP With Deep Learning
No ratings yet
08 NLP With Deep Learning
31 pages
Recurrent Neural Network (RNN) : Tuan Nguyen - AI4E
No ratings yet
Recurrent Neural Network (RNN) : Tuan Nguyen - AI4E
38 pages
Outline
No ratings yet
Outline
50 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
36 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
111 pages
Sequence Models
No ratings yet
Sequence Models
85 pages
Lecture 4 - Language Modelling and RNNs Part 2
No ratings yet
Lecture 4 - Language Modelling and RNNs Part 2
44 pages
Deep Learning Basics
No ratings yet
Deep Learning Basics
10 pages
RNN & LSTM: Nguyen Van Vinh Computer Science Department, UET, Vnu Ha Noi
No ratings yet
RNN & LSTM: Nguyen Van Vinh Computer Science Department, UET, Vnu Ha Noi
35 pages
Lecture 11
No ratings yet
Lecture 11
21 pages
ML 5
No ratings yet
ML 5
20 pages
Unit 5 Updated
No ratings yet
Unit 5 Updated
125 pages
NguyenThacCuong KLTN
No ratings yet
NguyenThacCuong KLTN
17 pages
11 RNN
No ratings yet
11 RNN
32 pages
Text Generation With LSTM Recurrent Neural Networks in Python With Keras
No ratings yet
Text Generation With LSTM Recurrent Neural Networks in Python With Keras
23 pages
Definition of RNN (Recurrent Neural Network) :: H F W X W H B y G W H B
No ratings yet
Definition of RNN (Recurrent Neural Network) :: H F W X W H B y G W H B
26 pages
DL Assignment 5
No ratings yet
DL Assignment 5
4 pages
Chapter 2
No ratings yet
Chapter 2
68 pages
Recurrent Neural Networks (RNN) : Subtitle
No ratings yet
Recurrent Neural Networks (RNN) : Subtitle
53 pages
For Seminar
No ratings yet
For Seminar
17 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
RNN & LSTM Notes
No ratings yet
RNN & LSTM Notes
8 pages
Lecture 11
No ratings yet
Lecture 11
57 pages
RNN StannfordBased
No ratings yet
RNN StannfordBased
102 pages
6b. Recurrent Neural Networks
No ratings yet
6b. Recurrent Neural Networks
38 pages
Day 4
No ratings yet
Day 4
22 pages
The Unreasonable Effectiveness of Recurrent Neural Networks
No ratings yet
The Unreasonable Effectiveness of Recurrent Neural Networks
1 page
AAM Unit 6 Notes
No ratings yet
AAM Unit 6 Notes
20 pages
Module 4-1
No ratings yet
Module 4-1
44 pages
185 Nervous System MCQs by Monsef - Karkor
100% (7)
185 Nervous System MCQs by Monsef - Karkor
60 pages
6 - RNN LSTM & Gru
No ratings yet
6 - RNN LSTM & Gru
14 pages
Unit 3 RCNN Updated
No ratings yet
Unit 3 RCNN Updated
28 pages
RNN, NLP
No ratings yet
RNN, NLP
2 pages
LSTM Ref04
No ratings yet
LSTM Ref04
2 pages
DL Unit-3
No ratings yet
DL Unit-3
9 pages
Module 06
No ratings yet
Module 06
5 pages
Mergeddv
No ratings yet
Mergeddv
2 pages
DL Unit 4 Part 2
No ratings yet
DL Unit 4 Part 2
8 pages
Unit 3
No ratings yet
Unit 3
8 pages
Recurrent Neural Networks Tutorial, Part 1 - Introduction To RNNs - WildML
No ratings yet
Recurrent Neural Networks Tutorial, Part 1 - Introduction To RNNs - WildML
8 pages
A Vietnamese Language Model Based On Recurrent Neural Network
No ratings yet
A Vietnamese Language Model Based On Recurrent Neural Network
5 pages
Cheatsheet Recurrent Neural Networks
No ratings yet
Cheatsheet Recurrent Neural Networks
5 pages
Exploring Neuroscience 1-6
No ratings yet
Exploring Neuroscience 1-6
6 pages
AI5006 - Deep Learning
No ratings yet
AI5006 - Deep Learning
6 pages
Deep Learning Syllabus
No ratings yet
Deep Learning Syllabus
2 pages
Skeletal Muscle Relaxants
No ratings yet
Skeletal Muscle Relaxants
21 pages
Sequence Modeling
No ratings yet
Sequence Modeling
131 pages
Cranial Nerves
No ratings yet
Cranial Nerves
20 pages
Simatic Hmi and Opc Ua Part 6: Wincc RT Professional Server, Comfort Panel Client
No ratings yet
Simatic Hmi and Opc Ua Part 6: Wincc RT Professional Server, Comfort Panel Client
23 pages
Grade 11 Irritability, Sensitivity and Coordination
No ratings yet
Grade 11 Irritability, Sensitivity and Coordination
94 pages
Excertible Tissue Lecture Note 1
No ratings yet
Excertible Tissue Lecture Note 1
102 pages
Simatic Hmi and Opc Ua Part 6: Wincc RT Professional Server, Comfort Panel Client
No ratings yet
Simatic Hmi and Opc Ua Part 6: Wincc RT Professional Server, Comfort Panel Client
23 pages
Anaphy Reviewer (Semi Finals)
No ratings yet
Anaphy Reviewer (Semi Finals)
28 pages
Unit 4 LSTM
No ratings yet
Unit 4 LSTM
85 pages
RNN and LSTM
No ratings yet
RNN and LSTM
32 pages
Santiago Ramón y Cajal en La Neurosciencia
No ratings yet
Santiago Ramón y Cajal en La Neurosciencia
11 pages
Basic Instruction For TOEIC W-S
No ratings yet
Basic Instruction For TOEIC W-S
33 pages
NCERT Exemplar Solution Class 10 Science Chapter 7
No ratings yet
NCERT Exemplar Solution Class 10 Science Chapter 7
21 pages
ITP MIDTERM REVIEWER. Nicah
No ratings yet
ITP MIDTERM REVIEWER. Nicah
20 pages
Opportunities For Neuromoriphic Computing
No ratings yet
Opportunities For Neuromoriphic Computing
10 pages
Organization and General Features of The Autonomic Nervous System
No ratings yet
Organization and General Features of The Autonomic Nervous System
17 pages
Neural Control and Coordination
No ratings yet
Neural Control and Coordination
16 pages
IM10 Datasheet
No ratings yet
IM10 Datasheet
2 pages
Backyard Brain Lab Week 9 - 1
No ratings yet
Backyard Brain Lab Week 9 - 1
3 pages
Lecture 17 Cellular Memory Consolidation
No ratings yet
Lecture 17 Cellular Memory Consolidation
14 pages
Emotional Tagging A Simple Hypothesis in
No ratings yet
Emotional Tagging A Simple Hypothesis in
13 pages
Spampinato Celnik 20
No ratings yet
Spampinato Celnik 20
10 pages
Slides Photonic Computing Chapter1
No ratings yet
Slides Photonic Computing Chapter1
32 pages
Taste Transduction and Channel Synapses in Taste Buds
No ratings yet
Taste Transduction and Channel Synapses in Taste Buds
11 pages
Rabies Virus Receptors
No ratings yet
Rabies Virus Receptors
6 pages
Neuroelectrophysiology Paper 2
No ratings yet
Neuroelectrophysiology Paper 2
6 pages
Electric Properties of Cardiac Muscles
No ratings yet
Electric Properties of Cardiac Muscles
5 pages
Nervous Physiology
No ratings yet
Nervous Physiology
3 pages
Chapter 9-Nervous System
No ratings yet
Chapter 9-Nervous System
2 pages
Delivery Robot Poster
No ratings yet
Delivery Robot Poster
1 page