0% found this document useful (0 votes)

18 views34 pages

AN2DL 04 2324 RecurrentNeuralNetworks

Uploaded by

kiankr79

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views34 pages

AN2DL 04 2324 RecurrentNeuralNetworks

Uploaded by

kiankr79

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Artificial Neural Networks and Deep Learning

- Recurrent Neural Networks-

Matteo Matteucci, PhD ([email protected])

Artificial Intelligence and Robotics Laboratory
Politecnico di Milano
Sequence Modeling

So far we have considered only «static» datasets

1 1 1
𝑤10 1
Xt
𝑤11
x1
… 𝑔1 𝑥 w
𝑤𝑗𝑖
xi …
…
… … … 𝑔𝐾 𝑥 w
xI 𝑤𝐽𝐼

2
Sequence Modeling

So far we have considered only «static» datasets

X0 X1 X2 X3 Xt

x1 x1 x1 x1 x1
… … … … …
xi xi xi xi … xi …
… … … … …

xI xI xI xI xI

time

3
Sequence Modeling

Different ways to deal with «dynamic» data:

X0 X1 X2 X3 Xt
Memoryless models (fixed lag):
• Autoregressive models x1 x1 x1 x1 x1
• Feedforward neural networks … … … … …
xi xi xi xi … xi …

Models with memory (unlimited): … … … … …

• Linear dynamical systems xI xI xI xI xI

• Hidden Markov models
time
• Recurrent Neural Networks X0 X1 X2 X3 Xt
… …
• ...

4
Memoryless Models for Sequences (1/2)
𝑊𝑡−2

Autoregressive models X0 X1 X2 X3 Xt
• Predict the next input from … …

previous ones using «delay taps» 𝑊𝑡−1

time

Y0 Y1 Y2 Y3 … … Yt

Linear models with fixed lag

• Predict the next output from 𝑊𝑡−2 𝑊𝑡−1
previous inputs using
X0 X1 X2 X3 Xt
«delay taps» … …

time
5
Memoryless Models for Sequences (2/2)
Hidden
Feed forward neural networks
• Generalize autoregressive models 𝑊𝑡−2 𝑊𝑡−1 𝑊𝑡

using non linear hidden layers X0 X1 X2 X3

…
Xt
…

time

Feed forward neural networks Y0 Y1 Y2 Y3 … … Yt

with delays 𝑉𝑡−2

• Predict the next output from Hidden

previous inputs and previous 𝑊𝑡−2 𝑊𝑡−1 𝑊𝑡

outputs using «delay taps» X0 X1 X2 X3 Xt
… …

time
6
Dynamical Systems (Models with Memory)

Generative models with a hidden state which cannot be observed directly

• The hidden state has some dynamics possibly Y0 Y1 … Yt
affected by noise and produces the output
• To compute the output need to infer hidden state

Hidden
Hidden

Hidden
• Input are treated as driving inputs …

In linear dynamical systems this becomes:

• State continuous with Gaussian uncertainty X0 X1 Xt
…
• Transformations are assumed to be linear
• State can be estimated using Kalman filtering
time

Stochastic systems ...

7
Dynamical Systems (Models with Memory)

Generative models with a hidden state which cannot be observed directly

• The hidden state has some dynamics possibly Y0 Y1 … Yt
affected by noise and produces the output
• To compute the output need to infer hidden state

Hidden
Hidden

Hidden
• Input are treated as driving inputs …

In hidden Markov models this becomes:

• State assumed to be discrete, state transitions
are stochastic (transition matrix)
• Output is a stochastic function of hidden states
• State can be estimated via Viterbi algorithm. time

Stochastic systems ...

8
Recurrent Neural networks Deterministic
systems ...
1

Memory via recurrent connections: ℎ𝑗𝑡 𝑥 𝑡 , W

𝑤11 1 , 𝑐 𝑡−1 , V 1

• Distributed hidden state allows …

𝑤𝑗𝑖 1
to store information efficiently xi
• Non-linear dynamics allows … …
complex hidden state updates 𝑔𝑡 𝑥 w
xI 𝑤𝐽𝐼

“With enough neurons and time, RNNs

can compute anything that can be
computed by a computer.” 𝑐1𝑡−1

1 (1)
(Computation Beyond the Turing Limit … 𝑐𝑏𝑡 𝑥 𝑡 , W𝐵 , 𝑐 𝑡−1 , VB
Hava T. Siegelmann, 1995)

𝑐𝐵𝑡−1

9
Recurrent Neural networks
1

Memory via recurrent connections: ℎ𝑗𝑡 𝑥 𝑡 , W

𝑤11 1 , 𝑐 𝑡−1 , V 1

• Distributed hidden state allows …

𝑤𝑗𝑖 1
to store information efficiently xi
• Non-linear dynamics allows … …
complex hidden state updates 𝑔𝑡 𝑥 w
xI 𝑤𝐽𝐼
𝐽 𝐵
(2) (2)
𝑔𝑡 𝑥𝑛 |𝑤 = 𝑔 ෍ 𝑤1𝑗 ⋅ ℎ𝑗𝑡 ⋅ + ෍ 𝑣1𝑏 ⋅ 𝑐𝑏𝑡 ⋅
𝑗=0 𝑏=0

𝐽 𝐵
(1) (1) 𝑐1𝑡−1
ℎ𝑗𝑡 ⋅ = ℎ𝑗 ෍ 𝑤𝑗𝑖 ⋅ 𝑥𝑖,𝑛
𝑡
+ ෍ 𝑣𝑗𝑏 ⋅ 𝑐𝑏𝑡−1
𝑗=0 𝑏=0
1 (1)
… 𝑐𝑏𝑡 𝑥 𝑡 , W𝐵 , 𝑐 𝑡−1 , VB
𝐽 𝐵
(1)
𝑡 (1)
𝑐𝑏𝑡 ⋅ = 𝑐𝑏 ෍ 𝑤𝑏𝑖 ⋅ 𝑥𝑖,𝑛 𝑡−1
+ ෍ 𝑣𝑏𝑏′ ⋅ 𝑐𝑏′
𝑗=0 𝑏′=0
𝑐𝐵𝑡−1

10
Backpropagation Through Time
1

𝑤11 ℎ𝑗𝑡 𝑥 𝑡 , W 1 , 𝑐 𝑡−1 , V 1

…
𝑤𝑗𝑖 1

… …
𝑔𝑡 𝑥 w
xI 𝑤𝐽𝐼

𝑐1𝑡−1

1 (1)
… 𝑐𝑏𝑡 𝑥 𝑡 , W𝐵 , 𝑐 𝑡−1 , VB

𝑐𝐵𝑡−1

11
Backpropagation Through Time
1 1 1 1

𝑤11 ℎ𝑗𝑡 𝑥 𝑡 , W 1 , 𝑐 𝑡−1 , V 1

x1 x1 x1 x1

… … All these weights… …

𝑤𝑗𝑖
should be the same. 1

xi xi xi xi

… … … … …
𝑔𝑡 𝑥 w
xI xI xI xI 𝑤𝐽𝐼

… 1 (1)
… … … … 𝑐𝑏𝑡 𝑥 𝑡 , W𝐵 , 𝑐 𝑡−1 , VB

12
Backpropagation Through Time
1

• Perform network unroll for U steps 𝑤11 ℎ𝑗𝑡 𝑥 𝑡 , W 1

, 𝑐 𝑡−1 , V 1
x1
• Initialize WB , 𝑉𝐵 replicas to be the same …
• Compute gradients and update replicas 𝑤𝑗𝑖 1

with the average of their gradients xi

𝑈−1 𝑈−1
… …
1 𝜕𝐸 1 𝜕𝐸 𝑡
𝑊𝐵 = 𝑊𝐵 − 𝜂 ⋅ ෍ 𝑉 = 𝑉𝐵 − 𝜂 ⋅ ෍
𝑈 𝜕𝑊𝐵𝑡−𝑢 𝐵 𝑈 𝜕𝑉𝐵𝑡−𝑢 xI 𝑤𝐽𝐼 𝑔𝑡 𝑥 w
𝑢=0 𝑢=0

… … … … 1 (1)
… 𝑐𝑏𝑡 𝑥 𝑡 , W𝐵 , 𝑐 𝑡−1 , VB
𝑉𝐵𝑡−3 𝑉𝐵𝑡−2 𝑉𝐵𝑡−1 𝑉𝐵𝑡

13
How much should we go back in time?
1

𝑤11
ℎ𝑗𝑡 𝑥 𝑡 , W 1 , 𝑐 𝑡−1 , V 1
Sometime output might be related to x1

some input happened quite long before …

𝑤𝑗𝑖 1

xi
Jane walked into the room. John walked in too.
It was late in the day. Jane said hi to <???> … …
𝑔𝑡 𝑥 w
xI 𝑤𝐽𝐼

However backpropagation through 𝑐1𝑡−1

time was not able to train recurrent
neural networks significantly … 1
𝑐𝑏𝑡 𝑥 𝑡 , W𝐵 , 𝑐 𝑡−1 , VB
(1)

back in time ... 𝑐𝐵𝑡−1

Was due to not being able to
backprop through many layers ...
14
How much can we go back in time?

To better understand why it was not working consider a simplified case:

ℎ𝑡 = ℎ(𝑣 1
⋅ ℎ𝑡−1 + 𝑤 1
⋅ 𝑥) 𝑦 𝑡 = 𝑔(𝑤 2 ⋅ ℎ𝑡 )
𝑥

Backpropagation over an entire sequence 𝑆 is computed as

𝑆 𝑡 𝑡 𝑡
𝜕𝐸 𝜕𝐸 𝑡 𝜕𝐸 𝑡 𝜕𝐸 𝑡 𝜕𝑦 𝑡 𝜕ℎ𝑡 𝜕ℎ𝑘 𝜕ℎ𝑡 𝜕ℎ𝑖 1 1
=෍ =෍ 𝑡 𝑡 𝑘 = ෑ = ෑ 𝑣 ℎ′ 𝑣 ⋅ ℎ𝑖−1 + 𝑤 (1) ⋅ 𝑥
𝜕𝑤 𝜕𝑤 𝜕𝑤 𝜕𝑦 𝜕ℎ 𝜕ℎ 𝜕𝑤 𝜕ℎ𝑘 𝜕ℎ𝑖−1
𝑡=1 𝑡=1 𝑖=𝑘+1 𝑖=𝑘+1

If 𝛾𝑣 ⋅ 𝛾ℎ′ < 1this

If we consider the norm of these terms converges to 0 ...
𝜕ℎ𝑖 1
𝜕ℎ𝑡 𝑡−𝑘
≤ 𝑣 ℎ′ ⋅ 𝑘
≤ 𝛾𝑣 ⋅ 𝛾ℎ′
𝜕ℎ𝑖−1 𝜕ℎ
With Sigmoids and Tanh we
have vanishing gradients
15
Which Activation Function?

Sigmoid activation function Tanh activation function

1 exp 𝑎 − exp(−𝑎)
𝑔 𝑎 = 𝑔 𝑎 =
1 + exp(−𝑎) exp(𝑎) + exp(−𝑎)
𝑔′ 𝑎 = 𝑔(𝑎)(1 − 𝑔 𝑎 ) 𝑔′ 𝑎 = 1 − 𝑔 𝑎 2
1 exp 0 exp 0 −exp 0 2
𝑔′ 0 = 𝑔 0 1 − 𝑔 0 = ⋅ = 0.25 𝑔′ 0 =1−𝑔 0 2 =1− =1
1 + exp(0) 1 + exp 0 exp 0 + exp 0

16
Dealing with Vanishing Gradient

Force all gradients to be either 0 or 1

𝑔 𝑎 = 𝑅𝑒𝐿𝑢 𝑎 = max 0, 𝑎
𝑔′ 𝑎 = 1𝑎>0

Build Recurrent Neural Networks using small modules that are designed
to remember values for a long time.
ℎ𝑡 = 𝑣 (1) ℎ𝑡−1 + 𝑤 (1) 𝑥 𝑦 𝑡 = 𝑔(𝑤 2
⋅ ℎ𝑡 )
𝑥

𝑣 (1) = 1
It only accumulates
the input ...
17
Long Short-Term Memories (LSTM)

Hochreiter & Schmidhuber (1997) solved the problem of vanishing

gradient designing a memory cell using logistic and linear units with
multiplicative interactions:

• Information gets into the cell

whenever its “write” gate is on.
• The information stays in the cell
so long as its “keep” gate is on.
• Information is read from the cell
by turning on its “read” gate.
Can backpropagate
through this since the
loop has fixed weight.
18
RNN vs. LSTM

RNN

LSTM Images from: https://colah.github.io/posts/2015-08-Understanding-LSTMs/

19
Long Short-Term Memory

LSTM

LSTM Images from: https://colah.github.io/posts/2015-08-Understanding-LSTMs/

20
Long Short-Term Memory

Input gate

LSTM

LSTM Images from: https://colah.github.io/posts/2015-08-Understanding-LSTMs/

21
Long Short-Term Memory

Forget gate

LSTM

LSTM Images from: https://colah.github.io/posts/2015-08-Understanding-LSTMs/

22
Long Short-Term Memory

Memory gate

LSTM

LSTM Images from: https://colah.github.io/posts/2015-08-Understanding-LSTMs/

23
Long Short-Term Memory

Output gate

LSTM

LSTM Images from: https://colah.github.io/posts/2015-08-Understanding-LSTMs/

24
Gated Recurrent Unit (GRU)

It combines the forget and input gates into a single “update gate.” It also
merges the cell state and hidden state, and makes some other changes.

LSTM Images from: https://colah.github.io/posts/2015-08-Understanding-LSTMs/

25
LSTM Networks

You can build a computation graph with continuous transformations.

Y0 Y1 … Yt

Hidden

Hidden
…

X0 X1 Xt
…

26
Multiple Layers and Bidirectional LSTM Networks

A computation graph in time with continuous transformations.

Hierarchical
representation
Y0 Y1 Yt
…
ReLu

ReLu

ReLu
…
LSTM
LSTM

LSTM
…
LSTM

LSTM

LSTM
…

X0 X1 … Xt

27
Tips & Tricks

When conditioning on full input sequence Bidirectional RNNs exploit it:

• Have one RNNs traverse the sequence left-to-right
• Have another RNN traverse the sequence right-to-left
• Use concatenation of hidden layers as feature representation

28
Multiple Layers and Bidirectional LSTM Networks

A computation graph in time with continuous transformations.

Hierarchical
representation
Y0 Y1 Yt Y0 Y1 Yt
… …
ReLu

ReLu

ReLu
ReLu
ReLu
ReLu
… …
LSTM
LSTM

LSTM

LSTM
LSTM
LSTM
… …
LSTM

LSTM

LSTM
LSTM
LSTM
… …

Xt Xt-1 X0
X0 X1 … Xt X0 X1 … Xt
Bidirectional
processing
29
Tips & Tricks

When conditioning on full input sequence Bidirectional RNNs exploit it:

• Have one RNNs traverse the sequence left-to-right
• Have another RNN traverse the sequence right-to-left
• Use concatenation of hidden layers as feature representation
When initializing RNN we need to specify the initial state
• Could initialize them to a fixed value (such as 0)
• Better to treat the initial state as learned parameters
• Start off with random guesses of the initial state values
• Backpropagate the prediction error through time all the way to the initial state values
and compute the gradient of the error with respect to these
• Update these parameters by gradient descent

30
Sequential Data Problems

Fixed-sized Sequence output Sequence input (e.g. Sequence input and Synced sequence input
input (e.g. image captioning sentiment analysis sequence output (e.g. and output (e.g. video
to fixed-sized takes an image and where a given sentence Machine Translation: an classification where we
output outputs a sentence of is classified as RNN reads a sentence in wish to label each frame
(e.g. image words). expressing positive or English and then outputs of the video)
classification) negative sentiment). a sentence in French)

LSTM Images Credits: Andrej Karpathy

31
Sequence to Sequence Learning Examples (1/3)

Image Captioning: input a single image and get a series or sequence of

words as output which describe it. The image has a fixed size, but the
output has varying length.

32
Sequence to Sequence Learning Examples (2/3)

Sentiment Classification/Analysis: input a sequence of characters or

words, e.g., a tweet, and classify the sequence into positive or negative
sentiment. Input has varying lengths; output is of a fixed type and size.

33
Sequence to Sequence Learning Examples (3/3)

Language Translation: having some text in a particular language, e.g.,

English, we wish to translate it in another, e.g., French. Each language has
it’s own semantics and it has varying lengths for the same sentence.

Unit 5
No ratings yet
Unit 5
61 pages
ch10 Sequence Modelling - Recurrent and Recursive Nets
No ratings yet
ch10 Sequence Modelling - Recurrent and Recursive Nets
45 pages
Deep Learning - Chorale Prelude
No ratings yet
Deep Learning - Chorale Prelude
2 pages
Dis6 Sol
No ratings yet
Dis6 Sol
6 pages
Recurrent Neural Networks - Hinton
No ratings yet
Recurrent Neural Networks - Hinton
57 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Modelling Time Series With Neural Networks: Volker Tresp Summer 2017
No ratings yet
Modelling Time Series With Neural Networks: Volker Tresp Summer 2017
24 pages
Recurrent Neural Networks (RNNS) : A Gentle Introduction and Overview
No ratings yet
Recurrent Neural Networks (RNNS) : A Gentle Introduction and Overview
16 pages
Slides RNN
No ratings yet
Slides RNN
75 pages
RNN
No ratings yet
RNN
79 pages
15.03.2024 Csa3007 A24+d23+d24
No ratings yet
15.03.2024 Csa3007 A24+d23+d24
8 pages
CH4 - AA1.1-Sequence Models
No ratings yet
CH4 - AA1.1-Sequence Models
26 pages
CS60010: Deep Learning: Recurrent Neural Network
No ratings yet
CS60010: Deep Learning: Recurrent Neural Network
44 pages
Chap 7.2 Sequence Analysis Using RNN LSTM
No ratings yet
Chap 7.2 Sequence Analysis Using RNN LSTM
60 pages
Recurrent Neural Networks: CSC2535 2013: Advanced Machine Learning
No ratings yet
Recurrent Neural Networks: CSC2535 2013: Advanced Machine Learning
57 pages
Chapter 4 Data Sci
No ratings yet
Chapter 4 Data Sci
58 pages
Lec 10 New
No ratings yet
Lec 10 New
57 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
Unit 5 Updated
No ratings yet
Unit 5 Updated
125 pages
Advanced Data Analytics: Simon Scheidegger - University of Lausanne, Department of Economics
No ratings yet
Advanced Data Analytics: Simon Scheidegger - University of Lausanne, Department of Economics
50 pages
4-Recurrent Neural Network
No ratings yet
4-Recurrent Neural Network
21 pages
ch6 RNN
No ratings yet
ch6 RNN
25 pages
cs224n spr2024 Lecture06 Fancy RNN
No ratings yet
cs224n spr2024 Lecture06 Fancy RNN
56 pages
LSTMDerivadas
No ratings yet
LSTMDerivadas
10 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
34 pages
8.5 Recurrent Neural Networks
No ratings yet
8.5 Recurrent Neural Networks
5 pages
Unit 4b - Recurrent Neural Networks
No ratings yet
Unit 4b - Recurrent Neural Networks
60 pages
Time Series RNN LSTM 1746197734
No ratings yet
Time Series RNN LSTM 1746197734
25 pages
Blue and White Simple Business Plan Presentation
No ratings yet
Blue and White Simple Business Plan Presentation
15 pages
Unit 4 - Machine Learning - WWW - Rgpvnotes.in
0% (1)
Unit 4 - Machine Learning - WWW - Rgpvnotes.in
16 pages
Unit IV
No ratings yet
Unit IV
31 pages
RNN-1 All
No ratings yet
RNN-1 All
44 pages
Unit 3 RCNN
No ratings yet
Unit 3 RCNN
25 pages
RNN Tutorial
No ratings yet
RNN Tutorial
41 pages
Module 7 RNN
No ratings yet
Module 7 RNN
12 pages
598 114 216 Recurrent Neural Networks
No ratings yet
598 114 216 Recurrent Neural Networks
87 pages
DNN U2 Notes
No ratings yet
DNN U2 Notes
32 pages
Unit III
No ratings yet
Unit III
43 pages
21cse356t NLP Unit 4
No ratings yet
21cse356t NLP Unit 4
81 pages
21CSE356T-NLP-Unit 4.1
No ratings yet
21CSE356T-NLP-Unit 4.1
46 pages
11 RNN
No ratings yet
11 RNN
32 pages
6S191 MIT DeepLearning L2
No ratings yet
6S191 MIT DeepLearning L2
85 pages
DL Mod4
No ratings yet
DL Mod4
105 pages
NLP Lecture 6
No ratings yet
NLP Lecture 6
57 pages
42 Recurrent Neural Networks and LSTM
No ratings yet
42 Recurrent Neural Networks and LSTM
68 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
28 pages
Chapter 12 PartII en
No ratings yet
Chapter 12 PartII en
23 pages
RNN LSTM
No ratings yet
RNN LSTM
71 pages
Unit 4 - Machine Learning
No ratings yet
Unit 4 - Machine Learning
16 pages
Unit 4 - MachineLearning
No ratings yet
Unit 4 - MachineLearning
16 pages
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
No ratings yet
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
9 pages
Module 4 Recurrent Neural Network
No ratings yet
Module 4 Recurrent Neural Network
78 pages
Introduction To Rnns
No ratings yet
Introduction To Rnns
48 pages
07 RNN Recurrent Neural Networks
No ratings yet
07 RNN Recurrent Neural Networks
115 pages
Recurrent Neural Networks and Long Short-Term Memory Networks: Tutorial and Survey
No ratings yet
Recurrent Neural Networks and Long Short-Term Memory Networks: Tutorial and Survey
15 pages
DL Mod 3
No ratings yet
DL Mod 3
4 pages
1308 0850 PDF
No ratings yet
1308 0850 PDF
43 pages
DL Co3 - PPT 1
No ratings yet
DL Co3 - PPT 1
22 pages
Sequence Modeling - Recurrent Networks: Biplab Banerjee
No ratings yet
Sequence Modeling - Recurrent Networks: Biplab Banerjee
66 pages
AN2DL 06 2324 AttentionAndTrasformers
No ratings yet
AN2DL 06 2324 AttentionAndTrasformers
60 pages
AN2DL 02 2324 Perceptron 2 FeedForward
No ratings yet
AN2DL 02 2324 Perceptron 2 FeedForward
55 pages
AN2DL 03 2324 NeuralNetwroksTraining
No ratings yet
AN2DL 03 2324 NeuralNetwroksTraining
40 pages
AN2DL 05 2324 Seq2SeqAndWordEmbedding
No ratings yet
AN2DL 05 2324 Seq2SeqAndWordEmbedding
42 pages
Classification by Backpropagation - A Multilayer Feed-Forward Neural Network - Defining A Network Topology - Backpropagation
No ratings yet
Classification by Backpropagation - A Multilayer Feed-Forward Neural Network - Defining A Network Topology - Backpropagation
8 pages
How To Choose An Activation Function For Deep Learning
No ratings yet
How To Choose An Activation Function For Deep Learning
15 pages
19 Deep Learning
100% (1)
19 Deep Learning
49 pages
Ppt-Ii NNFL
No ratings yet
Ppt-Ii NNFL
43 pages
Chapter 6 AI
No ratings yet
Chapter 6 AI
52 pages
NN Examples
No ratings yet
NN Examples
91 pages
CS7015 (Deep Learning) : Lecture 1
No ratings yet
CS7015 (Deep Learning) : Lecture 1
108 pages
223 COE 292 FinalExam Concept
No ratings yet
223 COE 292 FinalExam Concept
17 pages
ELEC 6240: Neural Networks
No ratings yet
ELEC 6240: Neural Networks
253 pages
Artificial Neural Networks: An: G.Anuradha
No ratings yet
Artificial Neural Networks: An: G.Anuradha
76 pages
Lec RNNs 2 LLMs - 1
No ratings yet
Lec RNNs 2 LLMs - 1
117 pages
Ch04-ANN-Dr Amin ML
No ratings yet
Ch04-ANN-Dr Amin ML
57 pages
DONE SOFT COMPUTING Unit 1
No ratings yet
DONE SOFT COMPUTING Unit 1
3 pages
Csa3007 - Deep-Learning - LTP - 1.0 - 40 - Deep Learning
No ratings yet
Csa3007 - Deep-Learning - LTP - 1.0 - 40 - Deep Learning
2 pages
Ann Lab Manual 2
No ratings yet
Ann Lab Manual 2
7 pages
DL Lab File Front Page
No ratings yet
DL Lab File Front Page
7 pages
Hedlin Novian Napitupulu Tugas3
No ratings yet
Hedlin Novian Napitupulu Tugas3
7 pages
Syllabus
No ratings yet
Syllabus
2 pages
Attention and Transformers
No ratings yet
Attention and Transformers
103 pages
DL Unit Iv
No ratings yet
DL Unit Iv
18 pages
Deep Learning KCS078
0% (1)
Deep Learning KCS078
2 pages
Deep Learning References: 1 Textbooks and Surveys About DL
No ratings yet
Deep Learning References: 1 Textbooks and Surveys About DL
9 pages
Difference Between ANN, CNN and RNN
100% (1)
Difference Between ANN, CNN and RNN
5 pages
ANNFL Assignment
No ratings yet
ANNFL Assignment
4 pages
Online FDP Schdule
No ratings yet
Online FDP Schdule
1 page
Ad3451 ML Unit 4 Notes
No ratings yet
Ad3451 ML Unit 4 Notes
34 pages
Presentation On: Neural Network
No ratings yet
Presentation On: Neural Network
30 pages
ML Session 15 Backpropagation
No ratings yet
ML Session 15 Backpropagation
30 pages