0% found this document useful (0 votes)

22 views

DeepLearning Unit-III

Uploaded by

D44 SREETEJA

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

DeepLearning Unit-III

Uploaded by

D44 SREETEJA

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 99

Deep Learning

Unit-III
Recurrent Neural Network
Dr. Rajesh Thumma
Associate. Professor
Anurag University
Recurrent Neural Network (RNN)
• A recurrent neural network (RNN) is a type of artificial neural network which uses
sequential data or time series data.
• RNNs used for speech recognition, voice recognition, time series prediction,
and natural language processing and image captioning; they are incorporated into
popular applications such as Siri, voice search, and Google Translate.
• Like ANN and CNNs, RNNs utilize training data to learn. They are distinguished by
their “memory” as they take information from prior inputs to influence the current
input and output. While traditional deep neural networks assume that inputs and
outputs are independent of each other, the output of RNNs depend on the prior
elements within the sequence.
Recurrent Neural Network
• The most important feature of RNN is its Hidden state, which remembers some
information about a sequence. The state is also referred to as Memory State since it
remembers the previous input to the network.

Recurrent Networks are

designed to recognize
patterns in sequences of data,
such as text, genomes,
handwriting, the spoken
word, and numerical time
series data emanating from
sensors, stock markets, and
government agencies.
Recurrent Neural Network
 RNN looks similar to a traditional neural network except that a memory-state is added to the
neurons.
Below figure shows how to convert a Feed-Forward Neural Network into a RNN:

The nodes in different layers of the neural network are compressed to form a single layer of
recurrent neural networks. A, B, and C are the parameters of the network.
Recurrent Neural Network
Fig: Fully connected Recurrent Neural Network

The RNN is a type of deep learning algorithm,

which follows a sequential approach. In neural
networks, we always assume that each input and
output is dependent on all other layers. These
types of neural networks are called recurrent
because they sequentially perform mathematical
computations.

Here, “x” is the input layer, “h” is the hidden layer, and “y” is the output layer. A, B, and C are
the network parameters used to improve the output of the model. At any given time t, the
current input is a combination of input at x(t) and x(t-1). The output at any given time is
fetched back to the network to improve on the output.
Recurrent Neural Network
Why Recurrent Neural Networks?

RNN were created because there were a few issues in the feed-forward neural network:

 Cannot handle sequential data

 Considers only the current input

 Cannot memorize previous inputs

The solution to these issues is the RNN. An RNN can handle sequential data, accepting the

current input data, and previously received inputs. RNNs can memorize previous inputs due to

their internal memory.

How Does Recurrent Neural Networks Work?
In RNNs, the information cycles through a loop to the middle hidden layer.

The input layer ‘x’ takes in the input to the neural network and processes it and passes it onto
the middle layer. The middle layer ‘h’ can consist of multiple hidden layers, each with its own
activation functions and weights and biases. The Recurrent Neural Network will standardize the
different activation functions and weights and biases so that each hidden layer has the same
parameters. Then, instead of creating multiple hidden layers, it will create one and loop over it
as many times as required.
Training through RNN
1. The network takes a single time-step of the input.
2. We can calculate the current state through the current input and the previous state.
3. Now, the current state through ht-1 for the next state.
4. There is n number of steps, and in the end, all the information can be joined.
5. After completion of all the steps, the final step is for calculating the output.
6. At last, we compute the error by calculating the difference between actual output
and the predicted output.
7. The error is backpropagated to the network to adjust the weights and produce a
better outcome.
Applications of Recurrent Neural Networks
Applications of Recurrent Neural Networks
Applications of Recurrent Neural Networks
Types of Recurrent Neural Networks
There are four types of Recurrent Neural Networks:
1. One to One
2. One to Many
3. Many to One
4. Many to Many
Types of Recurrent Neural Networks
One to One RNN: This type of neural network is known as the Vanilla Neural
Network. It's used for general machine learning problems, which has a single
input and a single output.
Types of Recurrent Neural Networks
One to Many RNN: This type of neural network has a single input and multiple
outputs. An example of this is the image caption.
Types of Recurrent Neural Networks
Many to One RNN: This RNN takes a sequence of inputs and generates a single
output. Sentiment analysis is a good example of this kind of network where a
given sentence can be classified as expressing positive or negative sentiments.
Types of Recurrent Neural Networks
Many to Many RNN: This RNN takes a sequence of inputs and generates a
sequence of outputs. Machine translation is one of the examples.
Advantages of Recurrent Neural Network
• Ability To Handle Variable-Length Sequences: RNNs are designed to handle input
sequences of variable length, which makes them well-suited for tasks such as speech
recognition, natural language processing, and time series analysis.
• Memory of Past Inputs: RNNs have a memory of past inputs, which allows them to
capture information about the context of the input sequence. This makes them useful
for tasks such as language modeling, where the meaning of a word depends on the
context in which it appears.
• Parameter Sharing: RNNs share the same set of parameters across all time steps,
which reduces the number of parameters that need to be learned and can lead to better
generalization.
Advantages of Recurrent Neural Network
• Non-Linear Mapping: RNNs use non-linear activation functions, which allows
them to learn complex, non-linear mappings between inputs and outputs.
• Sequential Processing: RNNs process input sequences sequentially, which makes
them computationally efficient and easy to parallelize.
• Flexibility: RNNs can be adapted to a wide range of tasks and input types,
including text, speech, and image sequences.
• Improved Accuracy: RNNs have been shown to achieve state-of-the-art
performance on a variety of sequence modeling tasks, including language
modeling, speech recognition, and machine translation.
Disadvantages of Recurrent Neural Network
• Vanishing And Exploding Gradients: RNNs can suffer from the problem of
vanishing or exploding gradients, which can make it difficult to train the
network effectively. This occurs when the gradients of the loss function with
respect to the parameters become very small or very large as they propagate
through time.
• Computational Complexity: RNNs can be computationally expensive to train,
especially when dealing with long sequences. This is because the network has to
process each input in sequence, which can be slow.
• Difficulty In Capturing Long-Term Dependencies: Although RNNs are
designed to capture information about past inputs, they can struggle to capture
long-term dependencies in the input sequence. This is because the gradients can
become very small as they propagate through time, which can cause the
network to forget important information.
Disadvantages of Recurrent Neural Network
Lack Of Parallelism: RNNs are inherently sequential, which makes it difficult to
parallelize the computation. This can limit the speed and scalability of the
network.
Difficulty In Choosing The Right Architecture: There are many different
variants of RNNs, each with its own advantages and disadvantages. Choosing the
right architecture for a given task can be challenging, and may require extensive
experimentation and tuning.
Difficulty In Interpreting The Output: The output of an RNN can be difficult to
interpret, especially when dealing with complex inputs such as natural language or
audio. This can make it difficult to understand how the network is making its
predictions.
Backpropagation Through Time (BPTT)
 When we apply a Backpropagation algorithm to a RNN with time series data as its
input, we call it backpropagation through time.
 A single input is sent into the network at a time in a normal RNN, and a single output
is obtained. Backpropagation, on the other hand, uses both the current and prior
inputs as input. This is referred to as a timestep, and one timestep will consist of
multiple time series data points entering the RNN at the same time.

 The output of the RNN is used to

calculate and collect the errors once it
has trained on a time set and given you
an output. The network is then rolled
back up, and weights are recalculated
and adjusted to account for the faults.
Key Differences Between CNN and RNN
1. CNN is applicable for sparse data like images. RNN is applicable for time
series and sequential data.
2. While training the model, CNN uses a simple backpropagation and RNN
uses backpropagation through time to calculate the loss.
3. RNN can have no restriction in length of inputs and outputs, but CNN has
finite inputs and finite outputs.
4. CNN has a feedforward network and RNN works on loops to handle
sequential data.
5. CNN can also be used for video and image processing. RNN is primarily
used for speech and text analysis.
RNN Architectures
There are several RNN architectures that have been developed to address the
limitations of the standard RNN architecture. Here are a few examples:

1. Long Short-Term Memory (LSTM) Networks

2. Gated Recurrent Unit (GRU) Networks

3. Bidirectional RNNs

4. Encoder-Decoder RNNs
Long Short-Term Memory (LSTM) Networks
 LSTM is a type of RNN that is designed to handle the vanishing gradient problem.
 LSTM networks are a modified version of RNNs, which makes it easier to remember
past data in memory.
 It can process not only single data points (such as images) but also entire sequences of
data (such as speech or video). Ex: connected handwriting recognition, or speech
recognition.
 General LSTM unit is composed of a cell, an input gate, an output gate, and a forget
gate. The cell remembers values over arbitrary time intervals, and three gates regulate
the flow of information into and out of the cell.
 LSTM is well-suited to classify, process, and predict the time series given of unknown
duration.
Long Short-Term Memory (LSTM) Networks
Long Short-Term Memory (LSTM) Networks
1. Input gate: It discover which value from input should be used to modify the
memory. Sigmoid function decides which values to let through 0 or 1. And tanh function gives
weightage to the values which are passed, deciding their level of importance ranging
from -1 to 1.
2. Forget gate: It discover the details to be discarded from the block. A sigmoid function decides
it. It looks at the previous state (ht-1) and the content input (Xt) and outputs a number between
0(omit this) and 1(keep this) in the cell state Ct-1.
3. Output gate: The input and the memory of the block are used to decide the output. Sigmoid
function decides which values to let through 0 or 1. And tanh function decides which values to
let through 0, 1. And tanh function gives weightage to the values which are passed, deciding
their level of importance ranging from -1 to 1 and multiplied with an output of sigmoid.
Workings of LSTMs in RNN
LSTMs work in a 3-step process.

Step 1: Decide How Much Past Data

It Should Remember

Step 2: Decide How Much This Unit

Adds to the Current State

Step 3: Decide What Part of the Current

Cell State Makes It to the Output
Working of LSTM
Step 1: Decide How Much Past Data It Should Remember

The first step in the LSTM is to decide which information should be omitted
from the cell in that particular time step. The sigmoid function determines
this. It looks at the previous state (ht-1) along with the current input xt and
computes the function.
Working of LSTM
Step 1: Decide How Much Past Data It Should Remember Cont….

Consider the following two sentences:

1. Let the output of h(t-1) be “Alice is good in Physics. John, on the other hand, is good at
Chemistry.”
2. Let the current input at x(t) be “John plays football well. He told me yesterday over the
phone that he had served as the captain of his college football team.”
The forget gate realizes there might be a change in context after encountering the first full
stop. It compares with the current input sentence at x(t). The next sentence talks about John,
so the information on Alice is deleted. The position of the subject is vacated and assigned to

John.
Working of LSTM
Step 2: Decide How Much This Unit Adds to the Current State
In this, there are two parts. sigmoid function and tanh function. In the sigmoid function, it
decides which values to let through (0 or 1) and tanh function gives weightage to the values
which are passed, deciding their level of importance (-1 to 1).

With the current input at x(t), the input gate analyzes the important information — John plays
football, and the fact that he was the captain of his college team is important.
“He told me yesterday over the phone” is less important; hence it's forgotten. This process of
adding some new information can be done via the input gate.
Working of LSTM
Step 3: Decide What Part of the Current Cell State Makes It to the Output
The third step is to decide what the output will be. First, we run a sigmoid layer, which decides
what parts of the cell state make it to the output. Then, we put the cell state through tanh to push
the values to be between -1 and 1 and multiply it by the output of the sigmoid gate.

Let’s consider this example to predict the next word in the sentence: “John played tremendously
well against the opponent and won for his team. For his contributions, brave ____ was awarded
player of the match.” There could be many choices for the empty space. The current input brave
is an adjective, and adjectives describe a noun. So, “John” could be the best output after brave.
Variant RNN Architectures
Gated Recurrent Unit (GRU) Networks:
GRU is another type of RNN that is designed to address the vanishing gradient
problem.
 It has two gates: the reset gate and the update gate.
The reset gate determines how much of the previous state should be forgotten
The update gate determines how much of the new state should be remembered.
This allows the GRU network to selectively update its internal state based on the
input sequence.
Working of GRU
• GRU uses a reset gate and an update gate to solve the vanishing gradient problem. These
gates decide what information to be sent to the output. They can keep the information from
long back without diminishing it as the training continues. We can visualize the architecture
of GRU below:
Working of GRU
Reset gate: The reset gate determines the information of the past that it needs to forget. It
uses the same formula as the update gate.
Working of GRU
Update gate: It is responsible for long-term memory. It determines the amount of
information on the previous steps that must be passed further. The equation used in the
update gate is:

Zt is the output of the update gate for

step t. Xt is the current input. Xt is
multiplied by its weight W(z). ht-1 holds
the information for the previous t-1
steps. U(z) is the corresponding weight
of ht-1. After adding, the sigmoid
activation function is applied to it.
Working of GRU
Calculating the output by using these two gates
We use these two gates to calculate the final output of GRU. A new memory location is created
that stores the information from the past using the reset gate. It is calculated by:

We multiply Xt, the current input, by its weight (W), and ht-1 with weight U. We then calculate
the Hadamard product, i.e., the element-wise product between the output of the reset gate (rt)
Uht-1. We then take the sum and apply the tanh activation function.

Final output of GRU

The final output (ht) of GRU is calculated by using update gate and h’t, which we calculated in
the previous step. ht is calculated by:
Working of GRU
We apply the Hadamard product on the update gate (zt) and ht-1 and to 1-zt and h’t, and then we
take the sum to get the output of GRU.

This is how GRU solves the vanishing gradient problem. It keeps the relevant information and
passes down the next step. It can perform excellently if trained correctly.
Variant RNN Architectures
Bidirectional RNNs: Bidirectional RNNs are designed to process input
sequences in both forward and backward directions. This allows the network to
capture both past and future context, which can be useful for speech recognition
and natural language processing tasks.
Encoder-Decoder RNNs: Encoder-decoder RNNs consist of two RNNs: an
encoder network that processes the input sequence and produces a fixed-length
vector representation of the input and a decoder network that generates the output
sequence based on the encoder's representation. This architecture is commonly
used for sequence-to-sequence tasks such as machine translation.
LSTM

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
The Hold Me Tight Workbook - Dr. Sue Johnson
100% (16)
The Hold Me Tight Workbook - Dr. Sue Johnson
187 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (79)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
Shortcut To Shred Ebook Revised 9-9-2015 PDF
88% (8)
Shortcut To Shred Ebook Revised 9-9-2015 PDF
15 pages
Trauma-Focused ACT - Russ Harris
95% (39)
Trauma-Focused ACT - Russ Harris
568 pages
I Hate You - Don't Leave Me
80% (54)
I Hate You - Don't Leave Me
6 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
89% (53)
Workbook For The Body Keeps The Score
111 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
2025 MandateForLeadership FULL
70% (10)
2025 MandateForLeadership FULL
920 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (7)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
Starbucks Underfilled Latte Lawsuit
68% (76)
Starbucks Underfilled Latte Lawsuit
24 pages
1001 Songs
70% (71)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Stock Prediction Using Recurrent Neural Network (RNN)
0% (1)
Stock Prediction Using Recurrent Neural Network (RNN)
24 pages
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
JNTUK R10 CSE 4-1 Syllabus
No ratings yet
JNTUK R10 CSE 4-1 Syllabus
30 pages
DeepLearning Unit-III
No ratings yet
DeepLearning Unit-III
42 pages
UNIT-3
No ratings yet
UNIT-3
30 pages
AD3501_UNIT3
No ratings yet
AD3501_UNIT3
29 pages
Module 4 Recurrent Neural Network
No ratings yet
Module 4 Recurrent Neural Network
78 pages
RNN Neural Network
No ratings yet
RNN Neural Network
23 pages
Unit 5
No ratings yet
Unit 5
76 pages
RNN LSTM Gru R
No ratings yet
RNN LSTM Gru R
97 pages
ad3501-dl-unit-3-notes
No ratings yet
ad3501-dl-unit-3-notes
30 pages
Unit-2 Part-2
No ratings yet
Unit-2 Part-2
42 pages
Unit V Recurrent Neural Networks
No ratings yet
Unit V Recurrent Neural Networks
35 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
8 pages
DL Unit - III Notes1
No ratings yet
DL Unit - III Notes1
14 pages
Unit 4 - DL
No ratings yet
Unit 4 - DL
23 pages
Recurrent Neural Networks: Index
No ratings yet
Recurrent Neural Networks: Index
13 pages
unit 4_merged
No ratings yet
unit 4_merged
13 pages
DL-unit-4-part-2
No ratings yet
DL-unit-4-part-2
8 pages
RNN SK
No ratings yet
RNN SK
17 pages
Unit 3
No ratings yet
Unit 3
8 pages
UNIT-IV DL
No ratings yet
UNIT-IV DL
23 pages
2 U4-Rnn
No ratings yet
2 U4-Rnn
17 pages
What is a Recurrent Neural Network
No ratings yet
What is a Recurrent Neural Network
36 pages
RNN
No ratings yet
RNN
15 pages
Neural Network (RNN & CNN)
No ratings yet
Neural Network (RNN & CNN)
31 pages
DL Unit-4
No ratings yet
DL Unit-4
4 pages
DL CO3- PPT 1
No ratings yet
DL CO3- PPT 1
22 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
6 pages
Nria20-Dl - Unit-4 Notes-Final
No ratings yet
Nria20-Dl - Unit-4 Notes-Final
21 pages
Lecture Notes_RRN
No ratings yet
Lecture Notes_RRN
8 pages
RNN introduction
No ratings yet
RNN introduction
22 pages
Recurrent Neural Network Jeeva
No ratings yet
Recurrent Neural Network Jeeva
10 pages
UNIT5
No ratings yet
UNIT5
13 pages
Unit 3 Chapter 1 RNN
No ratings yet
Unit 3 Chapter 1 RNN
121 pages
Deep Learning RNN
100% (1)
Deep Learning RNN
53 pages
Unit III- Recurrent Neural Networks
No ratings yet
Unit III- Recurrent Neural Networks
44 pages
Unit 3 RCNN Updated
No ratings yet
Unit 3 RCNN Updated
28 pages
RNN.docx
No ratings yet
RNN.docx
8 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
18 pages
What are Recurrent Neural Networks.docx
No ratings yet
What are Recurrent Neural Networks.docx
7 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
9 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
18 pages
Recurrent Neural Network (RNN)
No ratings yet
Recurrent Neural Network (RNN)
26 pages
Deep Arch Msc 2024
No ratings yet
Deep Arch Msc 2024
83 pages
DL Unit4
No ratings yet
DL Unit4
20 pages
Ministry of Higher Education and Scientific Research University of Technology Computer Engineering Department
No ratings yet
Ministry of Higher Education and Scientific Research University of Technology Computer Engineering Department
6 pages
viva
No ratings yet
viva
8 pages
RNN Simplified.
No ratings yet
RNN Simplified.
2 pages
28-Recurrent Neural Networks - Bidirectional RNNs-19!09!2024
No ratings yet
28-Recurrent Neural Networks - Bidirectional RNNs-19!09!2024
12 pages
Lecture no 6 Deep Learning Algorithm
No ratings yet
Lecture no 6 Deep Learning Algorithm
37 pages
Deep Learning (MODULE-4)
No ratings yet
Deep Learning (MODULE-4)
102 pages
ISP560 Notes
No ratings yet
ISP560 Notes
139 pages
Module5-dl
No ratings yet
Module5-dl
18 pages
Sequence Modeling
No ratings yet
Sequence Modeling
131 pages
Unit Iv (CNN)
No ratings yet
Unit Iv (CNN)
8 pages
Deep Recurrent NeuralNetwork
No ratings yet
Deep Recurrent NeuralNetwork
10 pages
Unit 4 Notes
100% (1)
Unit 4 Notes
45 pages
Deep Learning
No ratings yet
Deep Learning
49 pages
DL_MOD4 (3)
No ratings yet
DL_MOD4 (3)
105 pages
Neural Networks
From Everand
Neural Networks
Sasha Kurzweil
No ratings yet
variables and levels of measurement
No ratings yet
variables and levels of measurement
9 pages
Supervised Learning in R Classification
No ratings yet
Supervised Learning in R Classification
7 pages
Machine Learning Online Bootcamp Beginners Track Curriculum
No ratings yet
Machine Learning Online Bootcamp Beginners Track Curriculum
9 pages
Artificial Intelligence Applied To Stock Market Trading A Review
100% (2)
Artificial Intelligence Applied To Stock Market Trading A Review
20 pages
AI and Recruiting Software Ethical and Legal Implications
No ratings yet
AI and Recruiting Software Ethical and Legal Implications
18 pages
The Most Popular Python Libraries
No ratings yet
The Most Popular Python Libraries
7 pages
Ip - Amodha Infotech - 8549932017 PDF
No ratings yet
Ip - Amodha Infotech - 8549932017 PDF
4 pages
B13 Poster (Final)
No ratings yet
B13 Poster (Final)
1 page
2019 Craik - 2019 - J. - Neural - Eng. - 16 - 031001
No ratings yet
2019 Craik - 2019 - J. - Neural - Eng. - 16 - 031001
29 pages
6 Code MLP Export
No ratings yet
6 Code MLP Export
2 pages
Machine Learning For Intrusion Detection in Cyber Security: Applications, Challenges, and Recommendations
No ratings yet
Machine Learning For Intrusion Detection in Cyber Security: Applications, Challenges, and Recommendations
24 pages
Introduction (BT4222) YL
No ratings yet
Introduction (BT4222) YL
48 pages
PredictingTitanicSurvivorsusing by Applying Exploratory Data Anyltics and ML
No ratings yet
PredictingTitanicSurvivorsusing by Applying Exploratory Data Anyltics and ML
7 pages
Assessment of Land Use and Land Cover Change Using Gis and Remotesensing Techniques A Case Study of Makueni County Kenya 2469 4134 1000175
No ratings yet
Assessment of Land Use and Land Cover Change Using Gis and Remotesensing Techniques A Case Study of Makueni County Kenya 2469 4134 1000175
6 pages
Modeling Hydrological Characteristics Based On Land Use Land Cover and Climate Changes in Muga Watershed Abay River Basin Ethiopia
No ratings yet
Modeling Hydrological Characteristics Based On Land Use Land Cover and Climate Changes in Muga Watershed Abay River Basin Ethiopia
22 pages
The Visual Computing Database: A Platform For Visual Data Processing and Analysis at Internet Scale 1
No ratings yet
The Visual Computing Database: A Platform For Visual Data Processing and Analysis at Internet Scale 1
17 pages
Machine Learning (VR20) III B.Tech - II Semester: Random Forest Algorithm
No ratings yet
Machine Learning (VR20) III B.Tech - II Semester: Random Forest Algorithm
14 pages
Social Media Sentiment Analysis Using Twitter Dataset
No ratings yet
Social Media Sentiment Analysis Using Twitter Dataset
26 pages
My ML Lab Manual
No ratings yet
My ML Lab Manual
21 pages
Acquiring Mood Information From Songs in Large Music Database PDF
No ratings yet
Acquiring Mood Information From Songs in Large Music Database PDF
7 pages
Ch4 - Multilayer Perceptron
No ratings yet
Ch4 - Multilayer Perceptron
26 pages
Scopeof Artificial Intelligencein Law
No ratings yet
Scopeof Artificial Intelligencein Law
10 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
IDA-Group Assignment Question
No ratings yet
IDA-Group Assignment Question
6 pages
Chapter 2 - Logistic Regression
No ratings yet
Chapter 2 - Logistic Regression
88 pages
Machine Learning: Linear Models For Classification 1
No ratings yet
Machine Learning: Linear Models For Classification 1
30 pages
Learning Transferable Visual Models From Natural Language Supervision
No ratings yet
Learning Transferable Visual Models From Natural Language Supervision
47 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
72 pages
Data Mining in Banking
No ratings yet
Data Mining in Banking
12 pages

DeepLearning Unit-III

Uploaded by

DeepLearning Unit-III

Uploaded by

Deep Learning

Recurrent Networks are

The RNN is a type of deep learning algorithm,

 Cannot handle sequential data

 Considers only the current input

 Cannot memorize previous inputs

their internal memory.

 The output of the RNN is used to

1. Long Short-Term Memory (LSTM) Networks

2. Gated Recurrent Unit (GRU) Networks

Step 1: Decide How Much Past Data

Step 2: Decide How Much This Unit

Step 3: Decide What Part of the Current

Consider the following two sentences:

Zt is the output of the update gate for

Final output of GRU

You might also like