LSTM

Long Short-Term Memory (LSTM) is an advanced version of Recurrent Neural Networks (RNN) that effectively captures long-term dependencies in sequential data, making it suitable for tasks such as language translation and speech recognition. LSTMs address the vanishing and exploding gradient problems faced by traditional RNNs by incorporating a memory cell controlled by three gates: input, forget, and output. This architecture allows LSTMs to selectively manage information flow, enabling them to learn long-term patterns more effectively.

Uploaded by

Srikanth Sri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

91 views24 pages

LSTM

Uploaded by

Srikanth Sri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

What is LSTM – Long Short Term Memory?

Long Short-Term Memory (LSTM) is an enhanced

version of the Recurrent Neural Network (RNN)
designed by Hochreiter & Schmidhuber. LSTMs can
capture long-term dependencies in sequential data
making them ideal for tasks like language translation,
speech recognition and time series forecasting.
Unlike traditional RNNs which use a single hidden state
passed through time LSTMs introduce a memory cell
that holds information over extended periods
addressing the challenge of learning long-term
dependencies.
Problem with Long-Term Dependencies in RNN
Recurrent Neural Networks (RNNs) are designed to handle
sequential data by maintaining a hidden state that captures
information from previous time steps. However they often face
challenges in learning long-term dependencies where information
from distant time steps becomes crucial for making accurate
predictions for current state. This problem is known as the
vanishing gradient or exploding gradient problem.
• Vanishing Gradient: When training a model over time, the
gradients (which help the model learn) can shrink as they pass
through many steps. This makes it hard for the model to learn
long-term patterns since earlier information becomes almost
irrelevant.

• Exploding Gradient: Sometimes, gradients can grow too large,

causing instability. This makes it difficult for the model to learn
properly, as the updates to the model become erratic and
unpredictable.
LSTM Architecture
LSTM architectures involves the memory cell which is controlled by three gates: the input
gate, the forget gate and the output gate. These gates decide what information to add to,
remove from and output from the memory cell.
•Input gate: Controls what information is added to the memory cell.

•Forget gate: Determines what information is removed from the memory cell.

•Output gate: Controls what information is output from the memory cell.

This allows LSTM networks to selectively retain or discard information as it flows through
the network which allows them to learn long-term dependencies. The network has a
hidden state which is like its short-term memory. This memory is updated using the
current input, the previous hidden state and the current state of the memory cell.

Short Notes On Vanishing & Exploding Gradients
No ratings yet
Short Notes On Vanishing & Exploding Gradients
30 pages
LSTM PPT
No ratings yet
LSTM PPT
22 pages
LSTM
No ratings yet
LSTM
11 pages
Cs224n 2025 Lecture06 Fancy RNN
No ratings yet
Cs224n 2025 Lecture06 Fancy RNN
57 pages
T3-Slide 006 LSTM
No ratings yet
T3-Slide 006 LSTM
25 pages
Sequence Modeling
No ratings yet
Sequence Modeling
131 pages
LSTM
No ratings yet
LSTM
27 pages
RNN 2
No ratings yet
RNN 2
144 pages
Co2 LSTM 5
No ratings yet
Co2 LSTM 5
17 pages
Week 6
No ratings yet
Week 6
60 pages
Unit5 6LSTM
No ratings yet
Unit5 6LSTM
9 pages
LSTM AryanGomes
No ratings yet
LSTM AryanGomes
13 pages
UNIT-5-Modern Recurrent Neural Networks
No ratings yet
UNIT-5-Modern Recurrent Neural Networks
60 pages
Long-Short Term Memory
No ratings yet
Long-Short Term Memory
21 pages
LSTM Architecture Presentation
No ratings yet
LSTM Architecture Presentation
18 pages
LSTM Networks in Python 1723896317
No ratings yet
LSTM Networks in Python 1723896317
17 pages
RNN Part1
No ratings yet
RNN Part1
12 pages
Unlocking The Power of Long Short-Term Memory (LSTM) Networks - by Sachinsoni - Medium
No ratings yet
Unlocking The Power of Long Short-Term Memory (LSTM) Networks - by Sachinsoni - Medium
23 pages
Presentation Title
No ratings yet
Presentation Title
10 pages
Seminar-For CA-1 of Machine Learning-10200121006
No ratings yet
Seminar-For CA-1 of Machine Learning-10200121006
12 pages
LSTM
No ratings yet
LSTM
19 pages
34-Long-Term Dependencies - Echo State Networks - Long Short-Term Memory and Othe-03!10!2024
No ratings yet
34-Long-Term Dependencies - Echo State Networks - Long Short-Term Memory and Othe-03!10!2024
14 pages
LSTM
No ratings yet
LSTM
8 pages
Long Short-Term Memory (LSTM) : A Deep Dive Into Sequential Learning
No ratings yet
Long Short-Term Memory (LSTM) : A Deep Dive Into Sequential Learning
17 pages
LSTM
No ratings yet
LSTM
10 pages
Long Short-Term Memory
No ratings yet
Long Short-Term Memory
9 pages
LSTM Presentation
No ratings yet
LSTM Presentation
23 pages
What Is LSTM
No ratings yet
What Is LSTM
5 pages
Longshorttermmemorylstm 231215171600 1feb7b1b
No ratings yet
Longshorttermmemorylstm 231215171600 1feb7b1b
17 pages
Introduction To Long Short-Term Memory (LSTM) - Simplilearn
No ratings yet
Introduction To Long Short-Term Memory (LSTM) - Simplilearn
7 pages
LSTM by Bushra
No ratings yet
LSTM by Bushra
16 pages
Long Short-Term Memory Survey Paper
No ratings yet
Long Short-Term Memory Survey Paper
6 pages
RNN
No ratings yet
RNN
28 pages
LSTM 1738024034
No ratings yet
LSTM 1738024034
13 pages
What Is LSTM - Long Short Term Memory - GeeksforGeeks
No ratings yet
What Is LSTM - Long Short Term Memory - GeeksforGeeks
9 pages
What Is LSTM - Long Short Term Memory - GeeksforGeeks
No ratings yet
What Is LSTM - Long Short Term Memory - GeeksforGeeks
6 pages
LSTMS
No ratings yet
LSTMS
14 pages
LSTM
No ratings yet
LSTM
22 pages
DL Co-3 PPT 3
No ratings yet
DL Co-3 PPT 3
19 pages
LSTM&RNN
No ratings yet
LSTM&RNN
10 pages
LSTM
No ratings yet
LSTM
12 pages
EPJ LSTM Survey
No ratings yet
EPJ LSTM Survey
14 pages
LSTM
No ratings yet
LSTM
12 pages
LSTM Detailed Explanation
No ratings yet
LSTM Detailed Explanation
2 pages
NLP - L8 LSTM
No ratings yet
NLP - L8 LSTM
7 pages
Long Short-Term Memory (LSTM) by Mohsin
No ratings yet
Long Short-Term Memory (LSTM) by Mohsin
17 pages
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
No ratings yet
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
4 pages
DLT Unit-4
No ratings yet
DLT Unit-4
18 pages
Neural Networks
No ratings yet
Neural Networks
22 pages
Addition Multiplication RNN
No ratings yet
Addition Multiplication RNN
7 pages
LSTM
No ratings yet
LSTM
14 pages
Unit Iii
No ratings yet
Unit Iii
5 pages
LSTM
No ratings yet
LSTM
3 pages
Long Term Memory
No ratings yet
Long Term Memory
2 pages
LSTM Networks Thesis Updated
No ratings yet
LSTM Networks Thesis Updated
5 pages
5 LSTM
No ratings yet
5 LSTM
4 pages
Long Short-Term Memory: Machine Learning Data Mining
No ratings yet
Long Short-Term Memory: Machine Learning Data Mining
6 pages
UNIT-5 Foundations of Deep Learning
No ratings yet
UNIT-5 Foundations of Deep Learning
9 pages
VT6L
No ratings yet
VT6L
2 pages
Sentiment Analysis Using RNN
No ratings yet
Sentiment Analysis Using RNN
4 pages
S.A. Engineering College: Office of The Controller of Examinations Provisional Results For APRIL/MAY-2025 Examinations
No ratings yet
S.A. Engineering College: Office of The Controller of Examinations Provisional Results For APRIL/MAY-2025 Examinations
1 page
GRNN Notes
No ratings yet
GRNN Notes
6 pages
Phase 2 Batch 17
No ratings yet
Phase 2 Batch 17
20 pages
Auto Encoder
No ratings yet
Auto Encoder
17 pages
Fault Modeling and Parametric Fault Detection in A
No ratings yet
Fault Modeling and Parametric Fault Detection in A
8 pages
Transformers in Deep Learning Architecture: Definitive Reference for Developers and Engineers
From Everand
Transformers in Deep Learning Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
From Everand
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
Fouad Sabry
No ratings yet

LSTM

Uploaded by

LSTM

Uploaded by

What is LSTM – Long Short Term Memory?

Long Short-Term Memory (LSTM) is an enhanced

• Exploding Gradient: Sometimes, gradients can grow too large,

You might also like