0% found this document useful (0 votes)

24 views9 pages

Unit5 6LSTM

LSTMs are a type of recurrent neural network that can learn long-term dependencies. They have a chain-like structure with four interacting layers that allow them to remember information over long periods of time. LSTMs work in a 3-step process where they decide what information to remember or forget from the previous step, update the current state, and output values. Popular applications of LSTMs include language modeling, machine translation, image captioning, handwriting generation, and question answering chatbots.

Uploaded by

Jaya Sankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views9 pages

Unit5 6LSTM

Uploaded by

Jaya Sankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

PYTHON PROGRAMMING & DATA SCIENCE

Long Short-Term Memory Networks(LSTM)

Long Short-Term Memory Networks:
 The most popular and efficient way to deal with gradient problems,
i.e., Long Short-Term Memory Network (LSTMs).
Long-Term Dependencies:
Suppose we want to predict the last word in the text:
“The clouds are in the ______.”
The most obvious answer to this is the “sky.”
Consider this sentence:
“I have been staying in Spain for the last 10 years…I can speak fluent ______.”
The word we predict will depend on the previous few words in context.
Here we need the context of Spain to predict the last word in the text, and the
most suitable answer to this sentence is “Spanish.”
Long Short-Term Memory Networks(LSTM)
The gap between the relevant information and the point where it's needed
may have become very large. LSTMs help us solve this problem.
Long Short-Term Memory Networks:
LSTMs are a special kind of Recurrent Neural Network — capable of learning
long-term dependencies by remembering information for long periods is the
default behavior.
All recurrent neural networks are in the form of a chain of repeating
modules of a neural network. In standard RNNs, this repeating module will
have a very simple structure,
such as a single tanh layer.
Long Short-Term Memory Networks(LSTM)
LSTMs also have a chain-like structure, but the repeating module is a bit
different structure. Instead of having a single neural network layer, four
interacting layers are communicating extraordinarily.
Long Short-Term Memory Networks(LSTM)
Workings of LSTMs:
The diagrammatical represent of working of an LSTM are:

LSTMs work in a 3-step process.

Long Short-Term Memory Networks(LSTM)
Step 1: Decide how much past data it should remember
The first step in the LSTM is to decide which information should be omitted
from the cell in that particular time step.
The sigmoid function determines this.
It looks at the previous state (ht-1) along with the current input xt and
computes the function.
Long Short-Term Memory Networks(LSTM)
Step 2: Decide how much this unit adds to the current state
In the second layer, there are two parts. One is the sigmoid function, and the
other is the tanh function.
 In the sigmoid function, it decides which values to let through (0 or 1).
tanh function gives weightage to the values which are passed, deciding their
level of importance (-1 to 1).
Long Short-Term Memory Networks(LSTM)
Step 3: Decide what part of the current cell state makes it to the output
The third step is to decide what the output will be.
First, we run a sigmoid layer, which decides what parts of the cell state make
it to the output.
Then, we put the cell state through tanh to push the values to be between -1
and 1 and multiply it by the output of the sigmoid gate.
Long Short-Term Memory Networks(LSTM)
Applications of LSTM:
Some of the famous applications of LSTM includes:
1. Language Modelling
2. Machine Translation
3. Image Captioning
4. Handwriting generation
5. Question Answering Chatbots

Unit 1 PPDS
100% (1)
Unit 1 PPDS
159 pages
Deep Learning RNN
100% (1)
Deep Learning RNN
53 pages
Long Short-Term Memory (LSTM)
No ratings yet
Long Short-Term Memory (LSTM)
25 pages
Colah Github Io Posts 2015 08 Understanding LSTMs
No ratings yet
Colah Github Io Posts 2015 08 Understanding LSTMs
16 pages
Understanding LSTM Networks
No ratings yet
Understanding LSTM Networks
7 pages
UNIT-5 Foundations of Deep Learning
No ratings yet
UNIT-5 Foundations of Deep Learning
9 pages
Long Short-Term Memory Networks PDF
No ratings yet
Long Short-Term Memory Networks PDF
22 pages
13A05601 Computer Networks
No ratings yet
13A05601 Computer Networks
1 page
Understanding LSTM Networks
No ratings yet
Understanding LSTM Networks
15 pages
Illustrated Guide To LSTM's and GRU'S - A Step by Step Explanation - by Michael Phi - Towards Data Science
No ratings yet
Illustrated Guide To LSTM's and GRU'S - A Step by Step Explanation - by Michael Phi - Towards Data Science
15 pages
Understanding LSTM Networks
No ratings yet
Understanding LSTM Networks
8 pages
9 RNN LSTM Gru
No ratings yet
9 RNN LSTM Gru
91 pages
Survey of Prediction Using Recurrent Neural Network
No ratings yet
Survey of Prediction Using Recurrent Neural Network
3 pages
OlahLSTM NEURAL NETWORK TUTORIAL 15
No ratings yet
OlahLSTM NEURAL NETWORK TUTORIAL 15
9 pages
Sequence Modeling
No ratings yet
Sequence Modeling
131 pages
Long Short Term Memory (LSTM)
No ratings yet
Long Short Term Memory (LSTM)
33 pages
Neural Networks
No ratings yet
Neural Networks
22 pages
UNIT4 CostFunctions
No ratings yet
UNIT4 CostFunctions
23 pages
LSTM
No ratings yet
LSTM
22 pages
RNN and LSTM
No ratings yet
RNN and LSTM
32 pages
Long Short Term Memory Networks - Architecture of LSTM
No ratings yet
Long Short Term Memory Networks - Architecture of LSTM
14 pages
Understanding LSTM Networks - Colah's Blog
No ratings yet
Understanding LSTM Networks - Colah's Blog
7 pages
LSTM
No ratings yet
LSTM
12 pages
LSTM
No ratings yet
LSTM
11 pages
LSTM
No ratings yet
LSTM
27 pages
LSTM Networks Thesis Updated
No ratings yet
LSTM Networks Thesis Updated
5 pages
Long Short-Term Memory Survey Paper
No ratings yet
Long Short-Term Memory Survey Paper
6 pages
DeepuGupta1057 ML601
No ratings yet
DeepuGupta1057 ML601
9 pages
15A05502 Computer Networks
No ratings yet
15A05502 Computer Networks
1 page
Understanding LSTM Networks - Colah's Blog
No ratings yet
Understanding LSTM Networks - Colah's Blog
15 pages
Deep Arch MSC 2024
No ratings yet
Deep Arch MSC 2024
83 pages
Long-Short Term Memory
No ratings yet
Long-Short Term Memory
21 pages
Unlocking The Power of Long Short-Term Memory (LSTM) Networks - by Sachinsoni - Medium
No ratings yet
Unlocking The Power of Long Short-Term Memory (LSTM) Networks - by Sachinsoni - Medium
23 pages
RNN LSTM
No ratings yet
RNN LSTM
37 pages
34-Long-Term Dependencies - Echo State Networks - Long Short-Term Memory and Othe-03!10!2024
No ratings yet
34-Long-Term Dependencies - Echo State Networks - Long Short-Term Memory and Othe-03!10!2024
14 pages
LSTM by Bushra
No ratings yet
LSTM by Bushra
16 pages
06-DL-Deep Learning For Text Data (LSTM Seq2Seq Models)
No ratings yet
06-DL-Deep Learning For Text Data (LSTM Seq2Seq Models)
44 pages
LSTM Material 1
No ratings yet
LSTM Material 1
3 pages
LSTM Presentation
No ratings yet
LSTM Presentation
23 pages
RNN Part1
No ratings yet
RNN Part1
12 pages
LSTM
No ratings yet
LSTM
12 pages
Unit Iii
No ratings yet
Unit Iii
5 pages
LSTM 1738024034
No ratings yet
LSTM 1738024034
13 pages
LSTM Networks in Python 1723896317
No ratings yet
LSTM Networks in Python 1723896317
17 pages
DL Co-3 PPT 3
No ratings yet
DL Co-3 PPT 3
19 pages
What Is LSTM
No ratings yet
What Is LSTM
5 pages
Longshorttermmemorylstm 231215171600 1feb7b1b
No ratings yet
Longshorttermmemorylstm 231215171600 1feb7b1b
17 pages
Week 6
No ratings yet
Week 6
60 pages
LSTM
No ratings yet
LSTM
19 pages
Introduction To Long Short-Term Memory (LSTM) - Simplilearn
No ratings yet
Introduction To Long Short-Term Memory (LSTM) - Simplilearn
7 pages
Understanding LSTM Networks
No ratings yet
Understanding LSTM Networks
10 pages
Long Short-Term Memory (LSTM) : A Deep Dive Into Sequential Learning
No ratings yet
Long Short-Term Memory (LSTM) : A Deep Dive Into Sequential Learning
17 pages
LSTM
No ratings yet
LSTM
10 pages
RNNs and Their Types - Simple Explanation
No ratings yet
RNNs and Their Types - Simple Explanation
5 pages
RNNs and LSTMs
No ratings yet
RNNs and LSTMs
41 pages
Final PDL - Unit IV
No ratings yet
Final PDL - Unit IV
51 pages
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
No ratings yet
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
4 pages
LSTM Detailed Explanation
No ratings yet
LSTM Detailed Explanation
2 pages
LSTM
No ratings yet
LSTM
3 pages
RNN With LSTM
No ratings yet
RNN With LSTM
36 pages
What Is LSTM - Long Short Term Memory - GeeksforGeeks
No ratings yet
What Is LSTM - Long Short Term Memory - GeeksforGeeks
9 pages
Seminar-For CA-1 of Machine Learning-10200121006
No ratings yet
Seminar-For CA-1 of Machine Learning-10200121006
12 pages

Unit5 6LSTM

Uploaded by

Unit5 6LSTM

Uploaded by

PYTHON PROGRAMMING & DATA SCIENCE

Long Short-Term Memory Networks(LSTM)

LSTMs work in a 3-step process.

You might also like