0% found this document useful (0 votes)

20 views38 pages

Chapter 5 RNN

Uploaded by

Bereketeab Zinabu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views38 pages

Chapter 5 RNN

Uploaded by

Bereketeab Zinabu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 38

Deep Learning Application for Communication

Engineering
Subject Code: ECE7419

Dr. RAM SEWAK SINGH

Associate Professor

Electronics and Communication Engineering Department

School of Electrical Engineering and Computing
Adama Science and Technology University,
Ethiopia, Po. Box: 1888 1
ASTU
Chapter 5: : 2
Sequence Models

ASTU
Lecture Overview 3

Problem With ML,ANN and CNN

 Recurrent Neural Networks (RNN) for sequences

Backpropagation Through Time

Vanishing and Exploding Gradients and Remedies

RNNs using Long Short-Term Memory (LSTM)

ASTU
4

ASTU
5

ANN

ASTU
Recurrent Neural Network 6

A recurrent neural network (RNN) is a deep learning structure that uses past
information to improve the performance of the network on current and future
inputs. What makes an RNN unique is that the network contains a hidden state and
loops.

The looping structure allows the network to store past information in the hidden
state and operate on sequences.

A recurrent neural network (RNN) is a network architecture for deep learning

that predicts on time-series or sequential data.

ASTU
RNNs are particularly effective for working with sequential data that varies in
7
length and solving problems such as natural signal classification, language
processing, and video analysis.

What a sequence really is?

Data inside a sequence are non i.i.d. ◦Identically, independently

distributed.

The next “word” depends on the previous “words”

Ideally on all of them.

We need context, and we need memory!

How to model context and memory ?

ASTU
8

ASTU
9

ASTU
10

ASTU
11

ASTU
Mathematical Model
12

Example: Predict Sequence of Characters

ASTU
13

ASTU
14

ASTU
15

ASTU
16

ASTU
Problem With RNN
17

ASTU
18

ASTU
19

ASTU
LSTM Networks (Long Short Term Memory) networks
20
Long Short Term Memory networks: usually just called “LSTMs” – are a special
kind of RNN, capable of learning long-term dependencies.

 LSTMs are explicitly designed to avoid the long-term dependency problem.

Remembering information for long periods of time is practically their default
behavior, not something they struggle to learn!

All recurrent neural networks have the form of a chain of repeating modules of
neural network. In standard RNNs, this repeating module will have a very simple
structure, such as a single tanh layer.

ASTU
LSTMs also have this chain like structure, but the repeating module has a different
21
structure. Instead of having a single neural network layer, there are four, interacting
in a very special way.

ASTU
(1) Memory Cell
22
(2) Forget Cell (1)
(3) I/P Gate (4)
(4) O/P Gate

(2) (3)

ASTU
In the above diagram, each line carries an entire vector, from the output of one 23
node
to the inputs of others.

The pink circles represent pointwise operations, like vector addition, while the
yellow boxes are learned neural network layers.

Lines merging denote concatenation, while a line forking denote its content being
copied and the copies going to different locations.

The Core Idea Behind LSTMs

The key to LSTMs is the cell state, the horizontal line running through the top of the
diagram. The cell state is kind of like a conveyor belt. It runs straight down the entire
chain, with only some minor linear interactions. It’s very easy for information to just
flow along it unchanged.

ASTU
24

The LSTM does have the ability to remove or add information to the cell state,
carefully regulated by structures called gates.

Gates are a way to optionally let information through. They are composed out of a
sigmoid neural net layer and a pointwise multiplication operation.

ASTU
Step-by-Step LSTM Walk Through 25

ASTU
26

ASTU
27

ASTU
28

ASTU
Variants on Long Short Term Memory
29

ASTU
Autoencoders 30

Autoencoders are a specific type of feed forward neural networks where the input is
the same as the output. They compress the input into a lower-dimensional code and
then reconstruct the output from this representation.

The code is a compact “summary” or “compression” of the input, also called

the latent-space representation.

An autoencoder consists of 3 components: Encoder, Code and Decoder. The encoder

compresses the input and produces the code, the decoder then reconstructs the input
only using this code.

ASTU
Autoencoders are mainly a dimensionality reduction (or compression) algorithm with
31
a couple of important properties:

Data-specific: Autoencoders are only able to meaningfully compress data similar to

what they have been trained on. Since they learn features specific for the given training
data, they are different than a standard data compression algorithm like Gzip.

Lossy: The output of the autoencoder will not be exactly the same as the input, it will
be a close but degraded representation. If you want lossless compression they are not
the way to go.

Unsupervised: To train an autoencoder we don’t need to do anything fancy, just throw

the raw input data at it. Autoencoders are considered an unsupervised learning
technique since they don’t need explicit labels to train on. But to be more precise they
are self-supervised because they generate their own labels from the training data.

ASTU
Architecture
32

ASTU
33

ASTU
34

ASTU
Applications, including anomaly detection, text generation, image generation, image
denoising, and digital communications. 35
 Autoencoders will naturally ignore any input noise as the encoder is trained. This
feature is ideal for removing noise or detecting anomalies when the inputs and
outputs are compared (see Figures 2 and 3).

ASTU
 The latent representation can also be used to generate synthetic data. For example,
36
you can automatically create realistic looking handwriting or phrases of text (Figure
4).

Time series-based auto encoders can also be used to detect anomalies in signal data.
For example, in predictive maintenance, an auto encoder can be trained on normal
operating data from an industrial machine (Figure 5).

Figure 5: Training on normal operating data for predictive maintenance.

 The trained auto encoder is then tested on new incoming data. A large variation from
the autoencoder’s output indicates an abnormal operation, which could require 37
investigation (Figure 6).

ASTU
38

Thank you for your attention!!

ASTU

CNN RNN LSTM GRU Simple
100% (3)
CNN RNN LSTM GRU Simple
20 pages
RNN
No ratings yet
RNN
79 pages
Chap 10-2 Sequence Modeling Recurrent and Recursive Net-Hyun-Lim Yang
No ratings yet
Chap 10-2 Sequence Modeling Recurrent and Recursive Net-Hyun-Lim Yang
39 pages
Lec 10
No ratings yet
Lec 10
37 pages
Deep Learning (MODULE-5)
No ratings yet
Deep Learning (MODULE-5)
71 pages
LSTM, RNN
No ratings yet
LSTM, RNN
38 pages
Lecture 8 RNN LSTMs W Annotations
No ratings yet
Lecture 8 RNN LSTMs W Annotations
22 pages
Sequence Modeling
No ratings yet
Sequence Modeling
131 pages
RNN With LSTM
No ratings yet
RNN With LSTM
36 pages
Slides m440 Recurrent Neural Networks
No ratings yet
Slides m440 Recurrent Neural Networks
32 pages
RNN With LSTM
No ratings yet
RNN With LSTM
41 pages
06-DL-Deep Learning For Text Data (LSTM Seq2Seq Models)
No ratings yet
06-DL-Deep Learning For Text Data (LSTM Seq2Seq Models)
44 pages
AE556 2024 Topic6 RNN
No ratings yet
AE556 2024 Topic6 RNN
19 pages
DL Half TechKnowledge
No ratings yet
DL Half TechKnowledge
50 pages
Final PDL - Unit IV
No ratings yet
Final PDL - Unit IV
51 pages
Deep Learning RNN
100% (1)
Deep Learning RNN
53 pages
Unit 4 - DL
No ratings yet
Unit 4 - DL
23 pages
RNN LSTM
No ratings yet
RNN LSTM
37 pages
RNN 2
No ratings yet
RNN 2
144 pages
9 RNN LSTM Gru
No ratings yet
9 RNN LSTM Gru
91 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
DL Mod4
No ratings yet
DL Mod4
105 pages
Testbank and Solutions For Microelectronic Circuits 7th Edition
No ratings yet
Testbank and Solutions For Microelectronic Circuits 7th Edition
18 pages
07 RNN Recurrent Neural Networks
No ratings yet
07 RNN Recurrent Neural Networks
115 pages
Understanding LSTM Networks - Colah's Blog
No ratings yet
Understanding LSTM Networks - Colah's Blog
15 pages
Deep Learning
No ratings yet
Deep Learning
49 pages
Endsem Imp DL Unit 4
No ratings yet
Endsem Imp DL Unit 4
30 pages
Deep Arch MSC 2024
No ratings yet
Deep Arch MSC 2024
83 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
36 pages
50 Ipv4 Subnetting Practice Questions (With Answer Key)
33% (3)
50 Ipv4 Subnetting Practice Questions (With Answer Key)
15 pages
Time Series RNN LSTM 1746197734
No ratings yet
Time Series RNN LSTM 1746197734
25 pages
Unit 5
No ratings yet
Unit 5
76 pages
Understanding LSTM Networks - Colah's Blog
No ratings yet
Understanding LSTM Networks - Colah's Blog
7 pages
Unit III - Recurrent Neural Networks
No ratings yet
Unit III - Recurrent Neural Networks
44 pages
161 - Course Details B.E.electrical Engineering
No ratings yet
161 - Course Details B.E.electrical Engineering
36 pages
PCIE Protocol
No ratings yet
PCIE Protocol
29 pages
Module 4-1
No ratings yet
Module 4-1
44 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Exploring LSTMs
No ratings yet
Exploring LSTMs
35 pages
RNN and LSTM
No ratings yet
RNN and LSTM
32 pages
Chapter 12 PartII en
No ratings yet
Chapter 12 PartII en
23 pages
Remembering Katyn 1st Edition Alexander Etkind Instant Download
No ratings yet
Remembering Katyn 1st Edition Alexander Etkind Instant Download
31 pages
Modelling Time Series With Neural Networks: Volker Tresp Summer 2017
No ratings yet
Modelling Time Series With Neural Networks: Volker Tresp Summer 2017
24 pages
Deep Learning
No ratings yet
Deep Learning
26 pages
What Is A Recurrent Neural Network
No ratings yet
What Is A Recurrent Neural Network
36 pages
3 Regression Diagnostics
100% (1)
3 Regression Diagnostics
53 pages
CH4 - AA1.1-Sequence Models
No ratings yet
CH4 - AA1.1-Sequence Models
26 pages
PAC8000S Owner's Manual: Downloaded From Manuals Search Engine
No ratings yet
PAC8000S Owner's Manual: Downloaded From Manuals Search Engine
14 pages
Key Point Mapping
No ratings yet
Key Point Mapping
69 pages
Unit 4
No ratings yet
Unit 4
27 pages
RNNs and Their Types - Simple Explanation
No ratings yet
RNNs and Their Types - Simple Explanation
5 pages
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
No ratings yet
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
6 pages
15.03.2024 Csa3007 A24+d23+d24
No ratings yet
15.03.2024 Csa3007 A24+d23+d24
8 pages
Dis6 Sol
No ratings yet
Dis6 Sol
6 pages
VinayKumar - Resume - QA Lead - 14yrs
No ratings yet
VinayKumar - Resume - QA Lead - 14yrs
4 pages
SAP HANA 2.0 Revision and Maintenance Strategy
No ratings yet
SAP HANA 2.0 Revision and Maintenance Strategy
6 pages
Understanding LSTM Networks
No ratings yet
Understanding LSTM Networks
15 pages
Long Short-Term Memory Networks PDF
No ratings yet
Long Short-Term Memory Networks PDF
22 pages
3 1 Public Switched Telephone Network & Public Land Mobile Networks
No ratings yet
3 1 Public Switched Telephone Network & Public Land Mobile Networks
55 pages
Applications of High-Temperature Superconductors in Microwave Integrated Circuits
No ratings yet
Applications of High-Temperature Superconductors in Microwave Integrated Circuits
31 pages
Machine Learning Unit 4 RNN
No ratings yet
Machine Learning Unit 4 RNN
11 pages
Unit Iii
No ratings yet
Unit Iii
5 pages
What Is Netsh Command
No ratings yet
What Is Netsh Command
4 pages
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
No ratings yet
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
4 pages
Unit 3
No ratings yet
Unit 3
8 pages
IITM Thesis Format
No ratings yet
IITM Thesis Format
12 pages
LSTM Networks Thesis Updated
No ratings yet
LSTM Networks Thesis Updated
5 pages
What Is An RNN
No ratings yet
What Is An RNN
6 pages
Survey of Prediction Using Recurrent Neural Network
No ratings yet
Survey of Prediction Using Recurrent Neural Network
3 pages
De Eep Tem Peratur Re Freez Er: S Service Manuall
No ratings yet
De Eep Tem Peratur Re Freez Er: S Service Manuall
21 pages
03-Two Random Variables R
No ratings yet
03-Two Random Variables R
44 pages
CURVIC1
No ratings yet
CURVIC1
48 pages
Performance Assessment of Mimo Precoding On Realistic Mmwave Channels
No ratings yet
Performance Assessment of Mimo Precoding On Realistic Mmwave Channels
6 pages
DSL Technologies (Autosaved)
No ratings yet
DSL Technologies (Autosaved)
36 pages
Lab Manual
No ratings yet
Lab Manual
56 pages
Chapter 01
No ratings yet
Chapter 01
53 pages
Epcom
100% (1)
Epcom
2 pages
Application of Cognitive Ergonomics To The Control Room Design of Advanced Technologies
No ratings yet
Application of Cognitive Ergonomics To The Control Room Design of Advanced Technologies
40 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
UNIT-5 Foundations of Deep Learning
No ratings yet
UNIT-5 Foundations of Deep Learning
9 pages
M32-Edit V 3.2 PDF
No ratings yet
M32-Edit V 3.2 PDF
2 pages
Water Bill April
No ratings yet
Water Bill April
1 page
Arduino Based Fire Fighting Robot Project
No ratings yet
Arduino Based Fire Fighting Robot Project
13 pages
Review of Related Literature: A. Teaching Media
No ratings yet
Review of Related Literature: A. Teaching Media
23 pages
Secospace USG2110 V100R001C03SPC200 Upgrade Guide (English Document)
No ratings yet
Secospace USG2110 V100R001C03SPC200 Upgrade Guide (English Document)
38 pages
04-Basic Inequalities
No ratings yet
04-Basic Inequalities
14 pages
Ankit Singh Gurgaon 2.10 Yrs
No ratings yet
Ankit Singh Gurgaon 2.10 Yrs
2 pages
Bulk Insert To Oracle - Final
No ratings yet
Bulk Insert To Oracle - Final
13 pages
ECE4313 Worksheet Endsem
No ratings yet
ECE4313 Worksheet Endsem
5 pages
Computer Networks: 7 Application
No ratings yet
Computer Networks: 7 Application
46 pages
Fleet Management System Presentation TO Yapi Merkezi BY Tech Up Company Limited
No ratings yet
Fleet Management System Presentation TO Yapi Merkezi BY Tech Up Company Limited
11 pages
X-Ray Warning Flash Lamp: Measurement & Control
No ratings yet
X-Ray Warning Flash Lamp: Measurement & Control
2 pages
The Relevant Résumé Template 2 PDF
No ratings yet
The Relevant Résumé Template 2 PDF
1 page
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
From Everand
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
César Pérez López
No ratings yet
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
From Everand
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
Fouad Sabry
No ratings yet
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
From Everand
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
Fouad Sabry
No ratings yet

Chapter 5 RNN

Uploaded by

Chapter 5 RNN

Uploaded by

Deep Learning Application for Communication

Dr. RAM SEWAK SINGH

Electronics and Communication Engineering Department

Problem With ML,ANN and CNN

 Recurrent Neural Networks (RNN) for sequences

Backpropagation Through Time

Vanishing and Exploding Gradients and Remedies

RNNs using Long Short-Term Memory (LSTM)

A recurrent neural network (RNN) is a network architecture for deep learning

What a sequence really is?

Data inside a sequence are non i.i.d. ◦Identically, independently

The next “word” depends on the previous “words”

We need context, and we need memory!

How to model context and memory ?

Example: Predict Sequence of Characters

 LSTMs are explicitly designed to avoid the long-term dependency problem.

The Core Idea Behind LSTMs

The code is a compact “summary” or “compression” of the input, also called

An autoencoder consists of 3 components: Encoder, Code and Decoder. The encoder

Data-specific: Autoencoders are only able to meaningfully compress data similar to

Unsupervised: To train an autoencoder we don’t need to do anything fancy, just throw

Figure 5: Training on normal operating data for predictive maintenance.

Thank you for your attention!!

You might also like