0% found this document useful (0 votes)

59 views54 pages

NN DL

This document provides an overview of neural networks and deep learning. It discusses how neural networks mimic the functionality of the brain by connecting neurons. The key types of neural networks covered are perceptrons, multi-layer perceptrons, convolutional neural networks, recurrent neural networks, and deep belief networks. Training algorithms like backpropagation and techniques like pre-training and dropout help address challenges in training deep models. Neural networks are widely used today for applications involving images, text, video and more.

Uploaded by

Siva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views54 pages

NN DL

Uploaded by

Siva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

Neural Network

and
Deep Learning

Md Shad Akhtar
Research Scholar
IIT Patna
Neural Network
• Mimics the functionality of a brain.
• A neural network is a graph with neurons
(nodes, units etc.) connected by links.
Neural Network: Neuron
Neural Network: Perceptron
• Network with only single layer.
• No hidden layers
Neural Network: Perceptron
X1
W1 = ?
a
t=? AND Gate

X2 W2 = ?

X1
W1 = ?
a
t=? OR Gate

X2 W2 = ?

a
X1 t=? NOT Gate
W1 = ?
Neural Network: Perceptron
X1
W1 = 1
a
t = 1.5 AND Gate

X2 W2 = 1

X1
W1 = 1
a
t = 0.5 OR Gate

X2 W2 = 1

a
X1 t = -0.5 NOT Gate
W1 = -1
Neural Network: Multi Layer Perceptron
(MLP) or Feed-Forward Network (FNN)
• Network with n+1 layers
• One output and n hidden layers.
Training: Back propagation algorithm
• Gradient decent algorithm
Training: Back propagation algorithm
Training: Back propagation algorithm
Training: Back propagation algorithm
Training: Back propagation algorithm
Training: Back propagation algorithm
1. Initialize network with random weights
2. For all training cases (called examples):
a. Present training inputs to network and calculate
output
b. For all layers (starting with output layer, back to
input layer):
i. Compare network output with correct output (error
function)
ii. Adapt weights in current layer
Deep Learning
What is Deep Learning?
• A family of methods that uses deep architectures to
learn high-level feature representations
Example 1
MAN

Example 2
Why are Deep Architectures hard to train?

• Vanishing/Exploding gradient problem in Back

Propagation
Layer-wise Pre-training
• First, train one layer
at a time, optimizing
data-likelihood
objective P(x)
Layer-wise Pre-training
• Then, train second
layer next, optimizing
data-likelihood
objective P(h)
Layer-wise Pre-training
• Finally, fine-tune labelled objective P(y|x) by
Backpropagation
Deep Belief Nets
• Uses Restricted Boltzmann Machines (RBMs)
• Hinton et al. (2006), A fast learning algorithm
for deep belief nets.
Restricted Boltzmann Machine (RBM)
• RBM is a simple energy-based model:

where

Example:
• Let weights (h1; x1), (h1; x3) be positive, others be
zero, b = d = 0.
• Calculate p(x,h) ?

• Ans: p(x1 = 1; x2 = 0; x3 = 1; h1 = 1; h2 = 0; h3 = 0)
Restricted Boltzmann Machine (RBM)
• P(x, h) = P(h|x) P(x)
• P(h|x): easy to compute
• P(x): hard if datasets are large.

Contrastive Divergence:
Deep Belief Nets (DBN) = Stacked RBM
Auto-Encoders: Simpler alternative to
RBMs
Deep Learning - Architecture
• Recurrent Neural Network (RNN)
• Convolution Neural Network (CNN)
Recurrent Neural Network (RNN)
Recurrent Neural Network (RNN)
• Enable networks to do temporal processing
and learn sequences
Character level language model Vocabulary: [h,e,l,o]
.

Training of RNN: BPTT

: Predicted
: Actual
V

W
U
.

Training of RNN: BPTT

One to many:
Sequence output (e.g. image captioning takes an image and outputs a sentence of
words)
Many to one:
Sequence input (e.g. sentiment analysis where a given sentence is classified as
expressing positive or negative sentiment)
Many to many:
Sequence input and sequence output (e.g. Machine Translation: an RNN reads a
sentence in English and then outputs a sentence in French)
Many to many:
Synced sequence input and output (e.g. Language modelling where we wish to
predict next words.
RNN Extensions
• Bidirectional RNN
• Deep (Bidirectional) RNNs
RNN (Cont..)
• “the clouds are in the sky”

clouds are in the W1

the clouds are in the

RNN (Cont..)
• “India is my home country. I can speak fluent Hindi.”
is my home fluent W2

India is my speak fluent

It is very hard for RNN to learn “Long Term Dependency”.

LSTM
• Capable of learning long-term dependencies.

Simple RNN

LSTM
LSTM
• LSTM remove or add information to the cell
state, carefully regulated by structures called
gates.

• Cell state: Conveyer belt of the cell

LSTM
• Gates
– Forget Gate
– Input Gate
– Output Gate
LSTM
• Gates
– Forget Gate
– Input Gate
– Output Gate
LSTM
• Gates
– Forget Gate
– Input Gate
– Output Gate
LSTM
• Gates
– Forget Gate
– Input Gate
– Output Gate
LSTM- Variants
Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)

• A special kind of multi-layer neural networks.

• Implicitly extract relevant features.
• Fully-connected network architecture does not
take into account the spatial structure.
• In contrast, CNN tries to take advantage of the
spatial structure.
Convolutional Neural Network (CNN)

1. Convolutional layer
2. Pooling layer
3. Fully connected layer
Convolutional Neural Network (CNN)

1. Convolutional layer 1 0 1

0 1 0

1 0 1
1 1 1 0 0
Convolution Filter
0 1 1 1 0
0 0 1 1 1
0 0 1 1 0
0 1 1 0 0
Image
Convolutional Neural Network (CNN)

1. Convolutional layer 1 0 1

0 1 0

1 0 1
Convolutional Neural Network (CNN)

1. Convolutional layer 1 0 1
• Local receptive field
• Shared weights 0 1 0

1 0 1
Convolutional Neural Network (CNN)

2. Pooling layer
Convolutional Neural Network (CNN)

3. Fully connected layer

.
. .
.
Convolutional Neural Network (CNN)

Putting it all together

Pooled
feature

Labels
Convolution
feature

Input matrix 3 convolution filter Pooling Flatten Fully-connected layers

Example 1: CNN for Image
Example 2: CNN for Text

AI-Based Phishing Detection Techniques
No ratings yet
AI-Based Phishing Detection Techniques
15 pages
Understanding and Implementing Faster R-CNN - by Rishabh Singh - Medium
No ratings yet
Understanding and Implementing Faster R-CNN - by Rishabh Singh - Medium
14 pages
Scaling & Shifting Your Features: A New Baseline For Efficient Model Tuning
No ratings yet
Scaling & Shifting Your Features: A New Baseline For Efficient Model Tuning
20 pages
Big Data Analytics (CS443) IV B.Tech (IT) 2018-19 I Semester
No ratings yet
Big Data Analytics (CS443) IV B.Tech (IT) 2018-19 I Semester
72 pages
Programming Large Language Models With Azure Open Ai: Conversational Programming and Prompt Engineering With Llms
No ratings yet
Programming Large Language Models With Azure Open Ai: Conversational Programming and Prompt Engineering With Llms
661 pages
Unit 5 - Machine Learning
No ratings yet
Unit 5 - Machine Learning
17 pages
Blue Print - AI - Std10 - Preboard-1 - 24-25
No ratings yet
Blue Print - AI - Std10 - Preboard-1 - 24-25
2 pages
Sentiment Analysis IMDB Review - Presentation
No ratings yet
Sentiment Analysis IMDB Review - Presentation
19 pages
1a. Overview
No ratings yet
1a. Overview
18 pages
A Review of YOLO Object Detection Algorithms Based
No ratings yet
A Review of YOLO Object Detection Algorithms Based
4 pages
DL Unit Iv
No ratings yet
DL Unit Iv
15 pages
Gradient Leakage Attacks in Federated Learning - Research Frontiers, Taxonomy and Future Directions
No ratings yet
Gradient Leakage Attacks in Federated Learning - Research Frontiers, Taxonomy and Future Directions
8 pages
Deep Learning Day 27
No ratings yet
Deep Learning Day 27
43 pages
Big Data NOTES and QB
No ratings yet
Big Data NOTES and QB
92 pages
LSTM, RNN
No ratings yet
LSTM, RNN
38 pages
CH 4 Deep Learning
No ratings yet
CH 4 Deep Learning
7 pages
RNN LSTM GRU Transformers
0% (1)
RNN LSTM GRU Transformers
123 pages
MC5403 Adbdm Unit I Notes
No ratings yet
MC5403 Adbdm Unit I Notes
95 pages
Lec 10
No ratings yet
Lec 10
37 pages
427 16sacaob3 2020051805192483
No ratings yet
427 16sacaob3 2020051805192483
66 pages
PennStateSchool08 LecNotes
No ratings yet
PennStateSchool08 LecNotes
529 pages
Group I
No ratings yet
Group I
20 pages
Universal Human Values II - Understanding Harmony 20EGM03
No ratings yet
Universal Human Values II - Understanding Harmony 20EGM03
2 pages
Big Data - Hadoop Questions Answers
No ratings yet
Big Data - Hadoop Questions Answers
18 pages
D2-S1 B Self-Exploration J Happiness and Prosperity July 26
No ratings yet
D2-S1 B Self-Exploration J Happiness and Prosperity July 26
17 pages
Img - 31 5 13
No ratings yet
Img - 31 5 13
5 pages
Lecture 3 V33
No ratings yet
Lecture 3 V33
52 pages
DLA Unit 4
No ratings yet
DLA Unit 4
38 pages
CGAN-Based Collaborative Intrusion Detection For UAV Networks A Blockchain-Empowered Distributed Federated Learning Approach
No ratings yet
CGAN-Based Collaborative Intrusion Detection For UAV Networks A Blockchain-Empowered Distributed Federated Learning Approach
13 pages
WT R19 Unit 3
No ratings yet
WT R19 Unit 3
18 pages
Banking UNIT IV
No ratings yet
Banking UNIT IV
17 pages
1694600937-Unit2.5 Support Vector Machine CU 2.0
No ratings yet
1694600937-Unit2.5 Support Vector Machine CU 2.0
26 pages
Eps Unit 2
No ratings yet
Eps Unit 2
81 pages
249 Enterpreneurship Lesson 14
No ratings yet
249 Enterpreneurship Lesson 14
11 pages
Time Series RNN LSTM 1746197734
No ratings yet
Time Series RNN LSTM 1746197734
25 pages
MC5403 Adbdm Unit Ii Notes
No ratings yet
MC5403 Adbdm Unit Ii Notes
59 pages
Syllabus EC 003
No ratings yet
Syllabus EC 003
2 pages
Hydrostatics (4A)
No ratings yet
Hydrostatics (4A)
4 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
B. Techacforgovt
No ratings yet
B. Techacforgovt
4 pages
Eps Unit 1
No ratings yet
Eps Unit 1
11 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Interfețe Vizuale Om-Mașină
No ratings yet
Interfețe Vizuale Om-Mașină
15 pages
KBNET
No ratings yet
KBNET
15 pages
R Cet Vacancy 202218032023
No ratings yet
R Cet Vacancy 202218032023
2 pages
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
No ratings yet
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
6 pages
Slides CNN Unit 3
No ratings yet
Slides CNN Unit 3
36 pages
Unit 4
No ratings yet
Unit 4
13 pages
A Deep Convolutional Neural Network For Wafer
No ratings yet
A Deep Convolutional Neural Network For Wafer
9 pages
Day 4
No ratings yet
Day 4
22 pages
Organizational Behaviour: Alagappa University
No ratings yet
Organizational Behaviour: Alagappa University
208 pages
Deep Learning Algorithms
No ratings yet
Deep Learning Algorithms
19 pages
Deep Learning Concise Notes
No ratings yet
Deep Learning Concise Notes
4 pages
M.Tech - CTM II Year Syllabus
No ratings yet
M.Tech - CTM II Year Syllabus
17 pages
CP4252 ML Unit - V
No ratings yet
CP4252 ML Unit - V
17 pages
21 - Reinforcement Learning
No ratings yet
21 - Reinforcement Learning
25 pages
DL Mod4
No ratings yet
DL Mod4
105 pages
Machine Learning Roadmap For Absolute Beginners
No ratings yet
Machine Learning Roadmap For Absolute Beginners
2 pages
Recurrent Neural Networks (RNNS)
No ratings yet
Recurrent Neural Networks (RNNS)
45 pages
DL Unit 1
No ratings yet
DL Unit 1
19 pages
Simplifying Neural Networks and Deep Learning Basics!
No ratings yet
Simplifying Neural Networks and Deep Learning Basics!
27 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
Lecture No 6 Deep Learning Algorithm
No ratings yet
Lecture No 6 Deep Learning Algorithm
37 pages
Neural Network and Deep Learning 1736802600
No ratings yet
Neural Network and Deep Learning 1736802600
54 pages
Deep Learning Strategy For Braille Character Recognition
No ratings yet
Deep Learning Strategy For Braille Character Recognition
15 pages
I MTechMPharmacy
No ratings yet
I MTechMPharmacy
1 page
IMBAMCA
No ratings yet
IMBAMCA
1 page
Revised IV B. Tech
No ratings yet
Revised IV B. Tech
1 page
Fineartseventsyouthfestivaldec 2022
No ratings yet
Fineartseventsyouthfestivaldec 2022
1 page
Deep Learning Report For Students
No ratings yet
Deep Learning Report For Students
32 pages
Deep Learning Recurrent Neural Networks - Introduction
No ratings yet
Deep Learning Recurrent Neural Networks - Introduction
106 pages
Age and Gender Prediction From Face Images Using Attentional Convolutional Network
No ratings yet
Age and Gender Prediction From Face Images Using Attentional Convolutional Network
6 pages
Deep Arch MSC 2024
No ratings yet
Deep Arch MSC 2024
83 pages
Lecture Notes On Lecture Notes On Deep Learning
No ratings yet
Lecture Notes On Lecture Notes On Deep Learning
8 pages
15.03.2024 Csa3007 A24+d23+d24
No ratings yet
15.03.2024 Csa3007 A24+d23+d24
8 pages
Week9 (Learning Perceptron and Delta)
No ratings yet
Week9 (Learning Perceptron and Delta)
57 pages
Explaining and Harnessing Adversarial Examples
No ratings yet
Explaining and Harnessing Adversarial Examples
3 pages
PP&DS 5
No ratings yet
PP&DS 5
31 pages
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
From Everand
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
César Pérez López
No ratings yet
Deep Learning Basics in Machine Learnning 1
No ratings yet
Deep Learning Basics in Machine Learnning 1
29 pages
PHD Thesis Template For Dtu Management
No ratings yet
PHD Thesis Template For Dtu Management
13 pages
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-1
No ratings yet
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-1
24 pages
Image Classification (Project Introducing)
No ratings yet
Image Classification (Project Introducing)
8 pages
Facial K: Dynamic Selfie Filters Using ML
No ratings yet
Facial K: Dynamic Selfie Filters Using ML
10 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
18 pages
Introduction To Neural Networks: Freek Stulp
No ratings yet
Introduction To Neural Networks: Freek Stulp
12 pages
UNIT-2 DL
No ratings yet
UNIT-2 DL
51 pages
IC Unit6 DeepLearning
No ratings yet
IC Unit6 DeepLearning
35 pages
An Object Detection System Using Image Reconstruction With PCA
No ratings yet
An Object Detection System Using Image Reconstruction With PCA
7 pages
Deep Learning RNN
100% (1)
Deep Learning RNN
53 pages
RNN and LSTM
No ratings yet
RNN and LSTM
15 pages
Unit V Recurrent Neural Networks
No ratings yet
Unit V Recurrent Neural Networks
35 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
9 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
Unit 6
No ratings yet
Unit 6
41 pages
Arquivo5203 1
No ratings yet
Arquivo5203 1
180 pages
DL Unit-3
No ratings yet
DL Unit-3
9 pages
Unit 3
No ratings yet
Unit 3
8 pages
Demonstrating The Openai Gym and Deep Reinforcement Learning When Applied To Atari 2600 Games
No ratings yet
Demonstrating The Openai Gym and Deep Reinforcement Learning When Applied To Atari 2600 Games
4 pages
6b. Recurrent Neural Networks
No ratings yet
6b. Recurrent Neural Networks
38 pages
OCI DL Fundations
No ratings yet
OCI DL Fundations
4 pages
Power of Recurrent Neural Networks (RNN) - Revolutionizing AI
No ratings yet
Power of Recurrent Neural Networks (RNN) - Revolutionizing AI
33 pages
Unit 4
No ratings yet
Unit 4
27 pages
Stock Prediction Using Recurrent Neural Network (RNN)
0% (1)
Stock Prediction Using Recurrent Neural Network (RNN)
24 pages
Monash University: Semester Two Examination 2004 Faculty of Information Technology
No ratings yet
Monash University: Semester Two Examination 2004 Faculty of Information Technology
5 pages
19 Deep Learning
100% (1)
19 Deep Learning
49 pages
Unit 5
No ratings yet
Unit 5
39 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
49 pages
DL - Unit - 1 - Foundations of Deep Learning
No ratings yet
DL - Unit - 1 - Foundations of Deep Learning
35 pages
Understanding LSTM
No ratings yet
Understanding LSTM
34 pages
Survey of Prediction Using Recurrent Neural Network
No ratings yet
Survey of Prediction Using Recurrent Neural Network
3 pages
Deep Learning and Applications: Pham The Bao Ptbao@sgu - Edu.vn
No ratings yet
Deep Learning and Applications: Pham The Bao Ptbao@sgu - Edu.vn
43 pages
For Seminar
No ratings yet
For Seminar
17 pages
Machine Translation Wise 2016/2017
No ratings yet
Machine Translation Wise 2016/2017
58 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet

NN DL

Uploaded by

NN DL

Uploaded by

Neural Network

• Vanishing/Exploding gradient problem in Back

Training of RNN: BPTT

Training of RNN: BPTT

clouds are in the W1

the clouds are in the

India is my speak fluent

It is very hard for RNN to learn “Long Term Dependency”.

• Cell state: Conveyer belt of the cell

• A special kind of multi-layer neural networks.

3. Fully connected layer

Putting it all together

Input matrix 3 convolution filter Pooling Flatten Fully-connected layers

You might also like