Chapter 5 Deep Learning

Uploaded by

nguyenquangdatdtr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views35 pages

Chapter 5 Deep Learning

Uploaded by

nguyenquangdatdtr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Deep learning and Convolution

Neural Network
Ts. Võ Như Thành
Bộ môn Cơ điện tử
Khoa Cơ khí
Email: [email protected]
Tel 0903532083

1
Content
What is Machine Learning vs Neural Network and Deep Learning
Advantage and Disadvantage
General Structure of Deep Learning and CNN
Typical famous of CNN
Learning Type and Implimentation

2
3
4
5
6
Reinforce Learning
- Supervised and Unsupervised learning work on static dataset, while RL works with data from a
dynamic environment.
- Goal of RL is not to cluster or label the data but it finds best sequence of actions that will
generate optimal outcome.
- RL allows an agent (Piece of software) to explore, interact with and learn from the environment
on reward and punishment basis.

7
Deep Learning (DL)
- Deep Neural Networks are basis of Deep Learning.
- The term "deep" usually refers to the number of hidden layers in the neural network. Traditional
neural networks only contain 2-3 hidden layers, while deep networks can have as many as 150
layers
- Deep learning models are trained by using large sets of labeled data and neural network
architectures that learn features directly from the data without the need for manual feature
extraction.

8
Deep Learning (DL)

9
Limitations
- Deep learning requires huge amounts of training data.
- Deep learning requires extensive computing power.
- Architectures can be complex and must often be highly tailored to a
specific application.
- The resulting models may not be easily interpretable.
Implimentation
- GPUs (Graphic Processor Units) and TPUs (Tensor Processing Units)
are being exploited for heavy computations.
- For low end processing, FPGAs (Field Programmable Gate Arrays)
and CPUs are used.

10
Machine Learning (ML):
- It relies on feature extraction from input images. In Cat Vs
Dog case features may be whiskers, Ears, Eyes etc. Feature
extraction parameters are defined by us.
- On the basis of these features, some classifier gives output.
Deep Learning (DL):
- Deep learning takes ML one step ahead. Deep learning
automatically finds out the features which are important for
classification.

11
12
Convolutional Neural Network (CNN)
One of the most popular types of deep neural networks is known as convolutional
neural networks (CNN or ConvNet). A CNN convolves learned features with input
data, and uses 2D convolutional layers, making this architecture well suited to
processing 2D data, such as images, sounds...

13
14
Convolution

15
16
- Convolution puts the input images through a set of convolutional filters, each of
which activates certain features from the images.
- Pooling simplifies the output by performing nonlinear down-sampling, reducing the
number of parameters that the network needs to learn about.
- Rectified linear unit (ReLU) allows for faster and more effective training by mapping
negative values to zeros. 17
SOME CNN STRUCTURE
ALEXNET
- It is a simple yet powerful network architecture having 23 layers.
- It utilizes CNN with convolution, ReLu, Pooling and Fully Connected
classification layers.
- The key feature of AlexNet is training on GPU architecture, which
speeds up the training.
VGG Net
- The VGG Network was introduced by the researchers at Visual
Graphics Group at Oxford.
- This network is specially characterized by its pyramidal shape, where
the bottom layers which are closer to the image are wider, whereas the
top layers are narrower.
- Has 19 layers and slow in training 18
SOME CNN STRUCTURE

Google Net
- Google Net (or Inception Network) is a class of architecture designed by
researchers at Google.
- Google Net was the winner of ImageNet 2014, where it proved to be a powerful
model.
- It has 144 layers.
- It offers parallel architecture which shows drastic change from the sequential
architectures of previously used models.
ResNet - Residual Networks
- ResNet is one of the monster architectures which truly define how deep a deep
learning architecture can be.
- It uses 152 layers.
- Residual Networks (ResNet in short) consists of multiple subsequent residual
modules, which are the basic building block of ResNet architecture.
19
SOME CNN STRUCTURE
RCNN (Region Based CNN)
- Region Based CNN architecture is said to be the most influential of all the
deep learning architectures.
- RCNN does is to attempt to draw a bounding box over all the objects present
in the image, and then recognize what object is in the image.

20
SOME CNN STRUCTURE
YOLO (You Only Look Once)
- YOLO is the current state-of-the-art real time system built on deep learning for
solving image detection problems.
- It first divides the image into defined bounding boxes, and then runs a recognition
algorithm in parallel for all of these boxes to identify which object class.
- After identifying this classes, it goes on to merging these boxes intelligently to form
an optimal bounding box around the objects.

OTHERS: SqueezeNet, SegNet, GAN (Generative Adversarial Network)

21
How to Create and Train Deep Learning Models
Training from Scratch
- To train a deep network from scratch, we need a very large labeled
data set (millions of images).
- This is good for new applications, or applications that will have a
large number of output categories.
- This is a less common approach because with the large amount of
data and rate of learning, these networks typically take days or weeks
to train. - Extensive computational power is needed. (High
performance GPUs and TPUs).

22
How to Create and Train Deep Learning Models
Transfer Learning
- A process that involves fine-tuning a pretrained model.
- We start with an existing network, such as AlexNet or GoogleNet, and
feed in new data containing previously unknown classes.
- After making some tweaks to the network, we can now perform a new
task, such as categorizing only dogs or cats instead of 1000 different
objects.
- This also has the advantage of needing much less data (processing
thousands of images, rather than millions), so computation time drops
to minutes or hours.
23
How to Implement Deep Learning Models

Depending on the computational power (Hardware) needed,

1. Cloud Based (i.e. Google Colab)
2. On local Machine
- 2.1 CPU based implementation (Slower but affordable)
- 2.2 GPU/ TPU based implementation (Faster but very costly)

24
Design CNN example
- Using MNIST data that previously introduced in the last
lecture
- Test data set has 10000 graysalce image
- 10 folder with 1000 images in each
- 28x28 pixel
- Make sure the folders in the right path

25
Design CNN example
- Check the path and make sure it is accessible.
- For Training, using 750 images from each folder, therefore,
total training images are 7500.
- Rest 250 images in each folder are used for validation,
therefore, total images for validation are 2500 .
- This distribution of images will be done by writing MATLAB
code, not manually.
- Test image will be done manually

26
Design CNN example – Layers in the model

27
Layers in the model
Layers of a CNN
Image Input Layer: This is the layer where we specify the image size.
imagelnputLayer([M N n], 'Name', 'Input')
Ex. imagelnputLayer([28 28 1], 'Name', 'Input')

Convolutional Layer: It is main layer which is responsible of features

extractions. Filters keep on updating with training data.
convolution2dLayer(filtersize, numFilter,‘stride',n,’'Padding', 'same', 'Name',
'Conv_1')
convolution2dLayer(3, 8, 'Padding', 'same', 'Name', 'Conv_1')

28
Layers in the model
Layers of a CNN
Batch Normalization Layer: It normalizes the activations and gradients,
making network training an easier optimization problem and speeds up
network training and reduces the sensitivity to network initialization.
batchNormalizationLayer('Name','BN_1')
ReLU Layer
The most common non linear activation function is the Rectified Linear Unit
(ReLU).
reluLayer('Name','Relu_1')

29
Layers in the model
Max Pooling Layer
It is down-sampling operation that reduces the spatial size of the feature
map and removes redundant spatial information.
maxPooling2dLayer(PoolSize,'Stride',n,'Name','MaxPool_1')
maxPooling2dLayer(2,'Stride',2,'Name','MaxPool_1')
Fully Connected Layer
The last fully connected layer combines the features to classify the images.
fullyConnectedLayer(outputSize,Name,Value)
fullyConnectedLayer(10,'Name','FC')

30
Layers in the model
Softmax Layer:
The softmax activation function normalizes the output of the fully connected
layer; output of the softmax layer consists of positive numbers that sum to
one, which can then be use classification probabilities by the classification
layer.
softmaxLayer('Name', Name) √
softmaxLayer('Name', 'SoftMax')
Classification Layer:
The final layer is the classification layer. This layer uses the probabilities
returned by the softmax activation function for each input to assign the input
to one of the mutually exclusive classe and compute the loss.
classificationLayer('Name',Name)
classificationLayer('Name','Output Classification')

31
Layers in the model
Training Parameters
trainingOptions(solverName, Name, Value)
trainingOptions('sgdm', 'LearnRateSchedule', 'piecewise',…
'LearnRateDropFactor', 0.2, 'LearnRateDropPeriod', 5, 'MaxEpochs',… 20,
'MiniBatchSize', 64, 'Plots', 'training-progress')
SolverName:
• 'sgdm': Stochastic Gradient Descent with momentum (SGDM) optimizer.
It needs momentum rate.
• 'rmsprop': RMSProp optimizer. It needs decay rate of the squared
gradient moving average.
• 'adam': Adam optimizer. It needs decay rates of the gradient and squared
gradient moving averages.
Hardware Options:
'Execution Environment' - Hardware resource for training network 'auto'
(default) | 'cpu' | 'gpu' |'multi-gpu' | 'parallel'
32
Layers in the model

33
Testing

34
Faculty of Mechanical Engineer
Vo Nhu Thanh, Ph.D, Senior lecturer

Convolutional Neural Network
43% (7)
Convolutional Neural Network
20 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
DL Inference FPGA Class1
No ratings yet
DL Inference FPGA Class1
56 pages
Lecture Notes On Lecture Notes On Deep Learning
No ratings yet
Lecture Notes On Lecture Notes On Deep Learning
8 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Introduction To Convolutional Neural Networks
No ratings yet
Introduction To Convolutional Neural Networks
4 pages
8 Deep Learning CNN
No ratings yet
8 Deep Learning CNN
63 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
43 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
Introduction To Deep Learning: Nandita Bhaskhar
No ratings yet
Introduction To Deep Learning: Nandita Bhaskhar
56 pages
Deep Learning (DL) - Comprehensive Summary
No ratings yet
Deep Learning (DL) - Comprehensive Summary
9 pages
Autoencoders: Parallel Programming Parallel Processing
No ratings yet
Autoencoders: Parallel Programming Parallel Processing
5 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
ch4 CNN
No ratings yet
ch4 CNN
35 pages
2630 20230529 Mahdi Momen Aldawood HH 15261 946399124
No ratings yet
2630 20230529 Mahdi Momen Aldawood HH 15261 946399124
11 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Hardware Architectures For Deep Neural Networks-MIT'16
No ratings yet
Hardware Architectures For Deep Neural Networks-MIT'16
300 pages
Ug4 Proj
No ratings yet
Ug4 Proj
44 pages
Image Classification Using Convolutional Neural Networks (CNNS)
No ratings yet
Image Classification Using Convolutional Neural Networks (CNNS)
61 pages
Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
Deep Learning Lab
No ratings yet
Deep Learning Lab
11 pages
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
No ratings yet
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
92 pages
Deep Learning UNIT 5
No ratings yet
Deep Learning UNIT 5
182 pages
Lec 07 8
No ratings yet
Lec 07 8
40 pages
Deep Learning For Image Classification: GEOINT Training
No ratings yet
Deep Learning For Image Classification: GEOINT Training
75 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
UNIT-2 DL
No ratings yet
UNIT-2 DL
51 pages
Lecture 3 V33
No ratings yet
Lecture 3 V33
52 pages
Notions de Deep Learning
No ratings yet
Notions de Deep Learning
116 pages
Deep Learning Report For Students
No ratings yet
Deep Learning Report For Students
32 pages
Deep Learning Concise Notes
No ratings yet
Deep Learning Concise Notes
4 pages
Unit 3
No ratings yet
Unit 3
105 pages
FT04 Haghighat Independent 2023
No ratings yet
FT04 Haghighat Independent 2023
40 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Kernel Slides
No ratings yet
Kernel Slides
33 pages
Detailed Deep Learning Answers
No ratings yet
Detailed Deep Learning Answers
4 pages
Lect 2 Common Architectural Principles of Deep Networks
No ratings yet
Lect 2 Common Architectural Principles of Deep Networks
20 pages
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
No ratings yet
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
33 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
Deep Learning Fundamentals
No ratings yet
Deep Learning Fundamentals
19 pages
Tensorflow: Features
No ratings yet
Tensorflow: Features
10 pages
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
Deep Learning
No ratings yet
Deep Learning
10 pages
Unit 4
No ratings yet
Unit 4
86 pages
Modern Convolutional Neural Networks
No ratings yet
Modern Convolutional Neural Networks
68 pages
Group I
No ratings yet
Group I
20 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
Intro To Deep Learning
100% (1)
Intro To Deep Learning
35 pages
IC Unit6 DeepLearning
No ratings yet
IC Unit6 DeepLearning
35 pages
clc02 Nvmhoang Ass3
No ratings yet
clc02 Nvmhoang Ass3
26 pages
Eng PPT Tech
No ratings yet
Eng PPT Tech
18 pages
SDL Unit 2 3 4
No ratings yet
SDL Unit 2 3 4
12 pages
Introduction To Deep Learning 17th January 2025
No ratings yet
Introduction To Deep Learning 17th January 2025
60 pages
Notes of Deep Learning Top Architectures
No ratings yet
Notes of Deep Learning Top Architectures
13 pages
NN DL Unit - III
No ratings yet
NN DL Unit - III
19 pages
ML Unit4
No ratings yet
ML Unit4
32 pages
AAM Sample Paper
100% (2)
AAM Sample Paper
4 pages
Supervised Learning Networks: Perceptron Networks Back Propagation Networks
No ratings yet
Supervised Learning Networks: Perceptron Networks Back Propagation Networks
22 pages
The Evolution of Deep Learning
No ratings yet
The Evolution of Deep Learning
53 pages
Time Series Forecasting With Multilayer Perceptrons and Elmen Neural Neworks
No ratings yet
Time Series Forecasting With Multilayer Perceptrons and Elmen Neural Neworks
5 pages
Chap 2
No ratings yet
Chap 2
105 pages
Lecture 5
No ratings yet
Lecture 5
36 pages
Deep Learning Techniques: An Overview: January 2021
No ratings yet
Deep Learning Techniques: An Overview: January 2021
11 pages
QB Btech DP Sem Viii 21-22
No ratings yet
QB Btech DP Sem Viii 21-22
12 pages
Neural Networks-A Diffusion Model Changing The Landscape
No ratings yet
Neural Networks-A Diffusion Model Changing The Landscape
13 pages
8.lecture7 28a 29 NN
No ratings yet
8.lecture7 28a 29 NN
60 pages
Linear Vector Quantization Neural Network
No ratings yet
Linear Vector Quantization Neural Network
3 pages
Assignment 2
No ratings yet
Assignment 2
12 pages
1991 Multilayer Perceptrons
No ratings yet
1991 Multilayer Perceptrons
15 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
51 pages
ANN
0% (1)
ANN
3 pages
Figure PPT ch009
No ratings yet
Figure PPT ch009
27 pages
Tutorial Sheet For Unit 1,2 and 3
No ratings yet
Tutorial Sheet For Unit 1,2 and 3
6 pages
Applications of AI
No ratings yet
Applications of AI
56 pages
Free Courses in AI ML From MIT Harvard
No ratings yet
Free Courses in AI ML From MIT Harvard
2 pages
NeurIPS 2022 Revised
No ratings yet
NeurIPS 2022 Revised
9 pages
Lecture 17 Transfer Learning
No ratings yet
Lecture 17 Transfer Learning
12 pages
2018 12 Abbeel - AI PDF
No ratings yet
2018 12 Abbeel - AI PDF
105 pages
Literature Review: Gender Classification and Age Estimation Using Neural Networks
No ratings yet
Literature Review: Gender Classification and Age Estimation Using Neural Networks
2 pages
HCIA-AI Exercises
No ratings yet
HCIA-AI Exercises
43 pages
Introduction To Artificial Intelligence
No ratings yet
Introduction To Artificial Intelligence
2 pages
DEEP LEARNING Import Questions For External Exam
No ratings yet
DEEP LEARNING Import Questions For External Exam
1 page
Syllabus EEE
No ratings yet
Syllabus EEE
2 pages