0% found this document useful (0 votes)

23 views55 pages

Lecture2 Advanced CNN

CNN

Uploaded by

Quang Uy Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views55 pages

Lecture2 Advanced CNN

CNN

Uploaded by

Quang Uy Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 55

Advanced Convolutional Neural Networks

Nguyen Quang Uy

1
Outline
1. Alexnet

2. VGGnet

3. Googlenet

4. Resnet

5. Mobilenet

6. Efficientnet
2
Legends

3
Layers

4
Activation functions

5
Modules/Blocks

6
Repeated layers

7
Alexnet

8
Overview
• Paper: ImageNet Classification with Deep Convolutional Neural Networks
• Published in: NeurIPS 2012.
• Considered to be the most impact in computer vision.

9
Novelties
• Use Rectified Linear Units (ReLUs) as activation functions.
• Use Dropout layer.
• Use data augmentation.

10
Architecture
• AlexNet has 8 layers — 5 convolutional and 3 fully-connected.
• AlexNet Has 60M parameters.

11
Results
• Top-1 error rates is 37.5%
• Top-5 error rates 17.0%

12
VGG

13
Overview
• VGG: Visual Geometry Group
• Paper: Very Deep Convolutional Networks for Large-Scale Image
Recognition
• Published in arXiv 2014

14
Novelties
• Designing of deeper networks (roughly twice as deep as AlexNet). This was done by
stacking uniform convolutions.
• They use only 3x3 kernels, as opposed to AlexNet 11x11. This design decreases the
number of parameters.

15
Architecture
• VGG has 13 convolutional and 3 fully-connected layers.
• This network stacks more layers onto AlexNet.
• It consists of 138M parameters.

16
VGG result
• Top-1 accuracy is 71.5%
• Top-1 accuracy 90.1%

17
Googlenet

18
Overview
• Also known as Inception-v1
• Paper: Going Deeper with Convolutions
• Published in: 2015 IEEE Conference on Computer Vision and Pattern
Recognition (CVPR).
• Achieve competitive result compared to human

19
Novelties
• Building networks using modules/blocks, instead of stacking convolutional layers.
• 1×1 conv are used for dimensionality reduction to remove computational bottlenecks.
• Have parallel convolutions with filters at 1×1, 3×3 and 5×5, followed by concatenation.
• Use two auxiliary classifiers to encourage discrimination in the lower stages.

20
Architecture

21
Architecture
• Stem and Inception module.

22
Results
• Top-1 accuracy is 78.2%
• Top-5 accuracy is 94.1%
• Human error is 5%-8%.

23
Resnet

24
Overview
• Paper: Deep Residual Learning for Image Recognition.
• Published in: 2016 IEEE Conference on Computer Vision and Pattern
Recognition (CVPR).
• The first network achieves better result then human.

25
Novelties
• Popularise skip connections (they weren’t the first to use skip connections).
• Design even deeper CNNs (up to 152 layers) without compromising model’s
generalisation power
• Among the first to use batch normalisation.

26
Architecture
• Conv block and Identity module.

27
Architecture
• Conv block and Identity module.

28
Resnet result
• Top-1 accuracy is 87.0%.
• Top-5 accuracy 96.3%.
• Top-5 human accuracy: 95.0%

29
Mobilenet

30
Overview
• Paper: MobileNets: Efficient Convolutional Neural Networks for Mobile Vision
Applications
• Published in: 2017 IEEE Conference on Computer Vision and Pattern Recognition
(CVPR).
• Specially designed to be used in mobile devices.

31
Novelties
• MobileNet uses depthwise separable convolutions. It significantly reduces the number of
parameters.
• It introduces two shrinking hyperparameters that efficiently trade off between latency and
accuracy

32
Architecture

33
Architecture
• Deepwise separable convolution.

34
Architecture
• Deepwise convolution:
• In a normal convolution, all channels of a kernel are used to produce a feature map.
• In a depthwise convolution, each channel of a kernel is used to produce a feature map.

35
Architecture
• Pointwise convolution.
• In a normal convolution, we just have to use 256 filters of size 5x5x3.
• In a pointwise convolution, we just have to use 256 filters of size 1x1x3.

36
Computation cost
• Standard convolution

• The computational cost can be calculated as

• Where DF is the dimensions of the input feature map and DK is the

size of the convolution kernel, M and N are the number of input and
output channels respectively.
37
Computation cost
• Depthwise convolution

• The computational cost can be calculated as

38
Computation cost
• Depthwise convolution

• The computational cost can be calculated as

39
Computation cost
• The total computational cost of Depthwise separable convolutions can be
calculated as.

• Comparing it with the computational cost of standard convolution, we get

the reduction in computation.

40
Results
• Mobilenet is better than Googlenet and VGG with much lower number of
operators and parameters.

41
Efficientnet

42
Overview
• Paper: EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
• Published in: International Conference on Machine Learning, 2019.
• It is considered as the state-of-the-art until today.

43
Novelties
• Compound Scaling from B0 to B7.
• The EfficientNet Architecture (developed using Neural Architecture Search)

44
Architecture

45
Compound scaling
• The most common way to scale up ConvNets was either depth (number of layers), width
(number of channels) or image resolution (image size).
• EfficientNets perform Compound Scaling to scale all three dimensions while mantaining a
balance between all dimensions of the network.

46
Compound scaling
• This idea of compound scaling makes sense because if the input image is
bigger then the network needs more layers (depth) and more channels
(width) to capture more fine-grained patterns.

47
Neural Architecture Search
• This is a reinforcement learning based approach used to develop Efficient-B0 by
leveraging a multi-objective search that optimizes for both Accuracy and FLOPS.

48
Neural Architecture Search
• The objective function can formally be defined as:

49
Mobile inverted bottleneck convolution (MBConv)
• MBConv without squeeze and excitation operation

50
Mobile inverted bottleneck convolution (MBConv)
• MBConv with squeeze and excitation operation

51
Squeeze and excitation operation
• Access to global information
• Modelling channel interdependencies
• Which can be regarded as a self-attention function on channels

52
Scaling Efficient-B0 to get B1-B7
• Let the network depth(d), widt(w) and input image resolution(r) be:

• We then fix α, β, γ as constants and scale up baseline network with

different φ using Equation 3, to obtain EfficientNet-B1 to B7
53
Results

54
Q&A
Thank you!

Project Documentation
No ratings yet
Project Documentation
75 pages
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
167 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
MLT CNN Architectures
No ratings yet
MLT CNN Architectures
104 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Convolution Neural Networks
No ratings yet
Convolution Neural Networks
80 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
5b Dana
No ratings yet
5b Dana
67 pages
CNN Architectures 01
No ratings yet
CNN Architectures 01
66 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
4a Convolutional Neural Networks
No ratings yet
4a Convolutional Neural Networks
56 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
17 pages
Oct2022 CSC649 SupervisedDL - CNN
No ratings yet
Oct2022 CSC649 SupervisedDL - CNN
79 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
51 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
43 pages
Aidl 2023s DL 08 CNN Architectures
No ratings yet
Aidl 2023s DL 08 CNN Architectures
51 pages
CV Ss16 0609 Deep Learning
No ratings yet
CV Ss16 0609 Deep Learning
91 pages
AE556 2024 Topic4 CNN
No ratings yet
AE556 2024 Topic4 CNN
26 pages
ch4 CNN
No ratings yet
ch4 CNN
35 pages
Deep Learning: Seungsang Oh
No ratings yet
Deep Learning: Seungsang Oh
39 pages
Convolutional Neural Network2 26112024 015227pm
No ratings yet
Convolutional Neural Network2 26112024 015227pm
41 pages
FT04 Haghighat Independent 2023
No ratings yet
FT04 Haghighat Independent 2023
40 pages
Cs437 Cs5317 Ee414 Ee513 l10 Cnncasestudies
No ratings yet
Cs437 Cs5317 Ee414 Ee513 l10 Cnncasestudies
55 pages
Afaan Oromo Question Classification Using Deep Learning Approachosal
No ratings yet
Afaan Oromo Question Classification Using Deep Learning Approachosal
17 pages
TRes Net
No ratings yet
TRes Net
37 pages
Kernel Slides
No ratings yet
Kernel Slides
33 pages
VGG Net
No ratings yet
VGG Net
6 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
ML II - Unit IV
No ratings yet
ML II - Unit IV
20 pages
Automated Structural Optimization GNN
No ratings yet
Automated Structural Optimization GNN
16 pages
3 DL ConvNets
No ratings yet
3 DL ConvNets
46 pages
Lecture05 DeepLearningCNN Trang 2
No ratings yet
Lecture05 DeepLearningCNN Trang 2
45 pages
CNN (1) - Unit 3 - Merged
No ratings yet
CNN (1) - Unit 3 - Merged
14 pages
CNN Apps
No ratings yet
CNN Apps
17 pages
Convolutional Neural Networks (Cnns / Convnets)
No ratings yet
Convolutional Neural Networks (Cnns / Convnets)
21 pages
A Digital Forensic Technique For Inter-Frame Video Forgery Detection Based On 3D CNN PDF
No ratings yet
A Digital Forensic Technique For Inter-Frame Video Forgery Detection Based On 3D CNN PDF
14 pages
DL6 - Convnets 4
No ratings yet
DL6 - Convnets 4
57 pages
Convolutional Networks
No ratings yet
Convolutional Networks
25 pages
Advanced DL Computer Vision
No ratings yet
Advanced DL Computer Vision
10 pages
MobileNetV2 Inverted Residuals and Linear Bottlenecks
No ratings yet
MobileNetV2 Inverted Residuals and Linear Bottlenecks
11 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
No ratings yet
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
13 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
Unit4 - 3 CNN
No ratings yet
Unit4 - 3 CNN
19 pages
Internshipml (J2)
No ratings yet
Internshipml (J2)
50 pages
Efficient CNN Architecture Design Guided by Visualization
No ratings yet
Efficient CNN Architecture Design Guided by Visualization
6 pages
Deep Learning Approach For Object Detection Using CNN: Abstract
No ratings yet
Deep Learning Approach For Object Detection Using CNN: Abstract
7 pages
Mobile Net
No ratings yet
Mobile Net
9 pages
Ch-3 Convolutional Neural Networks (CNNS)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNS)
11 pages
CS231n Convolutional Neural Networks For Visual Recognition
No ratings yet
CS231n Convolutional Neural Networks For Visual Recognition
2 pages
4 Neural Network
No ratings yet
4 Neural Network
74 pages
CS231n - Convolutional-Networks 1
No ratings yet
CS231n - Convolutional-Networks 1
3 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
2019 - VideoBERT - A Joint Model For Video and Language Representation Learning
No ratings yet
2019 - VideoBERT - A Joint Model For Video and Language Representation Learning
13 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Multi-Layered Deep Convolutional Neural Network For Object Detection
No ratings yet
Multi-Layered Deep Convolutional Neural Network For Object Detection
6 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
465-Lecture 7
No ratings yet
465-Lecture 7
46 pages
CS436 CS5310 Ee513 L05 CNN2
No ratings yet
CS436 CS5310 Ee513 L05 CNN2
27 pages
Notes
No ratings yet
Notes
15 pages
Machine Learning
No ratings yet
Machine Learning
109 pages
DLCV Ch2 Neural Network
No ratings yet
DLCV Ch2 Neural Network
68 pages
CNN-RNN Based Handwritten Text Recognition: G.R. Hemanth, M. Jayasree, S. Keerthi Venii, P. Akshaya, and R. Saranya
No ratings yet
CNN-RNN Based Handwritten Text Recognition: G.R. Hemanth, M. Jayasree, S. Keerthi Venii, P. Akshaya, and R. Saranya
7 pages
AI and Medical Imaging Technology Evolution Impacts
No ratings yet
AI and Medical Imaging Technology Evolution Impacts
13 pages
CSPE 102 - Module 3
No ratings yet
CSPE 102 - Module 3
19 pages
Module 3 - Convolutional Neural Networks: History
No ratings yet
Module 3 - Convolutional Neural Networks: History
3 pages
Image-Based Bengali Sign Language Alphabet Recogni
No ratings yet
Image-Based Bengali Sign Language Alphabet Recogni
8 pages
Creating Alert Messages Based On Wild Animal Activ
No ratings yet
Creating Alert Messages Based On Wild Animal Activ
16 pages
EAI Endorsed Transactions: Music Recommendation Based On Facial Emotion Recognition
No ratings yet
EAI Endorsed Transactions: Music Recommendation Based On Facial Emotion Recognition
8 pages
Ca 4 NLP Report - 1
No ratings yet
Ca 4 NLP Report - 1
21 pages
Pedestrian Detection System Based On Deep Learning
No ratings yet
Pedestrian Detection System Based On Deep Learning
5 pages
Skinpox Disease Detection Android App
No ratings yet
Skinpox Disease Detection Android App
11 pages
Cambricon: An Instruction Set Architecture For Neural Networks
No ratings yet
Cambricon: An Instruction Set Architecture For Neural Networks
13 pages
Fast Animal Detection in Uav Images Using Convolutional Neural Networks
No ratings yet
Fast Animal Detection in Uav Images Using Convolutional Neural Networks
4 pages
IJCRTAF02025
No ratings yet
IJCRTAF02025
4 pages
Predict Pneumonia With Chest X-Ray Images Based On Convolutional Deep Neural Learning Networks
No ratings yet
Predict Pneumonia With Chest X-Ray Images Based On Convolutional Deep Neural Learning Networks
15 pages
Unit V Aiml
No ratings yet
Unit V Aiml
18 pages
Smart Glasses For Visually Impaired Using Image Processing Techniques
No ratings yet
Smart Glasses For Visually Impaired Using Image Processing Techniques
6 pages
Enhanced Image Classification Through Customized Convolutional Spiking Neural Network
No ratings yet
Enhanced Image Classification Through Customized Convolutional Spiking Neural Network
6 pages
Histogram Layer For Texture Classification
No ratings yet
Histogram Layer For Texture Classification
9 pages
International Journal of System of Systems Engine
No ratings yet
International Journal of System of Systems Engine
4 pages
Immediate download Computing Science Communication and Security First International Conference COMS2 2020 Gujarat India March 26 27 2020 Revised Selected Papers and Information Science 1235 Band 1235 Nirbhay Chaubey (Editor) ebooks 2024
No ratings yet
Immediate download Computing Science Communication and Security First International Conference COMS2 2020 Gujarat India March 26 27 2020 Revised Selected Papers and Information Science 1235 Band 1235 Nirbhay Chaubey (Editor) ebooks 2024
65 pages
Applied Intelligence For Industry 40 Nazmul Siddique Mohammad Shamsul Arefin Instant Download
No ratings yet
Applied Intelligence For Industry 40 Nazmul Siddique Mohammad Shamsul Arefin Instant Download
76 pages
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
From Everand
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
S. R. Jena
No ratings yet
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet