0% found this document useful (0 votes)

10 views15 pages

Unit 2 CNN

Uploaded by

amruthabhaskar7114

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views15 pages

Unit 2 CNN

Uploaded by

amruthabhaskar7114

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Unit 2

CNN Architecture
• Convolutional neural networks are biologically inspired
networks that are used in computer vision for image
classification and object detection.
• In the convolutional neural network architecture, each
layer of the network is 3-dimensional, which has a spatial
extent and a depth corresponding to the number of
features.
• The notion of depth of a single layer in a convolutional
neural network is distinct from the notion of depth in
terms of the number of layers.
• In the input layer, these features correspond to the color
channels like RGB (i.e., red, green, blue), and in the
hidden channels these features represent hidden feature
maps that encode various types of shapes in the image.
• If the input is in grayscale (like LeNet-5), then the input
layer will have a depth of 1, but later layers will still be 3-
dimensional.
• The architecture contains two types of layers, referred to
as the convolution and subsampling layers, respectively.
• For the convolution layers, a convolution operation is
defined, in which a filter is used to map the activations
from one layer to the next.
• A convolution operation uses a 3-dimensional filter of
weights with the same depth as the current layer but with
a smaller spatial extent.
• The dot product between all the weights in the filter and
any choice of spatial region (of the same size as the filter)
in a layer defines the value of the hidden state in the next
layer
• The operation between the filter and the spatial regions in
a layer is performed at every possible position in order to
define the nextlayer
Alexnet
• AlexNet was the winner of the 2012 ILSVRC competition.
• It has 8 layers with learnable parameters.
• The input to the Model is RGB images.
• It has 5 convolution layers with a combination of max-
pooling layers.
• Then it has 3 fully connected layers.
• The activation function used in all layers is Relu.
• It used two Dropout layers.
• The activation function used in the output layer is
Softmax.
• The total number of parameters in this architecture is 62.3
million.
• AlexNet starts with 224 × 224 × 3 images and uses 96
filters of size 11 × 11 × 3 in the first layer.
• A stride of 4 is used. This results in a first layer of size 55
× 55 × 96.
• After the first layer has been computed, a max-pooling
layer is used.
• The ReLU activation function was applied after each
convolutional layer, which was followed by response
normalization and max-pooling.
• The second convolutional layer uses the response-
normalized and pooled output of the first convolutional
layer and filters it with 256 filters of size 5 × 5 × 96.
• The sizes of the filters of the third, fourth, and fifth
convolutional layers are 3 × 3 × 256(with 384 filters), 3
× 3 × 384 (with 384 filters), and 3 × 3 × 384 (with 256
filters).
• All max-pooling layers used 3 × 3 filters at stride 2.
• The fully connected layers have 4096 neurons.
• The fully connected layers have 4096 neurons. The final
set of 4096 activationscan be treated as a 4096-
dimensional representation of the image.
• The final layer of AlexNet uses a 1000-way softmax in
order to perform the classification.
VGG
• VGG stands for Visual Geometry Group; it is a
standard deep Convolutional Neural Network
(CNN) architecture with multiple layers.
• The “deep” refers to the number of layers with
VGG-16 or VGG-19 consisting of 16 and 19
convolutional layers.
• The VGG16 model achieves almost 92.7% top-5 test accuracy
in ImageNet. ImageNet is a dataset consisting of more than 14
million images belonging to nearly 1000 classes.
• Moreover, it was one of the most popular models submitted to
ILSVRC-2014.
• It replaces the large kernel-sized filters with several 3×3
kernel-sized filters one after the other, thereby making
significant improvements over AlexNet.
• The VGG16 model was trained using Nvidia Titan Black
GPUs for multiple weeks.
• The VGG-16 consists of 13 convolutional layers and
three fully connected layers.

• Input: The VGGNet takes in an image input size of

224×224. For the ImageNet competition, the creators of
the model cropped out the center 224×224 patch in each
image to keep the input size of the image consistent.
• Convolutional Layers: VGG’s convolutional layers
leverage a minimal receptive field, i.e., 3×3, the smallest
possible size that still captures up/down and left/right.
Moreover, there are also 1×1 convolution filters acting as
a linear transformation of the input. This is followed by a
ReLU unit, which is a huge innovation from AlexNet that
reduces training time. ReLU stands for rectified linear
unit activation function; it is a piecewise linear function
that will output the input if positive; otherwise, the output
is zero. The convolution stride is fixed at 1 pixel to keep
the spatial resolution preserved after convolution (stride is
the number of pixel shifts over the input matrix).
• Hidden Layers: All the hidden layers in the
VGG network use ReLU.
• Fully-Connected Layers: The VGGNet has three
fully connected layers. Out of the three layers,
the first two have 4096 channels each, and the
third has 1000 channels, 1 for each class.

Assignment No 2 (Aleeza Anjum CS101)
No ratings yet
Assignment No 2 (Aleeza Anjum CS101)
60 pages
Notes
No ratings yet
Notes
15 pages
465-Lecture 7
No ratings yet
465-Lecture 7
46 pages
DL UNIT 2 CNN Architectures
No ratings yet
DL UNIT 2 CNN Architectures
12 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
14 pages
What Is VGG
No ratings yet
What Is VGG
3 pages
Data Science Interview Preparation (#DAY 14)
No ratings yet
Data Science Interview Preparation (#DAY 14)
11 pages
VGG New
No ratings yet
VGG New
15 pages
ML II - Unit IV
No ratings yet
ML II - Unit IV
20 pages
CNN Architectures 01
No ratings yet
CNN Architectures 01
66 pages
CNN (1) - Unit 3 - Merged
No ratings yet
CNN (1) - Unit 3 - Merged
14 pages
Modern CNN Architectures
No ratings yet
Modern CNN Architectures
32 pages
DL3 QB
No ratings yet
DL3 QB
19 pages
Convolutional Neural Network2 26112024 015227pm
No ratings yet
Convolutional Neural Network2 26112024 015227pm
41 pages
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
No ratings yet
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
82 pages
DNN Architectures
No ratings yet
DNN Architectures
12 pages
Modern Convolutional Neural Networks
No ratings yet
Modern Convolutional Neural Networks
68 pages
DL Unit 3-5
No ratings yet
DL Unit 3-5
44 pages
Difference Between Alexnet, Vggnet, Resnet, and Inception
No ratings yet
Difference Between Alexnet, Vggnet, Resnet, and Inception
14 pages
Unit III
No ratings yet
Unit III
58 pages
Different Deep CNN Architectures - LeNet, AlexNet, VGG
No ratings yet
Different Deep CNN Architectures - LeNet, AlexNet, VGG
13 pages
5b Dana
No ratings yet
5b Dana
67 pages
DL Ass 742
No ratings yet
DL Ass 742
14 pages
Aidl 2023s DL 08 CNN Architectures
No ratings yet
Aidl 2023s DL 08 CNN Architectures
51 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Ch-3 Convolutional Neural Networks (CNNS)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNS)
11 pages
Chitra K S 2022bcse07aed1011
No ratings yet
Chitra K S 2022bcse07aed1011
21 pages
Lecture05 DeepLearningCNN Trang 2
No ratings yet
Lecture05 DeepLearningCNN Trang 2
45 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
CNN Apps
No ratings yet
CNN Apps
17 pages
Image Processing With Deep Learning
No ratings yet
Image Processing With Deep Learning
39 pages
VGG Net
No ratings yet
VGG Net
6 pages
Alex Net
No ratings yet
Alex Net
26 pages
Notes - CSE (DS)
No ratings yet
Notes - CSE (DS)
44 pages
Convolution Neural Networks
No ratings yet
Convolution Neural Networks
80 pages
Module3 Casestudy
No ratings yet
Module3 Casestudy
13 pages
Unit 3
No ratings yet
Unit 3
37 pages
Unit 5
No ratings yet
Unit 5
24 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
17 pages
Malware Image Classification Using ML DL
No ratings yet
Malware Image Classification Using ML DL
5 pages
Transfer Learning - CNN Architectures
No ratings yet
Transfer Learning - CNN Architectures
120 pages
Notes - CSE (DS)
No ratings yet
Notes - CSE (DS)
44 pages
Deep Learning Assign 2
No ratings yet
Deep Learning Assign 2
5 pages
Week3 Lec1 2
No ratings yet
Week3 Lec1 2
107 pages
Model
No ratings yet
Model
4 pages
17 VGG 03 09 2024
No ratings yet
17 VGG 03 09 2024
10 pages
Unit V
No ratings yet
Unit V
84 pages
2023 AN2DL Lez 4 CNN Famous Architectures
No ratings yet
2023 AN2DL Lez 4 CNN Famous Architectures
113 pages
Comparison and Architecture of Pre-Trained Model (VGG-16, VGG-19, ResNet, GoogleNet, AlexNet, Inception - by Muhammad Abdullah - Medium
No ratings yet
Comparison and Architecture of Pre-Trained Model (VGG-16, VGG-19, ResNet, GoogleNet, AlexNet, Inception - by Muhammad Abdullah - Medium
15 pages
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
167 pages
VGG-Net Architecture Explained. The Company Visual Geometry Group - by Siddhesh Bangar - Medium
No ratings yet
VGG-Net Architecture Explained. The Company Visual Geometry Group - by Siddhesh Bangar - Medium
19 pages
19 ResNet 10 09 2024
No ratings yet
19 ResNet 10 09 2024
35 pages
Unit 3
No ratings yet
Unit 3
38 pages
Difference Between AlexNet, VGGNet, ResNet, and Inception
No ratings yet
Difference Between AlexNet, VGGNet, ResNet, and Inception
25 pages
Difference Between AlexNet, VGGNet, ResNet, and Inception - by Aqeel Anwar - Towards Data Science
No ratings yet
Difference Between AlexNet, VGGNet, ResNet, and Inception - by Aqeel Anwar - Towards Data Science
14 pages
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
No ratings yet
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
50 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
cs231n 2018 Lecture09
No ratings yet
cs231n 2018 Lecture09
106 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Summer Vacation
No ratings yet
Summer Vacation
3 pages
Hackee - Magnetic Resonance Imaging - Physical Principles and Sequence Design
No ratings yet
Hackee - Magnetic Resonance Imaging - Physical Principles and Sequence Design
937 pages
Advance Course Outline CU 2.0
No ratings yet
Advance Course Outline CU 2.0
6 pages
056 - ME8099, ME6010 Robotics - Question Bank
No ratings yet
056 - ME8099, ME6010 Robotics - Question Bank
6 pages
DSP Project Report
100% (1)
DSP Project Report
14 pages
AI - Facilitators - Handbook - VI 2025-26
No ratings yet
AI - Facilitators - Handbook - VI 2025-26
36 pages
CCS338 Set3
No ratings yet
CCS338 Set3
4 pages
Object Detection and Trackinfg in Videos: N. Rasathi
No ratings yet
Object Detection and Trackinfg in Videos: N. Rasathi
8 pages
Helmet and Number Plate Detection
No ratings yet
Helmet and Number Plate Detection
3 pages
Topics: Overview of Computer Graphics
No ratings yet
Topics: Overview of Computer Graphics
17 pages
Report On Image Compression Using JPEG Algorithm
No ratings yet
Report On Image Compression Using JPEG Algorithm
8 pages
23 24chapter 1 Image Representation
No ratings yet
23 24chapter 1 Image Representation
50 pages
Mitchell ShadingInValvesSourceEngine
No ratings yet
Mitchell ShadingInValvesSourceEngine
69 pages
Image Processing/Communications-MATLAB: Code Title
No ratings yet
Image Processing/Communications-MATLAB: Code Title
7 pages
Lecture 10 Basic CNN
No ratings yet
Lecture 10 Basic CNN
65 pages
Comparison of Different Filtering/Smoothing Filters in Digital Image Processing
No ratings yet
Comparison of Different Filtering/Smoothing Filters in Digital Image Processing
5 pages
MS in Embedded Systems and IOTs
No ratings yet
MS in Embedded Systems and IOTs
8 pages
Training Test
No ratings yet
Training Test
54 pages
Scale-Space Theories in Computer Vision
No ratings yet
Scale-Space Theories in Computer Vision
544 pages
Zhang MotionTrack End-to-End Transformer-Based Multi-Object Tracking With LiDAR-Camera Fusion CVPRW 2023 Paper
No ratings yet
Zhang MotionTrack End-to-End Transformer-Based Multi-Object Tracking With LiDAR-Camera Fusion CVPRW 2023 Paper
10 pages
AIA 6600 - Module 1
No ratings yet
AIA 6600 - Module 1
5 pages
Investigate Latest AI Features in Software Products That Support Retail Market
No ratings yet
Investigate Latest AI Features in Software Products That Support Retail Market
17 pages
Dissertation HannWoeiHo
No ratings yet
Dissertation HannWoeiHo
154 pages
AI Unit 2
No ratings yet
AI Unit 2
36 pages
Resumo MMVR
No ratings yet
Resumo MMVR
1 page
Adaptive and Generic Corner Detection Based On The Accelerated Segment Test
No ratings yet
Adaptive and Generic Corner Detection Based On The Accelerated Segment Test
14 pages
Pixhawk A Syste
No ratings yet
Pixhawk A Syste
6 pages
MV Family Datasheet 240207 r4 1
No ratings yet
MV Family Datasheet 240207 r4 1
9 pages
Chapter 4 - Isometric Drawing
100% (2)
Chapter 4 - Isometric Drawing
14 pages

Unit 2 CNN

Uploaded by

Unit 2 CNN

Uploaded by

Unit 2

• Input: The VGGNet takes in an image input size of

You might also like