Convolutional Neural Network (CNN)

Uploaded by

ganeshattipatla123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views14 pages

Convolutional Neural Network (CNN)

Uploaded by

ganeshattipatla123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

CONVOLUTIONAL NEURAL

NETWORK(CNN)
LeNet-5 Architecture
It was originally used for handwritten digit classification.

The classic version of LeNet-5 consists of the following layers:

● Convolutional Layers (Cx): these layers apply convolution operations using multiple filters. Each filter is responsible for capturing
specific features, such as edges, textures, or patterns. The sliding window approach ensures that features are extracted in a spatially
structured manner, enabling the network to capture hierarchical information (low-level features like edges in early layers, more
complex patterns like shapes in deeper layers).

● Subsampling Layers (Sx): LeNet-5 uses average pooling in its subsampling layers, which reduces the spatial resolution of the
feature maps. This pooling operation also helps the network to become more robust to variations in the position of features, as
smaller shifts in input images won’t significantly alter the output. This process, along with reduced dimensionality, helps combat
overfitting and makes the network less computationally expensive.

● Fully Connected Layers (Fx): The fully connected layers are tasked with making the final decisions or predictions based on the
features extracted by the convolutional and subsampling layers. By connecting all neurons in these layers, the network is able to
combine and weigh features learned across all previous layers, effectively integrating global information from the image to produce
the final output (in LeNet's case, predicting the digit).
Architectural Efficiency

LeNet-5 was designed to be both computationally efficient and effective, which was critical at a time when computational resources
were limited. The alternating pattern of convolution and pooling ensures that relevant features are captured while progressively
reducing the spatial complexity of the data. The structured receptive fields ensure that the network captures local patterns in a way that
supports the hierarchical learning of features.

This layered approach of progressively extracting more abstract and complex representations in later layers, followed by fully
connected layers to make the final decision, became a hallmark of modern CNNs.
VGG-16 Architecture:
The name "VGG-16" reflects the fact that it has 16 layers with learnable weights
(13 convolutional layers and 3 fully connected layers). The architecture follows a
straightforward design philosophy: stacking small convolutional filters (3x3) with
stride 1, padding 1, and using max pooling to reduce the spatial dimensions.
Key Features of VGG-16

1. Small Convolutional Filters:

○ VGG-16 uses 3x3 filters throughout the network. This is a deliberate choice because stacking multiple 3x3 filters in successive
layers allows the network to capture complex patterns with fewer parameters, while still having a large receptive field.
○ For example, two consecutive 3x3 convolutional layers have an effective receptive field of 5x5, and three consecutive 3x3 layers
have an effective receptive field of 7x7.
2. Deep Architecture:
○ VGG-16 is deeper than many previous CNN architectures, with 13 convolutional layers. This depth allows it to learn more complex
features and patterns from data, which improves its ability to classify and generalize.
3. Consistent Structure:
○ The architecture is very consistent in its design: it repeats similar building blocks (conv-conv-maxpool) across the network, making
it simpler and more predictable in terms of layer organization.
4. Large Fully Connected Layers:
○ VGG-16 has two large fully connected layers (4096 neurons each) before the final softmax layer, which helps the network to
combine features and make decisions.
5. Max Pooling:
○ Max pooling with 2x2 filters and stride 2 is used after each block of convolutional layers to progressively reduce the spatial
dimensions of the feature maps. This helps reduce computational complexity while retaining the most important features.
Strengths and Weaknesses

● Strengths:
○ Performance: VGG-16 performs exceptionally well on image classification tasks, especially on large-scale datasets like
ImageNet.
○ Generalization: The architecture is highly transferable, meaning it works well on other tasks via fine-tuning (e.g., object
detection and segmentation).
○ Simplicity: The uniform use of 3x3 filters and 2x2 max pooling makes the network design simple and effective.
● Weaknesses:
○ Computationally Expensive: VGG-16 has a very large number of parameters (about 138 million), making it
computationally expensive to train and requiring significant memory and processing power.
○ Not the Most Efficient: While deep, the architecture is not the most computationally efficient compared to more modern
architectures (e.g., ResNet, EfficientNet) that use more innovative strategies like residual connections or depthwise
separable convolutions.
VGG-19 Architecture:
VGG-19 contains 19 layers with learnable parameters: 16 convolutional layers and
3 fully connected layers.
Key Features of VGG-19

1. Small Convolutional Filters:

○ Like VGG-16, VGG-19 uses 3x3 convolutional filters, which are stacked to capture increasingly complex patterns in the
input. Multiple 3x3 filters allow the model to have a larger effective receptive field while keeping the number of
parameters manageable.
2. Deeper Architecture:
○ VGG-19 has 16 convolutional layers, making it slightly deeper than VGG-16. The extra layers improve its ability to
capture fine-grained features, but also increase the number of parameters and computational complexity.
3. Max Pooling:
○ Max pooling with 2x2 filters and a stride of 2 is used after every block of convolutional layers to reduce spatial
dimensions and control computational cost.
4. Fully Connected Layers:
○ VGG-19, like VGG-16, has two large fully connected layers (4096 neurons each) before the final classification layer.
These dense layers allow the network to integrate information from all the learned features.
5. Consistency:
○ The architecture is highly regular, with repeated blocks of convolution followed by max pooling, making it simple in
design but effective at performance
Strengths and Weaknesses:

● Strengths:
○ Performance: VGG-19 performs very well in image classification tasks and has strong
generalization ability.
○ Modularity: The repeated blocks make the network design modular and easy to implement.
○ Transfer Learning: VGG-19, like VGG-16, is popular for transfer learning, meaning it can be used
as a pre-trained model for other tasks.
● Weaknesses:
○ High Computational Cost: VGG-19 has about 143 million parameters, making it very expensive
in terms of memory and computation, especially compared to more recent architectures like Res
AlexNet Architecture:
Key Innovations in AlexNet

1. ReLU Activation:
○ AlexNet popularized the use of the ReLU activation function, which introduced non-linearity into the model. ReLU is
computationally efficient and helps mitigate the vanishing gradient problem that plagued earlier models using sigmoid or
tanh activations.
2. GPU Training:
○ AlexNet was one of the first deep learning models to make extensive use of GPUs to accelerate training. In fact, AlexNet
was trained on two Nvidia GTX 580 GPUs, splitting the model across GPUs and processing mini-batches in parallel.
3. Dropout:
○ Dropout, introduced in AlexNet, is a regularization technique that randomly sets a fraction of neurons to zero during
training, which helps prevent overfitting.
4. Data Augmentation:
○ AlexNet used data augmentation techniques like random cropping, horizontal flipping, and image translations to artificially
increase the size of the training set and improve generalization.
5. Local Response Normalization (LRN):
○ LRN was used to normalize activations by applying competition across neurons in a local neighborhood. This feature was
specific to AlexNet and is rarely used in modern architectures, which now rely on Batch Normalization.

Assignment No 2 (Aleeza Anjum CS101)
No ratings yet
Assignment No 2 (Aleeza Anjum CS101)
60 pages
50 REAL TIME LINUX Multiple Choice Questions and Answers-LINUX Multiple Choice Questions
60% (5)
50 REAL TIME LINUX Multiple Choice Questions and Answers-LINUX Multiple Choice Questions
16 pages
Class 4-MATHS Asssignment - Holiday HW-Answer Key
No ratings yet
Class 4-MATHS Asssignment - Holiday HW-Answer Key
14 pages
Modern CNN Architectures
No ratings yet
Modern CNN Architectures
32 pages
Brochure SRT 4930 - en
No ratings yet
Brochure SRT 4930 - en
2 pages
2023 AN2DL Lez 4 CNN Famous Architectures
No ratings yet
2023 AN2DL Lez 4 CNN Famous Architectures
113 pages
Graitec Advance Concrete Manual
No ratings yet
Graitec Advance Concrete Manual
4 pages
Post-Reading Report Alex Shen (Mid Exam)
No ratings yet
Post-Reading Report Alex Shen (Mid Exam)
36 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
17 pages
Literature Review On Image Classification Architecture
No ratings yet
Literature Review On Image Classification Architecture
14 pages
Data Science Interview Preparation (#DAY 14)
No ratings yet
Data Science Interview Preparation (#DAY 14)
11 pages
Transfer Learning For Image Classification
No ratings yet
Transfer Learning For Image Classification
5 pages
Automatic Greenhouse Monitoring and Control: Project by Challa Mukund Saianth
No ratings yet
Automatic Greenhouse Monitoring and Control: Project by Challa Mukund Saianth
26 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
DNN Architectures
No ratings yet
DNN Architectures
12 pages
Presentation, Project and Technical Report On " Computer Hardware "
No ratings yet
Presentation, Project and Technical Report On " Computer Hardware "
12 pages
Fifth Generation: List Processing: LISP
No ratings yet
Fifth Generation: List Processing: LISP
7 pages
CNN
No ratings yet
CNN
2 pages
Message
No ratings yet
Message
7 pages
VGG Net
No ratings yet
VGG Net
22 pages
ML II - Unit IV
No ratings yet
ML II - Unit IV
20 pages
DL Unit-6
No ratings yet
DL Unit-6
17 pages
Unit 5
No ratings yet
Unit 5
24 pages
Kanoria Shubham Anil 2023HT01569
No ratings yet
Kanoria Shubham Anil 2023HT01569
9 pages
Convolution Neural Networks
No ratings yet
Convolution Neural Networks
80 pages
CNN Case Studies Unit 4
No ratings yet
CNN Case Studies Unit 4
13 pages
Microprocessors and Peripherals: Lab Programs - 2019
No ratings yet
Microprocessors and Peripherals: Lab Programs - 2019
40 pages
Module3 Casestudy
No ratings yet
Module3 Casestudy
13 pages
Infocyte Hunt-Biotech Case Study
No ratings yet
Infocyte Hunt-Biotech Case Study
4 pages
CNN Apps
No ratings yet
CNN Apps
17 pages
Trustworthy - Final Essay
No ratings yet
Trustworthy - Final Essay
21 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
15 pages
ML Lab Session 06 - VGG16-CNN
No ratings yet
ML Lab Session 06 - VGG16-CNN
15 pages
The Youtube Social Network: Mirjam Wattenhofer Roger Wattenhofer Zack Zhu
No ratings yet
The Youtube Social Network: Mirjam Wattenhofer Roger Wattenhofer Zack Zhu
9 pages
STC St04014at B757
No ratings yet
STC St04014at B757
1 page
IOT Embedded Projects List 2021 - 2022
No ratings yet
IOT Embedded Projects List 2021 - 2022
10 pages
(6es7952-1al00-0aa0) Memory Card
No ratings yet
(6es7952-1al00-0aa0) Memory Card
1 page
Different Deep CNN Architectures - LeNet, AlexNet, VGG
No ratings yet
Different Deep CNN Architectures - LeNet, AlexNet, VGG
13 pages
Variable Frequency Drives
No ratings yet
Variable Frequency Drives
88 pages
Advantages and Disadvantages of VIDEO CALLING and SOCIAL NETWORKING
No ratings yet
Advantages and Disadvantages of VIDEO CALLING and SOCIAL NETWORKING
1 page
Unit 2 CNN
No ratings yet
Unit 2 CNN
15 pages
Comparison and Architecture of Pre-Trained Model (VGG-16, VGG-19, ResNet, GoogleNet, AlexNet, Inception - by Muhammad Abdullah - Medium
No ratings yet
Comparison and Architecture of Pre-Trained Model (VGG-16, VGG-19, ResNet, GoogleNet, AlexNet, Inception - by Muhammad Abdullah - Medium
15 pages
Type Euro Symbol, Pound Symbol, Yen Symbol, and Other Currency Symbols Online
No ratings yet
Type Euro Symbol, Pound Symbol, Yen Symbol, and Other Currency Symbols Online
3 pages
Notes - CSE (DS)
No ratings yet
Notes - CSE (DS)
44 pages
E SBC l1 GLP External
No ratings yet
E SBC l1 GLP External
91 pages
Notes - CSE (DS)
No ratings yet
Notes - CSE (DS)
44 pages
DL Unit 3-5
No ratings yet
DL Unit 3-5
44 pages
HF-3 Instruction Manual
No ratings yet
HF-3 Instruction Manual
11 pages
Cisco Webex Rooms Brochure
No ratings yet
Cisco Webex Rooms Brochure
22 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Transfer Learning
No ratings yet
Transfer Learning
15 pages
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
No ratings yet
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
50 pages
Lab 04 - Composition
No ratings yet
Lab 04 - Composition
3 pages
Convolutional Neural Network2 26112024 015227pm
No ratings yet
Convolutional Neural Network2 26112024 015227pm
41 pages
1 Information Technology Lessons
No ratings yet
1 Information Technology Lessons
7 pages
Model
No ratings yet
Model
4 pages
Image Processing With Deep Learning
No ratings yet
Image Processing With Deep Learning
39 pages
CNN (1) - Unit 3 - Merged
No ratings yet
CNN (1) - Unit 3 - Merged
14 pages
23-CNN Operations - Architecture - Simple Convolution Network-09!09!2024
No ratings yet
23-CNN Operations - Architecture - Simple Convolution Network-09!09!2024
8 pages
System Architecture Overview
No ratings yet
System Architecture Overview
8 pages
What Is VGG
No ratings yet
What Is VGG
3 pages
Chitra K S 2022bcse07aed1011
No ratings yet
Chitra K S 2022bcse07aed1011
21 pages
VGGNet and ResNet Assignment Questions
No ratings yet
VGGNet and ResNet Assignment Questions
8 pages
VGG-Net Architecture Explained. The Company Visual Geometry Group - by Siddhesh Bangar - Medium
No ratings yet
VGG-Net Architecture Explained. The Company Visual Geometry Group - by Siddhesh Bangar - Medium
19 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Swa-Adhyayan Proposal Letter
No ratings yet
Swa-Adhyayan Proposal Letter
3 pages
DL3 QB
No ratings yet
DL3 QB
19 pages
Numerical Methods103
No ratings yet
Numerical Methods103
7 pages
DL Ass 742
No ratings yet
DL Ass 742
14 pages
Tsedey Bank
No ratings yet
Tsedey Bank
11 pages
Unit III
No ratings yet
Unit III
58 pages
Ch-3 Convolutional Neural Networks (CNNS)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNS)
11 pages
Modern Convolutional Neural Networks
No ratings yet
Modern Convolutional Neural Networks
68 pages
VGG New
No ratings yet
VGG New
15 pages
VGG Net
No ratings yet
VGG Net
6 pages
Vggnet
No ratings yet
Vggnet
8 pages
eSEC01 NetSec
No ratings yet
eSEC01 NetSec
24 pages
Notes
No ratings yet
Notes
15 pages
Module 05
No ratings yet
Module 05
10 pages
BEFA
No ratings yet
BEFA
23 pages
17 VGG 03 09 2024
No ratings yet
17 VGG 03 09 2024
10 pages
TCL Linkzone Mw63 Factsheet Final-1
No ratings yet
TCL Linkzone Mw63 Factsheet Final-1
2 pages
Applied Python Programming (Cycle-1) - 1
No ratings yet
Applied Python Programming (Cycle-1) - 1
26 pages
AI Based Smart Traffic Management System
No ratings yet
AI Based Smart Traffic Management System
8 pages
465-Lecture 7
No ratings yet
465-Lecture 7
46 pages
Lecture05 DeepLearningCNN Trang 2
No ratings yet
Lecture05 DeepLearningCNN Trang 2
45 pages
NPM-D3A en 25 0101
No ratings yet
NPM-D3A en 25 0101
4 pages
DL UNIT 2 CNN Architectures
No ratings yet
DL UNIT 2 CNN Architectures
12 pages
Transfer Learning VGG Concepts MAYO
No ratings yet
Transfer Learning VGG Concepts MAYO
4 pages
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
From Everand
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
S. R. Jena
No ratings yet