0% found this document useful (0 votes)

21 views4 pages

25-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-12!09!2024

Uploaded by

gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views4 pages

25-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-12!09!2024

Uploaded by

gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

AlexNet Solved Example Problems

AlexNet is a pioneering CNN architecture introduced by Alex Krizhevsky,

Ilya Sutskever, and Geoffrey Hinton in 2012. It significantly outperformed
previous models in the ImageNet Large Scale Visual Recognition
Challenge (ILSVRC).

Problem 1: AlexNet Architecture

Problem Statement:
Describe the architecture of AlexNet, including the number and types of
layers, and the size of the input it accepts.

Solution:
AlexNet consists of 8 layers: 5 convolutional layers followed by 3 fully
connected layers.

1. Input: 227x227x3 RGB image

2. Conv1: 96 filters of size 11x11 with stride 4, followed by ReLU
activation and max pooling
3. Conv2: 256 filters of size 5x5, followed by ReLU activation and max
pooling
4. Conv3: 384 filters of size 3x3, followed by ReLU activation
5. Conv4: 384 filters of size 3x3, followed by ReLU activation
6. Conv5: 256 filters of size 3x3, followed by ReLU activation and max
pooling
7. FC6: 4096 neurons with ReLU activation and dropout
8. FC7: 4096 neurons with ReLU activation and dropout
9. FC8 (Output): 1000 neurons with softmax activation (for 1000
ImageNet classes)

Explanation:
AlexNet's architecture was designed to process large-scale image
datasets. The convolutional layers extract features from the input image,
while the fully connected layers interpret these features for classification.

The use of ReLU activations and dropout were innovative at the time and
helped improve training speed and reduce overfitting.

Problem 2: Calculating Output Size of Conv1 Layer

Problem Statement:
Given an input image of size 227x227x3, calculate the output size of the
first convolutional layer (Conv1) in AlexNet.
Solution:
To calculate the output size, we use the formula:
Output size = (N - F + 2P) / S + 1

Where:
N = Input size
F = Filter size
P = Padding
S = Stride

For Conv1 in AlexNet:

N = 227
F = 11
P = 0 (no padding)
S=4

Output size = (227 - 11 + 2(0)) / 4 + 1

= 216 / 4 + 1
= 54 + 1
= 55

Therefore, the output size of Conv1 is 55x55x96 (96 is the number of

filters).

Explanation:
This calculation shows how the spatial dimensions are reduced in the first
convolutional layer due to the large filter size (11x11) and stride (4). The
depth becomes 96 because there are 96 filters in this layer.

Problem 3: Number of Parameters in Conv1 Layer

Problem Statement:
Calculate the number of learnable parameters in the first convolutional
layer (Conv1) of AlexNet.

Solution:
To calculate the number of parameters, we need to consider both the
weights and biases:

1. Weights:

o Each filter is 11x11x3 (3 for RGB channels)

o There are 96 such filters
o Total weights = 11 * 11 * 3 * 96 = 34,848

2. Biases:

o One bias per filter

o Total biases = 96
Total parameters = Weights + Biases
= 34,848 + 96
= 34,944

Explanation:
Each filter in the convolutional layer has weights for each pixel in its
receptive field (11x11) for each input channel (3 for RGB). Additionally,
each filter has one bias term. The large number of parameters in this layer
contributes to AlexNet's ability to learn complex features from the input
images.

Problem 4: Receptive Field Size in Later Layers

Problem Statement:
Calculate the receptive field size of a neuron in the Conv5 layer of AlexNet
with respect to the input image.

Solution:
To calculate the receptive field, we need to work backwards from Conv5
to the input:

1. Conv5: 3x3 filter

2. Conv4: 3x3 filter
3. Conv3: 3x3 filter
4. Conv2: 5x5 filter
5. Conv1: 11x11 filter with stride 4

Calculation:

 Start with Conv5: 3x3 = 3

 Conv4: 3 + (3-1) = 5
 Conv3: 5 + (3-1) = 7
 Conv2: 7 + (5-1) = 11
 Conv1: 11 + (11-1)*4 = 51

The receptive field size is 51x51 pixels in the original input image.

Explanation:
This calculation shows how neurons in deeper layers of the network have
a larger receptive field in the original image. This allows later layers to
capture more complex and larger-scale features of the input image.

Problem 5: Impact of ReLU Activation

Problem Statement:
Explain the purpose and impact of using ReLU (Rectified Linear Unit)
activation functions in AlexNet.
Solution:
ReLU activation functions in AlexNet serve several important purposes:

1. Non-linearity: ReLU introduces non-linearity into the network,

allowing it to learn complex patterns.

2. Faster training: ReLU doesn't suffer from the vanishing gradient

problem like sigmoid or tanh functions, allowing for faster training of
deep networks.

3. Sparsity: ReLU can lead to sparse activations (many neurons output

zero), which can be beneficial for feature learning.

4. Computational efficiency: ReLU is simple to compute (max(0,x)),

making it faster than other activation functions.

Impact:

 Improved training speed: AlexNet trained several times faster with

ReLU compared to tanh activation.
 Better performance: The use of ReLU contributed to AlexNet's
superior performance in the ILSVRC 2012 competition.
 Deeper networks: ReLU made it possible to train deeper networks
without suffering from the vanishing gradient problem.

Explanation:
The introduction of ReLU in AlexNet was a key innovation that helped
overcome limitations of previous activation functions. It allowed for the
effective training of deeper networks and contributed significantly to
AlexNet's breakthrough performance in image classification tasks.

《Proofs》
No ratings yet
《Proofs》
330 pages
Remote Sensing Image Processing
100% (1)
Remote Sensing Image Processing
137 pages
Math No Problem Textbook 1A
No ratings yet
Math No Problem Textbook 1A
152 pages
Different Deep CNN Architectures - LeNet, AlexNet, VGG
No ratings yet
Different Deep CNN Architectures - LeNet, AlexNet, VGG
13 pages
Similar Triangles: Chapter - 8
No ratings yet
Similar Triangles: Chapter - 8
25 pages
Q.15. Derive Expression For Ratio of Tension On Tight Side and Slack Side
100% (1)
Q.15. Derive Expression For Ratio of Tension On Tight Side and Slack Side
8 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
Schedule Risk Analysis
No ratings yet
Schedule Risk Analysis
40 pages
1LANG algMERGED PDF
No ratings yet
1LANG algMERGED PDF
12 pages
C4.5 Algorithm
100% (1)
C4.5 Algorithm
31 pages
Fuzzy Rule Base and Approximate Reasoning
No ratings yet
Fuzzy Rule Base and Approximate Reasoning
31 pages
CompleteFoundationGuideforIITJEEMathematicsBook8!27!08-2024!12!17-39TeacherAssets TeacherManual IIT 11042024 063208 Class 8
No ratings yet
CompleteFoundationGuideforIITJEEMathematicsBook8!27!08-2024!12!17-39TeacherAssets TeacherManual IIT 11042024 063208 Class 8
164 pages
Variation of Velocity and Acceleration in Suction and Delivery Pipes Due To Acceleration of Piston
100% (1)
Variation of Velocity and Acceleration in Suction and Delivery Pipes Due To Acceleration of Piston
9 pages
Course - Outline - Math 2030B - F2019 PDF
No ratings yet
Course - Outline - Math 2030B - F2019 PDF
5 pages
NMCE Lecture Plan
No ratings yet
NMCE Lecture Plan
1 page
Design of Rural Water Supply System Using Loop 4.0
No ratings yet
Design of Rural Water Supply System Using Loop 4.0
9 pages
Advanced Mathematics 2
No ratings yet
Advanced Mathematics 2
4 pages
MEDIAN
No ratings yet
MEDIAN
11 pages
465-Lecture 7
No ratings yet
465-Lecture 7
46 pages
CS436 CS5310 Ee513 L05 CNN2
No ratings yet
CS436 CS5310 Ee513 L05 CNN2
27 pages
Deeplearning - PPT - Unit 4 and 5
No ratings yet
Deeplearning - PPT - Unit 4 and 5
154 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
55 pages
Transfer Learning - CNN Architectures
No ratings yet
Transfer Learning - CNN Architectures
120 pages
cs231n 2018 Lecture09
No ratings yet
cs231n 2018 Lecture09
106 pages
Difference Between AlexNet, VGGNet, ResNet, and Inception
No ratings yet
Difference Between AlexNet, VGGNet, ResNet, and Inception
25 pages
Convolutional Neural Network Models
No ratings yet
Convolutional Neural Network Models
83 pages
L7-CNNs NT
No ratings yet
L7-CNNs NT
82 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
No ratings yet
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
82 pages
CNN Architectures - Transfer Learning
No ratings yet
CNN Architectures - Transfer Learning
64 pages
Unit V
No ratings yet
Unit V
84 pages
4 March 23 - DL
No ratings yet
4 March 23 - DL
79 pages
Neural Network Basic - CNN
No ratings yet
Neural Network Basic - CNN
65 pages
Reasoning Under Uncertainty
100% (1)
Reasoning Under Uncertainty
17 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
77 pages
MP Reference 2017
No ratings yet
MP Reference 2017
23 pages
5b Dana
No ratings yet
5b Dana
67 pages
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
167 pages
Deep Learning CNN
No ratings yet
Deep Learning CNN
204 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
138 B Pretrained Networks Classification Complete
No ratings yet
138 B Pretrained Networks Classification Complete
47 pages
Difference Between AlexNet, VGGNet, ResNet, and Inception - by Aqeel Anwar - Towards Data Science
No ratings yet
Difference Between AlexNet, VGGNet, ResNet, and Inception - by Aqeel Anwar - Towards Data Science
14 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
CNN Architectures 01
No ratings yet
CNN Architectures 01
66 pages
Genai See
No ratings yet
Genai See
51 pages
ML Lec 15 Alexnet CNN
No ratings yet
ML Lec 15 Alexnet CNN
8 pages
Module 2
No ratings yet
Module 2
40 pages
Unit 3
No ratings yet
Unit 3
37 pages
Screenshot 2024-05-27 at 7.54.47 PM
No ratings yet
Screenshot 2024-05-27 at 7.54.47 PM
29 pages
Unit 3
No ratings yet
Unit 3
38 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Angles 2 - Solutions
No ratings yet
Angles 2 - Solutions
13 pages
XLA Final Report
No ratings yet
XLA Final Report
17 pages
Alex Net
No ratings yet
Alex Net
26 pages
XCXC
No ratings yet
XCXC
16 pages
Modern CNN Architectures
No ratings yet
Modern CNN Architectures
32 pages
AlexNet Algorithm Presentation ML AI Deep Learning
No ratings yet
AlexNet Algorithm Presentation ML AI Deep Learning
10 pages
BEFA
No ratings yet
BEFA
23 pages
( (3D Terrain) ) 3D Graphic Java - Render Fractal Landscapes - JavaWorld
No ratings yet
( (3D Terrain) ) 3D Graphic Java - Render Fractal Landscapes - JavaWorld
10 pages
An Overview of Convolutional Neural Network Architectures For Deep Learning
No ratings yet
An Overview of Convolutional Neural Network Architectures For Deep Learning
22 pages
Unit 2 CNN
No ratings yet
Unit 2 CNN
15 pages
11 Ece
No ratings yet
11 Ece
15 pages
Difference Between Alexnet, Vggnet, Resnet, and Inception
No ratings yet
Difference Between Alexnet, Vggnet, Resnet, and Inception
14 pages
DL UNIT 2 CNN Architectures
No ratings yet
DL UNIT 2 CNN Architectures
12 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
Untitled Document
No ratings yet
Untitled Document
15 pages
Module3 Casestudy
No ratings yet
Module3 Casestudy
13 pages
Difference of LeNet and AlexNet
No ratings yet
Difference of LeNet and AlexNet
11 pages
12-Mini-Batch Gradient Descent - Exponential Weighted Averages-07-08-2024
No ratings yet
12-Mini-Batch Gradient Descent - Exponential Weighted Averages-07-08-2024
2 pages
Exercise 8
No ratings yet
Exercise 8
6 pages
Alexnet: The Architecture That Challenged Cnns
No ratings yet
Alexnet: The Architecture That Challenged Cnns
6 pages
Understanding AlexNet
No ratings yet
Understanding AlexNet
8 pages
MTH123A CourseCalendar FT1 2016
No ratings yet
MTH123A CourseCalendar FT1 2016
8 pages
7-Activation Functions - Gradient Descent - Back Propagation-31-07-2024
No ratings yet
7-Activation Functions - Gradient Descent - Back Propagation-31-07-2024
16 pages
Object Classification Using CNN
No ratings yet
Object Classification Using CNN
9 pages
2 Mesh Analysis
No ratings yet
2 Mesh Analysis
16 pages
KCBE - Mathematic MS2023
No ratings yet
KCBE - Mathematic MS2023
15 pages
26-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-16!09!2024
No ratings yet
26-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-16!09!2024
6 pages
Software Midterm
No ratings yet
Software Midterm
10 pages
23-CNN Operations - Architecture - Simple Convolution Network-09!09!2024
No ratings yet
23-CNN Operations - Architecture - Simple Convolution Network-09!09!2024
8 pages
CC113a Freduard V. Manlapaz Subject Instructor
No ratings yet
CC113a Freduard V. Manlapaz Subject Instructor
14 pages
Test Review
No ratings yet
Test Review
8 pages
Mổ xẻ cái AlexNet network
No ratings yet
Mổ xẻ cái AlexNet network
5 pages
Alexnet and Data Augmentation
No ratings yet
Alexnet and Data Augmentation
6 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
Deep Learning Assign 2
No ratings yet
Deep Learning Assign 2
5 pages
Sol Sheet 8 CV 2022
No ratings yet
Sol Sheet 8 CV 2022
4 pages
9-Deep Neural Networks - Forward and Back Propagation-01-08-2024
No ratings yet
9-Deep Neural Networks - Forward and Back Propagation-01-08-2024
10 pages
8-Activation Functions - Gradient Descent - Back Propagation-31-07-2024
No ratings yet
8-Activation Functions - Gradient Descent - Back Propagation-31-07-2024
9 pages
Alex Net
No ratings yet
Alex Net
2 pages
Maths Expertes 22 28
No ratings yet
Maths Expertes 22 28
7 pages
Tutorial 4 (Week 4) Beams, Supports and Indeterminacy of Structure
No ratings yet
Tutorial 4 (Week 4) Beams, Supports and Indeterminacy of Structure
5 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet

25-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-12!09!2024

Uploaded by

25-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-12!09!2024

Uploaded by

AlexNet Solved Example Problems

AlexNet is a pioneering CNN architecture introduced by Alex Krizhevsky,

Problem 1: AlexNet Architecture

1. Input: 227x227x3 RGB image

Problem 2: Calculating Output Size of Conv1 Layer

For Conv1 in AlexNet:

Output size = (227 - 11 + 2(0)) / 4 + 1

Therefore, the output size of Conv1 is 55x55x96 (96 is the number of

Problem 3: Number of Parameters in Conv1 Layer

o Each filter is 11x11x3 (3 for RGB channels)

o One bias per filter

Problem 4: Receptive Field Size in Later Layers

1. Conv5: 3x3 filter

 Start with Conv5: 3x3 = 3

Problem 5: Impact of ReLU Activation

1. Non-linearity: ReLU introduces non-linearity into the network,

2. Faster training: ReLU doesn't suffer from the vanishing gradient

3. Sparsity: ReLU can lead to sparse activations (many neurons output

4. Computational efficiency: ReLU is simple to compute (max(0,x)),

 Improved training speed: AlexNet trained several times faster with

You might also like