0% found this document useful (0 votes)

12 views11 pages

Alexnet - Number of Parameters and Tensor Sizes in A Convolutional Neural Network (CNN)

Uploaded by

sankeerthmanmadhan2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views11 pages

Alexnet - Number of Parameters and Tensor Sizes in A Convolutional Neural Network (CNN)

Uploaded by

sankeerthmanmadhan2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Number of Parameters and Tensor Sizes in a Convolutional

Neural Network (CNN)

In this post, we share some formulas for calculating the sizes of

tensors (images) and the number of parameters in a layer in a
Convolutional Neural Network (CNN).

This post does not define basic terminology used in CNN and
assumes you are familiar with them. In this post, the word Tensor
simply means an image with an arbitrary number of channels.

We will show the calculations using AlexNet as an example. So,

here is the architecture of AlexNet for reference.
AlexNet has the following layers
1. Input: Color images of size 227x227x3. The AlexNet
paper mentions the input size of 224×224 but that is a
typo in the paper.
2. Conv-1: The first convolutional layer consists of 96
kernels of size 11×11 applied with a stride of 4 and
padding of 0.
3. MaxPool-1: The maxpool layer following Conv-1
consists of pooling size of 3×3 and stride 2.
4. Conv-2: The second conv layer consists of 256 kernels
of size 5×5 applied with a stride of 1 and padding of 2.
5. MaxPool-2: The maxpool layer following Conv-2
consists of a pooling size of 3×3 and a stride of 2.
6. Conv-3: The third conv layer consists of 384 kernels of
size 3×3 applied with a stride of 1 and padding of 1.
7. Conv-4: The fourth conv layer has the same structure as
the third conv layer. It consists of 384 kernels of size
3×3 applied with a stride of 1 and padding of 1.
8. Conv-5: The fifth conv layer consists of 256 kernels of
size 3×3 applied with a stride of 1 and padding of 1.
9. MaxPool-3: The maxpool layer following Conv-5
consists of a pooling size of 3×3 and a stride of 2.
10. FC-1: The first fully connected layer has 4096 neurons.
11. FC-2: The second fully connected layer has 4096
neurons.
12. FC-3: The third fully connected layer has 1000 neurons.

Next, we will use the above architecture to explain

1. How to calculate the tensor size at each stage
2. How to calculate the total number of parameters in the
network

Size of the Output Tensor (Image) of a Conv Layer

Let’s define
= Size (width) of output image.
= Size (width) of input image.
= Size (width) of kernels used in the Conv Layer.
= Number of kernels.
= Stride of the convolution operation.
= Padding.
The size ( ) of the output image is given by

The number of channels in the output image is equal to the number

of kernels .

Example: In AlexNet, the input image is of size 227x227x3. The first

convolutional layer has 96 kernels of size 11x11x3. The stride is 4 and
padding is 0. Therefore the size of the output image right after the first
bank of convolutional layers is

So, the output image is of size 55x55x96 ( one channel for each kernel ).
We leave it for the reader to verify the sizes of the outputs of the Conv-
2, Conv-3, Conv-4 and Conv-5 using the above image as a guide.

Size of Output Tensor (Image) of a MaxPool Layer

Let’s define
= Size (width) of output image.
= Size (width) of input image.
= Stride of the convolution operation.
= Pool size.
The size ( ) of the output image is given by

Note that this can be obtained using the formula for the convolution
layer by making padding equal to zero and keeping same as the
kernel size. But unlike the convolution llayer, the number of channels in
the maxpool layer’s output is unchanged.
Example: In AlexNet, the MaxPool layer after the bank of convolution
filters has a pool size of 3 and stride of 2. We know from the previous
section, the image at this stage is of size 55x55x96. The output image
after the MaxPool layer is of size

So, the output image is of size 27x27x96.

We leave it for the reader to verify the sizes of the outputs of MaxPool-2
and MaxPool-3.
Size of the output of a Fully Connected Layer
A fully connected layer outputs a vector of length equal to the number of
neurons in the layer.
Summary: Change in the size of the tensor through AlexNet
In AlexNet, the input is an image of size 227x227x3. After Conv-1, the
size of changes to 55x55x96 which is transformed to 27x27x96 after
MaxPool-1. After Conv-2, the size changes to 27x27x256 and following
MaxPool-2 it changes to 13x13x256. Conv-3 transforms it to a size of
13x13x384, while Conv-4 preserves the size and Conv-5 changes the
size back go 27x27x256. Finally, MaxPool-3 reduces the size to
6x6x256. This image feeds into FC-1 which transforms it into a vector
of size 4096×1. The size remains unchanged through FC-2, and finally,
we get the output of size 1000×1 after FC-3.
Next, we calculate the number of parameters in each Conv Layer.

Number of Parameters of a Conv Layer

In a CNN, each layer has two kinds of parameters : weights and biases.
The total number of parameters is just the sum of all weights and biases.
Let’s define,
= Number of weights of the Conv Layer.
= Number of biases of the Conv Layer.
= Number of parameters of the Conv Layer.
= Size (width) of kernels used in the Conv Layer.
= Number of kernels.
= Number of channels of the input image.
In a Conv Layer, the depth of every kernel is always equal to the number
of channels in the input image. So every kernel has parameters,
and there are such kernels. That’s how we come up with the above
formula.
Example: In AlexNet, at the first Conv Layer, the number of channels (
) of the input image is 3, the kernel size ( ) is 11, the number of
kernels ( ) is 96. So the number of parameters is given by

Readers can verify the number of parameters for Conv-2, Conv-3, Conv-
4, Conv-5 are 614656 , 885120, 1327488 and 884992 respectively. The
total number of parameters for the Conv Layers is therefore 3,747,200.
Think this is a large number? Well, wait until we see the fully connected
layers. One of the benefits of the Conv Layers is that weights are shared
and therefore we have fewer parameters than we would have in case of a
fully connected layer.
Number of Parameters of a MaxPool Layer
There are no parameters associated with a MaxPool layer. The pool size,
stride, and padding are hyperparameters.
Number of Parameters of a Fully Connected (FC) Layer
There are two kinds of fully connected layers in a CNN. The first FC
layer is connected to the last Conv Layer, while later FC layers are
connected to other FC layers. Let’s consider each case separately.
Case 1: Number of Parameters of a Fully Connected (FC) Layer
connected to a Conv Layer
Let’s define,
= Number of weights of a FC Layer which is connected to a
Conv Layer.
= Number of biases of a FC Layer which is connected to a
Conv Layer.
= Size (width) of the output image of the previous Conv Layer.
= Number of kernels in the previous Conv Layer.
= Number of neurons in the FC Layer.

Example: The first fully connected layer of AlexNet is connected to a

Conv Layer. For this layer, , and . Therefore,

That’s an order of magnitude more than the total number of parameters

of all the Conv Layers combined!
Case 2: Number of Parameters of a Fully Connected (FC) Layer
connected to a FC Layer
Let’s define,
= Number of weights of a FC Layer which is connected to an
FC Layer.
= Number of biases of a FC Layer which is connected to an
FC Layer.
= Number of parameters of a FC Layer which is connected to
an FC Layer.
= Number of neurons in the FC Layer.
= Number of neurons in the previous FC Layer.

In the above equation, is the total number of connection

weights from neurons of the previous FC Layer the neurons of the
current FC Layer. The total number of biases is the same as the number
of neurons ( ).
Example: The last fully connected layer of AlexNet is connected to an
FC Layer. For this layer, and . Therefore,
We leave it for the reader to verify the total number of parameters for
FC-2 in AlexNet is 16,781,312.
Number of Parameters and Tensor Sizes in AlexNet
The total number of parameters in AlexNet is the sum of all
parameters in the 5 Conv Layers + 3 FC Layers. It comes out to a
whopping 62,378,344! The table below provides a summary.

Layer Name Tensor Weights Biases Parameter

Size s

Input Image 227x227x3 0 0 0

Conv-1 55x55x96 34,848 96 34,944

MaxPool-1 27x27x96 0 0 0

Conv-2 27x27x256 614,400 256 614,656

MaxPool-2 13x13x256 0 0 0

Conv-3 13x13x384 884,736 384 885,120

Conv-4 13x13x384 1,327,10 384 1,327,488

Conv-5 13x13x256 884,736 256 884,992

MaxPool-3 6x6x256 0 0 0

FC-1 4096×1 37,748,7 4,096 37,752,832

36
FC-2 4096×1 16,777,2 4,096 16,781,312
16

FC-3 1000×1 4,096,00 1,000 4,097,000

Output 1000×1 0 0 0

Total 62,378,344

BS en 60584-1-2013
100% (2)
BS en 60584-1-2013
72 pages
Bio-Stats Step 3
100% (6)
Bio-Stats Step 3
9 pages
Final Priced BOQ's - Residential Hse PDF
50% (2)
Final Priced BOQ's - Residential Hse PDF
55 pages
CNN Short
No ratings yet
CNN Short
61 pages
Guide - Making Money Online
91% (11)
Guide - Making Money Online
324 pages
Complete Kitcar - March 2017
No ratings yet
Complete Kitcar - March 2017
84 pages
Coma
100% (1)
Coma
42 pages
Rue Morgue 11.12 2021
100% (2)
Rue Morgue 11.12 2021
64 pages
Ch06 Roth3e
100% (1)
Ch06 Roth3e
85 pages
Colony Earth PDF
No ratings yet
Colony Earth PDF
144 pages
Business Plan Group 2
No ratings yet
Business Plan Group 2
48 pages
Design and Layout of Spiral Separation Plant: (Industrial Project For Indian Rare Earths LTD., Chavara)
No ratings yet
Design and Layout of Spiral Separation Plant: (Industrial Project For Indian Rare Earths LTD., Chavara)
53 pages
Convolution Neural Network: CP - 6 Machine Learning M S Prasad
No ratings yet
Convolution Neural Network: CP - 6 Machine Learning M S Prasad
28 pages
Mos Cabin R1
100% (1)
Mos Cabin R1
13 pages
Java JVM Troubleshooting Guide
100% (1)
Java JVM Troubleshooting Guide
127 pages
Banana Fibre Extracting Project
No ratings yet
Banana Fibre Extracting Project
2 pages
Principles of Convolutional Neural Networks
No ratings yet
Principles of Convolutional Neural Networks
9 pages
Massachusetts Parent Letter Refusing MCAS
No ratings yet
Massachusetts Parent Letter Refusing MCAS
1 page
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
No ratings yet
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
15 pages
Building Your Money Making Machine
100% (1)
Building Your Money Making Machine
2 pages
Fluid Mechanics and Hydraulics - Gillesania
No ratings yet
Fluid Mechanics and Hydraulics - Gillesania
308 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
55 pages
Gravitation Revision Notes (JEE Mains)
No ratings yet
Gravitation Revision Notes (JEE Mains)
33 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
Convolutional Neural Networks: Shusen Wang
No ratings yet
Convolutional Neural Networks: Shusen Wang
75 pages
Module5 Quiz
100% (1)
Module5 Quiz
34 pages
CNN For Visual Recognition
No ratings yet
CNN For Visual Recognition
4 pages
(W F + 2P) /S + 1: Use of Zero-Padding
No ratings yet
(W F + 2P) /S + 1: Use of Zero-Padding
3 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
ENVI Classic Tutorial: Target Detection
No ratings yet
ENVI Classic Tutorial: Target Detection
18 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
63 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
61 pages
Joan Batayo Profile
No ratings yet
Joan Batayo Profile
2 pages
Convolutional Neural Networks - Annotated
No ratings yet
Convolutional Neural Networks - Annotated
83 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
CNN Midterm
No ratings yet
CNN Midterm
103 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
161 pages
Intro To CNN
No ratings yet
Intro To CNN
17 pages
Money and Banking Notes Part 2
No ratings yet
Money and Banking Notes Part 2
3 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
29 pages
Cbs 350 Chapter 08
No ratings yet
Cbs 350 Chapter 08
18 pages
Student Notes: Convolutional Neural Networks (CNN) Introduction
No ratings yet
Student Notes: Convolutional Neural Networks (CNN) Introduction
9 pages
Summary Notes of CNN
No ratings yet
Summary Notes of CNN
23 pages
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
No ratings yet
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
13 pages
CS231n Convolutional Neural Networks For Visual Recognition
No ratings yet
CS231n Convolutional Neural Networks For Visual Recognition
2 pages
Unit 3 ML
No ratings yet
Unit 3 ML
27 pages
Sinopsis Muhammad Haris Yulianto-1
No ratings yet
Sinopsis Muhammad Haris Yulianto-1
6 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
No ratings yet
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
65 pages
THE Infinite Game: Simon Sinek
No ratings yet
THE Infinite Game: Simon Sinek
27 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
6 pages
Lec 8
No ratings yet
Lec 8
60 pages
Nria20-Dl - Unit-3 Notes-Final
No ratings yet
Nria20-Dl - Unit-3 Notes-Final
23 pages
06 Activity 1
No ratings yet
06 Activity 1
3 pages
Unit 3
No ratings yet
Unit 3
80 pages
Introduction To Artificial Neural Networks - Neural Networks and Deep Learning
No ratings yet
Introduction To Artificial Neural Networks - Neural Networks and Deep Learning
26 pages
Untitled Document
No ratings yet
Untitled Document
15 pages
Convolutional Neural Networks (CNN) : Convolutions
No ratings yet
Convolutional Neural Networks (CNN) : Convolutions
17 pages
CV Lec6
No ratings yet
CV Lec6
57 pages
26-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-16!09!2024
No ratings yet
26-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-16!09!2024
6 pages
Assignment 5
No ratings yet
Assignment 5
2 pages
10 - Mark - CNN Architecture and Training
No ratings yet
10 - Mark - CNN Architecture and Training
7 pages
Notice To IEA Dwarka Museum
No ratings yet
Notice To IEA Dwarka Museum
2 pages
Solution
No ratings yet
Solution
4 pages
Mod 5
No ratings yet
Mod 5
96 pages
Assignment 10
No ratings yet
Assignment 10
2 pages
CSE4261 Lecture-11
No ratings yet
CSE4261 Lecture-11
35 pages
Where Bible Says Eat and Joy With You Wife in The Bible - Google Search
No ratings yet
Where Bible Says Eat and Joy With You Wife in The Bible - Google Search
1 page
586 114 216 Convolutional Neural Networks
No ratings yet
586 114 216 Convolutional Neural Networks
48 pages
Fortum Investor Presentation May 2019 0
No ratings yet
Fortum Investor Presentation May 2019 0
56 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
26 pages
Convolutional Neural Network: Vnuk - NCT & TTH
No ratings yet
Convolutional Neural Network: Vnuk - NCT & TTH
41 pages
Gradient Descent
No ratings yet
Gradient Descent
12 pages
Lecture 3
No ratings yet
Lecture 3
92 pages
Week 7
No ratings yet
Week 7
24 pages
07 Ais302 CNN
No ratings yet
07 Ais302 CNN
56 pages
T SNE1
No ratings yet
T SNE1
20 pages
DL Mod3
No ratings yet
DL Mod3
102 pages
Info Sec Unit-1 Note-05
No ratings yet
Info Sec Unit-1 Note-05
5 pages
Worksheet Geography CH 4
No ratings yet
Worksheet Geography CH 4
2 pages
Convolutional Neural Networks: 1. Basics of Cnns
No ratings yet
Convolutional Neural Networks: 1. Basics of Cnns
8 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
4 A 14 CSC Operating System
No ratings yet
4 A 14 CSC Operating System
2 pages
Convolutional Neural Networks - Part 2
No ratings yet
Convolutional Neural Networks - Part 2
49 pages
0 - Try To Remember
No ratings yet
0 - Try To Remember
4 pages
02.02 15645872 Obc 2024
No ratings yet
02.02 15645872 Obc 2024
1 page
16-Optimization and Loss Functions in Classifiers, Convolution Layers, Max Pool Layers-24!08!2024
No ratings yet
16-Optimization and Loss Functions in Classifiers, Convolution Layers, Max Pool Layers-24!08!2024
36 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
6 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
98 pages
CNN Calculations
No ratings yet
CNN Calculations
2 pages
CNN Detailed Explanation
No ratings yet
CNN Detailed Explanation
3 pages
Unit 5th Ig Ann
No ratings yet
Unit 5th Ig Ann
112 pages
Chapter 8 (Convolution Neural Network)
No ratings yet
Chapter 8 (Convolution Neural Network)
73 pages
CNN (Neural Network)
No ratings yet
CNN (Neural Network)
32 pages
Unit 5 Ann
No ratings yet
Unit 5 Ann
28 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
32 pages
CNN 1
No ratings yet
CNN 1
19 pages
Day 3 - Math & Convolution
No ratings yet
Day 3 - Math & Convolution
4 pages