0% found this document useful (0 votes)

67 views4 pages

Backpropagation in Convolutional Neural Networks

The document describes the architecture of a convolutional neural network for image classification. It includes details on the layers, hyperparameters, and training process. The network contains convolution layers, pooling layers, and fully connected layers. It is trained on mini-batches with a learning rate of 0.001 using RMSProp optimization over 200 epochs. Deeper models and dropout could improve performance while larger strides and pooling kernels would speed up training.

Uploaded by

SergeiBugrov

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views4 pages

Backpropagation in Convolutional Neural Networks

Uploaded by

SergeiBugrov

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

ECSE 6965

Programing Assignment 3
Sergei Bugrov

Backpropagation in convolutional neural networks

1. 𝛁ŷ = ŷ – y, when loos function is cross entropy

𝜕ŷ 𝜕ŷ 𝜕ŷ
2. 𝛁𝑊𝑜 = 𝛁ŷ, 𝛁𝑏𝑜 = 𝛁ŷ, 𝛁FC = 𝛁ŷ
𝜕𝑊𝑜 𝜕𝑏𝑜 𝜕FC

3. 𝛁P[𝑟][𝑐] = 𝛁FC[(𝑟 − 1)𝑁𝑟𝐹𝐶 ]

𝜕𝑃 𝑁𝑃 𝜕𝑃[𝑟] 𝑁𝑃 𝑁𝑃 𝜕𝑃[𝑟][𝑐]
4. 𝛁A = 𝛁P = ∑𝑟=1
𝑟
𝛁P[𝑟] = ∑𝑟=1
𝑟
∑𝑐=1
𝑐
𝛁P[𝑟][𝑐] =
𝜕𝐴 𝜕𝐴 𝜕𝐴
𝜕𝑃[𝑟][𝑐] 𝜕𝑃[𝑟][𝑐]
⋯
𝜕𝐴[1][1] 𝜕𝐴[1][𝑁𝑐𝐴 ]
𝑃 𝑃 𝜕𝑃[𝑟][𝑐] 𝟏 𝑖𝑓 𝑘 = 𝑖 ∗ 𝑎𝑛𝑑 𝑙 = 𝑗 ∗
∑𝑁 𝑁𝑐
𝑟=1 ∑𝑐=1 𝛁P[𝑟][𝑐], 𝑤ℎ𝑒𝑟𝑒 =
𝑟
⋮ ⋱ ⋮ { ,
𝜕𝑃[𝑟][𝑐] 𝜕𝑃[𝑟][𝑐]
𝜕𝐴[𝑘][𝑙] 0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
⋯
[𝜕𝐴[𝑁𝑟𝐴][1] 𝜕𝐴[𝑁𝑟𝐴 ][𝑁𝑐𝐴 ]]

where 𝑖 ∗ , 𝑗 ∗ = argmax 𝐴[𝑘][𝑙] and if stride = 1

𝑟≤𝑘≤𝑟+𝑑−1
𝑐≤𝑙≤𝑐+𝑑−1

𝜕𝐴[𝑟][𝑐] 𝜕𝐴[𝑟][𝑐]
⋯
𝜕𝐶[1][1] 𝜕𝐶[1][𝑁𝑐𝐴 ]
𝜕𝐴 𝑁𝑟𝐴 𝑁𝑐𝐴 𝜕𝐴[𝑟][𝑐] 𝑁𝑟𝐴 𝑁𝑐𝐴
5. 𝛁C = 𝛁A = ∑𝑟=1 ∑𝑐=1 𝛁A[𝑟][𝑐] = ∑𝑟=1 ∑𝑐=1 ⋮ ⋱ ⋮ 𝛁A[𝑟][𝑐]
𝜕𝐶 𝜕𝐶
𝜕𝐴[𝑟][𝑐] 𝜕𝐴[𝑟][𝑐]
⋯
[𝜕𝐶[𝑁𝑟𝐴][1] 𝜕𝐶[𝑁𝑟𝐴 ][𝑁𝑐𝐴 ]]

𝜕𝐴[𝑟][𝑐] 𝟏 𝑖𝑓 𝑖 = 𝑟, 𝑗 = 𝑐, 𝑎𝑛𝑑 𝐶[𝑖][𝑗] > 0

={
𝜕𝐶[𝑖][𝑗] 0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

𝜕𝐶[𝑟][𝑐] 𝜕𝐶[𝑟][𝑐]
⋯
𝜕𝑊𝑥 [1][1] 𝜕𝑊𝑥[1][𝑁𝑐𝐶 ]
𝜕𝐶 𝑁𝐶 𝑁 𝐶 𝜕𝐶[𝑟][𝑐] 𝑁𝐶 𝑁𝐶
6. 𝛁𝑊𝑥 = 𝛁C = ∑𝑟=1
𝑟
∑𝑐=1
𝑐
𝛁C[𝑟][𝑐] = ∑𝑟=1
𝑟
∑𝑐=1
𝑐
⋮ ⋱ ⋮ 𝛁C[𝑟][𝑐]
𝜕𝑊𝑥 𝜕𝑊𝑥
𝜕𝐶[𝑟][𝑐] 𝜕𝐶[𝑟][𝑐]
⋯
[𝜕𝑊𝑥[𝑁𝑟𝐶][1] 𝜕𝑊𝑥 [𝑁𝑟𝐴 ][𝑁𝑐𝑐 ]]
𝜕𝐶[𝑟][𝑐]
𝜕𝑊𝑥[𝑖][𝑗][1]
𝜕𝐶[𝑟][𝑐]
𝜕𝐶[𝑟][𝑐] 𝜕𝐶[𝑟][𝑐]
= 𝜕𝑊𝑥[𝑖][𝑗][2] , 𝜕𝑊 [𝑖][𝑗][𝑙] = X[𝑟 + 𝑖 − 1][𝑐 + 𝑗 − 1][𝑙]
𝜕𝑊𝑥[𝑖][𝑗] 𝑥
⋮
𝜕𝐶[𝑟][𝑐]
[𝜕𝑊𝑥[𝑖][𝑗][𝐷]]

𝜕𝐶[𝑟][𝑐] 𝜕𝐶[𝑟][𝑐]
𝑁𝑟𝐶 𝑁𝑐𝐶 ⋯ 𝑁𝑟𝐶 𝑁𝑐𝐶
𝜕𝐶 𝜕𝐶[𝑟][𝑐] 𝜕𝑏𝑥 [1][1] 𝜕𝑏𝑥 [1][𝐾]
𝛁𝑏𝑥 = 𝛁C = ∑ ∑ 𝛁C[𝑟][𝑐] = ∑ ∑ ⋮ ⋱ ⋮ 𝛁C[𝑟][𝑐]
𝜕𝑏𝑥 𝜕𝑏𝑥
𝑟=1 𝑐=1 𝑟=1 𝑐=1 𝜕𝐶[𝑟][𝑐] 𝜕𝐶[𝑟][𝑐]
⋯
[𝜕𝑏𝑥 [𝐾][1] 𝜕𝑏𝑥 [𝐾][𝐾]]
𝜕𝐶[𝑟][𝑐] 𝟏 𝑖𝑓 𝑖 = 𝑟, 𝑗 = 𝑐
where ={
𝜕𝑏𝑥 [𝑖][𝑗] 𝟎, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

𝜕𝐶[𝑟][𝑐] 𝜕𝐶[𝑟][𝑐]
⋯
𝜕𝑋[1][1] 𝜕𝑋[1][𝑁𝑐𝑋 ]
𝜕𝐶 𝑁𝑟𝐶 𝑁𝑐𝐶 𝜕𝐶[𝑟][𝑐] 𝑁𝑟𝐶 𝑁𝑐𝐶
7. 𝛁X = 𝛁C = ∑𝑟=1 ∑𝑐=1 𝛁C[𝑟][𝑐] = ∑𝑟=1 ∑𝑐=1 ⋮ ⋱ ⋮ 𝛁C[𝑟][𝑐], where
𝜕X 𝜕X
𝜕𝐶[𝑟][𝑐] 𝜕𝐶[𝑟][𝑐]
⋯
[𝜕𝑋[𝑁𝑟𝑋 ][1] 𝜕X[𝑁𝑟𝑋 ][𝑁𝑐𝑋 ]]

𝜕𝐶[𝑟][𝑐]
𝜕𝑋[𝑖][𝑗][1]
𝜕𝐶[𝑟][𝑐] 𝜕𝐶[𝑟][𝑐]
= 𝜕𝑋[𝑖][𝑗][2]
𝜕𝑋[𝑖][𝑗]
⋮
𝜕𝐶[𝑟][𝑐]
[𝜕X[𝑖][𝑗][𝐷]]
𝜕𝐶[𝑟][𝑐] 𝑊𝑥 [𝑖 − 𝑟 + 1][𝑗 − 𝑐 + 1][𝑙] 𝑖𝑓 𝑟 ≤ 𝑖 ≤ 𝑟 + 𝐾 − 1 𝑎𝑛𝑑 𝑐 ≤ 𝑗 ≤ 𝑐 + 𝐾 − 1
where ={ when
𝜕X[𝑖][𝑗][𝑙] 𝟎, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
stride = 1

Model architecture.

• Input: tensor X ∈ R32x32x3;

• W 1, tensor of weights, its shape = [5, 5, 3, 32], vector of biases b1 ∈ R32;
• 1st Convolutions layer with 32 kernels of size 5x5, stride =1
• 1st Pooling layer with a kernel of size 2x2, stride =1;
• W 2, tensor of weights, its shape = [5, 5, 32, 32], vector of biases b2 ∈ R32;
• 2nd Convolutions layer with 32 kernels of size 5x5, stride =1;
• 1st Pooling layer with a kernel of size 2x2, stride =1;
• W 3, tensor of weights, its shape = [3, 3, 32, 64], vector of biases b3 ∈ R64;
• 3rd Convolutions layer with 64 kernels of size 3x3, stride =1;
• W 4, matrix of weights, its shape = [192, 65536], vector of biases b4 ∈ R192;
• FC, fully connected layer ∈ R192
• W 4, matrix of weights, its shape = [10, 192], vector of biases b4 ∈ R10;
• Output, output layer ∈ R10;
• Loss function – cross entropy error: loss = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits_v2(logits=output, labels=y))
• Optimizer: tf.train.RMSPropOptimizer(learning_rate=1e-3).minimize(loss)

Hyperparameters.
Batch size = 250
Number of epochs = 200

Discussion. Stride > 1 and/or pooling kernel size > 2 will improve the speed of training while
performance will be the same. Obviously, a deeper model and/or Dropout layer will improve
generalization. It took me 2+ hours to train the model. I ran it for only 200 epochs vs 6000 epochs in the
instructions. Hence, relative underperformance.

1st Convolution Layer Filters:

Loss
8
6
4
2
0
3…
0

55
66
11
22
33
44

77
88
99
110
121
132
143
154
165
176
187
198
209
220
231
242

Train_Loss Val_Loss

Terrible overfitting. I would blame gigantic fully connected layer. Good dropout would improve the
situation.
Accuracy
120.00%

100.00%

80.00%

60.00%

40.00%

20.00%

0.00%

0…
0
10
20
30
40
50
60
70
80
90
100
110
120
130
140
150
160
170
180
190
200
210
220
230
240
Train_Accu Val_Accu

Test accuracy (per class)

Class Accuracy
0 0.6782786885245902
1 0.7465346534653465
2 0.51953125
3 0.386317907444668
4 0.5641025641025641
5 0.5594262295081968
6 0.7535641547861507
7 0.6565656565656566
8 0.7857142857142857
9 0.7309941520467836.

Class #3 is way below (only 0.386) average accuracy (Average train_accu: 99.50%, Average

valid_accu: 63.80%), as well as classes 3,4, and 5.

Case Analysis. Frogs Leap
100% (1)
Case Analysis. Frogs Leap
6 pages
Cs2351 Artificial Intelligence 16 Marks
100% (1)
Cs2351 Artificial Intelligence 16 Marks
1 page
Week 8 Salary Negotiation Skills Test
No ratings yet
Week 8 Salary Negotiation Skills Test
3 pages
Crash Course On Tensorflow!: Vincent Lepetit!
No ratings yet
Crash Course On Tensorflow!: Vincent Lepetit!
63 pages
Introduction To Keras!: Vincent Lepetit!
No ratings yet
Introduction To Keras!: Vincent Lepetit!
33 pages
Tensorflow and Deep Learning
No ratings yet
Tensorflow and Deep Learning
51 pages
Tensorflow, Keras and Deep Learning
No ratings yet
Tensorflow, Keras and Deep Learning
51 pages
ccs355 Lab Manual
No ratings yet
ccs355 Lab Manual
24 pages
Analysis and Study of Perceptron To Solve Xor Problem
No ratings yet
Analysis and Study of Perceptron To Solve Xor Problem
6 pages
Autoencoder: Tuan Nguyen - AI4E
No ratings yet
Autoencoder: Tuan Nguyen - AI4E
35 pages
Face - Emotion Recog - Implementation
No ratings yet
Face - Emotion Recog - Implementation
11 pages
ECSE 6965 HW#7 Sergei Bugrov
No ratings yet
ECSE 6965 HW#7 Sergei Bugrov
2 pages
ASNM Program Explain
No ratings yet
ASNM Program Explain
4 pages
Due 4 PM, May 1: Assignment #8
No ratings yet
Due 4 PM, May 1: Assignment #8
1 page
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Xor in C#
No ratings yet
Xor in C#
3 pages
Addernet: Do We Really Need Multiplications in Deep Learning?
No ratings yet
Addernet: Do We Really Need Multiplications in Deep Learning?
8 pages
CNN With Tensor Flow
No ratings yet
CNN With Tensor Flow
61 pages
Ieee - Intrusion Detection System Using Neural
No ratings yet
Ieee - Intrusion Detection System Using Neural
8 pages
Speech Emotion Recognition
No ratings yet
Speech Emotion Recognition
6 pages
A Fully Integrated Computer-Aided Diagnosis System For Digital X-Raymammograms Via Deep Learning Detection, Segmentation, and Classification
No ratings yet
A Fully Integrated Computer-Aided Diagnosis System For Digital X-Raymammograms Via Deep Learning Detection, Segmentation, and Classification
11 pages
Chapter02 Mathematical-Building-Blocks
No ratings yet
Chapter02 Mathematical-Building-Blocks
9 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
02 Ai Project Cycle Revision Notes
No ratings yet
02 Ai Project Cycle Revision Notes
4 pages
(IJCST-V11I2P11) :dr. Girish Tere, Mr. Kuldeep Kandwal
No ratings yet
(IJCST-V11I2P11) :dr. Girish Tere, Mr. Kuldeep Kandwal
7 pages
01 249212 012 10129792044 11122022 112910pm
No ratings yet
01 249212 012 10129792044 11122022 112910pm
8 pages
Final Code
No ratings yet
Final Code
16 pages
Traffic Sign Recognition Project
No ratings yet
Traffic Sign Recognition Project
9 pages
Beyond The Algorithm AI, Security, Privacy, and Ethics (Omar Santos)
No ratings yet
Beyond The Algorithm AI, Security, Privacy, and Ethics (Omar Santos)
437 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Homework IntroToDL
No ratings yet
Homework IntroToDL
3 pages
Csc413 Project Semantic Segmentation
No ratings yet
Csc413 Project Semantic Segmentation
84 pages
Short MCMC Supplementary
No ratings yet
Short MCMC Supplementary
5 pages
AD3511-DEEP LEARNING LAB MANUAL Revised
No ratings yet
AD3511-DEEP LEARNING LAB MANUAL Revised
72 pages
Ad3511 Deep Learning Lab Manual
No ratings yet
Ad3511 Deep Learning Lab Manual
80 pages
C1 W1 Lab 3 Siamese-Network
No ratings yet
C1 W1 Lab 3 Siamese-Network
13 pages
Font Transfer 2 Autoencoders
No ratings yet
Font Transfer 2 Autoencoders
78 pages
Medical Text Classifier GabrieldeOlaguibel
No ratings yet
Medical Text Classifier GabrieldeOlaguibel
12 pages
NNDL Record Final
No ratings yet
NNDL Record Final
46 pages
SLR Ocr
No ratings yet
SLR Ocr
28 pages
Python Deep Learning Lab Programs
No ratings yet
Python Deep Learning Lab Programs
35 pages
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
No ratings yet
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
56 pages
Perceptron & Back Propagation Algorithm
No ratings yet
Perceptron & Back Propagation Algorithm
35 pages
S. NO. Title of The Experiments Page No
No ratings yet
S. NO. Title of The Experiments Page No
11 pages
NN & DL Lab Manual 1
No ratings yet
NN & DL Lab Manual 1
44 pages
AI LAB 15-Oct-2024 1
No ratings yet
AI LAB 15-Oct-2024 1
213 pages
APKA Report
No ratings yet
APKA Report
3 pages
Experiments - With - Convolutional - Neural - Network - 2 - 6b.ipynb - Colaboratory
No ratings yet
Experiments - With - Convolutional - Neural - Network - 2 - 6b.ipynb - Colaboratory
6 pages
Keras
No ratings yet
Keras
4 pages
C2 W2 Multiclass TF
No ratings yet
C2 W2 Multiclass TF
13 pages
Restricted Boltzmann Machines
No ratings yet
Restricted Boltzmann Machines
8 pages
AD3511 - Deep Learning Lab Manual
No ratings yet
AD3511 - Deep Learning Lab Manual
61 pages
C2 W2 Multiclass TF
No ratings yet
C2 W2 Multiclass TF
13 pages
Aditya Joshi 23252595 Assign 5
No ratings yet
Aditya Joshi 23252595 Assign 5
7 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Ccnet Only
No ratings yet
Ccnet Only
6 pages
NNDL 2
No ratings yet
NNDL 2
67 pages
Postgraduate PG - Master Computer Applications Mca - Semester 3 - 2024 - May - Knowledge Representation Artificial Intelligence 2020 Pattern
No ratings yet
Postgraduate PG - Master Computer Applications Mca - Semester 3 - 2024 - May - Knowledge Representation Artificial Intelligence 2020 Pattern
4 pages
Neural DEEP
No ratings yet
Neural DEEP
39 pages
DL Practical QP
No ratings yet
DL Practical QP
10 pages
Nndlmac
No ratings yet
Nndlmac
9 pages
Nndlrepo 2
No ratings yet
Nndlrepo 2
3 pages
Large Language Models Are Zero Shot Text Classifiers
No ratings yet
Large Language Models Are Zero Shot Text Classifiers
9 pages
IJRPR16324
No ratings yet
IJRPR16324
5 pages
20-Delta Rule-02-09-2024
No ratings yet
20-Delta Rule-02-09-2024
3 pages
Mini Project 1
No ratings yet
Mini Project 1
16 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
Border Defence Mechanism by Pavithra
No ratings yet
Border Defence Mechanism by Pavithra
10 pages
Model Answer
No ratings yet
Model Answer
3 pages
The Evolution of NLP
No ratings yet
The Evolution of NLP
81 pages
Deep Learning LAB
No ratings yet
Deep Learning LAB
47 pages
All List Cse Techlogics
No ratings yet
All List Cse Techlogics
32 pages
Ass 3
No ratings yet
Ass 3
5 pages
Arya Dadhich: City, State
No ratings yet
Arya Dadhich: City, State
1 page
Simcpsr: Simple Contrastive Learning For Paper Submission Recommendation System
No ratings yet
Simcpsr: Simple Contrastive Learning For Paper Submission Recommendation System
13 pages
AI Algorithms Summary by Djemoui Badr
No ratings yet
AI Algorithms Summary by Djemoui Badr
5 pages
Raul Rojas - Neural Networks - A Systematic Introduction-Springer (1996) - 1-11
No ratings yet
Raul Rojas - Neural Networks - A Systematic Introduction-Springer (1996) - 1-11
11 pages
Deep Learning Assignment 01
No ratings yet
Deep Learning Assignment 01
5 pages
Conv Net
No ratings yet
Conv Net
7 pages
CNN - Face Recognition
No ratings yet
CNN - Face Recognition
26 pages
Assignment3 - DeepLearning
No ratings yet
Assignment3 - DeepLearning
16 pages
Deep Learning Programs Updated
No ratings yet
Deep Learning Programs Updated
24 pages
DL Experiments
No ratings yet
DL Experiments
19 pages
Cognizant
No ratings yet
Cognizant
15 pages
Neural Network
No ratings yet
Neural Network
4 pages
MNIST Tensorflow Mini Project 1749471354
No ratings yet
MNIST Tensorflow Mini Project 1749471354
4 pages
NNDL Record Final 2
No ratings yet
NNDL Record Final 2
85 pages
Algorithm - Pseudocode of 2D CNN
No ratings yet
Algorithm - Pseudocode of 2D CNN
7 pages
Exp. No.: I. Aim: AIML634P Neural Network Lab 2262034
No ratings yet
Exp. No.: I. Aim: AIML634P Neural Network Lab 2262034
6 pages
DL Expt 9
No ratings yet
DL Expt 9
4 pages
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
From Everand
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
No ratings yet
O Level Biology Practice For Structured Questions Enzymes
From Everand
O Level Biology Practice For Structured Questions Enzymes
Esther Chen
No ratings yet

Backpropagation in Convolutional Neural Networks

Uploaded by

Backpropagation in Convolutional Neural Networks

Uploaded by

ECSE 6965

Backpropagation in convolutional neural networks

1. 𝛁ŷ = ŷ – y, when loos function is cross entropy

3. 𝛁P[𝑟][𝑐] = 𝛁FC[(𝑟 − 1)𝑁𝑟𝐹𝐶 ]

where 𝑖 ∗ , 𝑗 ∗ = argmax 𝐴[𝑘][𝑙] and if stride = 1

𝜕𝐴[𝑟][𝑐] 𝟏 𝑖𝑓 𝑖 = 𝑟, 𝑗 = 𝑐, 𝑎𝑛𝑑 𝐶[𝑖][𝑗] > 0

• Input: tensor X ∈ R32x32x3;

1st Convolution Layer Filters:

Test accuracy (per class)

valid_accu: 63.80%), as well as classes 3,4, and 5.

You might also like