0% found this document useful (1 vote)

36 views7 pages

Convolutional Neural Networks: 1 Initializers

Uploaded by

Haider Zaidi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

36 views7 pages

Convolutional Neural Networks: 1 Initializers

Uploaded by

Haider Zaidi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Deep Learning Exercises [DL E] DL Tutors Team

Exercise 2 April 20, 2024

Convolutional Neural Networks

We will extend our framework to include the building blocks for modern Convolutional Neural
Networks (CNNs). To this end, we will add initialization schemes improving our results,
advanced optimizers and the two iconic layers making up CNNs, the convolutional layer and
the max-pooling layer. To ensure compatibility between fully connected and convolutional
layers, we will further implement a flatten layer. Of course we want to continue implementing
the layers ourselves and the usage of machine learning libraries is still not allowed.

1 Initializers

Initialization is critical for non-convex optimization problems. Depending on the application

and network, different initialization strategies are required. A popular initialization scheme is
named Xavier or Glorot initialization. Later an improved scheme specifically targeting ReLU
activation functions was proposed by Kaiming He.

Task:

Implement four classes Constant, UniformRandom, Xavier and He in the file “Initializ-
ers.py” in folder “Layers”. Each of them has to provide the method initialize(weights shape,
fan in, fan out) which returns an initialized tensor of the desired shape.

• Implement all four initialization schemes. Note the following:

– The Constant class has a member that determines the constant value used for
weight initialization. The value can be passed as a constructor argument, with a
default of 0.1.
– The support of the uniform distribution is the interval [0, 1).
– Have a look at the exercise slides for more information on Xavier and He initializers.

• Add a method initialize(weights initializer, bias initializer) to the class FullyCon-

nected reinitializing its weights. Initialize the bias separately with the bias initializer.
Remember that the bias is usually also stored in the weights matrix.

• Refactor the class NeuralNetwork to receive a weights initializer and a

bias initializer upon construction.

• Extend the method append layer(layer) in the class NeuralNetwork such that it
initializes trainable layers with the stored initializers.

You can verify your implementation using the provided testsuite by providing the commandline
parameter TestInitializers.

1
Deep Learning Exercises [DL E] DL Tutors Team
Exercise 2 April 20, 2024
Convolutional Neural Networks

2 Advanced Optimizers

More advanced optimization schemes can increase speed of convergence. We implement a pop-
ular per-parameter adaptive scheme named Adam and a common scheme improving stochastic
gradient descent called momentum.

Task:

Implement the classes SgdWithMomentum and Adam in the file “Optimizers.py” in folder
“Optimization”. These classes all have to provide the method
calculate update(weight tensor, gradient tensor).

• The SgdWithMomentum constructor receives the learning rate and the momen-
tum rate in this order.

• The Adam constructor receives the learning rate, mu and rho, exactly in this order.
In literature mu is often referred as β1 and rho as β2 .

• Implement for both optimizers the method

calculate update(weight tensor, gradient tensor) as it was done with the basic
SGD Optimizer.

You can verify your implementation using the provided testsuite by providing the commandline
parameter TestOptimizers2.

2
Deep Learning Exercises [DL E] DL Tutors Team
Exercise 2 April 20, 2024
Convolutional Neural Networks

3 Flatten Layer

Flatten layers reshapes the multi-dimensional input to a one dimensional feature vector. This
is useful especially when connecting a convolutional or pooling layer with a fully connected
layer.

Task:

Implement a class Flatten in the file “Flatten.py” in folder “Layers”. This class has to provide
the methods forward(input tensor) and backward(error tensor).

• Write a constructor for this class, receiving no arguments.

• Implement a method forward(input tensor), which reshapes and returns the input tensor.

• Implement a method backward(error tensor) which reshapes and returns the er-
ror tensor.

You can verify your implementation using the provided testsuite by providing the commandline
parameter TestFlatten.

3
Deep Learning Exercises [DL E] DL Tutors Team
Exercise 2 April 20, 2024
Convolutional Neural Networks

4 Convolutional Layer

While fully connected layers are theoretically well suited to approximate any function they
struggle to efficiently classify images due to extensive memory consumption and overfitting.
Using convolutional layers, these problems can be circumvented by restricting the layer’s pa-
rameters to local receptive fields.

Task:

Implement a class Conv in the file “Conv.py” in folder “Layers”. This class has to provide
the methods forward(input tensor) and backward(error tensor).

• Write a constructor for this class, receiving the arguments stride shape, convolu-
tion shape and num kernels defining the operation. Note the following:
– this layer has trainable parameters, so set the inherited member trainable accord-
ingly.
– stride shape can be a single value or a tuple. The latter allows for different strides
in the spatial dimensions.
– convolution shape determines whether this object provides a 1D or a 2D con-
volution layer. For 1D, it has the shape [c, m], whereas for 2D, it has the shape
[c, m, n], where c represents the number of input channels, and m, n represent the
spatial extent of the filter kernel.
– num kernels is an integer value.
Initialize the parameters of this layer uniformly random in the range [0, 1).
• To be able to test the gradients with respect to the weights: The members for weights
and biases should be named weights and bias. Additionally provide two properties:
gradient weights and gradient bias, which return the gradient with respect to the
weights and bias, after they have been calculated in the backward-pass.
• Implement a method forward(input tensor) which returns a tensor that serves as the
input tensor for the next layer. Note the following:
– The input layout for 1D is defined in b, c, y order, for 2D in b, c, y, x order. Here,
b stands for the batch, c represents the channels and x, y represent the spatial
dimensions.
– You can calculate the output shape in the beginning based on the input tensor
and the stride shape.
– Use zero-padding for convolutions/correlations (“same” padding). This allows input
and output to have the same spatial shape for a stride of 1.

4
Deep Learning Exercises [DL E] DL Tutors Team
Exercise 2 April 20, 2024
Convolutional Neural Networks

Make sure that 1×1-convolutions and 1D convolutions are handled correctly.

Hint: Using correlation in the forward and convolution/correlation in the backward pass
might help with the flipping of kernels.
Hint 2: The scipy package features a n-dimensional convolution/correlation.
Hint 3: Efficiency trade-offs will be necessary in this scope. For example, striding may
be implemented wastefully as subsampling after convolution/correlation.

• Implement a property optimizer storing the optimizer for this layer. Note that you
need two copies of the optimizer object if you handle the bias separately from the other
weights.

• Implement a method backward(error tensor) which updates the parameters using

the optimizer (if available) and returns the error tensor which returns a tensor that
servers as error tensor for the next layer.

• Implement a method initialize(weights initializer, bias initializer) which reinitial-

izes the weights by using the provided initializer objects.

You can verify your implementation using the provided testsuite by providing the command-
line parameter TestConv. For further debugging purposes we provide optional unittests in
“SoftConvTests.py”. Please read the instructions there carefully in case you need them.

5
Deep Learning Exercises [DL E] DL Tutors Team
Exercise 2 April 20, 2024
Convolutional Neural Networks

5 Pooling Layer

Pooling layers are typically used in conjunction with the convolutional layer. They reduce the
dimensionality of the input and therefore also decrease memory consumption. Additionally,
they reduce overfitting by introducing a degree of scale and translation invariance. We will
implement max-pooling as the most common form of pooling.

Task:

Implement a class Pooling in the file “Pooling.py” in folder “Layers”. This class has to provide
the methods forward(input tensor) and backward(error tensor).

• Write a constructor receiving the arguments stride shape and pooling shape, with
the same ordering as specified in the convolutional layer.

• Implement a method forward(input tensor) which returns a tensor that serves as the
input tensor for the next layer. Hint: Keep in mind to store the correct information
necessary for the backward pass.
– Different to the convolutional layer, the pooling layer must be implemented only for
the 2D case.
– Use “valid”-padding for the pooling layer. This means, unlike to the convolutional
layer, don’t apply any “zero”-padding. This may discard border elements of the
input tensor. Take it into account when creating your output tensor.

• Implement a method backward(error tensor) which returns a tensor that serves as

the error tensor for the next layer.

You can verify your implementation using the provided testsuite by providing the commandline
parameter TestPooling.

6
Deep Learning Exercises [DL E] DL Tutors Team
Exercise 2 April 20, 2024
Convolutional Neural Networks

6 Test, Debug and Finish

Now we implemented everything.

Task:

Debug your implementation until every test in the suite passes. You can run all tests by
providing no commandline parameter. To run the unittests you can either execute them with
python in the terminal or with the dedicated unittest environment of PyCharm. We recommend
the latter one, as it provides a better overview of all tests. For the automated computation of
the bonus points achieved in one exercise, run the unittests with the bonus flag in a terminal,
with

python3 NeuralNetworkTests.py Bonus

or set in PyCharm a new “Python” configuration with Bonus as “Parameters”. Notice, in

some cases you need to set your src folder as “Working Directory”. More information about
PyCharm configurations can be found here 1 .
Make sure you don’t forget to upload your submission to StudOn. Use the dispatch tool, which
checks all files for completeness and zips the files you need for the upload. Try

python3 dispatch.py --help

to check out the manual. For dispatching your folder run e.g.

python3 dispatch.py -i ./src_to_implement -o submission.zip

and upload the .zip file to StudOn.

1
https://www.jetbrains.com/help/pycharm/creating-and-editing-run-debug-configurations.html

Convolutional Neural Networks in Python Master Data Science and Machine Learning With Modern Deep Le
100% (3)
Convolutional Neural Networks in Python Master Data Science and Machine Learning With Modern Deep Le
178 pages
Convolutional Neural Networks in Python Master Data Science and Machine Learning With Modern Deep Learning in Python, Theano, and TensorFlow (Machine Learning in Python) by LazyProgrammer
No ratings yet
Convolutional Neural Networks in Python Master Data Science and Machine Learning With Modern Deep Learning in Python, Theano, and TensorFlow (Machine Learning in Python) by LazyProgrammer
183 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Training Deep Neural Networks
No ratings yet
Training Deep Neural Networks
55 pages
Manual - Deep Learning Lab.
No ratings yet
Manual - Deep Learning Lab.
43 pages
Tensorflow 2 - 0 Slides PDF
No ratings yet
Tensorflow 2 - 0 Slides PDF
100 pages
Deep Learning
100% (2)
Deep Learning
49 pages
From Theory To Practice The Evolution of Artificial Intelligence in Business
No ratings yet
From Theory To Practice The Evolution of Artificial Intelligence in Business
15 pages
Unit 3 Slides - Getting Started With Neural Networks
No ratings yet
Unit 3 Slides - Getting Started With Neural Networks
70 pages
B3 Mini Project Document
No ratings yet
B3 Mini Project Document
69 pages
Convolutional Neural Networks in Python - Master Data Science and Machine Learning With Modern Deep Learning in Python, Theano, and TensorFlow (Machine Learning in Python) (PDFDrive) PDF
No ratings yet
Convolutional Neural Networks in Python - Master Data Science and Machine Learning With Modern Deep Learning in Python, Theano, and TensorFlow (Machine Learning in Python) (PDFDrive) PDF
75 pages
IoT - Lecture 11
No ratings yet
IoT - Lecture 11
58 pages
Convolutional Neural Networks in Python Master Data Science and Machine Learning With Modern Deep Learning in Python, Theano,... (The LazyProgrammer)
No ratings yet
Convolutional Neural Networks in Python Master Data Science and Machine Learning With Modern Deep Learning in Python, Theano,... (The LazyProgrammer)
169 pages
First
No ratings yet
First
92 pages
Deep Learning For Computer Vision Image Classification Object Detection and Face Recognition in Python Jason Brownlee Instant Download
No ratings yet
Deep Learning For Computer Vision Image Classification Object Detection and Face Recognition in Python Jason Brownlee Instant Download
54 pages
Lecture 26-30 Unit 2
No ratings yet
Lecture 26-30 Unit 2
20 pages
DL Practical File
No ratings yet
DL Practical File
58 pages
Artificial Intelligence For Orthodontic Diagnosis and Treatment Planning: A Scoping Review
No ratings yet
Artificial Intelligence For Orthodontic Diagnosis and Treatment Planning: A Scoping Review
12 pages
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
Deep Learning Lab Course 2017 (Deep Learning Practical)
No ratings yet
Deep Learning Lab Course 2017 (Deep Learning Practical)
49 pages
AD3511-DEEP LEARNING LAB MANUAL Revised
No ratings yet
AD3511-DEEP LEARNING LAB MANUAL Revised
72 pages
Convolution Model Step by Step v1
No ratings yet
Convolution Model Step by Step v1
31 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
Convolutional Neural Networks in Python
100% (3)
Convolutional Neural Networks in Python
141 pages
AD3501 Deep Learning Syllabus
No ratings yet
AD3501 Deep Learning Syllabus
1 page
Modelling Emotional Expression in Music Using Interpretable and Transferable Perceptual Features
No ratings yet
Modelling Emotional Expression in Music Using Interpretable and Transferable Perceptual Features
163 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Ai Class 10 - Merged
No ratings yet
Ai Class 10 - Merged
68 pages
2 Deep Neural Network - 241120 - 095158
No ratings yet
2 Deep Neural Network - 241120 - 095158
47 pages
DL Experiments
No ratings yet
DL Experiments
19 pages
Lecture 4 - Deep Learning Introduction
No ratings yet
Lecture 4 - Deep Learning Introduction
63 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Convolutional Neural Networks in Python - DataCamp
No ratings yet
Convolutional Neural Networks in Python - DataCamp
22 pages
Week 02 Ch2.1 Introduction To Neural Networks
No ratings yet
Week 02 Ch2.1 Introduction To Neural Networks
44 pages
Deepnet Lourentzou
No ratings yet
Deepnet Lourentzou
49 pages
Course Contents #1
No ratings yet
Course Contents #1
24 pages
Ijf Manuscript
No ratings yet
Ijf Manuscript
33 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
Pytorch 101: Deep Learning PHD Course 2017/2018
No ratings yet
Pytorch 101: Deep Learning PHD Course 2017/2018
19 pages
SS 2020
No ratings yet
SS 2020
21 pages
SS 2020 Solutions
No ratings yet
SS 2020 Solutions
22 pages
Lecture2 Slides 1
No ratings yet
Lecture2 Slides 1
28 pages
Building Your Deep Neural Network - Step by Step v8 PDF
No ratings yet
Building Your Deep Neural Network - Step by Step v8 PDF
44 pages
6 CNN
No ratings yet
6 CNN
50 pages
R Deep Neural Network Step by Step
No ratings yet
R Deep Neural Network Step by Step
27 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
Convnets
No ratings yet
Convnets
41 pages
Hao 2016
No ratings yet
Hao 2016
23 pages
S5 and S6-2023 Curriculum Syllabus
No ratings yet
S5 and S6-2023 Curriculum Syllabus
6 pages
Survey of FNN
No ratings yet
Survey of FNN
25 pages
Assignment 4x
No ratings yet
Assignment 4x
19 pages
Unit-2 Improving-Deep-Neural-Networks
No ratings yet
Unit-2 Improving-Deep-Neural-Networks
18 pages
GK Deeplearning
No ratings yet
GK Deeplearning
15 pages
HW 5
No ratings yet
HW 5
10 pages
Deep Learning Unit 4
No ratings yet
Deep Learning Unit 4
11 pages
Section - C: Unit 1
No ratings yet
Section - C: Unit 1
12 pages
Tunnelqnn: A Hybrid Quantum-Classical Neural Network For Efficient Learning
No ratings yet
Tunnelqnn: A Hybrid Quantum-Classical Neural Network For Efficient Learning
11 pages
Python Topic List TPS New
No ratings yet
Python Topic List TPS New
16 pages
DEEPLEARNINGTUTORIAL Ipynb-Colaboratory
No ratings yet
DEEPLEARNINGTUTORIAL Ipynb-Colaboratory
8 pages
Lecture 9 Training Deep Networks
No ratings yet
Lecture 9 Training Deep Networks
20 pages
ML Lab 11 Manual - Neural Networks (Ver4)
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
8 pages
Deep Learning
No ratings yet
Deep Learning
17 pages
Facial Emotion Recognition Using Transfer Learning in The Deep CNN
No ratings yet
Facial Emotion Recognition Using Transfer Learning in The Deep CNN
20 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
22 pages
Mijwil - 2023
No ratings yet
Mijwil - 2023
14 pages
Data Augmentation
No ratings yet
Data Augmentation
2 pages
Applied Sciences: An Intelligent Event-Sentiment-Based Daily Foreign Exchange Rate Forecasting System
No ratings yet
Applied Sciences: An Intelligent Event-Sentiment-Based Daily Foreign Exchange Rate Forecasting System
15 pages
Practical 08 Solutions
No ratings yet
Practical 08 Solutions
6 pages
Neural Networks: 1 Basic Optimizer
No ratings yet
Neural Networks: 1 Basic Optimizer
8 pages
Attention Is All You Need - Transformer
No ratings yet
Attention Is All You Need - Transformer
12 pages
Chen 等 - 2024 - PeLK Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
No ratings yet
Chen 等 - 2024 - PeLK Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
12 pages
Gatys 2016
No ratings yet
Gatys 2016
10 pages
U-Net Sabri 2022
No ratings yet
U-Net Sabri 2022
8 pages
Survey Paper - Training Memorablity Score
No ratings yet
Survey Paper - Training Memorablity Score
10 pages
A Survey of Model Compression and Acceleration For Deep Neural Networks
No ratings yet
A Survey of Model Compression and Acceleration For Deep Neural Networks
10 pages
CNN Test Answers
No ratings yet
CNN Test Answers
8 pages
Image Detection and Segmentation Using YOLO v5 For
No ratings yet
Image Detection and Segmentation Using YOLO v5 For
6 pages
Weapon Detection Using Artificial Intelligence and Deep Learning For Security Applications
No ratings yet
Weapon Detection Using Artificial Intelligence and Deep Learning For Security Applications
7 pages
Creating and Training Custom Layers in TensorFlow 2 - by Arjun Sarkar - Towards Data Science
No ratings yet
Creating and Training Custom Layers in TensorFlow 2 - by Arjun Sarkar - Towards Data Science
11 pages
Real Time Object Detection Using YOLO
No ratings yet
Real Time Object Detection Using YOLO
6 pages
Age and Gender Classification Using Convolutional Neural Networks
No ratings yet
Age and Gender Classification Using Convolutional Neural Networks
9 pages
CS-601 Machine Learning Study Guide
No ratings yet
CS-601 Machine Learning Study Guide
2 pages
IEEE Conference Template 062824
No ratings yet
IEEE Conference Template 062824
3 pages
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
No ratings yet
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
6 pages
Two-Stage Filter Response Normalization Network For Real Image Denoising
No ratings yet
Two-Stage Filter Response Normalization Network For Real Image Denoising
5 pages
Ece490 C5 HW
No ratings yet
Ece490 C5 HW
3 pages
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
From Everand
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
Fouad Sabry
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

Convolutional Neural Networks: 1 Initializers

Uploaded by

Convolutional Neural Networks: 1 Initializers

Uploaded by

Deep Learning Exercises [DL E] DL Tutors Team

Exercise 2 April 20, 2024