0% found this document useful (0 votes)

21 views6 pages

Practical 08 Solutions

Uploaded by

dothiminhphuong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views6 pages

Practical 08 Solutions

Uploaded by

dothiminhphuong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Deep Learning (IST, 2021-22)

Practical 8: Convolutional Neural Networks

João Santinha, José Maria Moreira, Luís Sá-Couto, and Andreas Wichert

Introduction
We have now learned about the Perceptron, Linear and Logistic Regression, Multi-layer Percep-
tron, Auto-encoders, and the gradient backpropagation algorithm.
Today we will talk about Convolutional Neural Networks (CNNs). We will start with a
pen-and-pencil exercise, and will move to Google Colab, where we will do some programming
hands-on exercises.
CNNs provide a clever way of exploiting input structure. Natural images are an example of
data where this structure exists and where CNNs have been particularly successful.
Important properties of CNNs:

• Invariance → learn useful representations with fewer parameters

– Locality (learning small convolutional kernels/filters will dramatically decrease the

number of trainable parameters with features being translational invariant)
– Spatial Invariance / Equivariant Representations

• Sparse Interactions (induced by kernels smaller than the input)

• Parameter Sharing

Their name comes from the fact that the network employs the mathematical operation called
convolution which can be represented by (with kernel size 2 and stride 1):

1
[From Goodfellow I., Bengio Y., Courville A. Deep learning. MIT press; 2016.]
In its simplest form, and to be correct, this performs a cross-correlation operation on the
two-dimensional input data and the kernel (and then adds a bias).
It is possible to change the the stride and the kernel shape (sw = 2, sh = 1, and k ∶ (3, 3)),
leading to:

Another operation in the context of CNNs is the pooling operation, which is very similar
to the convolution operation, however without the kernel. Pooling applies an operation to each
window it is evaluating from the input. As examples, there is the max pooling and the average

Page 2
pooling, which finds the maximum value or calculates the average of the values in the window,
respectively:

Question 1
Consider the following input image:

⎛ 20 35 35 35 35 20 ⎞
⎜ 29 46 44 42 42 27 ⎟
⎜ ⎟
⎜ 16 25 21 19 19 12 ⎟
I=⎜
⎜
⎟
⎜ 66 120 116 154 114 62 ⎟⎟
⎜ ⎟
⎜ 74 216 174 252 172 112 ⎟
⎝ 70 210 170 250 170 110 ⎠

1. What is the output provided by a convolution layer with the following properties:

• Stride: (1,1)
• Kernel:
⎛ 1 1 1 ⎞
⎜ 1 0 1 ⎟
⎝ 1 1 1 ⎠

Solution:

⎛ 225 258 250 209 ⎞

⎜ 458 566 552 472 ⎟
C=⎜ ⎟
⎜ 708 981 887 802 ⎟
⎝ 1000 1488 1320 1224 ⎠

2. Take the output from 1. and apply a max pooling layer with the following properties:

• Stride: (2,2)
• Window Shape: (2,2)

Page 3
Solution:

566 552
M=( )
1488 1320

3. For the following kernels, describe what kind of feature they extract from the image:

⎛ −10 −10 −10 ⎞

F1 = ⎜ 5 5 5 ⎟
⎝ −10 −10 −10 ⎠

⎛ 2 2 2 ⎞
F2 = ⎜ 2 −12 2 ⎟
⎝ 2 2 2 ⎠

⎛ −20 −10 0 5 10 ⎞
⎜ −10 0 5 ⎟
⎜ 5 10 ⎟
⎜
F3 = ⎜ 0 0 ⎟
5 10 5 ⎟
⎜ 5 0 −10 ⎟
⎜ 10 5 ⎟
⎝ 10 5 0 −10 −20 ⎠

Solution:
F1: Thin horizontal line
F2: Frame with no center
F3: Diagonal line

Question 2
Consider that your input images have size 227×227×3 and filter of size 11×11×3 in the first
convolution layer. In the case of AlexNet we have in the first convolution layer a total of 96
filters, a stride of 4 and a padding of 0.
1. Determine the width/height of the output of this first convolution layer.

Solution:

input width − kernel width + 2 × padding width

Output width = +1
stride
227 − 11 + 2 × 0
= +1
4
= 55

Since we our input, filter/kernel and padding are symmetric we have

Output height = Output width = 55

Page 4
2. Determine the number of units within the first convolutional layer.

Solution:
N umber of units = Output width × Output height × number of f ilters
= 55 × 55 × 96
= 290.400

3. Determine the number of trainable parameters/weights within the convolutional layer with
weight sharing.

Solution:
N umber of trainable weights = N umber of f ilters × ((kernel width × kernel height×
number of channels) + bias)
= 96 × ((11 × 11 × 3) + 1)
= 34.944

4. How many parameters would we have if we had used a FF layer with 256 units instead of a
convolutional layer?

Solution:
N umber of trainable weights = N umber of units × ((input width × input height×
number of channels) + bias)
= 256 × ((227 × 227 × 3) + 1)
= 39.574.528

Question 3
Now it’s time to try the Convolutional Neural Networks on real data and compare it with other
approaches previously seen in our practical sessions.

1. Download and create the MNIST train and validation datasets using torchvision:

from torchvision import datasets, transforms

mnist_train_dataset = datasets.MNIST('../data', download=True, train=True,
transform=mnist_transform)
mnist_val_dataset = datasets.MNIST('../data', download=True, train=False,
transform=mnist_transform)

where

mnist_transform = transforms.Compose([transforms.ToTensor(),
transforms.Normalize((0.1307,), (0.3081,))])

Page 5
Implement a Logistic Regression using scikit-learn and a Feedforward Neural Network
with 3 hidden layers of dimensions 256, 128, 64, using pytorch. Train the Feedforward
Neural Network using as optimizer SGD with a learning rate of 0.01 and a momentum 0.5,
nn.NLLLoss() as your loss function. These models will be our baselines to be compared
with CNNs.
Run your implementation of the Logistic Regression and the Feedforward Neural Network
algorithms on this dataset. Measure the training and validation accuracy.
Plot the coefficients of the Logistic Regression for each class.

2. Implement a CNN with the following structure:

• convolutional layer with 10 output channels, kernel size of 5 x 5, and default padding
and stride.
• max pooling layer with kernel size of 2 x2.
• a ReLU activation function.
• convolutional layer with 20 output channels, kernel size of 5 x 5, and default padding
and stride.
• dropout layer with default probability.
• max pooling layer with kernel size of 2 x2.
• a ReLU activation function.
• affine transformation with 50 output features.
• a ReLU activation function.
• affine transformation with 10 output features followed by a log_softmax().

Use the Adam optimizer with a learning rate of 0.001 and the negative log likelihood loss
(nn.NLLLoss()).

3. Determine the number of learnable parameters of the Feedforward Neural Network and the
CNN.

4. Change the number of hidden layers and respective number of units so that it approximates
the number of learnable parameters of the CNN. Run your implementation and measure the
training and validation accuracy of your new Neural Network.
Are you able to maintain your performance?

5. What happens in each of the classifiers if for some reason your validation set does not always
have the digits centered in your input image?
Hint:

mnist_transform_val = transforms.Compose([transforms.ToTensor(),
transforms.RandomAffine(0, translate=[0.1, 0]),
transforms.Normalize((0.1307,), (0.3081,))])
mnist_val_dataset = datasets.MNIST('../data', download=True, train=False,
transform=mnist_transform_val)

What property of CNNs allows it to maintain its performance?

Page 6

Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
98 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
3 # Deep Learning
No ratings yet
3 # Deep Learning
36 pages
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
No ratings yet
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
64 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
79 pages
Computer Vision With Keras
No ratings yet
Computer Vision With Keras
67 pages
Week6 - Intro To Convolutional Neural Networks
No ratings yet
Week6 - Intro To Convolutional Neural Networks
25 pages
MLOA Exp 1 - C121
No ratings yet
MLOA Exp 1 - C121
18 pages
ChatGPT - Convolution and Pooling Operations
No ratings yet
ChatGPT - Convolution and Pooling Operations
43 pages
CNN Short
No ratings yet
CNN Short
61 pages
CH VI - Convolutional Neural Network - 24
No ratings yet
CH VI - Convolutional Neural Network - 24
33 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
27 pages
Chapter 4 Ann
No ratings yet
Chapter 4 Ann
33 pages
Module5 ML
No ratings yet
Module5 ML
112 pages
Unit 4a - Convolutional Neural Networks
No ratings yet
Unit 4a - Convolutional Neural Networks
107 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
Unit - I CHP - 5
No ratings yet
Unit - I CHP - 5
26 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
47 pages
Experiment 2
No ratings yet
Experiment 2
7 pages
CNN 2
No ratings yet
CNN 2
47 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
Convolutional Neural Networks - Part 2
No ratings yet
Convolutional Neural Networks - Part 2
49 pages
Convolutional Neural Networks : Covnets
No ratings yet
Convolutional Neural Networks : Covnets
22 pages
Co2 CNN 3
No ratings yet
Co2 CNN 3
31 pages
Unit2 CNN
No ratings yet
Unit2 CNN
34 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
Assignment 5 - Implementing Image Classification Using Deep Learning
No ratings yet
Assignment 5 - Implementing Image Classification Using Deep Learning
8 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
DL Unit Iv
No ratings yet
DL Unit Iv
18 pages
DLT Unit - 4
No ratings yet
DLT Unit - 4
36 pages
DEEPLEARNINGTUTORIAL Ipynb-Colaboratory
No ratings yet
DEEPLEARNINGTUTORIAL Ipynb-Colaboratory
8 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Unit - 4 DL
No ratings yet
Unit - 4 DL
19 pages
Unit Iii Deep Learning
No ratings yet
Unit Iii Deep Learning
31 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
Building A Convolutional Neural Network Using Tensorflow Keras
No ratings yet
Building A Convolutional Neural Network Using Tensorflow Keras
10 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
37 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Rec Ex 11
No ratings yet
Rec Ex 11
13 pages
CNN With TensorFlow and Keras
No ratings yet
CNN With TensorFlow and Keras
11 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
34 pages
Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
Convolutional Neural Networks in Python - DataCamp
No ratings yet
Convolutional Neural Networks in Python - DataCamp
22 pages
AE556 2024 Topic4 CNN
No ratings yet
AE556 2024 Topic4 CNN
26 pages
Convolutional Neural Networks: Objectives
No ratings yet
Convolutional Neural Networks: Objectives
10 pages
Convolutional Neural Networks Notes
No ratings yet
Convolutional Neural Networks Notes
29 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Class Notes Unit 5
No ratings yet
Class Notes Unit 5
13 pages
Guddu Jha - Organized
No ratings yet
Guddu Jha - Organized
3 pages
CCW331 BA IAT 1 Set 1 & Set 2 Questions
No ratings yet
CCW331 BA IAT 1 Set 1 & Set 2 Questions
19 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
Super VIP Cheetsheet - Deep Learning, AI, ML
No ratings yet
Super VIP Cheetsheet - Deep Learning, AI, ML
47 pages
CS 601 Machine Learning Unit 3
No ratings yet
CS 601 Machine Learning Unit 3
37 pages
DEEP LEARNING Unit-2 NOTES For Post Graduation
No ratings yet
DEEP LEARNING Unit-2 NOTES For Post Graduation
11 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
CH 10 Current Issues in Financial Markets HDRJDGFKMW
No ratings yet
CH 10 Current Issues in Financial Markets HDRJDGFKMW
104 pages
AI Unit 2
No ratings yet
AI Unit 2
38 pages
Ai Project Logbook.
No ratings yet
Ai Project Logbook.
27 pages
CNN Plant Disease Detection
No ratings yet
CNN Plant Disease Detection
21 pages
Fracture Identification On Facial Bone X-Ray Using Transfer Learning (YOLO V8 Algorithm)
No ratings yet
Fracture Identification On Facial Bone X-Ray Using Transfer Learning (YOLO V8 Algorithm)
11 pages
Summer Internship Report.
No ratings yet
Summer Internship Report.
27 pages
MIssing Data Imputation Using Machine Learning Algorithm
No ratings yet
MIssing Data Imputation Using Machine Learning Algorithm
11 pages
List of Datasets For Machine-Learning Research
No ratings yet
List of Datasets For Machine-Learning Research
48 pages
Module 3
No ratings yet
Module 3
53 pages
BMW M-4
No ratings yet
BMW M-4
108 pages
TRAINING Report
No ratings yet
TRAINING Report
32 pages
IEEE Conference Template 2
No ratings yet
IEEE Conference Template 2
6 pages
Mini Project Document
No ratings yet
Mini Project Document
45 pages
Unit - 1 DLNN
No ratings yet
Unit - 1 DLNN
36 pages
Xiao Et Al. - 2019 - An Adaptive Feature Extraction Algorithm For Multiple Typical Seam Tracking Based On Vision Sensor I
No ratings yet
Xiao Et Al. - 2019 - An Adaptive Feature Extraction Algorithm For Multiple Typical Seam Tracking Based On Vision Sensor I
15 pages
Enhancing Medicare Fraud Detection Through Machine Learning Addressing Class Imbalance With SMOTE-ENN
No ratings yet
Enhancing Medicare Fraud Detection Through Machine Learning Addressing Class Imbalance With SMOTE-ENN
16 pages
Lal Babu
No ratings yet
Lal Babu
19 pages
Internship Report ML'
No ratings yet
Internship Report ML'
36 pages
Germany Credit Analysis
No ratings yet
Germany Credit Analysis
41 pages
Internship or Mini Project Report NutriNourish
No ratings yet
Internship or Mini Project Report NutriNourish
26 pages
Loan Mount Prediction. ML Project Report
No ratings yet
Loan Mount Prediction. ML Project Report
6 pages
Exercise 8 - Nikki
No ratings yet
Exercise 8 - Nikki
11 pages
Create A Clustering Model With Azure Machine Learning Designer
No ratings yet
Create A Clustering Model With Azure Machine Learning Designer
22 pages
Weights and Biases
No ratings yet
Weights and Biases
10 pages
Marc icABCD 2023-4
No ratings yet
Marc icABCD 2023-4
7 pages
Ca1 Format All
No ratings yet
Ca1 Format All
13 pages
Applied Sciences
No ratings yet
Applied Sciences
21 pages
End-To-End Learning of Driving Models From Large-Scale Video Datasets
No ratings yet
End-To-End Learning of Driving Models From Large-Scale Video Datasets
9 pages
Overfitting and Solution Sovlve
No ratings yet
Overfitting and Solution Sovlve
3 pages
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)

Practical 08 Solutions

Uploaded by

Practical 08 Solutions

Uploaded by

Deep Learning (IST, 2021-22)

Practical 8: Convolutional Neural Networks

• Invariance → learn useful representations with fewer parameters

– Locality (learning small convolutional kernels/filters will dramatically decrease the

• Sparse Interactions (induced by kernels smaller than the input)

⎛ 225 258 250 209 ⎞

⎛ −10 −10 −10 ⎞

input width − kernel width + 2 × padding width

Since we our input, filter/kernel and padding are symmetric we have

Output height = Output width = 55

from torchvision import datasets, transforms

2. Implement a CNN with the following structure:

What property of CNNs allows it to maintain its performance?

You might also like