Machine Learning-Lecture 17 (Student)

Lecture 17 discusses Convolutional Neural Networks (CNNs), highlighting their success in image classification due to their ability to learn hierarchical features through convolution and pooling layers. CNNs operate on 3D tensors, utilizing filters and kernels to extract relevant features from images while maintaining translation invariance and spatial hierarchy. The lecture also explains the concepts of convolution strides and max pooling, emphasizing their roles in downsampling feature maps and enhancing the efficiency of the learning process.

Uploaded by

hubertkuo418

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views7 pages

Machine Learning-Lecture 17 (Student)

Uploaded by

hubertkuo418

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Lecture 17: Deep Learning

10.3 Convolutional Neural Networks

⚫ Neural networks rebounded around 2010 with big successes in
. Around that time, massive databases of labeled images were
being accumulated, with ever-increasing numbers of classes.
⚫ Figure 10.5 shows 75 images drawn from the database. This
database consists of images labeled according to superclasses
(e.g. aquatic mammals), with classes per superclass (beaver, dolphin,
otter, seal, whale). Each image has a resolution of pixels, with
eight-bit numbers per pixel representing . The numbers for
each image are organized in a array called a .
The first two feature map axes are spatial (both are 32-dimensional), and the
third is the axis, representing the three colors. There is a
designated training set of images, and a test set of
⚫ A special family of has evolved for
convolutional neural networks such as these, and has
shown spectacular success on a wide range of problems. CNNs mimic to some
degree how humans classify images, by
anywhere in the image that distinguish each particular object class.
⚫ Figure 10.6 illustrates the idea behind a convolutional neural network on a
cartoon image of a tiger.

⚫ The network first identifies in the input image, such as

small edges, patches of color, and the like. These low-level features are then
combined to form , such as parts of ears, eyes, and so on.
Eventually, the presence or absence of these higher-level features contributes to

1
the probability of any given output class.
⚫ How does a convolutional neural network build up this hierarchy?
◼ It combines two specialized types of hidden layers, called
layers and layers
◼ Convolution layers search for instances of in the
image, whereas pooling layers these to select a
subset.
⚫ Convolution Layers (the following content is mostly from Deep Learning with
Python, by F. Chollet)
◼ The fundamental difference between a densely connected layer and a
convolution layer is this: Dense layers learn in their
input feature space, whereas convolution layers learn
see figure 5.1):
◼ In the case of images, patterns found in small 2D windows of the inputs. In
the previous example, these windows were all 3 × 3.

◼ This key characteristic gives convnets two interesting properties:

1. The patterns they learn are . After learning a certain
pattern in the lower-right corner of a picture, a convnet can recognize it
anywhere: for example, in the upper-left corner. A densely connected
network would have to learn the pattern anew if it appeared at a new
location. This makes convnets data efficient when processing images
(because the visual world is fundamentally translation invariant): they need
fewer training samples to learn representations that have generalization
power.
2. They can learn of patterns (see figure 5.2). A first
convolution layer will learn small local patterns such as edges, a second

2
convolution layer will learn larger patterns made of the features of the first
layers, and so on. This allows convnets to efficiently learn increasingly
complex and abstract visual concepts (because the visual world is
fundamentally spatially hierarchical).

◼ Convolutions operate over 3D tensors, called feature maps, with two

spatial axes (height and width) as well as a depth axis (also called the
channels axis). For an RGB image, the dimension of the depth axis is 3.
◼ In the MNIST example, the first convolution layer takes a feature map of
size (28, 28, 1) and outputs a feature map of size (26, 26, 32): it computes
32 filters over its input.

◼ Convolutions are defined by two key parameters:

➢ Size of the patches extracted from the inputs— These are typically 3 ×
3 or 5 × 5. In the example, they were 3 × 3, which is a common choice.
➢ Depth of the output feature map— The number of filters computed by
the convolution. The example started with a depth of 32 and ended
with a depth of 64.
◼ In Keras Conv2D layers, these parameters are the first arguments passed to
the layer: Conv2D(output_depth, (window_height, window_width)).

3
https://towardsdatascience.com/types-of-convolution-kernels-simplified-
f040cb307c37

Kernel vs Filter
Before we dive into it, I just want to make the distinction between the terms ‘kernel’
and ‘filter’ very clear because I have seen a lot of people use them interchangeably.
A kernel is, as described earlier, a matrix of weights which are multiplied with the
input to extract relevant features. The dimensions of the kernel matrix is how the
convolution gets it’s name. For example, in 2D convolutions, the kernel matrix is a
2D matrix.
A filter however is a concatenation of multiple kernels, each kernel assigned to a
particular channel of the input. Filters are always one dimension more than the
kernels. For example, in 2D convolutions, filters are 3D matrices (which is
essentially a concatenation of 2D matrices i.e. the kernels). So for a CNN layer
with kernel dimensions h*w and input channels k, the filter dimensions are
k*h*w.

The following four pictures are from “人工智慧 Artificial Intelligence, by 張志勇等
人”

4
◼ A convolution works by these windows of size 3 × 3 or 5 × 5 over the
3D input feature map, stopping at every possible location, and extracting the 3D
patch of surrounding features.
◼ Note that the output width and height may differ from the input width and
height. They may differ for two reasons:
➢ , which can be countered by padding the input feature map
➢ The use of , which I’ll define in a second
◼ Understanding border effects
➢ Consider a 5 × 5 feature map (25 tiles total). There are only 9 tiles around
which you can center a 3 × 3 window, forming a 3 × 3 grid (see figure 5.5).
Hence, the output feature map will be
➢ It a little: by exactly tiles alongside each dimension, in this case.
➢ You can see this border effect in action in the earlier example: you start
with 28 × 28 inputs, which become after the first convolution layer.

5
◼ Understanding convolution strides
➢ The other factor that can influence output size is the notion of strides.
➢ The distance between two successive windows is a parameter of the
convolution, called its stride, which defaults to 1.
➢ It’s possible to have strided convolutions: convolutions with a stride higher
than 1. In figure 5.7, you can see the patches extracted by a 3 × 3
convolution with over a 5 × 5 input

➢ Using stride 2 means the width and height of the feature map are
by a factor of 2 (in addition to any changes induced by border
effects).
➢ Strided convolutions are rarely used in practice.
➢ To downsample feature maps, instead of strides, we tend to use the
operation.
◼ The max-pooling operation
➢ In the convnet example, you may have noticed that the size of the feature
maps is halved after every MaxPooling2D layer.
➢ For instance, before the first MaxPooling2D layers, the feature map is 26 ×
26, but the max-pooling operation halves it to
➢ That’s the role of max pooling: to aggressively feature maps,
much like strided convolutions.
➢ Max pooling consists of extracting windows from the input feature maps
and outputting the max value of each channel.
➢ A big difference from convolution is that max pooling is usually done with
windows and stride , in order to downsample the feature maps by
a factor of
➢ On the other hand, convolution is typically done with 3 × 3 windows and no
stride (stride 1).
➢ Note that max pooling isn’t the only way you can achieve such
downsampling. As you already know, you can also use strides in the prior

6
convolution layer.
➢ But max pooling tends to work better than these alternative solutions.
➢ It’s more informative to look at the of different features
than at their average presence.

Language Learning Planner
No ratings yet
Language Learning Planner
51 pages
Managerial Decision Making: True/False Questions
No ratings yet
Managerial Decision Making: True/False Questions
27 pages
Self-Care Assesment
No ratings yet
Self-Care Assesment
2 pages
Deep Learning For Computer Vision
No ratings yet
Deep Learning For Computer Vision
55 pages
Unit 4
No ratings yet
Unit 4
19 pages
Unit2 CNN
No ratings yet
Unit2 CNN
34 pages
Chapter14 CNN
No ratings yet
Chapter14 CNN
54 pages
Cnns Layers: Convolution Neural Network Convolutional Neural Network
No ratings yet
Cnns Layers: Convolution Neural Network Convolutional Neural Network
10 pages
(Fall 2024) Images and Convolutions
No ratings yet
(Fall 2024) Images and Convolutions
69 pages
Module 3
No ratings yet
Module 3
34 pages
Unit 5 Ann
No ratings yet
Unit 5 Ann
28 pages
unit3dl
No ratings yet
unit3dl
72 pages
Convolutional Neural Networks 2 Now
No ratings yet
Convolutional Neural Networks 2 Now
6 pages
Cours CNN Eng
No ratings yet
Cours CNN Eng
60 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
78 pages
Module 3 - Convolutional Neural Networks: History
No ratings yet
Module 3 - Convolutional Neural Networks: History
3 pages
DL Mod 3
No ratings yet
DL Mod 3
65 pages
CNN Notes Unit-3
No ratings yet
CNN Notes Unit-3
12 pages
Unit 3 CNN
No ratings yet
Unit 3 CNN
47 pages
Module 3
No ratings yet
Module 3
67 pages
465-Lecture 5-6
No ratings yet
465-Lecture 5-6
40 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
55 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Unit 5th Ig Ann
No ratings yet
Unit 5th Ig Ann
112 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
Chapter 4 Ann
No ratings yet
Chapter 4 Ann
33 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
98 pages
Lecture 6
No ratings yet
Lecture 6
17 pages
Deep Learning UNIT-5
No ratings yet
Deep Learning UNIT-5
37 pages
DL Unit-3
No ratings yet
DL Unit-3
70 pages
FODL Unit-4
No ratings yet
FODL Unit-4
46 pages
AD3501-DL-Unit 2
No ratings yet
AD3501-DL-Unit 2
33 pages
Convolutional Networks 2024
No ratings yet
Convolutional Networks 2024
44 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
An Introduction to Convolutional Neural Networks
No ratings yet
An Introduction to Convolutional Neural Networks
11 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Images, Neural Networks, CNNs
No ratings yet
Images, Neural Networks, CNNs
26 pages
Convolutional Neural Networks-Part2
No ratings yet
Convolutional Neural Networks-Part2
21 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
DL Mod3
No ratings yet
DL Mod3
102 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
A Comprehensive Tutorial To Learn Convolutional Neural Networks From Scratch
No ratings yet
A Comprehensive Tutorial To Learn Convolutional Neural Networks From Scratch
11 pages
Unit - 2
No ratings yet
Unit - 2
31 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
95 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
Ee046746 Tut 03 04 Convolutional Neural Networks
No ratings yet
Ee046746 Tut 03 04 Convolutional Neural Networks
26 pages
Cnns Convolution Neural Networks
No ratings yet
Cnns Convolution Neural Networks
50 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
61 pages
DLT Unit-4
No ratings yet
DLT Unit-4
25 pages
Unit 2 a7709.Docx
No ratings yet
Unit 2 a7709.Docx
39 pages
Unit 2
No ratings yet
Unit 2
45 pages
Ad3501-Dl-Unit 2 Notes
No ratings yet
Ad3501-Dl-Unit 2 Notes
29 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Lec 8 - CNN2
No ratings yet
Lec 8 - CNN2
60 pages
UNIT4
100% (1)
UNIT4
14 pages
Convolution Operation
No ratings yet
Convolution Operation
23 pages
AIDS - ANN - Unit 5 - Convolutional Neural Network AIDS - ANN - Unit 5 - Convolutional Neural Network
No ratings yet
AIDS - ANN - Unit 5 - Convolutional Neural Network AIDS - ANN - Unit 5 - Convolutional Neural Network
17 pages
Chap4 CNN (20240205) - DL4H Practioner Guide
No ratings yet
Chap4 CNN (20240205) - DL4H Practioner Guide
23 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
From Everand
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
Fouad Sabry
No ratings yet
HW 09
No ratings yet
HW 09
3 pages
HW 06
No ratings yet
HW 06
2 pages
HW 07
No ratings yet
HW 07
3 pages
Support Vector Machine R程式練習2
No ratings yet
Support Vector Machine R程式練習2
3 pages
Ch21 - Lecture 2025 Updated
No ratings yet
Ch21 - Lecture 2025 Updated
80 pages
分組作業三
No ratings yet
分組作業三
4 pages
HW 11
No ratings yet
HW 11
3 pages
Ch24 - Lecture 2025 Updated
No ratings yet
Ch24 - Lecture 2025 Updated
51 pages
Ch23 - Lecture 2025 Updated
No ratings yet
Ch23 - Lecture 2025 Updated
55 pages
Ch25 - Lecture 2025 Updated
No ratings yet
Ch25 - Lecture 2025 Updated
84 pages
Support Vector Classifier
No ratings yet
Support Vector Classifier
7 pages
Pca 411210002
No ratings yet
Pca 411210002
4 pages
Support Vector Classifier
No ratings yet
Support Vector Classifier
7 pages
Machine Learning-Lecture 2 (Student)
No ratings yet
Machine Learning-Lecture 2 (Student)
9 pages
Ch22 - Lecture 2025 Updated
No ratings yet
Ch22 - Lecture 2025 Updated
68 pages
Resampling-Methods 411210002
No ratings yet
Resampling-Methods 411210002
3 pages
Mathematical Analysis Lecture 20241029 - 241105 - 233716
No ratings yet
Mathematical Analysis Lecture 20241029 - 241105 - 233716
12 pages
Mathematical Analysis Lecture 20241024 - 241102 - 155553
No ratings yet
Mathematical Analysis Lecture 20241024 - 241102 - 155553
10 pages
Mathematical Analysis Lecture 20241008 - 241019 - 114257
No ratings yet
Mathematical Analysis Lecture 20241008 - 241019 - 114257
11 pages
Improved Techniques For Lower Bounds For Odd Perfe
No ratings yet
Improved Techniques For Lower Bounds For Odd Perfe
2 pages
LL LDA Practice 411210002
No ratings yet
LL LDA Practice 411210002
3 pages
Mathematical Analysis Lecture 20241022 - 241102 - 155552
No ratings yet
Mathematical Analysis Lecture 20241022 - 241102 - 155552
12 pages
Mathematical Analysis Lecture 20241015 - 241019 - 163822
No ratings yet
Mathematical Analysis Lecture 20241015 - 241019 - 163822
18 pages
Mathematical Analysis Lecture 20240926 - 241019 - 114056
No ratings yet
Mathematical Analysis Lecture 20240926 - 241019 - 114056
12 pages
Mathematical Analysis Lecture 20240925 - 241012 - 115607
No ratings yet
Mathematical Analysis Lecture 20240925 - 241012 - 115607
13 pages
Mathematical Analysis Lecture 20241017 - 241019 - 114136
No ratings yet
Mathematical Analysis Lecture 20241017 - 241019 - 114136
18 pages
Mathematical Analysis Lecture 20240910 - 250224 - 211702
No ratings yet
Mathematical Analysis Lecture 20240910 - 250224 - 211702
10 pages
Mathematical Analysis Lecture 20240911 - 241012 - 161638
No ratings yet
Mathematical Analysis Lecture 20240911 - 241012 - 161638
21 pages
hw13 250104 160719
No ratings yet
hw13 250104 160719
3 pages
Mathematical Analysis Lecture 20241009 - 241019 - 134610
No ratings yet
Mathematical Analysis Lecture 20241009 - 241019 - 134610
10 pages
Lesson 5 Module Social Science Theories
No ratings yet
Lesson 5 Module Social Science Theories
15 pages
Singeduc
No ratings yet
Singeduc
12 pages
Come On Phonics 2 Student Book Key PR
No ratings yet
Come On Phonics 2 Student Book Key PR
37 pages
Internal Audit Effectiveness An Ethiopian Public S
No ratings yet
Internal Audit Effectiveness An Ethiopian Public S
17 pages
Content's of The Dead Man's Pocket. Lesson - Story
No ratings yet
Content's of The Dead Man's Pocket. Lesson - Story
16 pages
Gen - Math Week1 - LM1
No ratings yet
Gen - Math Week1 - LM1
24 pages
Ferna-Research (1) TTT
No ratings yet
Ferna-Research (1) TTT
34 pages
Bachelor of Science in Mathematics: Sample 4 - Year Plan (2016 - 2018)
No ratings yet
Bachelor of Science in Mathematics: Sample 4 - Year Plan (2016 - 2018)
2 pages
Dissertation Philosophique Sur Le Desir
100% (2)
Dissertation Philosophique Sur Le Desir
8 pages
First Progress Report
No ratings yet
First Progress Report
6 pages
School Grade Level Teacher Learning Area Teaching Date and Time Quarter
No ratings yet
School Grade Level Teacher Learning Area Teaching Date and Time Quarter
24 pages
60 Affirmations For Peace
No ratings yet
60 Affirmations For Peace
21 pages
General Chemistry 2nd Unit Examination
No ratings yet
General Chemistry 2nd Unit Examination
2 pages
Individual Assignment (Totto-Chan)
No ratings yet
Individual Assignment (Totto-Chan)
13 pages
Iot ML DL
No ratings yet
Iot ML DL
5 pages
CV of Saiful Islam Lecturer Finance
No ratings yet
CV of Saiful Islam Lecturer Finance
3 pages
Confidence Intervals and Tests: List of Tables
No ratings yet
Confidence Intervals and Tests: List of Tables
4 pages
Describing SAP HANA and SAP S/4HANA: Unit 1 Lesson 1
No ratings yet
Describing SAP HANA and SAP S/4HANA: Unit 1 Lesson 1
2 pages
Brochure 2020
No ratings yet
Brochure 2020
2 pages
Need and Function of Teaching by Anand
No ratings yet
Need and Function of Teaching by Anand
14 pages
HINAI® 5.9.15 - SR05 Live
No ratings yet
HINAI® 5.9.15 - SR05 Live
1 page
additionalSHS Class Prog. 1sem SY 22 23
No ratings yet
additionalSHS Class Prog. 1sem SY 22 23
37 pages
Q1 Sci9 Mod6
100% (2)
Q1 Sci9 Mod6
18 pages
School Calendar 2011-2012
No ratings yet
School Calendar 2011-2012
1 page
Scaredy Squirrel Teaching Plan
No ratings yet
Scaredy Squirrel Teaching Plan
2 pages
Resume
No ratings yet
Resume
4 pages
Year 6 Homework Sheets Australia
100% (1)
Year 6 Homework Sheets Australia
4 pages