0% found this document useful (0 votes)

11 views34 pages

Lecture2 CNN Network Design

The document outlines an introductory course on applied machine learning, specifically focusing on Convolutional Neural Networks (CNN) for image classification. It discusses the architecture of CNNs, including the importance of receptive fields, parameter sharing, and the simplifications that enhance the model's efficiency in detecting patterns. The course is taught by Dr. Tao Han at the New Jersey Institute of Technology, utilizing materials inspired by Prof. Hung-yi Lee’s courses.

Uploaded by

ra734

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views34 pages

Lecture2 CNN Network Design

Uploaded by

ra734

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

ECE 498:

Introduction to Applied Machine Learning

• Tao Han, Ph.D.

• Associate Professor
• Electrical and Computer Engineering
• Newark College of Engineering
• New Jersey Institute of Technology

• https://tao-han-njit.netlify.app

Slides are designed based on Prof. Hung-yi Lee’s Machine Learning courses at National Taiwan University
Network Architecture Design:

Convolutional Neural Networks (CNN)

Image Classification
⋮ ⋮
0.2 dog 0
0.7 cat 1
0.1 tree 0
⋮ ⋮
Model 𝒚′ ෝ
𝒚
Cross
entropy

100 x 100

(All the images to be classified have the

same size.)

3
Image Classification
3 channels 100 x 100

3-D 100
tensor
100 x 100

100 x 100 100

100 x 100
value represents intensity

4
Fully Connected Network

100 x 100 𝑥𝑖 ……

……
𝑥𝑗 3 x 107 ……
100 x 100
……

𝑥𝑘 ……
100 x 100 x 3 1000
100 x 100

Do we really need “fully connected”

in image processing?
5
Observation 1
Identifying some critical patterns

Input Layer 1 Layer 2

x1 ……
x2 Bird?
……
……

……

……
xN ……

Perhaps human also identify birds in a similar way … ☺

6
7
Observation 1 A neuron does not have to
see the whole image.
Need to see the Input Layer 1 Layer 2
whole image? x ……
1

x2 bird
……
……

……

……
xN ……
basic advanced
detector detector
Some patterns are much smaller than the whole image.
8
3x3x3
Simplification 1 weights

…...
3x3

bias
1

…...
Receptive 1 0 0 0 0 1 3x3
11 00 00 00 00 11
field 0 1 0 0 1 0
00 11 00 00 11 00
0 0 1 1 0 0
00 00 11 11 00 00
1 0 0 0 1 0
11 00 00 00 11 00

…...
0 1 0 0 1 0 3x3
00 11 00 00 11 00
0 0 1 0 1 0
00 00 11 00 11 00

9
• Can different neurons have
different sizes of receptive field?
Simplification 1 • Cover only some channels?
• Not square receptive field?

3 x 3 x 3 weights
the same
1 0 0 0 0 1 receptive field
Receptive 11 00 00 00 00 11
field 0 1 0 0 1 0
00 11 00 00 11 00
0 0 1 1 0 0
00 00 11 11 00 00
1 0 0 0 1 0
11 00 00 00 11 00
0 1 0 0 1 0 Can be
00 11 00 00 11 00
0 0 1 0 1 0 overlapped
00 00 11 00 11 00

10
Simplification 1 – Typical Setting
Each receptive field has a set of neurons (e.g., 64 neurons).

stride = 2 overlap
all channels
1 0 0 0 0 1
11 00 00 00 00 11
kernel size 0 1 0 0 1 0 padding
00 11 00 00 11 00
(e.g., 3 x 3) 0 0 1 1 0 0
00 00 11 11 00 00
1 0 0 0 1 0
11 00 00 00 11 00
0 1 0 0 1 0 The receptive fields
00 11 00 00 11 00
0 0 1 0 1 0 cover the whole
00 00 11 00 11 00 image.
11
Observation 2
• The same patterns appear in different regions.
I detect “beak” in
my receptive field.

Each receptive field

needs a “beak” detector?

I detect “beak” in
my receptive field.
12
3x3x3
Simplification 2 weights

…...
bias
1 0 0 0 0 1 1
11 00 00 00 00 11

…
0 1 0 0 1 0 parameter sharing
00 11 00 00 11 00
0 0 1 1 0 0
00 00 11 11 00 00
1 0 0 0 1 0 3x3x3
11 00 00 00 11 00 weights

…...
0 1 0 0 1 0
00 11 00 00 11 00
0 0 1 0 1 0
00 00 11 00 11 00
bias
1
…

13
𝑥1 𝜎 𝑤1 𝑥1 + 𝑤2 𝑥2 + ⋯
𝑥2 𝑤1
Simplification 2

…...
𝑤2

bias
1 0 0 0 0 1 1
11 00 00 00 00 11

…
0 1 0 0 1 0
00 11 00 00 11 00
0 0 1 1 0 0 𝑥1′ 𝜎 𝑤1 𝑥1′ + 𝑤2 𝑥2′ + ⋯
00 00 11 11 00 00
1 0 0 0 1 0
11 00 00 00 11 00 𝑥2′ 𝑤1

…...
0 1 0 0 1 0 𝑤2
00 11 00 00 11 00
0 0 1 0 1 0
00 00 11 00 11 00
bias
Two neurons with the same receptive
1
field would not share parameters.
…

14
Simplification 2 – Typical Setting
Each receptive field has a set of neurons (e.g., 64 neurons).

1 0 0 0 0 1
11 00 00 00 00 11
0 1 0 0 1 0
00 11 00 00 11 00
0 0 1 1 0 0
00 00 11 11 00 00
1 0 0 0 1 0
11 00 00 00 11 00
0 1 0 0 1 0

……
00 11 00 00 11 00
……

0 0 1 0 1 0
00 00 11 00 11 00
15
Simplification 2 – Typical Setting
Each receptive field has a set of neurons (e.g., 64 neurons).
Each receptive field has the neurons with the same set of
parameters.

filter 1 1 0 0 0 0 1 filter 1
11 00 00 00 00 11
filter 2 0 1 0 0 1 0 filter 2
00 11 00 00 11 00
0 0 1 1 0 0
filter 3 00 00 11 11 00 00 filter 3
1 0 0 0 1 0
filter 4 11 00 00 00 11 00 filter 4
0 1 0 0 1 0

……
00 11 00 00 11 00
……

0 0 1 0 1 0
00 00 11 00 11 00
16
Benefit of Convolutional Layer
Fully Connected Layer Jack of all trades,
master of none
Receptive Field

Parameter Sharing
Convolutional Layer Larger model bias
(for image)

• Some patterns are much smaller than the whole image.

• The same patterns appear in different regions.
17
Another story based on filter ☺
Convolutional Layer

Filter 1
3 x 3 x channel
tensor

Convolution
Filter 2
3 x 3 x channel
tensor
……

……

channel = 3 (colorful) Each filter detects a small

channel = 1 (black and white) pattern (3 x 3 x channel).
18
Consider channel = 1
Convolutional Layer (black and white image)

1 -1 -1
1 0 0 0 0 1 -1 1 -1 Filter 1
0 1 0 0 1 0 -1 -1 1
0 0 1 1 0 0
-1 1 -1
1 0 0 0 1 0 Filter 2
-1 1 -1
0 1 0 0 1 0
-1 1 -1
0 0 1 0 1 0

6 x 6 image ……
(The values in the filters
are unknown parameters.)
19
1 -1 -1
Convolutional Layer -1 1 -1 Filter 1
-1 -1 1
stride=1

1 0 0 0 0 1
0 1 0 0 1 0 3 -1 -3 -1
0 0 1 1 0 0
1 0 0 0 1 0 -3 1 0 -3
0 1 0 0 1 0
0 0 1 0 1 0 -3 -3 0 1

6 x 6 image 3 -2 -2 -1

20
-1 1 -1
Convolutional Layer -1 1 -1 Filter 2
-1 1 -1
stride=1 Do the same process for
1 0 0 0 0 1 every filter
0 1 0 0 1 0 3 -1 -3 -1
-1 -1 -1 -1
0 0 1 1 0 0
1 0 0 0 1 0 -3 1 0 -3
-1 -1 -2 1
0 1 0 0 1 0 Feature
0 0 1 0 1 0 -3 -3 Map0 1
-1 -1 -2 1
6 x 6 image 3 -2 -2 -1
-1 0 -4 3

21
Convolutional Layer 3
-1
-1
-1
-3
-1
-1
-1
-3 1 0 -3
-1 -1 -2 1

-3 -3 0 1
-1 -1 -2 1
3 -2 -2 -1
-1 0 -4 3
64
Convolution
filters “Image” with 64 channels

Convolution
……
Multiple
3 -1 -3 -1
Convolutional Layers -1 -1 -1 -1
-3 1 0 -3
-1 -1 -2 1

-3 -3 0 1
-1 -1 -2 1
3 -2 -2 -1
-1 0 -4 3
64
Convolution
filters “Image” with 64 channels

Convolution
Filter:
3 x 3 x 64
……

64 23
1 0 0 0 0 1
Multiple
0 1 0 0 1 0
Convolutional Layers
0 0 1 1 0 0
1 0 0 0 1 0
0 1 0 0 1 0
0 0 1 0 1 0

64 3 -1 -3 -1
Convolution -1 -1 -1 -1
filters
-3 1 0 -3
-1 -1 -2 1
Convolution
-3 -3 0 1
-1 -1 -2 1
……

3 -2 -2 -1
-1 0 -4 3
24
Comparison of Two Stories

1 -1 -1 Filter
…...

-1 1 -1 3 x 3 x channel
-1 -1 1 tensor

Receptive
field (ignore bias in this slide)

25
The neurons with different receptive

…...
fields share the parameters.

bias
1 0 0 0 0 1 1
11 00 00 00 00 11

…
0 1 0 0 1 0
00 11 00 00 11 00
0 0 1 1 0 0
00 00 11 11 00 00
1 0 0 0 1 0
11 00 00 00 11 00

…...
0 1 0 0 1 0
00 11 00 00 11 00
0 0 1 0 1 0
00 00 11 00 11 00
bias
Each filter convolves over the 1
…

input image. 26
Convolutional Layer

Neuron Version Story Filter Version Story

Each neuron only considers There are a set of filters

a receptive field. detecting small patterns.

The neurons with different

Each filter convolves
receptive fields share the
over the input image.
parameters.

They are the same story.

27
Observation 3
• Subsampling the pixels will not change the object

bird
bird

subsampling

28
Pooling – Max Pooling
1 -1 -1 -1 1 -1
-1 1 -1 Filter 1 -1 1 -1 Filter 2
-1 -1 1 -1 1 -1

3 -1 -3 -1 -1 -1 -1 -1

-3 1 0 -3 -1 -1 -2 1

-3 -3 0 1 -1 -1 -2 1

3 -2 -2 -1 -1 0 -4 3
29
Convolutional Layers
3 -1 -3 -1
+ Pooling -1 -1 -1 -1
-3 1 0 -3
-1 -1 -2 1

-3 -3 0 1
-1 -1 -2 1
3 -2 -2 -1
-1 0 -4 3
Convolution
“Image” with 64 channels
Repeat

Pooling 3 0
-1 1

3 1
……

0 3
30
The whole CNN
cat dog ……
Convolution
softmax

Pooling

Fully Connected
Layers Convolution

Pooling

Flatten 31
Application: Playing Go

Next move
Network (19 x 19
positions)
19 x 19 classes
19 x 19 matrix
19(image)
x 19 vector
Black: 1 Fully-connected
48 channels network can be used
white: -1
in Alpha Go
none: 0 But CNN performs much better.
32
Why CNN for Go playing?
• Some patterns are much smaller than the whole
image

Alpha Go uses 5 x 5 for first layer

• The same patterns appear in different regions.

33
More Applications

Speech/Signal
Processing
https://dl.acm.org/doi/10.110
9/TASLP.2014.2339736

05introduction To Convolutional Neural Networks
No ratings yet
05introduction To Convolutional Neural Networks
72 pages
Handwriting To Text Conversion
No ratings yet
Handwriting To Text Conversion
7 pages
Deep Learning Quiz Answers 1-50
No ratings yet
Deep Learning Quiz Answers 1-50
4 pages
AI Old Handbook Class X
No ratings yet
AI Old Handbook Class X
131 pages
02 CNN Slides
No ratings yet
02 CNN Slides
77 pages
L11 Learning III Neural Network Architectures
No ratings yet
L11 Learning III Neural Network Architectures
35 pages
Project File Alzheimer's Disease
No ratings yet
Project File Alzheimer's Disease
22 pages
Seminar
No ratings yet
Seminar
10 pages
K-Max Pooling Operation
No ratings yet
K-Max Pooling Operation
134 pages
CNN (Neural Network)
No ratings yet
CNN (Neural Network)
32 pages
ML 11
No ratings yet
ML 11
62 pages
CNN Basic Beak of Bird
100% (1)
CNN Basic Beak of Bird
20 pages
Computer Vision: Field of AI That Enables Computers To Derive Meaningful Information From
No ratings yet
Computer Vision: Field of AI That Enables Computers To Derive Meaningful Information From
26 pages
PNAL9 CNNs
No ratings yet
PNAL9 CNNs
61 pages
Deep Learning LectureCNN
No ratings yet
Deep Learning LectureCNN
28 pages
Week6 - Intro To Convolutional Neural Networks
No ratings yet
Week6 - Intro To Convolutional Neural Networks
25 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
102 pages
Final Report
No ratings yet
Final Report
56 pages
Rigorous Analysis of Data Orthogonalization For Self-Organizing Maps in Machine Learning Cyber Intrusion Detection For LoRa Sensors
No ratings yet
Rigorous Analysis of Data Orthogonalization For Self-Organizing Maps in Machine Learning Cyber Intrusion Detection For LoRa Sensors
20 pages
Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
ML 2
No ratings yet
ML 2
70 pages
Sarma CNN Vce Oct 2022
No ratings yet
Sarma CNN Vce Oct 2022
63 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
34 pages
Probability Models
No ratings yet
Probability Models
4 pages
앵무새감정 분류
No ratings yet
앵무새감정 분류
6 pages
A Multimodal Driver Emotion Recognition Algorithm Based On The Audio and Video Signals in Internet of Vehicles Platform
No ratings yet
A Multimodal Driver Emotion Recognition Algorithm Based On The Audio and Video Signals in Internet of Vehicles Platform
12 pages
Convolutional Neural Networks - Part 2
No ratings yet
Convolutional Neural Networks - Part 2
49 pages
Chapter Convolutional Neural Networks
No ratings yet
Chapter Convolutional Neural Networks
7 pages
CNNs
No ratings yet
CNNs
88 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Convolutional Networks
No ratings yet
Convolutional Networks
37 pages
Genconvit: Deepfake Video Detection Using Generative Convolutional Vision Transformer
No ratings yet
Genconvit: Deepfake Video Detection Using Generative Convolutional Vision Transformer
10 pages
A Computer Vision Based Approach For Driver Distraction Recognition Using Deep Learning and Genetic Algorithm Based Ensemble
No ratings yet
A Computer Vision Based Approach For Driver Distraction Recognition Using Deep Learning and Genetic Algorithm Based Ensemble
12 pages
Deep Neural Networks and Tabular Data: A Survey
No ratings yet
Deep Neural Networks and Tabular Data: A Survey
22 pages
Convolutional Networks 2024
No ratings yet
Convolutional Networks 2024
44 pages
Video Processing
No ratings yet
Video Processing
40 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
DEEP LEARNING Unit-2 NOTES For Post Graduation
No ratings yet
DEEP LEARNING Unit-2 NOTES For Post Graduation
11 pages
Convolution Neural Networks: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
No ratings yet
Convolution Neural Networks: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
123 pages
Unit 3
No ratings yet
Unit 3
105 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
Deep 2
No ratings yet
Deep 2
57 pages
Harnessing Deep Learning For Early Breast Cancer Diagnosis
No ratings yet
Harnessing Deep Learning For Early Breast Cancer Diagnosis
19 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
CSE
No ratings yet
CSE
20 pages
CNN2
No ratings yet
CNN2
70 pages
None 1b0764e7
No ratings yet
None 1b0764e7
7 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
Music Source Separation Presentation - PPSX
No ratings yet
Music Source Separation Presentation - PPSX
6 pages
NN Bnu1
No ratings yet
NN Bnu1
31 pages
Facial Final Mini
No ratings yet
Facial Final Mini
38 pages
CNNs
No ratings yet
CNNs
22 pages
Introduction To AI and ML
No ratings yet
Introduction To AI and ML
22 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
DL Unit Iv
No ratings yet
DL Unit Iv
18 pages
CNN Slides PDF
No ratings yet
CNN Slides PDF
81 pages
MLT UNIT-4 & 5 Imp Sol
No ratings yet
MLT UNIT-4 & 5 Imp Sol
22 pages
Draft Syllabus of B.E. Sem VII & VIII Biomedical Engg.
No ratings yet
Draft Syllabus of B.E. Sem VII & VIII Biomedical Engg.
60 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
Pattern Recognition
No ratings yet
Pattern Recognition
14 pages
AE556 2024 Topic4 CNN
No ratings yet
AE556 2024 Topic4 CNN
26 pages
Module-4 DL
No ratings yet
Module-4 DL
22 pages
Yoga-82: A New Dataset For Fine-Grained Classification of Human Poses
No ratings yet
Yoga-82: A New Dataset For Fine-Grained Classification of Human Poses
9 pages
Fast and Accurate Traffic Sign Recognition For Self Driving Cars Using RetinaNet Based Detector
No ratings yet
Fast and Accurate Traffic Sign Recognition For Self Driving Cars Using RetinaNet Based Detector
7 pages
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
No ratings yet
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
64 pages
Guidelines - Deep Learning
No ratings yet
Guidelines - Deep Learning
2 pages
Ml@ok Questions
No ratings yet
Ml@ok Questions
16 pages
21CS743 Module4 Notes
No ratings yet
21CS743 Module4 Notes
15 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Emotion-Based Music Recommendation System
No ratings yet
Emotion-Based Music Recommendation System
5 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
Visual and Audio Signal Processing Lab University of Wollongong
No ratings yet
Visual and Audio Signal Processing Lab University of Wollongong
20 pages
CV PPT Mt101
No ratings yet
CV PPT Mt101
16 pages
Guddu Jha - Organized
No ratings yet
Guddu Jha - Organized
3 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
Matconvnet: Convolutional Neural Networks For Matlab
No ratings yet
Matconvnet: Convolutional Neural Networks For Matlab
55 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
Electronics 11 02162
No ratings yet
Electronics 11 02162
18 pages
Research Article: Indian Classical Dance Action Identification and Classification With Convolutional Neural Networks
No ratings yet
Research Article: Indian Classical Dance Action Identification and Classification With Convolutional Neural Networks
11 pages
Matconvnet Manual
No ratings yet
Matconvnet Manual
59 pages
Ch3 CNN
No ratings yet
Ch3 CNN
64 pages
Spiro Project Titles 2021 2022
No ratings yet
Spiro Project Titles 2021 2022
41 pages
Image Recognition Using Neural Networks
No ratings yet
Image Recognition Using Neural Networks
18 pages
SAS Viya 3.5 New Features Updated 10082019
No ratings yet
SAS Viya 3.5 New Features Updated 10082019
38 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
From Everand
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
Frank Millstein
No ratings yet