0% found this document useful (0 votes)

110 views31 pages

CNN

This document provides an introduction to convolutional neural networks (CNNs). It discusses that CNNs are a type of feedforward artificial neural network applied to visual imagery. It then describes the key layers in a CNN, including convolution layers, activation layers, pooling layers, fully connected layers, and loss layers. Regularization techniques for CNNs like dropout and batch normalization are also covered. Examples of CNN applications in computer vision and robotics are given. In the end, the document discusses software based on neural networks and potential alternatives to CNNs.

Uploaded by

gourav Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

110 views31 pages

CNN

Uploaded by

gourav Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Introduction to

Convolutional
Neural
Networks

2018 / 02 / 23
Buzzword: CNN
Convolutional neural networks (CNN, ConvNet) is a class of deep,
feed-forward (not recurrent) artificial neural networks that are applied to
analyzing visual imagery.
Buzzword: CNN
● Convolution

From wikipedia,
Buzzword: CNN
● Neural Networks
Background: Visual Signal Perception
Background: Signal Relay

Starting from V1 primary visual cortex, visual signal is transmitted upwards, becoming more
complicated and abstract.
Background: Neural Networks

Express the equations in Matrix

form, we have
Neural Networks for Images
For computer vision, why can’t we just flatten the image and feed it
through the neural networks?
Neural Networks for Images
Images are high-dimensional vectors. It would take a huge amount of
parameters to characterize the network.
Convolutional Neural Networks
To address this problem, bionic convolutional neural networks are proposed to
reduced the number of parameters and adapt the network architecture specifically to
vision tasks.

Convolutional neural networks are usually composed by a set of layers that can be
grouped by their functionalities.
Sample Architecture
Convolution
Layer
● The process is a 2D
convolution on the inputs.
● The “dot products”
between weights and
inputs are “integrated”
across “channels”.
● Filter weights are shared
across receptive fields. The
filter has same number of
layers as input volume
channels, and output
volume has same “depth”
as the number of filters.
Convolution
Layer
● The process is a 2D
convolution on the inputs.
● The “dot products”
between weights and
inputs are “integrated”
across “channels”.
● Filter weights are shared
across receptive fields. The
filter has same number of
layers as input volume
channels, and output
volume has same “depth”
as the number of filters.
Convolution
Layer
● The process is a 2D
convolution on the inputs.
● The “dot products”
between weights and
inputs are “integrated”
across “channels”.
● Filter weights are shared
across receptive fields. The
filter has same number of
layers as input volume
channels, and output
volume has same “depth”
as the number of filters.
Convolution
Layer
Activation
Layer

● Used to increase
non-linearity of the
network without affecting
receptive fields of conv
layers
● Prefer ReLU, results in Other types:
faster training Leaky ReLU, Randomized Leaky ReLU, Parameterized ReLU
Exponential Linear Units (ELU), Scaled Exponential Linear Units
● LeakyReLU addresses the
Tanh, hardtanh, softtanh, softsign, softmax, softplus...
vanishing gradient
problem
Softmax

● A special kind of activation layer,

usually at the end of FC layer
outputs
● Can be viewed as a fancy
normalizer (a.k.a. Normalized
Given sample vector input x and weight
exponential function)
vectors {wi}, the predicted probability of y = j
● Produce a discrete probability
distribution vector
● Very convenient when combined
with cross-entropy loss
Pooling Layer

● Convolutional layers
provide activation maps.
● Pooling layer applies
non-linear downsampling
on activation maps.
● Pooling is aggressive
(discard info); the trend is
to use smaller filter size
and abandon pooling
Pooling Layer
FC Layer
● Regular neural network
● Can view as the final
learning phase, which
maps extracted visual
features to desired outputs
● Usually adaptive to
classification/encoding
tasks
● Common output is a
vector, which is then In above example, FC generates a number which is then passed
passed through softmax to through a sigmoid to represent grasp success probability
represent confidence of
classification
● The outputs can also be
used as “bottleneck”
Loss Layer

● L1, L2 loss
● Cross-Entropy loss (works
well for classification, e.g.,
image classification) Binary case
● Hinge Loss
● Huber Loss, more resilient
to outliers with smooth General case
gradient
● Minimum Squared Error
(works well for regression
task, e.g., Behavioral
Cloning)
Regularization

● L1 / L2
● Dropout
● Batch norm
● Gradient clipping
● Max norm constraint

To prevent overfitting with

huge amount of training data
Dropout

● During training, randomly

ignore activations by
probability p
● During testing, use all
activations but scale them
by p
● Effectively prevent
overfitting by reducing
correlation between
neurons
Batch
Normalization

● Makes networks robust to bad initialization of

weights
● Usually inserted right before activation layers
● Reduce covariance shift by normalizing and
scaling inputs
● The scale and shift parameters are trainable
to avoid losing stability of the network
Example:
ResNet ● Residual Network, by Kaiming He (2015)
● Heavy usage of “skip connections” which are similar to
RNN Gated Recurrent Units (GRU)
● Commonly used as visual feature extractor in all kinds of
learning tasks, ResNet50, ResNet101, ResNet152
● 3.57% Top-5 accuracy, beats human
Applications
Can be viewed as a fancy feature extractor, just like SIFT, SURF, etc
CNN & Vision: RedEye
https://roblkw.com/papers/likamwa2016redeye-isca.pdf

Two keys: Add noise, save bandwidth/energy

CNN & Robotics: RL Example
Usually used with Multi-Layer Perceptron (MLP, can be viewed as a fancy
term for non-trivial neural networks) for policy networks.
Software 2.0
Quotes from Andrej Karpathy:

Software 1.0 is what we’re all familiar with — it is written in languages such as Python,
C++, etc. It consists of explicit instructions to the computer written by a programmer. By
writing each line of code, the programmer is identifying a specific point in program space
with some desirable behavior.

Software 2.0 is written in neural network weights. No human is involved in writing this
code because there are a lot of weights (typical networks might have millions). Instead,
we specify some constraints on the behavior of a desirable program (e.g., a dataset of
input output pairs of examples) and use the computational resources at our disposal to
search the program space for a program that satisfies the constraints.
Is CNN the Answer?
Capsule?

End2End is not the right way?

عن الاقصاء الاجتماعي والمقصيين الاجتماعيين
No ratings yet
عن الاقصاء الاجتماعي والمقصيين الاجتماعيين
8 pages
5423 Generative Adversarial Nets
No ratings yet
5423 Generative Adversarial Nets
9 pages
A Detailed Lesson Plan in English
No ratings yet
A Detailed Lesson Plan in English
4 pages
Deep Learning - Lecture 4 - CNNs
No ratings yet
Deep Learning - Lecture 4 - CNNs
53 pages
Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
Deep Learning (22CS63) : Module-3
No ratings yet
Deep Learning (22CS63) : Module-3
58 pages
CNN 3
No ratings yet
CNN 3
21 pages
Notes Bikash Deb 204 Unit 1
No ratings yet
Notes Bikash Deb 204 Unit 1
5 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
ECE OLED Technology
No ratings yet
ECE OLED Technology
22 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
Class Notes Unit 5
No ratings yet
Class Notes Unit 5
13 pages
Intro DL 02
No ratings yet
Intro DL 02
49 pages
Rtu Report Format
No ratings yet
Rtu Report Format
3 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
10 pages
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
Ed 243 - Final Lesson Plan Bella Roumain Cec Binder
No ratings yet
Ed 243 - Final Lesson Plan Bella Roumain Cec Binder
11 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
2019 - NATURE - Deep Neural Networks in Psychiatry
No ratings yet
2019 - NATURE - Deep Neural Networks in Psychiatry
16 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
47 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
Attitude Assessment
No ratings yet
Attitude Assessment
3 pages
NN 07
No ratings yet
NN 07
24 pages
Convolution Neural Networks
No ratings yet
Convolution Neural Networks
80 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
Unit III
No ratings yet
Unit III
89 pages
Work Motivation
No ratings yet
Work Motivation
10 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
Some Important Question
No ratings yet
Some Important Question
59 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
Lec6 RNN Attention Search
No ratings yet
Lec6 RNN Attention Search
62 pages
Antim Prahar AI and ML For Business 2025
No ratings yet
Antim Prahar AI and ML For Business 2025
45 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
SIPP Criminology
No ratings yet
SIPP Criminology
12 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Hot Chips Overview
No ratings yet
Hot Chips Overview
47 pages
CNN 2
No ratings yet
CNN 2
47 pages
Chapter14 CNN
No ratings yet
Chapter14 CNN
54 pages
Ece 8085 Microprocessor PDF Report
No ratings yet
Ece 8085 Microprocessor PDF Report
19 pages
AE556 2024 Topic4 CNN
No ratings yet
AE556 2024 Topic4 CNN
26 pages
Paper Prediction: Ostl Mini Project 1.hrutwika Ambavane 2.juili Kadu 3. Bhavesh Bawankar 4.akshat Singh
No ratings yet
Paper Prediction: Ostl Mini Project 1.hrutwika Ambavane 2.juili Kadu 3. Bhavesh Bawankar 4.akshat Singh
13 pages
Deep Learning: Seungsang Oh
No ratings yet
Deep Learning: Seungsang Oh
39 pages
Case Presentation On Gad
100% (1)
Case Presentation On Gad
38 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
CVlecture 5
No ratings yet
CVlecture 5
56 pages
Module 05 CNN Arctitecture
No ratings yet
Module 05 CNN Arctitecture
7 pages
Module 5
No ratings yet
Module 5
20 pages
DL Unit 3 2019PAT
No ratings yet
DL Unit 3 2019PAT
66 pages
Paper 3 (2020.2.3) Digital Amnesia The Smart Phone and The Modern Indian Student
No ratings yet
Paper 3 (2020.2.3) Digital Amnesia The Smart Phone and The Modern Indian Student
9 pages
Weekly Report Plan - Second Week
No ratings yet
Weekly Report Plan - Second Week
11 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
CV PPT Mt101
No ratings yet
CV PPT Mt101
16 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Unit III
No ratings yet
Unit III
89 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
05 Aktas
No ratings yet
05 Aktas
11 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Convolutional Neural Network - 5
No ratings yet
Convolutional Neural Network - 5
21 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
New
No ratings yet
New
8 pages
Module 5 Lesson 4 - Models of Technology-Enhances Instructional Lessons
No ratings yet
Module 5 Lesson 4 - Models of Technology-Enhances Instructional Lessons
5 pages
Ch-3 Convolutional Neural Networks (CNNS)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNS)
11 pages
CNN Notes Unit-3
No ratings yet
CNN Notes Unit-3
12 pages
7 Focus 4 Lesson Plan
No ratings yet
7 Focus 4 Lesson Plan
2 pages
Intro CNN PDF
No ratings yet
Intro CNN PDF
31 pages
Please Find The Details Below:: Date of The Exam
No ratings yet
Please Find The Details Below:: Date of The Exam
2 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
9 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
The WSC PRESENTATION
No ratings yet
The WSC PRESENTATION
2 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
DL Lecture8 Autoencoder
No ratings yet
DL Lecture8 Autoencoder
28 pages
Performance Management Process
No ratings yet
Performance Management Process
17 pages
PRJP - 1425
No ratings yet
PRJP - 1425
11 pages
Motivational and Emotional Influences On Learning: By: Teresita L. Caballero Educ 1 Sunday 7am-10am
No ratings yet
Motivational and Emotional Influences On Learning: By: Teresita L. Caballero Educ 1 Sunday 7am-10am
20 pages
Yogender K. Sharma-Foundations in Sociology of Education-Kanishka Publishers (2003)
0% (1)
Yogender K. Sharma-Foundations in Sociology of Education-Kanishka Publishers (2003)
239 pages
اختبار تجريبي لغة انجليزية علمي
No ratings yet
اختبار تجريبي لغة انجليزية علمي
8 pages
Introduction To Psychological Testing-1
No ratings yet
Introduction To Psychological Testing-1
5 pages
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
No ratings yet
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
33 pages
Answer The Question 5
No ratings yet
Answer The Question 5
2 pages
Smart Energy Network
No ratings yet
Smart Energy Network
22 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Couplers: Types of Directional Couplers
No ratings yet
Couplers: Types of Directional Couplers
7 pages
What Are Cognitive Biases?: Aquino, Ed Gerarr R. Mrs. Cherry Cruz 1CTE1D - GED111 Understanding The Self
No ratings yet
What Are Cognitive Biases?: Aquino, Ed Gerarr R. Mrs. Cherry Cruz 1CTE1D - GED111 Understanding The Self
3 pages
Defination Realism: in Education
No ratings yet
Defination Realism: in Education
6 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Letter To Nobody
No ratings yet
Letter To Nobody
2 pages
Unit 2 Lesson A Inquiry Based Learning and Research Based Learning
No ratings yet
Unit 2 Lesson A Inquiry Based Learning and Research Based Learning
6 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
From Everand
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
Manoj R Chakravarthi
No ratings yet
Ece 8085 Microprocessor
No ratings yet
Ece 8085 Microprocessor
26 pages
Cot Rating Consolidation
No ratings yet
Cot Rating Consolidation
1 page
Reading Month Action Plan
100% (6)
Reading Month Action Plan
2 pages
What Why How KevinYousie
No ratings yet
What Why How KevinYousie
4 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

CNN

Uploaded by

CNN

Uploaded by

Introduction to

Express the equations in Matrix

● A special kind of activation layer,

To prevent overfitting with

● During training, randomly

● Makes networks robust to bad initialization of

Two keys: Add noise, save bandwidth/energy

End2End is not the right way?

You might also like