0% found this document useful (0 votes)

35 views57 pages

Building Convolutional Neural Networks For Image Classification Slides

1. Convolutional neural networks (CNNs) are commonly used for image classification and contain alternating convolutional and pooling layers, followed by fully connected layers that output class probabilities. 2. CNNs apply filters to local regions of input images to extract features, with techniques like zero-padding and stride size determining the size of output feature maps. 3. Batch normalization is applied before activation functions to help address the vanishing/exploding gradient problem and speed up training of CNNs.

Uploaded by

Rahul Shetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views57 pages

Building Convolutional Neural Networks For Image Classification Slides

Uploaded by

Rahul Shetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 57

Building Convolutional Neural

Networks for Image Classification

Janani Ravi
CO-FOUNDER, LOONYCORN
www.loonycorn.com
Narrow and wide convolution
Zero-padding and the feature map sizes
Overview for convolutional layers
Calculating feature map dimensions
Batch normalization of input images
Building and training a CNN for image
classification
Changing model hyperparameters
Convolutional Neural Networks
Two Kinds of Layers in CNNs

Convolution Pooling
Local receptive field Subsampling of inputs
Typical CNN Architecture

ReLU

ReLU
Convolutional Pooling Convolutional

Alternating convolutional and pooling layers

Typical CNN Architecture

ReLU

ReLU
Convolutional Pooling Convolutional

This entire set of layers is then fed into a

regular, feed-forward NN
Typical CNN Architecture

P(Y = 0)

Fully Connected

Fully Connected
P(Y = 1)

Prediction
SoftMax
ReLU

ReLU
CNN Layers

Feed-forward
Layers
P(Y = 9)

This is the output layer, emitting probabilities

Typical CNN Architecture

P(Y=0) P(Y=9)
…

CNN Input is an image

Outputs are probabilities
Feature Maps

x1 x0 x1
x0 x1 x0
x1 x0 x1

Image Pixels Feature

Map
Zero-padding, Stride Size
Narrow vs. Wide Convolution

Input matrix i.e. image

Narrow vs. Wide Convolution

Convolution result
Narrow vs. Wide Convolution

Narrow Convolution Wide Convolution

Little zero padding; output Lots of zero padding; output
narrower than input wider than input
Without Zero Padding
6
0 0 0 0 0 0
4
0.2 0.8 0 0.3 0.6 0

0.2 0.9 0 0.3 0.8 0

6 0.3 0.8 0.7 0.8 0.9 0

x1 x0
x0
x1
x1 x0
4
x1 x0 x1
0 0 0 0.2 0.8 0

0 0 0 0.2 0.2 0

Convolution
Matrix Result
Zero Padding
10 8
0 0 0 0 0 0 0 0 0 0

0 0 0 0 0 0 0 0 0 0

0
0 0.2 0.8 0 0.3 0.6 0

0 0.2 0.9 0 0.3 0.8 0

0
0

0
8
10 0 0 0.3 0.8 0.7 0.8 0.9 0 0 0
x1 x0
x0
x1
x1 x0

0 0 0 0 0 0.2 0.8 0 0 0 x1 x0 x1

0 0 0 0 0 0.2 0.2 0 0 0

0 0 0 0 0 0 0 0 0 0

Convolution
Matrix Result
Zero Padding
12
0 0 0 0 0 0 0 0 0 0 0 0
10
0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0.2 0.8 0 0.3 0.6 0 0 0 0
10
12 0
0
0
0
0 0.2 0.9 0 0.3 0.8 0
0 0.3 0.8 0.7 0.8 0.9 0
0
0
0
0
0
0
x1 x0
x0
x1
x1 x0
0 0 0 0 0 0 0.2 0.8 0 0 0 0 x1 x0 x1
0 0 0 0 0 0 0.2 0.2 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0

Convolution
Matrix Result
Zero Padding

x0 x0 x0 With zero-padding, every element of

x0 x1 x1 x0 matrix will be passed into filter
x0 x1 x1 x0
Can decide number of zero columns to
x0 x1 x1 x0
pad with
x0 x0

Use to get output larger than input

Stride Size

0 0 0 0 0 0
x1 x0 x1

0.2 0.8 0 0.3 0.6 0

x0 x1 x0

0.2 0.9 0 0.3 0.8 0

x1 x0 x1

0.3 0.8 0.7 0.8 0.9 0

0 0 0 0.2 0.8 0

0 0 0 0.2 0.2 0
Stride Size

0 0 0 0 0 0
x1 x0 x1

0.2 0.8 0 0.3 0.6 0

x0 x1 x0

0.2 0.9 0 0.3 0.8 0

x1 x0 x1

0.3 0.8 0.7 0.8 0.9 0

0 0 0 0.2 0.8 0

0 0 0 0.2 0.2 0

Horizontal stride of 1
Stride Size

0 0 0 0 0 0
x1 x0 x1

0.2 0.8 0 0.3 0.6 0

x0 x1 x0

0.2 0.9 0 0.3 0.8 0

x1 x0 x1

0.3 0.8 0.7 0.8 0.9 0

0 0 0 0.2 0.8 0

0 0 0 0.2 0.2 0
Stride Size

0 0 0 0 0 0

0.2 0.8 0 0.3 0.6 0

x1 x0 x1

0.2
x0
0.9x1 0x0 0.3 0.8 0

0.3 0.8 0.7 0.8 0.9 0

x1 x0 x1

0 0 0 0.2 0.8 0

0 0 0 0.2 0.2 0

Vertical stride of 1
Stride Size

0 0 0 0 0 0

0.2 0.8 0 0.3 0.6 0

x1 x0 x1

0.2
x0
0.9x1 0x0 0.3 0.8 0

0.3 0.8 0.7 0.8 0.9 0

x1 x0 x1

0 0 0 0.2 0.8 0

0 0 0 0.2 0.2 0

Stride size is an important

hyperparameter in CNNs
Batch Normalization
Training via Back Propagation

ML-based Classifier

Object Parts
Corners
Edges
Pixels

Error

Optimiser
Vanishing and Exploding Gradients

Back propagation fails if

- gradients are vanishing
- gradients are exploding
Vanishing Gradient Problem
Loss
Gradient becomes zero
and stops changing

W
Initial value of loss

b Smallest value of loss

Exploding Gradient Problem
Loss
Gradient changes
abruptly and “explodes”

W
Initial value of loss

b Smallest value of loss

Coping with Vanishing/Exploding Gradients

Non-saturating activation
Proper initialization
function

Batch normalization Gradient clipping

Coping with Vanishing/Exploding Gradients

Non-saturating activation
Proper initialisation
function

Batch normalization Gradient clipping

Batch Normalization

Just before applying activation function

First, “normalize” inputs
Second, “scale and shift” inputs
Batch Normalization

“Normalize” inputs
- subtract mean
- divide by standard deviation
“Scale and shift” inputs
- scale = multiply by constant
- shift = add constant
Batch Normalization

Supported in PyTorch
Many other benefits
- allows much larger learn rate
- reduces overfitting
- speeds convergence of training
Choice of Activation Function
A Neural Network

Once a neural network is trained all edges have weights

which help it make predictions
Operation of a Single Neuron

W1
X1
X2 W2 Affine Activation
Wi Transformation Wx + b Function max(Wx+
…

b,0)
Xi
…

Wn
Xn
b

Each neuron only applies two simple functions to its inputs

Operation of a Single Neuron

W1
X1
X2 W2 Affine Activation
Wi Transformation Wx + b Function max(Wx+
…

b,0)
Xi
…

Wn
Xn
b

The affine transformation alone can only learn linear

relationships between the inputs and the output
Operation of a Single Neuron

W1
X1
X2 W2 Affine Activation
Wi Transformation Wx + b Function max(Wx+
…

b,0)
Xi
…

Wn
Xn
b

The affine transformation is just a weighted sum with a

bias added: W1x1 + W2x2 +…+ Wnxn + b
The weights and biases of
individual neurons are determined
during the training process
Operation of a Single Neuron

W1
X1
X2 W2 Affine Activation
Wi Transformation Wx + b Function max(Wx+
…

b,0)
Xi
…

Wn
Xn
b

The combination of the affine transformation and the

activation function can learn any arbitrary relationship
Activation Function

ReLU logit tanh step

Various choices of activation functions exist and drive

the design of your neural network
Importance of Activation

The choice of activation function is

crucial in determining performance
Feature Map Size Calculations
Without Zero Padding
6
0 0 0 0 0 0
4
0.2 0.8 0 0.3 0.6 0

0.2 0.9 0 0.3 0.8 0 4

6 0.3 0.8 0.7 0.8 0.9 0
x1 x0
x0
x1
x1 x0
x1 x0 x1
0 0 0 0.2 0.8 0

0 0 0 0.2 0.2 0

Convolution
Matrix Result
Zero Padding
10 8
0 0 0 0 0 0 0 0 0 0

0 0 0 0 0 0 0 0 0 0

0
0 0.2 0.8 0 0.3 0.6 0

0 0.2 0.9 0 0.3 0.8 0

0
0

0
8
10 0 0 0.3 0.8 0.7 0.8 0.9 0 0 0
x1 x0
x0
x1
x1 x0

0 0 0 0 0 0.2 0.8 0 0 0 x1 x0 x1

0 0 0 0 0 0.2 0.2 0 0 0

0 0 0 0 0 0 0 0 0 0

Convolution
Matrix Result
W-K+2P
O = + 1
S

Formula for dimension calculations

Handy in getting dimensions of CNN layers right
W-K+2P
O = + 1
S

O = Output dimension
Height/width of output
W-K+2P
O = + 1
S

W = Input dimension
Height/width of input image
W-K+2P
O = + 1
S

K = Kernel size
Height/width of kernel
W-K+2P
O = + 1
S

P = Padding (if any)

Maybe zero
W-K+2P
O = + 1
S

S = Stride
How far the kernel advances in each step
Stride Size

0 0 0 0 0 0
x1 x0 x1

0.2 0.8 0 0.3 0.6 0

x0 x1 x0

0.2 0.9 0 0.3 0.8 0

x1 x0 x1

0.3 0.8 0.7 0.8 0.9 0

0 0 0 0.2 0.8 0

0 0 0 0.2 0.2 0
Stride Size

0 0 0 0 0 0
x1 x0 x1

0.2 0.8 0 0.3 0.6 0

x0 x1 x0

0.2 0.9 0 0.3 0.8 0

x1 x0 x1

0.3 0.8 0.7 0.8 0.9 0

0 0 0 0.2 0.8 0

0 0 0 0.2 0.2 0

Horizontal stride of 1
Stride Size

0 0 0 0 0 0
x1 x0 x1

0.2 0.8 0 0.3 0.6 0

x0 x1 x0

0.2 0.9 0 0.3 0.8 0

x1 x0 x1

0.3 0.8 0.7 0.8 0.9 0

0 0 0 0.2 0.8 0

0 0 0 0.2 0.2 0
Stride Size

0 0 0 0 0 0

0.2 0.8 0 0.3 0.6 0

x1 x0 x1

0.2
x0
0.9x1 0x0 0.3 0.8 0

0.3 0.8 0.7 0.8 0.9 0

x1 x0 x1

0 0 0 0.2 0.8 0

0 0 0 0.2 0.2 0

Vertical stride of 1
W-K+2P
O = + 1
S

Formula for dimension calculations

Handy in getting dimensions of CNN layers right
Demo
Image classification using
convolutional neural networks (CNNs)
Hyperparameter tuning
Narrow and wide convolution
Zero-padding and the feature map sizes
Summary for convolutional layers
Calculating feature map dimensions
Batch normalization of input images
Building and training a CNN for image
classification
Changing model hyperparameters

Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
98 pages
3.convolutional Networks and Sequence Modeling
No ratings yet
3.convolutional Networks and Sequence Modeling
19 pages
3 Ann
No ratings yet
3 Ann
61 pages
Week2 Lecture1 2
No ratings yet
Week2 Lecture1 2
113 pages
586 114 216 Convolutional Neural Networks
No ratings yet
586 114 216 Convolutional Neural Networks
48 pages
ACR Creative Teaching Strategies
No ratings yet
ACR Creative Teaching Strategies
2 pages
DL Unit 3
No ratings yet
DL Unit 3
18 pages
Deep Learning UNIT-II Part1
No ratings yet
Deep Learning UNIT-II Part1
48 pages
Machine Learning (CSO851) - Lecture 10
No ratings yet
Machine Learning (CSO851) - Lecture 10
83 pages
Neural Networks
100% (1)
Neural Networks
119 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
Week 7
No ratings yet
Week 7
24 pages
Lecture 10 Slides - After
No ratings yet
Lecture 10 Slides - After
66 pages
Fundamental Nursing Sensory
No ratings yet
Fundamental Nursing Sensory
13 pages
13 Nnbasics
No ratings yet
13 Nnbasics
22 pages
Image Classification With Pytorch: Pre-Processing Images To Use in Machine Learning Models
No ratings yet
Image Classification With Pytorch: Pre-Processing Images To Use in Machine Learning Models
48 pages
1.neural Networks and Convolutional Processing
No ratings yet
1.neural Networks and Convolutional Processing
94 pages
The Math Behind Convolutional Neural Networks - Towards Data Science
No ratings yet
The Math Behind Convolutional Neural Networks - Towards Data Science
37 pages
Unit 2 CNN
No ratings yet
Unit 2 CNN
9 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
ANN Unit 4
No ratings yet
ANN Unit 4
66 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
27 pages
DeepLearning Unit-II
No ratings yet
DeepLearning Unit-II
70 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
ImageNet Classification With Deep Convolutional Convolutional Neural Networks PDF
No ratings yet
ImageNet Classification With Deep Convolutional Convolutional Neural Networks PDF
37 pages
CNN
No ratings yet
CNN
62 pages
Convolutinal Neural Networks
No ratings yet
Convolutinal Neural Networks
43 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
6 pages
Computer Vision NN Architecture
No ratings yet
Computer Vision NN Architecture
19 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
26 pages
Convolutional Neural Networks. Before Kickstarting Into CNNs We Must - by Namita - Medium
No ratings yet
Convolutional Neural Networks. Before Kickstarting Into CNNs We Must - by Namita - Medium
13 pages
Business Data Mining Week 12
No ratings yet
Business Data Mining Week 12
24 pages
Convolutional Neural Networks - Annotated
No ratings yet
Convolutional Neural Networks - Annotated
83 pages
Unit 3 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Machine Learning - WWW - Rgpvnotes.in
29 pages
How To Code For Quantum Computers
From Everand
How To Code For Quantum Computers
Nivio Dos Santos
No ratings yet
CNN For Computer Vision Problem (Session 1)
No ratings yet
CNN For Computer Vision Problem (Session 1)
43 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Lecture - 07 (Convolutional Neural Networks)
No ratings yet
Lecture - 07 (Convolutional Neural Networks)
57 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
Batch Normalization 1
No ratings yet
Batch Normalization 1
3 pages
An Overview of Convolutional Neural Network Architectures For Deep Learning
No ratings yet
An Overview of Convolutional Neural Network Architectures For Deep Learning
22 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
NN 06
No ratings yet
NN 06
18 pages
Encyclopedia Touch
No ratings yet
Encyclopedia Touch
4 pages
Unit III
No ratings yet
Unit III
89 pages
Convolutional Neural Networks & Zapier
No ratings yet
Convolutional Neural Networks & Zapier
75 pages
CNN Slides PDF
No ratings yet
CNN Slides PDF
81 pages
Unit Iii Deep Learning
No ratings yet
Unit Iii Deep Learning
31 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
29 pages
AE556 2024 Topic4 CNN
No ratings yet
AE556 2024 Topic4 CNN
26 pages
Introducing Convolutional Neural Networks Slides
No ratings yet
Introducing Convolutional Neural Networks Slides
94 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
Introduction To Neural Network
No ratings yet
Introduction To Neural Network
20 pages
GOJMERAC - Importance of Music in Education System
No ratings yet
GOJMERAC - Importance of Music in Education System
10 pages
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
No ratings yet
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
15 pages
Understanding The Drawbacks of Using Deep Neural Networks With Images
No ratings yet
Understanding The Drawbacks of Using Deep Neural Networks With Images
35 pages
Ilmp Grade 9
100% (1)
Ilmp Grade 9
22 pages
Anti-Aliasing with MSAA vs ABAA
From Everand
Anti-Aliasing with MSAA vs ABAA
Michel A Rohner
No ratings yet
Mind, Brain and Education High-Quality Ebook
100% (13)
Mind, Brain and Education High-Quality Ebook
16 pages
Age-Specific MRI Brain and Head Templates For Healthy Adults From 20 Through 89 Years of Age
No ratings yet
Age-Specific MRI Brain and Head Templates For Healthy Adults From 20 Through 89 Years of Age
14 pages
NN Concepts
No ratings yet
NN Concepts
4 pages
Lesson 10. Gardner and Eric Erickson
No ratings yet
Lesson 10. Gardner and Eric Erickson
9 pages
PerDev - Q1 - Module 5 - Powers of The Mind
100% (1)
PerDev - Q1 - Module 5 - Powers of The Mind
31 pages
Week 1 Scientific Values and Scientific Method
No ratings yet
Week 1 Scientific Values and Scientific Method
27 pages
Chapter 6 Psm1041
No ratings yet
Chapter 6 Psm1041
16 pages
Gathercole 1994 Working Memory
No ratings yet
Gathercole 1994 Working Memory
12 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Feedback: Conceptual Framework Input Process Output
100% (6)
Feedback: Conceptual Framework Input Process Output
2 pages
"What Is Creativity? The Ultimate Guide To Understanding Today" by Kelly Morr
No ratings yet
"What Is Creativity? The Ultimate Guide To Understanding Today" by Kelly Morr
13 pages
Chapter 4 Delusions
No ratings yet
Chapter 4 Delusions
14 pages
16 Marker MSM
No ratings yet
16 Marker MSM
2 pages
Teaching English in The Elementary Grades - Chapter 1 - The K-12 English Curriculum
No ratings yet
Teaching English in The Elementary Grades - Chapter 1 - The K-12 English Curriculum
32 pages
Fundamental Principles - Mind Power - Sreekanth
No ratings yet
Fundamental Principles - Mind Power - Sreekanth
14 pages
Deep Online Sequential Extreme Learning Machines and Its Application in Pneumonia Detection
No ratings yet
Deep Online Sequential Extreme Learning Machines and Its Application in Pneumonia Detection
6 pages
GRE Waived Universities
No ratings yet
GRE Waived Universities
5 pages
Detection of Pneumonia Clouds in Chest X-Ray Using Image Processing Approach
No ratings yet
Detection of Pneumonia Clouds in Chest X-Ray Using Image Processing Approach
4 pages
Topic 3 Test
No ratings yet
Topic 3 Test
3 pages
Final Exam G10
No ratings yet
Final Exam G10
3 pages
Glasgow Coma Scale 2
No ratings yet
Glasgow Coma Scale 2
1 page
GCP 16 Apr Ref
No ratings yet
GCP 16 Apr Ref
5 pages
0spy The Lie - Final Handout
No ratings yet
0spy The Lie - Final Handout
5 pages
Self-Concept Clarity: Measurement, Personality Correlates, and Cultural Boundaries
No ratings yet
Self-Concept Clarity: Measurement, Personality Correlates, and Cultural Boundaries
16 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages
Emerging Identities
No ratings yet
Emerging Identities
22 pages
Automatic Detection of Pneumonia On Compressed Sensing Images Using Deep Learning
No ratings yet
Automatic Detection of Pneumonia On Compressed Sensing Images Using Deep Learning
4 pages
DBT Diary Card PDF
No ratings yet
DBT Diary Card PDF
2 pages
2 - Brian Tracy Science of Self Confidence PDF
0% (2)
2 - Brian Tracy Science of Self Confidence PDF
2 pages
Set Achievable and Appropriate Learning Outcomes That Aligned With Learning Competencies - Powerpoint PPT Presentation
100% (4)
Set Achievable and Appropriate Learning Outcomes That Aligned With Learning Competencies - Powerpoint PPT Presentation
33 pages
Table of Specifications (TOS) (1) (1) .Odt
No ratings yet
Table of Specifications (TOS) (1) (1) .Odt
6 pages
6 Inclusion Strategies For Students With Autism Spectrum Disorders
No ratings yet
6 Inclusion Strategies For Students With Autism Spectrum Disorders
8 pages

Building Convolutional Neural Networks For Image Classification Slides

Uploaded by

Building Convolutional Neural Networks For Image Classification Slides

Uploaded by

Building Convolutional Neural

Networks for Image Classification

Alternating convolutional and pooling layers

This entire set of layers is then fed into a

This is the output layer, emitting probabilities

CNN Input is an image

Image Pixels Feature

Input matrix i.e. image

Narrow Convolution Wide Convolution

0.2 0.9 0 0.3 0.8 0

6 0.3 0.8 0.7 0.8 0.9 0

0 0.2 0.9 0 0.3 0.8 0

x0 x0 x0 With zero-padding, every element of

Use to get output larger than input

0.2 0.8 0 0.3 0.6 0

0.2 0.9 0 0.3 0.8 0

0.3 0.8 0.7 0.8 0.9 0

0.2 0.8 0 0.3 0.6 0

0.2 0.9 0 0.3 0.8 0

0.3 0.8 0.7 0.8 0.9 0

0.2 0.8 0 0.3 0.6 0

0.2 0.9 0 0.3 0.8 0

0.3 0.8 0.7 0.8 0.9 0

0.2 0.8 0 0.3 0.6 0

0.3 0.8 0.7 0.8 0.9 0

0.2 0.8 0 0.3 0.6 0

0.3 0.8 0.7 0.8 0.9 0

Stride size is an important

Back propagation fails if

b Smallest value of loss

b Smallest value of loss

Batch normalization Gradient clipping

Batch normalization Gradient clipping

Just before applying activation function

Once a neural network is trained all edges have weights

Each neuron only applies two simple functions to its inputs

The affine transformation alone can only learn linear

The affine transformation is just a weighted sum with a

The combination of the affine transformation and the

ReLU logit tanh step

Various choices of activation functions exist and drive

The choice of activation function is

0.2 0.9 0 0.3 0.8 0 4

0 0.2 0.9 0 0.3 0.8 0

Formula for dimension calculations

P = Padding (if any)

0.2 0.8 0 0.3 0.6 0

0.2 0.9 0 0.3 0.8 0

0.3 0.8 0.7 0.8 0.9 0

0.2 0.8 0 0.3 0.6 0

0.2 0.9 0 0.3 0.8 0

0.3 0.8 0.7 0.8 0.9 0

0.2 0.8 0 0.3 0.6 0

0.2 0.9 0 0.3 0.8 0

0.3 0.8 0.7 0.8 0.9 0

0.2 0.8 0 0.3 0.6 0

0.3 0.8 0.7 0.8 0.9 0

Formula for dimension calculations

You might also like