0% found this document useful (0 votes)

14 views

FundamentalsOfDeepLearning (1)

Uploaded by

RENZO KENYI TAKAGUI PÉREZ

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

FundamentalsOfDeepLearning (1)

Uploaded by

RENZO KENYI TAKAGUI PÉREZ

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 79

Fundamentals of Deep Learning

Pontificia Universidad Católica del Perú

Summer Camp en IA
2025

Notes adapted from Dr. César Beltrán (PUCP) and Dr. Ivan Serina (UNIBS)
Review
Scalars

● A scalar is a single number

● Integers, real numbers, rational numbers, etc.

Vectors

● A vector is a 1-D array of numbers:

● Can be real, binary, integer, etc.

● Example notation for type and size:
Matrices

● Multiplications (matrix and vector)

Matrix (Dot) Product
Tensors

A tensor is an array of numbers, that may have

● zero dimensions, and be a scalar

● one dimension, and be a vector

● Two dimensions, and be a matrix

● or more dimensions.
Review Scalar Derivative
Gradients
Chain Rule
Chain Rule
Gradient Descent
Approximate Optimization
History Review
Mark I Perceptron
Frank Rosenblatt ~1958
Mark I Perceptron
The first page of Rosenblatt's
article, “The Design of an
Intelligent Automaton,” in
Research Trends, a Cornell
Aeronautical Laboratory
publication, Summer 1958.

An image of the perceptron from Rosenblatt's

“The Design of an Intelligent Automaton,” Summer
1958.

Rosenblatt and the Images courtesy of Cornell Chronicle (2019)

perceptron.
Perceptron
Perceptron training rule
Adeline/Madeline
Widrow and Hoff ~1960

Adaptive Linear Neuron (Adeline)

https://www.youtube.com/watch?v=IEFRtz68m-8
Neocognitron: a self organizing neural network
model for a mechanism of pattern recognition
unaffected by shift in position.
Fukushima K. 1980
https://www.youtube.com/watch?v=Qil4kmvm2Sw
Learning representations by back-
propagating errors
Rumelhart et. al., 1986
Sigmoid unit
Cost Function
Gradient Descent

Every weight is modified by a small

quantity in the opposite direction
(addition or subtraction) that mostly
minimizes E
Backpropagation Algorithm
Gradient-based learning applied to document
recognition
Y. Le Cun et. al, 1998
Reducing the
Dimensionality of
Data with Neural
Networks
Hinton and Salakhutdinov 2006
Imagenet classification with deep convolutional
neural networks
Alex Krizhevsky, Ilya Sutskever, Geoffrey E Hinton, 2012
Classification

[Krizhevsky 2012]

29
Detection Segmentation

[Faster R-CNN: Ren, He, Girshick, Sun 2015] [Farabet et al., 2012]

30
Convolutional Neural
Networks
CNN
● CNN architecture main task is the feature extraction through 2D or 3D
convolutional operations.

● The simple CNN framework involves four layers: convolutional, activation,

pooling, and fully connected layer.

Lenet
¿Qué es una convolución?

1 0 1

0 1 0

1 0 1

Kernel
Convolution Layer

32x32x3 image

32 height

http://setosa.io/ev/image-kernels/
32 width
3 depth
Convolution Layer
activation map
32x32x3 image

32
5x5x3 filter
28

convolve
28
32
1
3
Convolution Layer

32x32x3 image

5x5x3 filter
32

32
3
Convolution Layer

32x32x3 image

5x5x3 filter
32

32
3
Convolution Layer
activation map
32x32x3 image
5x5x3 filter
32

convolve

32 28
3 1
Convolution Layer Un segundo filtro
32x32x3 image activation maps
5x5x3 filter
32

convolve

32 28
3 1
Convolution Layer
activation maps

Convolution Layer

32 28
3 6

Si tenemos 6 filtros, el resultado tendría la forma: 28x28x6

Convolution Layer
activation maps

Convolution Layer

32 28
● Kernel size = 5
3 ● # kernels = 6 6
● padding =0
7
7x7 input
3x3 filter

42
7
7x7 input
3x3 filter

43
7
7x7 input
3x3 filter

44
7
7x7 input
3x3 filter

45
7
7x7 input
3x3 filter

=> 5x5 output

46
Padding
0 0 0 0 0 0
input 7x7
0 3x3 filter
0 padding 1
0

0
Padding
0 0 0 0 0 0
input 7x7
0 3x3 filter
0 padding 1
0
7x7 output!
0

https://ezyang.github.io/convolution-visualizer/index.html
Pooling layer

49
Max Pooling

Single depth slice

1 1 2 4
x max pool with 2x2 filters
5 6 7 8 and stride 2 6 8

3 2 1 0 3 4

1 2 3 4

y
50
Avg Pooling

Single depth slice

1 1 2 4
x avg pool with 2x2 filters
5 6 7 8 and stride 2 3.25 5.25

3 2 1 0
2 2
1 2 3 4

y
51
Activation Function
Fully Connected Layer

32x32x3
Fully Connected Layer

3072

32x32x3
input

32
3
Fully Connected Layer
input

(32x32x3)

32
3
Fully Connected Layer
Keras code
from tensorflow.keras.layers import Dense, Conv2D, MaxPool2D, Flatten

model = Sequential([
Conv2D(16, 3, activation='relu', input_shape=(28,28,1)),
MaxPool2D(),
Conv2D(32, 3, activation='relu'),
MaxPool2D(),
Flatten(),
Dense(10, activation='softmax')
])
Arquitecturas conocidas
LeNet-5
[LeCun et al., 1998]

Conv filters were 5x5, applied at stride 1

Subsampling (Pooling) layers were 2x2 applied at stride 2
i.e. architecture is [CONV-POOL-CONV-POOL-CONV-FC]
AlexNet
[Krizhevsky et al. 2012]

Full (simplified) AlexNet architecture:

[227x227x3] INPUT
[55x55x96] CONV1: 96 11x11 filters at stride 4, pad 0
Details/Retrospectives:
[27x27x96] MAX POOL1: 3x3 filters at stride 2
- first use of ReLU
[27x27x96] NORM1: Normalization layer
- used Norm layers (not common anymore)
[27x27x256] CONV2: 256 5x5 filters at stride 1, pad 2
- heavy data augmentation
[13x13x256] MAX POOL2: 3x3 filters at stride 2
- dropout 0.5
[13x13x256] NORM2: Normalization layer
- batch size 128
[13x13x384] CONV3: 384 3x3 filters at stride 1, pad 1
- SGD Momentum 0.9
[13x13x384] CONV4: 384 3x3 filters at stride 1, pad 1
- Learning rate 1e-2, reduced by 10
[13x13x256] CONV5: 256 3x3 filters at stride 1, pad 1
manually when val accuracy plateaus
[6x6x256] MAX POOL3: 3x3 filters at stride 2
- L2 weight decay 5e-4
[4096] FC6: 4096 neurons
- 7 CNN ensemble: 18.2% -> 15.4%
[4096] FC7: 4096 neurons
[1000] FC8: 1000 neurons (class scores)
VGGNet
[Simonyan and Zisserman, 2014]

Only 3x3 CONV stride 1, pad 1

and 2x2 MAX POOL stride 2

best model
7.3% top 5 error
GoogLeNet

[Szegedy et al., 2014]

Inception module

ILSVRC 2014 winner (6.7% top 5 error)

Inception module (Keras code)

from tensorflow.keras.layers import Conv2D, MaxPool2D, concatenate

tower_1 = Conv2D(64, 1, padding='same', activation='relu')(input_img)

tower_2 = Conv2D(64, 1, padding='same', activation='relu')(input_img)

tower_2 = Conv2D(64, 3, padding='same', activation='relu')(tower_1)

tower_3 = Conv2D(64, 1, padding='same', activation='relu')(input_img)

tower_3 = Conv2D(64, 5, padding='same', activation='relu')(tower_2)

tower_4 = MaxPool2D(3, strides=(1,1), padding='same')(input_img)

tower_4 = Conv2D(64, 1, padding='same', activation='relu')(tower_3)

output = concatenate([tower_1, tower_2, tower_3, tower_4], axis = 3)

Inception module (Keras code)
GoogLeNet
ResNet [He et al., 2015] ILSVRC 2015 winner (3.6% top 5 error)
224x224x3

spatial dimension
only 56x56!
ResNet [He et al., 2015]
- Batch Normalization after every CONV layer
- Xavier/2 initialization from He et al.
- SGD + Momentum (0.9)
- Learning rate: 0.1, divided by 10 when validation error plateaus
- Mini-batch size 256
- Weight decay of 1e-5
- No dropout used

75
ResNet [He et al., 2015]
YOLO [Redmon et al., 2016]
SqueezeNet
[Iandola et al., 2017]
Thank You !

Susan Palacios Salcedo

PhD Candidate,
PUCP
[email protected]

Partial Differential Equations II: 2D Laplace Equation On 5x5 Grid
No ratings yet
Partial Differential Equations II: 2D Laplace Equation On 5x5 Grid
26 pages
Pune University Soft Computing Exam Papers
No ratings yet
Pune University Soft Computing Exam Papers
4 pages
07 AIS302 CNN
No ratings yet
07 AIS302 CNN
56 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
102 pages
Week2_lecture1_2
No ratings yet
Week2_lecture1_2
113 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
161 pages
Session 3
No ratings yet
Session 3
109 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Lec9 CNN 25jan18
No ratings yet
Lec9 CNN 25jan18
111 pages
NN 07
No ratings yet
NN 07
24 pages
Images, Neural Networks, CNNs
No ratings yet
Images, Neural Networks, CNNs
26 pages
Skin Melanoma Stage Detection - CNN
No ratings yet
Skin Melanoma Stage Detection - CNN
55 pages
WEEK 8
No ratings yet
WEEK 8
101 pages
QUESTION 2 - Convolution Neural Network Q2 A - Consider The Following CNN Architecture
No ratings yet
QUESTION 2 - Convolution Neural Network Q2 A - Consider The Following CNN Architecture
2 pages
Convolutional Neural Network (CNN) : A Network That Can See Patterns
No ratings yet
Convolutional Neural Network (CNN) : A Network That Can See Patterns
4 pages
12 Convolutional Neural Networks
No ratings yet
12 Convolutional Neural Networks
101 pages
CS 236 Section 3
No ratings yet
CS 236 Section 3
59 pages
10 - Mark - CNN Architecture and Training
No ratings yet
10 - Mark - CNN Architecture and Training
7 pages
CNN For Computer Vision Problem (Session 1)
No ratings yet
CNN For Computer Vision Problem (Session 1)
43 pages
10.CNN-2
No ratings yet
10.CNN-2
97 pages
Minggu04 - Convolutional Neural Network (CNN)
No ratings yet
Minggu04 - Convolutional Neural Network (CNN)
55 pages
2019 (Brats) 1st - Two Stage Cascade U Net
No ratings yet
2019 (Brats) 1st - Two Stage Cascade U Net
11 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
55 pages
Final Unit 2 Questions.
No ratings yet
Final Unit 2 Questions.
5 pages
Binary Image Analysis
No ratings yet
Binary Image Analysis
33 pages
Multi-Layered Deep Convolutional Neural Network For Object Detection
No ratings yet
Multi-Layered Deep Convolutional Neural Network For Object Detection
6 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Chapitre CNN
No ratings yet
Chapitre CNN
14 pages
Research On Face Recognition Based On CNN
No ratings yet
Research On Face Recognition Based On CNN
6 pages
CS 601 Machine Learning Unit 3
No ratings yet
CS 601 Machine Learning Unit 3
37 pages
Convolutional Neural Networks-Part2
No ratings yet
Convolutional Neural Networks-Part2
21 pages
07 Réseaux Convolutifs v2.10
No ratings yet
07 Réseaux Convolutifs v2.10
57 pages
Comp3314 8. Convolutional Neural Networks
No ratings yet
Comp3314 8. Convolutional Neural Networks
64 pages
Computer_Vision_Material
No ratings yet
Computer_Vision_Material
21 pages
Inception - GoogLeNet
No ratings yet
Inception - GoogLeNet
10 pages
Document 1
No ratings yet
Document 1
2 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Learning Algorithms For Classification A Compariso
No ratings yet
Learning Algorithms For Classification A Compariso
10 pages
SS_2021
No ratings yet
SS_2021
16 pages
Cha2CNN
No ratings yet
Cha2CNN
13 pages
AI - II - Cihan - Lect 6 PDF
No ratings yet
AI - II - Cihan - Lect 6 PDF
31 pages
Image filtering
No ratings yet
Image filtering
66 pages
L10 - Intro - To - Deep - Learning
No ratings yet
L10 - Intro - To - Deep - Learning
75 pages
CS60010: Deep Learning CNN - Part 1: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 1: Sudeshna Sarkar
64 pages
Math138.Lesson 2
No ratings yet
Math138.Lesson 2
3 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
notes - Copy
No ratings yet
notes - Copy
2 pages
Full Download (Ebook) 3D Game Environments: Create Professional 3D Game Worlds by Luke Ahearn ISBN 9781138920026, 1138920029 PDF DOCX
100% (6)
Full Download (Ebook) 3D Game Environments: Create Professional 3D Game Worlds by Luke Ahearn ISBN 9781138920026, 1138920029 PDF DOCX
55 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Lainzine 04
No ratings yet
Lainzine 04
66 pages
Unit 5a - Machine Vision
No ratings yet
Unit 5a - Machine Vision
55 pages
Machine Learning-Lecture 17(Student)
No ratings yet
Machine Learning-Lecture 17(Student)
7 pages
A Method To Find The Best Mixed Polarity Reed-Muller Expansion
No ratings yet
A Method To Find The Best Mixed Polarity Reed-Muller Expansion
52 pages
6 Lecture CNN
No ratings yet
6 Lecture CNN
45 pages
Lecture 17. Convolutional Neural Networks PDF
No ratings yet
Lecture 17. Convolutional Neural Networks PDF
32 pages
A Multi-View Deep Convolutional Neural Networks For Lung Nodule Segmentation
No ratings yet
A Multi-View Deep Convolutional Neural Networks For Lung Nodule Segmentation
4 pages
2 ADA Cluster Analysis
No ratings yet
2 ADA Cluster Analysis
12 pages
CNN Architectures 01
No ratings yet
CNN Architectures 01
66 pages
The Probability Lifesaver: All the Tools You Need to Understand Chance
From Everand
The Probability Lifesaver: All the Tools You Need to Understand Chance
Steven J. Miller
5/5 (1)
PhD_dpk25 - Machine Learning Force for Molecular Chemistry
No ratings yet
PhD_dpk25 - Machine Learning Force for Molecular Chemistry
142 pages
RefineNet
No ratings yet
RefineNet
11 pages
SOC estimation with GNN
No ratings yet
SOC estimation with GNN
5 pages
Lseg - Original Paper
No ratings yet
Lseg - Original Paper
13 pages
NLP
100% (1)
NLP
97 pages
cheng paper Nature
No ratings yet
cheng paper Nature
10 pages
Latent Ewald summation for ML of long-range interactions - Bingqung Cheng
No ratings yet
Latent Ewald summation for ML of long-range interactions - Bingqung Cheng
10 pages
TIB Renzo Takagui
No ratings yet
TIB Renzo Takagui
73 pages
Education: Dr. Armando Aligia
No ratings yet
Education: Dr. Armando Aligia
2 pages
Deep Learning - Lesson Plan
No ratings yet
Deep Learning - Lesson Plan
5 pages
Tensorflow, Keras and Deep Learning
No ratings yet
Tensorflow, Keras and Deep Learning
51 pages
Week 2
No ratings yet
Week 2
17 pages
Webpdf
No ratings yet
Webpdf
671 pages
NN 08
No ratings yet
NN 08
36 pages
Python Deep Learning Notes
No ratings yet
Python Deep Learning Notes
31 pages
ELEC 6240: Neural Networks
No ratings yet
ELEC 6240: Neural Networks
253 pages
DNN Ho
No ratings yet
DNN Ho
8 pages
NN Adaline
0% (2)
NN Adaline
14 pages
RNN, NLP
No ratings yet
RNN, NLP
2 pages
Be Computer-Engineering Semester-8 2023 February Deep-Learning-2019-Pattern
No ratings yet
Be Computer-Engineering Semester-8 2023 February Deep-Learning-2019-Pattern
1 page
Neural Network Quiz Questions
No ratings yet
Neural Network Quiz Questions
6 pages
Unit - I Artificial Neural Networks
No ratings yet
Unit - I Artificial Neural Networks
23 pages
Transformers Without Tears
No ratings yet
Transformers Without Tears
11 pages
Model Test Paper Soft Computing
No ratings yet
Model Test Paper Soft Computing
2 pages
Food_Recognition_with_ResNet-50
No ratings yet
Food_Recognition_with_ResNet-50
5 pages
Auto&hetero Memory
No ratings yet
Auto&hetero Memory
3 pages
Introduction To Neural Networks: Freek Stulp
No ratings yet
Introduction To Neural Networks: Freek Stulp
12 pages
Deep Learning With Pytorch: Ai Courses by Opencv
No ratings yet
Deep Learning With Pytorch: Ai Courses by Opencv
9 pages
5 LSTM
No ratings yet
5 LSTM
4 pages
10_neural_nets_with_keras.ipynb (1)
No ratings yet
10_neural_nets_with_keras.ipynb (1)
159 pages
Final Unit 3 Questions
No ratings yet
Final Unit 3 Questions
9 pages
Week 10 - Neural Network
No ratings yet
Week 10 - Neural Network
24 pages
Examen Deep Learning
100% (1)
Examen Deep Learning
8 pages
CNN Building Blocks
No ratings yet
CNN Building Blocks
14 pages
CNN Basic Beak of Bird
100% (1)
CNN Basic Beak of Bird
20 pages
Amanuel Negash
No ratings yet
Amanuel Negash
130 pages
Deep Learning Lab 2023
No ratings yet
Deep Learning Lab 2023
47 pages
Shoolini University Mid Sem
No ratings yet
Shoolini University Mid Sem
3 pages