0% found this document useful (0 votes)

128 views140 pages

Deep Learning for Computer Vision

Uploaded by

tung vu son

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

128 views140 pages

Deep Learning for Computer Vision

Uploaded by

tung vu son

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Deep Learning

for
Computer Vision
Thang Nguyen
Viet Nguyen

youtube.com/@vietnh10091
Nội dung
Deep Learning Image Classiﬁcation
01 Tổng quan về Deep 04 Bài toán phân loại ảnh
Learning

Neural Network Object detection

02 Cấu trúc của Neural 05 Bài toán định vị đối
network tượng

Convolutional NN Other topics

03 06 Image segmentation,
Cấu trúc của CNN
GANs
2
Tổng quan về Deep Learning

3
Tổng quan về Deep Learning

4
Deep Learning for Computer Vision

5
Deep Learning for Computer Vision

6
Deep Learning for Computer Vision

7
Neural Network vs Deep Learning

8
Image Classiﬁcation

9
Dataset

10
Linear Classiﬁer - score function
Matrix multiplication

Element-wise multiplication

11
Linear Classiﬁer explanation

MNIST dataset
12
Linear Classiﬁer - Weight visualization

13
Linear Classiﬁer - Weight visualization

14
Linear Classiﬁer - Weight visualization

15
Linear Classiﬁer - Weight visualization

CIFAR dataset

16
Biological Neuron vs Artiﬁcial Neuron

Neuron = linear classiﬁer + activation function

17
From linear classiﬁer to neuron

18
Activation function

19
Neural Network Architecture

20
Neural Network

Forward propagation
Feedforward neural network
Forward pass Demo
21
Complete example of Neural Network

Demo
22
Deep Neural Network

Demo
23
Loss function

24
Loss function
Regression Classiﬁcation

Mean Absolute Error Loss Cross-Entropy Loss

Mean Squared Error Loss

Huber Loss

25
Loss function
Mean Absolute Error Loss

26
Loss function
Mean Squared Error Loss

27
Loss function
Cross-Entropy Loss

28
Gradient

29
Optimization function
Gradient Descent

30
Optimization function
Gradient Descent in Linear Classiﬁer vs Neural network

31
Gradient Descent: Different versions
(Stochastic ) Mini-batch
Batch Gradient Descent Stochastic Gradient Descent
Gradient Descent

Tất cả datapoints Từng datapoint một sẽ N datapoint sẽ được đưa

được đưa vào mô hình được đưa vào mô hình để vào mô hình cùng lúc để
cùng 1 lúc để tính tính gradient tính gradient
gradient

32
How parameters are updated?

33
Challenge with Gradient Descent: Loss function

34
Challenge with Gradient Descent: Local minima

35
Challenge with Gradient Descent: Saddle points

36
Challenge with Gradient Descent: Ravines

37
Momentum

38
Momentum

39
Adaptive Gradient Descent (1)

40
Adaptive Gradient Descent (2)

41
Adaptive Gradient Descent (3)

42
Adaptive Gradient Descent (4)

43
Adaptive Gradient Descent (4)

44
Root Mean Square Propagation

45
BackPropagation

46
BackPropagation

47
Backpropagation

48
BackPropagation - example

forward

backward 49
BackPropagation - example

forward

backward 50
BackPropagation - example

forward

backward 51
BackPropagation - example

forward

backward 52
BackPropagation - example

forward

backward 53
BackPropagation - example

forward

backward 54
BackPropagation - example

forward

backward 55
BackPropagation - explanation

56
Neural network’s learning process

Demo
57
Parameters vs Hyperparameters

58
Neural Network

59
Gray image is represented in Computer Vision

60
Color image is represented in Computer Vision

61
Data preprocessing

62
Train a (Convolutional) Neural Network in Pytorch
01 02 03 04
Step 1 Step 2 Step 3 Step 4
Define a loss Define an
Setup a dataset Define a model
function optimizer
Xây dựng 1 dataset từ Chọn/tự xây dựng 1
Định nghĩa mô hình Chọn 1 optimizer
tập ảnh hay nhiều hàm loss

05 06 07
Step 5 Step 6 Step 7
Train Validate Test
Đánh giá trong quá
Huấn luyện mô hình Đánh giá mô hình cuối
trình huấn luyện với
với training data cùng với test data
validation data

63
Pytorch: tensor

Tensor in Pytorch is equivalent to Array in Numpy

64
Pytorch: tensor

65
Pytorch: tensor

66
Pytorch: Image to tensor

67
Pytorch: Datasets
Built-in datasets CIFAR dataset

68
Pytorch: Datasets
ImageFolder Butterﬂy dataset

69
Pytorch: Built-in datasets
Pytorch all built-in datasets

70
Pytorch: Datasets

71
Step 1: Setup a dataset

72
Step 2: Deﬁne a Neural Network

73
Step 3: Deﬁne a loss function and an optimizer

74
Step 4: Train the network

75
Step 4: Train the network

76
Convolutional Neural Network

77
Problems in Neural Network
Variations Scalable
● Position
● Memory
● Orientation
● Calculation
● Scale

78
Convolutional Neural Network

79
Convolutional Neural Network

80
Convolutional operation

81
Convolutional operation

82
Convolutional operation on Volume

83
Convolutional operation on Volume

84
Convolutional operation with multiple Filters/Kernels

85
1x1 Convolutional operation

86
Convolutional layer

87
Convolutional layer’s advantage
Local Connectivity Parameter Sharing

88
Convolutional layer’s advantage

89
Pooling layer

90
Internal Covariate Shift

91
Batch Normalization layer

Batch Normalization
92
Why Normalization helps?

93
Batch Normalization: Training phase

94
Batch Normalization: Inference phase

95
Normalization layer

96
Dropout layer

97
Common CNN’s layer pattern (part 1)
Hình thức phổ biến nhất của CNN: Các layer lần lượt được sắp xếp theo thứ tự sau:
1. Một vài block của Conv->(BatchNorm)->ReLU
2. Max-pooling
3. Lặp lại bước 1 và 2 cho đến khi feature map đủ bé
4. Flatten (làm phẳng) feature map
5. Một (vài) fully connected layer

98
Receptive Field

99
Receptive Field

100
Receptive Field
Add more Conv layers
01
How to increase Receptive ﬁeld
Make the network deeper

Add pooling layers

02
sub-sampling

Use Dilated Conv

Use Depth-wise Conv

101
Common CNN’s layer pattern (part 2)
Nhiều Conv layer với filter/kernel bé tốt hơn 1 Conv layer với filter/kernel lớn
2 3x3-kernel Conv layers 1 5x5 kernel Conv layer
Receptive field of last layer 5x5
Number of parameters 2*3*3 5*5
Number of non-linear layers 2 1

102
Effective Receptive Field

103
Use Pooling layer

104
Dilated/Atrous Convolutions

105
Common CNN’s layer pattern (part 3)
Làm sao để thiết kế CNN architecture?
Rule 1: Don’t be a hero. Use whatever works best one ImageNet

106
Common CNN’s layer pattern (part 3)
Làm sao để thiết kế CNN architecture?
Rule 2: If rule 1 does not work, you could consider the following question
● Mô hình nên có bao nhiêu layer?
● Mỗi layer nên có parameter như thế nào?
○ Convolutional có bao nhiêu channel? Kernel size là bao nhiêu?
○ Nên dùng activation function gì?
○ Pooling nên có kernel size là bao nhiêu?
○ Nên ﬂatten feature map ở đâu?
○ Nên có bao nhiêu fully connected layer?
○ Có nên dùng Batchnorm, Dropout, …?
107
Common CNN architectures

108
ImageNet Large Scale Visual Recognition Challenge

109
LeNet (1998)

110
AlexNet (2012)

111
LeNet vs AlexNet

112
VGGNet (2014)

113
VGGNet (2014)

114
Universal Approximation Theorem

115
ResNet (2015)

116
ResNet (2015)

117
Problems in Neural Network: Vanishing gradients

118
Sigmoid vs Vanishing gradients

119
ReLU vs Vanishing gradients

120
Leaky ReLU vs Vanishing gradients

121
Các biến thể khác của ReLU

122
SWISH

123
Problems in Neural Network: Exploding gradients
Ngược lại với vanishing gradient
Dấu hiệu
● Model’s parameter tăng rất nhanh (exponential growth)
● Model’s parameter có thể thành NaN trong quá trình training
● Loss trở thành NaN trong quá trình training
● Loss lớn dù train lâu
Nguyên nhân
● Improper initial weights => large weight update
● Poor initial learning rate
● Poor loss function => large loss value => large weight update
124
Solutions for Vanishing/Exploding gradients
● Proper activation functions, like ReLU and its derivative
● Batch Normalization
● Regularization
● Proper weight initialization
● Gradient clipping (against exploding gradients)
○ By values
○ By norm

125
Weight initialization
● Performance của CNN/NN phụ thuộc vào weight initialization
● Các lợi ích của good weight initialization:
○ Reproducible model
○ Model converge faster
○ Tránh được vanishing/exploding gradients
● Các cách phổ biến:
○ Zero/Constant/Random initialization

126
Weight initialization: Zero/Constant initialization
● Tất cả các parameter đều được khởi tạo là 0 hoặc 1 giá trị cố định
● Tất cả các node sẽ học theo cùng 1 cách/1 kiểu => fail to break
symmetry
● Luôn dẫn đến kết quả tồi

127
Weight initialization: Random initialization
● Giải quyết được vấn đề break the symmetry
● Có thể dẫn đến vấn đề mới: Vanishing/exploding gradients
● Random initialization from a distribution:
○ Random Normal
○ Random Uniform

128
Weight initialization: Xavier/Glorot initialization
● Được áp dụng cho các layers (fully connected layers, Conv layers) mà
sử dụng activation function là sigmoid hay tanh
● Có 2 phiên bản:

129
Weight initialization: He/Kaiming initialization
● Được áp dụng cho các layers (fully connected layers, Conv layers) mà
sử dụng activation function là ReLU và các biến thể của nó
● Có 2 phiên bản:

130
Gradient clipping

131
Transfer Learning

132
Transfer Learning: Use CNN as ﬁxed feature extractor

133
Transfer Learning: Fine-tune CNN

134
Transfer Learning: How to choose approach

135
Data augmentation

136
Data augmentation: Position manipulation

Another name: Geometric transformations 137

Data augmentation: Color manipulation

Another name: Color distortion 138

Data augmentation: Example

139
Data augmentation: Example

140

Tim Hieu Ve Deep Learning
100% (1)
Tim Hieu Ve Deep Learning
78 pages
Ghichu
No ratings yet
Ghichu
2,936 pages
Alexnet
No ratings yet
Alexnet
20 pages
Stock Price Prediction
No ratings yet
Stock Price Prediction
12 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
28 pages
B02 IntroductionToDL
No ratings yet
B02 IntroductionToDL
23 pages
DL Lec6 CNN
No ratings yet
DL Lec6 CNN
86 pages
CNN Intro
No ratings yet
CNN Intro
21 pages
MLP Mixer
No ratings yet
MLP Mixer
13 pages
Chương 2.3. Advanced Method
No ratings yet
Chương 2.3. Advanced Method
20 pages
Convolutional Neural Network: Motivation and Introduction To CNN
No ratings yet
Convolutional Neural Network: Motivation and Introduction To CNN
208 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
PART I Chapter2 Neural Network
No ratings yet
PART I Chapter2 Neural Network
30 pages
Chương 2.3. Advanced Method
No ratings yet
Chương 2.3. Advanced Method
49 pages
Chương 2.3. Advanced Method
No ratings yet
Chương 2.3. Advanced Method
35 pages
Tìm Hiểu Mô Hình Doubleunet Trong Bài Toán Phân Vùng Ảnh
No ratings yet
Tìm Hiểu Mô Hình Doubleunet Trong Bài Toán Phân Vùng Ảnh
26 pages
Lecture05 DeepLearningCNN
No ratings yet
Lecture05 DeepLearningCNN
84 pages
Lecture05 DeepLearningCNN Trang 1
No ratings yet
Lecture05 DeepLearningCNN Trang 1
39 pages
AI Foundation Application-2
No ratings yet
AI Foundation Application-2
84 pages
Deep Learning & CNN Fundamentals
No ratings yet
Deep Learning & CNN Fundamentals
56 pages
Chap8 CNN
No ratings yet
Chap8 CNN
48 pages
MLP and CNN
No ratings yet
MLP and CNN
56 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
Group Project - ML2
No ratings yet
Group Project - ML2
57 pages
Building a CNN for CIFAR-10 Classification
No ratings yet
Building a CNN for CIFAR-10 Classification
26 pages
Video Classification Techniques Overview
No ratings yet
Video Classification Techniques Overview
52 pages
More On CNN
No ratings yet
More On CNN
131 pages
Overview of Neural Networks
No ratings yet
Overview of Neural Networks
20 pages
Deep Learning: Seungsang Oh
No ratings yet
Deep Learning: Seungsang Oh
39 pages
(Project's Name) (Module's Name) YYYY Mastersheet
No ratings yet
(Project's Name) (Module's Name) YYYY Mastersheet
33 pages
Trí tuệ nhân tạo trong điều khiển: Convolution Neural Networks Mạng nơron tích chập
No ratings yet
Trí tuệ nhân tạo trong điều khiển: Convolution Neural Networks Mạng nơron tích chập
25 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
67 pages
Unit III
No ratings yet
Unit III
89 pages
Import Torch
No ratings yet
Import Torch
4 pages
PyTorch Deep Learning Course Guide
No ratings yet
PyTorch Deep Learning Course Guide
49 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
47 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
97 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Lecture05 DeepLearningCNN Trang 2
No ratings yet
Lecture05 DeepLearningCNN Trang 2
45 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
10 pages
Train Your Image Classifier Model With PyTorch
No ratings yet
Train Your Image Classifier Model With PyTorch
6 pages
CNN Basics with TensorFlow Explained
No ratings yet
CNN Basics with TensorFlow Explained
17 pages
CNN Tutorial: LeNet with Theano
No ratings yet
CNN Tutorial: LeNet with Theano
12 pages
CNN Theory
No ratings yet
CNN Theory
3 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
Enhancing Model Generalization Techniques
No ratings yet
Enhancing Model Generalization Techniques
46 pages
Tìm hiểu về Mạng Nơron Tích Chập
No ratings yet
Tìm hiểu về Mạng Nơron Tích Chập
19 pages
CVlecture 5
No ratings yet
CVlecture 5
56 pages
Mamba Architecture for Text Classification
No ratings yet
Mamba Architecture for Text Classification
12 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
37 pages
Datasets & DataLoaders - PyTorch Tutorials 2.6.0+cu124 Documentation
No ratings yet
Datasets & DataLoaders - PyTorch Tutorials 2.6.0+cu124 Documentation
5 pages
Computer Vision & CNNs - Study Notes
No ratings yet
Computer Vision & CNNs - Study Notes
12 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Day8 (CNN)
No ratings yet
Day8 (CNN)
35 pages
CNN2
No ratings yet
CNN2
70 pages
Đánh giá mô hình học sâu với Dropout
No ratings yet
Đánh giá mô hình học sâu với Dropout
10 pages
CS231n: Convolutional Networks Overview
No ratings yet
CS231n: Convolutional Networks Overview
2 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
Machine Learning Co Ban
No ratings yet
Machine Learning Co Ban
41 pages
Comparison Between Education System FINAL 1
No ratings yet
Comparison Between Education System FINAL 1
67 pages
OP 007 F1 Request of Analysis Form Revision 8 Edited 10 Row
No ratings yet
OP 007 F1 Request of Analysis Form Revision 8 Edited 10 Row
3 pages
Grade 3 Math: Rectangle Area
No ratings yet
Grade 3 Math: Rectangle Area
11 pages
The Mindful Leader Workbook-Scripts
No ratings yet
The Mindful Leader Workbook-Scripts
28 pages
Analysis of Figurative Language in Jane Naana Opoku Agyemang
No ratings yet
Analysis of Figurative Language in Jane Naana Opoku Agyemang
3 pages
Grade 2 Science EOS1 Print Paper 2
No ratings yet
Grade 2 Science EOS1 Print Paper 2
6 pages
Risk Management For Event Planning
No ratings yet
Risk Management For Event Planning
26 pages
Solution
No ratings yet
Solution
27 pages
Telescopes: A Student's Guide
No ratings yet
Telescopes: A Student's Guide
15 pages
Identification of Cations and Anions
No ratings yet
Identification of Cations and Anions
4 pages
EE 609 Tut-1 Questions
No ratings yet
EE 609 Tut-1 Questions
2 pages
How To Add New Disks To ONTAP's Existing ADP Aggregates
No ratings yet
How To Add New Disks To ONTAP's Existing ADP Aggregates
5 pages
The Ichneutae Sophocles Satyr Play
No ratings yet
The Ichneutae Sophocles Satyr Play
700 pages
Understanding the Human Muscular System
No ratings yet
Understanding the Human Muscular System
22 pages
Density Based Spatial Clustering (DBSCAN) : With Data Analysis
No ratings yet
Density Based Spatial Clustering (DBSCAN) : With Data Analysis
36 pages
Solutions To Introduction To Chemical Engineering Thermodynamics (9780073104454), Pg. 59, Ex. 27 Homework Help and Answers
No ratings yet
Solutions To Introduction To Chemical Engineering Thermodynamics (9780073104454), Pg. 59, Ex. 27 Homework Help and Answers
1 page
Land For All Opp
No ratings yet
Land For All Opp
12 pages
Nabard Grade A Syllabus 2022 Byju S Exam Prep 15
No ratings yet
Nabard Grade A Syllabus 2022 Byju S Exam Prep 15
7 pages
Optimization of Transportation of Municipal Solid Waste From Reso
No ratings yet
Optimization of Transportation of Municipal Solid Waste From Reso
15 pages
Technology Data For Renewable Fuels
No ratings yet
Technology Data For Renewable Fuels
381 pages
MATLAB Programming Lab Manual ECLR-33
No ratings yet
MATLAB Programming Lab Manual ECLR-33
81 pages
TNPSC Group 4 GK Test Batch 2022
No ratings yet
TNPSC Group 4 GK Test Batch 2022
12 pages
9 6 4 PB
No ratings yet
9 6 4 PB
11 pages
VF Summit - AI Prompts For Visuals
No ratings yet
VF Summit - AI Prompts For Visuals
6 pages
Smart City Project Management
No ratings yet
Smart City Project Management
13 pages
Wellbeing Class 9 English 05-11-2023 (H) (Inde. 20) Nur
No ratings yet
Wellbeing Class 9 English 05-11-2023 (H) (Inde. 20) Nur
116 pages
GS3 Environment Value Addition Notes by Apala Mishra
100% (1)
GS3 Environment Value Addition Notes by Apala Mishra
1 page
Impact of Online Classes on ABM Students
No ratings yet
Impact of Online Classes on ABM Students
85 pages
Vlsi Unit 3
No ratings yet
Vlsi Unit 3
83 pages
Handout 4 v3 PDF
No ratings yet
Handout 4 v3 PDF
10 pages

Deep Learning for Computer Vision

Uploaded by

Deep Learning for Computer Vision

Uploaded by

Deep Learning

Neural Network Object detection

Convolutional NN Other topics

Neuron = linear classiﬁer + activation function

Mean Absolute Error Loss Cross-Entropy Loss

Mean Squared Error Loss

Tất cả datapoints Từng datapoint một sẽ N datapoint sẽ được đưa

Tensor in Pytorch is equivalent to Array in Numpy

Add pooling layers

Use Dilated Conv

Use Depth-wise Conv

Another name: Geometric transformations 137

Another name: Color distortion 138

You might also like