0% found this document useful (0 votes)

29 views28 pages

DLCV Ch1 Introduction

This document provides an introduction to deep learning for computer vision. It discusses collecting labeled training data and building models based on feature spaces. It also covers bounding boxes, gradient descent, and using PyTorch to build a linear regression model for localization. Examples are provided on computing gradients for bounding box coordinate updates, as well as calculating precision and recall for object detection tasks.

Uploaded by

Mario Parot

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views28 pages

DLCV Ch1 Introduction

Uploaded by

Mario Parot

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Deep Learning for

Computer Vision

INTRODUCTION TO
DEEP LEARNING

Prof. G.S. Jison Hsu 徐繼聖

• Artificial Vision Laboratory
• National Taiwan University of
Science and Technology

Deep Learning for Computer Vision

Deep Learning for Computer Vision 2
https://www.youtube.com/watch?v=kE5QZ8G_78c [9:39]
Deep Learning for Computer Vision 3
• Collection of training data
• Model built upon features (or more precisely, on the feature space)
• Model-based prediction when new data is given

Deep Learning for Computer Vision 4

Deep Learning for Computer Vision 5
Deep Learning for Computer Vision 6
What is Bounding Box?

Ground Truth: Initial Box :

• 𝑥𝑐𝑒𝑛𝑡𝑒𝑟 : 443 • 𝑥ො𝑐𝑒𝑛𝑡𝑒𝑟 : 678
• 𝑦𝑐𝑒𝑛𝑡𝑒𝑟 : 346 • 𝑦ො𝑐𝑒𝑛𝑡𝑒𝑟 : 105
• 𝑊(𝑤𝑖𝑑𝑡ℎ): 167 • 𝑊(𝑤𝑖𝑑𝑡ℎ): 167
• 𝐻(ℎ𝑒𝑖𝑔ℎ𝑡): 158 • 𝐻(ℎ𝑒𝑖𝑔ℎ𝑡): 158

Deep Learning for Computer Vision 7

Confidence Score of Bounding Box
• The confidence is defined as Pr (Class) * IOU (pred, truth). If no object exists
in that cell, the confidence score should be zero. Otherwise, we want the
confidence score to be as high as possible.

Digital Surveillance
Deep Learning forSystems and Application
Computer Vision 8
Initial Parameters
• 𝐼𝑡𝑒𝑟𝑎𝑡𝑖𝑜𝑛 = 2
• 𝐿𝑒𝑎𝑟𝑛𝑖𝑛𝑔 𝑟𝑎𝑡𝑒 = 0.3
• 𝑃 =(678, 105) Score: 0
• 𝐺 =(443, 346)

Score: 1

Deep Learning for Computer Vision 9

Iteration Results

x y S
Initial position 678 105 0 S: 0
S: 0.15
Iteration 1 537 249 0.15
Iteration 2 481 307 0.85 S: 0.85

Ground Truth 443 346 1

S = IoU Score for Bounding Box

S: 1

Deep Learning for Computer Vision 10

Gradient Descent
• Loss function
𝐿𝑜𝑠𝑠 𝐿𝑥 = (𝑃𝑥 − 𝐺𝑥 )2
𝐿𝑜𝑠𝑠 𝐿𝑦 = (𝑃𝑦 − 𝐺𝑦 )2
Score: 0
• Differential of loss function
𝐺𝑟𝑎𝑑𝑖𝑒𝑛𝑡 𝑑𝑥 = 2 𝑃𝑥 − 𝐺𝑥
𝐺𝑟𝑎𝑑𝑖𝑒𝑛𝑡 𝑑𝑦 = 2 𝑃𝑦 − 𝐺𝑦
• Update position
𝑥ො = 𝑥 − 𝑑𝑥 𝑙𝑟
𝑦ො = 𝑦 − 𝑑𝑦 𝑙𝑟
Score: 1

Deep Learning for Computer Vision 11

• Iteration 1 • Iteration 2
𝐿𝑜𝑠𝑠 𝐿𝑜𝑠𝑠
𝐿1𝑥 = (678 − 443)2 𝐿1𝑥 = (537 − 443)2
𝐿1𝑦 = (105 − 346)2 𝐿1𝑦 = (249.6 − 346)2

𝐺𝑟𝑎𝑑𝑖𝑒𝑛𝑡 𝐺𝑟𝑎𝑑𝑖𝑒𝑛𝑡
𝑑1𝑥 = 2 678 − 443 = 470 𝑑1𝑥 = 2 537 − 443 = 188
𝑑1𝑦 = 2 105 − 346 = -482 𝑑1𝑦 = 2 249.6 − 346 = -192.8
Update position Update position
537 = 678 − 0.3 ∗ 470 480.6 = 537 − 0.3 ∗ 188
249.6 = 105 − 0.3 ∗ (−482) 307.44
= 249.6 − 0.3 ∗ (−192.8)
Deep Learning for Computer Vision 12
Update Position
x y Loss of x Loss of y 𝐺𝑟𝑎𝑑𝑖𝑒𝑛𝑡 of x 𝐺𝑟𝑎𝑑𝑖𝑒𝑛𝑡 of y
Initial position 678 105 55225 58081 470 -482
Iteration 1 537 249.6 8836 92921 188 -192.8
Iteration 2 480 307 1414 1487 74 -78
Ground Truth 443 346 0 0 0 0

Deep Learning for Computer Vision 13

Use the pytorch to build our first linear regression model. We employ
the SGD as optimizer and use MSE loss function to train the model.
Moreover, we can visualize the training process.

Deep Learning for Computer Vision 14

• Set up the base structure of this model in Pytorch

import torch.nn as nn # Toy dataset

import numpy as np x_train = np.array([[3.3], [4.4], [5.5]], dtype=np.float32)
import matplotlib.pyplot as plt
plt.ion() y_train = np.array([[1.7], [2.76], [2.09]], dtype=np.float32)

# Hyper-parameters
num_epochs = 50
learning_rate = 0.001
# Set initialize parameter y = ax + b
a = -0.5
b=1

Deep Learning for Computer Vision 15

• Initialize the model type and declare the forward pass

# Define Linear regression model

model = nn.Linear(1, 1)
# Initialize parameter
model.weight.data.fill_(a)
model.bias.data.fill_(b)

Deep Learning for Computer Vision 16

Use the Mean Square Error (MSE), which is the most
commonly used regression loss function
# Define Loss
criterion = nn.MSELoss()

Use Stochastic Gradient Descent (SGD) optimizer

for the update of hyperparameters
# Define optimizer
optimizer = torch.optim.SGD(model.parameters(), lr=learning_rate)

Deep Learning for Computer Vision 17

Example 1.2
Please use initial position and ground truth to compute the
gradient, and then complete the table below.

x y
Initial position 678 105 S: 0.15 S: 0
Iteration 1 537 249
Iteration 2 481 307
Iteration 3
Iteration 4
Iteration 5
S: 1
Ground Truth 443 346
Deep Learning for Computer Vision 18
Example 1.2
• Iteration 1 • Iteration 2
𝐿𝑜𝑠𝑠 𝐿𝑜𝑠𝑠
𝐿1𝑥 = (678 − 443)2 𝐿1𝑥 = (537 − 443)2
𝐿1𝑦 = (105 − 346)2 𝐿1𝑦 = (249.6 − 346)2

Three dogs in the image.

𝑇𝑃
Precision =
𝑇𝑃 + 𝐹𝑃

𝑇𝑃
Recall =
𝑇𝑃 + 𝐹𝑁
2
Precision = = 0.5
2+2
2
TP = 2 FP = 2 FN = 1 Recall = 2 +1
= 0.666

Deep Learning for Computer Vision 20

Example 1.3 Face Detection

𝑇𝑃
Precision =
𝑇𝑃 + 𝐹𝑃

𝑇𝑃
Recall =
𝑇𝑃 + 𝐹𝑁
4
Precision = = 0.667
4+2
4
Recall = = 0.8
4 +1
TP = 4 FP = 2 FN = 1

Deep Learning for Computer Vision 15

Example 1.3 Confusion Matrix
Basic Form
GT\Pred Class 1 Class 2 True positive = TP
Class 1 TP FP
False positive = FP
True negative = TN
Class 2 FN TN
False negative = FN

Example 1.2

GT\Pred Dog Others GT\Pred Face Others

Dog 2 1 Face 5 0
Others 2 0 Others 2 0

Deep Learning for Computer Vision 15

True positive (TP) = correctly identified
False positive (FP) = incorrectly identified
True negative (TN) = correctly rejected
False negative (FN) = incorrectly rejected

𝑇𝑃
Precision =
𝑇𝑃 + 𝐹𝑃
𝑇𝑃
Recall =
𝑇𝑃 + 𝐹𝑁

Deep Learning for Computer Vision 23

https://www.youtube.com/watch?v=prWyZhcktn4&ab_channel=Simplilearn [4:28 – 24:46]
Deep Learning for Computer Vision 24
F M

Deep Learning for Computer Vision 25

Training and Testing Sets
Training Set
– A set in which data are known to a system for building
classification/regression model.
– For example, in a face recognition neural network, the face
images used to train the network.

Deep Learning for Computer Vision Face Image from Multi-PIE 26

Training and Testing Sets
Testing Set
– A set in which data are unknown to a system for recognition.
– For example, the face images to be recognized by the trained
face recognition network.

Face Image from Celeb-HQ

Deep Learning for Computer Vision 27
Summary

Deep Learning for Computer Vision 2

Digital Logic Design Notes
100% (5)
Digital Logic Design Notes
298 pages
Ipc - Jedec J-STD-020C
100% (1)
Ipc - Jedec J-STD-020C
14 pages
CPSE Contacts
No ratings yet
CPSE Contacts
1,264 pages
Instruction Manual: Digital Genset Controller DGC-500
No ratings yet
Instruction Manual: Digital Genset Controller DGC-500
151 pages
Deep Learning For Computer Vision PDF
7% (14)
Deep Learning For Computer Vision PDF
24 pages
DE09 Sol
No ratings yet
DE09 Sol
157 pages
Zfap410dk Service Manual PDF
100% (3)
Zfap410dk Service Manual PDF
85 pages
Eternity of Sound and The Science of Mantras
100% (2)
Eternity of Sound and The Science of Mantras
115 pages
Bmva Ss 2018 Breckon Deepmachinelearning PDF
No ratings yet
Bmva Ss 2018 Breckon Deepmachinelearning PDF
120 pages
Why Triple Offset The Benefits of Triple Offset Butterfly Valves
100% (2)
Why Triple Offset The Benefits of Triple Offset Butterfly Valves
2 pages
Child Friendly School S High School 1
No ratings yet
Child Friendly School S High School 1
17 pages
Deep Learning Final Sheet
No ratings yet
Deep Learning Final Sheet
915 pages
Fazal Mahmood - Resume
No ratings yet
Fazal Mahmood - Resume
1 page
Image Recognition
No ratings yet
Image Recognition
47 pages
Bridgeswitch Family Datasheet PDF
No ratings yet
Bridgeswitch Family Datasheet PDF
32 pages
CERN Deep Learning and Vision
No ratings yet
CERN Deep Learning and Vision
72 pages
DLCV Ch2 Neural Network
No ratings yet
DLCV Ch2 Neural Network
68 pages
Claves
No ratings yet
Claves
4 pages
Short Course On Deep Learning: Welcome!!
No ratings yet
Short Course On Deep Learning: Welcome!!
57 pages
HaightAshburyFreePressVol 1no 61968D D TeoliJr A C 1
100% (1)
HaightAshburyFreePressVol 1no 61968D D TeoliJr A C 1
16 pages
CNN For Computer Vision Problem (Session 1)
No ratings yet
CNN For Computer Vision Problem (Session 1)
43 pages
Final Mid Term Risk Factore Including Nag
No ratings yet
Final Mid Term Risk Factore Including Nag
11 pages
Stage 424 June 2023
No ratings yet
Stage 424 June 2023
89 pages
Sample PF Packing List
No ratings yet
Sample PF Packing List
595 pages
Superintendent
No ratings yet
Superintendent
55 pages
Scene Classcification Using Deep Learning: by M. Wasif Asrar STUDENT NO:1810140
No ratings yet
Scene Classcification Using Deep Learning: by M. Wasif Asrar STUDENT NO:1810140
13 pages
Lecture 5
No ratings yet
Lecture 5
36 pages
The Little Book of Deep Learning - (François Fleuret) - University of Geneva-2023.compressed
No ratings yet
The Little Book of Deep Learning - (François Fleuret) - University of Geneva-2023.compressed
163 pages
Fast Unsupervised Object Localization: Dwaraknath, Anjan Menghani, Deepak Mongia, Mihir
No ratings yet
Fast Unsupervised Object Localization: Dwaraknath, Anjan Menghani, Deepak Mongia, Mihir
8 pages
Chinese Pidgin English - Bibliography PDF
No ratings yet
Chinese Pidgin English - Bibliography PDF
7 pages
Week8 WEB
No ratings yet
Week8 WEB
54 pages
Deep LearningINAF With MATLAB
No ratings yet
Deep LearningINAF With MATLAB
80 pages
How To Use Colab
100% (1)
How To Use Colab
13 pages
Lab Report Writing Guidelines: AP Chemistry ASK
No ratings yet
Lab Report Writing Guidelines: AP Chemistry ASK
13 pages
How Living Things Grow and Change
No ratings yet
How Living Things Grow and Change
14 pages
Halter
No ratings yet
Halter
2 pages
Edi 104 - Chapter 3
No ratings yet
Edi 104 - Chapter 3
47 pages
Profile Skills: Contacto
No ratings yet
Profile Skills: Contacto
1 page
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Deep Learning Tutorial
No ratings yet
Deep Learning Tutorial
133 pages
80879v00 Deep Learning Ebook
No ratings yet
80879v00 Deep Learning Ebook
15 pages
Introducing Deep Learning With MATLAB
No ratings yet
Introducing Deep Learning With MATLAB
15 pages
DLCV Ch3 Convolutional Neural Network
No ratings yet
DLCV Ch3 Convolutional Neural Network
45 pages
Deep Learning For Computer Vision PDF
No ratings yet
Deep Learning For Computer Vision PDF
24 pages
Deep Learning For Computer Vision With MATLAB - MATLAB & Simulink
No ratings yet
Deep Learning For Computer Vision With MATLAB - MATLAB & Simulink
5 pages
FM Heat & Smoke Detector
No ratings yet
FM Heat & Smoke Detector
34 pages
Vbook - Pub Deep Learning For Computer Visionpdf
No ratings yet
Vbook - Pub Deep Learning For Computer Visionpdf
24 pages
DLCV CH0 Syllabus v2
No ratings yet
DLCV CH0 Syllabus v2
16 pages
Deep Learning Using SVM in Matlab
No ratings yet
Deep Learning Using SVM in Matlab
13 pages
DLCV Ch2 Example Exercise
No ratings yet
DLCV Ch2 Example Exercise
25 pages
ML LittelBook
No ratings yet
ML LittelBook
161 pages
DNN - 1 - M1 - Fundamentals of Neural Network
No ratings yet
DNN - 1 - M1 - Fundamentals of Neural Network
95 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
163 pages
What Is New in Netbackup 6.5
No ratings yet
What Is New in Netbackup 6.5
42 pages
Evaluation
No ratings yet
Evaluation
10 pages
01 02 Intro
No ratings yet
01 02 Intro
11 pages
Part 2
No ratings yet
Part 2
225 pages
Measuring & Evaluating Learning
No ratings yet
Measuring & Evaluating Learning
15 pages
Master Term Paper
100% (1)
Master Term Paper
8 pages
Deep Learning
No ratings yet
Deep Learning
10 pages
Computer Vision and Deep Learning 1708702317
No ratings yet
Computer Vision and Deep Learning 1708702317
93 pages
DL1 Ver1
No ratings yet
DL1 Ver1
49 pages
Deep Learning in Matlab
No ratings yet
Deep Learning in Matlab
36 pages
DNN Merged Sugata
No ratings yet
DNN Merged Sugata
243 pages
DLQ Eyelashes
No ratings yet
DLQ Eyelashes
36 pages
09 Evaluation
No ratings yet
09 Evaluation
64 pages
Ann 5TH
No ratings yet
Ann 5TH
98 pages
ME2102 Tutorial 6
No ratings yet
ME2102 Tutorial 6
2 pages
Mini Project Assessment Brief Oct 24 - RH Signed
No ratings yet
Mini Project Assessment Brief Oct 24 - RH Signed
8 pages
Deep 2
No ratings yet
Deep 2
57 pages
Rizal Course - Instructions For The Required Terminal Paper
No ratings yet
Rizal Course - Instructions For The Required Terminal Paper
2 pages
CV Lecture 4-Donnnn
No ratings yet
CV Lecture 4-Donnnn
65 pages
Syllabus Udacity Default en Us
No ratings yet
Syllabus Udacity Default en Us
4 pages
Basics of Machine Learning and Deep Learning
No ratings yet
Basics of Machine Learning and Deep Learning
2 pages
Project
No ratings yet
Project
51 pages
Deep Learning Computer Vision
No ratings yet
Deep Learning Computer Vision
302 pages
Automotive and Small Engine Tools Assessment For CO
No ratings yet
Automotive and Small Engine Tools Assessment For CO
2 pages
Final Report Yolo Voice
No ratings yet
Final Report Yolo Voice
94 pages
L7 Lecture Image - classification.DNN v4
No ratings yet
L7 Lecture Image - classification.DNN v4
61 pages
Deep - Learning
No ratings yet
Deep - Learning
49 pages
1 AI - Introduction and ML
No ratings yet
1 AI - Introduction and ML
32 pages
Al3502deep Learning For Visionl T P C
No ratings yet
Al3502deep Learning For Visionl T P C
3 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
Module V-Deep Learning
No ratings yet
Module V-Deep Learning
19 pages

DLCV Ch1 Introduction

Uploaded by

DLCV Ch1 Introduction

Uploaded by

Deep Learning for

Prof. G.S. Jison Hsu 徐繼聖

Deep Learning for Computer Vision

Deep Learning for Computer Vision 4

Ground Truth: Initial Box :

Deep Learning for Computer Vision 7

Deep Learning for Computer Vision 9

Ground Truth 443 346 1

S = IoU Score for Bounding Box

Deep Learning for Computer Vision 10

Deep Learning for Computer Vision 11

Deep Learning for Computer Vision 13

Deep Learning for Computer Vision 14

import torch.nn as nn # Toy dataset

Deep Learning for Computer Vision 15

# Define Linear regression model

Deep Learning for Computer Vision 16

Use Stochastic Gradient Descent (SGD) optimizer

Deep Learning for Computer Vision 17

Three dogs in the image.

Deep Learning for Computer Vision 20

Deep Learning for Computer Vision 15

GT\Pred Dog Others GT\Pred Face Others

Deep Learning for Computer Vision 15

Deep Learning for Computer Vision 23

Deep Learning for Computer Vision 25

Deep Learning for Computer Vision Face Image from Multi-PIE 26

Face Image from Celeb-HQ

Deep Learning for Computer Vision 2

You might also like