0% found this document useful (0 votes)

24 views56 pages

CVI Week 2 1 Pre Note

The document discusses feature detection and extraction in computer vision, focusing on traditional visual descriptors like points, patches, edges, and contours, as well as deep learning-based approaches. It highlights the SIFT algorithm and its steps, advantages, and applications, while also contrasting shallow and deep learning architectures. The document emphasizes the importance of learning feature hierarchies through deep learning for improved performance in various domains.

Uploaded by

gfd45yjz79

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views56 pages

CVI Week 2 1 Pre Note

Uploaded by

gfd45yjz79

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

Feature Detection

and Deep Learning

Akila Subasinghe
School of Computer Science
University of Birmingham

[Computer Vision and Imaging]

Feature Detection/Extraction

1
Feature Detection/Extraction

2
Deep Learning-based Features

Nvidia to train 100,000 developers on deep learning AI

Outline
▪ Traditional visual feature descriptors
□ Points and patches
□ Edges and contours
□ Lines

▪ Deep learning-based visual feature extraction

□ How to?
□ Convolutional Neural Networks
□ Applications of DL4CV
4
Traditional visual feature descriptors

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.

Traditional visual feature descriptors
Points and patches

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.

Traditional visual feature descriptors
Points and patches

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.

Traditional visual feature descriptors
Points and patches
Patches with gradients in at least two (significantly)
different orientations are the easiest to localize

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.

Traditional visual feature descriptors
Points and patches
Adaptive non-maximal suppression
(ANMS)

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.

Traditional visual feature descriptors
Points and patches
Scale invariance

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.

Traditional visual feature descriptors
Points and patches
Rotational invariance

11
Traditional visual feature descriptors
Points and patches
Affine invariance

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.

Traditional visual feature descriptors
Points and patches
multi-scale oriented patches (MOPS)

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.

SIFT: Pet Example
SIFT Algorithm

1. Scale-space peak/feature selection

2. Key point Localization
3. Orientation Assignment
4. Key point Descriptor
5. Key point Matching (Not really SIFT)
SIFT Step 1: Constructing the
Scale Space.

 Use multiple Scales (octaves)(down samples)

 At each Octave, use Gaussian blurring to create various
versions of the same image…
SIFT Step 1: DOG
SIFT Algorithm

1. Scale-space peak/feature selection

2. Key point Localization
3. Orientation Assignment
4. Key point Descriptor
5. Key point Matching (Not really SIFT)
SIFT Step 2: Key point
Localization
 Detect maxima and minima of difference-of-
Gaussian in scale space

 Each point is compared to its 8 neighbors in

the current image and 9 neighbors each in
the scales above and below
SIFT Algorithm

1. Scale-space peak/feature selection

2. Key point Localization
3. Orientation Assignment
4. Key point Descriptor
5. Key point Matching (Not really SIFT)
SIFT Step 3: Orientation
Assignment.

 For each Key Point, we will assign an orientation to each key

point.
1. Calculate the magnitude and orientation
2. Create a histogram for magnitude and orientation.
SIFT Step 3: Orientation
Assignment.
SIFT Algorithm

1. Scale-space peak/feature selection

2. Key point Localization
3. Orientation Assignment
4. Key point Descriptor
5. Key point Matching (Not really SIFT)
SIFT Step 4: Key Point Descriptor

I = imread('image.jpg');
points = detectSURFFeatures(I);
SIFT advantages.
 Locality: features are local, so robust to
occlusion and clutter (no prior segmentation)
 Distinctiveness: individual features can be
matched to a large database of objects
 Quantity: many features can be generated for
even small objects
 Efficiency: close to real-time performance
 Extensibility: can easily be extended to wide
range of differing feature types
SIFT Mapping in Action…
SIFT Mapping in Action…
Traditional visual feature descriptors
Points and patches
Applications: Large-scale matching and retrieval

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.

Traditional visual feature descriptors
Edges and contours

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.

Traditional visual feature descriptors
Edges and contours
Sobel edge detector/operator

https://en.wikipedia.org/wiki/Sobel_operator
Traditional visual feature descriptors
Edges and contours
Canny edge detector
1.Apply Gaussian filter to smooth the image
in order to remove the noise

2.Find the intensity gradients of the image

3.Apply gradient magnitude thresholding or

lower bound cut-off suppression to get rid of
spurious response to edge detection

4.Apply double threshold to determine

potential edges

5.Track edge by hysteresis: Finalize the

detection of edges by suppressing all the
other edges that are weak and not
connected to strong edges. 31

https://en.wikipedia.org/wiki/Canny_edge_detector
Traditional visual feature descriptors
Edges and contours
Contour detection

Edge and Contour Detection with OpenCV and Python

Traditional visual feature descriptors
(straight) Lines and vanishing points

Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.

Traditional visual feature descriptors
(straight) Lines and vanishing points
Hough transforms: having edges “vote” for plausible
line locations

https://en.wikipedia.org/wiki/Hough_transform
Slide credit:

Traditional Image Categorization: Training phase

Training Training
Images
Training Labels

Image Classifier Trained

Features Training Classifier

35
Slide credit:

Traditional Image Categorization: Testing phase

Training Training
Images
Training Labels

Image Classifier Trained

Features Training Classifier

Testing
Image Trained Prediction
Features Classifier Outdoor
Test Image 36
Slide credit:

Features are have been the key…

Hand-crafted
SIFT [Loewe IJCV 04] HOG [Dalal and Triggs CVPR 05]
DPM [Felzenszwalb et al. PAMI 10]

Color Descriptor [Van De Sande et al. PAMI 10]

SPM [Lazebnik et al. CVPR 06]
37
What about learning the features?
• Learn a feature hierarchy all the way from pixels
to classifier
• Each layer extracts features from the output of
previous layer
• Layers have (nearly) the same structure
• Train all layers jointly (“end-to-end”)
Image/
Video Layer 1 Layer 2 Layer 3 Simple
Pixels Classifier
38
Learning Feature Hierarchy
Goal: Learn useful higher-level features from images
Feature representation

3rd layer
Input data “Objects”

2nd layer
“Object parts”

1st layer
“Edges”
[Lee et al., ICML
2009; CACM 2011]
Pixels
39

Slide credit: Rob Fergus

Learning Feature Hierarchy
• Better performance

• Other domains (unclear how to hand engineer):

– Kinect
– Video
– Multi spectral

Slide credit: Rob Fergus

“Shallow” vs. “Deep” architectures

Slide credit: Yann LeCun

“Shallow” vs. “Deep” architectures

Layer 1 … Layer k
42

Slide adapted from: Yann LeCun

Slide credit:

Why deep learning?

https://towardsdatascience.com/what-is-deep-learning-and-how-does-it-work-2ce44bb692ac
Types of Learning & History
Brain

Supervised
learning

Unsupervised
learning

Modern
architectures
44

https://towardsdatascience.com/supervised-vs-unsupervised-learning-in-2-minutes-72dad148f242 https://medium.com/analytics-vidhya/cnns-architectures-lenet-alexnet-vgg-googlenet-resnet-and-more-666091488df5
Slide credit:

General learning types

▪ Supervised learning
□ Learn to predict an output
when given an input vector.
▪ Reinforcement learning
□ Learn to select an action to
maximize payoff.
▪ Unsupervised learning
□ Discover a good internal
representation of the input.
□ Self-supervised learning
Courtesy: Hinton & Lecun 45
Slide credit:

A brief history

46
Slide credit:

A brief history

47
Slide adapted from:

A brief history

FCN UNet YOLO Generative Adversarial Network (GAN)

Mask R-CNN Capsule Network

Graph Neural Network (GNN)

Vision Transformer (ViT)

48
Today
Neural Radiance Field (NeRF)
Basic definition
Deep Learning Deep Neural Network
Demo

Human Brain

Video 49

Image courtesy: https://medium.com/autonomous-agents/mathematical-foundation-for-activation-functions-in-artificial-neural-networks-a51c9dd7c089

Basic definition

50
Basic definition
• Nonlinear
• Can approximate any continuous
function to arbitrary accuracy given
sufficiently many hidden units

Figure from Christopher Bishop 51

Basic definition
• Activations:

• Nonlinear activation function h

(e.g. sigmoid, RELU):

Figure from Christopher Bishop 52

Basic definition
• Layer 2

• Layer 3 (final)

• Outputs (e.g. sigmoid/softmax)

(binary) (multiclass)

• Putting everything together:

53
Basic definition
• Lots of hidden layers
• Depth = power (usually)
Weights to learn!

Weights to learn!

Weights to learn!
54

Figure from http://neuralnetworksanddeeplearning.com/chap5.html

The way to learn? Gradient descent

Credit: Andrew Ng, Alexei Efros, Samuel Velasco/Quanta Magazine

https://towardsdatascience.com/a-visual-explanation-of-gradient-descent-methods-momentum-adagrad-rmsprop-adam-f898b102325c

Color Theory in Procreate by Art With Flo PDF
86% (7)
Color Theory in Procreate by Art With Flo PDF
24 pages
Deep Learning For Computer Vision PDF
7% (14)
Deep Learning For Computer Vision PDF
24 pages
Computer Graphics JNTU Question Paper
No ratings yet
Computer Graphics JNTU Question Paper
4 pages
Computer Vision With Deep Learning
No ratings yet
Computer Vision With Deep Learning
5 pages
Image Recognition Using Neural Networks
No ratings yet
Image Recognition Using Neural Networks
18 pages
Unit 1 CV
No ratings yet
Unit 1 CV
78 pages
How To Convert Textures For Fallout 4 Using Substance Painter
100% (1)
How To Convert Textures For Fallout 4 Using Substance Painter
13 pages
SIFT - The Scale Invariant Feature Transform
No ratings yet
SIFT - The Scale Invariant Feature Transform
62 pages
Features 1 B
No ratings yet
Features 1 B
94 pages
Vbook - Pub Deep Learning For Computer Visionpdf
No ratings yet
Vbook - Pub Deep Learning For Computer Visionpdf
24 pages
Deep Learning For Computer Vision PDF
No ratings yet
Deep Learning For Computer Vision PDF
24 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
510 pages
Global Feature?: Local Feature Detection and Extraction
No ratings yet
Global Feature?: Local Feature Detection and Extraction
6 pages
Lecture4 - Convnets For CV Slide
No ratings yet
Lecture4 - Convnets For CV Slide
65 pages
EECE 5639 Computer Vision I: Edge Detection, Corners Hw2 Has Been Posted
No ratings yet
EECE 5639 Computer Vision I: Edge Detection, Corners Hw2 Has Been Posted
59 pages
Convolutional Neural Networks-CNN PDF
No ratings yet
Convolutional Neural Networks-CNN PDF
95 pages
V-Unit AIIA Complete Material
No ratings yet
V-Unit AIIA Complete Material
162 pages
Bai09 Descriptors
No ratings yet
Bai09 Descriptors
81 pages
Part 2
No ratings yet
Part 2
225 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
CH 8
No ratings yet
CH 8
21 pages
Week 9 Lecture Notes
No ratings yet
Week 9 Lecture Notes
27 pages
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
No ratings yet
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
38 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
Lecture 1.1
No ratings yet
Lecture 1.1
26 pages
Unit 2 Comuter Vision
No ratings yet
Unit 2 Comuter Vision
7 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
Ch1 TDMA Image Processing
No ratings yet
Ch1 TDMA Image Processing
34 pages
9-2e. SIFT-21-08-2024
No ratings yet
9-2e. SIFT-21-08-2024
66 pages
Unit II - Chapter 4 - Feature Detection
No ratings yet
Unit II - Chapter 4 - Feature Detection
56 pages
Computer Vision 2 Feature Extraction 3 Students
No ratings yet
Computer Vision 2 Feature Extraction 3 Students
105 pages
Week5 Computer Vision
No ratings yet
Week5 Computer Vision
58 pages
Features
No ratings yet
Features
60 pages
Cvlab 1
No ratings yet
Cvlab 1
6 pages
Comparis I On
No ratings yet
Comparis I On
68 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
DL4CV Week01 Part01
No ratings yet
DL4CV Week01 Part01
35 pages
SIFT - The Scale Invariant Feature Transform
No ratings yet
SIFT - The Scale Invariant Feature Transform
62 pages
cv2021 Lec2 Features I - 1600 - PDF - Gdrive.vip
No ratings yet
cv2021 Lec2 Features I - 1600 - PDF - Gdrive.vip
68 pages
ML Project Docs
No ratings yet
ML Project Docs
45 pages
CV Ss16 0609 Deep Learning
No ratings yet
CV Ss16 0609 Deep Learning
91 pages
C8-Modern CNNs
No ratings yet
C8-Modern CNNs
57 pages
Computer Vision Technology
No ratings yet
Computer Vision Technology
29 pages
End Sem
No ratings yet
End Sem
8 pages
Document From Sindhu Reddy... ??
No ratings yet
Document From Sindhu Reddy... ??
94 pages
Object Recog
No ratings yet
Object Recog
102 pages
Features Extraction DR - Tamizhselvan
No ratings yet
Features Extraction DR - Tamizhselvan
56 pages
W11 Lecture ITS69204 Image Recognition
No ratings yet
W11 Lecture ITS69204 Image Recognition
44 pages
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
No ratings yet
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
76 pages
Featuredescriptor
No ratings yet
Featuredescriptor
45 pages
Computer Vision I
No ratings yet
Computer Vision I
61 pages
SIFT - The Scale Invariant Feature Transform
No ratings yet
SIFT - The Scale Invariant Feature Transform
62 pages
Image Features and Categorization: Computer Vision Jia-Bin Huang, Virginia Tech
No ratings yet
Image Features and Categorization: Computer Vision Jia-Bin Huang, Virginia Tech
70 pages
A409163882 29458 9 2025 Unit-4
No ratings yet
A409163882 29458 9 2025 Unit-4
89 pages
SoS'25 Midterm - Report
No ratings yet
SoS'25 Midterm - Report
14 pages
Department of Computer Science and Engineering - University of Bologna
No ratings yet
Department of Computer Science and Engineering - University of Bologna
23 pages
L4 - Features & Filters I
No ratings yet
L4 - Features & Filters I
25 pages
CV4 F
No ratings yet
CV4 F
43 pages
Ch-3 Image AnalysisComputer Vision
No ratings yet
Ch-3 Image AnalysisComputer Vision
88 pages
Recognizing Pictures at An Exhibition Using SIFT
No ratings yet
Recognizing Pictures at An Exhibition Using SIFT
5 pages
Module 2
No ratings yet
Module 2
140 pages
A Review On Various Methodologies Used For Vehicle Classification, Helmet Detection and Number Plate Recognition
No ratings yet
A Review On Various Methodologies Used For Vehicle Classification, Helmet Detection and Number Plate Recognition
9 pages
Computer Vision
No ratings yet
Computer Vision
14 pages
Week 003-004 Module Characteristics of Digital & Traditional Graphics
No ratings yet
Week 003-004 Module Characteristics of Digital & Traditional Graphics
2 pages
Fundamental Steps in Digital Image Processing
No ratings yet
Fundamental Steps in Digital Image Processing
3 pages
CS6670: Computer Vision: Lecture 1: Images and Image Filtering
No ratings yet
CS6670: Computer Vision: Lecture 1: Images and Image Filtering
28 pages
An Algorithm of NURBS Surface Fitting For Reverse Engineering
No ratings yet
An Algorithm of NURBS Surface Fitting For Reverse Engineering
6 pages
Color Theory: Primary Colors Quiz Secondary Colors
No ratings yet
Color Theory: Primary Colors Quiz Secondary Colors
40 pages
Naac Lesson Plan Subject-Wsn
No ratings yet
Naac Lesson Plan Subject-Wsn
6 pages
Ufraw User Guide Color Management
No ratings yet
Ufraw User Guide Color Management
13 pages
Lecture7 Morphological Image Processing
No ratings yet
Lecture7 Morphological Image Processing
22 pages
LT05 L1TP 125060 20110214 20161010 01 T1 Ver
No ratings yet
LT05 L1TP 125060 20110214 20161010 01 T1 Ver
4 pages
Syllabus - Image Processing
No ratings yet
Syllabus - Image Processing
2 pages
A Computer Vision Based Barcode Reading System
No ratings yet
A Computer Vision Based Barcode Reading System
79 pages
14 Compositing
No ratings yet
14 Compositing
32 pages
Question Bank Unit 1
No ratings yet
Question Bank Unit 1
29 pages
Colored Image in Image Hiding
No ratings yet
Colored Image in Image Hiding
6 pages
Building An Automatic Vehicle License Plate Recognition System
No ratings yet
Building An Automatic Vehicle License Plate Recognition System
6 pages
Advanced Edge Detection Techniques-B
No ratings yet
Advanced Edge Detection Techniques-B
43 pages
Wavelets
No ratings yet
Wavelets
2 pages
Lecture15 2019 1H MRS MRSI
No ratings yet
Lecture15 2019 1H MRS MRSI
31 pages
Image Blending Using Unitery CNN Algorithm
No ratings yet
Image Blending Using Unitery CNN Algorithm
69 pages
Archsalt Pepper Noise S3.Ipynb - Colab
No ratings yet
Archsalt Pepper Noise S3.Ipynb - Colab
6 pages
Image Processing
No ratings yet
Image Processing
2 pages
Intelligentparkingsystem
No ratings yet
Intelligentparkingsystem
5 pages
Dicom Je Standard (Z. Markovic)
No ratings yet
Dicom Je Standard (Z. Markovic)
28 pages
IW Assignment 03
No ratings yet
IW Assignment 03
4 pages
Unit 5
No ratings yet
Unit 5
6 pages
Kode - Analisis Warna
No ratings yet
Kode - Analisis Warna
11 pages
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet