0% found this document useful (0 votes)

53 views34 pages

Sift Detector and Descriptor: (Scale Invariant Feature Transform)

1. The SIFT detector and descriptor were developed by David Lowe to detect and describe local features in images that are invariant to changes in scale, rotation, and illumination. 2. SIFT extracts distinctive invariant features from images based on location and scale, orientation, and local image gradients. 3. Keypoints are filtered and described by histograms of local image gradient orientations to provide features that can be matched between different images.

Uploaded by

Nguyen Viet Anh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views34 pages

Sift Detector and Descriptor: (Scale Invariant Feature Transform)

Uploaded by

Nguyen Viet Anh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

The SIFT (Scale Invariant Feature

Transform) Detector and Descriptor

developed by David Lowe
University of British Columbia
Initial paper ICCV 1999
Newer journal paper IJCV 2004

Review: Matt Browns Canonical Frames

11/1/2010

Multi-Scale Oriented Patches

Extract oriented patches at multiple scales

11/1/2010

[ Brown, Szeliski, Winder CVPR 2005 ]

Application: Image Stitching

11/1/2010

[ Microsoft Digital Image Pro version 10 ]

Ideas from Matts Multi-Scale Oriented Patches

1. Detect an interesting patch with an interest

operator. Patches are translation invariant.
2. Determine its dominant orientation.
3. Rotate the patch so that the dominant
orientation points upward. This makes the
patches rotation invariant.
4. Do this at multiple scales, converting them
all to one scale through sampling.
5. Convert to illumination invariant form

11/1/2010

Implementation Concern:
How do you rotate a patch?

Start with an empty patch whose dominant

direction is up.
For each pixel in your patch, compute the
position in the detected image patch. It will be
in floating point and will fall between the
image pixels.
Interpolate the values of the 4 closest pixels
in the image, to get a value for the pixel in
your patch.

11/1/2010

Rotating a Patch
(x,y)

T
(x,y)

empty canonical patch

x = x cos y sin
y = x sin + y cos

patch detected in the image

counterclockwise rotation

11/1/2010

Using Bilinear Interpolation

Use all 4 adjacent samples

I01

I11

I00

11/1/2010

I10

SIFT: Motivation

The Harris operator is not invariant to scale and

correlation is not invariant to rotation1.

For better image matching, Lowes goal was to

develop an interest operator that is invariant to scale
and rotation.

Also, Lowe aimed to create a descriptor that was

robust to the variations corresponding to typical
viewing conditions. The descriptor is the most-used
part of SIFT.

1But

Schmid and Mohr developed a rotation invariant descriptor for it in 1997.

11/1/2010

Idea of SIFT
Image content is transformed into local feature
coordinates that are invariant to translation, rotation,
scale, and other imaging parameters

SIFT Features
11/1/2010

Claimed Advantages of SIFT

Locality: features are local, so robust to occlusion

and clutter (no prior segmentation)

Distinctiveness: individual features can be

matched to a large database of objects

Quantity: many features can be generated for even

small objects

Efficiency: close to real-time performance

Extensibility: can easily be extended to wide range

of differing feature types, with each adding
robustness

11/1/2010

Overall Procedure at a High Level

1. Scale-space extrema detection
Search over multiple scales and image locations.

2. Keypoint localization
Fit a model to detrmine location and scale.
Select keypoints based on a measure of stability.

3. Orientation assignment
Compute best orientation(s) for each keypoint region.

4. Keypoint description
Use local image gradients at selected scale and rotation
to describe each keypoint region.
11/1/2010

1. Scale-space extrema detection

Goal: Identify locations and scales that can be

repeatably assigned under different views of the
same scene or object.
Method: search for stable features across multiple
scales using a continuous function of scale.
Prior work has shown that under a variety of
assumptions, the best function is a Gaussian
function.
The scale space of an image is a function L(x,y,)
that is produced from the convolution of a Gaussian
kernel (at different scales) with the input image.

11/1/2010

Aside: Image Pyramids

And so on.
3rd level is derived from the
2nd level according to the same
funtion
2nd level is derived from the
original image according to
some function

Bottom level is the original image.

11/1/2010

Aside: Mean Pyramid

And so on.
At 3rd level, each pixel is the mean
of 4 pixels in the 2nd level.
At 2nd level, each pixel is the mean
of 4 pixels in the original image.
mean

Bottom level is the original image.

11/1/2010

Aside: Gaussian Pyramid

At each level, image is smoothed and
reduced in size.
And so on.

Apply Gaussian filter

At 2nd level, each pixel is the result

of applying a Gaussian mask to
the first level and then subsampling
to reduce the size.

Bottom level is the original image.

11/1/2010

Example: Subsampling with Gaussian pre-filtering

G 1/8
G 1/4

Gaussian 1/2

11/1/2010

Lowes Scale-space Interest Points

Laplacian of Gaussian kernel

Scale normalised (x by scale2)

Proposed by Lindeberg

Scale-space detection

Find local maxima across scale/space

A good blob detector

11/1/2010

[ T. Lindeberg IJCV 1998 ]

Lowes Scale-space Interest Points:

Difference of Gaussians

11/1/2010

Gaussian is an ad hoc
solution of heat
diffusion equation

Hence

k is not necessarily very

small in practice
19

Lowes Pyramid Scheme

Scale space is separated into octaves:
Octave 1 uses scale
Octave 2 uses scale 2
etc.
In each octave, the initial image is repeatedly convolved
with Gaussians to produce a set of scale space images.
Adjacent Gaussians are subtracted to produce the DOG
After each octave, the Gaussian image is down-sampled
by a factor of 2 to produce an image the size to start
the next level.
11/1/2010

Lowes Pyramid Scheme

s+2 filters
s+1=2(s+1)/s0
.
.
i=2i/s0
.
.
2=22/s0
1=21/s0
0
11/1/2010

s+3
images
including
original
The parameter s determines the number of images per octave.

s+2
difference
images
21

Key point localization

Detect maxima and

minima of difference-ofGaussian in scale space
Each point is compared
to its 8 neighbors in the
current image and 9
neighbors each in the
scales above and below

11/1/2010

s+2 difference images.

top and bottom ignored.
s planes searched.

Resample
Blur
Subtract

For each max or min found,

output is the location and
the scale.

Scale-space extrema detection: experimental results over 32 images

that were synthetically transformed and noise added.
% detected

average no. detected

% correctly matched
average no. matched

Stability

Expense

Sampling in scale for efficiency

How many scales should be used per octave? S=?

11/1/2010

More scales evaluated, more keypoints found

S < 3, stable keypoints increased too
S > 3, stable keypoints decreased
S = 3, maximum stable keypoints found
23

Keypoint localization

Once a keypoint candidate is found, perform a

detailed fit to nearby data to determine

location, scale, and ratio of principal curvatures

In initial work keypoints were found at location and

scale of a central sample point.
In newer work, they fit a 3D quadratic function to
improve interpolation accuracy.
The Hessian matrix was used to eliminate edge
responses.

11/1/2010

Eliminating the Edge Response

Reject flats:

< 0.03

Reject edges:
Let be the eigenvalue with
larger magnitude and the smaller.

Let r = /.
So = r

r < 10
What does this look like?

11/1/2010

(r+1)2/r is at a
min when the
2 eigenvalues
are equal.

3. Orientation assignment

Create histogram of
local gradient directions
at selected scale
Assign canonical
orientation at peak of
smoothed histogram
Each key specifies
stable 2D coordinates
(x, y, scale,orientation)

If 2 major orientations, use both.

11/1/2010

Keypoint localization with orientation

233x189

832
initial keypoints

729
keypoints after
gradient threshold

11/1/2010

536
keypoints after
ratio threshold

4. Keypoint Descriptors

At this point, each keypoint has

location
scale
orientation

Next is to compute a descriptor for the local

image region about each keypoint that is

11/1/2010

highly distinctive
invariant as possible to variations such as
changes in viewpoint and illumination
28

Normalization

Rotate the window to standard orientation

Scale the window size based on the scale at

which the point was found.

11/1/2010

Lowes Keypoint Descriptor

(shown with 2 X 2 descriptors over 8 X 8)

In experiments, 4x4 arrays of 8 bin histogram is used,

a total of 128 features for one keypoint
11/1/2010

Lowes Keypoint Descriptor

use the normalized region about the keypoint

compute gradient magnitude and orientation at each
point in the region
weight them by a Gaussian window overlaid on the
circle
create an orientation histogram over the 4 X 4
subregions of the window
4 X 4 descriptors over 16 X 16 sample array were
used in practice. 4 X 4 times 8 directions gives a
...
vector of 128 values.

11/1/2010

Using SIFT for Matching Objects

11/1/2010

Uses for SIFT

Feature points are used also for:

11/1/2010

Image alignment (homography, fundamental

matrix)
3D reconstruction (e.g. Photo Tourism)
Motion tracking
Object recognition
Indexing and database retrieval
Robot navigation
many others
[ Photo Tourism: Snavely et al. SIGGRAPH 2006 ]

Ilovepdf Merged
No ratings yet
Ilovepdf Merged
510 pages
Analysis of Complex Sample Survey Data: Multinomial and Ordinal Logistic Regression For Complex Samples
No ratings yet
Analysis of Complex Sample Survey Data: Multinomial and Ordinal Logistic Regression For Complex Samples
39 pages
Scale Invariant Feature Transform (SIFT) : CS 763 Ajit Rajwade
No ratings yet
Scale Invariant Feature Transform (SIFT) : CS 763 Ajit Rajwade
52 pages
Document From Sindhu Reddy... ??
No ratings yet
Document From Sindhu Reddy... ??
94 pages
Comparis I On
No ratings yet
Comparis I On
68 pages
Features Extraction DR - Tamizhselvan
No ratings yet
Features Extraction DR - Tamizhselvan
56 pages
Verilog HDL - Samir Palnitkar
No ratings yet
Verilog HDL - Samir Palnitkar
403 pages
SRM Ramapuram Digital Image Processing Unit 5 DIP
No ratings yet
SRM Ramapuram Digital Image Processing Unit 5 DIP
41 pages
L4 - Features & Filters I
No ratings yet
L4 - Features & Filters I
25 pages
9-2e. SIFT-21-08-2024
No ratings yet
9-2e. SIFT-21-08-2024
66 pages
Recognition Local Features
No ratings yet
Recognition Local Features
41 pages
Features
No ratings yet
Features
60 pages
Harris InterestPoints Andfeatures 1
No ratings yet
Harris InterestPoints Andfeatures 1
64 pages
Database Relasional
No ratings yet
Database Relasional
61 pages
Unit II - Chapter 4 - Feature Detection
No ratings yet
Unit II - Chapter 4 - Feature Detection
56 pages
Local Features Scale Invariant Feature Transform
No ratings yet
Local Features Scale Invariant Feature Transform
17 pages
CV - Unit 2
No ratings yet
CV - Unit 2
30 pages
Featuredescriptor
No ratings yet
Featuredescriptor
45 pages
Sift
No ratings yet
Sift
28 pages
SIFT
No ratings yet
SIFT
33 pages
SIFT Transform
No ratings yet
SIFT Transform
50 pages
Unified Modeling Language (UML)
100% (2)
Unified Modeling Language (UML)
24 pages
SIFT - The Scale Invariant Feature Transform
No ratings yet
SIFT - The Scale Invariant Feature Transform
62 pages
Lecture 13
No ratings yet
Lecture 13
12 pages
4.01 08 2022 - FeatureDescriptors
No ratings yet
4.01 08 2022 - FeatureDescriptors
46 pages
Detailed Guide Powerful Sift Technique Image Matching Python
No ratings yet
Detailed Guide Powerful Sift Technique Image Matching Python
12 pages
Q1 - SIFT - Distinctive Image Features From Scale-Invariant Keypoints
No ratings yet
Q1 - SIFT - Distinctive Image Features From Scale-Invariant Keypoints
20 pages
Lou's Pseudo 3d Page (Backup)
No ratings yet
Lou's Pseudo 3d Page (Backup)
18 pages
S-SIFT A Simple SIFT Algorithm With High Efficiency
No ratings yet
S-SIFT A Simple SIFT Algorithm With High Efficiency
3 pages
CVML Mulakat Notlari
No ratings yet
CVML Mulakat Notlari
8 pages
Digital Image Processing
No ratings yet
Digital Image Processing
88 pages
Bayesian Information Criterion
No ratings yet
Bayesian Information Criterion
3 pages
Computer Vision
No ratings yet
Computer Vision
6 pages
Acknowledgement: Sift:Scale Invariant Feature Transform
No ratings yet
Acknowledgement: Sift:Scale Invariant Feature Transform
25 pages
MS Excel Instruction Steps in Matrimony Conjoint Analysis
No ratings yet
MS Excel Instruction Steps in Matrimony Conjoint Analysis
8 pages
SIFT - The Scale Invariant Feature Transform
No ratings yet
SIFT - The Scale Invariant Feature Transform
62 pages
SIFT - The Scale Invariant Feature Transform
No ratings yet
SIFT - The Scale Invariant Feature Transform
62 pages
Feature Detection: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Feature Detection: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
54 pages
Timeseries PPT 1O
No ratings yet
Timeseries PPT 1O
64 pages
Illumination Scale Rotation
No ratings yet
Illumination Scale Rotation
16 pages
Feed-Forward Neural Networks (Part 2: Learning)
No ratings yet
Feed-Forward Neural Networks (Part 2: Learning)
17 pages
SIFT White
No ratings yet
SIFT White
55 pages
Regresi Linear Sederhana
No ratings yet
Regresi Linear Sederhana
10 pages
Lecture 3 Simulaton
100% (1)
Lecture 3 Simulaton
32 pages
DataBase System CH - 1 and 2
No ratings yet
DataBase System CH - 1 and 2
65 pages
Implicit Data Type Conversion: Query-1
No ratings yet
Implicit Data Type Conversion: Query-1
29 pages
Chapter - 5 (New) PDF
No ratings yet
Chapter - 5 (New) PDF
17 pages
Feature Description & Extraction: FAST (Features From Accelerated Segment Test)
No ratings yet
Feature Description & Extraction: FAST (Features From Accelerated Segment Test)
11 pages
Topic: Sift (Scale Invariant Feature Transform) Method For Key Location Detection
No ratings yet
Topic: Sift (Scale Invariant Feature Transform) Method For Key Location Detection
6 pages
Id Name Age: Celebs
No ratings yet
Id Name Age: Celebs
3 pages
Distinctive Image Feature From Scale-Invariant Keypoints: David G. Lowe, 2004
No ratings yet
Distinctive Image Feature From Scale-Invariant Keypoints: David G. Lowe, 2004
27 pages
s16 Ramy Final
No ratings yet
s16 Ramy Final
51 pages
DBMS Lab 05 21102020 013902pm
No ratings yet
DBMS Lab 05 21102020 013902pm
9 pages
Panorama Stitching Based On SIFT Algorithm and Lev
No ratings yet
Panorama Stitching Based On SIFT Algorithm and Lev
8 pages
Sift Preprint
No ratings yet
Sift Preprint
28 pages
Scale Invariant Feature Transform (SIFT)
No ratings yet
Scale Invariant Feature Transform (SIFT)
24 pages
Frontsim
100% (1)
Frontsim
2 pages
Object Oriented Thinking: Asfar
No ratings yet
Object Oriented Thinking: Asfar
63 pages
Local Features Tutorial:: (C) 2004 F. Estrada & A. Jepson & D. Fleet
No ratings yet
Local Features Tutorial:: (C) 2004 F. Estrada & A. Jepson & D. Fleet
25 pages
An Implementation of SIFT Detector and Descriptor: Andrea Vedaldi University of California at Los Angeles
No ratings yet
An Implementation of SIFT Detector and Descriptor: Andrea Vedaldi University of California at Los Angeles
7 pages
Scale Invariant Feature Transform: Tom Duerig
No ratings yet
Scale Invariant Feature Transform: Tom Duerig
30 pages
Distinctive Image Features From Scale-Invariant Keypoints
No ratings yet
Distinctive Image Features From Scale-Invariant Keypoints
26 pages
Akai A4162, A4191 DVD Player, SM
No ratings yet
Akai A4162, A4191 DVD Player, SM
28 pages
A Survey of Content-Based Image Retrieval Systems Using Scale-Invariant Feature Transform (SIFT)
No ratings yet
A Survey of Content-Based Image Retrieval Systems Using Scale-Invariant Feature Transform (SIFT)
5 pages
SIFT - Distinctive Image Features From Scale-Invariant Keypoints
No ratings yet
SIFT - Distinctive Image Features From Scale-Invariant Keypoints
16 pages
Scale Invariant Feature Transform by David Lowe Short Explanation of The Approach by Michela Lecca
No ratings yet
Scale Invariant Feature Transform by David Lowe Short Explanation of The Approach by Michela Lecca
22 pages
An Implementation of SIFT Detector and Descriptor: Andrea Vedaldi University of California at Los Angeles
No ratings yet
An Implementation of SIFT Detector and Descriptor: Andrea Vedaldi University of California at Los Angeles
7 pages
Feature Matching: "What Stuff in The Left Image Matches With Stuff On The Right?"
No ratings yet
Feature Matching: "What Stuff in The Left Image Matches With Stuff On The Right?"
62 pages
Chapter 2 - Relational Data Model
No ratings yet
Chapter 2 - Relational Data Model
26 pages
Scale Estimation and Keypoint Description: Li Yicheng
No ratings yet
Scale Estimation and Keypoint Description: Li Yicheng
10 pages
ECE181B Proj02 Report
No ratings yet
ECE181B Proj02 Report
12 pages
ER Diagram Problem
No ratings yet
ER Diagram Problem
5 pages
Object Recognition From Local Scale-Invariant Features (SIFT)
No ratings yet
Object Recognition From Local Scale-Invariant Features (SIFT)
24 pages
Scale-Invariant Feature Transform
No ratings yet
Scale-Invariant Feature Transform
19 pages
What Is A UML Class Diagram
No ratings yet
What Is A UML Class Diagram
5 pages
A Comparison of SIFT and Harris Conner Features For Correspondence Points Matching
No ratings yet
A Comparison of SIFT and Harris Conner Features For Correspondence Points Matching
4 pages
Mywork - Hrs
No ratings yet
Mywork - Hrs
4 pages
Sys Verilog
No ratings yet
Sys Verilog
115 pages
Computer Aided Design
No ratings yet
Computer Aided Design
7 pages
Data Visualisation Handout
100% (1)
Data Visualisation Handout
4 pages
Scale Invariant Feature Transfrom: A Seminar On
No ratings yet
Scale Invariant Feature Transfrom: A Seminar On
8 pages
Interim Report: Improving SIFT Features
No ratings yet
Interim Report: Improving SIFT Features
8 pages
Improved SIFT Algorithm Image Matching
No ratings yet
Improved SIFT Algorithm Image Matching
7 pages
Recognizing Pictures at An Exhibition Using SIFT
No ratings yet
Recognizing Pictures at An Exhibition Using SIFT
5 pages
About The Program
No ratings yet
About The Program
3 pages
Analysis and Classification of Feature Extraction Techniques: A Study
No ratings yet
Analysis and Classification of Feature Extraction Techniques: A Study
6 pages
ArchiMAG - BIM Evolution by Victor Silva
No ratings yet
ArchiMAG - BIM Evolution by Victor Silva
1 page
Oops ABAP
No ratings yet
Oops ABAP
53 pages

Sift Detector and Descriptor: (Scale Invariant Feature Transform)

Uploaded by

Sift Detector and Descriptor: (Scale Invariant Feature Transform)

Uploaded by

The SIFT (Scale Invariant Feature

Transform) Detector and Descriptor

Review: Matt Browns Canonical Frames

Multi-Scale Oriented Patches

Extract oriented patches at multiple scales

[ Brown, Szeliski, Winder CVPR 2005 ]

Application: Image Stitching

[ Microsoft Digital Image Pro version 10 ]

Ideas from Matts Multi-Scale Oriented Patches

1. Detect an interesting patch with an interest

Start with an empty patch whose dominant

empty canonical patch

patch detected in the image

Using Bilinear Interpolation

Use all 4 adjacent samples

The Harris operator is not invariant to scale and

For better image matching, Lowes goal was to

Also, Lowe aimed to create a descriptor that was

Schmid and Mohr developed a rotation invariant descriptor for it in 1997.

Claimed Advantages of SIFT

Locality: features are local, so robust to occlusion

Distinctiveness: individual features can be

Quantity: many features can be generated for even

Efficiency: close to real-time performance

Extensibility: can easily be extended to wide range

Overall Procedure at a High Level

1. Scale-space extrema detection

Goal: Identify locations and scales that can be

Aside: Image Pyramids

Bottom level is the original image.

Aside: Mean Pyramid

Bottom level is the original image.

Aside: Gaussian Pyramid

Apply Gaussian filter

At 2nd level, each pixel is the result

Bottom level is the original image.

Example: Subsampling with Gaussian pre-filtering

Lowes Scale-space Interest Points

Laplacian of Gaussian kernel

Scale normalised (x by scale2)

Find local maxima across scale/space

[ T. Lindeberg IJCV 1998 ]

Lowes Scale-space Interest Points:

k is not necessarily very

Lowes Pyramid Scheme

Lowes Pyramid Scheme

Key point localization

Detect maxima and

s+2 difference images.

For each max or min found,

Scale-space extrema detection: experimental results over 32 images

average no. detected

Sampling in scale for efficiency

How many scales should be used per octave? S=?

More scales evaluated, more keypoints found

Once a keypoint candidate is found, perform a

location, scale, and ratio of principal curvatures

In initial work keypoints were found at location and

Eliminating the Edge Response

If 2 major orientations, use both.

Keypoint localization with orientation

At this point, each keypoint has

Next is to compute a descriptor for the local

Rotate the window to standard orientation

Scale the window size based on the scale at

Lowes Keypoint Descriptor

In experiments, 4x4 arrays of 8 bin histogram is used,

Lowes Keypoint Descriptor

use the normalized region about the keypoint

Using SIFT for Matching Objects

Uses for SIFT

Feature points are used also for:

Image alignment (homography, fundamental

You might also like