0% found this document useful (0 votes)

19 views50 pages

07 Iaor

The document outlines the concepts of image analysis and object recognition, focusing on shape detection techniques such as the Hough transform for line and circle fitting. It discusses the challenges of fitting models to noisy data and introduces the use of voting schemes to identify parameters that best represent detected features. Additionally, it covers Fourier descriptors for shape recognition, emphasizing their ability to provide translation and scale invariance in shape comparisons.

Uploaded by

bhagath mamillapally

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views50 pages

07 Iaor

Uploaded by

bhagath mamillapally

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

Image Analysis and Object Recognition

Basic Concepts from

Image Processing to Image Understanding

BOX

Lectures SoSe 2024

(Course notes for internal use only!)
Overview
• Shape detection Fitting model into image

– Hough transform
• Line fitting
• Circle fitting known radius and unkown radius
• General contours

– Fourier descriptors is used for shape detection.

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 2
Hough: Model fitting
Example: Line fitting
• Why fit lines?
– Many (artificial) objects are characterized by presence of straight lines

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 4
Difficulty of line fitting
this is the data.

• Noise in measured edge points, orientations:

– how to detect underlying parameters?
• Only some parts of each line detected,
and some parts are missing:
– how to find a line that bridges missing
information?
• Extra edge points (clutter), multiple models:
– which points go with which line, if any?

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 5
Hough transform
• Find imperfect objects with an early type of a voting scheme
Basically : Fitting noised data into a model

• General outline:
– Discretize dual parameter space into bins (accumulator array)
– For each feature point in the image, put a vote in every bin in the
parameter space that could have generated this point
– Find bins that have the most votes
x b

y m
y = mx + b 5

Image space Hough (parameter) space

Data will be available in pixels
© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 6
Hough: Duality of points and lines
y b

This line has one point in parameter space.

x m0 m
Image space Hough (parameter) space

Connection between image (x, y) and Hough (m, b) spaces

 A line in the image corresponds to one point in Hough space
 To go from image space to Hough space:
 Given a set of points (x, y), find all (m, b) such that y = mx + b

x0 x m
Image space Hough (parameter) space

Connection between image (x, y) and Hough (m, b) spaces

 What does a point (x , y ) in the image space map to?
0 0
 Answer: the solutions of b = -x0m + y0
 This is a line in Hough space

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 8
Hough transform: Lines
y b
(x1, y1)
b = –x0m + y0
(x0, y0)

b = –x1m + y1

x m
Image space Hough (parameter) space

What are the line parameters for the line

that contains both (x0, y0) and (x1, y1)?
 It is the intersection of the lines
b = –x0m + y0 and b = –x1m + y1

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 9
Voting
• It is not feasible to check all combinations of features
by fitting a model to each possible subset.

• Voting is a general technique where we let each feature

vote for all models that are compatible with it
– Cycle through features, cast votes for model parameters.
– Look for model parameters that receive a lot of votes.

• Noise and clutter features will cast votes too, but typically their votes
should be inconsistent with the majority of “good” features.

accumulator
array

x b
m 1 0 0 0 1 0
0 1 0 1 0 0
0 0 2 0 0 0
0 1 0 1 0 0
1 0 0 0 1 0
b
© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 11
Example: Voting 2/2
y m

x b
m 1 2 0 1 2 2
1 3 3 3 3 2
1 3 7 5 2 0
3 3 1 2 2 1
2 1 0 1 2 1
b
© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 12
Parameter space representation
• Problems with the (m, b) space:
– Unbounded parameter domain
– Vertical lines require infinite m
• Alternative: polar representation (Hessian normal form)
This has to be used in assignment.

x cosθ + y sinθ = ρ

ρ (rho) is the shortest distance

from the line to origin
rho shouldn´t be more than diagonal of the image.

θ (theta) is the angle the

perpendicular (normal vector)
makes with the x-axis.

Each point will add a sinusoid in the (θ, ρ) parameter space

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 13
Example: Polar representation
More points are required to come to conclusion on the rho and theta values.

y ρ

ρ
θ
x θ

image space Hough (parameter) space

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 14
Algorithm outline
• Initialize discreet accumulator
H(0…179, -d…d) for d = sqrt(w²+h²)
to all zeros
• for each edge point (x, y) in the image ρ
for θ = 0 to 179 Only looping around angle. No need of rho.
ρ = x cos θ + y sin θ
H(θ, ρ) = H(θ, ρ) + 1
end θ
end
• Find the value(s) of (θ, ρ)
where H(θ, ρ) is a local maximum
– The detected line in the image is given by
ρ = x cos θ + y sin θ

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 15
Extension: Incorporating image gradients
• Recall: when we detect an
edge point, we also know
its gradient direction
• This means that the line
is uniquely determined!

• Modified Hough transform:

For each edge point (x, y)
θ = gradient orientation at (x, y)
ρ = x cos θ + y sin θ Instead of divided theta, we calculate the gradient at the point.
This will speed up the process.

H(θ, ρ) = H(θ, ρ) + 1
end
© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 16
Example: Straight line detection

original edge detection found lines

parameter space
Visualize the parameter space.

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 18
Practical details
• Try to get rid of irrelevant features Things to consider in implementation.

– Take only edge points with significant gradient magnitude

• Choose a good grid / discretization
– Too coarse: large votes obtained when too many
different lines correspond to a single bucket
– Too fine: miss lines because some points that are not
exactly collinear cast votes for different buckets

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 19
Example: Basic illustration
Perfect lines -> perfect parameters

features votes

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 20
Effect of noise
Peak gets fuzzy and hard to locate
Use smoothing to avoid this clusters.
Run smoothing filter through accumulator array.

features votes
© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 21
Practical details
• Try to get rid of irrelevant features
– Take only edge points with significant gradient magnitude
• Choose a good grid / discretization
– Too coarse: large votes obtained when too many
different lines correspond to a single bucket
– Too fine: miss lines because some points that are not
exactly collinear cast votes for different buckets
• Increment neighboring bins
(smoothing in accumulator array)
• Who belongs to which line?
– Tag the votes See next page for its application

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 25
Extension: Cascaded Hough transform
• Let’s go back to the original (m, b) parameterization
• A line in the image maps to a pencil of lines in the
Hough space
• What do we get with parallel lines or a pencil of lines?
– Collinear peaks in the Hough space!
• So we can apply a second Hough transform to the
output of the first transform to find vanishing points

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 27
Hough: Circle Fitting
Finding circles by Hough transform

Equation of a circle: y

( xi − a ) 2 + ( yi − b) 2 = r 2 r
b (xi, yi)

If radius r is known:
Accumulator array H (a, b) x
a
(2D Hough space)
b

original edges (note noise)

Hr1 (Penny) Hr2 (Quarters)

Note: because of the

different sizes,
a separate Hough
transform was used!

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 32
Circles with unknown radius
• Main idea: The gradient direction θ at an
edge pixel points from/to the center of the circle
• Circle equations:
a = xi - r cos(θ) a, b, r are parameters
b = yi - r sin(θ)

(xi, yi)
r
(a, b)

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 33
Hough transform for circles
2D image space 3D Hough parameter space (volume)

y r

−r ⋅∇f ( x, y )

(x,y)
a
+ r ⋅∇f ( x, y )

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 35
Hough: Universal transform
Generalized Hough transform
• We want to find a shape defined by its boundary points
p and a reference point a
• For every boundary point, we can compute the
displacement vector r = a – p as a function of the
gradient orientation θ

θ r(θ)
p

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 37
Example: Generalized Hough transform
image of model r table

P2 P3 Φ ri
0 - 40°
90° 90° 40 - 80°
offline phase
80 --120°
80 120°
r2 r3 120 - 160°
(model 160 - 200°
generation) r1
315° o 200 - 240°
240 - 280°
P1 280 - -320°
280 320°
320 - 360°

online phase r3 r2
r2 r3
search image
(object
recognition)
r1

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 38
Hough transform
• Pros
– Can deal with non-locality and occlusion
– Can detect multiple instances of a model in a single pass
– Some robustness to noise: noise points unlikely to
contribute consistently to any single bin
• Cons
– Complexity of search time increases exponentially with
the number of model parameters
– Non-target shapes can produce wrong peaks in
parameter space
– It’s hard to pick a good grid size

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 39
Fourier descriptors
Recognition using Fourier descriptors
- Given: N points at the boundary of a closed 2D region
- Coordinate system is interpreted as the complex plane
- Complex number
xi + j yi y point on

imaginary
region
boundary
is i-th of the N points region (N = 24)

- 1D-DFT of the sequence x

(using the FFT) of N points
real
will be referred to as
Fourier descriptor F
of the contour
- Manipulations in frequency domain allow elimination
of the dependence on position, size and orientation.
© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 41
Normalization 1/3
• Fourier component F0 corresponds to centroid ≙ translation
• Leaving out F0 when comparing shapes
→ translation invariance

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 42
Normalization 2/3
• F1 corresponds to the radius
• Standard size: Fourier component F1 = 1
• Then normalization is a division of all coefficients by the absolute value of F1
→ scale invariance

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 43
Normalization 3/3
• Orientation and starting position affect only the phase:
Simple solution: Remove all phase information and consider only
the absolute values of the descriptor
→ rotation invariance and invariance regarding starting point.

Reconstruction of the shapes of letters “L” and “T” using 2, 3, 4 and 8 Fourier components

• Shape differences are measurable by

© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 46
Example: Detect aero planes
• 512 contour points
• Calculation of the normalized Fourier descriptor
• Work only on the first halve (256 elements),
the second halve must be mirrored!
• Fine details can be hidden by leaving out high frequency
components
• Keep the first low frequency components,
set the rest to 0
• Then, the inverse Fourier transform provides an
approximation of the original data
→ The first 32 low-frequency components are adequate
for the classification of the aero plane shapes
© Volker Rodehorst Lecture Image Analysis & Object Recognition - SoSe 2024 47
Example: Recognizing leaves

Berlin university study.

Tuan 6 Hough Transform Principle2
No ratings yet
Tuan 6 Hough Transform Principle2
88 pages
Computer Vision and Object Recognition
No ratings yet
Computer Vision and Object Recognition
76 pages
19 Cool Acoustic Guitar Tabs
100% (2)
19 Cool Acoustic Guitar Tabs
19 pages
03-04 Hough Transform and Image Features
No ratings yet
03-04 Hough Transform and Image Features
102 pages
Computer Vision - All - Units
No ratings yet
Computer Vision - All - Units
101 pages
1 Fitting
No ratings yet
1 Fitting
56 pages
Lect09 HoughTransform
No ratings yet
Lect09 HoughTransform
79 pages
IPMV Module 4
No ratings yet
IPMV Module 4
113 pages
06 Iaor
No ratings yet
06 Iaor
45 pages
Lec 8
No ratings yet
Lec 8
40 pages
Chap09 - Boundary Detection-Hough Transform
No ratings yet
Chap09 - Boundary Detection-Hough Transform
37 pages
CV Cat 2
No ratings yet
CV Cat 2
20 pages
Interpolation
No ratings yet
Interpolation
87 pages
Chap 6 Image Segmentation DD
No ratings yet
Chap 6 Image Segmentation DD
71 pages
Unit3 CV
No ratings yet
Unit3 CV
27 pages
06 Hough Transform (Cont. Chpter 10)
No ratings yet
06 Hough Transform (Cont. Chpter 10)
30 pages
8-2d. Hough Transform-17-08-2024
No ratings yet
8-2d. Hough Transform-17-08-2024
43 pages
Image Segmentation: Deepak Mishra Image Processing 2022
No ratings yet
Image Segmentation: Deepak Mishra Image Processing 2022
23 pages
DIP Mod 4 Segement Part B
No ratings yet
DIP Mod 4 Segement Part B
39 pages
Image Segmentation
No ratings yet
Image Segmentation
36 pages
CVI Week 4 1 Pre Note
No ratings yet
CVI Week 4 1 Pre Note
32 pages
S09.s1 - Material
No ratings yet
S09.s1 - Material
31 pages
Hough Transform - Pattern Recognition
No ratings yet
Hough Transform - Pattern Recognition
36 pages
Image Segmentation I
No ratings yet
Image Segmentation I
62 pages
l10 Shape1
No ratings yet
l10 Shape1
32 pages
5 Hough Transform
No ratings yet
5 Hough Transform
27 pages
Computer Vision - Hough Transform
No ratings yet
Computer Vision - Hough Transform
20 pages
Unit 3 CV and Di
No ratings yet
Unit 3 CV and Di
32 pages
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
From Everand
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
Rob Porter
No ratings yet
24 Documento - Planilha Nps Bale Da Cidade Marco 2018 2 1548267841
No ratings yet
24 Documento - Planilha Nps Bale Da Cidade Marco 2018 2 1548267841
2,967 pages
Computer Vision 2nd Assignment
No ratings yet
Computer Vision 2nd Assignment
20 pages
Ip Unit 4 One Shot
No ratings yet
Ip Unit 4 One Shot
20 pages
Lecture 05 Expo
No ratings yet
Lecture 05 Expo
14 pages
Line Detection: By: Tom Madison
100% (1)
Line Detection: By: Tom Madison
18 pages
Briefing Doc: Hough Transform and Homography
No ratings yet
Briefing Doc: Hough Transform and Homography
10 pages
Hough
No ratings yet
Hough
9 pages
Wind Energy Supplier Pitch Deck by Slidesgo
No ratings yet
Wind Energy Supplier Pitch Deck by Slidesgo
7 pages
ME5286 Lecture9 PDF
No ratings yet
ME5286 Lecture9 PDF
73 pages
Image Segmentation: Subject: FIP (181102) Prof. Asodariya Bhavesh ECD, SSASIT, Surat
No ratings yet
Image Segmentation: Subject: FIP (181102) Prof. Asodariya Bhavesh ECD, SSASIT, Surat
59 pages
Hough Transform
No ratings yet
Hough Transform
7 pages
Lumbang Integrated National High School
No ratings yet
Lumbang Integrated National High School
3 pages
E0005E - Industrial Image Analysis: The Hough Transform Matthew Thurley
No ratings yet
E0005E - Industrial Image Analysis: The Hough Transform Matthew Thurley
26 pages
Rohini 89299003921
No ratings yet
Rohini 89299003921
3 pages
Assignment4 40168195
No ratings yet
Assignment4 40168195
10 pages
Hough Transform
No ratings yet
Hough Transform
2 pages
Banana Paper Paper Making Process Technology Compa
0% (1)
Banana Paper Paper Making Process Technology Compa
13 pages
22am602 LM17
No ratings yet
22am602 LM17
6 pages
Hough Transform: Vineeth N Balasubramanian
No ratings yet
Hough Transform: Vineeth N Balasubramanian
34 pages
Main
No ratings yet
Main
57 pages
Hough Transform
No ratings yet
Hough Transform
24 pages
Digital Image Processing Workshop
No ratings yet
Digital Image Processing Workshop
50 pages
E0005e Lecture05 Hough Transform - Dvi PDF
No ratings yet
E0005e Lecture05 Hough Transform - Dvi PDF
26 pages
Hough2 PDF
No ratings yet
Hough2 PDF
20 pages
Eye Detection Project Report. 7/19/2005 Project Description: 1. Edge Detection - Canny Edge Detector
No ratings yet
Eye Detection Project Report. 7/19/2005 Project Description: 1. Edge Detection - Canny Edge Detector
8 pages
Laboratory 4. Image Features and Transforms: 4.1 Hough Transform For Lines Detection
No ratings yet
Laboratory 4. Image Features and Transforms: 4.1 Hough Transform For Lines Detection
13 pages
Complete
No ratings yet
Complete
14 pages
Lec09 Hough
No ratings yet
Lec09 Hough
48 pages
The Concept of The General Will in The Writings of Rousseau, Sièyes, and Robespierre by Dr. Stephen Carruthers
No ratings yet
The Concept of The General Will in The Writings of Rousseau, Sièyes, and Robespierre by Dr. Stephen Carruthers
10 pages
Hough Transform: Presentation by Sumit Tandon
No ratings yet
Hough Transform: Presentation by Sumit Tandon
27 pages
Dip-Lab#10:Hough Transform: Objective
No ratings yet
Dip-Lab#10:Hough Transform: Objective
9 pages
Iconex O2 Catalog
No ratings yet
Iconex O2 Catalog
103 pages
Edge 1
No ratings yet
Edge 1
55 pages
11 Rules of English Grammar
No ratings yet
11 Rules of English Grammar
4 pages
BPY Module
No ratings yet
BPY Module
10 pages
Harvard Decision Making
No ratings yet
Harvard Decision Making
64 pages
Edge Detection and Hough Transform
No ratings yet
Edge Detection and Hough Transform
5 pages
A Novel Method of Object Tracking Using Hough Transform
No ratings yet
A Novel Method of Object Tracking Using Hough Transform
6 pages
Hough Transform
No ratings yet
Hough Transform
20 pages
Topic:-Optical Fiber Connectors: Prepared by
No ratings yet
Topic:-Optical Fiber Connectors: Prepared by
23 pages
Hough 09gr820
No ratings yet
Hough 09gr820
7 pages
LAC
No ratings yet
LAC
38 pages
Moving Coil Galvanometer Porject Class 12
No ratings yet
Moving Coil Galvanometer Porject Class 12
25 pages
RI 2022 H3 Test 2 (Questions and Solutions)
No ratings yet
RI 2022 H3 Test 2 (Questions and Solutions)
8 pages
An Approach To Physical Performance Analysis For Judo
No ratings yet
An Approach To Physical Performance Analysis For Judo
8 pages
Hough Transform For Straight Lines: Mini-Project in Image Processing, 7th Semester 2007 by Jeppe Jensen. Group 721
No ratings yet
Hough Transform For Straight Lines: Mini-Project in Image Processing, 7th Semester 2007 by Jeppe Jensen. Group 721
4 pages
Hanja - Wiki
No ratings yet
Hanja - Wiki
8 pages
09 Iaor
No ratings yet
09 Iaor
66 pages
Soal Uts B. Inggris Kelas 9 Semester 1
No ratings yet
Soal Uts B. Inggris Kelas 9 Semester 1
5 pages
R S Aggarwal Solution Class 11 Maths Chapter 22 Parabola
No ratings yet
R S Aggarwal Solution Class 11 Maths Chapter 22 Parabola
9 pages
English Test Grade 10 (L2)
No ratings yet
English Test Grade 10 (L2)
2 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
76 pages
2019.08.19 Jowers's Answer, Affirmative Defenses, Counterclaims, and Third-Party Complaint (Filed) PDF
No ratings yet
2019.08.19 Jowers's Answer, Affirmative Defenses, Counterclaims, and Third-Party Complaint (Filed) PDF
71 pages
Words Worth Essay
No ratings yet
Words Worth Essay
1 page
Affirmative Action in Malaysia: Education and Employment Outcomes Since The 1990s
No ratings yet
Affirmative Action in Malaysia: Education and Employment Outcomes Since The 1990s
37 pages
Resume 2011
No ratings yet
Resume 2011
2 pages
BLENDER & MATLAB Subd
No ratings yet
BLENDER & MATLAB Subd
5 pages
Artificial Intelligence The Death of Creativity
No ratings yet
Artificial Intelligence The Death of Creativity
2 pages
HBS Neighborhood Map
No ratings yet
HBS Neighborhood Map
1 page
Nurbs Plotting From
No ratings yet
Nurbs Plotting From
2 pages
Gearmax Ep: Gearmax Ep 680 Es Una Mezcla Con Aceites Y Aditivos de Base
No ratings yet
Gearmax Ep: Gearmax Ep 680 Es Una Mezcla Con Aceites Y Aditivos de Base
2 pages
Republic of The Philippines: Cebu Normal University
No ratings yet
Republic of The Philippines: Cebu Normal University
2 pages
The Adjacent Elements Must Deform Without Causing Openings, Overlaps or Discontinuities
No ratings yet
The Adjacent Elements Must Deform Without Causing Openings, Overlaps or Discontinuities
1 page
MCEN2001 Lab Report 1
No ratings yet
MCEN2001 Lab Report 1
8 pages
Oid Esp All Eat A Paper Thailand Final
No ratings yet
Oid Esp All Eat A Paper Thailand Final
6 pages
Program
No ratings yet
Program
1 page
Jom Faperta Vol 4 No 2 Oktober 2017 1
No ratings yet
Jom Faperta Vol 4 No 2 Oktober 2017 1
12 pages
Hough Transform: Unveiling the Magic of Hough Transform in Computer Vision
From Everand
Hough Transform: Unveiling the Magic of Hough Transform in Computer Vision
Fouad Sabry
No ratings yet

07 Iaor

Uploaded by

07 Iaor

Uploaded by

Image Analysis and Object Recognition

Basic Concepts from

Lectures SoSe 2024

– Fourier descriptors is used for shape detection.

• Noise in measured edge points, orientations:

Image space Hough (parameter) space

This line has one point in parameter space.

Connection between image (x, y) and Hough (m, b) spaces

Connection between image (x, y) and Hough (m, b) spaces

What are the line parameters for the line

• Voting is a general technique where we let each feature

ρ (rho) is the shortest distance

θ (theta) is the angle the

Each point will add a sinusoid in the (θ, ρ) parameter space

image space Hough (parameter) space

• Modified Hough transform:

original edge detection found lines

– Take only edge points with significant gradient magnitude

original edges (note noise)

Hr1 (Penny) Hr2 (Quarters)

Note: because of the

- 1D-DFT of the sequence x

• Shape differences are measurable by

Berlin university study.

You might also like