0% found this document useful (0 votes)

39 views

Computer Vision How To Approach Vision Problems

This document discusses how to approach computer vision problems. It provides a list of techniques that could be considered when presented with a vision problem, including image preprocessing, binary vision, color vision, region-based vision, and edge-based vision. It also discusses recognition problems. The document then provides an example exam question and sample answer that demonstrates how to apply edge-based computer vision techniques to determine if a bottle label is straight and torn using edge detection, non-maxima suppression, thresholding, and boundary chain coding.

Uploaded by

Lydia Burke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views

Computer Vision How To Approach Vision Problems

Uploaded by

Lydia Burke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Computer Vision

How to Approach Vision Problems

15 How to approach vision problems

Computer vision has many practical applications and to demonstrate an understanding of the
area it is necessary to be able to come up with potential solutions to real problems. When
presented with an application problem the student should ask themselves what general type of
approach would be most appropriate? The following list is not exhaustive but instead is
intended to give the student a notion of the types of issues they might want to consider:

Image preprocessing. In Binary vision do we need to use an opening and/or a closing, etc?
In gray-scale or colour imaging do we need to use a Smoothing technique? If there is any
non-linearity do we need to geometrically correct the image?

Binary vision. Is the domain such that a sufficient distinction could be created between the
foreground and background to allow segmentation by thresholding?

Colour vision. Do we need to do some sort of analysis of the colours present in the scene?
Could we solve the problem using some sort of summary representation such as a histogram
or K-means.

Region based vision. If the data cannot be segmented by binary vision but needs to be
broken into distinct objects or regions would a region based technique such as split and
merge allow reliable segmentation?

Edge based vision. If the boundary of the objects would be sufficient to allow recognition
(or whatever is needed in the question) then perhaps the edges should be detected and
analyzed in some fashion (e.g. Hough, features, etc.). Also what sort of edge detector
should we use (Roberts, first derivative, second derivative, ) and does this need to be
followed by Contour Following, and the extraction of straight line segments, etc?

Recognition. If approaching a recognition problem we typically need to extract some

features (using one of the preceding technique) and then perform some explicit recognition
step (e.g. Hough Transform, Template Matching, Chamfer Matching, Statistical Pattern
Recognition, a cascade of strong classifiers, etc.).

In the vision course associated with these notes application problems come up in two
instances. In tutorials open vision problems are presented to the class to be tackled in groups,
the idea being that each group would develop a series of steps to solve the problem. It is
essential when doing so that the solution starts at the correct starting point (e.g. the specified
images) and finishes at the specified target (e.g. recognized characters).
139

Computer Vision

How to Approach Vision Problems

The second instance that these problems are faced is during the examination but in this case
typically questions are asked in two parts. The first part asks about some specific area of
vision, and the second part usually builds upon this to solve an application problem. It is
essential that student provide details of the techniques that they use, and to give some idea of
what is required a sample question and sample answer are provided here. Note that the level
of detail required varies depending on the complexity of the answer. If there are 5 steps to
solve a problem then less detail is needed to describe each these steps than if there were only
2 steps to solve the problem.

Sample Exam Question

a. Compare and contrast a first derivative edge detector with a second
derivative edge detector.
[8 marks]
b. Using edge based computer vision, for the bottles below, describe
how to automatically determine both if the label is straight and if it is
torn (e.g. a corner missing). Sample images are shown below.
[17 marks]

Sample Answer
a. First derivative edge detectors measure the rate of change of the
image intensity colour (for grey-scale images), whereas the second
derivative edge detectors look at the rate of change of the rate of
change of the image intensity. Considering a single row of an image
with two significant intensity changes:

140

Computer Vision

How to Approach Vision Problems

First derivatives edge detectors compute the gradient of the edge as

a combination (typically Root-Mean-Square) of two orthogonal partial
derivatives:

In a similar manner they combine the two orthogonal partial

derivatives to also calculate the orientation of the edge:

There are a large number of first derivative edge detectors, one of

the best known of which is Sobel. The partial derivative in a digital
image are computed by convolution with two masks h1(i,j) and
h3(i,j):

Note that in these filters there is effectively smoothing on each side

of the central point.
To locate a first derivative edge we typically threshold the gradient
image (although to avoid thick edges we need to apply non-maxima
suppression first).
Second derivative edge detectors look at the rate of change of the
rate of change and are located by searching for a zero-crossing in
the resultant derivative image. Second derivative edge detectors are
141

Computer Vision

How to Approach Vision Problems

computed in the discrete domain using a single convolution filter the

most common second derivative filter is the Laplacian, two discrete
approximations of which are:

In these approximations it is notable that the centre pixel has a

significant weighting and this causes problems in the presence of
noise. Hence Laplacian filtering is normally preceded by some type
of smoothing (typically Gaussian). The combination of these two
techniques is the Laplacian of Gaussian edge detector which is 1-D
looks like

which for obvious reasons is referred to as the Mexican hat.

Similarities:
-

Both first and second derivative edge detectors incorporate some

level of smoothing
Both can be used to locate an edge and to determine the gradient
(in the second derivative it is the slope of the zero-crossing)

Differences
-

The orientation of an edge point can only be determined in the

first derivative.
The second derivative is better for located edges.
A single convolution is required to compute the Laplacian of
Gaussian while two are required for the Sobel operator.
The Laplacian of Gaussian take a much larger area into account
(depending on the value of the sigma of the Gaussian) and this
sigma can be varied in order to analyze the image at different
scales.

142

Computer Vision

How to Approach Vision Problems

b. To address this problem we assume that we have an image of just one bottle.
Then we apply
- Sobel edge detection (as in part a).
- Non maxima suppression:
If the gradient image is thresholded (to identify significant edges) the
resulting edges are generally quite wide. Hence at this stage nonmaxima suppression is used:
Quantize edge orientations
for all points (i,j)
Look at the 2 points orthogonal to edge
if gradient(i,j) < gradient(either of these 2 points)
output(i,j) = 0
else output(i,j) = gradient(i,j)
-

Thresholding to identify edge points. The output is thresholded, so that

maximums less than some threshold are set to 0 and all above the
threshold are set to 1 (or typically 255).
for every point
if the edge gradient in any channel >= a threshold value
set the output to 1 for that point
else set the output to 0 for that point

This gives us a binary scene (each point is either 1 or 0)

To identify if the label is straight we

consider three different scan lines on the
bottle and find the first and second
significant edge points from the left and
right. The first points represent the bottle
whereas the second points represent the
label and if we calculate the distances
shown on the bottle below then for a
straight label:
A B C and D E F

To determine if the label is torn we must

trace around the label and ensure that the
shape is correct and consistent. Based
on the column values found for the left
and right of the label we can determine
the middle column of the label just by
averaging these. If we can determine a
chain of edge points we can then find the
143

Computer Vision

How to Approach Vision Problems

two on the middle line and compare edge points on each side relative to
those points (sort of like folding in the middle to see if the two sides
match).
-

We represent a chain of edge points like these using a boundary chain

code (BCC). A boundary chain code consists of a start point and
a list of orientations to other connected edge points (i.e. a chain
of points). The start point is specified by the (row,column) pair,
and each orientation is typically specified simply as a value from
0 to 7 (i.e. the 8 possible directions to a neighbouring pixel).
e.g.

To extract the chain of edge points we can start from any one of
the label edge points already located and then use a technique
similar to heuristic search for images borders to build up the
rest of the chain of edge points. Starting at the chosen label
point search forwards repeatedly for other edge points (there
should be just one) until reaching the start point again. This
would not work for all contours but would work for the label in
this case.

144

unit-4-dip
No ratings yet
unit-4-dip
115 pages
Edge Detection-Application of (First and Second) Order Derivative in Image Processing
No ratings yet
Edge Detection-Application of (First and Second) Order Derivative in Image Processing
11 pages
7image Segmentation
No ratings yet
7image Segmentation
57 pages
Simplified Novel Method For Edge Detection in Digital Images
No ratings yet
Simplified Novel Method For Edge Detection in Digital Images
6 pages
Fundamentals of Image Processing: Lecture #7 Edge Detection
No ratings yet
Fundamentals of Image Processing: Lecture #7 Edge Detection
65 pages
Edge Detection FPCV-2-1
No ratings yet
Edge Detection FPCV-2-1
22 pages
Algorithms For Edge Detection: Srikanth Rangarajan 105210122
No ratings yet
Algorithms For Edge Detection: Srikanth Rangarajan 105210122
17 pages
IVP Unit-V & VI
No ratings yet
IVP Unit-V & VI
51 pages
IT5409 Ch4.1 Edges - en
No ratings yet
IT5409 Ch4.1 Edges - en
45 pages
Image Segmentation: Unit-Iv
No ratings yet
Image Segmentation: Unit-Iv
108 pages
Dip Unit 4
No ratings yet
Dip Unit 4
58 pages
Canny Edge Detection Tutorial PDF
No ratings yet
Canny Edge Detection Tutorial PDF
17 pages
Unit-5 Edge Detetction
No ratings yet
Unit-5 Edge Detetction
10 pages
21ai601 CV LM5 23-24
No ratings yet
21ai601 CV LM5 23-24
9 pages
Chapter 10
No ratings yet
Chapter 10
63 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
74 pages
CV_UNIT_4
No ratings yet
CV_UNIT_4
9 pages
Dip unit 5 ppt
No ratings yet
Dip unit 5 ppt
57 pages
Lecture 9 Image Enhancement - 1st - 2nd Order
No ratings yet
Lecture 9 Image Enhancement - 1st - 2nd Order
55 pages
Computer Vision - Edge Detection
No ratings yet
Computer Vision - Edge Detection
30 pages
Chapter7 IPPR
No ratings yet
Chapter7 IPPR
16 pages
CE632 EdgeDetection
No ratings yet
CE632 EdgeDetection
33 pages
A Study of Region-Based and Contourbased Image Seg PDF
No ratings yet
A Study of Region-Based and Contourbased Image Seg PDF
8 pages
Lecture 7 Segmentation
No ratings yet
Lecture 7 Segmentation
43 pages
Edge Detection Project
100% (1)
Edge Detection Project
22 pages
AS R - B A C - B I S: Tudy OF Egion Ased ND Ontour Ased Mage Egmentation
No ratings yet
AS R - B A C - B I S: Tudy OF Egion Ased ND Ontour Ased Mage Egmentation
8 pages
Edge and Boundary Detection
No ratings yet
Edge and Boundary Detection
52 pages
A Survey On Edge Detection Methods PDF
No ratings yet
A Survey On Edge Detection Methods PDF
36 pages
Week 12 - L2
No ratings yet
Week 12 - L2
30 pages
Algorithms For Edge Detection
No ratings yet
Algorithms For Edge Detection
17 pages
Edge Detectors: Deptofcs& E
No ratings yet
Edge Detectors: Deptofcs& E
26 pages
Lecture 3 of Computer Vision
No ratings yet
Lecture 3 of Computer Vision
45 pages
Dip 7
No ratings yet
Dip 7
27 pages
lec2
No ratings yet
lec2
4 pages
Unit-3
No ratings yet
Unit-3
9 pages
Computer Vision ch4
No ratings yet
Computer Vision ch4
100 pages
Digital Image Processing
No ratings yet
Digital Image Processing
12 pages
Image Feature Extraction and Segmentation
No ratings yet
Image Feature Extraction and Segmentation
36 pages
3. Feature Extraction
No ratings yet
3. Feature Extraction
32 pages
Lab10 Image Segmentation1
No ratings yet
Lab10 Image Segmentation1
14 pages
CV Unit 2
No ratings yet
CV Unit 2
39 pages
RAT292 M3 Part 2 Sensors and Actuators
No ratings yet
RAT292 M3 Part 2 Sensors and Actuators
55 pages
Lecture2 Edges
No ratings yet
Lecture2 Edges
46 pages
Chapter 10-Updated
No ratings yet
Chapter 10-Updated
80 pages
JP Iacv Viva
No ratings yet
JP Iacv Viva
34 pages
Edge Detection On Fpga
No ratings yet
Edge Detection On Fpga
19 pages
Computer Vision: Edge Detection
No ratings yet
Computer Vision: Edge Detection
23 pages
Unit 2 CV
No ratings yet
Unit 2 CV
17 pages
03 Filters Contours Segmentation - SP
No ratings yet
03 Filters Contours Segmentation - SP
96 pages
T2310 - TDS3651 - L04 - Edges v2
No ratings yet
T2310 - TDS3651 - L04 - Edges v2
75 pages
3lec02 Edge for Web
No ratings yet
3lec02 Edge for Web
35 pages
Assignment No.: 5: Aim: Theory
No ratings yet
Assignment No.: 5: Aim: Theory
3 pages
Segment 5
No ratings yet
Segment 5
100 pages
chapter-2
No ratings yet
chapter-2
37 pages
(IP'22) Lecture 5 - Segmentation I - Edge-Based - Thresholding
No ratings yet
(IP'22) Lecture 5 - Segmentation I - Edge-Based - Thresholding
97 pages
Chapter 10
No ratings yet
Chapter 10
93 pages
Dip - 06 Edge
No ratings yet
Dip - 06 Edge
41 pages
Edge Detection: Exploring Boundaries in Computer Vision
From Everand
Edge Detection: Exploring Boundaries in Computer Vision
Fouad Sabry
No ratings yet
Canny Edge Detector: Unveiling the Art of Visual Perception
From Everand
Canny Edge Detector: Unveiling the Art of Visual Perception
Fouad Sabry
No ratings yet
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
From Everand
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
Fouad Sabry
No ratings yet
Final Polarbearexemplar
No ratings yet
Final Polarbearexemplar
6 pages
Nonlinear Integral Equations of The Hammerstein Type (0
No ratings yet
Nonlinear Integral Equations of The Hammerstein Type (0
19 pages
Ranajit Guha Interview 2.2.11
No ratings yet
Ranajit Guha Interview 2.2.11
3 pages
Icd
No ratings yet
Icd
111 pages
Tajuk Tugasan 2
No ratings yet
Tajuk Tugasan 2
5 pages
Charles Taylor - Varieties of Religion Today - William James Revisited (2002)
100% (3)
Charles Taylor - Varieties of Religion Today - William James Revisited (2002)
139 pages
Mcr3u Student Syllabus
No ratings yet
Mcr3u Student Syllabus
4 pages
Market Driven Strategy
75% (4)
Market Driven Strategy
20 pages
Learning Styles: Visual, Auditory, Read/Write, Kinesthetic: Know Your Tutee's Learning Style. Why?
100% (1)
Learning Styles: Visual, Auditory, Read/Write, Kinesthetic: Know Your Tutee's Learning Style. Why?
2 pages
For Other People Named Max Weber, See
No ratings yet
For Other People Named Max Weber, See
8 pages
Jennifer McArthur 10cb818b7f
No ratings yet
Jennifer McArthur 10cb818b7f
20 pages
R Crus
No ratings yet
R Crus
69 pages
Tribal Culture or Commodity
No ratings yet
Tribal Culture or Commodity
7 pages
Ferroresonance Study in Voltage Transformers Connecting Metal Oxide Varistor
No ratings yet
Ferroresonance Study in Voltage Transformers Connecting Metal Oxide Varistor
4 pages
GE 20 - 2nd Act
100% (2)
GE 20 - 2nd Act
3 pages
Philosophy of Music Education
No ratings yet
Philosophy of Music Education
6 pages
Daily Lesson Plan Date: Lesson 1: Soft Skills
No ratings yet
Daily Lesson Plan Date: Lesson 1: Soft Skills
7 pages
Critique Paper 3 - Marxism
No ratings yet
Critique Paper 3 - Marxism
2 pages
Douglas M. Kellner - Critical Theory, Marxism and Modernity-Polity Press (1989)
No ratings yet
Douglas M. Kellner - Critical Theory, Marxism and Modernity-Polity Press (1989)
284 pages
Critical Incident Management in Sport
No ratings yet
Critical Incident Management in Sport
18 pages
PRMS Application Guidelines
No ratings yet
PRMS Application Guidelines
26 pages
FCE - Phrasal Verbs
No ratings yet
FCE - Phrasal Verbs
5 pages
Author-Guidelines-IJMS-sri Lanka
No ratings yet
Author-Guidelines-IJMS-sri Lanka
3 pages
Thaddius Barker - The Book of Whichcraft PDF
100% (4)
Thaddius Barker - The Book of Whichcraft PDF
45 pages
Ethics in Advertisement
No ratings yet
Ethics in Advertisement
50 pages
Recognition of Open and Distance Learning (ODL) Institutions - Handbook 2009
No ratings yet
Recognition of Open and Distance Learning (ODL) Institutions - Handbook 2009
60 pages
August 02 Team Dewey Gened Socsci Discussion
No ratings yet
August 02 Team Dewey Gened Socsci Discussion
65 pages
Maths Etp
No ratings yet
Maths Etp
4 pages
National Institute of Design Paldi, Ahmedabad 380 007 Web: WWW - Nid.edu
No ratings yet
National Institute of Design Paldi, Ahmedabad 380 007 Web: WWW - Nid.edu
29 pages
Punctuation - The Question Mark
No ratings yet
Punctuation - The Question Mark
4 pages

Computer Vision How To Approach Vision Problems

Uploaded by

Computer Vision How To Approach Vision Problems

Uploaded by

Computer Vision

How to Approach Vision Problems

15 How to approach vision problems

Recognition. If approaching a recognition problem we typically need to extract some

How to Approach Vision Problems

Sample Exam Question

How to Approach Vision Problems

First derivatives edge detectors compute the gradient of the edge as

In a similar manner they combine the two orthogonal partial

There are a large number of first derivative edge detectors, one of

Note that in these filters there is effectively smoothing on each side

How to Approach Vision Problems

computed in the discrete domain using a single convolution filter the

In these approximations it is notable that the centre pixel has a

which for obvious reasons is referred to as the Mexican hat.

Both first and second derivative edge detectors incorporate some

The orientation of an edge point can only be determined in the

How to Approach Vision Problems

Thresholding to identify edge points. The output is thresholded, so that

This gives us a binary scene (each point is either 1 or 0)

To identify if the label is straight we

To determine if the label is torn we must

How to Approach Vision Problems

We represent a chain of edge points like these using a boundary chain

You might also like