Towards Real-Time Terrain Modeling For Robot Navigation Using A Monocular Camera
Towards Real-Time Terrain Modeling For Robot Navigation Using A Monocular Camera
Introduction
Methodology
Experiments and Results
Conclusion and Future Work
Muhammad Tufail
[email protected]
M Eng. in Mechatronics
School of Engineering and Technology
Final Presentation
July 2007
Outline of Topics
1 Introduction
2 Methodology
3 Experiments and Results
4 Conclusion and Future Work
Motivation
Problem Statement
The need is to develop a wide-baseline monocular stereo vision
technique for landmine detecting mobile robots that combines local
motion estimation and robust image matching in order to build a
3D model of the terrain.
Objectives
Objectives
Objectives
Objectives
Camera Model
Figure: Camera model. Figure 5.1 from [Hartley and Zisserman (2000)].
Camera Model
K
X
z
}| {
x αx 0 x0 1 0 0 0 T
y = 0 R t Y
αy y0 0 1 0 0
0T
3 1 Z
1 0 0 1 0 0 0 1
| {z } | {z } 1
x P | {z }
X
(1)
1
Image courtesy of Svoboda, T. http://cmp.felk.cvut.cz
Muhammad Tufail Asian Institute of Technology 9 / 41
Outline
Introduction
Methodology System Model
Experiments and Results
Conclusion and Future Work
2
Image courtesy of Svoboda, T. http://cmp.felk.cvut.cz
Muhammad Tufail Asian Institute of Technology 10 / 41
Outline
Introduction
Methodology System Model
Experiments and Results
Conclusion and Future Work
3
Image courtesy of Svoboda, T. http://cmp.felk.cvut.cz
Muhammad Tufail Asian Institute of Technology 11 / 41
Outline
Introduction
Methodology System Model
Experiments and Results
Conclusion and Future Work
We already know: e0 = P0 C
Projection to the camera 2: x0 9 = P0 (λP+ x9 + C)
Line is a cross product of the points lying on it: e0 × x0 9 = l0 9
Putting together: e0 × (P0 λP+ x9 + P0 C) = l0 9
As e0 × P0 C = 0, then: e0 × λP0 P+ x9 = l0 9
Since the point x0 9 must lie on the line l0 9 , therefore l0 T 0
9 x 9 = 0.
0 −e3 e2
Let [e]x = e3 0 −e1 The cross project is then written
−e2 e1 0
as a matrix multiplication l0 T 0 0 +
9 = ([e ]x λP P )x9
Inserting into l0 T 0
9 x 9 = 0 gives us:
0 0 + T 0
xT
9 ([e ]x λP P ) x 9 = 0 (2)
| {z }
F
Epipolar geometry4
x0 i Fxi = 0 (3)
xx 0 y x 0 x 0 xy 0 y y 0 y 0 x y 1 f = 0
(4)
Rectification
Rectification
H0 = GRT (6)
Dense correspondence
xp3T − p1T
y p3T − p2T
AX = x 0 p0 3T − p0 1T = 0
(8)
0 0
y 0 p 3T − p 2T
where piT are the rows of P. Then the scene point X can be
obtained by least squares solution of the homogenous linear system
using the SVD.
Muhammad Tufail Asian Institute of Technology 23 / 41
Outline
Introduction Tools
Methodology Available data
Experiments and Results Results
Conclusion and Future Work
Tools
5
http://www.intel.com/technology/computing/opencv/
6
http://www.irfanview.com/
Muhammad Tufail Asian Institute of Technology 24 / 41
Outline
Introduction Tools
Methodology Available data
Experiments and Results Results
Conclusion and Future Work
Tools
5
http://www.intel.com/technology/computing/opencv/
6
http://www.irfanview.com/
Muhammad Tufail Asian Institute of Technology 24 / 41
Outline
Introduction Tools
Methodology Available data
Experiments and Results Results
Conclusion and Future Work
Tools
5
http://www.intel.com/technology/computing/opencv/
6
http://www.irfanview.com/
Muhammad Tufail Asian Institute of Technology 24 / 41
Outline
Introduction Tools
Methodology Available data
Experiments and Results Results
Conclusion and Future Work
Availabe Data
Camera Calibration
A camera calibration data is assumped to be provided by
[Nakarmi (2007)].
Input images
(a) (b)
Figure: “Head and Lamp” image pair. (a) Left image. (b) Right image.
Input images
(a) (b)
Figure: “Bread and Oranges” image pair. (a) Left image. (b) Right
image.
Rectification
Rectification
Figure: Rectified left and right images. Corresponding points lie on the
same horizontal line.
Dense correspondence
(a) (b)
Figure: Disparity map for “Head and Lamp” images. (a) Input left
image. (b) Estimated disparity map for “Head and Lamp” image pair.
Dense correspondence
(a) (b)
Figure: Disparity map for “Head and Lamp” images. (a) Estimated
disparity map. (b) Ground truth disparity map.
Dense correspondence
(a) (b)
Figure: Estimated disparity map for “Bread and Oranges” images. (a)
Input left image. (b) Estimated disparity map.
3D reconstruction
3D reconstruction
3D reconstruction
End
References
References
References
References