A New Approach For Stereo Matching Algorithm With Dynamic Programming
A New Approach For Stereo Matching Algorithm With Dynamic Programming
Volume: 3 Issue: 2
ISSN: 2321-8169
691 - 696
_______________________________________________________________________________________________
Abstract Stereo matching algorithms are one of heavily researched topic in binocular stereo vision. Massive 3D information can be obtained
by finding correct correspondence of different points between images captured from different views. Development of stereo mat ching algorithm
is done for obtaining disparity maps i.e. depth information. When disparities computed for scan lines then dense reconstruction becomes time
consuming for vision navigation systems. So for pair of stereo images proposed method extracts features points those are at contours in images
and then a dynamic program is used to find the corresponding points from each image and calculates disparities. Also to reduce the noise which
may lead to incorrect results in stereo correspondence, a new stereo matching algorithm based on the dynamic programming is proposed.
Generally dynamic programming finds the global minimum for independent scan lines in polynomial time. While efficient, its performance is far
from desired one because vertical consistency between scan lines is not enforced. This method review the use of dynamic programming for
stereo correspondence by applying it to a contour instead to individual scan lines. Proposed methodology will obtain the global minimum for
contour array in linear time using Longest Common Subsequent (LCS) dynamic programming method with no disparity space image (DSI).
Keywords- Stereo Correspondence, Contour, Dynamic Programming, Disparity space Image, Longest common subsequent.
________________________________________________________*****_______________________________________________________
I.
INTRODUCTION
_______________________________________________________________________________________
ISSN: 2321-8169
691 - 696
_______________________________________________________________________________________________
test bed for stereo correspondence algorithms available, from
vision.middlebury.edu/stereo. This paper reviews stereo
correspondence algorithms and also proposes a new approach
for dynamic programming on contour.
II.
EXSITING SYSTEM
PROPOSED METHOD
Figure 1. This figure describes how disparity space image is generated [8].
_______________________________________________________________________________________
ISSN: 2321-8169
691 - 696
_______________________________________________________________________________________________
contour i.e. DSI on contour also takes polynomial time for
execution; this new method uses dynamic programming
directly on contour array which will execute in liner time. Sum
of points needed to be computed will be considerably reduced
when algorithm will compute stereo correspondence with high
speed and more accurate than dense reconstruction.
A. Canny edge Detection
The Canny edge detector is an edge detection operator that
uses a multi-stage algorithm to detect a wide range of edges in
images. Edges or contours in images are areas with strong
intensity contrasts. As the contour is the place where the great
changes have taken place on the image plane, we can remove
the area of near gray level effectively. Edge detecting an
image significantly reduces the amount of data and filters out
useless information, while preserving the important structural
properties in an image.
There are many traditional algorithms such as Sobel operator,
Prewitt operator, Laplace operator and so on. This entire
operator can perform at high speed. But detected edges are
always incomplete and uncontinuous and unsuited to be used
directly. By contrast, The canny operator is an optimization
algorithm for high signal to noise and precision of detection,
extensive used [12].
Apply canny edge detection algorithm for contour pixel
extraction of left and right image. The algorithm runs in 5
steps, apply each step on separate image of stereo pair ie. Left
and Right image.
Smoothing:
Blurring of the image to remove noise.
Finding gradients:
The edges should be marked where the gradients of the
image has large magnitudes.
Non-maximum suppression:
Only local maxima should be marked as edges.
Double thresholding:
Potential edges are determined by thresholding.
Edge tracking by hysteresis:
Final edges are determined by suppressing all edges that are
not connected to a very certain (strong) edge.
In proposed algorithm canny edge detection has been taken
directly. Which when applied on left and right images,
generated output gives detail edges with intensity 1 for all
contours and every non contour pixel appear with intensity 0.
This resultant canny left and right images pair used further for
getting stereo correspondence on contour.
_______________________________________________________________________________________
ISSN: 2321-8169
691 - 696
_______________________________________________________________________________________________
E(f) = Edata(f) + Esmooth(f) .........................(1)
Edata(f) =
Esmooth(f) =
Dp(fp),
p
qN(p)Vpq(fp, fq)
C(I,j,d(i,j)).......(3)
let i, j be the coordinates of the pixel in the left image and d(i,j)
be the value of disparity map d at pixel. Let m(d(i,j)) be the
matching penalty for assigning disparity d(i,j) to pixel, in our
frame work m(d(i,j)) can be sum of absolute differences (SAD)
to compare image contours,
m(d(i,j)) = SAD(i,j,d) = | Il(i+u, j+v) - Ir(i+u, j+v-d) |
.(5)
This means that for every contour pixel in the left image
search is performed along the same contour in right image that
best matches it, where d h. Note, if an image has j columns
and number of possible disparity values is h, then
straightforward dynamic programming take O(jh 2) time, but
running time can be reduced to O(jh). Let, s(dn) be the
smoothness penalty for assigning disparity dn and is disparity
penalty.
Thus the energy function required to be optimized is given by
equation (2) as follows,
E(d(i,j)) = m(d(i,j)) + s(dn)(6)
Then the minimum value of the energy in equation (6) can be
written as,
E(d(i,j)) = min(m(d(i,j)) + s(dn))..(7)
The optimal disparity assignment for d(i,j) in terms of LCS
dynamic programming can be written as follows, Let x = {x 1,
x2, x3xm} and y = {y1, y2, y3yn} be left and right
contour sequences and let z = {z1, z2, z3zk} be any LCS of
x and y.
_______________________________________________________________________________________
ISSN: 2321-8169
691 - 696
_______________________________________________________________________________________________
and place operations where size, shape and distance of object
is needed. Further stereo matching algorithm is applied on
actual contour extracted image.
Result is computed for different image pair from Middlebury
database to find depth information in which object near is
appear more brighter whereas object which is far is appear
darker.
The Figure below is the computed results for Midd 2 stereo
images pair from Middlebury database for DSI and LCS
mechanism on contour pixels. Detail evaluation is done in this
section.
(b)
(c)
Figure 5 Midd 2 stereo pair and disparity map.
Figure 5(a) is left view image and Figure 5(b) is right view
image. Both images are given as input.. In left view object
which is near is more shifted towards right and in right view
that object shifted to left. On which canny edge detection and
actual contour extraction algorithm is applied to generate
disparity map. Figure 5(c) is ground truth disparity map
obtained by dense disparity map given in Middleburry
database for evaluation of accuracy. Cap is near so it more
brighter comparing to background in disparity map image
Figure 5(c).
Stereo
pair.
RMS value
Bad Pixels
Percentage %
Accuracy
Percentage
DSI
LCS
DSI
LCS
DSI
LCS
Wood 1
7.65
3.22
6.56
6.85
93.44
93.15
Midd 1
1.04
1.06
9.87
1.008
90.13
98.99
Midd 2
1.06
1.07
9.92
1.002
90.08
98.99
Baby 1
5.46
5.37
4.81
4.51
95.19
95.49
Baby 2
4.72
4.61
3.89
3.83
96.11
96.17
Baby 3
8.47
8.57
7.46
7.80
92.54
92.2
Bowling
1
9.41
9.32
8.05
8.04
91.95
91.96
Flower
1.03
9.75
8.84
8.14
91.16
91.86
Wood 2
8.17
7.87
6.91
6.73
93.09
93.27
_______________________________________________________________________________________
ISSN: 2321-8169
691 - 696
_______________________________________________________________________________________________
V.
CONCLUSION
ACKNOWLEDGMENT
I thank to our college Shah & Anchor kutchhi Engineering
college, for providing its resources and other facilities;
helping us to come up with the topic in excellent manner. I
sincerely acknowledge my internal guide Mrs. Bhakti
Sonawane for her invaluable guidance in giving form and
substances to this paper. Her guidance and invaluable
suggestions has been major motivating factor. It is her
enthusiasm and sincere efforts that have given a sense of
direction and purpose to this.
I also thank the principal sir Prof. Vinit Kotak and the
Head of Department, Prof. Uday Bhave sir, for their able
guidance and inputs in implementing the presented topic. And
lastly our non-teaching staff for making our premises one
where working can be a pleasure.
REFERENCES
[1] http://vision.middlebury.edu/stereo/2010
[2] Scharstein D, and Szeliski R, A taxonomy and evaluation
of dense two- frame stereo correspondence algorithms,
International Journal of Computer Vision, 2002, 47(1-3):742.
[3] Fua, P., A Parallel Stereo Algorithm that Produces Dense
Depth Maps and Preserves Image Features, Machine
Vision and Applications, 1993, 6(1), 35-49.
[4] Hannah, M.J, A System for Digital Stereo Image
Matching, Phot. Eng. and Rem. Sens, 2001, 55(12), 17651770.
[5] Ohta, Y., and Kanade, T., Stereo by Intra- and InterScanline Search Using Dynamic Programming, IEEE
transactions on Pattern Analysis and Machine Intelligence,
1985, Vol. 7 No. 2, pp. 139-54.
[6] Y. Boykov, O. Veksler, and R. Zabih. Fast approximate energy minimization via graph cuts. In International Conference on Computer Vision, pages 377384, 1999.
[7] L.Junqin, D. Yan., X. Hu., Q. Xing., B. Yang., Dynamic
Programming algorithm for Stereo Correspondence of
Cntour, IEEE 5th International conference on Image and
Signal Processing, 2012.
696
IJRITCC | February 2015, Available @ http://www.ijritcc.org
_______________________________________________________________________________________