0% found this document useful (0 votes)

93 views7 pages

GrayLineExtraction PDF

This paper presents a new approach for Text Line Segmentation that works directly on gray-scale document images. Our algorithm constructs distance transform directly on the gray-scale images. The medial seam determines a text line and the separating seams define the upper and lower boundaries of the text line.

Uploaded by

kitin_511

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views7 pages

GrayLineExtraction PDF

Uploaded by

kitin_511

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Text Line Segmentation for Gray Scale Historical

Document Images
Abedelkadir Asi1 Raid Saabni2,3 Jihad El-Sana1
1

Ben-Gurion University of the Negev, Beer Sheva, Israel

2
Tel-Aviv University, Tel-Aviv, Israel
3
TRDC, Kafr Qarea, Israel

{abedas,saabni,el-sana}@cs.bgu.ac.il
ABSTRACT
In this paper we present a new approach for text line segmentation that works directly on gray-scale document images.
Our algorithm constructs distance transform directly on the
gray-scale images, which is used to compute two types of
seams: medial seams and separating seams. A medial seam
is a chain of pixels that crosses the text area of a text line
and a separating seam is a path that passes between two consecutive rows. The medial seam determines a text line and
the separating seams define the upper and lower boundaries
of the text line. The medial and separating seams propagate
according to energy maps, which are defined based on the
constructed distance transform. We have performed various experimental results on different datasets and received
encouraging results.

Keywords
Seam Carving, Line Extraction, Multilingual, Signed Distance Transform, Dynamic programming, Handwriting

INTRODUCTION

Historical handwritten documents are valuable cultural heritage, as they provide insights into both tangible and intangible cultural aspects from the past. The need to preserve
these documents demands global emerging efforts to analyze
and manipulate them by utilizing techniques from various
science fields. Handwritten historical documents pose real
challenges for automatic processing, such as image binarization, writer identification, page segmentation, and keyword
searching and indexing. A considerable number of algorithms address these tasks; some provide acceptable results
and already integrated into working systems.
Document image segmentation into text lines is a major
prerequisite procedure for various document image analysis
tasks, such as word spotting, key-word searching, and text
alignment [1, 2, 3, 4, 5, 6]. Extracting text lines from handwritten document images poses different challenges than those
Permission to make digital or hard copies of part or all of this work for
personal or classroom use is granted without fee provided that copies are not
made or distributed for profit or commercial advantage and that copies bear
this notice and the full citation on the first page. Copyrights for components
of this work owned by others than ACM must be honored. Abstracting with
credit is permitted. To copy otherwise, to republish, to post on servers or to
redistribute to lists, requires prior specific permission and/or a fee.
HIP 11, September 16 - September 17 2011, Beijing, China
Copyright 2011 ACM 978-1-4503-0916-5/11/09...$10.00.

in machine-printed documents [7]. These challenges can be

related roughly to two main factors: writing style and image
quality. Writing styles differs among writers and give rise to
various text-line segmentation difficulties. Baseline fluctuation due to pin movement, and as a result the baseline may
be straight, sequence of straight segments, or curved. Variability in skew among different text lines is a real challenge
that complicates the extraction process. Crowded writing
styles muddle text line boundaries as interlines spaces become narrow and increase the overlap of components bouding boxes among adjacent text lines. The presence of touching components from two adjacent text lines poses an obstacle for both text line segmentation approaches; those that
search for separating lines and those that search for aligned
physical units. Punctuation and diacritic symbols, which
are located in-between text lines, complicate the deciphering
process of the physical structure of handwritten text lines.
Historical document images are usually of low quality due
to aging, frequent handling, and storage conditions. They
often include various types of noise, such as holes, spots,
broken strokes, which may entangle the extraction process
and produce binarization errors once binarization is applied.
Several text-line extraction algorithms for handwritten documents have been proposed (see Section 2). Saabni and ElSana [8] presented an interesting line extraction algorithm,
which is based on the seam carving technique introduced
by Avidan and Shamir [9]. Their algorithm is limited to
binary images and as a result it inherits the limitations of
image binarization, which introduce noise and various artifacts. Preprocessing and post-processing steps were introduced to determine text line boundaries, cope with touching/overlapping components, and collect additional strokes.
In this paper, we built upon their work and developed a robust line extraction algorithm that works directly on grayscale images and overcomes the limitations mentioned above.
Our algorithm constructs distance transform directly on the
gray-scale images and computes medial seams and separating seams, which determine the text lines in a document
image (see Figure 1). The medial seam determines the middle of the text row and the separating seams, which are
generated with respect to a medial seam, define the upper
and lower boundaries of the text line. The medial and separating seams propagate according to different energy maps,
which are defined based on the constructed distance transform. The inability to determine the boundaries of text lines
forces Saabni and El-Sana [8] to recompute the energy map

(a)

(b)

segment text-line in document images [23, 24, 25, 26, 27,

28, 29]. These approaches are applied to binary images,
as they often require the isolation of basic elements, such
as strokes and connected components. Level-set techniques
are applied to extract text lines and handle multiple orientations, touching, and overlapping characters [5, 30]. A
painting technique is employed to smear the foreground of
the document image and enhance the separability between
the foreground and background to simplify the detection of
text-lines [31]. Dynamic programming is used to determine
text lines through computing minimum cost paths between
consecutive text lines [32].

3.
(c)

(d)
Figure 1: Algorithm flow: (a) the seam-map, (b)
medial seam (blue) and seam fragments (green and
red), (c) seam seeds (green and red), and (d) medial
seam and separating seams.
for the entire page image after the extraction of each text
line. The separating seams determine the text line boundaries, define the region to be updated, and overcome the
limitation of recomputing the energy map.
In the rest of this paper we briefly review related work, describe our approach in detail, and report experimental results. Finally we conclude and discuss directions for future
work.

RELATED WORK

Determining the text lines of a document image is a basic procedure for various document processing applications
and have received tremendous attention over the last several
decades.
Image smearing was among the earliest approaches used to
determine text lines; Wong et al. [10] applied image smearing
to binarized printed document images and Bar-Yosef et al. [11]
used it for historical documents. Projection profiles along
a predetermined direction is used in top-down approaches
to estimate the paths separating consecutive text lines [12,
13, 14, 11, 15, 16]. Adaptive local projection profiles is employed to handle multi-skew in document images [11, 17].
Hough transform is used to compute the direction to apply projection profile; and to generate good text line hypotheses [18, 19]. Fuzzy run length matrices and adaptive
local connectivity maps are applied directly to the gray-scale
document images [20, 4, 21]. Tracking minima points to
follow the white-most and black-most pixels along horizontal paths are used to estimate the boundaries and baselines
of text lines [22]. Seam carving approach is used to find
the seams, which resemble the baseline of text row, using
a signed-distance-transform based energy map [8]. Various
grouping techniques, such as heuristic rules, learning algorithms, nearest neighbor, and search trees are applied to

OUR APPROACH

Humans tend to perceive text line patterns by tracking the

concentration of ink along lines without actually reading the
written text. Spaces between text lines play a major role in
perceiving the layout of text lines despite the existence of
touching/overlapping components, which are usually sparse
and rarely disrupt the human text line recognition. These
observations motivated the development of our approach,
which find the medial seams that determine the text line
and the two seams that separate it from its previous and
next text lines.
We build upon the approach proposed by Saabni and ElSana [8], which applies distance transform to binary images
to generate energy maps; and then computes the minimal energy seams, which crosses the components along text lines.
Connected components labeling in binary document images
is a crucial step in their algorithm. Our approach works
directly on the gray-scale images and it does not require
binarizing the image neither labeling the components. It
extracts a stripe that resembles the text line directly from
the the gray-scale image, which eliminates the need for postprocessing to determine the components of the text line that
do not intersect the computed seam. In addition, separating
seams naturally cope with overlapping components during
seam propagation. Saabni and El-Sana [8] recompute the
energy map for the entire page after the extraction of each
text line, because of the inability to bound the influence of
the extracted text line. In contrast, our approach locally
updates the energy map to cancel the influence of the extracted lines region, and thus saves recomputing the energy
map.
Next we discuss in detail the two main steps of the line extraction procedure: constructing the energy maps and computing seams.

3.1

Energy Map

The search for a chain of pixels that either passes across

a text line (i.e., the medial seam) or lies as far as possible
from text lines (i.e., separating seam) calls for an energy
function that provide a sufficient distance measure. The
distance transform was adopted to generate the energy map,
where local minimum points determine medial seams and
maximum points define the separating seams.
To determine a seam that passes along the medial axis of
a text line and crosses its components, we use an energy
map based on a gray-level distance transform, introduced
by Levi and Montanari [33]. Gray-level distance transform

is defined as modified geodesic distances; i.e., the distance

between pixels p and q is the minimum of the lengths of the
paths [p = p0 , p1 , ..., pn = q]. The length of the path, l(p), is
defined by Equation 1, where f is a distance and d(pi , pi+1 )
corresponds to the slope between two consecutive pixels.

l(p) =

n1
X

f (d(pi , pi+1 ), I(pi ), I(pi+1 ))

(1)

which forces the medial seam to propagate along the text

lines, as shown in Figure 2(b).

(a)

(b)

i=0

Toivanen [34] developed the distance transform on curved

space (DTOCS) and the weighted distance transform on
curved space, where the distance metric is defined as difference between the gray values of the pixels along the minimal
path; i.e., d(pi , pi+1 ) = |I(pi ) I(pi+1 )| + 1.
The Gray-level Distance Transform(GDT) assigns values to
pixels according to their distance from the nearest minimal
points (reference points). In contrast to binary document
images, paths between components pixels and background
pixels in gray-scale images are curved and not straight lines,
as the straight lines between two pixels can be blocked by
obstacles consisting of higher or lower gray-values.
The distance transform of noisy document images may include small fluctuation that influence seam generation. To
overcome this limitation, we apply Gaussian filter to smooth
the image before generating the distance transform.

3.2

3.2.2 Separating Seams

The separating seams define the upper and lower boundaries
of text lines; i.e., determine the text strip, which is necessary
to assign in-between component to the right text lines and
accurately determine the pixels that need to be updated in
the seam-map to avoid recomputing the seam-map after each
line extraction. Separating seams of a text line are generated
with respect to the medial seam of their text line.
Seam seeds are defined with respect to a medial seam as
the global maxima (on the Gray-scale Distance Transform)
along the vertical segment connecting two consecutive medial seams. Separating seams are grown from seam seeds
toward the two sides of a page image (left and right). However, the need to determine the separating seams of a medial
seam before determining its neighboring medial seams complicates computing the seam seeds.

Seam Generation

Seams are computed using dynamic programming which relies on generating an energy map that encodes the minimal
cost of the valid paths. We refer to this energy map as the
seam-map, which is computed similar to [8] with slight modification to generate salient line structure. We replace the
equal weights for the horizontal and diagonal distances by
different weights that reflects the actual distance on the image (see Equation 2, where w0 = 1 and w1 = w1 = 12 ).
We found that this modification generates accurate energy
maps and produce robust seams. The algorithm determines
the minimal cost path by starting with the minimal cost on
the last column (right column) and traversing the seam-map
backward from right to left.

map[i, j] =

3.2.1

Figure 2: Medial seam generation using (a) one pass,

(b) two passes.

2GDT (i, j) + min1l=1 (wl map[i + l, j 1])

(2)

Medial Seam

We noticed that computing the seam-map in one pass as

shown above generates a seam-map which rows are more
salient on the right side than on the left side, as shown in
Figure 3(b). The weak seam-map on the left side allows
the medial seam to jump to adjacent lines as shown in Figure 2(a). To improve the saliency of the rows and retain the
seam on the medial of text lines, we construct the seam map
using two passes from left-to-right and from right-to-left
and then bi-linearly interpolate the resulting two seam maps
into the final seam-map, as shown in Figure 3(d). In this
scheme, the generated seam-map is well-defined along text
lines and faithfully resembles their structure (see Figure 3),

(a)

(b)

(c)

(d)

Figure 3: Seam-map generation: (a)original image,

(b)left-to-right, (c)right-to-left seams, and (d)the interpolated final seam-map.
To generate the seam seeds for a medial seam, sm , from each
pixel, px on the medial seam we search for the maximum
points on its lower and upper sides along the column including px . The absence of the adjacent medial seams makes
it hard to distinguish between local and global maximum
points. To resolve such dilemma, for each suspected maximum point, pmax , we proceed searching until we reach the
first minimum point, pmin . If pmin is connected through a
minima path (valley) to the starting seam, sm , then pmax is
a local maximum, which belongs to the processes seam, and
there is a need to continue searching for another maximum
point, otherwise pmax is considered a global maximum (an
in-between-lines point). To verify that pmin does not belong

to the seam sm , we extend pmin along minimal points with

respect to the GDT map and test whether the extended path
reaches the processed medial seam, sm , or not. If the path
returns to the sm , then pmin belongs to the processed seam
sm , otherwise it is not. Since local minima cannot be completely avoided, a noncontinuous seams that consists of separated short seams, which are denoted seam fragments, are
generated (see Figure 1). Erroneous fragments are filtered
out based one the length of the fragment and its distance
from the medial seam.
Short fragments are not reliable, as they may indicate local
minima with respect to the segment between the two medial seams. Therefore, seam fragments are sorted in ascending order according to their length and the fragments in the
lower fraction are filtered out (in current implementation we
ignore the lower half). For each of the remaining fragment
we associate a certainty value, certainty(f, s), which aims to
resemble the probability of a seam fragment to coincide with
the corresponding separating seam. The certainty value is
the sum of the distances of fragment pixels from the medial
seam, as shown in Equation 3, where fs and fe are the first
and the last column of the fragment f , and ms is its corresponding medial seam. We sort the remaining fragments
according to their certainty value in a descending order and
ignore the lower fraction (in our current implementation we
ignore the lower half).

certainty(f, ms) =

fe
X

|(ms(i) f (i))|

(a)

(b)

Figure 4: Spring model: (a) the medial seam (blue)

and a possible separating seam (red), (b) the resulting separating seam after applying spring force.

the seam (see Figure 4). The spring factor k was determined
experimentally, and we have found out that we need small
values of k, usually 1/dr .

(3)

i=fs

The fragments with the highest certainty values form the

seed candidate set. We extend each seed candidate to the
left and right sides of the page image, by propagating the
seam along the maximal points of the GDT map. We then
prioritize the candidates based on the number of fragments
each extended seam passes through. The candidate with the
maximal priority is taken as the seam seed. Note that we
select two seam seeds one below the medial seam and one
above it.

3.2.3

Seam Propagation

Growing a seam seed into a separating seam should maintain an appropriate distance from the corresponding medial
seam. The separating seam is guided by the GDT map,
which is computed based on the topography (gray levels) of
the image. The fork of ridges leads to the existence of separating seams with small differences in their weights, where
the maximal-weight seam may not be the sought seam (see
Figure 5). To overcome this limitation we incorporate the
distance from the medial seam into the propagation scheme
of the separating seam by integrating a spring model within
the seam prorogation scheme. The applied force of the
spring model is used as a weight in the propagation scheme;
i.e., F = k(|dr d|), where dr and d are the rest distance
and the distance from the medial seam, respectively, and k
is the spring constant. The rest distance is the average distance between the medial seam and the currently computed
separating seam. This scheme pushes the separating seam
away from the medial seam, when it is too close and attracts
the seam toward the medial seam when it moves aways from

Figure 5: A document image and its distance transform, where two fork examples are marked with red
rectangles.

EXPERIMENTAL RESULTS

Several evaluation methods for line extraction algorithms

have been proposed in the literature. Some, evaluate the
results manually, while others use predefined line areas to
count misclassified pixels. Since the proposed approach works
directly on gray-scale images and it is not possible to rely on
connected components for automatic evaluation, the experimental results were evaluated manually. The correctness of
an extracted text line is evaluated based on its three seams:
the medial and the two separating seams. The medial seam
is expected to go through the middle of the same text line
the ink (dark) area. The fraction of a medial seam that goes
between text lines or jump to another text line is defined
as faulty. Similarly, a separating seam is expected to go between the lines and faulty fractions go through (or touch)
text area or jump to another in-between area. We measure
the correctness, correctness(s), of a seam s as the ratio of
the correct fractions; i.e., the ratio of the sum of pixels in
the correct fractions to the line width in pixels. We measure
the correctness of an extracted text line, l, using Equation 4,
where medial(l), upper(l), and lower(l) are the medial, upper, and lower seams of the text line l. Equation 4 return
a value in the range [0.0, , 1.0], 1 for perfectly extracted
lines and 0 for completely wrong extraction.

correctness(medial(l))
2
correctness(upper(l))
+
4
correctness(lower(l))
+
4

DataSet
Wadod
Al-Majid
AUB
Congress L.

correctness(l) =

(4)

Equation 4 nicely measure the correctness of a text line, but

underestimates the intersection of a separating seam with a
descender or ascender, as it usually occupy a small number of
pixels. Therefore, we count the number of such intersection
for each line and measure the percentage of such intersection
separately, as shown in Table 1.

Medial
99
98
96
95

Correctness(%)
Upper Lower
97
97
96
97
95
94
93
94

Line
98
97
95
94.2

Stroke (%)
Crossing
9
2
9
11

Table 1: The performance of our algorithm on various dataset written in different languages.
Our approach enables the separating seams to split touching components along the path passing between the lines
and separate them, not necessarily on the optimal position.
This procedure may split fractions of bracket-shape ascenders or descenders that besiege the propagating seam and
force it to pass through, as shown in Figure 7. Nevertheless,
it is easy to fix this in a post-processing procedure that examines the cases where the separating seam passes through
local minima. Propagating along local minima path usually
reveals whether the crossed shape was a touching component, ascender, or descender.

Figure 7: The last word on the second line(right-toleft), descender besieges the propagating seam and
force it to pass through.

Figure 6: Random samples from the tested document images: Arabic, English and Spanish.
The absence of publicly availability database for evaluating line extraction algorithm on gray-scale images drove the
development of our own dataset, which consist of various
historical manuscripts in different languages. We have evaluated our system using 97 Arabic pages (900 lines) from
Juma Al-Majid Center for Culture and Heritage [35], 70
pages (1050 lines) from Wadod Center for Manuscripts [36],
40 pages (420 lines) from a 19th-century master thesis collection in the American University of Beirut(AUB) [37] and
10 pages (150 lines) from Thomas Jefferson manuscripts located at the Congress Libray. Our dataset includes Arabic, English, and Spanish handwritten document images.
The images have been selected to have multi-skew, touching/ overlapping components and both regular and irregular
spacing between lines.
Table 1 shows the average performance of our algorithm using various datasets of different qualities. Figure 6 presents
samples from the tested datasets. As can be seen, it performs well independent of the used script and manages to
generate very good results for languages that include delayed
strokes, dots, and diacritics.

In languages that include many dots and diacritics, such as

Arabic, hand-writers may not respect the closeness rule and
place dots or diacritics closer to the above or below text line.
Our algorithm may fail to detect such a case, as it requires
recognizing the written text to determine for which text line
those dots or diacritics belong. However, it is noticeable that
even for a human it is not easy to assign the misclassified
diacritics to the adequate text line without reading the text.
Zhixin et al. [4] presented an interesting approach for text
line segmentation on gray scale images. They adaptively
binarize the local connectivity map to focus on the line locations and superimposed the binarized ALCM on a binary
version of the original document image to collect components that touch the line patterns. Their approach still requires adaptive binarization for component extraction and
labeling, whereas our approach is binarazation-free and does
not include any component labeling step. We also used documents from the Congress Library to test our approach, but
since their documents were randomly selected for testing, it
is not easy to provide accurate comparison.

CONCLUSIONS AND FUTURE WORK

We presented a language independent approach for text line

segmentation for gray-scale images. Our approach constructs
an energy map directly on the gray-scale document image;
and then computes a medial seam and separating seams for

each text line. A medial seam passes along the presumed

text lines and the separating seams define the upper and
lower boundaries of the text line.
[9]
Our approach avoids applying image binarization, which introduces noise and various artifacts. It also does not extract
connected components and does not need to deal with text
fragmentation. Instead, it directly computes the distance
transform on the gray-scale images. Determining the boundary of text line enables updating the seam-map locally, and
hence saves recomputing the seam-map and distance transform for the entire image after each text line extraction.
We see the scope on future work on extending this approach
to determine the page layout and the component of a text
line directly on gray-sale document images.

ACKNOWLEDGMENT

This research was supported in part by the Israel Science

Foundation grant no. 1266/09, DFG-Trilateral Grant no.
8716, the Lynn and William Frankel Center for Computer
Sciences at Ben-Gurion University, Israel. We would like to
thank the reviewers for their insightful comments which led
to several improvements in the presentation of this paper.

[10]

[11]

[12]

[13]

[14]

[15]

REFERENCES

[1] T. Rath and R. Manmatha, Word image matching

using dynamic time warping, in Computer Vision and
Pattern Recognition, 2003. Proceedings. 2003 IEEE
Computer Society Conference on, vol. 2, 18-20 June
2003, pp. II521II527vol.2.
[2] T. M. Rath, R. Manmatha, and V. Lavrenko, A
search engine for historical manuscript images,
Annual ACM Conference on Research and
Development in Information Retrieval, pp. 369376,
2004.
[3] E. M. Kornfield, R. Manmatha, and J. Allan, Text
alignment with handwritten documents, Document
Image Analysis for Libraries, International Workshop
on, vol. 0, p. 195, 2004.
[4] S. Zhixin, S. Srirangaraj, and G. Venu, Text
extraction from gray scale historical document images
using adaptive local connectivity map, in ICDAR 05:
Proceedings of the Eighth International Conference on
Document Analysis and Recognition. Washington,
DC, USA: IEEE Computer Society, 2005, pp. 794798.
[5] Y. Li, Y. Zheng, D. Doermann, and S. Jaeger,
Script-independent text line segmentation in freestyle
handwritten documents, Pattern Analysis and
Machine Intelligence, IEEE Transactions on, vol. 30,
no. 8, pp. 1313 1329, aug. 2008.
[6] R. Saabni and J. El-Sana, Keyword searching for
arabic handwriting, in International conference of
frontiers in Handwriting recognition ICFHR,
Montreal, Canada., August 2008, pp. 271276.
[7] L. Likforman-Sulem, A. Zahour, and B. Taconet,
Text line segmentation of historical documents: a
survey, International Journal on Document Analysis
and Recognition, vol. 9, pp. 123138, 2007,
10.1007/s10032-006-0023-z. [Online]. Available:
http://dx.doi.org/10.1007/s10032-006-0023-z
[8] R. Saabni and J. El-Sana, Language-independent text

[16]

[17]

[18]

[19]

[20]

[21]

[22]

lines extraction using seam carving, in International

Conference on Document Analysis and Recognition,
(to appear), 2011.
S. Avidan and A. Shamir, Seam carving for
content-aware image resizing, ACM Trans. Graph.,
vol. 26, no. 3, p. 10, 2007.
K. Y. Wong, R. G. Casey, and F. M. Wahl,
Document analysis system, IBM Journal of Research
and Development, vol. 26, no. 6, pp. 647656, 1982.
I. B. Yosef, N. Hagbi, K. Kedem, and I. Dinstein, Line
segmentation for degraded handwritten historical
documents, in ICDAR, 2009, pp. 11611165.
T.Pavlidis and J.Zhou, Page segmentation by white
streams, in 1st Int. Conf. Document Analysis and
Recognition. (ICDAR) Int. Assoc. Pattern
Recognition, 1991, pp. 945953.
S. Nagy and S. Stoddard, Document analysis with
expert system, in Procedings of Pattern Recognition
conference in practice II, 1985.
E. Bruzzone and M. Coffetti, An algorithm for
extracting cursive text lines, in in ICDAR 99:
Proceedings of the Fifth International Conference on
Document Analysis and Recognition. IEEE Computer
Society, 1999, p. 749.
J. He and A. C. Downton, User-assisted archive
document image analysis for digital library
construction, in ICDAR 03: Proceedings of the
Seventh International Conference on Document
Analysis and Recognition. Washington, DC, USA:
IEEE Computer Society, 2003, p. 498.
A. Zahour, B. Taconet, P. Mercy, and S. Ramdane,
Arabic hand-written text-line extraction, in ICDAR,
2001, pp. 281285.
F. LeBourgeois, Robust multifont ocr system from
gray level images, in ICDAR 97: Proceedings of the
4th International Conference on Document Analysis
and Recognition. Washington, DC, USA: IEEE
Computer Society, 1997, pp. 15.
S. Vladimir, G. Georgi, and S. Vassil, Handwritten
document image segmentation and analysis, Pattern
Recogn. Lett., vol. 14, no. 1, pp. 7178, 1993.
L. Likforman-Sulem, A. Hanimyan, and C. Faure, A
hough based algorithm for extracting text lines in
handwritten documents, Document Analysis and
Recognition, International Conference on, vol. 2, p.
774, 1995.
S. Zhixin and G. Venu, Line separation for complex
document images using fuzzy runlength, in DIAL 04:
Proceedings of the First International Workshop on
Document Image Analysis for Libraries (DIAL04).
Washington, DC, USA: IEEE Computer Society, 2004,
p. 306.
Z. Shi, S. Setlur, and V. Govindaraju, A steerable
directional local profile technique for extraction of
handwritten arabic text lines, Document Analysis and
Recognition, International Conference on, vol. 0, pp.
176180, 2009.
A. Nicolaou and B. Gatos, Handwritten text line
segmentation by shredding text into its lines, in
ICDAR 09: Proceedings of the 2009 10th
International Conference on Document Analysis and
Recognition. Washington, DC, USA: IEEE Computer

Society, 2009, pp. 626630.

[23] L. Gorman, The document spectrum for pagelay-out
analysis, IEEE Trans. Pattern Analysis and Machine

Intelligence., vol. 15, no. 11, pp. 1162U1173

1993.

[24] L. forman Sulem and C.Faure, Extracting text lines

in handwritten documents by perceptual grouping, in
Advances in handwriting and drawing:a
multidisciplinary approach .Winter Eds,

Europia,Paris, p. 117U135,
1994.
[25] I. S. I. Abuhaiba, S. Datta, and M. J. J. Holt, Line
extraction and stroke ordering of text pages, in
ICDAR, 1995, p. 390.
[26] A. Simon, J.-C. Pret, and A. P. Johnson, A fast
algorithm for bottom-up document layout analysis,
IEEE Trans. Pattern Anal. Mach. Intell., vol. 19,
no. 3, pp. 273277, 1997.
[27] Y.Pu and Z.Shi, Anatural learning algorithm based
on hough transform for text lines extraction in hand
written documents, in in Proceedings sixth
International Workshop on Frontiers of Handwriting
646.
Recognition, 1998, p. 637 U
[28] K. Koichi, S. Akinori, and I. Motoi, Segmentation of
page images using the area voronoi diagram, Comput.
Vis. Image Underst., vol. 70, no. 3, pp. 370382, 1998.
[29] S. Nicolas, T. Paquet, and L. Heutte, Text line
segmentation in handwritten document using a
production system, in Proceedings of the Ninth
International Workshop on Frontiers in Handwriting

Recognition, 2004, p. 245U250.

[30] S. S. Bukhari, F. Shafait, and T. M. Breuel,
Script-independent handwritten textlines
segmentation using active contours, in ICDAR, 2009,
pp. 446450.
[31] A. Alaei, U. Pal, and P. Nagabhushan, A new scheme
for unconstrained handwritten text-line
segmentation, Pattern Recognition, vol. 44, no. 4, pp.
917 928, 2011.
[32] M. Liwicki, E. Indermuhle, and H. Bunke, On-line
handwritten text line detection using dynamic
programming, in Proceedings of the Ninth
International Conference on Document Analysis and
Recognition - Volume 01. Washington, DC, USA:
IEEE Computer Society, 2007, pp. 447451. [Online].
Available:
http://portal.acm.org/citation.cfm?id=1304595.1304766
[33] G. Levi and U. Montanari., A grey-weighted
skeleton. Information Control, vol. 17, pp. 6291,
1970.
[34] P. Toivanen, New geodesic distance transforms for
gray scale images. Pattern Recognition Letters,
vol. 17, pp. 437450, 1996.
[35] Juma Al-Majid Center for Culture and Heritage,
http://www.almajidcenter.org, online; accessed June,
2011.
[36] Wadod center for manuscripts, http://wadod.com,
online; accessed June, 2011.
[37] Master thesis in pharmacy, American University of
Beirut, http://ddc.aub.edu.lb, online; accessed June,
2011.

Character Recoganization
No ratings yet
Character Recoganization
6 pages
Smart Note Taker: A Seminar Report On
No ratings yet
Smart Note Taker: A Seminar Report On
32 pages
Kiumarsi 2018
No ratings yet
Kiumarsi 2018
6 pages
2014 IJDAR DFernandez
No ratings yet
2014 IJDAR DFernandez
21 pages
Manuscript Extraction
No ratings yet
Manuscript Extraction
14 pages
Text Line Detection in Handwritten Documents Louloudis2008
No ratings yet
Text Line Detection in Handwritten Documents Louloudis2008
15 pages
Binarization and Segmentation of Kannada Handwritten Document Images
No ratings yet
Binarization and Segmentation of Kannada Handwritten Document Images
6 pages
Document Mosaicing: Unlocking Visual Insights through Document Mosaicing
From Everand
Document Mosaicing: Unlocking Visual Insights through Document Mosaicing
Fouad Sabry
No ratings yet
Saqib Baby Snakes DAS08
No ratings yet
Saqib Baby Snakes DAS08
8 pages
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
No ratings yet
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
11 pages
Coupled Snakelet Model For Curled Textline Segmentation of Camera-Captured Document Images
No ratings yet
Coupled Snakelet Model For Curled Textline Segmentation of Camera-Captured Document Images
5 pages
Text Line Segmentation and Word Recognition in A System For General Writer Independent Handwriting Recognition
No ratings yet
Text Line Segmentation and Word Recognition in A System For General Writer Independent Handwriting Recognition
11 pages
Text-Image Separation in Document Images Using Boundary/Perimeter Detection
No ratings yet
Text-Image Separation in Document Images Using Boundary/Perimeter Detection
5 pages
Confluence 2018 8442875
No ratings yet
Confluence 2018 8442875
4 pages
Method For Line Segmentation in Handwrit
No ratings yet
Method For Line Segmentation in Handwrit
6 pages
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
Chowdhury 2007
No ratings yet
Chowdhury 2007
5 pages
Ofta Online Version
No ratings yet
Ofta Online Version
5 pages
ICIP 2012 Deshredding GallagherDeever
No ratings yet
ICIP 2012 Deshredding GallagherDeever
4 pages
Text Line Identification in Tagore's Manuscript: Chandranath Adak Bidyut B. Chaudhuri
No ratings yet
Text Line Identification in Tagore's Manuscript: Chandranath Adak Bidyut B. Chaudhuri
4 pages
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
From Everand
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
Fouad Sabry
No ratings yet
A Robust and Fast Text Extraction in Images and Video Frames
No ratings yet
A Robust and Fast Text Extraction in Images and Video Frames
7 pages
OCR Using Image Processing
No ratings yet
OCR Using Image Processing
8 pages
Character Extraction Algorithm For Handwritten Character Recognition Systems
No ratings yet
Character Extraction Algorithm For Handwritten Character Recognition Systems
8 pages
Maliki 2012
No ratings yet
Maliki 2012
8 pages
Learning Text-Line Localization With Shared and Local Regression Neural Networks
No ratings yet
Learning Text-Line Localization With Shared and Local Regression Neural Networks
6 pages
Text Recognition Handwritten Words
No ratings yet
Text Recognition Handwritten Words
18 pages
A Robust Algorithm For Text String Separation From Mixed Text/Graphics Images
No ratings yet
A Robust Algorithm For Text String Separation From Mixed Text/Graphics Images
9 pages
Graph Layout Support for Model-Driven Engineering
From Everand
Graph Layout Support for Model-Driven Engineering
Miro Spönemann
No ratings yet
Kang Orientation Robust Text 2014 CVPR Paper
No ratings yet
Kang Orientation Robust Text 2014 CVPR Paper
8 pages
Design and Implementation Recognition System For Handwritten Hindi/Marathi Document
No ratings yet
Design and Implementation Recognition System For Handwritten Hindi/Marathi Document
5 pages
Camera Ready Copy An Improved Method For Text Segmentation and Skew Normalization of Handwriting Image - Edited
No ratings yet
Camera Ready Copy An Improved Method For Text Segmentation and Skew Normalization of Handwriting Image - Edited
13 pages
Breaking The PayPal HIP A Comparison of
No ratings yet
Breaking The PayPal HIP A Comparison of
17 pages
De Nardin A One-Shot Learning Approach To Document Layout Segmentation of Ancient WACV 2024 Paper
No ratings yet
De Nardin A One-Shot Learning Approach To Document Layout Segmentation of Ancient WACV 2024 Paper
10 pages
Scene Text Recognition by Using EE-MSER and Optical Character Recognition For Natural Images-35843
No ratings yet
Scene Text Recognition by Using EE-MSER and Optical Character Recognition For Natural Images-35843
5 pages
A Recursive Otsu Thresholding Method For Scanned Document Binarization
No ratings yet
A Recursive Otsu Thresholding Method For Scanned Document Binarization
8 pages
Title: Spatial Cohesion Refers To The Fact That Text
No ratings yet
Title: Spatial Cohesion Refers To The Fact That Text
6 pages
Segmentation of Arabic Handwritten Documents Into PDF
No ratings yet
Segmentation of Arabic Handwritten Documents Into PDF
8 pages
Thesis Research Proposal
No ratings yet
Thesis Research Proposal
5 pages
Research Paper On Character Recognition
No ratings yet
Research Paper On Character Recognition
8 pages
Layout Analysis For Arabic Historical Document Images
No ratings yet
Layout Analysis For Arabic Historical Document Images
6 pages
Seminar Report On Image Text Segmentation
No ratings yet
Seminar Report On Image Text Segmentation
31 pages
Optical Character Recognition
100% (1)
Optical Character Recognition
36 pages
Stroke Width Transform
No ratings yet
Stroke Width Transform
8 pages
IJERT Segmentation and Detection of Text
No ratings yet
IJERT Segmentation and Detection of Text
6 pages
Automatically Detect and Recognize Text in Natural Images
No ratings yet
Automatically Detect and Recognize Text in Natural Images
19 pages
Text Color Images
No ratings yet
Text Color Images
6 pages
Handwritten Text Recgnition Final
No ratings yet
Handwritten Text Recgnition Final
5 pages
Word Segmentation Method For Handwritten
No ratings yet
Word Segmentation Method For Handwritten
5 pages
Vaidhi Ayush Gurkirat Jatin Project Synopsis Format
No ratings yet
Vaidhi Ayush Gurkirat Jatin Project Synopsis Format
6 pages
Latest Base Paper
No ratings yet
Latest Base Paper
4 pages
Sourabh Art1
No ratings yet
Sourabh Art1
15 pages
Bofinal
No ratings yet
Bofinal
10 pages
Recognition of English Characters by Codes Generated Using Neighbour Identification
No ratings yet
Recognition of English Characters by Codes Generated Using Neighbour Identification
5 pages
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
No ratings yet
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
4 pages
Optical Character Recognition For Devanagari Script
No ratings yet
Optical Character Recognition For Devanagari Script
5 pages
Analogic Preprocessing and Segmentation Algorithms For Off-Line Handwriting Recognition
No ratings yet
Analogic Preprocessing and Segmentation Algorithms For Off-Line Handwriting Recognition
20 pages
Text Detection and Character Recognition From Images: Nesib Eyyubov
No ratings yet
Text Detection and Character Recognition From Images: Nesib Eyyubov
14 pages
PAA RevisedVersionFinal PDF
No ratings yet
PAA RevisedVersionFinal PDF
30 pages
Raster Graphics Editor: Transforming Visual Realities: Mastering Raster Graphics Editors in Computer Vision
From Everand
Raster Graphics Editor: Transforming Visual Realities: Mastering Raster Graphics Editors in Computer Vision
Fouad Sabry
No ratings yet
Vector Graphics: Mastering Vector Graphics in Computer Vision
From Everand
Vector Graphics: Mastering Vector Graphics in Computer Vision
Fouad Sabry
No ratings yet
JOCC Volume 2 Issue 1 Page 9 19
No ratings yet
JOCC Volume 2 Issue 1 Page 9 19
11 pages
C1 Projectreport
No ratings yet
C1 Projectreport
58 pages
High-Performance OCR For Printed English and Fraktur Using LSTM Networks
No ratings yet
High-Performance OCR For Printed English and Fraktur Using LSTM Networks
5 pages
An Overview of Smart Quill
No ratings yet
An Overview of Smart Quill
3 pages
Performance Analysis of Artificial Neural Network Algorithms For Automatic Handwritten Devanagari Text Generation in Marathi Styles
No ratings yet
Performance Analysis of Artificial Neural Network Algorithms For Automatic Handwritten Devanagari Text Generation in Marathi Styles
8 pages
SVM, KNN, and Neural Networks Investigated For Machine Learning in Written Word Decoding
No ratings yet
SVM, KNN, and Neural Networks Investigated For Machine Learning in Written Word Decoding
9 pages
A Deep Learning Approach For Recognizing The Cursi
No ratings yet
A Deep Learning Approach For Recognizing The Cursi
15 pages
ManishGiri G 2018465 34
No ratings yet
ManishGiri G 2018465 34
12 pages
Improving Amharic Handwritten Word Recognition Usi
No ratings yet
Improving Amharic Handwritten Word Recognition Usi
11 pages
Smart Note Taker
78% (9)
Smart Note Taker
35 pages
Literature Review in Computer Science Research
100% (2)
Literature Review in Computer Science Research
7 pages
Section3NewR3 Merged
No ratings yet
Section3NewR3 Merged
72 pages
Handwriting Biometrics
No ratings yet
Handwriting Biometrics
16 pages
Bilingual OCR Report
No ratings yet
Bilingual OCR Report
10 pages
Documentation of Tablet PC
No ratings yet
Documentation of Tablet PC
34 pages
Motion Tablet F5 F5V F5T User Guide
No ratings yet
Motion Tablet F5 F5V F5T User Guide
28 pages
IBM-19100-1662601873 - Project Report
No ratings yet
IBM-19100-1662601873 - Project Report
37 pages
Presentation of Project
No ratings yet
Presentation of Project
13 pages
Neural Networks Based Off-Line Signature Verifcation
No ratings yet
Neural Networks Based Off-Line Signature Verifcation
30 pages
Offline Handwritten Text Recognition Using Hybrid CNNBLSTM Network
No ratings yet
Offline Handwritten Text Recognition Using Hybrid CNNBLSTM Network
6 pages
Introduction To PDA: What's A Handheld PC (H/PC) ?
No ratings yet
Introduction To PDA: What's A Handheld PC (H/PC) ?
19 pages
Recognition of Off-Line Kannada Handwritten Charac PDF
No ratings yet
Recognition of Off-Line Kannada Handwritten Charac PDF
11 pages
Off-Line Arabic Handwriting Character Recognition Using Word Segmentation
No ratings yet
Off-Line Arabic Handwriting Character Recognition Using Word Segmentation
6 pages
Information Search: Smart Quill
No ratings yet
Information Search: Smart Quill
7 pages
Technobot Using Python: Project Report
No ratings yet
Technobot Using Python: Project Report
22 pages
Handwriting Recognition With Large Multidimensional Long Short-Term Memory Recurrent Neural Networks
No ratings yet
Handwriting Recognition With Large Multidimensional Long Short-Term Memory Recurrent Neural Networks
6 pages
PDF 20
No ratings yet
PDF 20
5 pages
E3 Chap 10
No ratings yet
E3 Chap 10
30 pages
Handwritten Character Recognition System Black Book For Capstone Project
No ratings yet
Handwritten Character Recognition System Black Book For Capstone Project
33 pages

GrayLineExtraction PDF

Uploaded by

GrayLineExtraction PDF

Uploaded by

Text Line Segmentation for Gray Scale Historical

Ben-Gurion University of the Negev, Beer Sheva, Israel

in machine-printed documents [7]. These challenges can be

segment text-line in document images [23, 24, 25, 26, 27,

Humans tend to perceive text line patterns by tracking the

The search for a chain of pixels that either passes across

is defined as modified geodesic distances; i.e., the distance

f (d(pi , pi+1 ), I(pi ), I(pi+1 ))

which forces the medial seam to propagate along the text

Toivanen [34] developed the distance transform on curved

3.2.2 Separating Seams

Figure 2: Medial seam generation using (a) one pass,

2GDT (i, j) + min1l=1 (wl map[i + l, j 1])

We noticed that computing the seam-map in one pass as

Figure 3: Seam-map generation: (a)original image,

to the seam sm , we extend pmin along minimal points with

Figure 4: Spring model: (a) the medial seam (blue)

The fragments with the highest certainty values form the

Several evaluation methods for line extraction algorithms

Equation 4 nicely measure the correctness of a text line, but

In languages that include many dots and diacritics, such as

CONCLUSIONS AND FUTURE WORK

We presented a language independent approach for text line

each text line. A medial seam passes along the presumed

This research was supported in part by the Israel Science

[1] T. Rath and R. Manmatha, Word image matching

lines extraction using seam carving, in International

Society, 2009, pp. 626630.

Intelligence., vol. 15, no. 11, pp. 1162U1173

[24] L. forman Sulem and C.Faure, Extracting text lines

Recognition, 2004, p. 245U250.

You might also like