Reading Text in the Wild with Convolutional Neural Networks

Jaderberg, Max; Simonyan, Karen; Vedaldi, Andrea; Zisserman, Andrew

Computer Science > Computer Vision and Pattern Recognition

arXiv:1412.1842 (cs)

[Submitted on 4 Dec 2014]

Title:Reading Text in the Wild with Convolutional Neural Networks

Authors:Max Jaderberg, Karen Simonyan, Andrea Vedaldi, Andrew Zisserman

View PDF

Abstract:In this work we present an end-to-end system for text spotting -- localising and recognising text in natural scene images -- and text based image retrieval. This system is based on a region proposal mechanism for detection and deep convolutional neural networks for recognition. Our pipeline uses a novel combination of complementary proposal generation techniques to ensure high recall, and a fast subsequent filtering stage for improving precision. For the recognition and ranking of proposals, we train very large convolutional neural networks to perform word recognition on the whole proposal region at the same time, departing from the character classifier based systems of the past. These networks are trained solely on data produced by a synthetic text generation engine, requiring no human labelled data.
Analysing the stages of our pipeline, we show state-of-the-art performance throughout. We perform rigorous experiments across a number of standard end-to-end text spotting benchmarks and text-based image retrieval datasets, showing a large improvement over all previous methods. Finally, we demonstrate a real-world application of our text spotting system to allow thousands of hours of news footage to be instantly searchable via a text query.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1412.1842 [cs.CV]
	(or arXiv:1412.1842v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1412.1842

Submission history

From: Max Jaderberg [view email]
[v1] Thu, 4 Dec 2014 21:14:59 UTC (5,818 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2014-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Max Jaderberg
Karen Simonyan
Andrea Vedaldi
Andrew Zisserman

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Reading Text in the Wild with Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Reading Text in the Wild with Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators