Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks

Yousef, Mohamed; Hussain, Khaled F.; Mohammed, Usama S.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1812.11894 (cs)

[Submitted on 31 Dec 2018]

Title:Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks

Authors:Mohamed Yousef, Khaled F. Hussain, Usama S. Mohammed

View PDF

Abstract:Unconstrained text recognition is an important computer vision task, featuring a wide variety of different sub-tasks, each with its own set of challenges. One of the biggest promises of deep neural networks has been the convergence and automation of feature extractors from input raw signals, allowing for the highest possible performance with minimum required domain knowledge. To this end, we propose a data-efficient, end-to-end neural network model for generic, unconstrained text recognition. In our proposed architecture we strive for simplicity and efficiency without sacrificing recognition accuracy. Our proposed architecture is a fully convolutional network without any recurrent connections trained with the CTC loss function. Thus it operates on arbitrary input sizes and produces strings of arbitrary length in a very efficient and parallelizable manner. We show the generality and superiority of our proposed text recognition architecture by achieving state of the art results on seven public benchmark datasets, covering a wide spectrum of text recognition tasks, namely: Handwriting Recognition, CAPTCHA recognition, OCR, License Plate Recognition, and Scene Text Recognition. Our proposed architecture has won the ICFHR2018 Competition on Automated Text Recognition on a READ Dataset.

Comments:	Submitted for publication
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1812.11894 [cs.CV]
	(or arXiv:1812.11894v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1812.11894

Submission history

From: Mohamed Yousef [view email]
[v1] Mon, 31 Dec 2018 16:53:21 UTC (320 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators