Handwritten Text Recognition and Digital Text Conversion
Handwritten Text Recognition and Digital Text Conversion
Volume: 3 | Issue: 3 | Mar-Apr 2019 Available Online: www.ijtsrd.com e-ISSN: 2456 - 6470
@ IJTSRD | Unique Paper ID – IJTSRD23508 | Volume – 3 | Issue – 3 | Mar-Apr 2019 Page: 1826
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
Flow Chart
Conclusion
Handwritten Character Recognition from images is very
essential these days. Character Recognition from images uses
feature extraction using character geometry and gradient
technique [4]. The feature extraction methods have
performed well in classification when fed to the neural
Figure2. Process flow network and preprocessing of image using edge detection
and normalization are the ideal choice for degraded noisy
Methodology images. The method of training neural network with
This project is developed using Tesseract tess-2 module extracted features front sample images of each character has
software which is a Computer vision API library [3] , the detection accuracy to a greater extent. The proposed
modelis pretrained with the dataset containingthe literals of methodology has produced good results for images
the language, which are inturn compared to the input image containing handwritten text written in different styles,
file to produce the required output. different size and alignment with varying background. The
system is developed and evaluated for a set of sample images
Advantages of System containing handwritten text [5]. We discussed a NN which is
1. Converting handwritten text to digital text. able to recognize text in images. The NN consists of 5 CNN
2. We can store it in our versatile itself. and 2 RNN layers and outputs a character-probability matrix.
3. Copy the converted digital text. This matrix is either used for CTC loss calculation or for CTC
4. Share the converted digital text via mail, whatsapp, etc,. decoding. An implementation using TF is provided.
Improvements References
1. It can be trained more to get accurate results. [1] R. Smith, “A Simple and Efficient Skew Detection
2. It can be trained on multiple data sets to adapt to Algorithm via Text Row Accumulation”, Proc. of the 3rd
different languages. Int. Conf. on Document Analysis and Recognition (Vol.
3. Text to speech feature can be added. 2), IEEE 1995, pp. 1145-1148.
[2] S.V. Rice, F.R. Jenkins, T. A. Nartker, The Fourth Annual
Result and Analysis Test of OCR Accuracy, Technical Report 95-03,
We have tested the performance of our proposed system on Information Science Research Institute, University of
many samples of handwritten text. Nevada, Las Vegas, July 1995.
Here are few screenshots of the result
[3] R.W. Smith, The Extraction and Recognition of Text
from Multimedia Document Images, PhD Thesis,
University of Bristol, November 1987.
[4] Chirag I Patel, Ripal Patel. Palak Patel “Handwritten
Character Recognition Using Neural Networks",
International Journal of Scientific & Engineering
Research Volume 2, Issue 5, May- 2011.
[5] Kauleshwar Prasad, Devvrat C Nigam,
AshmikaLakhotiya, Dheeren Umre "Character
Recognition Using Neural Toolbox", International
Journal of u- and e- Service, Science and Technology
Vol. 6, No. 1, February, 2013.
Screenshot 1: output screen
@ IJTSRD | Unique Paper ID - IJTSRD23508 | Volume – 3 | Issue – 3 | Mar-Apr 2019 Page: 1827