Recognition of Handwritten Digits Using Machine Learning Techniques IJERTV6IS050456 PDF
Recognition of Handwritten Digits Using Machine Learning Techniques IJERTV6IS050456 PDF
I. INTRODUCTION
Handwritten image recognition is probably one of the
most interesting and challenging applications in the field
of pattern recognition. Handwritten recognition is divided
into two types of techniques: on-line and off-line. Off-line
techniques include reading the character using an image
capture device, such as a camera. while the technique
which is being dealt here is Off-line which means to
convert a handwritten image into a machine readable
form. Fig. 1. Steps of Pre-Processing
The major factor behind choosing this particular Finally the image is resized to 25 by 25 pixels. The
application is its numerous applications such as Automatic images used for testing consists of more than one digits
Number Plate Recognition, assisting blind and visually which needs to be separated into individual digits before
impaired people,automatic check processing for banks, applying the pre-processing steps.
and to process huge number of documents in industries
like healthcare, legal, education, and finance the focus if
the work described in this paper is on handwritten digits.
The paper will further be covering data collection, image IV. FEATURE EXTRACTION
preprocessing, feature extraction, and finally Blob Analysis: Blob or Binary Large Object is a large
classification. image which needs to be managed, and consists of binary
II. DATA ACQUISITION data. In this project we used images that consists of a
sequence of digits. The image is then converted into a
The data used in this project is a set of handwritten
binary image using the bwlabel function in matlab, where
digits from 1 to 10. The data has been divided into two
the image is processed according to the connected
categories which form the training set and the test set.
components concept.
Sets of data were collected which are the phone numbers,
zip-codes, and address plates for testing purpose. Connected Components: The concept is based on
grouping similar pixels according to pixel connectivity.
The connected components have similar levels of pixel binary class classification, that is one against all other
intensity, and after grouping, each pixel is labelled classes which builds one SVM per class. Here the
according to the component it belongs to. K-connected technique used is oneagainst-one which builds one SVM
components (here K=8), algorithm is used in this work. for each pair of classes. This method constructs n(n-1)/2
classifiers where each one is trained on data from two
The process of classification is performed using classes. For training data from ith and jth classes we use the
template matching, SVM and Neural Network approach. following binary classification ,
V. TEMPLATE MATCHING
Template Matching is a computer vision technique
which is used to recognize the elements in the image by
matching it with a predefined template. The process is
elaborated in the flow diagram in Fig. 2.
Neural Network
The input is explained in the following GUI.
IX. CONCLUSION
In this project we used Template Matching, Support
Vector Machine, and Artificial Neural Network for digit
recognition. It turned out that all the three methods were
very promising but Neural Networks was very challenging
to apply and yielded very good results, followed by SVM
and Template Matching. Due to time constraints our
project was restricted to digits, and for future work it will
be interesting to investigate characters and more advanced
applications could involve facial or handwriting
recognition.
X. ACKNOWLEDGEMENT
We are very thankful to our professors and specially to our
guide Dr. Selvam Venkatesan for his significant help in
Fig. 9 Neural Network Gui completing this project.