0% found this document useful (0 votes)

23 views7 pages

Table of Content

The document discusses handwritten digit recognition using machine learning techniques. It describes existing and proposed systems, algorithms used including SVM-KNN, model implementation and evaluation achieving 97% accuracy on the MNIST dataset.

Uploaded by

Amrutha reddy karumuru

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views7 pages

Table of Content

Uploaded by

Amrutha reddy karumuru

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

TABLE OF CONTENT

Table of Content………………………………………………………………….1
Abstract …………………………………………………………………………..2
Introduction ……………………………………………………………………....2
Existing system ……………………………………………………………………3
Proposed system …………………………………………………………………..4
Problem statement ………………………………………………………………...4
Algorithms and techniques ………………………………………………………..5
Implementation ……………………………………………………………………6
Model evaluation and validation…………………………………………………..6
Conclusion …………………………………………………………………………7
References ………………………………………………………………………….7

1
ABSTRACT
The human visual system is one of the wonders of the world. Consider the following
sequence of handwritten digits:

Most people effortlessly recognize those digits as 5,6,8. That ease is deceptive. We carry in
our heads a supercomputer, tuned by evolution over hundreds of millions of years, and
superbly adapted to understand the visual world. Recognizing handwritten digits isn't easy.
Rather, we humans are stupendously, astoundingly good at making sense of what our eyes
show us. But nearly all that work is done unconsciously. And so we don't usually appreciate
how tough a problem our visual systems solve.
The difficulty of visual pattern recognition becomes apparent if you attempt to write a
computer program to recognize digits like those above. What seems easy when we do it
ourselves suddenly becomes extremely difficult. Simple intuitions about how we recognize
shapes - "a 9 has a loop at the top, and a vertical stroke in the bottom right" - turn out to be
not so simple to express algorithmically. When you try to make such rules precise, you
quickly get lost in a morass of exceptions and caveats and special cases. It seems hopeless.

INTRODUCTION

Handwritten character recognition is a field of research in artificial intelligence, computer

vision, and pattern recognition. A computer performing handwriting recognition is said to be
able to acquire and detect characters in paper documents, pictures, touch-screen devices,
emails, bank cheque, and recognize number plates of vehicles, processing bank cheque

2
amount and convert them into machine-encoded form. Its application is found in optical
character recognition, transcription of handwritten documents into digital documents and more
advanced intelligent character recognition systems.

Handwritten character recognition can be thought of as a subset of the image recognition

problem.

The general flow of an image recognition algorithm.

Basically, the algorithm takes an image (image of a handwritten digit) as an input and outputs
the likelihood that the image belongs to different classes (the machine-encoded digits, 1–9). In
this blog post, I will elaborate on my approach to solving this problem with a combination of
machine learning techniques.

Existing System:
The existing system uses images and breaks to pixels in identifying the numbers and
their sequences of pixels. This makes identification very difficult and the results always
varies in real time.

Disadvantages of Existing System:

 When data sizes increases, computational process becomes a challenging thing.

3
 No perfect visualization.
 Parallel processing is not possible in clustering Analysis
 Mining techniques does not work well with the big data in analysis when we occurred
with multiple column analysis.

Proposed System:
We're focusing on handwriting recognition because it's an excellent prototype
problem for learning about neural networks in general. As a prototype it hits a sweet spot: it's
challenging - it's no small feat to recognize handwritten digits - but it's not so difficult as to
require an extremely complicated solution, or tremendous computational power.

Advantages:
 We propose a system where we use statistical analysis with sampling data in the
analysis.
 Considering the data visualization which is not done in the bigdata analysis.
 Python has good graphical libraries.
 The output is more effective using graphical libraries in Python.

Problem Statement
The handwritten digits are not always of the same size, width, orientation and justified to
margins as they differ from writing of person to person, so the general problem would be
while classifying the digits due to the similarity between digits such as 1 and 7, 5 and 6, 3 and
8, 2 and 5, 2 and 7, etc. This problem is faced more when many people write a single digit
with a variety of different handwritings. Lastly, the uniqueness and variety in the handwriting
of different individuals also influence the formation and appearance of the digits. Now we
introduce the concepts and algorithms of deep learning and machine learning.

Algorithms and Techniques

It has been shown that Support Vector Machines (SVMs) can be applied to image and hand-
written character recognition [4]. SVMs are effective in high dimensional spaces, hence it
makes sense to use SVMs for this study given the high dimensionality of our input space, i.e.

4
784 features. However, SVMs don’t perform well in large datasets as the training time
becomes cubic in the size of the dataset. This could be an issue as our dataset containing
42,000 samples which is quite large. To deal with this issue, we will adopt a technique
proposed by a study conducted at the University of California, Berkeley, which is to train a
support vector machine on the collection of nearest neighbours in a solution they called
“SVM-KNN” [2]. Training an SVM on the entire data set is slow and the extension of SVM to
multiple classes is not as natural as Nearest Neighbor (NN). However, in the neighbourhood of
a small number of examples and a small number of classes, SVMs often perform better than
other classification methods.

We use NN as an initial pruning stage and perform SVM on the smaller but more relevant set
of examples that require careful discrimination. This approach reflects the way humans
perform coarse categorization: when presented with an image, human observers can answer
coarse queries such as presence or absence of an animal in as little as 150ms, and of course,
can tell what animal it is given enough time [6]. This process of a quick categorization,
followed by successive finer but slower discrimination was the inspiration behind the “SVM-
KNN” technique.

Implementation

Our simple implementation of SVM-KNN goes as follows: for a query, we compute the
Euclidean distances of the query to all training examples and pick the K nearest neighbours. If
the K neighbours have all the same labels, the query is labelled and exit. Else, we compute the
pairwise distances between the K neighbours, convert the distance matrix to a kernel matrix
and apply multiclass SVM. We finally use the resulting classifier to label the query.

5
Model Evaluation and Validation

In our initial implementation, we extract 60 principal components and use parameters values
of k=2 for KNN and C=1.0 for SVM. During development, a validation set was used to
evaluate the model. I split the dataset into training and test sets. The final hyperparameters

6
were chosen because they performed the best amongst the tried combinations. A final value
of k=3 and C=0.5 yielded the best results. A low k value makes sense for our model because
we are trying to find the few samples where NN has a hard time establishing a decision
boundary and apply SVM to perform a more coarse-grained classification.

To verify the robustness of the final model, I use a cross-validation technique

(StratifiedShuffleSplit) on the dataset to ensure that the model generalizes well by using the
entire dataset for both training and testing. The model consistently categorized the handwritten
characters with a 97% accuracy.

CONCLUSION

The classification accuracy of 0.9714 is better than that of the benchmark (0.93514).
Therefore we can conclude that our model is adequate for solving the problem of classifying
handwritten characters in the MNIST dataset as it is able to accurately categorize well with an
accuracy quite close to humans. However, our model is useful in a limited domain. Some
changes would have to be made to solve a bigger problem of recognizing multiple digits in an
image or recognizing arbitrary multi-digit text in unconstrained natural images.

Source Code

https://github.com/briceicle/capstone/blob/master/model.py
https://github.com/briceicle/capstone/tree/master/data

References

[1] http://yann.lecun.com/exdb/publis/pdf/matan-90.pdf
[2] http://www.vision.caltech.edu/Image_Datasets/Caltech101/nhz_cvpr06.pdf
[3] http://www.johnwinn.org/Publications/papers/WinnCriminisi_cvpr2006_video.pdf
[4] http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.441.6897&rep=rep1&type=pdf
[5] https://en.wikipedia.org/wiki/Support_vector_machine#Applications
[6] https://www.ncbi.nlm.nih.gov/pubmed/8632824
[7] https://github.com/chefarov/ocr_mnist/blob/master/papers/knn_MNIST.pdf

Bangla Handwritten Digit Recognition Report
No ratings yet
Bangla Handwritten Digit Recognition Report
9 pages
Basic Parts of Motherboard
No ratings yet
Basic Parts of Motherboard
73 pages
A Comparative Study On Handwriting Digit Recognition Using Neural Networks
No ratings yet
A Comparative Study On Handwriting Digit Recognition Using Neural Networks
5 pages
CS8261 C Programming Lab Record Manual
100% (1)
CS8261 C Programming Lab Record Manual
59 pages
FULL
No ratings yet
FULL
44 pages
Hand Written Digit Recognition
No ratings yet
Hand Written Digit Recognition
5 pages
Research Papers
No ratings yet
Research Papers
16 pages
Report Digit Recognition
No ratings yet
Report Digit Recognition
11 pages
Handwritten Digit Recognition Using Quantum Convolution Neural Network
No ratings yet
Handwritten Digit Recognition Using Quantum Convolution Neural Network
9 pages
Sample Synopsis
No ratings yet
Sample Synopsis
7 pages
An In-Depth Deep Learning Approach To Handwritten Digits Recognition
No ratings yet
An In-Depth Deep Learning Approach To Handwritten Digits Recognition
7 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
Machine Learning
No ratings yet
Machine Learning
21 pages
Updated 2nd Synopsis
No ratings yet
Updated 2nd Synopsis
33 pages
Handwritten Recognition Using SVM, KNN and Neural Network PDF
No ratings yet
Handwritten Recognition Using SVM, KNN and Neural Network PDF
11 pages
Handwritten Digit Recognition Phase1 (1) - Pages
No ratings yet
Handwritten Digit Recognition Phase1 (1) - Pages
11 pages
V Minor
No ratings yet
V Minor
18 pages
MN1
No ratings yet
MN1
20 pages
Pattern Recognition
No ratings yet
Pattern Recognition
18 pages
SVM, KNN, and Neural Networks Investigated For Machine Learning in Written Word Decoding
No ratings yet
SVM, KNN, and Neural Networks Investigated For Machine Learning in Written Word Decoding
9 pages
SVMBasedRealTimeHand WrittenDigitRecognitionSystem
No ratings yet
SVMBasedRealTimeHand WrittenDigitRecognitionSystem
7 pages
Kumar - Singh - 2021 - IOP - Conf. - Ser. - Mater. - Sci. - Eng. - 1084 - 012021
No ratings yet
Kumar - Singh - 2021 - IOP - Conf. - Ser. - Mater. - Sci. - Eng. - 1084 - 012021
9 pages
Digit Main
No ratings yet
Digit Main
30 pages
JOCC Volume 2 Issue 1 Page 9 19
No ratings yet
JOCC Volume 2 Issue 1 Page 9 19
11 pages
Handwritten Digit Recognition Using Machine Learning
No ratings yet
Handwritten Digit Recognition Using Machine Learning
5 pages
Handwritten Digit Regonizer
No ratings yet
Handwritten Digit Regonizer
12 pages
Recognition of Handwritten Digits Using Machine Learning Techniques IJERTV6IS050456 PDF
No ratings yet
Recognition of Handwritten Digits Using Machine Learning Techniques IJERTV6IS050456 PDF
4 pages
Base Paper
No ratings yet
Base Paper
5 pages
Project Report
No ratings yet
Project Report
44 pages
Handwritten Digit Recognition Using CNN
100% (1)
Handwritten Digit Recognition Using CNN
6 pages
Layer 2
No ratings yet
Layer 2
8 pages
Handwritten Digit Regonizer
100% (3)
Handwritten Digit Regonizer
11 pages
Intro Ai Group3
No ratings yet
Intro Ai Group3
35 pages
Practice Assignment - 3 - IITM
No ratings yet
Practice Assignment - 3 - IITM
2 pages
Review 1 HDR
No ratings yet
Review 1 HDR
19 pages
31.july Ijmte - 674
No ratings yet
31.july Ijmte - 674
7 pages
SVMBasedRealTimeHand WrittenDigitRecognitionSystem
No ratings yet
SVMBasedRealTimeHand WrittenDigitRecognitionSystem
7 pages
Institute of Engineering and Management, Kolkata Artificial Intelligence Project (CS793C) On Handwriting Analysis
No ratings yet
Institute of Engineering and Management, Kolkata Artificial Intelligence Project (CS793C) On Handwriting Analysis
11 pages
Handwritten Character Recognition Using Deep Learning
No ratings yet
Handwritten Character Recognition Using Deep Learning
8 pages
Sat - 23.Pdf - Handwritten Hindi Character Recognition Using CNN
No ratings yet
Sat - 23.Pdf - Handwritten Hindi Character Recognition Using CNN
11 pages
Assignment 2, Machine Learning
No ratings yet
Assignment 2, Machine Learning
5 pages
Synopsis
No ratings yet
Synopsis
19 pages
Handwritten Digit Recognition Project Paper
No ratings yet
Handwritten Digit Recognition Project Paper
15 pages
Proposal
No ratings yet
Proposal
9 pages
ManishGiri G 2018465 34
No ratings yet
ManishGiri G 2018465 34
12 pages
Real Time Handwritten Digit Recognition Using Neural Networks For Accurate Marks Entry On Examination Portal
No ratings yet
Real Time Handwritten Digit Recognition Using Neural Networks For Accurate Marks Entry On Examination Portal
7 pages
Ijirt162606 Paper
No ratings yet
Ijirt162606 Paper
4 pages
Deep Learning - Handwritten Digit Recognition Using Python REVIEW 0
No ratings yet
Deep Learning - Handwritten Digit Recognition Using Python REVIEW 0
16 pages
Handwritten Digit Recognition
No ratings yet
Handwritten Digit Recognition
4 pages
Arun KRS
No ratings yet
Arun KRS
7 pages
Classifying Hand-Written Digits Using Neural Network: A Project Report On
No ratings yet
Classifying Hand-Written Digits Using Neural Network: A Project Report On
19 pages
Tally Question
No ratings yet
Tally Question
59 pages
Implementation of Handwritten Digit Recognizer Using CNN: Vinjit, Bhojak, Kumar and Nikam
No ratings yet
Implementation of Handwritten Digit Recognizer Using CNN: Vinjit, Bhojak, Kumar and Nikam
9 pages
Methodology: Project Name
No ratings yet
Methodology: Project Name
5 pages
Synopsis PDF
No ratings yet
Synopsis PDF
2 pages
Handwritten Digit Recognition Using ML&DL
No ratings yet
Handwritten Digit Recognition Using ML&DL
3 pages
Spring Framework Notes
No ratings yet
Spring Framework Notes
93 pages
Electronic Age-Wps Office
No ratings yet
Electronic Age-Wps Office
94 pages
CGR Microproject
No ratings yet
CGR Microproject
11 pages
Datasheet AVEVA MES
No ratings yet
Datasheet AVEVA MES
9 pages
How To Install Software License Manager (SLM) License Server
No ratings yet
How To Install Software License Manager (SLM) License Server
6 pages
CS 234: Assignment #2: 1 Deep - Networks (DQN) (8 Pts Writeup)
No ratings yet
CS 234: Assignment #2: 1 Deep - Networks (DQN) (8 Pts Writeup)
9 pages
3MTT Onboarding Learning Resources
No ratings yet
3MTT Onboarding Learning Resources
31 pages
Computer Studies Notes Form 2
No ratings yet
Computer Studies Notes Form 2
5 pages
Project Diary
No ratings yet
Project Diary
20 pages
Power Platform - OWASP Low Code No Code Top 10 Risks (April 2024)
No ratings yet
Power Platform - OWASP Low Code No Code Top 10 Risks (April 2024)
58 pages
4.production System Modeling
No ratings yet
4.production System Modeling
56 pages
VRF PRO V6x
No ratings yet
VRF PRO V6x
65 pages
TechSolve VizAdapter Data Sheet - Heidenhain CNC MTConnect Adapter
No ratings yet
TechSolve VizAdapter Data Sheet - Heidenhain CNC MTConnect Adapter
1 page
Log
No ratings yet
Log
215 pages
DBMS Class Test 2 Answers
No ratings yet
DBMS Class Test 2 Answers
8 pages
Chapter 6 Part1 Hands-On Exercies With Answers
No ratings yet
Chapter 6 Part1 Hands-On Exercies With Answers
8 pages
P34x EN MD Nc7 B2E2 LMA
No ratings yet
P34x EN MD Nc7 B2E2 LMA
159 pages
Lunch Box Switch - Seven Segment Display (CC and CA) : Lab Activity - 7
No ratings yet
Lunch Box Switch - Seven Segment Display (CC and CA) : Lab Activity - 7
7 pages
Circuits and Systems For Efficient Portable-to-Portable Wireless Charging
No ratings yet
Circuits and Systems For Efficient Portable-to-Portable Wireless Charging
125 pages
It Exam Practice Questions
No ratings yet
It Exam Practice Questions
7 pages
Back To Normal? or Will 5G Push The Envelope?: Industry Analysis #3 2021
No ratings yet
Back To Normal? or Will 5G Push The Envelope?: Industry Analysis #3 2021
26 pages
Group Assigment UBCOm
No ratings yet
Group Assigment UBCOm
5 pages
CV Jayant Kumar
No ratings yet
CV Jayant Kumar
1 page
n670x Series Datasheet
No ratings yet
n670x Series Datasheet
3 pages
Panchamis - Shree Shantadurga Vijayate
No ratings yet
Panchamis - Shree Shantadurga Vijayate
1 page
Review of Literature For Mobile Banking
No ratings yet
Review of Literature For Mobile Banking
5 pages
International Journal of Data Mining & Knowledge Management Process (IJDKP)
No ratings yet
International Journal of Data Mining & Knowledge Management Process (IJDKP)
3 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet

Table of Content

Uploaded by

Table of Content

Uploaded by

TABLE OF CONTENT

Handwritten character recognition is a field of research in artificial intelligence, computer

Handwritten character recognition can be thought of as a subset of the image recognition

The general flow of an image recognition algorithm.

Disadvantages of Existing System:

Algorithms and Techniques

To verify the robustness of the final model, I use a cross-validation technique

You might also like