0% found this document useful (0 votes)
45 views11 pages

Optical Character Recognition: Divyanshu Sagar Ahmed Zaid Faizee Vidyut Singhania

This document discusses optical character recognition (OCR) and a student project to develop an OCR system using machine learning algorithms and neural networks. It provides background on OCR, describes the current progress made in implementing various machine learning techniques, and outlines the planned technology use and timeline for the project.

Uploaded by

Nimit Jain
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
45 views11 pages

Optical Character Recognition: Divyanshu Sagar Ahmed Zaid Faizee Vidyut Singhania

This document discusses optical character recognition (OCR) and a student project to develop an OCR system using machine learning algorithms and neural networks. It provides background on OCR, describes the current progress made in implementing various machine learning techniques, and outlines the planned technology use and timeline for the project.

Uploaded by

Nimit Jain
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 11

OPTICAL CHARACTER

RECOGNITION
Divyanshu Sagar
Ahmed Zaid Faizee
Vidyut Singhania
INTRO
1. Ingenious piece of software.
2.Involves the mechanical/electronic
conversion of scanned imagesof
typewritten/printed text into machine-
encoded/computer-readable text.
3. Heavily used in the
industry.
INTRO ii

Common method of digitizing printed texts

Subtle software which is as highly overlooked as it is


simple.

Numerous applications and uses editing, scanning,


searching, comparison, compact storage and many
more!

OCR is a field of research inpattern


recognition,artificial intelligenceandcomputer vision.
Problem Statement
Ever since Charles Babbage invented the computer back in the early
19th century, Computer machines have held man's imagination for
numerous reasons - the primary being what all is this collection of nuts,
bolts and wires capable of doing.
Character Recognition is one such concept which has beheld mankinds
attention. There can be no greater testimony to the same than the fact
that people werealreadyworking on this idea-a few decades before John
McCarthy even coined the term "Artificial Intelligence".
Today, especially, Character Recognition plays a very important part of
our daily lives as they are incorporated so subtly that we even forget
their presence. Some examples are their implementation in Microsoft
Word, Adobe Acrobat and even Pen computing.
Optical Character Recognition (OCR)is the mechanicalor
electronicconversion of scanned or photoed imagesof typewritten or
printed text into machine-encoded/computer-readable text. This text can
then be used in numerous ways - ranging from assisting the visually
impaired (text-to-speech), extracting information from the
image, pen computing and so on.Optical Character Recognition
(OCR)is a result of cross-linking various avenues of technology like
Machine Learning, Artificial Intelligence and Neural Networks. We propose
to develop a system based on mathematical algorithms and
principles which involve all the aforementioned technologies. That being
Tech Jargon - I
Pre-processing
Used to improve the successful
recognition of the image (include De-
skew, Layout analysis, Despeckle)
Character/glyph recognition
Post-processing
Application specific optimization

Tweaking the system to better deal


with specific or different inputs.
Tech Jargon - II
Segmentation
Includes two important phases:
1) Obtaining training
samples
2) Recognizing new images
after
training

Feature Extraction
Feature of the character are
extracted and hence are
compared with the glyph

Classification
After the extraction, neural
Our Current Progress
We started with the Neural Networks / Machine Learning
aspect of the project.

We have implemented Univariate / Multivariate


Linear/Regularized Linear Regression, Gradient Descent for
Multiple Variables and Logistic/ Regularized Logistic
Regression.

Currently, we are studying & working on the


implementation of Neural Nets using Forward Propogation.

We plan on tackling character segmentation and feature


extraction next.
Technology to be used
We are using the following
technology platforms :
GNU Octave
To develop and test the OCR software.
5MP HD camera (720p @ 30fps)
To take images for detection
Timeline
Development
of Machine
Requirement Learning algo/
Analysis Begins Neural Nets
Dec, 2013. 2nd March,
2014

Designing Begins Feature


Extraction &
end of Jan, 2014 Segmentation
14th March, 2014
Development Testing 1st April,
begins - mid of 2014
Feb, 2014 Deployment 8th
April, 2014
Literature Review

Microsoft One Note

Adobe PDF scanner

HP scanner

You might also like