0% found this document useful (0 votes)

788 views27 pages

Optical Character Recognition

Optical character recognition (OCR) is a process that uses machine and electronic translation to recognize text in images and convert it to editable text. The document discusses the stages of OCR including preprocessing, feature extraction, model estimation, and classification. It also provides examples of implementing OCR using MATLAB and with the Tesseract library for Android applications. Key advantages of OCR are increasing efficiency, recovering valuable space from documents, and providing greater accessibility of text.

Uploaded by

Amit Srivastava

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

788 views27 pages

Optical Character Recognition

Uploaded by

Amit Srivastava

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

OPTICAL CHARACTER RECOGNITION (OCR)

Contents

Introduction Stages in OCR MATLAB Implementation Steps in MATLAB Implementation Android Implementation Advantages Applications Conclusion References
2

INTRODUCTION
Motivation:Text detection and recognition in general have quite a lot of relevant application for automatic indexing or information retrieval such document indexing, content-based image retrieval, and license car plate recognition which further opens up the possibility for more improved and advanced systems.

OCR:OCR is the mechanical or electronic translation of images of handwritten, typewritten or printed text (usually captured by a scanner) into machine-editable text.

Aims and Objectives

OCR
Recognition Recognize each of the character in the detected text region using a suitable algorithm

Segmentation Separate the text region into its individual characters.

The goal of Optical Character Recognition (OCR) is to classify optical patterns (often contained in a digital image) corresponding to alphanumeric or other characters.

STAGES IN OCR
TRAINING

Pre - processing

Feature Extraction

Model Estimation OCR Pre - processing TESTING Feature Extraction

Classification

PRE-PROCESSING
The raw data is subjected to a number of preliminary processing steps to make it usable in the descriptive stages of character analysis. Pre-processing aims to produce data that are easy for the OCR systems to operate accurately. The main objectives of pre-processing are : Binarization Noise reduction Stroke width normalization Skew correction Slant removal

BINARIZATIO N

Binarization (thresholding) refers to the

conversion of a gray-scale image into a binary image. Two categories of thresholding are: Global - picks one threshold value for the entire document image which is often based on an estimation of the background level from the intensity histogram of the image. Adaptive (local) - uses different values for each pixel according to the local area information

Noise Reduction Normalization

Noise reduction improves the quality of the document. Normalization provides a tremendous reduction in data size, thinning extracts the shape information of the characters. Two main approaches:

Filtering (masks) Morphological Operations (erosion, dilation, etc)

6/10/13

FEATURE EXTRACTION
In feature extraction stage each character is represented as a feature vector, which becomes its identity. The major goal of feature extraction is to extract a set of features, which maximizes the recognition rate with the least amount of elements. Due to the nature of handwriting with its high degree of variability and imprecision obtaining these features, is a difficult task.

MODEL ESTIMATION
Given

labelled sets of features for many characters, where the labels correspond to the particular classes that the characters belong to, we wish to estimate a statistical model for each character class.

CLASSIFICATION
According

to Tou and Gonzalez, The principal function of a pattern recognition system is to yield decisions concerning the class membership of the patterns with which it is confronted. In the context of an OCR system, the recognizer is confronted with a sequence feature patterns from which it must determine the character classes.

MATLAB IMPLEMENTATION Flowchart:Preprocess

Segmentation

Recognition

Snapshot of MATLAB Application

Make Template
To create templete.mat to be use for classification:

36 images of characters Size = 60 X 55

Matrix siz e 55 X 60 X 36 Saved as template .mat

Preprocess
Raw Image Noise Filter Binarize

Resizing

Baunding

Complimenting

Preprocessed Image

Segmentation Connected Components

The segmentation character involves the following steps:

Scan the image from left to right to find on pixel. If on pixel been found, all on pixel connected to the detected on pixel will be extracted segmented as a pixel. The process will be repeated until it reach end right of the image.

Corr2
Where is the mean of the input matrix i and is the mean of the input matrix j. 0 < r < 1 1 mean i and j is exactly same while 0 mean the i and j not same at all.

Recognition - Template Correlations

temp = templates(:,:,j); in = chars(:,:,i); allCorrs(j) = corr2(temp, in); Source image Image Template

allcorrs(j)

0.82011

0.57395

0.43850

Android Implementation

The same OCR application we build for Android devices named MyOCR using open source library Tesseract by Google.

Tesseract Background:Developed on HP-UX at HP between 1985 and 1994 to run in a desktop scanner. Came neck and neck with Caere and XIS in the 1995 UNLV test. Never used in an HP product. Open sourced in 2005. Now on: http://code.google.com/p/tesseract-ocr Highly portable.

Tesseract OCR Architecture

ADVANTAGE
Increase efficiency OCR Recover valuable space Eliminates Retyping Need Greater accessibility

APPLICATION
Document reading machines used for Banking Applications Automatic address reading for mail sorting

Data entry

Process automation

Aid for blind Automatic number-plate readers

Other Applications

Text Entry
Page readers for text entry, mainly used in Office Automation

Typical errors in OCR

Variations in shape
Due to serifs and style variations.

Deformations
Caused by broken characters, smudged characters and speckle.

Variations in spacing
Due to subscripts, superscripts, skew and variable spacing

Mixture of text and graphics

Future needs
Need constrained OCR will be decreasing Omni font OCR Systems

Recognition of manually produced documents

Recognition of entire words instead of individual

REFRENCES

http://www.uri.edu/~hansenj/projects/ele585/OCR / J.T. Tou and R.C. Gonzalez, Pattern Recognition Principles, Addison-Wesley Publishing Company, Inc., Reading, Massachusetts, 1974

M. Szmurlo, Masters Thesis, Oslo, May 1995, (users.info.unicaen.fr/~szmurlo/papers/masters/ master.thesis.ps.gz)

THANK YOU
Special Thanks To: Google.com Mathwoks.com

Application of UAV
No ratings yet
Application of UAV
22 pages
Optical Character Recognition
No ratings yet
Optical Character Recognition
27 pages
Computer Graphics Module1
No ratings yet
Computer Graphics Module1
39 pages
MDG RETAIL Limitation
No ratings yet
MDG RETAIL Limitation
25 pages
Ocr
No ratings yet
Ocr
16 pages
Back Propagation Technique
No ratings yet
Back Propagation Technique
24 pages
FPGA Based Design and Implementation of Image Edge Detection Using Xilinx System Generator
No ratings yet
FPGA Based Design and Implementation of Image Edge Detection Using Xilinx System Generator
4 pages
Unit II Requirements Elicitation
No ratings yet
Unit II Requirements Elicitation
23 pages
BE Computer Engineering 2012
0% (1)
BE Computer Engineering 2012
60 pages
Evolution of Machine Learning Algorithm
No ratings yet
Evolution of Machine Learning Algorithm
21 pages
Image Segmentation For Object Detection Using Mask R-CNN in Colab
No ratings yet
Image Segmentation For Object Detection Using Mask R-CNN in Colab
5 pages
15EC72 - Digital Image Processing 2018-19 PDF
No ratings yet
15EC72 - Digital Image Processing 2018-19 PDF
62 pages
Image Recognition and Its Language Translation Using OCR
No ratings yet
Image Recognition and Its Language Translation Using OCR
8 pages
Steganography Project Report For Major Project in B Tech
No ratings yet
Steganography Project Report For Major Project in B Tech
74 pages
Ed 3 Book
No ratings yet
Ed 3 Book
636 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
FRAS Final Report CSE 299
No ratings yet
FRAS Final Report CSE 299
55 pages
Sugar Crystal Size Characterization Using Digital Image Processing
No ratings yet
Sugar Crystal Size Characterization Using Digital Image Processing
131 pages
Object Detection Using Deep Learning
No ratings yet
Object Detection Using Deep Learning
45 pages
Introduction To Modeling and Simulation
100% (2)
Introduction To Modeling and Simulation
7 pages
Text Retrieval From Scanned Forms Using Optical Character Recognition Springerlink
No ratings yet
Text Retrieval From Scanned Forms Using Optical Character Recognition Springerlink
10 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
3 pages
Jagruthi Institute of Engineering and Technology: Optical Character Recognition
No ratings yet
Jagruthi Institute of Engineering and Technology: Optical Character Recognition
28 pages
Introduction To Resnet
No ratings yet
Introduction To Resnet
14 pages
GUIDELINES FOR PREPARATION OF PROJECT REPORT - III and Above
No ratings yet
GUIDELINES FOR PREPARATION OF PROJECT REPORT - III and Above
15 pages
Optical Character Recognition
100% (1)
Optical Character Recognition
17 pages
ICEF 2020 Keynote Prith Banerjee
No ratings yet
ICEF 2020 Keynote Prith Banerjee
23 pages
Ab5 PDF
No ratings yet
Ab5 PDF
93 pages
Feature Extraction
No ratings yet
Feature Extraction
70 pages
Optical Character Recognition Project Report
No ratings yet
Optical Character Recognition Project Report
71 pages
A Matlab Project in Optical Character Recognition
No ratings yet
A Matlab Project in Optical Character Recognition
7 pages
Ch. 9: Introduction To Convolution Neural Networks (CNN) and Systems
No ratings yet
Ch. 9: Introduction To Convolution Neural Networks (CNN) and Systems
96 pages
Vision-Face Recognition Attendance Monitoring System For Surveillance Using Deep Learning Technology and Computer Vision
No ratings yet
Vision-Face Recognition Attendance Monitoring System For Surveillance Using Deep Learning Technology and Computer Vision
5 pages
Optical Character Recognition (Ocr) : Karan Panjwani T.E - B, 68 Guided By: Prof. Shalini Wankhade
No ratings yet
Optical Character Recognition (Ocr) : Karan Panjwani T.E - B, 68 Guided By: Prof. Shalini Wankhade
24 pages
Face Detection in Python Using OpenCV
No ratings yet
Face Detection in Python Using OpenCV
17 pages
2 Convolutional Neural Network For Image Classification
No ratings yet
2 Convolutional Neural Network For Image Classification
6 pages
Sign Language Detection
No ratings yet
Sign Language Detection
5 pages
Currency Recognition On Mobile Phones Proposed System Modules
No ratings yet
Currency Recognition On Mobile Phones Proposed System Modules
26 pages
Project Detecto!: A Real-Time Object Detection Model
No ratings yet
Project Detecto!: A Real-Time Object Detection Model
3 pages
Final Year Project
No ratings yet
Final Year Project
57 pages
Introduction To Dimensionality Reduction
No ratings yet
Introduction To Dimensionality Reduction
5 pages
Review On Optical Character Recognition of Devanagari Script Using Neural Network
No ratings yet
Review On Optical Character Recognition of Devanagari Script Using Neural Network
6 pages
Lecture 12 - Deep Learning
No ratings yet
Lecture 12 - Deep Learning
25 pages
جميع اسئلة الرؤيا
No ratings yet
جميع اسئلة الرؤيا
13 pages
Car Parking System
No ratings yet
Car Parking System
53 pages
SL NO. Name Usn Number Roll No
No ratings yet
SL NO. Name Usn Number Roll No
10 pages
Image Processing in GRASS GIS
No ratings yet
Image Processing in GRASS GIS
9 pages
Deep Learning-Based Approach For Sign Language Gesture Recognition With Efficient Hand Gesture Representation
No ratings yet
Deep Learning-Based Approach For Sign Language Gesture Recognition With Efficient Hand Gesture Representation
16 pages
Object Detection
No ratings yet
Object Detection
7 pages
Multiple Object Tracking Using Deep Learning With Yolo v5 IJERTCONV9IS13010
No ratings yet
Multiple Object Tracking Using Deep Learning With Yolo v5 IJERTCONV9IS13010
5 pages
Semi-Supervised Medical Image Classification With Relation-Driven Self-Ensembling Model
No ratings yet
Semi-Supervised Medical Image Classification With Relation-Driven Self-Ensembling Model
12 pages
Handwritten Digit Recognition
0% (1)
Handwritten Digit Recognition
10 pages
Bird Species Identification Using Deep Learning IJERTV8IS040112 6
No ratings yet
Bird Species Identification Using Deep Learning IJERTV8IS040112 6
5 pages
Air Quality Prediction
No ratings yet
Air Quality Prediction
21 pages
Research Methods in Machine Learning: A Content Analysis: Jackson Kamiri Geoffrey Mariga
No ratings yet
Research Methods in Machine Learning: A Content Analysis: Jackson Kamiri Geoffrey Mariga
14 pages
A Novel Segmentation Approach in GA and Its Application in Antenna Array
No ratings yet
A Novel Segmentation Approach in GA and Its Application in Antenna Array
7 pages
Optical Character Recognition (OCR) System
No ratings yet
Optical Character Recognition (OCR) System
5 pages
Machine Learning in The Field of Optical Character Recognition OCR
No ratings yet
Machine Learning in The Field of Optical Character Recognition OCR
5 pages
Syllabus 8TH Sem
No ratings yet
Syllabus 8TH Sem
6 pages
DeepXDE A Deep Learning Library For Solving Differ
No ratings yet
DeepXDE A Deep Learning Library For Solving Differ
17 pages
Swin-Unet: Unet-Like Pure Transformer For Medical Image Segmentation
No ratings yet
Swin-Unet: Unet-Like Pure Transformer For Medical Image Segmentation
14 pages
Automatics Vehicle License Plate Recognition Using MATLAB
No ratings yet
Automatics Vehicle License Plate Recognition Using MATLAB
5 pages
Image Segmentation Using Watershed Transform: Amandeep Kaur, Aayushi
No ratings yet
Image Segmentation Using Watershed Transform: Amandeep Kaur, Aayushi
4 pages
Project Word Report
No ratings yet
Project Word Report
17 pages
Computer Vision Based Attendance Management System For Students
No ratings yet
Computer Vision Based Attendance Management System For Students
6 pages
Proposal To Enhance Fingerprint Recognit
No ratings yet
Proposal To Enhance Fingerprint Recognit
13 pages
Inception Net
No ratings yet
Inception Net
88 pages
A Matlab Project in Optical Character Recognition (OCR) : Introduction: What Is OCR?
No ratings yet
A Matlab Project in Optical Character Recognition (OCR) : Introduction: What Is OCR?
6 pages
Article - An Implicit Model of Consumer Behaviour
No ratings yet
Article - An Implicit Model of Consumer Behaviour
15 pages
Bioimage Data Analysis Workflows
No ratings yet
Bioimage Data Analysis Workflows
178 pages
Segment Anything Is Not Always Perfect: An Investigation of SAM On Different Real-World Applications
No ratings yet
Segment Anything Is Not Always Perfect: An Investigation of SAM On Different Real-World Applications
9 pages
Skilldzire Report PDF
0% (1)
Skilldzire Report PDF
37 pages
Attendance Management System Using Face
No ratings yet
Attendance Management System Using Face
7 pages
Radiographic Bone Texture Analysis Using Deep Learning Models For Early Rheumatoid Arthritis Diagnosis
No ratings yet
Radiographic Bone Texture Analysis Using Deep Learning Models For Early Rheumatoid Arthritis Diagnosis
15 pages
Philippine License Plate Character Recognition Using Faster R-CNN With Inceptionv2
No ratings yet
Philippine License Plate Character Recognition Using Faster R-CNN With Inceptionv2
5 pages
Library Attendance Using QR
No ratings yet
Library Attendance Using QR
7 pages
Biomedicines 11 00184 v3
No ratings yet
Biomedicines 11 00184 v3
22 pages
Fake Account Detection Using Machine Learning and Data Science
No ratings yet
Fake Account Detection Using Machine Learning and Data Science
58 pages
Computer Vision
No ratings yet
Computer Vision
5 pages
Bcse403l Digital-Image-Processing TH 1.1 0 Bcse403l
No ratings yet
Bcse403l Digital-Image-Processing TH 1.1 0 Bcse403l
2 pages
Dynamic SLAM A Visual SLAM in Outdoor Dynamic Scen
No ratings yet
Dynamic SLAM A Visual SLAM in Outdoor Dynamic Scen
15 pages
Spares Segementation
No ratings yet
Spares Segementation
19 pages
Uncertainty-Informed Mutual Learning For Joint Medical Image Classification and Segmentation
No ratings yet
Uncertainty-Informed Mutual Learning For Joint Medical Image Classification and Segmentation
11 pages
IPCV Unit 04
No ratings yet
IPCV Unit 04
12 pages
Image Processing
No ratings yet
Image Processing
2 pages
Monocular 3D Lane Line Detection in Autonomous Driving - A Review - by Patrick Langechuan Liu - Towards Data Science
No ratings yet
Monocular 3D Lane Line Detection in Autonomous Driving - A Review - by Patrick Langechuan Liu - Towards Data Science
12 pages
Computer Vision Module-3 Notes
No ratings yet
Computer Vision Module-3 Notes
25 pages
Dip Unit 5
No ratings yet
Dip Unit 5
57 pages
Buildings 15 02404
No ratings yet
Buildings 15 02404
22 pages
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)