Micro-project OCR Finally

The document outlines a project seminar on Object Character Recognition (OCR) at St. Vincent Pallotti College of Engineering and Technology for the academic year 2024-25. It discusses the need for an automated OCR system to efficiently extract text from images and scanned documents, highlighting existing challenges and literature in the field. The project aims to leverage machine learning techniques to enhance text digitization, with specified hardware and software requirements.

Uploaded by

officialnishant2005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Micro-project OCR Finally

Uploaded by

officialnishant2005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

St.

Vincent Pallotti College of Engineering and Technology

DEPARTMENT OF ARTIFICIAL INTELLIGENCE

Academic Year 2024-25

Project Seminar
On
“OBJECT CHARACTER
RECOGNITION”
Name of Guide: Project Members:
Mayank Charde (23010042)
Prof. Aparitosh Gahankari Tanay Makde(23010041)
Aditya Menon(23010062)
Nishant Tiwari(23010061)
Table of Content
• Introduction
• Problem Definition
• Literature Survey
• Comparative analysis of literature survey
• Tentative project flow(optional)
• Hardware & Software Requirements
• References
Introduction
Object Character Recognition (OCR) is a technology that detects and extracts text from images or scanned
documents. It converts printed or handwritten characters into machine-readable text, enabling editing,
searching, and processing. Commonly used in digitizing documents, license plate recognition, and receipt
scanning.

Problem Statement
Extracting text from images and scanned documents manually is time-consuming and error-prone.
Traditional methods do not efficiently convert printed or handwritten text into a digital format, limiting
accessibility, searchability, and editability of important documents. There is a need for an automated solution
to accurately recognize and digitize text from various sources.

Objective
Develop an Optical Character Recognition (OCR) system that leverages machine learning to accurately
extract text from images and scanned documents. The system should efficiently convert printed or
handwritten text into digital form, enabling easier editing, searching, and storage of textual information.
Literature Survey
Sr. no Title of paper Author name Name of conference / journal Year Findings

1 Optical Character Alaa Najmi Prachi Tiwari, Meetkumar 2024 An ml project OCR and NER for highly
Recognition and Named confidential docs focuses on accurately
Entity Recognition for Patel extracting text while ensuring sensitive data
Highly Confidential is securely identified and redacted.
Documents
2 General OCR Theory Anonymous Rajat Sharma, Adweteeya 2025 Model generalization, data and accuracy,
user-centric performance
Dwivedi
3 A Comprehensive Study of Ravi Raj A. Kos June 2022 This study examines the evolution of OCR
Optical Character technologies, highlighting their applications in
Recognition" document scanning and translation services. It
discusses the challenges related to accuracy and
processing time, emphasizing the need for
improved methodologies to enhance OCR
performance across various languages and scripts.
4 Handwritten Optical Character Jamshed Memon, Ahmed Khan Jan 2020 This systematic literature review analyzes research
Recognition (OCR) Maira Sami, Rizwan conducted on handwritten OCR from 2000 to 2018.
It synthesizes findings from 142 selected articles,
providing insights into the techniques used for
character recognition and identifying research gaps
for future exploration.

08-04-2025 4
Algorithm-Techniques-Tools
Basic Setup (For Small-Scale OCR)

CPU : Intel i5 / AMD Ryzen 5 or higher

RAM: 8GB (16GB recommended)
Storage: 256GB SSD or more
GPU : Integrated GPU (optional for basic OCR)

OS: Windows, Linux (Ubuntu), macOS Programming Language

Libraries & Tools : Flask==2.3.2

Flask-CORS==4.0.0
pytesseract==0.3.10
Pillow==10.0.0
opencv-python-headless==4.8.0.74
numpy==1.24.3
Workflow of OCR Model
Modules of OCR System

• Flask (2.3.2)
A lightweight web framework for building web applications and APIs in Python.
• Flask-CORS (4.0.0)
A Flask extension that enables Cross-Origin Resource Sharing (CORS)
support for your web app/API.
• Pytesseract (0.3.10)
A Python wrapper for Google’s Tesseract-OCR engine, used to extract text
from images.
• Pillow (10.0.0)
A powerful image processing library in Python; a modern fork of the PIL
(Python Imaging Library)
• Opencv-python-headless (4.8.0.74)
A headless version of OpenCV for image and video processing (no GUI
features included, ideal for servers).
• Numpy (1.24.3)
A core library for numerical computing in Python, often used for handling arrays
and matrices.
Completion Status
Screenshots of Work Done
Thank you!

08-04-2025 8
Any suggestions ?

08-04-2025 9

Harvard STAT E100
No ratings yet
Harvard STAT E100
5 pages
Optical Character Recognition:: An Illustrated Guide To The Frontier
No ratings yet
Optical Character Recognition:: An Illustrated Guide To The Frontier
197 pages
A12REVIEW
No ratings yet
A12REVIEW
18 pages
Optical Character Recognition: Presented By: - Vikas Shukla - Raj Singh
No ratings yet
Optical Character Recognition: Presented By: - Vikas Shukla - Raj Singh
11 pages
3 M&a
No ratings yet
3 M&a
24 pages
Adarsh Kumar Singh ( (1NH21MC004) )
No ratings yet
Adarsh Kumar Singh ( (1NH21MC004) )
28 pages
Optical Character Recognition: Kaivan Gandhi 60001160012 Rahul Jha 60001160019 Shagun Vasmatkar 60001160061
No ratings yet
Optical Character Recognition: Kaivan Gandhi 60001160012 Rahul Jha 60001160019 Shagun Vasmatkar 60001160061
7 pages
OCR PRESENTATION
No ratings yet
OCR PRESENTATION
15 pages
Optical Character Recognition: Article
No ratings yet
Optical Character Recognition: Article
5 pages
A Survey of Modern Optical Character Rec PDF
No ratings yet
A Survey of Modern Optical Character Rec PDF
37 pages
Handwritten Optical Character Recognition
No ratings yet
Handwritten Optical Character Recognition
2 pages
Handwritten Optical Character Recognition (OCR) : A Comprehensive Systematic Literature Review (SLR)
No ratings yet
Handwritten Optical Character Recognition (OCR) : A Comprehensive Systematic Literature Review (SLR)
28 pages
IP MINI GD (Ver02) FINAL DG
No ratings yet
IP MINI GD (Ver02) FINAL DG
18 pages
OCR Project Report PDF
No ratings yet
OCR Project Report PDF
24 pages
Optical Character Recognition: Article
No ratings yet
Optical Character Recognition: Article
5 pages
Optical Character Recognition (Ocr) : Karan Panjwani T.E - B, 68 Guided By: Prof. Shalini Wankhade
No ratings yet
Optical Character Recognition (Ocr) : Karan Panjwani T.E - B, 68 Guided By: Prof. Shalini Wankhade
24 pages
Ocr PDF
No ratings yet
Ocr PDF
5 pages
Optical Character Recognition Project Report
No ratings yet
Optical Character Recognition Project Report
71 pages
ANN Miniproject Report
No ratings yet
ANN Miniproject Report
11 pages
Optical_character_recognition_system_using_artific
No ratings yet
Optical_character_recognition_system_using_artific
7 pages
Optical Character Recognition - Project Report
100% (1)
Optical Character Recognition - Project Report
84 pages
Optical Character Recognition
No ratings yet
Optical Character Recognition
3 pages
Optical Character Recognizer: Team Member
No ratings yet
Optical Character Recognizer: Team Member
7 pages
Optical Character Recognition: Divyanshu Sagar Ahmed Zaid Faizee Vidyut Singhania
No ratings yet
Optical Character Recognition: Divyanshu Sagar Ahmed Zaid Faizee Vidyut Singhania
11 pages
OCR PPT GRP 12
No ratings yet
OCR PPT GRP 12
10 pages
Optical Character Recognition - OCR Text Recognition
No ratings yet
Optical Character Recognition - OCR Text Recognition
11 pages
OCR Using Tesseract
100% (2)
OCR Using Tesseract
37 pages
Optical Character Recognition: Made By: Dhairya Goel-02814803115 Madhwan Sharma-60214803115
No ratings yet
Optical Character Recognition: Made By: Dhairya Goel-02814803115 Madhwan Sharma-60214803115
15 pages
fin_irjmets1684836352
No ratings yet
fin_irjmets1684836352
7 pages
Raj Synopsis12
No ratings yet
Raj Synopsis12
5 pages
OCR Presentation
No ratings yet
OCR Presentation
16 pages
Fi Pdflatex mk4 - Bezdeklarace
No ratings yet
Fi Pdflatex mk4 - Bezdeklarace
41 pages
Ocr
No ratings yet
Ocr
16 pages
ML Report
No ratings yet
ML Report
5 pages
Vaidhi Ayush Gurkirat Jatin Project Synopsis Format
No ratings yet
Vaidhi Ayush Gurkirat Jatin Project Synopsis Format
6 pages
Build Your Own Optical Character Recognition (Ocr) System Using Google'S Tesseract and Opencv
No ratings yet
Build Your Own Optical Character Recognition (Ocr) System Using Google'S Tesseract and Opencv
10 pages
Jagruthi Institute of Engineering and Technology: Optical Character Recognition
No ratings yet
Jagruthi Institute of Engineering and Technology: Optical Character Recognition
28 pages
Text Detector (OCR)
No ratings yet
Text Detector (OCR)
12 pages
Design of An OCR System and Its Hardware Implementation
No ratings yet
Design of An OCR System and Its Hardware Implementation
18 pages
Extraction of Information From Handwriting Using Optical Character Recognition and Neural Networks
No ratings yet
Extraction of Information From Handwriting Using Optical Character Recognition and Neural Networks
6 pages
Machine Learning in The Field of Optical Character Recognition OCR
No ratings yet
Machine Learning in The Field of Optical Character Recognition OCR
5 pages
SL NO. Name Usn Number Roll No
No ratings yet
SL NO. Name Usn Number Roll No
10 pages
Optical Character Recognition System
No ratings yet
Optical Character Recognition System
41 pages
10 1109@icirca48905 2020 9183326
No ratings yet
10 1109@icirca48905 2020 9183326
6 pages
IJMIE1April24 55698
No ratings yet
IJMIE1April24 55698
7 pages
Bilingual_OCR_Report
No ratings yet
Bilingual_OCR_Report
10 pages
Development of Text Extraction Technique 3acb33e9
No ratings yet
Development of Text Extraction Technique 3acb33e9
8 pages
Optical Character Recognition: Selected Topics in Computer Science
No ratings yet
Optical Character Recognition: Selected Topics in Computer Science
7 pages
Mini Project-04,52 00
No ratings yet
Mini Project-04,52 00
85 pages
Bengal College of Engineering and Technology, Durgapur: "Handwritten Text Recognition"
No ratings yet
Bengal College of Engineering and Technology, Durgapur: "Handwritten Text Recognition"
15 pages
Surrvey Paper On Intelligent Reader For Visually Impaired People
No ratings yet
Surrvey Paper On Intelligent Reader For Visually Impaired People
5 pages
9589-First Manuscript-57755-2-10-20220620 - X
No ratings yet
9589-First Manuscript-57755-2-10-20220620 - X
12 pages
Review On Optical Character Recognition of Devanagari Script Using Neural Network
No ratings yet
Review On Optical Character Recognition of Devanagari Script Using Neural Network
6 pages
Hand Written Character Recognition Using Neural Network: BACHELOR OF ENGINEERING (Computer Engineering)
No ratings yet
Hand Written Character Recognition Using Neural Network: BACHELOR OF ENGINEERING (Computer Engineering)
46 pages
Text Detection in Natural Scene Images Using Ocr Algorithm
No ratings yet
Text Detection in Natural Scene Images Using Ocr Algorithm
3 pages
Research Paper On OCR
No ratings yet
Research Paper On OCR
4 pages
Seminar Report On Optical Character Recognition: Submitted By
No ratings yet
Seminar Report On Optical Character Recognition: Submitted By
27 pages
Charter & WBS For OCR
No ratings yet
Charter & WBS For OCR
3 pages
Optical Character Recognition: Types of OCR
No ratings yet
Optical Character Recognition: Types of OCR
1 page
Optical Character Recognition: Fundamentals and Applications
From Everand
Optical Character Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
The Art of Rust: Professional Patterns for Clean, Efficient, and Maintainable Code
From Everand
The Art of Rust: Professional Patterns for Clean, Efficient, and Maintainable Code
Aarav Joshi
No ratings yet
Rishi Tapase
No ratings yet
Rishi Tapase
1 page
TSD
No ratings yet
TSD
38 pages
2025 Design Briefing OaklandUniversity Car%23138.PDF
No ratings yet
2025 Design Briefing OaklandUniversity Car%23138.PDF
54 pages
Micro Project[1] Aipt Practical
No ratings yet
Micro Project[1] Aipt Practical
7 pages
Permutation & Combination
No ratings yet
Permutation & Combination
5 pages
DSS Ch02
No ratings yet
DSS Ch02
7 pages
DSAP-Lecture 5 - Array Based Sequences
No ratings yet
DSAP-Lecture 5 - Array Based Sequences
45 pages
Ooo
No ratings yet
Ooo
19 pages
F 2 CMG 04
No ratings yet
F 2 CMG 04
374 pages
New Rich Text Document
No ratings yet
New Rich Text Document
8 pages
Data Structures CW
No ratings yet
Data Structures CW
4 pages
RAG Based Question-Answering For Contextual Response Prediction System
No ratings yet
RAG Based Question-Answering For Contextual Response Prediction System
10 pages
Serial Interface Installation and Set-Up Guide V 1.4
No ratings yet
Serial Interface Installation and Set-Up Guide V 1.4
21 pages
Log
No ratings yet
Log
2 pages
4.how To Secure A Network With Linux
No ratings yet
4.how To Secure A Network With Linux
10 pages
Winsmart Academy Cit303 Exam Summary 08024665051
No ratings yet
Winsmart Academy Cit303 Exam Summary 08024665051
43 pages
Network Simulator (NS-2)
100% (1)
Network Simulator (NS-2)
60 pages
ADVENTURE WORKS
No ratings yet
ADVENTURE WORKS
14 pages
Array Implementation of List ADT
No ratings yet
Array Implementation of List ADT
5 pages
BPMN Pro Poster
No ratings yet
BPMN Pro Poster
2 pages
Mitsubishi EVO IV-VIII Installation Manual
No ratings yet
Mitsubishi EVO IV-VIII Installation Manual
27 pages
Folder Spectrolyser v3 en Web
No ratings yet
Folder Spectrolyser v3 en Web
2 pages
Hotel Management Project Report
No ratings yet
Hotel Management Project Report
6 pages
UserManual en Parte3
No ratings yet
UserManual en Parte3
150 pages
In The Key of Your Commands
No ratings yet
In The Key of Your Commands
8 pages
5149-1 - Vilink 4.0 For Filmarray 2.0 and Filmarray Torch - Att. 2 - Rev 1 - Bfr0001-6528 Connecting Filmarray 2.0 and Torch To Vilink
No ratings yet
5149-1 - Vilink 4.0 For Filmarray 2.0 and Filmarray Torch - Att. 2 - Rev 1 - Bfr0001-6528 Connecting Filmarray 2.0 and Torch To Vilink
35 pages
9500 MPR Users Manual Tempest Telecom Solutions
100% (1)
9500 MPR Users Manual Tempest Telecom Solutions
188 pages
Net User Command For Windows Server 2012
No ratings yet
Net User Command For Windows Server 2012
4 pages
David latest cv
No ratings yet
David latest cv
1 page
PL-SQL: Unit 5
No ratings yet
PL-SQL: Unit 5
21 pages
Cyber Forensics Principles: Jayaram P Cdac
No ratings yet
Cyber Forensics Principles: Jayaram P Cdac
54 pages
Design of A Web-Based Personalized E-Learning Plat
No ratings yet
Design of A Web-Based Personalized E-Learning Plat
7 pages
CG Lab Report
No ratings yet
CG Lab Report
167 pages
04 - 000durability Analysis 101
No ratings yet
04 - 000durability Analysis 101
4 pages

Micro-project OCR Finally

Uploaded by

Micro-project OCR Finally

Uploaded by

St.

Vincent Pallotti College of Engineering and Technology

DEPARTMENT OF ARTIFICIAL INTELLIGENCE

Academic Year 2024-25

CPU : Intel i5 / AMD Ryzen 5 or higher

OS: Windows, Linux (Ubuntu), macOS Programming Language

Libraries & Tools : Flask==2.3.2

You might also like