0% found this document useful (0 votes)

101 views7 pages

COURSE - Digital Image Processing PDF

This document provides information about a computer vision based text scanner project. It discusses a group of 6 students working on developing a system that can scan images, such as a sudoku puzzle, and extract the text. The methodology involves image acquisition, preprocessing, detecting the sudoku grid, warping the image, and using a neural network to recognize digits in each tile. The results demonstrate the image processing steps and digit recognition. The conclusion discusses potential applications and improvements to the technology.

Uploaded by

Ganesh Inguva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views7 pages

COURSE - Digital Image Processing PDF

Uploaded by

Ganesh Inguva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

COURSE: Digital Image Processing

PROJECT: Computer Vision Based Text Scanner

GROUP NUMBER: 22

TEAM:

NAME ID NUMBER

Arnav Jain 2017B3AA1378H

Vasu Sood 2017B4A31476H

Mihir Vilas Shende 2017B5A31157H

Thambabathula Omana 2018A3PS0553H

I M V S Ganesh 2018AAPS0389H

Toran Maheshwari 2017B3A80948H

INTRODUCTION

We humans, have a very robust visual system, which helps us to identify people and objects,
play sports, perform operations, drive vehicles, read, and so on.

Although it might seem that we do not put any special effort to do most of these tasks human
visual system is fairly complex to replicate and implement.

Computer Vision, in the simplest terms, is the automation of such a visual system, so that
computers or machines, in general, can obtain high level understanding of the environment from
digital images and videos.

In the manufacturing sector identifying defective products and ensuring quality and

accuracy is of utmost importance.

Object detection, deals with detecting instances of objects of a certain class, such as humans,
buildings, or cars in digital images and videos. Computer Vision is vital in implementing object
detection from digital images.
OBJECTIVE

To develop a computer vision based text scanner that will scan through any image (Example: a
Sudoku Puzzle from the Newspaper) to obtain the respective text from it.

METHODOLOGY

Any computer vision application starts with Image acquisition (Image acquisition is the digital
representation of the visual characteristics of the physical world).Image sensors are used to
detect and capture the information required to make an image.

The images acquired are then processed in the next stage. In this step, the signals in the acquired
images are filtered to remove the noise or any irrelevant frequencies. If needed the images are
padded and transformed to a different space, so as to make them ready for the actual analysis.

The processed images are then analysed to extract useful information, this involves pattern
identification, colour recognition, object recognition, feature extraction, motion tracking, image
segmentation, etc.

Finally, the high dimensional data obtained from all the above steps is used to produce
meaningful numerical information, which leads to making decisions.

SCRIPTS

main.py

This script combines all the scripts given below.

christopher.py

This script consists of a Convolutional Neural Network trained on a custom dataset.

basic.py
This script is used to take as input the original image, apply pre-processing, get the corner
points of the board, warp the image and separate out the individual smaller grids (tiles)
containing the individual digits/blanks.

sud.py

This script is used to take the individual tiles, does a bit of pre-processing and predict the digits
in each tile. As the grid is a 9x9, the number of tiles are 81.

RESULTS

1) Image Processing

The image is converted to grayscale and further Adaptive Thresholding and Dilation are applied
to the image to reduce noise and enhance contours. After this happens, the coordinates of the
maximum area (the Sudoku grid) in the image are found.

2) Warping

Using the coordinates found, we warp the image and form individual grids on the image. These
individual grids will help in extracting out the smaller tiles which contain a single digit or a blank.

3) Digit Recognition

The individual grids are passed into a convolutional neural network (Christopher) which is
pre-trained on a custom dataset. These grids are identified and returned in the form of a list.
CONCLUSION

We can scan a sudoku puzzle of an image using our camera scanner and then convert it into text,
using warping & Artificial intelligence. But in future we plan to work on how to scan any image
to successfully obtain respective texts on it. Further improvements and adaptations of this
technology can help us, it improves the Searching ability of our computers, as our data records
continue to get bigger and more complex, computers with OCR will make record searching
much easier. Computers with OCR can scan a document and store it in a database, making it
easier to quickly retrieve it in the future. Inclusion of AI, such that it can read text from images
or banners etc. can use in-built software to quickly translate it into your preferred language,
boosting communication. Also, AI-enabled with OCR would be able to read paper bills and
records, analyse complex charts, offer recommendations and take business decisions. AI that is
capable of recognizing facial expressions can understand how people around them are feeling.
This offers benefits in hospitality and healthcare sectors. Assembly processing robots with
computer vision will be able to identify defective products or rotten produce and separate them
from quality products.

REFERENCES

A. Rosenfeld and A. C. Kak, Digital Picture Processing, Academic Press, New York, 1976

K. S. Fu and A. Rosenfeld, Pattern recognition and image processing,

IEEE Trans, on Computers C-25, 1976, 1336–1346.

A. R. Hanson and E. M. Riseman, Design of VISIONS: segmentation

and interpretation of images, in Conference Record, 1976 Joint
Workshop on Pattern Recognition and Artificial Intelligence (IEEE
Publ. 76CH1169-2C), pp. 135-144.

https://www.cv-foundation.org/openaccess/content_cvpr_2015/html/Zhang_Symmetry-Bas
ed_Text_Line_2015_CVPR_paper.html

http://iihm.imag.fr/publs/2003/procam03_magictable_berard.pdf

http://opencv-python-tutroals.readthedocs.io/en/latest/py_tutorials/py_imgproc/py_table_o
f_contents_imgproc/py_table_of_contents_imgproc.html

https://scholar.google.co.in/citations?hl=en&vq=eng_computervisionpatternrecognition&vi
ew_op=list_hcore&venue=x0SOFhwf7eMJ.2020

Plagiarism Scan Report: Plagiarised Unique Words Characters
No ratings yet
Plagiarism Scan Report: Plagiarised Unique Words Characters
2 pages
Unit 1
No ratings yet
Unit 1
20 pages
Visual Based Product Identification For Blind: Project Report On
No ratings yet
Visual Based Product Identification For Blind: Project Report On
23 pages
Hussien 2021 J. Phys. Conf. Ser. 1973 012002
No ratings yet
Hussien 2021 J. Phys. Conf. Ser. 1973 012002
9 pages
CH 3
No ratings yet
CH 3
22 pages
CO1 Notes
No ratings yet
CO1 Notes
105 pages
Image Processing
No ratings yet
Image Processing
105 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Report OF Dip
No ratings yet
Report OF Dip
13 pages
Chap 1
No ratings yet
Chap 1
6 pages
Unit 1 To 5 Computer Vision and Image Processing
No ratings yet
Unit 1 To 5 Computer Vision and Image Processing
56 pages
Image Recognition in Artificial Intelligence
100% (2)
Image Recognition in Artificial Intelligence
11 pages
Digital Image Processing
No ratings yet
Digital Image Processing
30 pages
Computer Vision
No ratings yet
Computer Vision
23 pages
Computer Vision
No ratings yet
Computer Vision
23 pages
"Introduction To Computer Vision": Submitted by
No ratings yet
"Introduction To Computer Vision": Submitted by
45 pages
Image Processing Projects FALL 2024
No ratings yet
Image Processing Projects FALL 2024
36 pages
DIGITAL IMAGE PROCESSING Full Report
No ratings yet
DIGITAL IMAGE PROCESSING Full Report
10 pages
Digital Image Processing Full Report
No ratings yet
Digital Image Processing Full Report
9 pages
"Camera Based Product Information Reading For Blind People": Priyanka Patil, Sonali Solat, Shital Hake
No ratings yet
"Camera Based Product Information Reading For Blind People": Priyanka Patil, Sonali Solat, Shital Hake
4 pages
52 BDB
No ratings yet
52 BDB
3 pages
Computational Approaches To Image Understanding: Michael Brady
No ratings yet
Computational Approaches To Image Understanding: Michael Brady
69 pages
Computer Security
No ratings yet
Computer Security
23 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Object Sorting in Manufacturing Industries Using Image Processing
No ratings yet
Object Sorting in Manufacturing Industries Using Image Processing
9 pages
Computer Vision Advancement Rebecca
No ratings yet
Computer Vision Advancement Rebecca
17 pages
Computer Vision
No ratings yet
Computer Vision
19 pages
Unit - I Computer Vision Fundamentals
No ratings yet
Unit - I Computer Vision Fundamentals
25 pages
Computer Vision: Dr. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
No ratings yet
Computer Vision: Dr. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
21 pages
Computer Vision XTH
No ratings yet
Computer Vision XTH
9 pages
Implementation of Handwritten Digit Recognizer Using CNN: Vinjit, Bhojak, Kumar and Nikam
No ratings yet
Implementation of Handwritten Digit Recognizer Using CNN: Vinjit, Bhojak, Kumar and Nikam
9 pages
Computer Vision
No ratings yet
Computer Vision
14 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Synopsis of Real Time Security System: Submitted in Partial Fulfillment of The Requirements For The Award of
No ratings yet
Synopsis of Real Time Security System: Submitted in Partial Fulfillment of The Requirements For The Award of
6 pages
Digital Image Processing
No ratings yet
Digital Image Processing
10 pages
Object Detection and Currency Recognition Using CNN
No ratings yet
Object Detection and Currency Recognition Using CNN
6 pages
Final Project Requriment
No ratings yet
Final Project Requriment
5 pages
DGANG
No ratings yet
DGANG
12 pages
Understand Computer Vision
No ratings yet
Understand Computer Vision
2 pages
Project Report Final 1
No ratings yet
Project Report Final 1
63 pages
IJRPR11842
No ratings yet
IJRPR11842
6 pages
SE Project
No ratings yet
SE Project
15 pages
CAD Phase5
No ratings yet
CAD Phase5
10 pages
Project Synopsis22
No ratings yet
Project Synopsis22
9 pages
What Is Computer Vision
No ratings yet
What Is Computer Vision
18 pages
Computer Vision
No ratings yet
Computer Vision
15 pages
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
No ratings yet
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
48 pages
Image Manipulation Finall
No ratings yet
Image Manipulation Finall
7 pages
Intro Ai Group3
No ratings yet
Intro Ai Group3
35 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
Computer Vision Project
No ratings yet
Computer Vision Project
33 pages
Report Digital Image Processing On Edge Detection of Image
100% (2)
Report Digital Image Processing On Edge Detection of Image
15 pages
Computer Vision
No ratings yet
Computer Vision
36 pages
Digital Image Processing
No ratings yet
Digital Image Processing
10 pages
Real Time Object Detection Using Deep Learning Andmachine Learning Project
No ratings yet
Real Time Object Detection Using Deep Learning Andmachine Learning Project
56 pages
Rendering Computer Graphics: Exploring Visual Realism: Insights into Computer Graphics
From Everand
Rendering Computer Graphics: Exploring Visual Realism: Insights into Computer Graphics
Fouad Sabry
No ratings yet
Fundamentals of Digital Image Processing
From Everand
Fundamentals of Digital Image Processing
Dandak Kaniyar
No ratings yet
Percept: Fundamentals and Applications
From Everand
Percept: Fundamentals and Applications
Fouad Sabry
No ratings yet
Optical Braille Recognition: Empowering Accessibility Through Visual Intelligence
From Everand
Optical Braille Recognition: Empowering Accessibility Through Visual Intelligence
Fouad Sabry
No ratings yet
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
From Everand
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
Fouad Sabry
No ratings yet
Auto-Tuning of PID Controller For A Boost Converter Using Modified Relay Feedback Test
No ratings yet
Auto-Tuning of PID Controller For A Boost Converter Using Modified Relay Feedback Test
5 pages
Pega (PRPC) Concepts PDF
No ratings yet
Pega (PRPC) Concepts PDF
14 pages
Mathematics of Codes: Topics (And Subtopics)
No ratings yet
Mathematics of Codes: Topics (And Subtopics)
19 pages
Delft3D-WAVE User Manual PDF
No ratings yet
Delft3D-WAVE User Manual PDF
226 pages
14 NLP
No ratings yet
14 NLP
20 pages
Log
No ratings yet
Log
119 pages
QC Module 3 (Methods of Marker Planning)
No ratings yet
QC Module 3 (Methods of Marker Planning)
18 pages
HUAWEI FLA-LX3 9.1.0.116 (C605E5R1P1) Release Notes
No ratings yet
HUAWEI FLA-LX3 9.1.0.116 (C605E5R1P1) Release Notes
10 pages
Vortex Tube Thesis
100% (3)
Vortex Tube Thesis
8 pages
NTCC Sem VI Major Project WPR
No ratings yet
NTCC Sem VI Major Project WPR
12 pages
5543978
No ratings yet
5543978
2 pages
Reda Hps PDF
100% (1)
Reda Hps PDF
1 page
Candidate Handbook
No ratings yet
Candidate Handbook
66 pages
Test48 - Google Search
No ratings yet
Test48 - Google Search
3 pages
Microsoft FLow Offficial Documentation
100% (2)
Microsoft FLow Offficial Documentation
538 pages
2005 RG Body
No ratings yet
2005 RG Body
1,402 pages
Lenovo IdeaPad Flex 5 14 2-In-1 Touchscreen Lapt
No ratings yet
Lenovo IdeaPad Flex 5 14 2-In-1 Touchscreen Lapt
1 page
Some Introductory Concepts On Fiberr Optic System
No ratings yet
Some Introductory Concepts On Fiberr Optic System
36 pages
DP-200 Dump
No ratings yet
DP-200 Dump
164 pages
Ihp w22 Model Answer Paper 22655
No ratings yet
Ihp w22 Model Answer Paper 22655
14 pages
Homecharger: Type 1 Plug Type 2 Plug Type 2 Socket
No ratings yet
Homecharger: Type 1 Plug Type 2 Plug Type 2 Socket
2 pages
PDF 132821 67441
No ratings yet
PDF 132821 67441
10 pages
BB - Cac Phuong Phap Dieu Khien Tien Tien Nham Nang Cao Chat Luong Va TKNL - 11tr
No ratings yet
BB - Cac Phuong Phap Dieu Khien Tien Tien Nham Nang Cao Chat Luong Va TKNL - 11tr
11 pages
Chapter 2
100% (1)
Chapter 2
40 pages
Setting Up OpenVPN Server On Ubuntu
No ratings yet
Setting Up OpenVPN Server On Ubuntu
35 pages
Journal of Computer Science and Informat
No ratings yet
Journal of Computer Science and Informat
192 pages
ZYAROCK Artec Pot Leaflet (En)
No ratings yet
ZYAROCK Artec Pot Leaflet (En)
2 pages
ME990-IH-Section 2a - LongBoltFlangeDesignProblems
No ratings yet
ME990-IH-Section 2a - LongBoltFlangeDesignProblems
15 pages
Intermediate Level
No ratings yet
Intermediate Level
41 pages
How To Write An Email in English
No ratings yet
How To Write An Email in English
58 pages

COURSE - Digital Image Processing PDF

Uploaded by

COURSE - Digital Image Processing PDF

Uploaded by

COURSE:​ Digital Image Processing

PROJECT:​ Computer Vision Based Text Scanner

Arnav Jain 2017B3AA1378H

Vasu Sood 2017B4A31476H

Mihir Vilas Shende 2017B5A31157H

Thambabathula Omana 2018A3PS0553H

Toran Maheshwari 2017B3A80948H

accuracy is of utmost importance.

This script combines all the scripts given below.

This script consists of a Convolutional Neural Network trained on a custom dataset.

K. S. Fu and A. Rosenfeld, Pattern recognition and image processing,

A. R. Hanson and E. M. Riseman, Design of VISIONS: segmentation

You might also like

COURSE: Digital Image Processing

PROJECT: Computer Vision Based Text Scanner