0% found this document useful (0 votes)
39 views20 pages

Text Extraction From Image: Team Members CH - Suneetha (19mcmb22) Mohit Sharma (19mcmb13)

The document discusses text extraction from images using optical character recognition (OCR). It begins by defining image processing and text extraction. Next, it describes the technologies used, including Python, Django, HTML, CSS, SQLite, and the OCR library Pytesseract. The implementation section explains that Pytesseract takes an input image and outputs the text. It then outlines the process: users input images, images are stored in a SQLite database using Django as the backend, and Pytesseract extracts text. Future work involves extracting text from videos. Applications are listed as document analysis, license plate recognition, paper analysis, and video subtitles. The document concludes by citing references used.

Uploaded by

suneetha
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
39 views20 pages

Text Extraction From Image: Team Members CH - Suneetha (19mcmb22) Mohit Sharma (19mcmb13)

The document discusses text extraction from images using optical character recognition (OCR). It begins by defining image processing and text extraction. Next, it describes the technologies used, including Python, Django, HTML, CSS, SQLite, and the OCR library Pytesseract. The implementation section explains that Pytesseract takes an input image and outputs the text. It then outlines the process: users input images, images are stored in a SQLite database using Django as the backend, and Pytesseract extracts text. Future work involves extracting text from videos. Applications are listed as document analysis, license plate recognition, paper analysis, and video subtitles. The document concludes by citing references used.

Uploaded by

suneetha
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

Text Extraction From Image

Team Members
CH.Suneetha(19mcmb22)
Mohit sharma(19mcmb13)
Agenda
● What is image processing
● What is text extraction
● Technologies
● Implementation
● Future Implementations
● Applications
What is Image Processing
Image processing is a method to perform some operations on an image, in order
to get an enhanced image or to extract some useful information from it.

Input:Image

Output:may be image or characteristics/features associated with that image

Steps In the Image Processing:

1. Importing the image via image acquisition tools;


2. Analysing and manipulating the image;
3. Output in which result can be altered image or report that is based on image
analysis.
What Is Text Extraction
Process of extracting information from the text is text extraction also known as
information extraction.

Input: Digital Image, video

Output: Text Which is Present in the image or video


Technologies and Softwares
Technologies:

● Python
● Django
● Html
● Css
● SqlLite
● Ocr(Optimal Character Recognition)

Softwares

● Atom as text editor


● Operationg System Ubuntu Processor intel core i5 4th gen 4GB RAM
Implementation
To extract a text from image we are using optimal character recognition(OCR)
image processing technique

Optical character recognition (OCR) refers to both the technology and


process of reading and converting typed, printed or handwritten
characters into machine-encoded text or something that the computer
can manipulate.
Implementation Of OCR
We are using python to implement OCR and to Implement there is One Python
Library which is pytesseract.

Pytesseract take the image path as the input and it outputs the text which is
present in the image as output to the user
Process
● First we are design a page for the user to input their image by using html and
css techniques
● The images given by the user will be taken as url and it can be stored in
SqlLite database to connect the front end with the database we are using
django as the backend technology.
● By using pytesseract package in python we are implementing OCR image
processing algorithm that will give a text from the image that te user given
Future Implementations

In Future we are extending this technique to the text extraction from the video
streaming using some image processing techniques
Applications
● Document Analysis
● Licence Plate Extraction from vehicle
● Technical Paper Analysis
● Video Subtitles extraction
Research Paper
References

 1. OCR for Devnagari Script by Mahesh Goyani


 2. Edge Based Text Extraction From Complex Images by Xiaoqing Liu
and Jagath Samarbandhu
 3. Automatic Text Detection using Morphological Operations and
Inpainting by Khyati Vaghela
 4. Font and Background Color Independent Text Binarization by T.Kasar
, J.Kumar , A.G. Ramkrishnan
 Research paper link http://www.ijera.com/papers/Vol8_issue5/Part-
5/D0805052733.pdf
Thank You

You might also like