OCR Web App Prototype Assignment

Uploaded by

Sidharth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views3 pages

OCR Web App Prototype Assignment

Uploaded by

Sidharth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Assignment: OCR and Document Search Web Application Prototype

Note - DO NOT REPLY TO THE EMAIL TO SAY “I ACCEPT” OR ANYTHING.

DIRECTLY SUBMIT THE ASSIGNMENT TO - [Link]

DO NO SEND ANY EMAILS UNLESS ABSOLUTELY NECESSARY.

FEEL FREE TO MAKE ANY ASSUMPTIONS IF ANYTHING IS UNCLEAR & MENTION IT IN

YOUR NOTE.

Objective

Develop and deploy a web-based prototype that demonstrates the ability to perform Optical
Character Recognition (OCR) on an uploaded image (in picture format) containing text in both
Hindi and English. The web application should also implement a basic keyword search
functionality based on the extracted text. The prototype must be accessible via a live URL.

Scope of the Assignment

This assignment focuses on creating a web application that allows users to upload a single
image, processes the image to extract text using OCR, and provides a basic search feature.
The application must be deployed and accessible online.

Tasks

Task 1: Setup and OCR Implementation

1. Environment Setup:
○ Set up a Python environment with the necessary libraries, including Huggingface
Transformers, PyTorch, and any other dependencies required for OCR.
○ Explore the following OCR models and choose one to implement:
■ ColPali implementation of the new Byaldi library + Huggingface
transformers for Qwen2-VL.
■ General OCR Theory (GOT), a 580M end-to-end OCR 2.0 model.
2. OCR Model Integration:
○ Implement the chosen OCR model to process a single uploaded image (JPEG,
PNG, or other common picture formats) containing text in both Hindi and English.
○ Ensure the model successfully extracts text from the image and returns the
extracted text in a structured format (JSON or plain text).

Task 2: Web Application Development

1. Web Application:
○ Develop a simple web application using Gradio or Streamlit.
○ The application should allow users to:
■ Upload an image file for OCR processing.
■ Display the extracted text from the image.
■ Enter keywords to search within the extracted text.
○ Display search results on the same page, highlighting the matching sections.

Task 3: Deployment

1. Deploy the Web Application:

○ Deploy the web application on platforms like Hugging Faces, Streamlit Sharing,
or any other suitable platform.
○ Ensure the application is accessible via a public URL.

Deliverables

1. Code Submission:
○ Python scripts for the web application, including the OCR processing and search
functionality.
○ A README file explaining how to set up the environment, run the web
application locally, and details about the deployment process.
2. Live Web Application:
○ The live URL of the deployed web application where the OCR and search
functionalities can be tested.
3. Extracted Text and Search Output:
○ JSON or plain text output of the extracted text from the uploaded image.
○ Demonstration of the search functionality with example keywords.

Evaluation Criteria

● Accuracy: How well the OCR model extracts text from both Hindi and English sections
of the image.
● Functionality: The web application should correctly handle image uploads, extract text,
and allow keyword searches.
● User Interface: The web interface should be simple, intuitive, and functional.
● Deployment: The application must be accessible online, with a reliable deployment
process.
● Clarity: Clear and concise documentation and code structure.
● Completeness: All deliverables are submitted and demonstrate the required
functionality.

Deadline

● Submission Deadline: 1 week from receiving the assignment.

Instructions for Submission

● Submit a ZIP file containing all your code, the README file, and any additional
resources (e.g., screenshots of the web application).
● Provide the live URL of the deployed web application.

IT ProjectManagement
No ratings yet
IT ProjectManagement
13 pages
Anas Anwer
No ratings yet
Anas Anwer
2 pages
Without MCP Claude Output
No ratings yet
Without MCP Claude Output
12 pages
Bilingual OCR Report
No ratings yet
Bilingual OCR Report
10 pages
Jayadhi Entry-Level Internship Assignment
No ratings yet
Jayadhi Entry-Level Internship Assignment
5 pages
OCR Model Training Assignment
No ratings yet
OCR Model Training Assignment
3 pages
2 Years Experience
No ratings yet
2 Years Experience
3 pages
Tannistha Maharana (22052605)
No ratings yet
Tannistha Maharana (22052605)
8 pages
Byte Brawl
No ratings yet
Byte Brawl
11 pages
Backend Test Raft
No ratings yet
Backend Test Raft
5 pages
Ocr 2
No ratings yet
Ocr 2
42 pages
AKM Deep Learning Project.
No ratings yet
AKM Deep Learning Project.
4 pages
Post-Interview Evaluation Test1
No ratings yet
Post-Interview Evaluation Test1
2 pages
Cream Neutral Minimalist New Business Pitch Deck Present
No ratings yet
Cream Neutral Minimalist New Business Pitch Deck Present
14 pages
Multilingual PDF Search Solution
No ratings yet
Multilingual PDF Search Solution
4 pages
Synopsis
No ratings yet
Synopsis
3 pages
Python AI Engineer Hiring Assignment
No ratings yet
Python AI Engineer Hiring Assignment
5 pages
Project Ocr Timeline - Sheet1
No ratings yet
Project Ocr Timeline - Sheet1
2 pages
EasyOCR: Multilingual Text Recognition
No ratings yet
EasyOCR: Multilingual Text Recognition
11 pages
NLP Text Summarization App
No ratings yet
NLP Text Summarization App
3 pages
YouTube Transcript Summarizer Guide
100% (1)
YouTube Transcript Summarizer Guide
11 pages
Ocr&Promptengineering
No ratings yet
Ocr&Promptengineering
6 pages
6874faecd848a Adobe India Hackathon - Challenge
No ratings yet
6874faecd848a Adobe India Hackathon - Challenge
10 pages
Local Chatbot SoW
No ratings yet
Local Chatbot SoW
3 pages
Product Manager Task334
No ratings yet
Product Manager Task334
2 pages
IDEH Assignment
No ratings yet
IDEH Assignment
4 pages
Harshit AI ML Engineer
No ratings yet
Harshit AI ML Engineer
4 pages
AI Project Challenges for Developers
No ratings yet
AI Project Challenges for Developers
6 pages
AI Toolkit Project Synopsis
No ratings yet
AI Toolkit Project Synopsis
3 pages
Multilingual Ocr System
No ratings yet
Multilingual Ocr System
3 pages
Main Capstone PDF
No ratings yet
Main Capstone PDF
14 pages
Gen AI Content
No ratings yet
Gen AI Content
7 pages
Tanvir Updated Resume 2024-03-19
No ratings yet
Tanvir Updated Resume 2024-03-19
4 pages
Chat With Multiple PDF and Sign Letter Detection
No ratings yet
Chat With Multiple PDF and Sign Letter Detection
10 pages
4 Days Plan
No ratings yet
4 Days Plan
4 pages
Document RAG Assignment
No ratings yet
Document RAG Assignment
4 pages
Backend Django-Python Intern Assignment
No ratings yet
Backend Django-Python Intern Assignment
2 pages
Python Developer with IT Experience
No ratings yet
Python Developer with IT Experience
3 pages
OCR & Text Recognition for CS Students
100% (2)
OCR & Text Recognition for CS Students
37 pages
Python Image Processing Pipeline
100% (1)
Python Image Processing Pipeline
31 pages
Jaypee University of Engineering and Technology Raghogarh, Guna (M.P) Software Engineering
No ratings yet
Jaypee University of Engineering and Technology Raghogarh, Guna (M.P) Software Engineering
25 pages
Project
No ratings yet
Project
4 pages
OpenMic Ai AI Product Engineer (Full Stack Engineer
No ratings yet
OpenMic Ai AI Product Engineer (Full Stack Engineer
4 pages
Internship Task
No ratings yet
Internship Task
4 pages
Shamas Developer
No ratings yet
Shamas Developer
4 pages
Ocr PPT GRP 12
No ratings yet
Ocr PPT GRP 12
10 pages
Duy Le Thanh: Developer
No ratings yet
Duy Le Thanh: Developer
5 pages
ConversaiLabs Assignment - 1
No ratings yet
ConversaiLabs Assignment - 1
1 page
React Node Research
No ratings yet
React Node Research
3 pages
Assignment
No ratings yet
Assignment
3 pages
Frontend Task - Shareable Notes
No ratings yet
Frontend Task - Shareable Notes
3 pages
1998 - 1000 - DOC - AI-Powered Code Generation
No ratings yet
1998 - 1000 - DOC - AI-Powered Code Generation
5 pages
RAI AI Engineer Intern Assignments
No ratings yet
RAI AI Engineer Intern Assignments
3 pages
Python Django Robot Framework API
No ratings yet
Python Django Robot Framework API
3 pages
Backend Developer Assignment
No ratings yet
Backend Developer Assignment
3 pages
Python Developer Profile
No ratings yet
Python Developer Profile
4 pages
Task
No ratings yet
Task
3 pages
Web Developer Resume of Abhishek Chauhan
No ratings yet
Web Developer Resume of Abhishek Chauhan
3 pages
Fullstack Internship Assignment
No ratings yet
Fullstack Internship Assignment
2 pages
IELTS Speaking Tips: People & Places
No ratings yet
IELTS Speaking Tips: People & Places
12 pages
Orca Share Media1556693030998
No ratings yet
Orca Share Media1556693030998
18 pages
Analyzing Politics Ellen Grigsby Full Access
100% (1)
Analyzing Politics Ellen Grigsby Full Access
137 pages
The Lore of Ben 10 Explained
No ratings yet
The Lore of Ben 10 Explained
2 pages
Manual 34 Road Rescue
No ratings yet
Manual 34 Road Rescue
143 pages
NTPC
No ratings yet
NTPC
301 pages
317 Prep Work Day #2 Samuel Vasquez
No ratings yet
317 Prep Work Day #2 Samuel Vasquez
5 pages
Financial Management and Corporate Finance: Theories of Capital Structure: Relevance & Irrelevance Approach
No ratings yet
Financial Management and Corporate Finance: Theories of Capital Structure: Relevance & Irrelevance Approach
12 pages
Part 1. Introducing ABC Learning Design July 20
No ratings yet
Part 1. Introducing ABC Learning Design July 20
10 pages
STR-5 Stair Case & Midlanding Beam Schedule
No ratings yet
STR-5 Stair Case & Midlanding Beam Schedule
1 page
9 10 AI MCQ5 Sol
No ratings yet
9 10 AI MCQ5 Sol
10 pages
02 May Ashish Pune-Bilaspur
No ratings yet
02 May Ashish Pune-Bilaspur
2 pages
Ver2. Music From La La Land For Saxophone Quartet-Tenor - Saxophone-1
No ratings yet
Ver2. Music From La La Land For Saxophone Quartet-Tenor - Saxophone-1
4 pages
1A Surface Mount Bridge Rectifiers
No ratings yet
1A Surface Mount Bridge Rectifiers
2 pages
White Paper Pics GMP Guide Annex 1 Revisions and Interpretations
No ratings yet
White Paper Pics GMP Guide Annex 1 Revisions and Interpretations
12 pages
Kinginang Maneco
No ratings yet
Kinginang Maneco
5 pages
Jeeny Case Study - GTP 2025
No ratings yet
Jeeny Case Study - GTP 2025
3 pages
Johan de Meij - Canticles (Piano)
No ratings yet
Johan de Meij - Canticles (Piano)
28 pages
Resume For College Ojt
100% (1)
Resume For College Ojt
9 pages
Kryolan MSDS
No ratings yet
Kryolan MSDS
7 pages
6567-Article Text-27355-1-10-20230123
No ratings yet
6567-Article Text-27355-1-10-20230123
10 pages
Renewable Energy Sources Overview
No ratings yet
Renewable Energy Sources Overview
4 pages
B3-30 Ac DS Series 5 PM F-663 1217
No ratings yet
B3-30 Ac DS Series 5 PM F-663 1217
126 pages
Financial Accounting II - FARM FRESH BERHAD
No ratings yet
Financial Accounting II - FARM FRESH BERHAD
25 pages
Direcional Kill Sheet Blank Form
No ratings yet
Direcional Kill Sheet Blank Form
6 pages
12 Volatile Oils Cinnamon Fennel Coriander
No ratings yet
12 Volatile Oils Cinnamon Fennel Coriander
22 pages
Understanding Evolution: Evidence and Theories
No ratings yet
Understanding Evolution: Evidence and Theories
2 pages
IELTS Speaking 30 Day Course Updated
No ratings yet
IELTS Speaking 30 Day Course Updated
12 pages
Ling 101 F2025 Syllabus 8.15.25-1
No ratings yet
Ling 101 F2025 Syllabus 8.15.25-1
13 pages
Arena Software Installation Guide
No ratings yet
Arena Software Installation Guide
34 pages