0% found this document useful (0 votes)
3 views13 pages

IT ProjectManagement

The project report titled 'Pixel Prompt' presents a web-based application developed by students at Techno India University that facilitates text-to-image and image-to-text conversions using Python, Tesseract OCR, and Flask. The report outlines the project's goals, scope, resources, and methodologies, including Agile principles for development and risk management strategies. Future enhancements may include multilingual OCR support and improved user interface features.

Uploaded by

harshitaj2022
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views13 pages

IT ProjectManagement

The project report titled 'Pixel Prompt' presents a web-based application developed by students at Techno India University that facilitates text-to-image and image-to-text conversions using Python, Tesseract OCR, and Flask. The report outlines the project's goals, scope, resources, and methodologies, including Agile principles for development and risk management strategies. Future enhancements may include multilingual OCR support and improved user interface features.

Uploaded by

harshitaj2022
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 13

DEPARTMENT OF COMPUTER SCIENCE AND BUSINESS SYSTEM

TECHNO INDIA UNIVERSITY


EM4, SECTOR V, SALT LAKE, KOLKATA – 700091
2024-2025

PIXEL PROMPT

Project Report submitted by


Pramangshu Das: 211001020010
Sanket Sur: 211001020014
Ujjaini Ghosh: 211001020021
Subham Garai: 211001020032
Utsab Kundu: 211001020038
Certificate
This is to certify that this project report titled “Pixel Prompt”, submitted in partial fulfillment of requirements
for award of the degree Bachelor of Technology (B. Tech) in Computer Science and Business System of Techni
India University is a faithful record of the original work carried out by,
SUBHAM GARAI, Roll no: 211001020032
PRAMANGSHUA DAS, Roll no: 211001020010
UTSAB KUNDU, Roll no:211001020038
SANKET SUR, Roll no: 211001020014
UJJAINI GHOSH, Roll no: 211001020021

under my guidance and supervision.

It is further certified that it contains no material, which to a substantial extent has been submitted for the award
of any degree/diploma in any institute or has been published in any form, except the assistances drawn from
other sources, for which due acknowledgement has been made.

Date: …………

Dr. JAYANTA POREY Prof. KALPITA DUTTA


Professor and HOD Assistant Professor
Computer Science & Business System Department Computer Science & Engineering Department
TECHNO INDIA UNIVERSITY TECHNO INDIA UNIVERSITY
Declaration
We hereby declare that this project report titled “Pixel Prompt”, a web-based application is
our own original work carried out as an undergraduate student in Techno India University
except to the extent that assistance from other sources is duly acknowledged. All sources used
for this project report have been fully and properly cited. It contains no material which to a
substantial extent has been submitted for the award of any degree/diploma in any institute or
has been published in any form, except where due acknowledgment is made.

Student’s Name Signature Date


Subham Garai
Pramangshu Das
Utsab Kundu
Sanket Sur
Ujjaini Ghosh
Acknowledgment
I would like to express my heartfelt gratitude to my mentor, Kalpita Dutta for their invaluable
guidance and support throughout this project. My sincere thanks to Dr. Jayanta Porey for
providing the necessary resources and encouragement. I am also thankful to my peers and
family for their constant support.

Introduction
This project focuses on developing a web application that converts:

• Text to images, allowing users to visualize text content.


• Images to text using Optical Character Recognition (OCR).
By leveraging Python, Tesseract OCR, and Flask, the system aims to facilitate seamless
interaction between text and image data.

Project Overview

Goal
To design and implement a user-friendly application that:

• Accurately extracts text from images.


• Creates visually appealing images from text.
• Provides a simple web interface for users to interact with these functionalities.

Scope
The scope of the "Text-to-Image and Image-to-Text Conversion System" includes
Inclusions:
1. Image-to-Text Conversion:
• Upload images in formats like JPEG and PNG.
• Extract and display text using Tesseract OCR.
2. Text-to-Image Conversion:
• Input text through a user-friendly interface.
• Generate and download images containing the entered text
3. Web Interface:
• Flask-based web application for seamless interaction.
• Integration of both functionalities in a single platform.
Exclusions:

• Advanced image preprocessing for low-quality images (future enhancement).


• Multilingual OCR (can be added later).
Boundaries:

• Applicable for text extraction from clear, printed documents.


• Suitable for generating images with a fixed font style and size.

Resources
The resources required for the project include:
Hardware Resources:
1. Development System:
• Minimum Configuration:
• Processor: Intel i5 or equivalent.
• RAM: 8 GB.
• Storage: 100 GB free disk space.
• Preferred Configuration:
• Processor: Intel i7 or equivalent.
• RAM: 16 GB.
2. Hosting Platform:
• Cloud Service: AWS, Heroku, or local server for deployment.

Software Resources:
1. Languages and Frameworks:
• Python, Flask.
2. Libraries:
• Tesseract OCR.
• PIL (Pillow) for text-to-image.
3. Other Tools:
• Tesseract-OCR executable installation.
• IDE (e.g., VS Code, PyCharm).

Stakeholders
The stakeholders involved in the project are:
Primary Stakeholders:
1. Project Developer:
• Responsible for coding, integration, and deployment.
2. Mentor/Guide:
• Provides technical guidance and monitors’ progress.

Secondary Stakeholders:
1. End Users:
• Researchers, students, and organizations needing OCR-based solutions.
2. Institution/Organization:
• Benefits from the project as a demonstration of technical expertise.

Characteristics
The key characteristics of the project include:
Functional Characteristics:
1. Image-to-Text:
• Efficiently extracts text from images using Tesseract OCR.
2. Text-to-Image:
• Converts user-entered text into image format with high clarity.

Non-Functional Characteristics:
1. Scalability:
• Can handle multiple requests simultaneously when hosted on a server.
2. Usability:
• Intuitive user interface for seamless interaction.
3. Accuracy:
• High text extraction accuracy with clear images.
4. Maintainability:
• Modular codebase for easy updates and enhancements.

Feasibility
Technical Feasibility

• Open-source tools like Tesseract OCR ensure reliable text extraction.


• Flask provides a lightweight and scalable framework for building the web application.
Economic Feasibility

• Minimal development costs as most tools and libraries used are free.
• Cost-effective deployment options available on platforms like Heroku or AWS.

Market Demand Analysis


• Growing Need: OCR applications are widely used in digitization, accessibility tools, and
data extraction.
• Target Users: Researchers, content creators, and visually impaired users.
• Competitors: Existing tools lack an integrated text-to-image and image-to-text interface,
offering a competitive advantage.

SWOT Analysis
Strengths
• High accuracy using Tesseract OCR.
• Easy-to-use web interface.
Weaknesses
• Limited to the accuracy of OCR for certain image qualities.
• Dependency on preprocessing for better results.
Opportunities
• Integration with multilingual OCR.
• Expansion into mobile platforms.
Threats
Competition from established OCR and image processing tools

Risk Analysis
1. Technical Risks
• OCR Accuracy: Errors in text extraction from low-quality images.
Mitigation: Use image preprocessing (e.g., noise reduction).
• Library Limitations: Limited language or handwriting support.
Mitigation: Integrate additional OCR tools.
• Performance Issues: High latency for large inputs.
Mitigation: Optimize code and system resources.
2. Operational Risks
• Server Downtime: Hosting platform failures.
Mitigation: Use reliable hosting and backups.
• User Errors: Uploading unsupported formats.
Mitigation: Add input validation and error messages.
3. Project Management Risks
• Time Overrun: Delays in development.
Mitigation: Regular reviews and prioritization.
• Resource Unavailability: Missing tools or dependencies.
Mitigation: Prepare all tools beforehand.
4. Security Risks
• Data Privacy: Sensitive data in uploaded images.
Mitigation: Secure image handling and deletion post-processing.
• Unauthorized Access: System misuse by unauthorized users.
Mitigation: Add authentication mechanisms.
5. Market Risks
• Competition: Advanced competitors in the market.
Mitigation: Highlight unique features.
• Demand Fluctuations: Reduced interest in OCR tools.
Mitigation: Enhance usability and target niche markets.

Project Scheduling
Phases of the Project
1. Phase 1: Planning and Research (Week 1)

• Define objectives, scope, and resources.


• Research Tesseract OCR and Flask integration.
2. Phase 2: Development (Weeks 2–5)

• Build Image-to-Text module.


• Develop Text-to-Image module.
• Design Flask-based web interface.
3. Phase 3: Testing and Debugging (Weeks 6–7)

• Perform unit and integration testing.


• Fix bugs and optimize performance.
4. Phase 4: Deployment (Week 8)

• Host the application on a server.


• Ensure functionality and user accessibility.
5. Phase 5: Documentation and Submission (Week 9)
• Prepare project report and user manual.

Milestones
1. Week 1: Finalize project plan and tools.
2. Week 2: Complete Image-to-Text functionality.
3. Week 4: Complete Text-to-Image functionality.
4. Week 5: Integrate modules into the web interface.
5. Week 7: Complete testing phase.
6. Week 8: Deploy the application.
7. Week 9: Submit project report.

Dependencies
1. Tesseract OCR: Required for text extraction.
2. Flask Framework: Backbone of the web application.
3. Image Processing Libraries: Required for Text-to-Image module.
4. Hosting Platform: Necessary for deployment and user access.

Deadlines
• Planning and Research: End of Week 1.
• Development: End of Week 5.
• Testing and Debugging: End of Week 7.
• Deployment: End of Week 8.
• Documentation: End of Week 9.

Gantt Chart
A Gantt Chart is a graphical representation of the project timeline, with tasks and their
corresponding weeks.
PERT Chart
A PERT (Program Evaluation Review Technique) chart visually maps out project tasks and
their dependencies to identify the critical path.
Steps in PERT Analysis:
1. Identify all tasks.
2. Define dependencies.
3. Assign time estimates (Optimistic, Most Likely, Pessimistic).
4. Determine the critical path.
Critical Path
Planning → Development → Testing → Deployment → Documentation

Project Cost Control


1. Key Areas of Cost Control
1. Budget Planning:

• Allocate funds for tools, hosting, and development.


• Maintain a contingency fund for unforeseen expenses.
2. Resource Management:

• Use open-source tools like Tesseract OCR and Flask to reduce costs.
• Optimize team resources to avoid overstaffing or underutilization.
3. Regular Monitoring:

• Track expenses against the planned budget weekly.


• Use tools like Excel or project management software for cost tracking.
4. Milestone-Based Approvals:

• Release funds only after achieving specific project milestones.


• Prevent overspending by linking expenses to tangible deliverables.
5. Vendor and Hosting Costs:

• Choose cost-effective hosting services (e.g., Heroku, AWS Free Tier).


• Negotiate for better deals with any paid software or APIs.
Estimated Cost Breakdown:
Project Audit
Purpose of Project Audit
• To ensure the project adheres to planned timelines, budgets, and objectives.
• Identify discrepancies and areas for improvement during development.
Key Audit Areas
1. Performance Audit:

• Evaluate if milestones are achieved on time.


• Assess OCR accuracy and system performance.
2. Financial Audit:

• Compare actual expenses with the planned budget.


• Verify resource allocation and cost optimization.
3. Quality Audit:

• Ensure modules meet predefined quality standards.


• Validate functionality, such as accurate text-to-image conversion.
4. Compliance Audit:

• Confirm adherence to security and privacy standards.


• Ensure the use of legal and ethical tools.
Audit Frequency
• Conduct audits at the end of each project phase (planning, development, testing, and
deployment).

Agile Principles
1. Customer Satisfaction: Deliver working software early and continuously.
2. Embrace Change: Welcome evolving requirements to improve product value.
3. Deliver Frequently: Release increments of the project within short timeframes.
4. Collaborative Teamwork: Ensure close communication among developers and
stakeholders.
5. Sustainable Development: Maintain a consistent pace throughout the project.
6. Technical Excellence: Focus on high-quality code and design.
7. Continuous Improvement: Reflect regularly to enhance processes and outputs.

Agile Methodology
Steps in Agile for This Project
1. Initiation: Define scope, resources, and objectives.
2. Sprint Planning: Break the project into sprints (e.g., weekly deliverables).
3. Development Sprints: Focus on creating the image-to-text and text-to-image
modules.
4. Scrum Meetings: Conduct daily or bi-weekly meetings to track progress and address
challenges.
5. Testing and Review: Test each feature incrementally to ensure quality.
6. Deployment: Deploy a minimum viable product (MVP) and gather user feedback.
7. Retrospective: Reflect on the sprint to identify and implement improvements.

Project Path Analysis


Critical Path Analysis
• Identify Tasks: Determine all tasks (e.g., planning, development, testing).
• Sequence Tasks: Establish dependencies (e.g., testing depends on module
completion).
• Estimate Time: Assign durations to each task.
• Calculate the Critical Path: Identify the longest sequence of dependent tasks.
Critical Path for This Project
Planning → Development (Image-Text) → Development (Text-Image) → Testing →
Deployment → Documentation
Conclusion
This project demonstrates a practical implementation of OCR and Flask to create a robust
text-to-image and image-to-text application. By adhering to Agile principles and leveraging
efficient project management techniques, the system is built with high quality, within the
planned budget, and on schedule. The integration of risk analysis, cost control, and audits
ensures the project's success while meeting market demands.
Future enhancements may include multilingual OCR support, improved user interface, and
advanced image processing features to stay competitive and user-focused.

You might also like