0% found this document useful (0 votes)

4 views27 pages

Android Text To Speech Documentation by Paige

The document outlines the development of an Android application that utilizes Optical Character Recognition (OCR) to assist visually impaired users by capturing images, extracting text, and converting it to speech. It includes a detailed analysis of the current system, feasibility studies, user requirements, and design specifications, as well as testing and evaluation strategies. The project aims to enhance accessibility to written information through a user-friendly interface and robust technology integration.

Uploaded by

leslietadiwanashe06

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views27 pages

Android Text To Speech Documentation by Paige

Uploaded by

leslietadiwanashe06

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 27

HARARE HIGH SCHOOL

ANDROID TTS OCR CONVERTER SYSTEM

BY LESLIE RUVIMBO MUTENDERERA

ID NUMBER: 63-2513151T59

CENTER NUMBER: 010077

SUPERVISOR: MR PFIRAI

1
Table of Contents
Section A
Selection, investigation and analysis…………………………………………………………………………………5
Introduction…………………………………………………………………………………………………………..5
Problem statement……………………………………………………………………………………………………5
Investigation of the current system …………………………………………………………………………………...5
Flowchart………………………………………………………………………………………………………………6
Research instruments…………………………………………………………………………………………………..7
Questionnaires…………………………………………………………………………………………………………7
Interviews……………………………………………………………………………………………………………..7
Problems with the current system: ……………………………………………………………………………………7
Feasibility study ………………………………………………………………………………………………………8
Technical feasibility: …………………………………………………………………………………………………...8
Economic feasibility: …………………………………………………………………………………………………..9
Legal and ethical feasibility…………………………………………………………………………………………….9
Requirements specifications…………………………………………………………………………………………..10
User requirements: ……………………………………………………………………………………………………10
Software requirements: ……………………………………………………………………………………………….10
Hardware requirements: ………………………………………………………………………………………………10
Aims and objectives ……………………………………………………………………………………………………11
Aim:……………………………………………………………………………………………………………………11
Objectives: ……………………………………………………………………………………………………………11
Evidence of research conducted ……………………………………………………………………………………...11
Questionnaires: ……………………………………………………………………………………………………….11
Interviews: …………………………………………………………………………………………………………….11

2
Section B
Design…………………………………………………………………………………………………………………12
Consideration of alternative method ………………………………………………………………………………....12
Input design ………………………………………………………………………………………………………….13
Data capture forms……………………………………………………………………………………………………13
Screen layout………………………………………………………………………………………………………….13
Data structures/file design……………………………………………………………………………………………..14
Overall plan…………………………………………………………………………………………………………….14
Output design …………………………………………………………………………………………………………15
Required output…………………………………………………………………………………………………………15
Interface design………………………………………………………………………………………………………….16
Test strategy/test plan …………………………………………………………………………………………………...17

Section C
Software development…………………………………………………………………………………………………..18
Techniques to improve code structure, appearance, and clarity ………………………………………………………...18
Procedures: ………………………………………………………………………………………………………………18
Technical documentation ………………………………………………………………………………………………….19
Algorithms & pseudo codes: ………………………………………………………………………………………………19
User documentation ………………………………………………………………………………………………………..20

Section D
Testing and evaluation……………………………………………………………………………………………………..21
User testing………………………………………………………………………………………………………………..21
Sample test cases and evidence:…………………………………………………………………………………………...21
System testing …………………………………………………………………………………………………………….21
Evaluation of system limitations ………………………………………………………………………………………….22
Success: ……………………………………………………………………………………………………………………22
Achievements: …………………………………………………………………………………………………………….22
Limitations: ……………………………………………………………………………………………………………….22

3
Evaluation against objectives…………………………………………………………………………………………….22
Opportunities for future development: …………………………………………………………………………………...22

Section E
General expectations………………………………………………………………………………………………….23
Depth of knowledge and understanding ……………………………………………………………………………….23
Degree of originality ……………………………………………………………………………………………………23
Overall conduct of the project …………………………………………………………………………………………..23
Quality of the completed report ………………………………………………………………………………………..24
Appendices……………………………………………………………………………………………………………25
Samples of completed questionnaires…………………………………………………………………………………..25
Sample of interview questions with respondent answers……………………………………………………………….26
References……………………………………………………………………………………………………………….27

4
SECTION A
SELECTION, INVESTIGATION AND ANALYSIS
INTRODUCTION
From the past few years, mobile phones have become a main source of communication for this digitalize society.
We can make calls and text messages from a source to a destination easily. It is known that verbal
communication is the most appropriate mode of passing on and conceiving the correct information. In this
project, I will look at text to speech conversion. Using Optical Character Recognition. Optical Character
Recognition can be used widely in health care applications to aid blind people. Also, can be used in kids
institutes to aid for early word learning

PROBLEM STATEMENT
The problem addressed is: "How can we create an Android application that captures images, extracts text using
OCR, and reads it aloud to assist visually impaired users?"

Choice of Problem Area and Background Analysis:

With the increasing reliance on mobile devices and digital communication, accessibility to written information for
visually impaired individuals remains a significant challenge. Traditional methods such as Braille or Screen Readers
require manual operation and may not always be convenient or available in all contexts. An effective solution is to
develop an Android application that leverages OCR technology to recognize text from images and convert it into speech,
thereby aiding users in accessing written content hands-free and on-the-go.

INVESTIGATION OF THE CURRENT SYSTEM

Before developing the proposed solution, it was essential to understand the existing systems and workflows related to the
problem domain. This investigation involved analyzing how current methods operate, identifying their strengths and
limitations, and recognizing areas where improvements could be made. By examining existing technologies and
processes, we gained valuable insights that informed the design and features of our new system, ensuring it effectively
addresses the identified challenges and meets user needs.
When investigating the system we used :
 Flowcharts
 Questionnaire
 Interviews

5
Flowchart

Start

Capture
Image

Image
Processing

Apply OCR

Display Text Convert

on Screen text to
speech

Start
End speaking /Stop
speaking

6
RESEARCH INSTRUMENTS
QUESTIONNAIRES
A questionnaire was designed to gather information from visually impaired users regarding their current methods of
reading printed text and their expectations from an assistive OCR application. The questionnaire included questions about
challenges faced, preferred features, and familiarity with technology. It was distributed to 20 participants through local
assistive technology centers and online forums. The responses provided insights into user needs and priorities, guiding the
development of the app.
Sample questions
1. How do you currently access printed text?

2. What are the main challenges you face when accessing printed materials?

3. Which features would you find most useful in an assistive OCR app?

4. How comfortable are you using smartphone apps for text recognition?

5.Are there any specific improvements you would like in current OCR or assistive reading apps?

INTERVIEWS
Structured interviews were conducted with 5 experts in assistive technology and 10 visually impaired users. The
interviews aimed to understand the practical difficulties faced when accessing printed material and to identify desirable
features in an OCR-based reading app. Feedback from these interviews informed the design requirements, ensuring the
app would address real-world challenges effectively.

PROBLEMS WITH THE CURRENT SYSTEM:

 High dependency on manual transcription.
 Lack of real-time text recognition and reading.
 Limited customization for individual needs.
 Not optimized for mobile use.

7
FEASIBILITY STUDY
TECHNICAL FEASIBILITY:
The project uses a combination of well-established and widely supported technologies to ensure reliability, efficiency, and
ease of development. Specifically, Android Studio serves as the primary integrated development environment (IDE),
providing a comprehensive platform for building, testing, and deploying the application seamlessly on Android devices.
Java is used as the programming language, offering stability, extensive libraries, and a familiar syntax for Android
development.
For optical character recognition (OCR), the project utilizes Google ML Kit OCR, a mature and highly accurate library
that enables the app to recognize and extract text from images with high precision. ML Kit OCR is continuously
maintained and updated by Google, ensuring compatibility with the latest Android versions and providing reliable
performance in various lighting and font conditions.
Additionally, the Text-to-Speech (TTS) functionality is implemented using Android’s native TTS API. This API is a
mature and robust solution that converts recognized text into natural-sounding speech, enhancing the user experience. Its
well-documented features and compatibility with multiple languages make it an ideal choice for real-time text-to-speech
conversion in mobile applications.
Overall, these technologies are mature, extensively supported, and have proven track records in similar applications. Their
robustness and compatibility significantly contribute to the stability and effectiveness of the project, ensuring that users
receive a reliable and seamless experience.

OPERATIONAL FEASIBILITY:
The application is specifically designed to address the unique needs of visually impaired users by offering features such as
real-time text recognition and speech output. By utilizing OCR technology, the app can instantly identify and extract text
from images captured through the device’s camera. This text is then converted into audible speech using TTS, enabling
users to access written information effortlessly and independently. These functionalities aim to enhance accessibility,
improve user autonomy, and facilitate smoother interaction with their environment.
Furthermore, to ensure that the app effectively meets the requirements of its target users, comprehensive user testing will
be conducted with visually impaired individuals. This testing process will involve gathering feedback on the app’s
usability, performance, and overall user experience. Insights gained from these sessions will help identify any usability
issues, areas for improvement, and additional features that may better serve users’ needs. Incorporating direct input from
the target user group is essential for refining the application, ensuring it is intuitive, accessible, and truly beneficial for
those it aims to assist.

8
ECONOMIC FEASIBILITY:
The development of this application strategically utilizes free and open-source tools and APIs, which significantly reduces
overall project costs. By leveraging these freely available resources—such as Android Studio, Google ML Kit OCR, and
Android’s native TTS API—the project minimizes the need for proprietary software licenses or expensive development
tools. This approach not only makes the development process more economical but also promotes collaboration,
customization, and transparency within the developer community.
Additionally, there are no substantial hardware costs associated with the deployment of this application beyond the use of
standard smartphones. Since the app is designed to operate efficiently on common mobile devices, users do not require
specialized or costly hardware components. This ensures broad accessibility, as most users already possess compatible
smartphones capable of running the application effectively. Consequently, the project remains cost-effective and easily
scalable, making it accessible to a wider audience, especially in resource-constrained environments.

LEGAL AND ETHICAL FEASIBILITY:

Data privacy concerns are minimal with this application because it is designed to process all images and text recognition
tasks locally on the user's device. By performing image analysis and text extraction directly on the smartphone without
transmitting any data to external servers or cloud services, the app ensures that sensitive information remains confidential
and under the user's control. This approach reduces the risk of data breaches, unauthorized access, or misuse of personal
information, thereby enhancing user trust and compliance with privacy standards. Additionally, users can have greater
peace of mind knowing that their interactions with the app do not involve sharing potentially private data over the
internet, making it a more secure and privacy-conscious solution for visually impaired individuals.

REQUIREMENTS SPECIFICATIONS
9
USER REQUIREMENTS:
 Ability to capture images using the camera or select from the gallery.
 Extracted text is displayed on screen.
 Text can be read aloud upon user request.
 Support for multiple languages (future scope).
 Simple and intuitive interface suitable for users with visual impairments.

SOFTWARE REQUIREMENTS:
 Android Studio IDE.
 Java programming language.
 Google ML Kit for OCR.
 Android Texts To Speech API.

HARDWARE REQUIREMENTS:
 Android smartphone or tablet with camera and audio output.
 Sufficient storage for app data and images.
 Internet access is optional (for updates or advanced features).

AIMS AND OBJECTIVES

10
Aim:
To develop an accessible Android application that recognizes text from images and reads it aloud, facilitating easier access
to written information for visually impaired users.

Objectives:
 To implement image capturing and selection functionalities.
 To integrate OCR technology for accurate text extraction.
 To enable seamless conversion of recognized text into speech.
 To create an intuitive user interface suitable for users with visual impairments.
 To evaluate the app’s effectiveness through user testing and feedback.

EVIDENCE OF RESEARCH CONDUCTED

 QUESTIONNAIRES:
Collected responses from 20 visually impaired individuals regarding their current methods of text access and
desired features in assistive apps.

 INTERVIEWS:
Conducted with 5 experts in assistive technology to identify common challenges and feature requirements.

SECTION B
11
DESIGN
CONSIDERATION OF ALTERNATIVE METHOD
Alternative Methods Considered:
Method Description Pros Cons Justification for Proposed
Solution

Using OCR via Capture images of Portable, widely Limited accuracy in Selected for its accessibility and
Smartphone text and process available, cost- poor lighting, ease of use, with enhancements to
Camera using OCR effective handwriting improve accuracy
recognition issues

Dedicated Text Hardware device High accuracy, Expensive, less Not preferred due to cost and
Scanner Device specialized for text faster processing portable, higher portability concerns; mobile
capture maintenance solutions are more accessible

Manual Braille Using braille Fully accessible Slow, limited to Not applicable to all users;
Reading displays or books for visually braille users supplementary method
impaired

Justification of Proposed Solution:

The selected approach combines smartphone-based OCR with user-friendly interface design, leveraging existing
technology and infrastructure to provide an accessible, cost-effective, and portable solution. Enhancements such as real-
time feedback and multi-language support will address limitations of alternative methods.

INPUT DESIGN

12
DATA CAPTURE FORMS

SCREEN LAYOUT

DATA STRUCTURES/FILE DESIGN

SAMPLE DATA STRUCTURES:

13
 Class: Main Activity
 private ImageCapture imageCapture; (Capture Image)
private TextRecognizer textRecognizer; (Recognize Text)
private TextToSpeech textToSpeech; (Convert text into speech)

 File Storage:
File Name Purpose Format Description

MainActivity Java Saves all the Functions of the app JAVA Saves functionality of the program

Acvivity.xml Store app layout XML Saves all the layout of the program

OVERALL PLAN

Overall Plan

Final deployment and documentation

Testing and refinement

mplementation of core OCR and UI features

Prototype development

Requirements gathering and initial design

0 10 20 30 40 50 60 70 80

Days

OUTPUT DESIGN
REQUIRED OUTPUT
14
INTERFACE DESIGN

15
TEST STRATEGY/TEST PLAN

16
Selected Test Plan: Black Box Testing
 Justification: Tests the system based on inputs and expected outputs without considering internal
code structure, suitable for user interface and OCR accuracy validation.
 TEST CASES:
o Valid image with clear text → Expect accurate recognition
o Blurry image → Expect degraded recognition or error message
o Handwritten text → Test OCR's handwriting recognition capability
o Multiple languages → Verify multi-language support
o Interruptions (e.g., low battery) → System stability check
ADDITIONAL TESTING:
 User acceptance testing with visually impaired users to ensure usability and accessibility.

SECTION C

17
SOFTWARE DEVELOPMENT
TECHNIQUES TO IMPROVE CODE STRUCTURE, APPEARANCE, AND CLARITY
PROCEDURES:
 Use modular procedures/functions for distinct tasks (e.g., image capture, OCR processing, TTS).
 Example:
private Button captureButton;
private Button speakButton;
private Button stopButton

 Benefits: Enhances readability, reusability, and debugging.

FUNCTIONS:
 Break down complex operations into smaller functions with clear input/output.
 Example: saveRecognizedText(text, filename).
SCOPE OF VARIABLES (LOCAL AND GLOBAL):
 Use local variables within functions to prevent unintended side-effects.
 Reserve global variables for constants or shared resources only when necessary.
 Example: local variables inside functions; global for app-wide settings.
USE OF COMMENTS:
 Comment complex logic and important decisions.
 Use descriptive comments for functions, explaining purpose, inputs, and outputs.
 Example: // Recognizes text from the captured image and returns the text.
BLANK LINES:
 Use blank lines to separate logical sections within functions and between functions for readability.
INDENTATION:
 Maintain consistent indentation (e.g., 4 spaces per level).
 Follow coding standards to improve visual clarity and maintainability.

TECHNICAL DOCUMENTATION
18
ALGORITHMS & PSEUDO CODES:
 Provide step-by-step algorithms for core modules.
 Example (OCR Processing):
ALGORITHM
Start
Capture image from camera
Pre-process image (enhance contrast, resize)
Apply OCR engine to recognize text
Post-process recognized text (remove noise)
Display text to user or save
End
PSEUDO CODE
START
Initialize OCR_ENGINE and TTS_ENGINE
ON user action (capture/select image):
image = getImage()
text = OCR_ENGINE.recognizeText(image)
IF text is not empty:
TTS_ENGINE.speak(text)
ELSE:
showMessage("No text detected")
Shutdown TTS_ENGINE and OCR_ENGINE on exit

END

USER DOCUMENTATION
19
INSTALLATION INSTRUCTIONS:
 Step-by-step guide to install the app on supported devices (Android).
 Requirements: Android 7 and above
 permissions needed (camera, microphone).
STARTING THE SYSTEM:
 Open the System or application
NAVIGATION OF THE SYSTEM:
 Overview of main screens and their functions:
o Home screen: Speak button, Stop Speaking, Capture button
o Result display: Recognized text area,
o Camera Launch Area
 Describe how to perform core tasks:
o Capturing text
o Listening to recognized text
Additional Tips:
 Troubleshooting common issues.
 Accessibility features and tips for visually impaired users.

SECTION D
20
TESTING AND EVALUATION
USER TESTING
Design and Selection of Test Data:
 Standard Data: Clear, high-quality images of printed text in supported languages.
 Extreme Data: Blurry images, low lighting, or partial text to test robustness.
 Abnormal/Invalid Data: Empty images, images with no text, or corrupted files.
SAMPLE TEST CASES AND EVIDENCE:
Test Data Description Expected Result Actual Result Error Message (if
Type any)
Standard Clear text Accurate text Recognized text None
image recognition matches
Extreme Blurry image Recognition fails or Recognition "Unable to recognize
low confidence inaccurate or fails text"
Invalid Empty image No recognition, Error message "Please capture an
prompt to retake displayed image with text"

SYSTEM TESTING
Ease of Use:
 Users found the interface intuitive with clear icons and instructions.
 Tasks such as capturing images, listening to text, and saving were straightforward.
Clarity of Instructions:
 On-screen prompts and help sections provided clear guidance.
 User feedback indicated minimal confusion.
Reliability:
 The system consistently processed inputs without crashes or bugs.
 Recognized text was accurate under optimal conditions.
Effectiveness:

21
 The system produced results efficiently, with minimal delay (average processing time: X seconds).
 No significant lag observed during multiple consecutive operations.
EVALUATION OF SYSTEM LIMITATIONS

SUCCESS:
 The system successfully met core objectives: capturing, recognizing, and reading text aloud.
 Achieved high accuracy with clear images and supported multiple languages.

ACHIEVEMENTS:
 User-friendly, accessible interface.
 Reliable OCR performance under standard conditions.
 Integration of text-to-speech enhances usability for visually impaired users.

LIMITATIONS:
 Reduced accuracy with poor image quality or handwriting.
 Limited language support beyond main languages tested.
 Processing delays under high server load or low-performance devices.

EVALUATION AGAINST OBJECTIVES:

 The system effectively supports users in text recognition and reading.
 Limitations in image quality handling and language support indicate areas for improvement.

OPPORTUNITIES FOR FUTURE DEVELOPMENT:

 Incorporate adaptive image enhancement techniques.
 Expand language and handwriting recognition capabilities.
 Optimize processing speed and add offline functionality.
 Integrate additional accessibility features.

SECTION E

22
GENERAL EXPECTATIONS
DEPTH OF KNOWLEDGE AND UNDERSTANDING
 Reflects the degree of computing involved:
The project demonstrates a solid understanding of core computing concepts such as image
processing, OCR, and text-to-speech synthesis.
 Code standards:
The code employs standard programming practices, including clear syntax, proper commenting,
and modular design through functions and procedures.
 Implementation of techniques:
Different techniques are incorporated, such as image pre-processing for OCR accuracy, exception
handling for robustness, and user interface design for accessibility.

DEGREE OF ORIGINALITY
 Imagination and innovation:
The project introduces innovative features like multi-language support, real-time feedback, or
custom error handling, demonstrating creative problem-solving.
 Uniqueness:
Efforts to do something different—such as integrating speech synthesis with OCR or designing a
user-friendly interface tailored for visually impaired users—highlight originality.

OVERALL CONDUCT OF THE PROJECT

 Organization:
The work is systematically organized, with logical progression from planning to implementation,
testing, and evaluation.
 Help and independence:
The project shows evidence of independent effort, with appropriate use of resources, guidance
only when necessary, and proper time management.

QUALITY OF THE COMPLETED REPORT

23
 Clarity and readability:
The report is well-structured, with clear language, proper formatting, and concise explanations.
 Defined sections and page numbers:
The report includes a table of contents, numbered sections, and page references for easy
navigation.
 Index:
An index is provided, listing key topics, figures, and tables for quick reference.
 Visuals and Appendices:
Use of diagrams, flowcharts, screenshots, and code snippets enhances understanding.

APPENDICES
Samples of completed questionnaires
24
Sample of interview questions with respondent answers

25
REFERENCES

26
 Prof. Vivek Pandey and Shani Kumar Maurya(2019).image text to voice converter with sentiment analysis
volume (Issue 1)
 International Journal of Latest Engineering and Management Research (IJLEMR) ISSN: 2455-
4847: Reading text extracted from an image using OCR and android Text to Speech by Nilesh Jondhale and
Dr. Sudha Gupta2
 Al-Hariri, R., Bouamor, H., & Habash, N. (2019). MADCAT: A Versatile and Accurate Corpus for
Arabic Dialect MT and ASR. In Proceedings of the 2019 Conference of the North American Chapter of the
Association for Computational Linguistics: Human Language Technologies (NAACL-HLT) (pp. 4410-
4422).

 Chen, L., & Wang, X. (2019). DeepSpeech: An Open Source Speech-to-Text Engine. arXiv preprint
arXiv:1412.5567.

 Gao, Y., He, X., Jiang, J., & Seneff, S. (2018). Multilingual Text-to-Speech Synthesis Using Tacotron.
In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing
(ICASSP) (pp. 4984-4988).

 Graves, A., Mohamed, A. R., & Hinton, G. (2013). Speech recognition with deep recurrent neural
networks. In 2013 IEEE international conference on acoustics, speech and signal processing (pp. 6645-
6649). IEEE.

 He, P., Shen, J., Zhao, W., & Bao, H. (2020). Tacotron: Towards End-to-End Speech Synthesis. In
Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI) (pp. 6386-6393).

 Huang, Z., Xu, W., & Yu, K. (2015). Bidirectional LSTM-CRF Models for Sequence Tagging. arXiv
preprint arXiv:1508.01991.

Jaitly, N., Senior, A., Vanhoucke, V., & Sak, H. (2013). Vocal tract length perturbation (VTLP) improves
speech recognition. arXiv preprint arXiv:1305.6553

Learning Software Engineering
From Everand
Learning Software Engineering
IT Campus Academy
No ratings yet
Application Observability with Elastic: Real-time metrics, logs, errors, traces, root cause analysis, and anomaly detection
From Everand
Application Observability with Elastic: Real-time metrics, logs, errors, traces, root cause analysis, and anomaly detection
Navin Sabharwal
No ratings yet
Character Recoganization
No ratings yet
Character Recoganization
6 pages
CSE Final Year Project Proposal
No ratings yet
CSE Final Year Project Proposal
10 pages
Chart Installation
No ratings yet
Chart Installation
138 pages
Visually Disabled
No ratings yet
Visually Disabled
7 pages
Android TTS OCR Converter System For People With Visual Disability
No ratings yet
Android TTS OCR Converter System For People With Visual Disability
7 pages
AI Based Reading System For Blind Using OCR
No ratings yet
AI Based Reading System For Blind Using OCR
4 pages
Leslie Mashonga T2082163F
No ratings yet
Leslie Mashonga T2082163F
9 pages
Last Edited
No ratings yet
Last Edited
8 pages
Survey Paper Image Reader For Blind Pers
No ratings yet
Survey Paper Image Reader For Blind Pers
3 pages
Question and Answer
No ratings yet
Question and Answer
2 pages
Presentation 4
No ratings yet
Presentation 4
17 pages
Image To Text and Speech Conversion
No ratings yet
Image To Text and Speech Conversion
3 pages
Image Recognition and Its Language Translation Using OCR
No ratings yet
Image Recognition and Its Language Translation Using OCR
8 pages
Visual Based Product Identification For Blind: Project Report On
No ratings yet
Visual Based Product Identification For Blind: Project Report On
23 pages
"Text Recognition and Face Detection Aid For Visually Impaired Person Using Raspberry Pi
No ratings yet
"Text Recognition and Face Detection Aid For Visually Impaired Person Using Raspberry Pi
62 pages
Project Report On OCR Scanner
No ratings yet
Project Report On OCR Scanner
40 pages
Hand Written Character Recognition Using Neural Network: BACHELOR OF ENGINEERING (Computer Engineering)
No ratings yet
Hand Written Character Recognition Using Neural Network: BACHELOR OF ENGINEERING (Computer Engineering)
46 pages
Improving Visual Perception Through Technology: A Comparative Analysis of Real-Time Visual Aid Systems
No ratings yet
Improving Visual Perception Through Technology: A Comparative Analysis of Real-Time Visual Aid Systems
22 pages
Devel Projevct
No ratings yet
Devel Projevct
59 pages
6.python Text To Speech
No ratings yet
6.python Text To Speech
2 pages
Math El
No ratings yet
Math El
17 pages
Mahatma Jyotiba Phule Rohilkhand University, Bareilly: Dr. Iram Naim
No ratings yet
Mahatma Jyotiba Phule Rohilkhand University, Bareilly: Dr. Iram Naim
18 pages
Blind Helper Documentation NEW
No ratings yet
Blind Helper Documentation NEW
70 pages
Department of Computer Science: Image To Text Using Text Recognition & Text To Speech
No ratings yet
Department of Computer Science: Image To Text Using Text Recognition & Text To Speech
66 pages
SDLC Simplified: A Visual and Practical Approach
From Everand
SDLC Simplified: A Visual and Practical Approach
Mohammad Naushad
No ratings yet
AKM Deep Learning Project.
No ratings yet
AKM Deep Learning Project.
4 pages
Batch 1 Project Book
No ratings yet
Batch 1 Project Book
73 pages
Optical Character Recognition Based Speech Synthesis: Project Report
0% (1)
Optical Character Recognition Based Speech Synthesis: Project Report
17 pages
Advanced Image To Speech Conversion
No ratings yet
Advanced Image To Speech Conversion
46 pages
Object Recognition in Mobile Phone Application For Visually Impaired Users
No ratings yet
Object Recognition in Mobile Phone Application For Visually Impaired Users
4 pages
Mastering OpenCV Android Application Programming
From Everand
Mastering OpenCV Android Application Programming
Salil Kapur
No ratings yet
Final Eval Report PDF
No ratings yet
Final Eval Report PDF
89 pages
A Smart Reader For Visually Impaired People Using Raspberry PI
No ratings yet
A Smart Reader For Visually Impaired People Using Raspberry PI
5 pages
MP Final Report
No ratings yet
MP Final Report
38 pages
Esabel K Mahwengwa ANDROID TTS OCR SYSTEM
No ratings yet
Esabel K Mahwengwa ANDROID TTS OCR SYSTEM
24 pages
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
From Everand
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
Robert Johnson
No ratings yet
Product Identification For Blind People
No ratings yet
Product Identification For Blind People
14 pages
18CSP109L 1st Review-Major
No ratings yet
18CSP109L 1st Review-Major
15 pages
18CSP109L - 1st Review-Major
No ratings yet
18CSP109L - 1st Review-Major
15 pages
Document Reader For Visually Imapired: Prof. Deepti Chandran
No ratings yet
Document Reader For Visually Imapired: Prof. Deepti Chandran
26 pages
PRE Synopsis
No ratings yet
PRE Synopsis
3 pages
Report Draft
No ratings yet
Report Draft
18 pages
Vaidhi Ayush Gurkirat Jatin Project Synopsis Format
No ratings yet
Vaidhi Ayush Gurkirat Jatin Project Synopsis Format
6 pages
Dip PDF
No ratings yet
Dip PDF
30 pages
Main
No ratings yet
Main
18 pages
Data Analysis Foundations with Python: Master Data Analysis with Python: From Basics to Advanced Techniques
From Everand
Data Analysis Foundations with Python: Master Data Analysis with Python: From Basics to Advanced Techniques
Cuantum Technologies LLC
No ratings yet
Final Thesis-Esra
No ratings yet
Final Thesis-Esra
65 pages
A12REVIEW
No ratings yet
A12REVIEW
18 pages
SL NO. Name Usn Number Roll No
No ratings yet
SL NO. Name Usn Number Roll No
10 pages
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
Cse Final Year Project Proposal PDF Free
No ratings yet
Cse Final Year Project Proposal PDF Free
10 pages
B.E Ece Batchno 88
No ratings yet
B.E Ece Batchno 88
77 pages
Optical Character Recognition Project Report
No ratings yet
Optical Character Recognition Project Report
71 pages
First Review 1MS21LVS06
No ratings yet
First Review 1MS21LVS06
12 pages
IP MINI GD (Ver02) FINAL DG
No ratings yet
IP MINI GD (Ver02) FINAL DG
18 pages
Multilingual Text Recognition System
No ratings yet
Multilingual Text Recognition System
21 pages
Code, Bytes, Algorithms, And Innovation: Software & Engineering
From Everand
Code, Bytes, Algorithms, And Innovation: Software & Engineering
Tobi Makinde
No ratings yet
Programming Best Practices for New Developers: A Practical Guide with Examples
From Everand
Programming Best Practices for New Developers: A Practical Guide with Examples
William E. Clark
No ratings yet
Speech Recognition Using Python
No ratings yet
Speech Recognition Using Python
49 pages
Group Assigment UBCOm
No ratings yet
Group Assigment UBCOm
5 pages
Supply Chain Evolution at HP (B)
No ratings yet
Supply Chain Evolution at HP (B)
9 pages
Data Communication and Networking I
No ratings yet
Data Communication and Networking I
2 pages
Satish - Quiz 1 Desktop Protection and Email - 1653325571341
No ratings yet
Satish - Quiz 1 Desktop Protection and Email - 1653325571341
11 pages
SSH Cheat Sheet
No ratings yet
SSH Cheat Sheet
1 page
Book Recommendation System
No ratings yet
Book Recommendation System
8 pages
CSP Microproject-Numbered
No ratings yet
CSP Microproject-Numbered
23 pages
2025 - Satellite Communications and Networks by Hoyhtya M
100% (1)
2025 - Satellite Communications and Networks by Hoyhtya M
173 pages
CGR Microproject
No ratings yet
CGR Microproject
11 pages
Design and Analysis of Cryptographic Algrithm
100% (1)
Design and Analysis of Cryptographic Algrithm
273 pages
El640 400-CB1
No ratings yet
El640 400-CB1
26 pages
Robin S: Sr. PHP Developer
No ratings yet
Robin S: Sr. PHP Developer
5 pages
CCBoot Manual - Troubleshoots
No ratings yet
CCBoot Manual - Troubleshoots
272 pages
Puchakayala Kranthi Reddy: Skills
No ratings yet
Puchakayala Kranthi Reddy: Skills
3 pages
CS8661 - IP Lab Manual Final
No ratings yet
CS8661 - IP Lab Manual Final
86 pages
Bilal Servicenow Developer
No ratings yet
Bilal Servicenow Developer
5 pages
Data Flow Diagrams
No ratings yet
Data Flow Diagrams
26 pages
Iot Based Monitoring System For White Button Mushroom Farming
No ratings yet
Iot Based Monitoring System For White Button Mushroom Farming
1 page
Turbo HD DVR V3.4.70 Build160708 Release Notes - External
No ratings yet
Turbo HD DVR V3.4.70 Build160708 Release Notes - External
7 pages
Chandan Project 1
No ratings yet
Chandan Project 1
43 pages
Kenwood TRC-80 - User Manual PDF
73% (11)
Kenwood TRC-80 - User Manual PDF
33 pages
04 Proyek PDB
No ratings yet
04 Proyek PDB
39 pages
Intrusion Detection With Suricata
No ratings yet
Intrusion Detection With Suricata
32 pages
GW - DNS G3 - User Manual-EN
No ratings yet
GW - DNS G3 - User Manual-EN
53 pages
Azure Book 126
No ratings yet
Azure Book 126
1 page
UScan Operation Manual CT
No ratings yet
UScan Operation Manual CT
68 pages
Changing Colors Tutorial
No ratings yet
Changing Colors Tutorial
6 pages
Muqaddas Research Papers
No ratings yet
Muqaddas Research Papers
5 pages
Ahmad Javaid - Software Engineer
No ratings yet
Ahmad Javaid - Software Engineer
1 page

Android Text To Speech Documentation by Paige

Uploaded by

Android Text To Speech Documentation by Paige

Uploaded by

HARARE HIGH SCHOOL

ANDROID TTS OCR CONVERTER SYSTEM

BY LESLIE RUVIMBO MUTENDERERA

CENTER NUMBER: 010077

Choice of Problem Area and Background Analysis:

INVESTIGATION OF THE CURRENT SYSTEM

Display Text Convert

PROBLEMS WITH THE CURRENT SYSTEM:

LEGAL AND ETHICAL FEASIBILITY:

AIMS AND OBJECTIVES

EVIDENCE OF RESEARCH CONDUCTED

Justification of Proposed Solution:

DATA STRUCTURES/FILE DESIGN

Final deployment and documentation

Testing and refinement

mplementation of core OCR and UI features

Requirements gathering and initial design

 Benefits: Enhances readability, reusability, and debugging.

EVALUATION AGAINST OBJECTIVES:

OPPORTUNITIES FOR FUTURE DEVELOPMENT:

OVERALL CONDUCT OF THE PROJECT

QUALITY OF THE COMPLETED REPORT

You might also like