Saavip-Smart Ai - Assistant For Visually Impaired People
Saavip-Smart Ai - Assistant For Visually Impaired People
SAAVIP-SMART AI -
ASSISTANT FOR VISUALLY
IMPAIRED PEOPLE
COLLEGE OF ENGINEERING PATHANAPURAM
01
Contents
02-11-2024
1. Introduction
2. Existing System
3. Proposed System
4. Problem Statement
5. Objective
6. Scope
7. Literature Review
LITERATURE REVIEW
8. System Design
9. Conclusion
10. Reference
02
LITERATURE REVIEW 02-11-2024
03
Introduction
01
Introduction
02-11-2024
04
LITERATURE REVIEW 02-11-2024
05
Existing System
02
Existing System
02-11-2024
Current systems like Seeing AI and Be My Eyes provide object detection, text reading, and
navigation assistance for the visually impaired.
Limitations
Fragmented features, focusing on individual tasks.
Limited real-time performance and processing.
LITERATURE REVIEW
06
LITERATURE REVIEW 02-11-2024
07
Proposed System
03
Proposed System
02-11-2024
08
LITERATURE REVIEW 08.12.2024
09
Problem Statement
04
Problem Statement
02-11-2024
10
11
LITERATURE REVIEW 02-11-2024
Objectives
05
Objectives
02-11-2024
12
13
LITERATURE REVIEW 02-11-2024
Scope
06
Scope
02-11-2024
14
15
LITERATURE REVIEW 02-11-2024
Literature Review
07
CITATION METHODOLOGY ADVANTAGES DISADVANTAGES IDEA INHERITED
Real-time Object Detection for Employed a Convolutional Neural Provides real-time feedback, enabling Accuracy can be affected by lighting To empower visually impaired individuals with
Visually Challenged People" Network (CNN) architecture for real-time immediate situational awareness. conditions, object occlusion, and complex increased autonomy and independence by
object detection. Utilized a dataset of Can potentially enhance independent backgrounds. leveraging AI-powered object detection for
Sunit Vaidya- 2020 common objects and trained the model navigation and daily living activities. Requires continuous power supply for the environmental understanding
to accurately identify and locate them in device.
real-world scenarios.
A Deep Learning Approach for Developed a deep learning model using a Can potentially improve accessibility Requires a large and diverse dataset To bridge the gap in assistive
Object Recognition System for the combination of Convolutional Neural for visually impaired individuals in with accurate Arabic annotations for technology for visually impaired
Visually Impaired Using Arabic Networks (CNNs) and Recurrent Neural Arabic-speaking regions. effective training. individuals in regions with different
Annotation Networks (RNNs) to recognize objects in Combines the strengths of CNNs and May face challenges in generalizing to linguistic contexts.
By real-time. Trained the model on a dataset RNNs for enhanced feature extraction unseen objects or environments.
Mohammad Hussan et al. of Arabic-annotated images. and classification.
- 2023
A deep learning-based integrated Developing a deep learning-based system integrating deep learning and assistive
voice assistance system for partially that uses a camera and sensors to Enhanced Independence Limited Precision in Complex technology to create a voice
disabled people by capture real-time environmental data, Health Monitoring Environments assistance system specifically
Harshit Garg ,Srishti Jhunthra which is then processed for object Cost-effective Design Latency Issues designed for individuals with partial
,Madhav Kindra ,Vikrant Dixit ,Vedika detection and text recognition. The User Training Required disabilities
Gupta - April 2016 system utilizes a text-to-speech engine to Enhancing their ability to interact with
convert visual data into audio feedback. their surroundings.
Real-Time Object Detection and Utilized a smartphone-based system with Leverages the widespread Accuracy can be affected by
Recognition for Visually Impaired a camera and integrated object detection availability and computational power smartphone camera limitations and It combines a voice recognition module
Persons Using Smartphone by algorithms. Developed a user-friendly of smartphones. varying lighting conditions. with an obstacle detection system to
Hiren Kumar Thakkar et al - 2020 interface for real-time object identification Provides a portable and convenient Battery life of the smartphone can impact improve safety and autonomy for users.
and audio feedback. solution for everyday use. usage duration.
Image processing and machine learning To harness the power of readily available
A facial expression controlled Increased Accessibility Reliance on Facial Movements technology to enhance the quality of life
techniques to recognize specific facial Environmental Limitations
wheelchair for people with expressions, which are then mapped to Intuitive Interface for visually impaired individuals.
disabilities by Yassine Rabhi,Makrem wheelchair control commands (e.g., forward, Real-time Response High Computational Demand
Mrabet,Farhat Fnaiech - February backward, left, right).
2019
17
CITATION METHODOLOGY ADVANTAGES DISADVANTAGES IDEA INHERITED
Faster R-CNN: Towards Real-Time Uses RPNs to generate region proposals, Faster and more accurate region
Object Detection with Region Computationally intensive for real-time Combines Region Proposal Networks
sharing convolutional features with the proposals. applications.
Proposal Networks by Shaoqing detection network for unified training. Enables joint training of RPN and (RPNs) and Fast R-CNN into an end-to-
Ren, Kaiming He, Ross B. Girshick, Struggles with small object detection. end trainable object detection system.
Jian Sun - 2015 detection networks
You Only Look Once (YOLO): Divides the image into grid cells and
directly predicts bounding boxes and Extremely fast, achieving real-time Simplifies object detection as a single
Unified, Real-Time Object Detection Lower accuracy in complex scenes. neural network regression task for
by Joseph Redmon, Santosh class probabilities in a single pass. detection. Struggles with small or closely spaced
Simple, unified architecture. bounding boxes and class probabilities.
Divvala, Ross B. Girshick, Ali objects.
Farhadi-2016
Introduces a loss function that down- Proposes Focal Loss to address class
Focal Loss for Dense Object Boosts accuracy for one-stage Adds a tunable hyperparameter (γ). imbalance in one-stage detectors by
Detection weights easy examples, emphasizing hard- detectors, especially for rare classes. Slightly increases training time.
to-classify ones. focusing training on hard examples.
Tsung-Yi Lin, Priya Goyal, Ross B. Works with various one-stage
Girshick, Kaiming He, Piotr Dollár - architectures.
2017
18
CITATION METHODOLOGY ADVANTAGES DISADVANTAGES IDEA INHERITED
Improved Disability Assistant The project creates an Android app to Provides accessible health and fitness
Android Mobile Application Mulla Lacks features for visually impaired Health guidance through exercises
support disabled users with health resources. users.
Amina Mustaq1, Sapkal Kinjal guidance, communication tools, GPS-based Enhances communication with text-to- and diet plans.
Baliram2, Chaudhary Chinmaya Limited to basic sign language resources. Communication aid with text and
hospital location, reminders for speech and speech-to-text. Reminder notifications could be more
Pravin3, Prof. K.S Charumathi4 - appointments, and sign language Includes reminders for medical speech conversion.
April 2023 customizable. GPS and reminders for easy
resources. appointments.
healthcare access.
A reconfigurable technical Develops a reconfigurable assistance Enhances mobility and independence Complexity in managing multiple users in Flexible and fault-tolerant assistance
assistance for disabled people A. system for disabled users, using intelligent for disabled individuals. shared spaces. in daily activities.
Belabbas, P. Berruet, A. Rossi, J-L. wheelchairs and domotic services, Offers customizable assistance Limited to areas with domotic Intelligent navigation and interaction
Philippe LESTER - 2022 modeled with Petri nets to adapt to through adaptive technology. infrastructure. with the environment.
failures in the environment. Maintains service availability despite High reliance on real-time adjustments Ensures continuous service availability
system breakdowns. and reconfiguration. through reconfiguration
Smartphone-based Accessibility: Smartphone Limitations: Performance The paper inherits the idea of using real-
The methodology of this paper involves Object detection for visually impaired depends on device power, affecting
Real-Time Object Detection And conducting a comparative analysis of time object detection to assist visually
Identification For Visually without extra hardware. older models. challenged individuals by adapting
existing object detection and identification Optimized Algorithms: Compares
Challenged People Using Mobile algorithms, focusing on their performance Detection Range: Effective within 2-5 algorithms to work on mobile platforms. It
Platform Neeraj Joshi, Shubham on low-computation devices, and identifying YOLO and SSD for real-time use. meters, limiting some scenarios. leverages the speed of regression-based
Maurya, Sarika Jain - February 2021 research gaps to propose a feasible model Low-Compute Solutions: Proposes Speed vs. Accuracy: Prioritizing speed algorithms like YOLO and SSD to create
for visually impaired individuals using a lightweight methods for accessible may reduce accuracy in complex an accessible, low-computation solution
use. settings.
19 smartphone-based system. without extra hardware.
LITERATURE REVIEW 02-11-2024
20
SYSTEM DESIGN
08
MODULE DESCRIPTION
02-11-2024
Utilizes a YOLO model trained on a specific dataset to identify objects and scenes in real-time,
helping users understand their surroundings.
21
MODULE DESCRIPTION
02-11-2024
auditory feedback.
21
MODULE DESCRIPTION
02-11-2024
6. Database:
Stores datasets, trained models, and user-specific data (such as recognized faces and
navigation preferences).
Enables efficient data retrieval for face recognition, object detection, and navigation
processing.
LITERATURE REVIEW
21
HARDWARE REQUIREMENTS
02-11-2024
21
SOFTWARE REQUIREMENTS
02-11-2024
1. Operating System:
Android or iOS SDK (e.g., Android Studio for Android or Xcode for iOS development).
2. Programming Languages:
Java/Kotlin for Android, or Swift for iOS development.
Python for developing machine learning models.
3. Machine Learning Libraries:
YOLO Model for object and face detection.
OpenCV for image processing.
TensorFlow Lite or Core ML for on-device model deployment.
4. APIs and Services:
LITERATURE REVIEW
21
LITERATURE REVIEW 02-11-2024
20
STRUCTURE
PROJECT
configuration component src Configuration
Files
postcss.config.js
PostCSS configuration
package.json public
Project dependencies and scripts Static assets
State Management Refs Effects UI Components
Camera State
Detected Objects State
20
MODULE
DETECTION
OBJECT
TECHNOLOGIES USED
02-11-2024
REACT JS
TENSORFLOW JS
COCO-DATASET
LITERATURE REVIEW
21
02-11-2024
MediaDevices API
LABEL
VIDEO FEED
COCO-SSD BOUNDARY BOX
CAMERA CONFIDENCE
ANALYSIS
Canvas API
LITERATURE REVIEW
BOUNDARY
BOX
21
02-11-2024
MediaDevices API - used to access the user's camera feed. This allows you to get a live video stream from the
camera so the app can analyze it in real-time.
COCO-SSD - pre-trained model available from TensorFlow.js. It can detect objects from a list of 90 categories,
such as people, animals, vehicles, and more.
COCO-SSD model processes each frame and returns a list of detected objects, each with:
Label - The name of the detected object (e.g., "person", "dog").
Bounding Box - The location of the detected object in the video frame (given as coordinates: top, left, width,
height).
Confidence - A measure of how confident the model is in its detection.
LITERATURE REVIEW
Canvas API - to draw bounding boxes around the detected objects in the video feed. This provides a visual
indication of where each object is in the frame.
21
21
LITERATURE REVIEW 02-11-2024
OUTPUT
LITERATURE REVIEW 02-11-2024
20
MODULE
OUTPUT
VOICE
FEATURES
02-11-2024
21
PROGRESS
02-11-2024
21
02-11-2024
API
ech
Spe
eb
W
LITERATURE REVIEW
SpeechSynthesis API:
Use the SpeechSynthesis interface to
make the browser speak the text. Here’s
how you can implement it.
API converts the description text into speech
21
LITERATURE REVIEW 02-11-2024
20
Conclusion
09
Conclusion
02-11-2024
21
LITERATURE REVIEW 02-11-2024
22
Reference
10
Reference
02-11-2024
Improved Disability Assistant Android Mobile Application Mulla Amina Mustaq1, Sapkal Kinjal Baliram2, Chaudhary
Chinmaya Pravin3, Prof. K.S Charumathi4 - April 2023
Intelligent Voice Controlled Wheel Chair for Disabled People by M. Joly, Arun Pradeep ,Kavitha S - 25.02.2023
Voice Control Intelligent Wheelchair Movement Using CNNs By Mohammad Shahrul Izham Sharifuddin; Sharifalillah Nordin;
Azliza Mohd Ali - March 2021
Integrated Speaker and Speech Recognition for Wheel Chair Movement using Artificial Intelligence Gurpreet Kaur Research
Scholar, I.K Gujral Punjab Technical University, Kapurthala-144603, India - November 10, 2017
Text to Voice Conversion for Visually Impaired Person by using Camera.1Mr. Sumit Chafale.,2Ms.PriyankaDighore 3 Ms.Dipika
Panditpawar., 4Mr. Khushal Bhagawatkar5Mr.Shrikant Sakhare - March 2021
Integrated Speaker and Speech Recognition for Wheel Chair Movement using Artificial Intelligence Gurpreet Kaur Research
LITERATURE REVIEW
Scholar, I.K Gujral Punjab Technical University, Kapurthala-144603, India - November 2019
21
LITERATURE REVIEW 02-11-2024
24
THANK YOU