0% found this document useful (0 votes)

12 views10 pages

Chatbot Paper

Uploaded by

sowmimohan2805

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views10 pages

Chatbot Paper

Uploaded by

sowmimohan2805

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

ITM Web of Conferences 37, 01019 (2021) https://doi.org/10.

1051/itmconf/20213701019
ICITSD-2021

Virtual AI Assistant for Person with Partial Vision Impairment

Rohith Raghavan1*, Vishodhan Krishnan1, Hitesh Nishad1, and Bushra Shaikh1

1
Department of Information Technology, SIES Graduate School of Technology, Navi Mumbai, India

Abstract: Smartphones help us with almost every activity and task nowadays. The features and
hardware of the phone can be leveraged to make apps for online payment, content
consumption, and creation, accessibility, etc. These devices can also be used to help and assist
visually challenged and guide them in their daily activities. As the visually challenged sometimes
face difficulty in sensing the objects or humans in the surroundings, they require guidance or
help in recognizing objects, human faces, reading text, and other activities. Hence, this Android
application has been proposed to help and assist people with partial vision impairment. The
application will make use of technologies like face detection, object and text recognition, barcode
scanner, and a basic voice-based chatbot which can be used to execute basic commands
implemented through Deep Learning, Artificial Intelligence, and Machine Learning. The
application will be able to detect the number of faces, recognize the object in the camera frame
of the application, read out the text from newspapers, documents, etc, and open the link
detected from the barcode, all given as output to the user in the form of voice.

1 Introduction
2 Literature Review
A Normal person without any disabilities have no
issues with daily work in their life. But, on the other We studied and went through the following research
hand, it is difficult for a partially blind person to carry papers listed below to get more knowledge and ideas
out daily tasks. Actions like reading texts, identifying about the implementation of our project.
objects cannot be performed by them due to their
disability. Making Braille versions of every text is an Tosun et al [1] discussed the process and the
expensive and tedious task. Also, recognizing objects algorithms involved for real-time object detection.They
from a distance is not possible for a visually challenged also compared the various algorithms like YOLOv2,
person. Although there are several applications to help SSD, and faster R- CNN in terms of accuracy.The
and assist the visually challenged, they offer only some paper explained the ML algorithms in brief. YOLOv2
features, making the person install a handful of provided better accuracy and ran on even low fps with
applications for that. So, to overcome the current issues a GPU processor.
faced by a visually challenged person, we have
developed this application that offers convenience and Tembhurne et al. [2] studied the implementation of a
assistance to the visually challenged person. The voice assistant for visually challenged. The paper
application offers text, object recognition, and face discussed the various modules which can be
detection to identify text, objects, and humans. It also implemented in the voice assistant like calls, messages,
offers a chatbot so that the visually challenged person TTS, OCR, etc. The paper also talks about using Maps
can interact with the bot for basic information and API for navigation.
activities

*Corresponding author:[email protected]

© The Authors, published by EDP Sciences. This is an open access article distributed under the terms of the Creative Commons Attribution License 4.0
(http://creativecommons.org/licenses/by/4.0/).
ITM Web of Conferences 37, 01019 (2021) https://doi.org/10.1051/itmconf/20213701019
ICITSD-2021

Dahiya et al. [3] elaborates on the R-CNN algorithm in also has a fairly simple and user-friendly UI designed
detail and also compares the accuracy and specifically for the visually impaired.
computational time of R- CNN and faster R-CNN
combined with resnet-50. The paper also discusses the Jakhete et al. [11] discussed about using Single Shot
data preprocessing steps required for feeding the data Detector (SSD) Algorithm to implement Object
into the machine learning model. The framework Detection in an Android Application. The paper lists
proposed in the paper claims an accuracy of 92%. other object recognition algorithms and mentioned the
steps to implement SSD algorithm on an Android
Ahmed et al. [4] discussed using RNN (recurrent application.
neural network) and CNN (convolutional neural
network) for obstacle avoidance and way-finding.
Their work using CNN proved helpful to implement 3 Existing System
object detection using CNN-based algorithms. In this section, we are discussing the features of certain
applications available on the Play Store
Gianani et al [5] described real-time object detection
implemented using OpenCV and also determining the Supersense[12]– it is an application that assists the
position of the object using Euclidean distance. The visually challenged and the features provided by it are
paper also guides the user to the objects through voice Object recognition, Face recognition, and text
output. The paper explains object detection using the recognition Sullivan+[13]–This application also serves
SSD framework and MobileNet architecture which has the same purpose this provides Object Recognition to
an accuracy of 99.61%. This system is designed to describe images, Face recognition and text recognition.
work in an indoor environment.
Envision AI [14]– This application also serves the
Kukade et al. [6] focused on Speech-to-Text, Text-to- same purpose and provides the features that are Face
Speech, Optical Character Recognition, and voice recognition, object recognition
assistance and the proposed system to implement the
same. The paper also discussed the ways of it
LetSeeApp[15]– This application is also for the same
purpose and provides the features of text recognition to
Shishir et al. [7] explained object recognition using read visiting cards as well as credit and debit cards
Tensorflow ML API along with the implementation of
it. They included informative flowcharts for
understanding the process behind it. They also The above-mentioned applications provide more or less
explained the working of OCR and object recognition. similar features (the links to these applications are
This implementation provided accuracy of over 80%. provided in the references section)

Karthik et al. [8] provided an overview of the OCR 4 Proposed System

algorithm and the hindrances faced while the text is
being extracted. They also share the idea of using An Android-based application based on technology and
Raspberry Pi instead of a mobile phone to capture innovation promises to academically empower visually
images. The paper also talks about the future scope of challenged by freeing them of their dependence on
using a GPS location tracker for guidance. visuals by providing the information through an app.

Singh et al. [9] proposed an Android application which This application aims to provide better functionality in
offers text recognition, speech recognition, image an app that makes a partially blind user use it for
recognition and a chatbot for the user to interact with navigation, identification, recognition, and also gaining
the application. The paper proposed using Google information of the outer world. Some of them are listed
Cloud APIs (various APIs which can be used to below:
automate tasks) and Google Dialogflow (a natural
language understanding platform on which chatbot can •The app will contain a chatbot such that we will be
be implemented) to implement various modules instead asking questions about time, weather, or any other kind
of training deep learning models to perform various to obtain information or asking to perform certain
activities. actions the user desires.
•It will detect the objects in real-time and provide the
Sharma et al. [10] focuses on implementing a system necessary information to the user.
offering face recognition, text-to-speech and object •The app will also contain a barcode scanner which
recognition on a web browser which can be opened on will help the user to get information about certain
a mobile device. The paper also talks about adding a products.
feature to add unknown faces to the database at the tap •The app can also help the user detect human faces so
of a button for future reference. The proposed system that the user can understand human presence in the
surrounding and also the number of people in the room.

2
ITM Web of Conferences 37, 01019 (2021) https://doi.org/10.1051/itmconf/20213701019
ICITSD-2021

•This application will have a text reader which will be Object Recognition:
used to read the text out loud to the user.
Object recognition is the technique to recognize and
Using the app, the person can get help and guidance in label an object detected in an image, video, or real-
day-to-day tasks and activities. time. Object recognition is achieved using machine
learning and deep learning. Object recognition
APP NAME FR` OR` TR` CHATBOT algorithms take the frame from the camera as the input
and then apply a bounding box of a specific size to the
SUPERSENSE YES YES NO NO image and check for the object in the image. If the
SULLIVAN + NO YES YES NO object is found in the image, the algorithm will
LETSEEAPP NO YES YES NO recognize the object. There are two steps to object
recognition –image classification and object
ENVISON AI YES YES NO NO
localization. Image classification predicts the class of
OUR YES YES YES YES the object in an image. Whereas object localization
APPLICATION identifies one or more objects in the image and
Table 1 Comparison of features provided by each drawing the bounding boxes. The object detection
application algorithm will combine both of the tasks and will
NOTE- FR`- FACE RECOGNITION; OR`- OBJECT classify the objects in the image.
RECOGNITION; TR`-TEXT RECOGNITION
NOTE- As the algorithms used in the other
Text Recognition:
applications are unknown to us .We have contacted
their developers but haven’t got any responses yet. So, Text recognition is the technique to detect and identify
we have done the comparison based on features the text which is in printed, handwritten or digital
provided format. Text recognition technology converts the text
in different forms to digital form. It is also called OCR
(Optical Character Recognition). Several APIs exist for
5 Methods various platforms which can be used to implement
OCR.
Face Detection:
For recognizing typed or printed text on objects or
Face detection is a computer technology that is used to
books, the user has to open the application on his
detect human faces in images, videos, or in real-time
smartphone and then select the required option. The
video. Face detection is a broad technology that just
application will identify the text and convert it to
marks or labels the human face identified by the
digital form. The text will then be read out to the user.
application. The key difference between face detection
and recognition is that face detection just identifies the
face whereas face recognition will also label the Chatbot:
person’s name, gender, age, or other attributes. Face
Chatbots are AI-based computer programs that can
detection can be implemented in various fields -
simulate a human conversation. They are also called
security, biometrics, entertainment, law enforcement,
digital assistants as the chatbots can be used to do
etc.
actions and commands given by the user. A chatbot can
process the human conversation, reply to commands
Basic face detection can be achieved through OpenCV and queries or can solve user FAQs as well.
whereas real-time face detection or face detection in
different conditions can be achieved using machine
The key modules behind a chatbot are artificial
learning or deep learning. The face detection
intelligence, natural language processing, user-defined
algorithms start searching for human eyes in the frame
rules, and machine learning which are required to
as it is the easiest to detect.
process the commands or messages sent by the user
and deliver the required feedback.
It then searches for other factors like eyebrows, nose,
ears, and iris. When the algorithm finds the factors in
Chatbots are of two types- task-oriented and data-
the image in the frame, it then applies additional tests
driven. Task-oriented chatbots are designed for a single
and then confirms the detection of the face by labelling
purpose and only generate automated responses. Their
the face with a rectangular box.
interaction is specific and restricted to only FAQs or
basic questions.
Real-time face detection involves motion; hence
traditional algorithms cannot be applied. So, advanced
The answers to the queries are already defined in task-
machine learning and deep learning algorithms are
oriented chatbots. Hence, they can only handle and
used to create models which can detect faces in real-
process basic queries and are the most commonly used
time in various scenarios.
in websites and apps for uer queries.

3
ITM Web of Conferences 37, 01019 (2021) https://doi.org/10.1051/itmconf/20213701019
ICITSD-2021

Data-driven chatbots or virtual assistants are more The SSD algorithm also performs better when it comes
interactive, sophisticated, and advanced than data- to detecting objects of different shapes and sizes. This
driven ones. is evident from the comparison graph which shows the
These chatbots use NLP, NLU, and ML to learn from difference.
the user’s queries and responses. These chatbots
analyze and use past user interaction data and behavior
to provide responses or feedback to the user’s queries.
Hence, data-driven chatbots get better, efficient, and
precise over time. Amazon Alexa, Google Assistant,
and Apple’s Siri are examples of data-driven chatbots.

Implementation
For implementing object recognition and face
detection, we have chosen to use Tensorflow Lite
(Google’s open-source deep learning framework
designed for on-device processing) framework in our
proposed system.

TensorFlow Lite was chosen as other frameworks like Fig 1 Algorithm’s performance over objects of different
sizes.
Keras (an open-source library that provides a Python
interface for artificial neural networks) and PyTorch
We have used the Tensorflow Object Detection API
(an open-source ML library designed for NLP and
model which uses SSD mobilenet v1. This model is
Computer Vision) do not offer Lite versions for low-
trained over the MS-COCO[19] dataset. The COCO
end devices like smartphones. dataset is a massive object detection dataset which has
330,000 images with over 200,000 labelled images
TF (Tensorflow Lite) also offers various pre-trained consisting of 80 various object categories.
models with commonly used algorithms and datasets
for out-of-the-box usage in projects and
Real-time face detection can be implemented using
applications.Several algorithms like You Only Look algorithms like Multi-Task Cascaded Convolutional
Once (YOLO) algorithm [16], Single Shot Detector Neural Network (MT- CNN)[20], Google FaceNet[21]
(SSD) [17] and Region-based Convolutional Neural
algorithm, and using the OpenCV Haar Cascade[22] and
Network (R-CNN) [18] among others can be used to OpenCV Dlib[23] toolkits.
implement realtime object recognition.
The FaceNet algorithm performed better among the
We chose the SSD algorithm for our project as it offers others as it had a maximum accuracy of 99.63%. So,
a fair trade-off between speed and accuracy over other
we chose to implement the FaceNet algorithm
algorithms which offered either of these parameters.
designed for low-power devices, the MobileFaceNet[24]
model for face detection. The MobileFaceNet model
The following table shows the speed and accuracy offered better speed over others as is evident from the
comparisons.
below graph.
Table 2. Speed and Accuracy comparison among object
detection algorithms.
Method mAP FPS Size Boxes Input

Fast R- 73.2 7 1 6000 1000X

CNN 600
Fast 52.7 155 1 98 448X 448
YOLO
YOLO- 66.4 21 1 98 448X 448
VGG16
SSD300 74.3 46 1 8732 300X 300
SSD512 76.8 19 1 24564 512X 512
SSD300 74.3 59 8 8732 300X 300
SSD512 76.8 22 8 24564 512X 512
Fig 2 Comparison between face detection algorithms w.r.t
time

4
ITM Web of Conferences 37, 01019 (2021) https://doi.org/10.1051/itmconf/20213701019
ICITSD-2021

The reason behind the fast performance is that the

global average pooling layer has been replaced by a
depth-wise convolutional layer which improves
performance on face detection and recognition.

For the proposed system, we have used the

MobileFaceNet model trained over the Labelled Faces
in the Wild (LFW)[25] dataset. The LFW dataset
contains over 13,000 human faces captured in various
angles and orientations.

Real-time text recognition has been implemented

using Text recognition API from Google’s ML Kit
(Google’s machine learning for Android devices in the
form of a mobile SDK) which provides various
libraries for implementing computer vision-related
recognitions. Google’s ML Kit website offers prebuilt
APIs and packages which can be imported into our
application to implement text recognition.

For implementing the chatbot, we have used AIML Fig 3 Detecting objects
chatbot which uses Python packages like Pyttsx3 (An
offline Python Text to Speech conversion library
(TTS)), nltk (Natural Language Toolkit, a package of Text Recognition
libraries and programs written in Python for
Text Recognition module has been implemented
processing natural language), chatterbot to provide
successfully
feedback to the user as per the queries asked.
Implementing the chatbot requires natural language
processing and artificial intelligence for it to give  Accuracy of 90 %
replies and perform actions. The chatbot will read the  Average run time of 1.4 seconds
command from the user, detect the keywords in the
command, and then will perform the action as
programmed by the developer.

The barcode scanner has been implemented using

Google ML Kit’s Barcode API. The API can directly
be used in the application by importing the app
dependencies and package.

6 Results
Object Recognition
The object recognition module has been implemented
successfully

 Accuracy of 90 %
 Average run time of 1.3 seconds.

Fig 4 Text Detection

5
ITM Web of Conferences 37, 01019 (2021) https://doi.org/10.1051/itmconf/20213701019
ICITSD-2021

Face Recognition
Face Recognition module has been implemented
successfully

 Accuracy of 85%
 Average run time of 1.2 seconds

Fig 6 User command to chatbot via speech

Fig 5 Detecting human face

Chatbot
Voice-based chatbot has been implemented
successfully

 Offline feature’s like calling, asking for date

and time are working properly
 Online features like asking for weather,
temperature, information on certain products
are working properly if provided suitable
internet connectivity

Fig 7 Chatbot performs the required action

Barcode Scanner
Barcode Scanner has been integrated successfully

 The information on product is provided

correctly with a sufficient internet
connectivity

6
ITM Web of Conferences 37, 01019 (2021) https://doi.org/10.1051/itmconf/20213701019
ICITSD-2021

B. FACE RECOGNITION
1. Average time taken to perform one task(tested 20
times)

1.5
SUPERSENSE
1.4
SULLIVAN +
1.3
LETSEEAPP
1.2 ENVISON AI
1.1 OUR APP
Time Taken(sec)
Fig 11 –Time taken by each application for face recognition

2. Accuracy- providing correct output (tested 20

times)
Fig 8 Barcode detection
100
Performance of our application compared to 80 SUPERSENSE
others SULLIVAN +
60
A. OBJECT RECOGNITION 40 LETSEEAPP
1. Average time taken to perform one task(tested 20 20 ENVISON AI
times)
0 OUR APP
ACCURACY(PERCENTAGE)
2.5
Fig 12 –Accuracy of each application for face recognition
2 SUPERSENSE
1.5 SULLIVAN + C. TEXT RECOGNITION
1 LETSEEAPP 1. Average time taken to perform one task(tested 20
ENVISON AI times)
0.5
0 OUR APP
2
Time Taken(sec) SUPERSENSE
1.5
Fig 9 –Time taken by each application for object recognition SULLIVAN +
1
2. Accuracy- providing correct output (tested 20 LETSEEAPP
times) 0.5 ENVISON AI
0 OUR APP
95 Time Taken(sec)
90 SUPERSENSE
85 SULLIVAN + Fig 13 –Time taken by each application for text recognition
80
LETSEEAPP
75
70 ENVISON AI
65 OUR APP
ACCURACY(PERCENTAGE)

Fig 10 –Accuracy of each application for object recognition

7
ITM Web of Conferences 37, 01019 (2021) https://doi.org/10.1051/itmconf/20213701019
ICITSD-2021

2. Accuracy- providing correct output (tested 20 The working of the android app and its modules are
times) explained below.

95
SUPERSENSE
90
SULLIVAN +
85
LETSEEAPP
80
ENVISON AI
75 OUR APP
ACCURACY(PERCENTAGE)
Fig 14- Accuracy of each application for text recognition

Note – the testing was done indoor under tube light

(unnatural but ample brightness) and may vary
according to the surrounding environmental condition

7 Discussion
Fig 16 Application working flowchart

Modules used in ChatBot component of the

Application.

The chatbot only requires the smartphone’s

microphone and Internet access. It offers some useful
functionalities achieved through techniques and
libraries mentioned below:
Pysttsx3 – It is a Python text-to-speech convertor that
even works offline. We have implemented this module
in our project to provide offline text-to-speech
conversion.

This module provides us many features like

• TTS conversion without Internet
• Option to choose different voices
• Change speed or pitch of speech
• Easy-to-use and feature-rich API

Speech Recognition – A technique that is used to

identify the queries of the user and convey it to the
application. Which in turn will start the process it was
requested to perform. This works in such a way that a
keyword is associated with a particular action and
when the keyword is spoken by the user the action will
take place. Google Speech-to-Text has been used for
speech recognition. We have implemented this for
Natural Language Processing in our project
Fig 15 Layout of the application
Natural Language Processing (NLP) – It is broadly
The system (referred to as the android app hereafter) defined as the automatic manipulation of natural
consists of 5 modules-- real-time face detection, real- language, like speech and text, by software. Natural
time object and text recognition, barcode scanner, and language refers to the way humans normally
chatbot. Each of these modules can be easily accessed communicate with each other.
from the android app with the click of a button. The UI This module is used in our project so that user can
of the application has been designed to be user-friendly communicate with their device as they communicate
for partially blind. with fellow human beings.

8
ITM Web of Conferences 37, 01019 (2021) https://doi.org/10.1051/itmconf/20213701019
ICITSD-2021

Datetime – This module is used to provide the date and The application is designed to capture preview frames
time to the chatbot. This module works offline as its at a resolution of 800*600px. The preview frame, if
working on the data received from the device on which horizontal in orientation is rotated vertically and is
it runs. We have implemented this in our Project so cropped to 400*300px which removes the background
that the user can ask the device for the time and date and retains only the human body.
whenever needed the data from this module is given to
another module (Pyttsx3) This image is then rescaled to 112*112px to be used as
input for the MobileFaceNet model. On feeding the
Web Browser- The user can browse through the web image, the model looks for the face in the image by
using only voice commands given to the chatbot. We matching the face features. It then creates a bounding
have configured this module in such a way that it can box when it detects the face and is highlighted. The
be used to gain information, play music (via an API to number of faces detected in the frame by the app is
access YouTube), provide us with weather report( via then outputted to the user orally using TTS
an API to access The Weather Channel), and get news functionality. This can be useful for the user to know
updates(via an API to access Times of India) and to get the number of people in the room or a certain place.
information on various topics we have also linked it Face detection requires only the smartphone’s camera
with Wikipedia Module. access.

Working of Text Recognition: Working of Object Recognition

Text Recognition API: Google ML Kit is a set of APIs The android app has the object recognition feature
and tools which can be used to deploy and automate where the user can point at an object and the app will
certain applications like text recognition, barcode recognize the object in frame and will output the object
scanning, pose detection, etc. We have used the text name to the user using Text-to-Speech. Object
recognition API in our project. The API will first use recognition has been implemented using SSD neural
OCR to detect the text shown on the camera frame. It network. When the user points to an object, the frame
will then split the text into lines and the lines will be is cropped to 600* 800 and is inputted into the model
split into words. till the whole frame is covered. Based on the
These words will then be sent to the API for confidence level set by the user, the model creates
recognition and the recognized words will be spoken to multiple boxes with different aspect ratios throughout
the user using Google Text-to-Speech (TTS). Google the image and tries to detect the object.
TTS is available by default on all Android devices.
Text recognition only requires the smartphone’s The accuracy of detection of the object depends on the
camera access. confidence level. Once the object is detected, it then
creates a box over the detected object with a label. The
Working of Barcode Scanner: name of the object is then read to the user using TTS.
The object recognition requires only the smartphone’s
Barcode API: Google ML Kit also offers a barcode
camera access.
API which can be used to scan barcodes and QR codes.
The API will detect for any QR code/barcode displayed
on the camera preview frame. 8 Conclusion
After detection, the QR Code/barcode will be read by The proposed android application is designed to help
the API to detect the embedded information or URL. and guide the partially blind in their daily tasks when
The app will automatically open the URL link or will needed. The application has 5 main components,
read out the information from the barcode using namely- text recognition, object recognition, face
Google TTS. The barcode API only requires the detection, chatbot, and barcode scanner. The text and
smartphone’s camera access. object recognition, barcode scanner, face detection, and
chatbot are working as proposed and intended. Several
Working of Face Recognition changes in the text-to-speech module and the output
are yet to be implemented which will be added in the
For implementing face detection, we have used the coming months. This application is intended to work in
MobileFaceNet model, which is an extremely efficient indoor and outdoor conditions provided there is a good
CNN model. The model is just 4.0MB in size and is lighting condition.
designed for smartphones and embedded systems. The
face detection process starts with detecting the human's
faces in the real-time camera preview frame. The Acknowledgment
image is then warped using the detected landmarks like This work is supported by the Department of
eyes, nose, jaws, eyebrows, etc and the face is Information Technology, SIES Graduate School of
captured. This image of the face is then processed and Technology. This Project is also supported by the Head
resized to be fed as input to the Deep Learning model. of the Department Dr. K. Lakshmisudha and Project
Guide Prof. Bushra Shaikh(co-author).

9
ITM Web of Conferences 37, 01019 (2021) https://doi.org/10.1051/itmconf/20213701019
ICITSD-2021

References International Conference (PuneCon), Pune, India,

1. S. Tosun and E. Karaarslan, "Real-Time Object 2019
Detection Application for Visually Impaired 12. Supersense:
People: Third Eye," 2018 International Conference https://play.google.com/store/apps/details?id=com.
on Artificial Intelligence and Data Processing mediate. supersense&hl=en_IN&gl=US
(IDAP), Malatya, Turkey 13. Sullivan+:
2. N. M. Tembhurne, S. V. Vaidya, A. Shiekh, S. https://play.google.com/store/apps/details?id=tuat.k
Dravyakar, "Voice Assistant for Visually r.suliva n&hl=en_IN&gl=US
Impaired People”, International Research Journal 14. EnvisionAI:
of Engineering and Technology (IRJET). https://play.google.com/store/apps/details?id=com.l
3. D. Dahiya, H. Gupta and M. K. Dutta, "A Deep etsenvis ion.envisionai&hl=en_IN&gl=US
Learning based Real-Time Assistive Framework 15. LetSeeApp:
for Visually Impaired," 2020 International https://play.google.com/store/apps/details?id=com.l
Conference on Contemporary Computing and etseeapp.letseeapp&hl=en_IN&gl=US
Applications (IC3A), Lucknow, India, 2020 16. “You Only Look Once: Unified, Real-Time
4. F. Ahmed, M. S. Mahmud, and M. Yeasin, "RNN Object Detection”: [1506.02640] You Only Look
and CNN for Way-Finding and Obstacle Once: Unified, Real-Time Object Detection
Avoidance for Visually Impaired," 2019 2nd (arxiv.org)
International Conference on Data Intelligence and
17. SSD: Single Shot MultiBox Detector”:
Security (ICDIS), South Padre Island, TX, USA,
[1512.02325] SSD: Single Shot MultiBox
2019
Detector (arxiv.org)
5. S. Gianani, A. Mehta, T. Motwani, and R. Shende,
18. “Faster R-CNN: Towards Real-Time Object
"JUVO - An Aid for the Visually Impaired", 2018
Detection with Region Proposal
International Conference on Smart City and
Emerging Technology (ICSCET), Mumbai Networks”:[1506.01497] Faster R- CNN:
6. R. Kukade, R. Fengse, K. Rodge, S. Ransing, V. Towards Real-Time Object Detection with
Lomte, "Virtual Personal Assistant for the Region Proposal Networks (arxiv.org)
Blind",2018 International Journal of Computer 19. “COCO - Common Objects in Context”:COCO -
Science and Technology (IJCST), Bali, Indonesia. Common Objects in Context (cocodataset.org)
7. M. A. Khan Shishir, S. Rashid Fahim, F. M. 20. ” Joint Face Detection and Alignment using
Habib, and T. Farah, "Eye Assistant: Using a Multi-task Cascaded Convolutional Networks”
mobile application to help the visually impaired," [1604.02878] Joint Face Detection and
2019 1st International Conference on Advances in Alignment using Multi-task Cascaded
Science, Engineering and Robotics Technology Convolutional Networks (arxiv.org)
(ICASERT), Dhaka, Bangladesh 21. ”FaceNet: A Unified Embedding for Face
8. A.Karthik, V.K.Raja and S.Prabakaran, "Voice Recognition and Clustering”: [1503.03832]
Assistance for Visually Impaired People," 2018 FaceNet: A Unified Embedding for Face
International Conference on Communication, Recognition and Clustering (arxiv.org)
Computing and Internet of Things , Chennai, India 22. GitHub - opencv/opencv: Open Source Computer
9. G. Singh, K. Takhtani, O. Kandale, N. Dadhwal,” Vision Library
A Smart Personal AI Assistant for Visually 23. GitHub - ageitgey/face_recognition: The world's
Impaired People”, Vol 7, Issue 6, pg.1450-54,
simplest facial recognition API for Python and
International Research Journal of Engineering and
the command line
Technology (IRJET)
24. ” MobileFaceNets: Efficient CNNs for Accurate
10. V. Sharma, V. M. Singh, S. Thanneeru,” Virtual
Real-Time Face Verification on Mobile
Assistant for Visually Impaired”, (April 19,
Devices”: [1804.07573] MobileFaceNets:
2020). Available at
SSRN:https://ssrn.com/abstract=3580035 Efficient CNNs for Accurate Real-Time Face
11. S. A. Jakhete, P. Bagmar, A. Dorle, A. Rajurkar Verification on Mobile Devices (arxiv.org)
and P. Pimplikar, "Object Recognition App for 25. LFW Face Dataset: http://vis-
Visually Impaired," 2019 IEEE Pune Section www.cs.umass.edu/lfw

Anthropology 14th Edition Carol R Ember HQ File Fast Access
No ratings yet
Anthropology 14th Edition Carol R Ember HQ File Fast Access
312 pages
Expansion of Theme
100% (2)
Expansion of Theme
10 pages
Fyp Report - Wei Cheng Won
No ratings yet
Fyp Report - Wei Cheng Won
137 pages
Deep Learning Based Mobile Assistive Device For Visually Impaired People
No ratings yet
Deep Learning Based Mobile Assistive Device For Visually Impaired People
3 pages
Ai-Powered Based Blind and Visually Impaired System For Smart Glass
No ratings yet
Ai-Powered Based Blind and Visually Impaired System For Smart Glass
5 pages
Obs Gynae Dams Notes 2018 PDF
No ratings yet
Obs Gynae Dams Notes 2018 PDF
398 pages
Saavip-Smart Ai - Assistant For Visually Impaired People
No ratings yet
Saavip-Smart Ai - Assistant For Visually Impaired People
43 pages
Project Report G-16
No ratings yet
Project Report G-16
33 pages
Project Diary
No ratings yet
Project Diary
26 pages
1240 SCEECS25 Review
No ratings yet
1240 SCEECS25 Review
12 pages
Obstacle Detection For Visually Impaire Using IoT
No ratings yet
Obstacle Detection For Visually Impaire Using IoT
21 pages
Report
No ratings yet
Report
32 pages
AI Assistant For Visually Impaired 3
No ratings yet
AI Assistant For Visually Impaired 3
6 pages
Robotic Assistant For Object Recognition Using Con
No ratings yet
Robotic Assistant For Object Recognition Using Con
13 pages
First Review 1MS21LVS06
No ratings yet
First Review 1MS21LVS06
12 pages
Chapter1 2
No ratings yet
Chapter1 2
6 pages
Blind
No ratings yet
Blind
24 pages
Main
No ratings yet
Main
18 pages
Ijcrt July Student 2022
No ratings yet
Ijcrt July Student 2022
5 pages
2023 Voice Assisted Real-Time Object Detection
No ratings yet
2023 Voice Assisted Real-Time Object Detection
14 pages
Newborn Care Checklist
No ratings yet
Newborn Care Checklist
2 pages
Syllabus 2021 Foundation Engineering
No ratings yet
Syllabus 2021 Foundation Engineering
4 pages
Deeplearningfor Objectdetect
No ratings yet
Deeplearningfor Objectdetect
20 pages
Final Invision
No ratings yet
Final Invision
6 pages
02 - FootPrinting
No ratings yet
02 - FootPrinting
91 pages
Final Invision
No ratings yet
Final Invision
6 pages
Designingan Obstacle Detectionand Alerting Systemfor Visually Impaired Peopleon Sidewalks
No ratings yet
Designingan Obstacle Detectionand Alerting Systemfor Visually Impaired Peopleon Sidewalks
5 pages
Assistive Technology For Visual Impairment
No ratings yet
Assistive Technology For Visual Impairment
15 pages
Intelligent Glasses For The Visually Impaired With Google Cloud API
No ratings yet
Intelligent Glasses For The Visually Impaired With Google Cloud API
4 pages
Vision Maker: An Audio Visual and Navigation Aid For Visually Impaired Person
No ratings yet
Vision Maker: An Audio Visual and Navigation Aid For Visually Impaired Person
6 pages
Altman Z Score Model
No ratings yet
Altman Z Score Model
7 pages
Blind Assistance
No ratings yet
Blind Assistance
16 pages
Scene Description
No ratings yet
Scene Description
6 pages
VisualPal A Mobile App For Object Recognition For The Visually Impaired
No ratings yet
VisualPal A Mobile App For Object Recognition For The Visually Impaired
6 pages
Fin Irjmets1657961785
No ratings yet
Fin Irjmets1657961785
4 pages
Development Smart Eyeglasses For Visuall
No ratings yet
Development Smart Eyeglasses For Visuall
9 pages
AI Optics: Object Recognition and Caption Generation For Blinds Using Deep Learning Methodologies
No ratings yet
AI Optics: Object Recognition and Caption Generation For Blinds Using Deep Learning Methodologies
6 pages
Vision Android Application For The Visually Impaired
No ratings yet
Vision Android Application For The Visually Impaired
6 pages
Smart Glasses A Visual Assistant For The Blind
No ratings yet
Smart Glasses A Visual Assistant For The Blind
6 pages
Automated Service Assistances To The Visually Impaired People Using Android Application
No ratings yet
Automated Service Assistances To The Visually Impaired People Using Android Application
9 pages
Rajendran 2020
No ratings yet
Rajendran 2020
4 pages
A Report On Existing AI Work For Visually Impaired People: Ayesha Tariq
No ratings yet
A Report On Existing AI Work For Visually Impaired People: Ayesha Tariq
51 pages
Think Pair Share Food Safety 2
No ratings yet
Think Pair Share Food Safety 2
4 pages
Object Detection Research Paper
No ratings yet
Object Detection Research Paper
4 pages
Project-1 Details 3
No ratings yet
Project-1 Details 3
6 pages
Blind Assistance System
No ratings yet
Blind Assistance System
8 pages
Irjet V7i3567 PDF
No ratings yet
Irjet V7i3567 PDF
6 pages
College Code / Name: 9615 - Maria College of Engineering and Technology Branch Code / Name: 103 - B.E. Civil Engineering
No ratings yet
College Code / Name: 9615 - Maria College of Engineering and Technology Branch Code / Name: 103 - B.E. Civil Engineering
3 pages
Mapping Pulling Cable Grounding System
No ratings yet
Mapping Pulling Cable Grounding System
1 page
Object Detection and Recognition Using TensorFlow For Blind People
No ratings yet
Object Detection and Recognition Using TensorFlow For Blind People
6 pages
Fa22 Rba 003
No ratings yet
Fa22 Rba 003
7 pages
Virtual Smart Glass For Blind Using Object Detection
No ratings yet
Virtual Smart Glass For Blind Using Object Detection
6 pages
SonarQube Users (Archive) - Java - lang.OutOfMemoryError - Java Heap Space PDF
No ratings yet
SonarQube Users (Archive) - Java - lang.OutOfMemoryError - Java Heap Space PDF
9 pages
New App-Karen Ortiz
No ratings yet
New App-Karen Ortiz
2 pages
ED Mid
No ratings yet
ED Mid
1 page
Items - Doc Format
No ratings yet
Items - Doc Format
1 page
ABC Telecom
No ratings yet
ABC Telecom
8 pages
Ai Glass 1
No ratings yet
Ai Glass 1
6 pages
TCTX 5100 Classroom Rules Learning Activity
No ratings yet
TCTX 5100 Classroom Rules Learning Activity
2 pages
A Novel Based Intelligent Spectacles For Visually Impaired
No ratings yet
A Novel Based Intelligent Spectacles For Visually Impaired
9 pages
Deep Learning Based Object Detection and Recognition Framework For The Visually-Impaired
No ratings yet
Deep Learning Based Object Detection and Recognition Framework For The Visually-Impaired
5 pages
Virtual Assistant For The Blind
No ratings yet
Virtual Assistant For The Blind
7 pages
iCMLDE2019 Paper 25
No ratings yet
iCMLDE2019 Paper 25
5 pages
2303 07451 PDF
No ratings yet
2303 07451 PDF
6 pages
Lit Survey
No ratings yet
Lit Survey
1 page
Assistive Technology For Visually Impaired Using Tensor Flow Object Detection in Raspberry Pi and Coral USB Accelerator
No ratings yet
Assistive Technology For Visually Impaired Using Tensor Flow Object Detection in Raspberry Pi and Coral USB Accelerator
4 pages
4-Quantity Calculations
No ratings yet
4-Quantity Calculations
18 pages
Syllabus MKCU Semester 2
No ratings yet
Syllabus MKCU Semester 2
3 pages
Smart Glasses For Blind - A Personal Assistant Using Paper
No ratings yet
Smart Glasses For Blind - A Personal Assistant Using Paper
4 pages
Pehlivan 2019
No ratings yet
Pehlivan 2019
4 pages
Human Resources, Job Design, and Work Measurement: Human Resource Strategy For Competitive Advantage
No ratings yet
Human Resources, Job Design, and Work Measurement: Human Resource Strategy For Competitive Advantage
3 pages
Object Detection System With Voice Alert For Blind
No ratings yet
Object Detection System With Voice Alert For Blind
7 pages
Android Based Application For Visually Impaired Using Deep Learning Approach
No ratings yet
Android Based Application For Visually Impaired Using Deep Learning Approach
10 pages
Colonial Houses and The Stephen Moylan Press
No ratings yet
Colonial Houses and The Stephen Moylan Press
7 pages
Blinds Personal Assistant Application For Android
No ratings yet
Blinds Personal Assistant Application For Android
7 pages
Third Eye An Aid For Visually Impaired 1
No ratings yet
Third Eye An Aid For Visually Impaired 1
6 pages
Sec A: Project: It Building, Bhaktapur NEA Supply GEN Supply
No ratings yet
Sec A: Project: It Building, Bhaktapur NEA Supply GEN Supply
3 pages
FBS Midterm
No ratings yet
FBS Midterm
2 pages
Onion - Wikipedia, The Free Encyclopedia1
No ratings yet
Onion - Wikipedia, The Free Encyclopedia1
7 pages
Natgeo-Formation-Of-Earth-2000002398-Article Quiz and Answers
No ratings yet
Natgeo-Formation-Of-Earth-2000002398-Article Quiz and Answers
4 pages
List of Banned Pesticides
No ratings yet
List of Banned Pesticides
3 pages
Sony Ericsson Product
No ratings yet
Sony Ericsson Product
34 pages
Material Test Report: Cse. Chiang Sung Enterprise Co., LTD
No ratings yet
Material Test Report: Cse. Chiang Sung Enterprise Co., LTD
3 pages
Book Report Choice Board 1
No ratings yet
Book Report Choice Board 1
1 page
Bone Forming Tumors
No ratings yet
Bone Forming Tumors
81 pages
Hands-On Artificial Intelligence for Android: Understand Machine Learning and Unleash the Power of TensorFlow in Android Applications with Google ML Kit
From Everand
Hands-On Artificial Intelligence for Android: Understand Machine Learning and Unleash the Power of TensorFlow in Android Applications with Google ML Kit
Vasco Correia Veloso
No ratings yet
Mastering OpenCV Android Application Programming
From Everand
Mastering OpenCV Android Application Programming
Salil Kapur
No ratings yet
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
From Everand
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
Fouad Sabry
No ratings yet
Activity Recognition: Fundamentals and Applications
From Everand
Activity Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet

Chatbot Paper

Uploaded by

Chatbot Paper

Uploaded by

ITM Web of Conferences 37, 01019 (2021) https://doi.org/10.

Virtual AI Assistant for Person with Partial Vision Impairment

Rohith Raghavan1*, Vishodhan Krishnan1, Hitesh Nishad1, and Bushra Shaikh1

Karthik et al. [8] provided an overview of the OCR 4 Proposed System

Fast R- 73.2 7 1 6000 1000X

The reason behind the fast performance is that the

For the proposed system, we have used the

Real-time text recognition has been implemented

The barcode scanner has been implemented using

Fig 4 Text Detection

Fig 6 User command to chatbot via speech

Fig 5 Detecting human face

 Offline feature’s like calling, asking for date

Fig 7 Chatbot performs the required action

 The information on product is provided

2. Accuracy- providing correct output (tested 20

Fig 10 –Accuracy of each application for object recognition

Note – the testing was done indoor under tube light

Modules used in ChatBot component of the

The chatbot only requires the smartphone’s

This module provides us many features like

Speech Recognition – A technique that is used to

Working of Text Recognition: Working of Object Recognition

References International Conference (PuneCon), Pune, India,

You might also like