On Sign Language Detection
On Sign Language Detection
Problem Definition
The lack of efficient and accessible tools for real-time sign language
recognition poses a significant communication barrier for individuals with
hearing impairments, hindering their ability to interact seamlessly in
various social and professional settings. Current solutions often lack
accuracy, limited in language support, and are not easily deployable for
widespread use.
To address this issue, this project aims to develop a comprehensive real-
time sign language recognition system capable of accurately interpreting
simple signs, including greetings, in Marathi, Hindi, and English. The
system will leverage advanced deep learning techniques to achieve robust
recognition performance across multiple languages and will be designed
for easy integration into daily communication scenarios.
3
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with year proposed work)
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with year proposed work)
3 • M. Zhang, "Deep Learning- The proposed system However, recent
• S. Yang Based Standard demonstrates superior advancements in hand
• M. Zhao, Sign Language performance compared to gesture recognition
Discrimination,“ existing methods when systems, effectively
2023 evaluated on a challenging addressing the
dataset, indicating its complexities of hand
effectiveness in dynamic segmentation, local
hand gesture recognition. feature representation,
global body
configuration
representation, and
gesture sequence
modeling remains a
challenge.
6
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with proposed work)
year
4 • M. "Exploring The proposed deep learning Even though, there are deep
Boonda LSTM and CNN architecture, combining CNN learning frameworks,
mnoen, Architectures for feature extraction and particularly utilizing
• K. for Sign LSTM for spatio-temporal convolutional neural
Thongsri, Language information capture, networks (CNN) and long
• T. Translation," demonstrates promising short term memory (LSTM),
Sahaban 2023 results for sign language to accurately recognize and
toegnsin recognition, particularly in the interpret sign language
• K. context of the Indian Sign gestures continues to be an
Woraratp Language (ISL) dataset. The open research challenge.
anya, scarcity of research utilizing
deep learning models to
effectively capture temporal
information underscores the
significance of this study in
addressing a pertinent open
problem in the field of sign
language recognition.
7
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with proposed work)
year
5 • B. G. J. "Sinhala Sign The proposed work aims to This gap presents an open
Gamage, Language address the gap in existing problem that motivates the
• R. P. S. Translation literature by developing a current research. The
D. through comprehensive system research aims to integrate
Paranag Immersive 3D specifically tailored for SSL various aspects, including
ama, Avatars and users in Sri Lanka. This learning methodologies,
• R. M. S. Adaptive system will integrate various dynamic sign detection,
H. Learning" 2023 components such as learning audio/video to sign
Ranawee modules, dynamic sign conversion, and vocal
ra, detection, audio/video to sign training, into a unified
• A. V. R. conversion, and vocal platform. Such a platform
Dilshan training into a unified would serve to facilitate
platform. The goal is to effective communication and
facilitate effective language development
communication and language within the SSL community in
development for the SSL Sri Lanka.
community.
8
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with year proposed work)
10 Abhishek Real Time Sign The application predicts Through transfer learning, it
Wahane Language these sentences and words attains an accuracy of
Recognition in real time, with an average 89.91% with Google's
using Deep accuracy of 75% each word. Inception v3. The system's
Learning A dual-model system that effectiveness and accuracy
Techniques operates a machine learning in recognizing and
model using the user's 2D- interpreting ASL sentences
Year -2020 Pose coordinates at runtime and gestures are ensured by
and recognizes gestures the fact that the application's
JOURNAL using the Single Shot machine learning and deep
Multibox Detector (SSD).In learning models are all
addition, the program may trained on a special dataset
generate custom phrases by created specifically for this
real-time ASL alphabet purpose.
recognition.
13
Software/Tools Requirements
Software requirements:-
• Python 3
Hardware requirements:-
Architecture / Design
MODELS
• CNN - Convolutional Neural Network
IMAGE
21
Implementation
REAL-TIME
22
Timeline
Abstract • <00/02/2024>
Submission
Literature • <20/02/2024>
Survey
Design • <10-03-2024>
Implementation • <00-03-2024>
Testing • <00-04-2024>
Document • <Date of
Submission Completion>
23
References
[1] A. S Sushmitha Urs, V. B Raj, P. S, P. Kumar K, M. B R and V. Kumar S, "Action
Detection for Sign Language Using Machine Learning," 2023 International
Conference on Network, Multimedia and Information Technology (NMITCON),
Bengaluru, India, 2023, pp. 1-6, doi: 10.1109/NMITCON58196.2023.10275950.
[2] K. S. Vikash, K. Jayakrishnan, S. Ramanathan, G. Rohith and V. Hanumara, "An
approach to Generation of sentences using Sign Language Detection," 2023
International Conference on Signal Processing, Computation, Electronics, Power and
Telecommunication (IConSCEPT), Karaikal, India, 2023, pp. 1-6, doi:
10.1109/IConSCEPT57958.2023.10170218.
[3] Y. Zhang, L. Long, D. Shi, H. He and X. Liu, "Research and Improvement of
Chinese Sign Language Detection Algorithm Based on YOLOv5s," 2022 2nd
International Conference on Networking, Communications and Information
Technology (NetCIT), Manchester, United Kingdom, 2022, pp. 577-581, doi:
10.1109/NetCIT57419.2022.00137.
24
References
[4] M. Zhang, S. Yang and M. Zhao, "Deep Learning-Based Standard Sign Language
Discrimination," in IEEE Access, vol. 11, pp. 125822-125834, 2023, doi:
10.1109/ACCESS.2023.3330863.
[5] M. Boondamnoen, K. Thongsri, T. Sahabantoegnsin and K. Woraratpanya,
"Exploring LSTM and CNN Architectures for Sign Language Translation," 2023 15th
International Conference on Information Technology and Electrical Engineering
(ICITEE), Chiang Mai, Thailand, 2023, pp. 198-203, doi:
10.1109/ICITEE59582.2023.10317660.
[6] K. Aryasa and A. Rusydi, "Design and Build a Sign Language Detection
Application with Tensorflow Object Detection and SSD Mobilenet V2," 2023 5th
International Conference on Cybernetics and Intelligent System (ICORIS),
Pangkalpinang, Indonesia, 2023, pp. 1-5, doi: 10.1109/ICORIS60118.2023.10352247.
U !
Y O
N K
H A
T