0% found this document useful (0 votes)

116 views25 pages

On Sign Language Detection

Detection of sign language using Deep learning models

Uploaded by

Bhavyasree Nara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

116 views25 pages

On Sign Language Detection

Detection of sign language using Deep learning models

Uploaded by

Bhavyasree Nara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

1

Problem Definition
The lack of efficient and accessible tools for real-time sign language
recognition poses a significant communication barrier for individuals with
hearing impairments, hindering their ability to interact seamlessly in
various social and professional settings. Current solutions often lack
accuracy, limited in language support, and are not easily deployable for
widespread use.
To address this issue, this project aims to develop a comprehensive real-
time sign language recognition system capable of accurately interpreting
simple signs, including greetings, in Marathi, Hindi, and English. The
system will leverage advanced deep learning techniques to achieve robust
recognition performance across multiple languages and will be designed
for easy integration into daily communication scenarios.
3
Literature Survey

S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with year proposed work)

1 • A. S "Action Detection It acknowledges the Developing robust and

Sushmitha for Sign Language challenges of efficient automated
Urs, Using Machine misinterpretation and systems capable of
• V. B Raj, P. Learning," 2023 language variations. It accurately recognizing
S, proposes a solution and interpreting
• P. Kumar utilizing automation for sign diverse sign signals
• K, M. B R V. recognition, enabling users from images or video
Kumar S to capture hand signals inputs remains an
through a camera and open problem in the
receive accurate field.
identification of the signs,
thereby enhancing
accessibility and inclusivity.
4
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with year proposed work)
2 • K. S. "An approach to By employing real-time Despite
Vikash, Generation of video processing and the advancements in sign
• K. sentences using SSD MobileNet V2 model language recognition
Jayakrishna Sign Language for sign language using computer vision
n, Detection," 2023 detection, coupled with and machine learning,
• S. TensorFlow Object achieving seamless
Ramanatha Detection API, the system real-time
n, demonstrates high communication
• G. Rohith accuracy and efficiency in between sign
• V. recognizing and translating language users and
Hanumara, sign language into text in non-users remains a
real-time scenarios. This challenge. Current
advancement holds systems may lack the
significant potential in ability to accurately
facilitating smoother interpret nuances in
interactions between the sign language, leading
signing and non-signing to potential
communities. miscommunications.
5
Literature Survey

S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with year proposed work)
3 • M. Zhang, "Deep Learning- The proposed system However, recent
• S. Yang Based Standard demonstrates superior advancements in hand
• M. Zhao, Sign Language performance compared to gesture recognition
Discrimination,“ existing methods when systems, effectively
2023 evaluated on a challenging addressing the
dataset, indicating its complexities of hand
effectiveness in dynamic segmentation, local
hand gesture recognition. feature representation,
global body
configuration
representation, and
gesture sequence
modeling remains a
challenge.
6
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with proposed work)
year
4 • M. "Exploring The proposed deep learning Even though, there are deep
Boonda LSTM and CNN architecture, combining CNN learning frameworks,
mnoen, Architectures for feature extraction and particularly utilizing
• K. for Sign LSTM for spatio-temporal convolutional neural
Thongsri, Language information capture, networks (CNN) and long
• T. Translation," demonstrates promising short term memory (LSTM),
Sahaban 2023 results for sign language to accurately recognize and
toegnsin recognition, particularly in the interpret sign language
• K. context of the Indian Sign gestures continues to be an
Woraratp Language (ISL) dataset. The open research challenge.
anya, scarcity of research utilizing
deep learning models to
effectively capture temporal
information underscores the
significance of this study in
addressing a pertinent open
problem in the field of sign
language recognition.
7
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with proposed work)
year

5 • B. G. J. "Sinhala Sign The proposed work aims to This gap presents an open
Gamage, Language address the gap in existing problem that motivates the
• R. P. S. Translation literature by developing a current research. The
D. through comprehensive system research aims to integrate
Paranag Immersive 3D specifically tailored for SSL various aspects, including
ama, Avatars and users in Sri Lanka. This learning methodologies,
• R. M. S. Adaptive system will integrate various dynamic sign detection,
H. Learning" 2023 components such as learning audio/video to sign
Ranawee modules, dynamic sign conversion, and vocal
ra, detection, audio/video to sign training, into a unified
• A. V. R. conversion, and vocal platform. Such a platform
Dilshan training into a unified would serve to facilitate
platform. The goal is to effective communication and
facilitate effective language development
communication and language within the SSL community in
development for the SSL Sri Lanka.
community.
8
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with year proposed work)

6 • S. Patil, Conversion of The paper addresses the The system's effectiveness is

S. Indian Sign communication challenges underscored by its impressive
Gulave, V. Language to faced by individuals with accuracy of up to 98%,
Gawai, P. Speech by Using hearing and speaking demonstrating the potential of
Gode and Deep Neural disabilities by proposing a neural networks in bridging
P. Network system to convert Indian Sign communication gaps for
Mudme, Language (ISL) into speech individuals reliant on sign
Year- 2022 through deep neural networks. language.
Acknowledging the widespread
JOURNAL difficulty in interpreting sign
language among the general
population, the study
emphasizes the necessity of
eliminating the reliance on
interpreters for communication
with mute and deaf individuals.
on steps facilitate the
interpretation of hand signs,
with speech synthesis
achieved through the Google
text-to-speech API.
9
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with proposed work)
year

7 • Z. Deep Learning- The study addresses the Training datasets, sourced

Mustaffa, Based challenges faced by from Kaggle and personal
N. A. Technique for individuals in learning Malay camera captures, are
Farihin Sign Language Sign Language (MSL) by meticulously divided for
Mohd Detection presenting an object robust training, validation,
Zulkifli, detection model tailored to and testing phases. The
M. H. Year- 2023 detect MSL signs in real study's successful
Sulaiman time. Highlighting the development of a real-time
, F. JOURNAL significance of sign and accurate MSL
Ernawan languages for the deaf and recognition system using the
and Y. A. hard-of-hearing community, SSD-Mobilenet model holds
Adam, the research focuses on promising implications for
developing a system capable enhancing communication
of recognizing static signs, access for individuals within
simplifying the learning the deaf and hard-of-hearing
process. Leveraging the community.
Single Shot Detector (SSD)
and Mobilenet architectures,
the model achieves real-time
detection accuracy.
10
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with proposed work)
year

8 B. A. Arabic Sign The paper addresses the Employing logistic

Dabwan and Language communication challenges regression for binary
M. E. Detection Using faced by the global classification and a 2-layer
Jadhav Deep Learning population of individuals who Artificial Neural Network
are deaf and mute, (ANN), the model aims to
Year -2023 numbering approximately accurately predict the
466 million. With meaning behind the
JOURNAL communication being a detected hand signs.
fundamental aspect of
human interaction, the study
highlights the importance of
providing accessible
communication methods for
this demographic.
Leveraging hand gestures
and symbols as their primary
means of communication, the
research proposes a model
for detecting and interpreting
these gestures using a Deep
Neural Network (DNN).
11
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For proposed
name(s) paper with year work)

9 • N. Hand sign The paper addresses the ResNet-34 is a convolutional

Titarmare language communication challenges neural network (CNN)
detection faced by the global population architecture that belongs to the
of individuals who are deaf and ResNet (Residual Network)
Year- 2022 mute, numbering approximately family, introduced by Microsoft
466million.. By utilizing Research in 2015. ResNet-34
JOURNAL advanced machine learning is known for its depth, featuring
techniques, the proposed 34 layers, hence the name. It
system endeavors to bridge builds upon the concept of
communication gaps and residual learning, which
empower individuals who rely enables the training of very
on hand sign language for deep networks without
communication. encountering the vanishing
gradient problem.
12
Literature Survey
S.No. Authors Full title of the Inference from the paper Open Problem (For
name(s) paper with proposed work)
year

10 Abhishek Real Time Sign The application predicts Through transfer learning, it
Wahane Language these sentences and words attains an accuracy of
Recognition in real time, with an average 89.91% with Google's
using Deep accuracy of 75% each word. Inception v3. The system's
Learning A dual-model system that effectiveness and accuracy
Techniques operates a machine learning in recognizing and
model using the user's 2D- interpreting ASL sentences
Year -2020 Pose coordinates at runtime and gestures are ensured by
and recognizes gestures the fact that the application's
JOURNAL using the Single Shot machine learning and deep
Multibox Detector (SSD).In learning models are all
addition, the program may trained on a special dataset
generate custom phrases by created specifically for this
real-time ASL alphabet purpose.
recognition.
13

Justification for the Proposed Problem

➢ The proposed problem addresses the need for inclusive communication
by developing a real-time sign language recognition system capable of
interpreting simple signs in multiple languages. This project aims to
bridge the communication gap for individuals who rely on sign
language by providing a robust and accessible tool that recognizes and
translates signs into text and audio in Marathi, Hindi, English, Telugu,
and Kannada. By leveraging deep learning techniques and real-time
integration, this system offers a practical solution to facilitate seamless
communication for the deaf and hard of hearing community.
14

Software/Tools Requirements
Software requirements:-

• Python 3

Hardware requirements:-

• Computer (CORE - iUU78IU766TY7U8I5)

Architecture / Design
MODELS
• CNN - Convolutional Neural Network

• CNNs are a class of deep neural networks commonly used

for analyzing visual imagery.
• They consist of multiple layers, including convolutional
layers, pooling layers, and fully connected layers.
• CNNs have achieved state-of-the-art performance in various
computer vision tasks, including image classification, object
detection, and image segmentation.
MODELS
• EfficientNet:

• EfficientNet is a family of convolutional neural network

architectures that aim to achieve better accuracy and
efficiency by scaling the network width, depth, and resolution
in a balanced way.
• It was introduced by Tan et al. in the paper "EfficientNet:
Rethinking Model Scaling for Convolutional Neural
Networks" in 2019.
• EfficientNet models are known for their superior performance
on various image classification tasks while being
computationally efficient compared to other architectures.
MODELS
• ResNet50

• ResNet50 is a variant of the ResNet (Residual Network)

architecture introduced by Microsoft Research in the paper
"Deep Residual Learning for Image Recognition" in 2015.
• ResNet50 specifically refers to a ResNet architecture with 50
layers, consisting of residual blocks that help mitigate the
vanishing gradient problem during training.
• It has been widely used for various computer vision tasks,
including image classification, object detection, and image
segmentation.
MODELS
• InceptionV3

• InceptionV3 is a convolutional neural network architecture

introduced by Google researchers in the paper "Rethinking the
Inception Architecture for Computer Vision" in 2016.
• It is an evolution of the original Inception architecture, designed
to improve computational efficiency and performance.
• InceptionV3 incorporates various modules such as convolutional
layers with different kernel sizes (including 1x1, 3x3, and 5x5),
max-pooling layers, and parallel concatenated operations.
• It has been widely used for image classification, object
detection, and image segmentation tasks, particularly in
scenarios where computational resources are limited.
20
Implementation

IMAGE
21

Implementation
REAL-TIME
22

Timeline
Abstract • <00/02/2024>
Submission

Literature • <20/02/2024>
Survey

Design • <10-03-2024>

Implementation • <00-03-2024>

Testing • <00-04-2024>

Document • <Date of
Submission Completion>
23

References
[1] A. S Sushmitha Urs, V. B Raj, P. S, P. Kumar K, M. B R and V. Kumar S, "Action
Detection for Sign Language Using Machine Learning," 2023 International
Conference on Network, Multimedia and Information Technology (NMITCON),
Bengaluru, India, 2023, pp. 1-6, doi: 10.1109/NMITCON58196.2023.10275950.
[2] K. S. Vikash, K. Jayakrishnan, S. Ramanathan, G. Rohith and V. Hanumara, "An
approach to Generation of sentences using Sign Language Detection," 2023
International Conference on Signal Processing, Computation, Electronics, Power and
Telecommunication (IConSCEPT), Karaikal, India, 2023, pp. 1-6, doi:
10.1109/IConSCEPT57958.2023.10170218.
[3] Y. Zhang, L. Long, D. Shi, H. He and X. Liu, "Research and Improvement of
Chinese Sign Language Detection Algorithm Based on YOLOv5s," 2022 2nd
International Conference on Networking, Communications and Information
Technology (NetCIT), Manchester, United Kingdom, 2022, pp. 577-581, doi:
10.1109/NetCIT57419.2022.00137.
24

References
[4] M. Zhang, S. Yang and M. Zhao, "Deep Learning-Based Standard Sign Language
Discrimination," in IEEE Access, vol. 11, pp. 125822-125834, 2023, doi:
10.1109/ACCESS.2023.3330863.
[5] M. Boondamnoen, K. Thongsri, T. Sahabantoegnsin and K. Woraratpanya,
"Exploring LSTM and CNN Architectures for Sign Language Translation," 2023 15th
International Conference on Information Technology and Electrical Engineering
(ICITEE), Chiang Mai, Thailand, 2023, pp. 198-203, doi:
10.1109/ICITEE59582.2023.10317660.
[6] K. Aryasa and A. Rusydi, "Design and Build a Sign Language Detection
Application with Tensorflow Object Detection and SSD Mobilenet V2," 2023 5th
International Conference on Cybernetics and Intelligent System (ICORIS),
Pangkalpinang, Indonesia, 2023, pp. 1-5, doi: 10.1109/ICORIS60118.2023.10352247.
U !
Y O
N K
H A
T

Complete Bundle Weaving It Together 4 4th Edition Broukal
No ratings yet
Complete Bundle Weaving It Together 4 4th Edition Broukal
407 pages
NLP Unit 1 and 2
No ratings yet
NLP Unit 1 and 2
106 pages
SIGNLANGUAGE PPT
100% (1)
SIGNLANGUAGE PPT
15 pages
Compiler Design Note1
No ratings yet
Compiler Design Note1
111 pages
Chapter-1 Compiler Design
100% (1)
Chapter-1 Compiler Design
13 pages
NLP Course File Notes
No ratings yet
NLP Course File Notes
71 pages
2-Regular Expressions, Text Normalization, Edit Distance
No ratings yet
2-Regular Expressions, Text Normalization, Edit Distance
42 pages
CCS369 TEXT AND SPEECH ANALYSIS - Syllabus
No ratings yet
CCS369 TEXT AND SPEECH ANALYSIS - Syllabus
4 pages
2024-25 NLP Question Bank
No ratings yet
2024-25 NLP Question Bank
4 pages
Notes of NLP - Unit-2
No ratings yet
Notes of NLP - Unit-2
23 pages
Bai601 NLP
No ratings yet
Bai601 NLP
5 pages
Project File
No ratings yet
Project File
66 pages
Unit 2 TOC ODL
No ratings yet
Unit 2 TOC ODL
42 pages
Abstract On Face Recognition Technology
60% (5)
Abstract On Face Recognition Technology
1 page
PPT08-Natural Language Processing
100% (1)
PPT08-Natural Language Processing
44 pages
Project Report - Sign Language To Text Conversion..2
No ratings yet
Project Report - Sign Language To Text Conversion..2
37 pages
Tokenization in NLP
No ratings yet
Tokenization in NLP
10 pages
AI in Healthcare Syllabus
No ratings yet
AI in Healthcare Syllabus
7 pages
Multilingual Natural Language Processing
No ratings yet
Multilingual Natural Language Processing
2 pages
HPGD3103 - Instructional Technology
No ratings yet
HPGD3103 - Instructional Technology
16 pages
306 Seminar Report
No ratings yet
306 Seminar Report
39 pages
MLP Mid Sem Merge (Raja)
No ratings yet
MLP Mid Sem Merge (Raja)
351 pages
Ccs349 Iva Record - Final
No ratings yet
Ccs349 Iva Record - Final
49 pages
ECQ 413 Seminar
No ratings yet
ECQ 413 Seminar
3 pages
CSE4022 Natural-Language-Processing ETH 1 AC41
No ratings yet
CSE4022 Natural-Language-Processing ETH 1 AC41
6 pages
Project Report GitHub
No ratings yet
Project Report GitHub
32 pages
Pattern Recognition and Anomaly Detection Lab
No ratings yet
Pattern Recognition and Anomaly Detection Lab
3 pages
A Slot Reallocation Model For Containership Schedule Adjustment
No ratings yet
A Slot Reallocation Model For Containership Schedule Adjustment
10 pages
Text and Speech CCS369-UNIT 5
No ratings yet
Text and Speech CCS369-UNIT 5
9 pages
Pointillist Measuring Customer Experience Beyond NPS Ebook
No ratings yet
Pointillist Measuring Customer Experience Beyond NPS Ebook
23 pages
Tsa Ut III Tsa Notes
No ratings yet
Tsa Ut III Tsa Notes
30 pages
NLP Unit 1
No ratings yet
NLP Unit 1
133 pages
Hand Gesture Recognition and Voice Conversion For Deaf and Dumb
No ratings yet
Hand Gesture Recognition and Voice Conversion For Deaf and Dumb
8 pages
Lesson 1: Structure of A Compiler
No ratings yet
Lesson 1: Structure of A Compiler
20 pages
TOC Unit 4 PDF
100% (1)
TOC Unit 4 PDF
23 pages
Iva Lab Manual
No ratings yet
Iva Lab Manual
34 pages
Internship Report
No ratings yet
Internship Report
13 pages
Minor Project Report Sign Language Detection
No ratings yet
Minor Project Report Sign Language Detection
34 pages
ICT4SD 2024 Paper 436
No ratings yet
ICT4SD 2024 Paper 436
15 pages
6CS4 AI Unit-5
No ratings yet
6CS4 AI Unit-5
65 pages
Sign Language Detection
No ratings yet
Sign Language Detection
5 pages
Sign Language Recognition Using Deep Learning
No ratings yet
Sign Language Recognition Using Deep Learning
6 pages
F1 Visa Interview Questionsdocx - 240506 - 125203
No ratings yet
F1 Visa Interview Questionsdocx - 240506 - 125203
6 pages
AHS 2012-13 (Final Report)
No ratings yet
AHS 2012-13 (Final Report)
88 pages
Image Caption Generator PCL
No ratings yet
Image Caption Generator PCL
19 pages
(IFIP — the International Federation for Information Processing) Asbjørn Rolstadås (Auth.), Norio Okino, Hiroyuki Tamura, Susumu Fujii (Eds.)-Advances in Production Management Systems_ Perspectives An
No ratings yet
(IFIP — the International Federation for Information Processing) Asbjørn Rolstadås (Auth.), Norio Okino, Hiroyuki Tamura, Susumu Fujii (Eds.)-Advances in Production Management Systems_ Perspectives An
483 pages
Frederic Bartlett PDF
No ratings yet
Frederic Bartlett PDF
10 pages
RES 317 Study Guide
No ratings yet
RES 317 Study Guide
34 pages
Lecture-1-Introduction To Natural Language Processing-2021
No ratings yet
Lecture-1-Introduction To Natural Language Processing-2021
46 pages
Concept Learning
No ratings yet
Concept Learning
85 pages
Paulick Cai WhitePaper 2023
No ratings yet
Paulick Cai WhitePaper 2023
76 pages
Sample Letter For Validators
No ratings yet
Sample Letter For Validators
6 pages
Sara
No ratings yet
Sara
40 pages
Ch. 8 Training and Development 2021
No ratings yet
Ch. 8 Training and Development 2021
42 pages
Solutions To NLP I Mid Set A
100% (1)
Solutions To NLP I Mid Set A
8 pages
CS6007 Information Retrieval
No ratings yet
CS6007 Information Retrieval
8 pages
Mefa R19 - Unit-1
No ratings yet
Mefa R19 - Unit-1
30 pages
Organizational Commitment Challenges
No ratings yet
Organizational Commitment Challenges
23 pages
4.theories of Second 15
No ratings yet
4.theories of Second 15
59 pages
Sign Language Recognition Synopsis
No ratings yet
Sign Language Recognition Synopsis
4 pages
Untitled
No ratings yet
Untitled
18 pages
Model Question Paper
0% (1)
Model Question Paper
2 pages
Inbound 1560510999830898000
No ratings yet
Inbound 1560510999830898000
14 pages
Deep Learning r18 Jntuh Lab Manual
No ratings yet
Deep Learning r18 Jntuh Lab Manual
20 pages
Me Chapter 4 Demand Estimation Covid 19 Student Version
No ratings yet
Me Chapter 4 Demand Estimation Covid 19 Student Version
18 pages
Jurnal Keratitis 2
No ratings yet
Jurnal Keratitis 2
12 pages
Staphylococcus Aureus On Patients
No ratings yet
Staphylococcus Aureus On Patients
10 pages
Blackbook
No ratings yet
Blackbook
35 pages
Improving System Performance in Homogeneous Multicore Systems
No ratings yet
Improving System Performance in Homogeneous Multicore Systems
6 pages
NLP End Sem Paper - Evaluation Scheme
No ratings yet
NLP End Sem Paper - Evaluation Scheme
14 pages
Analysis of Impact of Technology On Relationship Marketing
No ratings yet
Analysis of Impact of Technology On Relationship Marketing
10 pages
Foundations of Applied Statistical Methods - 2nd Edition Reference Book Download
100% (19)
Foundations of Applied Statistical Methods - 2nd Edition Reference Book Download
15 pages
Synopsis WS
No ratings yet
Synopsis WS
11 pages
Sumit Barad CV Resume
No ratings yet
Sumit Barad CV Resume
1 page
Synopsis PPT Template
No ratings yet
Synopsis PPT Template
13 pages
Software Requirements Specification: COMSATS University Islamabad, COMSATS Road, Off GT Road, Sahiwal, Pakistan
No ratings yet
Software Requirements Specification: COMSATS University Islamabad, COMSATS Road, Off GT Road, Sahiwal, Pakistan
13 pages
Testbank For Research Methods For The Behavioral Sciences 3rd Edition Privitera SOLUTION MANUAL Solution Manual
No ratings yet
Testbank For Research Methods For The Behavioral Sciences 3rd Edition Privitera SOLUTION MANUAL Solution Manual
19 pages
Pyt02 240624 077807 Appreceipt
No ratings yet
Pyt02 240624 077807 Appreceipt
1 page
Biostatistics (SAMPLING TECHNIQUES)
No ratings yet
Biostatistics (SAMPLING TECHNIQUES)
2 pages
Choice Based Credit System
No ratings yet
Choice Based Credit System
3 pages
Sign Language and Common Gesture Using CNN
0% (1)
Sign Language and Common Gesture Using CNN
7 pages
Miranda Finale 1
No ratings yet
Miranda Finale 1
19 pages
Feasibility Report
No ratings yet
Feasibility Report
12 pages
Chem PSP Here
No ratings yet
Chem PSP Here
6 pages
Deep Learning For Sign Language Recognition
No ratings yet
Deep Learning For Sign Language Recognition
4 pages
Convolution Neural Networks For Hand Gesture Recognation
No ratings yet
Convolution Neural Networks For Hand Gesture Recognation
5 pages
Indian Sign Language Recognition System
No ratings yet
Indian Sign Language Recognition System
3 pages
Sign Language Interpreter Using Computer Vision and LeNet-5 Convolutional Neural Network Architecture
No ratings yet
Sign Language Interpreter Using Computer Vision and LeNet-5 Convolutional Neural Network Architecture
4 pages
Sign Language To Text-Speech Translator Using Machine Learning
No ratings yet
Sign Language To Text-Speech Translator Using Machine Learning
5 pages
NLP Assignment 2
No ratings yet
NLP Assignment 2
2 pages
7 On Narratology
No ratings yet
7 On Narratology
25 pages
Ieee Paper
No ratings yet
Ieee Paper
5 pages
Inquiry Based Lesson Plan Template 0
No ratings yet
Inquiry Based Lesson Plan Template 0
5 pages
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet