0% found this document useful (0 votes)

299 views

Sign Language Recognition Using CNNs

This document summarizes a research paper on sign language recognition using convolutional neural networks. The researchers used a CNN model to recognize 20 Italian gestures from video data captured by a Microsoft Kinect. The CNN was able to automatically extract features from the video frames without needing complex handcrafted features. In testing, the model achieved a cross-validation accuracy of 91.7% at recognizing gestures, generalizing well to new users and environments not seen during training. It also achieved good performance in the ChaLearn 2014 gesture spotting competition with a mean Jaccard Index of 0.789.

Uploaded by

陳子彤

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

299 views

Sign Language Recognition Using CNNs

Uploaded by

陳子彤

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Sign Language Recognition Using Convolutional

Neural Networks

Lionel Pigou(B) , Sander Dieleman, Pieter-Jan Kindermans,

and Benjamin Schrauwen

ELIS, Ghent University, Ghent, Belgium

[email protected]

Abstract. There is an undeniable communication problem between the

Deaf community and the hearing majority. Innovations in automatic sign
language recognition try to tear down this communication barrier. Our
contribution considers a recognition system using the Microsoft Kinect,
convolutional neural networks (CNNs) and GPU acceleration. Instead
of constructing complex handcrafted features, CNNs are able to auto-
mate the process of feature construction. We are able to recognize 20
Italian gestures with high accuracy. The predictive model is able to gen-
eralize on users and surroundings not occurring during training with a
cross-validation accuracy of 91.7%. Our model achieves a mean Jaccard
Index of 0.789 in the ChaLearn 2014 Looking at People gesture spotting
competition.

Keywords: Convolutional neural network · Deep learning · Gesture

recognition · Sign language recognition

1 Introduction
Very few people understand sign language. Moreover, contrary to popular belief,
it is not an international language. Obviously, this further complicates commu-
nication between the Deaf community and the hearing majority. The alternative
of written communication is cumbersome, because the Deaf community is gen-
erally less skilled in writing a spoken language [17]. Furthermore, this type of
communication is impersonal and slow in face-to-face conversations. For exam-
ple, when an accident occurs, it is often necessary to communicate quickly with
the emergency physician where written communication is not always possible.
The purpose of this work is to contribute to the field of automatic sign
language recognition. We focus on the recognition of the signs or gestures. There
are two main steps in building an automated recognition system for human
actions in spatio-temporal data [15]. The first step is to extract features from
the frame sequences. This will result in a representation consisting of one or
more feature vectors, also called descriptors. This representation will aid the
computer to distinguish between the possible classes of actions. The second step
is the classification of the action. A classifier will use these representations to
discriminate between the different actions (or signs). In our work, the feature
extraction is automated by using convolutional neural networks (CNNs). An
artificial neural network (ANN) is used for classification.

c Springer International Publishing Switzerland 2015
L. Agapito et al. (Eds.): ECCV 2014 Workshops, Part I, LNCS 8925, pp. 572–578, 2015.
DOI: 10.1007/978-3-319-16178-5 40
Sign Language Recognition Using Convolutional Neural Networks 573

2 Related Work
In our work, we build on the results of Roel Verschaeren [18]. He proposes a CNN
model that recognizes a set of 50 different signs in the Flemish Sign Language
with an error of 2.5%, using the Microsoft Kinect. Unfortunately, this work is
limited in the sense that it considers only a single person in a fixed environment.
In [19] an American Language recognition system is presented with a vocab-
ulary of 30 words. They constructed appearance-based representations and a
hand tracking system to be classified with a hidden Markov model (HMM). An
error rate of 10.91% is achieved on the RWTH-BOSTON-50 database.
The approach in [4] uses the Microsoft Kinect to extract appearance-based
hand features and track the position in 2D and 3D. The classification results are
obtained by comparing a hidden Markov model (HMM) approach with sequential
pattern boosting (SP-boosting). This resulted in an accuracy of 99.9% on 20
different isolated gestures on their specifically constructed data set and 85.1%
on a more realistic one with 40 gestures.
The Microsoft Kinect is also used in [2] that proposes a recognition system
for 239 words of the Chinese Sign Language (CSL). Here, the 3D movement
trajectory of the hands are used besides a language model to construct sentences.
This trajectory is aligned and matched with a gallery of known trajectories. The
top-1 and top-5 recognition rates are 83.51% and 96.32% respectively.

(a) RGB (b) Depth map (c) User index (d) Skeleton

Fig. 1. Data set for the CLAP14 gesture spotting challenge [5]

3 Methodology
3.1 Data
We use the data set from the ChaLearn Looking at People 2014 [5] (CLAP14)
challenge in this work. More speciﬁcally, Track 3: Gesture Spotting. This dataset
consists of 20 diﬀerent Italian gestures, performed by 27 users with variations in
surroundings, clothing, lighting and gesture movement. The videos are recorded
with a Microsoft Kinect. As a result, we have access to the depth map, user
index (location of the user in the depth map) and the joint positions (Figure 1).
574 L. Pigou et al.

We use 6600 gestures in the development set of CLAP14 for our experiments:
4600 for the training set and 2000 for the validation set. The test set of CLAP14
is also considered as the test set for this work and consists of 3543 samples. The
users and backgrounds in the validation set are not contained in the training
set. The users and backgrounds in the test set can occur in the training and the
validation set.

3.2 Preprocessing

Our ﬁrst step in the preprocessing stage is cropping the highest hand and the
upper body using the given joint information. We discovered that the highest
hand is the most interesting. If both hands are used, they perform the same
(mirrored) movement. If one hand is used, it is always the highest one. If the left
hand is used, the videos are mirrored. This way, the model only needs to learn
one side.
The preprocessing results in four video samples (hand and body with depth
and gray-scale) of resolution 64x64x32 (32 frames of size 64x64). Furthermore,
the noise in the depth maps is reduced with thresholding, background removal
using the user index, and median ﬁltering. The outcome is shown in Figure 2.

(a) Original (b) Noise reduction (c) 4 Input channels

Fig. 2. Preprocessing

3.3 Convolutional Neural Network (CNN)

CNNs (based on [13]) are feature extraction models in deep learning that recently
have proven to be to be very successful at image recognition [12], [3], [20], [7]. As of
now, the models are in use by various industry leaders like Google, Facebook and
Amazon. And recently, researchers at Google applied CNNs on video data [11].
CNNs are inspired by the visual cortex of the human brain. The artificial
neurons in a CNN will connect to a local region of the visual field, called a
receptive field. This is accomplished by performing discrete convolutions on the
image with filter values as trainable weights. Multiple filters are applied for each
channel, and together with the activation functions of the neurons, they form
Sign Language Recognition Using Convolutional Neural Networks 575

feature maps. This is followed by a pooling scheme, where only the interest-
ing information of the feature maps are pooled together. These techniques are
performed in multiple layers as shown in Figure 3.

3.4 Proposed Architecture

For the pooling method, we use max-pooling: only the maximum value in a local
neighborhood of the feature map remains. To accommodate video data, the
max-pooling is performed in three dimensions. However, using 2D convolutions
resulted in a better validation accuracy than 3D convolutions.
The architecture of the model consists of two CNNs, one for extracting hand
features and one for extracting upper body features. Each CNN is three layers
deep. A classical ANN with one hidden layer provides classification after concate-
nating the outcomes of both CNNs. Also, local contrast normalization (LCN)
as in [10] is applied in the first two layers and all artificial neurons are rectified
linear units (ReLUs [14], [6]). An illustration of the architecture is depicted in
Figure 3.

Fig. 3. The architecture of the deep learning model

3.5 Generalization and Training

During training, dropout [9] and data augmentation are used as main approaches
to reduce overfitting. The data augmentation is performed in real time on the
CPU during the training phase whiles the model trains on the GPU as in [12].
This consists of zooming up to 10%, rotations up to (-)3◦ , spatial translations
up to (-)5 pixels in the x and y direction, and temporal translations up to (-)4
frames.
We use Nesterov’s accelerated gradient descent (NAG) [16] with a fixed
momentum-coefficient of 0.9 and mini-batches of size 20. The learning rate is
576 L. Pigou et al.

initialized at 0.003 with a 5% decrease after each epoch. The weights of the
CNNs are randomly initialized with a normal distribution with μ = 0 and
σ = 0.04, and σ = 0.02 for the weights of the ANN. The biases of the CNNs are
initialized at 0.2 and the biases of the ANN at 0.1.
Experiments are conducted on one machine with a hexa-core processor (Intel
Core i7-3930K), 32GB SDRAM and a NVIDIA GeForce GTX 680 GPU with
4096MB of memory . The models are implemented using the Python libraries
Theano [1], and PyLearn2 [8] for the fast implementation of 2D convolutions by
Alex Krizhevsky [12].

3.6 Temporal Segmentation

The CLAP14 challenge consists of spotting gestures in video samples. Each video
sample is an unedited recording of a user signing 10 to 20 gestures, including
noise movements that are not part of the 20 Italian gestures. The goal of the
temporal segmentation method is to predict the begin and end frames of every
gesture in the video samples.
We use the sliding windows technique, where each possible interval of 32
frames is evaluated with the trained model (as previously described). Consecu-
tive intervals with identical classes and sufficiently high classification probability
(thresholding) are considered as a gesture segment. The validation set of CLAP14
is used to optimize the thresholding parameters. Furthermore, an extra class is
added to the classifier to help identify video intervals without gesture.

Table 1. Validation results

Error rate (%) Improvement (%)

Tanh units 18.90
ReLU 14.40 23.8
+ dropout 11.90 17.4
+ LCN (ﬁrst 2 layers) 10.30 13.4
+ data augmentation 8.30 19.4

4 Results
Our most notable experiments are the models with ReLUs, dropout, LCN and
data augmentation. The validation results of these experiments are shown in
Table 1. We observe a validation accuracy of 91.70% (8.30% error rate) for our
best model. Furthermore, ReLUs prove to be very eﬀective with an improvement
of 23.8% with respect to tanh units.
The accuracy on the test set is 95.68% and we observe a 4.13% false pos-
itive rate, caused by the noise movements. Note that the test result is higher
than the validation result, because the validation set doesn’t contain users and
backgrounds in the training set.
Sign Language Recognition Using Convolutional Neural Networks 577

The final score for the CLAP14 competition is the mean Jaccard Index of
each gesture and video sample. The Jaccard Index is a measure for overlapping
frames between prediction and ground truth. The validation score of our best
model is 0.789675 and the final score is 0.788804, which ranks us fifth of the 17
qualified teams.

5 Conclusion

This work shows that convolutional neural networks can be used to accurately
recognize diﬀerent signs of a sign language, with users and surroundings not
included in the training set. This generalization capacity of CNNs in spatio-
temporal data can contribute to the broader research ﬁeld on automatic sign
language recognition.

References
1. Bergstra, J., Breuleux, O., Bastien, F., Lamblin, P., Pascanu, R., Desjardins, G.,
Turian, J., Warde-Farley, D., Bengio, Y.: Theano: a CPU and GPU math expres-
sion compiler. In: Proceedings of the Python for Scientific Computing Conference
(SciPy), June 2010, oral Presentation
2. Chai, X., Li, G., Lin, Y., Xu, Z., Tang, Y., Chen, X., Zhou, M.: Sign Language
Recognition and Translation with Kinect (2013). Language Recognition and Trans-
lation with Kinect.pdf. http://vipl.ict.ac.cn/sites/default/files/papers/files/2013
FG xjchai Sign
3. Cireşan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for
image classification. In: IEEE Conference on Computer Vision and Pattern Recog-
nition (CVPR), pp. 3642–3649. IEEE (2012)
4. Cooper, H., Ong, E.J., Pugeault, N., Bowden, R.: Sign language recognition using
sub-units. The Journal of Machine Learning Research 13(1), 2205–2231 (2012)
5. Escalera, S., Bar, X., Gonzlez, J., Bautista, M.A., Madadi, M., Reyes, M., Ponce,
V., Escalante, H.J., Shotton, J., Guyon, I.: Chalearn looking at people challenge
2014: Dataset and results. In: ECCV Workshop (2014)
6. Glorot, X., Bordes, A., Bengio, Y.: Deep sparse rectifier networks. In: Proceedings
of the 14th International Conference on Artificial Intelligence and Statistics 15,
pp. 315–323 (2011). http://eprints.pascal-network.org/archive/00008596/
7. Goodfellow, I.J., Bulatov, Y., Ibarz, J., Arnoud, S., Shet, V.: Multi-digit number
recognition from street view imagery using deep convolutional neural networks
(2013). arXiv preprint arXiv:1312.6082
8. Goodfellow, I.J., Warde-Farley, D., Lamblin, P., Dumoulin, V., Mirza, M., Pascanu,
R., Bergstra, J., Bastien, F., Bengio, Y.: Pylearn2: a machine learning research
library (2013). arXiv preprint arXiv:1308.4214. http://arxiv.org/abs/1308.4214
9. Hinton, G.E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.R.:
Improving neural networks by preventing co-adaptation of feature detectors (2012).
arXiv preprint arXiv:1207.0580
10. Jarrett, K., Kavukcuoglu, K.: What is the best multi-stage architecture for object
recognition?. In: IEEE 12th International Conference on Computer Vision, pp.
2146–2153 (2009). http://ieeexplore.ieee.org/xpls/abs all.jsp?arnumber=5459469
578 L. Pigou et al.

11. Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., Fei-Fei, L.: Large-
scale video classification with convolutional neural networks. In: CVPR (2014)
12. Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep con-
volutional neural networks. Advances in Neural Information, 1–9 (2012). http://
books.nips.cc/papers/files/nips25/NIPS2012 0534.pdf
13. Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to
document recognition. Proceedings of the IEEE 86(11) (1998)
14. Nair, V., Hinton, G.E.: Rectified linear units improve restricted boltzmann
machines. In: Proceedings of the 27th International Conference on Machine Learn-
ing (ICML 2010), pp. 807–814 (2010)
15. Poppe, R.: A survey on vision-based human action recognition. Image and Vision
Computing 28(6), 976–990 (2010)
16. Sutskever, I., Martens, J., Dahl, G., Hinton, G.: On the importance of initializa-
tion and momentum in deep learning. In: Proceedings of the 30th International
Conference on Machine Learning (ICML 2013), pp. 1139–1147 (2013)
17. Van Herreweghe, M.: Prelinguaal dove jongeren en nederlands: een syntactisch
onderzoek. Universiteit Gent, Faculteit Letteren en Wijsbegeerte (1996)
18. Verschaeren, R.: Automatische herkenning van gebaren met de microsoft kinect
(2012)
19. Zaki, M.M., Shaheen, S.I.: Sign language recognition using a combination of new
vision based features. Pattern Recognition Letters 32(4), 572–577 (2011)
20. Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional neural net-
works (2013). arXiv preprint arXiv:1311.2901

JNTUK R20 ML UNIT-I (Chapter-I)
No ratings yet
JNTUK R20 ML UNIT-I (Chapter-I)
9 pages
TestBank IntroToIS 8e TechGuide4
No ratings yet
TestBank IntroToIS 8e TechGuide4
17 pages
Forecasting With Artificial Intelligence: Theory and Applications
No ratings yet
Forecasting With Artificial Intelligence: Theory and Applications
441 pages
Talbi 2021
No ratings yet
Talbi 2021
32 pages
Image Segmentation Using K Mean Algorithm
No ratings yet
Image Segmentation Using K Mean Algorithm
5 pages
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
Realtime Sign Language Gesture Word Recognition From Video Seque 2018
No ratings yet
Realtime Sign Language Gesture Word Recognition From Video Seque 2018
10 pages
Electronics and Communication s7 & s8
No ratings yet
Electronics and Communication s7 & s8
38 pages
Hand gesture reco using CNN
No ratings yet
Hand gesture reco using CNN
6 pages
Research 6
No ratings yet
Research 6
10 pages
ApplicationofDeepLearningusingConvolutionalNeural
No ratings yet
ApplicationofDeepLearningusingConvolutionalNeural
8 pages
Sign Language Recognition System Using Deep Neural Network
No ratings yet
Sign Language Recognition System Using Deep Neural Network
5 pages
DL Final Project Report
No ratings yet
DL Final Project Report
9 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
47 pages
Paper Template1
No ratings yet
Paper Template1
9 pages
ICSCSS 177 Final Manuscript
No ratings yet
ICSCSS 177 Final Manuscript
6 pages
ResearchPaper3
No ratings yet
ResearchPaper3
6 pages
Deep Learning Based Accurate Hand Gesture Recognition Using Enhanced CNN Model
No ratings yet
Deep Learning Based Accurate Hand Gesture Recognition Using Enhanced CNN Model
12 pages
Research Paper Hand Gesture Recognition
No ratings yet
Research Paper Hand Gesture Recognition
4 pages
10 1109@BigData 2018 8622141
No ratings yet
10 1109@BigData 2018 8622141
4 pages
U18Ini5600 - Engineering Cilincs - V Project Report
No ratings yet
U18Ini5600 - Engineering Cilincs - V Project Report
14 pages
Hand Gesture Recognition With Convolution Neural Networks
No ratings yet
Hand Gesture Recognition With Convolution Neural Networks
4 pages
Deep Learning Application On American Sign Language Database For
No ratings yet
Deep Learning Application On American Sign Language Database For
49 pages
Sign Language Detection From Hand Gesture Images Using Deep Multi-Layered Convolution Neural Network
No ratings yet
Sign Language Detection From Hand Gesture Images Using Deep Multi-Layered Convolution Neural Network
5 pages
Convolution Neural Networks For Hand Gesture Recognation
No ratings yet
Convolution Neural Networks For Hand Gesture Recognation
5 pages
Hand Gesture Recognition Using Neural Networks PDF
No ratings yet
Hand Gesture Recognition Using Neural Networks PDF
69 pages
A Review of Sign Language Classification Techniques
No ratings yet
A Review of Sign Language Classification Techniques
7 pages
Referencia N°03
No ratings yet
Referencia N°03
11 pages
ResearchPaper5
No ratings yet
ResearchPaper5
5 pages
Summary of Progress
No ratings yet
Summary of Progress
9 pages
Conference Latex Template
No ratings yet
Conference Latex Template
6 pages
Sign Language 1.1
No ratings yet
Sign Language 1.1
8 pages
Sign Language Recognition Using Deep Learning and Computer Vision
No ratings yet
Sign Language Recognition Using Deep Learning and Computer Vision
6 pages
Sign Language Recognition Using Deep Learning
No ratings yet
Sign Language Recognition Using Deep Learning
12 pages
Report Project
No ratings yet
Report Project
47 pages
Gesture Recognition System With Machine Learning
No ratings yet
Gesture Recognition System With Machine Learning
10 pages
Deep Learning-Based Sign Language Recognition System For Static Signs
No ratings yet
Deep Learning-Based Sign Language Recognition System For Static Signs
12 pages
Hand_Sign_amp_Gesture_Recognition_System
No ratings yet
Hand_Sign_amp_Gesture_Recognition_System
4 pages
A Novel Hybrid Deep Learning Architecture for Dynamic Hand Gesture Recognition
No ratings yet
A Novel Hybrid Deep Learning Architecture for Dynamic Hand Gesture Recognition
14 pages
Hand - Gesture - Recognition IEEE 2023
No ratings yet
Hand - Gesture - Recognition IEEE 2023
4 pages
Bachelor's-Project Report-(Sign Language To Text Conversion)
No ratings yet
Bachelor's-Project Report-(Sign Language To Text Conversion)
30 pages
Sign Lang Detection Project (PPT)
No ratings yet
Sign Lang Detection Project (PPT)
18 pages
Online Detection and Classification of Dynamic Hand Gestures With Recurrent 3D Convolutional Neural Networks
No ratings yet
Online Detection and Classification of Dynamic Hand Gestures With Recurrent 3D Convolutional Neural Networks
9 pages
978-981-15-4301-2_10
No ratings yet
978-981-15-4301-2_10
9 pages
Hand Gesture i-PACT
No ratings yet
Hand Gesture i-PACT
6 pages
Hand Gesture Detection Using Deep Learning Demo (1)
No ratings yet
Hand Gesture Detection Using Deep Learning Demo (1)
9 pages
Evolving_robots_to_play_capture_the_flag (2)
No ratings yet
Evolving_robots_to_play_capture_the_flag (2)
8 pages
Vigneshkumar - Sign Language Recognition
No ratings yet
Vigneshkumar - Sign Language Recognition
8 pages
Hand Gesture Recognition Using Machine Learning
No ratings yet
Hand Gesture Recognition Using Machine Learning
7 pages
Hand Gesture
No ratings yet
Hand Gesture
37 pages
Final PPT Capstone Project
No ratings yet
Final PPT Capstone Project
17 pages
SEMG Basedhandgesturesclassificationusingasemi Supervisedmulti LayerneuralnetworkswithAutoencoder
No ratings yet
SEMG Basedhandgesturesclassificationusingasemi Supervisedmulti LayerneuralnetworkswithAutoencoder
10 pages
Development of A Hand Pose Recognition System On An Embedded Computer Using Artificial Intelligence
No ratings yet
Development of A Hand Pose Recognition System On An Embedded Computer Using Artificial Intelligence
4 pages
ResearchPaperours (2)
No ratings yet
ResearchPaperours (2)
6 pages
Recognizing and Transforming Sign Language To Speech
No ratings yet
Recognizing and Transforming Sign Language To Speech
23 pages
Paper 9004
No ratings yet
Paper 9004
4 pages
Human Hand Gesture Recognition Using A Convolution Neural Network
No ratings yet
Human Hand Gesture Recognition Using A Convolution Neural Network
7 pages
ASL Recognition in Real Time With RNN - Antonio Domènech
No ratings yet
ASL Recognition in Real Time With RNN - Antonio Domènech
53 pages
Sign language detection presentation
No ratings yet
Sign language detection presentation
9 pages
Sinais de Linguagem - Redes Neurais
No ratings yet
Sinais de Linguagem - Redes Neurais
12 pages
Liu 2019
No ratings yet
Liu 2019
5 pages
Sign Language Recognition and Translation Method Based On VTN
No ratings yet
Sign Language Recognition and Translation Method Based On VTN
5 pages
What Is Gesture1
No ratings yet
What Is Gesture1
15 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
From Everand
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
Nietsnie Trebla
No ratings yet
Question of The Day: N N N N
No ratings yet
Question of The Day: N N N N
8 pages
Digest - Dimensionality Reduction of Neural Spike Train Data Using Factor Analysis
No ratings yet
Digest - Dimensionality Reduction of Neural Spike Train Data Using Factor Analysis
3 pages
EAHUIM Enhanced Absolute High Utilit - 2022 - International Journal of Informat
No ratings yet
EAHUIM Enhanced Absolute High Utilit - 2022 - International Journal of Informat
8 pages
Features of R and Its Applications
No ratings yet
Features of R and Its Applications
2 pages
AMME3500/9501 System Dynamics and Control Problem Set 11
No ratings yet
AMME3500/9501 System Dynamics and Control Problem Set 11
3 pages
Ricardo Gutierrez-Osuna - Multi-Layer Perceptrons
No ratings yet
Ricardo Gutierrez-Osuna - Multi-Layer Perceptrons
22 pages
Artificial Intelligence: Transportation Planning
No ratings yet
Artificial Intelligence: Transportation Planning
2 pages
CHaitanya Mondi - CV
No ratings yet
CHaitanya Mondi - CV
3 pages
Question Bank Discrete Maths
No ratings yet
Question Bank Discrete Maths
5 pages
Here Is The Classic Block Diagram of A Process Under PID Control
No ratings yet
Here Is The Classic Block Diagram of A Process Under PID Control
9 pages
Fractional Order PID Controller Tuning Based On IMC
No ratings yet
Fractional Order PID Controller Tuning Based On IMC
15 pages
HR Analytics
No ratings yet
HR Analytics
10 pages
Oral Communication COMMUNICATION-comes From The Latin Word "Communis"
No ratings yet
Oral Communication COMMUNICATION-comes From The Latin Word "Communis"
2 pages
Pega Datasheet Pega Process Mining
No ratings yet
Pega Datasheet Pega Process Mining
2 pages
Artificial Intelligence & Data Analytics
No ratings yet
Artificial Intelligence & Data Analytics
1 page
Final Project Paper Akash
No ratings yet
Final Project Paper Akash
5 pages
IEEE PAMI: Towards Open Vocabulary Learning A Survey
No ratings yet
IEEE PAMI: Towards Open Vocabulary Learning A Survey
20 pages
Automatic Estimation of Excavator Actual and Relative Cycle Times
No ratings yet
Automatic Estimation of Excavator Actual and Relative Cycle Times
16 pages
06 ESm 43 MQ
No ratings yet
06 ESm 43 MQ
6 pages
04 Feedback Control 1
No ratings yet
04 Feedback Control 1
26 pages
CV w6 - Deep Learning
No ratings yet
CV w6 - Deep Learning
86 pages
Anomaly Detection
No ratings yet
Anomaly Detection
7 pages
Advantages of Oral and Written Communication
No ratings yet
Advantages of Oral and Written Communication
10 pages
Prediction of Company Bankruptcy: Amlan Nag
100% (2)
Prediction of Company Bankruptcy: Amlan Nag
16 pages
Communication Systems 4ed Haykin PDF
No ratings yet
Communication Systems 4ed Haykin PDF
2 pages

Sign Language Recognition Using CNNs

Uploaded by

Sign Language Recognition Using CNNs

Uploaded by

Sign Language Recognition Using Convolutional

Lionel Pigou(B) , Sander Dieleman, Pieter-Jan Kindermans,

ELIS, Ghent University, Ghent, Belgium

Abstract. There is an undeniable communication problem between the

Keywords: Convolutional neural network · Deep learning · Gesture

(a) Original (b) Noise reduction (c) 4 Input channels

3.3 Convolutional Neural Network (CNN)

3.4 Proposed Architecture

Fig. 3. The architecture of the deep learning model

3.5 Generalization and Training

3.6 Temporal Segmentation

Table 1. Validation results

Error rate (%) Improvement (%)

You might also like