ResearchPaper3
ResearchPaper3
978-l-7281-2791-0/20/$31.00©2020IEEE 811
A newalgorithmhas been discussed in [1] to recognize and InthegivenFig 1, theimagegivenastheinput isusedtofind
trackhandgestures for better interactionwith avideo game. It theprimitivefeatureslikehorizontal andvertical lines.
is consisting of four hand gestures and four-hand direction
classesto fulfil requirements that couldhavebeenextendedto
make it more powerful. It uses segmentationandtracking [2]. PRIMITIVE
Theproposedalgorithmwasperformedon40 samplesandthe FEATURES
accuracyturnedout tobequiteimpressive. OBJECT
CLASS
Use of aconvolutional neural network to reducethefeature
extraction process and parameters being used has been
discussed in [2]. The hand gesture recognition is performed
using aconvolutional neural network but the one usedin our BUILDING
paper shows a deep convolutional neural network
implementation. Resultsshownin [3] areveryimpressivewhen
a training set of 50%of the database is used. Max Pooling
Convolutional Neural Network to advance Human- robot
interactions using color segmentation, edge blurring with
morphological digital imageprocessingandthenexperimenting
withmobilerobotsusingARM11533MHz [3]. Fig1. WorkingofaCNNModel
They manage to get anaccuracy score of around 96%. The After the primitive feature extraction phase, the next stage
vocabulary in the proposed project was up to 11 classes. It determines the part of the given object using the features
couldhavebeenusedin amilieuof human-swarminteraction. extracted. Objects parts arethenusedto interpret the class of
Driver’s hand gesture recognition via a 3D convolutional theobject.
neural net was followed [4]. It engages spatial data
augmentation techniques and pre-processing techniques for Interfaces that cannot be touched not only improves the
betterresults. driver's focus andprevent possiblemishapsbut alsomakesthe
devicesmuchmoreuser-friendly duetowhichimplementation
They achieved a score of 77% which could have been of suchtechnologiesinseveral control systemsispreferred.
improved by constructing a deeper neural network. The
challengesof a3DCNNtoperformclassificationanddetection IV. METHODOLOGY
onthegivendataset wasaddressedin [5] whichalsointroduces
a multi-modal dynamic challenging dataset and achieved an Fig2showsaconvolutional Neural Networkmodel and
accuracyof 83.8%. variouslayersinvolved.
III. PRELIMINARIES
A convolutional neural network is a deep learning neural
network class which is most appliedto images andvideos for
its analysis. It is a kind of artificial neural network using
machine learning algorithms for a unit and perceptron for
supervisedproblems.
A CNN is basically a technique, or a machine learning
model applied to images to make them interpretable by
machines. It canhaveoneor morethanoneconvolutional layer
followed by the fully connectedlayers. All types of cognitive CNNisatypeof neural networkthat isempoweredwith
tasks are performed using CNNs like Natural Language certainspecificlayers, suchas:
processing, image processing, etc. The concept of machine
learning is not a contemporary thing, the first Artificial 1. Input layer
Intelligence-based program which came into play with a 2. Hiddenlayer:
learned version of a game in which anArtificial Intelligence
programwasbuilt that understoodnatural languagefinally.
(c)
(a) (b)
(d) (e)
(c) (d) Fig6. Thresholdimages