Improved Estimation of Hand Postures Usi

The document presents an improved method for hand pose estimation using depth images, focusing on optimizing performance while maintaining accuracy. By incorporating anatomical constraints through principal component analysis (PCA) and biased particle swarm optimization, the authors demonstrate significant enhancements in both the accuracy and speed of pose estimation. The proposed approach effectively addresses challenges such as high-dimensionality and self-occlusions in hand tracking, aiming for real-time application in human-robot interaction.

Uploaded by

Jeremy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views6 pages

Improved Estimation of Hand Postures Usi

Uploaded by

Jeremy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Improved Estimation of Hand Postures Using

Depth Images

Dennis Hamester, Doreen Jirak and Stefan Wermter

University of Hamburg, Department of Informatics, Knowledge Technology

Vogt-Kölln-Straße 30, D - 22527 Hamburg, Germany
{7hameste,jirak,wermter}@informatik.uni-hamburg.de
http://www.informatik.uni-hamburg.de/WTM/

Abstract—Hand pose estimation is the task of deriving a Oikonomidis et al.’s [2] method deals very well with the
hand’s articulation from sensory input, here depth images high-dimensionality and self-occlusions of the human hand.
in particular. A novel approach states pose estimation as an However, their approach is still computationally demanding.
optimization problem: a high-dimensional hypothesis space is They report that their algorithm can run at about 15 FPS on
constructed from a hand model, in which particle swarms search a high-end PC. This is only half the rate at which the Kinect
for the best pose hypothesis. We propose various additions to
provides images. Our goal was to improve the performance,
this approach. Our extended hand model includes anatomical
constraints of hand motion by applying principal component possibly to the point of running in real-time. At the same time
analysis (PCA). This allows us to treat pose estimation as a we did not want to sacrifice any accuracy. We addressed this
problem with variable dimensionality. The most important benefit by exploiting biases in certain variants of particle swarms. We
becomes visible once our PCA-enhanced model is combined with will show, that the optimization behavior of these variants can
biased particle swarms. Several experiments show that accuracy be aligned with a priori knowledge about how humans perform
and performance of pose estimation improve significantly. hand motions. The result was an overall improved convergence
behavior, leading to better pose estimation in less time.
I. I NTRODUCTION The idea to use a priori information has already been
The human hand is highly articulated. Humans use hands to applied successfully to hand pose estimation by Bianchi et
manipulate objects in their surroundings and to communicate al. [3]. They determined statistical properties of hand motion
with other people. Capturing exact hand postures is an impor- and used these to improve the noisy measurements of a
tant step for Human-Robot Interaction and the development of low-cost data glove. Our method differs in the way a priori
natural interfaces. Computer vision (CV) can provide cheap knowledge is used. We use it to transform the search space
and unobtrusive solutions to this problem, especially compared of all hand postures, such that certain variants of particle
to data gloves. swarm optimization (PSO) perform better due to biases in their
behavior. We also do not require an existing pose estimation.
Solving CV-based hand pose estimation without markers
in single camera setups is a very challenging task, because This paper is organized as follows: Section II covers our
hands can take on vastly different shapes in images. The image preprocessing. Its purpose is to segment images into
amount of degrees of freedom (DOFs) contributes to a high- hand and non-hand parts. Section III introduces our new hand
dimensional problem. The problem is further complicated by model and how its parameter space is altered through principal
self-occlusions of the hand, that happen inevitably during the component analysis. Particle swarm optimization and the target
projection onto 2D images. function mentioned above are covered in Section IV. We will
also explain our motivation for using a PSO variant with
Following the taxonomy of Erol et al. [1], the approach certain biases. In Section V we detail our experiments with
discussed here belongs to the class of model-based tracking the new method and provide an evaluation of the data. A
methods that follow a single hypothesis over time. In this final discussion and an outlook for future research are given
context, single hypothesis means that only one satisfying in Section VI.
solution is searched for and kept for the initialization of the
next frame. II. H AND D ETECTION & T RACKING
Significant progress in this area was made by Oikonomidis Detecting hands in images is a necessary step, because pose
et al. [2]. They formulate pose estimation as an optimiza- estimation is not capable of performing this segmentation by
tion problem. An internal hand model defines the parameters itself. We have separated the task into two steps: first, an initial
(DOFs) that make up a hand pose. This high-dimensional one-time detection of hands based on depth images and shape
space is searched by a particle swarm for a suitable solution. recognition and second, subsequent tracking of the hand region
As a particle moves, it renders an artificial depth image with an adaptive skin color model.
of its current hand pose hypothesis, which is compared to
the actual observation from the Kinect. A target function is For the first step, we restrict the detection to a specific
used to measure the discrepancy between rendered image and hand posture that has a distinctive shape. A hand has to
observation. be open and face the sensor, with the fingers spread out a
little. We perform foreground segmentation on depth images
to reduce the region of interest. After that, edge detection in the
foreground depth image provides a set of candidate contours.
To support classification and the ability for generalization, we
use Fourier descriptors with 12 complex-valued coefficients1
to represent contours. These provide desirable invariance prop-
Fig. 1. Hand model in different configurations. It consists of two types of
erties against common affine transformation (e. g. scale or objects: ellipsoids (in blue) and elliptical cylinders (in green, red, orange and
rotation). Furthermore, the contour information is condensed yellow).
in these 12 coefficients. Finally soft-margin support vector
machines are used to separate hands from non-hands.
Based on the color that is enclosed by a hand contour, we
learn the parameters of an elliptical boundary model (EBM)
[4] of the skin color distribution. In all subsequent frames after
successful detection by shape, this model is used to retrieve
the hand.
This two-step scheme can properly distinguish between
hands and other skin-colored objects in the scene. It comes
at the cost of requiring a specific hand posture for detection.
But this restriction is alleviated as soon as the distribution
parameters of the skin color are learned.

III. H AND M ODEL

Fig. 2. Schematic joint model of a human hand. Joints are shown as blue
The hand model serves two purposes: first, it defines the dots. The CMC1 and MCP2-5 are modelled as 2-DOF joints. The other joints
parameters that make up the state of the hand. Each parameter (MCP1, IP1, PIP2-5, DIP2-5) have only one DOF.
is one DOF of the model. Since our method requires synthetic
hand images, the second purpose of our model is to define what
a hand looks like. Apart from these, constraints of human hand hand. Joints come in two variants: joints with two DOFs and
articulations will be discussed as a third property. those with just one DOF.
The joints used here describe a rotation, in either one or
A. Shape two dimensions. If it is a 2-DOF joint, both axes of rotation
The geometrical detail of our hand model must be kept are orthogonal to each other and the reference point of the
low, while still ensuring resemblance to a real human hand. rotation is the same in both dimensions. This also implies that
The algorithm repeatedly renders depth images, which are then both axes must intersect, which is not necessarily true for real
compared to real depth images from the Kinect. Even though joints [5].
rendering is accelerated here with OpenGL, the complexity of A schematic joint model of the human hand is depicted in
the hand model has a huge impact on the runtime performance. Fig. 2. All DOFs together form a 20-dimensional parameter
The model is shown in Fig. 1. It is composed of two space, in which each point describes one particular posture.
primitive objects: elliptical cylinders and ellipsoids. The main A special 6-DOF joint is placed in the center of mass of the
object of the palm, shown in green in Fig. 1, is an elliptical palm, because we do not want restrict hands to one specific
cylinder, whose major semi-axis is significantly larger than location in space nor is the orientation assumed fixed. This
the minor semi-axis. Two ellipsoids (blue) are placed on both joint represent the global position and orientation of the hand
ends of the cylinder to provide a smooth surface. Each finger in space relative to the sensor. With the addition of this joint,
consists of five objects: three cylinders and two spheres. The the final parameter space has 26 dimensions.
cylinders are shown in red, orange and yellow to emphasize
the different phalanges. The spheres are placed between two C. Constraints
adjacent phalanges to handle discontinuities that occur when
bending a finger. The thumb is modelled similarly, but has a One particular goal was to take inter-dependencies between
large ellipsoid (blue) as its first object instead of a cylinder. DOFs into consideration. Although each joint has as many
We found this to be very effective at reproducing the skin DOFs as stated before, we are not able to control all of them
deformation that happens when the thumb is moved. Our independently. Lin et al. [6] and Wu et al. [7] state, that 95%
model consists 27 individual objects: 15 elliptical cylinders of the variance in hand articulations can be reduced to just 7
and 12 ellipsoids. dimensions.
Lin et al. [6] further classify hand motion constraints into
B. Degrees of Freedom three types. The first type refers to static constraints, called
A set of joints is placed into the above model, which allow range of motion values [5], for each individual DOF. They
the model to take on basically any articulation of a human are usually expressed by two boundary angles that must not
be exceeded. The second type refers to dynamic constraints
1 The specific descriptor length was chosen on the basis of experiments. that are caused by the anatomy of human hands. These can be
TABLE I. VARIANCES AND ( CUMULATIVE ) RATIOS OF RANDOM HAND
POSES AFTER PCA.

1 2 3 4 5 6
Variance 1.324 0.466 0.325 0.259 0.178 0.13
Ratio 42.7% 15% 10.5% 8.3% 5.7% 4.2%
Cum. Ratio 42.7% 57.7% 68.2% 76.5% 82.2% 86.5%
7 8 9 10 11 12
Variance 0.118 0.09 0.063 0.057 0.035 0.022
Ratio 3.8% 2.9% 2% 1.8% 1.1% 0.7%
Cum. Ratio 90.3% 93.2% 95.2% 97% 98.1% 98.9%
13 14 15 16
Variance 0.017 0.012 0.004 0.003
Ratio 0.5% 0.4% 0.1% 0.08%
Cum. Ratio 99.4% 99.8% 99.9% 100%

Fig. 3. Covariance matrix of random hand poses. The DOFs are:

intra- and inter-finger constraints. One example for an intra- 1,2=CMC1; 3,4=MCP2; 5,6=MCP3; 7,8=MCP4; 9,10=MCP5; 11=MCP1;
finger constraint is the fact that distal interphalangeal (DIP) and 12=PIP2; 13=PIP3; 14=PIP4; 15=PIP5; 16=IP. Position, orientation and DIP
proximal interphalangeal (PIP) joints (refer to Fig. 2) can only joints are not included.
be bent together [1], [6]. The third type comprises factors other
than the anatomy, like e. g. the smoothness of hand motion.
concentrated in the first 9 dimensions (in both publications
We do not use closed formulas to model constraints, except only 7 dimensions were required). The first two dimensions
for a few common type 2 constraints. These are the ones account for more than 50% of the variance. Based on this
mentioned above, concerning the relationship between PIP and data, we assume that some of the least significant dimensions
DIP angles. By using the equation are essentially just noise.
2
θDIP,i = θP IP,i 2 ≤ i ≤ 5 (1)
3 IV. PARTICLE S WARM O PTIMIZATION
the dimensionality is reduced by 4 down to 22. Particle swarms were developed in 1995 by Kennedy and
We use PCA to further remove dimensions based on their Eberhart [8] and have received great attention since then
significance. This also allows us to treat the dimensionality as [9], [10]. The method originates from human social behav-
a variable parameter instead of predefined value. ior simulations. In these simulations, agents were placed in
a two-dimensional space and moved through it in discrete
Several videos were recorded with the Kinect to generate time steps. The direction of the movement was based on an
the necessary data for the PCA. All videos together con- attraction point, which Kennedy and Eberhart [8] called the
tained just above 9200 frames, which corresponded to about 5 cornfield vector, in analogy to bird flocks searching for food.
minutes in total. Each video contained a sequence of mostly The authors observed that all agents settled quickly on the
random finger motions, to capture as many hand postures attraction point, despite their random initialization. The result
as possible. Even though such motions are random in their was the formulation of the original particle swarm optimization
articulation, they are still natural, in the sense that they are algorithm.
anatomically plausible, but lack semantic meaning. The hand
was neither moved nor rotated in any of the videos. The In a swarm, n simple entities, called particles, exist in
palm always faced the sensor. In none of the videos, external a d-dimensional space. Each particle has a position p ∈ Rd
physical forces were applied to the hand or fingers. and velocity v ∈ Rd . When particles move in discrete steps
over time, they evaluate their own position p with a target
We used our hand model with 22 DOFs and estimated function f (which will be detailed shortly). The goal is
hand poses for each frame. The global position and orientation to minimize this function. After all particles moved, their
estimations were stripped from the resulting dataset and not velocities are updated. The new velocity comprises two distinct
considered for the PCA, since no meaningful correlation components: a cognitive and a social component [11]. The
between global hand pose and finger articulation was expected. cognitive component is the position pc,i , with the best target
This left about 9200 estimations for the remaining 16 joint value a particle i has seen in the past. As such, the cognitive
angles: the CMC1, MCP1-5, IP1 and PIP2-5 (refer to Fig. 2). component potentially differs between all particles. The social
The data had not been smoothed or filtered in any other way component on the other hand, is the global best known position
prior to the PCA. ps and is shared between all particles in the swarm. These two
vectors take on the role of attraction points in the following
The covariance matrix of the dataset is shown in Fig. 3. formula for the velocity update:
It shows some relatively high covariance values outside the
diagonal. These give indication of the inter-dependencies be- vi ← χ [ vi + U (0, φc ) ⊗ (pc,i − pi ) (2)
tween the DOFs. Interestingly, the DOFs that correspond to + U (0, φs ) ⊗ (ps − pi ) ]
the abduction angles of the MCP joints (DOFs 4, 6, 8 and
10) do not show significant covariance values. The variances U (a, b) is a vector of d random numbers, each uniformly
in Table I indicate very well, that much of the hand motion distributed in the range given by its parameters and ⊗ denotes
happens in only few dimensions. The data is similar to Lin component-wise multiplication. The parameters φc and φs
et al. [6] and Wu et al. [7], in that 95% of the variance is control the influence of the cognitive and social component
on the new velocity. The parameter χ is used to control the axes. These new axes do no longer represent just one DOF,
velocities and avoid swarm-explosion. It can be computed as but many. Specifically, each axis models a particular (linear)
follows [12]: finger motion involving possibly many DOFs that was (with
decreasing significance) noticeable in the sample data. Con-
φ = φc + φs > 4 sider for example the motion between an open hand and a fist.
2 In the original space, this requires changing many DOFs at the
χ= (3)
φ − 2 + φ2 − 4φ same time. If we further assume that this motion corresponds
roughly to one of the principal components, this might just
For each particle i, the positions are then updated by simply involve change in a very limited subset of DOFs, after the
adding the velocity: space has been rotated by the PCA. In this regard, the new
parameter space is aligned with the way particles move in a
pi ← pi + v i (4) biased PSO. Most significant changes in hand postures happen
along parallels to coordinate axes, and less likely along a mix
The target function we use here determines how closely a of many axes (diagonals).
given hand pose hypothesis h ∈ R26 matches the observation
depth image do . Let x×y be the image size and dh the rendered The switch to a biased PSO in combination with PCA
depth image of h using our hand model. Then the function revealed another positive side effect during our experiments,
y
besides the increase in accuracy. Oikonomidis et al. [2] were
x
forced to randomly disturb the particles every few generations
f (h) = min (|do (u, v) − dh (u, v)| , t) (5) due to premature convergence (“swarm collapse”). At first,
v=1 u=1
we observed the same behavior. However, after making the
iterates over both images and computes the sum of pixel- discussed changes, the swarm collapses disappeared. With our
wise differences, which are thresholded at some value t. Areas method, the swarm does not converge prematurely and is able
in both images that do not contain hand pixels, are marked to find satisfying solutions on its own.
with a value of 0. If we omitted the threshold, this would
lead to very high numbers caused by possibly few pixels. We V. E XPERIMENTS & E VALUATIONS
experimentally determined t = 5cm to work well.
Our goal for the experiments and evaluations was to assess
This function was designed similarly to the one proposed the differences of our pose estimation compared to Oikono-
by Oikonomidis et al. [2], but also radically simplified. Orig- midis et al. [2]. We were primarily interested in measuring the
inally, two more components were included. First, a term that possible accuracy gains through quantitative evaluation. We
tested whether a pixel is skin-colored or not. This would will also present some qualitative results at the end of this
be redundant in our case, because all other pixels in do section.
have already been filtered out during the tracking phase Evaluating a hand pose estimation method is in itself not a
(Section II). The second term penalized physically implausible trivial task, because ground-truth information is not available
hand postures. More specifically, it considered the differences when working with real videos. To deal with this problem, we
in abduction angles of the three adjacent finger pairs. In our generated a test video, in part synthetically. The images, that
experiments, we observed that such a term actually hinders make up the video, were completely rendered with our hand
proper optimization. Most of the cases in which this happened, model, but the actual movement of the hand was authentic.
had relative abductions close to zero between adjacent fingers We first recorded a real video of the desired hand motions
(like in a stop gesture or a fist). We therefore removed this and then ran the hand pose estimation on them. Gaussian
penalizing term. filters were used to eliminate high frequency noise in the hand
Spears et al. [13] showed, that drawing random numbers pose sequence. This filtered sequence was then used in turn
dimension by dimension in equation (2) causes several biases. to render the synthetic video of the hand motions. As a last
They found out, that the bias is made up of two components: step, we applied noise and a discretization step to the video in
skew and spread. When a particle moves primarily parallel order to mimic depth images from the Kinect.
to an axis, the skew bias pushes it towards a diagonal of The video contained random motions of the fingers and
two or more axes. On the other hand, a particle that moves the thumb and lasted for approximately 25 seconds. We tried
along a diagonal is highly unstable and gets pushed back to to capture movement of all DOFs and cover many possible
a trajectory parallel to an axis due to the spread bias. The articulations. This video did not contain notable movement of
biases appeared regardless of PSO parameters, like swarm size, the whole hand in space. The hand itself was also not rotated.
number of iterations and dimensionality. A particle swarm that For the whole duration of the video, the palm was facing
updates velocities dimension by dimension is biased towards the sensor. The mean distance between sensor and hand was
movement along axis parallels, even when the problem is roughly one meter.
rotationally symmetric. As a direct result, new PSO versions
(like SPSO 2011 [11]) were developed to overcome the bias. Let πi (x) be the projection of the vector x onto its i-th
component and x1 , x2 ∈ R26 be hand poses. Then
We deliberately propose to use PSO with these biases. In
26
our application, particles move through the parameter space of 1
our hand model, in which each axis corresponds to one DOF. ea (x1 , x2 ) = |πi (x1 − x2 )| (6)
23 i=4
However as detailed in Section III, this space is altered by
a PCA. The PCA rotates the hand model’s parameter space measures the discrepancy of all angles (given in degrees [◦ ])
in such a way that eigenvectors become the new coordinate as the mean absolute difference. The first three components
27° 16°
Original method / 64 particles
24° Original method / 32 particles 14°
21° Our method / 64 particles
Our method / 32 particles 12°
18°
10°
Mean error

Mean error
15°
8°
12°
6°
9°
4°
6°

3° 2°

0° 0°
4 6 8 10 12 14 16 18 20 22 24 26 28 30 7 10 13 16 19 22
PSO generations Number of DOFs after PCA

Fig. 4. Comparison of our method (blue, magenta) with Oikonomidis et Fig. 5. Dependency between DOFs and mean error.
al. [2] (red, green). A single measurement indicates the angle error (equation 6)
averaged over all frames in the test video.
be influenced negatively when insignificant dimensions were
present.
correspond to the hand location in 3D space, which was not
considered for evaluation. In contrast to Oikonomidis et al. [2],
we chose to stay close to the actual representation of hand C. Qualitative Results
poses as a vector of mostly angles. They derived locations of When it comes to the visually perceived accuracy of
phalanx endpoints and used them to measure accuracy. our pose estimation, there were only minor discrepancies
This evaluation did not take into consideration PSO pa- compared to the real hand posture. Figure 6 shows eight
rameters other than the number of particles and generations. different postures alongside the model, articulated according
In particular, the effect of the cognitive and social factors to the estimation. Most errors stem from thumb estimation.
in the particles velocity equation (2) was not analyzed. To Particularly in Fig. 6(d), the thumb does not point away from
conduct the experiments, we set the values to φc = 2.8 the hand. It is actually inside the other fingers, which is also the
and φs = 1.3 [2], i. e. the constriction factor χ was 0.73 case in Fig. 6(f). This happened quite often, because we did not
(equation 3). Most combinations for φc and φs perform well, perform collision detection or model any physical constraints.
as long as φc + φs = 4.1 holds true [14]. Figures 6(e) and (g) show postures with severe out-of-plane
rotations, that still resulted in proper estimations. We have
found these kinds of postures to be especially problematic,
A. Direct Comparison because the hand occludes large parts of itself.
The vertical axis in Fig. 4 shows the mean absolute angle
error ea . A single measurement is the mean error over all VI. D ISCUSSION & F UTURE W ORK
frames in the entire video for the given PSO parameters. The
In this paper we presented an improved method for the
original method shows a strong dependency on the number of
problem of full-DOF hand pose estimation, based on the
generations. To keep the error below an average of 9◦ at least
method by Oikonomidis et al. [2] that has been extended to
64 particles and 22 generations had to be used. For our method
take a priori knowledge about hand motion into consideration.
on the other hand, 32 particles and 14 generations already were
We achieved this by first applying a common relationship
sufficient. In general, we observed much faster convergence
between DIP and PIP joints, followed by a change of basis
after enabling the PCA and biased PSO. The curves for our
to eigenvectors. This way, biases in particle swarms can be
method are less steep in Fig 4. This directly translates to
exploited, leading to much improved convergence behavior.
an improved performance, because less effort was required
We performed several experiments with partially synthetic
to achieve a certain maximum error. Using 64 particles and
data to provide evidence for this claim. Other experiments
25 generations has been suggested before [2]. We reached the
revealed that our method retains its optimal accuracy when
same error at 32/18, which is roughly 2.8 times faster.
all dimensions are

B. Dimensionality Reduction We discussed our hand model from three different per-
spectives: shape, DOFs and constraints. Several similar works
The experiments above used our hand model with 22 [2], [15], [16] focus almost exclusively on the shape of the
DOFs. We applied the PCA but did not remove any dimensions model, while we put more emphasis on the DOFs of the model
afterwards. Figure 5 depicts the same experiment (32 particles, instead. With the terminology of Lin et al. [6], only level 1
20 generations) but with a varying number of dimensions. The constraints have been imposed on joint angles in the relevant
lowest possible number of dimensions is 7, which include 6 for literature [2], [15], [16]. In this paper, constraints of level 2
the global pose and just one dimension for all joint angles. The and 3 were considered, and some have been modelled with
data indicate that there was no benefit in removing dimensions. closed formulas, while the majority is included through PCA.
Starting from the right, the mean error first stagnates and then This also introduced the dimensionality of the hand model as
starts rising. Thus, our method performs optimally when all a parameter instead of a fixed value. The PCA played a major
dimensions are left in place. The biased PSO did not seem to role in the improved properties of our method.
(a) (b) (c) (d)

(e) (f) (g) (h)

Fig. 6. Eight hand postures and their estimation.

Biased particle swarms were the other key component R EFERENCES

in gaining accuracy. We recognized that these biases push [1] A. Erol, G. Bebis, M. Nicolescu, R. D. Boyle, and X. Twombly, “Vision-
particles onto trajectories parallel to coordinate axes and how based hand pose estimation: A review,” Computer Vision and Image
this relates to PCA. The particle swarm in our method does not Understanding, vol. 108, pp. 52–73, 2007.
converge prematurely. We were thus not forced to apply addi- [2] I. Oikonomidis, N. Kyriazis, and A. A. Argyros, “Efficient model-based
tional randomness to keep the swarm alive, like Oikonomidis 3D tracking of hand articulations using kinect,” in British Machine
et al. [2] did. Vision Conference, Proc. of the, 2011, pp. 101.1–101.11.
[3] M. Bianchi, P. Salaris, and A. Bicchi, “Synergy-based hand pose sens-
ing: Reconstruction enhancement,” Int. Journal of Robotics Research,
A. Contributions The, vol. 32, no. 4, pp. 396–406, 2013.
Our method maintains the same level of accuracy as before [4] J. Y. Lee and S. I. Yoo, “An elliptical boundary model for skin color
detection,” in Imaging Science, Systems, and Technology, Int. Conf. on,
[2], but is about 2.8 times faster. If a 16% increase in estima- 2002.
tion errors is acceptable, our method is able to run five times
[5] G. Stillfried and P. van der Smagt, “Movement model of a human hand
faster. It is able to exceed the 30 Hz framerate of the Kinect. based on magnetic resonance imaging (mri),” in Applied Bionics and
We expect that even more performance is achievable with our Biomechanics, Int. Conf. on, 2010.
method when the set of possible hand postures is constrained [6] J. Lin, Y. Wu, and T. S. Huang, “Modeling the constraints of human
by specific applications. Our idea to combine biased PSO with hand motion,” in Human Motion, Proc. Workshop on, 2000, pp. 121–
PCA provides a very flexible way of incorporating a priori 126.
knowledge. [7] Y. Wu, J. Y. Lin, and T. S. Huang, “Capturing natural hand articulation,”
in Computer Vision, IEEE Int. Conf. on, vol. 2, 2001, pp. 426–432.
We also gave a working example on how biased PSO algo- [8] J. Kennedy and R. Eberhart, “Particle swarm optimization,” in Neural
rithms can be exploited and that the results can be significant. Networks, IEEE Int. Conf. on, vol. 4, 1995, pp. 1942–1948.
This might prove useful to many more applications of PSO, [9] R. Poli, J. Kennedy, and T. Blackwell, “Particle swarm optimization -
because it is not specific to pose estimation. an overview,” Swarm Intelligence, vol. 1, pp. 33–57, 2007.
[10] R. Poli, “Analysis of the publications on the applications of particle
B. Future Work swarm optimisation,” Journal of Artificial Evolution and Applications,
vol. 2008, pp. 3:1–3:10, 2008.
Despite our improvements, the approach is still computa- [11] S. S. Pace, A. Cain, and C. J. Woodward, “A consolidated model of
tionally demanding. Most of the time is spent rendering depth particle swarm optimisation variants,” in Evolutionary Computation,
images. For future work, we would like to explore ways of IEEE Congress on, 2012, pp. 1–8.
shifting some of the effort to an offline learning phase. This [12] M. Clerc and J. Kennedy, “The particle swarm - explosion, stability,
and convergence in a multidimensional complex space,” IEEE Trans.
might be done by pre-rendering a subset of hand postures and Evol. Comput., vol. 6, no. 1, pp. 58–73, 2002.
a suitable interpolation method. [13] W. M. Spears, D. Green, and D. F. Spears, “Biases in particle swarm
We plan to also conduct additional experiments to identify optimization,” Int. Journal of Swarm Intelligence Research, vol. 1, no. 2,
pp. 34–57, 2010.
circumstances under which the algorithm fails. First experi-
[14] I. Oikonomidis, N. Kyriazis, and A. A. Argyros, “Tracking the artic-
ments indicate that out-of-plane rotations of the palm require ulated motion of two strongly interacting hands,” in Computer Vision
significantly more effort for a proper pose estimation. These and Pattern Recognition, IEEE Conf. on, 2012, pp. 1862–1869.
rotations are characterized by the palm not being aligned with [15] H. Hamer, K. Schindler, E. Koller-Meier, and L. Van Gool, “Tracking
sensor image plane, as in Figs. 6(e) and (g). a hand manipulating an object,” in Computer Vision, IEEE Int. Conf.
on, 2009, pp. 1475–1482.
ACKNOWLEDGMENT [16] C. Keskin, F. Kıraç, Y. E. Kara, and L. Akarun, “Real time hand
pose estimation using depth sensors,” in Consumer Depth Cameras
The authors would like to thank Johannes Bauer and Sven for Computer Vision, ser. Advances in Computer Vision and Pattern
Magg for providing valuable suggestions when writing this Recognition. Springer London, 2013, pp. 119–137.
paper. We would also like to thank the anonymous reviewers
for their valuable comments on our paper.

Ambit Optimist 8 Installation Guide
0% (1)
Ambit Optimist 8 Installation Guide
87 pages
Witherby Seamanship2014 PDF
67% (3)
Witherby Seamanship2014 PDF
92 pages
Chemical Engineering, March 2014
100% (1)
Chemical Engineering, March 2014
92 pages
The Bankers Own The Earth
100% (3)
The Bankers Own The Earth
51 pages
Epicor 9.05 Performance Tuning Guide - SQL
No ratings yet
Epicor 9.05 Performance Tuning Guide - SQL
21 pages
Guidelines Flare Vent Measurement
100% (1)
Guidelines Flare Vent Measurement
36 pages
PROPOSAL Syringe4 Needle Assemble INDIA 20180212 MR - Rohit Shaha
No ratings yet
PROPOSAL Syringe4 Needle Assemble INDIA 20180212 MR - Rohit Shaha
31 pages
Report On Rural Haat
83% (6)
Report On Rural Haat
22 pages
MP WRD 6625 - Rewa
No ratings yet
MP WRD 6625 - Rewa
77 pages
DS4510 5010
100% (1)
DS4510 5010
2 pages
Vio's Bartering Money Guide For Poor People-1 PDF
No ratings yet
Vio's Bartering Money Guide For Poor People-1 PDF
13 pages
Excel MCQ
No ratings yet
Excel MCQ
29 pages
OpenAI Et Al. - 2019 - Learning Dexterous In-Hand Manipulation
No ratings yet
OpenAI Et Al. - 2019 - Learning Dexterous In-Hand Manipulation
27 pages
Internship at Troikaa Pharmaceuticals
No ratings yet
Internship at Troikaa Pharmaceuticals
7 pages
CS 3
No ratings yet
CS 3
12 pages
Sensors: Machine-Learning-Based Muscle Control of A 3D-Printed Bionic Arm
No ratings yet
Sensors: Machine-Learning-Based Muscle Control of A 3D-Printed Bionic Arm
16 pages
Understanding Everyday Hands in Action From RGB-D Images
No ratings yet
Understanding Everyday Hands in Action From RGB-D Images
9 pages
A Novel Predictive Approach To Prosthetic Control For Digit Amputations Using Grip Classification
No ratings yet
A Novel Predictive Approach To Prosthetic Control For Digit Amputations Using Grip Classification
6 pages
Real Time Finger Tracking and Contour Detection For Gesture Recognition Using Opencv
No ratings yet
Real Time Finger Tracking and Contour Detection For Gesture Recognition Using Opencv
4 pages
Literature Review and Proposed Research: Alba Perez Gracia
No ratings yet
Literature Review and Proposed Research: Alba Perez Gracia
34 pages
Hand Gesture Detection and Recognition Using Principal Component Analysis
No ratings yet
Hand Gesture Detection and Recognition Using Principal Component Analysis
6 pages
Recovering 3D Human Pose From Monocular Images: Ankur Agarwal and Bill Triggs
No ratings yet
Recovering 3D Human Pose From Monocular Images: Ankur Agarwal and Bill Triggs
15 pages
Iqvue Presentation
No ratings yet
Iqvue Presentation
9 pages
HGR Progress Presentation Apr 8
No ratings yet
HGR Progress Presentation Apr 8
46 pages
Human Hand Motion Analysis With Multisensory Information: Zhaojie Ju, Member, IEEE, and Honghai Liu, Senior Member, IEEE
No ratings yet
Human Hand Motion Analysis With Multisensory Information: Zhaojie Ju, Member, IEEE, and Honghai Liu, Senior Member, IEEE
11 pages
Rough Transcriptionasi Se Baila El Tango - Violin 1
No ratings yet
Rough Transcriptionasi Se Baila El Tango - Violin 1
2 pages
3D Body Moving
No ratings yet
3D Body Moving
13 pages
Appendix B For 29
No ratings yet
Appendix B For 29
1 page
A Vision-Based Method To Find Fingertips in A Closed Hand
No ratings yet
A Vision-Based Method To Find Fingertips in A Closed Hand
10 pages
View-Independent Recognition of Hand Postures
No ratings yet
View-Independent Recognition of Hand Postures
7 pages
Model-Based 3D Tracking of An Articulated Hand
No ratings yet
Model-Based 3D Tracking of An Articulated Hand
6 pages
Hand Models and Systems For Hand Detection, Shape Recognition and Pose Estimation in Video
No ratings yet
Hand Models and Systems For Hand Detection, Shape Recognition and Pose Estimation in Video
41 pages
Functional Anthropomorphism For Human To Robot Motion Mapping
No ratings yet
Functional Anthropomorphism For Human To Robot Motion Mapping
6 pages
Poier Learning Pose Specific CVPR 2018 Paper
No ratings yet
Poier Learning Pose Specific CVPR 2018 Paper
10 pages
Automatic Grasp Selection Using A Camera in A Hand Prosthesis
No ratings yet
Automatic Grasp Selection Using A Camera in A Hand Prosthesis
4 pages
Organizational Planning, HR Planning & Career Planning
No ratings yet
Organizational Planning, HR Planning & Career Planning
6 pages
Import / Export Permit Application Form: or FAX To: 9637 8475
No ratings yet
Import / Export Permit Application Form: or FAX To: 9637 8475
2 pages
6.pressure Sensor Positioning For Accurate Human Interaction With A Robotic Hand
No ratings yet
6.pressure Sensor Positioning For Accurate Human Interaction With A Robotic Hand
4 pages
Applsci 13 07433
No ratings yet
Applsci 13 07433
16 pages
Good Paper
No ratings yet
Good Paper
5 pages
Real-Time Hand Detection and Tracking Against Complex Background
No ratings yet
Real-Time Hand Detection and Tracking Against Complex Background
4 pages
Schischek Product Catalogue en PUB113 001 00
No ratings yet
Schischek Product Catalogue en PUB113 001 00
76 pages
FF0332 01 Artificial Intelligence Powerpoint Template
No ratings yet
FF0332 01 Artificial Intelligence Powerpoint Template
8 pages
2003automatic Biometric Identification System by Hand Geometry
No ratings yet
2003automatic Biometric Identification System by Hand Geometry
4 pages
School of Electrical and Electronic Engineering Nanyang Technological University, Singapore 639798 Ehyguan@ntu - Edu.sg
No ratings yet
School of Electrical and Electronic Engineering Nanyang Technological University, Singapore 639798 Ehyguan@ntu - Edu.sg
4 pages
Large-Scale Multiview 3D Hand Pose Dataset
No ratings yet
Large-Scale Multiview 3D Hand Pose Dataset
23 pages
Robot Programming by Demonstration With A Monocular RGB Camera
No ratings yet
Robot Programming by Demonstration With A Monocular RGB Camera
12 pages
10 1016@j Eswa 2019 06 055
No ratings yet
10 1016@j Eswa 2019 06 055
11 pages
Finger and Hand Tracking With Kinect SDK
No ratings yet
Finger and Hand Tracking With Kinect SDK
6 pages
Phannarak CV
No ratings yet
Phannarak CV
2 pages
5416 5183 1 PB
No ratings yet
5416 5183 1 PB
9 pages
Mueller GANerated Hands For CVPR 2018 Paper
No ratings yet
Mueller GANerated Hands For CVPR 2018 Paper
11 pages
Hand Detection Using Multiple Proposals: Arpit@robots - Ox.ac - Uk
No ratings yet
Hand Detection Using Multiple Proposals: Arpit@robots - Ox.ac - Uk
11 pages
Kinect Sensor (Temporary) Utfhhd
No ratings yet
Kinect Sensor (Temporary) Utfhhd
6 pages
Baek Weakly-Supervised Domain Adaptation Via GAN and Mesh Model For Estimating CVPR 2020 Paper
No ratings yet
Baek Weakly-Supervised Domain Adaptation Via GAN and Mesh Model For Estimating CVPR 2020 Paper
11 pages
Hand Posture Recognition Using Adaboost With SIFT For Human Robot Interaction
No ratings yet
Hand Posture Recognition Using Adaboost With SIFT For Human Robot Interaction
6 pages
Motion Capture of Hands in Action Using Discriminative Salient Points
No ratings yet
Motion Capture of Hands in Action Using Discriminative Salient Points
14 pages
2022 Ohkawa
No ratings yet
2022 Ohkawa
14 pages
Approach For Recognizing Continuous Human Grasping Sequences
No ratings yet
Approach For Recognizing Continuous Human Grasping Sequences
11 pages
Human Pose Estimation Using MediaPipe Pose and Opt
No ratings yet
Human Pose Estimation Using MediaPipe Pose and Opt
21 pages
Grasping Under Uncertainties Sequential Neural Ratio Estimation For 6-DoF Robotic Grasping
No ratings yet
Grasping Under Uncertainties Sequential Neural Ratio Estimation For 6-DoF Robotic Grasping
7 pages
Important Questions
No ratings yet
Important Questions
21 pages
Haile 0000
No ratings yet
Haile 0000
81 pages
Trigonometry 15 Dec1.
No ratings yet
Trigonometry 15 Dec1.
107 pages
A Contactless Identification System Based On Hand S - 2015 - Procedia Computer S
No ratings yet
A Contactless Identification System Based On Hand S - 2015 - Procedia Computer S
8 pages
GSFLOW Release Notes 2.2.0
No ratings yet
GSFLOW Release Notes 2.2.0
84 pages
Efficient Annotation and Learning For 3D Hand Pose Estimation: A Survey
No ratings yet
Efficient Annotation and Learning For 3D Hand Pose Estimation: A Survey
18 pages
Zhou2020monocular Supp
No ratings yet
Zhou2020monocular Supp
3 pages
Markerless and Efficient 26-DOFHand Pose Recovery - 2010 - 11 - ACCV - 3dhandpose
No ratings yet
Markerless and Efficient 26-DOFHand Pose Recovery - 2010 - 11 - ACCV - 3dhandpose
13 pages
01 JRODOS Overview
No ratings yet
01 JRODOS Overview
25 pages
Back To RGB - 3D Tracking of Hands and Hand-Object Interactions Based On Short-Baseline Stereo - 1705.05301
No ratings yet
Back To RGB - 3D Tracking of Hands and Hand-Object Interactions Based On Short-Baseline Stereo - 1705.05301
10 pages
2004 IFToMM DatagloveBasedGraspPlanningforMulti FingeredRobotHand
No ratings yet
2004 IFToMM DatagloveBasedGraspPlanningforMulti FingeredRobotHand
6 pages
Proposal Theory
No ratings yet
Proposal Theory
13 pages
Report
No ratings yet
Report
18 pages
Eng-Simple Wikipedia 2021 300K-Sources
No ratings yet
Eng-Simple Wikipedia 2021 300K-Sources
1,576 pages
A Wearable Smart Glove and Its Application of Pose and Gesture Detection To Sign Language Classification
No ratings yet
A Wearable Smart Glove and Its Application of Pose and Gesture Detection To Sign Language Classification
8 pages
Hand Tracking and Gesture Recognition by Multiple Contactless Sensors A Survey
No ratings yet
Hand Tracking and Gesture Recognition by Multiple Contactless Sensors A Survey
9 pages
Technical Report of HCB Team For Multiview Egocentric Hand Tracking Challenge On HANDS 2024 Challenge
No ratings yet
Technical Report of HCB Team For Multiview Egocentric Hand Tracking Challenge On HANDS 2024 Challenge
3 pages
Proposal UNSW
No ratings yet
Proposal UNSW
18 pages
Houston We Have A Problem The Use of Cha
No ratings yet
Houston We Have A Problem The Use of Cha
46 pages
Hand Landmarks Detection and Localization in Color
No ratings yet
Hand Landmarks Detection and Localization in Color
26 pages
A Review of Modern Fashion Recommender S
No ratings yet
A Review of Modern Fashion Recommender S
38 pages
The Study Legal Aspects of Trade in Ethi
No ratings yet
The Study Legal Aspects of Trade in Ethi
19 pages
AI and The Future of Digital Public Squa
No ratings yet
AI and The Future of Digital Public Squa
39 pages
The Sixth International Conference On Da
No ratings yet
The Sixth International Conference On Da
88 pages
Advances in Fine Tuning Large Language M
No ratings yet
Advances in Fine Tuning Large Language M
11 pages
A Review On Explainability in Multimodal
No ratings yet
A Review On Explainability in Multimodal
22 pages
To Study The Effect of Marketing On Awar
No ratings yet
To Study The Effect of Marketing On Awar
12 pages
2008 The Slavic Word Suffix Order and Pa
No ratings yet
2008 The Slavic Word Suffix Order and Pa
26 pages
Boukhayma 3D Hand Shape and Pose From Images in The Wild CVPR 2019 Paper
No ratings yet
Boukhayma 3D Hand Shape and Pose From Images in The Wild CVPR 2019 Paper
10 pages
2007 On Derivation Inflection Character
No ratings yet
2007 On Derivation Inflection Character
26 pages
Towards Computational Modelling of Neura
No ratings yet
Towards Computational Modelling of Neura
23 pages
Derivation Versus Inflection in Three in
No ratings yet
Derivation Versus Inflection in Three in
20 pages
Vague Models and Their Implications For
No ratings yet
Vague Models and Their Implications For
20 pages
2014 Affix Ordering Across Languages and
No ratings yet
2014 Affix Ordering Across Languages and
12 pages
Role of Women in Agriculture Sector
No ratings yet
Role of Women in Agriculture Sector
18 pages
An Input Oriented Approach To Inflection
No ratings yet
An Input Oriented Approach To Inflection
16 pages
Human AI Coevolution
No ratings yet
Human AI Coevolution
13 pages
Not All Federated Learning Algorithms Ar
No ratings yet
Not All Federated Learning Algorithms Ar
12 pages
Inductive Logic Programming Via Differen
No ratings yet
Inductive Logic Programming Via Differen
12 pages
HUMMUS A Linked Healthiness Aware User C
No ratings yet
HUMMUS A Linked Healthiness Aware User C
11 pages
Zero Shot Recommendations With Pre Train
No ratings yet
Zero Shot Recommendations With Pre Train
11 pages
Sensors 22 03777
No ratings yet
Sensors 22 03777
20 pages
Robot Trajectory Prediction and Recognit
No ratings yet
Robot Trajectory Prediction and Recognit
8 pages
Online Advertising in The Tourism Indust
No ratings yet
Online Advertising in The Tourism Indust
7 pages
Harnessing Retrieval Augmented Generatio
No ratings yet
Harnessing Retrieval Augmented Generatio
4 pages
An Approach To Web Adaptation by Modelli
No ratings yet
An Approach To Web Adaptation by Modelli
6 pages
Ficha Técnica Del Montacargas XC SERIES 3 WHEEL ELECTRIC FORKLIFT WITH LI-ION BATTERY 3,200-4,000LBS
No ratings yet
Ficha Técnica Del Montacargas XC SERIES 3 WHEEL ELECTRIC FORKLIFT WITH LI-ION BATTERY 3,200-4,000LBS
6 pages
SketchQL Demonstration Zero Shot Video M
No ratings yet
SketchQL Demonstration Zero Shot Video M
4 pages
Academia Summary - Speculating On The Future of Graphic Design in The Age of Intelligent Machines
No ratings yet
Academia Summary - Speculating On The Future of Graphic Design in The Age of Intelligent Machines
3 pages
Academia Summary - The Role of Knowledge Management in Hierarchical Model Development
No ratings yet
Academia Summary - The Role of Knowledge Management in Hierarchical Model Development
3 pages
Advancing Personalized Medicine A Strate
No ratings yet
Advancing Personalized Medicine A Strate
9 pages
Bomb in Hand Paper Final
No ratings yet
Bomb in Hand Paper Final
10 pages
Cbet LVL 6 Basic Electronics 4
No ratings yet
Cbet LVL 6 Basic Electronics 4
3 pages
Tteh 000553
No ratings yet
Tteh 000553
5 pages
000400000007AF00
No ratings yet
000400000007AF00
7 pages
High Precision 6-DoF Grasp Detection in Cluttered Scenes Based On Network Optimization and Pose Propagation
No ratings yet
High Precision 6-DoF Grasp Detection in Cluttered Scenes Based On Network Optimization and Pose Propagation
8 pages
Articulated Body Pose Estimation: Unlocking Human Motion in Computer Vision
From Everand
Articulated Body Pose Estimation: Unlocking Human Motion in Computer Vision
Fouad Sabry
No ratings yet
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
From Everand
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
Fouad Sabry
No ratings yet
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
From Everand
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
Fouad Sabry
No ratings yet

Improved Estimation of Hand Postures Usi

Uploaded by

Improved Estimation of Hand Postures Usi

Uploaded by

Improved Estimation of Hand Postures Using

Dennis Hamester, Doreen Jirak and Stefan Wermter

University of Hamburg, Department of Informatics, Knowledge Technology

III. H AND M ODEL

Fig. 3. Covariance matrix of random hand poses. The DOFs are:

(e) (f) (g) (h)

Fig. 6. Eight hand postures and their estimation.

Biased particle swarms were the other key component R EFERENCES

You might also like