Statistical Body Height Estimation From A Single I
Statistical Body Height Estimation From A Single I
Statistical Body Height Estimation From A Single I
net/publication/224401122
CITATIONS READS
51 643
2 authors, including:
Chiraz Benabdelkader
Ecole Nationale des Sciences de l'Informatique
27 PUBLICATIONS 1,466 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Chiraz Benabdelkader on 15 May 2014.
978-1-4244-2154-1/08/$25.00 2008 IE
accuracy better than a random guess but less than when ac- has a unique solution, which happens to be the right singular
curate/complete absolute scale information is available. vector, denoted vn , of C that corresponds to its smallest
singular value [19]. Thus, a set of approximate (non-exact)
2. Methods solutions for (2) is:
Neck
Height
Acromial
Height
base of
feet
(Section 3.1). The main challenge we face with this ap- Figure 2. Landmark localization in frontal-view image. The blue
proach lies in that, aside from facial anthropometrics, there dots are points we mark manually; green dotted line is the medial
are pretty much no anthropometrics that are at once: eas- axis of symmetry of the person, which we estimate by fitting a line
ily obtainable from images, strongly correlated with body to some of those dots (see text)
height, and whose statistics are available (see discussion at
the end of Section 2.2). Currently we use the seven facial
anthropometrics from Section 2.2 (4–10), plus the acromion
to top of head distance. Because statistics for the latter an-
thropometric are not available in any well-known anthro- tics, namely the lower and upper bounds of both each an-
pometric surveys, we have instead derived them from the thropometric and each anthropometric ratio, and the mean
statistics of body height and acromial height. and covariance of all anthropometrics. The statistics were
all obtained from the seminal anthropometric survey in [5].
2.4. Implementation Details The details are given in [2]. Table 1 shows these statistics
for some anthropometrics, with α =1e-12. It is important
Localization of the body landmarks associated with the to note, however, that these statistics are categorized by
anthropometrics of concern is achieved as follows (Fig. 2). gender, i.e. separate statistics for males and females. Con-
For frontal-view images, we start by locating these 15 body sequently, the gender of the person in the image is assumed
landmarks in the image: top of the head, forehead, sub- to be known, which is an important consideration for auto-
nasale, stomion, chin, left and right corners of the eyes, left mated implementations of the method.
and right tragions, left and right trapezius points (neck), left
and right acromions, and left and right medial longitudinal
foot arches. This is currently done semi-automatically by
having the user select points in the image via an interac- Table 1. Some anthropometric statistics (in cm), where SD: stan-
tive Matlab interface. We then estimate the person’s medial dard deviation, UB: upper bound, LB: lower bound. Based on the
1988 US Army survey [5] and with α =2.9e-7 (which under the
axis (midline of symmetry) as the line passing through the
assumption of normality means the upper and lower bounds corre-
top of the head, midpoint of the two foot landmarks, and
spond to 5 standard deviations from the mean).
(if known) the vertical vanishing point. We refine the lo-
cations of the first five landmarks (top of head, forehead,
Males Females
subnasale, stomion, chin) by projecting them onto the me- Mean SD LB UB Mean SD LB UB
dial axis. For the left/right landmark pairs of the trapezius, Body height 175.5 6.68 142.2 209.0 162.9 6.36 131.1 194.7
Head length 23.2 .88 18.8 27.6 21.76 .85 17.5 26.0
neck, acromions, and foot arches, we compute the intersec- Bitragion 14.5 .60 11.5 17.5 13.64 .52 11.0 16.2
tion of the medial axis with the line segment joining each Interocular 10.2 .54 7.50 12.9 9.62 .50 7.12 12.1
landmark pair. This way, the anthropometrics 1–8 are all
collinear with the medial axis.
Finally, a word about the required anthropometric statis-
true uncorrupted estimated
randomly generate x compute pairwise y x
head to chin distance, (5) stomion to top of head distance,
Metrology
anthropometric
vector
anthropometric
ratios
Technique (6) subnasale to top of head distance, (7) forehead to chin
distance, (8) sellion to chin distance, (9) biocular distance,
Figure 3. Methodology for (random) generation of synthetic an- i.e. between outer corners of the eyes, (10) bitragion dis-
thropometric data (namely y vectors) to be used in testing our an- tance, (11) acromion to top of head distance.
thropometric estimation method. The body height estimation error is computed as follows:
estimated value - true value. For the method
parameters, we use λ = 0.01 and α =2.9e-7 (i.e. upper and
lower bounds correspond to 5 standard deviations from the
mean). In order to investigate the effect of choice of the an-
thropometric set on estimation error, we tested the method
with various different sets. The results are given below in
Figures 5–8 in two forms: (a) the cumulative distribution
of absolute estimation error, and (b) the distribution of es-
timation error (as a boxplot). Furthermore, for comparison
purposes, we have included in each Figure the performance
of the baseline algorithm, i.e. the one that estimates the un-
known body height as the population mean (175.5cm for
males and 162.9cm for females–See Table 1).
As far as estimation from whole-body images, our
method is significantly better than the baseline algorithm for
Figure 4. Sample images from our inhouse dataset. synthetic data, but only slightly better for real data. Since
the synthetic vectors are uncorrupted by noise, the syn-
thetic estimation error actually corresponds to the method’s
3. Experiments and Results model error. This suggests that the method is quite sensi-
tive to input error in the real data, which mostly comes from
3.1. The Data landmark localization and vanishing point/line estimation.
We tested our anthropometric estimation method using Furthermore, generally speaking, the method’s performance
both synthetic data and real images. The former consists of improves when more anthropometrics are used in the esti-
randomly generated y vectors, each of which is obtained by mation. This especially obvious for synthetic data. As to
first randomly generating a vector x from the multivariate estimation from upper-body images, our method is no bet-
Gaussian distribution of male or female anthropometrics, ter than the baseline algorithm, both for synthetic and real
then computing the corresponding pairwise ratios (Fig. 3). data! This suggests that the main source of error lies in the
The real images consist of a set of high-resolution linear prediction model that we currently use.
(4368x2912) images captured in-house using a Canon 28-
200mm EOS camera (Fig. 4). Each image is a full-body 4. Conclusions and Future Work
shot of one person. The dataset contains a total 108 images,
We presented methods for estimating a person’s body
with 96 frontal-view shots. Also, the dataset comprises 27
height from a single uncalibrated image. The novelty is
different adults, 7 females and 20 males, spanning various
two-fold: (i) we do not require scale information (the so-
ethnicities: caucasian, chinese, indian, and african. The ver-
called reference length); and (ii) we handle images contain-
tical vanishing point needed to compute pairwise ratios of
ing the upper-body part only of the person. The method
anthropometrics 1–8 is computed as the intersection in the
was tested both synthetic data and real images. As far as
image of parallel vertical lines of the background scene.
whole-body images, performance is slightly better than the
baseline algorithm (that randomly guesses the population
3.2. Results
mean), but inferior to that of other well-known single-view
As discussed, we estimate body height by applying our metrology techniques such as [9]. This is expected since
novel metrology technique (Section 2.1) to estimate a set of our method uses less input information. Furthermore, since
anthropometrics. For whole-body images, this set includes quite a bit of estimation error is caused by input error in
body height (Section 2.2), while for upper-body images it the localization of landmarks and computation of vanishing
does not, and a linear model is subsequently applied to pre- points, this error can be reduced by estimating from multi-
dict body height (Section 2.3). To simplify the discussion, ple images or a video sequence. As for upper-body images,
let us assign a number to each anthropometric, as follows: our method is currently no better than the baseline algo-
(1) body height, (2) neck height, (3) acromial height, (4) rithm. However, there is room for improvement by using
1 1
Cumulative distribution of body height estimation error
0.8 0.8
0.7 0.7
0.6 0.6
0.5 0.5
0.4 0.4
0.3 0.3
baseline baseline
0.2 1−3 0.2 1−3
1−3,11 1−3,11
0.1 1−8 0.1 1−8
1−10 1−10
0 0
0 5 10 15 20 25 0 5 10 15 20 25 30
absolute body height error (cm) absolute body height error (cm)
(a) (a)
20 15
15 10
10 5
body height error (cm)
0 −5
−5 −10
−10 −15
−15 −20
−20 −25
−25 −30
baseline {1−3} {1−3,11} {1−8} {1−10}
baseline {1−3} {1−3,11} {1−8} {1−10}
(b) (b)
Figure 5. Synthetic data results for body height estimation from a Figure 6. Real data results for body height estimation from a
whole-body image. whole-body image.
a better prediction model than the naive linear model we [3] A. Bovyrin and K. Rodyushkin. Human height prediction
currently use. In future work, we plan to: (i) explore other and roads estimation. In IEEE Conference on Advanced
prediction models, particularly non-linear ones, for estimat- Video and Signal-Based Surveillance, 2005.
ing body height from upper-body images; (ii) extend the [4] S. Bramble, D. Compton, and L. Klasen. Forensic im-
method to estimation from multiple images; and (iii) inves- age analysis. In 13th Interpol Forensic Science Symposium,
tigate the effect of camera viewpoint (i.e. camera angle with 2001.
respect to person) on estimation accuracy. [5] J. Cheverud, C. G. Gordon, R. Walker, C. Jacquish, L. Kohn,
A. Moore, and N. Yamashita. 1988 anthropometric survey of
us army personnel. Technical Report TR-90/032, US Army
References Natick Research Labs, May 1990.
[1] C. BenAbdelkader, R. G. Cutler, and L. S. Davis. View- [6] D. Compton, C. Prance, M. Shears, and C. Champod. A
invariant estimation of height and stride for gait recognition. systematic approach to height interpretation from images. In
In Workshop on Biometric Authentication, 2002. Proc. of SPIE, volume 4232, 2001.
[2] C. BenAbdelkader and Y. Yacoob. Computational Forensics, [7] A. Criminisi, I. Reid, and A. Zisserman. Single view metrol-
volume 1, chapter Statistical Estimation of Human Anthro- ogy. In CVPR, pages 434–442, Sept. 1999.
pometry from a Single Uncalibrated Image, pages 200–220. [8] A. Criminisi, I. Reid, and A. Zisserman. Single view metrol-
Springer Verlag, 2008. ogy. IJCV, 40(2):123–148, Nov. 2000.
1 1
Cumulative distribution of body height estimation error
0.8 0.8
0.7 0.7
0.6 0.6
0.5 0.5
0.4 0.4
0.3 0.3
baseline baseline
0.2 4−8 0.2 4−8
9−10 9−10
0.1 4−10 0.1 4−10
4−11 4−11
0 0
0 5 10 15 20 25 0 5 10 15 20 25 30 35
absolute body height error (cm) absolute body height error (cm)
(a) (a)
20 15
10
15
5
10
body height error (cm)
baseline {4−8} {9−10} {4−10} {4−11} baseline {4−8} {9−10} {4−10} {4−11}
(b) (b)
Figure 7. Synthetic data results for body height estimation from an Figure 8. Real data results for body height estimation from an
upper-body image. upper-body image.
[9] A. Criminisi, A. Zisserman, L. Van Gool, S. Bramble, and Computer Vision. Cambridge University Press, second edi-
D. Compton. A new approach to obtain height measurements tion, 2004.
from video. In Proc. of SPIE, volume 3576, 1998. [15] A. Leykin and M. . Tuceryan. A vision system for auto-
[10] D. DeAngelis, R. Sala, A. Cantatore, P. Poppa, M. Dufour, mated customer tracking for marketing analysis: low-level
M. Grandi, and C. Cattaneo. New method for height es- feature extraction. Technical Report TR612, Indiana Univer-
timation of subjects represented in photograms taken from sity, June 2005.
video surveillance systems. International Journal of Legal [16] C. Madden and M. Piccardi. Height measurement as a
Medicine, 121(6):489–492, November 2007. session-based biometric. In Image and Vision Computing
[11] L. G. Farkas, editor. Anthropometry of the Head and Face. New Zealand, 2005.
Raven Press, second edition, 1994. [17] S. Pheasant. Bodyspace: Anthropometry, Ergonomics and
[12] F. Guo. Calibration and Metrology Using Still and Video the Design of Work. Taylor and Francis, second edition,
Images. PhD thesis, University of Maryland, College Park, 1996.
2007. [18] N. Saitoh, K. Kurosawa, and K. Kuroki. A study on height
[13] D. Hansen, B. Mortensen, P. Duizer, J. Andersen, and measurement from a single view. In ICIP, 1999.
T. Moeslund. Automatic annotation of humans in surveil- [19] G. Strang. Introduction to Linear Algebra. Wellesley-
lance video. In Fourth Canadian Conference on Computer Cambridge Press, third edition, 2003.
and Robot Vision, pages 473–480, 2007.
[14] R. Hartley and A. Zisserman. Multiple View Geometry in