0% found this document useful (0 votes)

32 views15 pages

Huang 2003

Uploaded by

Sang Phạm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views15 pages

Huang 2003

Uploaded by

Sang Phạm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Neurocomputing 51 (2003) 197 – 211

www.elsevier.com/locate/neucom

Face detection from cluttered images using a

polynomial neural network
Lin-Lin Huang ∗ , Akinobu Shimizu, Yoshihiro Hagihara,
Hidefumi Kobatake
Graduate School of BASE, Tokyo University of Agriculture and Technology, 2-24-16 Naka-cho
Koganei-shi, Tokyo 184-8588, Japan

Received 3 July 2001; accepted 8 May 2002

Abstract

Automatic detection of human faces from cluttered images is important for face recognition
and security applications. This problem is challenging due to the multitude of variations and
the confusion between face and background regions. This paper proposes a new face detection
method using a polynomial neural network (PNN). To locate the human faces in an image, the
local regions in multiscale sliding windows are classi5ed by the PNN to two classes, namely, face
and non-face. The PNN takes as inputs the binomials of the projection of the local image onto a
feature subspace learned by principal component analysis (PCA). We investigated the in9uence of
PCA on either the face samples or the pooled face and non-face samples. In addition, we integrate
the distance from the feature subspace into the PNN to improve the detection performance. In
experiments on images with complex backgrounds, the proposed method has produced promising
results in terms of high detection rate and low false positive rate.
c 2002 Elsevier Science B.V. All rights reserved.

Keywords: Face recognition; Face detection; Pattern classi5cation; Polynomial neural network; Feature
extraction

1. Introduction

Machine recognition of human faces has wide applications in security and human–
computer interface [3]. A complete face recognition system consists of several modules:

∗ Corresponding author. Tel.: +81-42-3283398; fax: +81-42-3283398.

E-mail addresses: [email protected] (L.-L. Huang), [email protected] (A. Shimizu), dhag@
cc.tuat.ac.jp (Y. Hagihara), [email protected] (H. Kobatake).

0925-2312/03/$ - see front matter c 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 9 2 5 - 2 3 1 2 ( 0 2 ) 0 0 6 1 6 - 1
198 L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211

face detection=location, facial feature extraction, and face identi5cation=veri5cation. De-

spite the importance of face detection in face recognition systems, it has not received
enough attention in the research community until recently. Face detection is more dif-
5cult than it was expected due to the diverse variations of face appearance and the
complexity of the image background. The variations of face images include the diver-
sity of individuals, the pose and expression of face, the attachments, and the lighting
condition. In recent years, increasing eHorts have been devoted to face detection and
great progresses have been achieved. Nevertheless, this problem is still considered to
be unsolved.
The methods proposed for face detection so far can be roughly grouped into four
categories: template matching [10,16,27], geometrical models [7,13,15,29], statistical
approach [4,5,12,17,19,23,26], and neural network approach [6,8,14,18,21]. Since tem-
plate matching is insuIcient to grasp the diversity of faces, it is often combined with
geometrical models in multistage knowledge-based systems. The geometrical model is
intuitively appealing in that the geometrical structure and the relationships of facial
parts are grasped.
Compared to template matching and geometrical models, statistical models and neu-
ral networks are powerful to globally measure the face likelihood of images and
are tolerable to image degradation to a large extent. Basically, face detection is a
two-class classi5cation problem: to classify a local image to face or non-face. Hence,
in principle, all statistical and neural classi5ers can be applied to ful5ll this task.
These include the classi5cation techniques in face recognition, such as the eigen-
face method [19]. Generally, the statistical approach builds probability density mod-
els for the face class or both the face and non-face classes, and the face=non-face
classi5cation is performed by computing a likelihood measure or likelihood
ratio.
Neural networks are appropriate for face detection due to their discrimination ca-
pability obtained by learning from examples. In contrast to density models, neural
networks learn from examples on the decision boundary between face and non-face
instances. So far, the neural network methods in face detection have yielded results
comparable to or better than those of the statistical methods. These methods include
the modular feedforward networks [8], the probabilistic decision neural network [14],
the feedforward network with local receptive 5elds [21], the modular auto-associative
networks [6], the support vector machine (SVM) [18], etc. In the method of [26],
neural networks were combined with statistical density models.
This paper proposes a new method for face detection, which uses a polynomial
neural network (PNN) [11,24] for classi5cation on the feature subspace learned by
principal component analysis (PCA). The PCA is aimed to reduce the dimensionality
of image patterns so as to obtain a compact representation and reduce the complexity
of learning and testing. The PNN takes as inputs the binomials of the projection of
the image pattern onto the subspace and is trained on face and non-face samples to
discriminate two classes. The feature subspace can be learned from either the face
samples or the pooled face and non-face samples. We will conduct experiments to
investigate the in9uence of the sample of PCA on the performance of face detection.
In order to further improve the detection performance, we incorporate into the PNN

why would they use PNN

L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211 199

the distance of image pattern from the feature subspace, which was frequently used in
eigenface-based recognition.
Experiments of face detection on images with complex backgrounds demonstrated
the eIciency of the proposed method. In respect of the detection rate and false pos-
itive rate, the achieved results are comparable to those reported in the literature. In
comparison with a multilayer perceptron (MLP), the performance of PNN is superior.
The proposed method is not complicated to implement and shows potential to further
improve the performance.
The rest of this paper is organized as follows. Section 2 gives an overview of the
face detection method; Section 3 describes the PNN structure and learning algorithm.
The experimental results are presented in Section 4, and 5nally, Section 5 provides
concluding remarks.

2. System overview

To detect faces of variable sizes and locations, the detector needs to examine the
shifted regions of the test image in multiple scales. A statistical classi5er or a neu-
ral network is used to classify the image pattern of the local region to one of two
classes: face or non-face. Alternatively, the image pattern is assigned by the classi5er
a likelihood measure, which should be high for a face region and low for a non-face
region. The likelihood measure is useful to resolve the competition between overlapping
regions within the same scale and across diHerence scales.
Our strategy of image rescaling and pre-processing is similar to those of [21,26]. In
brief, the test image with faces of unknown size and location is rescaled to multiple
sizes with the hope that after scaling, each face becomes nearly a standard size in
one of the rescaled images. Each rescaled image is scanned exhaustively to examine
all shifted regions of standard size (in this work, 20 × 20 pixels). The local image of
each shifted region is assigned by the underlying classi5er a likelihood value. A region
with a likelihood value higher than a threshold is classi5ed to be a face region. The
regions shifted slightly from the standard face region and=or with a slightly diHerent
scale may also be assigned high likelihood values, so they are also classi5ed to be
face regions. The overlapping face regions within a rescaled image or across diHerent
scales compete with each other so that only the region of the highest face likelihood
is retained.
As in previous works [21,26], we aim to detect frontal faces and de5ne a face
region as a window enclosing the organs of a human face: two eyes, one nose and one
mouth. To reduce the eHect of inhomogeneous lighting condition, the gray levels of
the local image is subtracted from an optimally 5tted plane in MSE (minimum square
error) sense. Then the gray levels are adjusted by histogram equalization so as to
standardize the contrast. In calculating the 5tting plane and the histogram, the pixels in
four corners of the 20 × 20 window are excluded because they mostly do not compose
the face organs and are subject to large variation. Finally, the gray levels of the local
image excluding the corner pixels are arranged in a 368-dimensional vector as the
input pattern to the classi5er. Fig. 1 shows examples of local image pre-processing.
200 L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211

Fig. 1. Pre-processing of image pattern of a local region.

The upper row is an example of face region, and the lower row is an example of
non-face region. The four images from left to right show in turn the clipped local
image, the result of lighting condition correction, the result of histogram equalization,
and the image pattern excluding the corner pixels.
The underlying classi5er for image pattern classi5cation is a PNN, which has a
single output unit to indicate whether an input pattern is a face or not. The output
value of the unit gives the likelihood as to what extent the input pattern looks like a
human face. The output unit takes as inputs the input features as well as their binomial
expansions. When the number of input features is large, the number of polynomial
terms will be huge. This would not only increase the computational complexity, but
also deteriorate the generalization performance if trained on small sample size. To
overcome this problem, we reduce the dimensionality of the input pattern using PCA.
Using PCA to learn an eigen subspace from a set of example patterns, the projections
of an input pattern onto the principal eigenvectors are used as the input features of the
PNN. The structure and the learning algorithm of PNN are described in detail in the
following section.
The architecture of PNN combined with feature extraction exhibits great 9exibilities.
Hence the PNN shows great potential to improve the detection performance if the PCA
is replaced or combined with other techniques for feature extraction. Some feature
extraction techniques used in face detection [12,25] can be readily combined with
the PNN. Other feature extraction and transformation techniques available for this task
include the wavelet transform, the Gabor transform, the independent component analysis
[1], and various feature subset selection methods [9].

3. Polynomial neural network

The PNN can be viewed as a generalized linear classi5er which uses as inputs not
only the feature measurements of the input pattern but also the polynomials of the
measurements. For face detection, the PNN has one single output unit for two-class
classi5cation. The number of polynomial terms, i.e., the number of inputs to the out-
put unit, increases rapidly with the number of features. Nevertheless, the size of a
second-order (binomial) network is acceptable and the classi5cation performance is
L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211 201

promising. The binomial network is also closely related to the Gaussian quadratic clas-
si5er since they both utilize the second-order statistics of the pattern space [24,28].
However, the PNN (including the binomial network) breaks the constraints of Gaus-
sian density and the parameters are optimized in discriminative learning so as to well
separate the patterns of diHerent classes. While compared to other neural networks,
such as the MLP [22] and radial basis function (RBF) network [2], the PNN is faster
in learning and is less susceptible to local minima because it is a single-layer structure.
The classi5cation power of PNN originates from the nonlinear mapping of polynomial
expansion.
Denote the input pattern as a feature vector x = (x1 ; x2 ; : : : ; xd )T , the output of the
PNN is computed by
d d d

y(x) = g w i xi + wij xi xj + w0 ; (1)
i=1 i=1 j=i

where g(·) is a sigmoid activation function

1
g(a) = :
1 + exp(−a)
In our problem, the input vector x has 368 components, which are the pixel values of
a local region image. Instead of feeding x directly to the PNN, it is projected onto a
linear feature subspace:

zj = (x − )T j ; j = 1; 2; : : : ; m; (2)

where zj denotes the projection of x onto the jth axis of the subspace, j denotes
the eigenvector of the axis, and denotes the mean vector of the pattern space. The
eigenvectors are computed by PCA or K–L transformation on a sample set, i.e., a
set of face samples or the pooled set of face and non-face samples. The eigenvectors
corresponding to the m largest eigenvalues are selected such that the error of pattern
reconstruction from the subspace is minimized.
Using the projections of image pattern onto the subspace as the features, the output
of the PNN is now in the form
m m
m

y(x) = g w i zi + wij zi zj + w0 : (3)
i=1 i=1 j=i

In this form, the reconstruction error of image pattern (the distance from the feature
subspace, DFFS) is totally ignored. However, the DFFS is an important indicator of
the deviation of the pattern from the subspace. When the subspace is learned from
face samples, the DFFS indicates the dissimilarity of a pattern being a face. Hence,
we integrate the DFFS into the PNN with hope to improve the detection performance:
m m m

y(x) = g w i zi + wij zi zj + w Df + w0 = g(wT zE + w0 );
D
(4)
i=1 i=1 j=i
202 L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211

where w denotes the vector composed of all connecting weights while zE is the vector
composed of all the inputs to the output unit, including the DFFS:
m
2
Df = x − − zj2 : (5)
j=1

The connecting weights of the PNN are trained in supervised learning on a set of face
and non-face samples with aim to minimize the empirical loss of mean square error
(MSE):
Nx
Nx

E= [y(xn ) − t n ]2 + w2 = En; (6)
n=1 n=1

where t n denotes the target output for the input pattern xn , with value 1 for face
pattern and 0 for non-face pattern; is a coeIcient of weight decay, which is helpful
to improve the generalization performance.
The connecting weights are updated by stochastic gradient descent [20]. The example
patterns are fed into the network repeatedly to update the weights until the empirical
loss reaches a local minimum. On an input pattern zn = z(xn ), the connecting weights
are updated by gradient descent:
9E n
w(n + 1) = w(n) − (n) ;
9w
9E n
w0 (n + 1) = w0 (n) − (n) ; (7)
9w0
where (n) is a learning rate, which is small enough and decreases progressively. The
partial derivatives are computed by
9E n
= [y(xn ) − t n ]y(1 − y)zE + w;
9w Nx

9E n
= [y(xn ) − t n ]y(1 − y): (8)
9w0
Since the PNN is a single layer network, the training process is quite fast and the
result is not in9uenced by the random initialization of weights.

4. Experimental results

4.1. Image databases

To train the classi5er and test the performance of face detection, we used two sets of
images. The 5rst set contains 3257 images downloaded from several websites, mostly
of 384 × 384 size and with one face in each image. The second set has 130 images
downloaded from the website of CMU, called as the Test Set 1 in [21]. In this paper,
we call the 3257 images as type 1 images and the CMU images as type 2 images.
L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211 203

We used 2987 type 1 images (containing 2990 faces) to extract face samples. The
face boxes were arti5cially located and then each box is adjusted into a square.
The local image within the face box is normalized to 20 × 20 size, and undergoes
lighting condition correction, histogram equalization, and corner elimination to give a
368-dimensional vector of face pattern. The square face box also varies in aspect ratio
and size to generate four variations. In addition, the mirror re9ection of a face image
to the vertical axis also gives a variation. In combination, a face image gives 10 vari-
ations of face patterns and as a result, 29,990 face patterns are available for training.
The rest of the 270 type 1 images in the 5rst set were used in testing.
We collected non-face samples from three subsets of images in three steps. The
images for non-face collection include a subset of 228 images and a subset of 273
images from which the face samples were extracted, and 14 scene images from the
type 2 dataset. From the 5rst subset of 228 images, the shifted local regions in 10
scales (at scaling factor 1.21 and starting from 0.1) were examined to collect the
5rst-step non-face samples. The local patterns were compared with the center vector of
the face samples such that the patterns with the Euclidean distance below a threshold is
considered to be a confusing non-face example. Since the number of patterns satisfying
this condition is huge, we collected the patterns of minimum distance in 20 × 20 range
and those standing on the grid of 10 × 10 pixels. As a result, we obtained 56,007
non-face samples in the 5rst step.
In the second step, non-face samples were collected from the subset of 273 images
(10 scales starting from 0.1) using the PNN trained on the face samples and the
5rst-step non-face samples. For this time all the local region patterns with the output of
the PNN higher than 0.5 were collected. For diHerent structures of PNN, the number of
second-step non-face samples ranges from 30,000 to 40,000. The PNN is then retrained
with the face samples and the non-face samples of two steps for collecting non-face
examples from the 14 scene images (10 scales starting from 0.2). The number of
third-step non-face samples is around 10,000.
From the type 2 (CMU) dataset, 14 scene images have been used for non-face
collection. We again excluded 7 images with extremely big or small faces (the face
sizes were out of the 10 scales). The rest 109 images containing 487 faces were used
as our type 2 test set. The 10 image scales start from 0.2 and end at 1.11. This implies
we can detect the faces ranging from 18 × 18 pixels to 100 × 100 pixels. For the type
1 set, the 10 scales start with 0.1.

4.2. Results of PNN

We 5rst tested three PNN variations with subspace dimension m=80, which we refer
to as PNN-A80, PNN-B80, and PNN-C80. The PNN-A80 and the PNN-C80 use the
subspace learned on the face samples. The PNN-C80 incorporates the DFFS whereas
the PNN-A80 does not. For the PNN-B80, the subspace was learned with the pooled
set of the face samples and the 5rst-step non-face samples. This paradigm is aimed
to represent the pooled sample in a uni5ed subspace. The distributions of eigenvalues
(sorted in decreasing order) of PCA from face samples and from pooled sample are
plotted in Fig. 2.
204 L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211

Fig. 2. Eigenvalues of PCA from face samples and from pooled sample.

Table 1
Detection results of three PNN structures

PNN Type 1 test set Type 2 test set

Det. rate (%) False False rate Det. rate (%) False False rate
A80 99.63 69 2:48 × 10−6 83.37 374 4:75 × 10−6
B80 99.63 59 2:12 × 10−6 81.52 255 3:24 × 10−6
C80 100 37 1:33 × 10−6 84.60 276 3:51 × 10−6

On the two test datasets, the detection results of the three PNN structures are listed
in Table 1. The false positive rate is the ratio of the false positive faces with respect
to the total number of examined windows. We can see that on the type 1 test set,
since the images have high clarity and the face shape variation is relatively small,
all the three PNN structures achieve very high detection rate and low false positive
rate. Some images in the type 2 test set have very low clarity and many faces are
inherently ambiguous, so the detection rate is lower than that of the type 1 test set. In
comparison with the three PNN structures, the PNN-B80 gives fewer false positives
than the PNN-A80, but the detection rate is traded oH. The PNN-C80 outperforms
PNN-A80 and PNN-B80 in both detection rate and the false positive rate. This justi5es
that incorporating the DFFS into PNN is bene5cial.
We also tested the performance of the PNN with DFFS on variable dimensionality
of subspace (PNN-C series). Table 2 gives the detection results of PNN-C on subspace
dimensionality m=60; 80, and 100. We can see that the performance of the PNN-C60 is
evidently inferior to that of the PNN-C80. The PNN-C100 trades oH the detection rate
L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211 205

Table 2
Detection results of variable subspace dimensionality

PNN Type 1 test set Type 2 test set

Det. rate (%) False False rate Det. rate (%) False False rate
C60 100 58 2:09 × 10−6 84.19 385 4:89 × 10−6
C80 100 37 1:33 × 10−6 84.60 276 3:51 × 10−6
C100 100 34 1:22 × 10−6 83.78 251 3:19 × 10−6

Fig. 3. Examples of face detection on type 1 test images.

to decrease the false positive rate. We will see later from the detection=false tradeoH
curves that the PNN-C80 outperforms the PNN-C100.
Some examples of face detection using the PNN-C80 are shown in Figs. 3–5.
Fig. 3 shows the examples of type 1 test set, while Figs. 4 and 5 shows the ex-
amples of type 2 test set. From the results of type 2 test set, we can see that the
proposed method is quite robust against low image quality and face shape variation.
The missed faces are incomplete, ambiguous, or rotate excessively. The false positives,
on the other hand, mostly resemble the geometrical shape of human faces. In gray-scale
images, the solution of ambiguous faces relies on more contextual information, e.g.,
the other parts of human body.

4.3. Comparison with other methods

To compare the performance of PNN with other neural networks, we have experi-
mented with the MLP, the most popular neural network for regression and classi5cation.
We use a four-layer MLP (two hidden layers) with the number of hidden units in the
5rst hidden layer equal to the subspace dimensionality of PNN (h1 = m), and the num-
ber of hidden units in the second hidden layer being the half of that of the 5rst hidden
206 L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211

Fig. 4. Examples of face detection on type 2 test images.

layer (h2 =h1 =2). In this situation, the numbers of parameters and computations of MLP
approximately equal to those of PNN in that the 5rst hidden layer weights correspond
to the subspace eigenvectors of PNN, the number of weights of second hidden layer
and output layer approximately equals the number of weights of PNN. The weights
of MLP are learned using the back-propagation (BP) algorithm [22], which minimizes
the heuristic MSE as for PNN.
The MLP was trained and tested on same images as the PNN. After having trained
on the 2990 face samples and 56,007 5rst-step non-face samples, the MLP was used to
collect second-step non-face samples. The third-step non-face samples were collected
after retraining the MLP. For face detection on test images, the MLP was trained on
the face samples and all non-face samples. To compare the detection performance of
L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211 207

Fig. 5. More examples of face detection on type 2 test images.

diHerent networks more fairly, we give the curve of tradeoH between correct detection
rate and false positive rate with variable decision thresholds. On the type 2 test images,
the tradeoH curves of MLP and PNN are plotted in Fig. 6, where PNN-60, PNN-80 and
PNN-100 denote the PNN (DFFS incorporated) with m = 60; 80 and 100, respectively,
and MLP-60, MLP-80 and MLP-100 denote the MLP with 60, 80 and 100 hidden units
in the 5rst hidden layer, respectively. From the results, it is evident that the performance
of PNN is superior to that of MLP. Among the PNNs, the PNN-80 performs best, and
among the MLPs, the MLP-80 performs best.
In comparison to the results of previous methods, we would like to specially men-
tion the results of [21] and [6] experimented on the same CMU dataset as ours. As
have explained above, this dataset contains many ambiguous faces so it is diIcult
to obtain both high detection rate and low false positive rate. The template matching
208 L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211

Fig. 6. Curves of detection performance of PNN and MLP.

and model-based methods usually report their results on high quality images only (e.g.,
[15,16]). The methods of [6,21] achieved very low false positive rates at high detection
rates. At the detection rate of 86%, their false rates are 2:8 × 10−7 and 9:7 × 10−8 ,
respectively. However, their results were achieved by combining multiple neural net-
works, which is signi5cant to reduce false positives. We use a single neural network
and the detection performance (84.6% detection rate and 3:51 × 10−6 false rate) is
fairly good.

5. Conclusion

We propose a new face detection method using a PNN. The PNN functions as a
classi5er to evaluate the face likelihood of the image patterns of multiscale shifted local
regions. The PCA technique is used to reduce the dimensionality of image patterns and
extract features for the PNN. Using a single network, we have achieved fairly high
detection rate and low false positive rate on images with complex backgrounds. Besides
the linear PCA, the PNN is 9exible to combine with other feature extraction techniques
(e.g., wavelet analysis, ICA, nonlinear PCA, feature subset selection) and shows the
potential to further improve the detection performance. While 5t for the applications
of high-quality images, the current version of PNN suIces in respect of performance.
L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211 209

Acknowledgements

The author would like to thank the editors and the anonymous reviewers for their
invaluable comments.

References

[1] A. Bell, T. Sejnowski, An information-maximization approach to blind separation and blind

deconvolution, Neural Comput. 7 (1995) 1129–1159.
[2] C.M. Bishop, Neural Networks for Pattern Recognition, Clarendon Press, Oxford, 1995.
[3] R. Chellappa, C.L. Willson, S. Sirohey, Human and machine recognition of faces: a survey, Proc. IEEE
83 (5) (1995) 704–740.
[4] A.J. Colmenarez, T. Huang, Face detection with information-based maximum discrimination, in:
Proceedings of the CVPR, 1997, pp. 782–787.
[5] N. Duta, A.K. Jain, Learning the human face concept in black and white images, in: Proceedings of
the 14th ICPR, 1998, pp. 1365 –1367.
[6] R. FPeraud, O.J. Bernier, J.E. Viallet, M. Collobert, A fast and accurate face detector based on neural
networks, IEEE Trans. Pattern Anal. Mach. Intell. 23 (1) (2001) 42–53.
[7] S.-H. Jeng, et al., Facial feature detection using geometrical face model: an eIcient approach, Pattern
Recognition 30 (1997) 273–281.
[8] P. Juell, R. Marsh, A hierarchical neural network for human face detection, Pattern Recognition 32 (3)
(1996) 781–787.
[9] J. Kittler, Feature selection and extraction, in: T.Y. Young and K.S. Fu (Eds.), Handbook of Pattern
Recognition and Image Processing, Academic Press, New York, 1986, pp. 59 –83.
[10] T. Kondo, H. Yan, Automatic human face detection and recognition under non-uniform illumination,
Pattern Recognition 32 (1999) 1707–1718.
[11] U. KreQel, J. SchRurmann, Pattern classi5cation techniques based on function approximation, in: H.
Bunke, P.S.P. Wang (Eds.), Handbook of Character Recognition and Document Image Analysis, World
Scienti5c, Singapore, 1997, pp. 49–78.
[12] M.S. Lew, N. Huijsmans, Information theory and face detection, in: Proceedings of the 13th ICPR, Vol.
3, 1996, pp. 601– 605.
[13] C. Lin, K.C. Fan, Human face detection using geometric triangle relationship, in: Proceedings of the
15th ICPR, 2000, pp. 945 –948.
[14] S.-H. Lin, S.-Y. Kung, L.-J. Lin, Face recognition=detection by probabilistic decision-based neural
network, IEEE Trans. Neural Networks 8 (1) (1997) 114–132.
[15] D. Maio, D. Maltoni, Real-time face location on gray-level static images, Pattern Recognition 33 (2000)
1525–1539.
[16] J. Miao, B. Yin, K. Wang, L. Shen, X. Chen, A hierarchical multiscale and multiangle system for
human face detection in a complex background using gravity-canter template, Pattern Recognition 32
(1) (1999) 1237–1248.
[17] B. Moghaddam, A. Pentland, Probabilistic visual learning for object representation, IEEE Trans. Pattern
Anal. Mach. Intell. 19 (7) (1997) 696–720.
[18] E. Osuna, R. Freund, F. Girosi, Training support vector machine: an application to face detection, in:
Proceedings of the CVPR, 1997, pp. 130 –136.
[19] A. Pentland, B. Moghaddam, T. Starner, View-based and modular eigenspaces for face recognition, in:
Proceedings of the CVPR, 1994, pp. 84 –91.
[20] H. Robbins, S. Monro, A stochastic approximation method, Ann. Math. Statist. 22 (1951) 400–407.
[21] H.A. Rowley, S. Baluja, T. Kanade, Neural network-based face detection, IEEE Trans. Pattern Anal.
Mach. Intell. 20 (1) (1998) 23–38.
[22] D.E. Rumelhart, G.E. Hinton, R.J. Williams, Learning representations by back-propagation error, Nature
323 (9) (1986) 533–536.
210 L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211

[23] H. Schneiderman, T. Kanade, Probabilistic modeling of local appearance and spatial relationships for
object recognition, in: Proceedings of the CVPR, 1998, pp. 45 –51.
[24] J. SchRurmann, Pattern Classi5cation: A Uni5ed View of Statistical Pattern Recognition and Neural
Networks, Wiley Interscience, New York, 1996.
[25] Q. Song, J. Robinson, A feature space for face image processing, in: Proceedings of the 15th ICPR,
2000, pp. 97–100.
[26] K.-K. Sung, T. Poggio, Example-based learning for view-based human face detection, IEEE Trans.
Pattern Anal. Mach. Intell. 20 (1) (1998) 39–50.
[27] G. Yang, T.S. Huang, Human face detection in a complex background, Pattern Recognition 27 (1)
(1994) 53–61.
[28] H.-C. Yau, T. Tanry, Iterative improvement of a Gaussian classi5er, Neural Networks 3 (1990) 437–443.
[29] K.C. Yow, R. Cipolla, Feature-based human face detection, Image Vision Comput. 15 (9) (1997)
pp. 713–735.

Lin-Lin Huang was born in April, 1968. She received the B.S. degree from Wuhan
University and M.E. degree from Beijing Polytechnic University, China, in 1989 and
1994, respectively. Currently she is a Ph.D. student in Tokyo University of Agri-
culture and Technology. From 1994 to 1998, she worked as a lecturer in Northern
Jiaotong University, Beijing, China. Her research interests include pattern recogni-
tion, image processing and computer vision.

Akinobu Shimizu was born in October, 1965. He received his B.E. and Ph.D. de-
grees from Graduate School of Engineering, Nagoya University in 1989 and 1994,
respectively. He became a research associate at Nagoya University in 1994, and
has been an associate professor in the Graduate School of Bio-Applications and
Systems Engineering, Tokyo University of Agriculture and Technology since 1998.
His research interests include image processing and analysis. He is a member of the
Japanese Society of Medical Imaging Technology, the Japanese Society for Medi-
cal and Biological Engineering, the Japan Society of Computer Aided Diagnosis of
Medical Images, and the IEEE.

Yoshihiro Hagihara was born in Kanagawa Prefecture, Japan, in May 1964. He

received the B.E. and the Ph.D. degrees from Tokyo University of Agriculture
and Technology in 1990 and 1996, respectively. From 1993 to 1997 he worked
as a researcher with Systems Development Laboratory, Hitachi ltd. In 1997 he
joined Tokyo University of Agriculture and Technology, where he is now a research
associate. His research interests range from pattern recognition to image processing
with industrial applications.
L.-L. Huang et al. / Neurocomputing 51 (2003) 197 – 211 211

Hidefumi Kobatake was born in November, 1943. He received the B.E., M.E., and
Ph.D. degrees from The University of Tokyo, Tokyo, Japan, in 1967, 1969 and
1972, respectively. He is now a Professor at Graduate School of Bio-Applications
and Systems Engineering, Tokyo University of Agriculture and Technology, Tokyo,
Japan. His research activities are in the areas of speech processing, image pro-
cessing, and the applications of digital signal processing. He has received several
awards, including a 1987 Society of Instrument and Control Engineers’ Best Mono-
graph Award and a 1998 Three-Dimensional Image Conference’s Best Paper Award.
He is the member of the IEEE, the Society of Instrument, the Acoustical Society
of Japan, etc.

Project Report Face Detection and Recognition
75% (8)
Project Report Face Detection and Recognition
38 pages
Morphological Pro Morphological Processingcessing
No ratings yet
Morphological Pro Morphological Processingcessing
12 pages
A Method of Face Recognition Based On Fuzzy C-Means Clustering and Associated Sub-Nns
No ratings yet
A Method of Face Recognition Based On Fuzzy C-Means Clustering and Associated Sub-Nns
11 pages
Face Detection
No ratings yet
Face Detection
28 pages
Computer Vision2
No ratings yet
Computer Vision2
7 pages
Face Detection and Recognition
100% (3)
Face Detection and Recognition
39 pages
Face Recognition Using Eigenfaces
No ratings yet
Face Recognition Using Eigenfaces
6 pages
6.report Face Recognition
No ratings yet
6.report Face Recognition
45 pages
1822 B.tech It Batchno 347 Organized
No ratings yet
1822 B.tech It Batchno 347 Organized
21 pages
Face Recognition Using Eigen Faces and Artificial
No ratings yet
Face Recognition Using Eigen Faces and Artificial
7 pages
Convolutional Neural Network Approach Fo
No ratings yet
Convolutional Neural Network Approach Fo
6 pages
Eigen Faces CV
No ratings yet
Eigen Faces CV
16 pages
Jocn 1991 3 1 71 PDF
No ratings yet
Jocn 1991 3 1 71 PDF
313 pages
Seminar Report: Facial Recognition Systems and Algorithms
No ratings yet
Seminar Report: Facial Recognition Systems and Algorithms
31 pages
ECE 420: Embedded DSP Laboratory Lab Assigned Project Lab Eigenfaces For Recognition Paper Summary
No ratings yet
ECE 420: Embedded DSP Laboratory Lab Assigned Project Lab Eigenfaces For Recognition Paper Summary
5 pages
Bayesian Face Recognition: Baback Moghaddam, Tony Jebara, Alex Pentland
No ratings yet
Bayesian Face Recognition: Baback Moghaddam, Tony Jebara, Alex Pentland
12 pages
An Efficient Face Recognition System by Declining Rejection Rate Using Pca
No ratings yet
An Efficient Face Recognition System by Declining Rejection Rate Using Pca
6 pages
Report On Face Recognition System
100% (1)
Report On Face Recognition System
44 pages
A Study of Face Recognition Using The PCA and Error Back-Propagation
No ratings yet
A Study of Face Recognition Using The PCA and Error Back-Propagation
4 pages
Face Detection Using Feed Forward Neural Network in Matlab
No ratings yet
Face Detection Using Feed Forward Neural Network in Matlab
3 pages
A Neural Architecture For Fast and Robust Face Detection
No ratings yet
A Neural Architecture For Fast and Robust Face Detection
4 pages
Face Detection System Based On MLP Neural Network
No ratings yet
Face Detection System Based On MLP Neural Network
6 pages
Neural Network-Based Face Detection
No ratings yet
Neural Network-Based Face Detection
48 pages
ICASSp 1
No ratings yet
ICASSp 1
5 pages
Principle Component Analysis (Pca) - Eigenfaces
100% (1)
Principle Component Analysis (Pca) - Eigenfaces
54 pages
ECCV-2018-CR154-Pyramidbox - A Context-Assisted Single Shot Face Detector
No ratings yet
ECCV-2018-CR154-Pyramidbox - A Context-Assisted Single Shot Face Detector
17 pages
Abstract For Face Detection
No ratings yet
Abstract For Face Detection
4 pages
PaperSummary VS
No ratings yet
PaperSummary VS
2 pages
Performance Improvement For 2-D Face Recognition Using Multi-Classifier and BPN
No ratings yet
Performance Improvement For 2-D Face Recognition Using Multi-Classifier and BPN
7 pages
Face Recognition System With Face Detection
No ratings yet
Face Recognition System With Face Detection
10 pages
Ecen 671: Project Proposal-Face Recognition Using Eigen Faces
No ratings yet
Ecen 671: Project Proposal-Face Recognition Using Eigen Faces
2 pages
4 - Face Recognition
No ratings yet
4 - Face Recognition
8 pages
KLKKK
No ratings yet
KLKKK
9 pages
Developing A Neural Network-Based Method For Faster Face Recognition by Training & Simulation
No ratings yet
Developing A Neural Network-Based Method For Faster Face Recognition by Training & Simulation
10 pages
Implementation of Security Management System Using Face Recognition
No ratings yet
Implementation of Security Management System Using Face Recognition
5 pages
10.1007@s00371 020 01814 8
No ratings yet
10.1007@s00371 020 01814 8
10 pages
Face Recognition Using Eigenfaces: G. Md. Zafaruddin and H. S. Fadewar
No ratings yet
Face Recognition Using Eigenfaces: G. Md. Zafaruddin and H. S. Fadewar
10 pages
Face Recognition Machine Vision System Using Eigenfaces
No ratings yet
Face Recognition Machine Vision System Using Eigenfaces
7 pages
Face Recognition Using Back Propagation Neural Network: Siddharth Rawat
No ratings yet
Face Recognition Using Back Propagation Neural Network: Siddharth Rawat
8 pages
Face Detection and Its Applications: ISSN: 2320 - 8791
No ratings yet
Face Detection and Its Applications: ISSN: 2320 - 8791
10 pages
Feature Extraction Methods For Real-Time Face Detection and Classification
No ratings yet
Feature Extraction Methods For Real-Time Face Detection and Classification
13 pages
Face Recognition Using Principle Component Analysis. Liton, C. P. Abdullah, A
No ratings yet
Face Recognition Using Principle Component Analysis. Liton, C. P. Abdullah, A
6 pages
Intro Face Detect Recognition
No ratings yet
Intro Face Detect Recognition
43 pages
Background Study and Literature Review
No ratings yet
Background Study and Literature Review
22 pages
A Comprehensive Survey On Face Recognition and Image Retrieval For Event-Based Applications
No ratings yet
A Comprehensive Survey On Face Recognition and Image Retrieval For Event-Based Applications
5 pages
Eigenfaces Khuang
No ratings yet
Eigenfaces Khuang
13 pages
Facerec Python
No ratings yet
Facerec Python
16 pages
Face Recognition Using Back Propagation Neural Network
No ratings yet
Face Recognition Using Back Propagation Neural Network
1 page
Eigenfaces For Recognition
No ratings yet
Eigenfaces For Recognition
16 pages
Enhanced Face Recognition Using Euclidean Distance Classification and PCA
No ratings yet
Enhanced Face Recognition Using Euclidean Distance Classification and PCA
5 pages
Face Recognition
No ratings yet
Face Recognition
23 pages
Face Recognition Using Eigen Faces and Artificial Neural Network
No ratings yet
Face Recognition Using Eigen Faces and Artificial Neural Network
6 pages
Design of Radial Basis Function Network As Classifier in Face Recognition Using Eigenfaces
No ratings yet
Design of Radial Basis Function Network As Classifier in Face Recognition Using Eigenfaces
6 pages
Face Recognition - An Engineering Approach
No ratings yet
Face Recognition - An Engineering Approach
74 pages
Intro Face Detect Recognition
No ratings yet
Intro Face Detect Recognition
43 pages
1991 - Eigenfaces For Recognition
No ratings yet
1991 - Eigenfaces For Recognition
16 pages
Pattern Recognition Letters
No ratings yet
Pattern Recognition Letters
8 pages
Convolutional Neural Network-Based Face Recognition Using Non Subsampled Shearlet Transform and Histogram of Local Feature Descriptors
No ratings yet
Convolutional Neural Network-Based Face Recognition Using Non Subsampled Shearlet Transform and Histogram of Local Feature Descriptors
12 pages
Facial Recognition Using Eigen Faces
No ratings yet
Facial Recognition Using Eigen Faces
3 pages

Huang 2003

Uploaded by

Huang 2003

Uploaded by

Neurocomputing 51 (2003) 197 – 211

Face detection from cluttered images using a

Received 3 July 2001; accepted 8 May 2002

∗ Corresponding author. Tel.: +81-42-3283398; fax: +81-42-3283398.

face detection=location, facial feature extraction, and face identi5cation=veri5cation. De-

why would they use PNN

Fig. 1. Pre-processing of image pattern of a local region.

3. Polynomial neural network

where g(·) is a sigmoid activation function

4.1. Image databases

4.2. Results of PNN

PNN Type 1 test set Type 2 test set

PNN Type 1 test set Type 2 test set

Fig. 3. Examples of face detection on type 1 test images.

4.3. Comparison with other methods

Fig. 4. Examples of face detection on type 2 test images.

Fig. 5. More examples of face detection on type 2 test images.

Fig. 6. Curves of detection performance of PNN and MLP.

[1] A. Bell, T. Sejnowski, An information-maximization approach to blind separation and blind

Yoshihiro Hagihara was born in Kanagawa Prefecture, Japan, in May 1964. He

You might also like