An Optical Flow-Based Approach To Robust Face Recognition Under Expression Variations

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO.
1, JANUARY 2010 233
An Optical Flow-Based Approach to Robust Face

Recognition Under Expression Variations
Chao-Kuei Hsieh, Shang-Hong Lai, Member, IEEE, and Yung-Chang Chen, Fellow, IEEE
Abstract—Face recognition is one of the most intensively studied face recognition, such as SVM. Although applying an appro-
topics in computer vision and pattern recognition, but few are priate dimension reduction algorithm or a robust classification
focused on how to robustly recognize faces with expressions under technique may yield more accurate recognition results, they
the restriction of one single training sample per class. A con-
strained optical flow algorithm, which combines the advantages
usually require multiple training images for each subject. How-
of the unambiguous correspondence of feature point labeling and ever, multiple training images per subject may not be available
the flexible representation of optical flow computation, has been in practice.
developed for face recognition from expressional face images. In Some authors have proposed different approaches to deal
this paper, we propose an integrated face recognition system that with facial expression variations in face recognition. These
is robust against facial expressions by combining information algorithms can be roughly divided into two main categories:
from the computed intraperson optical flow and the synthesized
face image in a probabilistic framework. Our experimental results morphable model based and optical flow based. The basic idea
show that the proposed system improves the accuracy of face in the former category is to warp images to similar global
recognition from expressional face images. face geometries as the ones used for training. The concept of
Index Terms—Constrained optical flow, face recognition. separately modeling texture and geometry information has been
applied in active shape model and active appearance model
(ASM/AMM) [3], [4]. Face geometry is defined via a set of
I. INTRODUCTION feature points in ASM, while face texture can be warped to
the mean shape in AAM. Ramachandran et al. [15] presented
F ACE recognition has been studied for several decades.
Comprehensive reviews of the related works can be found
in [14], [21]. Even though the 2-D face recognition methods
preprocessing steps to convert a smiling face to a neutral face.
Li et al. [9] applied a face mask for face geometry normaliza-
tion and further calculated the eigenspaces for geometry and
have been actively studied in the past, there are still some in-
texture separately, but not all images can be well warped to a
herent problems to be resolved for practical applications. It was
neutral image because of the lack of texture in certain regions,
shown that the recognition rate can drop dramatically when the
like closed eyes. Moreover, linear warping was usually applied,
head pose and illumination variations are too large, or when
which was not consistent to the nonlinear characteristics of
the face images involve expression variations. Pose, illumina-
facial expression movements.
tion, and expression variations are three essential issues to be
The other category is to use optical flow to compute the face
dealt with in the research of face recognition. To date, there was
warping transformation. Optical flow has been used in the task
not much research effort on overcoming the expression varia-
of expression recognition [5], [18]. However, it is difficult to
tion problem in face recognition, though a number of algorithms
learn the local motion in the feature space to determine the
have been proposed to overcome the pose and illumination vari-
expression change for each face, since different persons have
ation problems.
expressions in different motion styles. Martinez [12] proposed
To improve the face recognition accuracy, researchers have
a weighting method that independently weighs the local areas
applied different dimension reduction techniques, including
which are less sensitive to expressional changes. The intensity
principle component analysis (PCA) [17], linear discriminant
variations due to expression may mislead the calculation of op-
analysis (LDA) [10], independent component analysis (ICA)
tical flow. A precise motion estimation method was proposed in
[1], discriminant common vector (DCV) [2], kernal-PCA,
[11], which can be further applied for expression recognition.
kernal-LDA [19], kernal-DCV [7], etc. In addition, several
However, the proposed motion estimation did not consider in-
learning techniques have been used to train the classifiers for
tensity changes due to different expressions.
In this paper, we focus on the problem of face recognition
from a single 2-D face image with facial expression. Note that
Manuscript received January 26, 2009; revised July 23, 2009. First published
August 28, 2009; current version published December 16, 2009. The associate this paper is not about facial expression recognition. For many
editor coordinating the review of this manuscript and approving it for publica- practical face recognition problem settings, like using a passport
tion was Dr. Margaret Cheney. photo for face identification at custom security or identifying a
C.-K. Hsieh and Y.-C. Chen are with the Department of Electrical En-
gineering, National Tsing Hua University, Hsinchu 30013, Taiwan, R.O.C.
person from a photo on the ID card, it is infeasible to gather
(e-mail: [email protected]; [email protected]). multiple training images for each subject, especially with dif-
S.-H. Lai is with the Department of Computer Science, National Tsing Hua, ferent expressions. Therefore, our goal is to solve the expressive
University, Hisnchu 30013, Taiwan, R.O.C. (e-mail: [email protected]).
Color versions of one or more of the figures in this paper are available online
face recognition problem under the condition that the training
at http://ieeexplore.ieee.org. database contains only neutral face images with one neutral face
Digital Object Identifier 10.1109/TIP.2009.2031233 image per subject. In our previous work [8], we combined the
1057-7149/$26.00 © 2009 IEEE
234 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 1, JANUARY 2010
advantages of the above two approaches: the unambiguous cor- where the subscript denotes the location, is the concate-
respondence of feature point labeling and the flexible represen- nation vector of all the flow components and and all the
tation of optical flow computation. A constrained optical flow brightness variation multiplier and offset factors, and are the
algorithm was proposed, which can deal with position move- parameters controlling the degree of smoothness in the motion
ments and intensity changes at the same time when handling and brightness fields, is the set of all the discretized locations
the corresponding feature points. With our proposed constrained in the image domain, is the weight for the data constraint,
optical flow algorithm, we can calculate the expressional mo- and , , , , , , , and are the weights
tions from each neutral faces in the database to the input test for the corresponding components of the smooth constant
image, and estimate the likelihood of such a facial expression along - and - directions. In our application, we regard a neu-
movement. Using the optical flow information, neutral images tral face image and an expressional face image as two adjacent
in the database can be further warped to faces with the exact time instances in the above formulation.
expression of input image. In this paper, we propose to exploit The above quadratic energy function can be rewritten in a
the two different types of information, i.e., the computed optical matrix-vector form given by .
flow and the synthesized image, to improve the accuracy of face By setting the first order deviation to zero, minimizing this
recognition. Experimental validation on the Binghamton Uni- quadratic and convex function is equivalent to solving a large
versity 3-D Face Experssion (BU-3DFE) [20] Database is given linear system , and can be efficiently solved by
to show the improved performance of the proposed face recog- the incomplete Cholesky preconditioned conjugate gradient
nition system. Since we do not attempt to solve the automatic fa- (ICPCG) algorithm [16].
cial landmark localization problem in this paper, the facial land- However, the motion vectors of the facial feature points,
mark points in our experiment are labeled manually. which were used only as references for interpolating compu-
The remainder of this paper is organized as follows. We tation in ICPCG, cannot be guaranteed to be consistent in the
briefly review the constrained optical flow computational final converged optical flow. In order to guarantee the computed
technique in Section II. The proposed expression-invariant optical flow to be consistent with the motion vectors at these
face recognition system is presented in Section III. Section IV corresponding feature points, we modify the unconstrained
gives the experimental results by applying the proposed expres- optimization problem in the original formulation of the optical
sion-invariant face recognition algorithm. Finally, we conclude flow estimation to a constrained optimization problem [8] given
the paper in the last section.
as follows:
II. CONSTRAINED OPTICAL FLOW COMPUTATION

The computational algorithms of traditional optical flow (3)
cannot guarantee that the computed optical flow corresponds to
the exact pixels in different images, since the intensity varia- where is the set of feature points and is the specified
tions due to expression may mislead the computation of optical optical flow vector at the feature point. A modified ICPCG
flow. The brightness constancy constraint, however, is not valid procedure was applied to solve this constrained optimization
in many circumstances. Therefore, with the generalized dy- problem and the details are referred to [8]. By solving this op-
namic image model (GDIM) proposed by Negahdaripour and timization problem with the displacements of the feature points
Yu [13], Teng et al. [16] generalized the optical flow constraint imposed as hard constraints, we can compute the motion vec-
to tors at all pixels in the entire image for different expressions or
subjects.
(1)
III. PROPOSED FACE RECOGNITION SYSTEM
where and denote the multiplier and offset factors
of the scene brightness variation field, is the image intensity A. Basic Concept
function, the subscripts , , and denote the spatiotemporal We treat the expression-invariant face recognition system
partial derivatives, is a point in the spatiotemporal domain, as a probabilistic maximum a posteriori (MAP) classification
and is the motion vector at the point . They pro- problem. To do this, we formulate the problem as follows:
posed to minimize the following discrete energy function, which
is defined with an adaptive smoothness adjustment scheme to (4)
constrain both flow components ( and ) and all the bright-
ness variation multiplier and offset factors ( and ) where is the input image, is the neutral face image for the
th subject in training data set, and denotes the expression
motion field between and . The optical flow field is not
specifically defined yet, which will be discussed later. The di-
rection of could be either from to or the opposite way.
Based on the Bayes theorem and the assumption of indepen-
dence between and , (4) can be rewritten as
(2) (5)
HSIEH et al.: OPTICAL FLOW-BASED APPROACH TO ROBUST FACE RECOGNITION UNDER EXPRESSION VARIATIONS 235
Fig. 1. Symbolizations of OF ( ) operators.
Furthermore, the prior probability for each candidate is as-

sumed equally probable, i.e., is constant for all . The
formulation can be simplified as
(6)
There are two parts in (6), i.e., the prior probability of the
expression motion , and the conditional probability of the
input image given the subject with the expression . Fig. 2. Illustration of decomposing input optical flows (OF ) to interperson
B. Prior Probability of the Expression Motion

(OF ) and intraperson (OF ) parts.
We use the proposed constrained optical flow estimation algo-

rithm [8] to calculate the deformation between the input image in the testing procedure, i.e., the intraperson optical flow. The
and the subject , which is defined as a directional op- motion information in is defined as
erator and depicted in Fig. 1. Note that operator is
(8)
to apply the proposed constrained optical flow estimation algo-
rithm to estimate the pixel motion vectors from one image to where is the overall optical flow from global neutral
a reference image . face to input image , is the interperson op-
A training procedure is necessary to further calculate the tical flow from to the guessed neutral face , and is
probability of the expression movement . It is neces- the intraperson optical flow from to . Moreover, the symbol
sary that all the optical flow fields used in both training and “ ” in the subscript denotes the optical flow represented
testing procedures are in the same coordinate. The traditional with the geometry of , even though the intraperson optical
expressive optical flow is computed from a neutral face image flow is defined as the pixel-wise motion from to .
of person to an expression image with expression We consider the prior probability of expression motion, i.e.,
of the same subject. However, the computed optical flows intraperson optical flow, as a mixture of Gaussian probability
are generally not in the same coordinate, since the geometry distribution with centers corresponding to the samples in the
of neutral faces is different from each other. Some research training optical flow dataset, that is
only considered motion vectors at certain feature points to
overcome this problem, but only limited information about (9)
facial movement is used in this case.
We propose a different solution for optical flow normaliza-
tion, as shown in Fig. 2. Instead of computing the intraperson where is the intraperson optical flow , is the intra
optical flow directly from the neutral face to person optical flow for the th subject in the training data,
an expressive face image for each person, we start from is the total number of training samples, is the diagonal co-
a global neutral face to obtain the interperson optical variance matrix determined from all training optical flows, is
flow and the overall optical flow the determinant of , is the dimension of optical flow, and
. The intraperson optical flow can is the weight associated with each training sample.
then be computed by pixel-wise subtraction as follows: C. Conditional Probability of the Expression Motion
With the facial motion computed from from the proposed
(7) constrained optical flow algorithm, we can synthesize a face
image. The function denotes the synthesis operator
The intraperson expression motion fields of the subjects in the that warps the source image to a new one through the mo-
training dataset, which are exclusive from the testing data, are tion vector , as depicted in Fig. 3(a). Note that the motion
collected in the training procedure. vector should be in the same coordinate of source image .
There are two advantages of the above optical flow normaliza- Although the motion vectors and intensity variation coefficients
tion scheme: (1) all expressive face images of all subjects have are obtained in our optical flow estimation, this operator only
the same dimension of motion fields, and (2) all optical flows involves geometric warping determined by the optical flow and
are computed and represented with the same geometry of . the brightness variations is not used in the face synthesis. We
To further define with preservation of identical geom- define the operation , as an
etry and dimensionality for each , we use the same strategy operator, and symbolize it as depicted in Fig. 3(b). Under such
Fig. 4. Illustration of mask definition and warping: (a) reference image and
feature points, (b) initial mask, (c) input image, and (d) warped and sheared
mask for the input image.
Fig. 3. Symbolizations of (a) Syn and (b) OF 0 Syn operators. Fig. 5. Overall flowchart of the proposed system
in Fig. 4(a). Moreover, the region within the mouth is excluded

procedure, the source image can be transferred to the expres- in the region of interest [as shown in Fig. 4(d)], since it cannot
sion and geometry of any target image , and the variation be- be synthesized due to the lack of texture in the corresponding
tween images due to expression can be reduced. region of the neutral image.
The conditional probability is assumed to be pro-
portional to the similarity between input image (used as the D. Overall System Flowchart
target image) and the synthesized image, , The overall flowchart of the proposed expression-invariant
from neutral face and the computed optical flow field. face recognition system is depicted in Fig. 5. For each candidate
Since the optical flow used for synthesizing neutral face in the dataset, two motion fields are needed to compute: one
to a certain expression must be represented with the same ge- is the intraperson optical flow represented with the geometry
ometry of , the intraperson optical flow in the of global neutral face , i.e., defined in (8), and
previous section is not appropriate in this circumstance. An es- the other is the intraperson optical flow at the geometry of ,
timated intraperson optical flow under the geometry of is which is defined in (10). The former is used for the
needed, i.e., prior probability of the estimated motion vector, while the latter
is used for image synthesis as well as the computation of the
(10) conditional probability given in (11).
Considering each pixel as an independent and normally dis- IV. EXPERIMENTAL RESULTS
tributed random variable, the conditional probability can be fur- Our experiments were performed on the Binghamton Uni-
ther defined as follows: versity 3-D Face Expression (BU-3DFE) Database [20]. The
BU-3DFE database contains face images and 3-D face models
of 100 subjects (56 females and 44 males), each with a neutral
(11) face and six different expressions (angry, disgust, fear, happy,
sad, and surprised) at different levels, from level 1 (weakest)
to 4 (strongest). Note that only the 2-D face images were used
where is the synthesized image , is in our experiments. All images are normalized according to the
the index of pixel, is the total number of valid pixels, and procedure described later in Section IV-A and resized to 200
is the standard deviation of the image intensities at the th pixel. 200 pixels. Fig. 6 shows the 25 normalized face images of one
Fig. 4 shows the mask definition used for specifying the valid subject after the normalization procedure.
pixels in the face images. We first defined the standard mask
image [Fig. 4(b)] from the global neutral face image [Fig. 4(a)]. A. Preprocessing
When there is an input image with expressions [Fig. 4(c)], the We manually labeled 21 feature points, including three points
mask is then warped according to the three feature points shown for each eyebrow and four points for each eye, one at the nose
Fig. 6. Sample images in BU-3DFEDB. The left-top most is the neutral face.
The others are the face images with angry, disgust, fear, happy, sad, and surprise
expressions in columns from left to right with increasing levels in rows from top
to bottom.
Fig. 8. (a) Recognition rates from the original images after PCA reduction
under different expressions and levels (with average recognition rate 56.88%).
(b) Recognition rates from the original images by direct subtraction under dif-
ferent expressions and levels (with average recognition rate 60.71%).
Fig. 7. (a) Face region selection. The 21 feature points on (b) a neutral face
image and (c) a surprised face image.
tip and the other six around the mouth region. With the labeled
points, the distance between the outer corners of both eyes is Fig. 9. Recognition rates from weighted optical flow result proposed by Mar-
used as the reference to normalize face images (0.5, 0.5, 0.5, and tinez [12] (with average recognition rate 67.36%).
1.5 times to left, right, top, and bottom, respectively), which is
depicted in Fig. 7.
also implemented Martinez’s method [12] with the optical flow
B. Benchmark Test obtained by our proposed method. In the experiment, we adopt
Since our goal is to solve the expressive face recognition the weighting as , where is the magni-
problem under the restriction of a single training sample per tude of optical flow at the th pixel and .
class, the training database in the benchmark contains only neu- The average recognition rate is 67.36%, and the recognition re-
tral face images with one neutral face image per subject, i.e., sults are shown in Fig. 9. Even though the time consumption is
100 training face images for 100 classes in total. There are 24 much longer, the recognition rate is only slightly improved.
expression variant images for each subject, and we used totally
2400 images for all 100 subjects for testing. C. Face Recognition With the Proposed System
In the training phase for the original data, we first use PCA In our proposed system, an extra training dataset is needed
to compute the low-dimensional vector for all the 100 neutral for constructing the prior distribution for the expression motion
images for all subjects. In the training phase, we use all the neu- as described in Section III-B. Among the BU-3DFE database,
tral images of all subjects in the training dataset for computing 34 subjects are randomly selected for intraperson optical flow
the PCA subspace by preserving 95% energy, which yields 51 training, and the remaining 66 subjects are used as the testing
eigenvectors. In the testing phase, the input image is projected set. In (9), the dimensions of intraperson optical flows are re-
to the PCA subspace and classified by the nearest neighbor clas- duced by using PCA with 99% energy preservation and all the
sifier. The average recognition rate is 56.88%, used as a bench- Gaussians are equally weighted.
mark, and the recognition results for all different expressions As described in the previous section, we follow the flow-
and levels are shown in Fig. 8(a). We can see that the surprise chart shown in Fig. 5. Some experimental images are shown
and disgust expressions give the worst results, and the result in- in Fig. 10. We apply a mask, shown in Fig. 10(b), and defined
dicates that higher facial expression levels lead to worse recog- from the global neutral face [Fig. 10(a)], to extract the re-
nition rates. This is consistent with the statement in [6] that gion of interest. Moreover, the region inside the mouth is dis-
the correlation between an image and another image is di- carded, as illustrated in Fig. 10(e). Both the optical flow and
rectly related to the Euclidean distance in the original space, the grayscales of the synthesized image within the mask are
i.e., , if all images are normalized to zero mean and unit used in the face recognition process. For an input image, as de-
variance. Fig. 8(b) shows the result of direct subtraction without picted in Fig. 10(d), we first position the corresponding mask
PCA preprocessing, which is slightly better than Fig. 8(a). We [Fig. 10(e)] to obtain the masked image [Fig. 10(f)]. After that,
Fig. 12. Recognition rates using dimension-reduced intraperson optical flow

only (with average recognition rate 82.39%).
Fig. 13. Recognition rates using the integrated information, including the syn-
thesized images and dimension-reduced intraperson optical flow (with average
recognition rate 94.44%).
Fig. 10. Illustration of experimental images: (a) global neutral face,

(b) mask image, (c) masked image of (a), (d) input image, (e) warped mask
image, (f) masked input image, (g) guessed subject 1, (h) synthesized face from Fig. 14. Recognition rates using original intraperson optical flow only (with
(g) to (d), (i) masked synthesized image (g) using mask image (e), (j) guessed average recognition rate 65.40%).
subject 2, (k) synthesized face from (g) to (d), and (l) masked synthesized
image (k) using mask image (e).
Fig. 15. Recognition rates using the integrated information, including the syn-
thesized images and original intraperson optical flow (with average recognition
Fig. 11. Recognition rates using the synthesized face images only (with av- rate 91.41%).
erage recognition rate 85.86%).
rates based on the synthesized images or the dimension-reduced

for each candidate in the database [Fig. 10(g) and 10(j)], the in- intraperson optical flows individually are 85.86% and 82.39%,
traperson optical flow, i.e., , is computed and used for respectively. The average recognition rate of the proposed inte-
virtual image synthesis [Fig. 10(h) and Fig. 10(k)]. The masked grated system is significantly improved to 94.44% in this exper-
images [Fig. 10(i) and 10(l)] can finally be applied for similarity iment. If the intraperson optical flows are used directly without
comparison. computing PCA, the recognition rate of using optical flow in-
The face recognition results by using the proposed optical formation only is 65.4% (Fig. 14), and the rate of our proposed
flow approach are shown in Figs. 11–13. Fig. 11 gives the face integrated system is 91.41% (Fig. 15). The experiments are im-
recognition result by using the synthesized face image only, i.e., plemented on a PC with a 1.86 GHz CPU. The time consump-
the conditional probability in (11). The recognition result by tion of an OF-Syn and an OF operator is about 2.01 and 1.43 s,
using the intraperson optical flow, i.e., the prior probability of respectively, which would be the critical part in our system.
expression motion in (9), is summarized in Fig. 12. The pro- The performance of the proposed face recognition system
posed face recognition system based on the posterior probability under different sizes of intraperson optical flow probability
given in (6), which integrates the synthesized face image and training is examined. The experimental results are summarized
the intraperson optical flow information. Its recognition result is in Fig. 16. The recognition results of using only motion infor-
shown in Fig. 13. From the results, the average face recognition mation and using the integrated information are improved as
TABLE I
COMPUTATIONAL COMPLEXITIES FOR A TESTING PROCEDURE.
NOTE THAT C IS THE TOTAL NUMBER OF CANDIDATES
Fig. 16. Recognition rates with different numbers of training data.
accuracy of face recognition on expressional face images. How-

ever, the proposed integrated system is more computationally
costly compared to the previous works, since the optical flow
computation, image synthesis, and the probability calculations
are needed for all candidates in the database. It takes only
one OF-Syn operator in our previous work [8] and no image
Fig. 17 (a) Original normalized image using feature points without noise; synthesis is needed in the weighted optical flow algorithm [12].
(b) normalized image using feature points with small noises; (c) size difference The comparison of computational complexities is summarized
between normalized images with and without noises. in Table I. In the future, we will aim to reduce the computational
complexity of the proposed face recognition approach.
REFERENCES
[1] M. S. Bartlett, J. R. Movellan, and T. J. Sejnowski, “Face recognition
by independent component analysis,” IEEE Trans. Neural Netw., vol.
13, no. 6, pp. 1450–1464, Nov. 2002.
[2] H. Cevikalp, M. Neamtu, M. Wilkes, and A. Barkana, “Discrimina-
tive common vectors for face recognition,” IEEE Trans. Pattern Anal.
Mach. Intell., vol. 27, no. 1, pp. 4–13, Jan. 2005.
Fig. 18. Recognition rates using feature points and normalized face images [3] T. Cootes, C. Taylor, D. Cooper, and J. Graham, “Active shape
with noise and the integrated information (with average recognition rate models—Their training and application,” Comput. Vis. Image Under-
93.69%). stand., vol. 61, pp. 18–23, 1995.
[4] T. Cootes, G. J. Edwards, and C. Taylor, “Active appearance models,”
IEEE Trans. Pattern Anal. Mach. Intell., vol. 23, pp. 681–685, 2001.
[5] I. A. Essa and A. Pentland, “A vision system for observing and ex-
the training data size is increased, while the recognition rates of tracting facial action parameters,” in Proc. IEEE Conf. Computer Vi-
using only synthesized images are independent of the training sion Pattern Recognition, 1994, pp. 76–83.
data size. [6] K. Fukunaga, Introduction to Statistical Pattern Recognition, 2nd ed.
The performance of the proposed face recognition system New York: Academic, 1990.
[7] Y. H. He, L. Zhao, and C. R. Zou, “Kernel discriminative common
with disturbances of feature point detection for image normal- vectors for face recognition,” in Proc. Int. Conf. Machine Learning and
ization is examined. To simulate the inaccuracies in feature Cybernetics, Guangzhou, China, Aug. 18–21, 2005, pp. 4605–4610.
point extraction during image normalization, we introduce an [8] C.-K. Hsieh, S.-H. Lai, and Y.-C. Chen, “Expression-invariant face
error in the outward direction for the locations of the outer recognition with accurate optical flow,” in Proc. PCM, Hong Kong,
Dec. 11–14, 2007.
corners of the eyes. This will generate faces in a smaller [9] X. Li, G. Mori, and H. Zhang, “Expression-invariant face recognition
size compared to the original normalized images (Fig. 17). with expression classification,” presented at the 3rd Canadian Conf.
With small facial feature misalignment, the recognition rate is Computer and Robot Vision, Jun. 2006.
[10] A. M. Martinez and A. C. Kak, “PCA versus LDA,” IEEE Trans. Pat-
slightly reduced as depicted in Fig. 18.
tern Anal. Mach. Intell., vol. 23, no. 2, pp. 228–233, Feb. 2001.
[11] A. M. Martinez, “Matching expression variant faces,” Vis. Res., vol. 43,
pp. 1047–1060, 2003.
V. CONCLUSION [12] A. M. Martinez, “Recognizing expression variant faces from a single
In this paper, we proposed a 2-D expression-invariant face sample image per class,” presented at the IEEE Conf. Computer Vision
and Pattern Recognition, Jun. 2003.
recognition system based on integrating the optical flow infor- [13] S. Negahdaripour, “Revised definition of optical flow: Integration of
mation and image synthesis. Only one neutral image for each radiometric and geometric cures for dynamic scene analysis,” IEEE
candidate subject is needed in our face recognition system. Trans. Pattern Anal. Mach. Intell., vol. 20, no. 9, pp. 961–979, Sep.
Two kinds of intraperson optical flow fields, and 1998.
[14] A. Rama and F. Tarres, “Partial LDA VS partial PCA,” presented at the
, were computed and used for expression motion Int. Conf. Multimedia and Expo., Ontario, Canada, Jul. 2006.
likelihood calculation and expressive image synthesis, re- [15] M. Ramachandran, S. K. Zhou, D. Jhalani, and R. Chellappa, “A
spectively. The proposed algorithm combines the face image method for converting a smiling face to a neutral face with applica-
tions to face recognition,” in Proc. ICASSP, Mar. 2005, pp. 18–23.
comparison and optical flow prior information in a probabilistic
[16] C.-H. Teng, S.-H. Lai, Y.-S. Chen, and W.-H. Hsu, “Accurate optical
MAP framework. As shown from the experimental results, the flow computation under non-uniform brightness variations,” Comput.
proposed face recognition system significantly improves the Vis. Image Understand., vol. 97, pp. 315–346, 2005.
[17] M. A. Turk and A. P. Pentland, “Face recognition using Eigenfaces,” Shang-Hong Lai (M’95) received the B.S. and
in Proc. IEEE Conf. Computer Vision and Pattern Recognition, Maui, M.S. degrees in electrical engineering from National
HI, Jun. 1991, pp. 586–591. Tsing Hua University, Hsinchu, Taiwan, R.O.C.,
[18] Y. Yacoob and L. S. Davis, “Recognizing human facial expressions and the Ph.D. degree in electrical and computer
from long image sequences using optical flow,” IEEE Trans. Pattern engineering from University of Florida, Gainesville,
Anal. Mach. Intell., vol. 18, no. 6, pp. 636–642, Jun. 1996. in 1986, 1988, and 1995, respectively.
[19] M.-H. Yang, “Kernel Eigenfaces vs. Kernel Fisherfaces: Face recog- He joined Siemens Corporate Research in
nition using kernel methods,” in Proc. Int. Conf. Automatic Face and Princeton, NJ, as a member of technical staff in
1995. Since 1999, he has been a faculty member
Gesture Recognition, Washington, DC, May 2002, pp. 215–220.
in the Department of Computer Science, National
[20] L. Yin, X. Wei, Y. Sun, J. Wang, and M. J. Rosato, “A 3D facial ex-
Tsing Hua University. He is currently a Professor
pression database for facial behavior research,” in Proc. Int. Conf. Au-
in the same department. In 2004, he was a visiting scholar with Princeton
tomatic Face and Gesture Recognition, Apr. 2006, pp. 211–216. University. His research interests include computer vision, visual computing,
[21] W. Zhao, R. Chellappa, P. J. Phillips, and A. Rosenfeld, “Face recog- pattern recognition, medical imaging, and multimedia signal processing. He has
nition: A literature survey,” ACM Comput. Surv., vol. 35, no. 4, pp. authored more than 130 papers published in the related international journals
399–458, Dec. 2003. and conferences. He holds ten U.S. patents for inventions related to computer
vision and medical image analysis.
Dr. Lai has been a member of program committee of several international
conferences, including CVPR, ICCV, ECCV, ACCV, ICPR, and ICME.
Yung-Chang Chen (M’85–SM’90–F’05) received

the B.S. and M.S. degrees in electrical engineering
from the National Taiwan University, Taipei, Taiwan,
R.O.C., in 1968 and 1970, respectively, and the Ph.D.
(Dr.-Ing.) degree from the Technische Universitat
Berlin, Berlin, Germany, in 1978.
In 1978, he joined the Department of Elec-
trical Engineering, National Tsing Hua University,
Hsinchu, Taiwan. From 1980 to 1983, he was
Chair of the Department of Electrical Engineering,
National Central University, Chungli, Taiwan. From
Chao-Kuei Hsieh received the B.S. degree in elec- 1992 to 1994, he was Chair of the Department of Electrical Engineering,
trical engineering from the National Tsing Hua Uni- National Tsing Hua University. From 2002 to 2004, he was Dean of the College
versity, Hsinchu, Taiwan, R.O.C., in 2001. He is cur- of Engineering and a Professor with the Department of Computer Science and
rently pursuing the Ph.D. degree in the Department Information Engineering, National Chung Cheng University, Chiayi, Taiwan.
of Electrical Engineering, National Tsing Hua Uni- He is now a Professor with the Department of Electrical Engineering, National
versity, Hsinchu, Taiwan. Hsing Hua University. His current research interests include multimedia signal
His research interests include multimedia signal processing, digital video processing, medical imaging, computer vision, and
processing and pattern recognition. pattern recognition.
Dr. Chen serves as chair of the R.O.C. Image Processing and Pattern Recog-
nition Society.

An Optical Flow-Based Approach To Robust Face Recognition Under Expression Variations

Uploaded by

Document Informationclick to expand document information

Copyright:

Available Formats

An Optical Flow-Based Approach To Robust Face Recognition Under Expression Variations

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

An Optical Flow-Based Approach To Robust Face Recognition Under Expression Variations

Uploaded by

Copyright:

Available Formats

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO.

1, JANUARY 2010 233

An Optical Flow-Based Approach to Robust Face

II. CONSTRAINED OPTICAL FLOW COMPUTATION

Fig. 1. Symbolizations of OF ( ) operators.

Furthermore, the prior probability for each candidate is as-

B. Prior Probability of the Expression Motion

We use the proposed constrained optical flow estimation algo-

in Fig. 4(a). Moreover, the region within the mouth is excluded

Fig. 12. Recognition rates using dimension-reduced intraperson optical flow

Fig. 10. Illustration of experimental images: (a) global neutral face,

rates based on the synthesized images or the dimension-reduced

Fig. 16. Recognition rates with different numbers of training data.

accuracy of face recognition on expressional face images. How-

Yung-Chang Chen (M’85–SM’90–F’05) received

You might also like