Gabor Wave Let Transform Based Facial Expression
Gabor Wave Let Transform Based Facial Expression
2265
2014 IEEE 22nd Signal Processing and Communications Applications Conference (SIU 2014)
where (x,y) denote the pixel position in the spatial domain, computed over a region is used as a texture descriptor. Fig.
ϖ is the central frequency of a sinusoidal plane wave, θ is 2 gives an example of the basic LBP operator.
the orientation of the Gabor filter and σ is the standard
deviation along x and y directions. The parameters x' and
y ' can be defined by the following equations: 32 18 25 1 0 0
threshold
12 27 50 0 1
' '
x = x cos θ + y sin θ , y = − y sin θ + y cos θ (2) 81 78 42 1 1 1
binary sequence: 10011110
Gabor filter bank of various frequencies and orientations has Figure 2 Example of LBP operators
been frequently used to extract features of face image.
However in previous work review [14-17] mainly a forty The limitation of the basic LBP operator is that its small 3×3
Gabor filter bank at five scales, v={0,1,…,4} and eight neighborhood cannot capture the dominant features with
orientations, µ={0,1,…,7} with σ=2π , ϖ = 2 were used. large scale structures. As a result, to deal with the texture at
Fig.1 shows the magnitude of the Gabor at four scales and different scales, the operator was later extended to use
the Real Gabor filter bank with 4 different scales and 6 neighborhoods of different sizes [19]. One of such is the
different orientations. uniform patterns: an LBP is ‘uniform’ if it contains at most
one 0-1 and one 1-0 transition when viewed as a circular bit
string. For instance, 00000000, 001110000 and 11100001
are uniform patterns. It is observed that uniform patterns
account for nearly 90% of all patterns in the (8, 1)
neighborhood and for about 70% in the (16, 2)
neighborhood in texture images, where the notation ( P, R )
denotes a neighborhood of P equally spaced sampling
points on a circle of radius of R that form a circularly
symmetric neighbor set [4].
(a) (b)
B. Principal Component Analysis
Figure 1 (a) Magnitude of the Gabor filter bank at four scales (b) PCA [20] can be defined as an orthogonal linear
Real part of the Gabor kernels at four scales and six orientations. transformation that transforms data to a new coordinate
system such that the highest variance by any projection of
The GW feature representation ϕm,n ( x, y ) is obtained by the data comes to sit on the first coordinate (called the first
principal component), the second highest variance on the
convolving the Gabor filter bank ψ ( x, y,ϖ , θ ) with input second coordinate, and so on. Thus PCA finds orthogonal
image. basis for data, sorts dimensions in order of importance and
ϕm,n ( x, y ) = I ( x, y ) ∗ψ ( x, y,ϖ ,θ ) (3) discard low significance dimensions. The main purpose of
PCA is to reduce the dimensionality of the raw data while
retaining as much as possible the variation present in the
In practice usually the magnitude of the convolution output data set [17].
Gabor feature vector representation is used for FER, PCA can also be thought as a way of recognizing patterns in
because they do vary slowly with the displacement while the data, and expressing the data in such a way as to emphasize
phases are very sensitive under small displacement [18]. their similarities and differences. Since pattern in data may
be difficult to find especially in data of high dimension.
III. DIMENTIONALITY REDUCTION PCA is used in many pattern recognition applications such
FER [21] and face recognition [22].
A. Local Binary Patterns
LBP operator takes a local neighborhood around each pixel,
IV. PROPOSED METHOD
thresholds the pixels of the neighborhood at the value of the
central pixel and uses the resulting binary valued image In this section we explain the method used in this paper. It
patch as a local image descriptor. It was originally defined can be summarized using the diagram shown in Fig. 3. The
for 3×3 neighborhoods, giving 8 bit codes based on the 8 FE database used is the Japanese Female Facial Expression
pixels around the central one, the operator labels the pixels (JAFFE) Database, the images where categorized into
of an image by thresholding a 3×3 neighborhood of each training and testing. All expressions in the database where
pixel with the center value and considering the results as a first pre-processed using GWT with 5 scales and 8
binary number, and the 256-bin histogram of the LBP labels orientations and no down sampling were done. After the
2266
2014 IEEE 22nd Signal Processing and Communications Applications Conference (SIU 2014)
pre-processing each image within the database has a Gabor+LBP has better performance with averaged
corresponding 40 images and each image is 256×256. Due recognition rate 90% and promise to be the best result
to this large number of images, dimensionality reduction is compared to the rest, though all the remaining algorithms
used. Two methods; PCA and LBP are used here. A uniform appear to demonstrate a good performance as well when
LBP is used in which each image is divided into a region of compared to previous work reviewed.
64×64, a radius of 1 used with 8 samples within each radius.
K-nearest neighbor is used for similarity measure and in this VI. CONCLUSION
case we used Euclidean distance (L2).
Gabor wavelets were used as a pre-processing stage
followed by dimensionality reducing using PCA/LBP for
Preprocessing (Gabor Dimensionality
Facial Expression
Database
Wavelet Transform) Reduction (PCA/LBP) facial expression recognition in this paper. Experimental
evaluations of the proposed approach were conducted on
JAFFE database. The results obtained showed that pre-
processing with Gabor wavelets improves the performance
of directly applying both PCA and LBP.
Decision Similarity Measure (L2)
Surprise
Disgust
Neutral
Happy
Anger
Fear
Sad
PCA
The database contains 213 images of 10 subjects with 7
facial expressions namely; neutral, anger, disgust, fear,
happy, sad and surprise. Fig. 4 shows example of the images Neutral 90 0 0 0 0 0 10
from the database. 137 images containing all the expressions Anger 9 64 18 9 0 0 0
were used for training while 76 images were used for Disgust 0 8 83 0 0 8 0
testing. Fear 8 0 8 67 0 8 8
Happy 0 0 0 10 80 10 0
Sad 18 0 0 18 18 45 0
Surprise 0 0 0 0 0 10 90
Surprise
Disgust
Neutral
Happy
Anger
Fear
Sad
LBP
Happy
Anger
Fear
Sad
2267
2014 IEEE 22nd Signal Processing and Communications Applications Conference (SIU 2014)
Surprise
Disgust
Neutral Classification of Single Facial Images,” IEEE Transactions on
Happy
Anger
Fear
Sad
Pattern Analysis and Machine Intelligence, Vol. 21, No. 12,
pp. 1357–1362, 1999.
[9] S. Bashyal, and G. Venayagamoorthy, “Recognition of Facial
Neutral 100 0 0 10 0 0 0 Expressions using Gabor Wavelets and Learning Vector
Anger 18 82 0 0 0 0 0 Quantization,” Engineering Applications of Artificial
Disgust 0 0 67 0 0 25 8 Intelligence, Vol. 21, No. 7, pp. 1056-1064, 2008.
Fear 0 0 0 58 17 25 0 [10] W. Gu, C. Xiang, Y. Venkatesh, D. Huang, and H. Lin, “Facial
Expression Recognition using Radial Encoding of Local Gabor
Happy 0 0 0 10 70 10 10
Features and Classifier Synthesis,” Pattern Recognition, Vol.
Sad 0 9 18 0 9 63 0 45, No. 1, pp. 80-91, 2012.
Surprise 0 0 0 10 0 0 90 [11] A. Eleyan, and H. Demirel, “Performance Comparison among
Complex Wavelet Transforms Based Face Recognition
Systems,” Image Processing and Communication Conference,
Table 5 Confusion Matrix for FER using Gabor+LBP AISC84, pp. 201-209, August 2010.
[12] W. K. Kong, D. Zhang and W. Li, “Palm Print Feature
Extraction using 2-D Gabor Filters”, Pattern Recognition
Gabor+LBP
Surprise
Disgust
Neutral
Happy
Fear
Sad
2268