Face Recog Using Wavelet Transformations PDF
Face Recog Using Wavelet Transformations PDF
ABSTRACT
We present a comparison between two major pruning
strategies for face recognition using wavelet packets. The
first approach is top-down, beginning with the image
representation of the human faces and decomposing the
image selectively based on correlation energy. The second
approach begins with the full wavelet decomposition,
designs a correlation filter for each subspace, and prunes
the tree based on classification rate. Each technique has
advantages: top-down is better at verifying whether an
image belongs to a particular class while the full wavelet
decomposition is better at determining which class an
image belongs to out of a set of different classes. We
attempt to generate trees that take advantage of both
strengths to see at what middle ground we maximize
effectiveness at image verification and image
classification, using faces as the test images. Then, we
apply the tree structure to sets of human faces to
determine whether the tree structure is more broadly
applicable.
1.
INTRODUCTION
BACKGROUND
2.1. Biometrics
Biometrics is the use of unique physical characteristics of
people to positively authenticate a user of a system or
recognize a type of a pattern and then identify people
based on that pattern. Examples of biometrics for
authentication include iris pattern, hand vein pattern, and
fingerprints, while biometrics for recognition are
primarily based on facial features. By identifying which
characteristics are particular to a person, that person can
be recognized in the future.
Currently, biometrics is not commonly used to
authenticate users. The most common use, fingerprint
scanners on laptops, is still limited to after-market
peripherals and a few models of IBM laptops [3]. Two of
the problems that prohibit their more widespread use are
the difficulty of obtaining a clean image and the greater
space that it takes to store an image as opposed to an
alphanumeric password. While we can assume that a user
will try to give a clean image in order to maximize
recognition, different lighting conditions or fingerprint
pressure can greatly affect the image that is compared to
1.1. Motivation
While it is a relatively simple matter to either identify a
human face or classify between different ones, it would be
very interesting to see if we could use wavelet transforms
to achieve better and more robust results. One of most
characteristics of wavelet transforms is their ability to
represent a signal into partitions of time-frequency plane.
The popular representation of wavelet transforms is a
multi-resolution wavelet tree where the each sub space
contains information in a time-frequency domain.
Therefore, we want to design figures of merit to take
advantage of those wavelet spaces to end up with different
pruned trees. The main motivation is to try to design a tree
that is optimal for verification and one which is good for
classification of face images. Their performance at
verification and classification will then be measured using
the figures of merit, and the strategies will be refined
based on the results. In the end, we will come up with a
2.3. Classifier
There are many different classifiers out there that have
proved to be very effective in classifying faces. We will
be using advanced correlation filters, specifically the
Minimum Average Correlation Energy filter (MACE).
Correlation filter techniques are attractive candidates for
the matching needed in face verification. According to
Kumar et al. in [4], correlation filters can be used on any
biometrics as long as they are in the form of images.
Advanced correlation filters can offer a very good
matching performance in the presence of variability such
as facial expression and illumination changes.
Furthermore, they are of less complexity and are shift
invariant.
The main idea is to synthesize a filter using a set of
training images that would produce correlation output that
reduces the correlation values at locations other than the
origin and this value at the origin is constrained to a
specific peak value. When the filter is correlated with a
test image that is authentic, the filter will exhibit sharp
correlation peaks in the correlation plane. Otherwise the
filter will output small correlation values. This correlation
plane v can be modeled as
v i = (DFT)1 Ci DFTx
(1)
PCE(v i ) =
Figure 2 - An example of a full wavelet decomposition with two
levels.
max(| v i |) mean(| v i |)
stdev(| v i |)
(2)
E average =
where
1
N
Ei =
i =1
1
N
h
i =1
Di h = h + Dh , (3)
1 N
D = Di
N i=1
(4)
X +h = u
(5)
h = D 1 X ( X + D 1 X ) 1 u
3.
(9)
(6)
PROPOSED WORK
(10)
F = 1/ E
(8)
(11)
f (v i ) = PCE(v i )
l
(12)
3.3. Methodology
We will be using the Cohn-Kanade AU-Coded Facial
Expression database for testing our pruning strategies. For
the purpose of verification and classification, Advanced
Correlation filters, more specifically MACE will be used.
The evaluation process will consist of two phases, namely
training followed by testing phase. For the first stage, the
first tree structure is going to be designed by starting in
the image space and adaptively decomposing using
wavelet transforms. At the same time, pruning will be
done at every sub space using correlation energy based on
inequality (9) as main figure of merit. The second strategy
is that we start with the full decomposition and prune
every single sub space using the second figure of merit
defined in (10). The design of those two tree structures
will be adaptive whereby we can alter the variables in our
figure of merits to get those two representations as close
as possible. In other words, we designed an optimal tree
with the best verification rate and at the same time
requiring the minimum spanning tree. The way to do that
is that we build a combined tree, which has a correlation
filter at a sub space on both trees. Once we have the
combined tree, we can begin the testing phase. We tested
it by applying those wavelet-domain correlation filters on
the expression database to evaluate the effectiveness of
our method. As a control test, standard correlation filters
have been generated in the image-intensity domain only
and their overall performances for verification and
classification have been computed. We will compare our
results to those values.
4.
EXPERIMENTAL RESULTS
4.1. Datasets
The database we used for evaluation is the Cohn-Kanade
AU-Coded Facial Expression database.
[1]
[2]
[3]
[4]
[5]
Original
Wavelet
80
[6]
70
60
[7]
50
40
30
20
[8]
10
0
1
9 10 11 12 13 14 15 16 17 18 19 20
Classes