SOM A Computer Vision Technique For Face Sketch Featured Database
SOM A Computer Vision Technique For Face Sketch Featured Database
SOM A Computer Vision Technique For Face Sketch Featured Database
Volume: 1 Issue: 4
___________________________________________________________________________
I. INTRODUCTION
Human communication has two main aspects auditory
(verbal) and visual (non-verbal). Non-verbal communication
like facial expression, body movements, and physiological
reactions provides significant information regards the state of
the person. Computer vision aims to duplicate the human
vision by electronically perceiving and understanding an
image. Computer vision techniques use the results of
artificial intelligence, pattern recognition, mathematics,
computer science, psycho-physiology and other scientific
areas.
II. KOHONEN SELF-ORGANIZING MAPS
Self-Organizing Maps (SOM) was introduced by a
Finnish Professor, Teuvo Kohonen in 1982, thus SOMs are
also sometimes known as Kohonen Maps. SOM are subtype
of artificial neural network. They are trained using unsupervised learning to produce low dimensional
representation of the training samples while preserving the
topological properties of the input space. Thus SOM are
reasonable for visualizing low-dimensional views of highdimensional data.
Self-Organizing Maps are a single layer feed forward
network where the output syntaxes are arranged in low
dimensional grid i.e. 2D or 3D. Each input is connected to all
output neurons attached to every neuron there is a weight
vector with the same dimensionality as the input vectors. The
number of input dimensions is usually a lot higher than the
output grid dimension. A Self-organizing map is shown in
below Figure 1.
355
IJRITCC | MAR 2013, Available @ http://www.ijritcc.org
___________________________________________________________________________
Volume: 1 Issue: 4
___________________________________________________________________________
input n1. The winners output is 1. The neuron whose weight
vector is closest to the input vector has the least negative net
input and, therefore, wins the competition to output a 1.
Thus the competitive transfer function produces a 1 for
output element a1i corresponding to i* the winning neuron.
All other output elements in a1 are 0.
A self-organizing feature map network identifies a
winning neuron using the same procedure as employed by a
competitive layer. However, instead of updating only the
winning neuron, all neurons within a certain neighborhood
of the winning neuron are updated using the Kohonen rule.
III.
E. 2D-DCT
The two-dimensional discrete cosine transform (2DDCT) used for image processing. The 2D-DCT resembles
the 1D-DCT transform because it is a separable linear
transformation. For example, in an n x m matrix, S, the 2DDCT is computed by applying it to each row of Sand then to
each column of the result. Figure 3 shows the generic 2DDCT architecture of an N x M input image.
A. Overview
The discovery of Discrete Cosine Transform (DCT) in
1974 was an important achievement for research community
working on image compression. It is a lossy data
compression technique virtually used image processing.
Compression standard like JPEG for compression of still
images employ the basics of the DCT.
B. Background
The DCT algorithm similar to Fast Fourier Transform
converts the data pixels into the sets of frequencies. The first
frequencies in the set are the most meaningful than the latter
i.e. least. To compress the data the least meaningful
frequencies are stripped away based on the allowable
resolution loss. DCT operates on a function at a finite
number of discrete data points.
C. Definition
The DCT is regarded as a discrete-time version of the
Fourier cosine series. Hence it is considered as a Fourierrelated transform similar to the Discrete Fourier Transform
(DFT) using only real numbers.
D. 1D-DCT
The Discrete Cosine Transform is a linear invertible
function or an N x N square matrix like:
F: RN -> RN
where, R denotes set of real numbers
Mathematically,1D-DCT
length N can be defined as:
Figure 3: 2D-DCT
IV. DESIGN AND IMPLEMENTATION
In Computer Vision, detecting a Face Sketch in a digital
image involves segmentation, extraction and verification of
faces and possibly facial features like Left-Eye, Right-Eye,
Nose, Lips and frontal face from an uncontrolled
background.
A. Face Sketch Recognition
Face Sketch recognition algorithm can be classified into
two main categories i.e. generative and discriminative
approaches.
1) Genrative Approach: To model a digital image in
terms of sketches and then match it with the query sketch or
vice-versa.
Wang and Tang proposed Eigen transformation based
approach to transform a digital photo into sketch before
matching. In another approach, they presented an algorithm
to separate shape and texture information and applied
Bayesian classier for recognition. Liu et al. proposed nonlinear discriminative classier based approach for
synthesizing sketches by preserving face geometry. Li et al.
matched sketches and photos using a method similar to the
Eigen-transform after converting sketches to photos.
Recently, Wang and Tang proposed Markov Random Fields
based algorithm to automatically synthesize sketches from
digital face images and vice-versa.
___________________________________________________________________________
___________________________________________________________________________
variations in gender, age, ethnicity and inter-artist
variations. They also discussed about the quality of sketches
in terms of artists skills, experience, exposure time, and
distinctiveness of features. Similarly, Nizami et al. analyzed
the effect of matching sketches drawn by different artists.
Klare and Jain proposed a Scale Invariant Feature
Transform (SIFT) based local feature approach where
sketches and digital face images were matched using the
gradient magnitude and orientation within the local region.
Bhatt et al. extended Uniform Local Binary Patterns to
incorporate exact difference of gray level intensities to
encode texture features in sketches and digital face images.
Klare et al. extended their approach using Local Feature
Discriminant Analysis (LFDA) to match forensic sketches.
In their recent approach, Klare and Jain proposed a
framework for heterogeneous face recognition where both
probe and gallery images are represented in terms of
nonlinear kernel similarities. Recently, Zhang et al.
proposed an information theoretic encoding band descriptor
to capture discriminative information and random forest
based matching to maximize the mutual information
between the sketch and the photo. [7]
B. Face Image Pre-processing
The programming language used to design and
implement the forensic face sketch identification system
code is MATLAB. The reason for using MATLAB in this
project is due to its Neural Network and Image Processing
toolbox that helped to obtain an efficient code.
Face image processing consists of the following steps:
Data Gathering
Import Faces Images to MATLAB
Image Resize in MATLAB
Featured Cropping
1.
Data Gathering
357
IJRITCC | MAR 2013, Available @ http://www.ijritcc.org
___________________________________________________________________________
355 358
Volume: 1 Issue: 4
___________________________________________________________________________
1
1
0
0
0
0
0
1
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0 ];
Lips
358
IJRITCC | MAR 2013, Available @ http://www.ijritcc.org
___________________________________________________________________________
___________________________________________________________________________
359
IJRITCC | MAR 2013, Available @ http://www.ijritcc.org
___________________________________________________________________________