A Conic Section Classifier and Its Application To Image Datasets

This paper introduces a novel conic section classifier designed for high-dimensional sparse datasets in computer vision tasks. The classifier represents each class with a prototype conic section and utilizes a distance measure for classification, addressing challenges posed by high dimensionality and limited data. The authors present a tractable learning algorithm and demonstrate its effectiveness against established classifiers on various datasets.

Uploaded by

james.f.owers

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views6 pages

A Conic Section Classifier and Its Application To Image Datasets

Uploaded by

james.f.owers

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

A Conic Section Classiﬁer and its Application to Image Datasets ∗

Arunava Banerjee, Santhosh Kodipaka, Baba C. Vemuri

Department of Computer & Information Science & Engineering
University of Florida, Gainesville, FL 32611
{arunava,snsk,vemuri}@cise.ufl.edu

Abstract Although Statistical Learning Theory [12] does provide

formal bounds for the generalization error of a classifier as
Many problems in computer vision involving recognition a function of the classifier’s empirical error on the given
and/or classification can be posed in the general frame- dataset and a formalization of the complexity of the classi-
work of supervised learning. There is however one as- fier’s concept class, such bounds are often weak. The com-
pect of image datasets, the high-dimensionality of the data mon practice therefore is to estimate the generalization er-
points, that makes the direct application of off-the-shelf ror via such protocols as cross-validation and bootstrapping.
learning techniques problematic. In this paper, we present Without detailed prior knowledge regarding the nature of a
a novel concept class and a companion tractable algo- dataset, it is not possible to predict which of a given set of
rithm for learning a suitable classifier from a given labeled concept classes will yield the smallest generalization error.
dataset, that is particularly suited to high-dimensional Practitioners therefore resort to applying as many classifiers
sparse datasets. Each member class in the dataset is repre- with different concept classes as possible, before choosing
sented by a prototype conic section in the feature space, and the one that yields the least generalization error estimated
new data points are classified based on a distance measure via one of the above noted protocols. Every new concept
to each such representative conic section that is parame- class with a corresponding tractable learning algorithm is
terized by its focus, directrix and eccentricity. Learning is consequently a potential asset to a practitioner since it ex-
achieved by altering the parameters of the conic section de- pands the set of classifiers that can be applied to a dataset.
scriptor for each class, so as to better represent the data.
The difficulty of learning a classifier becomes particu-
We demonstrate the efficacy of the technique by comparing
larly acute when the dimensionality of the feature space,
it to several well known classifiers on multiple public do-
M , is far greater than the size of the dataset, N . This sit-
main datasets.
uation arises whenever the “natural” description of a data
point in the problem domain is very large and the cost of
collecting large number of labeled data points is prohibitive.
1. Introduction Classification problems that involve images are specifically
Many notable problems in computer vision and related prone to this issue. In such scenarios, learning even a simple
fields, such as automated face recognition from images [11], classifier such as a linear discriminant is under-constrained
diagnosis of Epilepsy from MRI scans of the brain [10], and because one has to solve for M + 1 parameters given only
diagnosis of various forms of Cancer from micro-array gene N inequalities. Additional objectives, such as maximizing
expression data [1], can be posed in the general framework the margin of the discriminant, is usually introduced to fully
of supervised learning. In such cases, the abstract problem constrain the problem. The learning problem becomes pro-
may be posed formally as follows: One is given a dataset gressively difficult as the concept class of the classifier be-
of N labeled tuples {X1 , y1 , . . . , XN , yN }, where the comes richer, since such classifiers require larger number
Xi ’s are the data points represented in some feature space S, of parameters to be solved for, given the same number of
and the yi ’s their associated class labels. For a dataset with constraints. This leads to over-fitting and the generalization
K classes, the yi ’s take nominal values in the range [1, K]. capacity of the classifier suffers. The traditional response
S in most cases is the Euclidean space RM . The goal of to this quandary has been to either restrict oneself to the
learning is to identify a function f : S → [1, K] (called the simplest of concept classes or to reduce the dimensionality
classifier) that minimizes the generalization error. of the dataset by projecting onto a subspace through tech-
niques such as PCA; the assumption underlying the second
∗ This research was in part supported by NIH RO1 NS046812 to BCV. approach being that there is a smaller set of compound fea-

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06)
0-7695-2597-0/06 $20.00 © 2006 IEEE
tures that is sufficient for the purpose of classification.
In this paper, we present a novel concept class that ex-
pands the power of the first approach noted above. The con-
(a) (b) (c) (d)
cept class, presented in Section 2, is rich and subsumes lin-
Figure 1. Discriminant boundaries in R2 . (See Sec-2)
ear discriminants, and yet is specified with merely twice the
number of parameters as a linear discriminant. Each mem- equidistant to the representative conic sections, in eccen-
ber class in the dataset is represented by a prototype conic tricity. This discriminant corresponds to a rich, non-linear
section in the feature space, and new data points are classi-
surface in RM .
fied based on a distance measure to each such representative
The concept class just described has several notable fea-
conic section. In Section 3, we present a tractable algorithm
tures. As shown in Fig.1, different configurations of two
for learning the appropriate conic sections (i.e., their direc-
conic sections (shown in R2 ) generate different discrimi-
trices, foci, and eccentricities) for the classes given a labeled
nant boundaries, ranging from simple to complex. Fig.1(a)
dataset. In Section 4, we demonstrate the efficacy of the
corresponds to a configuration where the directrices for the
technique by comparing it to several well known classifiers
two classes are identical, the foci for the two classes lie
on multiple artificial as well as public domain datasets.
symmetrically on the two sides of the directrix, and the class
eccentricities are equal. If the foci are moved such that the
2. The Concept Class using Conic Sections line joining them is bisected by the directrix, the boundary
A conic section in R2 is defined as the locus of points remains linear (Fig.1(b)). When the angle between the nor-
whose distance from a given point (the focus) and that from mals to the directrices, D1 , D2 , is non-zero, the boundary
a given line (the directrix), form a constant ratio (the eccen- becomes non-linear (Fig.1(c)). Further changes in the de-
tricity). Different kinds of conic sections, ellipse, parabola scriptors produce rich non-linear boundaries (Fig.1(d)).
and hyperbola, are obtained by fixing the value of the eccen- Regardless of the dimensionality of the feature space, the
tricity to < 1, = 1, and > 1, respectively. The concept can discriminant is linear when the directrices of the two classes
be generalized to RM by making the directrix a hyperplane are parallel, the foci are equidistant from the directrices, and
of codimension 1. Together, the focus and the directrix hy- the class eccentricities are equal and lie in a particular range.
perplane generate an eccentricity function that attributes to The concept class therefore subsumes linear discriminants.
each point X ∈ RM a scalar valued eccentricity defined as: Finally, the number of parameters necessary to specify the
conic sections for each class is 2 ∗ (M + 1), which is far
(F − X)T (F − X)
ε(X) = (1) less than the M 2 parameters necessary to specify a generic
b + DT X quadratic surface. We point out in passing that there is no
where F ∈ RM is the focus, and (b + DT X) (assuming known kernel for the support vector machine which matches
DT D = 1) is the orthogonal distance of X from the direc- this concept class, and therefore, the concept class is novel.
trix represented as {b, D}, where b ∈ R is the offset of the
directrix from the origin and D ∈ RM , DT D = 1, is the
3. Learning Algorithm - The Two-Class case
unit normal vector to the directrix. Setting ε(X) = ê yields
an axially symmetric conic sections in RM . In this section, we present a novel incremental algorithm
We are now in a position to formally define the concept (Algorithm-1) for learning the conic section descriptors,
class. To each class, k, we assign a distinct conic section Ck = {Fk , {bk , Dk }, êk } for k = 1, 2, that minimize the
parameterized by the descriptor set: focus, directrix and ec- empirical error (Eqn.4). We assume a set of N labeled sam-
centricity, as Ck = {Fk , {bk , Dk }, êk }. For any given point ples P = {X1 , y1 , . . . , XN , yN }, where Xi ∈ RM and
X, each class attributes an eccentricity εk (X), as defined in the label yi ∈ {1, 2}, and that the data is sparse in a very
Eqn.1, in terms of the descriptor set Ck . The conic sections high dimensional input space, i.e., N M .
for a set of K classes induce a mapping ε∗ : RM → RK ,
from the feature space to the eccentricity space (ecc-Space)
Data: Labeled Samples P
as, ε∗ (X) = ε1 (X), . . . , εK (X). The point X is assigned
Result: Conic Section Descriptors C1 , C2
to the class whose eccentricity descriptor êk is closest in
1: Initialize {F1 , b1 , D1 }, {F2 , b2 , D2 } [Sec.3.6]
magnitude to the attributed eccentricity, i.e.,
2: Compute ε1 (Xi ), ε2 (Xi ) ∀Xi ∈ P
class(X) = argmink (|εk (X) − êk |) (2) 3: Find class-eccentricities ê1 , ê2 [Sec.3.1]
|ε1 (X) − ê1 | = |ε2 (X) − ê2 |, f or K = 2 (3) 4: Compute the desired ε1i , ε2i [Sec.3.2]
5: Update foci & directrices alternately. [Sec.3.3, 3.5]
With an eye towards simplicity, we restrict the rest of the 6: Goto (2) until convergence of descriptors.
presentation to the binary classification case. The discrim-
Algorithm 1: Learning the descriptors C1 , C2
inant boundary (Eqn.3) for this case is the locus of points

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06)
0-7695-2597-0/06 $20.00 © 2006 IEEE
Following initialization of the descriptors (Section 3.6), order to keep the learning process simple, we update only
the learning process is comprised of two stages. In the first one descriptor of a particular class at each iteration. Hence,
stage, C1 and C2 are held fixed, and each Xi is mapped we move the misclassified points in ecc-Space by changing
into ecc-Space by computing its attributed eccentricities, ε1i or ε2i for the class of the descriptor being updated.
ε1 (Xi ), ε2 (Xi ). The pair of class eccentricities ê1 , ê2 The learning task now reduces to alternately updating the
that minimizes the empirical risk Lerr is then computed. foci and directrices of C1 and C2 , so that the misclassified
points are mapped into the desired quadrants in ecc-Space,
1
Lerr = I(yi = class(Xi )) (4) while the correctly classified points remain fixed. Note that
N i with such an update, our learning rate is non-decreasing.
We also introduce a margin along the discriminant bound-
where I is the indicator function. For each misclassified ary and require the misclassified points to be shifted beyond
sample, one can find a desired pair of attributed eccentrici- this margin into the correct quadrant. In most of our experi-
ties ε1i , ε2i that would correctly classify that sample. ments the margin was set to 5% of the range of eccentricity
In the second stage, the foci {F1 , F2 } and the directri- values in ecc-Space.
ces {{b1 , D1 }, {b2 , D2 }} are updated alternately so as to
achieve the desired attributed eccentricities for those mis- 3.3. Updating The Focus
classified samples, without affecting the attributed eccen-
tricities for those samples that are already correctly classi- Our objective here is to achieve the desired attributed ec-
fied. The process is repeated until the descriptors converge centricities εki for all the samples by changing the focus
or there can be no further improvement in classification. Fk . For each correctly classified sample, the desired eccen-
tricity εki is simply its previous value εki . From Eqn.1 we
3.1. Finding Class-Eccentricities ê1 , ê2 can conclude that the εki ’s for k = 1 depend only on the
class descriptor C1 , and likewise for k = 2. Since we update
Note that the dimensionality of ecc-Space is the num- only one focus at a time, we shall hereafter deal with the
ber of classes (2 in our case). For any given choice of case k = 1. The update problem may be posed formally
class eccentricities, the discriminant boundary (Eqn.3) in as follows. Find a focus F1 that satisfies the following N
ecc-Space is a pair of orthogonal lines with slopes +1, −1, quadratic constraints. Let . 2 be the Euclidean L2 norm.
respectively, as illustrated in Fig.2(a). The lines intersect at

ê1 , ê2 (referred to hereafter as the cross-hair). The lines = r1i , ∀Xi ∈ Pc ,
divide ecc-space into four quadrants with opposite pairs be- F1 − Xi 2 (5)
≤ or ≥ r1i , ∀Xi ∈ Pmc ,
longing to the same class. It should be noted that this dis-
criminant corresponds to a non-linear decision boundary in w here, r1i = ε1i (b1 + D1T Xi ) (6)
the feature space RM . In effect, each point Xi desires F1 to be at a distance r1i
We now present an O(N 2 ) algorithm to find the opti- from itself, derived from Eqn.6. Pc and Pmc are the set
mal cross-hair. The method begins by rotating ecc-Space of classified and misclassified points respectively. The in-
around the origin by 45◦ so that any choice of the discrim- equalities above imply that the desired location ε1i can lie
inants will now be parallel to the new axes. Each axis is in an interval along an axis in ecc-Space (See Fig.2). In
divided into (N + 1) intervals by projecting the points in order to closely control the learning process, we learn one
ecc-Space onto that axis. Consequently, ecc-Space is parti- misclassified point at a time, while holding all the others
tioned into (N + 1)2 2D intervals. We now make a crucial fixed. This leaves us with only one inequality constraint.
observation: within the confines of a given 2D interval, any
We refer to the set of all feasible solutions to the above
choice of a cross-hair classifies the set of samples identi-
quadratic constraints as the Null Space of F1 . Further, we
cally. We can therefore enumerate just the (N + 1)2 inter-
have to pick an optimal F1 in this Null Space that maxi-
vals and choose the one that gives the smallest classification
mizes the generalization capacity of the classifier. Although
error. The cross-hair is set at the center of this 2D interval.
the general Quadratic Programming Problem is known to
In cases where there are multiple 2D intervals that give the
be NP-hard, the above constraints have a nice geometric
smallest classification error, the larger one is chosen.
structure that can be exploited to construct the Null Space
in O(N 2 M ) time. Note that by assumption, the number
3.2. Learning Misclassified Points
of constraints, N M . The Null Space of F1 with re-
Given the attributed eccentricities ε1i , ε2i of a misclas- spect to each equality constraint in Eqn.5 is a hyper-sphere
sified point, we can compute its desired location ε1i , ε2i in RM . Hence, the Null Space for all the constraints com-
in ecc-Space (see Fig.2(a)) by moving it into the nearest bined is simply the intersection of all the corresponding
quadrant associated with its class label. This movement can hyper-spheres in RM with centers {X1 , . . . , XN } and radii
be achieved by updating a focus or directrix in Eqn.1. In {r11 , . . . , r1N }. Let XN be the single point being updated

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06)
0-7695-2597-0/06 $20.00 © 2006 IEEE
spheres problem is converted into the intersection of (N −1)
hyper-spheres and a hyper-plane H{1,2} problem.

S1 ∩ S2 → S{1,2} ∈ H{1,2} (7)

e2 Si ∩ H{1,2} → Si ∈ H{1,2} ∀i = 3, ., N (8)

The problem is now transparently posed in the lower di-

mensional hyper-plane H{1,2} as a problem equivalent to
e1 the one that we began with, except with one less hyper-
Figure 2. (a) Shaded regions in this ecc-space belong to one class. sphere constraint. The end result of this iteration for all
Learning involves shifting misclassified points into desired re- the (N − 1) equality constraints is a low dimensional lin-
gions. (b) Intersection of two hyper-sphere Null spaces ear subspace in which lies the Null Space S e represented
as a single hyper-sphere (parameterized as a radius and a
in ecc-Space. Then, r1N can take any value within an inter- center), computed in O(N 2 M ) time. It should be observed
val (rmin , rmax ) corresponding to the range of desired ε1N . that all the intersections thus far are feasible and that the
The solution to this case is presented next. successive Null Spaces have non-zero radii since the equal-
ity constraints have a feasible solution apriori.
3.4. The Intersection of Spheres problem Let SN be the null space for the inequality constraint
with rN ∈ (rmin , rmax ). If SN intersects with S e , we
We present an algorithm that builds the Null Space in- chose a radius rN that maximally shifts the correspond-
crementally. For ease of readability, we drop the refer- ing misclassified point in ecc-Space. On the resultant final
ence to class in this section. The Null Space is initialized Null Space, we picked a solution that improves the gener-
as the set of feasible solutions for the first equality con- alization capacity of the classifier. We have found that any
straint in Eqn.5. It can be parameterized as the hyper-sphere choice of the solution that arrives at a configuration close
S1 = (r1 , X1 ), where r1 is the desired distance of a solu- to that in Fig 1(a), results in a simpler discriminant in the
tion from the sample X1 . At the next step, the second equal- input space. If S e does not intersect with SN , this sim-
ity constraint is introduced, the Null Space for which, con- ply means that the chosen misclassified point can not be
sidered independently, is the hyper-sphere S2 = (r2 , X2 ). shifted entirely to the desired location in ecc-Space. In such
Hence the combined Null Space for the two constraints is a case, we picked an appropriate solution on S e that allows
the intersection of the two hyper-spheres, S1 ∩ S2 . the maximum possible shift.
As illustrated in Fig.2(b), the intersection of two spheres
in R3 is a circle that lies on the plane of intersection of the 3.5. Updating The Directrix
two spheres. Our technique is based on a generalization
We demonstrate in this section that the Directrix Update
of this setting in RM . We make two critical observations:
problem is closely related to the Focus Update problem ow-
the intersection of two hyper-spheres is a hyper-sphere of
ing to the duality of points and hyper-planes. We begin by
one lower dimension, and this hyper-sphere lies on the in-
once again noting that {b1 , D1 } may be updated indepen-
tersecting hyper-plane of the original hyper-spheres. We
dent of {b2 , D2 }, and vice versa. The goal here is to update
re-parameterize the combined Null Space, S1 ∩ S2 , as a
the directrix descriptor {b1 , D1 } for a given F1 and desired
lower-dimensional hyper-sphere S{1,2} , lying in the hyper-
eccentricities ε1i . Just as in the focus update, each point Xi
plane of intersection H{1,2} . Based on the geometry of the
desires the directrix to be at a certain orthogonal distance
problem and the parameterization of S1 and S2 , it is triv-
v1i , from itself. The problem reduces to finding a directrix
ial to compute, in O(M ), the radius and center of the new
that satisfies the constraints:
hyper-sphere S{1,2} = (r{1,2} , X{1,2} ), as well as the in-
tersecting hyper-plane H{1,2} represented as (b{1,2} , Q12 ), = v1i ∀Xi ∈ Pc
b1 + D1T Xi (9)
the first parameter being the displacement of H{1,2} from ≤ or ≥ v1i ∀Xi ∈ Pmc
the origin and the second being the unit normal to H{1,2} . w here, v1i = F1 − Xi /ε1i , D1
2 2= 1 (10)
Q{1,2} lies along the line joining X1 and X2 .
We now solve the remainder of the problem on the hyper- This problem appears simpler at first sight since it is
plane H{1,2} . This is accomplished by intersecting each comprised of N linear constraints. However, the quadratic
of the remaining hyper-spheres S3 , ., SN that correspond to constraint requiring D1 to be an unit normal makes the
the samples X3 , ., XN , with H{1,2} , in O(N M ) time. Once above a Quadratic Programming Problem which is again
again, based on the geometry of the problem, it is trivial NP-hard in general. Once again, we exploit the geomet-
to compute the new radii and centers of the corresponding ric structure inherent in the problem to arrive at the Null
hyper-spheres. In short, the intersection of the N hyper- Space in O(N 2 M ) time. We first solve for the scalar b1 by

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06)
0-7695-2597-0/06 $20.00 © 2006 IEEE
translating the origin to the first classified point, X1 , so that eccentricities. This implies that the Null Space for the clas-
b1 = v11 . In addition, just as in Section 3.3, we learn a sified points is non-empty. Second, the search for updates
single misclassified point, say XN , in each iteration. With a in the Null Space always guarantees the feasible solution for
known b1 , we translate and scale all remaining points such the constraints related to the correctly classified points.
that the linear constraints become: A key contribution of our technique is the tracking of the
set of all feasible solutions as a compact geometric object.
= v̂1i 2≤i<N
D1T X̂i (11) From this Null Space we pick a solution biased towards a
≤ or ≥ v̂1i i = N linear discriminant so as to improve upon generalization.
w here, X̂i = (Xi − X1 )/ (Xi − X1 ) 2 (12) The size of margin in ecc-space also gives a modicum of
v̂1i = (v1i − v11 )/ (Xi − X1 ) (13) control over generalization. The order of samples processed
2
does not affect the final Null Space. The convergence of our
Now the null space of D1 , for each constraint in Eqn.11 learning algorithm depends on data and initialization. How-
considered separately, is a hyper-plane Hi ∈ RM repre- ever, we found that it converged to a local minima typically
sented as {−v̂1i , X̂i }. The null space corresponding to the within 50 iterations of the focus and directrix updates.
quadratic constraint on D1 is a unit hyper-sphere, S1 ∈
RM , centered at the new origin. Hence, the final Null Space 4. Experiments
for D1 is the intersection of all the Hi ’s and S1 .
We evaluated the classifier on two synthetic datasets
We now make two critical observations. The inter-
and four real datasets. The results were compared against
section of a hyper-plane with a hyper-sphere is a lower-
several state-of-the-art linear and non-linear classifiers.
dimensional hyper-sphere. Same is the case with the in-
The classification accuracies based on leave-one-out cross-
tersection of two hyper-spheres. We can therefore con-
validation are presented in Table-1.
vert this hyperplane-hypersphere intersection problem into
a hypersphere-hypersphere intersection problem. In effect, Support Vector Machines (SVM) [2] and Kernel Fisher
we can replace each hyper-plane Hi with a suitable hyper- Discriminants (KFD) [7] broadly represented the non-linear
sphere Si such that Hi ∩ S1 = Si ∩ S1 . Owing to the category. Both employ the kernel trick of replacing inner
geometry of the problem, we can compute Si from Hi and products with Mercer kernels. Among the linear classifiers,
S1 . The Null Space for all the constraints combined is now we chose the Linear Fisher Discriminant (LFD) [4] and lin-
the intersection of all the hyper-spheres S1 , S2 , ..., SN . The ear SVM. We used the OSU SVM toolbox for MATLAB
problem, now reduced to a hyperspheres-intersection prob- based on libSVM [13]. We considered Polynomial (PLY)
lem, is solved as in Section 3.4. and Radial Basis (RBF) Kernels.
The best parameters were empirically explored. Polyno-
mial kernels gave best results with either degree = 1 or 2
3.6. Initialization
and the scale was approximately the sample variance. The
Given a set of labeled samples, we found that there are RBF kernel performed best when the radius was the sample
several ways of initializing the conic section descriptors that variance or the mean distance between all sample pairs.
led to a solution. Random initializations converged to dif-
ferent conic descriptors each time leading to inconsistent 4.1. Results
performance. We observed that owing to Eqn.1, the Null
Synthetic dataset-1 was randomly generated from two
Spaces are small or vanishing if the foci or directrices are
well separated Gaussian clusters in R40 . The results in
very close to the samples. We found the following initial-
Table-1 validate our classifier’s effectiveness on simple, lin-
ization to be consistently effective in our experiments. The
early separable data. Synthetic dataset-2 was generated by
foci were first placed at the sample class means and then
sampling from two intersecting paraboloids (related to the
pushed apart until they were outside the sample clouds. The
two classes) in R3 and placing them in R64 . This instance
normals to the directrices were initialized as the line joining
shows that our classifier favors data lying on paraboloids. It
the foci. The directrix planes were then positioned at the
clearly out-performed the other classifiers.
center of this line or on either sides of the data.
Epilepsy data [10] consists of displacement vector fields
between the left and right hippocampi for 31 epilepsy pa-
3.7. Discussion
tients. The displacement vectors are computed at 762 dis-
One of the core characteristics of our algorithm is that crete mesh points on each of the hippocampal surfaces,
after each update any point that is correctly classified by the in 3D. This vector field representing the non-rigid regis-
earlier descriptors is not subsequently misclassified. This tration, captures the asymmetry between the left and right
is due to two reasons. First, we begin with an initialization hippocampi. Hence, it can be used to categorize different
that gives a valid set of assignments for the class attributed classes of epilepsy based on the localization of the focus

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06)
0-7695-2597-0/06 $20.00 © 2006 IEEE
Samples Data Size (NxM) CSC LFD KFD PLY KFD RBF SVM PLY SVM RBF
Synthetic Data1 20 x 40 100 100 100 100 100 100
Synthetic Data2 32 x 64 93.75 87.5 75 75 81.25 87.5
Epilepsy 31 x 2286 77.42 67.74 67.74 61.29 67.74 74.19
Colon Tumor 62 x 2000 87.1 85.48 75.81 82.26 82.26 85.48
UMIST FaceDB 575 x 10304 97.74 98.72 99.93 99.91 99.3 99.06
Texture Pair1 95 x 601 100 100 100 100 100 100
Texture Pair2 95 x 601 92.63 98.94 100 100 90.52 82.10
Table 1. Classiﬁcation accuracies for the Conic Section Classiﬁer, (Linear & Kernel) Fisher Discriminants and SVM. ( See Sec-4 )

of epilepsy to either the left (LATL) or right temporal lobe 5. Summary and Conclusions
(RATL). The LATL vs. RATL classification is a hard prob-
lem. As seen in Table-1, our classifier out-performed all In this paper, we have introduced a novel concept class
based on conic section descriptors, provided a tractable su-
the others and with a significant margin, except over SVM-
pervised learning algorithm, and have tested the resultant
RBF. In fact, our result is better than that reported in [10].
The best RBF kernel parameters for SVM and KFD meth- classifier against several state-of-the-art classifiers on many
public domain datasets. Our classifier was able to classify
ods were 600 and 1000, respectively. The best degree for
the polynomial kernel was 1 for both of them. tougher datasets better than others in most cases as vali-
dated in Table-1. The classifier in its present form uses ax-
The Colon Tumor data [1] comprises of 2000 gene- ially symmetric conic sections. In future work, we intend
expression levels for 22 normal and 40 tumor colon tissues. to extend this technique for multi-class classification and to
The normals to directrix descriptors were initialized with conic sections that are not necessarily axially symmetric.
the LFD direction in this case. Our classifier yielded 87%
accuracy outperforming the other classifiers. Interestingly, References
most of the other classifiers could not out-perform LFD, im-
plying that they were learning the noise as well. Terrence, [1] U. Alan, et al. Broad patterns of gene expression revealed by
et al. [5] were able to correctly classify two more samples clustering analysis of tumor and normal colon tissues probed
by oligonucleotide arrays. PNAS, 96:6745–6750, 1999.
with a linear SVM, only after adding a diagonal factor of
[2] C. Cortes and V. Vapnik. Support-vector networks. Machine
two to the kernel matrix. Learning, 20(3):273–297, 1995.
The Sheffield (formerly UMIST) Face Database [6] has [3] K. J. Dana, et al. Reflectance and texture of real-world sur-
564 pre-cropped face images of 20 individuals with varying faces. ACM Transactions on Graphics, 18(1):1–34, 1999.
[4] R. O. Duda, P. E. Hart, and D. G. Stork. Pattern Classifica-
pose. Each image has 92 x 112 pixels with 256 gray-levels.
tion. Wiley-Interscience, 2001.
Since we only have a binary classifier now, the average clas- [5] T. Furey, et al. Support vector machine classification and
sification performance over all possible pairs of subjects is validation of cancer tissue samples using microarray expres-
reported. This turned out to be an easier problem. Conic sion data. Bioinformatics, 16(10):906–914, 2000.
classifier achieved a comparable accuracy of about 98%, [6] D. Graham and N. Allinson. Characterizing virtual eigen
while the others were near 100%. signatures for general purpose face recognition. NATO ASI
Series F, Comp. & Sys. Sci., 163:446–456, 1998.
CURET database [3] is a collection of 61 texture classes [7] S. Mika, et al. Fisher discriminant analysis with kernels.
imaged under 205 illumination and viewing conditions. Neural Networks for Signal Processing, IX:41–48, 1999.
Varma et al. [9] have built a dictionary of 601 textons and [8] E. Spellman, B. C. Vemuri, and M. Rao. Using the KL-
computed texton frequencies in a given sample image. The center for efficient and accurate retrieval of distributions
texton frequency histograms obtained from [8], can be used arising from texture images. CVPR, pages 111–116, 2005.
as the sample feature vectors for classification. About 47 [9] M. Varma and A. Zisserman. Texture classification: Are
images were chosen from each class, with out a preferential filter banks necessary? Vol 2, pages 691–698, June 2003.
[10] N. Vohra, et al. Kernel fisher for shape based classification
order so as to demonstrate the efficacy of our classifier for
in epilepsy. MICCAI, pages 436–443, 2002.
high-dimensional sparse data. We report the results for an [11] W. Zhao, et al. Face recognition: A literature survey. ACM
easy pair and a relatively tougher pair of textures for clas- Comput. Surv., 35(4):399–458, 2003.
sification. The two cases are Sand paper vs. Rough paper [12] V. Vapnik. Statistical Learning Theory, John Wiley and
(Pair1) and Sand paper vs. Polyester (Pair2), respectively. Sons, New York, 1999.
As seen in Table-1, Pair1 turned out to be easier case in [13] C. Chang and C. Lin. LIBSVM: a Library for Support Vec-
deed. KFD out-performed the others for the second pair tor Machines (Version 2.31).
and our classifier fared comparably.

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06)
0-7695-2597-0/06 $20.00 © 2006 IEEE

Site Instruction and Variation Order Procedures For Contractors Manual PDF
100% (3)
Site Instruction and Variation Order Procedures For Contractors Manual PDF
31 pages
Reznor Handbook
100% (1)
Reznor Handbook
72 pages
Introduction To Machine Learning Lecture 3: Linear Classification Methods
No ratings yet
Introduction To Machine Learning Lecture 3: Linear Classification Methods
40 pages
Machine Learning Lectures
No ratings yet
Machine Learning Lectures
82 pages
Main
No ratings yet
Main
5 pages
Between Classification-Error Approximation and Weighted Least-Squares Learning
No ratings yet
Between Classification-Error Approximation and Weighted Least-Squares Learning
12 pages
Unit 3
No ratings yet
Unit 3
100 pages
Digital Image Processing Lecture
No ratings yet
Digital Image Processing Lecture
63 pages
Linear Classifiers and The Perceptron Algorithm: 36-350, Data Mining, Fall 2009 16 November 2009
No ratings yet
Linear Classifiers and The Perceptron Algorithm: 36-350, Data Mining, Fall 2009 16 November 2009
5 pages
19 Image Classification
No ratings yet
19 Image Classification
78 pages
j077 2011 KulHar WileyTutorial
No ratings yet
j077 2011 KulHar WileyTutorial
14 pages
SVM Class
No ratings yet
SVM Class
33 pages
Classification Techniques
No ratings yet
Classification Techniques
99 pages
SWE622 Lecture 3 Classification
No ratings yet
SWE622 Lecture 3 Classification
57 pages
Deep Learning Answers
No ratings yet
Deep Learning Answers
36 pages
הרצאה-Classifiers and Decision Trees
No ratings yet
הרצאה-Classifiers and Decision Trees
119 pages
105 Machine Learning Paper
No ratings yet
105 Machine Learning Paper
6 pages
Navot PHD
No ratings yet
Navot PHD
145 pages
08classification I
No ratings yet
08classification I
52 pages
C. Cifarelli Et Al - Incremental Classification With Generalized Eigenvalues
No ratings yet
C. Cifarelli Et Al - Incremental Classification With Generalized Eigenvalues
25 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
IT 802 ML Unit-2 Notes
No ratings yet
IT 802 ML Unit-2 Notes
19 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
50 pages
Prediction Errors Tech Report
No ratings yet
Prediction Errors Tech Report
9 pages
AAI Lecture 11 SP 25
No ratings yet
AAI Lecture 11 SP 25
77 pages
KNN Evaluation
No ratings yet
KNN Evaluation
51 pages
Ai and ML
No ratings yet
Ai and ML
16 pages
3.1 Feature Selection
No ratings yet
3.1 Feature Selection
35 pages
Combined SVM-Based Feature Selection and Classification
No ratings yet
Combined SVM-Based Feature Selection and Classification
22 pages
Part 11 MD
No ratings yet
Part 11 MD
53 pages
Lecture 14
No ratings yet
Lecture 14
20 pages
Linear Discriminant Functions: CS479/679 Pattern Recognition Dr. George Bebis
No ratings yet
Linear Discriminant Functions: CS479/679 Pattern Recognition Dr. George Bebis
41 pages
Machine Learning Crash Course: Computer Vision James Hays
No ratings yet
Machine Learning Crash Course: Computer Vision James Hays
38 pages
L6 Lecture Image - Classification.fundemental v4
No ratings yet
L6 Lecture Image - Classification.fundemental v4
66 pages
4K-Nearest Neighbor
No ratings yet
4K-Nearest Neighbor
38 pages
Introduction To: Support Vector Machines
No ratings yet
Introduction To: Support Vector Machines
53 pages
Machine Learning: Support Vector Machines Kernel Methods
No ratings yet
Machine Learning: Support Vector Machines Kernel Methods
87 pages
Pattern Recognition & Learning II: © UW CSE Vision Faculty
No ratings yet
Pattern Recognition & Learning II: © UW CSE Vision Faculty
47 pages
Lab NN KNN SVM
No ratings yet
Lab NN KNN SVM
13 pages
03 Supervised Classification
No ratings yet
03 Supervised Classification
68 pages
06 Lectureslides LinearClassification Fixed
No ratings yet
06 Lectureslides LinearClassification Fixed
52 pages
Pattern Recognition 14
No ratings yet
Pattern Recognition 14
46 pages
Slide 2 ML Basics
No ratings yet
Slide 2 ML Basics
42 pages
Classification FoundationalMathofAI S24
No ratings yet
Classification FoundationalMathofAI S24
6 pages
Support Vector Machines For Histogram-Based Image Classification
No ratings yet
Support Vector Machines For Histogram-Based Image Classification
10 pages
Support Vector Machines
No ratings yet
Support Vector Machines
57 pages
T6 - KNN - Features, Distances &amp Amp Non-Parametric Models
No ratings yet
T6 - KNN - Features, Distances &amp Amp Non-Parametric Models
23 pages
Introduction To Support Vector Machines: Andrew Moore CMU
No ratings yet
Introduction To Support Vector Machines: Andrew Moore CMU
40 pages
Mlfa Autumn 22 Lec 03
No ratings yet
Mlfa Autumn 22 Lec 03
61 pages
Pattern Revision
No ratings yet
Pattern Revision
63 pages
Classification
No ratings yet
Classification
53 pages
Data Science in FInancial Services - 3
No ratings yet
Data Science in FInancial Services - 3
76 pages
Perceptron Notes
No ratings yet
Perceptron Notes
5 pages
CrossValidation - Permutation Test For Studying Classifier Performance
No ratings yet
CrossValidation - Permutation Test For Studying Classifier Performance
31 pages
Quiz 1 On Wednesday
No ratings yet
Quiz 1 On Wednesday
46 pages
LNCS 2810 Similarity Based Classification 1st Edition by Axel Bernal, Karen Hospevian, Tayfun Karadeniz, Jean Louis Lassez ISBN 3540408134 978-3540408130 Download
100% (3)
LNCS 2810 Similarity Based Classification 1st Edition by Axel Bernal, Karen Hospevian, Tayfun Karadeniz, Jean Louis Lassez ISBN 3540408134 978-3540408130 Download
26 pages
ML Unit 1
No ratings yet
ML Unit 1
73 pages
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
No ratings yet
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
5 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Contextual Image Classification: Understanding Visual Data for Effective Classification
From Everand
Contextual Image Classification: Understanding Visual Data for Effective Classification
Fouad Sabry
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Nursing Informatics Week 1
No ratings yet
Nursing Informatics Week 1
37 pages
I, Hereby Declare That The Research Work Presented in The Summer Training Based Project Report Entitled, Study of Compotators of Frooti Juice
No ratings yet
I, Hereby Declare That The Research Work Presented in The Summer Training Based Project Report Entitled, Study of Compotators of Frooti Juice
98 pages
HT I&ii
No ratings yet
HT I&ii
98 pages
2.2. BASIC Work in Team Environment
No ratings yet
2.2. BASIC Work in Team Environment
3 pages
OPA Annex 4 Request For Funds Format (15 March 2018)
No ratings yet
OPA Annex 4 Request For Funds Format (15 March 2018)
5 pages
P1 Marketing
No ratings yet
P1 Marketing
4 pages
Editorial Cartoon
No ratings yet
Editorial Cartoon
4 pages
API 101 - 2024 Fall (170000) Harvard Kennedy
No ratings yet
API 101 - 2024 Fall (170000) Harvard Kennedy
24 pages
Ice Cream Cones Manufacturing and Production Project Report
100% (1)
Ice Cream Cones Manufacturing and Production Project Report
22 pages
Evaluation of Quickcampus++ As Integrated Student Management System of Pangasinan State University
No ratings yet
Evaluation of Quickcampus++ As Integrated Student Management System of Pangasinan State University
4 pages
Compensation Management Systems - Paper B - 4
No ratings yet
Compensation Management Systems - Paper B - 4
9 pages
Presenatation On SIP by Saral Jain
No ratings yet
Presenatation On SIP by Saral Jain
12 pages
Test Automation
No ratings yet
Test Automation
1 page
Islamic Investment Fund: Tahreem Zafar Roll No. 172026 Course Instructor Dr. Mian Abbas
No ratings yet
Islamic Investment Fund: Tahreem Zafar Roll No. 172026 Course Instructor Dr. Mian Abbas
38 pages
Company Profile: /shega Interiors
No ratings yet
Company Profile: /shega Interiors
25 pages
Designoftwowayslab
No ratings yet
Designoftwowayslab
23 pages
Product HRBX01K02
No ratings yet
Product HRBX01K02
3 pages
5.load Transfer Mechanism and Load Test - 2
No ratings yet
5.load Transfer Mechanism and Load Test - 2
18 pages
New Medical Shop Management System
100% (1)
New Medical Shop Management System
24 pages
L-Dens 427 Instruction Manual
No ratings yet
L-Dens 427 Instruction Manual
78 pages
Risk Assessment Template Teen Fashion
No ratings yet
Risk Assessment Template Teen Fashion
2 pages
Diary Ka Kea, My Cursed Life-1
No ratings yet
Diary Ka Kea, My Cursed Life-1
960 pages
Linearization OpenFAST
No ratings yet
Linearization OpenFAST
13 pages
OS Lab Manual Part 3
No ratings yet
OS Lab Manual Part 3
7 pages
Gamal Mohamed CV
No ratings yet
Gamal Mohamed CV
2 pages
NetScaler 10.5 Policies and Expressions
No ratings yet
NetScaler 10.5 Policies and Expressions
381 pages
Interview Questions With Answers On All Topics (Rev1)
No ratings yet
Interview Questions With Answers On All Topics (Rev1)
41 pages
Client Duties Case Preparation
No ratings yet
Client Duties Case Preparation
2 pages

A Conic Section Classifier and Its Application To Image Datasets

Uploaded by

A Conic Section Classifier and Its Application To Image Datasets

Uploaded by

A Conic Section Classiﬁer and its Application to Image Datasets ∗

Arunava Banerjee, Santhosh Kodipaka, Baba C. Vemuri

Abstract Although Statistical Learning Theory [12] does provide

S1 ∩ S2 → S{1,2} ∈ H{1,2} (7)

The problem is now transparently posed in the lower di-

You might also like