Solucao Diferente
Solucao Diferente
Authorized licensed use limited to: CENTRO FED DE EDUCACAO TECNOLOGICA DE MINAS GERAIS. Downloaded on May 24,2023 at 20:42:32 UTC from IEEE Xplore. Restrictions apply.
sentative model) is high'. Therefore it is of great interest the example in Figure 1 these are indicated by i1,2 = (0.5,2)
to find representations which enable a visualization of the and i2,3 = (0.75,1.5) which uniquely describe a line in 3 di-
entire structure of a dataset or the corresponding model mensions.
without such a loss of information. The n-dimensional line in Cartesian coordinates can be
An interesting approach to visualize high dimensional represented by (n - 1) linearly independent equations each of
data sets in the same dimension without any loss of in- which results from equating a different pair of the following
formation is parallel coordinates [lo, 12, 131 a technique fractions [12]:
which was recently proposed to find trends in data. Par-
allel coordinates allows one to visualize n dimensional data 51 - a1 - 52 -a2
- -- - ... -
- -X. n - an
~
(1)
points in two dimensions. Essentially, parallel coordinates U1 U2 Un
transforms multi-dimensional patterns into two-dimensional
Now it may be assumed that the n - 1 linearly indepen-
patterns without loss of information. Visualization is facili-
dent equations are obtained from pairing the n - 1 adjacent
tated by viewing the two-dimensional representation of the
fractions, with no loss in generality. This yields
n-dimensional data points as lines crossing n parallel axes,
each of which represents one dimension of the original fea-
ture space. This approach scales well with increasing n and
has already been incorporated in some data analysis tools. where mi = ui+l/u; represents the slope and b, = (ai+l -
Fuzzy models have not received a lot of attention with mia,) the intercept of the zi+l-axis of the projected line on
respect to visualization even though powerful algorithms to the sizi+l-plane. The dual point of the n-dimensional line
construct such models from data exist. In this paper we in parallel coordinates therefore corresponds the the set of
therefore explore whether and how parallel coordinates can n-1 indexed points
be used to visualize fuzzy points in n-dimensional space. In
each dimension a fuzzy point has some fuzzy extent which
in the narrowest case (that is, a singleton) would be a sin-
gle value or point. Since fuzzy rules can also be considered
as a fuzzy point in n dimensional space, this visualization There are other nice results about the parallel coordinate
is extensible to such rules as well. Another highly related representation [13,9,6] that are not germane to this paper.
example of fuzzy points is a fuzzy partition of a data set.
The fuzzy partition can be represented by fuzzy clusters,
which in turn can be viewed as centroids with a correspond- 3 f i z z y data in parallel coordinates
ing fuzzy neighborhood which again is representable in the
framework of fuzzy points. Of course, fuzzy clusters have Our initial example showed several non-fuzzy points on a
also been converted to fuzzy rules [5, 7, 81. line. There are several ways that a fuzzy point might be
represented. In Figure 2a we show that a fuzzy point can
be represented as a single point which would be the core
2 Parallel Coordinates or centroid of the fuzzy rule, cluster, or point. Here it rep-
resents the number of training examples it covers (thicker
Parallel coordinates [lo, 121 allows one to visualize n dimen- lines for more examples). A more intuitive, but perhaps
sional data in 2-D. Essentially, parallel coordinates trans- more difficult to decipher, approach is to model the fuzzy
forms multi-dimensional problems into 2-D patterns without point as a region in each dimension Figure 2b. When there
loss of information. Visualization is facilitated by viewing are many fuzzy points each color or grey-level (as we use
the 2-D representation of the n dimensional data.
If one takes each of n coordinate axes and lines them up
in parallel, one has the basis for parallel coordinates. The
distance between each adjacent axis is assumed to be equal
to 1. A point in n dimensional space becomes a series of
n - 1 connected lines in parallel coordinates which intersect
each axis at the appropriate value for that dimension. A
parallel coordinates example of 3 points in 3-D, a' = (1,3, l ) ,
= (4,0,2), and c'= (2.5,1.5,1.5), from a line is shown in
Figure 1. Figure 1 . A parallel Coordinate depiction of 3 pojnts
The dual of an n-dimensional line in Cartesian coordi- on a line with a' = (1.0,3.0,1.0) (dark line), b =
nates is a set of n-1 points in parallel coordinates [ll,61,for (4.0,0.0,2.0) (gray), c' = (2.5,1.5,1.5) (light gray).
The two intersection points at i1,2 = (0.5,2) and i2,3 =
'In contrast to a model in high dimensions which has a simple
low-dimensional structure and can hence be transformed into a low (0.75,1.5) uniquely describe the line going through all
dimensional representation without a substantial loss of information. three points an the original 3 - 0 space.
Such models, unfortunately, are found rarely for real-world problems.
75
Authorized licensed use limited to: CENTRO FED DE EDUCACAO TECNOLOGICA DE MINAS GERAIS. Downloaded on May 24,2023 at 20:42:32 UTC from IEEE Xplore. Restrictions apply.
Figure 3. A n example of a fuzzy rule represented as
its core support region and in lighter gray its partial
support region.
76
Authorized licensed use limited to: CENTRO FED DE EDUCACAO TECNOLOGICA DE MINAS GERAIS. Downloaded on May 24,2023 at 20:42:32 UTC from IEEE Xplore. Restrictions apply.
4 Fuzzy Data analysis
77
Authorized licensed use limited to: CENTRO FED DE EDUCACAO TECNOLOGICA DE MINAS GERAIS. Downloaded on May 24,2023 at 20:42:32 UTC from IEEE Xplore. Restrictions apply.
fuzzy sets per axis. Not all axes need to be included in ev-
ery rule. Hence, it seems a natural approach to the visual
development of fuzzy sets. In a way analogous to our cur-
rent accuracy percentage for the training set, after a rule
is visually created an accuracy measure can be displayed
for it when it is applied to classification data. Then the
fuzzy sets may be modified to achieve the desired accuracy
on the training or validation set. This approach seems to
hold promise for developing fuzzy classification rules and
interactive data exploration.
Acknowledgements
This research was partially done at BISC while L. Hall was
on sabbatical. Thanks to UC Berkeley’s Div. of CS and
Prof. Zadeh for the use of their facilities. M. Berthold was
supported by the Deutsche Forschungsgemeinschaft through
grant DFG Be1740/7-1.
References
(b) [l] M. Berthold and D. H. (eds.). Intelligent Data Analysis, An
Introduction. Springer-Verlag, 1999.
Figure 5. The parallel coordinate view after using the [2] M. R. Berthold and L. 0. Hall. Visualizing fuzzy points in
slider on petal-length to isolate only the f i z z y points parallel coordinates. Technical Report UCB-CSD-99-1082,
describing the Virginica class (a) and after using the University of California at Berkeley, 1999.
[3] M. R. Berthold and K.-P. Huber. Constructing fuzzy graphs
slider on petal-length to attempt to isolate only the from examples. Intelligent Data Analysis, 3(1):37-54, 1999.
fuzzy points describing the Setosa class (b). [4] C. Blake and C. Merz. UCI repository of machine learning
databases, 1998.
[5] S. Chiu. Fuzzy model identification based on cluster esti-
of fuzzy clusters (by centroids). The visualization was used mation. Journal of Intelligent €4 Fuzzy Systems, 2(3), 1994.
to separate red tide phytoplankton blooms from other types [6] S.-Y. Chou, S.-W. Lin, and C.-S. Yeh. Cluster identifica-
tion with parallel coordinates. Pattern Recognition Letters,
of blooms and water. We were able to recreate the results of
20:565-572, 1999.
a painstaking search through 2-D projections that produced [7] M. Delgado, G.-S. A.F., and M. F. A fuzzy clustering-based
a similar rule for isolating red tide. The re-creation required rapid prototyping for fuzzy rule-based modeling. IEEE
less than 5 minutes [2]. fiansactions on f i z z y Systems, 5(2):223-233, May 1997.
We have applied the concept of viewing fuzzy points in [8] M. Fkiedman and A. Kandel. Introduction to Pattern Recog-
parallel coordinates only to fuzzy points which are rules or nition : Statistical, Structural, Neural and f i z z y Logic Ap-
centroids of potential rules that will be used to place an proaches. World Scientific, 1998.
[9] C. Gennings, K. Dawson, W. Carter, and R. Myers. Inter-
object in a class. The technique may also be useful for
preting plots of a multidimensional dose-responsesurface in
fuzzy control rules in which the consequent of a rule is a a parallel coordinate system. Biometrics, 46:719-735, 1990.
fuzzy set or fuzzy function. The visualization process may [lo] A. Inselberg. The plane with parallel coordinates. Visual
indicate where coverage is not fine enough (light overlap) Computer, 1:69-91, 1985.
and appears likely to make it easy to view uncovered regions. [ll] A. Inselberg. Multidimensional detective. In IEEE Confer-
To apply the idea to fuzzy control rules, an indication of ence on Visualization?, 1997.
goodness of control would need to be displayed after every [12] A. Inselberg and B. Dimsdale. Multidimensional lines i: rep-
change to the fuzzy points/rules. resentation. SIAM J. Applied Math, 54(2):559-577, 1994.
[13] A. Inselberg and B. Dimsdale. Multidimensional lines
One intriguing possibility with parallel coordinates is to ii: proximity and applications. SIAM J. Applied Math,
add features that are, for example, projections onto non-axis 54(2):578-596, 1994.
parallel lines or more complicated features to allow for the [14] N. Q. Jr. and M. Breuer. A force directed component place-
data to be separated. The extra features can still easily be ment procedure for printed circuit boards. IEEE fiansac-
visualized, may separate the data much better even though tions on Circuits and Systems, 26(6):377-388, 1979.
they may be redundant. [15] T. Kohonen. Self-organizing Maps. Springer-Verlag,Berlin,
The tool we have built could be further extended to al- Heidelberg, 1995.
[16] J. Meulman. A distance approach to nonlinear multivariate
low fuzzy sets to be defined on each axis. Then a fuzzy analysis. DSWO Press, Leiden, The Netherlands, 1986.
rule/point could be created by linking a choice of 1 or more
Authorized licensed use limited to: CENTRO FED DE EDUCACAO TECNOLOGICA DE MINAS GERAIS. Downloaded on May 24,2023 at 20:42:32 UTC from IEEE Xplore. Restrictions apply.