About Shape: Jan Koenderink
About Shape: Jan Koenderink
Jan Koenderink
D E C LOOTCRANS P RESS
About Shape
Jan Koenderink
De Clootcrans Press
Utrecht The Netherlands
[email protected]
Jan Koenderink
pax / jan koenderink University of Leuven (KU Leuven)
Laboratory of Experimental Psychology
Tiensestraat 102 – bus 3711
3000 Leuven
First edition, 2012 Belgium
[email protected]
10 9 8 7 6 5 4 3 2 1
i
infinite in itself is formless. We humans endowed with senses and intellect
WHAT IS SHAPE? demand tangible forms. Hence a triangle. The triangle is the beginning of
all forms. Out of it first comes the square. A square is the triangle doubled.
This doubling process goes on infinitely and we have the multitudinosity
This is a valid question. Most people feel they know the answer,
of things, which the Chinese philosopher calls “the ten thousand things”,
although they find it hard to express it. This perhaps indicates that
that is, the universe.
a formal understanding is lacking. Not that it matters much, ex-
cept when you intend to do research on “shape perception”. Then
you’d better know what you’re talking about, at least more than the
participants in your experiments.
For formal definitions you consult a dictionary. Trying a few different ones
is best. You find the usual circular definitions, that is how dictionaries work,
they explain words by words. Some give also some “common uses” though,
and these are of some interest. Here are a few:
The particular physical form or appearance of something:
— clay can be moulded into almost any SHAPE
— these bricks are all different SHAPES
— our table is oval in SHAPE
— my bicycle wheel is bent out of SHAPE
1
S HAPE is an invariant under the group action of a group of “congruences” of different items.
or “movements”. We evidently need more useful distance functions. It is evident that this
is hard. How different is a horse from a goat? No idea? Is the distance
However, mathematicians can get really excited over definitions like this, so from a horse to an octopus larger than the distance from a horse to a goat? I
we should perhaps look a bit further. What does it imply? Well, would say yes! Apparently there “is an answer”, at least in some sense, but
— a configuration space (think of “3 points in the plane”); the problem as stated appears perhaps too hard.4 It seems better to start with
— a group of “movements” (think of “shifts, rotations, scalings”); something simpler. As said, I will look at the “shape of triangles’.
— an equivalence relation “congruency” that partitions the space of configu- You may protest that the shape of triangles is not particularly interesting.
rations into distinct SHAPES; Why consider that at all? Well, you may underestimate the power of the tri-
— SHAPES may be parameterized in a “SHAPE space”. angle. Understand the triangle and you have conquered the (Eucldean) plane,
What you need a shape space for? Well to name shapes (through coordinates for instance. Here are just a few simple examples:
of some kinds), to measure differences between shapes and so forth.
The definition is from the late 19th c. due to people like Felix Klein, who
should count among your heroes. Although a first rate mathematician (his
students were afraid of him because he outsmarted them), he had a sharp
feeling for and appreciation of intuition. The mathematical models you see
displayed in the older math departments are due to him. Klein stated that
the models were not illustrations of the math, but that the math was about
the models. For him SHAPES where qualia in the sense of visual and haptic
phenomenology.
Klein had no problem to conceive of the distance (in shape space!) between
two triangles. Can you? Well, you can cheat! For instance, suppose I define
the distance between two triangles as either one (when they are congruent,
that is to say, the same triangles in different spatial locations and attitudes) or
zero (when their symmetric difference (or disjunctive union) as planar sets is
non-empty in any mutual placement).2 Why do I say this is a cheat? Because
whereas this distance is indeed a good distance function,3 it is not going to The tiger and the lamb are very different animals. For instance, predators
help you much. It simply labels two things as same or different. You cannot have frontally placed eyes and fixate their prey, whereas their prey animals
use it to find a pair of different items that is twice as different as another pair have laterally placed eyes and hardly need any fixations to see what they’re
in.
2
Notice the technical terms “distance”, “set”, “symmetric difference”, “disjunctive
union”, “planar set” and so forth. If you are not familiar with such a term look it up in
Google. In most cases the first hit will do it, but you may have to scout a bit or pursue various Although the tiger and the lamb have very distinct faces, both have eyes,
trails. As a bonus you’re likely to pick up some useful information on the side. Don’t be a nose and so forth. In a formal experiment participants readily provide you
ashamed, many people (including me) do the same.
3 4
Look it up! Cladistics yields one possible answer.
2
with a dozen or so corresponding locations in the two portraits. ing triangulation. The corresponding points automatically yield correspond-
ing triangulations. Now — using the triangulation — the correspondences set
by the participants in the experiment automatically let you relate the full areas
with each other.
Here you have plenty of interesting possibilities, for instance, you can both
interpolate between and extrapolate either way from the triangulations, al-
lowing you to plot sheepish lambs or unnaturally fierce super-tigers. Using
triangles, you really conquered the image plane!
Here is another example of triangle use: Pick three colors and consider a
The tiger in the lamb-geometry and lamb in the tiger-geometry. texture filled with all colors obtainable by interpolation. Each such texture is
equivalent to a triangle.0 If I have a distance for triangles I can immediately
This is interesting, because you can make a few points go a long way by us- use it to define distances between such chromatic textures.
3
It would be easy enough (and fun!) to multiply the triangle examples, but
I’ll leave it at this. By now, you understand that triangles are hardly boring.
It would be appropriate to generate and show a few triangles here. But
consider how to make them. Suppose you need to program a “random triangle
generator” for your experiments, what would you do? Could you be formally
sure all triangles would be “equally likely”?5 Here is an idea: I generate triples
of normally distributed Cartesian coordinates with the same variance. The
convex hull of each triple is a “random triangle” for sure!
Indeed, this looks promising. But is it a unique method? No! For instance,
I could draw triples from a uniform distribution in a square (an area), or from
a circle (a curve!). Are at least the results the same? No! So much is evident Random triangles generated as the convex hulls of triples of points drawn
from a cursory look. In order to make progress we need formal methods. from a uniform distribution on a circle.
The notion we need at this point is that of the “Procrustes” method. Pro-
5
Here you should ask “in what sense?”, for they trivially are in the sense of your algo- crustes was a rather unpleasant person who made sure his guests perfectly
rithm, whatever the algorithm might be like. fitted the bed he generously offered them by either chopping off the surplus
4
or stretching them forcefully to the right size. The Procrustes method does method.7 The best scaling can be easily found after factoring out the trans-
something similar, hence its name. lation by equating the r.m.s.distance of all points from the center of inertia
for the two configurations. After scaling, finding the optimal rotation is per-
haps best done using the singular values decomposition method.8 According
to the methods used, finding the best superposition may involve some itera-
tions. The square root of smallest measure of fit then is unique for the pair of
point configurations. It is their mutual “Procrustes distance”.
5
Triangles are the simplest objects to which the Procrustes method applies representation would reflect the Procrustes method. Kendal figured out how
in a non-trivial sense. to do this, here is how:
There is one additional degree of freedom to consider, namely that of reflec-
tion. Two triangles are often considered congruent when they can be brought
������ ���������
into full superposition after one of them has been reflected. It is a mere matter ��������
I show the analysis of David George Kendall (who should be another one
of your heroes), who figured out the structure of the triangle shape space in
the mid nineteeneighties. The figure shows what happens graphically, but —
of course — Kendall has an algebraic parallel storyline.
6
be able to find such a relation. One needs to grok both the algebra and the ized by two numbers. In practice one uses the three angles and plots this in
geometry, moreover, have a certain empathic feeling for them. That’s why I “barycentric” coordinates. Is this equivalent to the Kendall method? This is
said Kendall should count as one of your heroes. For the rest of us Kendall’s easily checked by plotting triangles drawn from a uniform Kendall distribu-
spherical blackboard is a wonderful tool. If you ever do experiments involv- tion in the barycentric coordinates. The result is clear: These representations
ing triangles (remember that you may actually do that without realizing the are different! So which one is to be preferred? That depends upon your appli-
fact!), you certainly should use the spherical blackboard. cation.
7
By identifying pairs of edges as in-
dicated I construct a closed surface. When triangles ABC, ACB, BAC,
It has the topology of the sphere. BCA, CAB, and CBA are con-
This has the advantage that there is sidered “the same”, each triangle
no boundary, thus a deforming tri- is represented by six point in the
angle will never run into an obstacle barycentric representation.
along its path. Each triangle is rep-
resented fourfold.
The Kendall analysis is specific for the Euclidean plane. What if you leave
this trusty territory? Well, when you change the geometry you change the
C A B C A B C A
game completely! This is very important, so I’ll give some examples. One of
C A B C A B C A B C
the simplest examples is to consider the shape of triangles in the affine plane.
C A B C A B C A
In the affine plane two configurations are reckoned “congruent”when they are
C A B C A B C A B C
equivalent under general linear transformations.
C A B C A B C A
C A B C A B C A B C
C A B C A B C A
8
The figure shows the effect of an affine transformation. “Affine transforma-
tions” preserve lines and parallelity. They are very common in applications, Quadrangles
perhaps even more so than Euclidean movements. The interesting observation
is that there exists a unique affine map for any pair of triangles that will map
the one to the other. Thus, doing affine Procrustes, any pair of triangles has It is simple enough to generate random quadrangles, although most people
(affine) Procrustes distance zero. All triangles are affinely identical. There is tend to be surprised in seeing their initial results.
only one affine triangle.
Although there is nothing wrong with quadrangles that are not convex or
even have intersecting sides, this is not what our common sense expects.
Quadrangles are perhaps best understood as convex hulls of four points. This
tends to make sense to most people.
Although there is only one affine triangle, there are infinitely many affine
quadrangles. What is their shape space like? One simple way to obtain an
idea is to draw a diagonal, yielding two abutting triangles, and affinely force
one triangle to be equilateral. Then the other triangle uniquely parameterizes
the quadrilateral. This implicates that the affine quadrangle space is just the
Euclidean triangle space.
9
at the aspect ratio. It makes “the shape come out right” and users are happy.
Here a number of random quadrilaterals have been transformed such that one Any two quadrilaterals are projectively equivalent.
triangle above a diagonal is equilateral. Notice that the quadrilaterals still differ
by the other triangle. Thus two quadrilaterals are typically affinely different, their
difference being the difference of two Euclidean triangles.
10
Application of affine shape
I could only show the tip of the iceberg here. The point to remember is that This is a Greek stone sculpture that my friend John Willats (died some years
selecting the right geometry for your research is a crucial issue. Euclidean ago, great artist) measured up at the British Museum before the time of 3D
geometry is not the holy grail. Physicists have come to understand that, but scanners. I use it as a test object here. It is convenient because most people
the other sciences run behind. Don’t forget that a change of geometry is a know what faces look like.
dramatic, qualitative change. Using an inappropriate geometry is likely to
require endless fussy patchwork that would be cleared up in one fell swoop
when the right choice were made. Don’t underestimate this!
10
In case you remember the Flower Power period, it is more important than your Third Eye.
Hippies who tried to actually open that generally came to a bad end.
11
This is the “affine flow”. I get it by mapping the triples of indicated points to
equal equilateral triangles in the two views, also centering the results so as to
annull the average displacement. This is the affine Procrustean trick. Notice
Here are two views of the face from different directions and distances. Can that all affine flow vectors are mutually parallel, only their magnitudes count.
you get the true 3D shape of the face from the flow? Hardly. The magnitudes are proportional to their deviation from the plane of the tri-
angle. This allows me to draw the profile at right. Not a bad reconstruction
of the original! All that is lacking is a depth calibration.
12
This makes for a very nice example of the use of triangles in surprising
settings. In this case the problem was solved trivially by moving from a Eu- Shape in the Visual Arts
clidean to an affine perspective. The solution depended on the fact that three
points define a unique plane in three-dimensional Euclidean space, whereas
Who knows best what SHAPE is? Perhaps we should ask visual
projections (of course, planar) triangles in that space yield triangles in the
artists first. For them SHAPE is not a formal definition, it is their
two-dimensional Euclidean plane of projection. Since any two triangles are
meal ticket. Once you start to look at the shape-related methods
affinely equivalent, such triangles can be mapped upon each other. However,
of painters and sculptors you will soon feel to be ready to give up!
any point not coplanar with the triangle in three-dimensional Euclidean space
There are really too many aspects of the matter than that you might
cannot join in the latter transformation. Thus its deviation from the plane
hope to formalize. Our formal methods will never be able to reveal
becomes manifest after the affine map.
all but a few tips of the iceberg and even that with considerable
In dealing with images triangles are natural because they cover part of the
loss. Formalizing intuitive insights necessarily involves hardcore
image plane. A triangulation tiles the image plane with triangular tiles. This
Procrustean methods. I just touch on a few topics, then zoom in on
is very useful because each tile can be taken as small as desired, with the
a useful, fairly general, formalism in the next chapter.
implication that non-linear deformations can be conveniently linearized. This
makes it possible to do complicated transformations by local texture map-
Consider the process of drawing solid shapes for a start. One of its roots is
ping, much cheaper than pixel-based methods. Similar methods apply to
the facility to doodle. Visual awareness turns doodles into solid shapes.
higher dimensions. For instance, in three-dimensional space you would “tile”
with tetrahedra, and so forth. The space need not be “just space”, but could
be parameter spaces of arbitrary kind.13 Such methods generate a need for
“good tilings”, which generally means tilings having triangles that are close
to equilateral. Thus the topic of “the shape of triangles” is bound to pop up in
many—often surprising—contexts.
13
For instance, there may be occasions where it would be advantageous to triangulate Becoming an expert doodler involves endless finger exercises like these.
Kendall’s spherical blackboard itself.
13
There are many ways to doodle, the illustration shows just three common than doodling. It is very effective though. The example I show is really a
styles. Each work great. It lets you conjure up about any solid shape you want. rectangular box (notice the shading!) with a minimum of muscular dressing
By combination you easily build more complicated structures. That’s really up. When fully dressed up the artist proudly displays an expert understanding
all there is to most effective drawing. Facility in doodling involves endless of the male human form.
finger exercises, a bit like keyboard training. Most artists enjoy it. You can
bend, twist, bulge and taper such shapes even easier than when you do it in
clay. They magically become 3D in visual awareness.
It hardly matters which style you adopt. Most artists stick with a first pref-
erence. It does not matter, because it is easy enough to turn such doodles into
anything you want. That is called “developing the drawing”, which is another Here is the “cuirasse esthétique” fully dressed up
facility one needs to acquire. It is of a different kind, you need to dress up the with muscular detail. This drawing is by William
bare doodles with local surface relief. Rimmer a mid-nineteenth century family doctor in
Massachusetts.
The “dressing up” can be applied to many other starting methods, all you
need is a good start. The doodles are great for that, because so general. But
many people prefer less general but more specific methods.14 A well known
example is the cuirasse esthétique. The traditional artist knows it by heart.
It allows you to draw a male torso with your eyes closed, it is much less fun The method of polyhedral approximation. At bottom “ovoid drawing” used in
dressing up.
14
Hence the endles guides on “How to draw cats” and so forth.
14
Many artists abhor of smooth surfaces because they don’t know what to
draw within an outline and if you draw only outline you end up with a flat
silhouette. They prefer polyhedral approximations and root-learn “the planes
of the head” and so forth. This surely works, especially if your dressing up
removes or hides your scaffold. The sharp edges will be replaced with soft
shading, or “ovoid drawing” is used to add a sprinkle of muscular detail.
The Egyptian sculpture strikes you as volumetric, whereas the Greek kouros
is frontally conceived and almost treated like a relief.
So far the methods were based on volumetric intuitions, with the cuirasse
esthéstique perhaps being more like a template that can also be used in an
essentially “flat” manner (although that will show). But, of course, it is just One way to put some order on the chaos is to classify artistic shape con-
as well possible to start with axes, although these are usually “felt” as moving ceptions as primarily linear, planar, or volumetric. The Egyptian sculpture
in space, although drawn on paper. The gestural drawing by Gretchen Kelly first strikes you as a rounded cube, it is clearly conceived as a volumetric
is a nice example. It perfectly captures the pose of the model in a few brush block. The Greek kouros is evidently frontally conceived, it might as well
strokes. Such a method can also be dressed up, although the result is likely to have been a relief. It looks like an articulated plane. The gestural drawing by
differ from that obtained by other methods. Usually you can see (or feel) how Gretchen Kelly is evidently linearly conceived. This can equally well be done
a drawn or painted shape originated. in sculpture, there are numerous examples in contemporary art.
A somewhat degenerated form of gestural drawing is based on the idea For each of these general families of shapes one might develop formalized
of “stick figures”. This usually leads to drawing that indeed look like stick treatments. I had to make a choice for this course because of time limits. I
figures, which seriously offends the eye, but in the right hands it is certainly a decided on articulated planes. It is an apt choice because it leads to a formal-
valid starting method. It is often preferred by amateur artists. ism of relative simplicity, but an enormous range of applications. It is a bit
15
like the case of triangles, you might be surprised at the range of — perhaps
unexpected — applications. The Shape of Surfaces
In order to know what surfaces are you need to know what space is. So
what is space? But what of a question is that? So I will dodge the question
and ask “do you know what the surface of the earth is (I mean shape-wise)?”
Answers are likely to range from geomorphological descriptions to spherical,
oblate ellipsoid and so forth. It apparently depends on the application. There
are hills and dales in the landscape, grains of sand and leaves of grass near to
you, whereas the globe looks an almost perfect sphere as seen from the moon.
Scale is the defining parameter. When I talk of surfaces in this chapter I mean
objects like the sphere or perhaps the hills and dales.
Things get a little easier when I regard a “local” environment. This is tricky,
because it introduces a second scale parameter. Here is an example:
One used to think that the earth was flat like a plane in Euclidian space. Yet
we know that hills and dales are seen from many places where we stand and
that the earth looks like a sphere as seen from the moon. We might consider a
scope large enough to consider the hills and dales as mere local articulations,
yet a scope that is too limited to see the sphericity.
Such decisions are made by our visual awareness on the fly and according
to present situational awareness. If we say someone has a perfect, smooth skin
we do neither imply wrinkles, nor that the skin is extended like a plane. We
accept both bulges and pores as not violating the notion of a “smooth skin”,
albeit for different reasons.
Suppose we accept the flat earth notion for a moment. It probably fits
our perceptions here and now fine anyway. Suppose we describe the shapes
of hills and dales. The first thing to notice is that the height dimension is
qualitatively different from the dimensions of the ground plane. It is measured
16
differently (counting steps works only in the plane) and often expressed in sorts. Some planes are flat, others are hollowed out, and others are swollen
different physical units. For instance, before GPS or radar it was common outward and are spherical. To these a fourth may be added which is com-
to measure height in terms of barometric pressure. Thus the “space” we find posed of any two of the above. The flat plane is that which a straight ruler
ourselves in is not 3D, but rather (2 + 1)D. So when I describe the formal will touch in every part if drawn over it. The SURFACE OF THE WATER
theory of articulated planes, I will be using this (2 + 1)D space. I refer to it as is very similar to this. The spherical plane is similar to the EXTERIOR OF
“relief space”. A SPHERE . We say the sphere is a round body, continuous in every part;
Relief space is very important in visual perception. “Visual space” is an any part on the extremity of that body is equidistant from its centre. The
articulated plane, namely the “visual field” (2D), articulated by “depth” (1D). hollowed plane is within and under the outermost extremities of the spher-
A painter paints “pictorial spaces” where the picture plane roughly accounts ical plane as in the INTERIOR OF AN EGG SHELL. The compound plane
for the 2D (visual field say) and the 1D is the implicit “pictorial depth”. is in one part flat and in another hollowed or spherical like those on the
More generally, any scalar field is such an “articulated plane”. Examples INTERIOR OF REEDS or on the EXTERIOR OF COLUMNS .
are meteorological maps of temperature or barymetric pressure. But scalar
fields over two-fold extended continua abound in the sciences.15 Thus this
type of formal shape has numerous applications. Once up to it, you will
discover novel applications by the day. Their importance almost beats tri-
angles.16
An important final issue is that of smoothness. I will assume that all sur-
faces of interest are smooth on all scales of interest. It is just that I don’t have
the time to go into fractal structures. But smooth structures are nice and im-
portant in their own right. After all, the earth does look like a perfect sphere
as seen from the moon, despite the fact that your easily trip if you don’t watch
your feet.
Alberti was the first person to attempt a complete zoo of surface types. This
is a quote from his famous book on painting dating from the early Renaissance
(emphasis added by me): Alberti, a major intellectual force during the early Italian Renaissance.
17
The other one is the catalogue of shapes. Alberti has these categories, evi- a full categorization of local surface shapes and much more. He founded the
dently based on the notion of surface curvature: field of “differential geometry”, which is local, smooth geometry, in one fell
blow.
— surface of water
— exterior of a sphere
— interior of reeds
— exterior of columns
Notice that both “hollowed out” and “swollen outward” admit of degrees.
They might be measured on non-negative “more–or–less” scales. The flatness
is different though, it does in no way admit of degrees. Intuitively, it is on
the edge between convex and concave, which implies that it is not a bona fide
category. This extends to the“interior of reeds” and the “exterior of columns”.
These are evidently borderline cases, but borderlines of what?
18
As you move in a certain direction the surface tilts in various ways. This is which there is no torsion. In these directions the surface is purely curved.
described as a “torsion” and a “curvature”. They are known as the “principal directions of curvature” of the surface.
Of course, they differ from point to point.
Since curvature has a “sense”, there are two kinds of surfaces, those where
the principal curvatures have the same sense and those where they are of op-
posite senses. Thus, in the Gauss categorization, there are exactly two local
surface types, he called them “hyperbolical and “elliptical” (I’ll explain why
below). The elliptical category may be split into two subcategories “convex”
and “concave” if you care to distinguish between up and down (both hill tops
and pit bottoms are elliptic).
“Flat” is not a shape at all, it is a singular condition. This makes sense when
you think of how hard it is to make a truly flat surface.17 Formally, they occur
with probability zero.
Some examples of surface strips. Likewise, cylindrical forms are the transitions between elliptic and hyper-
bolic (Gauss called them “parabolic”). They occur with probability zero,
but—because there are so many points in a surface (severe understatement!)—
they occur as points on certain curves, the boundaries between elliptic and
hyperbolic regions.
Why the funny names elliptic, parabolic and hyperbolic? There are various
ways to explain this. Gauss used algebraic language that derived from the
structure of a family of planar curves called “conic sections” by the Greeks
that might be said to generalize the circle. Examples are ellipses, hyperbolas
and parabolas. However, you probably prefer a more geometric insight. It is
easily obtained starting from the classical notion of conic sections.
With “cone” one implies a “right circular cone” (rotationally symmetric),
with an infinite axis. That is to say, it looks like two “common” cones (the
ones that end in a sharp vertex) in tandem. If you cut it with a plane you
either obtain a closed curve (an ellipse), or an open curve of two branches (a
hyperbola). In a singular case—dividing the ellipses from the hyperbolas—
you obtain an open, single branched curve (a parabola). These curves are the
outlines of the “wounds” inflicted by the cutting with a planar blade. The
Strips of principal curvature on an elliptic (top) and a hyperbolic (bottom) sur-
curves display the shape of the cone to who has the eye to appreciate such
face.
things.
17
Gauss discovered that there are two, mutually perpendicular, directions in I used to pay top dollars for flat mirrors when I ran a group at a physics laboratory.
19
Charles Dupin had the bright idea to extend this technique of revealing where I live, certainly looks flat to me), except when I interact with people a
shape by cuts to general local surfaces. At a given point he cuts the surface long distance away, or have to travel there. The only way to see that the earth
with a plane parallel to the tangent plane at that point. The notion of “tan- is not flat are NASA photographs taken from outer space.
gent plane” may not be familiar, so I’ll explain it first. I’ll get back to Dupin A formal definition of tangent plane at some fiducial point is as follows:
soon enough. Take any three distinct points in the neighborhood of the fiducial point and
construct the unique plane that contains them. Now take the points closer and
closer to the fiducial point. When very close indeed, taking them any closer
makes no difference any more. The limiting plane—all three points at the
fiducial point!—is the tangent plane. Because I assume everything is smooth,
such a plane certainly exists.
A physical definition is more fun, but in practice works only for convex
elllpitical points. You may use it to construct tangent planes at points on a
boiled egg. Simply let the egg touch a (flat) table top. It will touch at one
point. The table top is the tangent plane at that point. Got it?
This is “the earth” in three distinct views. Both scope and scale differ, al-
though the scale–scope ratio is roughly constant. At left is a view of narrow
scope and high resolution, here we meet with almost infinite complexity. At
center is a far view, at least as judged from the human condition, the horizon
distance depends simply on your eye-height. In your awareness the earth is
flat, except for some ripples on the water. It takes reflective thought to in-
fer that the horizon might be due to otherwise invisible curvature. At right
we (or rather, a NASA robot) see the earth from a distance. In your visual
awareness it appears as a smooth sphere, painted with pretty texture. It is of
some interest to estimate how many orders of magnitude separate such views
(try!). In fields like the neurosciences you are often confronted with pictures
that are easily as far apart (on a log-scale, of course). This should give rise to
some thought. All these views depict realities, but these are hardly the same.
Your actuality here and now can hardly be at all three simultaneously. The
regularities of Nature are also very different in these three worlds.
Remember that the earth was once supposed to be flat. The “flatness” was
a tangent plane at the globe. The flat earth people believed that they were all This Dupin caricature of Dupin is by Daumier. Daumier drew many important
on the same plane, but we now understand that every geographical location persons of his time.
implies a different plane. In practice I’m a flat earth person (the Netherlands,
20
Dupin considered cuts very close to the tangent plane. He understood that maticians conventionally use a plus sign for indicatrix points on the upper cut
when you cut close enough you always obtain a conic section. He actually and negative for points on the lower cut.
advocated a pair of cuts, one below the surface and one above. That way you It is easy enough to define a measure of “curvature”. In (2+1)D-space one
get both branches of the hyperbolas and you differentiate between convexi- uses the value of the second order directional derivative.19 It has the advan-
ties and concavities. The figure you get is called “Dupin’s Indicatrix” at the tage that the sign is automatically supplied.20 Since the Dupin’s indicatrices
fiducial point. It is a very intuitive, because geometrical, way to arrive at an are conic sections, the curvature reaches two extremal values if you follow it
understanding of local surface shape. over all directions. These are known as the “principal directions” and the cur-
vatures as the “principal curvatures”. Thus you deal with a pair of curvatures,
say {κ2 , κ2 }, where I’ll let κ1 > κ2 .
Here the confusion starts! So what is THE curvature of the surface? The so
called intrinsic curvature is of major interest in mathematics. Gauss defined
it as K = κ1 κ2 . It is special because it can be found by measurements that
are purely within the surface (hence “intrinsic”). It is also most confusing to
common sense people because cylindrical and conical surfaces have zero cur-
vature, whereas any fool can see they are not flat. However, Gauss’ argument
is that they can be “developed into the plane” as they certainly can. Indeed,
cylindrical surfaces are often produced by bending flat plates. Moreover, the
Some instances of Dupin’s indicatrix. In (a) the cut is below the tangent plane
intrinsic curvature fails to distinguish between “hollowed out” and “swollen
and it yields an ellipse. In (b) the cut is also below the tangent plane and it
outwards” surfaces, which really goes against the grain of common sense—
yields a degenerate parabola (looks like two parallel lines). In (c) one cut is
Alberti would cringe!
above the tangent plane and it yields one branch of a hyperbola, whereas the
Another common curvature measure is the so called “mean curvature”, not
other cut below the tangent plane yields the other branch.
unexpectedly defined as H = (κ1 + κ2 )/2. This neatly differentiates the
Albertian categories. However, the mean curvature of a symmetric saddle is
In order to put some structure on it one assigns a sign to the surface cur- zero, which again conflicts with common sense.
vatures. Of course, it is not obvious how to do this! I’m sure this was Al- Only in 1890 Felice Casorati, an Italian mathematician, suggested
berti’s problem: he simply failed to see that a surface may be “hollowed out” a surface
p curvature expressly designed to fit common sense, namely
and “bulged outwards” at the same time! The point is that a surface is usu- C = (κ2 + κ22 )/2. The Casorati curvature is indeed only zero for planes.
2
ally curved differently in different directions. This is easily seen in Dupin’s This is indeed what our intuition understands as “surface curvature”. How-
indicatrix, because the distance of a point on the indicatrix is inversely pro- ever, it fails to distinguish between the categories of shape. Thus one needs
portional to the square root of the curvature.18 It evidently varies among the and additional measure for the “quality” of curvature. This is evidently what
curve. Moreover, it makes sense to distinguish between “above the tangent Alberti intended with “qualities which rest like a skin over all the surface of
plane” and “below the tangent plane” branches by assigning a sign. Mathe-
19
In the cortical implementation in V1 this is the activity of a Hubel and Wiesel “line
18 2
Puzzled? Try it in 1D: The curve κxp/2 has curvature
√ κ, the tangent line is the x-axis. detector”.
The cut at separation d is at distance a = 2d/κ ∝ 1/ κ from the origin, for κa2 /2 = d. 20
Yes, the mathematicians have it right, same convention.
21
the plane”. The “quality” is essentially the ratio of principal curvatures. It This is what we do:
turns out that S = arctan ((κ2 + κ1 )/(κ2 − κ1 )) is a more natural parameter — we rotate in the plane, aligning the axes of maximum principal curvature,
in shape space. This “shape index” takes values in (−π/2, +π/2). — we scale the height, setting both Casorati curvatures to one,
then the remaining r.m.s. measure of fit—that is the Procrustes distance—
equals the difference of shape indices. This is a measure of shape quality, for
instance all convex cylinders22 (“outsides of columns”) have the same shape,
they differ only in size (Casorati curvature) and spatial attitude (orientation).
This is Felice Casorati. He had the guts This is a result that perfectly fits what I understand as “common sense”.
(not easy for a mathematician) to pub-
lish “ Mesure de la courbore des surfaces
suivant l’idée commune” in a mathemat-
ical journal in 1890. His confrères killed
him over that. They couldn’t stand the
idée commune, which is considered un-
scientific, because subjective. Casorati
is better known for his work in complex
analysis.
The Casorati curvature and shape index together define the habitus. The
other freedom is the orientation, for which one may use the angle between the
axis of maximal principal curvature and some arbitrary reference in the plane.
The orientation O has period π, I take it to take values in [−π/2, +π/2).
The triple {C, S, 2O} forms a natural parameterization of shape space.21
The Casorati curvature measures the distance from the origin, the shape index
the latitude and the orientation the longitude. It can be shown that distances This shows the intuitive parameterization of local surface shape. Shape in-
in this space represent the natural measure of fit, that is the square root of the dex, Casorati curvature and orientation together form a natural coordinate
average squared deviation. system for shape space. Casorati curvature is distance from the origin, shape
In measuring the distance between two shapes we can go totally Pro- index latitude and orientation longitude. In this space the metric is Pro-
crustean. crustean with respect to a comparison with planarity.
21
Polar coordinates in Euclidean 3-space. The 2O instead of just O is necessary to obtain
22
the 2π periodicity. I use this example because Gauss does not differentiate cylinders from planes.
22
can easily find the probability density functions for a plane that is articulated
The natural (polar) coordinates of shape space. The
through isotropic Gaussian noise. The result is interesting: there is actually
yellow sphere is a locus of constant Casorati curva-
a higher probability to encounter a hyperbolic than there is to encounter an
ture, the blue plane a locus of constant orientation
elliptic shape! So the Bayesian “explanation” does not “save Alberti’s face”.
and the gray cone a locus of constant shape index.
The red circle indicates the rotational symmetry (pe-
riod π, thus each orientation occurs only once!). The
blue vertical axis is the “umbilical axis” the locus of
shapes like the spherical shell with coincident prin-
cipal curvatures. For umbilicals the orientation re- The probability density for the
mains indeterminate. shape index of a plane articulated
with isotropic Gaussian noise. The
probability to encounter a saddle
(the blue range) is about 57%.
Given a metrical shape space, you can calculate many things of consider-
able interest. For instance, the fact that Alberti’s oversight was not noticed
for centuries gives rise to thought. Is our vision unfit to notice saddle shapes? Of course, in practice there will be various kinds of deviations from this.
One fact that appears to fit such a thought is that sculptors generally concen- Since a potato has a smooth surface that is curved in itself, it it likely to have
trate on elliptical parts, especially convex ones. A cursory look at the cover a bias towards convexity, like the sphere, which is all convex umbilical. A
picture will suggest as much. If this were true, is it due to the fact that sad- geographical landscape will lack major concavities due to the inevitable water
dles are rare? (That would be the standard Bayesian argument.) Well, we erosion processes. And so forth. The Gaussian random relief is a mere ideal.
23
Top left a “graph”, which is an articulated line, a relief often known as a
“curve”. Top right, point samples as a discrete representation of the relief. It
is the 0th –order representation. Bottom left, a discrete representation by line
samples, the tangent lines at discrete points. It is the 1rst –order representa-
At left a relief map of the Gaussan relief, the curves are loci of equal height.
tion. Bottom right, a discrete representation by local shape samples. It is the
Notice that you get to see the Dupin indicatrices at the pits, peaks and passes
2nd –order representation.
of the landscape. At right the flow lines of steepest descent, how water would
run down hill (at least initially, I do not consider any dynamics here). This is
the “gradient field”. In applications reliefs are represented through discrete samples. “Samples”
are local views. At the rock bottom one has mere point samples. Then the
scope is nil, for knowing one point sample tells you nothing about the next
The landscapes are interesting because they show that (one might almost one. It is more useful to have sampling of “linelets”. At any sample you
forget after reading Gauss-type differential geometry!) the local concavities, have a finite slope that tells you something about the local trend and lets you
convexities and saddles hardly exhaust the wealth of surface form. Far from predict the height of the next point, at least if that is close. Even better to have
it! Of course, it might be a problem that this richesse is really too much for a sampling of curvelet samples. Why better? Well, you can compute several
analysis. Here we meet the scale and scope issues again. A hill is a local linelets from a single curvelet, but not vice versa. The curvelets imply greater
object, simple to study, a landscape is a multiplicity of hills, complicated to scope. That is why you need more linelets than curvelets (and even more
study. In the divide and conquer strategy one studies just one generic hill and point samples) in order to approximate the relief within a certain tolerance.
is done with. What is left is to study relations between hills and so forth. In
Of course, you can use even higher orders. But usually a reasonable sweet
rare cases one considers multiple hill objects, think of camel-backs or bikini-
spot is order two (curvelets, shapelets). That is because whereas many reliefs
tops, for instance. The most useful scope depends upon context.
are indeed smooth, few are that smooth. If the relief is fractal you are better
In order to get a feel for this, consider the case of curves. In the relief off with point samples even.
view, curves are articulated lines. One description would be to plot height as Human vision is largely focussed on order two, most likely because the
a function of distance along the line. (This is usually called a“graph”, likewise lower orders are not optically specified, or require absolute calibrations not
landscapes are often called graphs of functions of two variables.) available to physiological mechanisms and the world is too unpredictable over
larger scopes. The Hubel and Wiesel “line detectors” are really second or-
24
der directional derivatives, sustaining local differential geometry of the Gauss brain. I really appreciate that.
type.
Reliefs over large ranges are to be treated as multitudes. One way to deal
with them is through maps. Maps are convenient because of the possibilities
of using various scales (atlases) and degrees of “generalization” or specializa-
tion. This is obviously implemented in the various areas of the early visual
system.
Useful maps for articulated planes are representations of the height con-
tours and representations of the field of steepest descent curves. Technically
one uses the gradient, which is often represented as a vector in the direction
of steepest descent with a magnitude equal to the slope. A very useful ab-
straction from this is “gradient space”. In gradient space I simply retain the
gradient vectors, but ignore their location. I simply let all gradient vectors
start at the origin. Why might this be useful? Well, for instance, if you illu-
minate the relief from a certain direction, with a directional beam, then points
that receive the same illumination lie on simple curves23 in gradient space. If
you invert the map from the landscape to gradient space, you immediately get
the shading.
Here you meet with an interesting and important phenomenon: inverting
the map is not trivial because a single point in gradient space may correspond
to several (or many) in the landscape! Thus the gradient plane is a thoroughly Examples of the Whitney “fold” and “cusp”. Notice the degree of covering
wrinkled copy of the articulated plane. Hassler Whitney24 , a mathematician supplied by the wrinkled carpet. At the fold (top) there is a double covering
who is another one of my heroes, figured out (in 1955) what kinds of wrinkles on one side and no covering on the other side. At the cusp (bottom) the
to expect. You can have folds and cusps. covering is either single or triple.
The folds and cusps are where the degree of covering of the map changes.
The fold and cusps are objects that refer to a wider scope than the shapelets,
The study of the wrinkles in gradient space is thus (among more) a cheap
they are loci where the nature of the local shapelets changes qualitatively.
way to study higher order properties of relief. Indeed, in some sense it might
This is the case because folds correspond to parabolic points. Thus the shape
be said that “nothing happens” between the wrinkles, the folds and cusps are
index changes between elliptic and hyperbolic at a fold. That is why they are
really “where the action is”. Even a superficial understanding of the folds
a convenient way to deal with the complexity of reliefs that goes beyond the
allows you to infer important properties of reliefs, even without form deriva-
mere local views. Yet folds and cusps are again “single objects”, thus they
tions. For instance, you can easily show that the stationary points of the shad-
offer a great opportunity to broaden your view without overloading your bird
ing of reliefs under directional illumination (light spots, dark spots and tonal
23
Easy enough to show that these are conic sections. saddles) is such that they occur only on the preimages of the folds. When you
24
Remember that Google also helps you to find facts about people, especially famous ones. vary the direction of the illuminating beam the stationary points travel along
25
the parabolic curves of the relief. You obtain such important, qualitative in- patterns reveal the face—they don’t paint the face but the shading patterns,
sights almost for free! then psychogenesis of visual awareness constructs the face for you. But this
pattern is essentially determined by that of the parabolic curves.
At left examples of landscapes that illustrate the Whitney “fold” and “cusp”.
The landscapes are smooth throughout, the loci of the folds are the blue
curves at left, the locus of the cusp is a point on one of these. At right the The pattern of parabolic curves (and thus the folds and cusps in gradient
corresponding gradient spaces, the pink planes, with the folds indicated in space) is of even wider scope than a single piece of fold or a cusp by itself.
red. The checkered surfaces may help your intuition a bit, they graphically This is one powerful way to expand your vision of relief.
display the “fold” and “cusp”. Apparently the gradient map is a powertool. Might something similar ap-
ply to the map into shape space? One would certainly expect so. However,
Maybe that was the reason why Felix Klein proposed the pattern of perhaps sadly, there is no extensive formal analysis as to date. I will simply
parabolic curves on a surface as important for the “looks” of a surface. More make a few cheap remarks and leave it up to you to become famous!
specifically, he speculated that the beauty of faces might be found in this pat- I have already shown how to map each point of a relief to a point in shape
tern. In order to check this, he had a student draw the parabolic curves on a space. By plotting all points, I map the relief plane into shape space. It will
copy of a bust of the Apollo Belvedere, at the time considered the pinnacle of be some surface embedded in that space.
male beauty. Hard to say whether he succeeded. Check for yourself! In any Of course, it might be degenerate. A plane is simply mapped on the origin,
case, the idea was hardly crazy. Most painters would agree that the shading a mere point. It is simple enough to derive conditions that ensure that a patch
26
of relief will map on a surface patch in shape space. Generally, such condi-
tions are bound to be met with probability one. Exceptions will be the generic So what is SHAPE?
singularities of maps from the plane into three-dimensional space. These were
again classified by Hassler Whitney.25
Now do we finally know “what SHAPE is”? I’ll recapitulate the major
points. Of course, I’m talking only formal matters here, for accounts on artis-
tic shape you should shop somewhere else. It is not that I feel the formal
understanding to be more important than the artistic one, rather the opposite.
But a solid understanding of the formal description is a sine qua non for any-
The Whitney umbrella. Notice that one who proposes to undertake an experiment involving “shape”.
the surface passes through itself! The first point to understand is that the notion of shape presupposes the no-
The singularity is the “pinch point” tion of space and that the notion of space involves a notion of metric, as well
where the self intersection ends. as the notion of a group of transformations whose action on configurations
leaves their internal metrical properties invariant. Such groups of transfor-
mations are often known as the “proper movements”27 of the space and their
actions as congruences. Notice that space, metric and movements come as a
single package, usually called a geometry. This is essentially Felix Klein’s
famous Erlanger Programm.28
The generic singularity is the “Whitney umbrella”.26 The pattern of pinch In most cases we would also consider a wider group of transformations,
points and curves of self-intersection are of obvious relevance, a kind of namely that of “similarities”. The group of similarities allows one to scale
“shape skeleton” for the relief. It is certain to be important. So far, I have configurations. It includes the group of movements, which are scalings by a
been able to arrive at some interesting relations to the singularities of gradient factor of one. Euclidean similarities are needed to be able to say such things
space, but the real work awaits to be done. as that a tennis ball and the earth “have the same shape”. One has some-
thing similar in other spaces, although the notion of “similarity” may differ
widely.29
27
An example of an improper movement would be a reflection.
28
This is Felix Klein’s proposal (wissenschaftliche Programmschrift) as he joined the
University Erlangen-Nürnberg in 1872, entitled Vergleichende Betrachtungen über neuere
geometrische Forschungen (printed in Mathematische Annalen 43, (1893), pp. 63–100).
The work had enormous impact upon geometry. (For a complete English Translation get
http://arxiv.org/abs/0807.3161.) I take a somewhat narrow view here. For in-
stance, my insistence on a metric is required because I aim to define shape as a quantifiable
property.
25 29
In the nineteenforties. For instance, in Euclidean space a similarity is parameterized by a single scaling factor,
26
Why “umbrella”? Well the canonical model is the implicit equation x2 = y 2 z, which but in (2+1)D space it is natural to distinguish two kinds of similarities (which might be com-
also includes the negative z axis—the handle of the umbrella. bined in various ways), one pertaining to the 2D part and one pertaining to the 1D part. It is
27
Given these formal objects, the Procrustes method yields a very general S OME IDEAS FOR FURTHER READING
method that allows one to define a “shape space” and put a metric on it. That
is typically all that is required for applications in vision. For a general introduction to the various geometrical notions I used here,
Notice that this is a very flexible description that allows one to make few books can beat Hilbert and Cohn-Vossen’s Anschauliche Geometrie of
“shape” a useful concept in mutually very different settings. This is the mean- 1932 (Berlin: Springer). It has been reprinted many times. An English trans-
ing of the formal definition I suggested at the outset: lation Geometry and the Imagination is also available (in the AMS Chelsea
Publishing series of the American Mathematical Society, 1999).
S HAPE is an invariant under the group action of a group of “congruences” A good way to quickly pick up details of Kendall’s spherical blackboard is
or “movements”, http://www.cantab.net/users/michael.behrend/ley_
stats/spher_black/pages/spher_black.html
which no doubt sounded somewhat mysterious at the time. Throughout this It also has references to Kendall’s original papers (much harder to read).
booklet I tried to unpack it for you, step by step. Gauss’s famous treatment Disquisitiones generales circa superficies curvas
This formal description is an “ideal one” in the sense that—for instance— is available on the Internet in various translations. You’ll need some true grit
neither Euclidian points, lines and planes, nor curves and surfaces exist as to read it. An easier way to pick up the essential ideas with respect to shape
material manifestations in physical space. Even the smoothest polished sur- is my Solid Shape (Cambridge MA: MIT Press, 1990). Hackers download it
face is not a surface on the molecular scale. Even the electron is not a point.30 for free, but the publisher does not encourage me to tell how.
In cases of real interest the relevant scales and scopes depend on the context The notion of (2+1)D space is explained in more detail in my Graph Spaces
and their choice is an important part of the formulation of your research pro- (Traiectina: de Clootcrans Press, MMXII). It can be downloaded for free (see
gram. Only once these issues have been settled, the formal structures may be following pages).
applied as suitable “generalizations”. Just remember the example of the earth. Details on the shape space are found in my Shadows of Shape (Traiectina:
Indeed, the earth may be taken to be spherical, flat,31 or fractal, it is all up de Clootcrans Press, MMXII). Again, this can be downloaded for free.
to your perspective! Only once you have decided on your perspective can a A tutorial on Scale-Space (the formal background for what I said on scale
certain formal shape description be made to apply. and scope) can be downloaded from
http://www.gris.informatik.tu-darmstadt.de/
˜akuijper/course/TUD/vbc96tutorial.pdf
This has references to the most useful sources. I recommend this tutorial
(Introduction to Scale-Space Theory: Multiscale Geometric Image Analysis,
Tutorial VBC ’96, Hamburg, Germany, Fourth International Conference on
indeed natural, because these parts are not necessarily (or indeed usually) mutually commen- Visualization in Biomedical Computing) because it should appeal to students
surate. of the neurosciences.
30
I’m being very selective here. Consider how you would handle the shape of an ocean There are numerous books on artistic shape. My best sources are used
wave, a swarm of mosquitos, or a galaxy. Problems start on any scale you happen to fancy. bookstores, but that takes time.
31
Yes, I wouldn’t say the flat earth people are wrong, it is just that their perspective ill
fits our contemporary framework. Forcefully pushing their views, is merely an ill-conceived
I recommend:
cultural mismatch. The “flat earth” fits my personal visual awareness quite well though. Are Philip Rawson (1969): Drawing (The Appreciation of the Arts/3, ed. Harold Os-
you ever bothered by the convexity of the floor? Perhaps you are, I’m not. So what? borne), Oxford University Press.
28
and especially
L.R.Rogers (1969): Sculpture (The Appreciation of the Arts/3, ed. Harold Osborne),
Oxford University Press.
Apart from these you should try to quickly skim through numerous technical
treatises on how to sculpt, draw, paint, . . . , you name it. Of course, you will
have to distill the academic messages yourself.
29
“eye measure” proof of the parallelogram of forces.
OTHER E B OOKS FROM T HE C LOOTCRANS P RESS :
The key argument is
1. Awareness (2012) de cloten sullen uyt haer selven een eeuwich roersel maken, t’welck
2. MultipleWorlds (2012) valsch is.
3. ChronoGeometry (2012)
4. Graph Spaces (2012) Simon Stevin was a Dutch genius, not only a mathematician, but also an
5. Pictorial Shape (2012) engineer with remarkable horse sense. I consider his “clootcrans bewijs” one
6. Shadows of Shape (2012) of the jewels of sixteenth century science. It is “natural philosophy” at its
7. Through the Looking Glass: on Viewing Aids (2012) best.
8. Painting to Marble (2012)
9. Experimental Phenomenology: Art & Science (2012)
10. The Spirit of the New Style (2013)
11. World, Environment, Umwelt and Innerworld (2013).
30
31
32