Basics of Affine Geometry
Basics of Affine Geometry
2
Basics of Affine Geometry
ab
a
O
Ox = (x1 , x2 , x3 )
in the frame (O, (e1 , e2 , e3 )) and
Ωx = (x1 − ω1 , x2 − ω2 , x3 − ω3 )
in the frame (Ω, (e1 , e2 , e3 )).
2.1. Affine Spaces 9
This is because
Ox = OΩ + Ωx and OΩ = (ω1 , ω2 , ω3 ).
We note that in the second frame (Ω, (e1 , e2 , e3 )), points and position vec-
tors are no longer identified. This gives us evidence that points are not
vectors. It may be computationally convenient to deal with points using
position vectors, but such a treatment is not frame invariant, which has
undesirable effets.
Inspired by physics, we deem it important to define points and properties
of points that are frame invariant. An undesirable side effect of the present
approach shows up if we attempt to define linear combinations of points.
First, let us review the notion of linear combination of vectors. Given two
vectors u and v of coordinates (u1 , u2 , u3 ) and (v1 , v2 , v3 ) with respect
to the basis (e1 , e2 , e3 ), for any two scalars λ, µ, we can define the linear
combination λu + µv as the vector of coordinates
(λu1 + µv1 , λu2 + µv2 , λu3 + µv3 ).
If we choose a different basis (e01 , e02 , e03 ) and if the matrix P expressing the
vectors (e01 , e02 , e03 ) over the basis (e1 , e2 , e3 ) is
a 1 b1 c 1
P = a 2 b2 c 2 ,
a 3 b3 c 3
which means that the columns of P are the coordinates of the e0j over the
basis (e1 , e2 , e3 ), since
u1 e1 + u2 e2 + u3 e3 = u01 e01 + u02 e02 + u03 e03
and
v1 e1 + v2 e2 + v3 e3 = v10 e01 + v20 e02 + v30 e03 ,
it is easy to see that the coordinates (u1 , u2 , u3 ) and (v1 , v2 , v3 ) of u and v
with respect to the basis (e1 , e2 , e3 ) are given in terms of the coordinates
(u01 , u02 , u03 ) and (v10 , v20 , v30 ) of u and v with respect to the basis (e01 , e02 , e03 )
by the matrix equations
0 0
u1 u1 v1 v1
u2 = P u02 and v2 = P v20 .
u3 u03 v3 v30
From the above, we get
0
v10
u1 u1 v1
u02 = P −1 u2 and v20 = P −1 v2 ,
u03 u3 v30 v3
and by linearity, the coordinates
(λu01 + µv10 , λu02 + µv20 , λu03 + µv30 )
10 2. Basics of Affine Geometry
b=a+u
u
a c=a+w w
−
→
(A3) For any two points a, b ∈ E, there is a unique u ∈ E such that
a + u = b.
−
→
The unique vector u ∈ E such that a + u = b is denoted by ab, or
→
−
sometimes by ab, or even by b − a. Thus, we also write
b = a + ab
→
−
(or b = a + ab, or even b = a + (b − a)).
− → ® −→
The dimension of the affine space E, E , + is the dimension dim( E )
−
→
of the vector space E . For simplicity, it is denoted by dim(E).
−
→
Conditions (A1) and (A2) say that the (abelian) group E acts on E,
−
→
and condition (A3) says that E acts transitively and faithfully on E. Note
that
a(a + v) = v
−
→
for all a ∈ E and all v ∈ E , since a(a + v) is the unique vector such that
a + v = a + a(a + v). Thus, b = a + v is equivalent to ab = v. Figure
2.2 gives an intuitive picture of an affine space. It is natural to think of all
vectors as having the same origin, the null vector.
− → ®
The axioms defining an affine space E, E , + can be interpreted intu-
−
→
itively as saying that E and E are two different ways of looking at the
same object, but wearing different sets of glasses, the second set of glasses
depending on the choice of an “origin” in E. Indeed, we can choose to
look at the points in E, forgetting that every pair (a, b) of points defines
−→
a unique vector ab in E , or we can choose to look at the vectors u in
−
→
E , forgetting the points in E. Furthermore, if we also pick any point a in
E, a point that can be viewed as an origin in E, then we can recover all
2.1. Affine Spaces 13
−
→
the points in E as the translated points a + u for all u ∈ E . This can be
−
→
formalized by defining two maps between E and E .
−→
For every a ∈ E, consider the mapping from E to E given by
u 7→ a + u,
−
→ −
→
where u ∈ E , and consider the mapping from E to E given by
b 7→ ab,
where b ∈ E. The composition of the first mapping with the second is
u 7→ a + u 7→ a(a + u),
which, in view of (A3), yields u. The composition of the second with the
first mapping is
b 7→ ab 7→ a + ab,
which, in view of (A3), yields b. Thus, these compositions are the identity
−→ −→
from E to E and the identity from E to E, and the mappings are both
bijections.
−
→
When we identify E with E via the mapping b 7→ ab, we say that we
consider E as the vector space obtained by taking a as the origin in E, and
− → ®
we denote it by Ea . Thus, an affine space E, E , + is a way of defining a
vector space structure on a set of points E, without making a commitment
to a fixed origin in E. Nevertheless, as soon as we commit to an origin a in
E, we can view E as the vector space Ea . However, we urge the reader to
−
→
think of E as a physical set of points and of E as a set of forces acting on
E, rather than reducing E to some isomorphic copy of Rn . After all, points
are points, and not vectors! For notational simplicity, we will often denote
− → ® −→ −
→
an affine space E, E , + by (E, E ), or even by E. The vector space E is
called the vector space associated with E.
Ä One should be careful about the overloading of the addition sym-
bol +. Addition is well-defined on vectors, as in u+v; the translate
−→
a + u of a point a ∈ E by a vector u ∈ E is also well-defined, but addition
of points a + b does not make sense. In this respect, the notation b − a
for the unique vector u such that b = a + u is somewhat confusing, since it
suggests that points can be subtracted (but not added!). Yet, we will see
in Section 4.1 that it is possible to make sense of linear combinations of
points, and even mixed linear combinations of points and vectors.
−
→
Any vector space E has an affine space structure specified by choosing
−→ −
→
E = E , and letting + be addition in the vector space E . We will refer
−
→ − → ® −
→
to the affine structure E , E , + on a vector space E as the canonical
14 2. Basics of Affine Geometry
−
→
(or natural) affine structure on E . In particular,
® the vector space Rn can
n n n
be viewed as the affine space R ,nR , n+ , ®denoted by A . In general, if
K is any field, the affine space K , K , + is denoted by AnK . In order
to distinguish between the double role played by members of Rn , points
and vectors, we will denote points by row vectors, and vectors by column
vectors. Thus, the action of the vector space Rn over the set Rn simply
viewed as a set of points is given by
u1
..
(a1 , . . . , an ) + . = (a1 + u1 , . . . , an + un ).
un
We will also use the convention that if x = (x1 , . . . , xn ) ∈ Rn , then the
column vector associated with x is denoted by x (in boldface notation).
Abusing the notation slightly, if a ∈ Rn is a point, we also write a ∈ An .
The affine space An is called the real affine space of dimension n. In most
cases, we will consider n = 1, 2, 3.
the action +: H × R2 → H of R 2
µ on¶ H defined such that for every point
u
(x, y, 1 − x − y) on H and any ∈ R2 ,
v
µ ¶
u
(x, y, 1 − x − y) + = (x + u, y + v, 1 − x − u − y − v).
v
x2 + y 2 − z = 0.
The set P is a paraboloid of revolution, with axis Oz. The surface P can
be made into an official affine space by defining the action +: P × R2 → P
of R¶2 on P defined such that for every point (x, y, x2 + y 2 ) on P and any
µ
u
∈ R2 ,
v
µ ¶
2 2 u
(x, y, x + y ) + = (x + u, y + v, (x + u)2 + (y + v)2 ).
v
This should dispell any idea that affine spaces are dull. Affine spaces not
already equipped with an obvious vector space structure arise in projective
geometry. Indeed, we will see in Section 5.1 that the complement of a
hyperplane in a projective space has an affine structure.
16 2. Basics of Affine Geometry
−
→
E E
b
ab
a c ac
bc
risk of boring certain readers, we give another example showing what goes
wrong if we are not careful in defining linear combinations of points.
Consider R2 as an affine space, under µ ¶its natural
µ ¶coordinate system with
1 0
origin O = (0, 0) and basis vectors and . Given any two points
0 1
a = (a1 , a2 ) and b = (b1 , b2 ), it is natural to define the affine combination
λa + µb as the point of coordinates
(λa1 + µb1 , λa2 + µb2 ).
Thus, when a = (−1, −1) and b = (2, 2), the point a + b is the point
c = (1, 1).
Let us now consider the new coordinate system with respect to the origin
c = (1, 1) (and the same basis vectors). This time, the coordinates of a are
(−2, −2), the coordinates of b are (1, 1), and the point a + b is the point d
of coordinates (−1, −1). However, it is clear that the point d is identical to
the origin O = (0, 0) of the first coordinate system.
Thus, a + b corresponds to two different points depending on which
coordinate system is used for its computation!
This shows that some extra condition is needed in order for affine com-
binations to make sense. It turns out that if the scalars sum up to 1, the
definition is intrinsic, as the following lemma shows.
Lemma 2.4.1 Given an affine space E, let (ai )i∈I be a family of points in
E, and let (λi )i∈I be a family of scalars. For any two points a, b ∈ E, the
following properties hold:
P
(1) If i∈I λi = 1, then
X X
a+ λi aai = b + λi bai .
i∈I i∈I
P
(2) If i∈I λi = 0, then
X X
λi aai = λi bai .
i∈I i∈I
Thus, by Lemma 2.4.1, for any P family of points (ai )i∈I in E, for any
family (λi )i∈I of scalars such that i∈I λi = 1, the point
X
x=a+ λi aai
i∈I
Definition 2.4.2 For any P family of points (ai )i∈I in E, for any family
(λi )i∈I of scalars such that i∈I λi = 1, and for any a ∈ E, the point
X
a+ λi aai
i∈I
Remarks:
(2) This result still holds, provided that the field K has at least three
distinct elements, but the proof is trickier!
P P
(3) When i∈I λi = 0, the vector P i∈I λi aai does not depend on the
point a, and we may denote it by i∈I λi ai . This observation will be
used in Section 4.1 to define a vector space in which linear combina-
tions
P of both points and vectors make sense, regardless of the value
of i∈I λi .
Later on, we will see that a polynomial curve can be defined as a set of
barycenters of a fixed number of points. For example, let (a, b, c, d) be a
sequence of points in A2 . Observe that
g1
a b
c
g2
a b
−
→
U
this time without any restriction on the λi , since the right-hand side
−
→ −
→
of the equation is null. Thus, U is a subspace of R2 . In fact, U is one-
dimensional, and it is just a usual line in R2 . This line can be identified
with a line passing through the origin of A2 , a line that is parallel to the
line U of equation ax + by = c, as illustrated in Figure 2.6.
Now, if (x0 , y0 ) is any point in U , we claim that
−
→
U = (x0 , y0 ) + U ,
where
−
→ n −
→o
(x0 , y0 ) + U = (x0 + u1 , y0 + u2 ) | (u1 , u2 ) ∈ U .
−
→
First, (x0 , y0 ) + U ⊆ U , since ax0 + by0 = c and au1 + bu2 = 0 for all
−
→
(u1 , u2 ) ∈ U . Second, if (x, y) ∈ U , then ax + by = c, and since we also
have ax0 + by0 = c, by subtraction, we get
a(x − x0 ) + b(y − y0 ) = 0,
−
→ −
→
which shows that (x − x0 , y − y0 ) ∈ U , and thus (x, y) ∈ (x0 , y0 ) + U .
−
→ −
→
Hence, we also have U ⊆ (x0 , y0 ) + U , and U = (x0 , y0 ) + U .
The above example shows that the affine line U defined by the equation
ax + by = c
2.5. Affine Subspaces 23
−
→
is obtained by “translating” the parallel line U of equation
ax + by = 0
passing through the origin. In fact, given any point (x0 , y0 ) ∈ U ,
−
→
U = (x0 , y0 ) + U .
More generally, it is easy to prove the following fact. Given any m × n
matrix A and any vector b ∈ Rm , the subset U of Rn defined by
U = {x ∈ Rn | Ax = b}
is an affine subspace of An .
Actually, observe that Ax = b should really be written as Ax> = b, to be
consistent with our convention that points are represented by row vectors.
We can also use the boldface notation for column vectors, in which case
the equation is written as Ax = b. For the sake of minimizing the amount
of notation, we stick to the simpler (yet incorrect) notation Ax = b. If we
consider the corresponding homogeneous equation Ax = 0, the set
−
→
U = {x ∈ Rn | Ax = 0}
is a subspace of Rn , and for any x0 ∈ U , we have
−
→
U = x0 + U .
This is a general situation. Affine subspaces can be characterized in terms
−→
of subspaces of E . Let V be a nonempty subset of E. For every family
(a1 , . . . , an ) in V , for any family (λ1 , . . . , λn ) of scalars, and for every point
a ∈ V , observe that for every x ∈ E,
n
X
x=a+ λi aai
i=1
since
n
X ³ n
X ´
λi + 1 − λi = 1.
i=1 i=1
−
→ −
→ −
→
Given any point a ∈ E and any subset V of E , let a + V denote the
following subset of E:
−
→ n −
→o
a+ V = a+v | v ∈ V .
24 2. Basics of Affine Geometry
−
→
E E
−
→
a V
−
→
V =a+ V
−
→
Figure 2.7. An affine subspace V and its direction V
− → ®
Lemma 2.5.2 Let E, E , + be an affine space.
(1) A nonempty subset V of E is an affine subspace iff for every point
a ∈ V , the set
−
→
Va = {ax | x ∈ V }
−
→ −
→
is a subspace of E . Consequently, V = a + Va . Furthermore,
−
→
V = {xy | x, y ∈ V }
−
→ −
→ − → −
→
is a subspace of E and Va = V for all a ∈ E. Thus, V = a + V .
−
→ −
→ −
→
(2) For any subspace V of E and for any a ∈ E, the set V = a + V is
an affine subspace.
Proof . The proof is straightforward, and is omitted. It is also given in
Gallier [70].
Remarks:
(1) Since it can be shown that the barycenter of n weighted points can
be obtained by repeated computations of barycenters of two weighted
points, a nonempty subset V of E is an affine subspace iff for every
two points a, b ∈ V , the set V contains all barycentric combinations of
a and b. If V contains at least two points, then V is an affine subspace
iff for any two distinct points a, b ∈ V , the set V contains the line
26 2. Basics of Affine Geometry
determined by a and b, that is, the set of all points (1 − λ)a + λb,
λ ∈ R.
(2) This result still holds if the field K has at least three distinct elements,
but the proof is trickier!
Since
ak aj = a k ai + a i aj ,
we have
X X X
λ j ak aj = λ j ak ai + λ j ai aj ,
j∈(I−{k}) j∈(I−{k}) j∈(I−{k})
X X
= λ j ak ai + λ j ai aj ,
j∈(I−{k}) j∈(I−{i,k})
X µ X ¶
= λ j ai aj − λ j ai ak ,
j∈(I−{i,k}) j∈(I−{k})
and thus
X µ X ¶
λ j ai aj − λj ai ak = 0.
j∈(I−{i,k}) j∈(I−{k})
of the ui , then the λi are unique. A similar result holds for affinely
independent points.
− → ®
Lemma 2.6.3 Given an affine space E, E , + , let (a0 , .P . . , am ) be a fam-
m
ily
Pm of m + 1 points in E. Let x ∈ E, and assume that x = i=0Pλmi ai , where
λ
i=0 i = 1. Then, the family (λ 0 , . . . , λ m ) such that x = i=0 λi ai is
unique iff the family (a0 a1 , . . . , a0 am ) is linearly independent.
Lemma 2.6.3 suggests the notion of affine frame. Affine frames are the
− → ®
affine analogues of bases in vector spaces. Let E, E , + be a nonempty
affine space, and let (a0 , . . . , am ) be a family of m + 1 points in E. The
family (a0 , . . . , am ) determines the family of m vectors (a0 a1 , . . . , a0 am ) in
−
→
E . Conversely, given a point a0 in E and a family of m vectors (u1 , . . . , um )
−→
in E , we obtain the family of m + 1 points (a0 , . . . , am ) in E, where ai =
a0 + ui , 1 ≤ i ≤ m.
Thus, for any m ≥ 1, it is equivalent to consider a family of m + 1 points
(a0 , . . . , am ) in E, and a pair (a0 , (u1 , . . . , um )), where the ui are vectors
−→
in E . Figure 2.8 illustrates the notion of affine independence.
Remark: The above observation also applies to infinite families (ai )i∈I of
−
→
points in E and families (−
→)
u i i∈I−{0} of vectors in E , provided that the
index set I contains 0.
−
→
When (a0 a1 , . . . , a0 am ) is a basis of E then, for every x ∈ E, since
x = a0 + a0 x, there is a unique family (x1 , . . . , xm ) of scalars such that
x = a 0 + x 1 a0 a1 + · · · + x m a0 am .
28 2. Basics of Affine Geometry
−
→
E E
a2
a0 a2
a0 a1 a0 a1
Even though Lemma 2.6.5 is rather dull, it is one of the key ingredi-
ents in the proof of beautiful and deep theorems about convex sets, such
as Carathéodory’s theorem, Radon’s theorem, and Helly’s theorem (see
Section 3.1).
A family of two points (a, b) in E is affinely independent iff ab 6= 0, iff
a 6= b. If a 6= b, the affine subspace generated by a and b is the set of all
points (1 − λ)a + λb, which is the unique line passing through a and b. A
family of three points (a, b, c) in E is affinely independent iff ab and ac
are linearly independent, which means that a, b, and c are not on the same
line (they are not collinear). In this case, the affine subspace generated by
(a, b, c) is the set of all points (1 − λ − µ)a + λb + µc, which is the unique
plane containing a, b, and c. A family of four points (a, b, c, d) in E is affinely
independent iff ab, ac, and ad are linearly independent, which means that
a, b, c, and d are not in the same plane (they are not coplanar). In this
30 2. Basics of Affine Geometry
a2
a0 a0 a1
a3
a0 a1 a0 a2
a1
Figure 2.9. Examples of affine frames and their convex hulls
case, a, b, c, and d are the vertices of a tetrahedron. Figure 2.9 shows affine
frames and their convex hulls for |I| = 0, 1, 2, 3.
Given n+1 affinely independent points (a0 , . . . , an ) in E, we can consider
the set of points λ0 a0 +· · · +λn an , where λ0 +· · ·+λn = 1 and λi ≥ 0 (λi ∈
R). Such affine combinations are called convex combinations. This set is
called the convex hull of (a0 , . . . , an ) (or n-simplex spanned by (a0 , . . . , an )).
When n = 1, we get the segment between a0 and a1 , including a0 and a1 .
When n = 2, we get the interior of the triangle whose vertices are a0 , a1 , a2 ,
including boundary points (the edges). When n = 3, we get the interior of
the tetrahedron whose vertices are a0 , a1 , a2 , a3 , including boundary points
(faces and edges). The set
{a0 + λ1 a0 a1 + · · · + λn a0 an | where 0 ≤ λi ≤ 1 (λi ∈ R)}
is called the parallelotope spanned by (a0 , . . . , an ). When E has dimension
2, a parallelotope is also called a parallelogram, and when E has dimension
3, a parallelepiped .
More generally, we say that a subset V of E is convex if for any two
points a, b ∈ V , we have c ∈ V for every point c = (1 − λ)a + λb, with
0 ≤ λ ≤ 1 (λ ∈ R).
Ä Points are not vectors! The following example illustrates why
treating points as vectors may cause problems. Let a, b, c be three
2.7. Affine Maps 31
and
X X X
λi (b + h(vi )) = b + λi b(b + h(vi )) = b + λi h(vi ),
i∈I i∈I i∈I
we have
à ! à !
X X
f λi (a + vi ) =f a+ λ i vi
i∈I i∈I
à !
X
=b+h λ i vi
i∈I
X
=b+ λi h(vi )
i∈I
X
= λi (b + h(vi ))
i∈I
X
= λi f (a + vi ).
i∈I
P
Note that the condition i∈I λi = 1 was implicitly used (in a hidden
call to Lemma 2.4.1) in deriving that
X X
λi (a + vi ) = a + λ i vi
i∈I i∈I
and
X X
λi (b + h(vi )) = b + λi h(vi ).
i∈I i∈I
d c
a0 b0
a b
Figure 2.10. The effect of a shear
c0
d0
d c
b0
a b a0
Figure 2.11. The effect of an affine map
−
→ −
→ − → −
→
for every v ∈ E is a linear map f : E → E 0 . Indeed, we can write
a + λv = λ(a + v) + (1 − λ)a,
since a + λv = a + λa(a + v) + (1 − λ)aa, and also
a + u + v = (a + u) + (a + v) − a,
since a+u+v = a+a(a + u)+a(a + v)−aa. Since f preserves barycenters,
we get
f (a + λv) = λf (a + v) + (1 − λ)f (a).
P
If we recall that x = P i∈I λi ai is the barycenter of a family ((ai , λi ))i∈I of
weighted points (with i∈I λi = 1) iff
X
bx = λi bai for every b ∈ E,
i∈I
we get
f (a)f (a + λv) = λf (a)f (a + v) + (1 − λ)f (a)f (a) = λf (a)f (a + v),
−
→ −
→
showing that f (λv) = λ f (v). We also have
f (a + u + v) = f (a + u) + f (a + v) − f (a),
from which we get
f (a)f (a + u + v) = f (a)f (a + u) + f (a)f (a + v),
−
→ −
→ −
→ −
→
showing that f (u + v) = f (u) + f (v). Consequently, f is a linear map.
For any other point b ∈ E, since
b + v = a + ab + v = a + a(a + v) − aa + ab,
b + v = (a + v) − a + b, and since f preserves barycenters, we get
f (b + v) = f (a + v) − f (a) + f (b),
which implies that
f (b)f (b + v) = f (b)f (a + v) − f (b)f (a) + f (b)f (b),
= f (a)f (b) + f (b)f (a + v),
= f (a)f (a + v).
−
→
Thus, f (b)f (b + v) = f (a)f (a + v), which shows that the definition of f
−
→
does not depend on the choice of a ∈ E. The fact that f is unique is
−→
obvious: We must have f (v) = f (a)f (a + v).
−
→ − → −→
The unique linear map f : E → E 0 given by Lemma 2.7.2 is called the
linear map associated with the affine map f .
2.7. Affine Maps 35
so that y = Ax + b is equivalent to
µ ¶ µ ¶µ ¶
y A b x
= .
1 0 1 1
Observe that Ha,λ (a) = a, and when λ 6= 0 and x 6= a, Ha,λ (x) is on the
line defined by a and x, and is obtained by “scaling” ax by λ.
Figure 2.12 shows the effect of a central dilatation of center d. The tri-
angle (a, b, c) is magnified to the triangle (a0 , b0 , c0 ). Note how every line is
mapped to a parallel line.
−−→
When λ = 1, Ha,1 is the identity. Note that Ha,λ = λ id− →
E
. When λ 6= 0,
it is clear that Ha,λ is an affine bijection. It is immediately verified that
Ha,λ ◦ Ha,µ = Ha,λµ .
We have the following useful result.
Lemma 2.8.1 Given any affine space E, for any affine bijection f ∈
−
→
GA(E), if f = λ id− →E
, for some λ ∈ R∗ with λ 6= 1, then there is a
unique point c ∈ E such that f = Hc,λ .
2.8. Affine Groups 39
0
a
b b0
d
c0
−
→
Clearly, if f = id− →E
, the affine map f is a translation. Thus, the group
of affine dilatations DIL(E) is the disjoint union of the translations and
of the dilatations of ratio λ 6= 0, 1. Affine dilatations can be given a purely
geometric characterization.
Another point worth mentioning is that affine bijections preserve the ra-
tio of volumes of parallelotopes. Indeed, given any basis B = (u1 , . . . , um )
−→
of the vector space E associated with the affine space E, given any m + 1
affinely independent points (a0 , . . . , am ), we can compute the determi-
nant detB (a0 a1 , . . . , a0 am ) w.r.t. the basis B. For any bijective affine map
f : E → E, since
³− → −
→ ´ ¡−
→¢
detB f (a0 a1 ), . . . , f (a0 am ) = det f detB (a0 a1 , . . . , a0 am )
−
→
and the determinant of a linear map is intrinsic (i.e., depends only on f ,
and not on the particular basis B), we conclude that the ratio
³−→ −
→ ´
detB f (a0 a1 ), . . . , f (a0 am ) ¡−
→¢
= det f
detB (a0 a1 , . . . , a0 am )
is independent of the basis B. Since detB (a0 a1 , . . . , a0 am ) is the volume of
the parallelotope spanned by (a0 , . . . , am ), where the parallelotope spanned
by any point a and the vectors (u1 , . . . , um ) has unit volume (see Berger
[12], Section 9.12), we see that affine bijections preserve the ratio of volumes
40 2. Basics of Affine Geometry
We also have the following simple lemma, whose proof is left as an easy
exercise.
Lemma 2.9.2 Given any affine space E, given any two distinct points
a, b ∈ E, and for any affine dilatation f different from the identity, if
2.9. Affine Geometry: A Glimpse 41
a1 b1
H1
H2
a2 b2
a3 b3
H3
A B
a0 = f (a), D = ha, bi is the line passing through a and b, and D 0 is the line
parallel to D and passing through a0 , the following are equivalent:
(i) b0 = f (b);
(ii) If f is a translation, then b0 is the intersection of D 0 with the line
parallel to ha, a0 i passing through b;
If f is a dilatation of center c, then b0 = D0 ∩ hc, bi.
The first case is the parallelogram law, and the second case follows easily
from Thales’ theorem.
We are now ready to prove two classical results of affine geometry,
Pappus’s theorem and Desargues’s theorem. Actually, these results are the-
orems of projective geometry, and we are stating affine versions of these
important results. There are stronger versions that are best proved using
projective geometry.
Lemma 2.9.3 Given any affine plane E, any two distinct lines D and D 0 ,
then for any distinct points a, b, c on D and a0 , b0 , c0 on D0 , if a, b, c, a0 , b0 ,
c0 are distinct from the intersection of D and D 0 (if D and D 0 intersect)
42 2. Basics of Affine Geometry
c
D
c0
b0
D0
a0
Figure 2.14. Pappus’s theorem (affine version)
and if the lines ha, b0 i and ha0 , bi are parallel, and the lines hb, c0 i and hb0 , ci
are parallel, then the lines ha, c0 i and ha0 , ci are parallel.
Lemma 2.9.4 Given any affine space E, and given any two triangles
(a, b, c) and (a0 , b0 , c0 ), where a, b, c, a0 , b0 , c0 are all distinct, if ha, bi and
ha0 , b0 i are parallel and hb, ci and hb0 , c0 i are parallel, then ha, ci and ha0 , c0 i
2.9. Affine Geometry: A Glimpse 43
0
a
b b0
d
c0
are parallel iff the lines ha, a0 i, hb, b0 i, and hc, c0 i are either parallel or
concurrent (i.e., intersect in a common point).
Proof . We prove half of the lemma, the direction in which it is assumed
that ha, ci and ha0 , c0 i are parallel, leaving the converse as an exercise. Since
the lines ha, bi and ha0 , b0 i are parallel, the points a, b, a0 , b0 are coplanar.
Thus, either ha, a0 i and hb, b0 i are parallel, or they have some intersection d.
We consider the second case where they intersect, leaving the other case as
an easy exercise. Let f be the dilatation of center d such that f (a) = a0 . By
Lemma 2.9.2, we get f (b) = b0 . If f (c) = c00 , again by Lemma 2.9.2 twice,
the lines hb, ci and hb0 , c00 i are parallel, and the lines ha, ci and ha0 , c00 i are
parallel. From this it follows that c00 = c0 . Indeed, recall that hb, ci and
hb0 , c0 i are parallel, and similarly ha, ci and ha0 , c0 i are parallel. Thus, the
lines hb0 , c00 i and hb0 , c0 i are identical, and similarly the lines ha0 , c00 i and
ha0 , c0 i are identical. Since a0 c0 and b0 c0 are linearly independent, these
lines have a unique intersection, which must be c00 = c0 .
The direction where it is assumed that the lines ha, a0 i, hb, b0 i and hc, c0 i,
are either parallel or concurrent is left as an exercise (in fact, the proof is
quite similar).
Lemma 2.11.2 Given any affine space E, for any two nonempty affine
subspaces M and N , the following facts hold:
−
→ − →
(1) M ∩ N 6= ∅ iff ab ∈ M + N for some a ∈ M and some b ∈ N .
−
→ − →
(2) M ∩ N consists of a single point iff ab ∈ M + N for some a ∈ M
−
→ − →
and some b ∈ N , and M ∩ N = {0}.
−
→
(3) If S is the least affine subspace containing M and N , then S =
−
→ − → −
→
M + N + Kab (the vector space E is defined over the field K).
Proof . (1) Pick any a ∈ M and any b ∈ N , which is possible, since M and
−
→ −
→
N are nonempty. Since M = {ax | x ∈ M } and N = {by | y ∈ N }, if
−
→
M ∩ N 6= ∅, for any c ∈ M ∩ N we have ab = ac − bc, with ac ∈ M and
−
→ −
→ − → −
→ − →
bc ∈ N , and thus, ab ∈ M + N . Conversely, assume that ab ∈ M + N
for some a ∈ M and some b ∈ N . Then ab = ax + by, for some x ∈ M
and some y ∈ N . But we also have
ab = ax + xy + yb,
and thus we get 0 = xy + yb − by, that is, xy = 2by. Thus, b is the middle
of the segment [x, y], and since yx = 2yb, x = 2b − y is the barycenter of
the weighted points (b, 2) and (y, −1). Thus x also belongs to N , since N
being an affine subspace, it is closed under barycenters. Thus, x ∈ M ∩ N ,
and M ∩ N 6= ∅.
(2) Note that in general, if M ∩ N 6= ∅, then
−−−−→ −→ − →
M ∩N = M ∩ N,
2.12. Problems 47
because
−−−−→ −
→ − →
M ∩ N = {ab | a, b ∈ M ∩N } = {ab | a, b ∈ M }∩{ab | a, b ∈ N } = M ∩ N .
−−−−→
Since M ∩ N = c + M ∩ N for any c ∈ M ∩ N , we have
−
→ − →
M ∩N =c+M ∩ N for any c ∈ M ∩ N .
From this it follows that if M ∩ N 6= ∅, then M ∩ N consists of a single
−
→ − →
point iff M ∩ N = {0}. This fact together with what we proved in (1)
proves (2).
(3) This is left as an easy exercise.
Remarks:
(1) The proof of Lemma 2.11.2 shows that if M ∩ N 6= ∅, then ab ∈
−
→ − →
M + N for all a ∈ M and all b ∈ N .
(2) Lemma 2.11.2 implies that for any two nonempty affine subspaces M
−
→ − → − →
and N , if E = M ⊕ N , then M ∩N consists of a single point. Indeed,
−
→ − → − → −→
if E = M ⊕ N , then ab ∈ E for all a ∈ M and all b ∈ N , and since
−
→ − →
M ∩ N = {0}, the result follows from part (2) of the lemma.
We can now state the following lemma.
Lemma 2.11.3 Given an affine space E and any two nonempty affine
subspaces M and N , if S is the least affine subspace containing M and N ,
then the following properties hold:
(1) If M ∩ N = ∅, then
−
→ − →
dim(M ) + dim(N ) < dim(E) + dim(M + N )
and
−→ − →
dim(S) = dim(M ) + dim(N ) + 1 − dim( M ∩ N ).
(2) If M ∩ N 6= ∅, then
dim(S) = dim(M ) + dim(N ) − dim(M ∩ N ).
Proof . The proof is not difficult, using Lemma 2.11.2 and Lemma 2.11.1,
but we leave it as an exercise.
2.12 Problems
Problem 2.1 Given a triangle (a, b, c), give a geometric construction of
the barycenter of the weighted points (a, 14 ), (b, 41 ), and (c, 21 ). Give a geo-
48 2. Basics of Affine Geometry
respect to the standard affine frame for A3 , prove that a, b, c, d are coplanar
iff
¯ a 1 b1 c 1 d 1 ¯
¯ ¯
¯ a 2 b2 c 2 d 2 ¯
¯ ¯
¯ = 0,
¯ a 3 b3 c 3 d 3 ¯
¯
1 1 1 1
¯ ¯
or, equivalently,
(a1 b2 − a2 b1 )x + (a2 b0 − a0 b2 )y + (a0 b1 − a1 b0 )z = 0,
where (x, y, z) are the barycentric coordinates of the generic point on the
line ha, bi.
Prove that the equation of a line in barycentric coordinates is of the form
ux + vy + wz = 0,
where u 6= v or v 6= w or u 6= w. Show that two equations
ux + vy + wz = 0 and u0 x + v 0 y + w 0 z = 0
represent the same line in barycentric coordinates iff (u0 , v 0 , w0 ) = λ(u, v, w)
for some λ ∈ R (with λ 6= 0).
50 2. Basics of Affine Geometry
direction of D0 and D00 . In either case, let (m, m0 , m00 ) be the barycentric
coordinates of M , as explained at the beginning of the problem. We call
M the intersection of D0 and D00 . Similarly, define N = (n, n0 , n00 ) as the
intersection of D1 and D10 , and P = (p, p0 , p00 ) as the intersection of D2 and
D20 .
Prove that
¯ ¯
¯ m
¯ 0 n0 p0 ¯
¯
¯m
¯ 00 n00 p00 ¯ = 0
¯
¯m n p ¯
iff either
(i) (D0 , D00 ), (D1 , D10 ), and (D2 , D20 ) are pairs of parallel lines; or
(ii) the lines of some pair (Di , Di0 ) are parallel, each pair (Dj , Dj0 ) (with
j 6= i) has a unique intersection point, and these two intersection
points are distinct and determine a line parallel to the lines of the
pair (Di , Di0 ); or
(iii) each pair (Di , Di0 ) (i = 0, 1, 2) has a unique intersection point, and
these points M, N, P are distinct and collinear.
and
det(Ox, Ob, Oc)
λ0 = .
det(Oa, Ob, Oc)
(iii) Let (a, b, c, d) be four points in A3 , and assume that (a, b, c, d) are
not coplanar. For any point x ∈ A3 , if x = λ0 a + λ1 b + λ2 c + λ3 d,
where (λ0 , λ1 , λ2 , λ3 ) are the barycentric coordinates of x with respect to
(a, b, c, d), show that
det(ax, ac, ad) det(ab, ax, ad) det(ab, ac, ax)
λ1 = , λ2 = , λ3 = ,
det(ab, ac, ad) det(ab, ac, ad) det(ab, ac, ad)
and
det(xb, bc, bd)
λ0 = .
det(ab, ac, ad)
Conclude that λ0 , λ1 , λ2 , λ3 are certain signed ratios of the volumes of the
five tetrahedra (a, b, c, d), (x, a, b, c), (x, a, b, d), (x, a, c, d), and (x, b, c, d).
(iv) Let (a0 , . . . , am ) be m + 1 points in Am , and assume that they are
affinely independent. For any point x ∈ Am , if x = λ0 a0 + · · · + λm am ,
where (λ0 , . . . , λm ) are the barycentric coordinates of x with respect to
(a0 , . . . , am ), show that
det(a0 a1 , . . . , a0 ai−1 , a0 x, a0 ai+1 , . . . , a0 am )
λi =
det(a0 a1 , . . . , a0 ai−1 , a0 ai , a0 ai+1 , . . . , a0 am )
for every i, 1 ≤ i ≤ m, and
det(xa1 , a1 a2 , . . . , a1 am )
λ0 = .
det(a0 a1 , . . . , a0 ai , . . . , a0 am )
Conclude that λi is the signed ratio of the volumes of the simplexes (a0 , . . .,
x, . . . am ) and (a0 , . . . , ai , . . . am ), where 0 ≤ i ≤ m.
Problem 2.17 With respect to the standard affine frame for the plane
A2 , consider the three geometric transformations f1 , f2 , f3 defined by
√
1 3 3
x0 = − x − y+ ,
√ 4 4 √4
3 1 3
y0 = x− y+ ,
4 4
√ 4
1 3 3
x0 = − x + y− ,
4√ 4 4
√
3 1 3
y0 = − x− y+ ,
4 4 4
1
x0 = x,
2 √
0 1 3
y = y+ .
2 2
56 2. Basics of Affine Geometry
(a) Prove that these maps are affine. Can you describe geometrically
what their action is (rotation, translation, scaling)?
(b) Given any polygonal line L, define the following sequence of polygonal
lines:
S0 = L,
Sn+1 = f1 (Sn ) ∪ f2 (Sn ) ∪ f3 (Sn ).
Construct S1 starting from the line segment L = ((−1, 0), (1, 0)).
Can you figure out what Sn looks like in general? (You may want to
write a computer program.) Do you think that Sn has a limit?
Problem 2.18 In the plane A2 , with respect to the standard affine frame,
a point of coordinates (x, y) can be represented as the complex number
z = x + iy. Consider the set of geometric transformations of the form
z 7→ az + b,
where a, b are complex numbers such that a 6= 0.
(a) Prove that these maps are affine. Describe what these maps do
geometrically.
(b) Prove that the above set of maps is a group under composition.
(c) Consider the set of geometric transformations of the form
z 7→ az + b or z 7→ az + b,
where a, b are complex numbers such that a 6= 0, and where z = x − iy if
z = x + iy. Describe what these maps do geometrically. Prove that these
maps are affine and that this set of maps is a group under composition.
Problem 2.19 Given a group G, a subgroup H of G is called a normal
subgroup of G iff xHx−1 = H for all x ∈ G (where xHx−1 = {xhx−1 | h ∈
H}).
(i) Given any two subgroups H and K of a group G, let
HK = {hk | h ∈ H, k ∈ K}.
Prove that every x ∈ HK can be written in a unique way as x = hk for
h ∈ H and k ∈ K iff H ∩ K = {1}, where 1 is the identity element of G.
(ii) If H and K are subgroups of G, and H is a normal subgroup of
G, prove that HK is a subgroup of G. Furthermore, if G = HK and
H ∩K = {1}, prove that G is isomorphic to H ×K under the multiplication
operation
(h1 , k1 ) · (h2 , k2 ) = (h1 k1 h2 k1−1 , k1 k2 ).
When G = HK, where H, K are subgroups of G, H is a normal subgroup
of G, and H ∩ K = {1}, we say that G is the semidirect product of H and
K.
−
→
(iii) Let (E, E ) be an affine space. Recall that the affine group of E,
denoted by GA(E), is the set of affine bijections of E, and that the linear
2.12. Problems 57
−
→ −→ −
→
group of E , denoted by GL( E ), is the group of bijective linear maps of E .
−
→ −→
The map f 7→ f defines a group homomorphism L: GA(E) → GL( E ),
and the kernel of this map is the set of translations on E, denoted as T (E).
Prove that T (E) is a normal subgroup of GA(E).
(iv) For any a ∈ E, let
GAa (E) = {f ∈ GA(E) | f (a) = a},
the set of affine bijections leaving a fixed. Prove that that GAa (E) is a
−
→
subgroup of GA(E), and that GAa (E) is isomorphic to GL( E ). Prove
that GA(E) is isomorphic to the direct product of T (E) and GAa (E).
Hint. Note that if u = f (a)a and tu is the translation associated with the
vector u, then tu ◦ f ∈ GAa (E) (where the translation tu is defined such
that tu (a) = a + u for every a ∈ E).
(v) Given a group G, let Aut(G) denote the set of homomorphisms
f : G → G. Prove that the set Aut(G) is a group under composition (called
the group of automorphisms of G). Given any two groups H and K and
a homomorphism θ: K → Aut(H), we define H ×θ K as the set H × K
under the multiplication operation
(h1 , k1 ) · (h2 , k2 ) = (h1 θ(k1 )(h2 ), k1 k2 ).
Prove that H ×θ K is a group.
Hint. The inverse of (h, k) is (θ(k −1 )(h−1 ), k −1 ).
Prove that the group H ×θ K is the semidirect product of the subgroups
{(h, 1) | h ∈ H} and {(1, k) | k ∈ K}. The group H ×θ K is also called the
semidirect product of H and K relative to θ.
Note. It is natural to identify {(h, 1) | h ∈ H} with H and {(1, k) | k ∈ K}
with K.
If G is the semidirect product of two subgroups H and K as defined in
(ii), prove that the map γ: K → Aut(H) defined by conjugation such that
γ(k)(h) = khk −1
is a homomorphism, and that G is isomorphic to H ×γ K.
−→ −→
(vi) Define the map θ: GL( E ) → Aut( E ) as follows:
θ(f ) = f,
−→
where f ∈ GL( E ) (note that θ can be viewed as an inclusion map). Prove
−
→ −
→
that GA(E) is isomorphic to the semidirect product E ×θ GL( E ).
−
→ −
→
(vii) Let SL( E ) be the subgroup of GL( E ) consisting of the linear
−
→
maps such that det(f ) = 1 (the special linear group of E ), and let SA(E)
be the subgroup of GA(E) (the special affine group of E) consisting of
−
→ −→
the affine maps f such that f ∈ SL( E ). Prove that SA(E) is isomorphic
58 2. Basics of Affine Geometry
−
→ −
→ −
→ −
→
to the semidirect product E ×θ SL( E ), where θ: SL( E ) → Aut( E ) is
defined as in (vi).
−
→
(viii) Assume that (E, E ) is a Euclidean affine space, as defined in Chap-
−→ −→
ter 6. Let SO( E ) be the special orthogonal group of E , as defined in
Definition 6.4.3 (the isometries with determinant +1), and let SE(E) be
the subgroup of SA(E) (the special Euclidean group of E) consisting of the
−
→ −→
affine isometries f such that f ∈ SO( E ). Prove that SE(E) is isomorphic
−
→ −
→ −→ −
→
to the semidirect product E ×θ SO( E ), where θ: SO( E ) → Aut( E ) is
defined as in (vi).
Problem 2.20 The purpose of this problem is to study certain affine maps
of A2 .
(1) Consider affine maps of the form
µ ¶ µ ¶µ ¶ µ ¶
x1 cos θ − sin θ x1 b1
7→ + .
x2 sin θ cos θ x2 b2
Prove that such maps have a unique fixed point c if θ 6= 2kπ, for all integers
k. Show that these are rotations of center c, which means that with respect
to a frame with origin c (the unique fixed point), these affine maps are
represented by rotation matrices.
(2) Consider affine maps of the form
µ ¶ µ ¶µ ¶ µ ¶
x1 λ cos θ −λ sin θ x1 b1
7→ + .
x2 µ sin θ µ cos θ x2 b2
Prove that such maps have a unique fixed point iff (λ + µ) cos θ 6= 1 + λµ.
Prove that if λµ = 1 and λ > 0, there is some angle θ for which either there
is no fixed point, or there are infinitely many fixed points.
(3) Prove that the affine map
µ ¶ µ ¶µ ¶ µ ¶
x1 8/5 −6/5 x1 1
7→ +
x2 3/10 2/5 x2 1
has no fixed point.
(4) Prove that an arbitrary affine map
µ ¶ µ ¶µ ¶ µ ¶
x1 a1 a2 x1 b1
7→ +
x2 a3 a4 x2 b2
has a unique fixed point iff the matrix
µ ¶
a1 − 1 a2
a3 a4 − 1
is invertible.
−
→
Problem 2.21 Let (E, E ) be any affine space of finite dimension. For
every affine map f : E → E, let Fix(f ) = {a ∈ E | f (a) = a} be the set of
fixed points of f .
2.12. Problems 59
that g ◦ f has the matrix representation (B, c)(A, b) w.r.t. (a0 , . . . , an ) and
(a000 , . . . , a00n ).
(4) Given two affine frames (a0 , . . . , an ) and (a00 , . . . , a0n ) for E, there is
a unique affine map h: E → E such that h(ai ) = a0i for i = 0, . . . , n, and
we let (P, ω) be its associated matrix representation with respect to the
frame (a0 , . . . , an ). Note that ω = a0 a00 , and that pi j is the ith coefficient
of a00 a0j over the basis (a0 a1 , . . . , a0 an ). Observe that (P, ω) is also the ma-
trix representation of idE w.r.t. the frames (a00 , . . . , a0n ) and (a0 , . . . , an ),
in that order. For any affine map f : E → E, if f has the matrix repre-
sentation (A, b) over the frame (a0 , . . . , an ) and the matrix representation
(A0 , b0 ) over the frame (a00 , . . . , a0n ), prove that
(A0 , b0 ) = (P, ω)−1 (A, b)(P, ω).
Given any two affine maps f : E → E and g: E → E, where f is invertible,
for any affine frame (a0 , . . . , an ) for E, if (a00 , . . . , a0n ) is the affine frame
image of (a0 , . . . , an ) under f (i.e., f (ai ) = a0i for i = 0, . . . , n), letting (A, b)
be the matrix representation of f w.r.t. the frame (a0 , . . . , an ) and (B, c) be
the matrix representation of g w.r.t. the frame (a00 , . . . , a0n ) (not the frame
(a0 , . . . , an )), prove that g◦f is represented by the matrix (A, b)(B, c) w.r.t.
the frame (a0 , . . . , an ).
Remark: Note that this is the opposite of what happens if f and g are
both represented by matrices w.r.t. the “fixed” frame (a0 , . . . , an ), where
g ◦ f is represented by the matrix (B, c)(A, b). The frame (a00 , . . . , a0n ) can
be viewed as a “moving” frame. The above has applications in robotics, for
example to rotation matrices expressed in terms of Euler angles, or “roll,
pitch, and yaw.”