3528031379
3528031379
3528031379
ii
iii
iv
v
Für Marlis
vii
Preface
Algebraic geometry is, loosely speaking, concerned with the study of zero sets of
polynomials (over an algebraically closed field). As one often reads in prefaces of intro-
ductory books on algebraic geometry, it is not so easy to develop the basics of algebraic
geometry without a proper knowledge of commutative algebra. On the other hand, the
commutative algebra one needs is quite difficult to understand without the geometric
motivation from which it has often developed.
Local analytic geometry is concerned with germs of zero sets of analytic functions,
that is, the study of such sets in the neighborhood of a point. It is not too big a surprise
that the basic theory of local analytic geometry is, in many respects, similar to the basic
theory of algebraic geometry. It would, therefore, appear to be a sensible idea to develop
the two theories simultaneously. This, in fact, is not what we will do in this book, as the
“commutative algebra” one needs in local analytic geometry is somewhat more difficult:
one has to cope with convergence questions. The most prominent and important example
is the substitution of division with remainder. Its substitution in local analytic geometry
is called the Weierstraß Division Theorem.
The above remarks motivated us to organize the first four chapters of this book
as follows. In Chapter 1 we discuss the algebra we need. Here, we assume the reader
attended courses on linear algebra and abstract algebra, including some Galois theory.
Probably the reader could just start with Chapter 2, referring to Chapter 1 only when
needed. Chapter 2 deals with the basics of affine algebraic geometry, up to say, Hilbert’s
Nullstellensatz and decomposition into irreducible components. In Chapter 3 we tackle
the corresponding basics for local analytic geometry. Here it is assumed that the reader
has knowledge of the theory of holomorphic functions in one variable. Although Chapter 4
is written in the language of local analytic geometry, most statements and proofs make
sense for affine algebraic geometry, too.
As an application of the general theory, in Chapter 5 we study the “simplest” germs
of local analytic spaces: plane curve singularities. Topics here are Puiseux expansion,
semigroups of curves, and resolutions of plane curve singularities.
Many of the topics in the rest of the book cannot usually be found in books on local
analytic geometry. The principle of conservation of number in Chapter 6 is such an exam-
ple. This is about interpretation of invariants in “families”. The simplest example of this
is the intersection number of two plane curve singularities. This intersection number can
be interpreted as the number of intersection points appearing after slightly “perturbing”
the singularities. To put the proofs in their proper context, we discuss and prove the
basic coherence theorems of Oka and Cartan, and the finite mapping theorem of Grauert
and Remmert. Next in line is the theory of standard bases in a power series ring, due to
Hironaka and Grauert. The corresponding notion in a polynomial ring is called Gröbner
basis, and has received much attention lately because of its applications in computer
algebra. We refer to the computer algebra system Singular [Singular 2000].1 One of
the main ideas is that standard bases allow a well-defined representative of an element
f modulo an ideal I, called the normal form of f . In particular, this normal form is zero
1 Our advise to the reader is to compute many examples of the theory in this book by means of a
computer algebra system, which can calculate in localizations of polynomials rings.
viii Preface
Some proofs in this book are new, or at least we have not seen them in this form in
the literature. We mention:
• The proof of the convergence of the Puiseux expansion by using Newton’s Lemma.
• The proof of the inversion theorem for Puiseux pairs.
• Direct construction of factors of a power series in two variables, without using the
Puiseux expansion.
• Certain parts of the proof of Grauert’s Approximation Theorem, in particular the
proof of Cartan’s Lemma by using standard bases.
• The fact that the resolution graph determines the semigroup of the curve.
• Parts of the proof of the Mather-Yau Theorem.
Assuming the reader has the knowledge mentioned above, it was our intention to
give either full proofs of all the statements in the book, or to put them into exercises
with sufficient hints. Except for the descriptive third section of Chapter 8, we hope to
have succeeded in this goal.
Acknowledgment: Various people read parts of the manuscript and gave us useful com-
ments. We mention Holger Cröni, Anne Frühbis Krüger, Tobias Hirsch, Rainer Kaenders
and Dorin Popescu. We thank Thomas Keilen, who also made some of the pictures for
us. We thank Pauline Bitsch for typing a large part of the manuscript. In particular we
thank Olaf Bachmann for helping us with LATEX, and explaining us the use of the cvs
system.
Last but not least, we thank our wives Jeannette and Marlis for being so patient
with us during the three years we were writing this book.
ix
Introductory Guide
2 3
6 5 7
8.1 8.2
8.3
9 10
x
Contents
Preface vii
Introductory Guide ix
Contents x
1 Algebra 1
1.1 Noetherian Rings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Modules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.3 Local Rings and Localization . . . . . . . . . . . . . . . . . . . . . . . 17
1.4 Primary Decomposition . . . . . . . . . . . . . . . . . . . . . . . . . . 22
1.5 Finite and Integral Extensions . . . . . . . . . . . . . . . . . . . . . . 34
Bibliography 361
1
1 Algebra
In the first chapter of the book we collect some facts from algebra which we will use in
this book. We assume that the reader is familiar with the concepts of ring, ideal in a
ring, principal ideal domain, prime ideal, unique factorization domain, etc. Throughout
this book, a ring will mean a commutative ring with 1. All homomorphisms of rings
always take 1 to 1. The first topic we discuss in this chapter is Noetherian rings, that
is, rings in which every ideal is finitely generated. The main thing to prove here is the
Hilbert Basis Theorem. This says that the polynomial ring K[x1 , . . . ,xn ], with K a field,
is Noetherian. Then we will discuss the basic notions of the theory of modules over a
ring, a notion which is analogous to vector spaces over a field. In the third section we
will discuss local rings, that is, rings which have a unique maximal ideal. They play
a very important role in this book. For us, the most important example will appear
in the third chapter, the power series ring in n variables C {x1 , . . . ,xn }. Localization
of rings will be defined. Geometric interpretations of localization will be deferred to
Chapter two. In section four we discuss primary decomposition of ideals in Noetherian
rings. Geometrically, this corresponds to the decomposition of an algebraic or analytic
set into irreducible components: these notions will also be discussed in chapters two and
three. Algebraically, it can be seen as a generalization of unique factorization in Z, or in
K[x1 , . . . ,xn ]. In the final section of this chapter we treat ring extensions. The notions of
finite and algebraic extensions of fields here correspond to finite and integral extensions
of rings. A reduced ring is called normal (or integrally closed), if every element of the
total quotient ring Q(R) which is integral over R is already in R. The most important
examples are K[x1 , . . . ,xn ], and, as we will see in Chapter three, the power series rings
C {x1 , . . . ,xn }. We will prove the Cayley-Hamilton Theorem, and the very important
theorem of the finiteness of integral closure, which is due to Dedekind and Kronecker. It
says that, under some weak hypothesis, the normalization of a ring R is finitely generated
as an R–module. This is a nontrivial statement, and it is only because of this proof that
we need some Galois theory as prerequisite.
I1 ⊂ I2 ⊂ · · · ⊂ Ik ⊂ · · ·
becomes stationary, that is, there exists an n > 0 such that Ik = In for all k ≥ n.
(3) Every nonempty set of ideals in R has a maximal element with respect to inclusion.
Proof.
(1) =⇒ (2) It is easily checked that the union ∪∞ k=1 Ik is an ideal again, which by as-
sumption is finitely generated, say by a1 , . . . ,as . For n big enough, one has ai ∈ In for
i = 1, . . . ,s. Then Ik = In for all k ≥ n.
(2) =⇒ (1) Suppose that there exists an ideal I in R which is not finitely generated.
Then we can inductively define elements as follows:
• Take a1 ∈ I.
• Given a1 , . . . ,ak−1 take an element ak ∈ I \ (a1 , . . . ,ak−1 ).
But then the chain (a1 ) ⊂ (a1 ,a2 ) ⊂ · · · is not stationary, which is a contradiction.
(2) =⇒ (3) If (3) were false, then there would exist a set of ideals such that for any ideal
Ik in this set we can find an ideal Ik+1 with Ik $ Ik+1 . This would lead to a chain of
ideals which is not stationary, a contradiction.
(3) =⇒ (2) Apply (3) to the set of ideals Ik for k ≥ 1. If In is a maximal element, then
Ik = In for all k ≥ n.
Theorem 1.1.3 (Hilbert’s Basis Theorem). Let R be a Noetherian ring. Then the poly-
nomial ring R[x] is also Noetherian.
Proof. By contradiction. Let I ⊂ R[x] be an ideal in R[x] which is not finitely generated.
We then inductively define a sequence of polynomials f1 ,f2 , . . . in I by:
• Let f1 ∈ I be a polynomial of minimal degree.
• Given f1 , . . . ,fk−1 , let fk be a polynomial of minimal degree in I \ (f1 , . . . ,fk−1 ).
Let ak be the leading coefficient of fk . Then we claim that for all k ≥ 2:
which would give us a chain of ideals in R which does not stabilize, in contradiction to the
assumption that R is Noetherian. Suppose on the contrary that ak ∈ (a1 , . . . ,ak−1 ) for
some k. We then can write ak = b1 a1 + . . . + bk−1 ak−1 for some bi ∈ R. The polynomial
is in I \ (f1 , . . . ,fk−1 ), but has degree smaller than fk , in contradiction to the choice of
fk .
The following corollary gives us the first class of examples of Noetherian rings using
the fact that fields are Noetherian.
1.1 Noetherian Rings 3
Corollary 1.1.4. Let R be a Noetherian ring, and I ⊂ R[x1 , . . . ,xn ] be an ideal. Then
the quotient ring R[x1 , . . . ,xn ]/I is Noetherian.
Proof. Using induction and Theorem 1.1.3 we obtain that R[x1 , . . . ,xn ] is Noetherian.
The rest is given as Exercise 1.1.20.
Definition 1.1.5.
From Corollary 1.1.4 we obtain that affine algebras over a Noetherian ring (or in
another language, rings of finite type over a Noetherian ring) are Noetherian. Our most
important case is the case of affine algebras over a field (which is Noetherian).
Now we will investigate another special class of Noetherian rings.
Definition 1.1.6. Let K be a field and R be a K–algebra. R is called an Artinian
K–algebra if R, considered as K–vector space, has dimK (R) < ∞.
Lemma 1.1.7. Let R be an Artinian K–algebra. Then R is Noetherian.
Proof. Let I ⊂ R be an ideal, then I is a finite-dimensional K–vector space because R
is a finite-dimensional K–vector space. Any K–basis of I, in particular, generates I as
ideal and, therefore, I is finitely generated.
Lemma 1.1.8. Let R be an Artinian K–algebra. Then any chain of ideals in R
I1 ⊃ I2 ⊃ . . . ⊃ Ik ⊃ . . .
becomes stationary, that is, there exists an n > 0 such that Ik = In for all k ≥ n.1
Proof. A chain of ideals is especially a chain of K–vector spaces. Such a chain becomes
stationary because R is a finite-dimensional K–vector space.
Examples 1.1.9. Let K be a field.
At the end of this section, we introduce the notion of graded rings and prove some
other useful tools.
1 This is the usual way to define an Artinian ring. We need later on only Artinian K–algebras in the
sense of 1.1.6 which makes proofs a little simpler. Note that C [x]/(x2 ) is a Q–algebra which is an
Artinian ring, but not an Artinian Q–algebra in the sense of our definition.
4 1 Algebra
such that
(1.2) Rg · Rh ⊂ Rg+h for all g,h ∈ G.
The elements of Rg are called homogeneous of degree g. A Z–graded ring is simply called
a graded ring.
Examples 1.1.11.
(1) Let S be a ring (for example a field), and R := S[x1 , . . . ,xn ] be the polynomial ring
in n variables with coefficients in S. Then R is a graded ring, by defining
( )
X
ν1 νn
Rd := cν1 ,...,νn x1 . . . xn ; cν1 ,...,νn ∈ S , and Rd = (0) for d < 0.
ν1 +···+νn =d
(2) Iν + Iµ = R for ν 6= µ.
Then the canonical map
m
M
R −→ R/Iν ,
ν=1
x 7→ (x + I1 , . . . ,x + Im )
is an isomorphism of rings.
1.1 Noetherian Rings 5
Proof. The first condition implies that the map is injective. To prove the surjectivity it is
enough to see that for all j there exists an x ∈ R such that x ∈ Iν for all ν 6= j and x ≡ 1
mod Ij . Without loss of generality, it suffices to prove this for j = 1. By assumption
we have Iν + I1 = R for all ν 6= 1. In particular we can find xν ∈ Iν and yν ∈ I1 for
ν = 2, . . . ,m such that xν + yν = 1. We define
m
Y m
Y
x := xν = (1 − yν ).
ν=2 ν=2
Exercises
1.1.14. Let I, J ⊂ R be ideals. The product I ·J ⊂ R is the ideal in R generated by the elements
x · y, where x runs through I and y runs through J. Show that I · J ⊂ I ∩ J.
1.1.15.
(1) Let I, J ⊂ R be ideals. Show that I : J := {x ∈ R : xJ ⊂ I} is an ideal in R, and that
I ⊂ I : J.
(2) Let a ∈ R. Prove that (I ∩ J) : a = (I : a) ∩ (J : a).
(3) Let I : f ∞ := {x ∈ R : xf s ∈ I for some s}. If R is Noetherian, show that I : f ∞ = I : f k
for some k ∈ N.
(4) Let I ⊂ R be an ideal, and x,y ∈ R. Prove that (I : x) : y = I : xy.
√
1.1.16. Let I ⊂ R be an ideal. The radical I of I is defined by:
√
I = {f ∈ R : f s ∈ I for some s ∈ N}.
√
An ideal is called radical if and only if I = I.
√
(1) Prove that I is an ideal.
√ p√ √
(2) Prove that I + J = I + J.
6 1 Algebra
√ √ √
(3) Find ideals I,J for which I + J % I + J .
√ √ √
(4) Prove that I ∩ J = I ∩ J. Generalize to finitely many ideals.
1.1.17. Let R be a ring. An element x ∈ R is called nilpotent if there exists an n ∈ N with
xn = 0. Show that the set of nilpotent elements is an ideal N .
The ring Rred := R/N is called the reduction of R. A ring is called reduced if it has no
nonzero nilpotent elements. Show that Rred is reduced.
P
1.1.18. Let R be a G–graded ring. An ideal I ⊂ R is called homogeneous if I = g∈G (I ∩ Rg ).
Prove the following.
(1) I is homogeneous if and only if it can be generated by homogeneous elements.
(2) Sums, products and intersections of homogeneous ideals are homogeneous.
1.1.19. An abelian group G is called ordered, if there is a total ordering < on G satisfying:
g < h =⇒ a + g < a + h for all a ∈ G.
For example, the group Z with its natural ordering is an ordered group. Let G be an ordered
group and R be a G–graded ring.
(1) Prove that a homogeneous ideal I is prime if and only if for all homogeneous elements
a,b ∈ R with ab ∈ I it follows that a ∈ I or b ∈ I.
(2) Prove that the radical of a homogeneous ideal is homogeneous.
1.1.20. Let R be a Noetherian ring and I ⊂ R be an ideal. Prove that R/I is Noetherian.
1.1.21. Consider two ideals I ⊃ J in a Noetherian ring R. Suppose that for all f ∈ I there
exists a k (depending on f) with f k ∈ J. Prove that some power of I is contained in J. In
√ k
particular I ⊂ I for some k.
1.2 Modules
The notion of a module over a ring is the analog of the notion of vector space over a
field, and is obtained by simply copying the definition. Here it is.
Definition 1.2.1. Let R be a ring. An R–module M is a set M with two operations:
(1) There is an addition + : M −→ M :
x,y ∈ M 7→ x + y ∈ M
As in the theory of rings, groups, and vector spaces, we want to consider modules
up to isomorphism. The definition should not surprise the reader:
• (x + N ) + (y + N ) := (x + y) + N ,
• a(x + N ) := ax + N .
Here x,y ∈ M and a ∈ R. One checks that addition and scalar multiplication are
well-defined.
(2) Let ϕ : M −→ N be a homomorphism of R–modules. We define
• the kernel of ϕ, notation Ker(ϕ), as
Ker(ϕ) := {x ∈ M : ϕ(x) = 0}.
One checks, exactly as in linear algebra, that Ker(ϕ) = {0} if and only if ϕ is
injective. Moreover, Ker(ϕ) is a submodule of M ,
• the cokernel of ϕ, notation Coker(ϕ), to be the quotient module N/ϕ(M ).
(3) The direct sum M ⊕ N of M and N is defined to be the direct sum of abelian
groups M ⊕ N = {(m,n),m ∈ M,n ∈ N }, with obvious addition, and with scalar
multiplication defined by a(m,n) := (am,an). It is sometimes also called the direct
product M × N . Similarly, one defines the direct sum ⊕i∈I Mi of R–modules. Here
I is any index set. As a set:
M
Mi = {(mi )i∈I with only finitely many mi 6= 0}
i∈I
and we have the obvious addition and scalar multiplication. One also has the direct
product Y
Mi = {(mi )i∈I }.
i∈I
with obvious addition and scalar multiplication. Here it is allowed that infinitely
many mi are nonzero.
(4) An R–module is called a free module if it is isomorphic to a direct sum ⊕i∈I R of
copies of R. Often we will have the case that I is a finite set of cardinality t. This
free module is called the free module of rank t, denoted by Rt . The fact that the
rank is well-defined is not trivial and will be shown in the exercises.
(5) Let M be an R–module, m1 , . . . ,mt be elements of M . The submodule (m1 , . . . ,mt )
generated by m1 , . . . ,mt is by definition the submodule of M given by
{a1 m1 + . . . + at mt : a1 , . . . ,at ∈ R}.
Similarly one defines the submodule generated by an arbitrary number of elements
of M .
1.2 Modules 9
(6) An R–module M is called finitely generated if there exist finitely many elements
m1 , . . . ,mt of M such that M = (m1 , . . . ,mt ). If this is the case, the elements
m1 , . . . ,mt are called generators of M . Finite generation of a module is equivalent
to having a surjective homomorphism
ϕ : Rt −→ M,
(a1 , . . . ,at ) 7→ a1 m1 + . . . + at mt .
Despite the obvious similarity with vector spaces, the general theory of modules over
a ring is much more difficult and much richer. The inability of to divide is just too much
an obstacle. For example, we know from the theory of vector spaces, that any finitely
generated vector space over a field K is isomorphic to K t for some t. This is certainly
not true for rings, as the following trivial example shows:
Example 1.2.5.
(1) The Z–module (that is, abelian group) Z/(2) is not free. Indeed a free group either
has 1 or an infinite number of elements.
(2) More generally, any module M whose torsion submodule Tors(M ) is nontrivial, is
not a free module. Indeed, suppose the converse, that is, let {mi }i∈I be a basis of
M . Let m ∈ Tors(M ), m 6= 0 and assume
10 1 Algebra
m = a1 m 1 + . . . + at m t .
As m 6= 0 we have ai 6= 0 for some i. Because m ∈ Tors(M ) there exists a nonze-
rodivisor a ∈ R with am = 0. We obtain
a · m = aa1 m1 + . . . + aat mt = 0.
By definition of basis aai = 0, which is a contradiction to the fact that a is a
nonzerodivisor.
One could ask the question whether at least finitely generated torsion free modules
are free. But this is also much too much to ask for, as the following simple lemma shows:
Lemma 1.2.6. Let I ⊂ R be an ideal, with R an integral domain. Then I is a free
module if and only if I is principal.
Proof. The case where I = (0) is trivial. Otherwise, suppose that I is principal, that is,
I = (f ) for some f ∈ R. Then the map: R −→ I given by a 7→ af gives an isomorphism of
R–modules. It is injective because R is supposed to be an integral domain. To prove the
converse, let I be a free R-module. If the rank is one, let ϕ : R −→ I be an isomorphism.
It is then immediate that I is the ideal generated by ϕ(1). We are finished if we show
that an ideal cannot be isomorphic to a free module of rank at least two. For this, it
suffices to show that there does not exist an injective map α : Rn −→ I, as soon as
n ≥ 2. Suppose the converse. Without loss of generality we may assume n = 2. Let
f = α(1,0), and g = α(0,1). Since α is injective, both f and g are nonzero. But then
α(g, − f ) = gf − f g = 0, a contradiction.
So for example the ideal (x,y) in the polynomial ring K[x,y] is not a free K[x,y]–
module. From this lemma one might hope that at least for principal ideal domains (that is,
domains where every ideal can be generated by one element), a finitely generated torsion
free module is free. This is indeed the case, in fact one has the following classification
theorem:
Theorem 1.2.7. Let R be a principal ideal domain, and M be a finitely generated R–
module. Then
M∼ = Rs ⊕ Tors(M ).
The torsion module has the following form:
Tors(M ) ∼
= R/(a1 ) ⊕ · · · ⊕ R/(at ) with (a1 ) ⊃ (a2 ) ⊃ . . . ⊃ (at ) 6= (0).
For a proof, we refer to Lang, [Lang 1965], Chapter XV, §2. At one point in the
book we will need the statement that a finitely generated torsion free module over the
power series ring in one variable is free. This is a special case of the theorem, for which
an easier proof is available. We refer to 1.3.9.
Of course, the finitely generated free modules Rp with p ∈ N are the simplest R–
modules. Just as in linear algebra, maps between such modules can be given by matrices.
Consider Rp with basis {e1 , . . . ,ep }, and Rq with basis {f1 , . . . ,fq }. Let ϕ : Rp −→ Rq
be an R–module homomorphism. The matrix of ϕ (with Pq respect to the chosen bases) is
the matrix (aij ) with entries in R given by ϕ(ei ) = j=1 aji fj . All the results in linear
algebra which do not use division in their proofs carry over to the general case of rings. In
particular, the determinant of a map between free modules of the same rank is defined.
On the other hand, because in a general ring we cannot divide, Cramer’s rule does not
carry over. But at least we can save the following statement.
1.2 Modules 11
Theorem 1.2.8 (Cramer’s Rule). Let a ring R, an n × n matrix A = (aij ) with co-
efficients in R and vectors (x1 , . . . ,xn ) and (b1 , . . . ,bn ) with entries in R be given. For
all j let Aj be the matrix obtained from A by replacing its j–th column2 by (b1 , . . . ,bn )t .
Consider the system of equations
n
X
aij xj = bi i = 1, . . . ,n.
j=1
Then
det(A)xj = det(Aj ) for j = 1, . . . ,n..
For a proof, look in any textbook on linear algebra, and forget to do the division at
the end of the proof, or do the proof yourself, Exercise 1.2.28.
Definition 1.2.9. Let A be an n × n matrix with entries in R. The adjoint matrix
Aad = (aad
ij )
is defined by
aad
ij = (−1)
i+j
det(Aij )
where Aij is the matrix obtained from A by deleting the i–th column and j–th row.
For a matrix A with adjoint matrix Aad we have the following theorem, which follows
from Cramer’s rule:
Theorem 1.2.10.
Aad A = AAad = det(A) · Idn ,
where Idn denotes the n × n unit matrix.
So, up to a factor (the determinant), the adjoint matrix describes “the inverse ma-
trix”.
Definition 1.2.11. A (finite or infinite) sequence of R–modules and homomorphisms
αk−1 k α
· · · −→ Mk−1 −−−→ Mk −−→ Mk+1 −→ · · ·
1 f′ 2 f′ 1 δ 2 g g
0 → Ker(h1 ) −→ Ker(h2 ) −→ Ker(h3 ) −→ Coker(h1 ) −→ Coker(h2 ) −→ Coker(h3 ) → 0
is exact.
In this lemma f1′ and f2′ are restrictions of f1 and f2 , and g 1 and g2 are induced
by g1 and g2 . The map δ is defined as follows. Let x ∈ Ker(h3 ), that is, x ∈ M3 with
h3 (x) = 0. As the upper row is exact, it follows that f2 is surjective, hence we can
take y ∈ M2 with f2 (y) = x. Because of the commutativity of the diagram we have
g2 h2 (y) = h3 f2 (y) = h3 (x) = 0. Therefore, h2 (y) ∈ Ker(g2 ). Since the lower row is exact,
we have Ker(g2 ) = Im(g1 ). Hence we can find w ∈ N1 with g1 (w) = h2 (y). Now we put
δ(x) to be the class of w in Coker(h1 ). We have to prove that δ is well-defined and that
this sequence is exact. The proof of the Snake Lemma is left as Exercise 1.2.27.
Definition 1.2.13. Let R be a ring, and M be an R–module. Then M is called Noethe-
rian if every submodule N ⊂ M is finitely generated.
This just means that we can find finitely many elements m1 , . . . ,mt ∈ N such that
N = R · m1 + . . . + R · mt = (m1 , . . . ,mt ).
We could rephrase this by saying that, in a short exact sequence of modules, if any
two are Noetherian, then the third is Noetherian.
Noetherian modules over a Noetherian ring are easy to construct:
Lemma 1.2.15. Let R be a Noetherian ring, and M be a finitely generated R–module.
Then M is a Noetherian R–module.
f
3
M1 −
→ M2
A diagram h1 ↓ is called commutative if h2 f = gh1 , that is it does not matter which route one
h2 ↓
g
N1 −
→ N2
takes along the diagram.
1.2 Modules 13
Proof. As M is a quotient of a free module, we only have to prove the lemma for M = Rt ,
by the previous lemma. This will be done by induction on t. The case t = 1 is the
assumption. Apply the previous lemma to the exact sequence
0 1
1 0 0 1 0 ... 0 1
B0C
B.C @ .. .. .. .. A
@.A
. .. . .
0 0 0 0 ... 1
0 → R −−−−→ Rt −−−−−−−−−→ Rt−1 → 0.
Definition 1.2.16. (1) The tensor product M ⊗R N is an R–module with the following
universal property:
There exists a canonical bilinear map λ : M × N −→ M ⊗R N . If ϕ : M × N −→ P
is any bilinear map, then there exists a unique homomorphism h : M ⊗R N −→ P such
that h ◦ λ = ϕ.
λ /
M × NM M ⊗R N
MMM
MMM h
ϕ MMM
M&
P.
One can give a direct construction of the tensor product as follows. M ⊗R N is the
quotient of the free R–module generated by {m ⊗ n : m ∈ M, n ∈ N } divided out by the
submodule of the tensor product relations generated by
{(am + bn) ⊗ (cp + dq) − ac · m ⊗ p − ad · m ⊗ q − bc · n ⊗ p − bd · n ⊗ q | a,b,c,d ∈ R,
m,n ∈ M, p,q ∈ N }.
The bilinear form λ : M × N −→ M ⊗R N is defined by λ(m,n) = m ⊗ n and has the
property mentioned above (Exercise 1.2.45).
(2) Let S be an R–algebra, then M ⊗R S is an S–module with the multiplication
s · m ⊗ s′ := m ⊗ (ss′ ) (Exercise 1.2.44). For example, if V is a real vector space, then
V ⊗R C is a C –vector space, called the complexification of V .
(3) Let S,T be R–algebras, then S ⊗R T is an R–algebra with the multiplication
(s ⊗ t) · (s′ ⊗ t′ ) := (ss′ ) ⊗ (tt′ ). The simplest example here is the case that R = k is a
field, S = k[X] and T = k[Y ]. Then S ⊗R T = k[X,Y ].
Let i1 : S −→ S ⊗R T , i2 : T −→ S ⊗R T be defined by i1 (s) = s ⊗ 1, i2 (t) = 1 ⊗ t,
then
i1
S ⊗O R T o SO
i2
T o R
is commutative, and universal (cf. Exercise 1.2.45), that is: let A be an R–algebra and
j1 : S −→ A, j2 : T −→ A be homomorphisms such that
j1
AO o SO
j2
T o R
14 1 Algebra
(1) The proof that It (ϕ) is independent on the choice of the bases is left as Exercise
1.2.47.
ϕ ψ
(2) Let Rn −→ Rm −→ M −→ 0 and Rs − → Rt −→ M −→ 0 be exact. Then Im−ν (ϕ) =
It−ν (ψ) for all ν. The proof uses some facts on so-called local rings, to be treated
in the next section. The proof is left as Exercise 1.3.28.
(3) If S is an R–algebra, then Iν (ϕ ⊗ idS ) = Iν (ϕ)S. This is left as Exercise 1.2.47.
Exercises
1.2.20. Prove that the ring of integers Z and the polynomial ring in one variable K[x] over a
field K are principal ideal domains.
(Hint: Use division with remainder.)
1.2.21.
(1) Let ϕ : Rp → Rs be an isomorphism between free R–modules. Show that p = s.
(Hint: Suppose not, for example p > s. Let ψ : Rs → Rp be the inverse of ϕ. Extend ϕ
resp. ψ to maps
ϕ′ : Rp → Rs ⊕ Rp−s
resp.
ψ ′ : Rs ⊕ Rp−s → Rp
in the obvious way. Derive a contradiction by looking at determinants.)
(2) Conclude that the rank of a free module is well-defined.
1.2.23. Let R be a subring of S. Show that one can, in a natural way, view S as an R–module.
Put Nk = Im(αk ) = Ker(αk+1 ). Show that for all k one has short exact sequences:
0 −→ Nk−1 −→ Mk −→ Nk −→ 0.
(So any exact sequence can be split into short exact sequences.)
0 −→ M1 −→ M2 −→ · · · −→ Ms −→ 0
P
of finite-dimensional K–vector spaces. Prove that sk=1 (−1)k dimK (Mk ) = 0.
1.2.30.
(1) Let M be a K[X]–module, which is finite-dimensional as a K–vector space. Show that M
is a torsion module.
(2) Prove the theorem on Jordan normal form of matrices over an algebraically closed field,
using the classification of finitely generated modules over a principal ideal domain, see
Theorem 1.2.7.
1.2.31. Let K be a field, S be a K–algebra, and M be an S–module. Suppose that Ann(M ) = (0)
and that dimK (M ) < ∞. Prove that dimK (S) < ∞.
(Hint: Show that one can embed S into HomS (M,M ).)
1.2.32. Let N be an R–module. Prove that the map φ : HomR (R,N ) −→ N , φ(ϕ) = ϕ(1), is
an isomorphism.
· · · −→ Mk+1 −→ Mk −→ Mk−1 −→ · · ·
of R–modules. Let x ∈ R be a nonzero divisor of Mk for all k. Prove that the induced sequence
is also exact.
(Hint: Split the exact sequence in many short exact sequences and use the Snake Lemma.)
16 1 Algebra
is exact.
is exact.
1.2.40. Prove that the canonical map φ : HomR (M ⊗R N,P ) −→ HomR (M, HomR (N,P )) de-
fined by φ(λ)(m)(n) := λ(m ⊗ n) is an isomorphism.
n n
1.2.41. Let M,N be R–modules and M = ⊕ Mi . Prove that M ⊗R N ∼
= ⊕ (Mi ⊗R N ).
i=1 i=1
1.2.44. Let M be an R–module and S be an R–algebra. Prove that with the multiplication
s · m ⊗ s′ = m ⊗ ss′ , M ⊗R S is an S–module.
commutes, i,π the canonical injection resp. projection. In this case one says that M is a direct
summand of N .
1.3 Local Rings and Localization 17
Definition 1.3.1. A ring R is called local if it has a unique maximal ideal m. One often
says that (R,m) is a local ring, to indicate that m is its unique maximal ideal.
Later on in this section we will see how to construct local rings from general rings by
means of “localization”. In this book, the local rings we are dealing with most frequently
are power series rings and their quotients, see Section 3.1. Here we define those rings for
the case of one variable.
P∞ 1.3.2. The formal power series ring C [[x]] in one variable consists of ele-
Definition
ments k=0 ak xk , with ak ∈ C . We have the obvious addition and multiplication.
P∞ The
convergent power series ring C {x} in one variable consists of elements k=0 ak xk , with
ak ∈ C , that converge in some open neighborhood of 0 in C .
Note that in the formal power series ring as well as in the convergent power series
ring, the elements which do not lie in (x), that is, those with nonzero constant term,
are units. Therefore, the fact that the power series rings are local rings follows from the
following lemma.
Lemma 1.3.3. Let R be a ring, and m ⊂ R be an ideal. Then R is local, with maximal
ideal m, if and only if R \ m is the set of units of R.
Proof. Suppose R is local with maximal ideal m. Take an element a ∈ R. If (a) 6= R,
then (a) is contained in a maximal ideal, which by definition of a local ring must be m.
Therefore, if a ∈
/ m, then (a) = R, that is, a is a unit. Moreover, if a ∈ m, then a is not
a unit, as otherwise (a) = R.
Suppose, on the other hand, that R \ m is the set of units. Take an ideal I 6= R. Then
the ideal I does not contain units, and therefore must be contained in m. This shows
that m is the unique maximal ideal of R.
Theorem 1.3.4 (Nakayama’s Lemma). Let (R,m) be a local ring and M be a finitely
generated R–module with m · M = M . Then M = 0.
Proof. By contradiction. Let m1 , . . . ,mt be a minimal number of generators for M . As
M = m · M there exist a1 , . . . ,at ∈ m such that mt = a1 m1 + . . . + at mt . Therefore
A := {J ideal in R : J ∩ I = mI}.
Our ring R is Noetherian, so that this set has a maximal element which we call J.
Obviously mI ⊂ J. Note that it suffices to prove that mk ⊂ J for certain k. Indeed, as
obviously I ⊂ mk , it follows I ⊂ mk ⊂ J, hence I ⊂ J ∩ I = mI, and we can apply
Nakayama. As m is finitely generated, it suffices to find for each f ∈ m an α with f α ∈ J.
This we will do by showing J + (f α ) = J by using the maximality property of J. Thus it
remains to show that (J +(f α ))∩I = mI. The inclusion ⊃ is obvious. As R is Noetherian,
the chain J : f ⊂ J : f 2 ⊂ · · · stabilizes. Hence there exists an α with J : f α = J : f α+1 ,
and this is the α we need. Now let x ∈ (J + (f α )) ∩ I, say x = y + af α ∈ I for y ∈ J. It
follows that f α+1 a = f x − f y ∈ mI + J = J, as mI ⊂ J. Hence a ∈ J : f α = J : f α+1 .
Thus f α a ∈ J, so that x ∈ J follows. As x ∈ I it follows x ∈ J ∩ I = mI. This holds
for all x ∈ (J + (f α )) ∩ I so that (J + (f α )) ∩ I ⊂ mI follows. This is what we had to
show.
Another corollary is the following lemma on the number of generators of a module
over a local ring. The proof is left as Exercise 1.3.18.
Corollary 1.3.6. Let M be a finitely generated module over a local ring (R,m).
(1) Let f1 , . . . ,fs ∈ M such that the classes of the fi generate M/mM as R/m–vector
space. Then f1 , . . . ,fs generate M .
(2) The minimal number of generators of M is equal to dimR/m (M/mM ).
Definition 1.3.7. Let R be a ring, and M be an R–module. A presentation of M is a
short exact sequence
α β
(1.3) G −→ F −→ M −→ 0
of R–modules, in which F and G are free R–modules. In case R is a local ring, and M
is finitely generated, a presentation as in 1.3 is called minimal if the rank of F is equal
to the minimal number of generators of M , that is, equal to dimR/m (M/mM ), see 1.3.6.
This is equivalent to the statement Im(α) ⊂ m · F .
ϕ
Proposition 1.3.8. Let R be a local ring and Rn −→ Rm −→ M −→ 0 be a presentation
of M . The following conditions are equivalent:
(1) M is a free module of rank m − r,
(2) Im(ϕ) is a free module of rank r and a direct summand of Rm ,
(3) Ir (ϕ) = R and Ir+1 (ϕ) = (0).
Proof.
(3) ⇒ (2) The fact that Ir (ϕ) = R and R is local implies that for a matrix A representing
ϕ there is an r–minor which is not in the maximal ideal of R, hence a unit. By renumbering
we may assume that this minor is obtained by taking the determinant of the first r rows
ad
and columns. We write A = ( B∗ ∗∗ ) with B an r × r–matrix. Then det1 B B0 Idn−r 0 gives
n
an automorphism of R and we multiply A from the right with this matrix. So we may
assume that A is of type ( Id∗r ∗∗ ). As in linear algebra, we can now do row and column
operations, corresponding to a change of basis in target and source. We may then assume
that Ais of type Id0r 0∗ . As by assumption Ir+1 (ϕ) = 0, it then follows that A is of type
Idr 0 . This immediately implies (2).
0 0
1.3 Local Rings and Localization 19
0 / K / F / M / 0
·x ·x ·x
0 / K / F / M / 0
where ·x denotes multiplication by x. By assumption, x is not a zero divisor of M , so
that the kernel of ·x : M → M is zero. It follows from the Snake Lemma that we have
an exact sequence:
From the remark above, we have that F/xF → M/xM is an isomorphism. It follows that
K/xK = 0. From Nakayama, in the version of Exercise 1.3.17 it follows that K = 0 or,
equivalently, that M ∼
= F is free.
There is a procedure to construct local rings from a given ring, called localization.
This is a special case of forming rings (and modules) of fractions, which is itself a gener-
alization of defining the quotient field of an integral domain. In order to define rings of
fractions, we have to introduce the concept of a multiplicatively closed set.
Definition 1.3.11. Let R be a ring. A set S ⊂ R is called multiplicatively closed if and
only if it satisfies the following conditions.
• 1 ∈ S,
• If f,g ∈ S, then f · g ∈ S.
20 1 Algebra
Definition 1.3.12. Let R be a ring and S be a multiplicatively closed set. We will define
the ring of fractions RS , also denoted by S −1 R. As a set it is the following:
RS = { sr : r ∈ R, s ∈ S}
(1) Let R be an integral domain, and S = R \ {0}. Then RS is the quotient field Q(R)
of R. It is indeed a field, as Q(R) = { ab : a,b ∈ R, b 6= 0}. So if ab 6= 0, then a 6= 0,
and ab ∈ Q(R).
(2) More generally, for a ring R, one can take as multiplicatively closed set S the set of
all nonzero divisors. The resulting ring RS is called the total quotient ring, or total
ring of fractions of R, notation Q(R). We will prove in Proposition 1.4.27 that for
a reduced Noetherian ring Q(R) is a direct sum of fields.
(3) Consider the multiplicatively closed set S = {1,f,f 2 ,f 3 , . . .} for f ∈ R. In this case
one usually writes Rf instead of RS .
(4) Let p be a prime ideal of R. According to the definition of a prime ideal the set
R \ p is a multiplicatively closed set. The resulting ring of fractions is denoted by
Rp (instead of the “correct” RR\p ), and is called the localization of R in p. So
Rp = { ab : a ∈ R, b ∈
/ p}.
At first it might be surprising that the following lemma holds generally for prime
ideals, and not only for maximal ideals. Nevertheless, the proof is easy!
Lemma 1.3.14. Let R be a ring and p ⊂ R be a prime ideal. Then Rp is a local ring,
with maximal ideal pRp = { ab : a ∈ p, b ∈
/ p}.
Proof. By 1.3.3, it suffices to show that the elements of pRp are the nonunits of Rp . An
element in Rp \ pRp is of type ab , where a,b ∈
/ p. But then ab ∈ Rp , so that ab is a unit. Of
a
course, every element b , with a ∈ p is not a unit in Rp .
1.3 Local Rings and Localization 21
The correspondence is given by the following rules. Given a prime ideal pS ⊂ RS , the
ideal p := pS ∩ R is a prime ideal in R. Given a prime ideal p in R whose intersection
with S is empty, its localization pS is a prime ideal in RS .
Proof. Suppose p is a prime ideal in R, with p ∩ S = ∅. Then the inclusion pS $ RS is
strict. This is because otherwise 1 = as , with a ∈ p, s ∈ S. Hence there would exist a
t ∈ S such that t(s − a) = 0, that is ta = ts. But ts ∈ S, and ta ∈ p, which by assumption
cannot be the case. Now suppose as bt ∈ pS . It follows that ab ∈ p, and therefore either
a ∈ p or b ∈ p. Therefore either as ∈ pS or bt ∈ pS . The converse is left to the reader.
Exercises
1.3.16. Prove that IRf ∩ R = I : f ∞ (see 1.1.15).
1.3.17 (Nakayama’s Lemma). Let R be a ring, I ⊂ R be an ideal which is contained in all the
maximal ideals of R, and M be a finitely generated R–module.
(1) Suppose that M = I · M . Prove that M = 0.
(2) Let N be a submodule of M such that M = N + I · M . Prove that M = N .
1.3.23. Let R be a ring, and p ⊂ R be a prime ideal. We define the n–th symbolic power p(n)
of p by
p(n) := pn Rp ∩ R.
We obviously have pn ⊂ p(n) .
(1) Let R = k[x,y,z]/(z 2 − xy), and p = (x,z). Show that p is prime, and that p(2) = (z 2 ,x).
In particular p2 6= p(2) .
/ p and f · g ∈ p(n) . Show that g ∈ p(n) .
(2) Let f,g ∈ R be given such that f ∈
1.3.25. Let R be a ring. The Jacobson radical J(R) of R by definition is the intersection of all
maximal ideals of R. Show that x ∈ J(R) implies that 1 + x is a unit.
1.3.27. Let R be a ring and p1 , . . . ,pm be prime ideals. Let (0) 6= M be a finitely generated
R–module such that Mp j 6= (0) for all j. Prove that there exists an x ∈ M such that x 6∈ pj Mp j
for all j.
(Hint: Use the idea of the proof of prime avoidance, see 1.1.13: Use induction on m and assume,
therefore, y 6∈ pj Mp j for j = 1, . . . ,m − 1 and a suitable y. Assume the worst case, that is,
y ∈ pm Mp m . You may also assume that ∩m−1 m−1
j=1 pj 6⊂ pm and choose r ∈ ∩j=1 pj \ pm . Let
′ ′
M = (y,y2 , . . . ,yn ) and M = (y,ry2 , . . . ,ryn ) then Mp m = Mp m and y + ryi 6∈ pj Mp j for i ≥ 2
and j = 1, . . . ,m − 1. One of these elements is the element we are looking for because, otherwise,
y,y2 , . . . ,yn ∈ pm Mp m , that is, Mp m = (0).)
λ µ
ϕ2
Rn2 / Rm2 /M
commutes. Use Remark 1.2.19 (1) for this` case. Now
´ show that for a suitable choice of basis in
Rn2 and Rm2 the map ϕ2 has a matrix ϕ02 Id 0 0
0
with ϕ2 : Rn1 −→ Rm1 and Id the identity
matrix.)
with pairwise different prime numbers pi . The pi and the νi are uniquely determined (up
to permutation). It is easily shown that equivalently
Remarks 1.4.2.
(1) It follows directly from the definition that a prime element √ is irreducible. The
converse is not true in general. For example, in the
√ ring Z[√ −5] the element 2 is
irreducible, but not prime as 2 | 6, and
√ 6 = (1 + √ −5)(1 − −5). However, it can
be shown that 2 does not divide 1 + −5 and 1 − −5. To give a different example,
consider the ring C [x,y,z]/(z 2 − xy). Then we have z | xy, but z does not divide x
or y.
The converse does hold however in a unique factorization domain.
(2) One could, alternatively, define a unique factorization domain as a domain in which
each nonzero element which is not a unit has (up to units and renumbering) a unique
decomposition into irreducibles. This will be shown in the exercises.
(3) The existence of a greatest common divisor will be proved in Exercise 1.4.37.
Example 1.4.3. Any principal ideal domain, especially Z and the polynomial ring K[x]
over a field K in one variable, is a unique factorization domain. This will be proved in
Exercise 1.4.35. Moreover, it follows from the Lemma of Gauß, to be proved next, that
the polynomial ring K[x1 , . . . ,xn ] over any field K is a unique factorization domain.
Lemma 1.4.4 (Lemma of Gauß). Let R be a unique factorization domain. Then R[x]
is a unique factorization domain.
One of the main steps in the proof of this Theorem is the following lemma.
Lemma 1.4.5. Let R be a unique factorization domain, K = Q(R) its quotient field,
and f ∈ R[x] irreducible. Then f is also irreducible considered as element of K[x].
24 1 Algebra
Proof. Suppose the converse. Let f = g · h, g,h ∈ K[x] and deg(h) < deg(f ), deg(g) <
deg(f ). We choose c ∈ R such that cg =: g, ch =: h are in R[x] and obtain c2 f = gh.
Now c2 is not a unit in R. This is because f is by assumption irreducible in R[x]. So
we can write c2 = p1 · · · ps , pi ∈ R prime. Exercise 1.4.39 implies p1 |g or p1 |h. So we
can find g ′ and h′ in R[x] with p2 · · · ps f = g ′ h′ . Repeating this argument, shows that
f = g0 · h0 , deg(g0 ) = deg(g), deg(h0 ) = deg(h). This is a contradiction to the fact that
f is irreducible in R[x].
Proof of the Lemma of Gauß 1.4.4. Let K = Q(R) be the field of fractions ofP R. Let f ∈
m k
R[x]. We have to prove that f = f1 · · · fn , fP
i ∈ R[x] prime. We can write f = k=0 ak x .
m ak k
Define d = gcd(a0 , . . . ,am ). Then f = d · k=0 d x . We know, because R is a unique
factorization domain, that d is a unit or a product of finitely many primes. Because of
Exercise 1.4.39 they are also prime in R[x]. Therefore, we may assume that d = 1.
Let f = f1 · · · fn , fi ∈ R[x] be irreducible.
P Such a decomposition into finitely many
polynomials exists because deg(f ) = deg(fi ). Therefore, we may assume that f is
irreducible and we have to prove that f is prime. From Lemma 1.4.5 it follows that f is
irreducible in K[x]. From Exercise 1.4.33 it follows that f is prime in K[x].
Now we are ready to prove the theorem. Let f | g · h, g,h ∈ R[x]. Then f | g or f | h
in K[x]. Without loss of generality, suppose that f | h. Then it follows that f · b = h
for a suitable b ∈ K[x].Pm Then we choose c ∈ R such that c · b =: b ∈ R[x] and obtain
f b = c · h. Now f = k=0 ak xk and gcd(a0 , . . . ,am ) = 1 implies, using 1.4.39 that c | b,
that is, b ∈ R[x].
Corollary 1.4.6. Let R be a field or Z. Then R[x1 , . . . ,xn ] is a unique factorization
domain.
In the rest of the section, we discuss a generalization of this result for an arbitrary
Noetherian ring. From now on we fix a Noetherian ring R. First we need to find a
substitute of the notion of prime power for a general ring.
Definition 1.4.7. An ideal q $ R is called a primary ideal if ab ∈ q, a 6∈ q implies bn ∈ q
for some n ∈ N.
An ideal p is prime, if R/p is a domain, that is, has no zero divisors. Similarly, an
ideal I is primary, if every zerodivisor of R/I is nilpotent. An ideal I is radical , exactly
if R/I does not have any nilpotent elements, except for the zero element. This means
exactly that R/I is a reduced ring.
Lemma 1.4.8.
Proof.
(1) Let ab ∈ m and a 6∈ m. Then (a) + m = R because of the maximality of m. Let
1 = r · a + m for some r ∈ R, m ∈ m, then b = r · a · b + mb ∈ m because a · b and m · b ∈ m.
This implies that m is a prime ideal.
(2) Now let a · b ∈ q with a 6∈ q. If bs 6∈ q for all s ∈ N, that is, b 6∈ m, then again
(b) + m = R. Thus 1 = rb + m for some r ∈ R and m ∈ m. Hence a = rba + m · a.
Iterating this we obtain a = r(1 + m + m2 + · · · + mk )ba + mk+1 a for all k > 0. As
√
q = m, we have mk+1 ∈ q for some k (R is Noetherian). This is in contradiction to
a∈/ q. This implies b ∈ m and thus bs ∈ q for some s.
√ √
(3) Let ab ∈ q and a 6∈ q. Then (ab)m ∈ q for some m but am 6∈ q. This implies that
m s √
for some s, (b ) ∈ q and, therefore, b ∈ q.
(4) Exercise 1.4.42.
Remarks 1.4.9.
(1) It is not true that any power of a prime ideal is primary. For example, the ideal
p = (x,z) is prime in R := K[x,y,z]/(xy − z 2 ), because R/p ∼ = K[y] does not have
zero divisors. However, p2 is not primary. To see this, note that z 2 = xy ∈ p2 , x 6∈ p2
and no power of y is in p2 .
(2) It is always true from (f ) is a prime ideal it follows that f is an irreducible element.
The converse does not always hold, but it holds if R is a unique factorization
domain. Furthermore (f m ) is primary in this case. This applies for example for
rings like K[x1 , . . . ,xn ] or Z[x1 , . . . ,xn ] by the Lemma of Gauß.
One of the main goals in this section is to show that any ideal in a Noetherian ring is
a finite intersection of primary ideals. Before proving this, we need the following lemma.
Lemma 1.4.10. Let I ⊂ R be an ideal, and f ∈ R such that I : f = I : f 2 . Then
(I : f ) ∩ (I,f ) = I.
Proof. The inclusion I ⊂ (I : f ) ∩ (I,f ) is obvious. For the other inclusion, let g ∈ (I :
f ) ∩ (I,f ). We have to prove g ∈ I. Let g = x + y · f, x ∈ I, y ∈ R. As g ∈ I : f it follows
that f g = xf + y · f 2 ∈ I. Hence y · f 2 ∈ I, so that y ∈ I : f 2 = I : f . It follows that
yf ∈ I, and therefore g ∈ I.
Theorem 1.4.11 (Primary Decomposition). Let R be a Noetherian ring and I $ R
be an ideal. Then there exist finitely many primary ideals q1 , . . . ,qr in R such that I =
q1 ∩ . . . ∩ qr .
Proof. Suppose the converse. Then the set A of ideals for which the theorem is not true,
that is,
A := {I ideal in R : I is not an intersection of finitely many primary ideals},
is nonempty. The ring R is Noetherian by assumption, and therefore by 1.1.2 the set A
has a maximal element with respect to inclusion. Let I be such a maximal element. In
particular, I is not primary! Therefore, there exist a,b ∈ R such that a · b ∈ I, a 6∈ I
and bn 6∈ I for all n. We have a chain of ideals I : b ⊂ I : b2 ⊂ · · · . As our ring R is
Noetherian, we get by 1.1.2 that this chain is stationary. Hence, there exists an n with
the property I : bn = I : bn+1 = I : bn+2 = · · · = I : b2n . We use Lemma 1.4.10 to obtain
I = (I : bn ) ∩ (I,bn ).
26 1 Algebra
I : d = (J ∩ q) : d = (J : d) ∩ (q : d) = q : d.
(1) For a primary ideal q, we have now two definitions of an associated prime of q. We
will prove in Theorem 1.4.20 that the two definitions coincide.
(2) Alternatively, but very similarly, one can define the set of associated primes by
The First Uniqueness Theorem 1.4.16 follows directly from the following theorem.
28 1 Algebra
Let {i1 , . . . ,it } be the set of indices with d ∈ / qij . This set is not empty because d ∈
/I=
∩qi . Then
p = (qi1 : d) ∩ · · · ∩ (qit : d).
p √
By Exercise 1.4.47, qik : d is pik –primary. In particular qik : d = qik = pik . By
Exercise 1.1.16 part 4 it follows that:
√ p p √ √
p = p = qi1 : d ∩ · · · ∩ qit : d = qi1 ∩ · · · ∩ qit = pi1 ∩ · · · ∩ pit
As p is a prime ideal it follows that p ⊃ pik for some k. As the other inclusion is clear it
follows that p = pik .
There is a second uniqueness theorem which we now want to formulate.
Definition 1.4.21.
(1) Let p ∈ Ass(I). Then we define
(2) Suppose that p ∈ Ass(I) with M (I,p) = {p}. Then p is called a minimal associated
prime of I. This means that there does not exist an associated prime q of I with
q $ p.
(3) If p,q ∈ Ass(I), p $ q, then q is called an embedded prime ideal of I. This exactly
means that M (I,q) 6= {q}.
Example 1.4.22. Let I = (x2 ,xy) ⊂ K[x,y]. Then I : x = (x,y), I : y = (x). Ass(I) =
{(x),(x,y)}, M I,(x,y) = Ass(I). The ideal (x,y) is an embedded prime ideal.
Theorem 1.4.23 (Second Uniqueness Theorem of Primary Decomposition).
Let R be a Noetherian ring and I ⊂ R be an ideal, I = q1 ∩ · · · ∩ qr an irredundant
primary decomposition of I. Let p be an associated prime ideal of I, and put
√ √
M (I,p) = { qi1 , . . . , qis }.
Then qi1 ∩ · · · ∩ qis is uniquely determined (that is, independent of the irredundant de-
composition). In fact:
qi1 ∩ · · · ∩ qis = Ip ∩ R.
In particular, the primary ideals belonging to minimal associated primes are uniquely
determined. For such a minimal prime ideal pi one, therefore, has qi = Ipi ∩ R.
1.4 Primary Decomposition 29
(4) If R is reduced, then the set of zerodivisors of R is equal to the union of the minimal
prime ideals of R.5
p
Proof. Let (0) = q1 ∩ · · · ∩ qk be an irredundant primary decomposition. Then (0) =
pp1 ∩· · ·∩pk follows, which is (1). For (2), as p is a prime ideal, it follows that p1 ∩· · ·∩pk =
(0) ⊂ p, so (2) follows.
To prove (3), let x be a zero divisor, that is, there is a y ∈ R, y 6= 0 and xy = 0. Let
√
(0) = q1 ∩ · · · ∩ qk be an irredundant primary decomposition and qi = pi . Since y 6= 0
there exists an i such that y 6∈ qi . This implies x ∈ pi . To prove the other direction, let
x ∈ pi for some i. If x is nilpotent, we are done. Otherwise xm ∈ qi for some m and
xm 6= 0. In particular, k ≥ 2.
But qi ·(q1 ∩· · ·∩qi−1 ∩qi+1 ∩· · · ∩qk ) ⊂ q1 ∩· · ·∩qk = (0) implies that there exists a
y 6= 0 such that xm y = 0. Let n ≥ 1 be minimal such that xn y = 0. Then x · (xn−1 y) = 0
and, consequently, x is a zero divisor. The final statement follows from the third.
As an application, we first want to study the structure of Artinian K–algebras (cf.
1.1.6).
Lemma 1.4.25. Let R be an Artinian K–algebra and p ⊂ R be a prime ideal. Then p
is a maximal ideal.6
Proof. R/p is an Artinian K–algebra, see 1.1.9, and an integral domain. We have to prove
that R/p is a field. Let x ∈ R/p, x 6= 0, then Lemma 1.1.8 implies that (xn ) = (xn+1 )
for some n. Hence, xn = xn+1 y for some y ∈ R/p. Since R/p is an integral domain, this
implies 1 = xy. So x is invertible, and R/p is a field.
Theorem 1.4.26. Let R be an Artinian K–algebra. Then R is uniquely (up to isomor-
phism) a finite direct sum of local Artinian K–algebras.
5 Note that minimal prime ideals of R are the minimal associated prime ideals of (0).
6 This is a very special property, and says that an Artinian K–algebra has Krull dimension zero. For a
discussion on these matters, see Section 4.1.
30 1 Algebra
Proof.
Step 1. We first prove the existence. Let (0) = q1 ∩ · · · ∩ qn be an irredundant primary
√
decomposition of the zero ideal. Then all the associated primes mi := qi are maximal
(1.4.25). By the Second Uniqueness Theorem 1.4.23 the qi are uniquely determined. We
claim that furthermore qi + qj = R if i 6= j. Indeed, otherwise m ⊃ qi + qj for some
√ √
maximal ideal m. But then m ⊃ qi = mi and m ⊃ qj = mj . This implies m = mi = mj
and, therefore, i = j (the decomposition was irredundant).
Therefore, the conditions to apply the Chinese Remainder Theorem 1.1.12 are ful-
filled and the canonical map
Mn
R −→ R/qi
i=1
is an isomorphism.
Because of 1.1.9 the R/qi are Artinian K–algebras. They are local because qi 6= qj
for i 6= j, again because the primary decomposition was irredundant.
Step 2. We now prove the uniqueness of the decomposition. So consider R ∼ = ⊕m
i=1 Ri with
Ri local. In step 1 we proved that an irredundant primary decomposition of the zero ideal
in an Artinian K–algebra is uniquely determined (up to permutation). We will show that
if qi := Ker(R −→ Ri ), then q1 ∩ . . . ∩ qm is an irredundant primary decomposition of
the zero ideal in R. This suffices to show the uniqueness, as Ri ∼
= R/qi .
(1) Consider the zero ideal in Ri . As Ri has only one maximal ideal, there is only one
primary ideal in an irredundant primary decomposition. This ideal has to be (0),
hence (0) is a primary ideal in Ri . As Ri = R/qi , this shows that the ideals qi are
primary.
(2) ∩m ∼ m
i=1 qi = (0) because R = ⊕i=1 Ri . Thus q1 ∩ · · · ∩ qm = (0) is a primary decompo-
sition.
(3) To prove that the primary decomposition is irredundant, it suffices to show that
√ √
qi 6= qj for i 6= j. For this it suffices to show that qi + qj = R for i 6= j.
Now, because R ∼ = ⊕mi=1 Ri we find x ∈ R corresponding to (0, . . . ,0,1,0, . . . ,0). So
x − 1 ∈ qi and x ∈ qj . This implies 1 = x + (1 − x) ∈ qi + qj .
Next we study the structure of quotient rings, as promised in the Examples 1.3.13.
Proposition 1.4.27. Let R = Q(R) be a reduced Noetherian ring and Ass (0) =
{p1 , . . . ,ps }. Then R ∼
= ⊕si=1 R/pi .
Proof. Note that p1 , . . . ,ps are minimal, because R is reduced. By prime avoidance,
Lemma 1.1.13, pi + pj 6⊂ ∪sℓ=1 pℓ if i 6= j.
By Theorem 1.4.24 there is a nonzerodivisor in pi + pj . As R = Q(R), all nonzerodi-
visors are units, so that pi +pj = R. As R is, moreover, reduced, we have (0) = p1 ∩· · ·∩ps
by 1.4.24. The conditions of the Chinese Remainder Theorem (1.1.12) are satisfied and
we get R ∼= ⊕si=1 R/pi .
Finally, we wish to study the rank of a module.
Definition 1.4.28. Let R be a Noetherian ring and M be a finitely generated R–module.
One says that M has rank r if M ⊗R Q(R) is a free Q(R)–module of rank r. We write
rank(M ) = r.
1.4 Primary Decomposition 31
For (2) ⇒ (1), the existence of a nonzerodivisor, implies Ir (ϕ)Q(R) = Q(R). Together
with Proposition 1.3.8 we get that Im(ϕ) has rank r.
Exercises
1.4.33. Let R be a principal ideal domain. Prove that f ∈ R is prime if and only if it is
irreducible.
(Hint: Irreducible elements in a principal ideal domain generate maximal ideals. Maximal ideals
are prime ideals.)
1.4.35. Use Exercise 1.4.33 to prove that principal ideal domains are unique factorization do-
mains.
6 0
( Hint: Use the property that a principal ideal domain is Noetherian to prove that every f =
not being a unit is a finite product of irreducible elements.)
1.4.36. Let R be a unique factorization domain. Prove that R has the unique factorization
Q
n Q
m
property: let p1 , . . . ,pn , q1 , . . . ,qm be primes in R and pi = qi .
i=1 i=1
Prove:
(1) n = m.
(2) There exist a permutation π ∈ Sn and units u1 , . . . ,un ∈ R such that pi = ui qπ(i) .
1.4.37. Let R be a unique factorization domain and a1 , . . . ,an ∈ R be elements different from
zero. Prove that a1 , . . . ,an have a greatest common divisor.
1.4.38. Recall the Euclidean algorithm and prove that it computes in Z and K[x] the greatest
common divisor of two elements.
1.4.39. Let R be a unique factorization domain and p ∈ R prime. Let f,g ∈ R[x] be two
polynomials and assume p|f · g. Prove that p|f or p|g, that is, p is also prime in R[x].
1.4.40. Let a,b ∈ Z. Prove that (a) · (b) = (a) ∩ (b) iff gcd(a,b) = 1.
1.4.43. Let I ⊂ R be an ideal and q ⊃ I be a prime (respectively primary) ideal. Prove that
q = {f : f ∈ q} ⊂ R/I is prime (respectively primary).
1.4.45. Let q be a prime (respectively primary) ideal in a Noetherian ring R. Let I · J ⊂ q and
I 6⊂ q for two ideals I,J ⊂ R. Prove that J (respectively J s for some s) is contained in q.
1.4.47. Let q be a primary ideal and p its associated prime ideal. Let x ∈ R, then
8
>
<R if x ∈ q
q : x = q if x 6∈ p
>
:primary with associated prime p if x 6∈ q.
1.4.53.
(a) In 1.4.24 we proved that the set of nilpotent elements in a Noetherian ring is the inter-
section of all (minimal) prime ideals in the ring. Prove that this holds in fact for all rings
(Noetherian or not) by an application of Zorn’s Lemma.
√
(b) Let R be a ring and I ⊂ R be an ideal. Prove that I = ∩ p⊃I p.
p prime
1.4.54. Let (R,m) be a Noetherian local ring and M be an R–module. Suppose that for all
prime ideals p 6= m the localization Mp is the zero module. Show that there exists a k such the
mk · M = 0.
(Hint: Look at a primary decomposition of Ann(M ).)
1.4.55. Let (R,m) be a Noetherian local ring. Suppose that all x ∈ m \ m2 are zerodivisors.
Show that all x ∈ m are zerodivisors.
(Hint: Use 1.4.24.)
34 1 Algebra
Finite and integral extensions of rings are the ring counterparts of finite and algebraic
extensions of fields, which you (hopefully) know from a course in Galois theory. First we
give the relevant definitions.
Definition 1.5.1. Let R ⊂ S be rings.
• R ⊂ S is called a finite extension if S, considered as an R–module, is finitely
generated.
• An element α ∈ S is called integral over R if and only if α satisfies an equation:
αn + a1 αn−1 + . . . + an−1 α + an = 0
with all ai ∈ R. Such an equation is called an integral equation for α. The extension
R ⊂ S is called integral if every element x ∈ S is integral over R.
• On the other hand, R is called integrally closed in S, if every element in S which is
integral over R already belongs to R.
• R is called integrally closed if R is integrally closed in Q(R), the total quotient ring
of R.
• R is called normal if R is reduced and integrally closed.
We obtain the first lemma by copying the proofs of the corresponding statements in
field theory. As usual we denote by R[α] ⊂ S the smallest ring in S containing R and α.
Lemma 1.5.2. Let R ⊂ S and α ∈ S.
(1) Let R ⊂ S ⊂ T be ring extensions, with R ⊂ S and S ⊂ T finite. Then R ⊂ T is
finite.
(2) α is integral over R ⇐⇒ R[α] is a finitely generated R–module.
In particular, every finite extension R ⊂ S is an integral extension.
(3) Let R ⊂ S ⊂ T be ring extensions, with R ⊂ S and S ⊂ T integral. Then R ⊂ T is
an integral extension.
(4) Let S be a finitely generated R–algebra. Then S is finitely generated as an R–module
if and only if every element of S is integral over R.
Proof.
(1) If u1 , . . . ,us are generators of S as R–module and v1 , . . . ,vt are generators of T as
S–module, then one proves directly that the products ui vj generate T as an R−module.
(2) Let α be integral, with integral equation p(α) = 0, for some monic7 polynomial
p ∈ R[X] of degree n, and g(α) ∈ R[α] for some polynomial g ∈ R[X]. Then we can
perform division with remainder to write g = qp + r, where deg(r) < n (cf. Theorem
2.1.5). Plugging in α, we see that g(α) = r(α). This shows that the elements 1,α, . . . ,αn−1
generate R[α] as R−module. On the other hand, if we have finitely many generators
q1 , . . . ,qt of R[α] as R−module, and if we put n − 1 to be maximum of the degrees with
respect to α occurring in the qi , it follows that the element αn is an R−linear combination
of the qi . This gives an integral equation for α.
7 This means that the leading coefficient of p is 1.
1.5 Finite and Integral Extensions 35
(3) We give here a proof for the case that R is Noetherian. For the general case, see
Exercise 1.5.28. Let α ∈ T . By hypothesis, it satisfies an integral equation:
αn + a1 αn−1 + . . . + an−1 α + an = 0
with all ai ∈ S. So in fact α is integral over R[a1 , . . . ,an ]. Therefore R[a1 , . . . ,an ][α] is a
finitely generated R[a1 , . . . ,an ]–module, by part (2). As all elements ai are integral over
R by hypothesis, it follows from the second part and induction that R[a1 , . . . ,an ] is a
finitely generated R–module. Therefore, R[α] is a submodule of R[a1 , . . . ,an ,α] which is
also a finitely generated R–module by the first part. As R is Noetherian, it follows that
R[α] is finitely generated. Now use (2) again, to conclude that α is integral.
(4) This follows from part (2).
Definition 1.5.3. The integral closure of R in S is the set of all elements in S which
are integral over R. In the case that S is the total quotient ring of R, and R is reduced,
we simply call it the normalization of R. We denote the normalization by R. e
To use the standard proof from field theory that sum, difference and products of
integral elements are integral again we need the assumption that R is Noetherian. The
statement holds without this condition however, see Exercise 1.5.29.
Lemma 1.5.4. Let R ⊂ S be rings. Then the integral closure of R in S is a ring. The
integral closure of R in S is integrally closed in S. In particular, the normalization of a
ring is normal.
Proof. We give the proof here for the case that R is Noetherian. We have to show that if
α and β are integral, then α − β and αβ are integral. But those elements are in R[α,β],
which is a finitely generated R–module by 1.5.2. As we supposed R to be Noetherian, it
follows that R[αβ] and R[α − β] are finitely generated R−modules. By the second part
of 1.5.2 it follows that α − β and αβ are integral. The other statement follows from the
third part of 1.5.2.
To give examples of normal rings, we prove the following theorem:
Theorem 1.5.5. Let R be a unique factorization domain. Then R is normal.
r r
Proof. Suppose s ∈ Q(R) is in lowest terms, and is integral over R. So s satisfies an
equation:
r n
r n−1
s + a1 s + . . . + an = 0
with ai ∈ R. We multiply the equation with sn and conclude:
Hence s divides rn , and thus s divides r. As we assumed that they have no common
factor, it follows that s is a unit in R, and so rs ∈ R.
In particular, polynomial rings over fields are normal! The converse of the theorem
does not hold, see Exercise 1.5.33.
Examples 1.5.6.
36 1 Algebra
(1) Consider the ring R := C [x,y]/(y 2 −x3 ). This ring is not normal. Indeed the element
2 3
t := xy satisfies t2 = xy 2 = xx2 = x. Therefore, t is integral, as it satisfies the integral
equation X 2 −x = 0. The normalization R e is in fact C [x,y,t]/(y 2 −x3 ,x−t2 ,y−t3 ) ∼=
C [t]. To show that t ∈ / R, consider the map ϕ : C [x,y] −→ C [t], which sends x to
t2 and y to t3 . Obviously y 2 − x3 is in the kernel of ϕ, and in fact the kernel is
generated by y 2 − x3 . Indeed, let g ∈ Ker(ϕ). Then write g = q(y 2 − x3 ) + r, where
r = r0 + r1 y, ri ∈ C [x]. Then g(t2 ,t3 ) = r0 (t2 ) + r1 (t2 ) · t3 = 0. The first term only
has terms in even degree in t, whereas the second has only odd terms in t. From
this it follows that r1 = r2 = 0. Hence C [x,y]/(y 2 − x3 ) = C [t2 ,t3 ], and t = xy is
not an element of R.
(2) Consider the ring R := C [x,y]/(x · y). This ring is not normal. Namely, x + y is not
x
a zerodivisor in R because x + y 6∈ (x) ∪ (y) (Theorem 1.4.24). Consider u = x+y .
2 2 3 2 2 2
Then u = u because x (x + y) = x = x(x + y ) = x(x + y) in R. But it can
easily be checked that u ∈/ R.
(3) Consider the ring R := C [x]/(x2 ). Then R = Q(R) and therefore R is integrally
closed. However, R is not normal, as R is not reduced.
In the second example we used quite essentially that R is not an integral domain.
In fact, the following theorem holds.
Theorem 1.5.7. Let R be a Noetherian normal local ring. Then R is an integral domain.
Proof. R is normal and, therefore, reduced. Let (0) = p1 ∩· · ·∩pr be an irredundant prime
decomposition. We have to prove that r = 1. Assuming r ≥ 2, we can find g ∈ p2 ∩· · ·∩pr ,
g 6∈ p1 (the decomposition is irredundant) and f ∈ p1 , f 6∈ p2 ∪ · · · ∪ pr (Prime avoidance
1.1.13). By the choice of f and g we have that f + g 6∈ pi for i = 1, . . . ,r. This implies
that f + g is not a zerodivisor (Theorem 1.4.24). On the other hand, f · g = 0 because
f
f · g ∈ p1 (p2 ∩ · · · ∩ pr ) ⊂ p1 ∩ · · · ∩ pr = (0). Consider u := f +g , then u2 = u because
2 3 2 2 2
f (f + g) = f = f (f + g ) = f (f + g) . This implies u ∈ R because R is normal. Note
f
that u 6= 1, as otherwise f +g = 1, and therefore g = 0. Now u(1 − u) = 0, hence u is a
zerodivisor in R. The element u is therefore contained in the maximal ideal of R. Hence
1 − u is a unit because R is local. This implies u = 0 and, therefore, f = 0 which is a
contradiction to the choice of f .
The assumption on R to be local in the theorem can be weakened to the assumption
that R has no idempotents different from 1.8 The last part of the proof just showed that
a local ring does not contain idempotents different from 1.
For more subtle arguments concerning integral elements given later, we need the
following important theorem:
Theorem 1.5.8 (Cayley-Hamilton). Let R ⊂ S, α ∈ S. Let M be an R[α]-module,
which is finitely generated as an R−module. Suppose moreover that M is a faithful9
R[α]–module. Then α is integral over R.
8 An element x in a ring is called idempotent if x2 = x.
9 An R–module M is called faithful if given a ∈ R such that a · m = 0 for all m ∈ M it follows that
a = 0.
1.5 Finite and Integral Extensions 37
Proof. Let u1 , . . . ,ut be generators of M as an R−module. Then for each i the element
αui is defined and is an P element of M . As the ui ′ s generate M there exist elements
t
ϕij ∈ R, such that αui = j=1 ϕij uj . In matrix notation:
u1
.. = 0, Id the unit matrix.
αId − (ϕij ) .
ut
We multiply on the left with the adjoint (see 1.2.9) of αId − (ϕij ) to conclude that
det(αId − ϕij ) uk = 0 for k = 1, . . . ,t. This is Cramer’s
rule, see 1.2.8. As we assumed
M to be faithful, it follows that det αId − (ϕij ) = 0. Expand the determinant to find
an integral equation for α.
Remarks 1.5.9.
(1) Note that the ring S is “dummy” in some sense. We just need that our element
lives in some ring. To see why this theorem is called the Cayley-Hamilton Theorem,
we refer to Exercise 1.5.30.
(2) If moreover α sends M into I · M for some ideal I, then α satisfies an integral
equation of type
αn + a1 αn−1 + · · · + an = 0, aj ∈ I.
This motivates the following definition.
Definition 1.5.10. Let R ⊂ S be a ring extension, and I ⊂ R be an ideal. An element
x is called integral over I if it satisfies an equation of type10
xn + a1 xn−1 + . . . + an = 0, aj ∈ I.
Proof. Of course, (2) =⇒ (3) is trivial. The equivalence (1) ⇐⇒ (2) can be proved as
in 1.5.2. The implication (3) =⇒ (1) uses the Cayley-Hamilton Theorem. Indeed, take
an m ∈ N such that xm ∈ I · S ′ . Therefore, multiplication with xm sends S ′ to I · S ′ .
It follows from the proof of the Cayley-Hamilton Theorem that xm satisfies an integral
equation with coefficients in I (S ′ is a faithful R[xm ]–module because 1 ∈ S ′ ).
1.5.12. Let R ⊂ S be rings, and I ⊂ R. Let R be the integral closure of R
Corollary √
in S. Then IR is the set of all elements of S which are integral over I.
10 There is a related notion of integrally dependence on I by using the condition aj ∈ I j . This notion
will not be used in this book.
38 1 Algebra
Proof. If x p∈ S and is√integral over I, then by√the second part of the previous lemma
we get x ∈ IR[x] ⊂ IR. Conversely, if x ∈ IR then xm ∈ IR for certain m. Hence
xm ∈ I · R[x1 , . . . ,xn ] for certain xi ∈ R, so that from (3) =⇒ (1) of the previous lemma
it follows that x is integral over I.
Next we will give a criterion for normality.
Theorem
√ 1.5.13 (Criterion for Normality). Let R be a Noetherian reduced ring. Let
I = I ⊂ R be a radical ideal with the following properties:
Then R is normal if and only if the canonical inclusion R ⊂ HomR (I,I) is an equality.
To prove the theorem we need the following lemma.
e its normalization. Let I ⊂ R be
Lemma 1.5.14. Let R be a Noetherian reduced ring, R
an ideal containing a nonzerodivisor of R. Then
e
(1) R ⊂ HomR (I,I) ⊂ R.
√
e ∩ HomR (I,R).
(2) If, moreover, I = I, that is, I is radical, then HomR (I,I) = R
Proof.
(1) The canonical map R −→ HomR (I,I) is defined by the multiplication: to y ∈ R we
associate the map ϕy : I −→ I defined by ϕy (x) = yx. The injectivity of the canonical
map follows from the assumption that I contains a nonzerodivisor.
Now let a ∈ I be a nonzerodivisor and ϕ ∈ HomR (I,I). Then for every b ∈ I
we have ϕ(b) = b · ϕ(a)
a because aϕ(b) = ϕ(ab) = bϕ(a). Especially, ϕ(a)a ∈ Q(R) is
independent of the choice of the nonzerodivisor. The map ϕ 7→ ϕ(a)
a gives the inclusion
HomR (I,I) ⊂ Q(R). In particular, we obtain an identification
We claim that this module is already contained in the normalization Re of R. Let x ∈ Q(R)
and xI ⊂ I then I can be considered as an R[x]–module.
As an ideal containing a nonzerodivisor it is a faithful R[x]–module and we can apply
the Cayley-Hamilton Theorem 1.5.8 to obtain that x is integral over R. This implies that
e
HomR (I,I) ⊂ R.
(2) We just proved that HomR (I,I) ⊂ R. e Moreover, the inclusion HomR (I,I) ⊂ HomR (I,R)
e
is obvious. So we get HomR (I,I) ⊂ R ∩ HomR (I,R). As above, note that HomR (I,R) =
{x ∈ Q(R) : xI ⊂ R}.
To prove the converse inclusion, let z ∈ R e ∩ HomR (I,R). We have to prove that
n n−1
zI ⊂ I. Let z + an−1 z + · · · + a0 = 0 for suitable a0 , . . . ,an−1 ∈ R. Let f ∈ I, then
by multiplying the above equation with f n we obtain
Proof of Theorem 1.5.13. Let R be normal, that is, R = R.e Then Lemma 1.5.14 (1)
implies that R = HomR (I,I).
To prove the converse, assume that R = HomR (I,I). Suppose that the theorem is
e Let h ∈ R
wrong, that is, R 6= R. e \ R. Consider the ideal
J := {u ∈ R : hu ∈ R}
√ √
in R. We claim that I ⊂ J. As J = ∩p⊃J p is the intersection of all prime ideals
containing J, by Exercise 1.4.53, it suffices to show that p ⊃ I for all p ⊃ J. So let p be a
prime ideal with p ⊃ J. Now h ∈ / Rp . Indeed, otherwise ah ∈ R for some a ∈ / p. But then
also a ∈
/ J, in contradiction to the definition of J. So h gives an element of Q(Rp ) which
is integral over Rp (use the same integral equation) and, therefore, Rp is not normal.
This implies p ⊃ I by the second assumption, and proves the claim.
As our ring is Noetherian, we can now choose d ∈ N such that I d ⊂ J. This implies
d
hI ⊂ hJ ⊂ R. As h ∈ / R, we have d > 0. Let d be minimal with hI d ⊂ R. Thus
there exists an x ∈ I d−1
such that hx 6∈ R. But hx ∈ R e and hxI ⊂ hI d ⊂ R implies
e
hx ∈ R ∩ HomR (I,R) = HomR (I,I) = R. This is a contradiction to the choice of x.
Therefore, our original assumption h ∈ / R is wrong. This proves the theorem.
Remark 1.5.15. We will see later that for rings of finite type over a field (cf. Chapter
2) or analytic algebras (cf. Chapter 3) the ideal I describing the singular locus (which
can be computed using the Jacobian Criterion, cf. Chapter 4) has the properties required
in Theorem 1.5.13.
Example 1.5.16 (Continuation of Example 1.5.6).
(1) Let R = C [t2 ,t3 ], I = (t2 ,t3 ). Then
HomR (I,I) = {x ∈ Q(C [t2 ,t3 ]) = Q(C [t]) : xI ⊂ I} = C [t].
x
(2) Let R = C [x,y]/(x · y), I = (x,y) and u = x+y then R[u] = HomR (I,I) because
R[u] ⊂ Hom(I,I) and R[u] = R[u]/(u) × R[u]/(1 − u) is the product of the two
normal rings R[u]/(u) ∼
= C [y] and R[u]/(1 − u) = C [x].
We now come to the very important theorem of the finiteness of the normalization.
This theorem, due to Kronecker and Noether, is of utmost importance for the develop-
ment of the theory in this book, and will be used at several points. For example, it is
used in the proof of the Nullstellensatz in local analytic geometry, see 3.4.4, and in the
proof of the Local Parametrization Theorem, see 3.4.14, which is also very important
in our development of the theory of local analytic geometry. It is also used in Theorem
4.4.8, in order to prove that the “normalization of a space” always exists. Before stat-
ing the theorem of the finiteness of normalization, we recall some facts concerning the
discriminant of a polynomial in one variable.
Definition 1.5.17. Let k be a field (not necessarily algebraically closed), and f =
xr + a1 xr−1 + . . . + ar be a polynomial with coefficients in k. Consider a field extension
k ⊂ K in which f splits:
f = (x − α1 ) · · · (x − αr ).
Then the discriminant ∆ = ∆x (f ) with respect to the variable x is defined by:
Y
∆= (αi − αj ).
i6=j
40 1 Algebra
The discriminant therefore is zero exactly if f has a multiple root. It follows from
the Theorem on Symmetric Functions, and is a standard fact in field theory, that the
discriminant in fact is a polynomial in the coefficients a1 , . . . ,ar of f .
Lemma 1.5.18. Let R be an normal integral domain, K be its quotients field, and L ⊃ K
be a field extension. Let x ∈ L be integral over R. In particular, x is algebraic over K.
Then the minimal polynomial f (X) ∈ K[X] of x has coefficients in R.
Proof. Consider a complete set of conjugates xi of x in an algebraic closure of L. This
means that we can write Y
f (X) = (X − xi ).
(Each xi is counted pe times, where pe is the degree of inseparability. In particular xi
is counted once, if the characteristic is zero.) By a standard result in field theory, there
exists a K–isomorphism σi : K(x) ∼ = K(xi ), with σi (x) = xi for all i. It follows that also
xi is integral over R. Indeed, apply the isomorphism σi to an integral equation of x
xn + a1 xn−1 + . . . , + an = 0
to which we apply our knowledge of linear algebra. First of all, the determinant D :=
det(A), as a sum and product of integral elements itself is integral. But we know more: the
matrix A isQ the well-known Vandermonde matrix and its determinant can be computed
to be D = i>j (αi − αj ). Furthermore, by definition of the discriminant ∆ = ±D2 . It
follows from Cramer’s rule 1.2.8 that
D · qν = Pν (fi ,αj )
for all ν for some polynomial Pν with integer coefficients. It follows that Dqν is integral
over R. As D also was integral we conclude that ∆qν = DDqν is integral over R. Now
∆ ∈ R ⊂ K, and the qν are in K, so that ∆ · qν is an element of K, the quotient ring of
R. We can now use our assumption that R is normal to conclude that ∆ · qν ∈ R for all
ν. This is what we had to prove.
12 Therefore, it is trivially true that for all f there exists a ∆f with ∆f · f ∈ R[α]. The point of the
theorem is, that we can take a universal ∆ which works for all f .
42 1 Algebra
(4) Let β ∈ S.e Then S[β] is a finitely generated R–module with quotient field L and,
therefore, by (3) of the theorem (applied to R ⊂ S[β]) included in R∆ [α]. This implies
that Se ⊂ R∆ [α] and is, therefore, finitely generated.
(5) Note that S∆ = R∆ [α] and, because of (4), Se∆ = R∆ [α].
Theorem 1.5.20 (Splitting of Normalization). Let R be a reduced Noetherian ring and
(0) = p1 ∩ · · · ∩ ps be an irredundant primary decomposition. Then there is a canonical
isomorphism between the normalization R ]i :
e and the direct sum of the normalizations R/p
s
M
e∼
R = ]i .
R/p
i=1
Proof. We have the following statements that are easy to prove, see Exercise 1.5.27.
(1) pi Q(R) is a prime ideal in Q(R).
(2) ∩si=1 pi Q(R) = (0).
(3) Q(R)/pi Q(R) = Q(R/pi ).
Now we use Proposition 1.4.27 and obtain
s
M s
M
Q(R) ∼
= ∼
Q(R)/pi Q(R) = Q(R/pi ).
i=1 i=1
The second isomorphism is the third statement from above. Let ui ∈ Q(R) correspond to
(0, . . . ,0,1,0, . . . ,0), with 1 at the i–th place. It follows immediately from the isomorphism
that u2i = ui . So ui ∈ R. e Note, moreover, that ui ∈ ∩j6=i pj Q(R), ui − 1 ∈ pi Q(R) for
i = 1, . . . ,s. Thus we get ui ∈ ∩j6=i (pj Q(R) ∩ R) e and ui − 1 ∈ pi Q(R) ∩ R.
e The conditions
of the Chinese Remainder Theorem are satisfied. We get an isomorphism
s
M
e∼
R = e i Q(R) ∩ R).
R/(p e
i=1
]i ∼
induces an isomorphism R/p e i Q(R) ∩ R),
= R/(p e see Exercise 1.5.27.
Remark 1.5.21.
(1) Note that we reproved that a Noetherian normal local ring is an integral domain.
(2) Let K be a field, and S = K[x1 , . . . ,xn ]/p with p a prime ideal. We will prove in
2.2.9 (Noether normalization) that there exists R ⊂ S satisfying the assumptions
of the Theorem 1.5.19. This implies that the normalization Se is a finitely generated
S–module. In general this is not necessarily true. A similar statement holds for
so-called analytic algebras, which we will consider in the third chapter.
1.5 Finite and Integral Extensions 43
xn + a1 xn−1 + . . . + an = 0,
Exercises
1.5.27. Show the unproved statements in the proof of Theorem 1.5.20.
]i ∼
(Hint: To prove that R/p e i Q(R) ∩ R)
= R/(p e consider the canonical inclusions
M M M
R⊂ R/pi ⊂ e i Q(R) ∩ R)
R/(p e ⊂ Q(R/pi ).)
1.5.28.
(1) Let R ⊂ S be a ring extension, and α ∈ S. Here R is not necessarily Noetherian. Use the
Cayley-Hamilton Theorem to prove that the following conditions are equivalent.
(a) α is integral over R.
(b) R[α] is a finitely generated R–module.
(c) R[α] is contained in a ring T ⊂ S, such that T is a finitely generated R–module.
(d) There exists a faithful R[α]–module M which is finitely generated as an R–module.
(2) Prove that 1.5.2 (3) holds without the hypothesis that R is Noetherian.
1.5.29. Use the Cayley-Hamilton Theorem to prove that the Noetherian hypothesis in 1.5.4 is
superfluous.
1.5.30.
(1) Let K be a field, V be a finite-dimensional vector space, and ϕ : V → V be a linear
map. Interprete 1.5.8 as the Theorem of Cayley-Hamilton you know from linear algebra.
Note that we could just as well take any ring R instead of the coefficient field K, and any
finitely generated module M instead of V .
(2) Let R be a ring, M be a finitely generated R−module, and ϕ : M → M be a surjective
R−module homomorphism. Prove that ϕ is an isomorphism.
(Hint: Consider M as R[X]–module, via X · m := ϕ(m), and extend by linearity. Then
(X) · M = M. Consider M as R[X,Y ]–module, via Y · m = Id(m) = m. Then apply
Remark 1.5.9 (2) to deduce that Y integral over X.)
fS = R
1.5.31. Let R be a reduced ring, and S ⊂ R be a multiplicatively closed set. Show that R eS .
Informally speaking, one says that taking integral closure commutes with localization.
1.5 Finite and Integral Extensions 45
1.5.32. Let R be a reduced ring. Prove that the following conditions are equivalent.
(1) R is normal.
(2) Rp is normal for all prime ideals p in R.
(3) Rm is normal for all maximal ideals m in R.
The purpose of this chapter is to introduce the reader to the basic definitions, ideas and
notations of affine algebraic geometry. Loosely speaking, an affine algebraic set is the zero
set of finitely many polynomials in an affine space K n , where K is an algebraically closed
field.1 Some of the results of this chapter are true and proved over arbitrary algebraically
closed fields. This is the reason why we do not consider only the case K = C as we will
do from Chapter 3 onwards. Admittedly, our main interest in this book is the behavior
of such zero sets in the neighborhood of a point. We will discuss and study the algebraic
case first, however, simply because the proofs of the statements in this case usually are
easier. This is because one does not have to bother one-selves with convergence questions.
Having grasped the main geometric and algebraic ideas in the affine case, the reader is
hopefully ready to tackle the local case in Chapter 3. A second reason for doing the affine
case, too, is that we need to do some algebraic geometry, in order to classify hypersurface
singularities, see Chapter 9.
Again for reasons of simplicity, we first study the zero set V (f ) of a single polyno-
mial f . These zero sets are called affine hypersurfaces. The main result in this section
is Study’s Lemma. It says that if f and g are polynomials, with f reduced and V (f ) a
subset of V (g), then f divides g. We give two proofs, whose ideas permeate large parts
of the first part of the book. Namely, we consider a “general” linear projection of the
hypersurface V (f ) on a hyperplane. This is called Noether normalization. One shows
that the projection is surjective (this is quite trivial), and the proof is completed by
using some elementary algebra, in particular, division with remainder. A second proof of
Study’s Lemma is given. Here it is not only proved that the projection is surjective, but
also the number of points in the fiber are studied.
Study’s Lemma allows us to interpret certain algebraic operations and statements
geometrically. We mention two of them.
The first is the decomposition of a hypersurface in irreducible components. A hy-
persurface is called reducible, if it can be written in a nontrivial way as the union of two
(or more) hypersurfaces. Now, if f and g are polynomials, the zero set of f · g, obviously,
is the union of the zero sets of f and g. Study’s Lemma allows us to conclude that for
the zero set of an irreducible polynomial f such a decomposition is not possible.
The second is the interpretation of the K–algebra K[x1 , . . . ,xn ]/(f ). On a hyper-
surface V = V (f ), where f is a polynomial, it is natural to consider functions which
are restrictions of polynomials on the ambient space K n . This set of functions is in a
natural way a K–algebra. If f is reduced, this algebra can be identified with the quotient
K[x1 , . . . ,xn ]/(f ) := K[V ]. In the proof, essential use of Study’s Lemma is made.
In the second section, we will study affine algebraic sets in general. That is, we study
zero sets V (I) := {a ∈ K n : f (a) = 0 for all f ∈ I} of ideals I in the polynomial ring
K[x1 , . . . ,xn ]. Hilbert’s Basis Theorem allows us to consider zero sets of only finitely
many polynomials. The main discussion is on a generalization of Study’s Lemma, called
Hilbert’s Nullstellensatz. A weaker form is given by the Weak Nullstellensatz. It says that
1 Just keep in mind the case K = C .
2.1 Affine Hypersurfaces 47
if I is a proper ideal, then V (I) 6= ∅. This statement is in fact equivalent to the structure
of maximal ideals in the polynomial ring: every maximal ideal in K[x1 , . . . ,xn ] is of type
(x1 − a1 , . . . ,xn − an ), for some (a1 , . . . ,an ) ∈ K n . So we have a one-one correspondence
between maximal ideals and points. The proof of the Weak Nullstellensatz is a conse-
quence of the Projection Theorem, which is a statement on how one can describe, under
certain circumstances, the image of an affine space under projection. To prove Hilbert’s
Nullstellensatz, we again use the Projection Theorem and Noether normalization. This
means that we study a general projection π : X = V (I) −→ K r , where r is determined
by X. Morally speaking, r is equal to the dimension of X, a property of X which is not
defined at this point of our study. The proof of the fact that the map π for a Noether
normalization then is surjective, is done by means of the Projection Theorem. Hilbert’s
Nullstellensatz itself is an easy consequence of the surjectivity of π, and is an immediate
generalization of the second part of the proof of Study’s Lemma. A shorter proof of the
Nullstellensatz out of the weak Nullstellensatz is by using the so-called Rabinowitch trick.
This is done in the exercises. We presented the other proof however, as a substitute for
the Rabinowitch trick in the local case does not, to our knowledge, exist. We want to
have similar proofs for both the affine and the analytic case.
In Section 2.3, we consider regular maps between affine algebraic sets. These maps
are restrictions of polynomial maps. In particular, we define isomorphisms between affine
algebraic sets. If ϕ : V −→ W is a regular map between affine algebraic varieties, we get,
by composition, a K–algebra homomorphism ϕ∗ : K[W ] −→ K[V ]. (Arrows reverse!)
Again geometric statements about maps between V and W are interpreted as algebraic
statements on the map ϕ∗ : K[W ] −→ K[V ]. For example, it is shown that ϕ is injective,
if and only if ϕ∗ is surjective.
Definition 2.1.1. Let K be a field, f ∈ K[x1 , . . . ,xn ] be a polynomial. The zero set
V (f ) ⊂ K n of f is defined by:
V (f ) := {(a1 , . . . ,an ) ∈ K n : f (a1 , . . . ,an ) = 0}.
A set V ⊂ K n is called an affine hypersurface if there exists a nonzero f ∈ K[x1 , . . . ,xn ],
such that V = V (f ).
We give some examples. For the examples, we use K = R for two reasons: real
pictures are easier to draw; R is not algebraically closed and almost everything which
fails over a field which is not algebraically closed fails already over R.
Examples 2.1.2.
(1) f = x2 + y 2 − 1 ∈ R[x,y]. The set V (f ) is a circle in the plane.
48 2 Affine Algebraic Geometry
As is well-known, see 1.4.4, the polynomial ring K[x1 , . . . ,xn ] is factorial. It follows
easily from the definition, see Exercise 2.1.18, that f |g (or what is the same (f ) ⊃ (g))
implies that there is an inclusion of zero sets V (f ) ⊂ V (g). The converse does not hold,
as the following simple examples shows.
Examples 2.1.3.
(1) Take f = x2 + 1 ∈ R[x], g = x ∈ R[x]. Then ∅ = V (f ) ⊂ V (g) = {0}. But as the
degree of f is bigger than the degree of g, f cannot divide g.
(2) Let f be a nonconstant polynomial. Then V (f 2 ) = V (f ), so in particular V (f 2 ) ⊂
V (f ). But obviously f 2 does not divide f .
The phenomena occurring in the first example is typical for fields which are not
algebraically closed. In fact, one has the following theorem.
Theorem 2.1.4 (Study’s Lemma). Let K be an algebraically closed field,
f,g ∈ K[x1 , . . . ,xn ]. Suppose that f is irreducible, and that V (f ) ⊂ V (g). Then f |g.
There is a proof of Study’s Lemma using so-called resultants, see for example,
[Brieskorn-Knörrer 1986], page 192. We present two other proofs of Study’s Lemma here.
Both proofs are based on the following two facts.
g = qf + r.
with m ≥ d. Consider
xi = bi yn + yi for 1 ≤ i ≤ n − 1, xn = bn yn
Then g(y1 , . . . ,yn ) := f (x1 (y1 , . . . ,yn ), . . . ,xn (y1 , . . . ,yn )) is such that g(0, . . . ,0,yn ) =
fs (b1 , . . . ,bn )yns + fs−1 (b1 , . . . ,bn )yns−1 + . . ., with fs (b1 , . . . ,bn ) = c 6= 0. Hence g =
c(yns + a1 yns−1 + . . . + as ), for certain ai ∈ K[y1 , . . . ,yn−1 ]. This shows the lemma.
The Noether normalization for hypersurfaces can be reformulated in a more algebraic
manner. The algebraic formulation is the one which is going to be generalized later.
Corollary 2.1.7 (Noether Normalization for Hypersurfaces). For a general coordinate
system,
K[x1 , . . . ,xn ]/(f )
is a finitely generated K[x1 , . . . ,xn−1 ]–module.2
2 In this case it is even a free module.
50 2 Affine Algebraic Geometry
Proof. After a general change of coordinates we may assume that f is of type (2.1). We
claim that K[x1 , . . . ,xn ]/(f ) is as K[x1 , . . . ,xn−1 ]–module generated by {1,xn , . . . ,xns−1 }.
Consider a g ∈ K[x1 , . . . ,xn ]. We do division with remainder
g = qf + r
Examples 2.1.8.
(1) Consider f = xy. Then obviously C [x] ⊂ C [x,y]/(xy), but C [x,y]/(xy) is not
a finitely generated C [x]–module. Indeed, if it was finitely generated, say by the
elements g1 , . . . ,gs ∈ C [x,y], then let d be the maximum of the degree in y in the
polynomials gi . Then y d+1 is not in the module generated by the g1 , . . . ,gs . The
module C [x,y]/(xy) is as C [x]–module generated by {1,y,y 2 , . . .}.
f
2.1 Affine Hypersurfaces 51
These two examples are quite typical. In both cases, we considered the inclusion of
rings C [x] ⊂ C [x]/(f ). Consider the projection π : X = V (f ) −→ C which sends
(a,b) to a. In the first example, this map is surjective, but the fiber π −1 (0) consists of
infinitely many points, in fact the whole y–axis. In the second example, the map π is not
surjective: π −1 (0) = ∅. But, if f is as in formula (2.1), it turns out that the projection
π : V (f ) −→ C n−1 on the first n − 1 coordinates is surjective, and has only finitely many
points in each fiber. This fact, which is very easy to prove, is crucial for our first proof
of Study’s Lemma.
Lemma 2.1.9. Let K be an algebraically closed field, and f = xsn + a1 xns−1 + . . . + as ,
with ai ∈ K[x1 , . . . ,xn−1 ]. Consider the projection
Then π is surjective.
Proof. This is quite obvious. For, let p = (p1 , . . . ,pn−1 ) ∈ K n−1 . Because K is alge-
braically closed, the equation
gk + b1 g k−1 + . . . + bk = 0,
with bi ∈ K[x1 , . . . ,xn−1 ]. This equation is to be considered in the ring K[x1 , . . . ,xn ]/(f ).
Therefore
g k + b1 g k−1 + . . . + bk ∈ (f ).
We take k minimal with this property. Take a point p ∈ V (f ), so that f (p) = 0. From
V (f ) ⊂ V (g) it follows that g(p) = 0. Therefore bk (p) = 0. But bk only depends on
the first n − 1 coordinates. As π : V (f ) −→ K n−1 is surjective by the previous lemma,
it follows that bk (p1 , . . . ,pn−1 ) = 0 for all (p1 , . . . ,pn−1 ) ∈ K n−1 . Hence bk is the zero
polynomial. Therefore
As f is irreducible, either f | g or f | g k−1 +b1 g k−2 +. . .+bk−1 . But the second possibility
is excluded by our choice of k to be minimal.
The second proof of Study’s Lemma we present looks somewhat more complicated. It
gives, however, also some more information on the fibers of the map π : V (f ) −→ K n−1 .
Moreover, the condition irreducible in this proof can be somewhat weakened: we only
need f to be square-free.
52 2 Affine Algebraic Geometry
Let ∆ ∈ K[x1 , . . . ,xn−1 ] be the discriminant of f (with respect to xn ), see 1.5.17. Con-
sider the projection
π : V (f ) −→ K n−1 ,
(p1 , . . . ,pn ) 7→ (p1 , . . . ,pn−1 ) =: q.
Then π −1 (q) has s, the degree of f , elements exactly when ∆(q) 6= 0.
Proof. Consider q = (p1 , . . . ,pn−1 ) ∈ K n−1 . Look at the polynomial
Because K is algebraically closed, this polynomial has s (distinct) roots exactly when its
discriminant, which is ∆(q), is nonzero. If b1 , . . . ,bt , t ≤ s are the roots of the polynomial
(2.2), then the points in the fiber π −1 (q) are (p1 , . . . ,pn−1 ,b1 ), . . . ,(p1 , . . . ,pn−1 ,bt ).
Example 2.1.11. Consider the curve C given by the equation y 2 − x2 (x + 1) = 0. The
discriminant is given by 4x2 (x + 1), so is zero for x = 0 and x = −1. Consider the
projection π from C to the x–axis.
y
We see indeed from the picture (over C !) that π −1 (p) consists of two points for
p 6= 0, − 1, whereas for p = 0 or −1 it consists of one point. For other examples see
Exercise 2.1.21.
Second proof of Study’s Lemma. After, if necessary, performing a linear change of coor-
dinates, we may assume, by Noether normalization for hypersurfaces, that
g = qf + r
r = r0 + r1 xn + . . . + rs−1 xns−1 , ri ∈ K[x1 , . . . ,xn−1 ].
Let (p1 , . . . ,pn−1 ,pn ) ∈ V (f ). As by assumption V (f ) ⊂ V (g), it follows that
2.1 Affine Hypersurfaces 53
has s roots. But the polynomial r(p1 , . . . ,pn−1 ) has degree s − 1, has s roots, so must be
the zero polynomial. Therefore, its coefficients are zero. Hence
As we can take q = (p1 , . . . ,pn−1 ) general, it follows from Exercise 2.1.19 that the poly-
nomials, r0 , . . . ,rs−1 ∈ K[x1 , . . . ,xn−1 ] are zero. Hence r = 0. Therefore, g = qf , that is,
f | g.
Definition 2.1.12. An affine hypersurface C ⊂ K n is called irreducible if from C =
C1 ∪ C2 , with C1 and C2 affine hypersurfaces, it follows that either C1 = C or C2 = C.
A hypersurface is called reducible if it is not irreducible.
An irreducible component of an affine hypersurface C is an irreducible affine hyper-
surface contained in C.
C = C1 ∪ · · · ∪ Cs ,
Examples 2.1.14.
x=0
y=0
2.1.21.
(1) Consider the curve defined by y 2 − xy + x2 = 0, and the projection π on the x–axis.
Compute for which points on the x–axis the inverse image of π consists of less than two
points.
(2) Consider the cone x2 − yz, see example 2.1.2., and the projection π on the y,z–plane.
Compute for which points on the y,z–plane the inverse image of π consists of less than
two points. What about projection on the x,y–plane?
2.1.22. Prove, using only the statement of Study’s Lemma, that V (f ) 6= ∅ for any nonunit
f ∈ K[x1 , . . . ,xn ].
In this rest of this chapter we assume, unless said otherwise explicitly that K is alge-
braically closed. In this section, we will, more generally, consider zero sets of any set of
elements in K[x1 , . . . ,xn ].
Definition 2.2.1. Let F ⊂ K[x1 , . . . ,xn ] be a subset. We put
and call it the zero set of F . A subset V of K n is called an (affine) algebraic set, or an
affine variety if there exists a subset F ⊂ K[x1 , . . . ,xn ] with V = V (F ). Conversely, for
any set A ⊂ K n we define
(8) I (V ∪ W ) = I (V ) ∩ I (W ).
√
(9) I ⊂ I (V (I)).
Proof.
(1) The fact that I (A) is an ideal is easy. Moreover, if f m ∈ I (A), then f m (p) = 0 for
all p ∈ A. Hence f (p) = 0 for all p ∈ A, that is f ∈ I (A).
(2) and (3) are trivial.
(4) As F ⊂ I, Pnit follows that V (F ) ⊃ V (I). Suppose on the other hand that a ∈ V (F ),
and let f = i=1 ai fi , fi ∈ F be an element of I. As a ∈ V (F ), we have that fi (a) = 0
for all i. Hence f (a) = 0. As f is arbitrary it follows that a ∈ V (I).
(5) ∅ = V ({1}), hence is algebraic. Furthermore, K n = V ({0}), hence is algebraic too.
(6) We prove three inclusions.
(a) V (I) ∪ V (J) ⊂ V (I ∩ J): let a in V (I) ∪ V (J). Therefore, either f (a) = 0 for all
f ∈ I or f (a) = 0 for all f ∈ J. In particular, if f ∈ I ∩ J, it follows that f (a) = 0.
(b) V (I ∩ J) ⊂ V (I · J): this follows from the inclusion of ideals I · J ⊂ I ∩ J.
(c) V (I · J) ⊂ V (I) ∪ V (J): let a ∈ V (I · J). In particular f (a) · g(a) = 0 for all f ∈ I
and g ∈ J. Suppose a ∈ / V (J). Then we must show a ∈ V (I). As a ∈ / V (J) there
exists a g ∈ J with g(a) 6= 0. But as f (a)g(a) = 0 for all f ∈ I, it follows that
f (a) = 0 for all f ∈ I. This gives a ∈ V (I).
P
(7)PAs the ideal generated by ∪α∈B Iα is exactly α∈B Iα , it follows that V (∪α∈B Iα ) =
V ( α∈B Iα ). The equality V (∪α∈B Iα ) = ∩α∈B V (Iα ) is easy, and therefore left to the
reader.
(8) If f vanishes on V ∪ W , it must vanish on both. The inclusion I (V ∪ W ) ⊂ I (V ) ∩
I (W ) follows. Suppose on the other hand that f ∈ I (V ) ∩ I (W ), and let a ∈ V ∪ W .
Then either a ∈ V or a ∈ W . In any case, for f ∈ I (V ) ∩ I (W ) it follows that f (a) = 0.
(9) The inclusion I ⊂ I (V (I)) is a√tautology. As I (V (I)) is a radical ideal by the first
part of the lemma, it follows that I ⊂ I (V (I)).
Examples 2.2.3.
(1) Consider I = (xy,xz,yz). It is not too difficult to show, see Exercise 2.2.30, that
I = (x,y) ∩ (x,z) ∩ (y,z). Hence X := V (I) consist of the union of the coordinate
axes.
On the other hand, X is also the zero set of the ideal I ′ := (x,y) · (x,z) · (y,z) =
(x2 y,x2 z,xyz,xz 2 ,xy 2 ,yz 2 ,y 2 z). This ideal is smaller than I and, therefore, is not
radical. This can be seen immediately. For example, xy ∈ / I ′ but (xy)2 = y · x2 y is
′
in I . This is quite typical. Products of radical ideals in general are not radical.
2.2 Affine Varieties 57
(2) Consider I1 = (z,y 2 − x3 ), and I2 = (x,y), and X1 = V (I1 ),X2 = V (I2 ). Then
the union X = X1 ∪ X2 is the zero set of the ideal I = I1 · I2 = xz,yz,x(y 2 −
x3 ),y(y 2 − x3 ) . Again this ideal is not radical. Indeed, y 2 − x3 ∈
/ I, but (y 2 − x3 )2 =
2 3 2 2 3 2 3
√
y · y(y − x ) − x · x(y − x ) ∈ I, so that y − x ∈ I.
We will give several proofs of the Nullstellensatz in this book. For still other proofs,
we refer to the book of Eisenbud [Eisenbud 1995]. Hilbert’s Nullstellensatz is a gener-
alization of Study’s Lemma, see Exercise 2.2.31. Why is it called the Nullstellensatz
(German for Theorem of Zeros)? Well, let I $ K[x1 , . .. ,xn ]. Then it follows from the
Nullstellensatz that V (I) 6= ∅: indeed, otherwise I V (I) = I (∅) = K[x1 , . . . ,xn ], hence
√
I = (1). Therefore for some n we have 1 = 1n ∈ I and, therefore, I = K[x1 , . . . ,xn ], a
contradiction. Hence from the Nullstellensatz it follows:
Theorem 2.2.7 (Weak Nullstellensatz).
Let I $ K[x1 , . . . ,xn ] be an ideal. Then V (I) 6= ∅.
In fact, one is able to show via the so-called Rabinowitch trick, see Exercise 2.2.32,
that conversely, the weak Nullstellensatz implies the Nullstellensatz. In this reduction,
both the number of variables and the number of equations grow by one! In particular
we get an alternative proof of Study’s Lemma, by following the lines in this proof of
the Nullstellensatz, in which one would have to prove the weak Nullstellensatz for two
functions. We will prove the Weak Nullstellensatz by using the Projection Theorem,
which we treat now.
Theorem 2.2.8 (Projection Theorem). Consider a nonzero ideal I ⊂ K[x1 , . . . ,xn ].
Suppose that I contains an element f0 which is regular in xn , that is
f0 = xdn + a1 xd−1
n + . . . + a0
for certain elements ai ∈ K[x1 , . . . ,xn−1 ]. Let p : K n −→ K n−1 be the map which
sends (x1 , . . . ,xn ) to (x1 , . . . ,xn−1 ). Define moreover I ′ := I ∩ K[x1 , . . . ,xn−1 ], and put
X ′ = V (I ′ ). Then
p(X) = X ′ .
Proof. The inclusion p(X) ⊂ X ′ is the easy inclusion. Indeed if a ∈ p(X), then there
exists a b ∈ K with (a,b) ∈ X. Let f ∈ I ′ . Then f ∈ I and thus f (a,b) = 0. But f does
not depend on xn so that f (a) = 0.
For the inclusion X ′ ⊂ p(X) we will use the Nullstellensatz for the case n = 1.
This case is easy, as K[x] is a principal ideal domain, and every f has a root, since by
assumption K is algebraically closed. Assume that a ∈ / p(X). We will show that a ∈/ X ′.
′
To prove this, it suffices to find an g ∈ I with g(a) 6= 0. This g is constructed in two
steps.
Step 1. First we will show that for all f ∈ K[x1 , . . . ,xn ] there exists a representation
Now
∃b : b ∈ V (ϕ(I)) ⇐⇒ ∃b : f (a,b) = 0 for all f ∈ I ⇐⇒ a ∈ p(X),
2.2 Affine Varieties 59
f = gf + h′f ,
d−1
X
gf = qf0 + gj xjn .
j=0
Pd−1
If we plug in (a,xn ) we get 0 = q(a,xn )f0 (a,xn ) + j=0 gj (a)xjn . By assumption f0 (a,xn )
is a polynomial in xn of degree d. By the uniqueness statement in division with remainder
it follows that gj (a) = 0 for all j. Now we put hf := h′f +qf0 and we get the representation
(2.3).
Step 2. We apply (2.3) to the functions 1,xn , . . . ,xd−1
n . We get
with gij (a) = 0 for all i,j and hj ∈ I for all j. In matrix notation
1 h0
(Id − gij ) ... = ... .
xd−1
n hd−1
Let B be the adjoint matrix of (Id − gij ). We multiply with B from the left, and it
follows from Cramer’s rule that
1 h0
.. ..
(2.4) det (Id − gij ) . =B . .
xd−1
n hd−1
We define g := det (Id − gij ) ∈ K[x1 , . . . ,xn−1 ]. From the first row of (2.4) we see that
g ∈ I, so g ∈ I ′ . As moreover gij (a) = 0, it follows that g(a) = 1 6= 0. This was our
goal.
Proof of the weak Nullstellensatz. The proof is by induction on n. The case I = (0) is
obvious. Otherwise there exists an f0 ∈ I, for which we may, by the Noether normaliza-
tion for hypersurfaces 2.1.6, assume to fulfill the conditions of the Projection Theorem.
Furthermore I ′ 6= (1), as otherwise I = (1). By induction V (I ′ ) 6= ∅, and thus by the
Projection Theorem p(X), and in particular X = V (I) is nonempty.
For the following theorem we do not need that K is algebraically closed. The state-
ment of the following so-called Noether normalization is a direct generalization of the
Noether normalization for hypersurfaces, see 2.1.7.
60 2 Affine Algebraic Geometry
Theorem 2.2.9 (Noether Normalization). Let K be a field with infinitely many ele-
ments, A = K[x1 , . . . ,xn ]/I a finitely generated K–algebra. Then after a general linear
coordinate change, there exists a number r ≤ n, and an inclusion
K[x1 , . . . ,xr ] ⊂ A,
such that A is a finitely generated K[x1 , . . . ,xr ]–module. If moreover I 6= (0), then r < n.
If V = V (I), we also say that the projection on the first r coordinates
π : V −→ K r
is a Noether normalization of V .
Proof. If I = (0), we can take x1 , . . . ,xr to be x1 , . . . ,xn . Otherwise the proof goes
by induction on n, the case n = 0 being trivial. Let g ∈ I with g 6= 0. By Noether
normalization for hypersurfaces see 2.1.7, after a general (linear) change of coordinates,
we may assume that K[x1 , . . . ,xn ]/(g) is a finitely generated K[x1 , . . . ,xn−1 ]–module.
Because g ∈ I, then certainly K[x1 , . . . ,xn ]/I is a finitely generated K[x1 , . . . ,xn−1 ]/(I ∩
K[x1 , . . . ,xn−1 ])–module. We apply the induction hypotheses to I ∩K[x1 , . . . ,xn−1 ]. After
a general linear coordinate change in x1 , . . . ,xn−1 we have that
K[x1 , . . . ,xr ] ⊂ K[x1 , . . . ,xn−1 ]/ I ∩ K[x1 , . . . ,xn−1 ]
is finitely generated. Composing with the inclusion
K[x1 , . . . ,xn−1 ]/ I ∩ K[x1 , . . . ,xn−1 ] ⊂ K[x1 , . . . ,xn ]/I,
allows us to deduce that K[x1 , . . . ,xr ] ⊂ K[x1 , . . . ,xn ]/I is finitely generated. This proves
the theorem.
Corollary 2.2.10. Let π : V −→ K r be a Noether normalization. Then π is surjective.
Proof. This follows by induction from the Projection Theorem 2.2.8.
Corollary 2.2.11. Hilbert’s Nullstellensatz holds for prime ideals p ⊂ K[x1 , . . . ,xn ].
Proof. As a prime ideal is radical, the Nullstellensatz says that p = I (V (p)) for a prime
√
ideal p. The inclusion ( p = p ⊂ I (V (p)) is easy, see Lemma 2.2.2 (9). For the other
inclusion, consider a Noether normalization
K[x1 , . . . ,xr ] ֒→ K[x1 , . . . ,xn ]/p.
Now let f ∈ I (V (p)) be given. By looking at its class in K[x1 , . . . ,xn ]/p, we see that
there exists an integral equation
f s + a1 f s−1 + . . . + as = 0 ∈ K[x1 , . . . ,xn ]/p, ai ∈ K[x1 , . . . ,xr ].
We take s minimal with this property. From p ⊂ I (V (p)), it follows as ∈ I (V (p)). This
implies that as (p) = 0 for all p ∈ V (p). If p = (p1 , . . . ,pn ) then as (p) = as (p1 , . . . ,pr ).
Using Corollary 2.2.10 we know that V (p) −→ K r is surjective and, consequently, for
all (p1 , . . . ,pr ) ∈ K r there exists a p = (p1 , . . . ,pr ,pr+1 , . . . ,pn ) ∈ V (p) which implies
as (p1 , . . . ,pr ) = 0 for all (p1 , . . . ,pr ) ∈ K r . This implies as ∈ I (K r ) = 0. Hence
f (f s−1 + a1 f s−2 + . . . + as−1 ) ∈ p.
Because p is a prime ideal, and s was to be taken minimal, it follows that f ∈ p. This is
what we had to show.
2.2 Affine Varieties 61
To complete our proof of the Nullstellensatz it remains to show the following lemma.
Lemma 2.2.12. Suppose the Nullstellensatz holds for all prime ideals p ⊂ K[x1 , . . . ,xn ].
Then the Nullstellensatz holds for all ideals I ⊂ K[x1 , . . . ,xn ].
√
Proof. Let I = q1 ∩ . . . ∩ qr be a primary decomposition of I. Let pi = qi for all i. In
particular V (pi ) = V (qi ). Then V (I) = V (p1 ∩ . . . ∩ pr ) = V (p1 ) ∪ . . . ∪ V (pr ) by Lemma
2.2.2 (6). Hence by (8) of the same lemma
X = X1 ∪ . . . ∪ Xr ,
Proof.
(1) Suppose f · g ∈ I (X). Then V (f ) ∪ V (g) = V (f · g) ⊃ V I (X) = X. The last
equality is by 2.2.5. Hence
X = X ∩ V (f ) ∪ X ∩ V (g) ,
This is the reason why, for an ideal I in a Noetherian ring R, the associated primes
of I which are not minimal are called embedded primes.
2.2 Affine Varieties 63
The next thing we do is to give a geometric interpretation of the the ideal quotient.
Theorem 2.2.19. Consider an ideal I ⊂ K[x1 , . . . ,xn ], and let d be an element in
K[x1 , . . . ,xn ]. Let I = q1 ∩ . . . ∩ qr be an irredundant primary decomposition of I. Then
[
V (I : d) = V (qi ).
i:d6∈qi
Proof. From (the easy) Exercise 1.1.15 it follows that I : d = (q1 : d)∩. . .∩(qr : d) so that
V (I : d) = V (q1 : d) ∪ . . . ∪ V (qr : d). Now if d ∈ qi , then qi : d is the whole polynomial
ring. In√particular V (qi : d) is empty. Suppose, on the other hand, that d ∈ / qi . Then
√
qi = qi : d by 1.4.47, so it follows that V (qi : d) = V (qi ). From this the theorem
follows.
For radical ideals I the theorem means that V (I : d) picks out the irreducible
components of V (I) on which the function d does not vanish identically. In mathematical
terms, we can write V (I : d) = V (I) \ V (d). Here the bar means taking the closure in
the Zariski topology. This will be slightly generalized in Exercise 2.2.38.
Examples 2.2.20.
(1) Let V be an irreducible algebraic set, and p a point of V . Let mp be the maximal
ideal of K[V ] of regular functions of V vanishing at the point p. Then K[V ]mp are
those rational functions f on V which have a well-defined value at p.
(3) Let f 6= 0 be an element of K[V ]. Then elements of K[V ]f have a well defined
value on V \ V (f ).
(a) V (m) 6= ∅.
(b) There exists (a1 , . . . ,an ) ∈ K n such that m = (x1 − a1 , . . . ,xn − an ).
2.2.27.
(1) Show that K[V ] is isomorphic to K[x1 , . . . ,xn ]/I (V ).
(2) For a subset F ⊂ K[V ], define V (F ) := {a ∈ V : f (a) = 0 for all f ∈ F }. V (F ) is called
the zero set of F . Show that the properties of Lemma 2.2.2 also holds in this more general
context. Show that the Nullstellensatz holds in the ring K[V ].
(3) Show that V is irreducible if and only if K[V ] is an integral domain.
2.2 Affine Varieties 65
(4) Let a,b ∈ V . Show that there exists a regular function f ∈ K[V ] with f (a) 6= f (b).
(Hint: Reduce to the case V = K n . )
2.2.28. We proved hat V (I + J) = V (I) ∩ V (J). It is not true in general that I (V ∩ W ) =
I (V ) + I (W ). Give a counterexample.
2.2.29.
(1) Let I ⊂ K[x1 , . . . ,xn ] be an ideal, such that the vector space K[x1 , . . . ,xn ]/I has finite
dimension s. Show that the set V (I) consists of at most s points.
(Hint: Use the fact that an Artinian algebra is the direct sum of local Artinian algebras,
see 1.4.26.)
(2) Suppose moreover that I is radical. Show that V (I) consists of exactly s points.
(3) Show that for a Noether normalization π : X → K k , with s the degree of the corresponding
field extension K(x1 , . . . ,xk ) ⊂ K(X) = Q(K[X]), the number of points in a fiber is at
most s.
2.2.30.
(1) Prove that (x,y)∩(x,z)∩(y,z) is an irredundant primary decomposition of I = (xy,xz,yz).
(2) Prove that (z,y 2 − x3 ) ∩ (x,y) = (yz,xz,y 2 − x3 ).
2.2.31. Prove that Study’s Lemma is a consequence of Hilbert’s Nullstellensatz.
2.2.32. Show that the Nullstellensatz follows from the weak Nullstellensatz by using the so-
called Rabinowitch
` ´ trick which goes as follows.
Take f ∈ I V (I) , and consider the polynomial ring K[x1 , . . . ,xn ,y]. Take the ideal J generated
by I and yf − 1. Show that V (J) = ∅. Apply the weak Nullstellensatz to conclude that 1 ∈ J.
Then use the fact that modulo J, the equality y = f1 holds.
2.2.33. Consider affine varieties,
where f1 , . . . ,fs ∈ K[x1 , . . . ,xn ], g1 , . . . ,gt ∈ K[y1 , . . . ,ym ]. Show that the Cartesian product
X × Y is also an affine variety, as zero set of the ideal
where the fi and gi are interpreted as elements of K[x1 , . . . ,xn ,y1 , . . . ,ym ].
2.2.34. Suppose m,n > 0. Prove that the Zariski topology of K n × K m is not equal to the
product of the Zariski topologies of K n and K m .
2.2.35. Let V be an algebraic set. Let mk = (x1 , . . . ,xk ) ⊂ K[x1 , . . . ,xk ], and let m be a maximal
ideal of K[V ]. A Noether normalization of K[V ]m , by definition is an inclusion
for some k, such that K[V ]m is a finitely generated K[x1 , . . . ,xk ]m k –module. Show that after a
general coordinate change, a Noether normalization of K[V ]m exists.
2.2.36.
(1) Let V be an irreducible topological space, f : V −→ W be continuous. Show that f (V ) is
irreducible.
(2) Let V ⊂ W be topological spaces. Suppose that V is irreducible. Show that the closure V
of V in W is also irreducible.
2.2.37. In this exercise we look at the Going-Up Theorem in the special case that p is a maximal
ideal.
66 2 Affine Algebraic Geometry
(1) Let R ⊂ S be an integral extension of integral domains. Show that R is a field if and only
if S is a field.
(2) Let R ⊂ S be an integral extension of rings. Let m be a maximal ideal in R, and q ⊂ S
be a prime ideal with q ∩ R = m. Prove that q is a maximal ideal.
Give geometric interpretations of these statements.
2.2.38. Let I,J be ideals in the polynomial ring K[x1 , . . . ,xn ]. Suppose I is radical. Prove that
V (I : J) = V (I) \ V (J).
In this section we look at the morphisms we allow between algebraic sets. As an algebraic
set is defined by polynomial equations, it is a natural idea to allow between them only
polynomial mappings. The simple case of regular functions, that is, regular maps V −→ K
has already been considered in 2.2.21.
Definition 2.3.1. Let V ⊂ K n and W ⊂ K m be affine varieties. A map ϕ : V −→ W
is called a regular map between the affine varieties V and W if ϕ is the restriction of a
polynomial map ϕ : K n −→ K m . Regular maps V −→ K are called regular functions.
This means the following: suppose (x1 , . . . ,xn ) are coordinates on K n , and (y1 , . . . ,ym )
are coordinates on K m . Then there exist functions ϕ1 , . . . ,ϕm ∈ K[x1 , . . . ,xn ], such
that the point with coordinates (x1 , . . . ,xn ) ∈ V is sent to the point with coordinates
(y1 , . . . ,ym ) = ϕ1 (x1 , . . . ,xn ), . . . ,ϕm (x1 , . . . ,xn ) ∈ W .
Definition 2.3.2. An isomorphism ϕ : V −→ W is a regular map which has a two-sided
inverse. Two affine varieties V,W are called isomorphic if there exists an isomorphism
ϕ : V −→ W between them. Of course, being isomorphic is an equivalence relation.
It is easy to check that the composition of two polynomial maps is a polynomial map.
This in particular gives the following. Let ϕ : V −→ W be a regular map. Let f ∈ K[W ],
ϕ f
that is, f : W −→ K be a regular function. Then the composition f ◦ ϕ : V −→ W −→ K
is a regular function too. Hence we get a map
ϕ∗ : K[W ] −→ K[V ]; f 7→ ϕ ◦ f
α
The composition K[y1 , . . . ,ym ] −→ K[W ] −→ K[V ] we also denote by α. Consider the
“coordinate” functions y1 , . . . ,ym ∈ K[y1 , . . . ,ym ]/I (W ). Then α(yi ) = ϕi (x1 , . . . ,xn ) ∈
K[V ] for some ϕi . Take lifts ϕi ∈ K[x1 , . . . ,xn ] of ϕi . Then ϕ := (ϕ1 , . . . ,ϕm ) is a
polynomial map ϕ : K n −→ K m . As both α and ϕ∗ are K–algebra maps, and they
agree on yi , it follows that they agree on all polynomials in the yi . Hence α = ϕ∗ :
K[y1 , . . . ,ym ] −→ K[V ]. So, uniqueness is clear. As different lifts differ by elements in
I (V ) the restriction of ϕ to V is well-determined. We need to show
ϕ(V ) ⊂ W.
i∗ j∗
ϕ∗
K[Y ] // K[X].
It follows that ϕ∗ I (Y ) ⊂ I (X).
(2) We apply the first part to X = ϕ−1 (Y ). We get
ϕ∗ I (Y ) ⊂ I ϕ−1 (Y ) ,
68 2 Affine Algebraic Geometry
∗
so that, by taking zero
sets, V ϕ I (Y ) ⊃ ϕ−1 (Y ). Toprove the other
inclusion,
let p ∈ V ϕ I (Y ) . Then mp = I (p) ⊃ I V ϕ∗ I (Y ) ⊃ ϕ∗ I (Y ) . Therefore
∗
ϕ(p) ∈ Y follows from the first part applied to X = {p}. Hence p ∈ ϕ−1 (Y ) as was to be
proved.
(3) From the second part it follows that inverse images of Zariski closed sets are Zariski
closed. Thus, ϕ is continuous.
Example 2.3.6. Let V = V (I), where I = (y 2 − x) ⊂ K[x,y]. Consider the projection
ϕ : V −→ W = K, the projection on the x–axis. Then the map ϕ∗ is just the inclusion
map ϕ∗ : K[x] ⊂ K[x,y]/(y 2 −x). We take the subset Y of W = K defined by x(x−1) = 0,
so consisting of the points 0 and 1. Hence the fiber is defined by the ideal (x(x−1),y 2 −x))
in K 2 so consists of three points, namely (0,0),(1,1) and (1, − 1).
0 1
Lemma 2.3.7. Let a regular map ϕ : V −→ W be given, and let ϕ∗ : K[W ] −→ K[V ]
be the induced map on coordinate rings. Then
ϕ is injective ⇐⇒ ϕ∗ is surjective.
Proof. First suppose that ϕ∗ is surjective. Suppose that a,b ∈ V with ϕ(a) = ϕ(b). Then
for all g ∈ K[W ], it follows that ϕ∗ (g)(a) = g ◦ ϕ(a) = g ◦ ϕ(b) = ϕ∗ (g)(b). Suppose
a 6= b. Choose an f ∈ K[V ] with f (a) 6= f (b). This we can do by Exercise 2.2.27. Because
ϕ∗ is surjective, there is a g ∈ k[W ] with f = ϕ∗ (g). This is a contradiction.
For the converse, suppose ϕ is injective. Hence V can be viewed as a subset of W :
V ⊂ W ⊂ K n . Hence I (V ) ⊃ I (W ), so that K[W ] = K[x1 , . . . ,xn ]/I (W ) −→ k[V ] =
K[x1 , . . . ,xn ]/I (V ) is surjective.
The corresponding statement with injective and surjective interchanged, is not true,
by the following example which we have seen before.
Example 2.3.8. Consider V = {(x,y) ∈ K 2 : xy − 1 = 0}, let W be the x–axis, and
ϕ : V −→ W be the projection. Then ϕ∗ is the inclusion K[W ] = K[x] ⊂ K[V ] =
K[x,y]/(xy − 1) = K[x,x−1 ]. However, ϕ is not surjective: it misses the point 0.
Note that in this example, K[V ] is not a finitely generated K[W ]–module. If we
insist however on this, the converse holds. The following therefore is a generalization of
the fact that for a Noether normalization π : V −→ K r , the map π is surjective, and
that the fibers consist of finitely many points. The proof is left as Exercise 2.3.17.
2.3 Maps between Algebraic Sets 69
Proof.
The implication (2) =⇒ (3) is easy, see Exercise 2.2.36.
(3) =⇒ (1). Take f,g ∈ K[W ]. Then we have the following two equivalences.
f = g ⇐⇒ f|ϕ(V ) = g|ϕ(V ) ⇐⇒ f ◦ ϕ = g ◦ ϕ.
The second equivalence is clear. For the first, the only not obvious implication is ⇐=. As
regular functions are continuous, it follows that (f − g)−1 (0) is closed. By assumption,
this set contains ϕ(V ), and therefore contains ϕ(V ) = W . Hence f = g on the whole of
W.
70 2 Affine Algebraic Geometry
The implication (1) =⇒ (2) is the “difficult” case. We have to prove that W r V (f ) ⊂
ϕ(V ) for some 0 6= f ∈ K[W ]. The idea is to find a nonzero f ∈ K[W ] ⊂ K[V ] and an
r such that there exists an inclusion K[W ]f [x1 , . . . ,xr ] ⊂ K[V ]f , such that K[V ]f is a
finitely generated K[W ]f [x1 , . . . ,xr ]-module. Indeed, this inclusion gives
• a finite map, hence surjective map ϕ e : V \ V (f ) −→ W r V (f ) × K r ,
• a projection map π : W rV (f ) ×K r −→ W rV (f ), which is obviously surjective,
R ֒→ S
↓ ↓
E = Q(R) ֒→ SRr{0} ⊂ Q(S).
R[x1 , . . . ,xr ] ⊂ S.
This need not be a finitely generated extension. But we can do the following. Take gen-
erators α1 , . . . ,αs of SRr{0} as E[x1 , . . . ,xr ]–algebra. Again we may assume that in fact
αi in S. The elements αi satisfy an integral equation with coefficients in E[x1 , . . . ,xr ] =
Q(R)[x1 , . . . ,xr ]. Let f be the product of all the denominators that appear as coefficients
in all such equations. Therefore, over Rf [x1 , . . . ,xr ] every αi is integral. We therefore
forced that the extension
Rf [x1 , . . . ,xr ] ⊂ Sf
is integral. This was our goal.
Theorem 2.3.13. Consider an ideal I ⊂ K[x1 , . . . ,xn ,y1 , . . . ,ym ], and let V = V (I) ⊂
K n+m . Let p : K n+m −→ K n be the projection on the first n factors. Then
K[x1 , . . . ,xn ]/(I ∩ K[x1 , . . . ,xn ]) ⊂ K[x1 , . . . ,xn ,y1 , . . . ,ym ]/I,
so that p(V ) is indeed given by the zero set of the ideal I ∩ K[x1 , . . . ,xn ] by 2.3.12.
Remark 2.3.14. Let ϕ = (ϕ1 , . . . ,ϕm ) : V −→ W ⊂ K m be a regular map, V = V (I).
Consider the graph of ϕ:
Graph(ϕ) = {(x,ϕ(x)) : x ∈ V } ⊂ V × W ⊂ V × K m .
gives the map Graph(ϕ) −→ W , which maps (x,ϕ(x)) to ϕ(x). We come to the conclusion
that every map ϕ : V −→ W can be viewed as the restriction of a projection mapping.
This is a point of view which is sometimes useful. For example its local version, see 3.4.43
will be used in the proof of the Finite Mapping Theorem, see 6.3.5.
Example 2.3.15. Consider the regular map ϕ : K −→ K 2 given by ϕ(t) = (t2 ,t3 ). Let
the coordinates of K 2 be x,y. Then the graph of ϕ lies in K 3 and is the zero set of the
ideal I = (x − t2 ,y − t3 ) So the closure of the image of f is the zero set of the ideal
I ′ = (x − t2 ,y − t3 ) ∩ K[x,y] by 2.3.14 and 2.3.13. One shows that I ′ = (y 2 − x3 ). The
proof of this is contained in Example 1.5.6.
Finally, we give a counterexample to the Going-Down Theorem for nonnormal rings.
Example 2.3.16. Let K be an algebraically closed field. Consider the map
ϕ : K2 −→ K 3
(t,s) 7−→ (x,y,z) = (t2 − 1,t3 − t,s).
p p′
X′
X
72 2 Affine Algebraic Geometry
2.3.18.
(1) Consider the map f : K 2 −→ K 4 given by f (s,t) = (s,st,t2 ,t3 ) = (x,y,z,u). Show that the
ideal of the image is equal to I ′ = (y 2 − zx2 ,yz − ux,z 2 x − uy,u2 − z 3 ).
(Hint: Show that any g is modulo I ′ equivalent to ay + bu + c with a,c ∈ K[x,z] and
b ∈ K[x].)
(2) Consider the map f : K −→ K 3 , f (t) = (t3 ,t4 ,t5 ) = (x,y,z). Show that the image of f is
given by the zero set of the ideal (y 2 − xz,x3 − yz,z 2 − x2 y).
(3) Use a computer algebra system, for example [Singular 2000] to check the results in (1)
and (2).
73
In this chapter we start studying local analytic geometry, that is, the zero sets of analytic
functions in a (small) neighborhood of a point. To see why we want to do so, we look
at the affine hypersurface in C 2 defined by f (x,y) = y 2 − x2 (x + 1) = 0. In a small
neighborhood U of (0,0) we see two “parts”, or components of f (x,y) = 0.
So we would like to say that the “singularity” consists of two components. But how small
should we take U ? The answer to this question is: as small as we need to. To make a
precise notion, we define the notion of a germ of a space. A germ of a topological space
X at a point p by definition is an equivalence class of open neighborhoods of p. But f ,
considered as element of C [x,y] is irreducible (Exercise 1.4.34), but reducible if considered
as an element of C {x,y}. Here C {x,y} is the ring of power series which converge in some
neighborhood of 0.1 Indeed, we have the following factorization in C {x,y}
√ √
f (x,y) = y − x x + 1 y + x x + 1 .
√
So we see that in a neighborhood on which √ x + 1 converges, the zero set √ f = 0 decom-
poses into two parts, namely {(x,y) : y = x x + 1} and {(x,y) : y = −x x + 1}.
As we assume that the reader heard a course on holomorphic functions in one vari-
able, we will, in Section 3.1, treat only the basic facts on holomorphic functions in several
variables. Many of the statements and the proofs are similar or direct generalizations of
the one variable case.
For polynomials f ∈ C [x1 , . . . ,xn ] one of the main invariants is the degree. It can
be interpreted (in case f is reduced) as the number of intersection points of the set {p ∈
C n : f (p) = 0} with a general line. By applying the Noether Normalization Theorem, one
may assume that this line is given by x1 = a1 , . . . ,xn−1 = an−1 . In case of convergent
power series f , its replacement is the multiplicity or order of the power series, and is
defined as the lowest order term of f . Assuming that f converges on the (small enough)
neighborhood U of 0, the multiplicity can be interpreted as the maximal number of
intersection points of f = 0 with a general line L in U , in case f is reduced. This
1 The neighborhood might depend on the power series.
74 3 Basics of Analytic Geometry
statement follows from the so-called continuity of roots. So for example the element
f (x,y) = y 2 − x2 (x + 1) has degree three considered as element of C [x,y], and has
multiplicity two considered as element of C {x,y}. There is a substitute for the Noether
normalization for the case f ∈ C {x1 , . . . ,xn }, so that we can always assume that the line
L from above points in the xn direction. In terms of the power series f of multiplicity b,
this is equivalent to saying that the term xbn occurs with nonzero coefficient in the power
series expansion of f . In this case one says that f is regular of order b in xn .
The replacement for division with remainder, which played an important role in
the proof of Study’s Lemma, or more generally the Nullstellensatz, is the Weierstraß
Division Theorem. The Weierstraß Division Theorem says the following: Let a convergent
power series f ∈ C {x1 , . . . ,xn } be regular of order b in xn , and a further power series
g ∈ C {x1 , . . . ,xn } be given. Then there exist uniquely defined q ∈ C {x1 , . . . ,xn } and
r ∈ C {x1 , . . . ,xn−1 }[xn ], where r has, considered as polynomial in xn , degree less than
b, such that
g = q · f + r.
The similarity with ordinary division with remainder is obvious. It follows from the
Weierstraß Division Theorem, that the C {x1 , . . . ,xn−1 }–module C {x1 , . . . ,xn }/(f ) is
finitely generated.2 As the ring C {x1 , . . . ,xn−1 } is local, it even follows by an application
of Nakayama’s Lemma, that the Weierstraß Division Theorem follows from the fact that
C {x1 , . . . ,xn }/(f ) is a finitely generated C {x1 , . . . ,xn−1 }–module. In Section 3.2 we will
prove a weak generalization of this seemingly weaker statement. If one looks closely at
the proof, we will “only” show the existence of q and r where r has degree less than or
equal to b. It turns out that in order to show the convergence of q and r, it is much easier
to prove this last statement.
In Section 3.3, we give applications of the Weierstraß Theorems. To mention the
most important ones, we show the Implicit and Inverse Function Theorems and New-
ton’s Lemma (a “cooked-up” version of the Implicit Function Theorem). Furthermore, we
show that the (formal) power series ring is Noetherian and factorial, prove the Noether
Normalization Theorem, and Hensel’s Lemma.
With this knowledge at hand we can start the study of germs of analytic spaces in
Section 3.4. The main theorem, and the most difficult in this section is the Nullstellensatz.
In the case of germs of analytic spaces, this theorem cannot be interpreted as a statement
on maximal ideals. We will prove the analogous statement to Corollary 2.2.10. That is,
let a germ of an analytic space (X,x) be given for which the ideal of (X,x) is prime, and
let π : (X,x) −→ (C k ,y) be a Noether normalization. Then π is surjective. As in the
affine case, the Nullstellensatz follows from this surjectivity. An essential ingredient in
the proof of the surjectivity of π is the Theorem of Finiteness of Normalization 1.5.19.
It will be used to describe the properties of a general projection of an irreducible germ
(X,x) onto a hypersurface singularity. The precise properties of this projection of X
onto a hypersurface are gathered in the Local Parametrization Theorem. This is a very
important theorem, as it, in many cases, allows the reduction of general statements
on germs of analytic spaces to the case of hypersurfaces, which are sometimes easier
to handle. The Local Parametrization Theorem will be used several times in the next
chapter.
2 This module is even free.
3.1 Holomorphic Functions of Several Complex Variables 75
Before going on with geometry, we need some basic facts on holomorphic functions of
several complex variables. The facts we mention here are either proved, or the proof can
be easily generalized from the case of a function in one variable. In this case, no proof is
given.
Definition 3.1.1. Let n ∈ N, and νi , for i = 1, . . . ,n be nonnegative integers, x =
(x1 , . . . ,xn ) ∈ C n . We introduce the multi index notation
n
Y n
X
ν := (ν1 , . . . ,νn ); xν := xνi i ; |ν| := νi .
i=1 i=1
The number |ν| is called the degree of ν. Let r = (r1 , . . . ,rn ) ∈ Rn+ , ri > 0 for i = 1 . . . ,n.
The set
P r := {x ∈ C n : |xi | ≤ ri for i = 1, . . . ,n}
is called the closed polydisc (or polycylinder) with center {0} and polyradius r. Similarly
one defines open polydiscs Pr , and polydiscs with center p ∈ C n .
Definition 3.1.2. Let U ⊂ C n be a domain3 , f : U −→ C a function, p ∈ U . Then f is
called complex differentiable in p, if there exists an open neighborhood V ⊂ U of p and
functions ∆1 , . . . ,∆n : V −→ C which are continuous in p such that for all x ∈ V
n
X
f (x) = f (p) + (xi − pi )∆i (x).
i=1
is called a formal power series in the variables x1 , . . . ,xn around p0 . With the
obvious addition and multiplication the set of formal power series is a ring, and
even an C –algebra. The ring of formal power series around p0 = 0 is denoted by
C [[x1 , . . . ,xn ]] or sometimes C [[x]] for short, and is called the formal power series
ring.
3 This means that U is an open and P connected subset.
4 This is an alternative notation for ν∈Nn aν (p − p0 )ν .
76 3 Basics of Analytic Geometry
P∞
(2) Let p ∈ C n , and c ∈ C . One says that the formal power series ν=0 aν (x − p0 )ν
converges in p with limit c if for all ε > 0 there exists a finite set Iε ⊂ Nn such that
for all finite sets I with Iε ⊂ I ⊂ Nn
X
aν (p − p0 )ν − c < ε.
ν∈I
P ν
Note that P ν∈I aν (p − p0 ) makes sense because the summation is over a finite set.
∞
In case that ν=0 aν (x − p0 )ν converges in p the number c is uniquely determined,
and is called the value of the formal power series at p. We, therefore, simply write
P ∞ ν
ν=0 aν (p − p0 ) = c.
P∞ ν
P∞ formal powerν series ν=0 aν (x − p0 ) is called absolutely convergent in p if
The
ν=0 | aν (x − p0 ) | is convergent. An absolutely convergent power series is in fact
convergent.
Let U ⊂ C n , and f : U −→ C be a function. One says that the formal power series
(3) P
∞
aν (x − p0 )ν converges on U with limit f if for all p in U the formal power
ν=0 P
∞
series ν=0 aν (x − p0 )ν converges in p with limit f (p).
n
P∞U ⊂ C , andν f : U −→ C be a function. One says that the formal power
(4) Let again
series ν=0 aν (x − p0 ) converges uniformly to f on U if for all ε > 0 there exists
a finite set Iε ⊂ Nn such that for all finite sets I with Iε ⊂ I ⊂ Nn and for all p ∈ U
X
ν
aν (p − p0 ) − f (p) < ε.
ν∈I
f = fm + fm+1 + . . .
f = fm + fm+1 + · · ·
3.1 Holomorphic Functions of Several Complex Variables 77
The first thing we want to show is that analytic functions are holomorphic. In order
to show this, we need the following lemma.
P∞
Lemma 3.1.4. Consider a formal power series ν=0 aν xν , which converges at p ∈ C n .
Let p = (p1 , . . . ,pn ) and suppose that pi 6=P0∞for all i. Let the polyradius r be defined by
ri = |pi |. Then the formal power series ν=0 aν xν converges absolutely and uniformly
on all compact subsets of the open poly-disc Pr .
P
Proof. From the convergence of the series aν pν it follows that the set {aν pν } is
bounded, that is, there exists an M ∈ R with |aν pν | = |aν |rν < M . Consider a real
number q with 0 < q < 1. By definition of Pqr for all x ∈ Pqr it follows that |xν | < q |ν| rν .
Hence for all x ∈ Pqr
|aν xν | < |aν |q |ν| rν < M · q |ν| .
But n
X 1
|ν|
M q =M ,
ν
1−q
see Exercise 3.1.19. The set Nn is countable, hence there exists
n a bijection α : N −→
n α(k)
P∞ 1
N . Put bk (x) := aα(k) x . Hence, k=0 |bk (x)| ≤ M 1−q , so that this series is
P∞
increasing and bounded. It follows that k=0 bk (x) converges absolutely and uniformly
on Pqr so that
∞
X
∀ε > 0 there exists nε ∈ N with |bk (x)| < ε for all x ∈ Pqr .
k=nε
This shows that the series converges absolutely and uniformly in Pqr . We still have to
show that the series converges on all compact subsets K in Pr . But Pqr for 0 < q < 1
is an open cover of K. As K is compact we have a finite subcover Pq1 r , . . . Pqs r . Let
q = max{q1 , . . . ,qs }. Hence K ⊂ Pqr , and thus the series converges uniformly on K.
Remarks 3.1.5.
The fact that f is analytic is equivalent to this property. We could rephrase P this
as follows. For all δ > 0 we put a seminorm5 on C [[x1 , . . . ,xn ]]. Let f = fα xα ∈
C [[x1 , . . . ,xn ]] be a formal power series. Then we put for δ > 0
X
kf kδ := |fα |δ |α| ∈ R+ ∪ {∞}.
α
It follows from the proof of Lemma 3.1.4 that f is convergent exactly when there
exists a δ > 0 with kf kδ < ∞.
(2) Note that if kf kδ < ∞, and 0 < t < 1, then kf ktδ < ∞. If, moreover, f ∈
(x1 , . . . ,xn ) an easy estimate shows that kf ktδ ≤ t · kf kδ .
P∞ k
(3) Let f = k=0 fk xn be a convergent power series, with fk ∈ C [[x1 , . . . ,xn−1 ]].
From the absolute convergence of f in some domain in C n it follows that fi are
absolute convergent in some domain of C n−1 , see Exercise 3.1.23. It follows that
fk ∈ C {x1 , . . . ,xn−1 }.
Theorem 3.1.6. Let U ⊂ C n be open, f : U −→ C be analytic. Then f is holomorphic.
Proof.
P Without loss of generality, it suffices to show that f is holomorphic in 0. Let
ν
ν a ν x be a power series that converges on an open neighborhood V of 0 to f . We
write
X X
aν xν = a0...0 +x1 aν x1ν1 −1 xν22 · · · xνnn
ν ν1 6=0
X
+x2 aν x2ν2 −1 · · · xνnn
ν,ν2 6=0
ν1 =0
..
.
X
+xn aν xνnn −1
ν,νn 6=0
ν1 =···=νn−1 =0
= a0...0 +x1 ∆1 + . . . + xn ∆n .
P
Take a point p = (p1 , . . . ,pn ) for which ν aν pν is absolutely convergent. Such a point
exists by Lemma 3.1.4. We P may assume that all the pi are nonzero. It follows from
the absolute convergence of ν aν pν that each of the sub series pi ∆i (p) are absolute
convergent. As pi 6= 0, the series ∆i (p) are convergent. Applying Lemma 3.1.4, it follows
that the ∆i for i = 1, . . . ,n converge absolutely and uniformly on small neighborhoods of
0. From a standard theorem in analysis it then follows that the ∆i are continuous.
Theorem 3.1.7 (Osgood’s Lemma). Let U be an open set of C n , and f : U −→ C be
a continuous function. The following conditions are equivalent.
(1) f is analytic.
(2) f is holomorphic.
5 A seminorm is a map C [[x1 , . . . ,xn ]] → R ∪ {∞} satisfying all conditions of a norm (with the obvious
interpretation a + ∞ = ∞).
3.1 Holomorphic Functions of Several Complex Variables 79
Proof. We already showed that (1) implies (2) in Theorem 3.1.6. As (2) implies (3)
is trivial, we only have to show (3) implies (1). Take a point p ∈ U . Without loss of
generality, we may assume that p = 0. Let P r be a closed poly-disc contained in U for
r = (r1 , . . . ,rn ). Take a point (ζ1 , . . . ,ζn−1 ) with |ζi | ≤ ri . The function
f (x1 , . . . ,xn ) =
n Z Z Z
1 1 1 f (ζ)
· · · dζn dζn−1 · · · dζ1 .
2πi ζ1 − x1 ζ2 − x2 ζn − xn
|ζ1 |=r1 |ζ2 |=r2 |ζn |=rn
For fixed (x1 , . . . ,xn ) the integrand is continuous on T := {(ζ1 , . . . ,ζn ) : |ζi | = ri for
i = 1, . . . ,n}, which is a compact set. By Fubini
n Z
1 f (ζ)
f (x1 , . . . ,xn ) = dζ1 · · · dζn .
2πi (ζ1 − x1 ) · · · (ζn − xn )
T
It follows from Exercise 3.1.19 that for fixed (x1 , . . . ,xn ) the series
∞
X xν1 · · · xνn
1 1 n
=
(ζ1 − x1 ) · · · (ζn − xn ) ν=0 ζ1ν1 +1 · · · ζnνn +1
n Z
1 f (ζ)
f (x1 , . . . ,xn ) = dζ1 · · · dζn
2πi (ζ1 − x1 ) · · · (ζn − xn )
T
∞
X n Z
1 f (ζ)
= xν11 · · · xνnn · dζ1 · · · dζn
ν=0
2πi ζ1ν1 +1· · · ζnνn +1
T
∞
X
= aν xν , with
ν=0
n Z
1 f (ζ)
aν = dζ1 · · · dζn .
2πi ζ1ν1 +1 · · · ζnνn +1
T
Remarks 3.1.8.
(1) It follows that the convergent power series is a ring. Indeed, if f,g ∈ C {x1 , . . . ,xn },
then choose an open neighborhood U of 0 on which f converges, and an open
neighborhood V of 0 on which g converges. Then f and g are holomorphic on
U ∩ V , so that f ± g and f · g are holomorphic on V , so define an analytic function.
This fact could also have been proved directly, by using only Lemma 3.1.4. Note
moreover that C {x1 , . . . ,xn } is a local ring with maximal ideal (x1 , . . . ,xn ). Indeed,
for convergent power series f not in (x1 , . . . ,xn ) one has f (0) 6= 0. It follows that
1
f is also holomorphic in a neighborhood of 0, and therefore, by Osgood’s Lemma,
again is a convergent power series. So the elements of the complement of (x1 , . . . ,xn )
are units, and it follows from 1.3.3 that the power series ring is local.
(2) Let U ⊂ C n be open. A map F = (f1 , . . . ,fm ) : U −→ C n is called holomorphic
if each of the fi is holomorphic. If F (U ) ⊂ W ⊂ C m , and G : W −→ C p is
a holomorphic map, then the chain rule, see Exercise 3.1.22 gives that G ◦ F is
holomorphic. This in particular gives the following. Let g1 , . . . ,gn ∈ C {y1 , . . . ,ym },
with gi (0) = 0. Let f ∈ C {x1 , . . . ,xn }. Then f (g1 , . . . ,gn ) ∈ C {y1 , . . . ,ym }.
Theorem 3.1.9 (Identity Theorem). Let U ⊂ C n be a domain. Let f : U −→ C be a
holomorphic function, such that the restriction of f to a nonempty open subset V ⊂ U
is the zero map. Then f is identically zero on U .
Remark 3.1.10. In the case n = 1 we have a stronger version of the Identity Theorem,
as follows.
Let U ⊂ C be a domain and f : U −→ C be a holomorphic function. Let {xk } be
a convergent sequence, xk ∈ U and limk→∞ xk ∈ U . If f (xk ) = 0 for all k, then f is
identically zero on U .
Indeed, suppose lim xk = 0, and f not identically zero in a neighborhood of 0. Then
f = xb · u, where u(0) 6= 0. Therefore, for p with |p| small, it follows that u(p) 6= 0, and
therefore, f (p) 6= 0. The zeros of f therefore lie isolated. This is a contradiction, and
therefore, f vanishes in a neighborhood of 0.
One can reduce the proof of the Identity Theorem to the Identity Theorem in the
one variable case. The idea is the same as in the proof of the maximum principle, which
comes next. The proof of the Identity Theorem 3.1.9 is therefore left as Exercise 3.1.24.
3.1 Holomorphic Functions of Several Complex Variables 81
(1) A set or space X ⊂ C n is called locally analytic, if for any point p ∈ X, there exists
an open subset V of p in C n , and finitely many holomorphic functions f1 , . . . ,fs
defined on V such that
(1) In this definition, we have to be more careful than in the affine case. Polynomials
are always globally defined. But holomorphic functions are usually only defined
in small open neighborhoods, so this is the reason why we have to choose open
neighborhoods V .
(2) The set X := {(x,y) : y = 0, Im(x) ≥ 0} in C 2
82 3 Basics of Analytic Geometry
is not locally analytic. Indeed, let V be a connected open neighborhood of {0}, and
f be a function on V with
Then we can restrict f to the set V ∩ {y = 0}. By the Identity Theorem in one
variable, f must vanish identically on {y = 0} ∩ V . As this holds for all f , it follows
that X is not locally analytic.
(3) In the definition of X to be an analytic subset of U , we need the condition X is
closed in U to avoid phenomena as shown in the following picture.
Morally speaking, one wants an analytic subset of U , except for discrete points, to
“reach” the boundary. Even more severely, without the closedness condition, if X
is the shaded part of
X
U
◦
this it would follow from the connectedness of U that X is equal to U , in contradiction
◦ ◦
to the assumption. So let us prove X is closed. Take a point p in the boundary of X
in U . As X is closed in U , the point p is in X. Hence, there exists an open, connected
neighborhood V of p and holomorphic functions f1 , . . . ,fs defined on V such that
X ∩ V = {x ∈ V : f1 (x) = . . . = fs (x) = 0}.
◦ ◦
As p is in the boundary of X, and p is in the open set V , it follows that V ∩ X 6= ∅.
V
p U
o
X
Therefore, for all i = 1, . . . ,s the function fi is identically zero on the (nonempty) open
◦
subset V ∩ X, hence fi is, by the Identity Theorem, identically zero on V . Hence V ⊂ X,
◦ ◦
and p ∈ X. So we proved that X is closed.
Theorem 3.1.15 (First Riemann Extension Theorem). Let U be an open connected
subset of C n , and X ⊂ U an analytic subset. Consider
f : U \ X −→ C
holomorphic. Suppose f is locally bounded, that is, for all p ∈ X there exists an open
neighborhood V of p such that the restriction of f to V \ (V ∩ X) is bounded. Then there
exists a holomorphic extension of f to U . More precisely, there exists a holomorphic
function fe : U −→ C such that fe|U\X = f .
Proof. Let p ∈ X, and Bp an open ball around p. It suffices to extend f to a holomorphic
function fep on Bp . Namely, if Bp ∩ Bq 6= ∅, and if fep ,feq are extensions of f to Bp ,
respectively Bq , they must agree on Bp ∩ Bq , as on W := Bp ∩ Bq ∩ (U \ X) they coincide
with f , and W is not empty because the closure of U \ X in U is U , see Theorem 3.1.14.
So we can use the Identity Theorem 3.1.9 to deduce that fp and fq coincide on Bp ∩ Bq .
Consider a line L through p. Then L ∩ X is an analytic subset of X. As X contains
no inner points, there exists a line not contained in X. As L ∩ X is an analytic subset
of an open subset of L ∼= C , hence locally given by the zero-set of one analytic function,
the points of L ∩ X lie isolated. By an affine change of coordinates, we may assume that
the line L is the xn –axis, given by x1 = . . . = xn−1 = 0, and p is equal to 0. Therefore,
there exists an rn > 0 such that
X ∩ {x = (x1 , . . . ,xn ) : x1 = . . . = xn−1 = 0, |xn | ≤ rn }
consists of just one point, namely 0. As the set {x = (x1 , . . . ,xn ) : x1 = . . . = xn−1 =
0, |xn | = rn } is compact7 , and is contained in the open set U \ X, it follows without much
difficulty that there exists an open neighborhood W of 0 in C n−1 such that W × {|xn | =
rn } ⊂ U \ X. So
7 It is a circle.
84 3 Basics of Analytic Geometry
(3.1) X ∩ (W × {|xn | = rn }) = ∅.
Lζ L
U
X
ζ
p |xn | ≤ rn
W
We consider a point ζ = (ζ1 , . . . ,ζn−1 ) ∈ W fixed. As X is given by the vanishing of
holomorphic functions, and the restriction of such functions to the line Lζ given by x1 =
ζ1 , . . . ,xn−1 = ζn−1 do not all vanish identically because of (3.1), it follows that the set
X∩{(ζ1 , . . . ,ζn−1 )}×{|xn | ≤ rn } has only isolated points. The function f (ζ1 , . . . ,ζn−1 ,xn )
is a holomorphic function in xn , not defined in the finitely many points of X, and it is
locally bounded. Therefore, by the Riemann Extension Theorem in one variable, it has
a holomorphic extension to {(ζ1 , . . . ,ζn−1 )} × {|xn | ≤ rn } and is, in fact, given by the
integral
Z
f (ζ)
(3.2) f (ζ1 , . . . ,ζn−1 ,xn ) = dζn .
ζn − xn
|ζn |=rn
This holds for all (ζ1 , . . . ,ζn−1 ) ∈ W . Now we define fe for all ζ1 , . . . ,ζn−1 ,xn ∈ W ×{|xn | ≤
rn } by the right hand side of equation (3.2). It follows that fe is holomorphic with respect
to each variable. Therefore, by Osgood’s Lemma 3.1.7, fe is holomorphic.
Theorem 3.1.16 (Second Riemann Extension Theorem). Let U be an open subset of
C n , with n ≥ 2. Let p ∈ U , and
f : U \ {p} −→ C
L U
B
q p
Exercises
3.1.18.
(1) Prove that the values ∆i (p) in the definition of a holomorphic function are well-determined,
by reducing it to a corresponding statement in the one variable case.
P ν
(2) Let f = ν aν x be a convergent power series. Describe all partial derivatives of f in
terms of the aν .
X n
Y 1
qν = .
ν∈Nn k=1
1 − qk
3.1.20. Consider the formal power series ring C [[x1 , . . . ,xn ]] and let m = (x1 , . . . ,xn ).
(1) Show that m is a maximal ideal.
(2) Let f,g ∈ C [[x1 , . . . ,xn ]]. Show that f = g if and only if f ≡ g mod mk for all k ∈ N.
(3) Show that in particular ∩∞ k
k=0 m = 0.
P
(4) Let g ∈ m, and ak ∈ C . Show that ∞ k
k=0 ak g is a well-defined element in C [[x1 , . . . ,xn ]].
(5) More generally, let elements g1 , . . . ,gs in m be given. Show that there is a well-defined
homomorphism of C –algebras
(6) Let ϕ : C [[y1 , . . . ,ys ]] −→ C[[x1 , . . . ,xn ]] be a C –algebra homomorphism. Prove that for
all f we have ϕ(f ) = f (ϕ(y1 ), . . . ,ϕ(ys )).
(7) Show that C [[x1 , . . . ,xn ]] is a local ring.
1
(Hint: Let g ∈ m. What is 1−g ?)
3.1.22.
(1) Let f ∈ C {x1 , . . . ,xn }, and g1 , . . . ,gn ∈ C {y1 , . . . ,ym } with gi (0) = 0 for i = 1, . . . ,n.
Show that f (g1 , . . . ,gm ) ∈ C {y1 , . . . ,ym } by using the seminorm kf kδ , see 3.1.5.
(2) Use Exercise 3.1.20 to prove that for a C –algebra homomorphism ϕ : C {y1 , . . . ,ys } −→
C{x1 , . . . ,xn } for all f ϕ(f ) = f (ϕ(y1 ), . . . ,ϕ(ys )) holds.
P
3.1.23. Let f = fk xkn , fk ∈ C [[x1 , . . . ,xn−1 ]], and f ∈ C {x1 , . . . ,xn }. Prove that fk ∈
C {x1 , . . . ,xn−1 }.
(Hint: Use 3.1.4 to show this for k = 0. Now use induction on k.)
3.1.25. Let U be an open connected subset of C n , and O(U ) the ring of holomorphic functions
on U . Prove that O(U ) is an integral domain.
3.1.27. Consider an ideal I ⊂ C [x1 , . . . ,xn ], and f ∈ C [x1 , . . . ,xn ]. Suppose that
f ∈ I · C {x1 , . . . ,xn }. Show that f ∈ I.
We now come to the famous Weierstraß Division Theorem, and its immediate corollary,
the Weierstraß Preparation Theorem. Before stating it, we need some definitions.
Definition 3.2.1.
(1) An element f ∈ C {x1 , . . . ,xn } is called regular of order b in xn if the power series
in the variable xn , defined by f (0, . . . ,0,xn ) has a zero of order b.
(2) An element
g = qf + r.
xbn = qf + r.
Now q is a unit, as otherwise in the expression qf + r there does not appear the term xbn ,
which would be a contradiction to the equality xbn = qf + r. So now we can put u = q −1
and p = xbn − r and get the equality f = up. It remains to show that p is a Weierstraß
polynomial. But if p were not a Weierstraß polynomial, one of the nonleading coefficients
of r would be a unit, and f would be regular of order smaller than b in xn . The fact that
u and p are uniquely determined follows from the corresponding fact in the Weierstraß
Division Theorem, see Exercise 3.2.16.
Remark 3.2.5. The Weierstraß Division and Preparation Theorem also hold in the
formal power series ring C [[x1 , . . . ,xn ]]. A proof for this case is much easier, as one does
not have to cope with convergence questions. A proof will be given in Exercise 3.2.18.
Corollary 3.2.6. Suppose that f ∈ C {x1 , . . . ,xn } is regular of order b. Then the
C {x1 , . . . ,xn−1 }–module C {x1 , . . . ,xn }/(f ) is finitely generated and free of rank b. A
basis is given by 1,xn , . . . ,xb−1
n .
The proof of this corollary is similar to the proof of 2.1.7 and therefore left as Exercise
3.2.17. It is in fact not so difficult to see that the Corollary is equivalent to the Weierstraß
Division Theorem. We can even do something better by invoking Nakayama’s Lemma.
Lemma 3.2.7. Let f ∈ C {x1 , . . . ,xn } be regular of order b. If C {x1 , . . . ,xn }/(f ) is a
finitely generated C {x1 , . . . ,xn−1 }–module, then the Weierstraß Division Theorem holds
for f .
Proof. Put M := C {x1 , . . . ,xn }/(f ). The maximal ideal ofC {x1 , . . . ,xn−1 } is the ideal
(x1 , . . . ,xn−1 ). Now M/(x1 , . . . ,xn−1 )M = C {xn }/ f (0,xn ) , hence is a finite-dimensional
C –vector space of dimension b. Indeed, a basis of this vector space is given by the classes
of 1,xn , . . . ,xb−1
n . By Corollary 1.3.6 (a corollary of Nakayama’s Lemma), it follows that
1,xn , . . . ,xb−1
n generate M as C {x1 , . . . ,xn−1 }–module. This exactly means that for all
g ∈ C {x1 , . . . ,xn } there exist q and r such that
g = qf + r
88 3 Basics of Analytic Geometry
f = ϕ(a0 ) · 1 + ϕ(a1 ) · t.
We still have to show that a0 and a1 are convergent power series. In this case this
follows easily from the convergence of f , by using 3.1.4.
Proof of the General Weierstraß Division Theorem 3.2.10.
Step 1. We first show that (1) implies (2). This is quite obvious. Any set of generators of S
as an R–module will, by taking classes, be a set of generators for S/mR S as R/mR = C –
module. Hence, S/mR S is a finite-dimensional vector space.
Step 2. We first prove the implication (2) =⇒ (1) for the case that R = C {x1 , . . . ,xn }, S =
C {y1 , . . . ,ym }. We put gi = ϕ(xi ) for i = 1, . . . ,n. The assumption then is that
The claim is that the monomials {y α : |α| < r} generate S as an R-module. This would
prove the finite generatedness of S as R–module in this case.
Step 2a. We first prove this “formally”. Let f ∈ S. We can write
X
(3.5) f = a(f ) + e
hα (f )y α .
α:|α|=r
such that e
hα (f )y α and e
hβ (f )y β have no common monomial if α 6= β. Moreover a(f ) is a
polynomial in the y ′ s of degree smaller than r. Plugging in (3.4) we get
n
X
(3.6) f = a(f ) + hi (f )gi ,
i=1
P
where hi (f ) = pα,i e
hα (f ).We now iterate (3.6). For a multi-index β = (β1 , . . . ,βn )
α:|α|=r
we define inductively
Note that the aβ (f ) are polynomials in the y1 , . . . ,ym of degree smaller than r. We claim
that for all k ∈ N
X X
(3.7) f= aβ (f )g β + hβ (f )g β .
|β|<k |β|=k
For k = 1 this is justP formula (3.6). We now apply (3.6) to the coefficients hβ (f ). We get
hβ (f ) = a(hβ (f )) + ni=1 hi (hβ (f ))gi . Plugging this in (3.7) and using the definition of
aβ (f ) and hβ (f ) gives the induction step.
As (g1 , . . . ,gn ) ∈ m, it follows that for all k ∈ N
X
f≡ aβ (f )g β mod mk .
β
Now for all α with |α| < r look at the formal power series
3.2 Weierstraß Division and Preparation Theorem 91
X
(3.9) Aα (f ) := aβ,α (f )xβ .
β
(1) kf kδ is finite.
(2) Let C be the number of monomials of degree r. Then for the pα,i of formula (3.4)
1
kpα,i kδ ≤ C.
We can find such a δ, because the elements pα,i are in the maximal ideal, see 3.1.5 (2).
We will show for all α with |α| < r
r
(3.10)
Aα (f )
< ∞ for ε = δ .
ε 2n2
This will show the convergence of Aα (f ). We have the following estimates for the terms
in formula (3.5) and (3.6).
(2) ke
hα (f )kδ ≤ δ −r kf kδ ,
(3) khi (f )kδ ≤ δ −r kf kδ .
The first two estimates follow from the definition of the seminorm, and the definition of
a(f ) and e
hα (f ). The final estimate follows because we have chosen δ so that kpα,i kδ ≤ C1 ,
C being the number of monomials of degree r. Inductively we get the estimates
Step 3. The implication (2) =⇒ (1) will be proved for the case that S = C {y1 , . . . ,ym }/J,
where J = (f1 , . . . ,fp ) is a finitely generated ideal. This would be always the case if we
knew that C {y1 , . . . ,ym } is Noetherian. Now, as the Weierstraß Division Theorem is
deduced from the General Weierstraß Division Theorem by using J = (f ), see Example
3.2.11 (1), and only the Weierstraß Division Theorem is used in the proof of the fact that
C {x1 , . . . ,xn } is Noetherian, see Lemma 3.2.7, this restriction is not essential.
Step 4. We now prove the general case, for J finitely generated.
Write R = C {x1 , . . . ,xn }/I,S = C {y1 , . . . ,ym }/J, where J = (f1 , . . . ,fp ). Define
new rings:
h = α1 v1 + . . . + αs vs αi ∈ ϕ′ (R′ ).
Reading this equation modulo (f1 , . . . ,fp ), we see that S is an R–module generated by
v1 , . . . ,vs , so, in particular, finitely generated.
Now we know (under the assumptions of the theorem) that S is a finitely generated
R–module, we can use Nakayama’s Lemma to get the following somewhat stronger form
of the theorem.
Corollary 3.2.12. Keep the notations of the previous Theorem 3.2.10. Suppose we
have elements v1 , . . . ,vs ∈ S whose classes generate S/mR S as a C –vector space. Then
v1 , . . . ,vs generate S as an R–module.
Corollary 3.2.13. Again keep the notation of Theorem 3.2.10.
Proof. The first statement follows from 3.2.12: indeed, it follows that 1 generates S as
an R–module, and thus in particular, ϕ is surjective. For the second statement, suppose
that f1 , . . . ,fs generate mS . Consider the map
ϕ : C {x1 , . . . ,xs } −→ S,
There is a “direct” proof of the second part of this corollary, using only the statement
of the Weierstraß Division Theorem. See Exercise 3.2.21.
Corollary 3.2.14. Let R,S be analytic C –algebras, ϕ : R −→ S a morphism of C –
algebras. Let M be a finitely generated S–module such that dimC M/mR M < ∞. Then
M is a finitely generated R–module.
Proof. As M is an S/ AnnS (M )–module, we may assume that AnnS (M ) = (0). Exercise
1.2.31 applied to M/mR M implies that dimC S/mR S < ∞. By the previous theorem S
is a finitely generated R–module and, therefore, M is a finitely generated R–module.
Exercises
3.2.15. Prove Lemma 3.2.2.
3.2.16. Prove that u and p in the Weierstraß Preparation Theorem are uniquely determined.
3.2.17. Prove that the Weierstraß Division Theorem is equivalent to the statement in Corollary
3.2.6.
3.2.18. Prove the Weierstraß Division Theorem for the formal power series ring.
(Hint: Consider a formal power series f ∈ C [[x1 , . . . ,xn ]] which is regular of order b in xn . Write
(3.13) f = f0 + xbn uf .
3.3 Applications
The first thing to note is that the Weierstraß Preparation Theorem implies the Implicit
Function Theorem!
Theorem 3.3.1 (Implicit Function Theorem). Let f ∈ C {x1 , . . . ,xn ,y} with f (0) = 0
and ∂f
∂y (0) 6= 0. Then there exists a unique ϕ ∈ (x1 , . . . ,xn )C {x1 , . . . ,xn } with
for some unit u ∈ C {x1 , . . . ,xn ,y} and some ϕ ∈ C {x1 , . . . ,xn }. The Implicit Function
Theorem follows from this equation.
Example 3.3.2. Consider f (x,y) = (y + 1)2 − x − 1 = y 2 + 2y −√x. Then f satisfies
the conditions of the Implicit Function Theorem. Hence y = −1 + x + 1 exists as an
analytic function.
From the uniqueness statements in the Weierstraß Preparation Theorem the follow-
ing follows immediately.
Corollary 3.3.3. Let f ∈ C {x1 , . . . ,xn } be regular of order b in xn , and suppose that
f = u · p, where u,p ∈ C [[x1 , . . . ,xn ]], u a unit and p a Weierstraß polynomial. Then u
and p are in C {x1 , . . . ,xn }, that is, they are convergent.
A similar statement holds for the Weierstraß Division Theorem.
Example 3.3.4. Consider f (x,y) = 2y + y 2 − x. Then f is regular of order one in y.
Hence, by the Weierstraß Preparation Theorem f (x,y) = u(y − ϕ(x)) for some unit u. We
can generate ϕ and u as indicated by a proof of the formal Weierstraß Division Theorem
(Exercise 3.2.18).
ϕ(x) = 12 (2y + y 2 − x) + 12 x − 21 y 2
= ( 12 − 41 y)(2y + y 2 − x) + 12 x − 14 xy + 14 y 3
= ( 21 − 41 y − 18 x + 81 y 2 )(2y + y 2 − x) + 12 x − 81 x2 − 81 y 4 + 41 xy 2
= ....
ϕ(x) = 12 x − 18 x2 + . . .
√
beginning of the power series of ϕ(x) = −1 +
which is the √ x + 1. Similarly, we obtain
u = y + 1 + x + 1.
The Implicit Function Theorem holds just as well for the formal case. From the
uniqueness statement in the Implicit Function Theorem the following fact follows.
3.3 Applications 95
Corollary 3.3.5. Let f ∈ C {x1 , . . . ,xn ,y} be regular of order 1 in y. Suppose ϕ(x) ∈
C [[x1 , . . . ,xn ]] is such that f (x1 , . . . ,xn ,ϕ(x)) = 0. Then ϕ(x) ∈ C {x1 , . . . ,xn }.
We might phrase this by saying that a formal solution to f (x1 , . . . ,xn ,y) = 0 (solving
for y), already is analytic.
By induction we get the Implicit Mapping Theorem from the Implicit Function
Theorem.
Theorem 3.3.6 (Implicit Mapping Theorem).
∂fi
Let f1 , . . . ,fm ∈ C {x1 , . . . ,xn , y1 , . . . ,ym } such that fi (0) = 0 and det ∂y j
(0) 6=
0. Then there exist uniquely defined ϕ1 , . . . ,ϕm ∈ (x1 , . . . ,xn )C {x1 , . . . ,xn } such that
fi (x1 , . . . ,xn ,ϕ1 , . . . ,ϕm } = 0 for i = 1, . . . ,m.
∂fi
Proof. First suppose that the m×m matrix ∂y j
(0) is the identity matrix. We will prove
the statement by induction on m. The case m = 1 is the Implicit Function Theorem.
For the induction step, first apply the Implicit Function Theorem to the element fm . We
conclude that there exists a unique holomorphic function ψ = ψ(x1 , . . . ,xn ,y1 , . . . ,ym−1 )
with fm (x1 , . . . ,xn ,y1 , . . . ,ym−1 ,ψ) = 0. It is left as an exercise to show that the functions
is an isomorphism. This again holds if and only if there exist open neighborhoods U,W
of 0 such that
F = (f1 , . . . ,fn )
defines a map F : U −→ W and this map has a holomorphic inverse.
The Inverse Function Theorem is the basis to introduce coordinate functions and
complex submanifolds of C n .
Definition 3.3.8.
96 3 Basics of Analytic Geometry
(1) Let f1 , . . . ,fn be holomorphic on an open subset U . Let p ∈ U , and suppose f1 (p) =
··· =
fn (p)= 0. The set {f1 , . . . ,fn } is called a set of coordinate functions at p if
∂f
det ∂xi (p) 6= 0.
and one checks that these are part of a set of coordinate functions at every point of X.
We will see later in Chapter 4 that for an analytic subset of C n almost all points have
a neighborhood which is a zero set of coordinate functions. More precisely, an analytic
subset contains as an open set a complex submanifold. This gives rise to the following
definition.
Definition 3.3.11. Let U ⊂ C n be an open set and X ⊂ U an analytic subset. A point
x ∈ X is called regular, or X is called smooth at x, if there exists an open subset V in
C n , x ∈ V , such that X ∩ V is a complex submanifold of C n . If x ∈ X is not regular,
then x is called singular, or a singularity. The set of singular points of X is denoted by
Sing(X).
Examples 3.3.12.
ak+1
y =0
ak
f (ak )
(3.14) ak+1 = ak − .
f ′ (ak )
Starting off with an initial approximation a0 we get a sequence a1 ,a2 , . . ., which, under
appropriate assumptions, converges to a number a with f (a) = 0.
We apply similar ideas to the equation f (x,y) = y 2 − 1 − x. We want to find √ a power
series, y = ϕ(x) with ϕ(x)2 = x + 1, that is, we want to find a power series for x + 1.
We start of with the initial approximation ϕ0 (x) = 1, and iterate with the obvious analog
of (3.14):
f x,ϕk (x)
(3.15) ϕk+1 (x) = ϕk (x) − ∂f .
∂y x,ϕk (x)
y = y(x) + t · α
The existence will be guaranteed by applying the Implicit Function Theorem. Indeed
Taylor’s formula gives
∂f
g(x,t) := f x,y(x) + tα = f x,y(x) + tα x,y(x) + α2 R = f x,y(x) + tα2 + α2 R
∂y
(1) Analogous ideas also give a new proof for the Hilbert Basis Theorem for the poly-
nomial ring C [x1 , . . . ,xn ].
(2) An alternative proof for the induction in the proof of Corollary 3.3.14 uses the
Hilbert Basis Theorem. It runs as follows. Let I 6= (0) be an ideal, and 0 6=
f ∈ I. Again, we may assume that C {x1 , . . . ,xn }/(f ) is a finitely generated
C {x1 , . . . ,xn−1 }–module. We may also assume that f is a Weierstraß polynomial
in xn . Hence C {x1 , . . . ,xn }/(f ) = C {x1 , . . . ,xn−1 }[xn ]/(f ). It follows from the
Weierstraß Division Theorem that
By induction and the Hilbert Basis Theorem, the ring C {x1 , . . . ,xn−1 }[xn ] is Noethe-
rian. Hence I is finitely generated as C {x1 , . . . ,xn−1 }[xn ]–ideal, and in particular
finitely generated as C {x1 , . . . ,xn−1 ,xn }–ideal.
The next thing we want to show is that the power series ring is a unique factorization
domain. For the proof we need the following fact, which is interesting in its own right.
3.3 Applications 99
and such that C {x1 , . . . ,xn }/I is a finitely generated C {x1 , . . . ,xk }–module. If we have
such a situation we call
As a composition of finite rings extension is finite, see 1.5.2, we deduce that the ring
extension C {x1 , . . . ,xk } ⊂ C {x1 , . . . ,xn }/I is finitely generated. This proves the corol-
lary.
Due to the fact that for general coordinate changes we get a Noether normalization,
we can get the following situation, which will play an important role in the proof of the
Nullstellensatz.
Corollary 3.3.20. Suppose that p ⊂ C {x1 , . . . ,xn } is a prime ideal. Then we can get a
Noether normalization
C {x1 , . . . ,xk } ⊂ C {x1 , . . . ,xn }/p,
with the property that the class of xk+1 in Q(C {x1 , . . . ,xn }/p) is a primitive element of
the field extension
Q(C {x1 , . . . ,xk }) ⊂ Q(C {x1 , . . . ,xn }/p).
We call a Noether normalization with the above property a primitive Noether normaliza-
tion.
9 More precisely, there exists a Zariski open subset U of Gln (C ) such that the statement holds for all
coordinate transformations in U .
3.3 Applications 101
Proof. We know from the proof of the Theorem of the Primitive Element, see for example
[Van der Waerden 1971], that the class of ck+1 xk+1 + · · · + cn xn in Q(C {x1 , . . . ,xn }/p)
is a primitive element for the field extension Q(C {x1 , . . . ,xn }/p) ⊃ Q(C {x1 , . . . ,xk }) for
general ck+1 , . . . ,cn ∈ C . Doing the coordinate change x′k+1 = ck+1 xk+1 + · · · + cn xn the
corollary follows.
Corollary 3.3.21 (Hensel’s Lemma). Let F ∈ C {x1 , . . . ,xn }[T ] and suppose that F ≡
(T − c1 )s1 · · · (T − ce )se mod (x1 , . . . ,xn ), where the ci ∈ C are pairwise different.
Then F = F1 · · · Fe , Fi ∈ C {x1 , . . . ,xn }[T ], Fi ≡ (T − ci )si mod (x1 , . . . ,xn ), and
the Fi have degree si in T .
Proof. We use induction on e. The case e = 1 is trivial. We may assume that c1 = 0
(otherwise we perform the transformation T 7→ T + c1 ). Then F (0,T ) = T s1 · H(T ) and
H(0) 6= 0. Now we use the Weierstraß Preparation Theorem (3.2.4) and obtain F = E·F1 ,
where F1 is a Weierstraß polynomial of degree s1 with respect to T , and E is a unit in
C {x1 , . . . ,xn ,T }.
On the other hand, E also has to be a polynomial in T : divide F by F1 with remainder
in C {x1 , . . . ,xn }[T ], F = E1 F1 + R1 with E1 ,R1 ∈ C {x1 , . . . ,xn }[T ] and degT R1 < s1 .
By the uniqueness statements in the Weierstraß Division Theorem, we obtain R1 = 0
and E = E1 . Now F1 (0,T ) = T s1 and, consequently, E(0,T ) = (T − c2 )s2 · · · (T − ce )se .
The corollary is proved by induction hypothesis.
Consider an open set U ⊂ C n−1 on which the ai converge, and a point p ∈ U such that
the polynomial
102 3 Basics of Analytic Geometry
Proof. This is in fact Hensel’s Lemma for the case that all si are equal to one.
The following corollary we need in the proof of the Finite Mapping Theorem 6.3.5,
Corollary 3.3.24. Consider a monic polynomial F ∈ C {x1 , . . . ,xn }[T ].PSuppose that
F (0,T ) = (T − c1 )s1 · · · (T − ce )se for pairwise different ci ∈ C . Let s = ei=1 si be the
degree of F . Let pi := (0, . . . ,0,ci ) ∈ C n+1 . Then for all g1 , . . . ,ge with gi ∈ On+1,pi there
exist uniquely determined qi ∈ On+1,pi and a polynomial r ∈ C {x1 , . . . ,xn }[T ] of degree
less than s such that
gi = qi F + r
holds in On+1,pi = C {x1 , . . . ,xn ,T − ci } for i = 1, . . . ,e.
Proof. We first show existence. By Hensel’s Lemma, we may write F = F1 · · · Fe , with
ci )si , that is, the Fi are Weierstraß polynomials with respect to T − ci .
Fi (0,T ) = (T −Q
We write Ei = j6=i Fj , so that F = Ei · Fi . Note that Ei is a unit in On+1,pi . We apply
the Weierstraß Division Theorem in On+1,pi for i = 1, . . . ,e:
Ei−1 gi = qi′ Fi + ri
P
Hence we may take qi = qi′ − j6=i rj Fj−1 .
To show uniqueness, it suffices to show that from qi F − r = 0 for i = 1, . . . ,e with
degree of r in T smaller than s, it follows that r = 0. Thus we may write r = F1 · · · Fe qi
with qi ∈ On+1,pi . We define elements
r r r
P1 := , P2 := , . . . , Pe :=
F1 F1 F2 F1 F2 · · · Fe
Note that as r = F1 · · · Fe qi the Pi are elements of On+1,pi . Note that
Ψ : Ok [y1 , . . . ,yt ] −→ S
yi 7→ ui
and conclude that S ∼ = Ok [y1 , . . . ,yt ]/J for J = Ker(Ψ). But we want to show that
S is a quotient of a power series ring. Therefore, we look at the canonical inclusion
Ok [y1 , . . . ,yt ] ⊂ Ok+t , and put Je := J · Ok+t . We obtain a canonical map:
e
θ : Ok [y1 , . . . ,yt ]/J → Ok+t /J.
p = g1 p1 + . . . + gs ps , gi ∈ Ok+t .
The gi are power series in general, so it does not make sense to talk about ψ(gi ). We
therefore calculate modulo high powers of the yi , that is, for every natural number k we
write gi = gik + hik , where hik ∈ (y1 , . . . ,yt )k+1 and the gik are polynomials. Because
Ψ(yi ) = ui ∈ mS it follows that modulo mk+1 S :
But Ψ(pi ) = 0, because the pi were supposed to be generators of J. It follows Ψ(p) ∈ mk+1
S
for all k. It follows from Krull’s Intersection Theorem 1.3.5 that Ψ(p) = 0, or, what is
the same, p ∈ J.
At the end of the section we look at generalization of this result, namely at the case
S is not necessarily local.
Corollary 3.3.26. Let R be an analytic algebra, and S be an R–algebra, which is finitely
generated as R–module. Then
104 3 Basics of Analytic Geometry
r
M
S∼
= Si
i=1
and the Si are analytic algebras. So S is a finite direct sum of analytic algebras.
The proof uses the case that S is a local ring, see 3.3.25 and the following lemma.
Lemma 3.3.27. Let R = On , and S ⊃ R be an R–algebra, finitely generated as R–
module. Let m be the maximal ideal of R. Let y ∈ S be an idempotent modulo mS, that
is, y 2 − y ∈ mS. Then y can be lifted to an idempotent in S, that is, there exists a z ∈ S
with z 2 = z and z ≡ y mod mS.
√
Proof. As S is a finitely generated R–module, and y 2 − y ∈ mS ⊂ mS it follows from
Lemma 1.5.11 that y 2 − y is integral over m. From this integral equation it follows that
m
X
(y 2 − y)r = ck y k , for some ck ∈ m.
k=0
Define
m
X
F (T ) := (T 2 − T )r − ck T k ∈ R[T ].
k=0
r r
So F (T ) ≡ T (T − 1) mod m. We can now apply Hensel’s Lemma 3.3.21 and get
F = G · H, G(T ) ≡ T r mod m and H(T ) ≡ (T − 1)r mod m. In particular, from y 2 ≡ y
mod mS, we obtain G(y) ≡ y r ≡ y mod mS and H(y) ≡ (y − 1)r ≡ (−1)r (1 − y)
mod mS. Hence
The number r is bigger than one, as S/mS is not a local ring. In particular S/mS contains
an idempotent different from 1. (For example (1,0, . . . ,0) in ⊕ri=1 Ci ).
This idempotent can be lifted by Lemma 3.3.27 to an idempotent y ∈ S, y 6= 1. We
apply the Chinese Remainder Theorem 1.1.12 and get
S∼
= S/(y) × S/(1 − y).
Exercises
√
3.3.28. Compute the power series of y = −1 + x + 1 using the ideas of Example 3.3.4.
3.3.30 (General Noether Normalization). Let I ⊂ C {x1 , . . . ,xn } be an ideal. Then, after a
general linear coordinate change, we find that there exists a k such that
and for all j > k there are monic polynomials fj ∈ I ∩ C {x1 , . . . ,xj−1 }[xj ] (with respect to xj )
with degxj (fj ) = ord(fj ) (considered as a power series in C {x1 , . . . ,xj }).
If, moreover, I is a prime ideal then Q(C {x1 , . . . ,xn }/I) = Q(C {x1 , . . . ,xk })[xk+1 ]/(fk+1 ).
(Hint: Assume in the proof of 3.3.19 that the order of g ∈ I is b and apply 3.2.2 and the
Weierstraß Preparation Theorem 3.2.4. In case of I being a prime ideal, combine this idea with
the proof of 3.3.20.)
3.3.31. Consider a general Noether normalization Ok ⊂ OX,x = On /p in the sense of 3.3.30. Let
f be a power series of order t. Show that modulo p the element f is equivalent to a polynomial
in xk+1 , . . . ,xn which has order at least t.
(Hint: Reduce to the case that p = (Pk+1 , . . . ,Pn ), where the Pj are Weierstraß polynomials
in xj with coefficients in C {x1 , . . . ,xk }. For the case p = (Pn ), use Exercise 3.2.19, and apply
induction to the coefficients.)
3.3.32. Consider a prime ideal p ∈ C {x1 , . . . ,xn }, and a Noether normalization C {x1 , . . . ,xk } ⊂
C {x1 , . . . ,xn }/p. Let Pj (X) be an integral equation of xj for j = k + 1, . . . ,n of minimal degree.
Prove that the Pj are Weierstraß polynomials.
(Hint: Use the Weierstraß Preparation Theorem.)
Generalize this to the case of m variables y1 , . . . ,ym . (Hint: Follow the steps in a proof of the
formal Implicit Function Theorem.)
106 3 Basics of Analytic Geometry
fi x,y(x) ≡ 0 mod J y(x) (x1 , . . . ,xn ) for i = 1, . . . ,m, y(x) ∈ C {x1 , . . . ,xn } and c ≥ 1.
Then there exists a y ∈ C {x1 , . . . ,xn }N such that
` ´
fi x,y(x) = 0,` i =´ 1, . . . ,m
y ≡ y mod J y(x) (x1 , . . . ,xn )c .
c
where the rij is a column vector of size m with entries in (x1 , . . . ,xn ) and hij is a column
vector of size m with entries in (t11 , . . . ,tNs )2 .
(2) Consider an m × N matrix A with entries in a ring R. Let M be an m–minor of A. Show
that there exists a N × m matrix B such that AB = M Idm , where Idm is the m × m
identity matrix.
(Hint: If m = N , one can take for B the adjoint of A, see 1.2.9.)
„ «
∂fi
(3) Use this to replace Mi Mj in (3.16) by Mj (y) Bi for certain matrices Bi .
∂yj
(4) Show that for all j with 1 ≤ j ≤ s it suffices to solve for tij the equation
0 1
s
t1j
X B C
Bi (rij + hij ) + @ ... A = 0.
i=1
tNj
In this section we study germs of analytic spaces. We already know what an analytic
space is, see 3.1.12, but we still have to define the notion of a germ.
Definition 3.4.1.
(1) Let X be a topological space and p ∈ X. Subsets A and B of X are called equivalent
at p if there exists a neighborhood U of p such that A∩U = B∩U . It is easy to check
that this is indeed an equivalence relation, see Exercise 3.4.33. The equivalence class
of A at p we call the germ of A at p. A is called a representative of the germ. We
write (A,p) for the germ of A at p.
(2) Let (X,x) and (Y,x) be germs. We define (X,x) ⊂ (Y,x) if there are representatives
X of (X,x) and Y of (Y,x) such that X ⊂ Y . It is not so difficult to see that
(X,x) = (Y,x) if and only if (X,x) ⊂ (Y,x) and (Y,x) ⊂ (X,x).
(3) Let (X,x) ⊂ (Y,x) and (Z,x) ⊂ (Y,x) be germs. Then we define (X,x) ∩ (Z,x) to
be the germ of X ∩ Z at x for any representative X of (X,x), and Z of (Z,x) at x.
We define (X,x) ∪ (Z,x) to be the germ of X ∪ Z at x for any representative X of
(X,x), and Z of (Z,x) at x.
We now define germs of analytic spaces.
Definition 3.4.2.
Definition 3.4.6. Let (X,p) and (Y,q) be two germs of topological spaces. Agerm of
a continuous map f : (X,p) −→ (Y,q) is defined as an equivalence class of maps f :
U −→ W , with f (p) = q, and where U and W are representatives of (X,p) and (Y,p)
respectively. Two such maps f1 : U1 −→ W and f2 : U1 −→ W are called equivalent if
they agree on an open neighborhood of p contained in U1 ∩ U2 .
The problem with germs of continuous maps is, that it makes in general no sense to
talk about the image. By this we mean that in general it might happen that for every
subgerm (Z,q) ⊂ (Y,q), there does not exist an open neighborhood U of p in X such that
(Z,q) is represented by f (U ). For an example, see Exercise 3.4.34. In particular, it does
not make sense to talk about surjectivity. For special kind of maps, so-called finite maps,
this problem does not occur. Let us define this notion.
Definition 3.4.7. Let f : X → Y be a continuous map between topological spaces.
(1) f is called closed if the image f (A) ⊂ Y is closed for all closed subspaces A ⊂ X.
(2) f is called quasi-finite if for all p ∈ Y the fiber f −1 (p) consists of a finite number
of points.
(3) f is called finite if it is both closed and quasi-finite.
The simplest example of a mapping which is quasi-finite but not finite probably is
the inclusion mapping C \ {0} → C . (Here C has the standard Euclidean topology.)
Lemma 3.4.8. Let X and Y be topological spaces, and f : X −→ Y be closed. Let
p ∈ X and let q = f (p). Assume that f −1 f (p) = {p}. Let A,B ⊂ X with p ∈ A, p ∈ B,
defining the same germ at p. Then (f (A),q) = (f (B),q), that is, f (A) and f (B) define
the same germ in q.
V ∩ f (A) = V ∩ f (B).
Step 1. We first show that there exists a neighborhood basis of p in X such that for
each element U of this basis we have the equality U = f −1 (f (U )). So let W be an open
neighborhood of p. Then X \ W is closed, so that f (X \ W ) is closed by assumption.
Then T := Y \ f (X \ W ) is open. Define U = f −1 (T ). Then U is open and we have the
following properties:
(1) As U = f −1 (T ), it follows f (U ) = f ◦ f −1 (T ) ⊂ T . Hence f −1 f (U ) ⊂ f −1 (T ) =
U . As the converse inclusion is trivial, it follows that f −1 f (U ) = U .
(2) U ⊂ W , as, otherwise, there exists an x ∈ U , x 6∈ W . By definition of T , it
immediately follows from x 6∈ W that f (x) ∈ f (X \ W ), so that f (x) 6∈ T . But we
have f (U ) ⊂ T , so this is a contradiction.
(3) We have p ∈ U . To show this, note that as p ∈ W and f −1 (f (p)) = {p} it follows
f (p) 6∈ f (X \ W ) and then f (p) ∈ T = Y \ f (X \ W ). Thus p ∈ f −1 (T ) = U .
110 3 Basics of Analytic Geometry
(1) Then for every arbitrary small neighborhood V of 0 in C there exists a small neigh-
borhood U of 0 in C k such that the power series a1 , . . . ,ar converge on U and such
that for every p in U the r zeros (counted with multiplicity) of
lie in V .
(2) Let π : C k+1 −→ C k be the projection on the first k coordinates. Then π :
V (P ), 0 −→ (C k , 0) is surjective.
U
Typical picture of the zero set of of a Weierstraß polynomial.
112 3 Basics of Analytic Geometry
Proof.
(1) The proof runs as the proof of the previous lemma. Suppose the converse. Then there
exists a small neighborhood V of 0 in C such that for all neighborhoods U of 0 in C k on
which the power series a1 , . . . ,ar converge, there exists a point p ∈ U and a root of the
polynomial P (p,y) = 0, which is not in V . Take a sequence Ui of open neighborhoods of
0 whose intersection is {0}, and points pi ∈ Ui such that there is a root yi of P (pi ,yi ) = 0
which does not lie in V . So there exist a C > 0, infinitely many yi with |yi | ≥ C and
infinitely many pi ∈ C k such that {pi } converges to 0 and P (pi ,yi ) = 0. But then it
follows as in the proof of 3.4.10 that
1 1
|yi | ≤ C r−1 |ar (pi )| + C r−2 |ar−1 (pi )| + . . . + |a1 (pi )| .
(1) Alternatively, Lemma 3.4.11 could have been proved by using Rouché’s Theorem
R ′
of function theory. Even simpler, one could use the integral ff , which counts
the number of zeroes of a holomorphic function in one variable and a continuity
argument. The lemma in fact is the starting point for an alternative proof of the
Weierstraß Division and Preparation Theorem, see the book of Gunning and Rossi,
[Gunning-Rossi 1965].
(2) We will use the Nullstellensatz to generalize 3.4.11 (3). In fact we will prove that,
if X is a small representative of V (p),0 for a prime ideal p ⊂ C {x1 , . . . ,xn }, and
h ∈ C {x1 , . . . ,xn }, h 6∈ I (X, 0), then X \ {h = 0} = X.
(a) there exist qk+2 , . . . ,qn ∈ C {x1 , . . . ,xk }[xk+1 ] such that Qj := ∆xj − qj ∈ p
for j = k + 2, . . . ,n;
(b) there exist f1 , . . . ,fs ∈ C {x1 , . . . ,xk }[xk+1 , . . . ,xn ] such that
114 3 Basics of Analytic Geometry
(3) There is an open neighborhood U of 0 in C k such that P,∆,Qk+2 , . . . ,Qn ,f1 , . . . ,fs
converge on U × C n−k and for
Before giving the proof of the Local Parametrization Theorem we give an example.
In this example all the power series are in fact polynomials. The neighborhood U can be
taken to be the whole of the complex vector space.
Example 3.4.15. Consider the map C −→ C 3 given by t 7→ (t3 ,t4 ,t5 ) = (x,y,z). Its
image can be calculated, see 2.3.18, to be the zero set of the ideal I = (y 2 − xz,x3 −
yz,z 2 − x2 y). A Noether normalization map is given by the projection on the x–axis.
Indeed modulo I one has y 3 = xyz = x4 , and z 3 = x2 yz = x5 . This gives the integral
equation z 3 − x5 = 0, and y 3 − x4 = 0.12 To show that this is a Noether normalization
one has to show that I ∩ C {x} is zero. For this, see Exercise 3.1.28. We in fact have
a primitive Noether normalization. Here y plays the role of xk+1 . This follows because
2
in the quotient field z = yx . The polynomial P is equal to y 3 − x4 . The discriminant
is ∆ = 27x8 . So the Local Parametrization Theorem says that x8 z is a polynomial in
y with coefficients in C {x}. Indeed, even xz = y 2 . So in this case the polynomial q is
equal to 27x7 y 2 . The space X ′ is given by y 3 − x4 = 0. For each a 6= 0, and each y = b
2
with b3 − a4 , there is a unique point on X, with z–coordinate z = ba . Thus the map π
in the Local Parametrization Theorem is indeed surjective. In fact, it is easily seen that
for x 6= 0, the analytic set is a complex submanifold.
n` qk+2 (p) ´ o
11 The equation implies that X \ (p ◦ π)−1 (D) = p, ∆(p)
, . . . , q∆(p)
n (p)
: p ∈ U × C , P (p) = 0 . This
(together with 3.3.23) is the reason to call 3.4.14 Local Parametrization Theorem.
12 A different way to see this is by applying the general Weierstraß Division Theorem 3.2.10. Consider
the C {x}–module C {x,y,z}/I. Divide out the maximal ideal (x) of C {x}. We get the C –vector space
C {y,z}/(y 2 ,yz,z 2 ), which is finite-dimensional. A basis is given by the classes of 1,y,z.
3.4 Germs of Analytic Spaces 115
Step 1. We first show (1) and (2a). We apply the Finiteness of Normalization Theo-
rem 1.5.19. We obtain that p ∩ C {x1 , . . . ,xk }[xk+1 ] = (P ), where P is the minimal
polynomial of the element xk+1 mod p in the field extension Q(C {x1 , . . . ,xn }/p) ⊃
Q(C {x1 , . . . ,xk }). By Exercise 3.3.32, P is a Weierstraß polynomial. Using the Weier-
straß Division Theorem 3.2.3, we obtain p ∩ C {x1 , . . . ,xk+1 } = (P ), as, otherwise, we
would find a nonzero element in p which is a polynomial in xk+1 of degree smaller than
the degree of P . This is in contradiction to the fact that P is the minimal polynomial
of the field extension. This proves (1). To prove (2a) we apply again 1.5.19. So for all
f ∈ On there exists qf ∈ C {x1 , . . . ,xk }[xk+1 ] with ∆ · f − qf ∈ p. In particular, this holds
for xk+2 , . . . ,xn . Therefore, there exist polynomials qk+2 , . . . ,qn ∈ C {x1 , . . . ,xk }[xk+1 ]
such that Qj := ∆xj − qj ∈ p for j = k + 2, . . . ,n. This shows (2a).
Step 2. We now prove (2b). Consider the minimal polynomials Pj ∈ C {x1 , . . . ,xk }[xj ] of
xj for j = k + 2, . . . ,n. Thus Pk+1 := P .
Step 2a. We first show that p has a system of generators in C {x1 , . . . ,xk }[xk+1 , . . . ,xn ].
Indeed, it follows that the extension C {x1 , . . . ,xk } ⊂ C {x1 , . . . ,xn }/(Pk+1 , . . . ,Pn ) is
finite. We put P i := Pi (0, . . . ,0,xk+1 , . . . ,xn ). It then easily follows that the vector space
C {xk+1 , . . . ,xn }/(P k+1 , . . . ,P n ) is generated by monomials in xk+1 , . . . ,xn . It follows
from Corollary 3.2.12 that every element in C {x1 , . . . ,xn } is modulo (Pk+1 , . . . ,Pn ) equiv-
alent to an element in C {x1 , . . . ,xk }[xk+1 , . . . ,xn ]. This then certainly holds modulo p.
Hence we can choose generators f1 , . . . ,fs of p such that
• f1 = Pk+1 = P, . . . ,fn−k = Pn ;
• fn−k+1 , . . . ,fs ∈ C {x1 , . . . ,xk }[xk+1 , . . . ,xn ];
• Qk+2 , . . . ,Qn ∈ {f1 , . . . ,fs }.
(3.17) C {x1 , . . . ,xk }∆ [xk+1 , . . . ,xn ]/J∆ ։ C {x1 , . . . ,xk }∆ [xk+1 , . . . ,xn ]/I∆ .
q (x ,...,x )
The equation Qj = 0 says that xj = j 1 ∆ k+1 , so we can eliminate the variables
xk+2 , . . . ,xn in C {x1 , . . . ,xk }∆ [xk+1 , . . . ,xn ]/J∆ . After having done that, formula (3.17)
gives a surjection
Step 3. The proof of (3a) now is easy. Let f1 , . . . ,fs ∈ C {x1 , . . . ,xk }[xk+1 , . . . ,xn ] be the
generators of p we constructed in Step 2. By (2b) we have for some b = b(i):
n
X
(3.18) ∆b fi = αj (fi )Qj + α(fi )P
j=k+2
for certain αj (fi ),α(fi ) ∈ C {x1 , . . . ,xk }[xk+1 , . . . ,xn ]. Choose U ⊂ C k an open neigh-
borhood of 0 such that P,∆,Qk+2 , . . . ,Qn , α(fi ),αj (fi ) converge on U × C n−k for all
i,j. Let p ∈ Y with ∆(p) 6= 0 be given. Then from (3.18) and P (p) = Qj (p) = 0 for
j = k + 2, . . . ,n, it follows that fi (p) = 0 for i = 1, . . . ,s. This exactly means that
Y \ (p ◦ π)−1 (D) = X \ (p ◦ π)−1 (D).
Step 4. We now prove (3b) and (3c). Consider a point a ∈ X ′ \ p−1 (D). From 3.3.23 X ′
is in a small neighborhood U ′ of a a complex submanifold of U ′ × W , given by the graph
of a function xk+1 = α(x1 , . . . ,xn ). By definition of Qj = ∆xj − qj . Therefore, there is a
unique point of X \ (p ◦ π)−1 (D) mapping to a. If the coordinates of a are (a1 , . . . ,ak+1 )
the unique point in π −1 (p) has coordinates
qk+2 (a1 , . . . ,ak+1 ) qn (a1 , . . . ,ak+1 )
a1 , . . . ,ak ,ak+1 , ,..., .
∆(a1 , . . . ,ak ) ∆(a1 , . . . ,ak )
So we have an inverse to the map π : X \ (p ◦ π)−1 (D) −→ X ′ \ p−1 (D), which we see to
be biholomorphic. Indeed, locally in U ′ × W × C n−k−1 , the set Y \ (p ◦ π)−1 (D) is given
by the set of equations
qk+2 (x1 , . . . ,xk+1 ) qn (x1 , . . . ,xk+1 )
xk+1 − α(x1 , . . . ,xk ) = xk+2 − = · · · = xn − = 0,
∆(x1 , . . . ,xk ) ∆(x1 , . . . ,xk )
and is therefore a complex submanifold.
Step 5. We prove (3d), (4) and (5). By Lemma 3.4.13 the map p ◦ π : X −→ U is finite.
The map p : X ′ \ p−1 (D) −→ U \ D is, obviously, surjective and because of (3b) the map
p ◦ π : X \ (p ◦ π)−1 (D) −→ U \ D is surjective. As p ◦ π is closed, the closure of U \ D is
in p ◦ π(X). Using Theorem 3.1.14 we obtain that U \ D = U and, therefore, π(X) = U .
Now (4) and (5) are immediate consequences (cf. 3.4.9 and 3.4.13).
Remarks 3.4.16. Let p be a prime ideal, and (X, 0) = V (p), 0 .
(1) Note that in the proof of the Local Parametrization Theorem, we showed that the
number of points in the general fiber of a Noether normalization π : X −→ U is
equal to the degree of P , that is, the degree of the field extension
Q(Ok ) ⊂ Q(OX, 0 ).
In fact, this holds generally for “finite maps between irreducible germs of analytic
spaces”, as will be shown in the exercises.
(2) We showed that locally, at a point a ∈ X \ (p ◦ π)−1 (D), the ideal I (X,a) ⊂
On,a is generated by Qk+2 , . . . ,Qn ,P . Moreover, On,a /I (X,a) is isomorphic to
Ok . It follows that if (f1 , . . . ,fs ) = I (X, 0), the ideal of (X, 0), then there exists a
neighborhood U of 0 in X such that the fi converge on U and for all a ∈ U ∩(X \D)
the ideal I (X,a) is generated by f1 , . . . ,fs .13
13 In fact, this holds for all a ∈ U . This is the Coherence Theorem of Oka-Cartan to be proved in Chapter
6.
3.4 Germs of Analytic Spaces 117
(3) In the notation of the Local Parametrization Theorem, write OX ′ ,0 = Ok+1 /(P ).
The cokernel K := OX,0 /OX ′ ,0 is an Ok –module. It was shown in the proof of the
Local Parametrization Theorem, that there exists an a ∈ N such that ∆a · K = (0).
In particular, we can view K as an Ok /(∆a )–module.
Definition 3.4.17. Let (X,x) be a germ of an analytic space. Then (X,x) is called
irreducible if from (X,x) = (X1 ,x) ∪ (X2 ,x), with (X1 ,x) and (X2 ,x) germs of analytic
spaces it follows that either (X,x) = (X1 ,x) or (X,x) = (X2 ,x).
Corollary 3.4.18. Let (X,x) be a germ of an analytic space.
(1) (X,x) is irreducible if and only if I (X,x) is a prime ideal.
(2) Let (X,x) ⊂ (C n ,x) be irreducible, and h ∈ On , h 6∈ I (X,x). Then for small
enough representatives X \ {h = 0} = X.
(3) There is a, up to permutation, unique decomposition (X,x) = (X1 ,x) ∪ · · · ∪ (Xr ,x),
with (Xi ,x) irreducible and (Xj ,x) 6⊂ (Xi ,x) for i 6= j. This is called the irreducible
decomposition of (X,x). The (Xi ,x) are called irreducible components of (X,x).
Proof. The proof runs as in the algebraic case, see 2.2.15, 2.2.16 and 2.2.17. But note
that in the proof the Nullstellensatz is used!
Definition 3.4.19. Let (X,x) ⊂ (C n ,x) be a germ of an analytic space. A germ of
an analytic function f : (X,x) −→ (C ,y) is a germ of a map f : (X,x) −→ (C ,y)
such that some representative is the restriction to X of an analytic function on an open
neighborhood of x in C n . As is easily shown, the germs of analytic functions on (X,x)
form a C –algebra, which we denote by OX,x . This is called the ring of analytic functions
on (X,x). Sometimes OC n , 0 , which is equal to C {x1 , . . . ,xn } is denoted by On . A germ
of an analytic map or simply map, ϕ = (f1 , . . . ,fm ) : (X,x) −→ (Y,y ⊂ (C m ,y) is
similarly defined. The map ϕ : (X,x) −→ (Y,y) is called an isomorphism if ϕ has a two-
sided inverse ψ : (Y,y) −→ (X,x) which is an analytic map. (X,x) and (Y,y) are called
isomorphic if there exists an isomorphism ϕ : (X,x) −→ (Y,y).
The following lemma is easy, and left to the reader.
Lemma 3.4.20.
(1) Let (X,x) ⊂ (C n ,x) be a germ of an analytic space, and I (X,x) be the ideal of X.
Then OX,x = On,x /I (X,x).
(2) Let (X,x) ⊂ (C n ,x) be a germ of a submanifold. Then OX,x ∼
= C {x1 , . . . ,xk } for
some k.
(3) Let ϕ : (X,x) −→ (Y,y) be a germ of an analytic mapping. Then by composition ϕ
induces a map of C –algebras
ϕ∗ : OY,y −→ OX,x : f 7→ f ◦ ϕ.
Theorem 3.4.21. Let (X,x) ⊂ (C n ,x) and (Y,y) ⊂ (C m ,y) be germs of analytic spaces.
Let α : OY,y −→ OX,x be a C –algebra homomorphism. Then there exists a unique germ
of an analytic mapping ϕ : (X,x) −→ (Y,y) with ϕ∗ = α.
Proof. . Without loss of generality, we may assume that x = 0 and y = 0. In 3.2.9 it was
proved that α(mY,0 ) ⊂ mX,0 . As α is a ring homomorphism, we get
α(mkY,0 ) ⊂ mkX,0
for all k > 0. Take generators w1 , . . . ,wm of the maximal ideal of Om . We have a surjective
map Om −→ OY,0 = Om /I (Y, 0), sending wi to w i . Let
f1 = α(w 1 ), . . . ,fm = α(w m ).
So the fi are elements of OX,0 . Define
ϕ = (f1 , . . . ,fm ) : (X, 0) −→ (C m , 0).
Hence we get a map ϕ∗ : Om −→ OX,0 . Moreover consider the composition α
e : Om −→
α
OY,0 −→ OX,0 . We want to prove the following:
(1) ϕ∗ = α
e.
(2) ϕ (X, 0) ⊂ (Y, 0).
(1) By construction ϕ∗ (wi ) = α e(wi ). As they are both C –algebra homomorphisms, it
follows that ϕ∗ (g) = α
e(g) for all polynomials g ∈ C [w1 , . . . ,wm ]. But, we want to show
that ϕ∗ (g) = α
e(g) for all g ∈ Om . Given such a g, we can, for all k ∈ N write
g = gk + gk′ ,
where gk is a polynomial of degree smaller than k and gk′ ∈ mk . (Here m is the maximal
ideal of Om .) Now
ϕ∗ (gk ) = α
e(gk ).
As gk is a polynomial, and both ϕ∗ and α
e map mk into mkX,0 , it follows that
ϕ∗ (g) − α
e(g) = ϕ∗ (gk′ ) − α
e(gk′ ) ∈ mkX,o .
As this holds for all k it follows that ϕ∗ (g) − αe(g) ∈ ∩k mkX,0 . But as OX,0 is a Noetherian
k
local ring, it follows that ∩k mX,0 = 0, by Krull’s Intersection Theorem 1.3.5. Therefore,
ϕ∗ (g) = α
e(g).
(2) Because (Y, 0) = V I (Y, 0) , 0 , it suffices to show that for all g ∈ I (Y, 0), the
map
g ◦ ϕ : (X, 0) −→ (C , 0),
e is induced by α : OY,0 −→ OX,0 , the ideal I (Y, 0) is in the kernel
is the zero map. As α
of ϕ∗ : On −→ OX,0 . So the claim follows.
The uniqueness of ϕ is clear.
Corollary 3.4.22. Two germs of analytic spaces (X,x) and (Y,y) are isomorphic if
and only if OX,x and OY,y are isomorphic. So the study of germs of analytic spaces is
equivalent to the study of reduced analytic algebras. In particular, a germ of a complex
submanifold is isomorphic to (C k , 0) for some k.
3.4 Germs of Analytic Spaces 119
A1 A2 A3
D4 D5 D6
Definition 3.4.26. Let f ∈ C {x1 , . . . ,xn }.
∂f ∂f
(1) We define the Jacobian ideal by J(f ) := ( ∂x 1
, . . . , ∂x n
).
(2) The Milnor algebra of f by definition is the C –algebra C {x1 , . . . ,xn }/J(f ). The
Milnor number of f , µ(f ), by definition is the C –vector space dimension of the
Milnor algebra:
µ(f ) = dimC C {x1 , . . . ,xn }/J(f ) .
(3) The Tjurina algebra of f is by definition the C –algebra C {x1 , . . . ,xn }/ f,J(f ) .
The Tjurina number of f , τ (f ) by definition is the C –vector space dimension of
the Tjurina algebra.
The Milnor number and Tjurina number might be infinite.
Theorem 3.4.27. Let f and g be in C {x1 , . . . ,xn }. Suppose that V (f ), 0 and V (g), 0
are isomorphic. Then the Tjurina algebras of f and of g are isomorphic.14 In fact, if ϕ
is an automorphism of C {x1 , . . . ,xn } such that ϕ (f ) = (g), then ϕ (f ) + J(f ) =
(g + J(g)). In particular the Tjurina numbers τ (f ) and τ (g) are equal.
Proof. Let ϕ = (ϕ1 , . . . ,ϕn ) : C {x1 , . . . ,xn } −→ C {x1 , . .. ,xn } be an automorphism
such that (ϕ(f )) = (g). It is not difficult to see that g,J(g) = ug,J(ug) for a unit u.
Therefore, we may assume that ϕ(f ) = g. The chain rule says that for all i
∂g X ∂f ∂ϕ
j
X ∂f ∂ϕj
= ◦ϕ · = ϕ · .
∂xi j
∂xj ∂xi j
∂xj ∂xi
So we see that J(g) ⊂ ϕ(J(f )). As ϕ−1 is also an isomorphism, it follows by symmetry
J(f ) ⊂ ϕ−1 (J(g)). Compose thiswith ϕ. We get ϕ(J(f )) ⊂ J(g). As moreover ϕ (f ) =
(g), it follows that ϕ (f ) + J(f ) ⊂ (g + J(g)). The converse inclusion follows because
of symmetry.
14 It is a famous theorem of Mather and Yau that the converse holds in case the Tjurina number is finite,
see 9.1.8.
3.4 Germs of Analytic Spaces 121
Example 3.4.28. The Tjurina numbers of the Ak ,Dk and Ek singularities are equal to k.
The singularities Ak and Al are isomorphic if and only if k = l. Now let (X, 0) = V (f ), 0
for f 6= 0 be a germ of a regular point. Then one calculates directly that τ (f ) = 0.
In particular, all hypersurface singularities V (f ), 0 with 0 < τ (f ) < ∞ are genuine
singularities. So now we proved that the Ak for k ≥ 1 are indeed singularities. Similarly
Dk and Dl are isomorphic if and only if k = l, and these are indeed singularities.
It is also true that the Milnor number is an invariant of the hypersurface singular-
ity (X, 0). A proof however, is surprisingly much more difficult, and will be proved as
Corollary 6.4.10.
A1 –singularities can conveniently be described by the Tjurina, or Milnor number.
Theorem 3.4.29. Let (X, 0) = V (f ), 0 be a hypersurface singularity. Then µ(f ) =
τ (f ) = 1 if and only if (X, 0) is isomorphic to an A1 –singularity.
Proof. Suppose f ∈ m \ m2 . Then µ(f ) = τ (f ) = 0, as one easily checks. Hence we
may assume that f ∈ m2 , so that τ (f ) ≥ 1. A direct calculation shows that for an
A1 –singularity µ(f ) = τ (f ) = 1.
Conversely, as τ (f ) ≤ µ(f ) it follows that either µ(f ) = 0 or 1. But µ(f ) = 0 cannot
occur,Pas f ∈/ m \ m2 . So we may assume that µ(f ) = 1. We may write (nonuniquely)
f = i,j xi xj Hij , and we may assume that Hij is symmetric. We put hij = Hij (0).
Then
∂f X
=2 xj hij modulo m2 .
∂xi j
As µ(f ) = 1, it follows that J(f ) = m = (x1 , . . . ,xn ). So there exist αki with
X ∂f
xk = αki .
i
∂xi
P 2
So it follows that i aki hij = 12 δkj so that hij = ∂x∂i ∂x
f
j
(0) is an invertible matrix.
The theorem therefore follows from the famous Morse Lemma, to be proved next.
Lemma
2 3.4.30
(Morse Lemma). Let f ∈ C {x1 , . . . ,xn }, f ∈ m2 and suppose that
∂ f
∂xi ∂xj (0) has maximal rank n. Then there exists an automorphism ϕ of C {x1 , . . . ,xn }
such that ϕ(f ) = x21 + . . . + x2n .
Proof. We write
n
X
f= xi xj Hij (x1 , . . . ,xn ).
i,j=1
2
We may suppose that Hij = Hji for all i,j. The rank of ∂x∂i ∂x f
j
(0) is equal to the rank
of Hij (0). It follows from linear algebra that this rank is independent of the coordinates.
By induction on s we suppose we found coordinates (y1 , . . . ,yn ) such that
122 3 Basics of Analytic Geometry
X
f (y1 , . . . ,yn ) = y12 + . . . + ys−1
2
+ yi yj Hij (y1 , . . . ,yn ).
i,j≥s
The case s = 0 is trivial. From the assumption that we have maximal rank, it follows
that at least one of the Hij (0) is not equal to zero. After a linear coordinate change
in the ys , . . . ,yn we may suppose that Hss (0) 6= 0. √ One can, therefore, by the Implicit
Function Theorem, consider the square root g := Hss which is in fact a unit. We take
new coordinates
1 X His (w1 , . . . ,wn )
yi = wi for i 6= s; ys (w1 , . . . ,wn ) = ws − 12 wi .
g(w1 , . . . ,wn ) i>s
Hss (w1 , . . . ,wn )
It follows from the Inverse Function Theorem that this is an invertible map. Now one
calculates that f (w1 , . . . ,wn ) is of the desired form.
Exercises
3.4.31. Suppose X ⊂ C n , and Y ⊂ C m are affine varieties. Let ϕ : X −→ Y be a morphism
and suppose that via ϕ∗ the coordinate ring K[X] is a finitely generated K[Y ]–module. Let
x ∈ X, and y = f (x) ∈ Y . Show that for the germs of the analytic spaces (X,x) and (Y,y) the
ring OX,x is a finitely generated OY,y module.
(Hint: Use the general Weierstraß Division Theorem 3.2.10.)
` ´
3.4.32. Let I ⊂ C {x1 , . . . ,xn } be an ideal. Use the Nullstellensatz to prove that V (I), 0 = {0}
if and only if dimC C {x1 , . . . ,xn }/I < ∞. Prove that this is the case if and only if I is an m–
primary ideal. Here m = (x1 , . . . ,xn ).
3.4.33. Prove that the relation in definition 3.4.1 is an equivalence relation.
3.4.34. Let f : (X,p) −→ (Y,q) be a germ of a continuous map. In general, it does not make
sense to talk about the image of the map f . Indeed, look at the map
Show that there exist arbitrary small representatives X of (X,p) and Y of (Y,q) such that for
all U open in R2
U ∩ f (X) 6= U ∩ f (Y ).
So it is impossible to define the image as the class of the image of a small enough representative.
Note that in this example, f is not finite.
3.4.35. Formulate and prove Lemma 3.4.10 and Lemma 3.4.11 for the case of finitely many
Weierstraß polynomials. (For every Weierstraß polynomial one takes a new variable).
3.4.36. Let X,Y be topological spaces, f : X −→ Y be a closed map. Then for each open
neighborhood U in X of a fiber f −1 (y) there exists an open neighborhood V of y in Y such that
f −1 (V ) ⊂ U . Prove this.
3.4.37. Let X be an irreducible affine variety. Formulate and prove a Local Parametrization
Theorem for X.
3.4.38. Let (X,x) be a germ of ` an analytic
´ space, OX,x be its local ring. Define for I ⊂ OX,x
the germ of the analytic space V (I),x ⊂ (X,x). For a subgerm (A,x) ⊂ (X,x), define I (A,x).
Prove the standard properties of these operations, including the Nullstellensatz.
` ´
3.4.39. Let (X, 0) = V (f ), 0 be a hypersurface singularity. One says that (X, 0) has an isolated
singularity, if there exists a representative of X such that X \ {0} is a complex submanifold.
Using the Nullstellensatz, show that (X, 0) has an isolated singularity if and only if τ (f ) < ∞.
3.4 Germs of Analytic Spaces 123
3.4.40. Let ϕ : (X,x) −→ (Y,y) be finite map between germs of analytic spaces. The map ϕ is
called generically s to 1 if there exists a hypersurface (D,y) in (Y,y) such that for all small enough
representatives Y of (Y,y) and D of (D,y), the set π −1 (p) has cardinality s for all p ∈ X \ D.
(1) Show that if ϕ is generically s to 1 for some s ≥ 1, then ϕ is surjective.
(2) Show that ϕ∗ induces an isomorphism of total quotient rings ϕ∗ : Q(OY,y ) ∼= Q(OX,x ) if
and only ϕ is generically 1 − 1.
(Hint: First reduce to the case (X,x) and (Y,y) irreducible. Furthermore, if (Y,y) −→
(C k , 0) is a primitive Noether normalization of (Y,y), then the induced map (X,x) −→
(C k ,0) is a primitive Noether normalization of (X,x). Apply the Local Parametrization
Theorem.)
(3) Suppose that OY,y is a normal ring. Show that ϕ is generically s to 1, where s is equal to
the degree of the field extension Q(OX,x ) ⊃ Q(OY,y ).
(Hint: First recall from 1.5.7 that OY,y is an integral domain. Then check that in the proof
of the Local Parametrization Theorem we can replace C {x1 , . . . ,xk } by OY,y .)
(4) Show that this result can be generalized to general (Y,y) as soon as we know that a
“normalization” exists, see Section 4.4. This remark also holds for the next exercise.
3.4.42.
(1) Let (X,x) and (Y,y) be isomorphic germs of analytic spaces. Show that for all k ∈ N the
C –algebras OX,x /mkX,x and OY,y /mkY,y are isomorphic. In particular they are isomorphic
as C –vector spaces.
(2) Reprove that Ak is isomorphic to Al if and only if k = l. In fact, show that all of the
A-D-E–singularities give different isomorphism classes.
3.4.43. Let (X, 0) ⊂ (C n , 0) and (Y, 0) ⊂ (C m , 0) be germs of analytic spaces, and let ϕ :
(X, 0) −→ (Y, 0) be a germ of an analytic map. Define the graph Graph(ϕ) ⊂ (C n+m , 0), which
is isomorphic to (X, 0). Show that we can view ϕ as the projection on the second factor.
124
In this chapter we will continue with the basics of the theory of germs of analytic spaces
(and of affine varieties). Probably the most important invariant of (irreducible) germs of
analytic spaces is the dimension. There are various ways to introduce it. We decided to
give, in fact, four definitions of dimension. We introduce this many of them, so that we
have a flexible development of the theory.
Three of these definitions are given in Section 4.1, namely the Weierstraß dimension,
the Krull dimension and the Chevalley dimension. They all have a geometric motivation.
The Weierstraß dimension of a germ of an analytic space (X,x) is given by considering
a Noether normalization (X,x) −→ (C k , 0). It turns out that k is independent of the
Noether normalization of (X,x). This map is surjective and has finite fibers, so it should
be geometrically clear to the reader that (X,x) has the same dimension as (C k , 0), which
should be k. The Krull dimension is defined as the maximal length k of a chain of
irreducible germs (X0 ,x) $ (X1 ,x) $ · · · $ (Xk ,x) ⊂ (X,x). The Chevalley dimension
is defined by the minimal number k of hypersurfaces (H1 ,x), . . . ,(Hk ,x) one has to cut
(X,x) with in order to get a point. The main part of the first section is devoted to the
fact that these three definitions coincide. As suggested by the definition of Chevalley
dimension, we will have to take intersections. Now it might happen that, even if one
starts with a reduced space (X,x), the intersection with every hypersurface is nonreduced.
Algebraically, this means it might happen that for a reduced local ring (R,m) and every
f ∈ m the ring R/(f ) might be nonreduced. This is why we have to consider germs of
complex spaces, that is, nonreduced spaces. Algebraically, this just means that we study
analytic algebras.
Our discussion on dimension applies just as well to algebraic sets. One has to keep
in mind however, that dimension is a local property. For example, take X = V (zx,zy)
and the point p on the line as indicated in the following figure.
Then one would like to say that the dimension of X at the point p is one. If, however, p
is as in one of the following two pictures,
125
p
p
In Section 4.3 we will consider tangent spaces, regular and singular points. We al-
ready defined in Chapter 3 what a smooth point of an analytic space is, see Definition
3.3.11. The first result is that the set of singular points is contained in a proper ana-
lytic subset. Essential use is made of the Local Parametrization Theorem. We have to
postpone the proof that the set of singular points in fact is an analytic subset to Chap-
ter 6. However, as a preliminary version, we will prove the Jacobian criterion. This says
that the set of singular points is contained in the analytic subset given by the maximal
minors of the Jacobian matrix. The fact that it is equal, uses the Coherence Theorem
of Oka-Cartan, which is a deep theorem, to be proved in Chapter 6. We will introduce
126 4 Further Development of Analytic Geometry
the Zariski tangent space and prove that singular points can be characterized by those
points for which the embedding dimension, which is equal to the vector space dimension
of the Zariski tangent space, is not equal to the dimension of the space. Finally we will
prove the general Jacobian criterion, which gives a statement on when a general point of
an analytic subset is smooth.
In the final section of this chapter we study the normalization of a germ of an analytic
space. In case (X,x) is irreducible, this is a normal analytic space (X,ee x) together with a
finite degree one map n : (X,e e x) −→ (X,x). For reducible (X,x), we need the notion of a
multi-germ. First of all, we will show that a normalization (X,ee x) of a space (X,x) always
exists. The proof uses the Finiteness of Normalization Theorem 1.5.19, Corollary 3.3.25,
and Corollary 3.3.26. These are used to show that the normalization of OX,x is a finite
direct sum of analytic algebras. We continue to study properties of normal analytic spaces.
It will be shown that the singular points of a normal analytic space are contained in a
subset of codimension at least two. In particular, normal curve singularities are smooth.
As the normalization always exists, this means that for irreducible curve singularities
(X,x) there exists a germ of a finite, generically 1 − 1 map n : (C , 0) −→ (X,x). Hence,
irreducible curve singularities can be parameterized! This property of curve singularities
will be studied further in the next chapter. The final topic in this chapter is a function
theoretic interpretation of normal analytic spaces, which can be phrased by saying that
an analytic space (X,x) is normal, if and only if the “Riemann Extension Theorem”
holds for (X,x). This means that if f is a holomorphic function on X \ Y , where Y
contains the singular locus of X, and such that f is locally bounded, then there exists an
extension of f to a holomorphic function on the whole of X. Again in the proof, Noether
normalization is used to reduce the statement to the case of C k , a case we already know,
see 3.1.15.
We start with a generalization of the notion of the germ of an analytic space. We need
this, because in our development of dimension theory nonreduced analytic algebras occur
naturally.
Definition 4.1.1. A germ of a complex space (X,x) consists of
(1) a germ of an analytic space (Xred ,x), called the reduction (X,x) and,
(2) an analytic algebra OX,x such that (OX,x )red = OXred ,x .
Example 4.1.2. Let I = (x2 ) ⊂ C {x}. Then V (I), 0 = (Xred , 0) is the point 0,
and OXred ,x = C . Now I defines a germ of a complex space (X, 0) using (Xred , 0) and
OX,x = C {x}/I.
Next we give three definitions of dimension.
Definition 4.1.3. Let (X,x) be a germ of a complex space and (OX,x ,m) be its local
ring.
• The Weierstraß dimension of (X,x) is the least number k, such that there exists a
Noether normalization Ok ⊂ OX,x of (X,x). (It will turn out that the number k in
a Noether normalization Ok ⊂ OX,x is independent of the Noether normalization.)
4.1 Dimension Theory 127
p0 $ p1 $ . . . $ pk
in OX,x . Note that this definition also works for all (Noetherian) rings.
Although we will prove in this section that for germs of analytic spaces the three
definitions of dimension coincide, we call the Krull dimension simply the dimension
of (X,x), notation dim(X,x).
For the case OX,x = On , consider the following chain of prime ideals:
from which it follows that the Krull dimension of (C n , 0) is at least n. It needs proof,
and is in fact nontrivial that the dimension of (C n , 0) is indeed equal to n. Of course, it
follows from the fact that the Krull dimension is equal to the Weierstraß dimension, or
alternatively, it is equal to the Chevalley dimension. We now start off proving that the
three definitions coincide.
Lemma 4.1.4. Let R ⊂ S be Noetherian local rings, mR ⊂ mS . Suppose that S is
a finitely generated R–module. Then the Krull dimension of S is equal to the Krull
dimension of R. In particular, if ϕ : (X,x) −→ (Y,y) is a finite surjective map of germs
of complex spaces, then dim(X,x) = dim(Y,y).
128 4 Further Development of Analytic Geometry
Proof.
Step 1. First let p0 $ · · · $ pd be a chain of prime ideals in R. By the Going-Up Theorem
1.5.25 there exists a chain of prime ideals P0 $ · · · $ Pd with Pi ∩ R = pi . This shows
that the Krull dimension of S is at least the Krull dimension of R.
Step 2. Now let P0 $ · · · $ Pd be a chain of prime ideals in S. Let pi := Pi ∩ R. Then
of course, the pi are prime ideals in R. It remains to show that for all i = 0, . . . ,d − 1
we have a proper inclusion pi $ pi+1 . By considering the quotient rings R/pi and S/Pi ,
which are integral domains, it obviously suffices to prove the following statement.
Let R ⊂ S be an integral extension, and (0) 6= J ⊂ S be an ideal containing a
nonzerodivisor. Then J ∩ R 6= (0).
To prove this, let x ∈ J be a nonzerodivisor. It satisfies an integral equation, which
we take of minimal degree
xn + . . . + an−1 x + an = 0, ai ∈ R.
On the other hand, suppose that we have elements f1 , . . . ,fd in R that generate
an m–primary ideal q = (f1 , . . . ,fd ) in Rred . This means that there exists a n ∈ N
with mn + N ⊂ q + N , where N is the set of nilpotent elements of R. However, as R is
Noetherian, there exists an s ∈ N with (q+N )s ⊂ q. It follows that mns ⊂ (mn +N )s ⊂ q.
Thus q is an m–primary ideal.
Step 3. First suppose that the Chevalley dimension is zero. We may now suppose that
the ring is reduced. We will show that (0) is a maximal ideal. Because the Chevalley
dimension is zero, it follows that there exists an m–primary ideal with 0 generators. Thus
the ideal (0) is m–primary. From the definition of primary, it follows that every zero
divisor is nilpotent, and hence zero, since R is reduced. Hence there are no zerodivisors
different from 0, which means that (0) is a prime ideal. As, moreover, (0) is m–primary,
it is equal to m. Hence the Krull dimension is zero.
Step 4. Suppose the Krull dimension is zero. Then m is the only prime ideal of R, as
otherwise we can construct a chain of length at least one. As the nilpotent elements
of R is the intersection of all minimal prime ideals, it follows that all elements of m
are nilpotent. Thus (0) is an m–primary ideal, which exactly says that the Chevalley
dimension of R is zero.
We now prove that the Chevalley dimension is smaller than or equal to the Krull
dimension. The proof works for a general Noetherian local ring.
Lemma 4.1.7. Let (R,m) be a Noetherian local ring. Then the Chevalley dimension is
smaller than or equal to the Krull dimension.
The main step in the proof is given by the following proposition.
Proposition 4.1.8. Let R be a Noetherian ring of Krull dimension d. Suppose that f
is an active element of R, that is, f is not contained in any of the minimal prime ideals
of R 1 . Then the Krull dimension of R/(f ) is at most d − 1.
Proof. Consider the ring R/(f ). Any chain of prime ideals p1 $ · · · $ ps in R/(f ) gives a
chain of prime ideals in R, with (f ) ⊂ p1 $ · · · $ ps . The ideal p1 is not minimal, because
otherwise f would be contained in a minimal prime ideal, contrary to the assumption.
Therefore, we can find a minimal prime ideal p0 contained in p1 . Hence s ≤ d, and it
follows that the Krull dimension of R/(f ) is at most d − 1.
Proof of Lemma 4.1.7. Let d be the Krull dimension of R. The proof goes by induction
on d. The case d = 0 was proved in Lemma 4.1.6.
Therefore, we may suppose d > 0. Then m is not a minimal prime ideal. Take an
element f ∈ m, which is not contained in ∪pi , where the pi are the minimal prime ideals of
R. Such an element exists because of Prime Avoidance, see 1.1.13. Applying the previous
lemma, we get that the Krull dimension of R/(f ) is at most d − 1. By induction we
have elements f1 , . . . ,fs , with s ≤ d − 1 generating an m–primary ideal in R/(f ). Then
f,f1 , . . . ,fs generate an m–primary ideal in R. Thus the Chevalley dimension is smaller
than or equal to s + 1 ≤ d.
Theorem 4.1.9. Let (X,x) be a germ of a complex space. Then the Weierstraß dimension
of (X,x) is equal to the Chevalley dimension of (X,x), and thus all three definitions of
dimension coincide for germs of complex spaces.
1 This exactly means that f is not a zerodivisor of Rred , see 1.4.24.
130 4 Further Development of Analytic Geometry
Proof. By Corollary 4.1.5 and Lemma 4.1.7 it remains to show that the Weierstraß
dimension is less than or equal to the Chevalley dimension of (X,x). This will be proved
in two steps.
Step 1. Let ϕ : Ok −→ OX,x be a C –algebra homomorphism, not necessarily injective.
Suppose that OX,x is, via ϕ, a finitely generated Ok –module. Then we claim that the
Weierstraß dimension of (X,x) is less than or equal to k.
Indeed, consider the injective map Ok / Ker(ϕ) ⊂ OX,x . This is an integral extension.
Now Ok / Ker(ϕ) is an analytic algebra. So there exists an s ≤ k and a Noether normal-
ization Os ⊂ Ok / Ker(ϕ). It follows that Os ⊂ OX,x is a finitely generated extension,
hence a Noether normalization. The claim follows.
Step 2. Now let f1 , . . . fs be generators of an m–primary ideal of OX,x . We consider the
map
By assumption OX,x /(f1 , . . . ,fs ) is a finitely generated C –vector space, so that, by the
general Weierstraß Division Theorem 3.2.10, OX,x is a finitely generated C {y1 , . . . ,ys }–
module. By step 1 it follows that the Weierstraß dimension is less than or equal to s.
Corollary 4.1.10 (Active Lemma). Let f ∈ m ⊂ OX,x be an active element, (Y,x) :=
V (f ),x ⊂ (X,x). Then dim(Y,x) = dim(X,x) − 1.
Proof. We have shown in 4.1.8, dim(Y,x) ≤ dim(X,x) − 1. It cannot be smaller, as
otherwise we would be able to find s generators of a primary ideal with s < dim(X,x).
Example 4.1.11. The condition that f is an active element
is essential. For example,
the germ consisting coordinate
axes (X, 0) = V (xy), 0 has dimension one. Take f = x.
Then (Y,x) = V (x), 0 also has dimension one.
The following corollary is typical for the germ (C n , 0) and certainly does not hold
for general germs of analytic spaces.
Corollary 4.1.12 (Characterization of Hypersurfaces). Let (X, 0) ⊂ (C n , 0) be irre-
ducible, and suppose dim(X, 0) = n − 1. Then there exists an f with I (X, 0) = (f ), that
is (X, 0) is a hypersurface.
More generally, if every irreducible component of (X, 0) has dimension n − 1, it
follows that (X, 0) is a hypersurface.
Proof. It follows that I (X, 0) is a prime ideal, so that we can take an irreducible f ∈
I (X, 0). We claim that (f ) = I (X, 0). Indeed, otherwise there would be a g ∈ I (X, 0) \
(f ). In particular g would be a nonzerodivisor modulo (f ), and by the Active Lemma
dim(X, 0) ≤ dim V (f,g), 0 = n − 2, a contradiction.
To prove the second statement, let (X, 0) = (X1 , 0) ∪ · · · ∪ (Xr , 0) be an irre-
ducible decomposition.
Then we just proved that (Xi , 0) = V (fi ), 0 for some fi . Thus
V (f1 · · · fr ), 0 = V (f1 ), 0 ∪ · · · ∪ V (fr ), 0 = (X, 0).
Theorem 4.1.13. Suppose (X,x) is a germ of a complex space. Let (X,x) = (X1 ,x) ∪
· · · ∪ (Xr ,x) be an irreducible decomposition of (X,x). Then
Proof. We may assume that OX,x is reduced. We use the Krull dimension. The inequal-
ity ≥ is obvious, since a chain of irreducible germs in (Xi ,x) gives one in (X,x). Let
I (Xi ,x) = pi . Then pi is a minimal prime ideal of OX,x . Let (Y,x) be an irreducible
subgerm of (X,x). Then there exists an i with (Y,x) ⊂ (Xi ,x). This is because I (Y,x)
is a prime ideal, and therefore there exists a minimal prime ideal pi ⊂ I (Y,y). Taking
zero sets gives (Y,x) ⊂ (Xi ,x). So a maximal chain of irreducible subgerms is contained
in some irreducible component of (X,x). This shows the theorem.
Now consider an irreducible
germ of an analytic space (X,x), and let 0 6= f ∈ m ⊂
OX,x . Put (Y,x) := V (f ),x ⊂ (X,x). It may very well happen that (Y,x) is reducible.
Note that we proved (Active Lemma) that the dimension of (Y,x) is equal to dim(X,x)−1,
but we did not prove that each irreducible component of (Y,x) has dimension dim(X,x)−
1. We only proved that there is at least one component of this dimension. However,
the statement that for an active element f each irreducible component of V (f ) has
codimension one is in fact true, and holds for general Noetherian rings. This is Krull’s
Principal Ideal Theorem, to be proved in the next section, see 4.2.16. In this section, we
give a proof for germs of analytic spaces which uses the Going-Down Theorem 1.5.26.
First we need a definition.
Definition 4.1.14. Let R be a ring.
(1) Let p be prime ideal. We define the height ht(p) of p, to be the maximal m such
that there exists a chain of prime ideals
p0 $ p1 $ · · · $ pm = p.
as follows directly from the definition. Consider a germ of a complex space (X,x), and
let I ⊂ OX,x be an ideal. Equality in (4.1) would allow us to interpret the height of I as
the codimension of the zero set of I in (X, 0). The codimension of a germ of a complex
(Y,x) in (X,x) is simply defined as dim(X,x) − dim(Y,x). Of course, it suffices to show
this for irreducible germs . This will be shown now.
There exist examples of Noetherian rings, for which we have strict inequality in
(4.1), see Exercise 4.1.30.
Theorem 4.1.16. Let (X,x) ⊂ (C n , 0) be an irreducible germ of a complex space. All
maximal chains of prime ideals in OX,x have the same length, namely dim(X,x). In
particular, formula (4.1) holds for R = OX,x .
Proof. Let Ok ⊂ OX,x be a Noether Normalization, and consider a chain of prime ideals
P0 $ P1 $ · · · $ Ps
in OX,x which cannot be extended to a larger chain. We will show that s = k = dim(X,x).
Obviously s ≤ k.
132 4 Further Development of Analytic Geometry
Step 1. Let p be a prime ideal of height one in Ok . We will first show that the dimension
of Ok /p is k − 1. Now p 6= (0), and there exists an element 0 6= f ∈ p. As p is prime,
there exists an irreducible factor of f which lies in p. So we may assume that we have
an irreducible f which lies in p. As Ok is a unique factorization domain, (f ) is a prime
ideal. Since p has height one, it follows that p = (f ). By the Active Lemma 4.1.10, the
dimension of Ok /p is k − 1.
Step 2. Let pi = Pi ∩ Ok . Then we have seen in the proof of Lemma 4.1.4 that
(4.2) p0 $ p1 $ · · · $ ps
is a chain of prime ideals in Ok . We claim that this chain cannot be extended to a larger
chain of prime ideals in Ok . Suppose the converse, and suppose that there exists a prime
ideal q with say pi $ q $ pi+1 . (Here we take the convention ps+1 = Ok .) Take a Noether
normalization
Ot ⊂ Ok /pi .
We thus have Ot ⊂ Ok /pi ⊂ OX,x /Pi . As Ok /pi ⊂ OX,x /Pi is also a finite extension,
we have a Noether normalization Ot ⊂ OX,x /Pi . The prime ideals q and pi+1 induce
prime ideals in Ok /pi which we denote by the same name. Moreover, (0) $ q $ pi+1 .
Intersecting this chain with Ot , we get a chain of prime ideals in Ot
(0) $ q′ $ p′i+1 .
(1) Let (X, 0) ⊂ (C n , 0) be a germ of an analytic space, and let I (X, 0) be the ideal
of (X, 0), and k be the minimal number of generators of I (X, 0). Then (X, 0) is
called a complete intersection if the dimension of (X, 0) is n − k.
(2) A germ of an analytic
space (X, 0) is called a set-theoretic complete intersection if
(X, 0) = V (I), 0 , and I can be generated by n − dim(X, 0) elements.
(3) Let R be a local ring. A sequence of nonunits {f1 , . . . ,fk } is called a regular sequence
if for all i = 1, . . . ,k (the class of) fi is a nonzero divisor of R/(f1 , . . . ,fi−1 ).
4.1 Dimension Theory 133
Example 4.1.19. We consider (X, 0), the coordinate axes in C 3 , given by the zero set of
the ideal I = (xy,xz,yz). This is a radical ideal as it is the intersection of the prime ideals
(x,y), (x,z) and (y,z). So I = I (X, 0), and by Nakayama’s Lemma it follows that the
minimal number of generators of I is equal to three. It is easy to see (using for example
the Chevalley dimension), that the dimension of (X, 0) is one. Therefore, (X, 0) is not a
complete intersection. It is however
a set-theoretic complete intersection. Indeed as one
easily checks, (X, 0) = V (J), 0 , where J = (xy,zy + zx).
Corollary 4.1.20. Let (X, 0) be a germ of a complex space, and suppose I (X, 0) is
minimally generated by f1 , . . . ,fk ∈ On . Then (X, 0) is a complete intersection, if and
only if {f1 , . . . ,fk } form a regular sequence in On . In this case, dim(X, 0) = n − k.
The proof is left as Exercise 4.1.25.
Definition 4.1.21. Let I ⊂ R be an ideal.
(1) I is called unmixed if all associated primes of I have the same height.
(2) I (or R/I) is called equidimensional, if for all minimal associated primes p we have
dim(Rp /IRp ) = dim(R/I).
In case I ⊂ On is unmixed, I does not have embedded components and is equidimen-
sional. By abuse of language, we also say that (X, 0) is unmixed, if the ideal I defining
(X, 0) is unmixed. Note that it very well might happen that unmixed ideals are not rad-
ical. As a corollary of dimension theory, we can give a nice characterization of unmixed
ideals in On in terms of a Noether normalization.
Theorem 4.1.22. Let I ⊂ On be an ideal and OX,0 = On /I. Suppose we have a Noether
normalization Ok ⊂ OX,0 of (X, 0). Then:
I is unmixed ⇐⇒ Every f ∈ Ok is a nonzero divisor of OX,0 .
Proof. Consider a primary decomposition of I:
I = q1 ∩ . . . ∩ qs .
√
Let pi := qi for i = 1, . . . ,s. By the Weierstraß characterization of dimension the ideal
I is unmixed, if and only if for all i = 1, . . . ,s the map
Ok −→ On /pi ,
Lq
q
V
X
B
The line Lq intersects the boundary {π(q)}×∂V . The restriction of f to Lq ∩({π(q)}×V )
is a function of one complex variable, so attains maximum absolute value on the boundary
{π(q)} × ∂V . Hence
4.2 Hilbert-Samuel Function and Multiplicity 135
4.1.27. Let X ⊂ C n be an affine algebraic set, m = (x1 , . . . ,xn ). Let mk = (x1 , . . . ,xk ) ⊂
C [x1 , . . . ,xk ] and suppose
C [x1 , . . . ,xk ]m k ⊂ C [X]m
is a Noether normalization. Consider the germ of the analytic space (X, 0). Show that
Ok ⊂ OX,0
HSR : N −→ N
d 7→ dimR/m R/md
is called the Hilbert-Samuel function of R. In case R = OX,x is the local ring of a germ
of a complex space (X,x) we also write HSX,x and call it the Hilbert-Samuel function of
(X,x).
136 4 Further Development of Analytic Geometry
Example 4.2.2. We take (X,x) = (C n , 0). Then R/md is equal to the vector space of
all polynomials
of degree less than d in n variables. This is a vector space of dimension
n+d−1
n . This can be proved by inductionP for example, but the easiest way to see this is as
follows. To a monomial xα 1
1
· · · xαn
n with i < d we assign a monomial of degree d − 1
i αP
α1 αn d−1− i αi
in n + 1 variables, namely x1 · · · xn · x0 . So it suffices to count all monomials
of degree d − 1 in n + 1 variables. A monomial xα αn
0 · · · xn we represent by circles and
0
crosses. We first put α0 circles, then a cross, then α1 circles, then a cross, etc.
x x
α0 αn
On the other hand, one can assign a polynomial to such a diagram.
So we have to
choose n crosses out of n + d − 1 possibilities, and we get n+d−1 monomials. It follows
n
n
n+d−1
that the Hilbert-Samuel function HSC ,0 of (C , 0) is given by d 7→
n
n .
The aim of this section is to prove that for d ≫ 0, the Hilbert-Samuel function
behaves like a polynomial, and interpret its degree as the dimension of the space. Finally,
in case R is the local ring of a singularity we will interpret the leading coefficient of the
polynomial as its “multiplicity”.
Theorem 4.2.3. Let (R,m) be a Noetherian local ring.
(1) There exists a polynomial HSPR ∈ Q[t], such that HSPR (d) = HSR (d) for d suffi-
ciently large. We call HSPR the Hilbert-Samuel polynomial of R.
In case R is the local ring of (X,x), we also write HSPX,x for the Hilbert-Samuel
polynomial of R, and call it the Hilbert-Samuel polynomial of (X,x).
(2) deg(HSPR ) = dim(R).
The proof of 4.2.3 will be given on page 141. For the proof, we need to introduce a
much more general notion of Hilbert-Samuel function and polynomial.
Definition 4.2.4. Let (R,m) be a Noetherian local ring, q be an m–primary ideal and
M be a finitely generated R–module. A q–filtration {Mi } = {Mi : i ∈ N} is a sequence
of submodules M = M0 ⊃ M1 ⊃ . . . such that qMd ⊂ Md+1 for all d. It is called a stable
q–filtration (or q–stable filtration) if moreover there exists an n0 such that qMd = Md+1
for all d ≥ n0 .
Let {Mi } be a q–filtration. Then HS{Mi } : N −→ N with
is called the Hilbert-Samuel function of the filtration {Mi }. (We will show in Theorem
4.2.6 that indeed dimR/m M/Md < ∞, so that the definition makes sense.)
If we take the filtration {Mi } with Mi = mi · M , we simply write HSM for HS{Mi } ,
and this is called the Hilbert-Samuel function of M . The Hilbert-Samuel function of
Definition 4.2.1 is the case M = R, q = m, and Mi = mi .
Remarks 4.2.5.
4.2 Hilbert-Samuel Function and Multiplicity 137
(1) We need the general concept of the Hilbert-Samuel function with respect to a filtra-
tion to obtain additivity. Namely, let N ⊂ M be a submodule, then the sequence
The proof needs some more preparations, and will be given on page 140. First note
Pd−1
that, obviously HS{Mi } (d) = k=0 dimR/m Mk /Mk+1 . This leads to the following def-
inition.
Definition 4.2.7. Let (R,m) be a local ring, q an m–primary ideal and {Mi } be a
stable q–filtration. The associated graded ring Grq = Grq (R) by definition is Grq (R) :=
⊕d≥0 qd /qd+1. The associated graded module Gr{Mi } = Gr{Mi } (M ) by definition is
Gr{Mi } := ⊕d≥0 Md /Md+1 . Note that Grq is a graded ring, and that Gr{Mi } is a graded
Grq –module. This follows directly from the definition of a q–filtration.
The following lemma motivates the condition in the definition of a stable q–filtration.
138 4 Further Development of Analytic Geometry
Lemma 4.2.8. Let M be a finitely generated module over the Noetherian local ring
(R,m), q be an m–primary ideal and {Mi } be a stable q–filtration of M . Then Grq is a
Noetherian ring, and Gr{Mi } is a finitely generated Grq –module.
Proof.
Step 1. We first prove that Grq is Noetherian. By assumption R is Noetherian
and, there-
fore, q is finitely generated, say by a1 , . . . ,at . The algebra map R/q [x1 , . . . ,xt ] −→ Grq
which sends xi to the class of ai is surjective. The ring (R/q)[x1 , . . . ,xt ] is Noetherian,
and from the surjectivity it follows that Grq is Noetherian.
Step 2. Now we show that Gr{Mi } is a finitely generated Grq –module. Because {Mi }
is a stable q–filtration, there exists an n0 ∈ N such that Mn0 +ℓ = qℓ Mn0 for all ℓ ≥
0. This says that Gr{Mi } is generated by ⊕k≤n0 Mk /Mk+1 . Furthermore, because R
is Noetherian, the module Mk /Mk+1 is a finitely generated R–module. Itfollows that
Mk /Mk+1 is a finitely generated R/q–module. Therefore, ⊕k≤n0 Mk /Mk+1 is a finitely
generated R/q–module. So indeed Gr{Mi } is a finitely generated Grq –module.
This motivates us to study the graded situation.
Definition 4.2.9. Let K be a field, A = ⊕i≥0 Ai be a graded K–algebra. Suppose
that dimK A0 < ∞ and that A1 = (x1 , . . . ,xr )A0 generates A as an A0 –algebra. This
means that A is the ring A0 [x1 , . . . ,xr ].2 Let M = ⊕i≥0 Mi be a finitely generated graded
A–module. We define the Hilbert function HM : N −→ N of M by
HM (n) = dimK Mn .
0 −→ M1 −→ M2 −→ M3 −→ M4 −→ 0
of finite-dimensional K–vector spaces, then dimK (M1 ) + dimK (M3 ) = dimK (M2 ) +
dimK (M4 ), see Exercise 1.2.26. It follows from the definition of the Hilbert-Poincaré
series that
HPKer(ϕ) (t) + HPM (t) = HPM(−1) (t) + HPCoker(ϕ) (t).
From the definition of ϕ, it follows that Ker(ϕ) and Coker(ϕ) are finitely generated
A/xr –modules. By induction, we therefore have
QKer(ϕ) (t) QCoker(ϕ) (t)
HPKer(ϕ) (t) = , and HPCoker(ϕ) (t) = .
(1 − t)r−1 (1 − t)r−1
From the definitions of M (−1) and HP it follows immediately that HPM(−1) (t) =
t HPM (t), and we obtain
G(t)
HPM (t) = , G(1) 6= 0,
(1 − t)s
Pk
with G(t) = i=0 gi ti ∈ Z[t] and s ≤ r. We expand (1 − t)−s in a power series. We leave
it as an exercise to show that the power series expansion of (1 − t)−s is
X ∞
−s s−1+i i
(1 − t) = t.
i=0
s−1
Xk
s−1+d−i
HM (d) = gi .
i=0
s−1
140 4 Further Development of Analytic Geometry
This is a polynomial in d of degree s− 1, which proves the first statement of the corollary.
(2) The surjectivity implies that HM (d) = dimK Md ≥ dimK Nd = HN (d). Hence
PM (d) ≥ PN (d) ≥ 0 for d sufficiently large. This is only possible if deg PM ≥ deg PN .
(3) Let M be generated by m1 , . . . ,ms . Let di be the degree of mi . Then we have a
surjective map of graded modules ⊕m . By the second part, deg P⊕A(di ) ≥
i=1 A(di ) ։ MP
deg PM . But PA(di ) (d) = PA (d+di ) and P⊕A(di ) = si=1 PA(di ) . Therefore deg P⊕A(di ) =
deg PA .
(4) If Ann(m) = 0, for some homogeneous m ∈ Mk , then there exists an injection of
graded A–modules
A(k) −→ M : 1→
7 m ∈ Mk .
This implies HA (d + k) = HA(k) (d) ≤ HM (d). This proves the corollary.
Example 4.2.12. Let A = C [x 1 , . . . ,xn ], Ai = {f ∈ A : f homogeneous of degree i}.
Then HA (d) = PA (d) = n+d−1
n−1 . Hence, deg(PA ) = n − 1 and
X∞
i+n−1 i 1
HPA (t) = t = .
i=0
n−1 (1 − t)n
(2) Define Ai = qi /qi+1 for all i ≥ 0. Then Grq = ⊕i≥0 Ai , and satisfies the con-
dition of Definition 4.2.9, and Gr{Mi } is a finitely generated Grq –module by 4.2.8.
Let r be the minimal number of generators of q. In the proof of 4.2.8, we saw that
then the minimal number of generators of A1 is at most r. It follows that the the
Hilbert polynomial HPGr{Mi } has degree at most r − 1 by Corollary 4.2.11. Note that
HPGr{Mi } (d) = HS{Mi } (d + 1) − HS{Mi } (d) for large d. It is a nice exercise, see 4.2.25,
that this implies (2).
(3) Let {Mfi } be another stable q–filtration of M . It is not difficult to show, see Exercise
4.2.31, that there exists an n0 such that Mn+n0 ⊂ M fn and M fn+n0 ⊂ Mn for all n. This
implies HS{Mi } (d) ≤ HS{M fi } (d) ≤ HS{Mi } (d + n0 ). As the degree
fi } (d + n0 ) and HS{M
for d large. This implies that the leading coefficients of HSP{Mi } and HSP{M
fi } are equal
and proves the theorem.
Before giving the proof of Theorem 4.2.3 we need two more results.
4.2 Hilbert-Samuel Function and Multiplicity 141
Step 1. We first prove that the degree of HSPR is smaller than or equal to the Chevalley
dimension of R. Let q be any m–primary ideal, and consider the filtration qi R. We claim
that deg(HSP{qi } ) = deg(HSPR ) which was by definition equal to deg(HSP{mi R} ). To
see this, first note that for some s ∈ N we have m ⊃ q ⊃ ms . Thus for all d ≥ 0, we have
md ⊃ qd ⊃ msd . Thus for large d we get
Now HSPR (sd) is also a polynomial in d, and has the same degree as HSPR (d). Thus
the degrees of the polynomials are equal, proving the claim. The second part of Theorem
4.2.6, says that this degree is at most the minimal number of generators of q, which is
the Chevalley dimension of R.
Step 2. We now prove that the Krull dimension of R is less than or equal to deg(HSPR ).
As we already showed that the Chevalley dimension is less than or equal to the Krull
dimension, see 4.1.7, this suffices to prove the theorem. The inequality is proved by
induction on deg(HSPR ).
Step 2a. Suppose that deg(HSPR ) = 0. It follows that dim(R/md ) is independent of
d for d ≫ 0. Hence md = md+1 for d ≫ 0. It follows from Nakayama’s Lemma that
md = 0.Thus the zero ideal is m–primary. So the Chevalley dimension, and hence the
Krull dimension is zero.
Step 2b. We now do the induction step. Let s = dim(R), and p0 $ · · · $ ps be a maximal
chain of prime ideals in R. As the case s = 0 is trivial, we may suppose s > 0. In
particular p1 6= 0. Take an element x ∈ p1 \ p0 , and consider the ring S := R/p0 .3 We
claim that deg(HSPS/(x) ) ≤ deg(HSPR ) − 1. Indeed, x is a nonzerodivisor so that by
4.2.14 deg(HSPS/(x) ) ≤ deg(HSPS ) − 1. Moreover, we have a surjective map R/md ։
S/md . Therefore, deg(HSPS ) ≤ deg(HSPR ). Thus deg(HSPS/(x) ) ≤ deg(HSPR ) − 1. We
apply the induction hypothesis. Any chain in S/(x) has length at most deg(HSPR ) − 1.
But the image of the chain p1 $ · · · $ ps in S/(x) gives a chain of length s − 1. So
dim(R) − 1 = s − 1 ≤ deg(HSPR ) − 1. This proves the theorem.
Remark 4.2.15. Note that in this proof we showed that:
(1) for any Noetherian local ring the Krull dimension is equal to the Chevalley dimension;
(2) the degree of the Hilbert-Samuel polynomial of an R–module M does not depend on
the choice of the filtration on M , and is smaller than or equal to the dimension of R.
(Simply replace in the proof R by M .)
Theorem 4.2.16 (Krull’s Principal Ideal Theorem). Let R be a Noetherian ring, f ∈ R,
and p ⊃ (f ) be a minimal associated prime ideal of (f ). Then ht(p) ≤ 1. If moreover f
is an active element of R, for example a nonzerodivisor, then ht(p) = 1. In particular,
dim(R/(f ) = dim(R) − 1.
Proof. Consider an irredundant primary decomposition
(f ) = q1 ∩ · · · ∩ qr .
3 Note that if R is an integral domain, then S = R, because p0 = (0) is a prime ideal.
4.2 Hilbert-Samuel Function and Multiplicity 143
√ √
So qi = p for some i. Since p is minimal it follows that for all j 6= i we have qj 6⊂ p.
Consider the local ring Rp , with maximal ideal pRp . So we get the primary decomposition
(f )Rp = qi Rp .
Therefore (f )Rp is a primary ideal in Rp , and is even pRp –primary. It follows from the
definition that the (Chevalley) dimension of Rp is at most one. Thus ht(p) ≤ 1, by using
the Krull dimension and the first part of Remark 4.2.15.
Suppose moreover that f is an active element. If ht(p) = 0, then p is a minimal
prime ideal. As f ∈ p, this is in contradiction to the assumption that f is an active
element.
For a different proof of Krull’s Principal Ideal Theorem, which does not use the
Hilbert-Samuel function, see Exercise 4.2.32.
Definition P4.2.17. Let (R,m) be a Noetherian local ring of dimension e. We write
HSPR (t) = ek=0 ak tk . Then m(R) := e! · ae is called the multiplicity of R .
If R = OX,x is the local ring of a singularity (X,x), then we also write m(X,x) for
m(OX,x ) and call this the multiplicity of (X,x), or the multiplicity of X in x.
Remark 4.2.18. The multiplicity m(X,x) is positive, because m(X,x) e! is the coefficient
of the monomial of highest degree in HSPX,x and HSPX,x (d) > 0 for d sufficiently large.
We leave it is as Exercise 4.2.25, to show that m(X,x) is indeed an integer.
Examples 4.2.19.
(n+d−1)···d
(1) Let (X,x) = (C n , 0). Then HSPX,x (d) = n+d−1
n = n! = 1 n
n! d + . . . , which
is a polynomial in d of degree n. Therefore, m(X,x) = 1.
(2) Consider the A2 –singularity with local ring R = C {x,y}/(y 2 − x3 ). Consider an
element f ∈ C {x,y}. Then modulo (y 2 − x3 ) f has a representative Pf which is a
polynomial in y of degree at most one. It is even true that ord(Pf ) ≥ ord(f ), see
Exercise 3.2.19. So we get that R/md = C {x,y}/(y 2 − x3 ,xd−1 y,xd ), and it follows
that HSPR (d) = 2d − 1. So the cusp singularity has dimension one, and multiplicity
2.
We now generalize Example 4.2.19(2).
Lemma 4.2.20. Let f ∈ C {x1 , . . . ,xn }, ord(f ) = m and OX,x = On /(f ). Then
Xm
n+d−j−1
HSPX,x (d) = .
j=1
n−1
In particular the dimension of (X,x) is n−1, and the multiplicity is m. Note that we have
a Noether normalization On−1 ⊂ On /(f ) = OX,x , so that In particular, if f irreducible,
the multiplicity of (X,x) is equal to the degree of the field extension Q(OX,x ) ⊃ Q(On−1 ).
Proof. Without loss of generality we may suppose that f is a Weierstraß polynomial
in xn of degree equal to m = ord(f ). As in the example, any element g ∈ OX,x has a
representative Pg which is a polynomial in xn of degree at most m − 1. Furthermore,
ord(g) ≤ ord(Pg ). Now HSR (d) = dimC C {x1 , . . . ,xn }/((f ) + md ) is equal to
144 4 Further Development of Analytic Geometry
Such a general Noether normalization exists according to Exercise 3.3.30. Let m be the
maximal ideal of OX,x , and mk be the maximal ideal of Ok . Consider the filtration {mi }
of OX,x Then {mi } is a stable mk –filtration.
Proof. Obviously, {mi } is an mk –filtration. Let f ∈ C {x1 , . . . ,xn } be a power series of
order t. It follows from Exercise 3.3.31 that modulo p f is equivalent to a polynomial
X νk+1
Pf = fνk+1 ,...,νn xk+1 · · · xνnn , fνk+1 ,...,νn ∈ Ok .
νj <nj
P
Moreover, Pf has order at least t. So we see that, if t ≥ j≥k+1 nj , we can write every
element of mt+1 as an element of mk · mt . This is what we needed to show.
Example 4.2.22. C {x} ⊂ C {x,y}/(y 2 − x) = OX,x = ∼ C {y} is not a general Noether
normalization (in the sense of 3.3.30) because degy (y 2 − x) = 2 > ord(y 2 − x) = 1. Now,
mk = (x), m = (x,y) and m2i = (y 2i ) = (xi ) implies mk · m2i = (xi+1 ) = m2i+2 6= m2i+1
for all i. This shows that {mi } is not a stable mk –filtration of OX,x .
Proposition 4.2.23. Let (X,x) be an irreducible germ of an analytic space, and Ok ⊂
OX,x be a general Noether normalization. Then
that is, the multiplicity is equal to the degree of the field extension Q(OX,x ) ⊃ Q(Ok ).
Proof. We apply the Local Parametrization Theorem 3.4.14, in order to reduce the
statement to the case of a hypersurface singularity. This case was already considered
in Lemma 4.2.20. Let ∆ be the discriminant of P := Pk+1 . Let OX,x := On /p, OX ′ ,x′ :=
Ok [xk+1 ]/(P ), and K = OX,x /OX ′ ,x′ . It was shown in the first part of the Local
Parametrization Theorem that p ∩ Ok [xk+1 ] = (P ) so that we have an exact sequence of
Ok –modules.
Let mk be the maximal ideal of Ok . We consider the following filtrations on these three
Ok –modules.
4.2 Hilbert-Samuel Function and Multiplicity 145
Let m be the maximal ideal of OX,x , and m′ be the maximal ideal of OX ′ ,x′ . Then the
i
filtrations {mi } and {m′ } are mk –stable by Lemma 4.2.21.
It was noted in 3.4.16, that as a consequence of the Local Parametrization Theorem,
the module K is in fact an Ok /∆a –module for some a ∈ N. The ring Ok /∆a has dimension
k − 1. It follows from the second part of Theorem 4.2.6, and the second part of Remark
4.2.15 that the Hilbert-Samuel polynomial of K has degree at most k − 1. By definition
m(X,x) is determined by the leading term of HSP{mi } , and m(X ′ ,x′ ) is determined by
the leading term of HSP{m′i } , which both have degree k, and those are independent of
the stable mk –filtration by Theorem 4.2.6. From these two facts it follows that m(X,x) =
m(X ′ ,x′ ). So we reduced the problem to the case of an irreducible hypersurface, as
promised.
Proof. Let Ue ⊂ Gln (C ) be the Zariski open subset such that for Mf∈U e the correspond-
ing linear coordinate change defines, via the projection π : C n −→ C k onto the first k
coordinates, a general Noether normalization (X,x) −→ (C k , 0).
Let M be the part of Mf consisting of the first n−k rows. This part of M
f corresponds
f
to the direction of the projection π. Let U = {M | M ∈ U }.e
Choosing M ∈ U restricts the situation to the case that a general primitive Noether
normalization OX,x ⊃ Ok is given and ! we may assume OX,x = C {x1 , . . . ,xn }/p ⊃
1 0 0 ... 0
C {x1 , . . . ,xk } and M = .. .. .. .
. . .
0 1 0 ... 0
x1
!
M .. =0
. e
xn X
p
Ck
Let V = {p | π(p) 6∈ D}, where D is the discriminant, given by V (∆), with ∆ the
discriminant of the minimal polynomial of the field extension Q(OX,x ) ⊃ Q(Ok ). We
! now x1
apply 3.4.16 (1) to obtain that the (n−k)–dimensional plane E defined by M .. = p,
.
xn
( x1 ! )
..
that is E = . : x1 = p1 , . . . ,xk = pk = π −1 π(p) , intersects a representative of
xn
(X,x) in [Q(OX,x ) : Q(Ok )] points. Now [Q(OX,x ) : Q(Ok )] is equal to m(X,x) by
Proposition 4.2.23.
Exercises
` ´
4.2.25. Define for all k ∈ Z the polynomial kt := k! 1
t(t − 1) · · · (t − k + 1) ∈ Q[t]. For f ∈ Q[t]
define ∆f by ∆f (t) := f (t + 1) − f (t) ∈ Q[t].
` ´ ` t ´
(1) Prove that ∆ kt = k−1 .
` ´ ` t ´
(2) For any f ∈ Q[t] we can write f (t) = c0 kt + c1 k−1 + . . . + ck for suitable ci ∈ Q. Show
this.
(3) Suppose moreover that f (n) ∈ Z for n ≫ 0, and k = deg(f ). Prove that ci ∈ Z for
i = 0, . . . ,k.
(Hint: Use induction on k, and consider ∆f .)
(4) Let f : N −→ N be any function. Suppose that d 7→ ∆f (d) = f (d+1)−f (d) is a polynomial
Q(d) for d ≫ 0 of degree k − 1. Prove that d 7→ f (d) is, for d ≫ 0, a polynomial of degree
k.
` t ´ ` t ´ ` ´ ` t ´
(Hint: Write ∆f (t) = c0 k−1 + c1 k−2 + . . . + ck−1 , and consider P = c0 kt + c1 k−1 +
`t´
. . . + ck−1 1 . Prove that ∆(f − P )(d) is a constant for d ≫ 0.)
4.2.28. Let I ⊂ C [x1 , . . . ,xr ] = C [x] be an ideal, define C [x]≤s = {f | deg(f ) ≤ s} and I≤s =
I ∩ C [x]≤s . Define the affine Hilbert function by AHI (n) = dimC C [x]≤n /I≤n .
Pd `´
Prove that for sufficiently large n AHI (n) is a polynomial API (n) = bi si , bi ∈ Z, bd > 0.
i=0
(Hint: Let I h ⊂ C [t,x1 , . . . ,xr ] be the homogenization of I. Prove that (with the canonical
grading of C [t,x1 , . . . ,xr ]) HC [t,x1 ,...,xr ]/I h (n) = AHI (n).)
4.2.29. With the notations of Exercise 4.2.28 prove that deg API = dim C [x1 , . . . ,xr ]/I.
(Hint: Let I h be as in 4.2.28. Prove that
4.2.31. Prove: if {Mn }, {M fn } are stable q–filtrations of M then there exists an n0 such that
fn+n0 ⊂ Mn and M
M fn+n0 ⊂ Mn for all n ∈ N.
fn = qn M for all n.)
(Hint: It suffices to prove this for the special case M
4.2.32. Give a different proof of Krull’s Principal Ideal Theorem 4.2.16 along the following
steps.
(1) Reduce the second statement to the first, by using the characterization of zerodivisors.
(2) By looking at the localization Rp , reduce to the case that R is local, with maximal ideal
m = p. From now one we suppose this.
(3) From now suppose q 6= m is a prime ideal. Show that it suffices to show that height(q) =
dim Rq = 0.
(4) Show that it suffices to show that qRq is nilpotent.
(5) Show that in R/(f ) every descending chain of ideals becomes stationary using the fact
that mρ ⊂ (f ) for some ρ.
(6) Consider the symbolic powers q(n) := qn Rq ∩ R, see Exercise 1.3.23. Show that the chain
of ideals
(f ) + q(1) ⊃ (f ) + q(2) ⊃ (f ) + q(3) ⊃ . . .
becomes stationary, that is, there exists an n ∈ N with (f ) + q(n) = (f ) + q(n+1) .
(7) Prove that for this n we have q(n) = (f )q(n) + q(n+1) . (Use the second part of Exercise
1.3.23.)
(8) Apply Nakayama to see that q(n) = q(n+1) .
(9) Use the definition of symbolic power and Nakayama to see that qn Rq = 0.
(10) Deduce the theorem.
S
4.2.33. Let (X,x) be a germ of an analytic space and (X,x) = n i=1 (Xi ,x) be
Pan irreducible
decomposition. We set I = {i : dim(Xi ,x) = dim(X,x)}. Prove that m(X,x) = i∈I m(Xi ,x).
(Hint: Study the situation (X,x) = (X1 ,x) ∪ (X2 ,x), with (Xi ,x) reduced and defined by the
ideals Ii ⊂ On . Compare the Hilbert-Samuel functions by looking at the exact sequence
One may assume that dim(On /I1 + I2 ) < max{dim(On /I1 ), dim(On /I2 )}.)
148 4 Further Development of Analytic Geometry
We now study regular and singular points of analytic spaces. According to Definition
3.3.11 a point x of an analytic space X is called regular if a neighborhood of this point
is a complex submanifold of C n . In particular, the local ring OX,x of a smooth point
x of X is isomorphic to C {x1 , . . . ,xk } for some k, which is an integral domain. Since
for a reducible germ (X,x) we have that OX,x is not an integral domain the following
proposition follows.
Proposition 4.3.1. Let X be an analytic space, x ∈ X, and (X,x) be a reducible germ.
Then x ∈ Sing(X). In particular, if X =X1 ∪ · · · ∪ Xr with Xi 6⊂ Xj for i 6= j, then
S S
Sing(X) = i Sing(Xi ) ∪i6=j Xi ∩ Xj .
It might not be a big surprise for the reader that “almost all” points of an analytic
space are smooth. The proof however is a little bit tricky. For an irreducible space this is
proved by means of the Local Parametrization Theorem 3.4.14 (cf. Remark 3.4.16 (2)),
which essentially allows us to reduce the statement to the hypersurface case.
Proposition 4.3.2. Let U be an open subset of C n , and X be an analytic subset of U .
Then the set of singular points Sing(X) is contained in a proper analytic subset of X.
Proof. This is a local statement, that is, for all x ∈ X we have to prove it for a neighbor-
hood of x ∈ X. If (X,x) is an irreducible germ, it was proved in the Local Parametrization
Theorem that there exists a hypersurface (D,x) in (X,x), and representatives X and D
such that X \ D is a complex submanifold of an open set in C n .
Now if (X,x) is reducible, we can take an irreducible decomposition (X1 ,x) ∪ · · · ∪
(Xr ,x) of (X,x). For suitable representatives, we have X = X1 ∪ . . . ∪ Xr with Xi 6⊂ Xj
for i 6= j. We therefore have that Xi ∩ Xj for i 6= j is a proper analytic subset of X.
We may assume that Sing(Xi ) ⊂ Vi , Vi a proper analytic subset of Xi . On the other
hand, we have Sing(X) = ∪i Sing(Xi ) ∪ ∪i6=j (Xi ∩ Xj ) by the previous proposition.
Now ∪i6=j (Xi ∩ Xj ) is a proper analytic subset ofX. Therefore, if x is not contained in
the proper analytic subset ∪i Vi ∪ ∪i6=j (Xi ∩ Xj ) then x is a smooth point of X.
Our next goal would be to show that the set of singular points in fact forms an
analytic subset of X. However, as we will explain later, we are not able to prove this
in this section, since we will need the Oka-Cartan Theorem, see 6.3.2, concerning the
coherence of the ideal sheaf. So the fact that the singular locus is an analytic subset is
surprisingly deep. What we will show is, that we characterize the regular points by means
of the Jacobian Criterion.
Definition 4.3.3. Let (R,m) be a Noetherian local ring. The embedding dimension
edim(R) of R is defined by
Then
edim(X, 0) + rank0 (f1 , . . . ,fs ) = n.
Proof. Put e := edim(X, 0) and r = rank0 (f1 , . . . ,fs ).
Step 1. Suppose we have functions f1 , . . . ,fs that generate the maximal ideal m of On .
Then we claim that the Jacobian matrix
∂fi
(0)
∂xj 1≤i≤s,1≤j≤n
has rank n. It follows from Nakayama’s Lemma that s ≥ n, and in fact already n of the
fi generate the Pmaximal ideal m. Without loss of generality, f1 , . . . ,fn are generators.
Now write fi = j Aij xj modulo m2 , for Aij ∈ C . As the fi generate m, we conversely
P
have xi = j Bij fj mod m2 , for some Bij ∈ C . Therefore (Aij ) is an invertible matrix
∂fi
with inverse (Bij ). Now ( ∂x j
(0) = (Aij ), so the Jacobian matrix has maximal rank n.
150 4 Further Development of Analytic Geometry
C ∼
= OX,0 /mX,0 = OC n ,0 /(f1 , . . . ,fs ,g1 , . . . ,ge ).
So f1 , . . . ,fs ,g1 , . . . ,ge generate the maximal ideal of OC n , 0 . Hence the Jacobian matrix
of (f1 , . . . ,fs ,g1 , . . . ,ge ) has rank n by step 1. Deleting a column reduces the rank by at
most one. So deleting the columns corresponding to g1 , . . . ,ge shows that the rank r of
∂fi
the Jacobian matrix ( ∂xj (0) is at least n − e.
We can therefore apply the Implicit Mapping Theorem 3.3.6 and deduce that there exist
functions ϕn−r+1 (x1 , . . . ,xn−r ), . . . ,ϕn (x1 , . . . ,xn−r ) such that
fi x1 , . . . ,xn−r ,ϕn−r+1 (x1 , . . . ,xn−r ), . . . ,ϕn (x1 , . . . ,xn−r ) = 0,
for i = n − r + 1, . . . ,n. This shows that (X, 0) is a subgerm of the graph of the map
ϕ : C n−r −→ C r , that is, a subgerm of
V xn−r+1 − ϕn−r+1 , . . . ,xn − ϕn , 0 .
Proof. Suppose that g1 , . . . ,gm also generate I (X,p). Then we have an equality
s
X
gi = ξik · fk
k=1
for some ξik ∈ C {x1 , . . . ,xn }. We differentiate. By the product rule, and because fi (p) = 0
for all i, it follows
X s
∂gi ∂fk
(p) = ξik (p) (p).
∂xj ∂xj
k=1
P
n
∂fi
Now let (q1 , . . . ,qn ) ∈ C n be such that ∂xj (p)qj = 0 for i = 1, . . . ,s. Then
j=1
n
X s
X X n
∂gi ∂fk
(p)qj = ξik (p) (p)qj = 0.
j=1
∂xj j=1
∂xj
k=1
TX,0
0
TX,(1,1)
Theorem 4.3.12. Let (X,p) ⊂ (C n ,p) and (Y,q) ⊂ (C m ,q) be germs of analytic subsets
and f = (f1 , . . . ,fm ) : (X,p) −→ (Y,q) be a germ of a holomorphic map. Then there
exists a canonical linear map
Xn
∂gi
TX,0 = {a ∈ C n : (0)aj = 0, i = 1, . . . ,s}
j=1
∂xj
m
X ∂hi
TY,0 = {b ∈ C m : (0)bj = 0, i = 1, . . . ,t}.
j=1
∂yj
Now h1 ◦ f, . . . ,ht ◦ f vanish on X, so they are in the ideal generated by g1 , . . . ,gs . Hence
there exist sik ∈ OC n ,0 such that
s
X
hi ◦ f = sik gk .
k=1
Pn ∂gk
Now consider a point a = (a1 , . . . ,an ) ∈ TX,0 . So j=1 (0)aj = 0. It follows that
∂xj
4.3 Regular Local Rings and the Jacobian Criterion 153
m
X Xn
∂hi ∂fk
(0) (0)aj = 0.
∂yk j=1 ∂xj
k=1
∂fk
But this says exactly that (0) · at ∈ TY,0 .
∂xj
The second statement follows from the chain rule, and is left as an exercise.
Examples 4.3.13.
(1) Let π : C n −→ C k be the projection given by π(x1 , . . . ,xn ) = (x1 , . . . ,xk ), and
p = (p1 , . . . ,pn ) ∈ C n . Then
is surjective.
154 4 Further Development of Analytic Geometry
Proof. In the proof of 2.3.12, it was shown that there exists an f ∈ C [Y ] ⊂ C [X], and
an r ∈ N such that ϕ factorizes on X \ V (f ) as follows
e
ϕ π
X \ V (f ) −→ Y \ V (f ) × C r −→ Y \ V (f ).
Moreover C [X]f is a finitely generated C [Y ]f [x1 , . . . ,xr ]–module. Here π is the projection
on the second factor, and thus for all q ′ ∈ Y \ V (f ) and a ∈ C r , the tangent map Tπ,(q′ ,a)
is surjective. So it suffices to show that there exists a point p ∈ X \ V (f ) with Tϕ,p e is
surjective, as an application of 4.3.12. As both X \ V (f ) and Y \ V (f ) × C r are affine,
see 2.3.10, we reduced the problem to the case of a finite map ϕ : X −→ Y of affine
algebraic sets, such that C [X] is a finitely generated C [Y ]–module.
So take a smooth point q ′ ∈ Y . By Theorem 2.3.9 the map ϕ is surjective, so there
exists a point p′ in ϕ−1 (q ′ ). It follows from 3.4.24 that the germ of the analytic map
ϕ : (X,p′ ) −→ (Y,q ′ ) is finite. Thus this is a Noether normalization, as by assumption q ′
is a smooth point of Y . It obviously suffices to prove the statement for an irreducible germ
(X,p′ ). It was proved in the Local Parametrization Theorem, see 3.4.14 that for “almost
all” points p in an open neighborhood of p′ the germ of the map ϕ : (X,p) −→ (Y,q) is
an isomorphism. In particular the tangent map Tϕ,p is an isomorphism. This concludes
the proof of the theorem.
For later applications, for example in the interpretation of normal rings, and in the
proof of Artin’s Approximation Theorem, we need a more general version of the Jacobian
Criterion.
Theorem 4.3.15 (Jacobian Criterion for Smoothness). Let I = (f1 , . . . ,fm ) ⊂
C {x1 , . . . ,xn }, p be a prime ideal associated to I, and q ⊃ p be a prime ideal containing
p. Then5
∂fi
C {x1 , . . . ,xn }q /Iq is a regular local ring ⇐⇒ rank ∂x j
mod q = ht(p).
∂fi
Furthermore, if ht(p) = k and rank ∂x j
mod q 1≤i≤k
= k then f1 , . . . ,fk generate
the ideal Iq .
We first discuss how to interpret this theorem.
∂fi
Examples 4.3.16. (1) For example, take q = m the maximal ideal. Then ∂x mod m
j
∂fi
is just the Jacobian matrix ∂x j
(0) . As all elements in C {x1 , . . . ,xn } \ m are already
invertible in C {x1 , . . . ,xn } we have the equality C {x1 , . . . ,xn }m /Im = C {x1 , . . . ,xn }/I.
It follows that C {x1 , . . . ,xn }/I is a regular local ring if and only the Jacobian matrix
has maximal rank. This we have already seen in 4.3.6.
(2) Suppose that C {x1 , . . . ,xn }q /Iq is a regular local ring. Put k = ht(p). Then k is the
codimension of V (p). Suppose p is a minimal
associated prime of I. Consider
the k–minors
∂fi ∂fi
∆1 , . . . ,∆s of the Jacobian matrix ∂x j
. Then rank ∂x j
mod q = ht(p) exactly
means that there exists a minor ∆i which is not in q. It follows that for all a in an open
subset of V (q) the variety
V (I) is smooth at the point a. Namely V (∆1 , . . . ,∆s ), 0 ∩
V (q), 0 $ V (q), 0 is a proper analytic subset. Now take representatives of all spaces
involved. It follows that for all a ∈ V (q)\V (∆1 , . . . ,∆s ) there is also a ∆i with ∆i (a) 6= 0.
“` ´”
5 ∂fi
Here rank ∂x mod q is the rank of the matrix over the quotient field Q(C {x1 , . . . ,xn }/q).
j
4.3 Regular Local Rings and the Jacobian Criterion 155
(3) Note that in the special case I = q = p, that C {x1 , . . . ,xn }p /pp is a field. As a field
obviously is a regular local ring, it follows that almost all points of V (p) are nonsingular.
This we have already seen in the Local Parametrization Theorem, see 3.4.14. In the proof
of the Jacobian criterion, this special case will be treated first and, in fact, in the proof
the Local Parametrization Theorem is used.
(4) We consider the explicit example of the Whitney umbrella defined by x2 − y 2 z = 0.
So p = I = (x2 − y 2 z). We take q = (x,y). Then q is the ideal of the singular locus. So
we expect that C {x,y,z}q /(x2 − y 2 z)q is not a regular local ring. Indeed, the minimal
number of generators of ideal (x,y)q is two, as one checks by using Nakayama’s Lemma.
However the dimension of C {x,y,z}q /(x2 − y 2 z)q is one.
On the other hand, looking at the y–axis, a “general” point of the y–axis is smooth.
We take the ideal q = (x,z), in which we localize. This is the same as localizing in the
multiplicative set S = {1,y,y 2 , . . .}. So y becomes a unit, and we can eliminate z. Hence
C {x,y,z}q /(x2 − y 2 z) ∼
= C {x,y}y , which one directly checks to be a regular local ring of
dimension one.
Before we can give the proof of the general Jacobian criterion, we need some basic
facts on regular local rings.
Theorem 4.3.17. Let (R,m) be a regular local ring. Then R is an integral domain.
Proof. The proof is by induction on the Krull-dimension of R.
Step 1. If dim(R) = 0, then edim(R) = 0, hence the maximal ideal has zero generators.
Thus m = 0, hence R = R/m is a field, which obviously has no zerodivisors.
Step 2. Now suppose dim(R) > 0. Let p1 , . . . ,ps be the minimal prime ideals of R. If we
prove that pi = (0), for some i, then i = s = 1 (because of minimality), and therefore
R = R/p1 = R/(0) is an integral domain.
As dim(R) > 0, it follows by Nakayama that m 6= m2 . By prime avoidance 1.1.13, we
can find an x ∈ m\m2 with x ∈ / ∪i pi . The last condition means that x is an active element
of R. By Krull’s
Principal Ideal Theorem 4.2.16, dim(R/(x)) = dim(R) − 1. Moreover,
m/ (x) + m2 is a vector space of dimension dim(R) − 1. Hence R/(x) is a regular local
ring, and by induction hypothesis it is an integral domain. Therefore, the ideal (x) is a
prime ideal. It cannot be a minimal prime ideal, since we have chosen x ∈ / ∪i pi . Therefore,
there must be a minimal prime ideal, say pi , which is contained in (x). Let y ∈ pi be
arbitrary. It follows that y = ax for some a ∈ R. As pi is prime, it follows a ∈ pi . Hence
we proved the equality xpi = pi . From Nakayama’s Lemma it follows that pi = 0, which
is what we had to prove.
Corollary 4.3.18. Let (R,m) be an r–dimensional regular local ring and x1 , . . . ,xi ∈ m.
The following conditions are equivalent.
Proof.
Step 1. We first show (1) =⇒ (2). The images of x1 , . . . ,xr generate m/m2 as R/m–vector
space. Since dimR/m (m/m2 ) = r they are linearly independent.
156 4 Further Development of Analytic Geometry
Step 2. (2) =⇒ (3). We choose xi+1 , . . . ,xr ∈ m such that the images of x1 , . . . ,xr form
a basis of m/m2 . Nakayama’s Lemma implies that m = (x1 , . . . ,xr ).
Now x1 is not a zero divisor of R by Theorem 4.3.17 . Thus dim R/(x1 ) = dim(R)−
1 by Krull’s Principal Ideal Theorem. The images of x2 , . . . ,xr generate the maximal ideal
in R/(x1 ). This implies that R/(x1 ) is regular. By induction we get that R/(x1 , . . . ,xi )
is regular.
Step 3. We finally show (3) =⇒ (1). Let m/(x1 , . . . ,xi ) be generated by the images of
xi+1 , . . . ,xr ∈ m. Then m = (x1 , . . . ,xi , xi+1 , . . . ,xr ).
Theorem 4.3.19. Let p be a prime ideal in C {x1 , . . . ,xn }. Then C {x1 , . . . ,xn }p is a
regular local ring.
Proof. The ring C {x1 , . . . ,xn }p has dimension ht(p). Consider a primitive Noether nor-
malization
C {x1 , . . . ,xk } −→ C {x1 , . . . ,xn }/p.
Then ht(p), which is the codimension of V (p), is equal to n − k. Hence we have to find
n − k generators of the maximal ideal pC {x1 , . . . ,xn }p . Let P ∈ C {x1 , . . . ,xk }][xk+1 ]
be the minimal polynomial of xk+1 ∈ C {x1 , . . . ,xn }/p, and ∆ ∈ C {x1 , . . . ,xk } be the
discriminant. In particular ∆ ∈ / p. Moreover, let Qj = xj ∆ − qj (xk+1 ) ∈ p for j =
k + 2, . . . ,n. Here the qj are polynomials with coefficients in C {x1 , . . . ,xk } which exist
according to the Finiteness of Normalization Theorem 1.5.19. Then Qn , . . . ,Qk+2 ,P ∈ p,
and we proved already in 3.4.14 that they generate the ideal pC {x1 , . . . ,xn }∆ . As ∆ ∈ / p,
they in particular generate the ideal pC {x1 , . . . ,xn }p .
Remark 4.3.20. It is quite generally true that localizations of regular local rings in
prime ideals are regular local rings. A proof of this more general result uses homological
methods, see for example [Eisenbud 1995], Corollary 19.14. The proof which we gave
works for the polynomial case too, and is a simple consequence of the Local Parametriza-
tion Theorem.
Proof of Theorem 4.3.15.
Step 1. We first consider the case (f1 , . . . ,fm ) = I = p = q.
The ring C {x1 , . . . ,xn }p /pp is a field and hence a regular local ring. Therefore, in
∂fi
this case we have to prove rank ( ∂x j
mod p)i≤m,j≤n = ht(p). Take a primitive Noether
normalization
C {x1 , . . . ,xk } −→ C {x1 , . . . ,xn }/p.
So we have to prove that the Jacobian matrix has rank n − k. We use the Local
Parametrization Theorem, and keep the notations of the proof of the previous Theo-
rem 4.3.19. We noted in the previous proof that the ideal pC {x1 , . . . ,xn }p is generated
by (P,Qk+2 , . . . ,Qn ). By Exercise 4.3.24 the rank of the Jacobian matrix is independent
of the generators of pC {x1 , . . . ,xn }p . As this ideal is generated by n − k elements, it
follows that the rank of the Jacobian matrix modulo p is at most n − k. Now the final
n − k columns of the Jacobian matrix for the generators (P,Qk+2 , . . . ,Qn ) look like
4.3 Regular Local Rings and the Jacobian Criterion 157
∂P
∂P ∂P 0 ... 0
∂xk+1
...
∂xk+1 ∂xn
∂qk+2
∂Q ∂Qk+2 0 ... 0
k+2 − ∂xk+1 ∆
...
J := ∂xk+1 ∂xn = .. .. .
. .. . .
.
. . .. ..
∂Q
n ∂Qn . .
... ∂qn
∂xk+1 ∂xn − ∆
∂xk+1
∂P
So det(J) = ∆n−k−1 · ∂xk+1 . Because P is the minimal polynomial of the field extension
C {x1 , . . . ,xn }p /pp ⊃ Q(C {x1 , . . . ,xk }) we have ∂x∂P
k+1
6∈ p. So det(J) 6∈ p. This implies
that
∂fi
rank ( mod p)i≤m, j≤n = rank(J mod p) = n − k = ht(p).
∂xj
This shows the theorem in the special case I = p = q.
Step 2. Now suppose that C {x1 , . . . ,xn }q /Iq is a regular local ring. Then we claim that
(4.4) Iq = pq .
which is the maximal possible rank. Deleting the columns corresponding to ua+1 , . . . ,ub
we obtain ∂u
i
rank mod q i≤a, j≤n = a = ht(p),
∂xj
which is again the maximal possible rank. We deduce that the Jacobian matrix for the
f ’s also has maximal rank by applying again Exercise 4.3.24.
∂fi
Step 3. We now prove the converse. Therefore, assume rank ( ∂x j
mod q)i≤m, j≤n =
ht(p) = a. By renumbering, we may assume
∂f
i
rank ( mod q)i≤a, j≤a = a.
∂xj
a
X
ci f i ∈ q 2
i=1
The first aim of this section is to show that every germ of an analytic space has a
normalization. We start with the definition of a normal germ.
Definition 4.4.1. A germ of an analytic space (X,x) is called normal if the local ring
OX,x is normal.
The first property of normal germs is that they are irreducible.
Proposition 4.4.2. Let (X,x) a normal germ of an analytic space. Then (X,x) is irre-
ducible.
Proof. This is an immediate consequence of Theorem 1.5.7.
So we see that normality is a nontrivial condition. The first goal in this section is
to prove that for all germs there exists a normalization. But in order to reach this goal,
we have to allow more general spaces then germs of analytic spaces: we have to consider
multi-germs.
Definition 4.4.3. A multi-germ of analytic spaces (X,x) is a finite disjoint union (X,x) =
(X1 ,x1 ) ∪ . . . ∪ (Xr ,xr ) of germs of analytic spaces. The ring OX,x by definition is equal
to ⊕ri=1 OXi ,xi .
Let (Y,y) = (Y1 ,y1 )∪. . .∪(Ys ,ys ) be a further multi-germ. A map ϕ : (X,x) −→ (Y,y)
is given by a system of maps of ϕi : (Xi ,xi ) −→ (Yα(i) ,yα(i) ) for i = 1, . . . ,r, and
some α(i) ∈ {1, . . . ,s}. Such a map ϕ induces, and is induced by a C –algebra map
ϕ∗ : OY,y −→ OX,x . In the obvious way, one defines such a map to be finite, generic s to
1, etc.
A multi-germ (X,x) is called normal if OX,x is a normal ring.
Remarks 4.4.4.
(1) It is not so difficult to see that OX,x is a semi-local ring, see Exercise 4.4.16. This,
by definition, means that OX,x has only finitely many maximal ideals.
(2) It is an exercise, see 4.4.16, to show that a multi-germ (X,x) = (X1 ,x1 )∪. . .∪(Xr ,xr )
is normal if and only if (Xi ,xi ) is normal for i = 1, . . . ,r.
Now we can define what we mean by a normalization of a germ of an analytic space.
Definition 4.4.5. Let (X,x) be a germ of an analytic space. A normalization of (X,x)
e x) together together with a finite, generically 1 − 1 map
is a normal multi-germ (X,e
e x) −→ (X,x).
n : (X,e
Remark 4.4.6. It was proved in Exercise 3.4.40 that a generically 1 − 1 map ϕ between
irreducible germs of analytic spaces induces an isomorphism of quotient fields. This was a
simple consequence of the Local Parametrization Theorem. This holds more generally for
multi-germs, see Exercise 4.4.16. As a normalization should be generically 1−1, it follows
that the ring OX,e
e x is the normalization of the ring OX,x . So a normalization, assuming it
exists, is uniquely determined. Note that the normalization (X,ee x) has the same dimension
as (X,x). Furthermore, note that if (X,x) is irreducible, then the normalization (X,e e x)
must be a germ of an analytic space. This is because the total quotient ring of OX,x is
equal to the total quotient ring of OX,e
e x , which in case (X,x) is irreducible, is a field.
160 4 Further Development of Analytic Geometry
Examples 4.4.7.
(1) A smooth germ (C n , 0) is normal. This is because C {x1 , . . . ,xn } is a unique factor-
ization domain by Corollary 3.3.17, and unique factorization domains are normal,
see Theorem 1.5.5,6
(2) Consider the A1 –singularity, defined by the equation x2 − y 2 = 0. So OX,x =
C {x,y}/(x2 − y 2 ). The normalization (X,e e x) of (X,x) consists of two copies of
smooth spaces (X1 ,x1 ) with parameter t1 and (X2 ,x2 ) with parameter t2 . The
normalization map n is given by two maps n1 : (X1 ,x1 ) −→ (X,x) which sends t1
to (t1 ,t1 ), and n2 : (X2 ,x2 ) −→ (X,x) which sends t2 to (t2 , − t2 ).
(X1 ,x1 )
(X2 ,x2 )
(X,x)
(3) Let (X,x) be the A2 –singularity, with local ring OX,x = C {x,y}/(y 2 −x3 ). Then the
normalization of OX is equal to C {t}, where t = xy . Compare example 1.5.6. The
e x) is a smooth one-dimensional space with parameter t, and the
normalization (X,e
normalization map is given by t 7→ (x(t),y(t)) = (t2 ,t3 ). Note that in this example
the normalization map is “bijective”, but not an isomorphism.
(4) Let (X,x) be the A1 –singularity in three space, defined by the equation z 2 − xy.
Then X is already normal, cf. Exercise 1.5.33. The normalization of (X,x) is (X,x)
itself, and the normalization map is the identity.
(5) Let (X,x) be the “Whitney umbrella” defined by x2 − zy 2 = 0.
6 More generally, regular local rings are normal because they are unique factorization domains. But we
do not prove this.
4.4 Normalization 161
2 2
The element s := xy is integral. Indeed, s2 = xy2 = zy
y 2 = z. We consider the ring
OX,x [s] generated by OX,x and s in the quotient ring Q(OX,x ) and claim that it is
equal to the normalization. Indeed, by applying the Weierstraß Division Theorem,
it is not so difficult to see that
We can eliminate x and z and see that OX,x [s] is isomorphic to C {y,s}, which is
normal. So the normalization is smooth, with parameters y,s and the normalization
map is given by (y,s) 7→ (ys,y,s2 ).
Note that in this example the normalization map is indeed generically 1–1: for all
points (x,y,z) on the Whitney umbrella not on the z–axis there is only one point
in the preimage, but for all points on the z–axis except the origin there are two
points in the preimage of the normalization map.
In the third example we saw that the normalization of a cusp singularity is smooth.
This turns out to be the case for all curve singularities. So normality is a very strong
condition for curve singularities! It follows that for an irreducible curve singularity (X,x)
there always exists a finite generically 1–1 map from (C , 0) to (X,x). Such a map can
even be given in a particularly simple form, which is then called Puiseux expansion. This
will be studied in chapter five.
More generally the singular locus of a normal space must have at least codimen-
sion two, compare the fifth example. For hypersurface singularities, this condition is also
sufficient as suggested by the fourth example, but we are not able to prove this at the
moment, but only in the section on Cohen-Macaulay spaces, see in particular Corollary
6.5.9. So in higher dimensions the normality condition is not as strong as in the case of
curve singularities.
Proof. Step 1. We first prove “=⇒”. As principal ideal domains are unique factorization
domains, see Example 1.4.3, and those are normal by 1.5.5, it suffices to show that R is
a principal ideal domain. The definition of regularity gives that dimC m/m2 = 1, so m is
a principal ideal. To show that an arbitrary ideal I ⊂ R is principal, first write m = (t)
for some t. Take an element x ∈ I, x 6= 0. We claim that there exists a unit u ∈ R and a
natural number k(x) such that x = utk(x) . This is obvious if x is a unit. Otherwise put
x = x1 t for some x1 ∈ R, and do the same with x1 . Either x1 is a unit, and we are done,
or x1 = x2 t, etc. This process must stop because, otherwise, x = xk tk ∈ mk for all k. But
∩mk = 0 by Krull’s Intersection Theorem, which is in contradiction to the choice of x.
This shows the claim.
Now take k to be the minimum of all k(x) where x runs over the nonzero elements
of I. It follows then I = (tk ), so, in particular, principal. Thus R is a principal ideal
domain, as was to be proved.
Step 2. “⇐=”. As a normal local ring, R is an integral domain by Theorem 1.5.7. Assume
that R is not regular. We consider the R–module m∗ = (R : m) = {x ∈ Q(R) : xm ⊂ R}.
Obviously, R ⊂ m∗ . We claim that this inclusion is strict. p
To prove the claim, take an arbitrary pa ∈ m, a 6= 0. Then (a) = m. Indeed,
by the Primary Decomposition Theorem, (a) is an intersection of prime ideals. But,
by assumption on dim(R) = 1, and because R is an integral domain, there is just one
nontrivial prime ideal, namely m. Because R is Noetherian, it follows that there exists an
s ∈ N with ms ⊂ (a). Let s be minimal with this property. If s = 1, then m = (a), hence
R would be a regular local ring. Take a nonzero element b ∈ ms−1 \ (a). Then ab ∈ / R.
b s b ∗
However, a m ⊂ R, because bm ⊂ m ⊂ (a). Thus a ∈ m \ R. This proves the claim.
From the definition of m∗ it follows that m ⊂ mm∗ ⊂ R. So either mm∗ = m
or mm∗ = R. We claim that the first equality cannot occur. Indeed, it would follow
that ab m ⊂ m. Therefore, m would be an R[ ab ]–module, finitely generated as an R–
module. By the Cayley-Hamilton Theorem 1.5.8, ab would be an integral element, which,
by assumption, would have to be in R. But ab was constructed in such a way that it does
not belong to R.
Therefore mm∗ = R. Now take any element t ∈ m \ m2 , which exists according to
Nakayama’s Lemma. Then tm∗ ⊂ R. It is impossible to have tm∗ ⊂ m, for, otherwise,
tR = tm∗ m ⊂ m2 , in contradiction to the choice of t. Therefore, tm∗ = R, and (t) = tR =
tm∗ m = m. This shows that m is principal.
Corollary 4.4.10. Let R = C {x1 , . . . ,xn }/I be a one-dimensional integral domain.
e is isomorphic to C {t}.
Then its normalization R
(R1) for each prime ideal p of height one, Rp is a regular local ring;
(S2) let f ∈ R be a nonzerodivisor, then the ideal (f ) in R has no embedded primes.
Proof.
Step 1. Suppose first that R is normal. Then Rp , for a prime ideal p, is normal too, by
Exercise 1.5.32. Moreover, Rp has dimension one, so is regular by 4.4.9.
To check the second condition, suppose, on the contrary, that we have an embedded
prime p for some (f ). In particular, p is not a minimal prime ideal with p ⊃ (f ). By
Lemma 1.4.17 we can find an element b ∈ R such that p = (f ) : (b). Let m = pRp .
Considering m∗ := (Rp : m) = {x ∈ Q(R) : x · m ⊂ Rp }, one sees that fb ∈ m∗ . It follows,
as in the proof of 4.4.9, that if mm∗ = Rp , then m is principal. But then it follows that
the dimension of Rp is less than or equal to one. This is in contradiction to the fact that
p is not minimal over (f ), as the dimension of Rp is equal to the height of p.
So we may assume that mm∗ = m. We can now apply the Cayley-Hamilton Theorem
1.5.8: all elements of m∗ are integral over Rp . As we assumed R, and hence Rp , to be
normal, we deduce that m∗ = Rp . In particular, as fb ∈ m∗ . we have fb ∈ Rp . Therefore,
we can write fb = ac for some a ∈ / p. Now
(f ) = q1 ∩ . . . ∩ qt .
√
As (f ) does not have embedded primes by (S2), all the pi := qi are minimal associated
primes. By Krull’s Principal Ideal Theorem, see 4.2.16, all the pi are prime ideals of
height one. By (R1) it follows that the Rpi are regular. Moreover they have dimension
one, so the Rpi are normal by Theorem 4.4.9. It follows that fb ∈ Rpi for all i. To put
it in another way: b ∈ f Rpi for all i. By the Second Uniqueness Theorem of Primary
Decomposition 1.4.23 we have qi = R ∩ f Rpi for all i. Therefore, b ∈ qi for all i. Hence,
b ∈ ∩qi = (f ), and thus fb ∈ R, which had to be proved.
We will now consider a function-theoretic interpretation of normal spaces. For this,
we will show that a space is normal if and only if “Riemann’s Extension Theorem” holds
for X. First we need a definition.
Definition 4.4.12. Let X be an analytic space, and Sing(X) be the singular locus of
X. A function f : X \ Sing(X) −→ C is called weakly holomorphic on X if
(1) f is holomorphic on X \ Sing(X);
(2) f is locally bounded.
Let (X,x) be a germ of an analytic space. A weakly holomorphic function germ f on
(X,x) is an equivalence class of weakly holomorphic functions f : X \ Sing(X) −→ C .
Here X is a representative of (X,x). Two such functions are called equivalent if they
agree on the intersection of the corresponding representatives.
164 4 Further Development of Analytic Geometry
Remarks 4.4.13.
(1) The weakly holomorphic functions on X form a ring. In fact, as Sing(X) is con-
tained in a proper analytic subset of X, it follows by continuity that the ring of
holomorphic functions on X embeds in the ring of weakly holomorphic functions on
X (an analytic function on X is already determined by its values on X \ Sing(X)).
′
(2) The germs of weakly holomorphic functions form a ring which we denote by OX,x .
′
As above one sees that OX,x ⊂ OX,x .
Examples 4.4.14. (1) Consider the coordinate axes, defined by xy = 0. Look at the
function f which is identically 1 on the the x–axis, except at the origin, and identically
0 on the y–axis except at the origin. Then f cannot be extended to the origin (not even
x
as a continuous function), but is bounded. Note that we can write f (x,y) = x+y .
(2) Consider the analytic subset X = {y − x = 0} ⊂ C . The function f = xy is
2 3 2
holomorphic, whenever
√ x 6= 0. So f is a holomorphic function on X \ {(0,0)}. We have
the estimate yx ≤ | x|, hence f defines a germ of a weakly holomorphic function on
(X,x). In this case, f extends to a continuous function on X, by f (0,0) = 0. However f
is not holomorphic.
Theorem 4.4.15.
(1) Let (X,x) be a germ of an analytic space. Then there is a canonical isomorphism
′
OX,x ∼
=O eX,x . To put it in another way, f is a weakly holomorphic function germ
if and only if f is in the integral closure of OX,x .
(2) A germ of an analytic space (X,x) is normal if and only if every weakly holomorphic
function germ on (X,x) can be extended to a holomorphic function on (X,x). To
put it in another way, a space (X,x) is normal, if and only if the First Riemann
Extension Theorem holds for (X,x).
Proof. The second statement follows immediately from the first one. The proof of the
first statement is quite long, and will be divided into several steps. Suppose the germ
(X,x) is in (C n , 0).
eX,x ⊂ O′ . Take f ∈ O
Step 1. First we prove O eX,x . Then f satisfies an equation
X,x
(4.6) F := T s + a1 T s−1 + . . . + as = 0.
Step 3. We now reduce to the case that (X,x) is an irreducible hypersurface. By the Local
Parametrization Theorem we have a surjective map π from (X,x) onto a hypersurface
(X ′ ,x) which has the following properties:
• OX ′ ,x ⊂ OX,x .
• Q(OX,x ) = Q(OX ′ ,x ).
• There exists a representative π : X −→ X ′ and a proper analytic subset D of X ′
such that π : X \ π −1 (D) −→ X ′ \ D is biholomorphic.
• Sing(X ′ ) ⊂ D.
Let f be a weakly holomorphic function on X. Then f gives a bounded holomorphic
function on X ′ \ D. This is a smooth space, so by Riemann’s Extension Theorem 3.1.15
f can extend to a bounded holomorphic function on X ′ \ Sing(X ′ ). We obtain a weakly
holomorphic function on X ′ . So if the theorem is proved for a hypersurface we get f ∈
Q(OX ′ ,x′ ) and an integral equation for f with coefficients in OX ′ ,x . As OX ′ ,x ⊂ OX,x
this shows that f is in OeX,x , since f ∈ Q(OX ′ ,x ) = Q(OX,x ).
g
∆, for a holomorphic function g on X.7 So we have to show that ∆ · f has a holomorphic
extension to X. Consider the projection
π : X −→ U ⊂ C n−1 .
We choose U so small, so that there exists a constant C > 0 such that the absolute value
of the xn –th coordinate of each point is smaller than C. This can be done by Continuity
of Roots, see 3.4.11. Furthermore, we may assume that |f (a)| < C for all a ∈ X \Sing(X),
because by assumption f is locally bounded. Take a point p ∈ U \ D. Locally around p,
there exists an open neighborhood V of p, and s holomorphic functions
α1 , . . . ,αs : V −→ C ,
such that π −1 (V ) is the disjoint union of s complex submanifolds, given by the equations
xn = αi (x1 , . . . ,xn−1 ), see 3.3.23. U was chosen to be so small that the αi are bounded.
Let q ∈ V , and consider the Lagrange interpolation polynomial in xn
s
X Y xn − αj (q)
f (q,αi (q)) .
i=1
αi (q) − αj (q)
j6=i
Thus we see that ∆·f has as coefficients holomorphic functions on V . As one easily checks,
these functions are bounded by a number A(C), which only depends on C. Doing this for
all p ∈ U \ D, we see that ∆ · f is a polynomial in xn whose coefficients are holomorphic
functions on U \ D which are bounded. By the Riemann Extension Theorem for C n−1 ,
see 3.1.15, it follows that we can extend these to holomorphic functions on U . Hence ∆ · f
can be extended to a holomorphic function on all of X.
Step 5. We keep the notations of Step 4, and we finally show that any weakly holomorphic
function is integral over OX,x . For p ∈ U \ D and V an open neighborhood of p, look at
the holomorphic functions α1 , . . . ,αs : V −→ C , as above. The function q 7→ f (q,αi (q))
defines a holomorphic function on V . Moreover on π −1 (V ) the holomorphic function
f − f (x1 , . . . ,xn−1 ,α1 ) · · · f − f (x1 , . . . ,xn−1 ,αs )
is the zero function. Let bi be the i–th symmetric polynomial in the f (x1 , . . . ,xn−1 ,αj ).
Then the bi define holomorphic functions on V , and we have an equation
(4.7) f s + b1 f s−1 + . . . + bs = 0,
7 In view of the fact that we try to prove that f is in the integral closure of OX,x , and the fact that any
element in the integral closure has as universal denominator ∆, see 1.5.19, this is the obvious thing
to do.
4.4 Normalization 167
(4.8) g s + b1 ∆g s−1 + . . . + bs ∆s = 0,
4.4.17. Let ϕ : (X,x) −→ (Y,y) be a finite map of germs of analytic spaces. Prove that the
image of ϕ is a germ of an analytic space.
(Hint: Let I = Ker(ϕ∗ ). Show that the image of ϕ consists of all points a ∈ Y with f (a) = 0 for
all f ∈ I.)
4.4.18. The normalization of a local domain is not necessarily local. The local ring R, obtained
from C [x,y]/(y 2 − x3 − x2 ) by localizing in the maximal ideal (x,y) is an integral domain. Show
that the normalization of R is not local.
e x) −→ (X,x) be a normalization.
4.4.19. Let (X,x) be a germ of an analytic space, and n : (X,e
Let (Y,y) be a normal multi-germ, and ϕ : (Y,y) −→ (X,x) be a map of multi-germs. Show that
e x) making the following
e : (Y,y) −→ (X,e
there exists a uniquely determined map of multi-germs ϕ
diagram commutative.
e
ϕ
(Y,y) e x)
/ (X,e
FF
FF
FF n
ϕ FF
F#
(X,x)
(Hint: Prove a corresponding statement for rings.)
4.4.20. Let (X,x) be a germ of an analytic space, and Sing(X,x) be the singular locus of (X,x).
Let I ⊂ OX,x be the ideal of germs of analytic functions vanishing on the singular locus. Show
that (X,x) is normal if and only if HomOX,x (I,I) = OX,x .
(Hint: Use the fact that regular analytic rings
√ are normal. Follow the proof of 1.5.13, but now
use the Nullstellensatz 3.4.4 to prove I ⊂ J (notations of 1.5.13).)
168 4 Further Development of Analytic Geometry
“ ”
∂f ∂f
4.4.21. Let f ∈ C {x1 , . . . ,xn }. Prove that f k ∈ ∂x 1
, . . . , ∂x n
for some k ≥ 0.
` ∂f ∂f
´ ∂f ∂f
(Hint: Let (X, 0) = V (f, ∂x 1
, . . . , ∂xn
), 0 and (Y, 0) = (V (
∂x1
, . . . , ∂x n
), 0). Show that (X, 0) =
(Y, 0) in the following way. Suppose (X, 0) $ (Y, 0). Find a map n : (C , 0) −→ (Y, 0) with
n((C , 0)) 6⊂ (X, 0) by using the normalization of a curve (C, 0) ⊂ (Y, 0) which is not contained
d
in (X, 0). Compute dt (f ◦ n).)
This chapter is devoted to the study of plane curve singularities. First of all, from the
existence of the normalization, and the fact that one-dimensional normal singularities
are smooth, which are results proved in the previous chapter, it follows that one can
parameterize curve singularities. Thus, if R is the local ring of an irreducible plane
curve singularity, we have an injection R ⊂ C {t}. From the Finiteness of Normalization
Theorem 1.5.19 it follows without much difficulty that the codimension of R in C {t}
(as C –vector space) is finite. This codimension is called the δ–invariant. Here we prove
the converse of this statement, that is, given a subring R of C {t} of finite codimension,
generated by two elements, say x(t) and y(t), then R the local ring of an irreducible plane
curve singularity, say given by f (x,y) = 0. In order to obtain information on the order of
f , we introduce the intersection multiplicity of two (irreducible) plane curve singularities
(C, 0) and (D, 0). This number can be calculated as follows. Consider the normalization
C {t} of OC,0 , and let (D, 0) be given by g = 0. Via the inclusion OC,0 ⊂ C {t}, we
can consider at g as an element of C {t}. Then the intersection multiplicity is given
by the vanishing order of g. From this it easily follows that if R = C {x(t),y(t)}, and
f ∈ C {x,y} is irreducible with f (x(t),y(t)) = 0, then the order of f in y is equal to
ordt (x(t)), and similar for x. As an application we prove that if we have a convergent
power series f ∈ C {x,y}, which is irreducible when considered as a formal power series,
then it is already irreducible as a convergent power series. This in fact holds for power
series in any number of variables, and is an easy consequence of the Artin Approximation
Theorem, which will be proved in Chapter 8. Next we discuss Newton’s method. Here
we start with an f ∈ C {x,y}, and try to find x(t), y(t) with f (x(t),y(t)) = 0. We give
two proofs of the convergence. The first one is a consequence of Corollary 3.3.18, and is,
to our knowledge, new. The second one is standard. Finally, we discuss the irreducibility
of f in terms of its Newton polygon. It is shown that for an irreducible f the Newton
polygon has only one face. Our proof is algorithmic, that is, if the Newton polygon has
more than one face, we will give an algorithm to find a factorization of f .
In Section 5.2 we will study several invariants of an irreducible plane curve singularity
R. As above, we may assume that R ⊂ C {t}, which is the normalization. We already
mentioned the δ–invariant. Moreover, we introduce the semigroup Γ(R) ⊂ N, which
consists of the vanishing orders of the elements in R. Furthermore, the conductor c(R)
is introduced. It is the smallest natural number α with α + N ⊂ Γ(R). Our first aim is
to show the Theorem of Gorenstein, which says that c(R) = 2 · δ(R). The proof uses
duality theory for fractional ideals. Fractional ideals are R–modules in the quotient field
Q(R), which have a universal denominator. Given a fractional ideal I, we can look at
its dual I ∗ := HomR (I,R). For example, the normalization R e and the conductor ideal
e
AnnR (R/R) are fractional ideals, which are dual to each other. We have a canonical map
I −→ I ∗∗ , which one easily sees to be injective. One may ask oneself whether this map
is an isomorphism for all fractional ideals. In general, this turns out not to be the case.
Those curve singularities for which it is true are called Gorenstein, and we will prove
that irreducible plane curve singularities are indeed Gorenstein. We then discuss how
to compute the minimal generators of the semigroup Γ(R) from the parametrization. In
170 5 Plane Curve Singularities
general, this would be much too P difficult, so we first suppose that the parametrization
is of type x(t) = tn , y(t) = i
i≥m ai t with m ≥ n. After a coordinate change in t,
and interchanging x and y we may suppose that our parametrization is of this type.
Characteristic exponents k0 , . . . ,kg are introduced, in terms of n, and the i ∈ N with
ai 6= 0. A priori these are only invariants of the parametrization, but we will show
that they determine, and are determined by, the minimal generators β0 , . . . ,βg of the
semigroup Γ(R). We then discuss the Inversion Theorem, which is of importance in the
next section. This gives a method for computing the characteristic exponents in case
n > m.
In Section 5.3 we consider resolutions of irreducible plane curve singularities. First
of all, the blowing-up π : Bl0 C 2 −→ C 2 of C 2 in a point, say 0, is introduced. The space
Bl0 C 2 is a complex manifold of dimension one, in which the point 0 has been replaced by
all directions through 0. The set π −1 (0) is a projective line, called the exceptional curve
or exceptional divisor. If one now has a plane curve singularity, which has representative
C in some small open neighborhood U of 0, we can look at the strict transform C (1) :=
π −1 (C \ {0}), and iterate this process. That is, we blow up in the singular point of C (1) , if
present. The Theorem of Resolutions of Plane Curve Singularities says that this process
stops after finitely many steps. The main ingredient of the proof is the existence of a
parametrization of the irreducible plane curve singularity, as studied in the first section.
After some more blowing-ups, one can arrange that the strict transform intersects just
one exceptional curve, and intersects this curve transversally. This is then called the
standard resolution. Having such a resolution process, one can introduce two invariants.
First of all, one has the multiplicity sequence. If the standard resolution is obtained
after k blowing-ups, one has a sequence of numbers (m0 , . . . ,mk−1 ), where m0 is the
multiplicity of C, and mi is the multiplicity of the strict transform C (i) after i blowing-
ups. Furthermore, the resolution graph is introduced. It is a weighted graph with one
vertex for each exceptional divisor. The weight of a vertex is i if the curve is created
under the i–th blowing-up. Furthermore, for the strict transform we add a star ∗. Two
vertices are connected by an edge, if the corresponding divisors intersect. Furthermore,
the star is connected to the vertex corresponding to the unique exceptional divisor which
it intersects. The main result of the section is that the multiplicity sequence, the resolution
graph, the characteristic exponents, and the semigroup determine each other. In the
proof, essential use of the Inversion Theorem is made.
In Section 5.4 we study reducible plane curve singularities. Again we can study
a standard resolution. Its existence easily follows from the irreducible case. As a new
invariant the contact number between two irreducible components, also called branches
of the plane curve singularity, is introduced. The contact number of two irreducible
plane curve singularities is the number of blowing-ups one needs to separate the strict
transforms. It turns out that this number is finite. As in the irreducible case, we introduce
a resolution graph, but of course we need a star ∗ for each branch. We then show that
the resolution graph determines the resolution graphs of the irreducible components and
the contact numbers between the branches, and conversely. Finally, as an application of
the standard resolution we give two results. First of all, given irreducible plane curve
singularities, we show how to compute the intersection number from the multiplicity
sequences and the contact numbers. Secondly, for irreducible plane curve singularities,
we will prove a theorem of Max Noether, which gives a formula for the δ–invariant of an
irreducible plane curve singularity in terms of its multiplicity sequence. This result will
be generalized to the reducible case in the exercises.
5.1 Puiseux Expansion 171
From the results of the previous chapter it is easy to see that one can parameterize
irreducible (plane) curve singularities:
Theorem 5.1.1 (Puiseux Expansion). Let 0 6= f ∈ C {x,y} be irreducible. Then there
exist x(t), y(t) ∈ C {t} such that
(1) f x(t),y(t) = 0,
(2) dimC C {t}/C {x(t),y(t)} < ∞.
Proof. Let R be the normalization of C {x,y}/(f ). Because of Corollary 4.4.10 R is iso-
morphic to C {t}. Therefore, we have an injective map
C {x,y}/(f ) −→ C {t},
and C {t} is a finitely generated C {x,y}/(f )–module via this map. Let x(t) respectively
y(t) be the images of x mod (f ) respectively y mod (f ) in C {t}. Then obviously (1)
holds.
To prove (2) we may assume that f is a Weierstraß polynomial in y. Let ∆ be
its discriminant. As f is irreducible, it does not have multiple factors. In particular
∆ ∈ / (f ). It follows from the Theorem of Finiteness of Normalization, see 1.5.19, that
∆C {t} ⊂ C {x(t),y(t)}. We can view ∆ as an element of C {t}. Let k be the vanishing
order of ∆, that is, ∆ = tk u for some unit u. Then it follows that tk C {t} ⊂ C {x(t),y(t)},
so that {1, . . . ,tk−1 } generate C {t}/C {x(t),y(t)} as C –vector space.
Example 5.1.2. For the A2 –singularity y 2 −x3 = 0 we can take x(t) = t2 , and y(t) = t3 .
We have the following converse to 5.1.1.
Theorem 5.1.3. Let two power series x(t), y(t) ∈ tC {t} be given. Define R =
C {x(t),y(t)} and suppose that dimC C {t}/R < ∞. Then there exists an irreducible
f ∈ C {x,y} with R ∼
= C {x,y}/(f ). Moreover, R ⊂ C {t} is the normalization of R.
Proof. Consider the map ϕ : C {x,y} −→ C{t} defined by ϕ(x) = x(t), ϕ(y) = y(t). Then
C {x,y}/ Ker(ϕ) ∼ = R. From dimC C {t}/R < ∞ it follows that R ⊂ C {t} is a finite ring
extension, and as C {t} has dimension one, it follows that R has dimension one, see 4.1.4.
Because R ⊂ C {t}, it is an integral domain. Thus R is the local ring of an irreducible
plane curve singularity. Therefore Ker(ϕ) is a principal ideal by the Characterization of
Hypersurfaces 4.1.12. So Ker(ϕ) = (f ) for some f ∈ C {x,y}. As R is an integral domain,
we get that Ker(ϕ) is a prime ideal, and it follows that f is irreducible.
It remains to prove that C {t} is the normalization of R. As dimC C {t}/R < ∞,
it follows that I := AnnR (C {t}/R) 6= 0 (Exercise 5.1.30). Now I is an ideal in R =
C {x(t),y(t)}, but from the definition it follows immediately that I is also an ideal in
C {t}. As C {t} is a principal ideal domain it follows that I = tc ·C {t} for some c > 0. This
implies that tc C {t} ⊂ C {x(t),y(t)}. Let ψ ∈ tC {t}, then for all k ≥ c we have ψ k ∈ R.
k+1
In particular ψ = ψψk is in the quotient field of C {x(t),y(t)}. As certainly the constants
are in the quotient field of C {x(t),y(t)} too, it follows that the quotient fields of C {t}
and C {x(t),y(t)} are equal. As C {t} is a finitely generated C {x(t),y(t)}–module, every
element in C {t} is integral over C {x(t),y(t)}. Therefore C {t} is in the normalization
of C {x(t),y(t)}. As C {t} is normal itself, it follows that C {t} is the normalization of
C {x(t),y(t)}.
172 5 Plane Curve Singularities
In this theorem we showed that we can find such an f , but we still have no in-
formation on its order. In order to answer this question, we introduce the intersection
multiplicity.
Definition 5.1.4. Let (X, 0) and (Y, 0) be plane curve singularities, not necessarily
reduced, not having a component in common. Let (X, 0) be defined by f = 0, and let
(Y, 0) be defined by g = 0. The intersection multiplicity or intersection number of (X, 0)
and (Y, 0) at 0 is defined by
(X, 0) · (Y, 0) := dimC C {x,y}/(f,g) .
In case (X, 0) and (Y, 0) are reduced, this intersection multiplicity has a geometric
interpretation as the number of intersection points in a small neighborhood of 0 after
a small perturbation of f and g. We are not able to prove this at the moment, but it
will be done in Chapter 6. The following important lemma tells us how to compute the
intersection number with the help of the normalization in case one of the curves, say
(X, 0), is irreducible.
Lemma 5.1.5. Suppose that (X, 0) is an irreducible plane curve singularity. Let OX,0 ⊂
∼
e 0 = C {t} be its normalization. Let (Y, 0) be a plane curve singularity, defined by
OX,e
g = 0. We can consider the class of g in OX,0 and, therefore, we can view g as an
element of C {t}. Then
(X, 0) · (Y, 0) = ordt (g).
Proof. Consider the diagram
·g
OX,e
e 0 / OX,e
e 0
O O
? ·g ?
OX,0 / OX,0 .
·g ·g
e 0 −→ OX,e
Then ordt (g) = dimC Coker(OX,e e 0 ) , and (X, 0)·(Y, 0) = dimC Coker(OX,0 −→
OX,0 ) . The problem is now reduced to a problem in linear algebra, and its solution is
left as Exercise 5.1.31.
Corollary 5.1.6. Under the conditions of Theorem 5.1.3, suppose that ord(x(t)) = n,
and that ord(y(t)) = m. Let f ∈ C {x,y} be irreducible with C {x(t),y(t)} ∼
= C {x,y}/(f ).
Then
(1) f is regular in y of order n, and regular in x of order m.
(2) The multiplicity ord(f ) of f is equal to min{n,m}.
Proof. Let (X, 0) is the singularity defined by f = 0, and (L, 0) the germ of the line
defined by x = 0. The first statement follows because, by Lemma 5.1.5,
ordy (f ) = dimC C {x,y}/(f,x) = (X, 0) · (L, 0) = ordt (x(t)).
Similarly for x.
To prove the second statement we may without loss of generality assume that n ≤ m,
and that f is regular in y of order ord(f ). This is always possible after a linear coordinate
change. It follows from the first part that
ord(f ) = ordy (f ) = (X, 0) · (L, 0) = ordt (x(t)) = n = min{n,m},
as claimed.
5.1 Puiseux Expansion 173
Theorem 5.1.7. Let ε be a primitive n–th root of unity. Consider the ring extension
C {x,y} ⊂ C {t,y}, where x = tn .
(1) Let f = y n + an−1 (x)y n−1 + · · · + a0 (x) be an irreducible Weierstraß polynomial.
Then there exists a y(t) ∈ tC {t} such that
n
Y
f= y − y(εi t) .
i=1
Qn
One also writes this as f = i=1 y − y(εi x1/n ) .
(2) Let y(t) ∈ tC {t}
such that dimC C {t}/C {tn ,y(t)} < ∞ be given. Then f =
Qn i
i=1 y − y(ε t) is an irreducible Weierstraß polynomial in C {x,y}.
the support of f . Sometimes we will identify the pair (i,j) with the monomial xi y j . Let
S
(1) Γ+ (f ) = (i,j) + R2+ .
(i,j)∈ supp(f )
(4) The face ∆ induces a grading on C {x,y} in the following way: if ∆ has slope
w1
−µ = − w 2
, in lowest terms, and w1 , w2 > 0. Then we set w-deg(x) = w1 , and
w-deg(y) = w2 . Note that f∆ is quasi-homogeneous with respect to these weights.
Two faces
Γ+ (f ) Γ(f )
5.1 Puiseux Expansion 175
Let ∆ be the face defined by (1,1) and (5,0). Then f∆ = x5 + xy. The weights corre-
sponding to the grading defined by ∆ are w-deg(x) = 1 and w-deg(y) = 4.
Obviously, different f can have the same Newton polygon. For example, x5 + x3 y 3 +
y 3 + xy and x5 + y 3 + xy have the same Newton polygon. The element (3,3) does not
enlarge the Newton polygon.
Example 5.1.11. f = x3 y 3 + x2 y 4 .
One face
Γ(f )
Definition 5.1.12. f is called convenient if there exist positive integers n,m such that
(n,0),(0,m) ∈ supp(f ), that is, the noncompact parts of Γ(f ) are on the coordinate axis.
Remark 5.1.13. f in Example 5.1.10 is convenient, f in Example 5.1.11 is not conve-
nient. For any f ∈ C {x,y}, we can always write f = xa · y b · f1 , where f1 is convenient.
Therefore, from the point of view of the zeros of f the convenient case is the important
one.
We now come to Newton’s method to find a parametrization of a branch of a plane
curve singularity.
Theorem 5.1.14. Let f ∈ C {x,y}, and suppose that f is convenient. We have the fol-
lowing algorithm for finding a parametrization of an irreducible component (also called
branch) of f = 0.
w1
(1) Let ∆ be the face of Γ(f ) with smallest slope − w2
, which is in lowest terms, and with
w1 , w2 > 0. Then f∆ is quasi-homogeneous of say, degree d0 . Consider a zero of f∆ (1,y)
which we call a0 . Take new variables x1 and y1 and substitute
x = xw
1 ,
1
y = xw
1 (a0 + y1 ).
2
Then
1
f (1) (x1 ,y1 ) := f (xw w2
1 ,x1 (a0 + y1 )) ∈ C {x1 ,y1 }.
1
xd10
(2) Inductively find a parametrization x1 = ta , y1 = ϕ(t) of a branch of f (1) (x1 ,y1 ) in
case f (1) 6= 0. Then
x = taw1 , y = a0 taw2 + taw2 ϕ(t),
gives a parametrization of a branch of f = 0.
Before giving the proof we give an example.
176 5 Plane Curve Singularities
Γ(f )
x = x21 , y = x31 (1 + y1 ).
Of course, we can go on as in the first step, but we can also use some common sense.
Indeed,
f (1) (x1 ,y1 ) = (2y1 + y12 )2 − x21 ,
so that x1 = ±(2y1 + y12 ). We, therefore, see two solutions for y1 :
√ √
y1 = −1 + 1 − x1 ; y1 = −1 + 1 + x1 .
(1)
so that indeed the definition of f makes sense. Obviously, if we can find a parametriza-
tion of a branch of f (1) = 0, we get a parametrization of a branch of f = 0.
Step 2. We now look at f (1) . Its definition was
X
(5.1) f (1) (x1 ,y1 ) = xd−d
1
0
fd (1,a0 + y1 ).
d≥d0
We claim that f (1) is again regular in y1 . In view of formula (5.1) we have to show that
fd0 (1,a0 + y1 ) is regular in y1 . We can factorize fd0 (1,y): (Recall that f is regular of order
b.)
fd0 (1,y) = c · (y − a0 ) · · · (y − ab−1 ),
for some constants c,a0 , . . . ,ab−1 ∈ C . From this we see that
fd0 (1,a0 + y1 ) = c · y1 (y1 + a0 − a1 ) · · · (y1 + a0 − ab−1 ),
so fd0 (1,a0 + y1 ) is regular in y1 . Thus f (1) is also regular in y1 , of order at most b. There
are two cases to consider.
(1) f (1) is also regular of order b. Then it follows that a0 = a1 = · · · = ab−1 , so that
fd0 (1,y) = c · (y − a0 )b . As fd0 is quasi-homogeneous and convenient it follows that
fd0 = c · (y − a0 xm )b . So w1 = 1, and m · b = d0 . Furthermore w2 = m, and
x1 = x, y = xm m
1 (a0 + y1 ) = x (a0 + y1 ).
is a formal solution of f (x,y) = 0. Indeed, f (x,y) = xd0 f (1) (x,y1 ) = · · · = xkd0 f (k) (x,yk ).
Plugging in y = y(x) we get f (x,y(x)) = xkd0 f (k) (x,yk (x)) for some power series yk (x).
Thus f (x,y(x)) is divisible by all powers of x, and hence is zero.
178 5 Plane Curve Singularities
The next thing we study in this section is the irreducibility of a power series f , in
terms of the Newton polygon. Let us look at the simplest case. Suppose h = f · g, f
and g convenient, and suppose that the Newton polygon of both f and g has just one
face. Let the slope of the face of Γ(f ) be given by the endpoints (p1 ,0) and (0,q1 ), and
similarly the slope of the face of Γ(g) by the endpoints (p2 ,0) and (0,q2 ). We assume
that the slope of the face of Γ(f ) is less than or equal to the slope of the face of Γ(g).
Then h = f · g is convenient. The Newton polygon of h has two faces, resp. one face if
the slopes are equal. The first face has endpoints (0,q1 + q2 ), and (p1 ,q2 ), the second has
endpoints (p1 ,q2 ) and (p1 + p2 ,0).
q1 q1 + q2
Γ(f )
q2 Γ(g) q2
p1 p2 p1 p1 + p2
This is easy to prove. One has to show that each monomial xα y β of h lies right to both
faces. The condition that a monomial xα y β lies right to a line whose intersections with
the axes are (p,0) and (0,q) say, is simply αq + βp ≥ pq. Let xα1 y β1 be a monomial of f
and xα2 y β2 be a monomial of g. Then, by assumption on Γ(f ) and Γ(g) we have
q1 q2
(1) p1 ≥ p2 ;
(2) β1 ≥ − pq11 α1 + q1 ;
(3) β2 ≥ − pq22 α2 + q2 .
This implies that β1 +β2 ≥ − pq11 (α1 +α2 )+q1 +q2 and β1 +β2 ≥ − pq22 (α1 +α2 )+ pq22 (p1 +p2 ).
The first inequality follows by adding (2) and (3) and using (1). The second inequality
follows by adding pp21 q2 to (3) and using the inequality α1 q2 + β1 p2 ≥ p1 q2 . Therefore,
xα1 +α2 y β1 +β2 lies right to both faces. Moreover if xα1 +α2 y β1 +β2 lies on both faces, it
follows that β1 = 0, from which α1 = p1 , β2 = q2 and α2 = 0 easily follow. So the
monomial xp1 y q2 indeed occurs in the product f · g.
More generally, we need the following lemma. The proof is just more technical, and
therefore left to the reader.
Lemma 5.1.16. Let f1 ,f2 ∈ C {x,y} be convenient. Assume Γ(f2 ) has one face ∆1
defined by (p2 ,0) and (0,q1 − q2 ). Denote the faces of f1 by ∆2 , . . . ,∆r and let ∆i be
defined by (pi ,q i ) and (pi+1 ,q i+1 ) with p2 = 0, q 2 = q2 and q r+1 = 0. Assume furthermore
that the slope of ∆1 is smaller than the slopes of the ∆i , i ≥ 2.
Then the Newton polygon Γ(f ) of f = f1 · f2 has faces ∆1 , . . . ,∆r , which are given
by:
• ∆1 defined by (0,q1 ) and (p2 ,q2 );
• ∆i defined by (pi + p2 ,q i ) and (pi+1 + p2 ,q i+1 ) for i = 2, . . . ,r.
5.1 Puiseux Expansion 179
q1 − q2 q1
Γ(f1 · f2 )
∆1
∆1
Γ(f2 )
q2
p2
q2 ∆2
∆2
Γ(f1 )
p2
We now come to a converse of this lemma. It follows without much difficulty from
the explicit construction of the Puiseux expansion, that the converse must hold. We want
an explicit construction of the factorization, however.
Theorem 5.1.17. Let f ∈ C {x,y} be convenient, ∆1 , . . . ,∆r be the faces of Γ(f ) and
di the slope of ∆i . Then f = f1 · · · fr , where fi is convenient such that Γ(fi ) has only
one face of slope di , i = 1, . . . ,r.
In the proof of this theorem, we use the following lemma.
Lemma 5.1.18. Consider the polynomial ring C [x,y], with grading defined by w-deg(x) =
w1 > 0 and w-deg(y) = w2 > 0. Let g = xb + . . . be quasi-homogeneous of weighted degree
bw1 . Let h ∈ C [x,y] be quasi-homogeneous with weighted degree at least bw1 + aw2 for
some a ∈ N. Then h ∈ (y a ,g).
Proof. We apply Division with Remainder, see Theorem 2.1.5, and get
h = q · g + r,
(0,q1 )
∆1
(p2 ,q2 )
We define
1
g := · f∆1 , h := f − y q2 g.
y q2
We obviously have f = y q2 g + h.
As in Definition 5.1.9 we introduce the weighted degree on the monomials of C {x,y}
corresponding to ∆1 , that is, we put w-deg(x) = w1 = (q1 − q2 )/c, and w-deg(y) = w2 =
p2 /c, where c = gcd(q1 − q2 ,p2 ). With these weights, g is weighted homogeneous of degree
(q1 − q2 )p2 /c. We consider the elements of C {x,y} to be ordered by the corresponding
weighted order function, w-ord(f ) = min{w-deg(xi y j ) : (i,j) ∈ supp(f )}.
(n) (n)
We will now inductively construct f1 , f2 and h(n) such that
(n) (n)
• f = f1 · f2 + h(n) ,
(n)
• f1 = y q2 + terms of higher w-order not divisible by y q2 ,
(n)
• f2 = g + terms of higher w-order,
• p2 q1 /c < w-ord(h) < w-ord(h(2) ) < · · · < w-ord(h(n) ).
(1) (1)
The initialization of course is f1 = y q2 , f2 = g and h(1) = h. Note that the weighted
order of h is indeed bigger than p2 q1 /c, as w-ord(h) > w-ord(y q2 g) = p2 q1 /c. This is
because all monomials of h lie right to the line through the face ∆1 . Now let k be the
quasi-homogeneous part of h(n) of w–degree equal to w-ord(h(n) ). Because of w-deg(k) >
w-ord(y q2 g) we can find by Lemma 5.1.18 weighted homogeneous ξ,η such that k =
ξy q2 + ηg. Moreover one can arrange that no monomial of η is divisible by y q2 . Now set
(n+1) (n)
• f1 = f1 + η,
(n+1) (n)
• f2 = f2 + ξ,
(n+1) (n+1)
• h(n+1) = f − f1 · f2 .
(n) (n)
It is not difficult to check, using w-ord(f1 − y q2 ) > p2 q2 /c and w-ord(f2 − g) >
p2 (q1 − q2 )/c, that
w-ord(h(n+1) ) > w-ord(h(n) ).
Continuing this way, we get formal power series f1 and f2 with f = f1 ·f2 . In particular, by
construction, f1 is a Weierstraß polynomial with respect to y. Therefore, the convergence
of f1 and f2 follows from Corollary 5.1.8.
The Newton polygon Γ(f2 ) = Γ(g) looks like,
5.1 Puiseux Expansion 181
q1 − q2
p2
∆1
∆2
(1) (1)
We have w1 = 5, w2 = 2, g = f2 = y12 · f∆1 = y 5 + x2 . So with f1 = y 2 and
(1)
h = x5 + x3 y 5 we get
(1) (1)
f = f1 f2 + h(1) .
Now h(1) is weighted homogeneous of degree 25. The monomials of h(1) all are divisible
by x3 . In fact h(1) = (x2 + y 5 ) · x3 = g · x3 , so that
f = (y 2 + x3 )(y 5 + x2 ).
f = (y − x − 12 x2 )(y + x + 12 x2 ) + 14 x4 .
f = (y − x − 12 x2 + 18 x3 ) · (y + x + 21 x2 − 81 x3 ) − 18 x5 + 1 6
64 x ,
f = (y 2 − x3 − x2 y + 12 x4 )(y 2 − x3 + x2 y + 12 x4 ) − 14 x8 .
so that the conditions of Theorem 5.1.20 are satisfied, and we get a second proof that f
is reducible.
Exercises
5.1.23. Use the algorithm for the Puiseux expansion
√ to find a solution of y 2 + 2y − x = 0, that
is, find the power series expansion of y = −1 + 1 + x.
5.1.25. Find the Puiseux expansions of the following (irreducible) power series.
(1) f = y 4 − 2x3 y 2 − 4x5 y + x6 − x7 .
(2) f = x33 − 3x22 y 2 + 6x21 y − x20 + 3x11 y 4 + 2x10 y 3 − y 6 .
5.1.27. Let C ((x)) be the quotient field of C {x} and C ((x)) its algebraic closure. Prove that
C ((x)) = ∪ C ((x1/n )).
n∈N
5.2 Invariants 183
5.1.28. This is an exercise for the reader who knows about resultants. Let (X, 0) and (Y, 0)
be plane curve singularities, which are given by Weierstraß polynomials f and g in y. Thus in
particular f, g ∈ C {x}[y]. Consider the resultant Rf,g ∈ C {x}. Prove that
(Hint: Reduce to the case that f and g are irreducible. Let x = tn , y = ϕ(t) be a Puiseux
expansion of (X, 0), and factorize f in C {t}[y]. Compute the resultant in C {t}.)
5.1.29. This exercise uses the previous exercises. Let (X, 0) be a plane curve singularity given
by a Weierstraß polynomial f in y. Let ∆ ∈ C {x} be the discriminant of f . Prove
where (L, 0) is the line given by y = 0, and µ(f ) is the Milnor number.
∂f
(Hint: Consider (Y, 0) given by ∂x = 0. Compute the intersection number (X, 0) · (Y, 0) in two
ways. One way is by using the previous exercise, the other way is by using Puiseux expansion for
the branches of (Y, 0): if (ϕ1 (t),ϕ2 (t)) is a parametrization of a branch then look at the vanishing
d
order of t 7→ dt (f (ϕ1 (t),ϕ2 (t))), which differs by one from the intersection multiplicity.)
5.1.30. Let (R,m) be an analytic C –algebra and M a finitely generated R–module. Assume
that dimC M < ∞. Prove that AnnR (M ) 6= {0}.
(Hint: Choose x ∈ m, x 6= 0 and consider M ⊃ xM ⊃ . . ..)
5.2 Invariants
Lemma 5.2.2.
(1) δ(R) = # N \ Γ(R) .
184 5 Plane Curve Singularities
e
(2) AnnR (R/R) e
= tc(R) R.
Proof.
e
(1) Suppose that a1 , . . . ,aδ is a basis of R/R. Suppose that ord(ai ) = ord(aj ) = k,
c
so that ai = ci tk + . . . ,aj = cj tk + . . .. Then we may replace aj by aj − cji ai ,
which has higher vanishing order. Using this operation several times we can get
a basis a1 . . . . ,aδ with ord(ai ) 6= ord(aj ) for i 6= j. Similarly, we can assure that
ord(ai ) 6∈ Γ(R), then the ord(a1 ), . . . , ord(aδ ) are different elements of N \ Γ(R).
On the other hand, if we have different elements c1 < . . . < cγ of N \ Γ(R),
then b1 tc1 + . . . + bγ tcγ with bi ∈ C has vanishing order min{ci : bi 6= 0}. So
b1 tc1 + . . . + bγ tcγ ∈ R only when b1 = . . . = bγ = 0, so that δ(R) ≥ γ.
e
(2) In the proof of 5.1.3 we showed already that AnnR (R/R) is an ideal in R and Re
e ce
and, therefore AnnR (R/R) = t R for some c. We have to prove that c − 1 6∈ Γ(R).
But c − 1 ∈ Γ(R) and tc Re ⊂ R implies that tc−1 R
e ⊂ R and, consequently, tc−1 ∈
e
AnnR (R/R), e
which contradicts AnnR (R/R) e
= tc R.
is an isomorphism.
Remarks 5.2.6.
e of R is a fractional ideal,
(1) Any ideal in R is a fractional ideal. The normalization R
which, in general, is not an ideal.
5.2 Invariants 185
(2) For fractional ideals I,J different from (0) one has the equality
In the remaining part of this Chapter we shall use (J : I) for (J : I)K to simplify
the notation.
The map (J : I) −→ HomR (I,J) sends an element a ∈ (J : I) to the multiplication
with this element in K. On the other hand, a homomorphism α ∈ HomR (I,J) can
be extended to an R–linear map α from K to K by putting α( rs ) := 1s α(r) for an
element r ∈ I and s ∈ R. Then one shows that the element α(1) ∈ (J : I), giving
the inverse map. In particular, we have for (0) 6= J ⊂ I always I ∗ ⊂ J ∗ .
e ∗ = (R : R)
(3) (R) e R = {c ∈ R : cR
e ⊂ R} = AnnR (R/R)
e e
= tc(R) R.
(4) For finite dimensional vector spaces V over a field k, one knows that the canonical
map V −→ V ∗∗ is an isomorphism. The corresponding statement for rings is false,
in general, even if one takes only fractional ideals. An example is given in the
Exercises. The canonical map is injective, but need not be surjective.
Step 1. Let (0) 6= J ⊂ I be nonzero fractional ideals. Then we claim that dimC (I/J) ≥
dimC (J ∗ /I ∗ )2 . If J ⊂ L ⊂ I then dimC (I/J) = dimC (I/L)+dimC (L/J) and dimC (J ∗ /I ∗ ) =
dimC (J ∗ /L∗ ) + dimC (L∗ /I ∗ ). We can always find a vector space L with J $ L $ I, and
mL ⊂ J. Indeed, take any vector space L such that J has codimension one in L. By
Nakayama’s Lemma, it follows that mL ⊂ J. It follows as above that L is a fractional
ideal.
Therefore, we can use induction and it suffices to prove the claim for the case
dimC (I/J) = 1. We will construct an injective map
ϕ : J ∗ /I ∗ −→ m∗ /R,
2 Because of R being one–dimensional we have dimC (I/J) < ∞.
186 5 Plane Curve Singularities
which, using the assumption, proves the claim. Take an element a ∈ I with (J,a) =
I. Then m(I/J) $ (I/J) by Nakayama’s Lemma. This implies m(I/J) = 0 because
dimC (I/J) = 1. In particular a · m ⊂ J. We now construct the map ϕ. Let b ∈ J ∗ . Then
ba · m = b · am ⊂ b · J ⊂ R. Thus ba ∈ m∗ . Thus the map ϕ sends b to ba. This gives a
well-defined map, as b ∈ I ∗ sens a ∈ I to an element in R. We now prove ϕ is injective.
So suppose ba ∈ R, for some b ∈ J ∗ . But then it sends all elements of I to R. Thus b ∈ I ∗ .
Step 2. Let I be a nonzero fractional ideal. We may assume that I is an ideal in R. Take
an element x ∈ I. We claim that we have the following equalities:
(∗) dimC (I/(x)) = dimC R/ (x) : I ,
βν = min{a ∈ Γ(R); gcd (a,β0 , . . . ,βν−1 ) < gcd (β0 , . . . ,βν−1 )}, ν ≥ 2.
By this construction we obviously obtain only finitely many βν , say β0 , . . . ,βg′ . Let
G(R) := {β0 , . . . ,βg′ }.
Lemma 5.2.12.
The question is how to calculate generators of the semi-group Γ(R) from the parametriza-
tion. To answer this question, we introduce other invariants, the characteristic exponents.
188 5 Plane Curve Singularities
P
Definition 5.2.14. Let R = C {tn ,y(t)}, y(t) = ai ti and am 6= 0. We define
i≥m
k0 = min{n,m},
min{i|ai 6= 0, gcd (i,k0 ) < k0 } if n ≤ m,
k1 = min{i|ai 6= 0, gcd (i,k0 ) < k0 } + n − m if n > m and m | n, ,
n if n > m and m ∤ n,
(
min{i|ai 6= 0, gcd (i,k0 , . . . ,kν−1 ) < gcd (k0 , . . . ,kν−1 )} if n ≤ m,
kν =
min{i|ai 6= 0, gcd (i,k0 , . . . ,kν−1 ) < gcd (k0 , . . . ,kν−1 )} + n − m if n > m,
ν ≥ 2.
By this construction again, we obviously obtain only finitely many kν , say k0 , . . . ,kg .
Note that in all cases k0 < k1 < . . . < kg and gcd(k0 , . . . ,kg ) = 1 because dimC (C {t}/R) <
∞. The set E(R) := {k0 ,k1 , . . . ,kg } is called the set of characteristic exponents of R.
Examples 5.2.15.
It is left as Exercise 5.2.24 to show that the formulas in (2) and (3) are equivalent.
Corollary 5.2.17. Let ϕ : R −→ R′ be an isomorphism, then E(R) = E(R′ ).
Definition 5.2.18. Let E(R) = {k0 , . . . ,kg } be the characteristic exponents. Define
P (R) = {(m1 ,n1 ), . . . ,(mg ,ng )}, the set of Puiseux pairs as follows: let kk01 = m
n1
1
such
kν nν
that gcd(m1 ,n1 ) = 1. k0 = m1 ·...·mν such that gcd(mν ,nν ) = 1, ν = 2, . . . ,g.
To digest this definition a little bit, let us write the Puiseux expansion as
X
y= ak xk .
k∈Q
5.2 Invariants 189
If the curve is not smooth, there is a smallest k 1 with ak1 6= 0 and k 1 not an integer. We
n1
write k1 = m 1
in lowest terms, and then (m1 ,n1 ) is the first Puiseux pair of the curve. To
find the second Puiseux pair, we look for the smallest k 2 > k 1 which cannot be written
in the form ma1 . We write k 2 = m1n·m
2
2
with gcd(m2 ,n2 ) = 1 and call (m2 ,n2 ) the second
Puiseux pair. To find the third, we look for the smallest k 3 > k 2 which cannot be written
n3
in the form m1a·m2 . Write k 3 = m1 ·m 2 ·m3
with gcd(m3 ,n3 ) = 1. Then (m3 ,n3 ) the third
Puiseux pair, etc.
Q
n
Proof. From f = y − y(εi x1/n ) it follows that
i=1
Xn Y
∂f
= y − y(εj x1/n ) ,
∂y i=1 j6=i
and, hence,
∂f n n−1
Y
t ,y(t) = y(t) − y(εi t) .
∂y i=1
It thus suffices to calculate the vanishing order for the different y(t) − y(εi t), their sum
being the vanishing order of ∂f n
∂y (t ,y(t)). We shall do the calculation for n ≤ m and leave
the other cases to the reader.
In our situation k0 = n, and we may expand y(t) − y(εi t) as a power series:
X
y(t) − y(εi t) = aν (1 − εiν )tν .
ν≥m
n o
k0
Setting Nj = i; 1 ≤ i ≤ n − 1, gcd(k0 ,...,kj−1 )
| i, gcd(kk0 ,...,k
0
j)
∤i
and noting that
Sg
gcd(k0 , . . . ,kg ) = 1, we consider the following partition {1, . . . ,n − 1} = j=1 Nj . We
claim that ord y(t) − y(εi t) = kj for any i ∈ Nj .
If εiν 6= 1, then k0 = n ∤ iν, and hence gcd(k0 , . . . ,kj−1 ) ∤ ν, by assumption on i.
Therefore, if in addition aν 6= 0 then ν ≥ kj , by definition of kj , and thus the order of
y(t) − y(εi t) is at least kj . Conversely, we know that akj 6= 0, and thus it suffices to show
gcd(k ,...,k )
that εikj 6= 1, i. e. k0 = nmidikj . Writing q = gcd(k0 0 ,...,kj−1
j)
we get
kj k0 kj
(∗) gcd q, = gcd ,..., = 1.
gcd(k0 , . . . ,kj ) gcd(k0 , . . . ,kj ) gcd(k0 , . . . ,kj )
190 5 Plane Curve Singularities
qk0 k0
By assumption on i we know that gcd(k0 ,...,kj−1 ) = gcd(k0 ,...,kj ) ∤ i. And hence,
i gcd(k0 ,...,kj−1 )
q ∤ k0 , which is an integer, again by assumption on i. From equation ∗ we
i gcd(k0 ,...,kj−1 )kj
deduce that q ∤ k0 gcd(k0 ,...,kj ) , and finally, k0 ∤ ikj . This proves the claim.
n o
Writing Mj = i; 1 ≤ i ≤ n − 1, gcd(kk0 0,...,kj ) |i we easily see that #Mj =
gcd(k0 , . . . ,kj )−1 and Nj = Mj−1 \Mj . Hence, #Nj = gcd(k0 , . . . ,kj−1 )−gcd(k0 , . . . ,kj ),
which finishes the proof.
P
Proof of Theorem 5.2.16. Let R = C {tn ,y(t)},y(t) = ai ti ,am 6= 0. We define β0 , . . . ,βg
i≥m
by the formulas in 5.2.16(3). So we have to show that β0 , . . . ,βg generate Γ(R). This will
be done by induction on g. The case g = 0 is easy, as we then have a smooth curve.
g=1
In this case it follows from the definition of the ki that gcd(k0 ,k1 ) = 1. We have to
prove that β0 = k0 and β1 = k1 generate Γ(R).
We first claim that we can arrange that R = C {tk0 + . . . , tk1 + . . .}, gcd(k0 ,k1 ) = 1
and k0 < k1 . For this we consider the different cases.
Here k 2 = k2 − max{n − m,0}. This defines a curve with two characteristic exponents
k0 k1 n
K1 , K1 . Now let h ∈ C {x}[y], degy (h) < K1 . Applying the case g = 1 it follows that
β0 β1
ord h x(t), y(t) ∈ N+ N.
K1 K1
Hence
ord h tn , y(tK1 ) ∈ β0 N + β1 N,
∂h n
t , y(tK1 ) ak2 tk2 + . . .
∂y
∂h P
occur, and ∂y = iqi y i−1 . The difference in the vanishing order between
and
qi (tn )tK1 i
is k 2 − K1 which is a positive number. Similar for the higher order derivatives of h.
Therefore, the vanishing order of h(tn ,y(t)) is in β0 N + β1 N.
d > 1 We shall first construct a function with vanishing order βd . To define this
function, we define a new curve R by the following parametrization:
X
x = tn/Kd−1 ; y = ai ti/Kd−1 .
m≤i<kd
192 5 Plane Curve Singularities
Now consider
∂f n
f tn ,y(t) = f tn ,y(tKd−1 ) + t ,y(tKd−1 ) · (akd tkd + . . .) + R.
∂y
The first term is zero, the second term has vanishing order
X
d−1
Kj−1 Kj
X
d−1
Kj−1 Kj
kd + − kj + max{n − m,0} = kd + − kj = βd .
j=1
Kd−1 Kd−1 j=1
Kd−1 Kd−1
The fact that R has higher vanishing order is proved as in the case d = 1. Hence, we
have constructed an f with vanishing order βd .
Let h ∈ C {x}[y], degy (h) < Knd . By successive application of the Weierstraß division
Ps i n
Theorem we can write h = i=0 hi (x,y)f with hi ∈ C {x}[y], degy (hi ) < Kd−1 and
Kd−1
s< .
Kd
d−1
P
By induction we have ord hi tn ,y(t) ∈ βj N. To conclude the Theorem, again
j=0
we have to show that no canceling occurs. Therefore, suppose
i j
ord hi tn ,y(t) f tn ,y(t) = ord hj tn ,y(t) f tn ,y(t) .
Pd−1
Then (i − j)βd = ord hj tn ,y(t) − ord hi tn ,y(t) so that (i − j)βd ∈ j=0 βj N.
By Exercise 5.2.24, gcd(β0 , . . . ,βd−1 ) = Kd−1 | (i − j)βd . From the definition of βd it
Kd−1
then follows that Kd−1 | (i − j)kd , and, hence, | i − j.
Kd
Kd−1
But as i − j ≤ s < Kd , this implies i = j. This proves that {β0 , . . . ,βg } defined
by (3) in the Theorem generate Γ(R). It remains to prove that {β0 , . . . ,βg } is minimal,
which is left as Exercise 5.2.26.
Remark 5.2.21. Usually the k0 , . . . ,kg are introduced as invariants of the parametriza-
tion and not of R as follows:
k0 = n,
kν = min{i|ai 6= 0, gcd (i,k0 , . . . ,kν−1 ) < gcd (k0 , . . . ,kν−1 )}, for 1 ≤ ν ≤ g.
P
If, then, x = bν τ ν , y = τ m , is the parametrization obtained by τ m = y(τ ), and
ν≥n
k0′ = m, kν′ = min{i|bi 6= 0, gcd (i,k0′ , . . . ,kν−1
′
) < gcd (k0′ , . . . ,kν−1
′
)}, 1 ≤ ν ≤ g ′ , then
the so–called inversion Theorem gives a relation between {k0 , . . . ,kg′ ′ } and {k0 , . . . ,kg }
′
as follows:
5.3 Resolutions of Irreducible Plane Curve Singularities 193
α ∈ H ⇐⇒ c(R) − 1 − α ∈
/ H.
5.2.23. Let ϕ : R −→ R′ be an isomorphism, then Γ(R) = Γ(R′ ). /Hint: use the fact that ϕ
extends to an isomorphism ϕ : C {t} −→ C {t} of the normalizations of R and R′ , ϕ(t) = t· unit.)
5.2.24.
(1) Show that (2) ⇐⇒ (3) in 5.2.16.
(2) Show that for d = 1, . . . ,g we have gcd(k0 , . . . ,kd ) = gcd(β0 , . . . ,βd ).
P
5.2.25. Let x = tn ,y = i≥m ai ti , with am = 1. Suppose n > m and m | n. Then there exists a
` ´
polynomial q ∈ C [t] such that tn −q y(t) = tn−m+k1 + terms of higher order, k1 = min{i|ai 6= 0,
gcd(i,k0 ) = 1}. P i
Hint: let n = r · m and assume am = 1, then tn − y(t)r = ci t , and if ci 6= 0 for
i>n
`r´ m(r−1)
i < n − m + k1 = (r − 1)m + k1 , the exponent of 1 t ak1 tk1 in y(t)r ), then m | i. If
ci 6= 0 for i < n − m + k1 , subtract again a suitable power of y(t) and continue up to the order
n − m + k1 .
5.2.26. Prove that G(R) = {β0 , . . . ,βg } for β0 , . . . ,βg defined by (3) of Theorem 5.2.16.
Hint: we know that β0 , . . . ,βg generate Γ(R). Use (3) to prove that gcd(β0 , . . . ,βi ) <
gcd(β0 , . . . ,βi−1 ) for i = 1, . . . ,g, and gcd(β0 , . . . ,βg ) = 1.
Definition 5.3.1. We define P1 = P1C to be the set of lines in C 2 which pass through
the origin.
Similarly one can define PnC for n ≥ 2. Alternatively, but very similarly, one could
define P1 as
P1 = (C 2 \ {0,0})/ ∼
Here (x,y) ∼ (x′ ,y ′ ) if and only if there exists a λ ∈ C \ {0} with (x′ ,y ′ ) = λ(x,y).
The equivalence class of (x,y) 6= (0,0) is denoted by (x : y). An equivalence class is called
a point of P1 . The (x : y) are called homogeneous coordinates of P1 .
As any line through (0,0) has a nonempty intersection with the unit sphere in C 2 =
R4 we also could have defined P1 as
{(x,y) ∈ C 2 :| x |2 + | y |2 = 1}/ ∼= §3 / ∼ ,
where (x,y) ∼ (x′ ,y ′ ) if ∃ λ ∈ C with |λ| = 1 so that (x,y) = λ(x′ ,y ′ ). We give P1 the
quotient topology. As a quotient of a compact topological space, P1 is compact. Moreover,
it is left to the reader to show that P1 is Hausdorff. In fact, P1 is homeomorphic to §2 .
This can be seen as follows. Consider a line with homogeneous coordinates (x : y) and
suppose that y 6= 0. Then there is a representative with y = 1, and x can be an arbitrary
element of C . So this subset of P1 is homeomorphic to C . There is only one point of
P1 with y = 0, as all (x,0) are equivalent to (1,0). Thus, it can be seen that P1C is
homeomorphic to the one point compactification of C = R2 , hence to §2 . Another way
to see this, is by considering an open covering
P1 = U0 ∪ U1 ,
C1 ∩ (C 2 × U1 ) = {(x,ax, 1 : a) | x ∈ C },
E ∩ (C 2 × U1 ) = {(0,0, 1 : v) | v ∈ C }.
v
C1
a
y y
C
x x
C1 ∩ (C 2 × U1 ) = ∅
C1 ∩ (C 2 × U2 ) = {(0,y,0 : 1) | y ∈ C },
(1 : a)
π
E
(1 : b)
Second case: C is an P
irreducible curve. Because of Theorem 5.1.17 we may assume
that f = (y p + αxq )c + aij xi y j defines C with q ≥ p, α 6= 0 and gcd(p,q) = 1.
ip+jq>pqc
Let us assume first that q > p.
y 2 − x3 = 0
C
f (x,xv)
C1 ∩ (C 2 × U1 ) = {(x,xv,1 : v) | f1 (x,v) = 0} with f1 (x,v) = = (v p +
P xord(f )
αxq−p )c + aij xi+j−pc v j .
ip+jq>pqc
E ∩ (C 2 × U1 ) = {(0,0,1 : v) | v ∈ C },
C1 ∩ E(∩C 2 × U1 ) = {(0,0,1 : 0)}.
C (1) x
x − v2 = 0
E v
f (uy,y)
And C1 ∩ (C 2 × U2 ) = {(yu,y,u : 1) | f2 (u,y) = 0} with f2 (u,y) = ord(f ) =
P y
(1 + αy q−p uq )c + ip+jq>pqc aij ui y i+j−pc .
E ∩ (C 2 × U2 ) = {(0,0,u : 1) | u ∈ C },
C1 ∩ E ∩ (C 2 × U2 ) = ∅.
5.3 Resolutions of Irreducible Plane Curve Singularities 197
y
f2 = 1 − yu3
E u
m(C) = 4
m(C1 ) = 2
E1
E2
m(C2 )) = 2
E2
m(C3 ) = 1
E1
5.3 Resolutions of Irreducible Plane Curve Singularities 199
E4
E3
E2
m(C4 ) = 1
E1
E4
E3
m(C5) = 1
E2 E1
5 4
1 3 2
We proved this already for (1), (2), and (3). To prove that (2), (4) and (5) determine
each other we need to P
study the resolution more carefully.
Let x = tk0 , y = ai ti define a singularity with characteristic exponents k0 , . . . ,kg .
i≥k1
Let
X1 = k1 ,
r1,1 = k0 .
with
• • • ... • • •
π(i,j) + 1 π(i,j) + 2 π(i,j) + 3 π(i,j) + µij − 1 π(i,j) + µij
with
π(1,1) = 0,
π(i,1) = π i − 1,w(i − 1) + µi−1,w(i−1) , for i = 2, . . . ,g,
π(i,j) = π(i,j − 1) + µi,j−1 , for 2 ≤ j ≤ w(i) and i = 1, . . . ,g.
The point with the smallest weight (respectively largest weight) is called the start
point (respectively end point) of the graph Sij .
Now we connect the end point of Sik with the start point of Sik+2 if k ≤ w(i) − 2,
w(i)−1 w(i)
the end point of Si with the end point of Si for all i. Then we connect the end
w(i) 1
point of Si with the start point of Si+1 if µi+1,1 6= 0, otherwise with the start point
3 2
of Si+1 if w(i + 1) ≥ 3 or with the end point of Si+1 if w(i + 1) = 2. Because w(i) ≥ 2,
w(i)
the end point of Si is, therefore, connected with three other points and usually called
w(g)
contact point of the graph. The end point of Sg we connect with the star. The result
is a graph of the following type:
∗
Contact points
...
...
..
.
... ...
... ...
We have g − 1 contact points and draw the graph in such a way that each contact
point creates a new level.
On the other hand it is obvious how to obtain the characteristic exponents from
the resolution graph: g − 1 is the number of contact points, points which are connected
to three other points. One can identify the Sij as the maximal connected subsets of the
graph such that the weights are strictly increasing (respectively decreasing). Starting
w(g)
with multiplicity 1 which occurs µg,w(g) –times in Sg we obtain rg,w(g)−1 = µg,w(g) · 1.
This way, going from bottom to top in the Euclidean algorithm we can reconstruct the
ki .
202 5 Plane Curve Singularities
10 11 12 13 14 16 15
9 8
1 2 6 7 5 4 3
From the diagram we can extract the number g of levels and the number ω(i) of
blocks in each level i, as well the blocks Sij themselves and, hence, the µi,j = #Sij . The
remaining data in the following table have to be calculated:
g=3
ω(1) = 3 S11 = {1,2} S12 = {3,4,5} S13 = {6,7}
µ1,1 = 2 µ1,2 = 3 µ1,3 = 2
r1,1 = 28 r1,2 = 8 r1,3 = 4 χ1 = 64
ω(2) = 2 S21 = ∅ S22 = {8,9}
µ2,1 = 0 µ2,2 = 2
r2,1 = 4 r2,2 = 2 χ2 = 2
ω(3) = 2 S31 = {10,11,12,13,14} S32 = {15,16}
µ3,1 = 5 µ3,2 = 2
r3,1 = 2 r3,2 = 1 χ3 = 11
In order to do the calculations we proceed as follows: the initialization step in level three
is done by setting
r3,2 = rg,ω(g) = 1.
We then go on with the following formulas:
This finishes level three, and we may continue with level two:
r2,2 = r3,1 = 2,
r2,1 = µ2,2 · r2,2 = 2 · 2 = 4,
k2 − k1 = χ2 = µ2,1 · r2,1 + r2,2 = 0 · 2 + 2 = 2.
r1,3 = r2,1 = 4,
r1,2 = µ1,3 · r1,3 = 2 · 4 = 8,
k0 = r1,1 = µ1,2 · r1,2 + r1,3 = 3 · 8 + 4 = 28,
k1 = χ1 = µ1,1 · r1,1 + r1,2 = 2 · 28 + 8 = 64.
5.3 Resolutions of Irreducible Plane Curve Singularities 203
(1) The graph described above is the resolution graph of the singularity.
(2) The multiplicity sequence contains exactly µi,j times the multiplicity ri,j , i =
1, . . . ,q,j = 1, . . . ,w(i).
S22
S11 S12
And this finishes the algorithm. One easily sees that starting from the diagram we would
get the same values for the µi,j and for the ri,j and, moreover, the reader immediately
verifies the connection between the (ri,j ,µi,j ) and the multiplicity sequence.
204 5 Plane Curve Singularities
Proof of the Theorem. We prove the Theorem using induction on the length k of the
multiplicity sequence.
If the length k is 1, that is, the curve is smooth, defined by x = t, y = t and
k0 = k1 = 1. Consequently X1 = 1, r1,1 = 1 µ1,1 = 1. Thus, from now on we may
assume k > 1. P
Let x = tk0 , y = ai ti , ak1 = 1 and k0 , . . . ,kg be the characteristic exponents and
i≥k1
(m0 , . . . ,mk−1 ) be the multiplicity sequence.
k1 k0
gcd(k0 ,k1 )
The singularity has an equation f of the following type: f (x,y) = x gcd(k0 ,k1 ) −y gcd(k0 ,k1 ) +
P
bij xi y j .
k1 k0
ik1 +jk0 > gcd(k
0 ,k1 )
µ1,1 = µ1,1 − 1,
µi,j = µi,j if (i,j) 6= (1,1),
r i,j = ri,j ,
X i = Xi i > 1.
In case (2) we have:
k1 = X1 = k0 + r1,2 ,
that is, we start with
X 1 = r1,1 = µ1,2 r1,2 + r1,3 ,
..
.
This implies:
X 1 = r1,1 ,
µ1,j = µ1,j+1 ,
r 1,i = r1,j+1 ,
X i = Xi ,i > 1,
r i,j = ri,j ,i > 1,
µi,j = µi,j ,i > 1.
In case (3) we have:
X1 = k1 ,
µ1,1 = 1, r1,1 = r1,w(1)−1 = k0 , r1,2 = r1,w(1) = k1 − k0 ,
r1,w(1)−1 = k0 = µ1,w(1) (k1 − k0 ),
X2 = k2 − k1 ,
r2,1 = k1 − k0 ,
X2 = µ2,1 r2,1 + r2,2 ,
..
.
We start with:
X 1 = X2 + k0 = X2 + µ1,w(1) (k1 − k0 ),
r 1,1 = k1 − k0 .
This implies:
X 1 = µ1,1 r 1,1 + r1,2 ,
µ1,1 = µ2,1 + µ1,w(1) ,
and we have:
X i = Xi+1 , i ≥ 2,
r i,j = ri+1,j , (i,j) 6= (1,1),
µi,j = µi+1,j , (i,j) 6= (1,1).
Now we use the induction hypothesis. In the first case, after blowing up the mul-
tiplicity, r1,1 = k0 occurs µ1,1 − 1 times and the multiplicities ri,j occur µi,j times if
(i,j) 6= (1,1). This proves the Theorem in this case.
In the second case, the multiplicity changes after blowing up, that is, m0 occurs
once. Then
X1 = k1 = 1 · k0 + (k1 − k0 ),k0 = X 1 , and k1 − k0 = r 1,1
determine the rest, as before.
In the third case, the multiplicity also changes after blowing up, that is, m0 occurs
once. m1 = r1,2 = r1,1 occurs (µ1,w(1) + µ2,1 ) times. This is covered by X 1 = X2 +
µ1,w(1) (k1 − k0 ). The rest remains unchanged.
This proves the second part of the Theorem.
206 5 Plane Curve Singularities
Now, using induction hypothesis, assume that the singularity after one blowing up
has the resolution graph as described before and denote the corresponding building blocks
j
by S i .
∗
2
S11 S1
We have only to study the behavior at the first building block:
S11 S12
P
If we assume, as before, f (x,y) = (xq − y p )c + bij xi y j to be the equation
ip+jq>pq
for our singularity with gcd(p,q) = 1 and q > 1, then there are two possibilities for the
intersection of the exceptional divisor E1 with the curve after one blowing up:
(2) q−p < p means that the intersection has maximal contact:
E1 : x = 0.
The second case occurs if and only if the multiplicity changes.
E1 or E1
In the first case the next blowing–up gives:
E2 E2 .
j 1
This implies S i = Sij (with the weights shifted by one) for (i,j) 6= (1,1) and if S is
1 2 3 ℓ−2 ℓ−1
5.3 Resolutions of Irreducible Plane Curve Singularities 207
then S11
1 2 3 ℓ−1 ℓ
In the second case we have two possibilities, the number of characteristic exponents
remains constant (if q −p > 1) or drops (if q −p = 1) after the blowing–up (corresponding
to case (2) and (3) in the proof of the second part of the Theorem).
The next blowing–up gives:
E1 : x = 0
E2 : y = 0
E1 E1
E3 Eℓ
E2
E2
Eℓ−1
Eℓ+1
Eℓ
Eℓ−1 E2
E1
208 5 Plane Curve Singularities
that is, in the graph the point corresponding to E1 is connected with the end of
1 j−1
S 1 = S12 (weights shifted by 1) and S11 = {•}. Moreover, S1j = S 1 (weights shifted by
j
1) for j > 1 and Sij = S i (weights shifted by 1) for i > 1.
E2 E3 E4 Eℓ+1 E1
1
S1
In the case q − p = 1 we obtain:
Eℓ+1
Eℓ
E1 Eℓ−1 E2
that is, in the graph the point corresponding to E1 is connected with the ℓ-th point
1
inS1. E1 and the points corresponding to E2 , . . . ,Eℓ+1 give the first building block of
the resolution graph:
E2 E3 Eℓ Eℓ+1 E1
We obtain S11 : the point defined by E1 .
1
S12 the first ℓ–points of S 1 (weights shifted by 1), connected with S11 at the ℓ–th
point.
1
S21 the rest of S 1 (weights shifted by 1), which is not in S12 .
j
S2j = S 1 (weights shifted by 1) for j > 1.
j j
Si = Si−1 for i ≥ 3.
This proves the Theorem.
As in the irreducible case, we can define the standard resolution of a reducible plane
curve singularity, as follows:
5.4 Reducible Plane Curve Singularities 209
Definition 5.4.4. With the notations of Definition 5.4.1 we consider the following
weighted graph, the resolution graph of C:
Theorem 5.4.5. Let (C, 0) = ∪ri=1 (Ci , 0) be a plane curve singularity. The following
data determine each other:
Proof.
Step 1. First suppose we know the resolution graph of C. Take a branch (Ci , 0), and delete
all stars from the resolution graph which belong to the branches (Cj , 0) for j 6= i. Call a
point of the remaining graph contractible, if it satisfies one of the following conditions:
(1) It is not connected to the star, and connected only to points of lower weight.
(2) It is connected to a star, and to just one other point of lower weight.
This exactly means that the corresponding curve was created in a blowing-up, which
would not have been necessary for getting the standard resolution of the branch (Ci , 0).
From the resolution process, it follows that such a point is connected to at most two
points in the graph. If it is connected to just one other point, then delete the point of
highest weight, and the edge connected to it. If it is connected to two points A and B
say, then delete the point of highest weight, the connecting edges, and connect A and B
by an edge. This corresponds to “blowing-down” the curve corresponding to this highest
weight. Now iterate this process, until it is no longer possible. We then arrive at the
resolution graph of (Ci , 0). For example, if we have the following resolution graph
* *
* 5 4
2 3 1 3 2
we can get the resolution graph of the curve corresponding to the star on the left by the
following process
delete
* *
* 5 4 * 4
2 3 1 3 2 2 3 1 3 2
* * *
2 3 1 3 2 2 3 1 2 2 3 1
To get the contact number γ(Ci ,Cj ), we remove all the stars corresponding to the
branches Ck with k 6= i,j, and then successively remove points with property (1) and
(2). If this is no longer possible, then we are left with the resolution graph of the union
of (Ci , 0) and (Cj , 0). Consider the following numbers.
(1) The maximal natural number k such that the numbers 1, . . . ,k −1,k occur as weight
just once in the resolution graph of the union of (Ci , 0) and (Cj , 0)}.
(2) The weights of the vertices connected to the stars in the resolution graph of the
union of (Ci , 0) and (Cj , 0).
5.4 Reducible Plane Curve Singularities 211
The contact number of (Ci , 0) and (Cj , 0) is the minimum of these numbers. So for
example in the resolution graph above, the contact number between the curves belonging
to the right two stars is equal to min{5,5,4} = 4, between the curve belonging to the star
on the left and the other two is one.
Step 2. Conversely, suppose we know the resolution graphs of the (Ci , 0) and the contact
numbers. We give an algorithm for finding the resolution graph of (C, 0). The proof
goes by induction on r. The case r = 1 is trivial. So suppose that the resolution graph
Γ of ∪r−1i=1 (Ci , 0) has been constructed out off the resolution graphs of the (Ci , 0) and
the contact numbers. For i = 1, . . . ,r − 1, define ℓ(i) = γ(Cr ,Ci ), and let l(r) be the
maximum of γ(Ci ,Cr ) for i 6= r. Let w(i) be the weight of the vertex in Γ belonging
to the exceptional curve which intersects the strict transform of Ci , for i = 1, . . . r − 1.
Let w(r) be the vertex of highest weight in the resolution graph Γr of (Cr , 0). It is
clear that the point in the resolution graph connected to the star, which corresponds to
(Ci , 0) has weight max{ℓ(i),w(i)}. So if ℓ(i) > w(i), we need, starting from the standard
resolution, ℓ(i) − w(i) extra blowing-ups. Therefore, if ℓ(i) > w(i), we modify the graph
Γ as suggested by the following picture.
∗
ℓ(i)
..
.
∗
Γ Γ′
If ℓ(i) ≤ w(i) we leave Γ unchanged at the star corresponding to (Ci 0). We get a
modified graph Γ′ . Similarly, we get a modified graph Γ′r . Let k = max{ℓ(j) : j < r} be
the maximum of the contact numbers of (Cr , 0) with (Cj , 0) for j < r. Without loss of
generality, we may suppose that this maximum is achieved for j = 1. Let π1 ◦ · · · ◦ πk :
Xk −→ U be the map which occurs in the standard resolution process of ∪ri=1 (Ci , 0) after
(k) (k)
k steps. So by definition of contact number, the strict transforms C1 and Cr intersect
(k)
a common component E1 of E (k) , but do not intersect each other. In fact, by choice of
(k) (k)
k, Cr does not intersect any of the Cj . Let p be the point of Γ′ which corresponds to
(k)
E1 . If k > 1, it follows from the resolution process that there is a unique component
(k)
E ′ of E (k) which intersects E1 and was created under πk−1 . The point of Γ′ which
′
corresponds to E we call q. It has weight k − 1. There are two possible cases.
(k)
(1) k > 1, and the curve Cr intersects E ′ . Note that the points p and q in Γ′ are
connected by an edge. The following is a possible picture of a part of the resolution
graph after k blowing-ups.
212 5 Plane Curve Singularities
(k)
C1
E′
(k)
E1
(k)
Cr
(k)
(2) k = 1, or the curve Cr does not intersect E ′ .
(k)
C1
E′
(k)
E1
(k)
Cr
In the second case, consider the subgraph Γ′′r of Γ′r which one gets by deleting all points
(k−1)
of weight less than k. So this is the resolution graph of Cr with all weights shifted
(k) (k)
by k − 1. As we already noted that none of the Cj intersects Cr , it follows that the
resolution graph of (C, 0) is obtained by identifying the unique point of Γ′′r of weight k
with p, which also has weight k.
In the first case, consider the subgraph Γ′′r of Γ′r which one gets by deleting all points
of weight less than k − 1. The graph Γ′′r looks like
k k−1
and we get the resolution graph (C, 0) by doing surgery: In Γ′ one replaces the
subgraph
p q
k k−1
by Γ′′r . So we see that we can construct the resolution graph (C, 0) from the resolution
graphs of (Ci 0) and the contact numbers.
Example 5.4.6. We consider the reducible plane curve singularity given by f = 0, where
f = f1 f2 f3 with
f1 = (y 2 − x3 )2 − 4x5 y − x7 ,
f2 = y 2 − x3 ,
f3 = y 3 − x2
5.4 Reducible Plane Curve Singularities 213
The second and third branch are cusp singularities, and we know how the resolution for
those looks like, see the fourth part of Example ??. The resolution of the first branch has
been studied in Example 5.3.7. So we have to put the following three resolution graphs
into one graph.
*
5 4 * *
1 3 2 2 3 1 2 3 1
E1
and deduce that the contact number of (C1 , 0) and (C3 , 0) is equal to one. Similarly, the
contact number of (C2 , 0) and (C3 , 0) is equal to one. We still have to determine the
contact number of (C1 , 0) and (C2 , 0). So we look at the resolution of the union of (C1 , 0)
and (C2 , 0). One can calculate that the sequence of blowing-ups continues as suggested
in the following picture.
(3)
C2 (3)
C1
(2)
C2
(2)
C1 E3
E2
E1
E1
E2
214 5 Plane Curve Singularities
(4) (4)
We see that the on the fourth blowing-up the branches C1 and C2 are separated.
Thus the contact number is four. It follows that the resolution graph of the curve (C, 0)
is equal to
* *
* 5 4
2 3 1 3 2
by using the algorithm described in the proof. Of course, we could also have gotten this
resolution graph by simply doing the whole resolution process.
Finally, summarizing all results, we obtain the following theorem:
Theorem 5.4.7. Let (C, 0) be a germ of a plane curve singularity. The following data
determine each other:
(1) the multiplicity sequences of the branches of C and the contact numbers between
the branches;
(2) the resolution graphs of the branches of C and the contact numbers between the
branches;
(3) the resolution graph of C.
Next we want to characterize the contact numbers of the branches by their intersec-
tion multiplicity.
Theorem 5.4.8. Let (C, 0) respectively (D, 0) be irreducible plane curve singularities
with multiplicity sequences (m0 , . . . ,mg ) respectively (n0 , . . . ,nh ). For k ≥ g + 1 define
mk = 1, and similarly, for k ≥ h + 1 define nk = 1. Then
γ(C,D)
X
(C, 0) · (D, 0) = mi · n i .
i=0
Proof. Of course we use Lemma 5.1.5 and induction on γ(C,D). After a general linear
coordinate change we may assume that (D, 0) is given by a Weierstraß polynomial f =
y n + a1 y n−1 + . . . + an of degree n = n0 in y, and that (C, 0) has as parametrization
By Lemma 5.1.5 we have (C, 0)·(D, 0) = ordt (f ). We look at what happens after blowing-
up. Then
x(t) = tm , v(t) = tq−m + . . . .
is a parametrization of the strict transform C (1) . An equation for the strict transform
D(1) of (D, 0) is given by f (x,v) = 0, where f (x,v) is defined by
Therefore,
ordt f x(t),y(t) = ordt f x(t),x(t)v(t)
= ordt x(t)n · ordt (f (x(t),v(t)))
=m0 · n0 + C (1) · D(1)
by induction.
Example 5.4.9. Consider the curve (C, 0) given by (y 2 − x3 )2 − 4x5 y − x7 = 0, and
(D, 0) given by g = y 2 − x3 = 0. The multiplicity sequence of (C, 0) is (4,2,2,1,1), and the
multiplicity sequence of (D, 0) is (2,1,1). We have seen that the contact number is four, see
Example 5.4.6. It follows that the intersection multiplicity is equal to 4·2+2·1+2·1+1·1 =
13. The curve (C, 0) is given by the parametrization x = t4 , y = t6 + t7 . So g is in C {t}
equal to (t6 + t7 )2 − (t4 )3 = 2t13 + t14 , so its order is thirteen. Therefore, conversely, one
can now deduce that the contact number is four.
As in the irreducible case, we define the δ–invariant.
e 0) its normalization. Then
Definition 5.4.10. Let (C, 0) be a curve singularity and (C,e
we define the δ–invariant by δ(C, 0) = dimC (OC,e
e 0 /OC,0 ).
The following easy lemma is useful if one wants to calculate the δ–invariant for
reducible singularities.
Lemma 5.4.11 (Lemma of Hironaka). Let (C, 0), and (D, 0) in (C n , 0) be reduced curve
singularities which do not have a component in common. Then
e 0 ⊕ OD,e
dimC (OC,e e 0 )/(OC,0 ⊕ OD,0 ) + dimC (OC,0 ⊕ OD,0 )/OC∪D,0
We wish to finish this chapter with a proof of Max Noether’s Theorem. This theorem
tells us how to compute the δ–invariant from the multiplicity sequence of an irreducible
plane curve singularity. In the proof of Noether’s Theorem, we need the following lemma.
Lemma 5.4.12. Let (C, 0) = V (f ), 0 be an irreducible plane curve singularity. Let the
multiplicity be m = mult(C, 0), and c = c(C, 0) be the conductor. Suppose g ∈ OC,0 ⊂
e 0 = C {t} with ordt (g) ≥ c. Then the multiplicity of g is at least m − 1.
OC,
216 5 Plane Curve Singularities
Proof. Suppose the converse. Then after a general linear coordinate change, we may
assume that g is regular in y of order k < m − 1 in y. We may also assume that f is
regular of order m in y, so that (C, 0) has parametrization x = tm , y = ts + . . . We now
look at h := xy · g ∈ OC,
e 0 . This is indeed in OC,
y
e 0 , since x = t
s−m
· unit ∈ C {t} = OC,
e 0.
Obviously ordt (h) ≥ c(C, 0), hence h ∈ OC,0 . Thus in OC,0 we have the equality yg = xh.
Therefore,
qf = yg − xh
for some q ∈ C {x,y}. Now on the right-hand side a term y k+1 occurs, as it occurs in yg
but not in xh. But it cannot occur on the left-hand side, as the multiplicity of f is m,
and k + 1 < m. This is a contradiction.
Theorem 5.4.13 (M. Noether). Let (C, 0) be an irreducible plane curve singularity,
(m0 , . . . ,mr ) be the multiplicity sequence of (C, 0). Then
r
X mi (mi − 1)
δ(C, 0) = .
i=0
2
Pr
Proof. As c(C, 0) = 2δ(C, 0) by 5.2.4, it suffices to show c(C, 0) = i=0 mi (mi − 1). The
formula, obviously, holds for the case of a smooth curve, since c(C, 0) = 0 and m0 = 1.
Let C (1) be the strict transform of (C, 0) under blowing-up U in the origin for some
sufficiently small neighborhood U . We claim that if the formula holds for C (1) , then it
holds for (C, 0). By induction on r, this suffices to prove the theorem. Suppose (C, 0) is
given by the parametrization
x = tm , y = ts + . . . , s > m.
x = tm , v = ts−m + . . . .
Let g = g(x,v) ∈ OC (1) . As we are looking at the vanishing order of g in OC, e 0 = C {t},
and C (1) is the zero set of a function which is regular of order ≤ m, we may, by applying
the Weierstraß Division Theorem, assume that degv (g) ≤ m − 1. We write
As (C, 0) and C (1) have the same normalization, it follows that g(x,y) ∈ OC,0 and has
vanishing order ordt (g) + m(m − 1). Hence it follows that
as claimed.
Step 2. To finish the proof we show that
5.4 Reducible Plane Curve Singularities 217
Suppose the converse, that is, let g ∈ OC,0 with ordt (g) = c(C (1) ) + m(m − 1) − 1. It is
left as Exercise 5.4.15 to show that we may take such g which is moreover reduced. By
the first step we know that c(C, 0) ≤ c(C (1) ) + m(m − 1) − 1. By the previous lemma, we
know that the multiplicity of g, say q, is at leastm − 1. Let g(x,v) = x−q g(x,xv). Then
5.4.8 gives that (C, 0) · V (g), 0 = C (1) · V (g), 0 + qm. Hence
C (1) · V (g), 0 = c(C (1) ) + m(m − 1) − 1 − qm,
because (C, 0) · V (g), 0 = ordt (g). Thus, we have found a function g ∈ OC (1) which
has vanishing order ordt (g) = c(C (1) ) + m(m − 1) − qm − 1. Now q ≥ m − 1, so that
xq−m+1 · g ∈ OC (1) . This function has vanishing order c(C (1) ) − 1. This is a contradiction
to the definition of the conductor c(C (1) ).
Example 5.4.14. Consider the plane curve singularity given by (y 2 − x3 )2 − 4x5 y − x7 =
0. The multiplicity sequence is (4,2,2,1,1), so the delta–invariant is equal to 6 + 1 + 1 = 8.
This can also be calculated immediately, with the aid of the characteristic exponents,
which are 4,6 and 7. Thus one can calculate the minimal set of generators of the semigroup
to be 4,6 and 13. The fact that δ = 8 is then easily seen by counting.
Exercises
5.4.15. Prove that one may take g in the proof of 5.4.13 to be reduced.
(Hint: Add elements of high order to some factors which are multiple.)
5.4.16. Let (C, 0) = ∪(Ci , 0) be a curve singularity. Consider the standard resolution of (C, 0),
for some small U :
Xk −→ . . . −→ X1 −→ U.
Let C (j) be the strict transform of C in Xj . An infinitely near point of C is, by definition, a
singular point of C (j) for some j, C (0) = C. Prove that
X mp (mp − 1)
δ(C, 0) =
p infinitely
2
near point
5.4.17. Let (C, 0) be the plane curve singularity defined by f = (x3 − y 4 )(x3 − y 7 )(y 11 − x5 ) = 0.
Compute the multiplicity sequences of the branches of (C, 0), the contact numbers, and the res-
olution graph of (C, 0). Also construct the resolution graph of (C, 0) out off the resolution graphs
of the branches and their contact numbers.
5.4.19. Let (C, 0) be a reduced plane curve singularity given by a Weierstraß polynomial f
in y of degree b and multiplicity m. Let C (1) be the strict transform under blowing-up, E the
exceptional divisor. Prove:
P (1)
(1) p∈C (1) ∩E (E,p) · (C ,p) = b.
218 5 Plane Curve Singularities
P
(2) µ(X, 0) − 1 = p∈C (1) ∩E µ(C (1) ,p) + m(m − 1).
(Hint: Use Exercise 5.1.29, and study the behavior of the terms in that formula under
blowing-up; use part 1 of this exercise and use Lemma 5.4.8.)
(3) Deduce Milnor’s formula
0 s
The picture describes a typical situation. In the fiber above the point 0, we have an
intersection of a line with a cusp and the intersection multiplicity at that point is three.
In a small neighborhood we have in the fiber three different intersection points and a
transversal intersection at each point of multiplicity 1. Altogether we see a conservation
of the intersection number. We will see that other invariants, such as the Milnor number
and the δ–invariant have the same property.
The basis to prove such this property is the theory of sheaves. Therefore, we start
the chapter with an overview of the basic facts of sheaf theory. In short, a presheaf S
of rings (or modules, or . . .) on a topological space gives for all open subsets U a ring
S (U ) (or module, or . . .), with some natural conditions. An element of S (U ) is called
a section over U . One can talk about the germ of a section at any point of U . The
set of germs at x is called the stalk at x, notation Sx . A sheaf is a presheaf, which
is, loosely speaking, determined by its stalks. The application we have in mind is that
these stalks are modules whose vector space dimension gives an invariant of a germ of
an analytic space. Thus, we want to compare the various stalks for a given sheaf. Now
general sheaves are like “sand” but coherent sheaves are not. This is reflected in the
statement that a map of sheaves which induces an isomorphism at the stalk at x, is an
isomorphism in a small neighborhood of x. Loosely speaking, the stalk determines the
sheaf in an open neighborhood. The problem with the theory of coherent sheaves is that
it is difficult to give nontrivial examples. The only examples one gets from the definition
are the zero sheaf, and finitely generated sheaves over a point. However, it is true that
most sheaves we consider are coherent. For example OX , that is, the sheaf of holomorphic
220 6 The Principle of Conservation of Number
functions, the ideal sheaf of an analytic set and the normalization sheaf of an analytic
space are coherent. Moreover, for finite maps coherence is preserved (Finite Mapping
Theorem). These theorems are all proved in Section 6.3. In Section 6.2 we develop the
technical tools. Here we prove the Meta-Theorem, which says that coherence of sheaves
is preserved under all “reasonable” algebraic operations. In Section 6.2 we discuss these
coherent sheaves, but give only two examples.
On this basis, in Section 6.4, the principle of conversion of number is studied. In
particular we study the behavior of the invariants mentioned above. Finally, in Section
6.5, we study Cohen-Macaulay spaces. This notion is closely related to the principle of
conservation of number. Hypersurfaces, more generally, complete intersections are Cohen-
Macaulay.
The space in the picture is not Cohen-Macaulay. There is an algebraic method of de-
ciding whether spaces are Cohen-Macaulay. This method is by using resolutions and the
Auslander-Buchsbaum formula. From this it follows that a space is Cohen-Macaulay if
and only if the resolution has the right length.
For lack of a better site, we will finally discuss the Hilbert-Burch Theorem. It says
that the ideal of Cohen-Macaulay codimension two spaces can be described by the t
minors of a certain t × (t + 1) matrix. It is used in Chapter 10 to give examples of
singularities for which there exists a semi-universal deformation over a smooth space.
6.1 Sheaves
For s ∈ S (U ) one usually writes s|V instead of ρUV (s). For all open subsets V we
get in a natural way a presheaf S|V which we call the restriction of S to V .
Similarly one defines presheaves of abelian groups, of vector spaces, C –algebras, etc.
Examples 6.1.2.
(1) Let X be the topological space consisting of just one point. Then a presheaf of
rings on X can be identified with a ring.
(2) Let X be a topological space. For all open U ⊂ X put
C (U ) := {f : U −→ C , f is continuous}.
For V ⊂ U , let ρUV be the obvious restriction of functions from U to V . Then one
shows without difficulty that this defines a presheaf of rings, or even of C –algebras
on X.
(3) Consider an open set X ⊂ C n . Put for U ⊂ X open:
O(U ) := {f : U −→ C , f is holomorphic},
and again for an open subset V of U , the map ρUV is given by restriction. This is
a presheaf of C –algebras.
(4) Consider an open set X ⊂ C n . For U ⊂ X open we put
O′ (U ) := {f : U −→ C , f is holomorphic},
but we define different restriction maps. For V $ U we put ρUV = 0, and ρUU = id.
Then O′ is a presheaf.
(5) Let A be a ring, X a topological space. The constant presheaf C is given by C (U ) =
A if U 6= ∅, and C (∅) = 0. The maps ρUV is the identity map if V 6= ∅. Moreover,
ρU∅ is the zero map for all open U .
Definition 6.1.3 (Sheaf Conditions). Let X be a topological space, S a presheaf on
X. Then S is called a sheaf if the following two sheaf axioms are satisfied.
(1) Let U be an open set in X, and U = ∪i∈I Vi be an open covering of U . Let s ∈ S (U )
be a section of S over U , such that s|Vi = 0 for all i. Then s = 0.
(2) Let U be an open set in X, and U = ∪i∈I Vi be an open covering of U . Let si ∈ S (Vi )
be given such that si|Vi ∩Vj = sj|Vi ∩Vj for all i,j ∈ I. Then there exists an s ∈ S (U )
such that s|Vi = si .
A presheaf which satisfies the sheaf axioms, so is in fact a sheaf, is also called a canonical
presheaf.
Remark 6.1.4. It follows from the first condition that such an s, existing according to
the second condition, is uniquely determined.
Example 6.1.5. In 6.1.2, the first three examples, in fact, are sheaves. The fourth
example, the presheaf O′ is not a sheaf, as it does not satisfy the first condition. Moreover,
the constant presheaf A is, in general, not a sheaf. In fact, suppose U,V are two open
subsets of X with U ∩ V = ∅. Let a,b ∈ A, with a 6= b. Then the restrictions of a and b
to U ∩ V = ∅ are equal, but there does not exist a section of U ∪ V , restricting to a and
b, respectively.
222 6 The Principle of Conservation of Number
fU : S (U ) −→ T (U ),
for all open subsets U of X, such that for all V ⊂ U ⊂ X open, the diagram
fU
S (U ) / T (U )
ρU V ρ′U V
fV
S (V ) / T (V )
fp : Sp −→ Tp .
∗
f : OX −→ OX , fU : g 7→ e2πig .
∗
Let U be an open subset of X, and OX (U ) = {h ∈ O(U ): h(p) 6= 0 for all p ∈ U }. Then
∗
fp is surjective for all p. Indeed, take hp ∈ OX,p , and a small simply connected open
neighborhood U of p on which h is a representative of hp . By continuity, we may assume
that U is so small that h is nonzero throughout U . From function theory, we know that
log(h) exists, and e2πi log(h) = h. Thus log(h)p maps to hp , showing that the map of
sheaves is surjective. However fp is not injective. With log(h) also log(h) + k for all k ∈ Z
maps to h. Let z be a coordinate on C . Then z ∈ O∗ (X) but z is not in the image of f ,
as log(z) is not a well defined holomorphic function on X \ {0}.
This example also shows that given a map f : G −→ H of sheaves the presheaf
defined by U 7→ H(U )/fU G (U ) is not necessarily a sheaf. The first sheaf axiom is
not satisfied: locally defined logarithms do not necessarily glue to a globally defined
logarithm. Thus we have problems in defining the cokernel of a map as sheaf. These
kinds of problems occur for several other constructions with sheaves as, for example, the
tensor product. This is the reason for the following construction, called sheafifying.
Theorem 6.1.13. Let X be a topological space and F be a presheaf on X. There exists
a sheaf F + and a morphism j : F −→ F + of presheaves with the following properties:
Definition 6.1.14. The (uniquely) determined sheaf F + is called the sheaf associated
to the presheaf F . One says that F + is obtained from F by sheafifying.
Proof of Theorem 6.1.13. Let U be an open set and define F + (U ) to be the set of func-
tions f : U −→ ∪ Fx having the following two properties:
x∈U
Now it is not difficult to verify, see Exercise 6.1.36, that F + together with the canonical
restriction maps is a sheaf. It is also not difficult to check that the canonical map j :
F −→ F + defined by jU (g)(y) = gy has the universal property mentioned, see Exercise
6.1.36.
Note that not all nonzero section of F (U ) give nonzero sections of F + (U ). Namely,
it might happen that all stalks are zero, but F (X) 6= 0. On the other hand, in the
∗
example f : OX −→ OX , fU (g) = e2πig the function z is in Im(f )(U ) as sheaf, but not
as presheaf. So F + (U ) might have more sections. This is because by definition sections
of F + (U ) are not sections of F (U ), but functions which are locally equal to sections of
F.
We are now prepared to give a lot of examples for sheaves.
Definition 6.1.15. Let X be a topological space, F , G be sheaves on X and f : F −→ G
a morphism.
6.1 Sheaves 225
(1) The kernel of f , Ker(f ) is the sheaf defined by Ker(f )(U ) = Ker(fU ).
(2) The cokernel of f , Coker(f
) is the sheaf associated to the presheaf defined by
U 7→ G (U )/fU F (U ) . A cokernel of a map of sheaves is also called a quotient
sheaf.
(3) The image of f , Im(f ) is the sheaf associated to the presheaf defined by U 7→
fU F (U ) .
(Note that Im(f ) is a subsheaf of G , Exercise 6.1.37.)
(4) Let A be a ring, the constant sheaf CA is the sheaf associated to the constant
presheaf defined by U 7→ A if U 6= ∅. In this case if U is the disjoint union of two
connected subsets, we have C (U ) = A ⊕ A. Sections of CA over U are now allowed
to differ on the two connected subsets.
The proof that the presheaf defined in (1) is already a sheaf will be given in Exercise
6.1.37.
We now relate (pre)sheaves which are defined on different topological spaces.
Definition 6.1.16. Let a continuous map f : X −→ Y between topological spaces be
given.
0 −→ S1 −→ S2 −→ S3 −→ 0.
0 −→ f∗ S1 −→ f∗ S2 −→ f∗ S3 −→ 0
is also exact.
One phrases this by saying that for finite f , the functor f∗ is exact. This theorem
will play an important role in the proof of the Finite Mapping Theorem, see 6.3.5. The
theorem follows immediately from the fact that exactness means exactness of the stalks
and the following theorem.
Theorem 6.1.18. Let X,Y be Hausdorff spaces, and f : X −→ Y be a finite continuous
map. Let S be a sheaf on X. Let y ∈ Y , and x1 , . . . ,xt be the points in the fiber f −1 (y).
Then we have the equality of stalks:
t
M
(f∗ S )y = Sxi .
i=1
Proof. As X is Hausdorff, we can choose pairwise disjoint open neighborhoods U1′ , . . . ,Ut′
of x1 , . . . ,xt in X. As f is closed, there exists an open neighborhood V of y in Y , such
that f −1 (V ) ⊂ ∪ti=1 Ui′ , see Exercise 3.4.36.
f
x1 U1′ xt Ut′ y V
∼
Then there exists a unique sheaf F on X and isomorphisms λi : F|Ui −→ Fi such that
for all i,j, ϕij ◦ λi = λj on Ui ∩ Uj .
In fact, for an open subset V of X we define
( )
Y
F (V ) := {si }i∈I ∈ Fi (Ui ∩ V ) : ϕij (si|Ui ∩Uj ∩V ) = sj|Ui ∩Uj ∩V .
i∈I
The proof that this defines a sheaf, and the uniqueness is left as Exercise 6.1.40.
Definition 6.1.20. A ringed space (X,OX ) is a topological space X together with a
sheaf of rings OX . Usually, if it is clear from the context what the sheaf of rings is, we
simply write X for the pair (X,OX ).
A sheaf F on X is called an OX –module if F (U ) is an OX (U )–module for all U ⊂ X
open, such that the restriction mapping F (U ) −→ F (V ) is OX (U )–linear for V ⊂ U .
(Note that F (V ) for V ⊂ U is also an OX (U )–module.)
An OX –module sheaf F which is a subsheaf of OX is called an ideal sheaf.
Examples 6.1.21 (Ringed Spaces).
(1) Let U be an open set of C n and OU be the sheaf of holomorphic functions on U .
(2) Let U be an open set in C n , f1 , . . . ,ft be holomorphic functions defined on U .
Consider X := V (f1 , . . . ,ft ). Let V ⊂ U be open. Consider the presheaf defined by
OX (V ∩ X) := O(V )/(f1 , . . . ,ft )O(V ) on X. This is easily checked to be a sheaf.
The ringed space constructed in this way is called a complex model space.
Consider X = {x ∈ U : f1 (x) = · · · = ft (x) = 0}. Consider the presheaf I (X) on
U defined by I (X)(W ) = {f ∈ OC n (W ) : f (x) = 0 for all x ∈ W ∩ X}. This is
easily checked to be a sheaf and is called the ideal sheaf of X.
(3) Let U ⊂ Rn be an open subset, and OU the sheaf of differentiable functions on U .
Similarly let CU be the sheaf of continuous functions on U .
(4) An affine variety X (with the Zariski topology) can also be seen as a ringed space
by taking as canonical presheaf the presheaf which assigns to each U ⊂ X the ring
g
OX (U ) = {f ∈ K(X) : ∀p ∈ U ∃g,h ∈ K[X] with f = and h(p) 6= 0}.
h
An element f in OX (U ) therefore can be viewed as a rational function f : U −→ K
which have a well-defined value for all p ∈ U .
(5) Spectrum of a ring: Let R be a ring (commutative with 1 as always). Then the
spectrum Spec(R) is defined as follows:
• As a set Spec(R) is the set of prime ideals p ⊂ R.
• One defines a topology on Spec(R) by defining the closed sets to be the sets
of type V (I) := {p ∈ Spec(R) : I ⊂ p}. This topology is called the Zariski
topology on Spec(R).
• We define a sheaf O of rings on Spec(R). Let U = D(f ) := Spec(R) \ V (f )
be an open set of Spec(R). Then O(D(f )) := Rf . We have to show that this
is independent of the chosen f . For this we refer to Exercise 6.1.33. As the
D(f ) form a basis for the topology by Exercise 6.1.34 this induces a presheaf
on Spec R. This presheaf is in fact a sheaf, see Exercise 6.1.34.
228 6 The Principle of Conservation of Number
(1) HomX (F ,G ) is the sheaf defined by HomX (F ,G )(U ) = HomOX|U (F|U ,G|U ).
(2) F ⊗OX G is the sheaf associated to the presheaf defined by U 7→ F (U )⊗OX (U) G (U ).
(3) F ⊕ G is the sheaf defined by (F ⊕ G )(U ) = F (U ) ⊕ G (U ).
(4) Let F ,G ⊂ H, H an OX –module sheaf. Then F ∩ G is the sheaf defined by
(F ∩ G )(U ) = F (U ) ∩ G (U ). F + G is the sheaf associated to the presheaf defined
by U 7→ F (U ) + G (U ). F : G is the sheaf associated to the presheaf U 7→ F (U ) :
G (U ).
The proof that the presheaves defined in (1), (3) and (4) are already sheaves is left
as Exercise 6.1.38.
Note that in all the examples considered above the stalk at each point is a local ring.
We put this phenomenon in a definition.
Definition 6.1.23. A locally ringed space (X,OX ) is a ringed space, such that for all
points p ∈ X the stalk OX,p is a local ring. A morphism between locally ringed spaces is
a pair: (f,f ∗ ) : (X,OX ) → (Y,OY ), where f : X → Y is a continuous map, and f ∗ is a
morphism of sheaves of OY −modules:
f ∗ : OY → f∗ OX
sends the maximal ideal of OY,f (p) into the maximal ideal of OX,p . Usually, if there is no
danger of confusion, we just write f for the pair (f,f ∗ ).
Examples 6.1.24.
(1) Let (X,OX ) be a locally ringed space. (X,OX ) is called a complex manifold if every
x ∈ X has a neighborhood U such that (U,OX|U ) is isomorphic to (V,OV ), V ⊂ C n
a domain.
(2) (X,OX ) is called an analytic space if every x ∈ X has a neighborhood U such that
(U,OX|U ) is isomorphic to (V,OV ), V an analytic subset of an open set W in some
C n , and OV = OW /I (V ) |V .
Exercises
6.1.27. Consider the following presheaf on R. Let U ⊂ X be open and
B(U ) := {f : U −→ R : f bounded},
6.1.28. Let S be a sheaf of rings on a topological space X. Show that for each point x ∈ X
the stalk Sx is in a natural way a ring again.
6.1.30. Prove that for f : X −→ Y continuous, S a sheaf on X, the direct image f∗ S is also
a sheaf.
6.1.31. Let R be a Noetherian ring. Prove that the topology on Spec(R) is indeed a topology
by proving the following statements:
` ´ ` ´
(1) V (0) = Spec(R); V (1) = ∅,
(2) V (I · J) = V (I) ∪ V (J),
P
(3) V ( j Ij ) = ∩j V (Ij ).
230 6 The Principle of Conservation of Number
6.1.32. For a subset A of Spec(R) let I (A) = ∩ p. Prove the analog of Hilbert’s Nullstellen-
p∈A
√
satz: I = I (V (I)).
(Hint: Use primary decomposition of an ideal).
6.1.33. Let R be a ring and f,g ∈ R such that D(f ) = D(g) (equivalently V (f ) = V (g)). Prove
that Rf = Rg .
6.1.34. Let R be a ring. Show that the open sets D(f ) form a basis for the topology of Spec(R).
Show that the presheaf O on Spec(R), defined in 6.1.21, is in fact a sheaf.
6.1.36. Prove that F + defined in the proof of Theorem 6.1.13 is a sheaf and the canonical map
j : F −→ F + has the universal property.
6.1.41. One can view X = C 2 \ {0} as a locally ringed space (X,OX ) just as one can see an
affine variety as a locally ringed space.
(1) Show that OX (X) = C [x,y].
(2) Show that X, as locally ringed space, is not isomorphic to an affine variety.
In this section, unless otherwise stated explicitly, X will be a complex space, with struc-
ture sheaf OX , mX ⊂ OX the maximal ideal. S ,T , etc., will be sheaves of OX modules.
If (X,OX ) is fixed we shall often use O and m instead of OX and mX .
What is a coherent sheaf? Well, before defining it, let us mention some fundamental
properties of coherent sheaves, give some examples, and deduce some properties which
show that coherence is indeed a very useful property a sheaf can have. The properties we
shall mention will also justify the name “coherence”.
6.2 Fundamental Properties of Coherent Sheaves 231
αx : Sx −→ Tx
is an isomorphism. Then there exists an open neighborhood U of x, such that for all
points p ∈ U the maps of stalks
αp : Sp −→ Tp
are isomorphisms, that is, the sheaves S and T are isomorphic, when restricted to U .
The Meta Theorem will be proved in Lemma 6.2.11, Corollary 6.2.12, Lemma 6.2.13
and Lemma 6.2.14. Theorem 6.2.2 is an immediate consequence of 6.2.12 (2) and 6.2.9.
So, to show that two coherent sheaves S and T are isomorphic on some open set
containing a point x, one has to find a map of sheaves α : S −→ T , and then show that
αx is an isomorphism. The last condition usually is pure algebra! As a sheaf is determined
by its stalks, the above property gives an easy method to give examples of sheaves which
are not coherent.
Example 6.2.3. Consider the ringed space consisting of the line C and the sheaf OC
of holomorphic functions.
For S we take the zero-sheaf (this sheaf will be coherent). To define T , consider a
sequence of points p1 ,p2 , . . . , converging to 0. Let T be the subsheaf of OC consisting of
all holomorphic functions vanishing at each point pi . We let α be the zero map. By the
identity Theorem for holomorphic functions in one variable, every holomorphic function
defined on some open neighborhood of U of 0 and vanishing at all points pi which lie
in U actually is zero. This shows that the stalk T0 is zero, hence α0 is an isomorphism.
However, T is not the zero sheaf (there exist nonzero sections over any open set not
containing zero). Therefore T is not a coherent sheaf. In fact, for all points p 6= pi and
p 6= 0, we have Tp = Op . For the pi we have Tpi = mpi , the germs of holomorphic
functions vanishing at the pi . Notice that T is an ideal sheaf but not finitely generated.
To prove this we show that for every domain U ⊂ C containing 0, every OC –module
homomorphism ϕ : OC |U −→ T|U has to be the zero map. Namely, because U contains
0 it contains almost all of the pi , then T (U ) = {0} because of the Identity Theorem
(Remark 3.1.10).
Now let W ⊂ U be an open subset and consider the diagram
ϕU
OC (U ) / T (U ) = {0}
ρU W ρ′U W
ϕW
OC (W ) / T (W ).
As ϕU (1) = 0 we have ϕW (1) = ϕW ρUW (1) = ρ′UW ϕU (1) = 0. This implies that
ϕW (g) = g · ϕW (1) = 0 for all g ∈ OC (W ). Hence ϕ = 0.
232 6 The Principle of Conservation of Number
So for coherent sheaves such an ugly thing does not happen. The problem with the
theory of coherent sheaves (in complex analysis, not in algebra, see Exercise 6.2.19.)
is that one has to work hard to produce examples. It follows relatively easy from the
definition that sheaves behave well under most algebraic operations on sheaves (the Meta–
Theorem for coherent sheaves): kernels, cokernels of maps between coherent sheaves are
coherent. Direct sums of coherent sheaves are coherent. Finitely generated subsheaves
of coherent sheaves are coherent. Taking Hom, tensor product etc. preserves coherence.
These things will be discussed in this section. It is much harder to prove the four basic
coherence Theorems:
(1) The structure sheaf On is coherent. More generally, the structure sheaf of any
complex space is coherent.
(2) Coherence is preserved under finite mappings of complex spaces.
(3) The ideal sheaf of any analytic space is coherent.
(4) The normalization sheaf of any analytic space is coherent.
These Theorems will be proved in Section 6.3. Assuming these, we give some applications.
Corollary 6.2.4. Let J ⊂ OX be a coherent ideal sheaf then V (J) := {x ∈ X : Jx ⊂
mX,x } is an analytic space.
Proof. Let x ∈ X and assume that Jx ⊂ mX,x . Since OX,x is Noetherian, we have Jx =
(f1 , . . . ,fm ). Let U ⊂ X be an open set such that f1 , . . . ,fm ∈ OX (U ). Let J ⊂ OX|U
be the ideal sheaf generated by f1 , . . . ,fm . This is a coherent sheaf (Theorem 6.2.1) and
we have a canonical map α : J −→ J such that αx : J x −→ Jx is the identity. This
implies that J |W = JW for an open set W ⊂ U , x ∈ W (Theorem 6.2.2). Therefore,
V (J) ∩ W = {x ∈ W : f1 (x) = · · · = fm (x) = 0}.
Corollary 6.2.5. Let M be a coherent OX –module. Then the support of M :
supp(M ) := {x ∈ X : Mx 6= 0} is an analytic space.
Proof. Along with M , the annihilator is coherent (cf. Theorem 6.2.1 or Lemma 6.2.14),
and the annihilator is an ideal sheaf. Now the support
of M is given by the zero set of
the annihilator of M : Supp(M ) = V Ann(M ) .
Indeed, let x ∈
/ Supp(M ). Then Mx = 0, so that Ann(Mx ) = Ann(M )x = OX,x .
Therefore, x does not lie in the zero set of Ann(M ). Suppose on the other hand that
x∈ / V Ann(M ) . Then there exists an f ∈ Ann(Mx ) with f (x) 6= 0 and fx m = 0 for
all m ∈ Mx . As f (x) 6= 0 it follows that the germ fx is a unit, so that m = 0 for all
m ∈ Mx . Hence Mx = 0, and thus x ∈ / Supp(M ). Hence the support of M is an analytic
subset.
Corollary 6.2.6. Let F be a coherent sheaf and supp(F ) = {y : Fy 6= 0} = {x} then
(1) F (X) = Fx ;
(2) dimC Fx < ∞.
6.2 Fundamental Properties of Coherent Sheaves 233
Proof. The first property holds without using the coherence. Consider the map F (X) −→
Fx . If for some s ∈ F (X) sx = 0 then there exists a neighborhood U of x such that
s|U = 0 because sy = 0 for all y 6= x. Let W := X \ {x} then X = U ∪ W and s|W = 0
implies s = 0. To prove the surjectivity let f ∈ Fx and choose a neighborhood U of x
such that f ∈ F (U ). On W := X \ {x} we have f|U∩W = 0. So the section 0 on W and
f on U glue to a section on X.
To prove (2) let I = Ann(F ) and assume that, in a neighborhood U of x, I is
generated by f1 , . . . ,fr . In OX,x we consider the primary decomposition of Ix = q1 ∩· · ·∩qs
and claim that s = 1 and q1 is mX,x –primary. If this is true, then mcX,x annihilates
Fx for a suitable c and, therefore, Fx is a finitely generated OX,x /mcX,x –module. But
dimC OX,x /mcX,x < ∞ implies dimC Fx < ∞.
√
To prove the claim assume Ix ⊂ q primary but q 6= mX,x . In this case V (I) ⊃
V (q) % {x}. This is a contradiction to the fact that Iy = OX,y for all y 6= x.
Definition 6.2.7.
(1) The sheaf S is said to be of finite type if for every point x ∈ X there is an open
neighborhood U of X and a surjective morphism of sheaves:
q
OX|U −→ S|U −→ 0.
(2) The sheaf S is said to be of relation finite type if for any open set U and any
q
morphism of sheaves: α : OX|U −→ S|U the kernel Ker(α) is a sheaf of finite type.
Lemma 6.2.8.
q (s1 ,...,sq )
OX|V −−−−−−→ S|V −→ 0.
In particular, if a sheaf of finite type has zero stalk in x, then the sheaf is identically zero
in some open neighborhood V of x.
234 6 The Principle of Conservation of Number
Proof. Because S is of finite type there exist an open neighborhood W of x, and sections
t1 , . . . ,ts generating S|V . But, because the
Psix generate the stalk at x, we can find germs
of holomorphic functions such that tix = j fijx sjx . Take an open neighborhood V ⊂ W
of x such that all these functions converge on V . Because the t′ s generate S , it follows
that the si also generate S on V .
The following Lemma is the basic Lemma, from which almost everything in this
section follows.
Proposition 6.2.10. Consider an exact sequence of sheaves on X:
ϕ1 ϕ2 ϕ3 ϕ4
S1 −→ S2 −→ S3 −→ S4 −→ S5 .
Os D
DD ψ
DD
DD
D!
ω Op C
CC
CC
β CC
C
ϕ1 ϕ2 ϕ3 !
Oq / S2 / S3 / S4
Thus ω(b) ∈ Ker(ϕ2 ) = Im(ϕ1 ). Hence there exists a c ∈ Os with ϕ1 (c) = ω(b).
But then (−c,b) ∈ K , and thus a = ψπ(−c,b) ∈ ψπ(K).
2 The proof shows that we only need that S2 , S3 are coherent, that S1 is of finite type, and that S5
is of relation finite typ.
6.2 Fundamental Properties of Coherent Sheaves 235
Then all sheaves S1 ,S2 ,S3 are coherent if two of them are coherent.
Corollary 6.2.12.
(2) Let α : S −→ T be a morphism of coherent sheaves. Then the sheaves Im(α), Ker(α)
and Coker(α) are coherent.
(3) Let S and T be coherent subsheaves of a coherent sheaf U . Then the sum S + T
and the intersection S ∩ T are coherent sheaves.
(4) Suppose that the structure sheaf OX is coherent. Then S is coherent, if and only
if for every x ∈ X there exists a small neighborhood U of x and an exact sequence:
q p
OX|U −→ OX|U −→ S|U −→ 0.
(1) The injectivity is easy: let ϕ : S|U −→ T|U be a map and assume that Ker(ϕx ) =
Sx then Theorem 6.2.2 implies that Ker(ϕ)|V = S|V for an open neighborhood of
x. This implies that ϕ is the zero map in HomO (S ,T )x . To show that the map is
surjective, we have to lift any map φx between germs to some open neighborhood
p
of x. This is easy for the case that S = OX , because we then just have to take
a lift in T of the images of the basis elements. Because S is finitely generated,
p
we can find a map ψ : O|U −→ S|U . By the remark made above, we can also find
p
a lift of the map ψx φx to χ : O|U −→ T|U , after possibly shrinking U . We have
Ker ψx ⊂ Ker χx . Because S is of relation finite type, Ker ψ is finitely generated,
and hence Ker ψU ⊂ Ker χU after possibly shrinking U . Therefore, χ induces the
p
desired map χ : S|U = O|U / Ker ψU −→ T|U . Note that in this proof we only made
use of the coherence of S .
236 6 The Principle of Conservation of Number
(2) It follows directly from the definition of coherence, that if S is coherent we (locally)
have an exact sequence Oq −→ Op −→ S −→ 0. We want to show that
0 // HomOx (Sx ,Tx ) // HomOx (Oxp ,Tx ) // HomOx (Oxq ,Tx ).
The sequence at the bottom is exact, because that is just a statement about rings
and modules. The left vertical arrow is an isomorphism by the first part. Because,
as already remarked, HomO (Op ,T ) is isomorphic to T p , the fact that the other
vertical arrows are isomorphisms is trivial. Therefore, the upper row is exact. As
this is true for all x ∈ X, the claim follows.
O −→ H omO (S ,S )
ϕ : OVp −→ S|V .
Step 3. Now suppose that S is OY –coherent. It follows that we can write S locally as
a cokernel. Thus for all points p ∈ Y there exists an open neighborhood V of p in Y and
an exact sequence
OVq −→ OVp −→ S −→ 0.
Again we write V = U ∩Y for an open set U in X. Then i∗ OV is OU –coherent, and there-
fore i∗ OVq and i∗ OVp are OU –coherent. It follows that i∗ S , as cokernel of two coherent
OU –modules, itself is OU –coherent.
Lemma 6.2.16. Suppose the structure sheaf OX is coherent and let S be a coherent
OX –module. Then the set A = {x ∈ X : Sx is not a free OX,x -module} is a proper
analytic subset of X.
Proof. Let x ∈ X and U be an open neighborhood such that S|U has a representation
q ϕ P
OX|U −→ OX|U −→ S|U −→ 0.
6.2.19. Let X be an affine variety, and M be a finitely generated K[X]–module. Show that the
` ´
f, defined by M
associated sheaf M f D(f ) = Mf , is a coherent OX –module.
6.2.20. Assume that OX is coherent. Prove that the support of a coherent sheaf S is an analytic
set, by using the exact sequence
q M p
O|U −→ O|U −→ S|U −→ 0
In this section we prove the four basic Coherence Theorems in complex analysis.
Theorem 6.3.1 (Oka). The sheaf OC n is coherent. More generally, for any complex
space X, the sheaf OX is coherent.
Proof.
Step 1. A sheaf S is coherent, if for every point there exists an open neighborhood of this
point such that S|U is coherent. Thus to show that OX is coherent, it suffices to show this
for a complex model space (D,OD ). Such a complex model space injects, by definition,
into an open subset of C n . By the extension principle 6.2.15 it suffices therefore to prove
the coherence of OC n . This will be done by induction on n. The case n = 0 is trivial, see
the second example of ??.
Step 2. For a short notation we write On instead of OC n and, since there is no possible
confusion, OU instead of OC n |U for an open subset U ⊂ C n . So assume that On−1
is coherent. The statement is local, so we look at a neighborhood of a point p ∈ C n .
Without loss of generality we may assume p = 0. As the sheaf On is obviously of finite
type, we only have to show that it is of relation finite type. Therefore, we have to show
that for any map of sheaves:
F =(f1 ,...,fs )
Ons −−−−−−−−→ On
the kernel Ker(F ) is ofP finite type. Finding elements in Ker(F ) means finding relations,
s
that is, r1 , . . . ,rs with i=1 ri fi = 0. After a linear change of coordinates (if necessary),
we may assume that f1 , . . . ,fs are regular in xn . As it is easy to relate the relations
between f1 , . . . ,fs and u1 f1 , . . . ,us fs for units ui , we may, by applying the Weierstraß
Preparation Theorem, assume that the fi are Weierstraß polynomials in xn . Let di be
the degree of fi in xn , and put d = max{di }. We may assume, without loss of generality,
that the degree of fs is d.
For any k we denote by On−1 [xn ]≤k the free On−1 –module of rank k +1 of polynomi-
als in xn of degree less than or equal to k. By induction, On−1 is coherent and, therefore,
the free On−1 –module On−1 [xn ]≤k is also coherent for all k by 6.2.12. By definition of
coherence, the kernel of the following map of On−1 –modules:
is finitely generated. This exactly means that Ker(F ) ∩ (On−1 [xn ]≤d )s is a finitely gen-
erated On−1 –module.
Step 3. It remains to show that in some small open neighborhood of 0, the sheaf of
s
relations Ker(F ) is as an On –module generated by Ker(F ) ∩ (On−1 [xn ]≤d ) . This means
s
the following: Take generators h1 , . . . ,hl of Ker(F ) ∩ On−1 [xn ]≤d . Take a small open
neighborhood U of 0 on which the h1 , . . . ,hl and the f1 , . . . ,fs converge. Then the germs
h1,p , . . . hl,p of the hi in p generate the kernel of the map Fp for all p ∈ U . To prove this,
first note that elements of the following type (so-called Koszul relations):
are in Ker(F ) ∩ On−1 [xn ]s≤d . Indeed, they are in Ker(F ), and also of degree less than or
equal to d in xn . We take an arbitrary point p = (p1 , . . . ,pn ) ∈ U . At the point p the
function fs,p is regular of order d′ ≤ d in xn − pn . Consider any relation between the fi
at the point p:
a1 f1,p + . . . + as fs,p = 0.
For i = 1, . . . ,s − 1 we perform Weierstraß division with respect to fs at p:
(a1 , . . . ,as ) = (c1 , . . . ,cs ) − (−fs,p ,0, . . . ,0,f1,p )q1 − . . . − (0, . . . ,0, − fs,p ,fs−1,p )qs−1 .
From this it follows that c1 f1,p + . . . + cs fs,p = 0 and, therefore, we expressed our relation
as an On –linear combination of elements of Ker(F ) ∩ On−1 [xn ]s≤d , as soon as we have
proved that cs is a polynomial of P degree less than or equal to d. This is easy, because we
s−1
have just proved that cs fs,p = − i=1 ci fi,p . The right-hand side is a polynomial in xn
′
of degree less than d + d . As we have assumed that the degree of fs is d, it follows that
cs has degree at most d′ ≤ d. This concludes the proof of Oka’s Theorem.
Theorem 6.3.2 (Oka-Cartan). Let X ⊂ U ⊂ C n be an analytic set. Then the ideal
sheaf I := I (X) of X is coherent.
Remark 6.3.3. Let the ideal I = (f1 , . . . ,fs ) ⊂ C {x1 , . . . ,xn } be a radical ideal , and
let (X, 0) = V (I), 0 . Take small enough representatives. Then it is a tautology that for
all a ∈ X
V (f1,a , . . . fs,a ) = (X,a)
but it is not clear that the ideal of (X,a) is generated by the fi,a . To put it in another
way, it is not clear a priori that the ideal (f1,a , . . . fs,a ) is a radical ideal in OX,a . That
this is indeed the case is the content of the Coherence Theorem of Cartan and Oka.
Proof of Theorem 6.3.2.
Step 1. Consider a point x ∈ X. Without loss of generality, we may assume that x =
0. Consider the ideal I0 ⊂ On,0 = C {x1 , . . . ,xn } of the germ (X, 0). As this ring is
Noetherian, this ideal is finitely generated. We take finitely many generators:
I0 = (f1 , . . . ,fs )
p
V
On V , the sheaf
S := (f1 , . . . ,fs ) : (g)
is coherent by 6.3.1, 6.2.8 and 6.2.14. Hence, S has finitely many generators, say h1 , . . . ,hu ,
for holomorphic functions hi defined on V . Since (6.1) was true for all points p ∈ X \ D,
it follows that gp ∈ (f1,p , . . . ,fs,p ) for all p ∈ V \ D, so that we deduce Sp = On,p for all
p ∈ V \ D. Thus, the zero set V (h1 , . . . ,hu ) ⊂ D ∩ V . As ∆ also vanishes on D, it follows
from the Nullstellensatz that there exists an r ≥ 0 such that ∆r ∈ (h1,p , . . . ,hu,p ) for all
p ∈ D ∩ V . From the defining properties of the hi it follows that ∆r g ∈ (f1,p , . . . ,fs,p ).
Let r ≥ 0 be minimal with this property. To prove the theorem, we have to show that
r = 0. Suppose the converse, that is, r > 0. But then we have ∆r−1 g ∈ (f1,p , . . . ,fs,p ) :
(∆) = (f1,p , . . . ,fs,p ), by (6.2), giving a contradiction.
Corollary 6.3.4. Let X ⊂ C n be an analytic space. The set of singular points Sing(X)
of X is an analytic subspace of X.
6.3 The Four Basic Coherence Theorems 241
Proof. Let x ∈ X. By applying Proposition 4.3.1, we may assume that (X,x) is irre-
ducible. Now we may apply the Jacobian Criterion, 4.3.6, and the coherence of ideal
sheaves, 6.3.2.
Theorem 6.3.5 (Finite Mapping Theorem). Consider a finite mapping π : X −→ Y of
complex spaces, and let S be a coherent OX –sheaf. Then π∗ S is a coherent OY –sheaf.
Proof. Using the Extension principle (6.2.15) it is enough to prove the theorem for ana-
lytic spaces, that is, both X and Y are reduced. Let y ∈ Y be a point. Since coherence
is a local property it is enough to prove that (π∗ S )|U is coherent for a sufficiently small
neighborhood U of y. We may assume that y = 0. The theorem is proved in several steps.
Step 1. We first consider the case that X is a small representative of (X, 0) = V (f ), 0 ,
where f ∈ C {x1 , . . . xn } is a Weierstraß polynomial of degree b in xn , Y an open subset
of C n−1 , π : X −→ Y the projection on the first n − 1 coordinates, and S = OX .
Let V ⊂ C be a small open neighborhood of 0. We choose an open neighborhood
U ⊂ C n−1 of 0 such that for all p ∈ U the b zeros (counted with multiplicity) of f (p,xn )
lie in V (continuity of roots, see 3.4.11). We may assume that
X = {(p,x) : p ∈ U, f (p,x) = 0} ⊂ U × V
b
OU (W ) −→ OU (W )[xn ] ⊂ OU×V (W × V ) −→ OX (W × V ) = π∗ OX (W )
b−1
X
(s0 , . . . sb−1 ) 7→ sj xjn .
j=0
We will prove that for all p ∈ U the induced map of the stalks
b
(6.3) OU,p −→ (π∗ OX )p
b
is an isomorphism. This will prove this step. Namely, as On−1|U is coherent by Oka’s
Theorem 6.3.1 , it follows that π∗ OX|(U×V ) is a coherent On−1|U –module.
Now let p ∈ U , x1 , . . . ,xt be the zeros of f (p,x
Ptn ) and b1 , . . . ,bt their multiplicities.
Put pi := (p,xi ) for i = 1, . . . ,t. In particular, i=1 bi = b. Using 6.1.18, we obtain
(π∗ OX )p = ⊕ti=1 OX,pi . To prove that (6.3) is surjective, consider an element (g1 , . . . ,gt ) ∈
Pb−1
(π∗ OX )p = ⊕ti=1 OX,pi . By Corollary 3.3.24 there exists an r = j=0 ri xjn , such that
r = gi in OX,pi for i = 1, . . . ,t. This shows the surjectivity. The injectivity of (6.3)
follows from the uniqueness of the decomposition of 3.3.24.
Step 2. We consider now the situation as in Step 1 but now S is an arbitrary coherent
sheaf. Because S is coherent, we have locally an exact sequence:
q p
OX −→ OX −→ S → 0.
q p
We proved in Step 1 that π∗ OX is coherent. Hence π∗ OX = (π∗ OX )q and π∗ OX are
coherent. Therefore, π∗ S as a cokernel is coherent.
Step 3. We now consider the case that i : X ⊂ V (f ) is a closed subspace, f regular in xn ,
Y an open subset of C n−1 , π ′ : V (f ) −→ Y the projection on the first n − 1 coordinates
and π = π ′ ◦ i.
Then i∗ S is coherent by the extension principle, and π∗ S = (π ′ ◦ i)∗ S = π∗′ (i∗ S )
is coherent by the previous step.
Step 4. We now consider the case that X is a small representative of (V (I), 0) ⊂ (C n , 0),
where C {x1 , . . . ,xn }/I is a finitely generated C {x1 , . . . ,xk }–module, Y is a small open
neighborhood of 0 in C k , and π : X −→ C k the obvious projection on the first k
coordinates. This case follows from Step 3 and induction on n − k.
Step 5. We now consider the general case. We can (locally) view any finite mapping as a
projection:
i / n
X C
π π′
j
Y / C k.
This is the graph construction, see 2.3.14 and Exercise 3.4.43. Here π ′ is the projection
on the first k coordinates. By the extension principle π∗ S is coherent if and only if
j∗ (π∗ S ) = π∗′ (i∗ S ) is coherent. But the last sheaf is coherent by Step 4.
For the proof of the final coherence theorem in this chapter, the following theorem
is of crucial importance.
Theorem 6.3.6 (Oka). Let X be an analytic space. Then the set of nonnormal points
of X forms an analytic subspace of X. In particular, the set of normal points is open in
X.
Proof. Consider a point 0 ∈ X, and let I ⊂ R := OX,0 be the ideal of the singular locus
of X at the point 0. Then Exercise 4.4.20, which is the local version of Theorem 1.5.13,
gives:
=
R ֒→ HomR (I,I) ⇐⇒ R is normal.
We now sheafify. We have proved in 6.3.2 that the ideal sheaf I of the singular locus (it
is an analytic space) is coherent. It follows from 6.2.13 that the sheaf HomOX (I ,I ) is
coherent, and that
HomOX (I ,I )p = HomOX,p (Ip ,Ip )
for all p ∈ X. It follows that the set of nonnormal points is given by the support of
the sheaf HomOX (I ,I )/OX . As this is the quotient of two coherent sheaves, it is a
coherent sheaf and, therefore, its support is an analytic set, see 6.2.5. This proves the
theorem.
Theorem 6.3.7 (Oka). Let (X,OX ) be an analytic space. The normalization sheaf O eX
eX,x at x by definition is the normalization of OX,x . Then the
is a sheaf whose stalk O
normalization sheaf OeX is OX –coherent.
6.4 The Principle of Conservation of Number 243
Here (Cs ,p) · (Ds ,p) denotes the intersection number of Cs and Ds in the point p, see
5.1.4.
This theorem gives a geometric interpretation of the intersection number. Suppose
that Cs and Ds intersect transversally, that is, the intersection number (Cs · Ds )p = 1
for all p ∈ Cs ∩ Ds . The theorem then says, that the intersection number of C and D is
equal to the number of intersection points appearing after a small perturbation of C and
D.
Example 6.4.2. Let C be given by f (x,y) = y 2 − x3 = 0, and D be given by g(x,y) =
y = 0. Then
(x(ti ,s),y(ti ,s)) : W × S −→ C 2 , x(ti ,0) = x(ti ), y(ti ,0) = y(ti ) for i = 1, . . . ,r.
Then we can take W and S so small that the image of (x(ti ,s),y(ti ,s)) is Q
an analytic space
r
in W × S given by Fi (x,y,s) = 0 with Fi (x,y,0)Q= fi . Then F (x,y,s) = i=1 Fi (x,y,s) is
r
the induced one-parameter deformation of f = i=1 fi .
Moreover for all sufficiently small neighborhoods U of 0 in W , there exists an open
neighborhood V of 0 in S such that:
(1) F converges on U × V .
(2) The only singular point of f on U is 0.
Denote for fixed s by Cs the curve {F (x,y,s) = 0} ∩ U , then for all s ∈ V we have
X
δ(C) = δ(Cs ,p),
p∈Cs
(check this). As the δ–invariant of an A1 –singularity is one, we see from the picture
that δ(C ∪ D, 0) = 4.
Theorem 6.4.5. Let f = f (x1 , . . . ,xn ) : (C n , 0) −→ (C , 0) be a germ of a holomorphic
function with isolated singularity, µ(f ) its Milnor number. Consider a deformation of f :
Then for all sufficiently small open neighborhoods U of 0 there exists an open neighborhood
V of 0 such that for all s ∈ V :
X
µ(f ) = µ(fs ,p)
p∈U
where µ(fs ,p) is the Milnor number of the germ of the holomorphic function:
Example 6.4.6. We consider the E7 –singularity given by f (x,y) = y(y 2 − x3 ), and the
deformation F (x,y,s) = (y − s)(y 2 − x2 (x − s)). The zero sets of f and fs are given
in the previous figures. In the picture of the zero set we see four A1 –singularities. But
since a function on a compact set must have a maximum and a minimum, we have that
in each compact region in the real plane bounded by the zero set of fs there must be
a singularity, too. One can in fact calculate that fs has seven A1 –singularities, so that
µ(f ) = 7.
How to prove that “an invariant is constant under deformation”, or, to put it in
another way, when do we have conservation of number? Usually, a numerical invariant
is defined by the vector space dimension of a certain module M over the local ring of a
singularity OX,0 . In the above theorems, we have respectively as local ring and as module:
∂f ∂f
(3) The local ring is On , and the module M is On /J(f ), where J(f ) = ( ∂x 1
, . . . , ∂x n
)
is the Jacobian ideal.
When deforming, the local ring OX,0 will be a quotient of the local ring OXS ,0 , and
similarly M is a factor module of MS . Both the local ring and the module are in fact
also C {s}–modules. In the above examples:
(1) OXS ,0 = C {x,y,s}/(F,G), MS = OXS ,0 .
eCS,0 /OCS ,0 .
(2) The local ring is OCS ,0 = C {x,y,s}/F , the module MS is O
(3) The local ring is C {x1 , . . . ,xn ,s}, and the module is MS = C {x1 , . . . ,xn ,s}/J(F ),
∂F ∂F
where J(F ) = ( ∂x 1
, . . . , ∂x n
) is the “relative” Jacobian ideal.
In order to be able to compare the invariants at different points in the parameter space
S, we need to sheafify.
Let the singularity (X, 0) be embedded in (C n , 0), and let U be a sufficiently small
open neighborhood of 0. Then we consider a representative X of (X, 0) in U . Furthermore,
we choose a small open neighborhood V of 0 in the parameter space S = C . Let π :
U × V −→ V be the canonical projection and XS ⊂ U × V be a complex space such that
XS ∩ π −1 (0) = X. For s ∈ V we put Xs := XS ∩ π −1 (s).
6.4 The Principle of Conservation of Number 247
U
X0 Xs
V
0 s
C {s}t ∼
= (f∗ M )s0 = ⊕ℓi=1 Mqi .
Restricting this to the point s0 ∈ V ′ and applying Exercise 6.1.35 implies that
Ct ∼
= ⊕ℓi=1 (M|Xs0 )qi .
s · Q = (A − LG)F + (B + LF )G.
6.4 The Principle of Conservation of Number 249
s · Q = G(x(t,s),y(t,s),s)
for some G ∈ C {x,y,s}. Plugging in s = 0 yields that G(x(t),y(t),0) = 0, so that G(x,y,0)
is divisible by f , say G(x,y,0) = Lf . We may now, without loss of generality, assume
that F (x,y,0) is equal to f . Now add 0 = −LF to the expression for s · Q. Then
s · Q = (G − LF )(x(t,s),y(t,s),s).
In the ring C {x,y,s} we can divide G − LF by s, so that Q = G−LF s (x(t,s),y(t,s),s).
Hence Q is already an element of OCS,0 , so that s is a nonzerodivisor, as was to be shown.
It remains to show that (M|Cs )p = O eC ,e /OCs ,p for s ∈ S and p ∈ Cs . Looking at the
s p
integral equations for x(t,s) and y(t,s) for fixed s it follows that the map n : W × {s} −→
Cs is finite. As we just proved that in particular OW ×{s} /OCs is a finite-dimensional
vector space it follows from Exercise 4.4.22 that n : W × {s} −→ Cs is the normalization
for all points in Cs . This is what we needed to show.
Proof of 6.4.5. This is analogous to the proof of 6.4.1, and left as Exercise 6.4.11.
250 6 The Principle of Conservation of Number
We may ask ourselves whether, for example, in the case of the Milnor number, a
deformation with only A1 –singularities always exists. This question is answered in the
following theorem.
Theorem 6.4.8. Consider a function f ∈ C {x1 , . . . ,xn } with an isolated singularity.
Let F : (C n × C n , 0) −→ (C , 0) be defined by
Then there exists a hypersurface (D, 0) ⊂ (C n , 0), such that for all small enough repre-
sentatives D and T of (D, 0) and (C n , 0), there exists a small neighborhood U of 0 in C n
such that for all t ∈ T \ D
has only A1 –singularities. The number of these A1 –singularities is equal to the Milnor
number µ(f ) of f .
Proof. Let U be a neighborhood of 0 in C n such that f converges on U and let t ∈ C n .
A point p ∈ U is a singular point of Ft exactly if all partial derivatives with respect to
the xi vanish. In short, if p is in the zero set of ∂F ∂Ft
∂x1 , . . . , ∂xn . We therefore consider the
t
n n
space (Σ, 0) in (C × C , 0) defined by the ideal
∂F ∂F ∂f ∂f
,..., = + t1 , . . . , + tn .
∂x1 ∂xn ∂x1 ∂xn
Let Σ be a representative of (Σ, 0), We want a condition for a point (p,t) in Σ to have an
A1 –singularity. For this, we look at the projection to the second factor
π : Σ −→ C n .
Suppose that given (p,t) ∈ Σ, there exists a neighborhood of (p,t) which is mapped
biholomorphically under π onto an open subset T of C n . Then we claim that Ft has an
A1 –singularity at p. Indeed, by assumption we have that the inclusion of stalks
OT,t ⊂ OΣ,(p,t)
6.4.12. For plane curve singularities (X, 0) and (Y, 0), prove Hironaka’s formula, see 5.4.11,using
the principle of conservation of number, in particular Theorems 6.4.1 and 6.4.3.
252 6 The Principle of Conservation of Number
6.4.13. In the Theorem of the Principle of Conservation of Number, replace the second con-
dition by the condition that M0 is a finitely generated C {s}–module, or, what is the same by
Nakayama, that (M|X0 )0 is a finitely generated C –vector space. Prove that
` ´ X ` ´
dimC (M|X0 )0 ≥ dimC (M|Xs )p .
p∈Xs′
6.4.14. An ordinary m–tuple point is by definition a plane curve singularity (X, 0) = ∪m i=1 (Xi , 0)
such that all (Xi , 0) are smooth, and the intersection number (X i , 0) · (Xj , 0) = 1 for all i 6= j.
` ´
Prove that the δ–invariant of an ordinary m–tuple point is m 2
using
(1) Hironaka’s formula 5.4.11 or
(2) the principle of conservation of number.
Find a C –basis of OX,
e 0 /OX,0 .
` ´
6.4.15. Consider an irreducible plane curve singularity (X, 0) = V (f ), 0 in (C 2 , 0) of multi-
plicity m. Without loss of generality we may suppose that (X, 0) is given by the parametrization
(3) Show that for small s 6= 0 the zero set of fs (x,y) := F (x,y,s) has two singular points:
1. an ordinary m–tuple point.
2. the strict transform of the blowing-up of the origin in C 2 .
For which values of t do these occur?
(4) Use the principle of conservation of number, in particular 6.4.3, to reprove Noether’s
formula 5.4.13.
(5) Generalize these results to reducible curves.
The Cohen-Macaulay property (CM for short) of a space is a local property, related to
the principle of conservation of number. In this section we deal with the first properties of
Cohen-Macaulay spaces, hopefully showing that it is a good (and, therefore, quite strong)
property a space can have. Nevertheless, we will prove that complete intersections (in
particular hypersurface singularities) are Cohen-Macaulay. First we introduce the notion
of depth, needed in order to define Cohen-Macaulay. We introduce these matters directly
for modules over a local ring, in order to get a more flexible development of the theory.
6.5 Cohen-Macaulay Spaces 253
C {x,y,z,w}/(xz,xw,yz,yw,y − w) ∼
= C {x,y,z}/(xz,xy,yz,y 2).
The examples four and five are quite typical. In fact, a Cohen-Macaulay space is
equidimensional, even unmixed. For the definition of unmixed, see 4.1.22. In the fifth
example, the intersection of the two components has dimension zero. A necessary con-
dition for the union to be Cohen-Macaulay is the condition that the intersection has
codimension one, as we will see in 6.5.19. At least from these examples we see that the
Cohen-Macaulay property is indeed a nontrivial one.
Proposition 6.5.7. Let (R,mR ) and (S,mS ) be local rings, and R ⊂ S be a finite
ring extension of local rings. Let M be an S–module. Suppose depthR (M ) = 0. Then
depthS (M ) = 0.
Proof. As depthR (M ) = 0, we have that m is an associated prime of M . Thus there
exists an x ∈ M with x · m = 0. Take an element g ∈ mS . We have to show that g is a
zerodivisor of M . By 1.5.11 g is integral over mR , so we have an integral equation:
g n + a1 g n−1 + . . . + an = 0, ai ∈ mR .
The second part of this corollary, applied to the case of a complete intersection is
usually called Macaulay’s Unmixedness Theorem.
Corollary 6.5.9. Let (X, 0) be an irreducible complete intersection singularity, and sup-
pose the singular locus of (X, 0) has codimension at least two. Then (X, 0) is normal.
Proof. By assumption, OX,0 satisfies (R1). We only need to show that it satisfies (S2)
by 4.4.11. The condition (S2) means that every hypersurface in (X, 0) has no embed-
ded primes. Because of Corollary 6.5.5 this hypersurface is also Cohen-Macaulay, and
therefore unmixed. Thus (S2) holds for Cohen-Macaulay singularities.
Theorem 6.5.10. Let (X, 0) be Cohen-Macaulay germ of an analytic space and π :
(X, 0) −→ (C k , 0) be a Noether normalization. Then there exist representatives X of
(X, 0) and U of (C k , 0) and a hypersurface D ⊂ U such that for all t ∈ U \ D the fiber
π −1 (t) consists of dimC OX,0 /mC k ,0 OX,0 many points.
6.5 Cohen-Macaulay Spaces 257
As in the proof of Theorem 6.4.8, we use the Local Parametrization Theorem 3.4.14
to find a hypersurface D ⊂ U with the following property: every point p ∈ U \ D has
a neighborhood V such that π −1 (V ) is a union of open subsets of X each of which
is mapped biholomorphically to V . This implies OX,qi ∼
= OU,p for all i and, therefore,
r = s.
Remark 6.5.11. The previous theorem does not hold if (X, 0) is not Cohen-Macaulay,
see the following examples. We can apply the theorem to the theory of intersection num-
ber. We consider f,g such that dimC C {x,y}/(f,g) < ∞, and deformations FS ,GS of f
and g, with OS,0 = Ok . Let (X, 0) = V (FS ,GS ), 0 . Then (X, 0) −→ (C k , 0) is a Noether
normalization. (X, 0) is a complete intersection, hence Cohen-Macaulay. The number of
points in Xs = V (Fs ,Gs ) for general s, therefore, is equal to dimC C {x,y}/(f,g).
Examples 6.5.12.
(1) Let I = (xy,x(x − z)). Then (X, 0) = V (I), 0 is the union of a plane and a line. The
projection (X, 0) −→ (C 2 , 0) which sends (a,b,c) to (b,c) is a Noether normalization. On
the level of rings we have a map
C {x,z} ֒→ C {x,y,z,u}/I.
There is also a completely algebraic method by which one can decide whether a
ring (or module) is Cohen-Macaulay. It is based on resolutions of modules. This method
is often used in the literature for proving that for a certain invariant the principle of
conservation of number holds.
Definition 6.5.13. Let (R,m) be a Noetherian local ring and M be a finitely generated
R–module. A free resolution of M is an exact sequence (infinite of finite)
αk+1 1 0 α α
(6.5) · · · −→ Fk+1 −−−→ Fk −→ · · · −→ F0 −→ M −→ 0,
with finitely generated free R–modules Fi for i ≥ 0. One says that M has finite projective
dimension if there exists an exact sequence
n α 1 0 α α
(6.6) 0 −→ Fn −→ Fn−1 −→ · · · −→ F0 −→ M −→ 0,
with finitely generated free R–modules Fi . The minimal n with Fn 6= 0 is called the
projective dimension of M , denoted by pd(M ) = pdR (M ). If M does not have finite
projective dimension, we write pdR (M ) = ∞. The resolution (6.5) is called minimal, if
for all k ≥ 1, the map αk maps Fk into mFk−1 .
Theorem 6.5.14. (Minimal) free resolutions of finitely generated modules M over a
Noetherian local ring (R,m) exist. If a module M has a finite minimal resolution
n α 1 0 α α
0 −→ Fn −→ Fn−1 −→ · · · −→ F0 −→ M −→ 0,
with Fn 6= (0), then n = pdR (M ). To put it in other words, two minimal resolutions have
the same length. Moreover, the rank of Fk is independent of the minimal resolution, and
is called the k–th Betti number of M .
Proof. Consider a minimal set of generators b1 , . . . ,bs0 of M . We get a surjective map
α
F0 := Rs0 −→
0
M
with α0 (a1 , . . . ,as0 ) = a1 b1 + . . . + as0 bs0 . The map α0 is surjective, because b1 , . . . ,bs0
generate M . By Nakayama, α0 induces an isomorphism
α0 : F0 /mF0 ∼
= M/mM.
Now put K2 = Ker(α1 ), take minimal number of generators of K2 etc. Continuing this
way, we get a minimal free resolution of M .
To show the invariance of the Betti-numbers, consider two minimal resolutions of
M:
6.5 Cohen-Macaulay Spaces 259
αk+1 1 0 α α
· · · −→ Fk+1 −−−→ Fk −→ · · · −→ F0 −→ M −→ 0,
′ α′k+1 α′ α′
· · · −→ Fk+1 −−−→ Fk′ −→ · · · −→
1
F0′ −→
0
M −→ 0.
As both α1 and α′1 both have entries in m, we get isomorphisms F0 /mF0 ∼ = M/mM ∼ =
F0′ /mF0′ . Therefore rank(F0 ) = rank(F0′ ).
′ ′
Pf1 , . . .′ ,fs0 be a basis of F0 . As the α0 (fj ) generate
Let e1 , . . . ,es0 be a basis of F0 , and
M , there exist βij ∈ R with α0 (ei ) = βij α0 (fj ). Consider the map h1 = (βij ) : F0 −→
F0′ . So we get a commutative diagram
α0
F0 /M
h1
α′0
F0′ / M.
Modulo m the map h1 : F0 /mF0 −→ F0′ /mF0′ is an isomorphism, that is, det(βij )
mod m 6= 0. Then det(βij ) is a unit in R which implies that h1 is an isomorphism.
Hence h1 induces an isomorphism
h1 : Ker(α0 ) = K1 −→ K1′ = Ker(α′0 ).
As α2 and α′2 have entries in m, and we have surjections F1 −→ K1 and F1′ −→ K1′ , it
follows that rank(F1 ) = rank(F1′ ), etc.
Examples 6.5.15.
(1) An R–module M has projective dimension zero if and only if M is a free R–module.
(2) Let R = C {x,y}, I = (x,y), and M = R/I. We will show that
(−y
x ) (x y) res
0 −→ R −−−→ R2 −−−→ R −−→ R/I −→ 0
is a free resolution of R/I as R–module. Here res is just taking residue classes, and is
obviously surjective. The kernel of res is generated by x and y. We have to compute the
(x y)
kernel of R2 −−−→ R. So let (α,β) be in the kernel. Then x · α + y · β = 0. As (x,y) is a
regular sequence in C {x,y} it follows that α = y · m, for a certain m. Similarly β = x · ℓ
for a certain ℓ. Then xy · m + xy · ℓ = 0, hence ℓ = −m. Therefore, (α,β) = ℓ(−y,x). So
(−y
x ) (x y)
R −−−→ R2 −−−→ R
(−y
x )
is exact. Moreover, it is easy to see that R −−−→ R2 is injective.
(3) We consider the ring R = C {x,y}/(xy), and M = C {y} ⊕ C {x} = R/(x) ⊕ R/(y).
Then M = R,e the normalization of R. M can be described as the cokernel of the map
0
α1 =(x
0 y)
R2 −−−−−−→ R2 −→ M −→ 0.
The kernel of α1 is easily seen to be generated by (y,0) and (0,x). Hence we get an exact
sequence
0
α2 =(0y x0) α1 =(x
0 y)
R2 −−−−−−→ R2 −−−−−−→ R2 −→ M −→ 0.
Going on like this, we see that M has a minimal free resolution which is 2-periodic. In
particular we see that M does not have finite projective dimension.
260 6 The Principle of Conservation of Number
So one sees that a finitely generated module over a local ring does not, in general,
have finite projective dimension. But it is true in the case that R is a regular local ring.
The analogous statement for graded modules over a polynomial ring is a famous theorem
of Hilbert.
Theorem 6.5.16 (Hilbert’s Syzygy Theorem). Let (R,m) be a regular local ring of di-
mension n, and M be a finitely generated R–module. Then pdR (M ) ≤ n. In particular,
the projective dimension of M is finite.
For the proof we need the following lemma.
Lemma 6.5.17. Let (R,m) be a Noetherian local ring, let M be a finitely generated R–
module, and x ∈ m be a nonzerodivisor of R and M . Consider a minimal free resolution
of M
0 −→ Fn −→Fn−1 −→ · · · −→F0 −→M −→ 0.
Then the induced sequence
· · · −→ Fk −→ Fk−1 −→ · · · −→ F1 −→ F0 −→ M −→ 0
· · · −→ Fk −→ Fk−1 −→ · · · −→ F1 −→ K −→ 0,
0 −→ K −→ F0 −→ M −→ 0,
with K = Ker(F0 → M ). Take any element x ∈ m \ m2 . As R is an integral domain, see
4.3.17, x is a nonzerodivisor of F0 . As K is a submodule of F0 , x is a nonzerodivisor of
K. Therefore x is a nonzerodivisor of Fi for all i ≥ 1 and of K, so that we get an exact
sequence
0 −→ M1 −→ M2 −→ M3 −→ 0.
6.5 Cohen-Macaulay Spaces 261
Then
depth(M2 ) ≥ min depth(M1 ), depth(M3 ) .
In case this inequality is strict we have, furthermore,
depth(M1 ) = depth(M3 ) + 1.
Proof. Suppose that all three modules have positive depth. From Prime Avoidance, see
Exercise 6.5.30, we can find an f ∈ R which is a nonzerodivisor of M1 ,M2 and M3 . By
the Snake Lemma 1.2.12, it follows that
0 −→ M1 /f M1 −→ M2 /f M2 −→ M3 /f M3 −→ 0
is also exact. Since the depth drops by one if one divides out a nonzerodivisor (Corollary
6.5.5), we can immediately reduce to the case that the depth of one of the three modules
M1 ,M2 or M3 is zero. So we distinguish three cases.
Case 1. Suppose depth(M1 ) = 0. We will prove that depth(M2 ) = 0. But this is obvious,
as any nonzerodivisor of M2 can be viewed as a nonzerodivisor of M1 , as M1 may be
viewed as a submodule of M2 .
Case 2. Suppose depth(M2 ) = 0. Then we have to show that either depth(M1 ) = 0 or
depth(M3 ) = 0. Suppose not, and let (by prime avoidance) f ∈ R be a nonzerodivisor of
M1 and M3 . Therefore, the multiplication by f on M1 and M3 has a trivial kernel. From
the Snake Lemma it follows that multiplication with f on M2 has trivial kernel, so f is
a nonzerodivisor of M2 . But as depth(M2 ) = 0 this cannot be the case, contradiction.
Case 3. Suppose depth(M3 ) = 0. We have to show that if, moreover, depth(M2 ) > 0,
then depth(M1 ) = 1. Take a nonzerodivisor f of M2 . As M1 is a submodule of M2 it is
also a nonzerodivisor of M1 . Hence, depth(M1 ) ≥ 1. By the Snake Lemma we obtain an
inclusion
(6.8) 0 −→ M ′ −→ M1 /f M1 ,
·f
where M ′ := Ker(M3 −→ M3 ). As depth(M3 ) = 0, f is a zerodivisor. Thus M ′ 6= 0. One
even has that there exists an element g ∈ M3 \ {0} with mg = 0. Hence, in particular, g ∈
M ′ . Any nonzerodivisor of M1 /f M1 would give a nonzerodivisor of M ′ by the inclusion
(6.8). But M ′ cannot have one by definition of g ∈ M3 . Therefore, depth(M1 /f M1) = 0,
and depth(M1 ) = 1.
Corollary 6.5.19. Let I,J ⊂ On be radical ideals. Suppose that (X, 0) = V (I), 0 and
(Y, 0) = V (J), 0 both have dimension k. Suppose (X, 0) intersects (Y, 0) properly, that
is, dim (X ∩ Y ), 0 ≤ k − 1. Suppose (X, 0) and (Y, 0) are Cohen-Macaulay. Then
(X ∪ Y, 0) is Cohen-Macaulay ⇐⇒ (X ∩ Y, 0) is Cohen-Macaulay of dimension k − 1.
Proof. Look at the exact sequence:
0 −→ On /I ∩ J −→ On /I ⊕ On /J −→ On /(I + J) −→ 0.
From the assumption dim(X ∩ Y, 0) ≤ k − 1 it follows that depth(On /(I + J)) ≤
k − 1. Because (X, 0) and (Y, 0) are Cohen-Macaulay of dimension k, we know that
depth(On /I ⊕ On /J) = k. The Depth Lemma gives that
depth(On /(I ∩ J)) = depth(On /(I + J)) + 1,
which can be interpreted as the statement of the theorem.
262 6 The Principle of Conservation of Number
is exact. Because the projective dimension stays the same if one divides out a nonzero-
divisor, see 6.5.17, and the depth drops by one if one divides out a nonzerodivisor, the
theorem follows by induction on depth(R).
Case 2. Finally, suppose that depth(M ) = 0 and depth(R) > 0. The case pdR (M ) = 0 is
trivial, as M is then a free R–module. Otherwise, consider a minimal a presentation
0 −→ K −→ F −→ M −→ 0
with F free and K 6= 0. Then pdR (K) = pdR (M )−1. Since depth(M ) = 0, the inequality
in the Depth Lemma is strict, so that depth(K) = depth(M ) + 1 = 1. As depth(R) −
depth(K) = depth ∗R) − depth(M ) − 1, the Auslander Buchsbaum Formula holds for
K by induction, so depth(K) + pdR (K) = depth(R). Hence depth(M ) + pdR (M ) =
depth(R).
Corollary 6.5.21. Let I ⊂ C {x1 , . . . ,xn } = R be an ideal, (X, 0) = V (I), 0 . Consider
a minimal free resolution of R/I as R–module
which shows that R as a O–module has projective dimension three. It follows from the
Auslander-Buchsbaum Formula that, as O–module, R has depth one. It follows that R
(as R–module) has depth one. We see again that R is not Cohen–Macaulay.
The following corollary of the Auslander-Buchsbaum formula will be used in the
proof of the Hilbert-Burch Theorem 6.5.26.
Corollary 6.5.23. Let (R,m) be a local ring and M be an R–module. Then
pdR (M ) ≥ depth Ann(M ),R .
Proof. We may assume that pdR (M ) < ∞. The proof is by induction on depth(M ). If
depth(M ) = 0, then, by the Auslander Buchsbaum Formula, pdR (M ) = depth(R) ≥
depth Ann(M ),R .
We now assume that depth(M ) > 0. In particular, from the Auslander-Buchsbaum
Formula, it follows that d := depth(R) > 0. Let 0 −→ Fn −→ · · · −→ F0 −→ M −→ 0
be a minimal free resolution of M . Choose a regular sequence x1 , . . . ,xd in R such that
xr+1 , . . . ,xd is a maximal regular sequence in Ann(M ). Since depth(M ) > 0, there exists
a nonzerodivisor of M . Thus in particular x1 is a nonzerodivisor of R/ Ann(M ). Hence
we have that r ≥ 1 and, by renumbering, we may suppose x1 is a nonzerodivisor of M .
We obtain, using 1.2.35, the exact sequence
Proof.
Step 1. Assume the complex is exact. We have to prove that depth Irν (ϕν ),R ≥ ν.
We use induction on d = depth(R).
The case d = 0 is not difficult. The exactness of the
complex gives that pdR Im(ϕν ) < ∞ for ν = 1, . . . ,n. This implies that pdR Im(ϕν ) =
0 (Auslander-Buchsbaum 6.5.20), that is, Im(ϕν ) is free and of rank rν . By 1.4.32 Irν (ϕν )
contains a nonzerodivisor. As depth(R) = 0, the only nonzerodivisors
are the units.
Therefore Irν (ϕν ) = R and, consequently, depth Irν (ϕν ),R = ∞.
Now assume that d > 0. By Exercise 1.3.22 the sequence 0 −→ Fn ⊗Q(R) −→ · · · −→
F0 ⊗ Q(R) is exact. By the previous lemma depth Irν (ϕν ),R ≥ 1 for all ν. Therefore we
can find a nonzerodivisor x ∈ ∩nν=1Irν (ϕν ). If x is a unit then Irν (ϕν ) = R for all ν and
thus by definition depth Irν (ϕν ),R = ∞. Otherwise, x ∈ m is a nonzerodivisor, so that
by 1.2.35 we get that
0 −→ Fn /xFn −→ · · · −→ F1 /xF1
is exact. The induction hypothesis implies that depth Irν (ϕν ⊗ idR/(x) ),R/(x) ≥ ν − 1
for ν = 2, . . . ,n. This implies depth Irν (ϕν ),R ≥ ν because of 6.5.5 and the definition
of the Fitting ideals, see 1.2.18.
Step 2. For the converse we consider three cases.
Case 1. This is the case that d = depth(R) = 0. Then the only nonzerodivisors in R are
units. Thus R = Q(R) and the statement follows from the previous lemma.
Case 2. The case that d = depth(R) ≥ 2. This is proved by induction on the dimension
of R and on n. The case n = 0 is trivial, as obviously the sequence 0 −→ F0 is exact.
The case dim(R) ≤ 1 follows because then depth(R) ≤ 1, hence we are in the first case
or in the third case.
Now assume that dim(R) > 0 and n > 0. By induction hypothesis we may suppose
that
0 −→ Fn −→ · · · −→ F1
is exact. Let M = Im(ϕ2 ) and K = Ker(ϕ1 ). It remains to prove that K/M = 0. Let
p 6= m be a prime ideal. By Exercise 1.3.22 the localization of the sequence
is exact. As dim(Rp ) ≤ dim(R) − 1, it follows from the induction hypothesis with respect
to the dimension of the ring that (K/M )p = 0 for all prime ideals p ⊂ R different from
the maximal ideal m. By Exercise 1.4.54 this implies that a certain power of m annihilates
K/M . Thus K/M is a module of dimension zero. Hence depth(K/M ) = 0.
We consider the exact sequences
0 −→ M −→ F1 −→ F1 /M −→ 0,
gives that depth(F1 /M ) ≥ 1. Note that also depth(F1 /K) ≥ 1. This holds because we
have an embedding F1 /K ⊂ F0 . Now assume that K/M 6= 0. We apply the Depth Lemma
to the exact sequence
0 −→ K/M −→ F1 /M −→ F1 /K −→ 0.
F1 ∼
= Im(ϕ2 ) ⊕ F1′
for a suitable free module F1′ (1.3.8). It remains to prove that the restriction of ϕ1 to F1′
is injective. This is trivial if F1′ = 0. By 6.5.24 we have that rank(F1′ ) = r1 . As in the
second case we have that Ker(ϕ1 |F1′ ) is annihilated by a power of the maximal ideal m
and thus, if Ker(ϕ1 |F1′ ) 6= 0, it follows that depth(Ker(ϕ1 |F1′ )) = 0, and we have an exact
sequence
0 −→ Ker(ϕ1 |F1′ ) −→ F1′ −→ Im(ϕ1 ) −→ 0
As F1′ has depth one, it follows from the Depth Lemma that depth(Ker(ϕ1 |F1′ )) > 0. This
is a contradiction. Therefore, ϕ1 |F1′ is injective and 0 −→ Fn −→ · · · −→ F0 is exact.
α∗ α∗
1
0 −→ R −→ F1∗ −→
2
F2∗ ,
using that I1 (α∗1 ) = I1 (α1 ) = Irank(F2 ) (α2 ), to obtain that this complex is exact. Now
consider the commutative diagram
α∗ α∗
R
1
/ F1∗ 2
/ F2∗
σ
α∗ α∗
0 /R 1
/ F1∗ 2
/ F2∗
The map σ with α∗1 ◦σ = α∗1 exists because of the exactness of the second row. Because σ ∈
HomR (R,R) there exists an a ∈ R such that σ(x) = ax for all x ∈ R. It remains to prove
that a is a nonzerodivisor. So suppose x · a = 0 for some x ∈ R. This implies xI1 (α1 ) = 0.
Therefore, I1 (α1 ) consists of zerodivisors. This is not possible, as the sequence 0 −→
α2 α1
F2 −→ F1 −→ F0 is exact and thus depth I1 (α1 ),R ≥ 1 by Theorem 6.5.25.
To prove (1.1) note that we have now proved that I contains a nonzerodivisor. Hence
I ⊗ Q(R) = Q(R) and R/I ⊗ Q(R) = 0. We tensorize the exact sequence with Q(R) and
obtain by Exercise 1.3.22 an exact sequence
This implies rank(F1 ) = rank(F2 ) + 1. It remains to prove (1.2). Put I ′ = Irank(F2 ) (α2 ).
By dividing by a we get an exact sequence
α1
α
2
0 −→ F2 −→ a
F1 −→ R −→ R/I ′ −→ 0.
Obviously I ′ = Ann(R/I ′ ), so that from 6.5.23 we obtain 2 ≥ pdR (R/I ′ ) ≥ depth(I ′ ,R).
On the other hand, by 6.5.25, we have depth(I ′ ,R) ≥ 2. This implies depth(I ′ ,R) = 2.
Exercises
6.5.27. Suppose M is a free R–module. Then depth(M ) = depth(R).
6.5.29. Let R ⊂ S be a finite ring extension of local rings, and M be an S–module. Show that
depthR (M ) = depthS (M ).
6.5.30. Let M,N be R–modules of positive depth. Show that there exists an f ∈ R which is a
nonzerodivisor for both M and N . Generalize this statement to finitely many modules.
(Hint: Use Exercise 1.4.52 and Prime Avoidance 1.1.13.)
6.5.31. The assumption on R to be local in Lemma 6.5.3 is an essential assumption! Show that
in the ring C [x,y,z]/(xz − z) the sequence x,xy − y is a regular sequence, but xy − y,x is not a
regular sequence.
6.5.32. Consider the map f (s,t) = (s,st,t2 ,t3 ) = (x,y,z,u) of Exercise 2.3.18. The image is
given by the ideal I = (yz − ux,y 2 − zx2 ,z 2 x − uy,z 3 − u2 ), as shown in that exercise. Let
R = C {x,y,z,u}. Show that
268 6 The Principle of Conservation of Number
0 1
0 u1 y u xz z 2
B −z 0 −u 0 C
@ −y A @ A
−z −x z −y u
x 4 0 −x 0 −y (yz−ux,y 2 −zx2 ,z 2 x−uy,z 3 −u2 )
−−−−−−−−−−−−−−→ R4 −−−−−−−−−−−−−−−−−−−−−−−→ R −→ R/I −→ 0
0 −→ R −−−−−→ R −
6.5.34. Let R = C `{x,y,z},I´ = (xy,xz). Compute a free resolution of R/I as R–module. Con-
clude that (X, 0) = V (I), 0 is not Cohen-Macaulay.
7 Standard Bases
Let an ideal I ⊂ C {x} be given. In this chapter we will define and study standard bases
of the ideal I. The main idea here is that one wants to have a “good” representative for
any coset of I. In fact, the Weierstraß Division Theorem provides us with the simplest
example of a standard basis. Consider f ∈ C {x} which is regular of order b in xn . For any
g ∈ C {x} we have a unique representation g = qf + r with r a polynomial of degree less
than b. From the uniqueness statement in the Weierstraß Division Theorem, it follows
without difficulty that the remainder r only depends on the coset g + (f ) of (f ). Of
course, the remainder r depends on the choice of xn .
In order to generalize, we would like to have division with remainder through an
ideal. The “choice” of xn is now replaced by the choice of a so-called monomial ordering
on the monomials of C {x}. Having such a monomial ordering, one can talk about the
leading term L(f ) of f , and the ideal L(I) = (L(f ) : f ∈ I \ {0}) of leading terms of
I. A standard basis of I is a set of elements f1 , . . . ,fm of I such that L(f1 ), . . . ,L(fm )
generate L(I). It is quite trivially true that a standard basis exists. Here one uses only
the fact that C {x} is Noetherian. It also turns out that a standard basis of I generates
the ideal I. Standard bases of ideals in C {x} generated by polynomials can be computed
effectively. The algorithms are explained in the exercises. For example, the computer
algebra system SINGULAR can be used for computation (cf. [Singular 2000]).
The first thing one would have is a kind of division with remainder through the fi ,
that is, one would like for any f ∈ C {x} a representation
m
X
(7.1) f= qi fi + r
i=1
where no term of r is divisible by the leading term of any fi . This replaces the condition
that r is polynomial of degree less than b in the Weierstraß Division Theorem. In fact,
we will give alternative proofs of the Weierstraß Division and Preparation Theorems. In
order to control the convergence of the qi and r, we will define
P seminorms1 on C {x}
n α
or even C [[x]] depending on ρ ∈ R+ defined by kf kρ := α |fα |ρ . This is also the
norm which is used in the books of Grauert and Remmert, see [Grauert-Remmert 1971],
and [Grauert-Remmert 1984] to give proofs of the Weierstraß Division and Preparation
Theorem. It turns out that if f1 , . . . ,fm is a standard basis, then the r is uniquely
determined, and is called the normal form NF(f ) = NF(f |I) of f . It depends, of course,
on the monomial ordering. In particular, it follows that f ∈ I if and only if NF(f ) = 0.
This will be used in an essential way in the proof of Grauert’s Approximation Theorem
in Chapter 8. We will give some applications in Section 7.3.
More generally, there is a theory of standard bases of modules. As it is not more
difficult, we will treat the general case from the beginning.
1 Here semi means that the norm might take infinite value.
270 7 Standard Bases
Definition 7.1.1. Let M = {xα |α ∈ Zn+ } be the set of monomials in C [x]. A monomial
ordering is a total ordering on M (respectively Zn+ ) compatible with the semi–group
structure, that is, xα > xβ implies xα+γ > xβ+γ .
We assume furthermore that the orderings considered here are well–orderings, that
is, every set of monomials has a minimal element with respect to the ordering. This is
equivalent to the property that 1 is the smallest monomial (cf. Exercise 7.1.9).
Examples 7.1.2.
Let w = (w1 , . . . ,wn ) ∈ Rn and wi > 0, i = 1, . . . ,n. For α ∈ Zn+ define |α|w =
P n
i=1 αi wi (we shall also simply use |α| = |α|w if w is fixed).
xα > xβ if and only if |α| > |β| or |α| = |β| and xα > xβ
deglex lex
Definition 7.1.3. Now let > be a monomial ordering. Let f ∈ C {x} be a power series.
Then we may write X
f= ai xαi
i≥1
We wish to define orderings also for the monomials of the free module C {x}N . For a
compatible notation we choose, as a basis, the vectors (0, . . . ,1,0 . . . 0) and identify them,
i
as before, with xi−1
n+1 , xn+1 a new variable.
7.1 The Division Theorem 271
PN −1
Definition 7.1.4. Let < be a monomial ordering on C {x}. Let C {x}N = i=0 C {x}xin+1
and M = {xα |α ∈ Zn+1 N
+ ,0 ≤ αn+1 ≤ N −1} be the set of monomials of C {x} . A module
ordering <m is a total ordering of M compatible with the ordering < on C {x}, that is
(1) f,f ′ ∈ M and f <m f ′ implies xα f <m xα f ′ .
(2) f ∈ M and xα < xβ implies xα f <m xβ f .
Exercise 7.1.11 gives several examples for module orderings.
From now on in this chapter we use the following ordering:
Let w = (w1 , . . . ,wn+1 ) ∈ Rn+1
+ be fixed and < be the degree lexicographical ordering.
For xα and xβ ∈ M we define xα < xβ if and only if |α| < |β| or if |α| = |β| then α < β
with respect to the lexicographical ordering.
As in Definition 7.1.3 we define for f ∈ C {x}N the leading monomial L(f ), the
leading coefficient C(f ), the tail, tail(f ) and the leading module L(I) for a submodule
I ⊂ C {x}N .
Let m,m′ ∈ M be two monomials, m = xα , m′ = xβ . We define m | m′ if αn+1 =
βn+1 and xα | xβ .
The aim of this Chapter is to obtain information about ideals by studying their lead-
ing ideals (respectively modules) which are generated by monomials and are, therefore,
simpler.
Definition 7.1.5. Let I ⊂ C {x}N be a submodule, S = {f1 , . . . ,fm } ⊂ I is a standard
basis of I if L(I) = L(f1 ), . . . ,L(fm ) .
A standard basis is reduced if C(fi ) = 1 for all i and L(fi ) does not divide any
monomial occurring in fj for i 6= j and occurring in tail(fi ).
Example:. Let I = (x3 + y 2 ,y), w = (1,2), then {x3 + y 2 ,y,y 5 } is a standard basis of I
in C {x} but not a reduced standard basis of I, {x3 ,y} is a reduced standard basis.
Remark:. Standard bases were introduced by Hironaka (cf. [Hironaka 1964]), Grauert
(cf. [Grauert 1972]) for the power series case and Buchberger (cf. [Buchberger 1965]) for
polynomial rings. In case of polynomial rings, they are called Gröbner bases.
Proposition 7.1.6. Let I ⊂ C {x}N be a submodule, then I has a standard basis.
Proof. L(I) is, by definition, the submodule in C {x}N generated by the leading monomi-
als of the elements of I. C {x} is Noetherian and, therefore L(I) is generated by finitely
many such leading monomials, say xα1 , . . . ,xαm . Now choose f1 , . . . ,fm ∈ I such that
L(fi ) = xαi then {f1 , . . . ,fm } is a standard basis of I.
Later we shall see that a standard basis of I generates the module I.
We shall also prove that reduced standard bases exist and are uniquely determined.
Therefore, we need the notion of a normal form of an element f ∈ C {x}N with respect
to a set S = {f1 , . . . ,fm } of elements of C {x}N , that is, a representation in C {x}N
m
X
f= qi fi + r
i=1
Proof.
Special case: fi = xαi , i ∈ {1, . . . ,m}, let ∆ = hα1 , . . . ,αm i ⊂ Zn+1
+ be the Zn+ –semi–
module generated by α1 , . . . ,αm . We define inductively a partition of ∆ by
∆1 = hα1 i
∆i = hαi i ∩ (∆ r hα1 , . . . ,αi−1 i).
P
Let f = fα xα , we define
X
r := fα xα
α∈∆
/
1 X
qi = fα xα
xαi
α∈∆i
P
then f = qi fi + r. Let kf kρ < ∞, we choose ρ = ρ, then krkρ ≤ kf kρ and kqi kρ kfi kρ ≤
kf kρ . We shall use the construction of this step in the next one. Therefore, we define for
S = {xα1 , . . . ,xαm } being fixed maps
qi : C {x}N −→ C {x}N
r : C {x}N −→ C {x}N
1 P P P
by qi (f ) = x αi α∈∆i fα xα resp. r(f ) = α∈∆
/ fα xα and obtain f = qi (f )xαi + r(f ).
General case: Let L(fi ) = xαi and assume C(fi ) = 1. Let S = {xα1 , . . . ,xαm }. We define
a sequence νi ∈ C {x}N by
2 This condition is a kind of minimality condition. Note that in a sum one can always add two terms
which cancel. The condition written here, avoids this phenomenon.
7.1 The Division Theorem 273
ν0 = f
m
X
νj+1 = νj − qν (νj )fν − r(νj ).
ν=1
P
∞
The idea of the proof is now to define ν = νj and prove that
j=0
P
• f= qj (ν)fj + r(ν).
• ν ∈ C {x}N .
Lemma 7.1.8. Let g1 , . . . ,gm ∈ C {xs+1 , . . . ,xn }N for some s ≥ 0. Then, for every ε > 0
and ρ ∈ Rn+1 , there exist ρ ∈ Rn+1
+ such that ρ ≤ ρ and ρi = ρi for i = 1, . . . ,s and for
all λ, 0 < λ < 1,
k tail(gi )kλ◦ρ < εkL(gi )kλ◦ρ , i = 1, . . . ,r.
We shall prove the Lemma later.
ε
Now let 1 > ε > 0 be given. We apply the Lemma to f1 , . . . ,fm and m and choose
ε αi
ρ such that ktail (fP)k
i λ◦ρ < m (λ ◦ ρ) and ρ ≤ ρ and ρ i = ρ i for i =
Pm1, . . . ,s.
m
Now, kνj+1 kλ◦ρ ≤ i=1 k tail(fi )kλ◦ρ kqi (νj )kλ◦ρ because νj+1 = − i=0 qi (νj ) tail(fi ).
1
By definition of the qi we have kqi (νj )kλ◦ρ ≤ (λ◦ρ)αi kνj kλ◦ρ and, therefore, kνj+1 kλ◦ρ ≤
εkνj kλ◦ρ .
Pi
Let si := j=0 νj then
i
X i
X
ksi kλ◦ρ = k νj kλ◦ρ ≤ kνj kλ◦ρ
j=0 j=0
i
X
≤ εj kν0 kλ◦ρ
j=0
1
≤ kf kλ◦ρ .
1−ε
This implies
1
kνkλ◦ρ ≤ kf kλ◦ρ .
1−ε
Especially,
and
kr(ν)kλ◦ρ ≤ kνkλ◦ρ
implies
1
kL(fi )kλ◦ρ kqi (ν)kλ◦ρ ≤ kf kλ◦ρ
1−ε
1
kr(ν)kλ◦ρ ≤ kf kλ◦ρ .
1−ε
274 7 Standard Bases
Now
∞
X
f= (νj − νj+1 )
j=0
X∞ m
X
= νj − νj + qℓ (νj )fℓ + r(νj )
j=0 ℓ=1
Xm X∞ ∞
X
= qℓ (νj )fℓ + r(νj ).
ℓ=1 j=0 j=0
P∞ P∞
But j=0 qℓ (νj ) = qℓ (ν) and j=0 r(νj ) = r(ν) and, therefore, the first part is proved.
P∞
Now L qi (ν)fi = L j=0 qi (νj )fi ≥ L qi (ν0 )fi = L qi (f )fi because L(νj+1 ) >
L(νj ). Therefore,
L qi (ν)fi ≥ L qi (f )fi ≥ L(f ).
This proves the Theorem.
Proof of the Lemma. Because the g1 , . . . ,gm do not depend on x1 , . . . ,xs we have a free
choice for ρ1 , . . . ,ρs . So we may assume that s = 0.
P (i)
Let ε > 0 be given and assume that C(gi ) = 1 for all i. Let gi = α gα xα and
(i)
L(gi ) = xα . We have to prove that there exist ρ ∈ Rn+1 + such that for all 0 < λ < 1
(i)
sup{(2|α| + 2)n+2 | gα(i) | (λ ◦ ρ)α } < ε(2|α(i) | + 2)n+2 (λ ◦ ρ)α .
(i)
We choose k ∈ {1, . . . ,n + 1} maximal such that for at least one i and one α with gα 6= 0
(*) holds.
We choose any ρk+1 , . . . ,ρn+1 ∈ R+ smaller than 1 and choose ρk such that
(i) (i) (i)
(i) α α α α α
|gα | ρα n+1 n+1 k+1 n+1 (i)
k ρk+1 · . . . · ρn+1 < ερk ρk+1 · . . . · ρn+1 for all i and α such that |α| = |α |,
k k
(i)
gα 6= 0 and (*) holds, that is,
α −αk
(i)
ε α
(i)
−αk+1 α
(i)
−αn+1
(∗∗) ρk k < (i)
k+1
· ρk+1 n+1
· . . . · ρn+1 .
|gα |
7.2 Characterizations and Properties of Standard Bases 275
Prove that <m is a module ordering. Notice that the following module orderings are often used:
(1) degree orderings w1 > 0, . . . ,wn+1 > 0;
(2) elimination orderings w1 = · · · = wn = 0.
7.1.12. For xα and xβ ∈ M we define xα <m xβ if and only if
• x1α1 · . . . · xα β1
n < x1 · . . . · xn or
n βn
Definition 7.2.1. Let S = {f1 , . . . ,fm } ⊂ C {x}N be Pan ordered set. For f ∈ C {x}
N
We will show that if S is a standard basis however, then the normal form does not
depend on the ordering. First the definition.
Definition 7.2.3. Let I ⊂ C {x}N be a submodule. Then S = {f1 , . . . ,fm } ⊂ I is called
a standard basis of I if L(I) = L(f1 ), . . . ,L(fm ) . A standard basis is called reduced if
C(fi ) = 1 for all i and L(fi ) does not divide any monomial occurring in fj for i 6= j and
occurring in Tail(fi ).
Example 7.2.4. Let I = (x3 + y 2 ,y), w = (1,2). Then {x3 + y 2 ,y,y 5 } is a standard
basis of I in C {x} but not a reduced standard basis of I. The standard basis {x3 ,y} is
reduced.
Remark 7.2.5. Standard bases were introduced by Hironaka (cf. [Hironaka 1964]),
Grauert (cf. [Grauert 1972]) for the power series case and, for polynomial rings, by
Buchberger (cf. [Buchberger 1965]). In case of polynomial rings, they are also called
Gröbner bases. The idea of Gröbner bases is already contained in the work of Gordan,
see [Gordan 1899].
We now give some basic results.
Proposition 7.2.6. Let I ⊂ C {x}N be a submodule. Then I has a standard basis.
Proof. The module L(I) is, by definition, the submodule in C {x}N generated by the
leading monomials of the elements of I. The ring C {x} is Noetherian and, therefore
L(I) is generated by finitely many such leading monomials, say xα1 , . . . ,xαm . Choose
f1 , . . . ,fm ∈ I such that L(fi ) = xαi . Then {f1 , . . . ,fm } is a standard basis of I.
Corollary 7.2.7. Let S = {f1 , . . . ,fm } be a standard basis of I. Then I = (f1 , . . . ,fm ).
P
Proof. For f ∈ I we have f = qi fi + NF(f |S). This implies
that NF(f |S) ∈ I. If
NF(f |S) 6= 0, then L NF(f |S) ∈ L(I) = L(f1 ), . . . ,L(fm ) , which is a contradiction to
the definition of NF(f |S).
Corollary 7.2.8. Let S = {f1 , . . . ,fm } and T = {g1 , . . . ,gs } be standard bases of I.
Then for all f ∈ C {x}N we have NF(f |S) = NF(f |T ).
P P
Proof. f = qν fν + NF(f |S) = qν gν + NF(f |T
) implies NF(f |S) − NF(f |T ) ∈ I. If
NF(f |S) 6= NF(f |T ), then L NF(f |S) − NF(f |T ) ∈ L(I) is a monomial which appears
in NF(f |S) or in NF(f |T ) which is impossible by definition of NF.
This result makes the following definition possible.
Definition 7.2.9. Let I ⊂ C {x}N be a submodule and S a standard basis of I. Then for
any f ∈ C {x}N we define NF(f |I) = NF(f |S) to be the normal form of f with respect
to I.
Corollary 7.2.10. Let I be a submodule of C {x}N , and S = {f1 , . . . ,fm } be a standard
basis of I. Let f ∈ C {x}N . Then
(1) f ∈ I if and only if NF(f |S) = 0,
(2) f ∈ I if and only if NF(f |I) = 0,
(3) f − N F (f |I) ∈ I.
7.2 Characterizations and Properties of Standard Bases 277
P
Proof. We know that f = qν fν +NF(f |S) and I = f1 , . . . ,fm . Therefore, NF(f |S) =
0 implies f∈ I. On the other hand, f ∈ I implies NF(f |S) ∈ I. If NF(f |S) 6= 0 then
L NF(f |S) ∈ L(I) which is not possible by the definition of NF.
Corollary 7.2.11. Let I ⊂ C {x}N be a submodule. Then I has a reduced standard
basis S = {f1 , . . . ,fm }, with the property L(f1 ) < · · · < L(fm ). Such a standard basis is
uniquely determined.
Proof.
Step 1. We first prove the existence. Let T = {g1 , . . . ,gm } be a standard basis of I,
and NF the normal form with respect to T . After reordering, and leaving out some of
the gi , we may assume L(g1 ) < · · · < L(gm ). Suppose that L(gi )|L(gj ) for some i 6= j.
Then {g1 , . . . ,gj−1 , gj+1 , . . . ,gm } is also a standard basis for I. Thus we may assume that
L(gi ) ∤ L(gj ) for i 6= j. Furthermore, dividing gi by C(gi ) we may assume that C(gi ) = 1
for all i. We define
fi := L(gi ) + NF Tail(gi )|I , i = 1, . . . ,m.
(2) Let m = lcm(L(f ),L(g). Then we define the syzygy polynomial of f and g by
m C(f ) m
spoly(f,g) = ·f − · · g.
L(f ) C(g) L(g)
Examples 7.2.13.
spoly(f,g) = x5 − 25 y 5 .
(2) Consider the free C {x1 ,x2 }–module C {x1 ,x2 }2 = C {x1 ,x2 } ⊕ C {x1 ,x2 }x3 , and
w = (1,1,1) Let f = (x21 + 1,x2 ) = 1 + x21 + x2 x3 and g = (x2 ,x31 ) = x2 + x31 x3 . Then
L(f ) = 1, and L(g) = x2 , so that
(2) Let syz(f1 , . . . ,fm ) be the submodule of C [x]m of all syzygies of the leading terms
of f1 , . . . ,fm .
(3) Write xα(i,j) = lcm(L(fi ),L(fj )). Then
(Note that s(i,j) · (f1 , . . . ,fm ) = spoly(fi ,fj ), where the dot denotes the dot prod-
uct.)
Lemma 7.2.15. Consider f1 , . . . ,fm ∈ C {x}N , and keep the notations of the previous
definition.
(1) The C [x]–module syz(f1 , . . . ,fm ) admits a Zn+1 –grading, that is, we can write
syz(f1 , . . . ,fm ) = ⊕ν∈Nn+1 Sν with
m
X
Sν := ci xα(i) ei ∈ syz(f1 , . . . ,fm ) : ci ∈ C , xα(i) L(fi ) = xν .
i=1
(3) Let G be a subset of {s(i,j) } generating syz(f1 , . . . ,fm ). Then NF spoly(fi ,fj )|S) =
0 for all i,j with s(i,j) ∈ G.
Proof. The implication (1) =⇒ (2) follows from Corollary 7.2.10, and (2) =⇒ (3) is
obvious.
It remains to show (3) =⇒ (1). Without loss of generality, we may assume that C(fi ) = 1
for all i. Let L(fi ) = xαi for i = 1, . . . ,m. Consider f ∈ I. We have
Pmto prove that L(f )
is divisible by L(fi ) for some i. As f ∈ I we can write f = i=1 gi fi . To such a
representation we associate the monomial xδ = min{L(gi fi )}. Assume that the repre-
sentation is chosen with maximal xδ between all possible representations. We will prove
that L(f ) = xδ . This would imply L(f ) = L(gi fi ) = L(gi ) · L(fi ) for some i, and thus
the theorem. Suppose the converse, that is L(f ) > xδ . We define
I1 := {i : L(gi fi ) = xδ }, I2 := {i : L(gi fi ) 6= xδ }.
Now consider a nonzero term of the right hand side. Then there exist i,j ∈ I1 , with
s(i,j) ∈ G and cij spoly(fi ,fj ) 6= 0. As the syzygy on the leading terms is in Sδ we have
xαij lcm(L(fi ),L(fj )) = xδ . It was noted in the second part of Definition 7.2.12 that
L(xαij spoly(fi ,fj )) > xδ .
Now the third assumption implies NF spoly(fi ,fj )|S = 0 for s(i,j) ∈ G and, there-
Pm ij
fore, spoly(fi ,fj ) = ν=1 qν fν and L(qνij fν ) ≥ L spoly(fi ,fj ) . Thus in particular we
get xαij L(qνij fν ) > xδ . We get
X X m
X
C(gi )L(gi )fi = cij xαij qνij fν
i∈I1 i,j∈I1 ν=1
s(i,j) ∈G
Xm X
= cij xαij qνij fν .
ν=1 i,j∈I1
s(i,j) ∈G
Now we define g ν by
(
X gν , for ν ∈ I2
gν = cij xαij qνij +
i,j∈I1
Tail(gν ), for ν ∈ I1 .
s(i,j) ∈G
Pm
Then f = ν=1 g ν fν and L(gν fν ) > xδ for all ν. This is a contradiction to the choice of
δ and proves the theorem.
Example 7.2.17. Let f1 = x4 + x3 y 2 , f2 = y 4 + x2 y 3 , w = (1,1). Then {f1 ,f2 } is a
standard basis:
spoly(f1 ,f2 ) = x3 y 6 − x6 y 3 = −x2 y 3 f1 + x3 y 2 f2 .
The following lemma gives the possibility to find proper subsets of {s(i,j) } which
generate syz(f1 , . . . ,fm ). This is very useful for the application of 7.2.16.
Lemma 7.2.18 (Chain Criterion). Let f1 , . . . ,fm ∈ C {x}N and assume that a sub-
set G of {s(i,j) } generates
syz(f1 , . . . ,fm ). Suppose s(i,j) , s(i,k) and s(k,j) are in G and
L fi )| lcm(L(fj ),L(fk ) . Then G r {s(j,k) } generates syz(f1 , . . . ,fm ).
Proof. We may assume that C(fi ) = 1 for all i, then one computes that
lcm L(fj ),L(fk ) (i,j) lcm L(fj ),L(fk ) (i,k)
s(k,j) = s − s .
lcm L(fi ),L(fj ) lcm L(fi ),L(fk )
Thus the lemma follows from 7.2.16.
7.2 Characterizations and Properties of Standard Bases 281
Exercises
7.2.19 (Product Criterion). Let f1 ,f2 ∈ C {x}, and assume that lcm(L(f1 ),L(f2 )) = L(f1 ) ·
L(f2 ). Prove that NF(spoly(f1 ,f2 )|{f1 ,f2 }) = 0.
7.2.20. Let <m be an elimination ordering (see 7.1.11 (2)) with w1 = · · · = wn = 0, wn+1 = 1.
Let I ⊂ C {x}N P
be a submodule and S a standard basis ofP I.
Prove that S ∩ ti=0 C {x}xin+1 is a standard basis of I ∩ ti=0 C {x}xin+1 for all t ≤ N − 1.
7.2.21 (Gröbner bases). In this exercise we will develop the theory of standard bases for poly-
nomial rings rewriting some passages of the Sections 7.1 and 7.2.
• Replace C {x} by C [x].
• Replace in Definition 7.1.3 xαi < xαi+1 by xαi > xαi+1 .
• Rewrite the examples.
• Replace (2) by L(qν fν ) ≤ L(f ), for ν = 1, . . . ,m in Theorem ??, and cancel Remark 7.1.7.
• Replace the proof of Theorem ?? by the following NF–algorithm (normal form algorithm):
r = NF(f |{f1 , . . . ,fm })
r=f
S = {f1 , . . . ,fm }
while r 6= 0 and L(fi ) | L(r) for some i
choose i minimal with L(fi ) | L(r)
C(r) L(r)
r = r − C(f i ) L(fi )
fi
6 0
if r =
r = C(r)L(r) + NF(Tail(r)|S)
return(r).
Prove that this algorithm
P terminates. Use the fact that < is a well-ordering and Definition
7.1.3. Prove that f = qν fν + r, r = NF(f |{f1 , . . . ,fm }) for suitable qν ∈ C [x] and (1)
and (2) of Theorem ?? are true.
• Check the proofs of 7.2.7, 7.2.8, 7.2.11 and 7.2.10, 7.2.15, and 7.2.16 (here we have to
change the definition of xδ = max{L(gi fi )} to min{L(gi fi )} and replace several times >
by <), and 7.2.18.
7.2.22 (Buchberger’s algorithm).
S = standard({f1 , . . . ,fm })
S = {f1 , . . . ,fm }
P = {(fi ,fj )}i<j
while P 6= ∅
choose (f,g) ∈ P
P = P r {(f,g)}
h = NF(spoly(f,g)|S)
if h 6= 0
P = P ∪ {(f,h)|h ∈ S}
S = S ∪ {h}
return(S).
• Prove that the algorithm terminates.
• Use 7.2.16 to prove the S is a standard basis of (f1 , . . . ,fm ).
7.2.23 (Standard bases for formal power series). Standard bases for formal power series can be
defined as in 7.2.3.
• Prove that the theory developed in this section holds in the formal case.
• Prove that {f1 , . . . ,fm } ⊂ C {x}N is a standard basis if and only if {f1 , . . . ,fm } ⊂ C [[x]]N
is a standard basis.
282 7 Standard Bases
7.2.24 (Computation of standard bases in the power series ring). Let < be a well ordering
on the set of monomials of C [x]. We choose a new variable t and extend this ordering in the
following way to C [x,t]:
x α ta > x β tb
if deg(xα ) + a > deg(xβ ) + b or in case of equality xα < xβ . This extension has the following
property: P
Let f ∈ C [x], f = i≥1 ai xαi , ai 6= 0 and xαi < xαi+1 , that is, the monomials in f are ordered
` ´
in the sense of a power series ring (cf. Definition 7.1.3). Let F = f h = tdeg(f ) f xt1 , . . . , xtn be
P αi αi
the homogenization of f , then F = i≥1 ai xαi tdeg(f )−deg(x ) and we have xαi tdeg(f )−deg(x ) >
αi+1
xαi+1 tdeg(f )−deg(x )
, that is, the monomials of F are ordered in the sense of a polynomial
ring (cf. Exercise 7.2.21).
Prove the following: Let f1 , . . . ,fm ∈ C [x] and I = (f1 , . . . ,fm )C {x}, respectively I =
(f1 , . . . ,fm )C [[x]]. Let < be a well-ordering on C [x] extended as before to C [x,t]. Let J =
(f1h , . . . ,fm
h
)C [x,t] and {G1 , . . . ,Gr } be a standard basis of J. Then {G1 (t = 1), . . . ,Gr (t = 1)}
is standard basis of I.3
7.3 Applications
in L(I) then NF(xβ |S) is a linear combination of the {xα |xα 6∈ L(I)} and, therefore, this
set is also a basis.
Now we want to prove that for w = (1, . . . ,1) the Hilbert–Samuel function of an ideal
I ⊂ C {x} coincides with the corresponding function of the leading ideals (cf. Chapter
4.3).
Corollary 7.3.2. Let I ⊂ C {x} be an ideal, then HSC {x}/I = HSC {x}/L(I) .
3 In practice this way to compute a standard basis is not very efficient. In the computer algebra sys-
tem Singular for this purpose Mora’s algorithm, a modified version of Buchberger’s algorithm, see
[Mora 1982] is implemented.
7.3 Applications 283
Proof. HSC {x}/I (c) = dimC C {x}/I + (x)c
= dimC C {x}/L I + (x)c
by 7.3.1
= dimC C {x}/L(I) + (x)c
= HSC {x}/L(I) (c).
Finally, we shall prove Lemma 5.1.21 from Chapter 5.
Let f ,g ∈ C [x,y] quasi-homogeneous deg(x) = q, deg(y) = p and (x,y)–primary,
let f = f + terms of higher degree, g = g+ terms of higher degree. Then xi y j ∈ (f,g) if
iq + jp > w-ord(f · g).
Proof. We choose w = (p,q) and compute a standard basis for (f,g). Because of the
fact that f ,g are quasi-homogeneous and (f ,g) is (x,y)–primary, we may assume that
L(f ) = xc0 and define g0 = f and g1 = NF(g|{g0 }), L(g1 ) = xc1 y b1 with c0 > c1 and
b1 > 0. Assume we have already constructed {g0 , . . . ,gi−1 } with the following properties:
(1) gj ∈ (f,g).
(2) L(gj ) = xcj y bj , cj > cj+1 ,bj+1 > bj .
(3) gj = NF spoly(gj−2 ,gj−1 )|{g0 , . . . ,gj−1 } .
We define gi := NF spoly(gi−2 ,gi−1 )|{g0 , . . . ,gi−1 } .
Let L(gi ) = xci y bi then bi > bi−1 because L spoly(gi−2 ,gi−1 ) < xci−2 y bi−1 . Simi-
larly, ci < ci−1 .
Because the ci are strictly decreasing and (f ,g) is (x,y)–primary there is an m such
that L(gm ) = y bm and gm+1 = 0. Namely, gm+1 = NF spoly(gm−1 ,g m ) | {g 0 , . . . ,g m } . If
cm−1 bm
gm+1 6= 0 then L(gm+1 ) > x y and, therefore, deg L(gm+1 ) > deg spoly(gm−1 ,gm ) =
pbm + qcm−1 . We shall prove now that in this situation L(gm+1 ) ∈ L(g0 ), . . . ,L(gm ) ,
which is a contradiction for gm+1 being in normal form. At the same time we prove
that {g0 , . . . ,gm } is a standard basis of (f,g). We use Theorem 7.2.16 and Lemma
7.2.18. The property cj > cj+1 and bj < bj+1 implies that L(gj−1 ) = xcj−1 y bj−1 |
lcm L(gj ),L(gk ) = x y for k ≤ j − 2. This implies that already G = {s(i,i−1) }i=1,...,m
ck bj
generates syz(g0 , . . . ,gm ). (Lemma 7.2.18.) But by definition gi =NF spoly(gi−2 ,gi−1 )
| {g0 , . . . ,gi−1 } and, therefore, NF spoly(gi−2 ,gi−1 )|{g0 , . . . ,gm } = 0. (4) of Theorem
7.2.16 implies that {g0 , . . . ,gm } is a standard basis of (f,g).
Because of (f ,g) being (x,y)–primary, L(gm ) = y bm . Now deg(g0 ) = c0 p (the
weighted degree) and
bm xcm−1 y bm
·xi y j
b1
x
cm−1 c0
The monomial xcm−1 y bm has, therefore, degree deg(f · g). It is the biggest monomial
which is not in (f,g). This implies that xi y j ∈ (f,g) if iq + jp > deg(f · g) and proves the
Lemma.
As a totally different application, we can study the ideal of the product of germs of
analytic spaces.
Definition 7.3.3. Let (X, 0) ⊂ (C n , 0) and (Y, 0) ⊂ (C m , 0) be germs of analytic spaces.
Let X and Y be representatives We define the product (X × Y, 0) to be the germ of
(X × Y ) in 0.
One checks without difficulty that this definition is independent on the choice of
the representatives. Let I ⊂ C {x} be the ideal of (X, 0), and J ⊂ C {y} be the ideal of
(Y, 0). Then it is a direct check that the product (X × Y, 0) is given as the zero set of
(I + J) · C {x,y}. That this are all functions in the ideal of (X × Y, 0) is not at all trivial:
Theorem 7.3.4. With the above notation one has
I (X × Y, 0) = (I + J) · C {x,y}.
Proof. The inclusion ⊃ is easy. For the converse, let f ∈ I (X × Y, 0). We have to show
that f ∈ (I + J) · C {x,y}. Take a standard basis of J. By Grauert’s division Theorem
7.1.7, we may assume that no monomial of f is in L(J). We write
X
f= fα y α , fα ∈ C {x}.
α
Therefore, for all α with fα 6= 0 we have that y α is not in the leading ideal of J. We
claim that all fα are in I, which suffices to prove the Theorem (Exercise 7.3.7). We choose
representatives X of (X, 0) and Y of (Y, 0) such that f is defined on X × Y and f (a,b) = 0
for all (a,b) ∈ X × Y . Suppose the converse. Then there exists an a ∈ X with fα (a) 6= 0
for some α. This is beause I = I (X, 0). For all b ∈ Y we have
7.3 Applications 285
X
f (a,b) = fα (a)bα = 0,
α
that is, X
f (a,y) = fα (a)y α ∈ I (Y, 0) = J.
α
This is a contradiction, as the leading term of f (a,y) does not lie in L(J).
This result motivates the following definition.
Definition 7.3.5. Let A = C {x}/I and B = C {y}/J be analytic algebras. Then we
b by
define the analytic tensor product A⊗B
b := C {x,y}/(I + J).
A⊗B
Thus in case that A and B are reduced, and are the local rings of (X, 0) and (Y, 0)
respectively, the analytic tensor product is the local ring of the product (X × Y, 0).
Remark 7.3.6. We have obvious maps inclusions i : A −→ A⊗B, b b
and j : B −→ A⊗B,
which corresponds geometrically to the projection on the factors, if A and B are reduced.
On the other hand, we also have a natural surjective map A⊗B b −→ A. This corresponds
geometrically with the inclusion of one of the factors in the product.
The analytic tensor product satisfies the following universal property. Let C be an
analytic algebra, and γ : A −→ C and δ : B −→ C be analytic maps. Then there exist a
uniquely determined map θ : A⊗B b −→ C such that γ = θ ◦ i and δ = θ ◦ j.
In the case that A and B are reduced, this is easy to prove and follows from the fact
that analytic maps (Z, 0) −→ (X, 0) and (Z, 0) −→ (Y, 0) induce obviously an analytic
map (Z, 0) −→ (X × Y, 0).
For the proof of the universal property and more facts on the analytic tensor product
we refer to the book Analytische Stellenalgebren, see [Grauert-Remmert 1971].
Exercises:
P
7.3.7. Let I ⊂ C {x1 , . . . ,xn } be an ideal and f ∈ C {x1 , . . . ,xn , y1 , . . . ,ym }, f = fα y α such
that fα ∈ I for all α. Prove that f ∈ IC {x1 , . . . ,xn ,y1 , . . . ,ym }.
(Hint: Use the fact that the normal form with respect to IC {x1 , . . . ,xn ,y1 , . . . ,ym } and the
degree ordering with w = (1, . . . ,1) is additive and maps (x1 , . . . ,ym )c to (x1 , . . . ,ym )c .)
7.3.8. Let I ⊂ C {x0 , . . . ,x4 } be the ideal generated by the 2–minors of the matrix ( xx01 xx12 xx23 xx34 ).
Prove that I is a prime ideal.
(Hint: Consider the map ϕ : C {x0 , . . . ,x4 } −→ C {x0 ,x1 }x0 defined by ϕ(x2 ) = x10 x21 , ϕ(x3 ) =
1 3
x and ϕ(x4 ) = x13 x41 . Prove that Ker(ϕ) = I : (x0 )∞ . Then prove that I = I : (x0 ). To
x2 1
0 0
see this, prove that (x1 ,x2 ,x3 )2 = L(I) the leading ideal of I with respect to the degree reverse
lexicographical ordering. The six 2–minors of the matrix above are already a standard basis of
I with respect to this ordering.)
286
8 Approximation Theorems
Here it comes:
Theorem 8.1.1 (Artin’s Approximation Theorem). Consider two sets of variables x =
(x1 , . . . ,xn ) and y = (y1 , . . . ,yN ). Let f1 , . . . ,fm ∈ C {x,y} and y ∈ (x)C [[x]]N such that
fi (y) = 0 for i = 1, . . . ,m. Let c be a positive integer. Then there exists a y ∈ C {x}N
such that:
• fi (y) = 0 for i = 1, . . . ,m.
• y ≡ y modulo (x)c .
The proof is by induction on n. The case n = 0 is trivial. The proof of the induction
step is quite long, and will be divided in several lemmas. The first two lemmas will
prepare us to prove the Artin Approximation Theorem for a special case. The general
case will then be reduced to this special case.
Lemma 8.1.2. Suppose that Artin’s Approximation Theorem holds for n − 1 variables
x1 , . . . ,xn−1 . Consider the following data:
• c, a positive integer.
• f1 , . . . ,fm ∈ C {x,y}.
• A formal Weierstraß polynomial P = xsn + a1 xns−1 + . . . + as ,
ai ∈ (x1 , . . . ,xn−1 )C [[x1 , . . . ,xn−1 ]].
8.1 Artin’s Approximation Theorem 287
Suppose that
fi (r(x)) ≡ 0 modulo P for i = 1, . . . ,m.
Then there exists an analytic Weierstraß polynomial P of degree s in xn , and elements
r1 , . . . ,rN ∈ C {x1 , . . . ,xn−1 }[xn ] of degree smaller than s with respect to xn such that
fi (r(x)) ≡ 0 modulo P
A1 , . . . ,As ; Rij , 1 ≤ i ≤ N, 0 ≤ j ≤ s − 1
and define
Pe := xsn + A1 xns−1 + . . . + As
s−1
X
ei :=
R (Rij + cij )xjn 1 ≤ i ≤ N.
j=0
for some Fij ∈ C {x1 , . . . ,xn−1 ,A1 , . . . ,As ,{Rij }}. Plugging in a1 , . . . ,as for A1 , . . . ,As
and, similarly, r ij for Rij for 1 ≤ i ≤ N, 0 ≤ j ≤ s − 1, we see that we have formal
solutions to the equations Fij = 0. We may apply the induction hypothesis to find the
P
convergent solutions ai and the rij . Then clearly P = Pe (a) and ri = (rij + cij )xjn
satisfy the statement of the lemma.
Lemma 8.1.3. Suppose that Artin’s Approximation Theorem holds for n − 1 variables
x1 , . . . ,xn−1 . Let g,f1 , . . . ,fm ∈ C {x,y}, c ∈ N and formal power series y ∈ (x)C [[x]]N
be given such that g(y) 6= 0 and
Then there exists a y ∈ C {x}N with y ≡ y modulo (x)c such that g(y) 6= 0 and
Proof. Consider g(y) ∈ C [[x]]. After a linear coordinate change in the x1 , . . . ,xn , we may
assume that g(y) is regular of order s in xn . We may enlarge c so that c > s. We apply
the Weierstraß Preparation Theorem:
g(y) = U · P ,
The first equivalence follows from the Taylor expansion, the second equivalence holds by
construction of P . Similarly
By the previous lemma, applied to the functions g,f1 , . . . ,fm , we can find convergent
ri ∈ C {x}, and a convergent Weierstraß polynomial P with ri ≡ r i and P ≡ P modulo
(x)c such that
To define the yi , take any Qi ∈ C {x} such that Qi ≡ Qi modulo (x)c and put:
yi := Qi P + ri .
By construction yi ≡ y i modulo (x)c . Using Taylor’s formula again, as above, we see that
(8.1) g(y) = U · P
for a unit U ∈ C {x}. It then follows immediately that fi (y) ≡ 0 modulo g(y) for
i = 1, . . . ,m, and g(y) 6= 0 because P 6= 0. To show (8.1), observe that by the Weierstraß
Division Theorem and the fact that g(y) ≡ 0 modulo P there exists a U ∈ C {x} with
g(y) = U · P . We only have to show that U is a unit. By assumption, g(y) is regular of
order s in xn . Because y ≡ y modulo xc and c > s it follows that g(y) is regular of order
s in xn . As P is a Weierstraß polynomial of degree s in xn , it follows that U is a unit.
This finishes the proof of the lemma.
Lemma 8.1.4. Suppose that Artin’s Approximation Theorem holds for n − 1 variables
x1 , . . . ,xn−1 . Then Artin’s Approximation Theorem for n variables x1 , . . . ,xn holds under
the following assumptions:
(1) m ≤ N .
8.1 Artin’s Approximation Theorem 289
∂fi
(2) There exists an m–minor ∆ of the Jacobian matrix ∂yj such that ∆(y) 6= 0.
Proof. This we will prove as an application of Newton’s Lemma, which itself follows
from the Implicit Mapping Theorem. Take any nonzero function h ∈ (x)c . Then because
fi (y) = 0, we certainly have:
We can apply the previous lemma to find a y ′ ∈ C {x}N , y ′ ≡ y modulo (x)c , such that
fi (y ′ ) ≡ 0 modulo ∆2 (y ′ )h.
Putting J to be the ideal generated by the m–minors of the Jacobian matrix we certainly
have
fi (y ′ ) ≡ 0 modulo J 2 (y ′ )(x)c ,
so that Newton’s Lemma 3.3.34 applies: there exists a y ∈ C {x}N , y ≡ y ′ modulo
J(y ′ )(x)c , such that fi (y) = 0 for all i = 1, . . . ,m. (Note that we needed m ≤ N in order
to apply Newton’s Lemma.)
Proof of Artin’s Approximation Theorem 8.1.1. Define the ideal p to be the kernel of the
map:
α : C {x,y} −→ C [[x]]
defined by x 7→ x, and y 7→ y(x). Note that as we have an injective map C {x,y}/p ֒→
C [[x]], it follows that p is a prime ideal. As f1 , . . . ,fm ∈ p it suffices to find a y(x) ∈
C {x}N such that g(x,y(x)) = 0 for all g ∈ p. So we may have assumed p = (f1 , . . . ,fm )
from the beginning.
Let ht(p) = k. By the Jacobian Criterion 4.3.15 applied to the case I = p = q, there
exist k variables z1 , . . . ,zk among the xi and the yi , such that (after renumbering the fi )
∂fi
∆′ := det ∈/ p.
∂zj 1≤i,j≤k
∂fi (x,y(x))
In particular ∆′ (x,y(x)) 6= 0. As 0 = fi (x,y(x)) ∈ C [[x]], it follows that ∂xj =0
for all j. From the chain rule it follows that
In particular, it follows that one of those minors is nonzero. We may, therefore, apply
the previous lemma to find a convergent y ∈ C {x}N with y ≡ y modulo (x)c and
f1 (x,y(x)) = . . . = fk (x,y(x)) = 0.
Now C {x,y}p /p is a regular local ring. (It is even a field.) In the Jacobian Criterion
4.3.15 it was shown that f1 , . . . ,fk generate pC {x,y}p , that is,
290 8 Approximation Theorems
8.1.6. Let the conditions be as in the previous exercise. Let I ⊂ C {x} be an ideal. Suppose we
have a formal solution y(x) whose components lie in I · C [[x]]. Then we can find a convergent
solution in I. Prove this.
8.1.7. Let p ⊂ C {x} be a prime ideal. Show that p · C [[x]] is a prime ideal.
(Hint: Consider f,g ∈ C [[x]] with f · g ∈ p · C [[x]]. Use Exercise 8.1.5 to find for all c ∈ N
fc ,gc ∈ C {x}, fc ≡ f modulo (x)c and gc ≡ g modulo (x)c , and fc · gc ∈ p. Deduce that either
f = limc→∞ fc ∈ p · C [[x]] or g = limc→∞ gc ∈ p · C [[x]].)
8.1.8. Let p ⊂ C {x} be a prime ideal. Suppose that C {x}/p is normal. Use Exercise 8.1.5 to
see that C [[x]]/p · C [[x]] is normal.
F ≡ 0 modulo I
has a solution (φ(e0 ) ,ψ (e0 ) ) of order e0 . Suppose, moreover, that for all e ≥ e0 every
solution (φ(e) ,ψ (e) ) of order e with φ(e) ≡ φ(e0 ) modulo me0 +1 , and ψ (e) ≡ ψ (e0 ) modulo
me0 +1 , extends to a solution (φ(e) + δ (e) ,ψ (e) + γ (e) ) of order e + 1, with δ (e) ∈ C [s]p and
γ (e) ∈ C {x}[s]q homogeneous of degree e + 1 in s. Then the system of equations F ≡ 0
modulo I has an analytic solution (φ,ψ), with φ ≡ φ(e0 ) modulo me0 +1 , and ψ ≡ ψ (e0 )
modulo me0 +1 .
Before giving the proof, we need some preparations. As in Chapter 7, we choose as
a basis for the free module C {x,s,Φ,Ψ}k the vectors ei := (0, . . . ,0,1,0, . . . ,0) (the 1 is on
the i–th spot) and identify ei with xi−1 n+1 by adding a new variable xn+1 and enlarging x
to x = (x1 , . . . ,xn+1 ).
We choose on C {x,s,Φ,Ψ}k the degree lexicographical ordering as in Chapter 7 with
weights 1 for all variables. For an element F ∈ C {x,s,Φ,Ψ}k there exists a polyradius
n+1+l+p+q
λ ∈ R+ such that F ∈ Bλk . We will write, in order to distinguish between the
p+q
variables x,s and φ,ψ, the polyradius as λ = (ρ,τ,σ), ρ ∈ Rn+1 l
+ , τ ∈ R+ , σ ∈ R+ .
This we can view as system of linear equations. More precisely, consider the linear sub-
space in C t spanned by the vectors
which is nonzero. After renumbering the bi we may assume that M = det(bi,αj )1≤i,j≤r .
We define
P (i,β)
(1) for i = 1, . . . ,r, yi := |β|=e yi,β sβ with yi,β = MM ,
(2) yr+1 := · · · := yt := 0.
Here M (i,β) is obtained from M by replacing Pt the i–th column of M by the column vector
(Cα1 ,β , . . . ,Cαr ,β )t . We want to show j=1 bj,α yj,β = Cα,β for all α. By Cramer’s rule,
we know this is true for α = α1 , . . . ,αr . Now
X X X X X
Cα,β = bj,α y j,β = ki,α bj,αi y j,β = ki,α Cαi ,β = ki,α bj,αi yj,β = bj,α yj,β ,
j i,j i i,j j
as was to be shown.
Step 2. It remains to define the K = K(ρ) and to show the estimates. We take
|Mij |
K = max .
i,j ραj |M |
Here Mij is the determinant obtained from M by deleting the i–th row and the j–th
column. As yr+1 = · · · = yt = 0, we only have to show the estimates for the y1 , . . . ,yr .
By definition
X M (i,β) β
X |M (i,β) |
kyi kτ =
s
τ = τβ.
M |M |
|β|=e |β|=e
8.2 Grauert’s Approximation Theorem 293
Pr
Moreover, by linear algebra we have M (i,β) = j=1 Cαj ,β Mij and, therefore,
r
|M (i,β) | X |Mij |
≤ |Cαj ,β | .
|M | j=1
|M |
This implies
r
1 X X
kyi kτ ≤ |Cαj ,β | |Mij |τ β .
|M | j=1
|β|=e
kyi kτ ≤ K · kCk(ρ,τ )
holds.
We need the following generalization of this proposition.
Lemma 8.2.4 (Cartan’s Lemma). Let a1 , . . . ,ar , b1 , . . . ,bt ∈ C {x}k and ρ ∈ Rn+1 + ,
τ ∈ Rl+ . Suppose that a1 , . . . ,ar , b1 , . . . ,bt ∈ Bρk . There exists a ρ ∈ Rn+1
+ , ρ ≤ ρ, and
K = K(ρ) ∈ R+ with the following property.
P
For all τ ∈ Rl+ with τ ≤ τ and for all C ∈ B(ρ,τ k
), C = |α|=e C
(α) α
s with C (α) ∈ C {x}
and for all y1 , . . . ,y t ∈ C [s] and z 1 , . . . z r ∈ C {x}[s] such that
(3) kzi k(ρ,τ ) ≤ KkCk(ρ,τ ) and kyj kτ ≤ KkCk(ρ,τ ) for all i and j.
The important thing to note here, is that K does not depend on C, e, and τ . However,
it depends (of course) on the ai , the bi and ρ.
294 8 Approximation Theorems
Proof. Step 1. The case r = 0 is dealt with in the previous proposition. We consider the
normal form NF with respect to the submodule (a1 , . . . ,ar ). Note that as the y j do not
Pr Pt
depend on x, we have NF(yj ) = y j . We take the normal form of i=1 ai z i + j=1 bj yj =
C and get
t
X
(8.3) NF(bj )y j = NF(C).
j=1
We apply the previous proposition. There exists a K0 (ρ) depending on the NF(bj ) and
P
ρ, and y1 , . . . ,yt ∈ C [s] homogeneous of degree e such that tj=1 NF(bj )yj = NF(C) and
To define the zi , note that from 7.2.10 it follows that we can write
r
X r
X
(8.5) bj = αij ai + NF(bj ), C= βi ai + NF(C).
i=1 i=1
The precise αij and βi we take will be determined in the second step. Plugging (8.5) in
Pr Pt
i=1 ai z i + j=1 bj y j − C = 0 and using (8.3) we get
r
X t
X
zi + αij y j − βi ai = 0.
i=1 j=1
Now we define
t
X
zi := − αij yj + βi .
j=1
Obviously,
r
X t
X
zi + αij yj − βi ai = 0.
i=1 j=1
Pt
Working backwards, that is, using j=1 NF(bj )yj = NF(C) we get
r
X t
X
ai z i + bj yj = C.
i=1 j=1
Step 2. We still have to tell which αij and βi we take that fulfill (8.5), and show the
estimates. To do this, we take a standard basis f1 , . . . ,fm of the submodule (a1 , . . . ,ar ).
We can write
Xr
fν = hνi ai
i=1
for some hνi ∈ C {x}. Choose ρ ≤ ρ so small that a1 , . . . ,ar ,b1 , . . . ,bt ,f1 , . . . ,fm ,
h11 , . . . ,hmr ∈ Bρk . We apply Grauert’s Division Theorem, see Remark 7.1.7 with ε = 21 .
Therefore, by making ρ smaller we get equations
8.2 Grauert’s Approximation Theorem 295
X
bj = bjν fν + NF(bj )
ν
X
C= Cν fν + NF(C),
ν
Let K ′ = L maxj {2K0 kbj kρ + 1} and K = max{2K0,K ′ }. With this K the desired
estimates hold.
We need one more result before we can give the proof of Grauert’s Approximation
Theorem.
Lemma 8.2.5. Let F (x,s,Θ) ∈ C {x,s,Θ}k be a vector of power series in three sets of
variables x = (x1 , . . . ,xn ), s = (s1 , . . . ,sl ) and Θ = (Θ1 , . . . ,Θt ). Let I ⊂ C {s} be an
k n+1+l+t
ideal. Assume that F ∈ B(ρ,τ ,σ) for a suitable polyradius (ρ,τ ,σ) ∈ R . Then for
l
all ε > 0, there exists a K = K(ε) > 0 and a τ ∈ R+ , τ ≤ τ , with the following property.
(e) (e)
Let e ∈ N and θ(e) (x,s) = θ1 (x,s), . . . ,θt (x,s) such that
296 8 Approximation Theorems
Let Fe ∈ C {x,s}k be the degree e + 1 part in s of the normal form of F (x,s,θ(e) (x,s))
with respect to I, then:
kFe k(ρ,λτ ) ≤ Kε.
Proof. Let 0 < α < 1 be given. Then there exists a polyradius τ ∈ Rl+ such that for all
f ∈ B(ρ,τ )
1
(8.7) k NF(f |I)k(ρ,τ ) ≤ kf k(ρ,τ ).
1−α
Given α and (ρ,τ ), this inequality still holds if we replace τ by λτ for any λ with 0 < λ ≤ 1,
see Remark 7.1.7. We develop F :
X
F (x,s,Θ) = Fν (x,s)Θν .
ν∈Nt
As this series is convergent, there exists an A > 0 such that kFν (x,s)k(ρ,τ ) ≤ A|ν| . (cf.
Exercise 8.2.6). We choose K so small that AK < 1. It is an exercise to show that we
can choose K = K(ε) so small that even
X 1 ε(1 − α)
(AK)|ν| = t
− 1 − tAK < K ,
(1 − AK) 3
|ν|≥2
By assumption the θ(e) (x,s) are in normal form with respect to P I. Moreover the Fν (x,0)
for all ν with |ν| ≤ 1 do not depend on s. It follows that F0 (x,0)+ |ν|=1 Fν (x,0)θ(e) (x,s)ν
P
is in normal form with respect to I. Furthermore F0 (x,0) + |ν|=1 Fν (x,0)θ(e) (x,s)ν does
not have terms of degree e + 1 in the variables s. It follows that Fe , which by definition
is the degree e + 1 part in s of the normal form of F (x,s,θ(e) (x,s)), is equal to the degree
e + 1 part of the normal form of
X X
F0′ (x,s) + Fν′ (x,s)θ(e) (x,s)ν + Fν (x,s)θ(e) (x,s)ν .
|ν|=1 |ν|≥2
By choice of K and δ each term on the right hand side is smaller than or equal to
(1 − α) εK
3 . Hence kFe k(ρ,λτ ) < εK, as was to be shown.
(e )′ (e )′ (e0 )′ (e )′
Here c,d,cν and dν are in C {x}. Moreover φν 0 = {φi,ν0 }i=1,...,p , ψν = {ψi,ν0 }i=1,...,q
(e )′ (e )′
with φi,ν0 ∈ m, ψi,ν0 ∈ m · C {x,s}. Define new variables Φ′ = {Φ′iν },Ψ′ = {Ψ′iν }, and
define X X
F ′ (x,s,Φ′ ,Ψ′ ) := F (x,s,c + sν (cν + Φ′ν ),d + sν (dν + Ψ′ν )).
′ ′ ′ ′
Now F ′ has a solution φ(e0 ) ,ψ (e0 ) of order e0 . Replacing F,φ(e0 ) , ψ (e0 ) by F ′ ,φ(e0 ) ,ψ (e0 )
proves the claim.
Step 2. We can decide whether F (φ,ψ) ∈ I by looking at the normal form NF(F (φ,ψ)|I).
In fact, in 7.2.10 it is proved that:
The normal form NF(F (φ,ψ)|I) is a power series, all of whose terms are not in L(I), the
ideal of leading terms of I. Moreover, NF(F (φ,ψ)|I) is congruent to F (φ,ψ) modulo I. If
(φ,ψ) is a solution of F ≡ 0 modulo I, then it follows immediately that (NF(φ|I), NF(ψ|I))
is also a solution
P of F ≡ 0 modulo I. Similar remarks
P hold for solutions of order e, as
obviously aα sα being in normal form implies |α|≤e aα sα is in normal form. So we
will and may assume that all our solutions are in normal form.
n+1+l+p+q k
Step 3. We now choose the polyradius. Let (ρ,τ ,σ) ∈ R+ such that F ∈ B(ρ,τ ,σ) .
We consider the elements
∂F ∂F
s=0
∈ C {x}k and ∈ C {x}k , for i = 1, . . . ,p and j = 1, . . . ,q.
∂Φ i ∂Ψ j s=0
n+1
By Cartan’s
∂F
Lemma 8.2.4 we can choose a ρ ∈ R+ , ρ ≤ ρ and M
∂F k
= M (ρ) ≥ 1 depending
on ∂Ψ s=0 and ∂Φ s=0 with the following property. Let G ∈ B(ρ,τ ) , and δ 1 , . . . ,δ p ∈ C [s]
and γ 1 , . . . ,γ q ∈ C {x}[s] such that
p
X Xq
∂F ∂F
G+ δi s=0
+ γi = 0,
i=0
∂Φi i=0
∂Ψi s=0
can choose, by scaling, the polyradius τ so small that, moreover, kφ(e0 ) kτ < K and
kψ (e0 ) k(ρ,τ ) < K.
Step 4. We will for all e ≥ e0 +1 construct solutions (φ(e) ,ψ (e) ) of order e with (φ(e) ,ψ (e) ) ≡
(φ(e−1) ,ψ (e−1) ) modulo me and with the estimates:
Here φe and ψe are the degree e parts in s of φ(e) and ψ (e) . Let (φ(e) ,ψ (e) ) be a solution
of order e, such that (φ(e) ,ψ (e) ) are polynomials in s of degree ≤ e, in normal form with
respect to I and kφe kτ ≤ K, kψe k(ρ,τ ) ≤ K. Let Fe be the degree e + 1 part in s of
K
the normal form of F (x,s,φ(e) ,ψ (e) ). By the previous lemma, kFe k(ρ,τ ) < εK = M . By
(e)
assumption, there exist δ and γ (e) homogeneous of degree e + 1 in s and in normal
(e)
form with respect to I such that (φ(e) + δ ,ψ (e) + γ (e) ) is a solution of order e + 1. By
Taylor expansion and taking the degree e + 1 part of the normal form we obtain
Xp Xq
∂F (e) ∂F (e)
Fe + |s=0 δ i + |s=0 γ i = 0.
i=1
∂Φ i i=1
∂Ψ i
(e) (e)
By choice of M there exist such δi and γi , having moreover the estimates:
(e) (e) K
kδi kτ , kγi k(ρ,τ ) < M kFe k(ρ,τ ) < M = K.
M
Now φ(e+1) := φ(e) + δ (e) and ψ (e+1) := ψ (e) + γ (e) . Then φe+1 = δ (e) and ψe+1 = γ (e) ,
and the desired estimate holds.
P P∞
Step 5. We now look at the formal solution φ = ∞ e=0 φe , ψ = e=0 ψe . Then, because
φe and ψe are homogenous of degree e with respect to s, and kφe kτ < K, kψe k(ρ,τ ) < K
we obtain
∞
X ∞
X
1 e
1 e
kφkτ /2 < 2 · K = 2K, kψk(ρ,τ /2) < 2 · K = 2K,
e=0 e=0
showing the convergence of φ and ψ. This concludes the proof of Grauert’s Approximation
Theorem.
Exercises
P
8.2.6. Let f (x,Θ) = ν∈Nt fν Θν . Let (ρ,τ ) be a polyradius with kf k(ρ,τ ) < ∞. Show that there
exists an A with kfν kρ ≤ A|ν| .
1
8.2.7. Let c > 0, prove that for all x with |x| sufficiently small (1−x) t < (c + t)x + 1.
(Hint: Check the monotony behavior of the real function corresponding to the above equality.)
8.3 Some other Approximation Theorems 299
f = aY1 + bY2 + cY3 + d ∈ C {x1 ,x2 ,x3 , Y1 ,Y2 ,Y3 }, y = (y 1 ,y2 ,y3 ) ∈ C [[x1 ,x2 ,x3 ]]
such that f (y) = 0, y2 ,y3 ∈ C [[x1 ,x2 ]] and there are no y = (y1 ,y2 ,y3 ) ∈ C {x1 ,x2 ,x3 }
with f (y) = 0 and y2 ,y3 ∈ C {x1 ,x2 }.
Remark 8.3.6. In the following special situation the problem of Remark 8.3.5 can be
solved.
Let K be a field and f1 , . . . ,fm ∈ Khx,Y i, assume fi (y) = 0 for y ∈ (x)K[[x]].
Assume yi ∈ K[[x1 , . . . ,xvi ]] 1 ≤ v1 ≤ · · · ≤ vN ≤ n. Let c > 0 be an integer, then there
exists a y ∈ KhxiN such that
y ≡ y mod (x)c
yi ∈ Khx1 , . . . ,xvi i
fi (y) = 0, i = 1, . . . ,m.
As we will not explain here, this theorem is a consequence of Popescu’s general
theorem (cf. [Popescu 1985], [Popescu 1986, 1990]). We obtain the following corollary:
Corollary 8.3.7 (Grauert’s Theorem for Algebraic Power Series). We take the
notations of and assumptions of 8.2.1 and 8.2.2. Assume furthermore that I ⊂ C hsi
and F ∈ C hx,s,φ,ψik . Then the system of equations F ≡ 0 modulo I has an algebraic
solution (φ,ψ) with φ ≡ φ(e0 ) modulo me0 +1 , and ψ ≡ ψ (e0 ) modulo me0 +1 .
Artin’s Approximation Theorem and its generalizations have many applications. As
an application we give Artin’s answer to a question of Grauert.
Proposition 8.3.8. Let A,B,C be analytic C –algebras and w : C −→ B a homomor-
phism.
(1) Assume that for a surjective homomorphism ϕ : A −→ B the following diagram is
commutative:
B /B b
G OO OO
ϕ ϕ̂
w
A b
7/ A
ooooo
ooo
ooooo u
o
C
Then, for a given positive integer c, there exists a u : C −→ A such that ϕ ◦ u = w
and u ≡ u mod mcAb.
(2) Assume the following diagram is commutative:
Ab _?
O ?
?? u
??
??
AO ?B
~~~
v
~~w
~~
C
Then for a given positive integer c there exists a u : B −→ A such that v = u◦w and
b −→ A
u ≡ u mod mcAb. If û : B b is an isomorphism, then u is also an isomorphism.
8.3 Some other Approximation Theorems 301
in the variables y,{Aij },{Bij }. It has the formal solution (y,{αij },{β ij }). Using Artin’s
Approximation Theorem, we obtain for a given c an analytic solution y,{αij },{βij }
such that (y,{αij },{βij }) ≡ (y,{αij },{β ij }) mod (x1 , . . . ,xn )c .
We define u : C −→ A by u(zi ) = yi (x) mod (f1 , . . . ,fa ). Because of hi (y) ≡ 0
mod (f1 , . . . ,fa ) it is well-defined and because of zi (x) ≡ yi (x) mod (g1 , . . . ,gb ) we have
ϕ ◦ u = w.
To prove (2), let zi′ (x) ∈ C {x} (respectively zi′′ (y) ∈ C {y}) be representatives of v(zi )
(respectively w(zi )). Furthermore, let yi (x) ∈ C [[x]] be representatives of u(yi ). Similar
to case (1), we obtain
a
X
zi′′ y(x) − zi′ (x) = αij fj
j=1
Xa
gi y(x) = β ij fj
j=1
for suitable αij ,β ij ∈ C [[x]]. Similar to the first case, the approximation of this formal
solution defines the morphism u : B −→ A with the required properties.
We leave it as an exercise to prove that u is an isomorphism if û : B b −→ A b was
already an isomorphism.
302
This chapter is devoted to the study of hypersurface singularities. There are various
equivalence relations one can consider on the set of all functions f ∈ C {x1 , . . . ,xn }. The
two most important ones are right equivalence and contact equivalence. Two functions
f and g are called right equivalent if there exists an automorphism ϕ of C {x1 , . . . ,xn }
such that ϕ(f ) = g. They are called contact equivalent if there exists an automorphism
ϕ and a unit u in C {x1 , . . . ,xn } such that ϕ(f ) = u · g. Until now we considered contact
equivalence in this book (without using this name), but in this chapter we will mainly
consider right equivalence, because in this case the theorems are somewhat easier to
formulate and to prove. (One does not have to worry about the unit.) The aim of this
chapter is to give a beginning of the classification of functions up to right equivalence.
The first thing is to consider Finite Determinacy Theorems. Let m be the maximal ideal
of C {x1 , . . . ,xn }. A function f is called (right) k–determined if for all elements g with
f − g ∈ mk+1 the functions f and g are right equivalent. In particular, a k–determined
f is right equivalent to a polynomial of degree at most k. A function is called finitely
determined, if there exists a k ∈ N such that f is k–determined. It is maybe a little
surprising that all functions which have an isolated singularity are finitely determined.
The proof, however, is not difficult at all: it is a direct application of Newton’s Lemma,
see 3.3.34. This result is not strong enough for our purposes, however. We want to find
a small k such that f is k–determined. So we prove that, if mk+1 ⊂ m2 J(f ), then f
is k–determined. (Here J(f ) is the Jacobian ideal, that is, the ideal generated by the
partial derivatives of f .) This is what is usually called the Finite Determinacy Theorem,
and is more difficult to prove. The idea of the proof is to connect f and g via the path
f + a(g − f ). So for a = 0 we get f , and for a = 1 we get g. For all a, we want to prove
that there exists a small neighborhood of a in C such that for all b ∈ U the functions
f + b(g − f ) and f + a(g − f ) are right equivalent. In order to reach this goal, we need a
criterion of local triviality. So, consider an element F ∈ C {x1 , . . . ,xn ,t} which we view
as a family of functions with parameter t. Then
we prove that this family is trivial if
∂F ∂F ∂F
and only if ∂t ∈ (x1 , . . . ,xn ) ∂x1 , . . . , ∂xn . In order to prove this, we construct an
automorphism in the formal power series ring. The existence of a convergent one then
follows from Artin’s Approximation Theorem. We will show that this condition holds in
the family f + ag, so that compactness of [0,1] then completes the proof of the Finite
Determinacy Theorem.
As an application of the local triviality we will prove the Mather-Yau Theorem.
It says that f and g are contact equivalent, if and only if their Tjurina algebras are
isomorphic. We will also prove a corresponding statement for right equivalence. The first
reduction in the proof of the Mather-Yau Theorem is to the case that (f ) + J(f ) =
(g) + J(g). Again we connect the functions by a path f + a(g − f ). But now one has to be
more careful, because it is certainly not true that for all a ∈ C the functions f + a(g − f )
and f are contact equivalent. For example, the functions f and −f are contact equivalent,
but for a = 12 we get that f + 21 (−f − f ) = 0, which is certainly not contact equivalent to
f . But at least we will show that for all a ∈ C , except for a discrete subset, the functions
9.1 Finite Determinacy of Hypersurface Singularities 303
f and f + a(g − f ) are contact equivalent. As the complement of a discrete subset is still
path connected, this is sufficient for the proof of the Mather-Yau Theorem.
In Section 9.2 we start with the classification of functions up to right equivalence. We
first define what a simple function is. This is done by putting a topology on C {x1 , . . . ,xn }.
A function f is then called simple, if there are only finitely many equivalence classes in a
sufficiently small neighborhood of f . The goal of this chapter is to prove that the simple
singularities are given by the A-D-E–singularities. We first prove the Splitting Lemma.
This allows us to study functions which lie in m3 . Then, via a sequence of arguments, it
is shown that the A-D-E–singularities are simple, and that for nonsimple singularities
f we may assume that either n ≥ 3 and f ∈ m3 , or n = 2 and f ∈ m4 , or n = 2 and
f ∈ (x,y 2 )3 or f is nonisolated.
In Section 9.3, we then show that functions with one of these properties are not
simple. The idea is to use the Finite Determinacy Theorem, to reduce the study of right
equivalence classes to the study of orbits under a group action in C N for some N ≫ 0.
In fact, we study the group action on the k–jets, which are the Taylor series of f up to
order k + 1. These orbits are orbits under the action of an algebraic group. This means
that all the actions are given by polynomial functions. We then apply some algebraic
geometry to prove that orbits of algebraic group actions are locally closed in the Zariski
topology. (That is, the intersection of an open and closed subset.) As all orbits contain
a smooth point, we can use the group action to show that all points of the orbit are
smooth. The next thing to do is to compute the tangent space. We will see that it is
equal to mJ(f ) + mk+1 /mk+1 . In particular, we get the dimension of the orbit of f . We
deduce that if g ∈ / mJ(f ), then in the family f + t · g there are only finitely many t such
that f + tg is right equivalent to f . The knowledge of the dimension of the orbits is our
main tool for showing that certain functions are not simple.
Definition 9.1.1. Let f, g ∈ m ⊂ C {x1 , . . . ,xn }, where m is, as usual, the maximal
ideal.
(1) One says that f is right equivalent to g, f ∼ g, if there exists an automorphism ϕ
R
of C {x1 , . . . ,xn } such that ϕ(f ) = g.
(2) One says that f is contact equivalent to g, f ∼ g, if there exists an automorphism
C
ϕ of C {x1 , . . . ,xn } such that (ϕ(f )) = (g), that is, there exists a unit u such
that ϕ(f ) = ug. Thus f and g are contact equivalent exactly if the C –algebras
C {x1 , . . . ,xn }/(f ) and C {x1 , . . . ,xn }/(g) are isomorphic. This is exactly the same
as saying that the germs of the analytic hypersurfaces defined by f and g are
isomorphic.
In this chapter, we will mainly work with right equivalence. Similar results hold for
contact equivalence, but the formulation and proofs will be left to the reader.
Definition 9.1.2. Let f ∈ m ⊂ C {x1 , . . . ,xn }. Then f is called k–determined if all
g ∈ C {x1 , . . . ,xn } with f − g ∈ mk+1 are right equivalent to f . If f is k–determined for
some k ∈ N, then f is called finitely determined. In particular, a k–determined f is right
equivalent to a polynomial.
304 9 Simple Hypersurface Singularities
The fact that functions which have an isolated singularity are finitely determined
follows quite easily from Newton’s Lemma 3.3.34, as we will show now.
Theorem 9.1.3. Let f ∈ m2 ⊂ C {x1 , . . . ,xn }. Suppose mk+1 ⊂ mJ(f )2 . Then f is
k–determined. In particular, functions with isolated singularities are finitely determined.
Proof. Let g ∈ m such that f − g ∈ mk+1 . Consider the following system of equations,
which we want to solve for y1 , . . . ,yn ∈ C {x1 , . . . ,xn }:
(2) There exists a ϕ = (ϕ1 , . . . ,ϕn ) ∈ C {x1 , . . . ,xn ,t}n such that
(a) ϕi (x1 , . . . ,xn ,0) = xi ,
(b) ϕi − xi ∈ (x1 , . . . ,xn )c ,
(c) F (ϕ1 , . . . ,ϕn ,0) = F (x1 , . . . ,xn ,t).
∂F ∂F ∂F
Corollary 9.1.6. Suppose that ∂t ∈ (x1 , . . . ,xn ) ∂x1 , . . . , ∂xn for some
F ∈ C {x1 , . . . ,xn ,t}. Then there exists a small neighborhood U of 0 in C such that
for all fixed a ∈ U the function Fa := F (x1 , . . . ,xn ,a) is right equivalent to F0 .
9.1 Finite Determinacy of Hypersurface Singularities 305
Proof. Consider the elements ϕ1 , . . . ,ϕn ∈ C {x1 , . . . ,xn ,t} which exist according to 9.1.5.
There exists a smallopen neighborhood
U of 0 in C , so that for all a ∈ U , the ϕi (x,a) are
∂ϕi (x,a)
convergent and det ∂xj (0) 6= 0. Thus for fixed a ∈ U , (ϕ1 , . . . ,ϕn ) is an automor-
phism of C {x1 , . . . ,xn }. Moreover, formula (c) in Theorem 9.1.5 says that it transforms
F (x1 , . . . ,xn ,0) into F (x1 , . . . ,xn ,a).
Proof of Theorem 9.1.5.
Step 1. We first prove the easy implication (2) =⇒ (1). As ϕi (x1 , . . . ,xn ,0) = xi it
follows easily from the Inverse Function Theorem that (ϕ1 , . . . ,ϕn ,t) is an automorphism
of C {x1 , . . . ,xn ,t}. Let ψ = (ψ1 , . . . ,ψn ,t) be its inverse. Then ψi − xi ∈ (x1 , . . . ,xn )c , as
one easily checks.
By assumption, we have F (ϕ,0) = F (x,t). Compose this with ψ. Then F (x,0) =
F (ψ,t). Hence the derivative of F (ψ1 , . . . ,ψn ,t) with respect to t is zero. Thus
n
X ∂F ∂ψi ∂F
(9.1) (ψ1 , . . . ,ψn ,t) + (ψ1 , . . . ,ψn ,t) = 0.
i=1
∂xi ∂t ∂t
Xn
∂F ∂ψi ∂F
(9.2) · (ϕ1 , . . . ,ϕn ,t) + = 0.
i=1
∂xi ∂t ∂t
We already noted that ψi −xi ∈ (x1 , . . . ,xn )c . But then also ∂ψ ∂t (ϕ1 , . . . ,ϕn ,t) ∈ (x1 , . . . ,xn )
i c
∂
Pn ∂
Consider the differential operator δ := − ∂t + i=1 ξi ∂xi . Let s be a new variable and
consider the map
Put Φi := Φ(xi ) for i = 1, . . . ,n, and Φn+1 = Φ(t). Note that Φn+1 = Φ(t) = t − s. Now
define ϕi := Φi (x1 , . . . ,xn ,t,t) for i = 1, . . . ,n. We have to show that these ϕi satisfy our
conditions.
(1) The first condition is easy. Indeed, ϕi (x1 , . . . ,xn ,0) by definition is equal to δ 0 (xi ) =
xi .
(2) The second condition is also not too difficult. Indeed from the first part we have
X
1 k k
Φ(xi ) = xi + k! δ (xi )s .
k≥1
From the definition of δ, and the fact that the ξi ∈ (x1 , . . . ,xn )c it follows immediately
that Φ(xi ) − xi ∈ (x1 , . . . ,xn )c . The second condition now follows.
(3) The third condition is the most difficult one. The crucial thing to show is
∂F (Φ1 , . . . ,Φn+1 )
(9.3) = 0,
∂s
that is, F (Φ1 , . . . ,Φn ,Φn+1 ) does not depend on s. Indeed, assuming (9.3) it follows that
F (ϕ1 , . . . ,ϕn ,0) = F (Φ1 , . . . ,Φn ,Φn+1 )s=t =
F (Φ1 , . . . ,Φn ,Φn+1 )s=0 = F (x1 , . . . ,xn ,t).
The first equality holds by definition of the ϕi , the second equality because of (9.3). The
third equality is because Φi |s=0 = xi for i ≤ n, and Φn+1 |s=0 = t.
It therefore remains to show (9.3). We have the following formulae:
∂Φi
(9.4) = ξi (Φ1 , . . . ,Φn ,Φn+1 ),
∂s
∂Φn+1
(9.5) = −1,
∂s
(9.6) δ(F ) = 0.
The second and the third formula follow immediately from the fact that Φn+1 = t − s,
and the definition of δ. To prove formula (9.4), consider an arbitrary a ∈ C [[x1 , . . . ,xn ,t]],
and differentiate:
∞ ∞
∂Φ(a) X 1 X 1
= δ k (a)sk−1 = δ k−1 (δ(a))sk−1 = Φ(δ(a)).
∂s (k − 1)! (k − 1)!
k=1 k=1
For all a we will show that there exists an open neighborhood U of a such that for all
a+t∈ U
(9.7) f + (a + t) · g ∼ f + a · g.
R
Because the unit interval [0,1] is compact we can cover the unit interval with finitely
many U . As right equivalence is an equivalence relation, it then follows that f = F0 is
right equivalent to F1 = f + g.
Step 2. To prove (9.7) we will apply Corollary 9.1.6, and it therefore suffices to prove
∂Fa 2 ∂Fa ∂Fa
(9.8) ∈m ,..., .
∂t ∂x1 ∂xn
2
By applying Artin’s Approximation Theorem, it suffices to find ξP i ∈ m C [[x1 , . . . ,xn ]]
k+1 n ∂f
satisfying (9.9). Given h ∈ m , we can by assumption write h = i=1 ξi ∂x i
, ξi ∈ m2 .
But then
Xn Xn
∂F ∂g
(9.10) h= ξi − ξi .
i=1
∂xi i=1
∂xi
∂g
As g ∈ mk+1 and ξi ∈ m2 it follows that ξi ∂x i
in mk+2 . By assumption mk+1 ⊂ m2 J(f ),
so that we can write
n
X n
X (1) ∂f
∂g (1)
(9.11) ξi = ξi , ξi ∈ m3 .
i=1
∂xi i=1
∂xi
(This is the reason we need mk+1 ⊂ m2 J(f ), as the condition mk+1 ⊂ mJ(f ) does not
suffice.) Plugging (9.11) in (9.10) we get
n
X n
X (1) ∂g n
X
(1) ∂F (1) ∂g
h= (ξi + ξi ) − ξ with ξi ∈ mk+3 .
i=1
∂xi i=1 i ∂xi i=1
∂xi
308 9 Simple Hypersurface Singularities
Xn
∂(f + ag)
g= ξi , for some ξi ∈ m2 .
i=1
∂xi
This implies
X ∂Fa n X ∂g n
∂Fa
(9.12) =g= ξi −t· ξi .
∂t i=1
∂xi i=1
∂xi
∂g
As g ∈ mk+1 , it follows that ξi ∂x i
∈ mk+2 . Thus we can write
n
X n
X (1) ∂(f + ag)
∂g (1)
(9.13) ξi = ξi , ξi ∈ m3 .
i=1
∂xi i=1
∂xi
X n
∂Fa (1) ∂Fa
g= = (ξi − tξi ) mod t2 mk+3 .
∂t i=1
∂xi
Step 5. For the final statement we apply Exercise 1.3.20. It follows that mk−1 ⊂ J(f ),
from which mk+1 ⊂ m2 J(f ) follows. So we can apply the first part of the theorem.
Theorem 9.1.5 can be generalized as follows:
Theorem 9.1.7 (Characterization of Local Analytic Triviality).
Let f ∈ C {x1 , . . . ,xn ,y1 , . . . ,ym } and c ≥ 0 be an integer. The following conditions are
equivalent:
∂f c ∂f ∂f
(1) ∂y i
∈ (x1 , . . . ,xn ) ∂x1 , . . . , ∂xn + (f ) for i = 1, . . . ,m.
(2) There exist ϕ1 , . . . ,ϕn ,u ∈ C {x1 , . . . ,xn ,y1 , . . . ,ym } such that
The proof is similar to the proof of 9.1.5 and will be left as Exercise 9.1.15. We now
come to the Theorem of Mather and Yau.
Theorem 9.1.8 (Mather-Yau). Let f and g ∈ m = (x1 , . . . ,xn ) ⊂ C {x1 , . . . ,xn }. Then
the following conditions are equivalent:
(1) C {x1 , . . . ,xn }/(f ) ∼
= C {x1 , . . . ,xn }/(g), that is, f and g are contact equivalent.
(2) C {x1 , . . . ,xn }/ (f ) + mJ(f ) ∼ = C {x1 , . . . ,xn }/ (g) + mJ(g) as C –algebras.
If f and g define isolated singularities these conditions are equivalent to
(3) C {x1 , . . . ,xn }/ (f ) + J(f ) =∼ C {x1 , . . . ,xn }/ (g) + J(g) as C –algebras.
To put it in another way, the function f and g are contact equivalent if and only if the
Tjurina algebras of f and g are isomorphic.
Remark 9.1.9. It is not true that functions are right equivalent if and only if their Milnor
algebras are isomorphic. Indeed, consider the family of functions ft = x4 + y 5 + t · x2 y 3 .
Then one can show, see Exercise 9.1.16:
(1) ft ∼ f1 ⇐⇒ t = a2 b2 , a4 = 1, b5 = 1.
R
5
(2) Consider the automorphism ϕα of C
{x,y} defined by ϕα (x) = α x, and ϕα (y) =
4 2 −1
α y, with α = t . Then ϕα J(ft ) = J(f1 ).
To find the correct theorem for the case of right equivalence, we consider the Milnor
algebra C {x1 , . . . ,xn }/J(f ) as C {t}–algebra with multiplication t · h := f h for h ∈
C {x1 , . . . ,xn }/J(f ).
Theorem 9.1.10 (Mather-Yau for Right Equivalence).
Let f and g ∈ m = (x1 , . . . ,xn ) ⊂ C {x1 , . . . ,xn }. Then the following conditions are
equivalent:
(1) f ∼ g.
R
ψ
C {x1 , . . . ,xn } / C {x1 , . . . ,xn }
ϕ
C {x1 , . . . ,xn }/ (f ) + J(f ) / C {x1 , . . . ,xn }/ (g) + J(g)
commutes, that is, ψ (f ) + J(f ) = (g) + J(g). Now let h
:= ψ(f ). Then (h) + J(h) =
(g) + J(g), because ψ (f ) + J(f ) = (ψ(f )) + J(ψ(f )) , see 3.4.27. This is what we
wanted to show.
Step 2. We may now assume that (f ) + J(f ) = (g) + J(g). Let h := f − g and F :=
g + th ∈ C {x1 , . . . ,xn ,t}. So we are connecting f and g by a (complex) path: for t = 0 we
have g, and for t = 1 we have f . Unlike in the proof of the Finite Determinacy Theorem,
it is not true however that for all a, the functions g and g + ah are contact equivalent.
In this step we will show that at least for t small the functions g and g + th are contact
equivalent. From (f ) + J(f ) = (g) + J(g), it follows that (h) + J(h) ⊂ (g) + J(g). Hence
Xn
∂F ∂g
= h = ug + ξi
∂t i=1
∂xi
n n
!
X ∂F X ∂h
= uF + ξi − t uh − ξi
i=1
∂xi i=1
∂xi
n n
!
X ∂F X (1) ∂g
= uF + ξi + t u(1) g + ξi
i=1
∂xi i=1
∂xi
for suitable u,ξ i ∈ C [[x1 , . . . ,xn ,t]]. Using Artin’s Approximation Theorem 8.1.1 we ob-
tain
∂F ∂F ∂F
∈ F, ,..., .
∂t ∂x1 ∂xn
Now we can apply Theorem 9.1.7 to obtain
ϕ F (x1 , . . . ,xn ,0) = wF (x1 , . . . ,xn ,t)
for a suitable map ϕ = (ϕ1 , . . . ,ϕn ,t) : C {x1 , . . . ,xn ,t} −→ C {x1 , . . . ,xn ,t} and w a unit
in C {x1 , . . . ,xn ,t}. Now we want to prove that for small t fixed the map (ϕ1 , . . . ,ϕn ,t)
is an automorphism of C {x1 , . . . ,xn }. This is the place where we need the assumption
that f and g define isolated singularities.1
At least the map (ϕ1 , . . . ,ϕn ,t) is an automorphism of C {x1 , . . . ,xn ,t}, so has an
inverse (ψ1 , . . . ,ψn ,t), so that
Indeed, if this were true, then (h) + J(h) ⊂ (g) + J(g) = (g + ah) + J(g + ah) and we can
redo the argument of Step 2. Now (9.14) does not hold for all a ∈ C , but we will show
that it holds for almost all a ∈ C . To prove (9.14) for almost all a ∈ C , we use the theory
of coherent sheaves. Let U ⊂ C n be an open subset such that f and g converge on U and
∂G ∂G
0 ∈ U . Define G := g + th, and J(G) = ∂x 1
, . . . , ∂xn . We have (G) + J(G) ⊂ (g) + J(g).
The ideal sheaves (G) + J(G) and (g) + J(g) in OU×C are coherent sheaves. Therefore,
the quotient M := (g) + J(g) /((G) + J(G)) is a coherent sheaf. Supports of coherent
analytic sheaves are analytic sets by 6.2.5, so that Z := supp(M ) is an analytic subset of
U × C . Therefore, the intersection Ze := Z ∩ ({0} × C ) is an analytic subset of {0} × C .
Now Ze is the set of all points (0,a) such that M(0,a) 6= (0), and therefore includes the
set of all points for which the inclusion of stalks (G) + J(G) ⊂ (g) + J(g) is strict. As
an analytic subset of {0} × C , it is either {0} × C or a discrete set (Identity Theorem
3.1.10). But obviously (0,0) 6∈ Z, e therefore, Z e is discrete. This proves (9.14) for all a
except for a discrete subset of C . This discrete subset we call D.
Step 4. Because of (f ) + J(f ) = (g) + J(g), we see that 1 ∈
/ D. Choose a path γ : [0,1] −→
C \ D from 0 to 1. For all a ∈ γ ([0,1]), we have that there exists an open neighborhood
U of a such that for all b ∈ U , the function f + bh is contact equivalent to f + ah by
Step 3. As γ([0,1]) is compact, we can cover γ([0,1]) by a finite number of U ′ s. It follows
that f and f + g − f = g are contact equivalent.
Remark 9.1.11. The Morse Lemma follows from the Mather-Yau Theorem. As for
g = x21 + · · · + x2n we have J(g) = (x1 , . . . ,xn ) we have that for all f with J(f ) = m are
right equivalent to x21 + · · · + x2n . The C {t} module structure can only be trivial, that
is, multiplication with t is the zero map. Of course, this is a very difficult proof of the
Morse Lemma.
312 9 Simple Hypersurface Singularities
Exercises
9.1.12. Suppose that f and g are right equivalent. Let k ∈ N. Suppose that f is k–determined.
Prove that g is k–determined.
9.1.13. Show by counterexample that the statement mk+2 ⊂ m3 J(f ) does not imply that f is
k–determined in general.
9.1.14. Let R be a commutative ring and u ∈ R[[t]]. Prove that for given a ∈ R the differential
equation
dy
=u·y
dt
P y ∈ R[[t]]
has a unique solution Pwith y(0) = a.
(Hint: Write u = uk tk , y = yk tk and compare both sides of the equation.)
k k
Use Exercise 9.1.14 to choose s ∈ C [[x1 , . . . ,xn ,y1 , . . . ,ym−k ,t]] such that
∂s
= v(ϕ1 , . . . ,ϕn ,y1 , . . . ,ym−k−1 ,ym−k − t,0, . . . ,0) · s,
∂t
and s(t = 0) = 1. Use Exercise 9.1.14 again to prove that
f (ϕ1 , . . . ,ϕn ,y1 , . . . ,ym−k−1 ,ym−k − t,0, . . . ,0) = s · f (x1 , . . . ,xn ,y1 , . . . ,ym−k ,0, . . . ,0).
f (ϕ1 , . . . ,ϕn ,y1 , . . . ,ym−k−1 ,0, . . . ,0) = s · f (x1 , . . . ,xn ,y1 , . . . ,ym−k ,0, . . . ,0).
(k+1)
Use the ϕi , s to define ϕi ,u(k+1) .
9.1.18. Let g and f in m ⊂ C {x1 , . . . ,xn } be given such that J(g) ⊂ m2 J(f ). Prove that f and
g are right equivalent.
(Hint: Follow the steps in the proof of the Finite Determinacy Theorem 9.1.4.)
G × On −→ On ; (G,f ) 7→ G(f ).
(3) We can view G as a subset of ⊕ni=1 m. Indeed if G ∈ G then put gi := G(xi ). Then
G is uniquely determined by the element (g1 , . . . ,gn ), see Exercise 3.1.22. We now
define
j k g ∈ Gk (j k f ) ⇐⇒ g is right equivalent to f.
We define a topology on C {x1 , . . . ,xn } as the coarsest topology such that all πk are
continuous.
Thus a basis for the topology for On is given by πk−1 (V ) for V ⊂ Jk . Informally
speaking, a power series is near zero when “some of its coefficients are small”, and the
coefficients of monomials which are not small are in a high power of the maximal ideal m.
On G we similarly define a topology. It is the topology induced by the product topology
of mn . The action of G on On is continuous. We omit the boring proof, as we do not
need the result.
Definition 9.2.5. We consider the action of G on On . Let f ∈ m ⊂ On . Then f is called
simple, if there exists an open neighborhood U of f , such that the number of orbits which
intersect U is finite.
Remark 9.2.6. There is a different way to phrase the definition of simple. Namely,
consider families
F (x,t) ∈ C {x1 , . . . ,xn ,t}
such that F (x,t) ∈ (x1 , . . . ,xn ), and F (x,0) = f . Families with this property are also
called deformations. Then f is called simple, if there exist finitely many right equivalence
classes G1 , . . . ,Gs such that for all families F (x,t) as above and t small and fixed the
germ Ft ∈ C {x1 , . . . ,xn } is right equivalent to one of the germs in G1 , . . . ,Gs . In case
that f is finitely determined, the proof that these two definitions are equivalent is left as
Exercise 9.2.16.
We will see later on, that simple germs have an isolated singularity. In fact, we
have the following Classification Theorem, due to Arnold, whose proof will occupy the
remainder of this chapter.
Theorem 9.2.7 (Classification of Simple Singularities). Suppose that f ∈ On is simple.
Then f is right equivalent to one of the singularities in the following list.
9.2 The A-D-E–singularities are simple. 315
In this section we will show that the A-D-E–singularities of Arnold’s list are simple.
In the next section we will show that there are no other simple singularities.
We first define the corank of a function. It is the coarsest invariant of right equiva-
lence classes.
Definition 9.2.8. Let f ∈ m2 ⊂ C {x1 , . . . ,xn }. The Hesse matrix of f is defined by
2
∂ f
H(f ) = (0) .
∂xi ∂xj 1≤i,j≤n
(1) Let f ∈ C {x1 , . . . ,xn }, with f ∈ m2 and Corank(f ) equal to s. Suppose that f is
finitely determined. Then f is right equivalent to an element of type
As the Corank is s, it follows that none of the Hij for 1 ≤ i,j ≤ s is a unit, hence Hij ∈ m.
We may rewrite (9.15) as
n
X
x2s+1 + . . . + x2n + g(x1 , . . . ,xs ) + xi Gi (x1 , . . . ,xn ) =
i=s+1
n
X
g+ (xi + 12 Gi (x1 , . . . ,xn ))2 − 41 Gi (x1 , . . . ,xn )2 .
i=s+1
As Hij ∈ m, it follows that Gi ∈ m2 for all i, and g(x1 , . . . ,xs ) ∈ m3 . We can therefore
do the following coordinate change:
After this coordinate change we have that f is right equivalent to a germ of type
n
X
(9.16) x2s+1 + . . . + x2n + g(x1 , . . . ,xs ) + xi Gi (x1 , . . . ,xn ).
i=s+1
with Gi ∈ m3 . (Both the g and the Gi may have changed in this process). So we im-
proved the situation. Now we have Gi ∈ m3 , which is a stronger statement than Gi ∈ m2 .
We can continue like this, and we obtain that f is right equivalent to a germ of type
(9.16) with Gi ∈ mk . As f is k–determined it follows from Exercise 9.1.12 that (9.16) is
also k–determined. Thus (9.16) is right equivalent to its k–jet, and the first part of the
theorem follows.
9.2 The A-D-E–singularities are simple. 317
(2) If g1 and g2 are right equivalent, then obviously g1 + x2s+1 + . . . + x2n and g2 +
x2s+1 + . . . + x2n are right equivalent. The converse follows without difficulty from the
Mather-Yau Theorem for right equivalence, see 9.1.10, but let us give a direct argument.
Suppose therefore that f1 = g1 (x1 , . . . ,xs ) + x2s+1 + . . . + x2n is right equivalent to f2 =
g2 (x1 , . . . ,xs ) + x2s+1 + . . . + x2n . Hence there exist
ϕ1 , . . . ,ϕn ∈ C {x1 , . . . ,xn }, with
∂li
ϕi = li + hi , hi ∈ m2 and li linear forms with det ∂xj 6= 0 such that
n
X n
X
(9.17) g1 (ϕ1 , . . . ,ϕs ) + ϕ2i = g2 (x1 , . . . ,xs ) + x2i .
i=s+1 i=s+1
Pn 2
Pn Pn
Note that i=s+1 li + i=s+1 hi (2li + hi ) = i=s+1 ϕ2i . By assumption, g1 ,g2 ∈ m3 , so
that
n
X n
X
(9.18) li2 = x2i .
i=s+1 i=s+1
Now suppose we can find ψs+1 , . . . ,ψn ∈ C {x1 , . . . ,xs } such that
ϕ′1 := ϕ1 (x1 , . . . ,xs ,ψs+1 , . . . ,ψn ), . . . , ϕ′s := ϕs (x1 , . . . ,xs ,ψs+1 , . . . ,ψn ).
This is in fact an automorphism. By the Inverse Function Theorem it suffices to show that
(ϕ′1 , . . . ,ϕ′s ) = (x1 , . . . ,xs ). Now (ϕ1 , . . . ,ϕs ,ls+1 + 21 hs+1 , . . . ,ln + 12 hn ) is an automor-
phism of C {x1 , . . . ,xn }, again by the Inverse Function Theorem. Hence (ϕ1 , . . . ,ϕs ,ls+1 +
1 1
2 hs+1 , . . . ,ln + 2 hn ) is the the maximal ideal of On . It follows that for all k with 1 ≤ k ≤ s
we can write
X s Xn
1
xk = ξν ϕν + ξν lν + hν , k = 1, . . . ,s.
ν=1 ν=s+1
2
follows from formulae (9.17), (9.18) and (9.19) so that we have an automorphism of
C {x1 , . . . ,xs } transforming g1 into g2 .
In order to construct the ψi , we therefore have to solve the system of equations
Fi = 0 with
Fi := 2li + hi , for i = s + 1, . . . ,n.
Now
∂Fi ∂li
det (0) = 2 det .
∂xj i,j≥s+1 ∂xj i,j≥s+1
318 9 Simple Hypersurface Singularities
Therefore, if we show that the right hand side is nonzero, we can apply
the Implicit
∂li
Mapping Theorem to solve this system of equations. The fact that det ∂x j
is
i,j≥s+1
nonzero, is linear algebra, and treated in Exercise 9.2.21.
(3) First suppose that f is simple. We have to prove that f P + x2s+1 + . . .+ x2n is simple too.
n
Consider Ft ∈ (x1 , . . . ,xn )C {x1 , . . . ,xn ,t} with F0 = f + i=s+1 x2i . Let k ≥ µ(f ) + 1.
For t small we have µ(Ft ) ≤ µ(f ). Then for all t small, Ft is k–determined by the Finite
Determinacy Theorem 9.1.4. We redo the argument of the first step simultaneously for
the whole family Ft . For t = 0, we get a coordinate change (in fact the identity), so that
n
for all small t we have a coordinate Pnchange2of (C , 0). As Ft is k–determined for Pall small
n
t we get a family Gt (x1 , . . . ,xs )+ i=s+1 xi , with G0 = f and Gt (x1 , . . . ,xs )+ i=s+1 x2i
is right equivalent to Ft for t small. As f is simple, there exist only finitely many right
equivalence classes in which Gt (x1 , . . . ,xs ) can lie. Hence there exist only finitely many
equivalence classes in which Ft can lie.
The converse statement is left to the reader.
Corollary 9.2.11. Let f ∈ m2 with µ(f ) < ∞, and Corank(f ) = 1. Then there exists a
k ≥ 2 such that f is right equivalent to Ak , that is, xk+1
1 + x22 + . . . + x2n . Moreover, the
Ak singularities are simple.
Proof. By the Splitting Lemma, we only have to consider a function in one variable
f (x) ∈ C {x}. If f (x) = 0 then µ(f ) = ∞. As this is not the case, there exists a unique
k with f (x)√= xk+1 u, where u is a unit in C {x}. By the Implicit Function Theorem, the
function k+1 u exists, is holomorphic and a unit in C {x}. We do the coordinate change
√ k+1
x′ = k+1 ux. In this coordinate f (x′ ) = x′ . This proves that f is right equivalent
to Ak . If Ft is a deformation of f , then µ(Ft ) ≤ µ(f ) for all t small. It follows that
an Ak singularity only deforms into an Al singularity with l ≤ k. In particular, the Ak
singularities are simple.
Now we turn our attention to functions of corank two. By the Splitting Lemma, we
only have to consider f (x,y) ∈ (x,y)3 ⊂ C {x,y}.
Proposition 9.2.12. Let f ∈ C{x,y} and f ∈ m3 . After a linear coordinate change we
may assume that j 3 f is of one of the following types.
1. j 3 f = xy(x + y), the zero set of j 3 f consists of three different lines.
2. j 3 f = x2 y, two lines coincide.
3. j 3 f = x3 , the three lines coincide.
4. j 3 f = 0.
Suppose Ft is a holomorphic family with F0 = f and Ft ∈ m2 for all small t. Then for
all t 6= 0 and small we have for the types (1), (2), resp (3):
Proof. Write
j 3 f = ax3 + bx2 y + cxy 2 + dy 3 .
Case 1. a = d = 0. Then j 3 f = xy(bx + cy).
(i) If b = c = 0 we get 4.
(ii) Suppose b = 0 or c = 0, but not both. Without loss of generality, let c = 0. Then
j 3 f = xy(bx). As b 6= 0, we can do the coordinate change y ′ = by. Then j 3 f = x2 y ′
which is 2.
(iii) Both b and c are nonzero. After the√coordinate change
√ x′ = bx, y ′ = cy we may
′
assume that b = c. By putting x = b · x, y = b · y we get j 3 f = x′ y ′ (x′ + y ′ )
3 ′ 3
which is 1.
Case 2. Suppose a 6= 0 or d 6= 0. Without
√ loss of generality, we may assume that a 6= 0
and, after the coordinate change x′ = 3 a · x, even a = 1. We factorize j 3 f :
j 3 f = x · (x − βy)(x − γy).
(i) If β = γ = 0 we get 3.
(ii) Suppose that β 6= 0 or γ 6= 0. Without loss of generality β 6= 0. Take y ′ = x − βy.
Then j 3 f = x′ y ′ (x′ − δy ′ ) for some δ, and we are back in Case 1.
This proves the first part of the proposition. The second part of the proposition follows
from the fact that the corank can only drop, see 9.2.9, and the fact that the zeros of the
polynomial j 3 (Ft (x,1)) depend continuously on t.
Proposition 9.2.13. Let f ∈ C {x,y}, µ(f ) < ∞.
(1) Suppose j 3 f = xy(x + y). Then f is right equivalent to D4 .
(2) Suppose j 3 f = x2 y. Then there exists a k ≥ 5 such that f is right equivalent to Dk .
(3) The germs Dk for k ≥ 4 are simple.
Proof. (1) Suppose that j 3 f = xy(x + y). It is an exercise to show that from the Finite
Determinacy Theorem it follows that xy(x + y) is 3–determined. It follows without dif-
ficulty that f is right equivalent to D4 .
j s f = x2 y + ay s + bxy s−1 + x2 ϕ,
for a suitable ϕ ∈ ms−2 , and a,b ∈ C . This we certainly have for s = 4. We do the
coordinate transformation x′ = x + 21 by s−2 , y ′ = y + ϕ. Then a simple calculation shows
2 s
that j s f = x′ y ′ + ay ′ . Renaming x′ to x and y ′ to y we may therefore suppose that
s 2 s
j f = x y + ay . There are two cases to consider.
320 9 Simple Hypersurface Singularities
√ 1
Case 1. We have a 6= 0. Do the coordinate change y ′ = s a · y, x′ = 2s√ a
x. Then
s 2 s 2 s
j f = x y + y . As x y + y is s–determined (exercise), it follows that f is a Ds+1 sin-
gularity.
Case 2. We have a = 0. In this case we write down j s+1 f . Iterate the above coordinate
changes until we come in Case 1, or until s ≥ k. We will show that s ≥ k leads to a
contradiction. As f is k–determined it follows that f is right equivalent to j s f = x2 y+ay s .
If a = 0, then f does not have an isolated singularity, contrary to our assumption.
(3) Consider D4 , and a holomorphic family Ft with F0 = f = xy(x + y). We have that
µ(Ft ) < ∞. If for fixed t small, the corank of Ft is zero, then Ft has an A1 –singularity by
the Morse Lemma. If the corank is one, then Ft is right equivalent to Ak for some k by
9.2.11. As µ(Ft ) ≤ µ(D4 ), it follows that k ≤ 4.5 Hence D4 can only deform in finitely
many germs. If Corank(Ft ) = 2, it follows that j 3 Ft = 0 consists of three different lines
by 9.2.12. Hence there exist coordinates such that j 3 Ft = xy(x + y) for fixed t. Hence,
by what we just proved, j 3 Ft is right equivalent to D4 . This shows that D4 can deform
only in finitely many germs.
The proof that the Dk for k ≥ 5 are simple, is similar.
So in view of Proposition 9.2.12, the case to consider now are functions f ∈ C {x,y}
whose 3–jet is x3 .
Proposition 9.2.14. Let f ∈ C {x,y} with µ(f ) < ∞. Suppose j 3 f = x3 . Then either
(1) f is right equivalent to E6 , E7 or E8 or
(2) f is right equivalent to a function which lies in the ideal (x,y 2 )3 = (x3 ,x2 y 2 ,xy 4 ,y 6 ).
Moreover, E6 , E7 and E8 are simple.
Proof. Suppose j 3 f = x3 . We write down the four jet.
j 4 f = x3 + ay 4 + bxy 3 + x2 ϕ,
j 5 f = x3 + xy 3 + cy 5 + xϕ,
9.2.19.
(1) Let f be a germ of corank s. Prove that µ(f ) ≥ 2s .
(2) Let f ∈ m4 ⊂ C {x,y}. Show that µ(f ) ≥ 9.
(3) Let f ∈ (y,x2 )3 ⊂ C {x,y}. Prove that µ(f ) ≥ 10.
Conclude that germs with µ(f ) < 8 are simple.
9.2.21. In this exercise we complete proof of the second part of the Splitting Lemma 9.2.10.
n
X n
X
(9.20) li2 = x2i .
i=s+1 i=s+1
To complete the proof of the Splitting Lemma it suffices to show det(C) = 1. Prove this.
(Hint: Use (9.20) to show that C is orthogonal.)
9.2.22. Let f ∈ C {x,y} with j 3 f = xy(x + y). Prove, without using the Finite Determinacy
Theorem, that f is right equivalent to D4 .
9.3 Orbits
In order to show that functions with one of the properties mentioned in Remark 9.2.15
are not simple, we need to study orbits of group actions more carefully. First a definition.
Definition 9.3.1. An affine algebraic group G is an affine variety G, together with
regular maps
• µ : G × G −→ G,
• ι : G −→ G,
which satisfy the groups axioms.
Examples 9.3.2.
(3) Consider the group of GLn (k) of invertible n×n matrices. This is an algebraic group.
2
Indeed it is an affine algebraic set, because it is equal to k n \ {det(aij ) = 0}, so as
a complement of an affine hypersurface it is itself affine.
(4) The group Gk is also an algebraic group because, as in the previous example, it is
the complement of an affine hypersurface.
9.3 Orbits 323
Definition 9.3.3. An affine algebraic group acts on an affine variety X, if there exists
a regular map
ϕ : G × X −→ X : (g,x) 7→ g · x
such that (g1 g2 ) · x = g1 · (g2 · x).
Example 9.3.4. The action of Gk on Jk .
The main result we need concerning group actions is the following theorem.
Theorem 9.3.5. Let G be an affine algebraic group, acting on X. Then
(1) Every orbit G · x is locally closed in the Zariski topology.6
(2) Every orbit is smooth.
(3) The boundary of each orbit is a union of orbits of smaller dimension.
The proof will occupy the first part of this section. In fact we need to do some more
algebraic geometry, in particular, we need to study so-called constructible sets.
Definition 9.3.6. Let X be a topological space, Z ⊂ X a subset. Z is called constructible
if it is a finite union of locally closed subsets: Z = ∪ni=1 Ui ∩ Fi , with Ui open and Fi
closed.
Example 9.3.7. Consider C 2 with the Zariski topology, and let Z = U1 ∪ F2 , where U1
is defined by U1 := {(x,y) : x 6= 0} and F2 = {(x,y) : y = 0}.
Proof of Lemma 9.3.9. Consider Y = X \ U . Then Y is Zariski closed. Consider the ideal
I (Y ) ⊂ K[X]. Write I (Y ) = (f1 , . . . ,fs ) ⊂ K[X] and define Ui := X \ V (fi ). Then Ui
is open and affine. We claim that U = ∪ni=1 Ui . As Ui ⊂ U , the inclusion ∪ni=1 Ui ⊂ U is
obvious.
For the other inclusion, suppose we have an element a ∈ U \ ∪ni=1 Ui . Then for all i,
we have a ∈/ Ui . Therefore fi (a) = 0 for all i. Hence a ∈ V (f1 , . . . ,fs ) = V (I (Y )) = Y .
Hence a ∈/ U , a contradiction.
Theorem 9.3.11 (Chevalley). Let X,Y be affine, and ϕ : X −→ Y be a morphism.
Then ϕ maps constructible sets in X to constructible sets in Y .
Proof. It suffices to prove that a locally closed subset of X is mapped to a constructible
set of Y . Let U ∩ F , with U open and F closed, be a locally closed subset. By Lemma
9.3.9, we can write U as a finite union of open affines. Therefore, it suffices to prove the
theorem for the case X = U , that is we have to show that the image of a closed subset
F of X is constructible. As F now is a finite union of irreducible varieties, it suffices to
prove the theorem for the case X = F is irreducible. So we have to show that the image
of X is constructible. As ϕ(X) is closed, it suffices to show that ϕ(X) is a finite union
of locally closed subsets of ϕ(X). Thus we may suppose Y = ϕ(X), and Y is irreducible.
The claim now follows by induction on dim(Y ). The case dim(Y ) = 0 is trivial. So assume
that the theorem has been proved for all Y ′ with Y ′ $ Y . As ϕ(X) = Y , the image ϕ(X)
contains a nonempty open subset U , see Theorem 2.3.12. Consider Y \ U = Z1 ∪ · · · ∪ Zs .
The set ϕ−1 (Zi ) is a closed subset of X, and has finitely many irreducible components.
Let Wij be such a component. By restriction we get a map ϕ : Wij −→ Zi . By induction,
the image is constructible. Therefore ϕ(X) is the union of U and a constructible set, and
therefore constructible itself.
We can now prove the important Theorem 9.3.5.
Proof of Theorem 9.3.5. (1) Let x ∈ X. Then G · x is constructible. Indeed, A := G · x
is the image of the map
G −→ X; g 7→ g · x.
Thus the orbit A = G · x is constructible by the previous Theorem 9.3.11. Write A =
∪si=1 Ui ∩ Fi . We may suppose that U1 ∩ F1 cannot be removed from this
union. Then
F1 \ (F2 ∪· · · ∪Fs ) is an open subset of F1 ∪· · · ∪Fs . Hence U := U1 ∩(F1 \ F2 ∪· · · ∪Fs )
is an open subset of F1 ∪ . . . ∪ Fs . As A is a closed subset of F1 ∪ . . . ∪ Fs , it follows that
U ⊂ A ⊂ A is open in A. Furthermore, by continuity we have that for all g ∈ G the set
g −1 (U ) := {g −1 · x : x ∈ U } is open in A. Applying this to g −1 ∈ G we see that g(U )
is open in A. As A is an orbit, it follows that A = ∪g∈G g(U ). Thus A is open in A, or
what is the same, A is locally closed.
(2) Let g ∈ G be fixed. The morphism g : X −→ X which sends x to g · x is an
isomorphism. Now let x ∈ X. Then G · x contains a smooth point. But if g · x = y, it
follows that y is also a smooth point. Therefore, all points of the orbit are smooth.
(3) Let A = G·x. Consider A, the closure of A. As multiplication with g ∈ G is continuous,
it follows that g −1 (A) is closed. As g −1 (A) = A, it follows by definition of closure that
g −1 (A) ⊃ A. Similarly g(A) ⊃ A. Now
A = g ◦ g −1 (A) ⊃ g(A) ⊃ A.
9.3 Orbits 325
f (x1 + εh1 , . . . ,xn εhn ) − f = ε h1 ∂x1 + . . . + hn ∂xn , as a simple calculation shows. (Use
Exercise 4.3.22.)
326 9 Simple Hypersurface Singularities
Proof. Indeed, applying the previous theorem shows that this codimension is equal to
the vector space dimension of On /mJ(f ). Now we have the exact sequence
The vector space dimension of J(f )/mJ(f ) is by Nakayama’s Lemma the minimal number
of generators of J(f ). As f has an isolated singularity, V (J(f )) = {0}, or better, J(f ) is
an m–primary ideal. As the (Chevalley) dimension of (C n , 0) is n, it follows that number
of generators of J(f ) is at least n. But then the number of generators of J(f ) is exactly
∂f ∂f
n, because ∂x 1
, . . . , ∂x n
are n generators of J(f ). The statement therefore follows from
the exact sequence (9.21).
Theorem 9.3.16. Let f ∈ m2 ⊂ C {x1 , . . . ,xn }, with finite Milnor number. Let g ∈ /
m · J(f ). Then there exist only finitely many t ∈ C such that f + t · g is right equivalent
to f .
Proof. Choose k such that mk+1 ⊂ mJ(f ). Hence g does not lie in the tangent space to
the orbit through f in Jk under the action of Gk . This implies that the line f +tg in the jet
space Jk does not lie entirely in the orbit of f . Therefore, the line f + tg in Jk intersects
the orbit in only finitely many points, as the intersection of the orbit with the line is a
constructible subspace of the line. This is exactly the statement of the theorem.
Proposition 9.3.17. Let f ∈ C {x1 , . . . ,xn } be simple. Then f has an isolated singular-
ity.
Proof. Suppose not. Then the Milnor number µ(f ) is not finite. As mJ(f ) ⊂ J(f ), it
follows that in particular dimC C {x}/mJ(f ) = ∞, that is, for all k > 0, there exists a
gk ∈ mk \mJ(f )+mk+1 , as otherwise mk ⊂ mJ(f ) by Nakayama. Hence for all sufficiently
small t 6= 0 the germ fk := f + tgk is not right equivalent to f . This is because otherwise,
gk had to be in the tangent space of the orbit of f under Gk , but by assumption it is not.
Thus even fk and f are not in the same orbit under Gk . Moreover for k < s also fk is not
right equivalent to fs . Indeed otherwise the classes of fs and fk in the jet space Jk would
be Gk equivalent. But they are not, as in Jk the class of fs is equal to the class of f .
Hence there are infinitely many nonequivalent function germs in any small neighborhood
of f , and therefore f is not simple.
Examples 9.3.18. (1) Let a complex number a with a2 6= 4 be given. Then
fa (x,y) = x4 + y 4 + ax2 y 2
has an isolated singularity, hence a finite Milnor number. One calculates that m5 ⊂
m2 J(fa ). Hence fa is 4–determined. Now J(fa ) has two generators, and mJ(fa ) has 4
generators. Therefore, not all of the degree four monomials in x,y can be in mJ(fa ).
Indeed, mJ(fa ) looks like,
9.3 Orbits 327
Here dots denote nonzero monomials in O/mJ(fa ), that are however equal up to a nonzero
constant modulo mJ(fa ). We see that x2 y 2 ∈ / mJ(fa ). As this holds for all a ∈ C with
a2 6= 4, it follows that all germs fa are not simple. Indeed, suppose the converse. Then
there exists a (small) t0 such that fa+t = x4 + y 4 + (a + t)x2 y 2 are mutually right equiv-
alent for infinitely many t with 0 < t < t0 . This is impossible as for t small x2 y 2 is
not in mJ(fa+t ), so there can only be finitely many t’s in the family fa+t that are right
equivalent.
(2) Consider the family fa := x3 + y 6 + axy 4 , such that a is a complex number with
4a3 + 27 6= 0. The zero set of fa consists of three (different) parabolas. One calcu-
lates that fa then has an isolated singularity. Moreover, one shows that the monomial
xy 4 ∈
/ mJ(fa ), see Exercise 9.3.21 so that all germs fa are not simple as in the first
example.
union of subvarieties of dimension four never can fill up an open subset of a vector space
of dimension 5, it follows that all neighborhoods of j 4 f intersect infinitely many orbits.
In particular f is not simple.
(4) The argument is similar, but now we calculate modulo m4 . We know that mJ(f )/m4
has dimension at most n2 , but m3 /m4 has dimension n+2
3 , which is bigger.
(3) In this case the argument is somewhat more subtle. We calculate modulo m7 , so we
consider the action of G6 on the jet space J6 . We take the weighted order by defining
the weight of x to be equal to two, and the weight of y to be equal to one. So all
terms of elements in (x,y 2 )3 have weighted order at least six. Let f ∈ (x,y 2 )3 . The
four generators x ∂f ∂f ∂f ∂f
∂x ,y ∂x ,x ∂y ,y ∂y of mJ(f ) have weighted order at least six, five, seven
and six respectively. So we have at most three linearly independent elements in mJ(f ) of
weighted order six, namely those given by x ∂f 2 ∂f ∂f
∂x , y ∂x , and y ∂y . There are however four of
them in (x,y 2 )3 . We draw those weighted order six elements for the example f = x3 + y 6 .
The box with the cross means that that monomial is not in mJ(f ).
y
A counting argument shows that (x,y 2 )3 + m7 /m7 is a vector space of dimension sixteen.
It follows that the orbit G6 · f has dimension at most sixteen, as its tangent space has at
most dimension 16. However, G6 · f does not contain the whole of (y,x2 )3 modulo m7 , as
otherwise the tangent space of G6 ·f would contain (y,x2 )3 modulo m7 . So the intersection
of G6 · f with (y,x2 )3 modulo m7 has dimension at most fifteen. It now follows as in the
other cases that f is not simple.
Exercises
9.3.20. Prove Lemma 9.3.8.
9.3.21.
(1) Consider the family fa := x3 +y 6 +axy 4 for a ∈ C . Show that fa has an isolated singularity
exactly when 4a3 + 27 6= 0. Show that xy 4 ∈ / mJ(fa ).
(2) Do the same for the family fa = x3 + y 3 + z 3 + axyz.
9.3.22. Let Ft be a holomorphic family. Suppose that for all sufficiently small nonzero t,s
the germs Ft and Fs are right equivalent. Suppose that F0 is not right equivalent to Ft for t
sufficiently small and nonzero. Prove that µ(Ft ) < µ(F0 ).
(Hint: Use the third part of Theorem 9.3.5 and Remark 9.3.15.)
329
10 Deformations of Singularities
In the final chapter of this book, we study deformations of germs of complex spaces.
The ultimate goal is to prove the existence of a semi-universal deformation of (X, 0), in
case it has an isolated singularity. As the proof of this theorem is quite involved, we will
first treat some special cases. So, in Section 10.1, we will consider isolated hypersurface
singularities (X, 0) given by f = 0. A deformation (XS , 0) −→ (S, 0) over some germ of a
complex space (S, 0) is simply given by FS = 0 for some element FS ∈ C {x}⊗O b S,0 , with
FS (x,0) = f . There is a natural notion of isomorphism of deformations. Moreover, there
ϕ
is the notion of induced family, also called pull-back. If (T, 0) −→ (S, 0) is a holomorphic
map, we get a map ϕ : C {x}⊗O b S,0 −→ C {x}⊗O b T,0 , which we denote by the same name.
Then ϕ(FS ) is an element of C {x}⊗O b T,0 , which gives a deformation (XT , 0) −→ (T, 0) of
(X, 0). A very important special case is that (T, 0) ֒→ (S, 0), in which case one just restricts
the family to the subspace (T, 0). A versal deformation (XS , 0) −→ (S, 0) is a deformation
from which every other deformation (XT , 0) −→ (T, 0) can be induced. If, moreover, the
map on tangent spaces (mT,0 /m2T,0 )∗ −→ (mS,0 /m2S,0)∗ is uniquely determined, it is called
semi-universal. We will show in Remark ?? , that it is too much to ask for uniqueness of
the map (T, 0) −→ (S, 0). The main tool in the proof of the existence of a semi-universal
deformation of an isolated hypersurface singularity is Grauert’s Approximation Theorem.
In Section 10.2 we will consider deformations of germs of complex spaces, defined by
say f1 = · · · = fk = 0. In general, defining a deformation by simply deforming f1 , . . . ,fk
to F1S , . . . ,FkS as in the hypersurface case, does not lead to a good notion. This is because
the dimension might change, or components might disappear. In case (S, 0) is smooth
and one-dimensional, so that OS,0 = C {s}, we will show in Example 10.2.1 that it is a
good idea to take the condition that s is a nonzerodivisor of C {x}⊗O b S,0 /(F1S , . . . ,FkS ).
We will show that this property is equivalent to the fact that one can lift all the relations
(r1 , . . . ,rk ), that is f1 r1 + . . . + fk rk = 0, to a relation between the F1S , . . . ,FkS . This
is called flatness, and can be defined for general OS,0 , for example, C [ε]/(ε2 ), the local
ring of the “double point” (T, 0). As ε is always a zerodivisor, the flatness definition by
the lifting property of the relations makes more sense. We then classify all deformations
of (X, 0) over the double point (T, 0). These deformations are classified by the so-called
1
TX, 0
, which is a finite-dimensional vector space in case (X, 0) has an isolated singularity.
If one has a deformation over a germ of a complex space (S, 0), one can ∗ define the
Kodaira-Spencer map. To every element of the tangent space mS,0 /m2S,0 , one assigns
1
a deformation over (T, 0), hence an element of TX, 0
. We then prove the existence of a
versal deformation, again by applying Grauert’s Approximation Theorem, in case there
exists a deformation over a smooth space (S, 0) such that the Kodaira-Spencer map is
surjective. This case applies, for example, to isolated complete intersection singularities,
see Exercise 10.2.33, and to isolated Cohen-Macaulay codimension two singularities. An
essential ingredient is the Hilbert-Burch Theorem.
In the final section, we will prove the existence of a semi-universal deformation in
1
a more general setting, namely under the assumption that TX, 0
is a finite-dimensional
vector space. The basic idea is to do this order by order. Let (X, 0) be defined by f ∈
330 10 Deformations of Singularities
C {x}k . On first order, one does the following. Take a basis g1 , . . . ,gτ of TX,
1
0
, and take
a new ring C [[s1 , . . . ,sτ ]] with maximal ideal m = (s1 , . . . ,sτ ). We then have a flat
deformation
(10.1) f + s1 g1 + . . . ,sτ gτ
over the space with local ring C [[s1 , . . . ,sτ ]]/m2 . In general, it is not possible to lift this
family to a flat deformation over C [[s1 , . . . ,sτ ]]/m3 . We will show however, that there is
a minimal I with m3 ⊂ I ⊂ m2 such that there exists a deformation over C [[s1 , . . . ,sτ ]]/I
lifting (10.1). In general, suppose we have a deformation of (X, 0) over a space with
local ring O/J. We will show that there exists a minimal I with mJ ⊂ I ⊂ J, such
that there exists an extension of the deformation over the space with local ring O/I.
The existence of this minimal I will be shown by obstruction theory. We will construct
2 2
an element in TX, 0
⊗ I for all I with mJ ⊂ I ⊂ J, TX, 0
a vector space, the so called
obstruction space. It is zero if and only if the family can be lifted over the space with
local ring O/I. Then we will successively extend the family in a maximal way order by
order. This leads to a formal deformation. The fact that it is formally semi-universal
is not very difficult. In fact, by applying the ideas of Section 10.2, one can show that
it is semi-universal, by applying Grauert’s Approximation Theorem, in case the formal
deformation is in fact convergent. The final subtle point is to prove that there exists
a convergent deformation which is formally semi-universal. Also in this proof Grauert’s
Approximation Theorem is used. We moreover prove the following criterion for (formal)
semi-universality. Suppose a formal deformation of (X, 0) over a formal algebra C [[s]]/J
2
is given. Consider the obstruction element in TX, ⊗ J/mJ, which leads to the obstruction
2
∗
0
map TX,0 −→ J/mJ. Then the formal deformation is semi-universal if and only if the
Kodaira-Spencer map is an isomorphism, and the obstruction map is surjective.
Example 10.1.2. Let f,g ∈ C {x1 , . . . ,xn }. Then FS = f + s · g ∈ C {x1 , . . . ,xn ,s} =
b {s} is a deformation of f .
C {x1 , . . . ,xn }⊗C
Definition 10.1.3. Let FS be a deformation of f over S, and T −→ S be an analytic
map between germs of complex spaces. Then we have an induced map OS −→ OT on
the ring level, and hence a map:
is a versal deformation of f .
Before giving the proof of this Theorem, we give some examples.
Example 10.1.8. We take the function f = x3 ∈ C {x}. A basis for the Milnor algebra is
given by {1,x}. Therefore, by the Theorem, a versal deformation is given by F (x,s1 ,s2 ) =
x3 + s1 x + s2 . Look at the deformation G(x,t) = x3 + tx2 of f . Its zero set looks like
Versality says that we can find a map T −→ S, that is, dually a map of rings
or to put it in another way, two power series s1 (t),s2 (t) such that
• Φ = (Φ(x,t),t),
• Φ(0,t) = Id,
• F (x,s1 (t),s2 (t)) = G(Φ(x,t),t).
We rewrite:
1 1 1
x3 + tx2 = (x + t)3 − t2 x − t3 .
3 3 27
With Φ(x,t) = x − 31 t we get
1 2
G(Φ(x,t),t) = x3 − t2 x + t3 .
3 27
We can therefore take
1 2 3
s1 (t) = − t2 ; s2 (t) = t .
3 27
therefore the map T −→ S is given by “the picture”
t
s2
s1
Φ(xi ,ε) = xi − hi · ε.
Proof of Theorem 10.1.7 The proof uses Grauert’s approximation Theorem and
the ideas of the last example. Let GT be a deformation of f over T , where OT =
C {t1 , . . . ,tq }/J for some ideal J. We have to find
• A map S −→ T , that is µ power series s(t) = s1 (t1 , . . . ,tq ), . . . ,sµ (t1 , . . . ,tq ) with
s ≡ 0 modulo t, and
• an automorphism Φ(x,t) with Φ(x,0) = Id,
such that
This we can view as a system of equations, which we have to solve for Φ and s. Here Φ
may depend on x and t, but the µ functions si are only allowed to depend on t1 , . . . ,tq .
We show that this equation has a solution by applying Grauert’s approximation Theorem
8.2.2. To apply this Theorem, we have to show that every solution (Φ,s) of (∗) of order
i in t can be extended to a solution of (∗) of order i + 1. Having this solution (Φ,s) of
order i, we look at the homogeneous part of degree i + 1 of G(Φ(x,t),t) − F (x,s(t)) and
write it as X
tν h ν
|ν|=i+1
with hν ∈ C {x1 , . . . ,xn }. Just as in the second example we can write for all ν;
µ
X n
X ∂f
hν = bjν gj + hνj
j=1 j=1
∂xj
As G ≡ f modulo t, it follows from Taylor expansion that (Φ′ ,s′ ) = (Φ′1 , . . . ,Φ′n ,s′1 , . . . ,s′µ )
is a solution of (∗) of order i + 1. This proves the Theorem.
f1 (x) = . . . = fk (x) = 0,
for fi ∈ C {x1 , . . . ,xn } = C {x}. Naively, one probably would like to say that a deforma-
tion of (X, 0) over a space (S, 0) is given by the zero set of functions,
F1 (x,s) = . . . = Fk (x,s) = 0
with Fi ∈ C {x}⊗Ob S,0 and Fi (x,0) = fi (x) for i = 1, . . . k. However, this kind of defor-
mation is too general, because a “general fiber” of (XS , 0) −→ (S, 0) does not sufficiently
reflect the properties of (X, 0). To see this, look at the following example.
334 10 Deformations of Singularities
Example 10.2.1. Let (XS , 0) −→ (S, 0) be an analytic map. Here (XS , 0) is defined by
F1 = xy − s = 0
F2 = xz − s = 0
F3 = yz − s = 0
Although this is a space defined in 4-space, we would like to imagine the situation by the
following picture:
s=0 s 6= 0 s
The problem here is that s is zerodivisor of C {x,y,z,s}/(F1 ,F2 ,F3 ). The total space
(XS , 0) has four components, of which three are in the fiber s = 0. In fact, a primary
decomposition of (F1 ,F2 ,F3 ) is (cf. 1.4.12):
We therefore have a decomposition (XS , 0) = (X0 , 0) ∪ (X1 , 0), where (X0 , 0) is the union
of three coordinate axes, and (X1 , 0) is the parabola. Take a function f , vanishing on
(X1 , 0), but not on (X0 , 0). Then obviously s · f = 0 as elements of OXS ,0 , but both s
and f are nonzero in OXS ,0 . This expresses the fact that s is a zerodivisor. In fact it can
be proved by using the Active Lemma 4.1.10, see Exercise 10.2.28, that
Lemma 10.2.2. s is a zerodivisor ⇐⇒ there exists a component of (XS , 0) in the fiber
above zero.
The above example hopefully makes clear to the reader that for a ”nice” one parame-
ter family (with parameter s), one should impose the condition that s is a nonzerodivisor.
This we call flatness:
Definition 10.2.3. A C {s}-module M is called flat if and only if s is a nonzerodivisor
of M .
For example a finitely generated C {s}-module M is flat if and only if M is free, see
1.3.9.
It turns out that it is not so easy at all to construct (nontrivial) one parameter flat
deformations of singularities. Therefore, it is a natural idea to construct deformations
by “power-series expansion”. That is, construct deformations over (T, 0), then try to lift
over the germ of the complex space with local ring C [s]/(s3 ), etc. But then one has
the problem of defining flatness of a C [s]/(s2 )-module, as s is always a zerodivisor of
C [s]/(s2 ). So we need to give a different definition of flatness of a C {s}-module, which
gives a good generalization to rings with nilpotent elements. The following lemma is of
importance for solving this problem.
10.2 Deformations of Singularities 335
Lemma 10.2.4. Let (S, 0) = (C , 0), and consider a deformation of (X, 0) as above. Then
the deformation is flat (that is, s is a nonzerodivisor) if and only if for every relation
between the fi :
f1 r1 + . . . + fk rk = 0,
we can find a lift Ri (x,s), that is, Ri (x,0) = ri (x) and
F1 R1 + . . . + Fk Rk = 0.
P
Proof. Suppose that s is a nonzerodivisor, and take a relation fi ri = 0. Take any lift
Ri′ (x,s) of the ri , and look at F1 R1′ + . . . + Fk Rk′ . This might not be zero, but we know
it is if we plug in s = 0. This shows that this expression is divisible by s:
sΦ = F1 R1 + . . . + Fk Rk .
P
Putting Ps = 0 we get a relation fi ri = 0, which by assumption can be lifted to a
relation Fi Ri′ = 0. Then X
sΦ = Fi (Ri − Ri′ ).
As both Ri and Ri′ are lifts of the ri , it follows that Ri − Ri′ is divisible by s. Hence
R −R′
the power-series i s i exists. Because in the power-series ring s is a nonzerodivisor it
follows that
X Ri − R′
i
Φ= Fi ,
s
expressing the fact that Φ = 0 as element of OXS ,0 . This is what we had to show.
This result motivates the following definition.
Definition 10.2.5. Let (X, 0) ⊂ (C n , 0) be a germ of a complex space. Let I (X, 0) =
(f1 , . . . ,fk ). Let (S, 0) be a further germ of a complex space. Let (XS , 0) ⊂ (C n × S, 0)
be defined by F1 , . . . ,Fk , Fi ∈ C {x}⊗O b S,0 . Then (XS , 0) −→ (S, 0) is called a (flat)2
deformation of (X, 0) over (S, 0) if
(1) Fi (x,0) = fi (x) for i = 1, . . . k,
(2) for every relation r = (r1 , . . . ,rk ) ∈ C {x}k between the fi , that is f · r :=
Pk
b k
i=1 fi ri = 0, there exists a lift R = (R1 , . . . ,Rk ) ∈ (C {x}⊗OS,0 ) , that is
Ri (x,0) = ri (x), which is a relation between the Fi . This means that F · R :=
Pk 3
i=1 Fi Ri = 0.
2 Later on we will often write deformation instead of flat deformation.
3 b S,0 /(F1 , . . . ,Fk ) is a flat OS,0 –module in the general
This is, in fact, equivalent to saying that C {x}⊗O
set-up of flatness, see [Matsumura 1979], Theorem 49.
336 10 Deformations of Singularities
A priori the definition depends on the choice of the generators of (XS , 0) (and of
(X, 0)), but it is not so difficult to show that it in fact does not. See Exercise 10.2.29.
Proposition 10.2.6. Let (X, 0) be a germ of a complex space, defined by the ideal I =
(f1 , . . . ,fk ) in C {x}. Let g1 , . . . ,gk be further elements in C {x} =: O. Consider (T, 0),
the double point. Then f1 +εg1 , . . . ,fk +εgk defines a (flat) deformation (XT , 0) −→ (T, 0)
if and only if
f1 7→ g1 , . . . ,fk 7→ gk
gives a well-defined element of HomO (I,OX,0 ) =: NX,0 .4 The module NX,0 is called the
normal module of (X, 0).
Proof. Suppose first that f + εg := (f1 + εg1 , . . . ,fk + εgk ) defines a (flat) deformation
(XT , 0) −→ (T, 0). We have to show that P the map fi −→ gP i induces a well defined map
in HomO (I,OX,0 ). Thus an element h = rP
i fi is sent to ri gi . In order
P to show that
this
P is well-defined, we have to show that if ri f i = 0, it follows that ri gi ∈ I. But
ri fi = 0 means that r = (r1 , . . . ,rk ) is a relation, which by assumption can be lifted
to a relation r + εs between f + εg, hence
Step 2. The fact that equality of the ideals (f1 +εg1 , . . . ,fk +εgk ) and (f1 +εg1′ , . . . ,fk +εgk′ )
implies that g1 , . . . ,gk and g1′ , . . . ,gk′ give the same element in NX,0 = HomO (I,OX ) is
proved similarly.
Example 10.2.8. We consider the germ of the complex space (X, 0) defined by the
2–minors of the matrix5
4 Note that NX, 0 is not an invariant of the isomorphism class of (X, 0), as it might depend on the
embedding.
5 Readers knowing some algebraic geometry will recognize that this singularity is the cone over the
rational normal curve of degree four.
10.2 Deformations of Singularities 337
x0 x1 x2 x3
.
x1 x2 x3 x4
We will denote by fij the minor obtained by taking the i–th and j–th column. So for
example f13 = x0 x3 −x1 x2 . It has been shown in Exercise 7.3.8 that the ideal I defined by
the minors is a prime ideal. Thus (X, 0) is irreducible. We determine the normal module
NX,0 .
Step 1. We have to determine the relations between the fij . We get eight relations in the
following way: take three columns, double one of the rows, and compute the determinant.
As two rows are the same, the determinant is zero, so that we indeed get a relation. We
get relations rijk if we double the first row, and sijk if we double the second row. As an
example, we compute r123 . So we take the first three columns, and put the first row on
top. We get the following matrix
x0 x1 x2
x0 x1 x2 ,
x1 x2 x3
and hence the relation x0 f23 −x1 f13 +x2 f12 = 0. We claim that all relations are generated
by the relations constructed in this way. We will only indicate how to prove this, leaving
the reader to fill in the details. First of all, we have that C {x0 ,x4 } ⊂ OX,0 is a Noether
normalization. Because OX,0 is a free C {x0 ,x4 }–module, see Exercise 7.3.8, it follows
that x0 ,x4 is a regular sequence in OX,0 . Thus all relations between the elements
can be lifted to relations between the fij . The relations between these monomials are
easily determined, and there are indeed eight of them.
Step 2. We can easily generate 8 elements n1 , . . . ,n8 of the normal module NX,0 . These
we get by putting an extra ε at one of the eight spots of the matrix. The fact that the
relations can be lifted, follows because we can lift the relations rijk and sijk by again
taking three columns and double a row! The n1 , . . . ,n4 correspond to the four entries in
the top row, and the n5 , . . . ,n8 correspond to the four entries in the bottom row. To give
an example we determine n1 . We get the following matrix
x0 + ε x1 x2 x3
.
x1 x2 x3 x4
We take the first minor, and get x0 x2 + εx2 − x21 . Thus f12 → x2 etc. It turns out that
the normal module is generated by nine elements, namely by n1 , . . . ,n8 and one further
element n9 . We put these in the following table. The rows give the elements of NX,0 by
giving the value on fij in the columns.
338 10 Deformations of Singularities
is commutative.
Theorem 10.2.10. Let Φ : (XS , 0) −→ (XS′ , 0) be a morphism of deformations of (X, 0)
over (S, 0). Then Φ is an isomorphism.
Proof. This is a consequence of the Inverse Function Theorem. A morphism of deforma-
tions of (X, 0) induces the identity on (X, 0), and preserve fibers. Therefore, Φ∗ is induced
by a morphism which, by abuse of notation, we also call Φ∗
b S,0
C {x1 , . . . ,xn }⊗O b S,0 ,
−→ C {x1 , . . . ,xn }⊗O
xi 7→ xi + hi , i = 1, . . . n,
s ∈ OS,0 7→ s.
So we can define:
1
Definition 10.2.12. TX, 0
:= Coker(α).6
Theorem 10.2.13. Let (X, 0) ⊂ (C n , 0) be a germ of a complex space. Isomorphism
classes of deformations of (X, 0) over (T, 0) are in one-one correspondence with elements
1
of TX, 0
.
6 1
It is, in fact true that TX, is an invariant of the isomorphism class of (X, 0), see Exercise 10.2.26.
0
340 10 Deformations of Singularities
This holds because ε2 = 0. Hence we see that the map I −→ OX,0 given by fi 7→ gi − gi′
is in the image of Θn . Similarly, one shows that elements of Θn induce isomorphisms.
This is what we needed to show.
Examples 10.2.14.
1
(1) Let (X, 0) be a germ of an isolated hypersurface, defined by f ∈ C {x}. Then TX, 0
=
C {x}/(f,J(f )). This follows from Example ??.
(2) We take the singularity of Example 10.2.8. Recall that the normal module elements
are determined by their values on f12 , f23 and f34 . A direct calculation shows that
∂ ∂ ∂ ∂ ∂ 1
∂x0 = n1 , ∂x1 = n2 + n5 , ∂x2 = n3 + n6 , ∂x3 = n7 + n4 , ∂x4 = n8 . Thus TX,0 is generated
1
by four element n2 , n3 , n4 , and n9 . We claim that (x0 ,x1 ,x2 ,x3 ,x4 )TX, 0
= 0. Thus we
have to do twenty checks!! The reader hopefully forgives us for just doing one of those.
1
We will show that the class of x0 n4 is the zero element in TX, 0
.We look at the values on
f12 , f13 and f34 . They are 0, 0 and −x0 x3 which modulo I is equal to 0, 0 and −x2 x1 .
∂ 1
Thus it is equal to −x1 n8 = −x1 ∂x 4
, hence the trivial element in TX, 0
. Altogether we see
1
that dimC (TX,0 ) = 4, with a basis given by the classes of the normal module elements
n2 ,n3 ,n4 and n9 .
Theorem 10.2.15. Let (X, 0) be a germ of a complex space with an isolated singularity.
1
Then TX, 0
is a finite-dimensional C –vector space.
Proof. Let X ⊂ U be a small representative of (X, 0) such that 0 is the only singular
point of X. Consider the sheaf TX1 := HomOU (I ,OX )/ΘU . Here ΘU is the sheaf of
derivations on U , I is the sheaf of ideals defining X. The map ΘU −→ HomOU (I ,OX )
1
is defined as in the definition of TX, 0
. By the basic coherence theorems TX1 is a coherent
1
sheaf on X, whose stalks in p ∈ X is equal to TX,p . As X has only one singularity, these
are zero for p 6= 0, by Exercise 10.2.27. Hence TX1 is a coherent sheaf concentrated in the
point 0, and therefore the set of global sections Γ(TX1 ,U ) = TX, 1
0
is a finite-dimensional
C –vector space by Theorem 6.2.6.
The converse of this theorem does not hold, see Exercise 10.2.30. We now come to
the definition of induced family and of versal deformations.
Definition 10.2.16. Let (X, 0) be a germ of a complex space.
(1) Suppose (XS , 0) −→ (S, 0) is a deformation of (X, 0), defined by F1 , . . . ,Fk ∈
b S,0 . Let p : (T, 0) −→ (S, 0) be an analytic map. We define the pull-back,
C {x}⊗O
or the induced deformation
(X ×S T, 0) −→ (T, 0)
10.2 Deformations of Singularities 341
as follows. The map p induces a map p∗ : OS,0 −→ OT,0 , which in turns induces a
map p∗ : C {x}⊗Ob S,0 −→ C {x}⊗O b T,0 . Then (XT , 0) is given by p∗ (F1 ), . . . ,p∗ (Fk ).
It is proved in Exercise 10.2.32 that the map (X ×S T, 0) −→ (T, 0) is indeed flat.
(2) A deformation (XS , 0) −→ (S, 0) is called versal if for every deformation (XT , 0) −→
(T, 0) there exists a map p : (T, 0) −→ (S, 0) such that the deformations (XT , 0) −→
(T, 0) and (X ×S T, 0) −→ (T, 0) are isomorphic. The versal deformation is called
semi-universal if moreover the map on tangent spaces is uniquely determined.
Definition 10.2.17. Let (XS , 0) −→ (S, 0) be a deformation of (X, 0). Suppose (X, 0)
k
is defined by f ∈ C {x}k , and that the deformation is given by F ∈ C {x} b
P⊗OS,0 .
Suppose mS,0 has a minimal system of generators s1 , . . . ,sp . Write F = f + si gi . We
1
denote by [gi ] the class of gi mod mS,0 in TX, 0
. Then the map
∗
TS,0 = mS,0 /m2S,0 1
−→ TX, 0
X X
ai s∗i 7→ ai [gi ]
i i
is called the Kodaira-Spencer map. It is easily shown that the definition of the Kodaira-
Spencer map is independent of the choices made.
The main goal in this chapter is to prove that for (X, 0) with an isolated singularity,
a semi-universal deformation exists. In general, the proof of this theorem needs many
additional preparations, but we proved it already in the first section for hypersurface sin-
gularities. In the remainder of this chapter, we will prove a special case of this theorem,
which includes the case of hypersurface singularities and complete intersection singular-
ities (see Exercise 10.2.33). Moreover we treat the case of Cohen-Macaulay singularities
of codimension two by applying the Hilbert-Burch Theorem 6.5.26.
Theorem 10.2.18. Let (X, 0) be a germ of a complex space, defined by f ∈ C {x}k .
1
Suppose dimC (TX, 0
) < ∞. Let (S, 0) = (C τ , 0). Suppose that
Let r be a relation, that is, r · f = 0. We will see that r · (gi − gi′ ) ∈ (f1 , . . . ,fk ), that is,
gi − gi′ defines an element of NX,0 . Consider lifts RB and RB ′
of r, which are relations
′
between the FB and the FB respectively. By Lemma 10.2.22, we may assume that the
′
restrictions of RB and RB to C {x}k ⊗ A are equal. Therefore we can write:
X X
′
RB = RA + ci · αi ; RB = RA + c′i · αi .
10.2 Deformations of Singularities 343
for complex numbers ci and c′i . As B −→ A is a small extension, it follows that mαi = 0
for all i. Hence
X X
0 = RB · FB = RA · FA + (r · gi )αi + (ci · f )αi .
Similarly X X
′
0 = RB · FB′ = RA · FA + (r · gi′ )αi + (c′i · f )αi .
By subtracting and looking at the coefficients of αi we see that for all i = 1, . . . ,p:
Here (t) is the maximal ideal of C {t}. We will change the order e + 1 terms in t of s,φ
and Λ in such a way that (10.3) holds modulo (J + (t)e+2 ).
Consider the small surjection
Take a basis α1 , . . . ,αp of Ker(σ). We can factorize σ into p simple surjections σi , for
i = 1, . . . ,p. Thus, using induction, we may suppose that σ : B −→ A is a small surjection,
the kernel of σ is of dimension one, and Ker(σ) is generated by a monomial α of degree
e + 1.
Step 1. Change of s(t): We apply 10.2.23 to σ : B −→ A, FB = FS (x,s(t)) and FB′ =
FT (φ,t) ◦ Λ. The s(t), φ(x,t) and Λ(x,t) are given by induction. It follows that
for an element n ∈ C {x}k such that f 7→ n is an element of the normal module NX,0 .
By assumption the Kodaira-Spencer map is surjective, that is, the elements ai : f −→ gi
1 1
mod (s) for i = 1, . . . ,τ project to generators of TX, 0
. Thus as elements of TX, 0
, we can
write
n = b 1 a1 + . . . + b τ aτ
for suitable bi ∈ C . Now we can redefine the s(t):
Step 2. Change of φ(x,t). After the change of s(t) we have just done, we have reached
1
the stage that the n = (n1 , . . . ,nk ) ∈ NX,0 in (10.4) defines a trivial element in TX, 0
.
−1
Then, obviously, nΛ|t=0 is a trivial element too. Therefore, there exist c1 , . . . ,cn ∈ C {x}
such that θ(f ) = nΛ−1 ∂ ∂
|t=0 , with θ := c1 ∂x1 + . . . + cn ∂xn . We redefine the automorphism
(φ1 , . . . ,φn ,t) in the following way.
φ1 := φ1 − c1 α, . . . , φn := φn − cn α.
It follows from Taylor’s expansion that after this redefinition, the n defines the trivial
element of NX,0 , that is, ni is the zero element in OX,0 for i = 1, . . . ,k.
Step 3. Change of Λ. After these redefinitions the elements n in (10.4) have entries ni
P in OX,0 = C {x}/(f1 , . . . ,fk ). Thus we can find elements
which all give the zero element
cji ∈ C {x} such that ni = j cji fj . Redefine the matrix Λ by putting Λ := Λ − (cji )α.
After this redefinition, we reached the stage that n = 0 ∈ C {x}k . This is what we had
to show.
To finish this section, we will prove the existence of a versal deformation for (X, 0) ⊂
(C n , 0) that have an isolated singularity, are Cohen-Macaulay, and have dimension n −
2. The Auslander-Buchsbaum formula 6.5.20 implies that the projective dimension of
OX,0 is equal to two. The main ingredient is now the Hilbert-Burch Theorem 6.5.26
and Theorem 10.2.18. Indeed, by the Hilbert-Burch Theorem, one has the following free
R := On – resolution of OX,0 = R/I, where I = I (X, 0):
2 α 1 α
0 −→ F2 −→ F1 −→ R −→ R/I −→ 0.
Here α2 is an (t − 1) × t matrix for some t. Generators f1 , . . . ,ft for the ideal I are
obtained by taking the (t−1)–minors of the matrix α2 . Equivalently, one has the following
resolution of the ideal I:
α2 α1
0 −→ F2 −→ F1 −→ I −→ 0.
Proposition 10.2.24. With the above notation, we get deformations of (X, 0) over any
germ of a complex space (S, 0) in the following way. Let E be a matrix with entries in
b S,0 . Consider the matrix α2 + E, and consider It−1 (α2 + E). This is an ideal in
On ⊗m
b S,0 , and defines a space (XS , 0) such that (XS , 0) −→ (S, 0) is a flat deformation
On ⊗O
of (X, 0). Moreover, all deformations of (X, 0) are obtained this way.
Proof. Step 1. Consider α1 + E ′ , the 1 × t matrix where the i–th entry equals (−1)i times
the minor obtained from α2 + E by leaving out the i–th row. Then the sequence
α +E α +E ′
b S,0 )t−1 −−2−−→ (On ⊗O
(On ⊗O b S,0 )t −−1−−→ It−1 (α2 + E) −→ 0
10.2 Deformations of Singularities 345
is certainly a complex, that is, (α1 + E ′ ) ◦ (α2 + E) = 0. This implies that we can lift
all the relations between the minors of α2 , which are generators of the ideal I. So we get
indeed a flat deformation.
Step 2. Conversely, suppose we have a deformation (XS , 0) of (X, 0) over (S, 0). The
flatness says that we can lift the relations. Hence we get a complex
α′ α′
(10.5) b S,0 )t−1 −→
(On ⊗O 2
b S,0 )t −→
(On ⊗O 1
b S, 0 −→ OXS ,0 −→ 0.
On ⊗O
where α′1 = α1 modulo mS,0 and α′2 = α2 modulo mS,0 . We leave it as Exercise 10.2.37
to prove that the sequence (10.5) is exact, that is, the kernel of α′1 is generated by the
lifts of the set of t − 1 generators of the kernel of α. We claim that the map α′2 is in
fact injective. To show this suppose α′2 (a1 , . . . ,at−1 ) = 0. Calculating modulo mS,0 , we
see that (a1 , . . . at−1 ) is zero modulo mS,0 . This is because
P α2 is injective. Take a basis
b1 , . . . ,bp of mS,0 /m2S,0 , and write (a1 , . . . ,at−1 ) = (ai1 , . . . ,ait−1 )bi modulo m2S,0 for
aij ∈ On . From α′2 (a1 , . . . ,at−1 ) = 0 it follows by calculating modulo m2S,0 that aij = 0.
Thus (a1 , . . . at−1 ) is zero modulo m2S,0 . Going on like this, we see that (a1 , . . . at−1 ) is
zero modulo mkS,0 for all k, hence 0 by Krull’s Intersection Theorem. So we get an exact
sequence
α′ α′
b S,0 )t−1 −→
0 −→ (On ⊗O 2
b S,0 )t −→
(On ⊗O 1
b S, 0 −→ OXS ,0 −→ 0.
On ⊗O
1
10.2.27. Let (X, 0) be a germ of a smooth space. Show that TX, 0 = 0.
10.2.29. Prove that the definition of flat deformation is independent of the choice of generators
of (XS , 0).
346 10 Deformations of Singularities
10.2.30.
(1) Consider the germ of the complex space (X, 0) defined by the ideal I = (xz,xw,yz,yw) ⊂
C {x,y,z,w}. Thus V (I) is the union of two planes in C 4 intersecting in a point. Show
1
that TX, 0 = 0. Singularities with this property are called rigid.
1
(2) Give an example of a nonisolated singularity (X, 0) with finite-dimensional TX, 0.
` ´
10.2.31. Consider the curve singularity V (xy,xz,yz), 0 . Compute a semi-universal deforma-
tion of this curve singularity.
10.2.32. Prove that for a (flat) deformation (XS , 0) −→ (S, 0), and an analytic map p : (T, 0) −→
(S, 0) the map (X ×S T, 0) −→ (T, 0) is flat.
10.2.33. Let (X, 0) be a complete intersection with isolated singularity. Describe a versal de-
formation of (X, 0).
10.2.34. Finish the proof of Lemma 10.2.23.
10.2.35. Consider t(t − 1) variables xij for 1 ≤ i ≤ t − 1 and 1 ≤ j ≤ t. Consider the generic
determinantal variety (X, 0) ⊂ (C t(t−1) , 0) defined by the ((t − 1)–minors of the matrix (xij ).
Prove that (X, 0) is rigid.
10.2.36. Use a computer algebra system (for example SINGULAR) to compute Example 10.2.8
(Hint: Try in SINGULAR the following:
ring r =0,(x(0..4)),(c,dp);
matrix m[2][4] = x(0),x(1),x(2),x(3),x(1),x(2),x(3),x(4);
ideal i = minor(m,2);
module s = syz(i);
qring q = std(i);
module s = imap(r,s);
syz(transpose(s));)
10.2.37. Prove that the sequence (10.5) is exact.
(Hint: Use the same method as in the proof of the injectivity of α′2 .)
In this section we will finally give the proof that for an isolated singularity a versal
deformation exists. We need many preparations.
Definition 10.3.1. Let (X, 0) be a germ of a complex space, A a local Artinian C –
algebra with maximal ideal mA . Suppose me+1
A = 0. Then a deformation (XA , 0) −→ (A, 0)
is called a deformation of order e.
Definition 10.3.2. Let (X, 0) be a germ of a complex space with local ring OX,0 =
C {x}/(f1 , . . . ,fk ). Let A = C [[s]]/J be a local formal C –algebra in the variables s1 , . . . ,sp ,
with maximal ideal mA . By abuse of notation, we denote by J also the ideal J · C {x}[[s]].
(1) A formal deformation (XA , 0) of (X, 0) over A is given by a quotient of C {x}[[s]]/J
by an ideal (F1 , . . . Fk ) such that
(a) Fi (x,0) = fi for i = 1, . . . ,k.
(b) For every relation r ∈ C {x}k , that is r · f = 0, there exists R ∈ C {x}[[s]]k
such that R · F = 0 as elements of C {x}[[s]]/J and R(x,0) = r.
10.3 Existence of a Semi-Universal Deformation 347
(2) For all e ∈ N, consider the local Artinian C –algebra Ae defined by Ae = C [[s]]/(J +
me+1
A ). Note that a formal deformation induces a deformation (XAe , 0) −→ (Ae , 0)
of order e for all e.
(3) A formal deformation (XA , 0) is called versal if for all local Artinian C –algebras
B, and all deformations (XB , 0) −→ (B, 0) there exists a homomorphism Ae −→ B
for some e ≫ 0 and an isomorphism of deformations (XAe ×Ae B, 0) ∼ = (XB , 0).
It is called semi-universal, if the induced homomorphism on the tangent spaces
(mA /m2A )∗ −→ (mB /m2B )∗ is uniquely determined.
Remark 10.3.3. Let (XA , 0) and (XB , 0) be two formal semi-universal deformations
of a germ of a complex space (X, 0). Then the formal analytic algebras A and B are
isomorphic.
To see this, note that, because (XA , 0) is semi-universal, we have a unique map α :
A1 −→ B1 , inducing (XB1 , 0) −→ (B1 , 0). Similarly, we get a map β : B1 −→ A1 . By semi-
universality, the composition β ◦ α is the identity on A1 , and α ◦ β is the identity on B1 .
Hence A1 and B1 are isomorphic. Now let e ∈ N be given and consider (XBe , 0) −→ (Be , 0).
By semi-universality, for some q there exists a map Aq −→ Be , so that (XBe , 0) −→ (Be , 0)
is induced by (XAq , 0) −→ (Aq , 0). As me+1
Be = 0 by definition, we can take q = e. Thus
we get a map φ : Ae −→ Be . Conversely, we get a map ψ : Be −→ Ae . The composition
ψ ◦ φ : Ae −→ Ae is such that the pull-back of the deformation (XAe , 0) −→ (Ae , 0)
is isomorphic to (XAe , 0) −→ (Ae , 0). By semi-universality, this map is uniquely defined
up to first order, and is the identity modulo the square of the maximal ideal. By the
Inverse Function Theorem8 , the map ψ ◦ φ is an isomorphism, and similarly φ ◦ ψ is an
isomorphism. Note that in particular dimC (Ae ) = dimC (Be ) for all e ∈ N.
Remark 10.3.4. Suppose that (XA , 0) is a formal semi-universal deformation of (X, 0).
Then the Kodaira-Spencer map (mA /m2A )∗ −→ TX, 1
0
is an isomorphism.
1
Namely, to construct the inverse mapping, consider an element of TX, 0
, that is,
2
a deformation over the space defined by the local ring C [[ε]]/ε . Because the formal
deformation is semi-universal, we get a unique map A1 −→ C [[ε]]/ε2 . By looking at the
coefficients of ε of the values of all elements in mA /m2A we get an element of (mA /mA )∗ .
One checks that this gives the inverse of the Kodaira-Spencer map.
Thus, as already said, we first want to show the existence of a formal semi-universal
deformation. The idea is to do this order by order, that is, define deformations (XAe , 0) →
(Ae , 0) of order e for all e ∈ N. For e = 1, we define a deformation over (A1 , 0), with
A1 = C [[s1 , . . . ,sτ ]]/m2 , and τ is the dimension of TX,
1
0
. Given Ae , it might be quite
obvious that we would like to have a deformation of order e + 1 over an as big as possible
space Ae+1 . In order to see that such a space exists, we will develop obstruction theory.
2
To formulate this, we need to define a further OX,0 –module, called TX, 0
.
Definition 10.3.5. Let a singularity (X, 0), with local ring OX,0 = On /I be given.
For simplicity, we write On = O. Take generators f1 , . . . ,fk of I. We consider the map
F := Ok −→ O, which sends the ei = (0, . . . ,0,1,0, . . . ,0) to fi . We let R be the kernel
of this map. Thus, we have an exact sequence
0 −→ R −→ F −→ O −→ OX,0 −→ 0.
8 This theorem holds in the formal case too, just ignore the convergence.
348 10 Deformations of Singularities
0 −→ J −→ B −→ A −→ 0.
This obstruction element ob(ξ)B→A is zero, if and only if there exists a flat defor-
mation of (XB , 0) −→ (B, 0) extending the given deformation ξ,
(2) Suppose we have a commutative diagram
0 // J // B // A // 0
0 // J ′ // B ′ // A′ // 0
such that the rows are small extensions. Let ξ ′ be the deformation over A′ induced
2
by ξ. Let β : TX, 0
2
⊗ J −→ TX, 0
⊗ J ′ be the induced map. Then
ob(ξ ′ )B ′ →A′ = β ob(ξ)B→A .
10.3 Existence of a Semi-Universal Deformation 349
Proof. We give the proof for local Artinian C –algebras, the remaining cases we leave
as Exercise 10.3.22. We first construct the obstruction element ob(ξ)B→A . Suppose the
k
deformation ξ, that is (XA , 0) −→ (A, 0), is given by the elements FA ∈ C {x} ⊗ A . Let
r ∈ R be a relation. By flatness we get a relation rA between the FA . Now take arbitrary
lifts FB ∈ C {x} ⊗ B of FA and a lift rB of rA . Then we consider
rB · FB ∈ OX,0 ⊗ B.
Step 5. On the other hand, suppose that ob(ξ)B→A = 0. Take an arbitrary lift FB′ =
′ ′
(FB1 , . . . ,FBk ) of FA . Because ob(ξ)B→A = 0, the element in HomO (R/R0 ,OX,0 ) ⊗ J
given by
r 7→ rB · FB′ , rB any lift of r,
is, in fact, in HomO (F ,OX,0 ) ⊗ J. Hence, there exist h1 , . . . ,hk ∈ C {x} ⊗ J such that
the map is given by
k
X
r = (r1 , . . . ,rk ) 7→ ri hi .
i=1
Now we define
′ ′
FB = (FB1 , . . . ,FBk ) := (FB1 , . . . ,FBk ) − (h1 , . . . ,hk ).
r 7→ rB · FB , rB any lift of r,
is the zero map. We claim that the FB defines a deformation which extends FA . To show
′ ′
this, for every relation r ∈ R a lift rB such that rB · FB = 0 has to be found. Now by
construction rB · FB = 0 as element of OX,0 ⊗ J. Therefore, there exist a1 , . . . ,ak in J
with rB · FB = a1 f1 + . . . + ak fk . Put a = (a1 , . . . ,ak ). As mB J = 0, and FB ≡ f modulo
′ ′
mB , it follows that a · FB = a · f . Now define rB := rB − a. Then rB is a lift of r with
′
rB · FB = 0. As we can do this for all r, it follows that FB gives a (flat) deformation of
(X, 0).
Step 6. The second statement of the theorem is immediate from the construction of the
obstruction element.
Corollary 10.3.9. Let (X, 0) be a germ of a complex space, and A = O/J a local Ar-
tinian (resp. formal, resp. analytic) C –algebra, and A′ := O/mJ. Let ξ be a deformation
(XA , 0) −→ (A, 0), and consider the small extension
Let I mod mJ be the image of the map ob(ξ)A′ →A . Then I is the unique minimal ideal
(with respect to inclusion) with the properties:
(1) mJ ⊂ I ⊂ J
(2) Let B = O/I. Then there exists a deformation (XB , 0) −→ (B, 0) of (X, 0) inducing
the deformation ξ.
The small surjection B −→ A we call a maximal extension with respect to the deformation
(XA , 0) −→ (A, 0).
10.3 Existence of a Semi-Universal Deformation 351
ob(ξ)A′ −→A = α1 ⊗ h1 + . . . + αp ⊗ hp ,
2
for some linear independent elements α1 , . . . ,αp of TX, 0
and h1 , . . . hp of J. Let I =
(h1 , . . . ,hp ) + mJ. To show that a deformation (XB , 0) −→ (B, 0) exists, we consider the
local Artinian C –algebra O/I. Then ob(ξ)O/I→A is equal to α1 ⊗ h1 + . . . + αp ⊗ hp ,
2
but this time considered as element of TX, 0
⊗ (J/I), thus ob(ξ)O/I→A is zero. (See the
second part of Theorem 10.3.8.) Hence we forced the obstruction element to be zero, and
therefore there exists an extension of ξ over the Artinian C –algebra O/I. To show that I
is minimal and unique with respect to the two properties, let K be another ideal satisfying
mJ ⊂ K ⊂ J, such that there exists an extension of ξ over O/K. The obstruction element
is α1 ⊗ h1 + . . . + αp ⊗ hp , and therefore is zero considered as element of J/K. Hence
I = (h1 , . . . ,hp ) + mJ ⊂ K, which is what we had to prove.
Lemma 10.3.10. Let A = C [[s]]/J be a formal algebra, (XA , 0) be a formal deformation
of (X, 0) over A, and (XAe , 0) be the induced deformation of order e for all e. Consider
the small surjection
Assume that
(1) C [[s]]/J + me −→ C [[s]]/J + me−1 is a maximal extension for the deformation
(XAe−1 , 0) −→ (Ae−1 , 0),
J+me+1 J+me
(2) dimC mJ+m e+1 = dim C mJ+me .
1
The modules NX,0 and TX, 0
have been described in 10.2.8 and 10.2.14. Thus we have the
following deformation ξ over C [[s1 ,s2 ,s3 ,t]]/m2 =: A, with m = (s1 ,s2 ,s3 ,t):
F12 = f12 − s1 x1 + tx0
F23 = f23 + s1 x3 − s2 x2 − tx2
F34 = f34 + s2 x4 − s3 x3 + tx4
F13 = f13 − s2 x1
F24 = f24 + s1 x4 − s3 x2
F14 = f14 − s3 x1 + tx2
The fij are the 2–minors of the above matrix as in Example 10.2.8. The Fij give a (flat)
deformation over C [[s1 ,s2 ,s3 ,t]]/(s1 ,s2 ,s3 ,t)2 , so we must be able to lift the relations.
Now we give the lift Rijk and Sijk of the relations rijk and sijk .
R123 = x0 F23 − (x1 + s1 )F13 + (x2 + s2 )F12
R124 = x0 F24 − (x1 + s1 )F14 + (x3 + s3 )F12 − tF13
R134 = x0 F34 − (x2 + s2 )F14 + (x3 + s3 )F13 − tF14 − tF23
R234 = (x1 + s1 )F34 − (x2 + s2 + t)F24 + (x3 + s3 )F23
S123 = x1 F23 − (x2 + t)F13 + x3 F12
S234 = x2 F34 − x3 F24 + x4 F23
S124 = x1 F24 − (x2 + t)F14 + x4 F12 − tF23
S134 = x1 F34 − x3 F14 + x4 F13 − tF24
These Rijk and Sijk are all zero modulo (s1 ,s2 ,s3 ,t)2 expressing the fact that we have a
(flat) deformation. We obtain the obstruction element by looking at the quadratic terms
in s and t, which one gets by simply expanding the terms:
r123 →
7 R123 = s2 tx0
r124 →7 R124 = s3 tx0 − s1 tx2 + s2 tx1
r134 →
7 R134 = s3 tx1 − s1 tx3
r234 →7 R234 = 0
s123 7→ S123 = s2 tx1
s124 7→ S124 = s3 tx1 − s1 tx3 + s2 tx2
s134 7→ S134 = −s1 tx4 + s3 tx2
s234 7→ S234 = 0
This map of R −→ OX,0 ⊗ J gives the element ob(ξ)A′ →A in TX, 2
0
⊗ J, where A′ =
3 2 3
C [[s1 ,s2 ,s3 ,t]]/m and J = m /m . Now, according ∗ to Corollary 10.3.9, we need a basis
2 2
of TX, 0
to describe the map ob(ξ) ′
A →A : T X,0 −→ m2 /m3 , and compute I, the image
of this map. We will see that we already computed part of a basis, which is enough for
our purpose. We have in particular that the elements α1 , α2 and α3 given in the first
three columns of the following table, (obtained by looking at the coefficients of s1 t, s2 t
and s3 t) give three elements in HomO (R/R0 ,OX,0 ). The last six columns correspond to
the images of the relations with respect to the canonical basis of HomO (F ,OX,0 ). We will
see that α1 , α2 and α3 are linearly independent modulo HomO (F ,OX,0 ), and therefore
2
induce linear independent elements of TX, 0
.
10.3 Existence of a Semi-Universal Deformation 353
r123 7→ 0 x0 0 x2 x0 0 −x1 0 0
r124 7 → −x2 x1 x0 x3 0 0 0 x0 −x1
r134 7→ −x3 0 x1 0 0 x0 x3 0 −x2
r234 7 → 0 0 0 0 x3 x1 0 −x2 0
(10.7)
s123 7→ 0 x1 0 x3 x1 0 −x2 0 0
s124 7 → −x3 x2 x1 x4 0 0 0 x1 −x2
s134 7 → −x4 0 x2 0 0 x1 x4 0 −x3
s234 7 → 0 0 0 0 x4 x2 0 −x3 0
2
To see whether these elements are linearly independent in TX, 0
we have to look at the
2
“trivial” elements in TX,0 , that is, elements of Hom(F ,OX,0 ) which are given by the final
six columns of the above table (10.7). These six columns are the duals of f12 , f23 , f34 ,
f13 , f24 , and f14 respectively. Thus, for example, the first row and the final six columns
says that r123 is the relation x2 f12 +x0 f23 +0f34 −x1 f13 +0f24 +0f14 = 0. An elementary
but boring check now shows that α1 , α2 and α3 represent linearly independent elements
2 2
in TX, 0
. They can therefore be considered as part of a basis of TX, 0
and ob(ξ)A′ →A =
α1 ⊗ s1 t + α2 ⊗ s2 t + α3 ⊗ s3 t. Hence the minimal ideal I of the corollary is given by
the ideal (s1 t,s2 t,s3 t) + (s1 ,s2 ,s3 ,t)3 . Note, however, that the family considered above, in
fact, defines a deformation over C {s1 ,s2 ,s3 ,t}/(s1 t,s2 t,s3 t), as all the Rijk · F and Sijk · F
vanish modulo (s1 t,s2 t,s3 t) and not only modulo (s1 t,s2 t,s3 t) + (s1 ,s2 ,s3 ,t)3 .
Theorem 10.3.12. Let A = C [[s]]/J be a formal local C –algebra, and (X, 0) be a
germ of a complex space. Let (XA , 0) be a formal deformation of (X, 0). For all k put
Ak := A/mk+1 and Jk = J + mk+1 . Suppose that
(1) the Kodaira-Spencer map of (XA , 0) is surjective (resp. an isomorphism),
(2) for all e ≥ 2, the small extension Ae+1 −→ Ae is a maximal extension with respect
to the deformation (XAe , 0) −→ (Ae , 0).
Then (XA , 0) is a formal versal (resp. semi-universal) deformation.
Example 10.3.13. The family written down in the previous example is formally versal.
It is a deformation over A = C [[s1 ,s2 ,s3 ,t]]/(s1 t,s2 t,s3 t). Details are left as an exercise.
The basic idea of the proof is as the proof of Theorem 10.2.18, but there is a subtle
difficulty popping up, which is taken care of by Proposition 10.3.15. In its proof we need
the following lemma.
Lemma 10.3.14. Suppose that ϕ : B −→ A is a simple surjection,and let ψ : C −→ B
be a map of local Artinian C –algebras such that
(1) ϕ ◦ ψ : C −→ A is surjective,
(2) ϕ does not have a section.
Then ψ is surjective.
Proof. Suppose the converse. Let J = Ker(ψ). Then we have a injective map ψ : C/J −→
B. Moreover ϕ ◦ ψ : C/J −→ A is surjective. As ψ : C/J −→ B is not surjective, and
dimC (B) = dimC (A) + 1, it follows that dimC (C/J) = dimC (A). In Exercise 10.3.23
you will prove that a surjective algebra map of Artinian C –algebras of the same (vector
space) dimension is an isomorphism. Thus ϕ ◦ ψ is an isomorphism. Denote the inverse
by β : A −→ C/J. Then the map ψ ◦ β is a section of ϕ. This is a contradiction.
354 10 Deformations of Singularities
Proposition 10.3.15. Let local Artinian C –algebras A = C [[s1 , . . . ,sp ]]/I, and A′ =
C [[s1 , . . . ,sp ]]/J be given, for some ideals I, J with mJ ⊂ I ⊂ J. Let two further Artinian
C –algebras B and B ′ be given. Let ψ : A −→ A′ be the canonical surjection. Let ξ :
(XA′ , 0) −→ (A′ , 0) be a deformation of (X, 0). Suppose that
(1) the small extension A −→ A′ is a maximal extension with respect to the deformation
ξ : (XA′ , 0) −→ (A′ , 0);
(2) we have a simple surjection π : B −→ B ′ ;
(3) we have a map φ′ : A′ −→ B ′ and a deformation (XB , 0) −→ (B, 0) inducing
Cartesian diagrams
(XB′ , 0) −→ (XA′ , 0) (XB′ , 0) −→ (XB , 0)
↓ ↓ ↓ ↓
φ′
(B′ , 0) −→ (A′ , 0) (B′ , 0) −→ (B, 0)
A′ ×B ′ B ∼
= O/K ։ O/J = A′ .
(XD , 0) −→ (D, 0)
inducing ξ. Suppose ξ is given by functions FA′ ∈ C {x}⊗Ab ′ k . Moreover, by assumption
we have deformations (XB , 0) given by say FB ∈ C {x}⊗B b k , and (XB′ , 0) given by
say FB ′ ∈ C {x}⊗Bb ′ k . As the deformation over (A′ , 0) and (B, 0) induces the same
deformation over B ′ , it can be easily seen that we may assume that
φ′ (FA′ ) = FB ′ = π(FB ).
Thus the pair (FA′ ,FB ) defines an element in C {x}⊗D b k . To show that it defines a (flat)
deformation, we have to lift relations. So let r ∈ C {x}k with r · f = 0 be an arbitrary
relation. By flatness we can find RA′ ∈ C {x}k ⊗ A′ with RA′ · FA′ = 0. Let RB ′ be the
image of RA′ under the natural map C {x}k ⊗ A′ −→ C {x}k ⊗ B ′ . By Lemma 10.2.22 we
can find an RB ∈ C {x}k ⊗ B lifting RB ′ with RB · FB = 0. The pair (RA′ ,RB ) therefore
is an element of C {x}k ⊗Db = C {x}k ⊗(A b ′ ×B ′ B), and (RA′ ,RB ) · (FA′ ,FB ) = 0. Hence
the relation r lifts. This finishes the proof of the proposition.
Proof of Theorem 10.3.12. Consider a local Artinian C –algebra B, and suppose that
we have a deformation (XB , 0) −→ (B, 0) of (X, 0). Let Ae := A/me+1 = C [[s]]/(J +
me+1 ). We have an induced deformation (XAe , 0) −→ (Ae , 0). Consider the following
small surjection
By assumption, for all e the ideal J mod me+2 is minimal among those ideals for which
there exists an extension of (XAe , 0) −→ (Ae , 0) over some small extension of Ae .
356 10 Deformations of Singularities
To show the versality, we have to show that there exists an e ∈ N and a mor-
phism of C –algebras Ae −→ B such that the deformation (XB , 0) −→ (B, 0) is iso-
morphic to (XAe ×Ae B, 0) −→ (B, 0). This we will prove by induction on the vector
space dimension dimC (B). We suppose that XAe is defined by elements F (e) (x,s) =
(e) (e)
F1 (x,s), . . . ,Fk (x,s) ∈ C {x}k ⊗ Ae = C {x}k ⊗ C [[s]]/Je . (Recall that Je = J +
me+1 .)
Step 1. Consider the induced deformation over B/m2B . As the Kodaira Spencer map is
surjective, we can find a map A1 −→ B inducing the deformation over B/m2B . This map
is uniquely determined in case the Kodaira-Spencer map is an isomorphism.
Step 2. Now take a nonzero element α ∈ B, such that mB α = 0. Put B ′ = B/(α), and
denote by π : B −→ B ′ the canonical projection. Then dimC (B ′ ) = dimC (B) − 1, and
therefore, by induction, we have for suitable e ∈ N a morphism of C –algebras
σ′
Ae −→ B ′
commutative.
Step 3. Now that we have our map σ we get two different deformations over (B, 0): The
given deformation (XB , 0) −→ (B, 0), and the deformation which is induced by σ. By
assumption the restriction of the two deformations to (B′ , 0) are isomorphic. Hence there
exist
(1) a k × k matrix Λ′ with entries in C {x} ⊗ B ′ , lifting the identity matrix and
(2) a B ′ – automorphism Φ′ defined by φ′1 = Φ′ (x1 ), . . . ,φ′n = Φ′ (xn ) ∈ C {x} ⊗ B ′
lifting the identity,
such that
F (e) (x,σ ′ (s1 ), . . . ,σ ′ (sp )) = FB′ (φ′1 , . . . ,φ′n ,t) ◦ Λ′ .
We now take any lift Λ of Λ′ , and φ1 , . . . ,φn of φ′1 , . . . ,φ′n . We will change σ, Λ, and
φ1 , . . . ,φn in such a way that:
(1) Consider new variables s1 , . . . ,sp , and consider A := C [[s1 , . . . ,sp ]], with maximal
ideal m = (s1 , . . . ,sp ). We set A1 := A/m2 and define a first order deformation:
The inclusion Je+1 + me+1 ⊂ Je−1 is easy, as Je+1 ⊂ Je ⊂ Je−1 , and me+1 ⊂ me ⊂ Je−1
by construction. For the other inclusion, we use mJe−1 = mJe +me+1 ⊂ Je+1 +me+1 . The
equality follows from the induction hypothesis, the inclusion holds because mJe ⊂ Je+1
by definition.
Proof of Theorem 10.3.16. The fact that the constructed deformation is semi-universal
is an immediate consequence of Theorem 10.3.12, Lemma 10.3.17 and Corollary 10.3.9.
It remains to show that all formal versal deformations come from a construction as
in Theorem 10.3.16. This we do by contradiction. Thus let (XA , 0) be a formal semi-
universal deformation. Now certainly, because of the versality, the Kodaira-Spencer map
is surjective. Furthermore dimC (mA /m2A ) is equal to the dimension of TX,
1
0
, because the
deformation is semi-universal. This implies that the Kodaira-Spencer map is an isomor-
phism. Now suppose that for some e the small extension Ae+1 −→ Ae is not maximal
with respect to the deformation (XAe , 0) −→ (Ae , 0). We can follow the construction of
Theorem 10.3.16 to find a formal semi-universal deformation over A′ with A′e = Ae , and
A′e+1 ։ Ae+1 . But we have formal semi-universal deformations over both A and A′ it
follows that dimC (A′e+1 ) = dimC (Ae+1 ), see Remark 10.3.3. Thus Ae+1 was maximal
after all, proving the claim.
358 10 Deformations of Singularities
Remark 10.3.18. Note that in fact we proved something stronger. Indeed, under the
assumptions of Theorem 10.3.16, suppose we have a deformation (XB , 0) −→ (B, 0) of
order q. We have an induced deformation of order q − 1 over the space defined by the
Artinian ring Bq−1 := B + mq /mq . Suppose we have a map ϕ : Ae −→ Bq−1 inducing
this deformation. Then there exists a lift φe : Af −→ B for some f ≥ e, that is, we have
a commutative diagram
e
φ
Af // B
φ
Ae // Bq−1 ,
such that φe induces the given deformation (XB , 0) −→ (B, 0). This property of the con-
structed versal deformation is of great use in the proof of the existence of a convergent
semi-universal deformation.
Theorem 10.3.19. Let (XS , 0) and (S, 0) be germs of complex spaces, and suppose
(XS , 0) −→ (S, 0) is formally semi-universal. Then (XS , 0) is a semi-universal defor-
mation.
Proof. Let a deformation (XT , 0) −→ (T, 0) be given. Suppose that this deformation is
b T,0 k . We have to find a map (T, 0) −→ (S, 0) such
given by functions FT (x,t) ∈ C {x}⊗O
that the deformation (XT , 0) −→ (T, 0) is isomorphic to (XS ×S T, 0) −→ (T, 0). Write
OS,0 = C {s}/J, and OT,0 = C {t}/J ′ . To give a map (T, 0) −→ (S, 0) means that dually
we have to find a map φ : C {s} −→ C {t} with φ(J) ⊂ J ′ . The induced deformation
(XS ×S T, 0) −→ (T, 0) is given by FS (x,φ(s)) ∈ C {x,t}k . This has to be isomorphic
to the given deformation defined by FT . Therefore, we have to find an automorphism
Φ : C {x,t}n −→ C {x,t}n , and a matrix Λ ∈ C {x,t}k such that
(J + me+1 )/(mJ + me+2 ) // C [[s]]/mJ + me+2 // C [[s]]/J + me+1 .
We obtain, using the second part of Theorem 10.3.8, for the corresponding obstruction
2 ∗
map TX, 0
−→ J + me+1 /mJ + me+2 a commutative diagram
2
∗
TX, // J/mJ
0
QQQ
QQQ
QQQ
QQQ
Q((
J + me+1 /mJ + me+2 .
2
∗
As by the second assumption, the obstruction map TX, 0
−→ J/mJ is surjective, the
image of the obstruction map of the second extension is thus (J + me+2 )/(mJ + me+2 ).
By Corollary 10.3.9 we see that for all e the extension C [[s]]/J + me+2 = Ae+1 −→ Ae =
C [[s]]/J + me+1 is maximal with respect to the deformation (XAe , 0) −→ (Ae , 0). Thus
we have a formal versal deformation by Theorem 10.3.12.
Step 2. Now suppose (XA , 0) is formally semi-universal. Consider the formal deforma-
tion constructed in Theorem 10.3.16. By construction, for all e, the obstruction element
associated to the extension
maps surjectively onto Je+1 /mJe+1 = J + me+2 /mJ + me+2 . Now for some e ≫ 0 the
minimal number of generators of Je+1 mod me+2 is equal to the minimal number of
generators of J by Exercise 1.3.19. Thus the obstruction map is surjective. The Kodaira-
Spencer map is an isomorphism by definition of the first order deformation.
Now we come to the proof of the main theorem.
Theorem 10.3.21 (Grauert). Let (X, 0) be a germ of a complex space. Suppose that
1
dimC (TX, 0
) < ∞. Then there exists an analytic semi-universal deformation of (X, 0).
Proof. The previous theorem says that it remains to give a convergent deformation which
is formally semi-universal. Let (X, 0) be defined by f = (f1 , . . . ,fk ) ∈ C {x}k , and let
r1 , . . . ,rt ∈ C {x}k generate the module of relations between the f1 , . . . ,fk . Take a formal
semi-universal deformation of (X, 0) over A = C [[s]]/J, and let m = (s). Let e be a
natural number, such that the minimal number of generators of J is equal to the minimal
number of generators of J modulo me+1 . As C [[s]] is Noetherian local ring, such an e
exists according to Exercise 1.3.19. Let g1 , . . . ,gp be the generators of J +me+1 /me+1 . We
may assume that the gi are polynomials in s. In order to apply Grauert’s Approximation
Theorem, we will have to interpret the flatness condition in terms of equations. Now
flatness means that we can find lifts F,R1 , . . . ,Rt ∈ C {x,s} of f,r1 , . . . ,rt and vectors
M1 , . . . ,Mt ∈ C {x,s}p with
(10.11) Ri · F − g · Mi = 0 for i = 1, . . . t.
360 10 Deformations of Singularities
We want to apply Grauert’s Approximation Theorem 8.2.2 to this set of equations. Our
formal semi-universal deformation of (X, 0) over A = C [[s]]/J gives solutions modulo
(q) (q)
mq+1 of (10.11), F (q) , Ri , g (q) , and Mi for all q ≥ e, lifting those for q = e. Now
′ ′
′ (q) ′ (q)
suppose F (q) , Ri , g (q) and Mi is a solution modulo mq+1 of (10.11) lifting F (e) ,
(e) (e) (e)
Ri , g , and Mi . In order to apply Grauert’s Approximation Theorem, we have to
′ (q)′ ′ (q)′
prove that the F (q) , Ri , g (q) and Mi can be lifted to solutions modulo mq+2 . The
′ ′
(q) (q)
ideal Jq′ = (g1 , . . . ,gp ) has p generators, just as many as Je . From Lemma 10.3.10 it
follows that the surjection C [[s]]/Jq′ −→ C [[s]]/Je is induced by a sequence of maximal
small extensions. Therefore Jq′ can by 10.3.16 be extended to an ideal J ′ , so that there
exists a formal semi-universal deformation over C [[s]]/J ′ . Hence from Remark 10.3.3 it
follows that the formal algebras C [[s]]/J ′ and C [[s]]/J are isomorphic, and thus that
C [[s]]/Jq′ and C [[s]]/Jq are isomorphic for all q. Thus for dimension reasons the minimal
ideal I with
′
(1) mJq+1 ⊂ I ⊂ Jq′ and,
(2) there exists a lift of the deformation over (A′q , 0) to one over (A′q+1 , 0) with A′q+1 =
C [[s]]/I,
′
is the ideal Jq+1 , as this is also the case for Jq+1 . Thus there does indeed exists a lift
′ (q+1)′ ′ (q+1)′ ′ (q)′ ′ (q)′
F (q+1) , Ri , g (q+1) and Mi of F (q) , Ri , g (q) and Mi . Thus Grauert’s
Approximation Theorem 8.2.2 applies, and the theorem is proved.
Exercises
10.3.22. Prove Theorem 10.3.8 for the formal and the analytic case.
(Hint: Use Exercise 1.3.19 to show that there exists an n ∈ N with OX,0 ⊗J ∼
= OX,0 ⊗(J +mn /mn ),
and use the theorem already proved for the Artinian case.)
10.3.23. Let ϕ : B −→ A be a surjective map of Artinian C –algebras. Suppose that dimC (A) =
dimC (B). Show that ϕ is an isomorphism.
10.3.27. Prove Theorem 10.3.21 for the algebraic case: Let (X, 0) be an isolated singularity,
1
defined by algebraic power series. Suppose dimC (TX, 0 ) < ∞. Then there exists a semi-universal
deformation of (X, 0) which is defined by algebraic power series.
(Hint: Use Corollary 8.3.7.)
361
Bibliography
[Artin 1968] Artin, M.: On the solution of analytic equations. Inv. math.
5, 277–291.
[Brieskorn-Knörrer 1986] Brieskorn, E., Knörrer, H.: Plane Algebraic Curves.
Birkhäuser, Basel, Boston, Stuttgart.
[Buchberger 1965] Buchberger, B.: Ein Algorithmus zum Auffinden der Ba-
siselemente des Restklassenringes nach einem nulldimen-
sionalen Polynomideal. Dissertation Innsbruck.