Polynomials (Lecture Notes)
Polynomials (Lecture Notes)
1 Polynomials in n variables
x 4x + 5 and g (f (x)) = (x + 1) 2 = x 1.
2 2 2
1
(ii) Evaluation Given a polynomial f (x) 2 R[x℄ and an element a 2 R, we
an repla
e every o
urren
e of x in f (x) by a to obtain a new element of
R, whi
h we denote by f (a). In this way, the polynomial f (x) denes a
fun
tion from R to R. This fun
tion is
alled a polynomial fun
tion.
Although there is a logi
al distin
tion between polynomials and polynomial fun
-
tions, it really does no harm to identify these two
on
epts. However, you should
be aware that two polynomials are equal if and only if they have identi
al
o-
eÆ
ients, whereas two fun
tions are equal if and only if they have the same
graphs. The dieren
e
an be seen by taking the eld F to be nite; for ex-
ample if p is a prime number and F = Fp, the eld with p elements, then the
polynomials xp and x dene the same fun
tion. But if F is an innite eld su
h
as Q (the rational numbers), R (the real numbers) or C (the
omplex numbers),
two polynomials are equal if and only if the
orresponding polynomial fun
tions
are equal. Espe
ially for F = R, polynomial fun
tions give a geometri
way to
visualise polynomials via their graphs.
1.2 Polynomials in two variables
When we try to write down a denition like (1) for a general polynomial in two
variables x and y with
oeÆ
ients in R, we have a diÆ
ulty: whi
h order do
we write the terms in? For example should the `standard form' of a quadrati
polynomial in x and y be a ; + a ; x + a ; y + a ; x + a ; xy + a ; y , or should
00 10 01 20
2
11 02
2
the term in x
ome before the term in y? Noti
e that we
an
olle
t terms in
2
or as (a ; + a ; y + a ; y ) + (a ; + a ; y)x + a ; x .
00 01 02
2
10 11 20
2
where ai;j 2 R and the sum is nite, i.e. ai;j = 0 for all but a nite number of
indexing pairs (i; j ). Although the problem resurfa
es when we try to do pra
ti
al
al
ulations, we adopt (2) as the formal denition. We
all an expression xiyj a
monomial in x and y , and if ai;j 6= 0 we
all ai;j xi y j a term of the polynomial.
The (total) degree of this monomial is i + j , and the (total) degree of a
nonzero polynomial is the maximum of the degrees of its terms. A polynomial is
alled homogeneous of degree d if it has no term of degree 6= d. (Thus the
zero polynomial is `homogeneous of degree d' for all d 0, although its degree is
not dened!)
The set of all polynomials (2) forms a ring R[x; y℄ with the usual addition and
multipli
ation operations,
alled the ring of polynomials in two variables over
R. By
olle
ting terms with the same power of x, we
an regard a polynomial in
R[x; y ℄ as a polynomial in x with
oeÆ
ients whi
h are polynomials in y , i.e. as
2
an element of S [x℄ where S = R[y℄. Thus we have a natural way to identify the
rings R[x; y℄ and (R[x℄)[y℄. Similarly, we
an identify R[x; y℄ and (R[y℄)[x℄.
As for the one variable
ase, we shall often use letters su
h as f; g; h or p; q; r
to denote polynomials; if it is ne
essary to draw attention to the variables being
used, then we write f (x; y) et
instead of f . If R = F is a eld, then a polynomial
f in F [x; y ℄ is always divisible by any nonzero
onstant polynomial, i.e. any
polynomial of degree 0.
It is easy to see that the division theorem fails for polynomials in two variables.
Suppose, for example, that we try to divide the polynomial x by the polynomial
y . This means we are looking for polynomials q and r su
h that x = qy + r and
deg r < 1 or r = 0. Thus r must be a
onstant polynomial, and
learly there is
then no polynomial q 2 R[x; y℄ su
h that x = qy + r.
1.3 Polynomials in n variables
The denitions above are easily extended to the
ase of n > 2 variables. A
monomial in n variables x ; : : : ; xn is an expression of the form
1
m = x xn ; 1
1 n
(0 0)
In multi-index notation, a polynomial
an be written as follows.
f = a x + : : : + a r x r
(1)
(1)
( ) (3)
( )
where the
oeÆ
ients a ; : : : ; a r are elements of R. The set of all polynomials
(1) ( )
of the form (3) forms a ring R[x ; : : : ; xn℄ with the usual addition and multipli
a-
1
tion operations,
alled the ring of polynomials in n variables over R. Noti
e
that by
olle
ting powers of one of the variables, xn say, we
an identify the rings
R[x ; : : : ; xn ℄ and R[x ; : : : ; xn ℄[xn ℄.
1 1 1
The (total) degree of a polynomial f 6= 0 is the maximum of the degrees of
the terms appearing in f . If all terms appearing in f have the same degree d,
f is
alled a homogeneous polynomial of degree d or a form of degree d.
In the
ase d = 1 we speak of a linear form, in the
ase d = 2 of a quadrati
form, in the
ase d = 3 of a
ubi
form, et
.
3
2 Ideals in polynomial rings
Thus hx ; yi is the set of all polynomials su
h that the
onstant term and the
2
ideal in R = F [x; y℄. For if I = hf i, then the polynomial f would have to divide
both x and y. But the only
ommon divisors of x and y are nonzero
onstant
2 2
4
Proof Let I F [x℄ be an ideal. If I = f0g then it is prin
ipal, so we may
assume that I
ontains a nonzero polynomial g of minimum degree. Clearly
hgi I . We shall prove that I hgi, so that I = hgi and hen
e I is prin
ipal.
Thus let f 2 I . By the Division Theorem 1.1, there exist polynomials q; r 2
F [x℄ su
h that f = qg + r and deg r < deg g or r = 0. Sin
e r = f qg and
f; g 2 I , it follows from the denition of an ideal that r 2 I . By our
hoi
e of g ,
it is impossible that deg r < deg g, and hen
e r = 0. But then f = qg, and so
f 2 hg i.
2.3 Finitely generated ideals
We see from Theorem 2.2 that the failure of the Division Theorem in the polyno-
mial ring F [x; y℄ in two variables over the eld F is
losely related to the existen
e
of ideals whi
h are not prin
ipal. From Example 2.1, we might be tempted to
guess that every ideal in F [x; y℄ is generated by at most two elements. The
following example shows that this is far too optimisti
.
Example 2.3 Let F be a eld, let n 1 and let I = hxn ; xn y; : : : ; xy n ; y ni
1 1
be the ideal in F [x; y℄ generated by the set of all monomials of degree n. We shall
prove that I
annot be generated by fewer than n + 1 polynomials.
First noti
e that the polynomials in I are the polynomials in whi
h no term
of degree < n appears. Let f ; : : : ; fm be a generating set for I : we must prove
1
that m P n +1. Ea
h of the n +1 monomials xi y n i
an be expressed in the form
xi y n i = mj gi;j fj where gi;j 2 F [x; y ℄. Sin
e the fj have no terms of degree
< n, equating terms of degree n gives x y = j ai;j hj , where hj is the degree
=1 Pm
i n i
n homogeneous part of fj and ai;j 2 F is the
onstant term of gi;j . However, the
=1
5
Does every ideal in F [x ; : : : ; xn℄ have a nite generating set?
1
This is a good way to onstru t lots of interesting rings, as we shall see later.
6
3 Monomial Ideals and Di
kson's Lemma
have zero onstant term and in whi h the monomials y, y and all xn (n 1) do 2
not appear.
7
3.1 Sums of ideals
Let I and J be ideals in R. Then their sum I + J is dened by
I + J = fa + b j a 2 I; b 2 J g: (7)
This is an ideal in R. If A and B are subsets of R su
h that A generates I and
B generates J , then I + J is generated by the union A [ B of these generating
sets. It follows that the sum of two monomial ideals is again a monomial ideal.
Example 3.3 If I = hx y; y i and J = hx ; xy i, then I + J = hx y; y ; x ; xy i.
2 3 4 2 2 3 4 2
Noti
e that we
an throw out x y from this list of generators, be
ause it is di-
4 3
8
Example 3.5 If I = hx y; y i and J = hx ; xy i, I \ J = hx y; x y ; x y ; xy i.
2 3 4 2 4 2 2 4 3 3
Noti
e that we
an throw out the element x y from the list of generators, be
ause
4 3
set M of monomials.
Clearly all these monomials are in I . Let xyk be a monomial in I . If k m,
then sin
e x 2 J and J is the monomial ideal generated by fx ; : : : ; x s g, (1) ( )
regard x y as the remainder and hen
e `smaller' than xy . To es
ape from this
2 2
Denition 4.1 Let F be a eld and let M be the set of all monomials x in
P = F [x1 ; : : : ; xn ℄. A monomial ordering on P is a total ordering < on M
whi
h is
ompatible with multipli
ation, i.e. the following four axioms hold.
(tri
hotomy rule) Given x ; x 2 M , exa
tly one of the three statements
x < x , x < x and x = x is true.
(transitive rule) If x ; x ; x
2 M satisfy x < x and x < x
, then
x < x
.
(initialisation rule) If x 2 M and x 6= 1, then 1 < x .
(multipli
ation rule) If x ; x ; x
2 M and x < x , then xx
< x x
.
Along with the < symbol, we shall use the symbols ; >; with the obvious
meanings.
We give three examples. In these examples, we shall assume (as part of
the ordering) that the variables are ordered so that x > x > > xn. Let 1 2
x = x xn , x = x xn .
1
1 n
1
1 n
11
In multi-index notation, the standard form of a typi
al nonzero polynomial f 2 P
(with respe
t to a given monomial ordering)
an be written as
f = a x + : : : + a r x r
(1)
(1)
( )
( )
(12)
where x > : : : > x r and the
oeÆ
ients a ; : : : ; a r are nonzero elements
(1) ( )
(1) ( )
of F .
We
all x the leading monomial of f , LM(f ), a the leading
oeÆ-
(1)
(1)
ient of f , LC(f ), and a x the leading term of f , LT(f ).
(1)
(1)
12
5 A division algorithm in n variables
1 2
More generally, if I is a monomial ideal, then we
an obtain a uniquely dened
remainder r by deleting from f all terms involving monomials in I . Note that
the qi will not be uniquely determined by (13) when s > 1.
We must rst x a monomial ordering in P . To see the need for this,
onsider
the
ase f = 2x +3y +4z, g = x + y + z in F [x; y; z℄. With the ordering x > y > z,
the quotient is 2 and the remainder is y + 2z, but with the ordering z > y > x,
the quotient is 4 and the remainder is 2x y. In ea
h
ase, all monomials whi
h
appear in the remainder are lower than the leading monomial in g, and this is a
ondition we wish to use to
hara
terise the remainder r in (13).
Here is an example to show what is involved. The basi
idea is the same as
for division in the
ase n = 1, i.e. at ea
h step we
an
el the leading term of f
with respe
t to the given monomial order, by multiplying some divisor gi by a
suitable monomial and subtra
ting a s
alar multiple of this produ
t from f .
Example 5.1 Using Lex order in F [x; y ℄ with x > y , we will divide f = xy + 1 2
13
Example 5.2 Again using Lex order in F [x; y ℄ with x > y , we will divide f =
x2 y + xy 2 + y 2 by g1 = xy 1 and g2 = y 2 1. The rst two steps go as before
(remember that we use g1 when both leading terms divide):
f x(xy 1) = xy + x + y = f
2 2
1
f1 y (xy 1) = x + y + y = f 2
2
Input: g ; : : : ; gs; f
1
Output: q ; : : : ; qs; r
1
q := 0; : : : ; qs := 0; r := 0
1
p := f
WHILE p 6= 0 DO
i := 1
div := false
WHILE i s AND div = false DO
IF LT(gi) divides LT(p) THEN
qi := qi + (LT(p)=LT(gi ))
p := p (LT(p)=LT(gi ))gi
div := true
ELSE
14
i := i + 1
IF div = false THEN
r := r + LT(p)
p := p LT(p)
The rest of the proof
onsists of
he
king that this algorithm operates
orre
tly.
The variable p represents the intermediate polynomial still to be divided, and the
algorithm terminates when p = 0. The Boolean variable `div' tells us whether
the leading term of some gi divides the leading term of p.
To prove that the algorithm works, we rst
he
k that the equation f =
q g + : : : + qs gs + p + r holds at ea
h stage. There are two
ases, depending
1 1
on whether a `division step' (div = true) or a remainder step (div = false) has
o
urred. It follows that f = q g + : : : + qsgs + r when the algorithm stops. Sin
e
1 1
the leading monomial of p de
reases in the given monomial order at ea
h step,
and sin
e every stri
tly de
reasing sequen
e in the monomial order is nite, the
algorithm does always stop. You should
he
k that the polynomials output by
the algorithm have the required properties.
The next example shows that this algorithm does not have all the properties
we would like.
Example 5.4 We repeat Example 5.2, inter
hanging the divisors. That is, using
Lex in F [x; y℄ with x > y, we will divide f = x y + xy + y by g = y 1 and
2 2 2
1
2
g = xy 1.
2
This time the algorithm produ
es q = x + 1, q = x, r = 2x + 1. So the
1 2
output depends on the order of the input polynomials g ; : : : ; gs; in parti
ular,
1
the remainder is not uniquely spe
ied by the properties stated in Theorem 5.3.
The next example shows that this problem
an also ae
t the question of
whether the remainder is zero.
Example 5.5 Using Lex in F [x; y ℄ with x > y , we will divide f = xy x by 2
6 Gr
obner bases
monomial ideal hLT(f ); LT(f )i = hx ; x yi is stri
tly smaller than hLT(I )i.
1 2
3 2
16
Proof Sin
e hLT(I )i is a monomial ideal, this follows immediately from Di
k-
son's Lemma (Theorem 3.6).
Theorem 6.5 (Hilbert Basis Theorem) Let F be a eld and let I be an ideal
in F [x1 ; : : : ; xn ℄. Then I is nitely generated, i.e. I = hg1 ; : : : ; gsi for some
g1 ; : : : ; gs 2 I .
Proof If I = 0, we take f0g as the generating set. Otherwise we take
g1 ; : : : ; gs as in Proposition 6.4. Sin
e ea
h gi 2 I , it is
lear that hg1; : : : ; gsi I .
We
laim that I hg1; : : : ; gsi, so that in fa
t I = hg1; : : : ; gsi, and so g1; : : : ; gs
is the required nite generating set for I .
Thus let f 2 I . By the division algorithm (Theorem 5.3), we
an write f in
the form f = q1g1 + : : : + qsgs + r, where no monomial appearing in the remainder
r is divisible by any of the leading monomials LM(g1 ); : : : ; LM(gs). We shall
prove by
ontradi
tion that r = 0.
If r 6= 0, then LT(r) 2 hLT(I )i = hLM(g1); : : : ; LM(gs)i. Sin
e this is a
monomial ideal, it follows from Theorem 3.2 that LT(r) is divisible by LM(gi)
for some i. This is a
ontradi
tion. Hen
e r = 0, and so f = q1g1 + : : : + qsgs.
Thus f 2 hg1; : : : ; gsi.
6.3 Noetherian rings
To
omplete our dis
ussion of the Hilbert Basis Theorem, we shall relate it to the
general stru
ture theory of
ommutative rings.
Theorem 6.6 (Noetherian rings) Let R be a
ommutative ring with 1. Then
the following
onditions are equivalent.
(i) Every ideal I in R has a nite generating set.
(ii) There is no innite stri
tly in
reasing sequen
e of ideals in R; more pre
isely,
given a sequen
e fIn gn1 of ideals in R su
h that I1 I2 I3 : : :, there
exists a positive integer n0 su
h that In = In0 for all n n0 .
Condition (ii) is often
alled the as
ending
hain
ondition (ACC). A ring
R satisfying either, and hen
e both, of these
onditions is
alled a Noetherian
ring.
Proof (i) ) (ii): Let I I I : : : be an as
ending
hain. Dene
1 2 3
I= 1 I . It is easy to
he
k that I is an ideal. By (i), I has a nite generating
S
n=1 n
set f1; : : : ; fs. Choose ni su
h that fi 2 In for 1 i s, and dene n0 = max ni .
Then f1; : : : ; fs are all in In0 . Hen
e I In0 . But In0 I , hen
e In0 = I . Hen
e
i
18
7 Gr
obner bases
Denition 7.1 Let I be an ideal in P , and let < be a monomial order. A nite
set of polynomials g1 ; : : : ; gs in I is
alled a Gr
obner basis (or standard basis)
of I if and only if
hLT(g ); : : : ; LT(gs)i = hLT(I )i:
1 (14)
Our proof of Theorem 6.5 shows that
a Grobner basis for I is indeed a generating set for I , and
every nonzero ideal I in P = F [x ; : : : ; xn℄ has a Grobner basis.
1
19
Proof Let g ; : : : ; gs be a Grobner basis for I . By the division algorithm,
1
Theorem 5.3, we
an write f in the form f = q g + : : : + qsgs + r where the
1 1
quotients q ; : : : ; qs and the remainder r are in P . Thus f = g +r where g = q g +
1 1 1
: : : + qs gs 2 I , so that (i) is satised. Theorem 5.3 also tells us that no term of r
is divisible by any of LT(g ); : : : ; LT(gs). Sin
e hLT(I )i = hLT(g ); : : : ; LT(gs)i,
1 1
a monomial x is in hLT(I )i if and only if it is divisible by some LT(gi). Hen
e
if x 2 hLT(I )i, then x does not divide any term of r.
Finally we must show that if r0 also satises (i) and (ii) then r = r0. By (i),
we have f = g0 + r0 where g0 2 I , so r0 r = g g0 2 I . If r0 r 6= 0, then
LT(r0 r) 2 hLT(I )i. But no monomial in r or r0 is divisible by any monomial
in hLT(I )i. This is a
ontradi
tion, hen
e r = r0.
The remainder r of f is sometimes written as f . It is parti
ularly
onvenient
for
al
ulations in the quotient ring P=I (Se
tion 2.4).
There is an important familiar
ase of the quotient ring
onstru
tion, when
F = R and I = hx + 1i. The ring R[x℄=hx + 1i is isomorphi
to the eld of
2 2
7.2, the generator of a prin
ipal ideal is a Grobner basis for the ideal.) Thus
hLT(I )i = hx i. Now the usual method of
al
ulating with
omplex numbers is
2
The following result shows how to generalise this type of
al
ulation in quo-
tient rings.
Proposition 7.4 Fix a monomial ordering in P = F [x ; : : : ; xn ℄, and let I be
1
an ideal in P . Then
(i) the
orresponden
e r ! r + I is a bije
tion between the set of all remainders
r of polynomials in P and the set P=I of all
osets of I in P ;
(ii) the set of
osets whi
h
orrespond to monomials x 2P su
h that x 62
hLT(I )i is a basis for P=I as a ve
tor spa
e over F ;
(iii) the quotient ring P=I is isomorphi
to the ring whose elements are the
remainders of polynomials in P , with the same addition operation as in P
but with the produ
t of r1 and r2 taken as the remainder of r1 r2 .
Proof By taking r = 0, Proposition 7.3 shows that f 2 I if and only if
f = 0; and by taking g = 0, Proposition 7.3 shows that f = f if and only if no
term of f is in hLT(I )i. It is also easy to see from Proposition 7.3 again that
f + f = f + f and that the remainder of f f is f f . The details are left as
1 2 1 2 1 2 1 2
an exer
ise.
20
8 S -polynomials
In this se
tion we shall see how to
he
k that a given set of polynomials in an
ideal I of P is a Grobner basis. This in turn will lead to an algorithm for the
onstru
tion of Grobner bases.
Denition 8.1 Let f , g be nonzero polynomials and let
x
= LCM(LM(f ); LM(g ))
be the least
ommon multiple of their leading monomials. The S-polynomial
S (f; g ) is dened by
x
x
S (f; g ) = (15)
LT(f ) f LT(g) g:
Note that S (f; f ) = 0 and S (g; f ) = S (f; g) for all f; g 2 P .
Example 8.2 Let f = x y + y + x, g = 2xy x , where the ordering is DegLex
2 2 2 2
= xx yy (x y + y + x) 2xxyy (2xy x );
2 2 2 2
S (f; g ) 2
2 2
2
2 2
= 21 x + y + xy:
3 3
The important thing to noti
e about this example is that the leading terms of f
and g have
an
elled, and that although S (f; g) = yf xg 2 I where I = hf; gi, 1
the leading term LT(S (f; g)) = x 62 hx y; xy i, the monomial ideal generated
2
1 3 2 2
Sin
e S (f; g) 2 I and LT(S (f; g)) 62 hLT(f ); LT(g)i, it follows that f and g
do not form a Grobner basis for the ideal I . It is natural to add h = S (f; g) to
the generating set ff; gg for I , and to ask whether ff; g; hg is a Grobner basis
for I .
We shall see that this method of adding S -polynomials to a given generating
set for an ideal I in P always su
eeds in produ
ing a Grobner basis for I in a
nite number of steps. Somehow, S -polynomials of pairs of polynomials in the
set fg ; : : : ; gr g a
ount for all
an
ellations of leading terms in forming elements
1
of the ideal hg ; : : : ; gr i. The next result is a key property of S -polynomials.
1
21
Proof First we do the spe
ial
ase where the fi are moni
, i.e. LC(fi ) = 1
for 1 i s. Sin
e LT(fi) = LT(fj ) = xÆ in this
ase, S (fi; fj ) = fi fj .
Clearly LM(S (fi; fj )) < xÆ sin
e the leading terms
an
el.
Consider the `teles
oping' sum
s
X
(
+ : : : +
i)(fi
1 fi+1 );
i=1
If this holds, then the last term in the teles
oping sum vanishes, and we have
=1 =1
g = is (
+ : : : +
i )S (fi ; fi ).
P 1
1 +1
In the general
ase, let LC(fi) = ai for 1 i s, so that g = Psi
iai (fi=ai)
=1
and LC(g) = i
iai. Sin
e fi=ai is moni
and S (fi; fj ) = S (fi=ai; fj =aj ), the
Ps =1
general
ase follows by applying the spe
ial
ase to the polynomials fi =ai.
=1
x xÆ
= ax f
bx
g
= x S (f; g)
where
i = Æi i.
We must
he
k that x
really is a monomial, i.e.
i 0 for all i. For this,
note that the inequalities i + i i , i + i i imply that Æi i and Æi i ,
and hen
e Æi i as required.
22
9 Bu
hberger's
riterion
t
X
f =
i fi + f0 ; (17)
i=1
where f = i (hi LT(hi ))gi + i t higi. Thus LM(fi) = xÆ forP1 i t,
0
Pt
=1
Ps
= +1
while LM(f ) < xÆ . It follows from (17) that LM(g) < xÆ , where g = ti
i fi.
0
Thus the leading terms of the fi
an
el when we form the linear
ombination
=1
t 1
X
g= bi S (fi ; fi+1 )
i=1
23
where bi 2 F . Sin
e fi = x i gi, Proposition 8.4 shows that S (fi; fi ) =
( )
+1
x
i S (gi ; gi ) for some monomial x
i . Hen
e
( )
+1
( )
t 1
X
g= bi x
(i) S (gi; gi+1 ):
i=1
We are now ready to use the hypothesis that all the S -polynomials of pairs of
polynomials in the set G have remainder 0. Thus by the division algorithm
(Theorem 5.3) we
an write S (gi; gi ) in the form +1
s
X
S (gi; gi+1 ) = si;j gj
j =1
for some polynomials si;j 2 P su
h that LM(si;j gj ) LM(S (gi; gi )) for all i; j . +1
Hen
e t s 1 s
X X X
g= bi x
i si;j gj = kj gj ;
( )
i=1 j =1 j =1
where kj = Pt 1
i=1 bi x . Now noti
e that
(i) s
i;j
LM(kj gj ) max
it
LM(x
i si;j gj )
1 1
( )
max
it
LM(x
i S (gi; gi ))
1 1
( )
+1
max
it
LM(S (fi; fi ))
1 1
+1
< xÆ ;
sin
e the terms in xPÆ int fi and fPi s
an
el in S (fi; fi ).
+1 +1
Now substitute i
ifi = j kj gj in (17), and noti
e that f = Psi h0igi
=1 =1 0 =1
where LM(h0igi)P<s xÆ for all i. After
olle
ting terms in g ; : : : ; gs, we get an 1
expression f = i h00i gi where h00i 2 P and LM(h00i gi) < xÆ for 1 i s. But
this
ontradi
ts the denition of xÆ . Thus Bu
hberger's Criterion is proved.
=1
24
10 Bu
hberger's algorithm
G := (f1 ; : : : ; ft )
REPEAT
G0 := G
FOR ea
h pair fp; q g, p 6= q , in G0 DO
r := remainder of S (p; q ) on division by G0
IF r 6= 0 THEN G := G [ frg
UNTIL G = G0
Proof First note that if p; q 2 I then S (p; q) 2 I . Initially, the list G is the
given list of generators (f ; : : : ; ft ) for I . As the algorithm pro
eeds, we add to
1
G all the remainders r of S -polynomials of pairs of elements in the
urrent list
G0 on division by polynomials whi
h are already in G0 . These remainders must
also be in I , and so, at every stage of the algorithm, G is a list of generators for
the same ideal I .
The algorithm terminates when all these remainders are 0. It follows imme-
diately from Theorem 9.1 that at this point the list G is a Grobner basis for
I.
It remains to prove that the algorithm does terminate for all possible inputs.
Consider the monomial ideal hLT(G)i generated by the leading monomials of
all the elements of G. The new remainder r
ontains no monomials whi
h are
divisible by the leading terms of elements of G, so in parti
ular LT(r) 62 hLT(G)i.
Thus the sequen
e of ideals hLT(G)i whi
h arise as the algorithm pro
eeds is an
as
ending
hain. By Theorem 6.6, su
h a
hain must stop after a nite number
of steps. Hen
e the algorithm terminates.
Example 10.2 In F [x; y ℄ with Lex and x < y , let f = xy x, f = y + x ,
1 2
2
I = hf ; f i.
1 2
25
We start with g = f , g = f , G = (g ; g ). Then S (g ; g ) = (xy x) +
1 1 2 2 1 2 1 2
x( y + x ) = x + x . Sin
e LM( x + x ) = x is not divisible by LM(g ) = xy
2 3 3 3
1
or LM(g ) = y, r = S (g ; g ) = x x in this
ase.
2 1 2
3
S (f ; f ) = x (x y 2y + x) y (x 2xy ) = x ;
2 1
2 2 3 2
and we see that the remainder is still x , so, as expe ted from Example 6.3, we 2
g =x .
3
2
through the REPEAT loop shows that we nally have a Grobner basis
fx 2xy; x y 2y + x; x ; 2xy; 2y + xg:
3 2 2 2 2
The algorithm in its most basi
form is not very eÆ
ient. In pra
ti
e it is better
to add the new remainders to G one at a time. Then we only have to
he
k the
S -polynomials involving the
urrent set G and the new remainder to be added
to it. In Example 10.3, this gives the following sequen
e of
al
ulations.
S (g ; g ), getting g :
1 2 3
S (g ; g ), getting g ; S (g ; g ), getting g :
1 3 4 2 3 5
DONE!
26
11 Minimal and redu
ed Gr
obner bases
A nonzero polynomial ideal I has innitely many Grobner bases, sin
e we
an add
any element of I to an existing Grobner basis to get a new one, and we
an also
multiply any of the polynomials in the Grobner basis by a nonzero s
alar. Thus
we are interested in making our Grobner bases as small (and tidy) as possible. In
parti
ular, the Grobner basis G output by Bu
hberger's algorithm may
ontain
`redundant' generators gi whose leading term LT(gi) is
ontained in the ideal
generated by the leading terms of the other elements of G. Thus in Example
10.3 we
an drop out g and g from G, sin
e their leading terms x and x y
1 2
3 2
are divisible by the leading term x of g . This gives a smaller Grobner basis
2
3
fx ; 2xy; 2y + xg.
2 2
to the minimal Grobner basis fx ; xy; y xg, whi
h is in fa
t redu
ed. In the
2 2 1
27
To see this, noti
e that this redu
tion pro
ess does not
hange the leading
terms, i.e. LT(hi) = LT(gi). Hen
e we have a minimal Grobner basis at ea
h
stage of the redu
tion. Finally note that, in the division algorithm, any term
whi
h appears in the remainder is not divisible by the leading monomials of any
of the divisors.
Next we prove uniqueness. Thus let G = fg ; : : : ; gsg and H = fh ; : : : ; htg
1 1
be two redu
ed Grobner bases for I . We rst show that s = t and that the
leading terms of the two bases are the same, i.e. we
an renumber the h's so that
LM(gi) = LM(hi) for all i. Consider g . Sin
e g 2 I and H is a Grobner basis
1 1
for I , there is an i su
h that LM(hi) divides LM(g ). But sin
e hi 2 I and G
1
is a Grobner basis for I , there is a j su
h that LM(gj ) divides LM(hi). Hen
e
LM(gj ) divides LM(g ). Sin
e G is minimal, it follows that j = 1 and hen
e
1
LM(hi ) = LM(g ). Renumber H so that hi be
omes h . Now
onsider g . In
1 1 2
the same way there is an i su
h that LM(hi ) divides LM(g ). This
annot be h ,
2 1
sin
e LM(h ) = LM(g ) and LM(g ) does not divide LM(g ), by minimality of G.
1 1 1 2
Now LM(gj ) divides LM(hi) for some j , and again minimality of G implies that
j = 2. Hen
e LM(g ) = LM(hi ), and we renumber Hi as h . It is
lear that this
2 2
pro
ess
an be
ontinued until all the elements of G and H are paired o.
Finally we must prove that gi = hi for 1 i s. Let f = gi hi , and assume
f 6= 0 so that LM(f ) is a monomial. Sin
e we have proved that LT(gi ) = LT(hi ),
we have LM(f ) < LM(gi). Sin
e f 2 I and G is a Grobner basis for I , LM(gj )
divides LT(f ) for some j . Clearly j 6= i sin
e LM(f ) < LM(gi). But LM(gj ) =
LM(hj ) does not divide any term of gi or hi, sin
e the Grobner bases G and H
are both redu
ed. Hen
e LM(gj ) does not divide any term of f , and in parti
ular
it does not divide LT(f ). This is a
ontradi
tion. Hen
e f = 0.
This result gives a way to test whether two sets of polynomials are generators
for the same ideal. We need only
ompute a redu
ed Grobner basis starting from
ea
h set in turn. The ideals will be equal if and only if the redu
ed Grobner bases
are the same set.
Many
omputer algebra systems (e.g. Maple, Mathemati
a)
an be used
to
ompute Grobner bases. These always return the redu
ed Grobner basis with
respe
t to a given monomial ordering (ex
ept that, to avoid fra
tions, the basis
may not be moni
). Thus it is easy to
ompare results obtained using dierent
systems. These systems use various implementations of (renements of) Bu
h-
berger's algorithm. Be
ause of the wide appli
ations of Grobner bases in algebra
and geometry, these pa
kages are useful to s
ientists and engineers as well as to
mathemati
ians.
It is worth mentioning that when the given generating set for I
onsists of
linear polynomials, the redu
ed Grobner basis
orresponds to the `redu
ed row-
e
helon form' of the
oeÆ
ient matrix of the
orresponding system of linear equa-
tions.
28
12 Appli
ations of Gr
obner bases
well use it. (But the method works with any Grobner basis.)
Using the division algorithm, we nd that
x y x = (x y + x y + x y + x )(y x ) + (x + x )(x x):
2 4 6 2 3 4 2 6 8 2 7 5 3
Noti
e that in both
ases we have arranged the terms of the quotients q and 1
q in de
reasing Lex order.
2
There is another way to think about the ideal membership problem. If we have a
set of equations f = 0; : : : ; fs = 0
orresponding to the generators of the ideal I ,
1
we
an ask whether the equation f = 0 is a
onsequen
e of these. If f 2 I , then
f = q f + : : : + qs fs = 0 when all fi = 0. In Example 12.1, the equations xy = x
1 1
and x = y imply x y x = (xy) y x = x (x ) x = 0. Grobner bases
2 2 4 6 2 2 6 2 2 2 6
provide a way of doing
al
ulations of this kind systemati
ally, so that they
an
be done by a
omputer.
29
12.2 The elimination problem
Problem: Given a set of polynomial equations f1 = 0, . . . , ft = 0 in
F [x1 ; : : : ; xr ; y1; : : : ; ys℄, how
an we eliminate the variables x1 ; : : : ; xr from
these equations so as to obtain equations involving only y1 ; : : : ; ys?
To get an idea of what is involved here, we
onsider a simple example.
Example 12.2 Consider the equations x = t , y = t , z = t . We
an think of
4 3 2
equation x = y .
3 4
From the point of view of ideal theory, the elimination problem is equivalent
to looking for polynomials in the ideal hf ; : : : ; ft i involving only the variables
1
y ; : : : ; ys. In Example 12.2, it is easy to
he
k that x z , y x and x y
1
2 2 3 3 4
y z = (y + t )(y t ) (z + zt + t )(z t ).
2 3 3 3 2 2 4 2
If we
ompute the redu
ed Grobner basis of I with respe
t to Lex order with
t > x > y > z , we obtain
ft z; ty z ; tz y; x z ; y z g:
2 2 2 2 3
Noti e that the polynomials here whi h do not involve t are x z and y z . 2 2 3
Sin
e t was
hosen as the largest variable in the ordering, t is eliminated rst in
omputing the Grobner basis.
Theorem 12.3 Let I be an ideal in F [x ; : : : ; xr ; y ; : : : ; ys℄, let G be a Grobner
1 1
basis of I with respe
t to Lex order with xi > yj for all i; j , and let G0 = G \
F [y ; : : : ; ys℄. Then G0 is a Grobner basis of I 0 = I \ F [y ; : : : ; ys℄.
1 1
30
12.3 The ideal interse
tion problem
Problem: Given ideals I = hg ; : : : ; gk i and J = hh ; : : : ; hl i in F [x ; : : : ; xn ℄,
1 1 1
nd a generating set for the ideal I \ J.
We saw in Se
tion 3.3 how to solve this problem in the
ase where I and J
are monomial ideals. The elimination theorem (Theorem 12.3) provides a tri
k
to solve this problem in general, by introdu
ing a dummy variable whi
h we then
eliminate.
Theorem 12.4 Given ideals I and J in P = F [x ; : : : ; xn ℄, let t be a new vari-
1
able. Then I \ J = K \ P , where K is the ideal htI; (1 t)J i in P [t℄.
Proof Given f 2 I \ J , by writing f in the form tf + (1 t)f we see that
f 2 tI +(1 t)J . Hen
e f 2 K \ P . Conversely, if f 2 K \ P , then we
an write
f in the form
Xk l
X
f= tgi pi + (1 t)hj qj ;
i=1 j =1
where I = hg ; : : : ; gk i and J = hh ; : : : ; hl i, and the pi's and qj 's are polynomials
1 1
in P [t℄.
This is a polynomial identity and so it remains true when we spe
ialise the
variables to elements of F . In parti
ular, setting t = 1 we obtain
k
X
f = gi pi (x1 ; : : : ; xn ; 1) 2 I;
i=1
and t = 0 gives
l
X
f = hj qj (x1 ; : : : ; xn ; 0) 2 J;
j =1
so that f 2 I \ J .
This method in fa
t gives a Grobner basis for I \ J , starting from any gener-
ating sets for I and J .
Example 12.5 Find a Grobner basis for I \ J in F [x; y ℄, where I = hx + y ; xy i 2 2
and J = hx y i.
2 2
31
MT30512 Polynomials Semester 2, 2006{07
i1 <i2 <:::<ir
1 +2 +:::+n =r
; ;
, then
x + ax + bx +
= (x )(x )(x
)
3 2
32
Thus the
oeÆ
ients of f (x)
an be regarded as polynomials in the `variables'
; ;
. If we permute the roots in any way, e.g. ex
hange and , we do not
hange the polynomial f (x) and so the polynomials + +
, +
+
,
are also un
hanged. In general, if the moni
polynomial
f (x) = xn + an xn + : : : + a x + a 1
1
1 0
then the relations an r = ( 1)r er hold for 1 r n, and express ea
h
oeÆ
ient
an r of the polynomial f as (plus or minus) a
orresponding elementary symmetri
fun
tion er of its roots x ; : : : ; xn.
1
1 2 +1
by swit
hing the variables xi and xi . Sin
e f is symmetri
, this monomial will
+1
also be a term of f.
Now let g = e e en be a `monomial' in the `variables' e ; e ; : : : ; en.
1 2 n
1 2
Sin
e LM(er ) = x x xr ,
1 2
1 2
= x
1
x 1
1+ 2+ +
xn : n
2
2+ + n n
1
hen
e they are linearly independent over the eld of
oeÆ
ients F .
33
14 Families of Symmetri
Fun
tions
m ; ; = x yz + xy z + xyz ;
(3 1 1)
3 3 3
m ; ; = x y z + x yz + xy z :
(2 2 1)
2 2 2 2 2 2
in the elementary symmetri
fun
tions form another ve
tor spa
e basis for S .
1
Then LM(f e e ) = LM(m ; ; ) = x yz, so the new = (3; 1; 1), the new
2
1 2 (3 1 1)
3
e e = (x + y + z ) xyz = m ; ; + 2m ; ; :
2
1 3
2
(3 1 1) (2 2 1)
f = e e 2(e e 2e e ) 5e e = e e 2e e e e :
2
1 2
2
1 3 2 3 2 3
2
1 2
2
1 3 2 3
34
14.2 Complete symmetri
fun
tions
We shall prove that the
omplete symmetri
fun
tions h ; : : : ; hn provide another 1
set of polynomial generators for the ring of symmetri
fun
tions S . The following
identities provide a re
ursive pro
edure for expressing ea
h hk as a polynomial in
e ; : : : ; en . For this we extend the denition of er by dening e = 1 and er = 0
1 0
for r > n, where n is the number of variables.
Proposition 14.4 For 1 r n and i 0, let hi be the ith
omplete symmetri
fun
tion in the variables x1 ; : : : ; xr and let ei be the ith elementary symmetri
fun
tion in the variables x1 ; : : : ; xn . Then
X
( 1)ieihj = 0; for k > n r:
i+j =k
Note that E (t) is a polynomial in P [t℄, but H (t) is a formal power series in t
(with
oeÆ
ients in P ).
As we noti
ed at the beginning of Se
tion 13, the elementary symmetri
fun
-
tions e ; : : : ; en (with alternating signs) are the
oeÆ
ients of the polynomial with
1
roots x ; : : : ; xn. Hen
e
1
= (1 + x t + x t + : : :)(1 + x t + x t + : : :) (1 + xr t + xr t + : : :)
2 2 2 2 2 2
= (1 1x t) (1 1x t) (1 1x t) :
1 1 2 2
1 2 r
35
Proof We
an solve the system of equations in Proposition 14.4 with r = n to
express ei as a polynomial in h ; : : : ; hi for 1 i n. Hen
e we
an express any
1
polynomial in e ; : : : ; en as a polynomial in h ; : : : ; hn. Hen
e by Theorem 13.2
1 1
every symmetri
polynomial
an be written as a polynomial in h ; : : : ; hn. 1
To prove uniqueness, we use the fa
t that the monomials e in e ; : : : ; en of 1
total degree d in x ; : : : ; xn form an F -ve
tor spa
e basis for the homogeneous
1
symmetri
polynomials S d of total degree d in x ; : : : ; xn. Sin
e we have seen that
1
the monomials h = h hn in S d form a spanning set, and sin
e there are
1 n
the same number of h 's as e 's in S d, the h 's are also a F -ve
tor spa
e basis.
1
14.3 Power sums and Newton's identities
The rth power sum pr = xr + : : : + xrn was introdu
ed in Se
tion 13. For some
purposes, it turns out to be useful to express symmetri
fun
tions as polynomials
1
in the pr 's rather than the er 's or the hr 's. However, sin
e denominators arise
in the
oeÆ
ients, we must work over a eld F of
hara
teristi
zero. First we
obtain relations between the pr 's and the er 's.
Proposition 14.6 (Newton's identities) For r 1,
r
X
rer = ( 1)k 1
pk er k :
k=1
Xn
= (xi + xi t + : : : + xri tr + : : :)
2 1
i=1
n
X
= xi (1 + xi t + : : : + (xi t)r 1
+ : : :)
i=1
n
X xi
= 1 xi t
:
i=1
E 0 ( t) Xn
xi
E (t)
= 1 x t = P ( t) :
i=1 i
Hen
e P (t)E (t) = E 0 (t) = e 2e t + : : : + ( 1)n nen tn . Equating the
1 2
1 1
oeÆ
ients of tr in this equation gives Newton's identities: note that these are
1
en = 0 1 h hn ; hn = 0 1 e en :
. ... . . . . . . ... . ... . . . . . . ...
1 2 1 2
.. ..
0 0 0 h 0 0 0 e
1 1
.. ..
0 0 0 e 0 0 n 1 p
1 1
The equivalen
e with Propositions 14.4 and 14.6
an be seen by using suitable
row or
olumn expansions of the determinants and indu
tion. There are similar
formulae relating the h's and p's (Examples 7, Question 6.) Although these
expli
it formulae are elegant, in pra
ti
e it is easier to use the re
ursive formulae
of Propositions 14.4 and 14.6 dire
tly in
al
ulations.
37
15 Invariants and Coinvariants
Theorem 15.1 For the monomial order DegLex with xn > > x > x , 2 1
H = fh (x ; : : : ; xn ); h (x ; : : : ; xn ); : : : ; hn (x ; x ); hn (x )g
1 1 2 1 1 1 1 2 1
so that the leading monomials are all
oprime. Thus if we
an show that H is
a basis of I , it will follow from our work on Bu
hberger's algorithm that all S -
polynomials of pairs of elements of H redu
e to zero, and hen
e H is a Grobner
basis for S . It is also easy to see that H is redu
ed.
Note that hn(x ) = xn is in I by equation (18), and h (x ; : : : ; xn ) = x +
1 1 1 1
: : : + xn is
learly in I sin
e it is symmetri
. To prove that the other elements of
1
38
By putting k = n r + 1 in (19), we see that hn r (x ; : : : ; xr ) is an ele-
+1 1
ment of the ideal he ; e ; : : : ; en r i. Hen
e hH i I . Conversely, by solving
1 2 +1
the equations (19) re
ursively, we
an express en r as an element of the ideal
+1
hh (x ; : : : ; xn); h (x ; : : : ; xn ); : : : ; hn r (x ; : : : ; xr )i. Sin
e e ; e ; : : : ; en gen-
1 1 2 1 1 +1 1 1 2
erate I , it follows that I hH i. Hen
e hH i = I .
Theorem 15.1 allows us to nd a normal form for any polynomial in P in the
ring of
oinvariants P=I , and to perform
al
ulations in this ring. Thus we obtain
a basis for P=I as a ve
tor spa
e over F , as follows.
Theorem 15.2 (Artin basis for the ring of
oinvariants) The
osets of the
monomials x = x1 1 : : : xn su
h that r n r form a F -ve
tor spa
e basis for
n
Proof Sin
e the leading terms of the elements of the Grobner basis H of
Theorem 15.1 are xn ; xn ; : : : ; xn, a monomial x 2 P is divisible by one of
2
these leading terms if and only if r n r + 1 for some r. Thus the result
1 1
The ring P=I has further interesting stru
ture whi
h we des
ribe brie
y. By
onsidering the
oinvariants of a xed degree d, we obtain a ve
tor spa
e on
whi
h the symmetri
group Sn a
ts by permutation of the variables x ; : : : ; xn. 1
These give important examples of matrix representations of the group Sn:
with ea
h permutation in Sn we asso
iate the matrix whi
h des
ribes its a
tion
on the Artin basis elements. In parti
ular, the a
tion of Sn on the whole ring P=I
gives the regular representation of Sn. These ideas are developed in group
representation theory.
39
16 Alternating Polynomials
..
...
. . . ...
x
n x 1 xn
1 n
2
1 n 1
For the base of indu
tion, let n = 0. Then m = 0 and f and g are
onstant
polynomials, i.e. nonzero elements of R. We have f = a , g = b and f = qg 0 0
where q = b a 2 R sin
e b is invertible in R.
1
0 0
Now let n > 0, and assume as indu
tion hypothesis that the result holds for
0
40
sin
e
learly deg f n and the
oeÆ
ient of xn in f is bm an anbm = 0,
1 1
so that in fa
t deg f < n. Hen
e the indu
tion hypothesis applies to f , to
1 1
give f = q g + r where q ; r 2 R[x℄ and deg r < m or r = 0. Hen
e bm f =
1 1 1 1 1 1 1
an xn m g + f = (an xn m + q )g + r . Hen
e f = qg + r, where q = bm (an xn m + q )
1 1 1
1
1
and r = bm r , and deg r = deg r < m or r = 0, as required.
1
1 1
Uniqueness is proved by the same argument as when R is a eld (exer
ise!).
Theorem 16.2 (Remainder Theorem) Let R be a
ommutative ring with 1
and let a 2 R. Then a polynomial f (x) 2 R[x℄ is divisible by x a if and only if
f (a) = 0, i.e. if and only if a is a zero (or root) of f (x).
Proof By Theorem 16.1 we
an divide an arbitrary polynomial f (x) 2 R[x℄
by the moni
polynomial g(x) = x a.
We now use a standard argument: we have f (x) = q(x)(x a) + r where the
remainder r is
onstant, i.e r 2 R. Substitute x = a to get r = f (a). Thus f (x)
is divisible by x a i f (a) = 0.
As a rst appli
ation, we obtain the well-known fa
torisation of the Vander-
monde determinant n. If xi = xj , then n = 0 sin
e the determinant has two
equal
olumns. By Theorem 16.2 it follows that n is divisible by xi xj for all
i; j with 1 i < j n. Hen
e
Y
n = (xj xi):
i<j n
1
41
It is easy to show that the ee
t of a permutation 2 Sn on an alternating
polynomial f is given by (f ) = f if is even, (f ) = f if is odd. Thus
alternating polynomials, as well as symmetri
polynomials, are invariant under
the a
tion of the alternating group An.
Proposition 16.4 Let f 2 F [x ; : : : ; xn ℄ be invariant under An , i.e. (f ) = f
1
for all 2 An . Then f
an be uniquely expressed as the sum of a symmetri
polynomial and an alternating polynomial.
Proof Clearly a polynomial f of the given form is An-invariant: we prove
the
onverse. Let f be invariant under An, and let be an odd permutation.
Then 2 An, and hen
e (f ) = f . Let g = (f + (f )), h = (f (f )).
2 2 1 1
Sin
e f is An-invariant, so also is (f ), and hen
e so are g and h. Now any
2 2
The set A of all An-invariant polynomials is
losed under addition and mul-
tipli
ation, i.e. it is a subring of F [x ; : : : ; xn℄. Clearly A
ontains the Vander-
1
monde determinant n, and also all symmetri
polynomials. Proposition 16.3
now allows us to give a
omplete des
ription of the ring A.
Theorem 16.5 If
har(F ) 6= 2, then
(i) every An -invariant polynomial f 2 F [x ; : : : ; xn ℄
an be uniquely expressed as
1
f = g + n h, where g and h are symmetri
polynomials, and
(ii) the ring A of all An -invariant polynomials is given by
A
= F [e1; : : : ; en; n℄=h2n Æ (e1 ; : : : ; en )i
where Æ (e1 ; : : : ; en ) is the polynomial obtained by writing n 2
in terms of
e1 ; : : : ; en .
Proof Part (i) follows from Propositions 16.4 and 16.3, noting that the
quotient of the `alternating part' of f by n is a symmetri
polynomial h. Part
(ii) follows from (i) if we observe that sin
e hn Æ (e ; : : : ; en )i is a prin
ipal
2
1
ideal, every element of the quotient ring has a unique representative of the form
g + n h, where g and h are polynomials in e ; : : : ; en . (Re
all that the moni
1
generator of a prin
ipal ideal is the redu
ed Grobner basis, for any monomial
ordering.)
42
16.2 The dis
riminant of a polynomial
We
an apply these ideas to obtain a ne
essary and suÆ
ient
ondition for a
polynomial in one variable to have equal roots. Thus let f (x) = xn + an xn + 1
1
formidable beast:
= a b 4a
4b + 18ab
27
:
2 2 2 3 3 2
This takes a more familiar form if we assume a = 0 (we
an always redu
e to this
ase by a linear
hange of variable). Thus the equation x + bx +
= 0 has at least
3
43