Abstract Algebra
Abstract Algebra
Samir Siksek
Mathematics Institute
University of Warwick
Contents
Chapter I. Prologue 1
I.1. Who Am I? 1
I.2. A Jolly Good Read! 1
I.3. Proofs 2
I.4. Acknowledgements and Corrections 2
Chapter X. Isomorphisms 67
Prologue
I.1. Who Am I?
I Samir Siksek have the immense pleasure of introducing you to three
heroes of abstract algebra: groups, rings and fields. I am not an alge-
braist, but I have nothing but love, admiration and enthusiasm for the
subject. Some of my best friends are algebraists.
three tips Abstract algebra is incredibly useful, but to get any benefit from it you
need to develop three essential habits:
(i) Study as many different examples as you can. The examples are
as important as the theorems and definitions. There is abso-
lutely no use in knowing the definition of a group if you’re not
familiar with the standard examples.
(2) Do calculations. Use calculations with matrices, permutations,
symmetries, etc. to test your ideas. Calculations will lead you to
counterexamples that can correct any erroneous ideas that you
have. But also with practice, you will find that calculations often
contain the germ of the proof you’re looking for.
(c) Think geometrically and draw pictures. The true meaning of
most mathematical concepts is geometric. If you spend all your
time manipulating symbols (i.e. doing algebra) without under-
standing the relation to the geometric meaning, then you will
have very little in terms of mathematical insight.
The three habits will not only help you learn the subject and apply it, you
will develop great mathematical taste.
1
2 I. PROLOGUE
I.3. Proofs
When I was a student I found it very hard to follow proofs in books and
lectures. So when I read a theorem, I would put down the book and try out
a few examples. After that I would try to prove the theorem myself. After I
finished (or if I failed) I would look at the proof in the book and compare.
I heartily recommend this strategy. You’ll gain a great understanding of
the subject. You’ll also get really good practice for the exam, where you
may asked to prove statements that you haven’t seen before.
Algebraic Reorientation
II.1. Sets
Sets are a basic notation for most of modern pure mathematics, but
life is too short to spend too much time on them. A set is simply a col-
lection of objects. We use curly brackets to denote sets. For example, if I
write
A = {2, 5, 13},
then I’m saying that the set A consists of the elements 2, 5, 13. This is
one way of specifying a set; we simply list all its elements between curly
brackets. The notation x ∈ S means x is a member of the set S and the
notation x ∉ S means x is not a member of the set S. For the set A above,
we know 13 ∈ A but 11 ∉ A.
We can also specify some infinite sets in this fashion; for example, the
set of all integers
Z = {. . . , −3, −2, −1, 0, 1, 2, 3, . . . }.
This is absolutely standard notation: when you see Z, you’re expected to
know that it’s the set of integers. The set of natural numbers is
N = {0, 1, 2, 3, 4 . . . }.
Again this is standard notation (but not all mathematicians include 0 in
the natural numbers).
Here is an example of another way of specifying a set:
B = {x ∈ Z : x 2 = 16}.
This is saying that B is the set of all integers x satisfying the equation
x 2 = 16. Of course, another way of specifying the same set would be to
write B = {−4, 4}. If we write
C = {x ∈ N : x 2 = 16},
then C = {4}.
If we write
D = {u ∈ Z : u 3 = 2},
then D is the set of integers u satisfying u 3 = 2. There are no integers
satisfying this equation, so D is the empty set. We denote the empty set
by ;, so we can write D = ;. Here are a couple more examples of empty
sets:
{w ∈ N : w ≤ −1} = ;, {v ∈ Z : 3.01 ≤ v ≤ 3.99} = ;.
3
4 II. ALGEBRAIC REORIENTATION
To get more practice with this notation, observe that another way of
specifying the natural numbers is to write
N = {x ∈ Z : x ≥ 0}.
Yet another correct—although admittedly stupid way—is to write
N = {x ∈ Z : x ≥ −0.5}.
Here are some other sets that you’re meant to know:
(1) Q is the set of rational numbers. We can write this as
na o
Q= : a, b ∈ Z, b 6= 0 .
b
Examples of elements of Q are 0, 5, −7/11, 3/2, 6/4 (the last two
being pthe same element). From Foundations you should know
that
p 2 is irrational. You can write this statement in set notation:
2 ∉ Q. Other examples of irrational numbers are e and π.
(2) R is the set of real numbers. It isn’t possible to write R in straight-
forward way as for the sets above, but you can think of the ele-
ments of R as pointsp on the real line. Examples of elements of R
are −7, 3/5, 3.85, 7, (π + 1)/2, sin 5.
(3) C is the set of complex numbers. You have seen complex num-
bers in your Further Mathematics A-Level. Recall that i is a sym-
bol that satisfies i 2 = −1. We can write the set of complex num-
bers as
C = { a + bi : a, b ∈ R }.
While we’re on the subject of notation, compare the following two state- Tip
ments:
• Some positive real numbers are irrational.
• ∃x ∈ R s.t. x > 0 ∧ x ∉ Q.
The two statements say exactly the same thing. A professional mathe-
matician prefers the first, and an amateur prefers the second. Use only as
much mathematical notation as needed to make your ideas transparent
and precise, but no more 1.
R∗ = { x ∈ R : x 6= 0 }.
Rn = {(x 1 , x 2 , . . . , x n ) : x 1 , x 2 , . . . , x n ∈ R}.
Thus R2 is the set of vectors in the plane, and R3 is the set of vectors
in 3-space. Addition is a binary operation on Rn , and so is subtraction.
What about multiplication by a scalar? If λ is a scalar (i.e. λ ∈ R) and
x = (x 1 , x 2 , . . . , x n ) ∈ Rn is a vector, we define
λx = (λx 1 , λx 2 , . . . , λx n ).
x · y = x1 y 1 + x2 y 2 + · · · + xn y n .
Notice that the result is in R, not Rn , so the dot product is not a binary
operation. The cross product is defined on R3 only. If x, y ∈ R3 the x × y is
again in R3 . So the cross product is a binary operation on R3 .
f : S1 → S2, g : S2 → S3.
and
g : R → R, g (x) = 3x + 2.
Then
Then f ◦ (g ◦ h) = ( f ◦ g ) ◦ h.
Also
(( f ◦ g ) ◦ h)(x) = (` ◦ h)(x) = `(h(x)) = f (g (h(x))).
So f ◦ (g ◦ h) = ( f ◦ g ) ◦ h.
II.7. COMMUTATIVITY AND ASSOCIATIVITY 7
Example II.8. Are there binary operations that are commutative but not
associative? Yes but it isn’t easy to come up with ‘natural’ examples. How-
ever it is easy to invent a finite set and a composition table that is commu-
tative but not associative. Let S = {a, b, c}. Let ◦ be the binary operation
on S with the following composition table:
◦ a b c
a b c a
b c c a
c a a c
Note that ◦ is commutative; you can see this by noting that the table
is symmetric about the diagonal from the top left corner to the bottom
right corner. But it isn’t associative. For example,
(b ◦ c) ◦ a = a ◦ a = b, b ◦ (c ◦ a) = b ◦ a = c.
♦
Exercise II.9. In the following, is ◦ a binary operation on A? If so, is it
commutative? Is it associative? In each case justify your answer.
(a) A = R is the set of real numbers and a ◦ b = a/b.
(b) A = {1, 2, 3, 4, . . . } is the set of positive integers and a ◦ b = a b .
II.8. WHERE ARE THE PROOFS? 9
You almost certainly met matrices during A-Levels, and you’ll see them
again in Linear Algebra. In any case you need to know about matrices for
this module. In this chapter I summarize what you need to know. We’ll
not cover this chapter in the lectures; I expect you to read it on your own.
Even if you think you know all about matrices I advise you to read this
chapter: do you know why matrix multiplication is defined the way it is?
Do you know why matrix multiplication is associative?
A, B , C are matrices. The matrix A has size 2×3 because it has 2 rows and
3 columns. Likewise B has size 3 × 2 and C has size 3 × 3. ♦
Displaying a matrix A by writing
a 11 a 12 a 13 . . . a 1n
a 21 a 22 a 23 . . . a 2n
A = .. .. .
.. ..
. . . .
a m1 a m2 a m3 . . . a mn
♦
Definition. The zero matrix of size m × n is the unique m × n matrix
whose entries are all 0. This is denoted by 0m×n , or simply 0 if no con-
fusion is feared.
Definition. Let A = (a i j )m×n and B = (b i j )n×p . We define the product
AB to be the matrix C = (c i j )m×p such that
ci j = ai 1 b1 j + ai 2 b2 j + ai 3 b3 j + · · · + ai n bn j .
Note the following points:
• For the product AB to be defined we demand that the number
of columns of A is equal to the number of rows of B .
• The i j -th element of AB is obtained by taking the dot product of
the i -th row of A with the j -th column of B .
Example III.5. Let
µ ¶ µ ¶
1 2 5 −3
A= , B= .
−1 3 0 −2
Both A and B are 2 × 2. From the definition we know that A × B will be a
2 × 2 matrix. We see that
µ ¶ µ ¶
1×5+2×0 1 × −3 + 2 × −2 5 −7
AB = = .
−1 × 5 + 3 × 0 −1 × −3 + 3 × −2 −5 −3
Likewise
µ ¶ µ ¶
5 × 1 + −3 × −1 5 × 2 − 3 × 3 8 1
BA= = .
0 × 1 − 2 × −1 0 × 2 + −2 × 3 2 −6
We make a very important observation: AB 6= B A in this example. So
matrix multiplication is not commutative. ♦
Example III.6. Let A be as in the previous example, and let
µ ¶
2 1 3
C= .
3 −4 0
Then µ ¶
8 −7 3
AC = .
7 −13 −3
However, C A is not defined because the number of columns of C is not
equal to the number of rows of A. ♦
Remark. If m 6= n, then we can’t multiply two matrices in M m×n (R). How-
ever, matrix multiplication is defined on M n×n (R) and the result is again
in M n×n (R). In other words, multiplication is a binary operation on M n×n (R).
Exercise III.7. Commutativity—What can go wrong?
• Give a pair of matrices A, B , such that AB is defined but B A isn’t.
• Give a pair of matrices A, B , such that both AB and B A are de-
fined but they have different sizes.
16 III. MATRICES—READ ON YOUR OWN
1Algebraists have many idiosyncrasies that distinguish them from other mathe-
maticians. I find most of these bewildering. However, they do have a very good point
in the way they write functions. We said that B A means do A first and then B , because
(B A)u = B (Au). However if you use row vectors then u(B A) = (uB )A, so B A means do B
first then A, which seems entirely natural.
III.6. MULTIPLICATIVE IDENTITY AND MULTIPLICATIVE INVERSE 21
y
projection onto the y-axis
L
S1 S2 S3
x
P ROOF. The proof is mostly left as an exercise for the reader. Parts (a), (b)
follow from the definition and effortless calculations (make sure you do
them). For (c) note that
det(A −1 A) = det(I 2 ) = 1.
Now applying (ii) we have 1 = det(A −1 ) det(A). We see that det(A) 6= 0 and
det(A −1 ) = 1/ det(A).
µ ¶ µ ¶
a b
Exercise III.14. Suppose u = and v = are non-zero vectors, and
c d
µ ¶
a b
let A be the matrix with columns u and v; i.e. A = . Show (al-
c d
gebraically) that det(A) = 0 if and only if u, v are parallel. Explain this
geometrically.
26 III. MATRICES—READ ON YOUR OWN
III.7. Rotations
We saw above some examples of transformations in the plane: reflec-
tion, stretching, projection. In this
µ ¶section we take a closer look at rota-
x
tions about the origin. Let P = be a point in R2 . Suppose that this
y
point is rotated anticlockwise about the µorigin
0
¶ through an angle of θ. We
x
want to write down the new point P 0 = 0 in terms of x, y and θ. The
y
easiest way to do this to use polar coordinates. Let the distance of P from
−−→
the origin O be r and let the angle OP makes with the positive x-axis be
φ; in other words the polar coordinates for P are (r, φ). Thus
x = r cos φ, y = r sin φ.
Since we rotated P anticlockwise about the origin through an angle θ to
obtain P 0 , the polar coordinates for P 0 are (r, φ + θ). Thus
x 0 = r cos(φ + θ), y 0 = r sin(φ + θ).
We expand cos(φ + θ) to obtain
x 0 = r cos(φ + θ)
= r cos φ cos θ − r sin φ sin θ
= x cos θ − y sin θ.
Similarly
y 0 = x sin θ + y cos θ.
We can rewrite the two relations
x 0 = x cos θ − y sin θ, y 0 = x sin θ + y cos θ,
in matrix notation as follows
cos θ − sin θ x
µ 0¶ µ ¶µ ¶
x
= .
y0 sin θ cos θ y
Thus anticlockwise rotation about the origin through an angle θ can be
achieved by multiplying by the matrix 1
cos θ − sin θ
µ ¶
Rθ = .
sin θ cos θ
cos π4 − sin π4
x ¶
µ ¶µ ¶
1Is this clever . . . or lame: x
µ
= ?
sin π4 cos π4 y
y
CHAPTER IV
Groups
σ1
σ2 σ0
2 1
σ3
O
3 4
D 4 = {ρ 0 , ρ 1 , ρ 2 , ρ 3 , σ0 , σ1 , σ2 , σ3 }.
It is not worth your while to check every entry in the table, but make
sure you check four or five entries at random to get an idea of how to
compose symmetries, and let me know if I’ve made any mistakes!
I want to convince you that (D 4 , ◦) is a group. The first thing we should
ask about is closure. This is clear from the table; when you compose two
elements of D 4 you get an element of D 4 . Let’s skip associativity for now
and talk about the existence of the identity element. For this we don’t
need the table. It is clear that ρ 0 (=do nothing) is an identity element.
It is also (geometrically) clear that every element has an inverse which
does belong to D 4 . If you reflect twice in the same line you end up where
you started, so σi ◦ σi = ρ 0 ; in other words, σi is its own inverse for i =
0, 1, 2, 3. The inverse of an anticlockwise rotation around O by 90◦ is an
anticlockwise rotation around O by 270◦ . We find that the inverses of ρ 0 ,
ρ 1 , ρ 2 and ρ 3 respectively are ρ 0 , ρ 3 , ρ 2 and ρ 1 .
What’s left is to prove associativity. So we have to prove that ( f ◦g )◦h =
f ◦(g ◦h) for all f , g , h ∈ D 4 . But there are 512 triples f , g , h, so that’s a lot
of checking. Is there a clever way? Yes there is, and it relies on thinking
about the elements of D 4 as functions 1. Remember that we labelled the
vertices of the square with 1, 2, 3, 4 as in Figure IV.1. Now ρ 1 takes vertex
1 to where vertex 2 was and vertex 2 to where vertex 3 was and vertex 3 to
where vertex 4 was and vertex 4 to where vertex 1 was. We can think of ρ 1
as a function from {1, 2, 3, 4} to itself as in Figure IV.2.
ρ1 σ0
1 1 1 1
2 2 2 2
3 3 3 3
4 4 4 4
In fact, this way every element of D 4 can be thought of as a function a key trick
from {1, 2, 3, 4} to itself. If we think like this, ◦ simply becomes composi-
tion of functions from D 4 to itself. We know that composition of func-
tions is associative by Lemma II.2. Thus the binary operation ◦ on D 4 is
1You’ve seen this idea before: in the proof of Theorem III.9 we showed that matrix
multiplication is associative by thinking of matrices as functions and matrix multiplica-
tion as composition of functions.
IV.4. SYMMETRIES OF A SQUARE 33
associative. We have now checked all the conditions (i)–(iv) for a group,
so (D 4 , ◦) is a group!
Remarks.
Moral • Notice that by changing the way we looked at the elements of D 4 ,
we saved ourselves from excruciatingly checking 512 laborious
cases. This is a recurring theme in algebra, where a conceptual
outlook saves you from much trouble.
• You might have found our definition of composition in D 4 strange:
α◦β means apply β first then α. The reason for this choice is that
we want to sometimes think of the elements of D 4 as functions,
and when we do that we want composition in D 4 to agree with
the usual composition of functions. Recall that f ◦g means apply
g first then f .
non-abelian group • D 4 is our first example of a non-abelian group. To check that it
isn’t abelian all we have to do is give a pair of symmetries that
don’t commute. For example,
σ0 ◦ ρ 1 = σ3 , ρ 1 ◦ σ0 = σ1 .
IV.4.1. Subgroups of D 4 . The set D 4 contains rotations and reflec-
tions. Let us now look at the rotations on their own and the reflections on
their own:
R = {ρ 0 , ρ 1 , ρ 2 , ρ 3 }, S = {σ0 , σ1 , σ2 , σ3 }.
For now let us look at the part of the composition table that involves only
rotations:
◦ ρ0 ρ1 ρ2 ρ3
ρ0 ρ0 ρ1 ρ2 ρ3
ρ1 ρ1 ρ2 ρ3 ρ0
ρ2 ρ2 ρ3 ρ0 ρ1
ρ3 ρ3 ρ0 ρ1 ρ2
Notice from the table that if we compose two rotations we obtain a ro-
tation. We didn’t really need the table for this; it’s geometrically obvious.
Thus ◦ is a binary operation on R (as well as being a binary operation
on D 4 ). We can ask whether (R, ◦) is a group, and it is easy to see that
the answer is yes (with the same reasoning as before). We have an inter-
esting phenomenon, which is a group (R, ◦) contained in another group
(D 4 , ◦). We say that (R, ◦) is a subgroup of (D 4 , ◦). We will discuss sub-
groups at length later. It is also interesting to note that (R, ◦) is abelian.
An algebraic way of seeing the (R, ◦) is abelian is to note that its compo-
sition table is symmetric about the leading diagonal. But you should also
see geometrically that if you compose two rotations (centred at the same
point) then the order does not matter. So (R, ◦) is an abelian subgroup of
the non-abelian group (D 4 , ◦).
What about (S, ◦)? Do the reflections of the square form a group? By
looking at the composition table the first thing we notice is that S is not
34 IV. GROUPS
closed under composition. So (S, ◦) is not a group. Are there any other
subgroups inside (D 4 , ◦) besides (R, ◦)? Yes. See Figure IV.3 for a complete
list.
(D 4 , ◦)
({ρ 0 }, ◦)
Again, check that a couple of these are subgroups. Don’t waste time
checking there aren’t other subgroups of (D 4 , ◦); when you know a lot
more about groups and subgroups you can come back to this question,
but even then it will still be a little tedious!
Exercise IV.13. In this exercise you will write out the composition table
for the group D 3 which is the group of symmetries of an equilateral tri-
angle. Sketch an equilateral triangle and label the vertices 1, 2, 3 in an-
ticlockwise order. Label the centre of the triangle with O. Let ρ 0 , ρ 1 , ρ 2
denote anticlockwise rotations about O through angles 0, 2π/3 and 4π/3.
Let σ1 , σ2 , σ3 denote reflections about the lines respectively joining ver-
tices 1, 2, 3 to O. Let
D 3 = {ρ 0 , ρ 1 , ρ 2 , σ1 , σ2 , σ3 }.
Write down a composition table for D 3 and explain why it is a group 1. Is
it abelian? It has six subgroups; write them down.
Exercise IV.14. Write down the symmetries of a triangle that is isoceles
but not equilateral and a composition table for them. Do they form a
group?
First Theorems
P ROOF. Suppose that e and e 0 are identity elements. Thus, for all a ∈ G
we have
(V.15) a ◦ e = e ◦ a = a,
and
(V.16) a ◦ e 0 = e 0 ◦ a = a.
e ◦ e 0 = e.
e ◦ e 0 = e 0.
P ROOF. Our proof follows the same pattern as the proof of Theorem V.1,
and you’ll see this pattern again and again during your undergraduate
key trick! career. Almost all uniqueness proofs follow the same pattern: suppose
that there are two of the thing that we want to prove unique; show that
these two must be equal; therefore it is unique.
For our proof we suppose that b and c are both inverses of a. We want
to show that b = c. By definition of inverse (property (iv) in the definition
of a group) we know that
a ◦ b = b ◦ a = e, a ◦ c = c ◦ a = e,
35
36 V. FIRST THEOREMS
Exercise VI.4. Write down the addition and multiplication tables for Z/4Z
and Z/5Z.
VI.2. CONGRUENCE CLASSES 41
+ 0 1 2 3 4 5 × 0 1 2 3 4 5
0 0 1 2 3 4 5 0 0 0 0 0 0 0
1 1 2 3 4 5 0 1 0 1 2 3 4 5
2 2 3 4 5 0 1 2 0 2 4 0 2 4
3 3 4 5 0 1 2 3 0 3 0 3 0 3
4 4 5 0 1 2 3 4 0 4 2 0 4 2
5 5 0 1 2 3 4 5 0 5 4 3 2 1
TABLE VI.1. The addition and multiplication tables for Z/6Z.
1Perhaps you prefer the inverse of a where 0 ≤ a < m to be of the form b where b
also satisfies 0 ≤ b < m. In this case, if 0 < a < m, then observe that 0 < m − a < m, and
a + m − a = 0, since a + (m − a) ≡ 0 (mod m). Moreover −0 ≡ 0 (mod m), thus −0 = 0.
CHAPTER VII
. . . , a −4 , a −3 , a −2 , a −1 , 1, a, a 2 , a 3 , a 4 , a 5 , . . . .
. . . , 1, a, a 2 , a 3 , 1, a, a 2 , a 3 , 1, . . . .
i
ζ
1 −1 1
ζ2
−i
Exercise VII.4. Write down and sketch the sixth roots of unity. What are
their orders? Repeat with the eighth roots of unity.
Exercise VII.5. C∗ has plenty of elements of infinite order. Write down a
few.
Exercise VII.6. Let G = GL2 (R). Show that
µ ¶ µ ¶
0 1 1 1
A= , B=
−1 0 0 1
belong to G. Determine their orders.
Whilst reading the above examples and working out your own, the fol-
lowing observations will have dawned on you (given here in multiplica-
tive notation).
Lemma VII.7. Let G be a group and g be an element of G.
(i) g has order 1 if and only if g is the identity element.
(ii) Let m be a non-zero integer. Then g m = 1 if and only if g has
finite order d with d | m.
VII.1. THE ORDER OF AN ELEMENT 45
m = qd + r, q, r ∈ Z and 0 ≤ r < d .
Subgroups
Excruciating pain It will be a long time before you come to appreciate and enjoy groups.
precedes orgasmic Abstract algebra goes from being mind-numbingly boring to being an ac-
pleasure! quired taste and then an exhilarating experience and finally—if you’re not
careful—a hopeless addiction. We’re still in the mind-numbingly boring
part of the journey; you should see this part as an initiation rite that can’t
be skipped.
Example VIII.6. Let’s take G = R∗ and H the subset of positive real num-
bers:
H = {a ∈ R∗ : a > 0}.
Let’s show that H is a subgroup of G. First, 1 is positive, so 1 ∈ H . Hence
condition (a) is satisfied.
To check (b), suppose that a, b are in H . Thus a and b are positive,
and so their product ab is also positive. Hence ab ∈ H and we know that
(b) is satisfied.
Finally, we want to check condition (c). Suppose a is an element of H .
Then a is positive, and so a −1 is positive. Hence a −1 is also an element of
H . It follows that condition (c) is satisfied.
By Theorem VIII.5, H is a subgroup of R∗ . ♦
Example VIII.7. Let
2Z = {2a : a ∈ Z} = {. . . , −6, −4, −2, 0, 2, 4, 6, . . . }.
In other words, 2Z is the set of even integers. Now 2Z is a subset of Z,
but is it a subgroup of Z? We should check the three conditions in the
theorem, where G = Z and H = 2Z. Condition (a) is “1 ∈ H ”. What does
that mean in our context? 1 is not the number 1. The 1 in the theorem is
the identity element for the group operation on Z. The group operation
on Z is addition. The identity element is 0. As 0 is an even number (after
all 0 = 2 × 0) we have 0 ∈ 2Z. Thus condition (a) is satisfied.
Let’s move on to condition (b). This says “if a, b ∈ H then ab ∈ H ”.
Again ab doesn’t always mean the product of a and b; it is shorthand for
a ◦ b where ◦ is the binary operation on G. Here G = Z and the binary
operation on Z is +. So to check (b) what we must check is the following
“if a, b ∈ 2Z then a + b ∈ 2Z”. In words this just says “the sum of two even
integers is even”, which is true so (b) holds.
Finally we have to interpret (c) in our context. Here a −1 is the inverse
of a with respect to addition, so it just means −a. Thus to check (c) we
want to check that “if a is an even integer then −a is also even”. Again this
is true, so (c) holds.
It follows from Theorem VIII.5 that 2Z is a subgroup of Z.
By contrast, the set of odd integers
{. . . , −5, −3, −1, 1, 3, 5, . . . }
is not a subgroup of Z. For example, it does not contain the identity ele-
ment 0, so does not satisfy (a). ♦
Example VIII.8. In Subsection IV.4.1, we listed the ten subgroups of D 4 .
Go back to that list, and use Theorem VIII.5 to verify that a couple of them
are indeed subgroups. ♦
Example VIII.9. Let
V = {(a, a) : a ∈ R}.
VIII.2. CRITERION FOR A SUBGROUP 49
y
V
y W
(1, 1)
(−1, −1)
(1, 1) and (−1, 1) are in U but their sum (0, 2) is not in U . So U does not
satisfy (b), and is therefore not a subgroup of R2 . See Figure VIII.3.
On the other hand, the intersection V ∩ V 0 = {(0, 0)} is a subgroup of
R2 . ♦
y
0
V (0, 2) V
(−1, 1) (1, 1)
y
C
(a, a3 )
(−a, −a3 )
(VIII.17) Π : n · (x − u) = 0.
Here n is any (non-zero) vector normal to the plane, and u is the position
vector of any point on the plane.
The plane Π in (VIII.17) defines a set
VΠ = {x ∈ R3 : n · (x − u) = 0}.
VΠ = {x ∈ R3 : n · x = 0}.
52 VIII. SUBGROUPS
Q x−u
P
u x
i
S
1 −1 1
−i
Sr = {α ∈ C∗ : |α| = r }.
P ROOF OF T HEOREM VIII.5. The theorem has an “if and only if” state-
ment. It usual when proving an “if and only if” statement to break it up if and only if
into an “if” part, and an “only if” part, and prove each part separately.
This is what we will do here. The “if” part says: “if H is a subset of G that
satisfies (a),(b),(c) then it is a subgroup of G”. The “only if” part says: “if
H is a subgroup of G then H satisfies (a), (b), (c)”.
Let us do the “if” part of the proof first. We have a group G and a
subset H of G. All we have been told is that H satisfies conditions (a), (b),
(c) in the statement of the theorem. We want to show that H is a group,
where the binary operation on H is the same as the binary operation on
G. This means that we have to show that H satisfies properties (i), (ii),
(iii), (iv) in the definition of a group.
Property (i) is ‘closure’: we want that if a, b ∈ H then ab ∈ H . But this
is what (b) is saying. So (i) is satisfied.
Property (ii) is associativity. We want to show that for all a, b, c ∈ H ,
we have (ab)c = a(bc). But if a, b, c are elements of H then they are also
addition that make the set of real numbers into a group. But if I wanted anything other
than the usual or obvious operation I’d have told you so.
1When answering a maths question, you should always be careful about what is
being asked. Here you’re being asked to show two things. The first is that the three listed
sets are indeed subgroups. The second is that there aren’t any other subgroups.
VIII.3. ROOTS OF UNITY 55
Still hurting?
CHAPTER IX
〈g 〉 = {g n : n ∈ Z} = {. . . , g −2 , g −1 , 1, g , g 2 , g 3 , . . . }.
Then 〈g 〉 is a subgroup of G.
P ROOF. This is very easy to prove using Theorem VIII.5. Have a go!
ζ0 = 1, ζ, ζ2 , . . . , ζn−1 .
h 4 = h, h 5 = h 2 , h 6 = 1, h 7 = h, . . . .
h −2 = h, h −3 = 1, h −4 = h 2 , h −5 = h, . . . .
g 2 = ζ10 = ζ6 ζ4 = ζ4 .
g 〈g 〉
1 {1}
ζ {1, ζ, ζ2 , ζ3 , ζ4 , ζ5 }
ζ2 {1, ζ2 , ζ4 }
ζ3 {1, ζ3 }
ζ4 {1, ζ2 , ζ4 }
ζ5 {1, ζ, ζ2 , ζ3 , ζ4 , ζ5 }
TABLE IX.1. The six elements of U6 and the cyclic sub-
groups they generate.
Example IX.3. For each element of the group Z/mZ, we write down the
cyclic group it generates. Note that since Z/mZ is an additive group, the
subgroup generated by g is 〈g 〉 = {ng : n ∈ Z}. That is, it is the set of
multiples of g rather than the set of powers of g . See Table IX.2. ♦
a 〈a〉
0 {0}
1 {0, 1, 2, 3, 4, 5}
2 {0, 2, 4}
3 {0, 3}
4 {0, 2, 4}
5 {0, 1, 2, 3, 4, 5}
TABLE IX.2. The six elements of Z/6Z and the cyclic sub-
groups that they generate.
Whilst working through the above examples, you will have noticed a
pattern about 〈g 〉, which we state in the following theorem.
Theorem IX.6. Let G be a group and let g be an element of finite order n.
Then
〈g 〉 = {1, g , g 2 , . . . , g n−1 }.
In particular, the order of the subgroup 〈g 〉 is equal to the order of g .
P ROOF. Observe that 〈g 〉 is a set, and {1, g , . . . , g n−1 } is a set. We want to
a fundamental show that these sets are the same. Whenever you have two sets, A and
principle B , and you want to prove that they’re equal, one way to do this is to show
that every element of A belongs to B and every element of B belongs to A.
You will see this principle again and again throughout your undergradu-
ate career.
Let’s apply this principle in our situation. By definition,
〈g 〉 = {g n : n ∈ Z} = {. . . , g −2 , g −1 , 1, g , g 2 , g 3 , . . . }.
That is 〈g 〉 is the set of all powers of g . It is obvious that every element
of {1, g , . . . , g n−1 } belongs to 〈g 〉. What about the other way round. Sup-
pose that h is an element of 〈g 〉. We want to show that h is an element
of {1, g , . . . , g n−1 }. We can write h = g m where m is an integer (positive or
negative). We want to show that h = g r where r is one of 0, 1, 2, . . . , n − 1.
The division For this we will use the division algorithm which you met in Foundations.
algorithm is one of We can write
the most powerful m = qn + r, q, r ∈ Z, 0 ≤ r < n.
ideas in algebra. Here we simply divided m by n; the integers q, r are respectively the quo-
tient and the remainder. Thus
h = g m = g qn+r = (g n )q · g r .
However, g n = 1 since g has order n. So h = g r . Since 0 ≤ r < n, we
see that r is one of 0, 1, . . . , n − 1. Therefore h is in {1, g , . . . , g n−1 }. By our
principle, we see that 〈g 〉 = {1, g , . . . , g n−1 }.
Exercise IX.7. In each of the following groups G, write down the cyclic
subgroup generated by g .
(a) G = S, g = exp(2πi /7).
(b) G = Z/12Z, g = 8.
¡ 0 1¢
(c) G = GL2 (R), g = −1 0 .
64 IX. CYCLIC GROUPS AND CYCLIC SUBGROUPS
Exercise IX.8. Which of the following groups G are cyclic? Justify your
answer for each, and if G is cyclic then write down a generator.
IX.2. Subgroups of Z
Trust me, I’m a I’m feeling particularly inarticulate at the moment, so I can’t explain
doctor. why subgroups of Z are important. But nevertheless they are important
and so we’ll do them.
The first thing to note about Z is that it is cyclic. Does that mean
that all elements of Z are powers of some element. No, because it is an
additive group. If G is an additive group, and g is an element of G then
〈g 〉 = {ng : n ∈ Z} = {. . . , −2g , −g , 0, g , 2g , 3g , . . . }.
Thus Z = 〈1〉 and so it is cyclic. In fact, it is infinite and cyclic, unlike for
example, Un .
Lemma IX.16. Let k be an integer. Write
kZ = {ka : a ∈ Z}.
Then kZ is a subgroup of Z.
P ROOF. You can prove this in a similar way to Example VIII.7. However,
it is quicker to note that kZ = 〈k〉, and so is a subgroup by Theorem IX.1.
Note that 0Z = {0} has only the identity element. Also
(−k)Z = {. . . , −2(−k), −(−k), 0, −k, 2(−k), . . . }
= {. . . , 2k, k, 0, −k, −2k, . . . }
= {. . . , −2k, −k, 0, k, 2k, . . . }
= kZ
because the order of elements in a set does not matter. In other words,
−Z = Z, −2Z = 2Z, −3Z = 3Z, . . . .
So we have an infinite list of subgroups
{0}, Z, 2Z, 3Z, 4Z, . . .
and we want to know if they’re all the subgroups of Z. The following the-
orem tells us that they are.
Theorem IX.17. Any subgroup of Z has the form kZ for some non-negative 1
integer k. In particular, all subgroups of Z are cyclic.
P ROOF. Let H be a subgroup of Z. We want to show that there is a non-
negative integer k such that H = kZ. We divide the proof into two cases.
The first case is when H is the subgroup {0}. Then H = 0Z and we’ve done
what we wanted.
So let’s look at the second case where H has non-zero elements. If a is
a non-zero element of H then because H is a(n additive) group, −a is also
a non-zero element of H but it has a different sign. So we know for sure
1The non-negative integers are 0, 1, 2, 3, . . . .
66 IX. CYCLIC GROUPS AND CYCLIC SUBGROUPS
that H has some positive elements. Let k be the smallest positive element
of H . We will prove that H = kZ.
Whenever you have two sets, A and B , and you want to prove that Is this familiar?
they’re equal, one way to do it is to show that every element of A belongs
to B and every element of B belongs to A.
As k is in H , we know by Theorem V.9 that all the multiples of k be-
long to H . Thus every element of kZ belongs to H . We must show the
converse: every element of H is a multiple of k.
Let a be an element of H . By the division algorithm which you have
met in Foundations, we can write
a = qk + r, q, r are integers and 0 ≤ r < k.
To remind: here q is the quotient of dividing a by k and r is the remainder.
Now a is in H ; qk is in H because it is a multiple of k ∈ H . So r = a − qk punchline
is also in H . But 0 ≤ r < k, and k is the smallest positive element of H . If
r > 0 then it would be an even smaller positive element of H giving us a
contradiction. So r = 0. Hence a = qk is a multiple of k.
Thus we’ve also shown that every element of H is a multiple of k and
so belongs to kZ. Hence H = kZ, as required.
Exercise IX.18. The subgroups of Z2 are harder to describe. Write down
a few.
Thrilled? Aren’t you glad you trusted me?
CHAPTER X
Isomorphisms
You must’ve noticed that there’s a lot in common between the group
of m-th roots of unity Um , and the group Z/mZ. If not, take another look
Tables IX.1 and IX.2. In fact the groups Um and Z/mZ are isomorphic.
What does this mean?
Definition. Let (G, ◦) and (H , ∗) be groups. We say that the function φ :
G → H is an isomorphism if it is a bijection and it satisfies
φ(g 1 ◦ g 2 ) = φ(g 1 ) ∗ φ(g 2 )
for all g 1 , g 2 in G. In this case we say that (G, ◦) and (H , ∗) are isomorphic.
Isomorphic groups may look different, but in essence are the same.
An isomorphism is a way of relabeling the elements of one group to ob-
tain another group, as the following examples will make clear.
Example X.1. Define φ : Z/mZ → Um by the simple rule
φ(a) = ζa , a = 0, 1, . . . , m − 1.
Then φ is a bijection and satisfies the magical property
e i (θ+φ) = e i θ e i φ .
The reason is because multiplying a complex number by e i θ rotates it
about the origin anticlockwise through the angle θ (prove this using the
exponential form for complex numbers).
Now that you know that R θ and e i θ are analogues, you will expect that
the groups SO2 (R) and S are isomorphic. Recall that SO2 (R) is the special
67
68 X. ISOMORPHISMS
Cosets
1.5S
0.5S
1
Summary: S is the circle centred at the origin of radius 1, and its cosets
in C∗ are the circles centred at the origin (of positive radius). ♦
Example XI.7. In Exercise VIII.16 I asked you the following question: which
lines in R2 define a subgroup? Let’s go back to this question and answer
it again, and this time for lines that define a subgroup we want to deter-
mine the cosets too.
One convenient way of a specifying a line L in R2 is as follows. Let Q
be a point on L, with position vector w. Let v be a vector parallel to L.
Then L has the parametric form
L : x = w + t v.
This is a (slightly clumsy) school way of saying things. What it means
is that the points with position vector w + t v are on the line, where t is
any ‘scalar’ (i.e. real number). A much better way is to just write L in set
notation:
L = {w + t v : t ∈ R}.
72 XI. COSETS
w+L
y L
v
w
Example XI.8. If you did matrices at school, then you will probably know
that a system of m linear equations in n variables can be written as a
single matrix equation
(XI.19) Ax = b
Ax = b, Ax0 = b.
Subtracting we find
A(x − x0 ) = 0.
So the difference x − x0 belongs to the subgroup K . Thus x belongs to the
coset x0 + K . In fact, the set of solutions to (XI.19) is precisely the coset
x0 + K . ♦
Example XI.9. In the Differential Equations module, one of things you’ll
look at are linear second order differential equations. For example, you’ll
see equations of the form
d 2x dx
(XI.20) a 2
+b + c x = f (t ),
dt dt
with a, b, c constants (again, it is likely that you’ve seen these at school).
To solve this you look at the corresponding homogeneous equation
d 2x dx
(XI.21) a 2
+b + c x = 0.
dt dt
Convince yourself that the solutions to the homogeneous equation (XI.21)
form a group K with respect to addition (revise Section VIII.5 if you need
to). In some textbooks on differential equations (and some old A-Level
maths textbooks), K is called the kernel. Now we ask the same question
as in the previous example: what is the relation between the solutions to
(XI.20) and K ? Again, if (XI.20) does not have a solution then there is no
relation. Suppose it has solutions, and let x 0 (t ) be one of them. In your
Differential Equations module, x 0 (t ) is called ‘a particular integral’. If x(t )
is any other solution to (XI.20), then you can check that x(t ) − x 0 (t ) is a
solution to the homogeneous equation (XI.21), and so is an element of K .
It follows that the set of solutions to (XI.20) is the coset x 0 (t ) + K . ♦
74 XI. COSETS
XI.3. Index
Definition. Let G be a group and H be a subgroup. We shall define the
index of H in G, denoted by [G : H ], to be the number of left cosets of H
in G.
Example XI.10. In Example XI.1, we computed the left cosets of R = {1, ρ 1 , ρ 2 , ρ 3 }
in D 4 and found exactly two of them: namely
{1, ρ 1 , ρ 2 , ρ 3 } and {σ0 , σ1 , σ2 , σ3 }.
So the index [D 4 : R] = 2. ♦
Example XI.11. In Example XI.3 we found that the cosets of 2Z in Z are
2Z itself, and 1 + 2Z, so the index [Z : 2Z] = 2. If you’ve done Exercise XI.4
then you’ll know that [Z2 : 2Z2 ] = 4. ♦
Example XI.12. In Example XI.6, we found that the cosets of the circle
group S in C∗ are the circles centred at the origin. So the index [C∗ : S] =
∞.
Example XI.13. Now let’s look at the index of the trivial group {0} as a
subgroup of Z. Note that
a + {0} = {a}.
So the cosets of {0} in Z are
. . . , {−2}, {−1}, {0}, {1}, {2}, . . .
Clearly [Z : {0}] = ∞. ♦
Exercise XI.14. Let G be a finite group. Let {1} be the trivial subgroup
consisting only of the identity element. Explain why [G : {1}] = |G|.
A priceless tip! finite. The best way to show that two finite sets have the same number
of elements is to set up a bijection between them. Let
φ : H → g H, h 7→ g h.
g −1 (g h 1 ) = g −1 (g h 2 ).
Thus h 1 = h 2 .
Surjectiveness: Suppose k is an element of the coset g H . We want to
show that k is of the form φ(h) for some element h of the subgroup H . But
by definition, g H = {g h : h ∈ H }, so k = g h = φ(h) for some h in H .
A Highbrow Remark for the Cognoscenti. Note that the proof that φ :
n
H → g H is a bijection did not assume the finiteness of H ; it is true for any
subgroup H whether finite or infinite. The finiteness is used to conclude
w
that the number of elements of H and the number of elements of g H are
, a
the same. What happens if H is infinite? Mathematicians still think of H
y
and g H as having the same number of elements, even though they are
infinite, simply because there is a bijection between them. Thus |2Z| =
w n
|1 + 2Z|, and |S| = |2S|. However, |2Z| 6= |S|, because 2Z is countable and
S is uncountable. If you find this interesting, have a look at cardinalities
ya
on Wikipedia. But only a brief look; trust me, set theory is as boring as
hell. In any case, feel free to ignore this remark.
Example XI.16. Now is a good time to revisit the examples at the begin-
ning of the chapter and make sure that Lemma XI.15 holds for them.
1Two sets A, B are disjoint if they have no members in common. Another way of
saying the same thing is: two sets A, B are disjoint if A ∩B = ;. I’m now confused—have
I said it in another way, or in the same way but with more notation?
76 XI. COSETS
Example XI.18. Look again at Example XI.6 and in particular Figure XI.1.
There we looked the cosets of the circle subgroup S inside C∗ . We found Geometric Epiphany
that the cosets are the circles centred at the origin of positive radius. It is I
obvious that two such circles are either equal or disjoint. ♦
Example XI.19. In Example XI.7, we saw that a line L in R2 passing through Geometric Epiphany
the origin defines a subgroup. The cosets of L are the lines parallel to it. II
Again it is clear that two lines parallel to L are either equal or disjoint. ♦
Quotient Groups
ie
qu
ot
!
! I love qu
o
tients! I lov
nts
eq
tie
uo o
tie qu
nt ve
s!
Il I lo
ov s!
ent
ti
e
qu
o
qu
ot
ien
ve
I lo
!ts
(mod mZ) if and only if a ≡ b (mod m). The concept of congruence mod-
ulo subgroups is a generalization of the earlier concept of congruence mod-
ulo integers. ♦
Example XII.2. Z is a subgroup of R. Two real numbers a, b are con-
gruent modulo Z if and only if a − b ∈ Z. This means that their differ-
ence is an integer. So, for example 1437.14 ≡ 0.14 (mod Z). It may seem
that congruence modulo Z is a stupid idea. After all, we’re concentrating
on the small fractional part of number and ignoring the bigger integer
part. However in some situations, the fractional part is the important
one. Let’s see one of those situations. In Example VIII.17 we defined the
circle group
S = {α ∈ C : |α| = 1}.
Let
f : R → S, f (θ) = e 2πi θ .
What happens to f (θ) as θ changes? If we start with θ = 0 ∈ R and increase
the value of θ, then f (θ) starts at 1 ∈ S and moves anticlockwise. When θ
reaches 1 ∈ R then f (θ) will have done a complete circle and returned to
1 ∈ S. By the time θ reaches 2 ∈ R, f (θ) will have done another complete
circle and returned again to 1 ∈ S. Of course, you want me to be less
clumsy and say that f is periodic with period 1. Indeed f (φ) = f (θ) if and
only if φ = θ + n where n is an integer. Now Z is a subgroup of R. So we
can rewrite that fact as f (φ) = f (θ) if and only if φ ≡ θ (mod Z). ♦
Example XII.3. Let X = {(a, 0) : a ∈ R}. It’s easy to show that X is a sub-
group of R2 , which is simply the x-axis. What does it mean for two points
to be congruent modulo X ? Suppose (a 1 , b 1 ) and (a 2 , b 2 ) are in R2 . Then
(a 1 , b 1 ) ≡ (a 2 , b 2 ) (mod X ) if and only if (a 1 −a 2 , b 1 −b 2 ) belongs to X . This
happens if and only if b 1 − b 2 = 0. So two points are congruent modulo X
if and only if they have the same y-coordinate. ♦
Example XII.4. Let G = R[x]. Let H = { f ∈ R[x] : f (0) = 0}. It is an easy
exercise to show that H is a subgroup of R[x]. Now let’s understand what
it means for two polynomials to be congruent modulo H . Suppose g ,
h ∈ R[x]. Write 1
g = a0 + a1 x + · · · + an x n , h = b0 + b1 x + · · · + bn x n ,
1It seems that we’re writing f and g both as polynomials of the same degree n; this
looks wrong as there is no reason to suppose that g and h have the same degree. But
looks can be misleading. Here we’re in fact writing g and h as polynomials of degree at
most n. For example, if g = 2 + 7x and h = 4 − 3x + 2x 3 then we can take n = 3 and let
a 0 = 2, a 1 = 7, a 2 = a 3 = 0, and b 0 = 4, b 1 = −3, b 2 = 0, b 3 = 2
XII.2. CONGRUENCE CLASSES AND COSETS 81
Exercise XII.5. Let (G, +) be an abelian group. We know that {0} and G
are subgroups of G. What does it mean for a and b to congruent modulo
{0}? What does it mean for a and b to be congruent modulo G?
a = a + H.
(XII.23) a + b = a + b.
XII.3. R/Z
In Example XII.2 we looked at congruences in R modulo Z. We now
want to understand the group R/Z. Note that every real number is con-
gruent modulo Z to a unique number in the half-open interval
Therefore
R/Z = {a : a ∈ [0, 1)}.
So when we add a + b, we take the result of a + b, and and simplify by
subtracting an integer if necessary to obtain c ∈ [0, 1), and then letting
a + b = c. For example,
f : R → S, f (θ) = e 2πi θ ,
In essence what this is saying is that the two groups (R/Z, +) and (S, ·) are
‘essentially the same’. Indeed, they’re isomorphic. Now is a good time to
look again at Chapter X.
Here is how you should think about R/Z. We identified it with the
interval [0, 1). Think of starting at 0.95 and moving up in small steps of
0.01:
0.95, 0.96, 0.97, 0.98, 0.99, 0.00, 0.01, 0.02, 0.03, . . .
So we should really think of R/Z as the interval [0, 1) with one end joined
to the other. If you take a string and join one end to the other you ob-
tain a loop, or a ‘circle’. This is what fˆ is doing. It is showing that R/Z is
isomorphic to the unit circle S. Indeed, the 2π in the formula for fˆ is a
‘stretching factor’, since the interval [0, 1) of length 1 has to be ‘stretched’
around the unit circle of perimeter 2π.
Exercise XII.8. R/Z has four elements of order 5. Find them.
Exercise XII.9. Let α ∈ [0, 1). Show that α has finite order in R/Z if and
only if α is rational.
Exercise XII.10. Show that fˆ takes rational numbers to roots of unity.
XII.4. R2 /Z2 83
XII.4. R2 /Z2
After R/Z you shouldn’t have any trouble imagining R2 /Z2 . You’re al-
lowed to shift any point in R2 by an integer multiple of i and an integer
multiple of j. So you end up in the unit square:
(XII.24) {(x, y) : 0 ≤ x < 1, 0 ≤ y < 1}.
But you should think of this square as having the top side glued to the
bottom side, and the right side glued to the left side! See Figure XII.2
(0, 1) (1, 1)
Peb
(0, 0) (1, 0)
F IGURE XII.2. R2 /Z2 is really just the unit square with the
top side glued to the bottom side, and the right side glued
to the left side.
♦
Exercise XII.12. Find all elements of order 3 in R2 /Z2 (there are eight of
them).
Exercise XII.13. In Z2 we let i = (1, 0) and j = (0, 1) as usual. Write 2Z2 =
{(2a, 2b) : a, b ∈ Z}. Convince yourself that 2Z2 is a subgroup of Z2 of
index 4 and that
Z2 /2Z2 = {0, i, j, i + j}.
Write down an addition table for Z2 /2Z2 .
Exercise XII.14. How would you describe C/Z[i ]? Is it really different
from R2 /Z2 ?
Exercise XII.15. How would you describe C/Z? Find all elements of order
2.
XII.5. R/Q
In this section, we shall briefly think about R/Q. In R/Z, we treat the
integers as ‘zero’. In R/Q, we treat the rationals as ‘zero’. This is a much
trickier quotient group. The trickiness does not come from the defini-
tion; there is no difficulty there. We can add in R/Q using the defini-
tion (XII.23). The problem is with ‘simplifying’ the result. Let’s try some
p
numerical examples
p so that you see what I mean. If we take a = 1 + 2
and b = 2/3 − 2, then
a + b = a + b = 5/3 = 0,
because 5/3 − 0 = 5/3 ∈ Q. However, if we take a = π and b = e (where
these have their usual values) then
a + b = π + e.
Can we simplify this? For example, is this equal to 0? It is if and only if
π + e is a rational number. No one knows if the number π + e is rational
or not (but we know that both π and e are irrational). So we don’t know if
the result of the above calculation is equal to 0 or not.
To wrap up this chapter, we need to check one thing: that G/H is in-
deed a group.
Theorem XII.18. Let (G, +) be an additive abelian group and H a sub-
group. Then (G/H , +) is an abelian group.
P ROOF. All we have to do is check the defining properties for abelian
groups. I’ll just check that addition is commutative and leave the rest
to you. Suppose a, b ∈ G. Then
b+a =b+a from the definition of addition in G/H
(XII.26) = a +b b + a = a + b as G is abelian
= a +b from the definition of addition in G/H .
We’ve only looked at quotients of abelian additive groups. For general
groups, things are more tricky. At the heart of the trickiness is that in the
non-abelian setting the binary operation on cosets might not be well-
defined. For now, if you’ve got to grips with Z/mZ, R/Z and R2 /Z2 then
you’ve made an excellent start with quotients.
CHAPTER XIII
Symmetric Groups
The coolest groups have elements that are functions. Matrix groups
are examples, and symmetric groups are other examples. It turns out that
every finite group is a subgroup of one of the symmetric groups. So if we
understand symmetric groups completely, then we’ll understand finite
groups completely. Have I given you hope that you’ll have a complete
understanding of finite groups by the end of the chapter? Sorry, I was
saying ‘if . . . ’
XIII.1. Motivation
Let A be a set, and let f , g be functions from A to itself. We know that
we can compose f , g to obtain f ◦ g which is also a function from A to
itself. We shall write Map(A) for the set of functions from A to itself. Then
◦ is a binary operation on Map(A). And it’s natural to ask if this makes
Map(A) into a group. After all, we know by Lemma II.2 that composition
of functions is associative. The following example will help clarify these
ideas.
Example XIII.1. Let A = {1, 2}. You will quickly convince yourself that
there are only four functions from A to itself, which are given in Fig-
ure XIII.1.
1 1 1 1
2 2 2 2
f1 f2
1 1 1 1
2 2 2 2
f3 f4
87
88 XIII. SYMMETRIC GROUPS
1 a 1 a
b b
2 c 2 c
f1 f2
B C
a 1 a u
b b v
c 2 c w
f3 f4
Example XIII.5. Look back at the functions in Figure XIII.1 for a very ba-
sic illustration of Theorem XIII.4. ♦
Example XIII.6. Theorem XIII.4 is true for finite sets only. For infinite
sets it might or might not hold. Let f 1 : N → N be given by f 1 (x) = x +
1. Then f 1 is not surjective since 0 is not in the image. However, f 1 is
injective. See Figure XIII.3. By contrast, let f 2 : Z → Z be also given by
N N Z Z
0 0 .. ..
. .
1 1 −2 −2
2 2 −1 −1
3 3 0 0
4 4 1 1
5 5 2 2
.. .. .. ..
. . . .
f1 f2
x 7→ x + 1. Then f 2 is a bijection. ♦
The following theorems collect together key results regarding bijec-
tions. Again you know all of this from Foundations.
Theorem XIII.7. Let f : A → B and g : B → C be bijections. Then g ◦ f is a
bijection A → C .
Definition. Let A be a set. The identity map on A is the map id A : A → A
satisfying id A (x) = x for all x ∈ A.
Let f : A → A be a function on A. We say that f is invertible if there
exists a function g : A → A such that f ◦ g = g ◦ f = id A . If such a g exists
then we call it the inverse of f and denote it by f −1 .
Of course you know from Foundations that if f has an inverse then it
is unique.
Theorem XIII.8. Let f : A → A be a function. Then f is invertible if and
only if f is a bijection. If f is invertible then f −1 is also a bijection.
XIII.4. S n 91
XIII.4. S n
We define S n to be the group Sym({1, 2, . . . , n}). We call S n the n-th
symmetric group. In Example XIII.9 we found that S 2 is a group of order
2.
92 XIII. SYMMETRIC GROUPS
Likewise,
µ(1) = 1, µ(2) = 3, µ(3) = 2.
Now let us compute ρµ. As always, this means apply µ first then ρ. So
Thus
µ ¶
1 2 3
ρµ = .
3 2 1
Similarly,
µ ¶
1 2 3
µρ = .
2 1 3
We rearrange this:
Hence
µ ¶
−1 1 2 3
ρ = .
2 3 1
♦
Exercise XIII.17. Write down a multiplication table for S 3 and determine
the orders of all six elements checking that your answers are consistent
with Lagrange’s Theorem.
Exercise XIII.18. Let ρ and τ be the following permutations:
µ ¶ µ ¶
1 2 3 4 5 1 2 3 4 5
ρ= , τ= .
2 3 5 1 4 3 1 2 5 4
Compute ρ −1 , ρτ, τ2 .
Exercise XIII.19. Show that S n is non-abelian for n ≥ 3.
Exercise XIII.20. Recall (page 32) that we interpreted elements of D 4 as
functions from {1, 2, 3, 4} to itself. Go back and check that these are bijec-
tions. Thus D 4 is a subgroup of S 4 .
94 XIII. SYMMETRIC GROUPS
and so on. Until we reach σ(m + 1). This can be any element of B , and so
there are n − m choices for σ(m + 1). etc. You see that the order of H is
m(m − 1) · · · 1 · (n − m)(n − m − 1) · · · 1 = m!(n − m)!
Don’t you just love maths?
Exercise XIII.21. To make sure you’ve understood the argument above,
let m = 2 and n = 4, so that
A = {1, 2}, B = {3, 4}.
Now write down all permutations σ in S 4 that satisfy σ(a) ∈ A for all a ∈ A
and σ(b) ∈ B for all b ∈ B , and convince yourself that these form a group.
2 3
5
5 1
2 3 4
Finally (1) is the cycle that takes 1 to itself and fixes all the other el-
ements. Clearly (1) = (2) = (3) = (4) = (5) = id is nothing other than the
identity permutation. ♦
I hope that the above example has convinced you that cycle notation
is simultaneously more concise and and more transparent than the old
notation. If so, the following theorem will come as a pleasant surprize.
Theorem XIII.23. Every permutation can be written as a product of dis-
joint cycles.
What does disjoint mean? Two cycles (a 1 , a 2 , . . . , a n ) and (b 1 , b 2 , . . . , b m )
are said to be disjoint if a i 6= b j for all i , j . What does product mean? The
product of two permutations is of course their composition as functions.
Before we prove the theorem, let’s see an example where we write down
a permutation as a product of cycles.
Example XIII.24. Let
µ ¶
1 2 3 4 5 6 7 8
ρ= .
5 7 1 4 8 2 6 3
Write ρ as a product of disjoint cycles.
Answer: We start with 1 are repeatedly apply ρ to it:
1 7→ 5 7→ 8 7→ 3 7→ 1.
Therefore ρ contains the cycle (1, 5, 8, 3). Now we start with an element of
the set {1, 2, . . . , 8} that is not contained in the cycle (1, 5, 8, 3). For example
start with 2 and repeatedly apply ρ to it:
2 7→ 7 7→ 6 7→ 2.
So ρ also contains the cycle (2, 7, 6). Note that the cycles (1, 5, 8, 3) and
(2, 7, 6) are disjoint, and ρ contains the product (or composition) (1, 5, 8, 3)(2, 7, 6).
There still remains one element of the set {1, 2, . . . , 8} that does not appear
as either of the two cycles (1, 5, 8, 3) and (2, 7, 6) and this is 4. Applying ρ
to 4 we find:
4 7→ 4.
So
ρ = (1, 5, 8, 3)(2, 7, 6)(4)
XIII.6. CYCLE NOTATION 97
It is clear that µ1 has the same effect as ρ on the elements 1, ρ1, . . . , ρ u−1 1.
Now let a be the first element of the set {1, 2, . . . , n} not appearing in
the list 1, ρ1, . . . , ρ u−1 1. Repeat the above argument with the sequence
a, ρa, ρ 2 a, ρ 3 a, . . . .
We deduce the existence of a cycle
µ2 = a, ρa, . . . , ρ v−1 a
¡ ¢
XIII.7. PERMUTATIONS AND TRANSPOSITIONS 99
such that µ2 and ρ have the same effect on the elements a, ρa, . . . , ρ v−1 a.
Let us show that µ1 and µ2 are disjoint. Suppose otherwise. Then ρ i 1 =
ρ j a for some 0 ≤ i < u and 0 ≤ j < v. Now apply ρ v− j to both sides to
obtain ρ k 1 = a where k = i +v − j . This contradicts our assumption that a
does not appear in the list 1, ρ1, . . . , ρ u−1 1. Hence the cycles µ1 and µ2 are
disjoint. Now the product µ1 µ2 has the same effect as ρ on the elements
1, ρ1, . . . , ρ u−1 1, a, ρa, . . . , ρ v−1 a.
We repeat the process, starting with the first element of {1, 2, . . . , n} not
appearing in either cycle µ1 , µ2 to construct a µ3 that is disjoint from both
µ1 and µ2 , etc. As the set {1, 2, . . . , n} is finite, this process must terminate
eventually with some µr . The product of disjoint cycles µ1 µ2 . . . µr has the
same effect on {1, . . . , n} as ρ. Therefore
ρ = µ1 µ2 · · · µr .
Exercise XIII.30. We will shortly meet the Alternating Groups, one of which
is
A 3 = {id, (1, 2, 3), (1, 3, 2)}.
Verify that A 3 is a subgroup of S 3 , and write down its left cosets.
Exercise XIII.31. Verify that H = {id, (1, 2)} is a subgroup of S 3 , and write
down its left cosets.
Exercise XIII.32. (i) Use Lagrange’s Theorem to show that S 4 does
not have an element of order 5.
(ii) Let σ = (a 1 , a 2 , . . . , a m ) be a cycle of length m in S n . Explain why
σ has order m.
(iii) Now let ρ = σ1 σ2 . . . σk where the σi are disjoint cycles of lengths
m i in S n . Explain carefully why ρ has order lcm(m 1 , m 2 , . . . , m k ).
(iv) Show that S 4 does not have elements of order 6. Could you have
shown this using Lagrange’s Theorem?
(XIII.29) (a 1 , a 2 , . . . , a m ) = (a 1 , a m ) · · · (a 1 , a 3 )(a 1 , a 2 ).
100 XIII. SYMMETRIC GROUPS
Example XIII.34. Equation (XIII.29) gives a recipe for writing any cycle
as a product of transpositions. For example,
(1, 5, 9) = (1, 9)(1, 5).
Note that these transpositions are not disjoint and so they don’t have to
commute. Check that
(1, 9)(1, 5) 6= (1, 5)(1, 9).
One thing to be careful about is that decomposition of a permutation as
a product of transpositions is not in any way unique. For example, using
(XIII.29) we have
(1, 2, 3, 4) = (1, 4)(1, 3)(1, 2).
However, you can also check that
(1, 2, 3, 4) = (2, 3)(1, 3)(3, 5)(3, 4)(4, 5).
So we can write (1, 2, 3, 4) as a product of 3 transpositions and as a prod-
uct of 5 transpositions. Can we write it as a product of 4 transpositions?
Spend no more and no less than five minutes thinking about this. ♦
So the one factor in group (IV) simply changes sign. We see that τ(P 5 ) has
the same factors as P 5 with three sign changes: τ(P 5 ) = (−1)3 P 5 = −P 5 . ♦
P ROOF OF L EMMA XIII.36. Let τ = (`, m). The transposition (`, m) swaps
` and m, and keeps everything else fixed. In particular (`, m) = (m, `).
So we can suppose that ` < m. Any factor x i − x j where neither i nor j
is equal to ` nor m, is unaffected by τ. We pair off the other factors as
follows:
x1 − x` , x1 − xm ,
x 2 − x ` ,
x2 − xm ,
(I) . .
.. ..
x `−1 − x ` , x `−1 − x m ,
x ` − x `+1 , x `+1 − x m ,
x ` − x `+2 , x `+2 − x m ,
(II) .. ..
. .
x ` − x m−1 , x m−1 − x m ,
x ` − x m+1 , x m − x m+1 ,
x ` − x m+2 , x m − x m+2 ,
(III) .. ..
. .
x` − xn , xm − xn ,
n
(IV) x` − xm .
Now τ swaps each pair in (I), keeping the signs the same; it swaps each
pair in (II) and changes the sign of each; it swaps each pair in (III), keep-
ing the signs the same; it changes the sign of x ` −x m . So τ(P n ) has exactly
the same factors as P n , up to a certain number of sign changes. How
many sign changes? The number of sign changes is:
2(m − ` − 1) + 1.
The 1 is for changing the sign of x ` −x m . There are 2 sign changes coming
from each pair in (II). The number of such pairs is m − ` − 1. Since the
number of sign changes is odd, we see that τ(P n ) = −P n .
Lemma XIII.38. If σ ∈ S n then σ(P n ) = ±P n . More precisely, if σ is a prod-
uct of an even number of transpositions then σ(P n ) = P n and if σ is a prod-
uct of an odd number of transpositions then σ(P n ) = −P n .
P ROOF. Recall, by Lemma XIII.33, that we can write every permutation as
a product of transpositions. Every transposition changes the sign of P n .
The lemma follows.
Example XIII.39. We have noted in Example XIII.34 that the way we ex-
press a permutation as a product of transpositions is not unique. Indeed
XIII.8. EVEN AND ODD PERMUTATIONS 103
we saw that
(1, 2, 3, 4) = (1, 4)(1, 3)(1, 2), (1, 2, 3, 4) = (2, 3)(1, 3)(3, 5)(3, 4)(4, 5).
So we can write (1, 2, 3, 4) as a product of 3 transpositions and as a prod-
uct of 5 transpositions. We asked the question of whether (1, 2, 3, 4) can
be written as a product of 4 transpositions? Write σ = (1, 2, 3, 4). From the
above lemma, we see that σ(P n ) = −P n . If we’re able to write σ as a prod-
uct of an even number of transpositions then σ(P n ) = P n . We would then
have P n = −P n which is a contradiction. Therefore we cannot write σ as
a product of 4 transpositions. ♦
You should now have no trouble in proving the following theorem.
Theorem XIII.40. Every permutation in S n can be written as a product of
either an even number of transpositions, or an odd number of transposi-
tions but not both.
We shall call a permutation even if we can write it as a product of an
even number of transpositions, and we shall call it odd if we can write it
as a product of an odd number of transpositions.
Example XIII.41. (1, 2, 3, 4) is an odd permutation because we can write
it as the product of 3 transpositions:
(1, 2, 3, 4) = (1, 4)(1, 3)(1, 2).
Indeed, a cycle of length n can be written as product of n − 1 transposi-
Beware! tions by (XIII.29). So a cycle of length n is even if n is odd, and it is odd if
n is even!
The permutation (1, 2, 3)(4, 5) is the product of an even permutation
which is (1, 2, 3) and an odd permutation which is the transposition (4, 5).
Thus (1, 2, 3)(4, 5) is an odd permutation.
What about the identity element id? Note that id(P n ) = P n , so id must
be even. We must be able to write it as a product of an even number of
transpositions. A mathematician would say that the identity element is
the product of zero transpositions, so it is even. If you find that kind of
reasoning disturbing, you have my sympathy. Instead, note that
id = (1, 2)(1, 2),
which does allow us to check that id is indeed even.
We now come to define a very important group. Let n ≥ 2. We define
the n-th alternating group to be
A n = {σ ∈ S n : σ is even}.
As usual, all we’ve done is specify a subset of S n which we’ve denoted by
A n and we must indeed show that A n is a group.
Theorem XIII.42. A n is a subgroup of S n .
P ROOF. We’ve already seen that the identity element id is even, so id ∈ A n .
If σ, ρ ∈ A n then we can write each as an even number of transpositions.
104 XIII. SYMMETRIC GROUPS
τA n , and we know that cosets are either disjoint or equal (Lemma XI.17).
So there aren’t any other cosets and the proof is complete.
Exercise XIII.46. Let ρ and τ be as given in Exercise XIII.18. Write ρ and
τ as products of transpositions and state if they’re even or odd.
Exercise XIII.47. Write down the elements of A 3 and check that it is cyclic
(and hence abelian). Show that A n is non-abelian for n ≥ 4.
Exercise XIII.48. Let f be a polynomial in variables x 1 , x 2 , . . . , x n . Let σ be
a permutation in S n . We define σ( f ) to be the polynomial f (x σ1 , x σ2 , . . . , x σn ).
For example, if f = x 1 + x 22 + x 3 x 4 and σ = (1, 4)(2, 3) then σ swaps x 1 and
x 4 , and swaps x 2 and x 3 ; thus σ( f ) = x 4 + x 32 + x 2 x 1 . Compute σ( f ) for the
following pairs f , σ:
(i) f = x 12 − x 2 x 3 , σ = (1, 2, 3).
(ii) f = x 1 x 2 + x 3 x 4 , σ = (1, 3)(2, 4).
Exercise XIII.49. Let f be a polynomial in variables x 1 , . . . , x n .
(a) Let H be a subgroup of S n . We say that f is H -invariant if it
satisfies the property that σ( f ) = f for all σ ∈ H . We say that f is
symmetric if it is S n -invariant. Find a polynomial in x 1 , x 2 , x 3 , x 4
that is D 4 -invariant but not symmetric.
1 2 3 4
5 6 7 8
9 10 11 12
13 15 14
You can slide any tile adjacent to the blank into the position of the
blank. So starting from the initial arrangement there are two possible
moves:
106 XIII. SYMMETRIC GROUPS
1 2 3 4 1 2 3 4
5 6 7 8 5 6 7 8
9 10 11 9 10 11 12
13 15 14 12 13 15 14
1 2 3 4
5 6 7 8
9 10 11 12
13 14 15
Show that this is impossible! You might want to follow these hints and a mathematical scam
tips:
(i) Think of the blank as a tile numbered 16. This way every re-
arrangement is a permutation on 1, . . . , 16 and so an element of
S 16 .
(ii) Observe that every move is a transposition involving 16.
(iii) Observe that to go from the initial arrangement to the desired
final arrangement, tile 16 must make the same number of moves
down as up, and the same number of moves right as left.
Can you use your knowledge of maths to think of other ways of ripping applied maths!
people off? This of course is a purely intellectual exercise. As citizens of
Warwick plc you are fine upright human beings who would not dream
of putting such ideas into practice. But DON’T share these ideas with
friends from less scrupulous universities. I don’t have a particular two
universities in mind.
Riveted?
CHAPTER XIV
Rings
If groups took your breath away, wait till you meet rings.
XIV.1. Definition
A ring is a triple (R, +, ·), where R is a set and +, · are binary operations
on R such that the following properties hold
(i) (closure) for all a, b ∈ R, a + b ∈ R and a · b ∈ R;
(ii) (associativity of addition) for all a, b, c ∈ R
(a + b) + c = a + (b + c);
a + 0 = 0 + a = a.
a + (−a) = (−a) + a = 0;
a + b = b + a;
a · (b · c) = (a · b) · c;
a · (b + c) = a · b + a · c; (b + c) · a = b · a + c · a;
1 · a = a · 1 = a.
a · b = b · a.
107
108 XIV. RINGS
Note that the word ‘commutative’ in the phrase ‘commutative ring’ refers
to multiplication. Commutativity of addition is part of the definition of
ring. Some textbooks omit property (viii) from the definition of a ring.
Those textbooks call a ring satisfying (viii) a ring with unity. We shall
always assume that our rings satisfy (viii).
Observe, from properties (i)–(v), if (R, +, ·) is a ring, then (R, +) is an
abelian group.
XIV.2. Examples
Example XIV.1. You know lots of examples of rings: Z, Q, R, C, R[x], etc.
All these examples are commutative rings.
Example XIV.2. Let
½µ ¶ ¾
a b
M 2×2 (R) = : a, b, c, d ∈ R .
c d
This is the set of 2 × 2 matrices with real entries. From the properties of
matrices it is easy to see that M 2×2 (R) is a ring with the usual addition
and multiplication of matrices. The additive identity is the zero matrix,
and the multiplicative identity is I 2 . The ring M 2×2 (R) is an example of a
non-commutative ring, as matrix multiplication is non-commutative.
Similarly we define M 2×2 (C), M 2×2 (Z), M 2×2 (Q). These are all non-
commutative rings. ♦
Theorem XIV.3. Let m be an integer satisfying m ≥ 2. Then Z/mZ is a
ring.
P ROOF. We really mean that (Z/mZ, +, ·) is a commutative ring. We’ve al-
ready seen that Z/mZ is closed under addition and multiplication, and
that (Z/mZ, +) is an abelian group. I leave you to ponder why the re-
maining properties (vi)–(ix) must be true.
Example XIV.4. You’re familiar with the following two binary operations
on R3 : addition and the cross product (also known as the vector product).
Is (R3 , +, ×) a ring? No. First the cross product is not associative. For
example,
i × (j × j) = 0, (i × j) × j = −i.
We only need one of the properties (i)–(viii) to fail for us to conclude that
(R3 , +, ×) is not a ring. We know that (vi) fails. It is interesting to note that
(viii) fails too, as we now show. Indeed,
(XIV.30) a × b = −b × a.
Suppose 1 is a vector in R that satisfies
3
a×1 = 1×a = a
for all a ∈ R3 . From (XIV.30) we see that a = −a for all a ∈ R3 . This gives a
contradiction. Therefore (viii) fails too. ♦
XIV.2. EXAMPLES 109
XIV.3. Subrings
Just as we have subgroups, so we have subrings.
Definition. Let (R, +, ·) be a ring. Let S be a subset of R and suppose that
(S, +, ·) is also a ring with the same multiplicative identity. Then we say
that S is a subring of R (or more formally (S, +, ·) is a subring of (R, +, ·)).
For S to be a subring of R, we want S to a ring with respect to the same
two binary operations that makes R a ring, and 1R ∈ S where 1R is the
multiplicative identity of R.
Example XIV.8. Z is a subring of R; Q is a subring of R; Z is a subring of
Q; R is a subring of R[x]. ♦
Theorem VIII.5 gave a criterion for a subset of a group to be a sub-
group. As you’d expect we have a similar criterion for a subset of a ring to
be a subring.
Theorem XIV.9. Let R be a ring. A subset S of R is a subring if and only if
it satisfies the following conditions
(a) 0, 1 ∈ S (that is S contains the additive and multiplicative identity
elements of R);
(b) if a, b ∈ S then a + b ∈ S;
(c) if a ∈ S then −a ∈ S;
(d) if a, b ∈ S then ab ∈ S.
P ROOF. First re-read the proof of Theorem VIII.5. Then prove this theo-
rem on your own. It won’t take you long.
Example XIV.10. In Example VIII.7, we saw that the set of even integers
2Z is a subgroup of Z. Strictly speaking, (2Z, +) is a subgroup of (Z, +).
Now we know that (Z, +, ·) is a ring. Is (2Z, +, ·) a subring? From Theo-
rem XIV.9 we see that it isn’t because 1 ∉ 2Z. ♦
Example XIV.11. In view of the previous example, let’s try to discover if Z
has any subrings other than itself. Let S be a subring of Z. We know that
0, 1 ∈ S. Also, by (b) we know that 2 = 1 + 1 ∈ S. Repeating the argument,
3 = 2 + 1 ∈ S and so on. By induction we know that 0, 1, 2, . . . are all in S.
But by (c), if a ∈ S then −a ∈ S. So . . . , −3, −2, −1 are also in S. Hence Z is
contained in S. But S is a subset of Z. So they must be equal: S = Z.
Therefore, the only subring of Z is Z itself. By contrast, in Section IX.2
we saw that Z has infinitely many subgroups. ♦
Exercise XIV.12. Let m be an integer satisfying m ≥ 2. Show that the only
subring of Z/mZ is Z/mZ itself.
Remark. The easiest way to show that a set is a ring is to show that it is A very important
a subring of a known ring. If you do this, you only have four properties tip!
to check (a),(b),(c),(d). If you don’t do this, you’ll have eight properties to
check (i)–(viii). The following two examples will help you appreciate this
principle.
XIV.3. SUBRINGS 111
(u 1 u 2 )(v 1 v 2 ) = (u 1 v 1 )(u 2 v 2 ) = 1 · 1 = 1.
You will easily see that this is consistent with the earlier definitions of
GL2 (R), GL2 (C), GL2 (Q) and GL2 (Z), and that moreover, (M 2×2 (R))∗ = GL2 (R).
Example XIV.26. Let
½µ ¶ ¾
a b
S= : a, b, c ∈ Z .
0 c
Show that S is a ring under the usual addition and multiplication of ma-
trices. Compute S ∗ .
Answer: To show that S is a ring it is enough to show that it is a subring
of M 2×2 (Z). We leave that as an easy exercise.
Let us compute the unit group. Suppose A = a0 bc is in S. To be unit
¡ ¢
it is not enough for this matrix to be invertible, we also want the inverse
to belong to S. So we require the determinant ac to be non-zero and we
want µ ¶ µ ¶
−1 1 c −b 1/a −b/ac
A = =
ac 0 a 0 1/c
to belong to S. Thus we want the integers a, b, c to satisfy
1 1 b
ac 6= 0, , ,− ∈ Z.
a c ac
This happens precisely when a = ±1 and c = ±1. Thus
½µ ¶ ¾
∗ ±1 b
S = :b∈Z .
0 ±1
♦
Exercise XIV.27. In Example XIV.15, we showed that
na o
S = r : a, r ∈ Z, r ≥ 0
2
is a ring. Find its unit group.
group of Z[i ]. The most elegant way of doing this is via the norm map.
We define the norm map N : Z[i ] → Z by
N (a + bi ) = a 2 + b 2 , a, b ∈ Z.
The norm map is multiplicative:
Lemma XIV.28. Let α, β ∈ Z[i ]. Then N (αβ) = N (α)N (β).
P ROOF. α and β are complex numbers, and you can see that N (α) = |α|2 .
From the properties of the absolute value you know that |αβ| = |α| · |β|.
The lemma follows.
Theorem XIV.29. The unit group of Z[i ] is {1, −1, i , −i }.
In other words, (Z[i ])∗ = U4 , the group of fourth-roots of unity.
P ROOF. We want the units of Z[i ]. Let α be a unit. Then there is some
β ∈ Z[i ] such that 1 αβ = 1. Applying the norm map, and recalling that it
is multiplicative, we see that
N (α)N (β) = N (αβ) = N (1) = 1.
Now N (α) and N (β) are in Z (go back to the definition of the norm map
to see this), and they multiply to give 1. So
N (α) = N (β) = 1, or N (α) = N (β) = −1.
Write α = a + bi where a, b are in Z. Then a 2 + b 2 = N (α) = ±1. Of course
−1 is impossible, so a 2 + b 2 = 1. But a, b are integers. So (a, b) = (±1, 0) or
(0, ±1). Hence α = a + bi = ±1 or ±i . Clearly ±1, ±i are units. So the unit
group is
Z[i ]∗ = {1, −1, i , −i }.
Remark. Compare the above proof to our determination of the unit group
of M 2×2 (Z) in Example XIV.25. I hope you agree that the similarities are
striking!
Exercise XIV.30. In Exercise XIV.16 you met the ring Z[2i ]. Find its unit
group. (Hint: Show first that any unit in Z[2i ] is a unit in Z[i ].)
p p p
Exercise XIV.31. pLet Z[ 2] = {a + b 2 : a, b ∈ Z}. Show that Z[ 2] is a
ring
p and that 1 + 2 is a unit. What is its order as an element of the group
Z[ 2]∗ ?
Exercise XIV.32. Let ζ = e 2πi /3 (this is a cube root of unity). Check that
ζ = ζ2 . Let Z[ζ] = {a + bζ : a, b ∈ Z}.
(i) Show that ζ2 ∈ Z[ζ] (Hint: the sum of the cube roots of unity is
. . . ).
(ii) Show that Z[ζ] is a ring.
1We could have written αβ = βα = 1. But Z[i ] is a commutative ring, so writing
αβ = 1 is enough.
XIV.5. THE UNIT GROUP OF THE GAUSSIAN INTEGERS 117
Fields
r u k m
α= + i, β = + i,
s v ` n
where r , s, u, v, k, `, m, n are integers and s, v, `, n are non-zero. This would’ve worked,
but why do it? Get used to thinking of rational numbers as numbers in their own right!
119
120 XV. FIELDS
is not a field.
CHAPTER XVI
Congruences Revisited
We saw that there are two binary operations defined on Z/mZ, ad-
dition and multiplication. These make (Z/mZ, +, ·) a commutative ring,
and (Z/mZ, +) a cyclic group of order m. We want to know about the unit
group of Z/mZ.
(Z/4Z)∗ = 1, 3 , (Z/5Z)∗ = 1, 2, 3, 4 .
© ª © ª
In particular, Z/2Z, Z/3Z and Z/5Z are fields and Z/4Z, Z/6Z are not
fields. Can you make a general guess as to which Z/mZ are fields and
which aren’t? Can you prove your guess? ♦
Theorem XVI.2. Let a ∈ Z/mZ. Then a is a unit in Z/mZ if and only if
gcd(a, m) = 1. Thus
(Z/mZ)∗ = a : 0 ≤ a ≤ m − 1 and gcd(a, m) = 1 .
© ª
1
gcd(19, 256) = 1, but let’s use Euclid’s Algorithm to write 1 as a linear
combination of 19 and 256:
256 = 13 × 19 + 9
19 = 2 × 9 + 1.
Thus
1 = 19 − 2 × 9 = 19 − 2 × (256 − 13 × 19) = (1 − 2 × −13) × 19 − 2 × 256,
so
1 = 27 × 19 − 2 × 256.
Hence 27 × 19 ≡ 1 (mod 256), so 27 is the inverse of 19 in Z/256Z. ♦
Vale Dicere
XVI.4.
With tear-filled eyes I say goodbye, and begin to suffer the heart-rending a sincere outpouring
pangs of separation . . . of grief
Exercise XVI.17. Write a 5000 word essay on how abstract algebra has
changed your outlook on life, detailing the insights you have gained into
the great challenges facing/menacing humanity 1.
Exercise XVI.18. “Veneration of abstract algebra is at the root of contem-
porary mathematical decline.” Discuss.
1Here’s a good way to start your essay: “Unbeknownst to me, I misspent the first 18
years of my life wallowing in a cesspool of intellectual stagnation, until week six of term
1 when the Abstract Algebra lectures started . . . ”.