Number Theory PDF
Number Theory PDF
Andrew Kobin
Contents
3 Linear Congruence 21
3.1 Modular Arithmetic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3.2 Linear Congruence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
7 Reciprocity 50
7.1 Quadratic Residues . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
7.2 Quadratic Reciprocity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
7.3 Applications of Quadratic Reciprocity . . . . . . . . . . . . . . . . . . . . . . 59
i
Contents Contents
9 Preliminaries 65
9.1 Basic Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
9.2 Euler-Maclaurin Summation . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
9.3 The Bernoulli Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
10 Euler’s Work 79
10.1 On the Sums of Series of Reciprocals . . . . . . . . . . . . . . . . . . . . . . 80
10.2 Newton’s Identities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
10.3 Euler’s Product Form . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
10.4 The Prime Number Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . 97
ii
Contents Contents
iii
Contents Contents
22 Curves 390
22.1 Divisors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 392
22.2 Morphisms Between Curves . . . . . . . . . . . . . . . . . . . . . . . . . . . 395
22.3 Linear Equivalence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 398
22.4 Differentials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 400
22.5 The Riemann-Hurwitz Formula . . . . . . . . . . . . . . . . . . . . . . . . . 402
22.6 The Riemann-Roch Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . 404
22.7 The Canonical Map . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 406
22.8 Bézout’s Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 407
22.9 Rational Points of Conics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409
iv
Contents Contents
VI L-Functions 496
28 Introduction 498
30 Duality 518
30.1 Functions of Positive Type . . . . . . . . . . . . . . . . . . . . . . . . . . . . 519
30.2 Fourier Inversion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 524
30.3 Pontrjagin Duality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 531
v
Contents Contents
vi
Part I
vii
Chapter 1
Introduction
The notes in Part I were compiled from a series of student-led lectures at Wake Forest
University under the advisory of Dr. Jeremy Rouse. The main source is Number Theory
Through Inquiry (Marshall, Odell and Starbird). The main topics include:
Divisibility
Quadratic reciprocity
1
1.1. Divisibility Chapter 1. Introduction
1.1 Divisibility
Definition. The natural numbers are the counting numbers 1, 2, 3, . . ., denoted N.
Definition. The number 0 and the negative numbers extend the natural numbers to the
integers, denoted Z.
Definition. For two integers a and d, d divides a (or d | a) if there is an integer k such
that a = kd.
Definition. For two integers a and b, a and b are congruent modulo n if for some natural
number n, n | (a − b), denoted a ≡ b (mod n).
Proof. Let 6 | n. Then there exists k ∈ Z such that n = 6k. By associativity, n = 6k = 3(2k)
and 2k ∈ Z so 3 | n.
Example 1.1.2. Let k ∈ Z such that k ≡ 7 (mod 2). Show that k ≡ 3 (mod 2).
Proof. Let k ≡ 7 (mod 2). Then 2 | (k − 7) so there exists j ∈ Z such that k − 7 = 2j. Then
k − 3 = 2j + 4 = 2(j + 2), so 2 | (k − 3) =⇒ k ≡ 3 (mod 2).
Proof. Let a | b and a | c. Then there exist integers j and k such that b = aj and c = ak.
Consider b + c. By substitution, b + c = aj + ak = a(j + k), so a | (b + c).
Proof. Let a | b and a | c. Then there exist integers j and k such that b = aj and c = ak.
Consider b − c. By substitution, b − c = aj − ak = a(j − k), so a | (b − c).
Proof. Let a | b and a | c. Then there exist integers j and k such that b = aj and c = ak.
Consider bc = (aj)(ak) = a(jak) by association. So a | (bc).
Theorem 1.1.8. Every integer is congruent to itself. In other words, for all a, n ∈ Z with
n > 0, a ≡ a (mod n).
2
1.1. Divisibility Chapter 1. Introduction
3
1.1. Divisibility Chapter 1. Introduction
Does divisibility work with congruence in the same way? Counterexample: let n = 10,
a = 0, b = 2 and c = 5. Then ac ≡ bc (mod n) but a ≡ 6 b (mod n). Divisibility must be
handled differently.
Fact: If gcd(a, n) = 1 then there is some k ∈ Z+ such that ak ≡ 1 (mod n). (This makes no
claims as to what we have to choose for particular a and n.)
Proof. Since a ≡ b (mod n), then n | (a − b) so there exists k ∈ Z such that a − b = nk.
Multiplying by a + b we get
(a + b)(a − b) = nk(a + b)
a2 − b2 = nk(a + b).
Proof. Let a ≡ b (mod n). The base case is proven in the example above. Now suppose that
ak−1 ≡ bk−1 (mod n). Then by Theorem 1.1.13, a(ak−1 ) ≡ b(bk−1 ) (mod n) which implies
ak ≡ bk (mod n). Hence by induction on k ∈ N, if a ≡ b (mod n) then ak ≡ bk (mod n).
Proof. The base case is 3 | 9. Suppose 3 | (10k−1 − 1). Then there is some x ∈ Z such that
10k−1 − 1 = 3x. Multiplying by 10 gives us
10k − 10 = 30x
10k − 1 − 9 = 30x
10k − 1 = 30x + 9 = 3(10x + 3).
Theorem 1.1.17. Let n ∈ N such that n = ak ak−1 · · · a1 a0 where ai is the ith digit of n (as
opposed to a factor of n). If m = ak + ak−1 + . . . + a1 + a0 then n ≡ m (mod 3).
Proof. Let n and m be as described. We can write n = 10k ak + 10k−1 ak−1 + . . . + 10a1 + a0 .
Consider n − m = (10k − 1)ak + (10k−1 − 1)ak−1 + . . . + (10 − 1)a1 . And by the Lemma,
3 | (10i − 1) for each 1 ≤ i ≤ k. So 3 | (n − m), by which n ≡ m (mod 3).
4
1.2. The Division Algorithm Chapter 1. Introduction
Proof. (1) Let m, n ∈ N and let S = {ni ∈ N | ni ≥ m}. Then by the Well-Ordering Axiom,
S has a smallest element, say nj. Then nj ≥ m but n(j − 1) < m because n(j − 1) 6∈ S. Let
q = j − 1 and r = m − nq. Since 0 ≤ r < n and m = nq + r, the existence portion holds.
5
1.3. Greatest Common Divisors Chapter 1. Introduction
(36, 22) = 2
(45, −15) = 15
(−296, −88) = 8
(0, 256) = 256
(15, 28) = 1, relatively prime
(1, −2436) = 1, relatively prime
Theorem 1.3.2. Let a, n, b, r, k ∈ Z. If a = nb + r, k | a and k | b then k | r.
Proof. Let a = nb + r and suppose k | a and k | b. Then there exist s, t ∈ Z such that a = sk
and b = tk. Then
sk = ntk + r
r = sk − ntk
= k(s − nt).
And s − nt ∈ Z by closure, so k | r.
Theorem 1.3.3. Let a, b, n1 , r1 ∈ Z with a or b nonzero. If a = n1 b + r1 then gcd(a, b) =
gcd(b, r1 ).
Proof. Let a = n1 b + r1 and let d = gcd(a, b). Then a = jd and b = kd for appropriate
j, k ∈ Z. So
jd = n1 kd + r1
r1 = jd − n1 kd
= (j − n1 k)d.
And j − n1 k ∈ Z by closure, so d | r1 . Now take c, a common divisor of b and r1 and suppose
c 6= d. Then b = cs and r1 = ct for appropriate s, t ∈ Z. So
a = n1 cs + ct
= c(n1 s + t)
which implies c | a. But since d = gcd(a, b), c < d. Hence d is the greatest common divisor
of b and r1 .
6
1.3. Greatest Common Divisors Chapter 1. Introduction
The Euclidean Algorithm: Let a and b be any two integers. To find gcd(a, b),
(2) By the Division Algorithm, there exist q1 , r1 ∈ Z such that a = bq1 + r1 , with 0 ≤ r1 < b.
(4) Eventually we will obtain qk , rk such that rk−2 = rk−1 qk +rk and rk = 0. Then rk−1 | rk−2
and by Theorem 1.3.3, rk−1 is the gcd of a and b.
Note: This algorithm must terminate because there are a finite number of integers between
0 and r1 .
Example 1.3.4. Use the Euclidean Algorithm to compute the gcd of the following pairs:
7
1.3. Greatest Common Divisors Chapter 1. Introduction
8
1.3. Greatest Common Divisors Chapter 1. Introduction
1 = rN −1 − rN qN +1
= rN −1 − axqN +1 − byqN +1 .
Hence for all k ∈ Z, this process yields integers x and y such that ax + by = 1.
Theorem 1.3.7. For any integers a and b, not both zero, there exist x, y ∈ Z such that
ax + by = gcd(a, b).
Proof. Let a, b ∈ Z with at least one nonzero. Let k = gcd(a, b). By the Euclidean Algorithm,
we have the following:
Then
Proof omitted.
9
1.3. Greatest Common Divisors Chapter 1. Introduction
10
1.3. Greatest Common Divisors Chapter 1. Introduction
Theorem 1.3.13. Given a, b, c ∈ Z with a and b not both zero, there exist x, y ∈ Z such
that ax + by = c if and only if gcd(a, b) | c.
Proof omitted.
for some k ∈ Z.
Proof omitted.
11
Chapter 2
Using multiplication, any natural number can be obtained from some prime number(s).
Definition. A natural number p > 1 is prime if p is not the product of natural numbers
less than p.
Theorem 2.0.1. If n > 1 is a natural number then there exists a prime p dividing n.
Proof. Suppose there are some natural numbers that do not have any prime factors. Let
S = {n > 1 | @p such that p | n}. By the Well-Ordering Axiom, S has a smallest element,
say n. If n were prime, n | n by which n 6∈ S. So n must be composite. Then by definition
there is some natural number k < n such that k | n. And because n is the smallest element
of S, k 6∈ S so there exists a prime p dividing k. Then p | k and k | n imply p | n,
contradicting n ∈ S. Hence all composite numbers (and thus all natural numbers) have a
prime divisor.
√
Theorem 2.0.2. A natural number n > 1 is prime if and only if for all p ≤ n, p does not
divide n.
Proof. ( =⇒ ) Suppose
√ n > 1 is prime. By definition√n is not the product of natural numbers
less than n. Since n < n there are no primes p ≤ n < n that divide n.
√ √
( ⇒= ) Now suppose that for all p ≤ n, p - n. Let S = {q ∈ N | √ q prime and n < q < n}.
By the Well-Ordering Axiom, S has a least element q. Consider n < q < n. In particular,
√
n < q. Squaring this inequality, we obtain n < q 2 . Since q is the smallest element of S,
for any r ∈ S with q < r, n < q 2 < qr. Therefore there is no prime less than n that divides
n, so n is prime.
12
Chapter 2. The Prime Numbers
√
Proof. Let n = 101. Note that 10 < 101 < 11 and
2 - 101
3 - 101
5 - 101
7 - 101.
Definition. Let π(n) be the number of primes less than or equal to n. This is known as
Euler’s totient function, or simply Euler’s function.
π(n)
The Prime Number Theorem: lim n = 1.
n→∞
log n
One of the most important results in Number Theory, this statement originally appeared
in various forms in papers by Euler, Legendre, Gauss and others in the late 18th Century.
In Riemann’s landmark 1859 manuscript On the Number of Primes Less Than a Given
Magnitude, Riemann outlined a method by complex analysis to obtain a proof of the theorem.
The Prime Number Theorem was proven independently by Hadamard and de la Vallé-Poissin
in 1896, using Riemann’s methods from forty years earlier.
13
2.1. The Fundamental Theorem of Arithmetic Chapter 2. The Prime Numbers
pk = pl1 q1 = q1 q2 · · · qn
pq1 = q2 q3 · · · qn .
where the pi and qi are all distinct primes, and the ri and si are natural numbers. Setting
these expressions equal, we have pr11 pr22 · · · prmm = q1s1 q2s2 · · · qm
sm
. Then by Lemma 2.1.1, p1 = qi
for some i. And since the q factors are distinct primes, repeated application of Lemma 2.1.1
yields pr11 = qisi . Then we have
s r
pr11 · · · prmm = q1s1 · · · qi−1
i−1 r1 i+1
p1 qi+1 · · · qksk .
14
2.1. The Fundamental Theorem of Arithmetic Chapter 2. The Prime Numbers
s
Again by Lemma 2.1.1, p2 = qj for some j. By the same reasoning, pr22 = qj j , so
s r s s
i−1 r1
pr11 pr22 · · · prmm = q1s1 · · · qi−1 i+1
p1 qi+1 j−1 r2
· · · qj−1 j+1
p2 qj+1 · · · qksk .
Repeating this process, we eventually replace each qlsl with prt t . Thus for every natural
number expressed as a product of powers of primes, the factorization is unique up to the
order of the factors.
Example 2.1.3. 12! can be expressed as
12! = 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12
= 2 · 3 · 22 · 5 · (2 · 3) · 7 · 23 · 32 · (2 · 5) · 11 · (22 · 3)
= 28 · 35 · 5 · 7 · 11.
Then a | b if and only if for all i ≤ m there exists j ≤ k such that pi = qj and ri ≤ sj .
Proof. ( =⇒ ) Suppose a | b. Then there exists some n ∈ N such that b = an. By the
above prime factorizations, q1s1 q2s2 · · · qksk = pr11 pr22 · · · prmm n. But by the Fundamental Theorem
(2.1.2), n also has a prime factorization, so we can write
where the tk are distinct primes. And since prime factorizations are unique up to order,
k = l and for each i ≤ l there exists some j ≤ k such that ti = qj and vi = sj . But
since every tvi i is a product of some pri i and a power of a prime factor of n, then pi = qj
and ri ≤ vi = sj . Therefore for every i ≤ m there is some j ≤ k such that pi = qj and ri ≤ sj .
( ⇒= ) Now suppose that for all i ≤ m there is some j ≤ k such that pi = qj and ri ≤ sj .
Then we can write
b = q1s1 · · · qksk
sm+1
= ps11 · · · psmm qm+1 · · · qksk
where qm+1 , . . . , qk are the leftover prime factors of b (if m < k). Then we can write
sm+1
b = pr11 pr22 · · · prmm · ps11 −r1 · · · psmm −rm qm+1 · · · qksk .
15
2.1. The Fundamental Theorem of Arithmetic Chapter 2. The Prime Numbers
Proof omitted.
Example 2.1.6. Find gcd(314 · 722 · 115 · 173 , 52 · 114 · 138 · 17).
Factors in common are 114 and 171 . So gcd = 114 · 17 = 248, 897.
Definition. A rational number is a real number of the form ab , where a and b are integers
and b is nonzero. The set of rational numbers is denoted Q.
2r 2r 2r
= p12r1 · · · pi−1i−1 72ri +1 pi+1i+1 · · · pj j .
But in the factorization of n2 every factor has even exponent. And since prime factorizations
are unique up to order of factors, 7m2 6= n2 .
√
Example 2.1.8. Show that 7 is irrational.
√ √
Proof. Suppose that 7 is rational. Then there exist integers a and b such that 7 = ab and
b 6= 0. Then
a2
7=
b2
=⇒ 7b = a2 .
2
But by Theorem
√ 2.1.7 there are no natural numbers such that 7b2 = a2 , a contradiction.
Therefore 7 is irrational.
First note that 1000 = 23 · 53 . Theorem 2.1.4 says that d must be composed of some subset
of the prime divisors of 1000, with exponents less than or equal to the exponents of 2 and
5. So d = 2a · 5b where a = 0, 1, 2, 3 and b = 0, 1, 2, 3. Therefore there are 16 choices in total
for d.
16
2.2. The Infinitude of Primes Chapter 2. The Prime Numbers
17
2.2. The Infinitude of Primes Chapter 2. The Prime Numbers
Theorem 2.2.6. There are infinitely many primes congruent to 3 (mod 4).
Proof. Suppose there is an na ≡ 3 (mod 4) such that it is the largest prime congruent
to 3 (mod 4). Then S = {ni | ni is prime and ni = 4ki + 3, ki ∈ Z} is finite. Define
t = 4n2 · · · na + 3 and take some ni ∈ S r {3}. Then ni | (n2 n3 · · · na + ni ). But since 3 is
the smallest prime of the form 4k + 3, ni - t. Therefore t has no prime factors of the form
4k + 3. Furthermore, by the Lemma, the product of primes of the form 4k + 1 is also of the
form 4k + 1; but t is not of this form, so t must be prime, a contradiction. Hence there are
infinitely primes of the form 4k + 3.
Dirichlet’s Theorem: For any a and b such that (a, b) = 1, there are infinitely many
primes of the form ak + b.
A proof will be given in Section 17.6.
18
2.3. Special Primes Chapter 2. The Prime Numbers
19
2.3. Special Primes Chapter 2. The Prime Numbers
Theorem 2.3.5. For any natural number n, there is a string of n consecutive composite
numbers.
2 | (n + 1)! + 2
3 | (n + 1)! + 3
etc.
Modern number theory has many famous questions related to the distribution of primes
among the natural numbers.
The Twin Primes Question: Are there infinitely many pairs of primes that differ from
one another by two?
Examples include 11 and 13, 29 and 31, 41 and 43, etc. The percentage of prime numbers
among the first n naturals seems to slowly decrease as n gets larger. Gauss and Legendre
n
conjectured that π(n) ≈ .
log(n)
π(n)
The Prime Number Theorem: lim n = 1.
n→∞
log(n)
Proof omitted.
The Goldbach Conjecture: Every positive, even natural number n > 2 can be written as
n = p + q, where p and q are prime.
20
Chapter 3
Linear Congruence
21
3.1. Modular Arithmetic Chapter 3. Linear Congruence
Example 3.1.2. Find the smallest nonnegative k such that 39453 ≡ k (mod 12).
3453 ≡ 3243+2·81+27+2·9+3
5 4 3 2
≡ 33 (33 )2 33 (33 )2 33
≡ 3 · 32 · 3 · 32 · 3
≡ 33 · 33 · 3
≡ 3 · 3 · 3 ≡ 3 (mod 12).
Step 1: If possible, find the smallest b > 0 such that a ≡ b (mod n) and 0 ≤ b < a. Then
by Theorem 1.1.15, ar ≡ br (mod n).
Step 2: r can be written as the sum of powers of 2. To find this sum, find the greatest power
of 2 less than r, say 2k1 , then add this to the greatest power of 2 less than r − 2k1 ,
say 2k2 . Continue adding greatest powers of 2. This will terminate since the ki are
positive integers (well-ordered). The process will yield r = 2k1 + 2k2 + . . . + 2kt ,
where k1 > k2 > · · · > kt . Note that if r is odd, kt = 0.
Step 5: Use the value of b2 found in Step 3 and t from Step 4 to find ar mod n.
Example 3.1.3. Let f (x) = 13x49 − 27x27 + x14 − 6. Show that f (98) ≡ f (−100) (mod 99).
22
3.1. Modular Arithmetic Chapter 3. Linear Congruence
Theorem 3.1.4. Let f (x) = an xn + . . . + a1 x + a0 with n > 0 and ai ∈ Z for all i. Let x1 ,
x2 and m > 0 be integers. If x1 ≡ x2 (mod m) then f (x1 ) ≡ f (x2 ) (mod m).
Corollary 3.1.5. If n = ak ak−1 · · · a1 a0 (in base 10, i.e. digits are the ai ) and m = ak +
ak−1 + . . . + a1 + a0 then 9 | n if and only if 9 | m.
Proof omitted.
Proof omitted.
Theorem 3.1.7. Suppose f (x) is a polynomial of degree n > 0 and an > 0. Then there is
an integer k such that if x > k then f (x) > 0.
Proof. Let f (x) = an xn + . . . + a0 for n > 0, an > 0. Then f (x) > an−1 xn−1 + . . . + a0 . Let k
be the greatest solution to an−1 xn−1 +. . .+a0 ; then an−1 k n−1 +. . .+a0 = 0. If there is no such
k, f (x) > 0 for all x. If k exists and if x > k, then f (x) > f (k) > an−1 k n−1 + . . . + a0 = 0.
Letting k 0 = dke + 1, we have an integer k 0 such that if x > k 0 , f (x) > 0.
Theorem 3.1.8. Suppose f (x) is a polynomial of degree n > 0 and an > 0. Then for any
M there is an integer k such that if x > k then f (x) > M .
Theorem 3.1.9. If f (x) is a polynomial of degree n > 0 with integer coefficients, then f (x)
is a composite number for infinitely many integers x.
Proof. Let f (x) = an xn + an−1 xn−1 + . . . a1 x + a0 with ai ∈ Z for all i between 0 and n. Let
g(x) be a divisor of f (x), where coefficients bi are integers. If bn > 0 then by Theorem 3.1.8,
for any M there is a k such that if x > k then g(x) > M . Letting M = 1, we have that for
an infinite number of integers x > k, g(x) > 1 which means f (x) is composite for an infinite
number of integers x > k. On the other hand, if bn < 0 then let h(x) = −g(x). Then h(x)
is also a factor of f (x), with positive leading coefficient, so the result follows.
23
3.1. Modular Arithmetic Chapter 3. Linear Congruence
Theorem 3.1.10. Given a ∈ Z and n ∈ N, there exists a unique t in the set {0, 1, 2, . . . , n −
1} such that a ≡ t (mod n).
Proof. Let a ∈ Z and n ∈ N. Then by the division algorithm there exist unique q, t ∈ Z
such that a = qn + t, where 0 ≤ t ≤ n − 1. Thus t ∈ {0, 1, 2, . . . n − 1}. And a − t = qn so
a ≡ t (mod n).
Definition. The set {0, 1, 2, . . . , n − 1} is called the canonical complete residue system
modulo n.
canonical: {0, 1, 2, 3}
other examples of complete residue systems: {−4, −3, −2, −1} and {0, 5, 10, 15}
Proof omitted.
Definition. A complete residue system mod n has one representative of each equivalence
class.
Theorem 3.1.13. For n ∈ N, any set {a1 , a2 , . . . , an } of integers for which no two are
congruent mod n is a complete residue system mod n.
Proof. Let A = {a1 , a2 , . . . , an } with ai 6≡ aj (mod n) for all i 6= j. By the Division Algo-
rithm, there exist qi , ri and qj , rj such that ai = qi n + ri , aj = qj n + rj , 0 ≤ ri , rj ≤ n − 1
and ri 6= rj . Since the n elements of A must each have n distinct corresponding rk , then
for each ak ∈ A there is some rk ∈ {0, 1, . . . , n − 1} such that ak ≡ rk (mod n). And by
Theorem 3.1.10, all integers are congruent modulo n to one of {0, 1, . . . , n − 1}. So by tran-
sitivity, every integer is congruent to some ak ∈ A. Hence A is a complete residue system
mod n.
24
3.2. Linear Congruence Chapter 3. Linear Congruence
(2) 2x ≡ 3 (mod 5)
x=4
(3) 4x ≡ 7 (mod 8)
there are no solutions
Theorem 3.2.2. Let a, b, n ∈ Z with n > 0. Then ax ≡ b (mod n) has a solution if and
only if there exist integers x, y such that ax + ny = b.
Proof. (implies) Suppose x is an integer satisfying ax ≡ b (mod n). Then ax − b = nk for
some k ∈ Z. Thus ax + (−k)n = b and by letting y = −k, we have x, y ∈ Z such that
ax + ny = b.
ax + ny = g
ax − g = −ny.
Multiplying by d, we get
axd − gd = −nyd
a(xd) − b = n(−yd).
25
3.2. Linear Congruence Chapter 3. Linear Congruence
(3) If ax ≡ b (mod n) has a solution, then there are exactly (a, n) solutions in the canonical
complete residue system mod n.
Proof omitted.
Example 3.2.5. Find a solution to the following system of congruences
x ≡ 3 (mod 17)
x ≡ 10 (mod 16)
x ≡ 0 (mod 15).
First, 15 | x so x = 15c for some c ∈ Z.
Then 15c ≡ 3 (mod 17) =⇒ c = 7, 24, 41, 58, 75, 92, 109, . . . , 262
And 15c ≡ 10 (mod 16) =⇒ c = 6, 22, 38, 54, 70, 86, 102, . . . , 262
Then a solution to the above system is 15 · 262 = 3930.
(3) All solutions to 24x ≡ 123 (mod 213) are then given by
213 24
41 24 x + k + 213 y − k = 123
3 3
213 24
24 41 x + k + 213 41 y − k = 123.
3 3
Example 3.2.7. Find a solution to the following system of congruences
x ≡ 1 (mod 2)
x ≡ 2 (mod 3)
x ≡ 3 (mod 4)
x ≡ 4 (mod 5)
x ≡ 5 (mod 6)
x ≡ 0 (mod 7).
26
3.2. Linear Congruence Chapter 3. Linear Congruence
A solution is x = 19.
x ≡ a (mod n)
x ≡ b (mod m)
nj + a = mk + b
a − b = mk − nj.
a − b = gk 0 k − gj 0 j
= g(k 0 k − j 0 j).
So g | (a − b).
a − b = (ny + mz)d
= nyd + mzd
a + n(−yd) = b + m(zd).
x ≡ a (mod n)
x ≡ b (mod m).
x ≡ a (mod n)
x ≡ b (mod m)
x0 ≡ x ≡ a (mod n)
x0 ≡ x ≡ b (mod m).
27
3.2. Linear Congruence Chapter 3. Linear Congruence
So
x0 − x ≡ 0 (mod n)
x0 − x ≡ 0 (mod m).
Then we have n | (x0 − x), m | (x0 − x) and (n, m) = 1 so by Theorem 1.3.10, nm | (x0 − x).
Thus x0 ≡ x (mod nm) so x is unique mod nm.
x ≡ a1 (mod n1 )
x ≡ a2 (mod n2 )
..
.
x ≡ aL (mod nL )
Proof. The base case L = 2 is given by Theorem 3.2.9. Assume for all l ≤ L, the system has
a unique solution mod n1 · · · nl , say x. Then consider the system
x ≡ a1 (mod n1 )
..
.
x ≡ aL+1 (mod nL+1 ).
Since gcd(a1 , . . . , aL+1 ) = 1 and 1 | (ai − aL+1 ) for all 1 ≤ i ≤ L, x is also a solution to
this new system of congruences. Suppose x0 is another solution to the L + 1 system. Then
as shown in the proof of Theorem 3.2.9, ni | (x0 − x) for all 1 ≤ i ≤ L + 1. And since
gcd(n1 , . . . , nL+1 ) = 1, Theorem 1.3.10 implies that n1 n2 · · · nL nL+1 | (x0 − x). Therefore
x0 ≡ x (mod n1 n2 · · · nL nL+1 ) which means x is unique modulo n1 n2 · · · nL nL+1 .
Example 3.2.11. To solve 3x ≡ 79 (mod 163), find a number f (3) so that 3 · f (3) ≡ 1
(mod 163).
28
Chapter 4
29
4.1. Fermat’s Little Theorem Chapter 4. Fermat’s and Euler’s Theorems
30
4.1. Fermat’s Little Theorem Chapter 4. Fermat’s and Euler’s Theorems
Proof. Let am ≡ 1 (mod n). By the Division Algorithm, there exist q, r ∈ Z such that
m = qk + r, where 0 ≤ r < k. Then
am ≡ aqk+r
≡ aqk ar
≡ (ak )q ar
≡ 1q ar
≡ ar (mod n).
So ar ≡ 1 (mod n). But r < k and k is the smallest natural number such that ak ≡ 1
(mod n), so r = 0. Hence we have that k | m. The entire argument is reversible.
Theorem 4.1.8. Let p be prime and a ∈ Z with (a, p) = 1. Then {a, 2a, . . . , pa} is a
complete residue system modulo p.
Proof. Since p is prime, for all 1 ≤ i ≤ p − 1, (i, p) = 1. And by Theorem 1.3.11, (ia, p) = 1.
Consider ia and ja for i 6= j, 1 ≤ i, j ≤ p − 1. Since i 6≡ j (mod p), ia 6≡ ja (mod p) by
the contrapositive of Theorem 1.3.12. Thus {a, 2a, . . . , (p − 1)a, pa} are pairwise incongruent
mod p. So by Theorem 3.1.13, the set is a complete residue system.
Proof. By Theorem 4.1.8, {a, 2a, . . . , (p − 1)a, pa} is a complete residue system mod p. Thus
{a, 2a, . . . , (p − 1)a, pa} ∼
= {1, 2, . . . , p − 1, 0} by congruence mod p. Since pa ≡ 0 (mod p),
for the remaining {a, 2a, . . . , (p − 1)a} we have that for each i, 1 ≤ i ≤ p − 1, there is some
j, 1 ≤ j ≤ p − 1, such that ia ≡ j (mod p). Thus by properties of congruence,
Theorem 4.1.10 (Fermat’s Little Theorem I). If p is prime and a ∈ Z with (a, p) = 1, then
ap−1 ≡ 1 (mod p).
1 · 2 · · · (p − 1) · ap−1 ≡ 1 · 2 · · · (p − 1) · 1 (mod p)
ap−1 ≡ 1 (mod p).
Theorem 4.1.11 (Fermat’s Little Theorem II). If p is prime and a is any integer, then
ap ≡ a (mod p).
31
4.1. Fermat’s Little Theorem Chapter 4. Fermat’s and Euler’s Theorems
Proof. Suppose (a, p) = 1. Then by FLT (I), ap−1 ≡ 1 (mod p) and left multiplication by a
gives us ap ≡ a (mod p). Now suppose p | a. Then a ≡ 0 (mod p) and ap ≡ 0 (mod p), so
ap ≡ a (mod p).
Theorem 4.1.12. Let p be prime and a ∈ Z. If (a, p) = 1 and k = ordp (a) then k | (p − 1).
Proof. By Fermat’s Little Theorem I (4.1.10), ap−1 ≡ 1 (mod p). And by Thm. 4.1.7,
k | (p − 1).
Primality Test: For large N , test primality by computing aN −1 mod N .
It has been shown that aN −1 ≡ 1 (mod N ) for a = 2, 3, 5, 7, 11, 13, . . . so it is likely that
N is prime. However, this is not a proof.
341 is the smallest number such that 2N ≡ 2 (mod N ). But 3341 6≡ 3 (mod 341) so 341
can be shown composite anyway.
Example 4.1.16. For all a ∈ Z, a561 ≡ a (mod 561). But 561 = 3 · 11 · 17.
Theorem 4.1.17. Let m, n ∈ N with (m, n) = 1 and a any integer. If x ≡ a (mod m) and
x ≡ a (mod n) then x ≡ a (mod mn).
Proof. Let x ≡ a (mod m) and x ≡ a (mod n). Then m | (x − a) and n | (x − a). And since
(m, n) = 1, by Theorem 1.3.10, mn | (x − a). Hence x ≡ a (mod mn).
32
4.2. Euler’s and Wilson’s Theorems Chapter 4. Fermat’s and Euler’s Theorems
Example 4.2.1.
n φ(n)
1 1
2 1
3 2
4 2
5 4
6 2
7 6
8 4
9 6
10 4
12 4
15 8
21 12
35 24
Theorem 4.2.2. Let a, b, n ∈ Z with n > 0. If a ≡ b (mod n) and (a, n) = 1 then (b, n) = 1.
Proof. Since a ≡ b (mod n), a = nk + b for some k ∈ Z. And since (a, n) = 1, by Theo-
rem 1.3.6 there exist x, y ∈ Z such that ax + ny = 1. Substituting for a, we get
1 = ax + ny
= (nk + b)x + ny
= nkx + bx + ny
= bx + n(kx + y).
Theorem 4.2.3. Let n ∈ N and x1 , x2 , . . . , xφ(n) be the distinct natural numbers ≤ n such
that (xi , n) = 1. Let a be a nonzero integer with (a, n) = 1 and let i and j be distinct natural
numbers ≤ φ(n). Then axi 6≡ axj (mod n).
Proof. Suppose axi ≡ axj (mod n). Then since (a, n) = 1, by Theorem 4.1.1 xi ≡ xj
(mod n). Assume without loss of generality that xi > xj . Then xi − xj is a natural number
< n. But xi − xj ≡ 0 (mod n), so n | (xi − xj ), a contradiction. Therefore axi 6≡ axj
(mod n).
Theorem 4.2.4 (Euler). If a, n ∈ Z with n > 0 and (a, n) = 1, then aφ(n) ≡ 1 (mod n).
33
4.2. Euler’s and Wilson’s Theorems Chapter 4. Fermat’s and Euler’s Theorems
Proof. Let X = {x1 , x2 , . . . , xφ(n) } be the set of distinct natural numbers ≤ n such that for
all i, 1 ≤ i ≤ φ(n), (xi , n) = 1. Consider the quantity ax1 ·ax2 · · · axφ(n) . By Theorem 1.3.11,
for every i, 1 ≤ i ≤ φ(n), (axi , n) = 1. Thus for each i, there is some j, 1 ≤ j ≤ φ(n), such
that axi ≡ xj (mod n). And so we have (by commuting)
φ(15) = 8
1249 ≡ 449 · 349 (mod 15)
449 ≡ 448 · 4
≡ (48 )6 · 4
≡ 16 · 4 by Euler’s Theorem (4.2.4)
≡ 4 (mod 15)
349 ≡ (37 )7
≡ (34 · 33 )7
≡ (81 · 33 )7 ≡ (6 · 33 )7
≡ (2 · 34 )7 ≡ (2 · 6)7 ≡ 127
≡ 37 · 47
≡ 12 · 47 by same steps above
≡ 3 · 48
≡ 3 · 1 by Euler’s Theorem (4.2.4)
≡ 3 (mod 15)
φ(27) = 18
139112 ≡ 4112
≡ 4108 · 44
≡ (418 )6 · 44
≡ 16 · 44 by Euler’s Theorem (4.2.4)
≡ 28 ≡ 25 · 23
≡ 32 · 8
≡ 5 · 8 ≡ 40 ≡ 13 (mod 27).
34
4.2. Euler’s and Wilson’s Theorems Chapter 4. Fermat’s and Euler’s Theorems
Euler’s Theorem has an important connection to abstract algebra in the proof of La-
grange’s Theorem. Let U (n) = {x1 , x2 , . . . , xφ(n) }. U (n) is closed under multiplication, has
association, an identity and inverses, so in fact U (n) is a group. Let H = {a, a2 , . . . , aordn (a) }.
It turns out that H is a subgroup of U (n), denoted H ≤ U (n). Lagrange’s Theorem says
that the order (size) of H divides the order of U (n). If we note that |H| = ordn (a) and
|U (n)| = φ(n), and recall that aφ(n) ≡ 1 (mod n) by Euler’s Theorem (4.2.4), then the result
follows from an application of Theorem 4.1.7.
Theorem 4.2.7. Let p be prime and a ∈ Z such that 1 ≤ a < p. Then there exists a unique
b ∈ N, 1 ≤ b < p, such that ab ≡ 1 (mod p).
Proof. Since a < p, (a, p) = 1. So by Fermat’s Little Theorem I (4.1.10), ap−1 ≡ 1 (mod p).
Let b = ap−2 . Then ab = ap−1 ≡ 1 (mod p). Now suppose c is another inverse of a modulo p.
Then ac ≡ 1 (mod p) so ab ≡ ac (mod p). And since (a, p) = 1, by Theorem 1.3.12, b ≡ c
(mod p). Hence the inverse of a mod p is unique.
Definition. Let p be prime and ab ≡ 1 (mod p). Then a and b are inverses modulo p.
Note that 1 and p − 1 are their own inverses mod p.
Theorem 4.2.8. Let p be prime and a, b be inverses mod p with 1 < a, b < p − 1. Then
a 6= b.
Proof. Let 1 ≤ a, b ≤ p − 1. Suppose ab ≡ 1 (mod p) and a = b. Then a2 ≡ 1 (mod p). So
p | (a2 − 1) which means either p | (a − 1) or p | (a + 1). This is equivalent to a ≡ 1 (mod p)
or a ≡ −1 (mod p), so a must be either 1 or p − 1.
Theorem 4.2.9. If p > 2 is prime then (p − 2)! ≡ 1 (mod p).
Proof. Let S = {2, 3, 4, . . . , p − 2}. By p > 2, |S| is even. And by Theorems 4.2.7 and 4.2.8,
for each a ∈ S there exists a unique b ∈ S such that ab ≡ 1 (mod p) and a 6= b. Then
(p − 2)! ≡ 2 · 3 · · · (p − 2)
≡ (2 · 2−1 ) · (3 · 3−1 ) · · · (p − 2)(p − 2)−1
≡ 1 · 1 · · · 1 ≡ 1 (mod p).
35
4.2. Euler’s and Wilson’s Theorems Chapter 4. Fermat’s and Euler’s Theorems
2p−1 ≡ 1 (mod p2 )
(p − 1)! ≡ −1 (mod p2 )
The only known primes that satisfy these are p = 1093, 3511.
36
Chapter 5
Definition. Public key codes are codes where the encoding method is publicly known, but
the decryption method is unknown.
The most prominent example of a public key code is RSA encryption, which is based on
the idea that factoring large numbers is difficult.
Definition. RSA encryption is the public key encryption developed by Rivest, Shamir and
Adleman that consists of a large product of primes (also large), where the product is known
but the factorization is not.
The following theorems are the basis for the RSA encryption system.
Theorem 5.0.1. If p and q are distinct primes and W ∈ N with (W, pq) = 1, then
Proof. Since p, q are prime, (W, p) = (W, q) = 1. Then Fermat’s Little Theorem I (4.1.10)
gives us
W p−1 ≡ 1 (mod p)
W q−1 ≡ 1 (mod q).
And if the W terms are raised to any integer power, this remains 1:
Theorem 5.0.2. Let p, q be distinct primes and k, W ∈ N with W < pq. Then
37
Chapter 5. Public Key Cryptography
Proof. Suppose (W, pq) = 1. Then by Theorem 5.0.1, W (p−1)(q−1) ≡ 1 (mod pq), so (W (p−1)(q−1) )k ≡
1 (mod pq) and W 1+k(p−1)(q−1) ≡ W (mod pq). Now suppose without loss of generality that
p | W . Then W ≡ 0 (mod p) and W 1+k(p−1)(q−1) ≡ 0 (mod p), so W 1+k(p−1)(q−1) ≡ W
(mod p). Note that W and p are still relatively prime, so by Fermat’s Little Theorem I
(4.1.10),
W 1+k(p−1)(q−1) ≡ W · W k(p−1)(q−1)
≡ W · (W q−1 )k(p−1)
≡ W · 1 ≡ W (mod q).
Proof omitted. This provides a much simpler route to proving the previous two theorems
using Euler’s function.
Theorem 5.0.4. Let p, q be distinct primes and E ∈ N such that (E, (p − 1)(q − 1)) = 1.
Then there exists some D, y ∈ N such that ED = 1 + y(p − 1)(q − 1).
Proof. Since (E, (p − 1)(q − 1)) = 1, by Theorem 1.3.6 there exist natural numbers j, k
such that Ej + (p − 1)(q − 1)k = 1. Rearranging and letting D = j, y = −k, we have
ED = 1 + y(p − 1)(q − 1).
Theorem 5.0.5. Let p, q be distinct primes, W ∈ N with W < pq, and E, D, y ∈ N such
that ED = 1 + y(p − 1)(q − 1). Then W ED ≡ W (mod pq).
(3) Choose natural numbers E and D such that (E, (p − 1)(q − 1)) = 1 and ED ≡ 1
(mod (p − 1)(q − 1)), which is possible by Theorem 5.0.4
(4) Let W be the natural number to be encrypted/decrypted, where W < pq. To encrypt,
raise W to the power E mod pq. W E is the encrypted message
(5) To decrypt, raise W E to the power D mod pq. By Theorem 5.0.5, W ED ≡ W (mod pq),
so we obtain the cleartext W .
Example 5.0.6.
38
Chapter 5. Public Key Cryptography
1456 = 47(30) + 46 1 = 47 - 46
47 = 46(1) + 1 = 47 - (1456 - 47(30))
= 47(31) - 1456
so D = 31. Suppose the encrypted message is W E = 570.
Then (W E )D = 57031 ≡ W (mod 1537) by Theorem 5.0.5
Note that 570 = 2 · 3 · 5 · 19. Then
So W = 131.
In 2009, a team factored a 768-bit number N = pq, effectively “cracking” 768-bit en-
cryption. They used the Number Field Sieve, a factoring algorithm with runtime dependent
39
Chapter 5. Public Key Cryptography
on the size of N . Another factoring algorithm is the Elliptic Curve Factorization Method,
whose runtime depends only on the size of the 2nd largest prime factor of N .
RSA encryption is generally slower than other public key systems. Computer encryptions
generally utilize AES encryption, but the initial encryption key is encoded and sent with
RSA.
40
Chapter 6
In this chapter, we begin studying the general form of polynomial congruences, that is,
equations of the form f (x) ≡ 0 (mod n) for f (x) a polynomial with integer coefficients. We
saw an example in Theorem 5.0.5: manipulating the equation xQ − x ≡ 0 (mod n) is key to
wielding the RSA encryption system.
41
6.1. Finding Roots Chapter 6. Higher Order Congruence
42
6.1. Finding Roots Chapter 6. Higher Order Congruence
for some polynomial g(x) with integer coefficients with deg g = n − 1. Suppose a ∈ Z such
that f (a) ≡ 0 (mod p) and a 6≡ c (mod p). Then
but since p - (a − c) and p is prime, Theorem 1.3.9 implies p | g(a). That is, g(a) ≡ 0
(mod p). Now since deg g = n − 1, by the inductive hypothesis g(x) has at most n − 1 roots
mod p. Therefore there are only n − 1 choices for such roots a 6≡ c (mod p), and so there
are at most n roots of f (x) mod p.
Corollary 6.1.5. Let p be prime and k | (p−1). Then xk ≡ 1 (mod p) has exactly k distinct
roots mod p.
xp−1 − 1 = xkq − 1
= (xk − 1)(xk(q−1) + xk(q−2) + . . . + 1).
Lemma 6.1.6. Let a, r ∈ Z such that a, r ≥ 2 and let q be prime. Then there exists a prime
p such that ordp (a) = q r .
Lemma 6.1.7. Let n be an integer with prime factorization n = pr11 · · · prss . Then for any
integer a > 1 there exist infinitely many squarefree integers m such that n | ordm (a). Fur-
thermore, there exists an integer b > 1 such that a 6≡ b (mod m) and n | ordm (b).
43
6.2. Primitive Roots Chapter 6. Higher Order Congruence
44
6.2. Primitive Roots Chapter 6. Higher Order Congruence
Thus ` | rk but since (k, `) = 1, so Theorem 1.3.9 gives ` | r. Repeating the argument with
a, we get k | r, so Theorem 1.3.10 implies k` | r. Hence r = k` as claimed.
Theorem 6.2.5. Let p be a prime. Then there exists a primitive root modulo p.
Proof. When p = 2, a = 1 is a primitive root mod 2 so we may assume p is odd. This allows
us to write p − 1 as a product of prime powers:
p − 1 = q1n1 · · · qknk .
ni ni −1
By Corollary 6.1.5, for each 1 ≤ i ≤ k, xqi −1 has exactly qini roots and xqi −1 has exactly
qini −1 roots, so it follows that there are qini − qini −1 = qini −1 (qi − 1) distinct elements mod p
ni ni −1
satisfying aqi ≡ 1 (mod p) and aqi 6≡ 1 (mod p). Thus, each of these a has ordp (a) = qini .
For each 1 ≤ i ≤ k, choose such an ai having order qini . Set a = a1 · · · ak . Then since the
primes qi are pairwise relatively prime, induction with Lemma 6.2.4 shows that
ordp (a) = ordp (a1 · · · ak ) = ordp (a1 ) · · · ordp (ak ) = q1n1 · · · qknk = p − 1.
Theorem 6.2.6. For a prime p, there are precisely φ(p − 1) primitive roots modulo p.
Proof. We showed in Theorem 6.2.5 that primitive roots exist so now it remains to count
them. Let g be a primitive root mod p. By Theorem 6.2.3, {0, g, g 2 , . . . , g p−1 } is a complete
p−1
residue system mod p. Moreover, it follows from Theorem 4.1.7 that ordp (g j ) = (j,p−1) , so
j
for 1 ≤ j ≤ p − 1, g is a primitive root mod p precisely when (j, p − 1) = 1. By definition
of the φ-function, there are exactly φ(p − 1) such exponents j.
Corollary 6.2.7. For any number n, if there exists a primitive root modulo n then there are
precisely φ(φ(n)) primitive roots modulo n.
Example 6.2.8. For n = 8, the set {1, 3, 5, 7} is a complete residue system mod 8. Notice
that for each a ∈ {1, 3, 5, 7}, a2 ≡ 1 (mod 8) so the order of any element in a complete
residue system mod 8 is at most 2. Therefore none can have order φ(8) = 4, so no primitive
roots mod 8 exist.
Lemma 6.2.9. Let n be odd. Then there exists a primitive root modulo n if and only if there
exists a primitive root modulo 2n.
45
6.2. Primitive Roots Chapter 6. Higher Order Congruence
Proof. Since n is odd, φ(2n) = φ(n). The proof breaks into even and odd cases. If g is odd,
g k ≡ 1 (mod 2) holds trivially for all k ≥ 1, so by the Chinese Remainder Theorem (3.2.10),
g k ≡ 1 (mod 2n) if and only if g k ≡ 1 (mod n). In particular, g is a primitive root mod 2n
exactly when g is a primitive root mod n.
On the other hand, note that ak ≡ 1 (mod 2n) is only possible if a is odd. So a primitive
root mod 2n determines a primitive root mod n, but a primitive root mod n may be even.
If a is a primitive root mod n and odd, a is also a primitive root mod 2n, while if a is even,
a + n is a primitive root mod 2n.
Lemma 6.2.10. Suppose p | n for an odd prime p. Then if a primitive root modulo n exists,
either n = pk or n = 2pk for some k ≥ 1.
Proof. Write n = pk m for k ≥ 1 and m ∈ Z such that p - m. Assume m ≥ 3. By Euler’s
Theorem (4.2.4), for any a ∈ Z such that (a, n) = 1 we have
φ(n) φ(pk ) φ(pk )
a 2 ≡ (aφ(m) ) 2 ≡1 2 =1 (mod m)
φ(n) k φ(m) φ(m)
and a 2 ≡ (aφ(p ) ) 2 ≡1 2 =1 (mod pk ).
φ(n)
So by the Chinese Remainder Theorem (3.2.10), a 2 ≡ 1 (mod n). This shows that ordn (a)
divides φ(n)
2
, so in particular a cannot have order φ(n). Therefore if a primitive root mod n
exists, m is at most 2.
Lemma 6.2.11. For k ≥ 3, there are no primitive roots modulo 2k .
Proof. For k = 3, this was shown in Example 6.2.8. We claim that for all k ≥ 3 and odd a,
k−2
a2 ≡1 (mod 2k ).
We will show this by induction. Again, the k = 3 case follows from Example 6.2.8. Now
k−2
assuming the statement holds for k, write a2 = 1 + 2k x for some x ∈ Z. Then
k−1 k−2
a2 = (a2 )2 = (1 + 2k x)2 = 1 + 2k+1 x + 22k x2 ≡ 1 (mod 2k+1 ).
Thus the claim holds, but 2k−2 is always less than φ(2k ) = 2k−1 when k ≥ 3, so no primitive
roots mod 2k can exist.
Lemma 6.2.12. Let p > 2 be prime and k ≥ 1. Then there exist a primitive root modulo
pk .
Proof. By Theorem 6.2.5, there exists a primitive root mod p; call it g. First suppose
k
g p−1 6≡ 1 (mod p2 ). We claim that g φ(p ) 6≡ 1 (mod pk+1 ) for all k ≥ 1. By Euler’s Theorem
k
(4.2.4), write g φ(p ) = 1 + pk x where by induction we may assume p - x. Then
k+1 ) k )p
g φ(p = g φ(p = (1 + pk x)p ≡ 1 + pk+1 x 6≡ 1 (mod pk+2 ).
46
6.2. Primitive Roots Chapter 6. Higher Order Congruence
so either ` = φ(pk+1 ) or ` = φ(pk ). However, the previous paragraph shows that ` = φ(pk )
is impossible, so we must have ` = ordpk+1 (g) = φ(pk+1 ). Hence g is a primitive root mod
pk+1 .
Now suppose g p−1 ≡ 1 (mod p2 ). Notice that in this case, g + p is a primitive root mod
p and satisfies
But p does not divide g, so 1−g p−2 g 6≡ p (mod p2 ). Therefore the argument in the proceding
paragraph can be repeated to show g + p is a primitive root mod pk for all k ≥ 1.
These lemmas imply the following characterization of numbers n for which there exist
primitive roots modulo n. Combined with Corollary 6.2.7, this fully describes primitive
roots.
Theorem 6.2.13. Let n ≥ 2. Then there exists a primitive root modulo n if and only if n
has one of the following forms:
(iii) n = 2k for k = 1, 2.
Surprisingly, Artin’s Conjecture is not known to hold for a single integer a, but it is known
that there are at most two primes for which the conjecture does not hold. For example, at
least one of 3, 5 or 7 is a primitive root modulo every other prime, but it is currently unknown
for which the statement holds.
47
6.3. Power Residues Chapter 6. Higher Order Congruence
Example 6.3.1. Square residues, i.e. solutions to x2 ≡ a (mod m), are called quadratic
residues. These will be fully characterized by Gauss’s beautiful quadratic reciprocity laws in
the next chapter.
In general, the discrete logarithm problem is very difficult to solve, and especially difficult
to solve quickly. It is an open problem in computer science to find a fast algorithm for solving
the discrete logarithm problem mod m. However, when m = p is prime, the problem becomes
simpler.
Example 6.3.3. Let p be prime. Then by Theorem 6.2.5 there exist a primitive root mod
p, say g, and {0, g, g 2 , . . . , g p−1 } is a complete residue system mod p. Thus any x ∈ Z can be
written x ≡ g k (mod p) for some k ≥ 1 and any power xn can be written xn ≡ (g k )n = g kn
(mod p). Similarly, a ∈ Z can be written a ≡ g b (mod p) for some b ≥ 1. Therefore the
discrete logarithm problem mod p is of the form
g kn ≡ g b (mod p).
kn ≡ b (mod p − 1)
Theorem 6.3.4. Suppose p is prime and (a, p) = 1. Then xn ≡ a (mod p) has (n, p − 1)
solutions if
p−1
a (n,p−1) ≡ 1 (mod p)
and no solutions otherwise.
48
6.3. Power Residues Chapter 6. Higher Order Congruence
Example 6.3.5. Consider the discrete logarithm problem x5 ≡ 6 (mod 101). Since p = 101
is prime, Theorem 6.3.4 applies so we should first see if
100
6 5 = 620 ≡ 1 (mod 101).
Note that
Therefore Theorem 6.3.4 says that x5 ≡ 6 (mod 101) has 5 solutions. One can work out
that the five solutions are:
Theorem 6.3.6. Suppose m = 2, 4, pk or 2pk for p an odd prime and k ≥ 1. Then for a ∈ Z
with (a, m) = 1, the equation xn ≡ a (mod m) has (n, φ(m)) solutions if
φ(m)
a (n,φ(m)) ≡ 1 (mod m)
49
Chapter 7
Reciprocity
50
7.1. Quadratic Residues Chapter 7. Reciprocity
Theorem 7.1.1. For an odd prime p, half of the numbers not congruent to 0 (mod p) in
any complete residue system are quadratic residues mod p and half are not.
Proof. Let A = {a1 , a2 , . . . , ap } be a complete residue system mod p with 0 ≡ a1 < a2 <
· · · ap ≡ p − 1 (mod p). By Theorem 3.1.10, A ∼ = C = {0, 1, . . . , p − 1}. Observe that
12 ≡ (p − 1)2 ≡ 1 (mod p)
22 ≡ (p − 2)2 ≡ 4 (mod p)
etc.
Proof. For each i ∈ {1, . . . , p − 1}, let i−1 denote its inverse mod p. In other words, for
each i, ii−1 ≡ 1 (mod p). So i(i−1 a) ≡ a (mod p). But since a is a quadratic non-residue,
i 6= i−1 a. Thus (p − 1)! = 1 · 2 · · · p − 1, which, as we have shown, divides up into pairs
p−1
i(i−1 a), each of which is congruent to a (mod p). And there are p−12
pairs, so (p − 1)! ≡ a 2
(mod p). Finally, by Wilson’s Theorem (4.2.10),
p−1
−1 ≡ (p − 1)! ≡ a 2 (mod p).
51
7.1. Quadratic Residues Chapter 7. Reciprocity
52
7.1. Quadratic Residues Chapter 7. Reciprocity
Corollary 7.1.6. Suppose p is an odd prime and a, b ∈ Z with (a, p) = (b, p) = 1. Then
if both a and b are quadratic residues or both non-residues, then ab is a quadratic residue.
Otherwise, ab is a quadratic non-residue.
Proof omitted.
The Legendre symbol can be stated in algebraic terms. Let G = (Zp , ·) and let H
be the set of quadratic residues
modulo p. Then H is a subgroup of G. Define the map
χ : G → {1, −1} by χ(a) = ap Then χ is a group homomorphism with ker(χ) = H.
Theorem 7.1.8. Let p be an odd prime, a ∈ Z with p - a and r1 , . . . , r p−1 be the represen-
2
tatives of a, 2a, . . . , p−1
2
a in the complete residue system
p−1 p−1
− , . . . , −1, 0, 1, . . . , .
2 2
ia ≡ ja
ia − ja ≡ 0 (mod p)
and since (a, p) = 1, i − j ≡ 0 which is a contradiction since i 6= j and n they are in the same
o
p−1
residue system. Thus for all 1 ≤ i < j ≤ 2 , ri 6≡ rj (mod p). So |r1 |, |r2 |, . . . , |r | =
p−1
2
1, 2, . . . , p−1
2
up to order of elements. Let g = the number of negative r i . Then we have
g p−1
that r1 r2 · · · r p−1 ≡ (−1) 2
! (mod p).
2
53
7.2. Quadratic Reciprocity Chapter 7. Reciprocity
p−1 p−1 g p−1
a 2 ! ≡ (−1) ! (mod p).
2 2
p−1
And since none of 1, 2, . . . , p−1
2
are congruent
to p, Theorem 1.3.12 says that a 2 ≡ (−1)g
(mod p). Thus by Euler’s Criterion, ap ≡ (−1)g (mod p).
54
7.2. Quadratic Reciprocity Chapter 7. Reciprocity
Lemma 7.2.4 (Gauss’s Lemma II). Suppose that p is an odd prime, a ∈ Z and (a, p) = 1.
p−1
2
a S
X 2ka
Then = (−1) , where S = .
p k=1
p
p−1 p−1
2 2
2ka j k
(−1)b c . By Theorem 7.2.3,
X Y 2ka
2ka
Proof. Let S = . Then (−1)S = p
p
is
k=1
p k=1
even if and only if rk is positive. So for each positive rk , (−1)b p c = 1. Thus (−1)S =
2ka
(−1)b p c = (−1)g , where g is the number of negative r’s. Hence by Gauss’s Lemma
Y 2ka
k:rk even
S g a
I (7.2.1), (−1) = (−1) = p
.
3
Example 7.2.5. Find a formula for .
p
By Gauss’s Lemma II (7.2.4),
P p−1
3
= (−1) k=1 b p c .
2 6k
p
Since 1 ≤ k ≤ p−1 2
, 6 ≤ 6k < 3p. Suppose p ≡ 1 (mod 12) =⇒ p = 12j + 1. Then
6k
= 1 when p ≤ 6k < 2p
p
p p
≤k<
6 3
12j + 1 12j + 1
≤k<
6 3
12j + 1 12j + 1 + 5 12j + 1 − 1 12j + 1
< ≤k≤ < .
6 6 3 3
j k
12j+1−1 12j+1+5
So g = 3
− = 4j − (2j + 1) + 1 = 2j, which is even. And if 6k = 0, 2, it is
6 p
3
even. Hence = 1 if p ≡ 1 (mod 12) (this is a partial solution; a full solution is given
p
in Section 7.3).
55
7.2. Quadratic Reciprocity Chapter 7. Reciprocity
Lemma 7.2.6 (Gauss’s Lemma III). Suppose a ∈ Z, (a, p) = 1 and a is odd. Then
P p−1
a
= (−1) k=1 b p c .
2 ka
p
Proof. By Theorem 7.1.5, ap is multiplicative, so
a+p
a a+p 2 2
= =
p p p p
( a+p
$ %
a+p P p−1
2
2k 2 )
k=1 p
2
= (−1)
p
P p−1
= (−1)
2
k=1 b ka+kp
p c
P p−1
= (−1)
2
k=1 b kap + kpp c
P p−1
= (−1)
2
k=1 b kap c+k
P p−1 P p−1
= (−1)
2
k=1 b kap c (−1) 2
k=1 k
p−1
P p−1
2
b kap c (−1) ( 2 )( p−1
2 +1)
= (−1) k=1 2
P p−1 2
= (−1)
2
k=1 b kap c (−1) p 8−1 .
p−1
2
P jkk
Take a = 1. Then 1 ≤ k < p implies p
< 1. So
k=1
P p−1
a 2 p2 −1
(−1) k=1 b p c
2 k
1= = (−1) 8
p p
2 p2 −1
1= (−1) 8 .
p
P p−1
a
= (−1) k=1 b p c .
2 ka
Hence
p
Definition. A lattice point is a point (x, y) ∈ R2 with x, y ∈ Z.
p−1
Theorem 7.2.7. Let p, q be distinct odd primes and 1 ≤ j ≤ 2
. Then jthe knumber of
q
lattice points (j, y) that lie above the x-axis and below the line y = px equals jqp .
j k
jq jq
Proof. Substituting j = x, we have a line y = p
and p
simply represents the number of
jq
positive integers less than p
. Hence this is the number of lattice points.
56
7.2. Quadratic Reciprocity Chapter 7. Reciprocity
q−1
Theorem 7.2.8. Let p, q be distinct odd primes and 1 ≤ k ≤ 2
. Then the
j number
k of
lattice points (x, k) that lie to the right of the y-axis and left of y = pq x equals kp
q
.
Proof. Switching the order of x and y and following the previous proof gives the desired
result.
p−1 q−1
2 2
X jq X kp p−1 q−1
Theorem 7.2.9. + = .
j=1
p k=1
q 2 2
There are p−1 choices for x and q−1 choices for y, so there are p−1
q−1
2 2 2 2
total lattice points
q
in the rectangle. And if we draw the line y = p x through the rectangle, we can count the
total number of lattice points on either side of the line by the previous two theorems:
p−1 q−1
2 2
X jq X kp p−1 q−1
+ = .
j=1
p k=1
q 2 2
Theorem 7.2.10 (Law of Quadratic Reciprocity I). If p and q are distinct primes, then
p q p−1 q−1
= (−1) 2 · 2 .
q p
q p
P p−1 P q−1
= (−1)
2
j=1 b jqp c + 2
k=1 b kpq c
p−1 q−1
· 2
which by Theorem 7.2.9 is (−1) 2 .
Theorem 7.2.11 (Law of Quadratic Reciprocity II). If p and q are distinct primes, then
q
if p ≡ 1 (mod 4) or q ≡ 1 (mod 4)
p
p
=
q q
− if p ≡ q ≡ 3 (mod 4).
p
57
7.2. Quadratic Reciprocity Chapter 7. Reciprocity
Proof. Suppose p ≡ 1 (mod 4), or p = 4k + 1, and note that no generality is lost (case is
the same for q ≡ 1 (mod 4)). By Quadratic Reciprocity I (7.2.10),
p q p−1 q−1 4k q−1
= (−1) 2 · 2 = (−1) 2 · 2
q p
q−1
= (−1)2· 2
= 1 since q − 1 is even.
So pq = pq . Now suppose p ≡ q ≡ 3 (mod 4), or p = 4k + 3, q = 4l + 3. By Quadratic
Reciprocity I (7.2.10),
p q p−1 q−1 4k+2 4l+2
= (−1) 2 · 2 = (−1) 2 · 2 = (−1)(2k+1)(2l+1)
q p
= (−1)4kl+2k+2l+1
= −1 since 4kl + 2k + 2l + 1 is odd.
So q = − pq .
p
58
7.3. Applications of Quadratic Reciprocity Chapter 7. Reciprocity
59
7.3. Applications of Quadratic Reciprocity Chapter 7. Reciprocity
Theorem 7.3.2. (
3 1 if p ≡ 1 or 11 (mod 12)
=
p −1 if p ≡ 5 or 7 (mod 12).
Proof. Suppose
p= 12k + 1. Then clearly p ≡ 1 (mod 4) so by Quadratic Reciprocity II
(7.2.11), p3 = 3 , and since 12k + 1 ≡ 1 (mod 3), p3 = 31 = 1.
p
x2 −3
Theorem 7.3.3. Suppose x > 1 is odd with 3 - x and let N = 2
. Then there is a prime
p | N so that p ≡ 11 (mod 12).
2
Proof. Let N = x 2−3 . Note that x odd implies x ≡ 1, 5, 7, 11, 13, 17, 19 or 23 (mod 24).
So x2 ≡ 1 (mod 24). Then x2 − 3 ≡ −2 ≡ 22 (mod 24) and we can divide through by
(2, 24) = 2, giving
x2 − 3
≡ 11 (mod 12).
2
2
Now suppose p| N is prime. Then N ≡ 0 (mod p) =⇒ x 2−3 ≡ 0 (mod p). So x2 ≡ 3
(mod p), or p3 = 1. By Theorem 7.3.2, either p ≡ 1 or 11 (mod 12). Suppose q is another
prime such that q | N , and both p and q are congruent to 1 mod 12. Then p = 12j + 1 and
q = 12k + 1 for integers j, k and we have
But N ≡ 11 (mod 12), so at least one prime divisor of N must be congruent to 11 mod
12.
60
7.3. Applications of Quadratic Reciprocity Chapter 7. Reciprocity
p−1 p−1
= (−1) 2 a 2
−1 a
≡ by Euler’s Criterion
p p
≡ −1 · 1 by Theorem 7.1.7
= −1.
61
7.3. Applications of Quadratic Reciprocity Chapter 7. Reciprocity
Theorem 7.3.8 (Miller). Let q be an odd Sophie Germain prime. Then for p = 2q + 1, the
complete set of primitive roots modulo p is
{−22 , −32 , . . . , −q 2 }.
Notice that Miller’s Theorem would imply Artin’s Conjecture is true if we knew there
are infinitely many Sophie Germain primes. Sadly, this is not the case.
62
Part II
63
Chapter 8
Introduction
These notes were compiled from a semester of lectures at Wake Forest University by Dr.
∞
X 1
John Webb. The primary focus is the Riemann Zeta Function: ζ(s) =
n=1
ns
∞
X 1
Example 8.0.1. ζ(1) = , the harmonic series, is a divergent series.
n=1
n
∞
X 1 π2
Example 8.0.2. We know ζ(2) = 2
converges to , but how?
n=1
n 6
Euler’s Results
π2
Proved that ζ(2) =
6
900+ papers over his lifetime
Riemann Hypothesis
L-functions
64
Chapter 9
Preliminaries
65
9.1. Basic Analysis Chapter 9. Preliminaries
Definition. A function f (x) diverges (to ∞), denoted lim f (x) = ∞, if for all M > 0
x→∞
there exists some N > 0 such that for all x > N , f (x) > M .
Definition. Given functions f (x) and g(x) defined on R (or Z) ≥ a, with g(x) > 0 and
monotonic on [a, ∞), we say that f (x) = O(g(x)) if for all x ≥ a there exists some constant
M > 0 such that |f (x)| ≤ M g(x), also denoted f (x) << g(x).
Definition. Given functions f and g, f (x) >> g(x) if there exists some constant m > 0
such that |f (x)| ≥ mg(x) for all x > a.
Definition. If f (x) >> g(x) and f (x) << g(x) then f and g are said to have the same
order, denoted f (x) g(x).
X 1
Example 9.1.1. = O(log(x))
p prime
p
p≤x
x x ∞ Z ∞
X 1 X1 X
We know ≥ . Recall the integral test: f (n) and f (x) dx both converge
n=1
n p=2
p n=1 1
or both diverge (if f > 0 and f is monotone on [1, ∞)). So we have
x Z x
X 1 1
≤ dt = log(x).
n=2
n 1 t
In fact,
x
X 1
log(x) < < 2 log(x).
n=1
n
x x x
X 1 X 1 X 1
Therefore ≤ < 2 log(x) for all x > 2. Hence = O(log(x)).
p=2
p n=1 n p=2
p
x x
X 1 X 1
But is way bigger than so this is a bad approximation tool.
n=1
n p=2
p
66
9.1. Basic Analysis Chapter 9. Preliminaries
Let g(x) = 1, then | sin(x)| ≤ g(x) for all x ∈ R. So sin(x) << 1. But is sin(x) 1? No,
since sin(x) = 0 at infinitely many points.
First, f (x) << x because if M = 2, |f (x)| ≤ 2x for x > 0. And f (x) >> x because if
m = 1/2, |f (x)| ≥ 1/2x for x > 0. Thus f (x) x.
Definition. Two functions f and g are asymptotic to each other, denoted f (x) ∼ g(x), if
f (x)
lim = 1.
x→∞ g(x)
67
9.2. Euler-Maclaurin Summation Chapter 9. Preliminaries
f (x)
4
2 dn
1 2 4 6 8
n
n−1
X
dn < (f (k) − f (k + 1)) = f (1) − f (n)
k=1
by telescoping series. And since f (n) > 0, f (1) − f (n) < f (1). Hence dn < f (1).
Let C(f ) = lim dn . We know C(f ) exists because dn is increasing but bounded. Then
n→∞
we can write n Z k+1
X
C(f ) = lim [f (k) − f (x)] dx.
n→∞ k
k=1
∞ Z
X k+1
Let Ef (n) = f (n) + dn − C(f ). Then Ef (n) > 0 since dn − C(f ) = [f (k) − f (x)] dx.
k=n k
Together, this gives us
∞
X Z ∞
f (k) = f (x) dx + C(f ) + Ef (n).
k=1 1
∞
X 1
Goal: Approximate 2
to at least 3 decimal places.
n=1
n
68
9.2. Euler-Maclaurin Summation Chapter 9. Preliminaries
1
Definition. γ = C x
is called the Euler constant.
Z n
1
We know that dx = log(n). To approximate the remaining terms, we will first prove
1 x
the following theorem for the general case.
Xn Z n Z n
Theorem 9.2.3. f (k) = f (x) dx + (x − bxc) f 0 (x) dx + f (1), where f has a con-
k=1 1 1
tinuous first derivative on [1, n].
To find dn ,
n−1 Z
X k+1
dn = [f (k) − f (x)] dx
k=1 k
u = f (k) − f (x) dv = dx
and integrate:
Z k+1 k+1 Z k+1
[f (k) − f (x)] dx = [f (k) − f (x)](x − (k + 1)) + (x − (k + 1))f 0 (x) dx
k k k
Z k+1
= [(f (k) − f (k + 1)) · 0 − 0(−1)] + (x − (k + 1))f 0 (x) dx
k
Z k+1 Z k+1
=0+ (x − (k + 1))f 0 (x) dx = (x − (k + 1))f 0 (x) dx.
k k
69
9.2. Euler-Maclaurin Summation Chapter 9. Preliminaries
Thus
n−1 Z
X k+1 n−1 Z
X k+1
[f (k) − f (x)] dx = (x − (k + 1))f 0 (x) dx
k=1 k k=1 k
n−1
X Z k+1
= (x − bxc − 1)f 0 (x) dx
k=1 k
n−1
X Z k+1 n−1 Z
X k+1
0
= (x − bxc)f (x) dx − f 0 (x) dx
k=1 k k=1 k
n−1
X Z k+1 Z n
= (x − bxc)f 0 (x) dx − f 0 (x) dx
k=1 k 1
n−1 Z k+1
X
= (x − bxc)f 0 (x) dx − (f (n) − f (1)).
k=1 k
This is a formula we can work with. Plugging it back into the series formula, we obtain
n
X Z n Z n
f (k) = f (x) dx + (x − bxc)f 0 (x) dx + f (1).
k=1 1 1
Note that the (x − bxc) part above bounds the integral, but we can do a little better.
The function x − bxc is a 1-periodic function. By selecting x − bxc − 1/2 instead, we still
have a 1-periodic function but one that will integrate to 0 over integer periods.
x − bxc
1
1 2 3 4
1 x − bxc − 1/2
1 2 3 4
−1
70
9.2. Euler-Maclaurin Summation Chapter 9. Preliminaries
(
x − bxc − 1/2 x 6∈ Z
So let P1 (x) =
0 x ∈ Z.
Now consider
Z n Z n
0
(x − bxc)f (x) dx = (x − bxc − 1/2 + 1/2)f 0 (x) dx
1
Z1 n Z n
0 1/2f 0 (x) dx
= P1 (x)f (x) dx +
Z1 n 1
Putting this into the formula from Theorem 9.2.3 gives us the following theorem:
n Z n 0
Z n
X 1 1 1 1 1
= P1 (x)
dx + dx + +1
k=1
k 1 1x x 2 n
1 n 1
Z
1 1
≤ log(x) + dx + +1
2 1 x2 2 n
≤ log(x) + 1.
Strategy:
71
9.2. Euler-Maclaurin Summation Chapter 9. Preliminaries
Proof. Since f is a monotone decreasing function (from the integral test), f 0 is always neg-
ative. So
Z ∞ Z ∞
0
|f (x)| dx = − f 0 (x) dx
i i
Z n
= − lim f 0 (x) dx
n→∞ i
= lim [f (i) − f (n)]
n→∞
= f (i) − 0 since f (n) → 0
= f (i).
Z ∞
Thus |f 0 (x)| dx converges to f (i).
i
1
Example 9.2.6. Approximate f (x) = to 3 decimals.
x2
1 0 −2
Let f (x) = ; then f (x) = . We will find an i such that
x2 x3
Z ∞
2P 1 (x)
≤ 0.0005
dx
i x3
1
k3
k + 1/2 k+1
k
−1
(k+1)3
2
For [k, k + 1/2] take the max value of |f 0 (x)| = .
k3
1
k3
P1 (x) max |f 0 (x)|
k k + 1/2
72
9.2. Euler-Maclaurin Summation Chapter 9. Preliminaries
Z k+1/2 Z k+1/2
2P1 (x) 2
Then − dx ≤ P1 (x) · dx.
k x3 k k3
1
area of triangle =
4k3
k + 1/2 k+1
−1
(k+1)3
Z k+1 Z k+1
2P1 (x) 2
Then − dx ≥ P1 (x) · dx.
k+1/2 x3 k+1/2 (k + 1)3
1
area of triangle =
4(k+1)3
Z k+1
1 1
This gives us an estimate for the error term: P1 (x)f 0 (x) dx ≤ − . So we
k 4k 3 4(k + 1)3
have ∞
Z ∞
0
X 1 1 1
P1 (x)f (x) dx ≤ 3
− =
i k=i
4k 4(k + 1)3 4i3
1 √3
by telescoping sum. We want 3 ≤ 0.0005 ⇒ i ≥ 500 = 7.9 . . . So choose i = 8 and we
∞
4i
X 1
can estimate f (n) for f (x) = 2 to within 3 decimals:
n=1
x
∞ 7 Z ∞
X 1 X 1 1 1
= + dx + f (1)
n=1
n2 n=1
n2 8 x2 2
7
X 1 1 1 1
= + +
n=1
n2 8 2 64
1 1 1 1 1 1 1 1
=1+ + + + + + + +
4 9 16 25 36 49 8 128
≈ 1.6446.
73
9.2. Euler-Maclaurin Summation Chapter 9. Preliminaries
u = f 0 (x) dv = P1 (x) dx
du = f 00 (x) dx v = 1/2(x2 − x) + c
Z 1
where we get to pick c. If c = 1/12 then 1/2(x2 − x) + c dx = 0. So in order to make the
0
integral periodic and have good cancellation, let v = 1/2((x − bxc)2 − (x − bxc)) + 1/12. Note
that since P1 (x) was piecewise continuous, 1/2((x − bxc)2 − (x − bxc)) + 1/12 is continuous as
well (on [0, ∞)). Now to integrate,
Z n
n
P1 (x)f 0 (x) dx = f 0 (x) 1/2((x − bxc)2 − (x − bxc)) + 1/12 1
1
Z n
1/2((x − bxc)2 − (x − bxc)) + 1/12 f 00 (x) dx
−
1
Z n
0 0 1/2P (x)f 00 (x) dx
= /12f (n) − /12f (1) −
1 1
2
1
Z n
2 00
where P2 (x) = (x−bxc) −(x−bxc)+ 1/6. This gives us a new error term, 1/2P
2 (x)f (x) dx.
1
where P2 (x) = (x − bxc)2 − (x − bxc) + 1/6 and f has continuous first and second derivatives
on [1, n].
74
9.2. Euler-Maclaurin Summation Chapter 9. Preliminaries
Z 1
where the coefficient 3 is chosen so P3 (x) is monic, and c is chosen such that P3 (x) dx = 0.
0
This give us P3 (x) = x3 − 3/2x2 + 1/2x + c, so
1
1
x4 x3 x2
Z
3
(x − 3/2x2 + 1/2 + c) dx = − + + cx
0 4 2 4 0
1 1 1
= − + +c=0
4 2 4
⇒ c = 0!
Z x
Thus we set P3 (x) = 3 P2 (t) dt = x3 − 3/2x2 + 1/2x. To integrate by parts again, let
0
u = f 00 (x) dv = P2 (x) dx
Then Z n Z n
1 00 1 00
P2 (x)f (x) dx = 1/6P3 (x)f (x) − P3 (x)f 000 (x) dx.
2 1 6 1
where P3 (x) = x3 − 3/2x2 + 1/2x and f has continuous first, second and third derivatives on
[1, n].
1
Example 9.2.10. Let’s apply this to f (x) =
x2
Z k+1
0 2 00 6 000 24 000
Note that f (x) = − 3 , f (x) = 4 , f (x) = − 5 . Let’s look at
P3 (x)f (x) dx.
x x x k
k k+1
75
9.2. Euler-Maclaurin Summation Chapter 9. Preliminaries
Then we have
Z
k+1/2
Z k+1 Z k+1
000
24
24
P3 (x)f (x) dx ≤ 5 P3 (x) dx − P3 (x) dx
k x k (k + 1)5
k+1/2
and Z 1/2
1/2 4
x x3 x2 1 1 1 1
P3 (x) dx = − + = − + = .
4 2 4 0 64 16 16 64
0
Thus
k+1
Z
000
24 1 24 1 since the integral from 0 to 1/2
P3 (x)f (x) dx ≤ 5
− 5 is the same as from 1/2 to 1
k k 64 (k + 1) 64
3 1 1
= − .
8 k 5 (k + 1)5
This becomes
Z ∞
∞
000
3 X 1 1 3 1
P3 (x)f (x) dx ≤ 5
− 5
=
i 8 k=i k (k + 1) 8 i5
76
9.3. The Bernoulli Numbers Chapter 9. Preliminaries
B0 (x) = 1
B1 (x) = x − 1/2
B2 (x) = x2 − x + 1/6
B3 (x) = x3 − 3/2x2 + 1/2.
Bk (x) is known as the kth Bernoulli polynomial, and the sequence of bk terms are called
the Bernoulli numbers.
Proof. If k = 1, then
B1 (1 − x) = (1 − x) − 1/2
= 1/2 − x
= −B1 (x)
so the base case holds. Now assume Bk−1 (x) = (−1)k−1 Bk−1 (1 − x). We have that
Z x
Bk (x) = k Bk−1 (t) dt + bk
0
Z x
= k (−1)k−1 Bk−1 (1 − t) dt + bk .
0
77
9.3. The Bernoulli Numbers Chapter 9. Preliminaries
Note that Z 1
Bk (1) = k Bk−1 (t) dt + bk = k(0) + bk = bk = Bk (0)
0
so Bk (1) = Bk (0). And if k is odd, then Bk (1) = −Bk (0) = −bk , but these also equal bk ,
hence bk = 0 if k is odd.
Proposition 9.3.2. For k ≥ 2, if k is even then Bk (x) = 0 for exactly one value in [0, 1/2].
And if k is odd, Bk (x) = 0 iff x = 0, 1/2 or 1.
Proof. Let k = 2, then see graph. Now suppose k is odd and the above holds for k − 1, which
is even. We know that Bk (0) = Bk (1/2) = 0. Suppose that Bk (c) = 0 for some c ∈ (0, 1/2).
By Rolle’s Theorem, since Bk (0) = Bk (c) = Bk (1/2) there must be an a and b such that
0 < a < c < b < 1/2 and Bk0 (a) = Bk0 (b) = 0. But Bk0 (x) = kBk−1 (x) which is even. By
inductive hypothesis, there’s only value in [0, 1/2] such that Bk−1 (x) = 0, a contradiction.
Thus for k odd, Bk (x) = 0 iff x = 0, 1/2 (or 1 by extension). Now suppose k is even and the
hypothesis holds for k − 1. Suppose Bk (t1 ) = Bk (t2 ) = 0 for t1 , t2 ∈ [0, 1/2] with t1 6= t2 . By
Rolle’s Theorem, Bk0 (x) = kBk−1 (x) has a zero in (t1 , t2 ). But since k −1 is odd, Bk−1 (x) 6= 0
on the interval (0, 1/2), contradiction our choice of t1 , t2 . Hence if k is even, Bk (x) = 0 for
exactly one value between 0 and 1/2.
Properties of Bernoulli Numbers
(1) bk = 0 if k is odd
(2) The critical points of Bk (x) are x = 0, 1/2, 1 if k is even, so bk is either a max or in on
[0, 1], and Bk (1/2) is the opposite
k k
X k X k r
(3) bk = bk−r — in fact, Bk (x) = x bk−r
r=0
r r=0
r
(4) |bk | ≥ |Bk (x)| on the interval [0, 1] if k is even, and Bk (1/2) = −(1 − 21−k )bk for k
even, so |bk | − |Bk (1/2) | is very small
Theorem 9.3.3 (General Form for Euler-Maclaurin Summation). If f has 2m+1 derivatives
on [i, n],
n Z n m
X X b2r
f (2r−1) (n) − f (2r−1) (1)
f (k) = f (x) dx + /2(f (1) + f (n)) −
1
k=i i r=1
(2r)!
Z n
1
+ P2m+1 (x)f (2m+1) (x) dx
(2m + 1)! i
where P2m+1 (x) = B2m+1 (x − bxc).
78
Chapter 10
Euler’s Work
79
10.1. On the Sums of Series of Reciprocals Chapter 10. Euler’s Work
P2 (x) = 1 + a1 x + a2 x2 + . . . + an xn
with roots b1 , b2 , . . . , bn .
Claim. Then we can write P2 as
x x x
P2 (x) = 1 − 1− ··· 1 − .
b1 b2 bn
Proof. If we evaluate P2 at one of its roots, bi , we have
bi bi bi
P2 (bi ) = 1 − 1− ··· 1 − · · · = 0.
b1 b2 bi
=0
So the bi ’s indeed satisfy their definition as roots. Thus our two expressions are degree n
polynomials with the same roots, so they can only differ by a factor of k. Plugging in x = 0,
we can solve for k = 1. Thus our two expressions are equivalent:
P2 (x) = 1 + a1 x + . . . + an xn
x x x
= 1− 1− ··· 1 − .
b1 b2 bn
80
10.1. On the Sums of Series of Reciprocals Chapter 10. Euler’s Work
s s3
Now consider f (s) = 1 − + − . . . Then the roots of f (s) are all the angles
y 3!y
A, B, C, D, . . . such that
81
10.1. On the Sums of Series of Reciprocals Chapter 10. Euler’s Work
Now define
α = a + b + c + d + e + ...
β = ab + ac + ad + bc + bd + . . .
γ = abc + abd + acd + bcd + . . . .
So alpha is the sum of single terms, β is the sum of all possible products of two terms, and
γ is the sum of products of three terms.
Claim. a2 + b2 + c2 + . . . = α2 − 2β.
a2 + b2 = a2 + b2 + 2ab − 2ab
= (a + b)2 − 2ab
= α2 − 2β.
Claim. a3 + b3 + c3 + . . . = α3 − 3αβ + 3γ
Proof omitted.
Proof omitted.
Let P = a + b + c + . . . = α
then Q = a2 + b2 + c2 + . . . = α2 − 2β = P α − 2β
R = a3 + b3 + c3 + . . . = α3 − 3αβ + 3γ = Qα − P β + 3γ
S,T, etc. follow from here.
82
10.1. On the Sums of Series of Reciprocals Chapter 10. Euler’s Work
where A is the least angle such that y = sin(A). But this just gives us
1
=α
y
0=β
−1
=γ
3!y
0=δ
etc.
Q 1
Since β = 0, Q = P α − 2β = P α and R = Qα + 3γ = − , and this holds for all values
y 2y
of y = sin(A). We will now choose y = 1, so A = π/2. All of our roots now come in equal
1 1 1 1
pairs: π , π , 5π , 5π , . . . Then
/2 /2 /2 /2
1 1 1 1 1
α= = + + + + ...
1 A π − A A − 2π −π − A
2 2 2 2 2 2
= + − − + + − ...
π π 3π 3π 5π 5π
4 1 1 1 1
= 1 − + − + − ... .
π 3 5 7 9
1 1 1 1 π
So 1 − + − + − . . . = . Note that this looks like a case of the Taylor series for
3 5 7 9 4
tan−1 (x):
∞
−1
X (−1)n x2n+1
tan (x) =
n=0
2n + 1
∞
X (−1)n π
tan−1 (1) = = .
n=0
2n + 1 4
Q = a2 + b 2 + c 2 + d 2 + . . .
2 2 2 2 2 2
2 2 −2 −2 2 2
= + + + + + + ...
π π 3π 3π 5π 5π
8 1 1 1
= 2 1+ + + + ... .
π 9 25 49
83
10.1. On the Sums of Series of Reciprocals Chapter 10. Euler’s Work
∞
X 1 1 1 1
Now let z = 2
= 1+ + + + . . . Then to produce all the even terms, divide by 4:
n=1
n 4 9 16
z 1 1 1 1 1
= + + + + + ...
4 4 16 36 64 100
π2
So z − z/4 just gives us back the odd terms, which we have shown equal 8
:
z π2
⇒z− =
4 8
π2
⇒z= .
6
For an alternate proof, set y = 0 at the beginning. Then the roots of our equation will be
±π, ±2π, ±3π, . . ., giving us α = 0 and β = − 1/6. Thus Q = −2β = 1/3, and we can proceed
to solve for ζ(2) as before:
2 2 2 2
1 1 1 −1 1
Q= = + + + + ...
3 π π 2π 2π
2 1 1 1
= 2 1+ + + + ...
π 4 9 16
∞
π2 X 1
⇒ = .
6 n=1
n2
∞
X 1
We can solve for other identities in the same way. For example, we find that if y =
n=1
n4
∞
y X 1
then = , implying
16 n=1 (2n)4
∞
y X 1 π4
y− = =
16 n=1 (2n − 1)4 32 · 3
π4
⇒y= .
90
Although Euler did not provide a general formula for ζ(2n) in this paper, his methods here
can be extended to show that for all n,
84
10.2. Newton’s Identities Chapter 10. Euler’s Work
X
sl = xi 1 xi 2 · · · xi l
i1 ,...,il distinct
X
u(m, r) = xm
j 0 xj 1 xj 2 · · · xj r
j0 ,j1 ,...,jr distinct
= xk+1 k+1
1 x2 · · · xl + x1 x2 · · · xl + . . . + x1 x2 · · · xk+1
l + . . . + xk+1
1 x2 · · · xl−1 xl+1
+ x1 xk+1
2 · · · xl−1 xl+1 + . . . + x1 x2 · · · xl−1 xkl xl+1 + . . .
= xk+1 k+1
· · · xl + . . . + xk1 x2 · · · xl xl+1 + . . .
1 x2 · · · xl + x1 x2
!
X X
= xk+1
j0 xj1 · · · xjl−1
+ xki1 xi2 · · · xil
j0 ,...,jl−1 distinct i1 ,...il distinct
85
10.2. Newton’s Identities Chapter 10. Euler’s Work
Proof. Consider
∞
! !
X X
t1 sl = xn xi l xi 2 · · · xi l
n=1 i1 ,...,il
= u(2, l − 1) + (l + 1)sl+1 .
These lemmas are used to prove the main theorem in this section, Newton’s Identities.
= −u(k − 1, 1).
Next,
86
10.2. Newton’s Identities Chapter 10. Euler’s Work
And
87
10.3. Euler’s Product Form Chapter 10. Euler’s Work
In Euler’s paper Various Observations about Infinite Series (1737), he made use of the
following notation
1 1 1 2n 3n 5n 7n
1+ + + + . . . = · · · ··· ,
2n 3n 4n 2n − 1 3n − 1 5n − 1 7n − 1
which of course is equivalent to our more modern notation for Euler’s Product Form. We
will follow Euler’s proof below.
Theorem 10.3.1 (Euler’s Product Form).
1 1 1 2n 3n 5n 7n
1+ + + + . . . = · · · ···
2n 3n 4n 2n − 1 3n − 1 5n − 1 7n − 1
In other words,
∞ Y −1
X 1 1
s
= 1− s .
m=1
m p prime
p
88
10.3. Euler’s Product Form Chapter 10. Euler’s Work
An alternate proof of Euler’s Product Form is given here. The proof utilizes the Fundamental
Theorem of Arithmetic (2.1.2), which states that every natural number factors uniquely into
the product of some primes.
Proof.
−1 !
Y 1 Y 1
1− s =
p prime
p p prime
1 − p1s
Y 1 1 1
= 1 + s + 2s + 3s + . . . by geometric sum
p prime
p p p
∞
X 1
= by Fundamental Theorem of Arithmetic (2.1.2)
m=1
ms
= ζ(s).
X 1 X 1
In the next sequence, we will prove that diverges by showing >> log log(x).
p prime
p p prime
p
p≤x p≤x
Steps:
(a) f (x) ∼ g(x) ⇒ log(f (x)) ∼ log(g(x))
X1 Y −1
1
(b) < 1−
n≤x
n p prime p
f (x)
lim =1
x→∞ g(x)
f (x)
lim log = log(1) = 0.
x→∞ g(x)
So lim [log(f (x)) − log(g(x))] = 0 ⇒ lim log(f (x)) = lim log(g(x)) and we conclude
x→∞ x→∞ x→∞
log(f (x))
lim = 1.
x→∞ log(g(x))
89
10.3. Euler’s Product Form Chapter 10. Euler’s Work
X1 ∞ n
1 1 Y X 1
Since = 1 + + . . . + , then contains all of the terms of the former,
n≤x
n 2 x p≤x n=0
p
plus the product of the reciprocals of all primes less than p. Therefore it must be that
X 1 Y −1
1
< 1− .
n≤x
n p≤x p
X 1 Y −1
1
Proof of (d): By (b), < 1− . Thus
n≤x
n p≤x p
! −1 !
X1 Y 1
log < log 1−
n≤x
n p≤x
p
X 1
= − log 1 −
p≤x
p
X2 1
≤ by (c) with t =
p≤x
p p
X1
=2 .
p≤x
p
!
X1 X1 X1
So log << . And since ∼ log(x), (a) gives us
n≤x
n p≤x
p n≤x
n
!
X1
log ∼ log log(x).
n≤x
n
X1 X 1
Hence log log(x) << which is sufficient to prove that diverges.
p≤x
p p prime
p
90
10.3. Euler’s Product Form Chapter 10. Euler’s Work
Next, we provide another proof of the divergence of the sum of reciprocals of primes. The
terminology loosely follows another paper by Euler, but we also employ series and product
notation.
X 1
Theorem 10.3.2. diverges.
p prime
p
X 1 ∞
X 1
Proof. Since s
< s
which converges, then
p prime
p n=1
n
1 1 1 X 1
A= + + + . . . =
2s 3s 5s p prime
ps
1 1 1 X 1
B = 2s + 2s + 2s + . . . =
2 3 5 p prime
p2s
X
− log 1 − p−s
=
p prime
∞
X X p−sn
= by power series for − log(1 − x)
p prime n=1
n
X∞ X p−sn
= we can switch order of summation since
n series is absolutely convergent
n=1 p prime
∞
!
X 1 X
= p−sn
n=1
n p prime
ps
X
= log .
p prime
ps − 1
91
10.3. Euler’s Product Form Chapter 10. Euler’s Work
Z ∞
1 1
≤ dx
2 1 x2s
∞
1 1 −1
= ·
2 x2s−1 2s − 1 1
1 1
= .
2 2s − 1
1 X −ks 1
So 1/2B converges. Likewise, p ≤ by replacing 2 with k above. Also
k p prime k(ks − 1)
1 1
note that ≤ . Then we have
k(ks − 1) (k − 1)2
∞
X X
1/2B + 1/3C + 1/4D + ... = p−ns
n=2 p prime
∞
X 1
≤
n=2
n(ns − 1)
∞
X 1
≤
n=2
(n − 1)2
∞
X 1 π2
= 2
= .
n=1
n 6
92
10.3. Euler’s Product Form Chapter 10. Euler’s Work
1
Proof. Let f (x) = . Then since f is monotone decreasing, the integral test gives us
xs
Z ∞ ∞
dx X 1
s
< s
.
1 x n=1
n
We know ∞
∞
x−s+1 −1
Z
dx 1
s
= = = .
1 x −s + 1 1
−s + 1 s−1
Likewise, we can split off the first term of the series (since we can’t integrate → 0) to obtain
∞ Z ∞
X 1 dx 1
s
< s
+1 = + 1.
n=1
n 1 x s − 1
Thus we have the desired bound:
∞
1 X 1 1
< s
< + 1.
s−1 n=1
n s − 1
93
10.3. Euler’s Product Form Chapter 10. Euler’s Work
The proof of the following theorem results from Euler and the Prime Harmonic Function,
by Paul Pollack at UGA.
X 1
Theorem 10.3.6. − log log(x) < 15 for sufficiently large x.
p p≤x
Proof. Let λ(t) be a bounded function on [0, 1], x > e4 ≈ 80. Define
!
X 1 1 −1
F (λ; x) = 1 λ p log(x) .
p prime
p p log(x)
! !
X 1 1 1
X 1 1
= 1 p log(x) + 1 ·0
p≤x
p p log(x) p>x
p p log(x)
X1
= .
p≤x
p
94
10.3. Euler’s Product Form Chapter 10. Euler’s Work
λL
1
λ0 λU
t
1
e 1
But first we need to bound F (λ; x) when λ is linear. Suppose λ(t) = a + bt. Then
!
X 1 1 −1
F (λ; x) = 1 λ p log(x)
p prime
p p log(x)
! !
X 1 b
= 1 a+ 1
p prime p1+ log(x) p log(x)
X 1 X 1
=a 1
1+ log(x)
+b 1+ log(x) 2 .
p prime p p prime p
1 1 2
Notice that since x > e4 , log(x)
< 4
and log(x)
< 12 . So Lemma 10.3.5 gives us
X 1 1
− log log(x) < 2 letting s =
1
log(x)
1+
p prime p
log(x)
and
X 1 log(x) 2
− log <2 letting s = .
2
2 log(x)
1+ log(x)
p prime p
−2(|a| + |b|) < F (λ; x) − (a + b) log log(x) + b log(2) < 2(|a| + |b|).
95
10.3. Euler’s Product Form Chapter 10. Euler’s Work
Thus |F (λ; x) − (a + b) log log(x)| < 2(|a| + |b|) + |b| log(2) if λ is linear. Now we will pick
e 1
λU (t) = −et + (e + 1) and λL (t) = t− by our graph on the previous page. Notice
e−1 e−1
that both lines pass through (1, 1), so λU (1) = aU + bU = 1 and λL (1) = aL + bL = 1. Thus
by the bound on linear λ’s,
Hence F (λL ; x) > log log(x) − 15. Putting this together, we have
log log(x) − 15 < F (λL ; x) ≤ F (λ0 ; x) ≤ F (λU ; x) < log log(x) + 15,
X1
so |F (λ0 ; x) − log log(x)| < 15. But we determined that F (λ0 ; x) = so we can conclude
p≤x
p
that
X 1
− log log(x) < 15.
p
p≤x
96
10.4. The Prime Number Theorem Chapter 10. Euler’s Work
A natural question we can ask is: How often is γ(n) = 1? (i.e. How often is n prime?)
Consider Z x
log log(x) = f (t) dt
2
1 γ(n) 1
This suggests that EV (γ(n)) = , and the ratio will converge to as n
log(n) n n log(n)
gets big.
X
Define π(x) to be number of primes p ≤ x. Then π(x) = γ(n). Furthermore, define
Z x n≤x
dt
the logarithmic integral by Li(x) = . Gauss conjectured that π(x) = Li(x), and
2 log(t)
while this is not stricly true, his intuition was correct that π(x) grows at about the same
rate as the logarithmic integral (i.e. they are asymptotic).
x
Lemma 10.4.1. Li(x) ∼ .
log(x)
97
10.4. The Prime Number Theorem Chapter 10. Euler’s Work
−1
du = dv = dt.
t(log(t))2
Then
Z x x Z x Z x
dt t t x 2 dt
Li(x) = = + dt = − + .
2 log(t) log(t) 2 2 t(log(t))2 log(x) log(2) 2 (log(t))2
To integrate by parts again, let
1
u = v=t
(log(t))2
−2
du = dv = dt.
t(log(t))3
Then
x Z x
x 2 t 2t
Li(x) = − + + dt
log(x) log(2) (log(t))2 2 2 t(log(t))3
Z x
x x dt
= + +c+2
log(x) (log(x))2 2 (log(t))3
x 3x
≤ + + c.
log(x) (log(x))2
x
And clearly ≤ Li(x) so we have
log(x)
x x 3x
≤ Li(x) ≤ + +c
log(x) log(x) (log(x))2
Li(x) 3 c log(x)
⇒ 1 ≤ x ≤ 1+ + .
log(x)
log(x) x
The Prime Number Theorem is perhaps the most important results from analytic number
theory. The theorem was proven separately by Hadamard and de LaValle-Poussin using
Riemann’s work on the zeta function.
98
10.4. The Prime Number Theorem Chapter 10. Euler’s Work
x
Theorem 10.4.2 (Prime Number Theorem). π(x) ∼ .
log(x)
On the previous page, we showed that
x
Li(x) ∼
log(x)
so one route to proving the Prime Number Theorem would be to show that π(x) ∼ Li(x),
which is of course much harder. Simple calculus shows that
x
≤ π(x) ≤ Li(x)
log(x) ↑
closer
Notice that 1+π1 (x)+π3 (x) = π(x), so at least one of these functions must diverge as x → ∞.
Questions:
Calculations show that π1 (x) < π3 (x) for many x. Does this hold for all x?
Dirichlet’s Theorem: Take any n, a ∈ Z with gcd(a, n) = 1. Then there are an infinite
number of primes p ≡ a (mod n). (See Section 17.6.)
Corollary 10.4.3. For any relatively prime integers a, n, πa,n (x) equals the number of primes
p ≤ x such that p ≡ a (mod n) diverges to infinity as x gets big.
Corollary 10.4.4. If (a, n) = 1 and (b, n) = 1, then πa,n (x) ∼ πb,n (x).
99
Chapter 11
Complex Analysis
In this chapter we survey the basic results in complex analysis that will √
be useful in number
theory applications. Recall the definition of the imaginary number i = −1.
In this way we can view the real part x and the imaginary part y of x + iy separately.
The set of all complex numbers is denoted C, and they form an algebraic field under the
operations
100
11.1. Arithmetic Chapter 11. Complex Analysis
11.1 Arithmetic
For a complex number z = x + iy we will denote the real and imaginary parts by x = Re(z)
and y = Im(z). As a vector space, C has the following special attributes for each vector
(complex number).
Definition.
p For a complex number z = x + iy, the modulus or absolute value of z is
|z| = x + y 2 and the complex conjugate of z is z̄ = x − iy.
2
Note that |z| and |z̄| are always equal. Geometrically, the modulus represents the distance
in the complex plane from the origin (0, 0) to (x, y).
(ii) zw = z̄ w̄.
Since C is a field, there is also a notion of divisibility for complex numbers. In particular
if x + iy, u + iv ∈ C and u + iv 6= 0, we define
x + iy xu + yv + i(yu − xv)
= .
u + iv u2 + v 2
x+iy
One can check that this is the appropriate formula by multiplying and dividing u+iv by the
conjugate u − iv.
As in the xy-plane, there is a polar coordinate system for complex
numbers: if z = x + iy
then we set r = |z|, x = r cos θ and y = r sin θ where θ = tan−1 xy . This gives us
Theorem 11.1.2 (De Moivre’s Theorem). For all integers n, (cos θ + i sin θ)n = cos(nθ) +
i sin(nθ).
101
11.1. Arithmetic Chapter 11. Complex Analysis
Proof. We prove this using induction on n. For the base case n = 1, we simply have
(cos θ + i sin θ)1 = cos θ + i sin θ.
Now assume De Moivre’s Theorem holds for n. Then we have
(cos θ + i sin θ)n+1 = (cos θ + i sin θ)n (cos θ + i sin θ)
= (cos(nθ) + i sin(nθ))(cos θ + i sin θ)
= (cos(nθ) cos θ − sin(nθ) sin θ) + i(sin θ cos(nθ) + cos θ sin(nθ))
= cos((n + 1)θ) + i sin((n + 1)θ).
Definition. When we write z = |z|(cos θ + i sin θ), the angle θ is called the argument of z,
denoted arg z.
We often want to restrict our attention to a single, canonical value of θ for any z. Thus
we define the principal argument θ = Arg z, where −π ≤ θ ≤ π.
Proposition 11.1.3. Arg(zw) = Arg z + Arg w, where these may differ by a multiple of 2π.
Example 11.1.4. Let z = −1 + i and w = i. Then zw = −1 − i, Arg(zw) = − 3π
4
and
3π π 5π 3π
Arg z + Arg w = + = ≡− mod 2π.
4 2 4 4
Continuing with the geometric parallels between Euclidean space and the complex plane,
we have the important triangle inequality for complex numbers:
|z + w| ≤ |z| + |w|.
There is also a related inequality, sometimes called the reverse triangle inequality:
||z| − |w|| ≤ |z − w|.
The original purpose of complex numbers was to compute roots of all polynomials, so
it will be desirable to be able to compute roots of complex numbers. In other words, if
w = |w|(cos ψ + i sin ψ), what is w1/n ? Let z = w1/n , so that z n = w. Then using De
Moivre’s Theorem (11.1.2) we have
|w|(cos ψ + i sin ψ) = (|z|(cos θ + i sin θ))n = |z|n (cos(nθ) + i sin(nθ)).
Solving for θ, we see that
ψ + 2πk
cos ψ = cos(nθ) =⇒ nθ = ψ + 2πk =⇒ θ =
n
for some integer k. Hence our expression for w1/n is
1/n 1/n ψ + 2πk ψ + 2πk
z = w = |w| cos + i sin .
n n
For the nth root of w, that is w1/n , this formula gives all possible roots. In fact there are n
distinct roots; all others are repeated values. p
Recall that the equation of a circle in R2 is (x − x0 )2 + (y − y0 )2 = r for r > 0. In the
complex plane, this is expressed by |z − z0 | = r.
102
11.2. Functions and Limits Chapter 11. Complex Analysis
f
x x
1
Example 11.2.2. f (z) = z−1
has domain D = {z ∈ C | z 6= 1} and range f (D) = {z ∈ C |
z 6= 0}.
Definition. A sequence is a complex-valued function whose domain is the set of positive
integers, written (zn ) = (z1 , z2 , z3 , . . .) where each zi is a complex number.
Definition. A sequence (zn ) is said to have a limit L if, given any ε > 0 there is some
N ∈ N such that |zn − L| < ε for all n ≥ N . In this case we write lim zn = L and say that
n→∞
(zn ) converges to L. If no such L exists, then (zn ) is said to diverge.
The definitions of sequence and limit are nearly identical to their counterparts in real
analysis. However, in the complex plane every number has a real and an imaginary part.
The following proposition helps us relate the definition of a complex limit to its real and
imaginary parts.
Proposition 11.2.3. Let zn = xn + iyn and z = x + iy. Then lim zn = z ⇐⇒ lim xn = x
n→∞ n→∞
and lim yn = y.
n→∞
Proof. ( =⇒ ) If lim zn = z then the inequalities |xn − x| ≤ |zn − z| and |yn − y| ≤ |zn − z|
n→∞
directly imply that (xn ) and (yn ) converge to x and y, respectively.
( ⇒= ) On the other hand, suppose (xn ) → x and (yn ) → y. If ε > 0 is given, we may
choose N1 and N2 such that |xn − x| < 2ε for all n ≥ N1 and |yn − y| < 2ε for all n ≥ N2 . Let
N = max{N1 , N2 }. Then for all n ≥ N the triangle inequality gives us
ε ε
|zn − z| ≤ |xn − x| + |yn − y| < + = ε.
2 2
Hence (zn ) converges to z = x + iy.
103
11.2. Functions and Limits Chapter 11. Complex Analysis
As a result, we have
Corollary 11.2.4. If zn → z then |zn | → |z|.
The converse to this is generally false. For example, the sequence |in | converges to 1
since |in | = |i|n = 1n = 1 for all n; however, in = (i, −1, −i, 1, i, −1, . . .) and this fluctuates
infinitely often between these four values, so the sequence diverges.
Proposition 11.2.5. Suppose lim zn = z. Then
n→∞
1 1
(ii) If zn 6= 0 for any n and z 6= 0, then lim = .
n→∞ zn z
Proof. (i) Let ε > 0 be given. By convergence of (zn ) there exists a positive integer N such
ε
that |zn − z| < |k| . Then for all n ≥ N ,
ε
|kzn − kz| = |k| |zn − z| < |k| = ε.
|k|
This shows that limits of complex sequences behave as expected (by which we mean they
behave as their counterparts do in the real case). We also have
Theorem 11.2.6. If (zn ) converges to z and (wn ) converges to w, then the sequence (zn wn )
converges to zw.
Definition. Given a function f (z) with domain D and a point z0 either in D or in the
boundary ∂D of D, we say f has a limit at z0 if
lim f (z) = L
z→z0
for some L ∈ C. Explicitly, f (z) has limit L at z0 if for every ε > 0 there exists a δ > 0
such that 0 < |z − z0 | < δ implies |f (z) − L| < ε.
104
11.2. Functions and Limits Chapter 11. Complex Analysis
Definition. f (z) is continuous at a point z0 in its domain if lim f (z) exists and it equals
z→z0
f (z0 ). In particular, f (z) is continuous if for every ε > 0 there exists a δ > 0 such that if
|z − z0 | < δ then |f (z) − f (z0 )| < ε.
Example 11.2.7. The function f (z) = |z|2 is continuous on its domain C. For example, f (z)
has limit 4 at z0 = 2i. To see this, let ε > 0 and define δ1 = 1, δ2 = 5ε and δ = min{δ1 , δ2 }.
Note that by the reverse triangle inequality, |z| ≤ |z − 2i| + |2i| < 1 + 2 = 3; we will use this
below. Then if 0 < |z − 2i| < δ we have
z
Example 11.2.8. Consider the function f (z) = where z = x + iy 6= 0 and z̄ = x − iy, its
z̄
complex conjugate. Does lim f (z) exist? Well consider this limit along two different paths
z→0
in the complex plane:
0 + iy
lim f (z) = = −1
(x,y)→(0,y) 0 − iy
x + i0
lim f (z) = = 1.
(x,y)→(x,0) x − i0
z
Since these limits are different, the limit of the function must not exist. Hence is not
z̄
continuous at z0 = 0.
Definition. A function f (z) has a limit at infinity, denoted lim f (z) = L, if for any
z→∞
ε > 0 there is a (large) number M such that |f (z) − L| < ε whenever |z| ≥ M . Note that
there is no restriction on arg z; only |z| is required to be large.
Example 11.2.9. The family of functions f (z) = z1m has a limit L = 0 as z → ∞ for all
1
m = 1, 2, 3, . . .. To see this, let ε > 0 and choose M = ε1/m . Then if |z| ≥ M ,
m m
1
= 1 ≥
1
= (ε1/m )m = ε.
zm |z| M
By properties of limits, we have
Proposition 11.2.10.
1) Every polynomial p(z) = a0 + a1 z + . . . + an z n is continuous on the complex plane.
p(z)
2) If p(z) and q(z) are polynomials, then their quotient q(z)
is continuous at all points such
that q(z) 6= 0.
105
11.2. Functions and Limits Chapter 11. Complex Analysis
Every complex-valued function f (z) can be written as f (z) = u(z) + iv(z), where u and
v are each real-valued functions. This allows us to view every complex function by its real
and imaginary parts. It is easy to see that all of the results on continuity for functions of
the real numbers now apply for complex-valued functions. In particular,
Proposition 11.2.11. Let f = u + iv be a complex-valued function. Then f is continuous
at z0 if and only if u and v are both continuous at z0 .
n
X
Definition. For complex numbers z1 , z2 , . . . their nth partial sum is zj = z1 + . . . + zn .
j=1
Recall from single-variable calculus the exponential function ex . This function has many
definitions, with the two most important being
x t
ex = lim 1 +
t→∞ t
Xx∞ n
and ex = .
n=1
n!
In complex analysis, we define
106
11.2. Functions and Limits Chapter 11. Complex Analysis
The special case eit = cos t + i sin t is called Euler’s formula. Euler was the first to
realize the connection between the exponential function and sine and cosine. This amazing
identity, called “the most remarkable formula in mathematics” by Feynman, has been around
since 1748 and has far-reaching implications in many branches of mathematics and physics.
The following proposition shows that this definition captures all of the nice properties of
ex from the real case. We will see in a moment that in the complex plane, the exponential
function has even deeper properties and an essential connection to the geometry of C.
(a) ez+w = ez ew .
(b) 1
ez
= e−z .
(c) ez+2πi = ez , that is, the complex exponential function is periodic with period 2πi.
(the last part uses a trick similar to the one used in the proof of De Moivre’s Theorem
(11.1.2)).
(b) follows from (a) and trig properties.
(c) follows directly from the definition of ez .
(d) follows from the fact that for any θ, | cos θ + i sin θ| = 1.
(e) By part (d), |ex+iy | = ex , and x is real so ex is always nonzero. Therefore |ez | 6= 0
which implies ez 6= 0.
Note that part (c) of Proposition 11.2.12 implies that f (z) = ez is not a one-to-one
function on the complex plane. This is unfortunate, since that was one of the nice attributes
of ex in the real case, as it allowed us to define an inverse, the logarithm log x. We next
show how to construct a partial solution to this problem.
Let w = ex+iy . We seek a function F such that F (w) = x + iy and eF (x+iy) = x + iy.
Note that since |w| = ex and these are real numbers, we have x = ln |w|. This allows us to
define
107
11.2. Functions and Limits Chapter 11. Complex Analysis
This is not a function (meaning it is not well-defined), since arg z represents a set of
values which differ by 2kπ for integers k.
We remedy this by making branch cuts of the complex plane. This is done by taking
a ray from the origin, say with angle θ and defining the branch (θ, θ + 2π] so that log z is
well-defined on this domain. The most important branch is
Definition. Let Arg z denote the argument of z in the branch (−π, π]; this is called the
principal branch. Then we define the principal logarithm by
eLog z = eln |z|+i Arg z = eln |z| (cos θ + i sin θ) = |z|(cos θ + i sin θ) = z.
Note that these require that we restrict our attention to a single branch (it may not even be
the principal branch) for the expressions to be well-defined.
Recall that f (z) = u(z) + iv(z) is continuous if and only if u and v are continuous. Well
Arg z has no limit at values along the negative real axis. Therefore Log z is not continuous at
any point Re(z) ≤ 0. However, making a different branch cut allows us to define a function
with different continuity.
As in the real case, exponentials for bases other than e are permitted. They relate to the
logarithm by
az = ez log a
where log a is defined on a fixed branch of the logarithm.
The complex trigonometric functions are defined in terms of ez .
Definition. The complex cosine and complex sine functions are defined by
Note that the complex trig functions coincide with their real counterparts, for if x ∈ R
we have
1 ix
2
(e + e−ix ) = 12 (cos x + i sin x + cos(−x) + i sin(−x))
= 21 (cos x + i sin x + cos x − i sin x) = cos x
and 1
2i
(eix − e−ix ) = 1
2i
(cos x + i sin x − (cos(−x) + i sin(−x)))
1
= 2i
(cos x + i sin x − cos x + i sin x) = sin x.
108
11.2. Functions and Limits Chapter 11. Complex Analysis
The complex cosine and sine functions are also periodic, with period 2π like the real-valued
cosine and sine. Using the fact that ez is periodic, we can write
Many other properties of the real trig functions carry over the complex case. Just to name
a few,
(g) When we define the derivative of a complex-valued function in Section 11.4, we will
see that the derivatives of cos z and sin z are similar to the real case.
109
11.3. Line Integrals Chapter 11. Complex Analysis
For functions that take on values over some region in the complex plane, we integrate over
curves.
a
γ(t)
Remember that a curve is smooth if its first derivative γ 0 (t) exists and is continuous on
[a, b]. Since the curves are all functions on a real interval [a, b], we need not worry about
complex derivatives yet; γ 0 (t) is just the first derivative in the normal sense. Some important
examples of parametrizations in the complex plane are
Example 11.3.1. A curve γ is simple if γ(t1 ) 6= γ(t2 ) whenever a < t1 < t2 < b. In plain
language, a simple curve does not intersect itself; it is an embedding of the interval [a, b] into
C. The easiest simple curve to parametrize is a line:
z1
z0 γ
110
11.3. Line Integrals Chapter 11. Complex Analysis
Example 11.3.2. A curve γ is closed if γ(a) = γ(b), i.e. it starts and ends in the same
location. The canonical example of a simple closed curve is a circle:
γ
r
z0
z1 = 2 + 3i
γ
z0 = 0 + 0i
We parametrize the curve by γ(t) = 2t + 3it, 0 ≤ t ≤ 1. Then using the formula above, we
compute
Z Z 1 Z 1
2 2 0
z dz = γ(t) γ (t) dt = (2t + 3it)2 (2 + 3i) dt
γ 0 0
Z 1 Z 1
2 2 2
= (4t − 9t + 12it )(2 + 3i) dt = (−5t2 + 12it2 )(2 + 3i) dt
0 0
Z 1 1
46 1 46
= (−46t2 + 9it2 ) dt = − t3 + 3it3 0 = − + 3i.
0 3 0 3
Example 11.3.4. Just as reversing the order of a and b in a real integral changes the integral
by −1, one can reverse the orientation of a smooth curve γ to switch the sign of the line
integral along γ. Let −γ denote the curve γ with orientation reversed. Then
Z Z
f (z) dz = − f (z) dz.
−γ γ
111
11.3. Line Integrals Chapter 11. Complex Analysis
Example 11.3.5. Let γ be the unit circle, which has the parametrization γ(t) = eit , 0 ≤
t ≤ 2π. Let’s verify the circumference of the circle with the formula for the length of γ:
Z 2π Z 2π Z 2π
0 it
|γ (t)| dt = |ie | dt = dt = 2π.
0 0 0
The next proposition contains some useful properties of the line integral.
Proposition 11.3.6. Suppose γ is a smooth curve and f and g are continuous, complex-
valued functions on a domain containing γ.
Z Z Z
(a) (f (z) + g(z)) dz = f (z) dz + g(z) dz.
γ γ γ
Z Z
(b) For any c ∈ C, cf (z) dz = c f (z) dz.
γ γ
(c) If τ is a curve whose initial point is the terminal point of γ, then γτ is defined to be
the curve obtained by following γ and then τ . The integral over γτ is given by
Z Z Z
f (z) dz = f (z) dz + f (z) dz.
γτ γ τ
Z
(d) f (z) dz ≤ max |f (z)| · length(γ).
γ z∈γ
112
11.4. Differentiability Chapter 11. Complex Analysis
11.4 Differentiability
z
Recall that the function f (z) = is not continuous at z0 = 0. This points to the fact
z̄
that complex functions are somehow different than their real brethren, and in particular the
convergence of a function in C is much stronger than convergence in R.
This definition is the same as in the real case, although as discussed above the notion of
a limit is much stronger in C. In the complex world, we have a further notion of differentia-
bility:
Example 11.4.1. Many familiar functions from real analysis have the same derivative in
the complex plane. For example, f (z) = z 2 has derivative 2z which may be confirmed by
computing either of the above limits. In fact this holds for all z ∈ C so z 2 is an entire
function.
z̄ − z̄0 z − z0 z̄
lim = lim = lim
z→z0 z − z0 z→z 0 z − z0 z→0 z
does not exist as we have seen.
Most of the nice properties of real derivatives carry over to the complex place.
113
11.4. Differentiability Chapter 11. Complex Analysis
The fundamental property in this section is a pair of equations called the Cauchy-
Riemann Equations, which relate the derivative f 0 (z) to the partial derivatives with respect
to the real and imaginary parts of z.
Theorem 11.4.4 (Cauchy-Riemann Equations). Let f (z) = u(x, y) + iv(x, y) be a complex
function which is continuous at z0 = x0 + iy0 . Then f (z) is differentiable at z0 if and only
if the partial derivatives ∂u , ∂u , ∂v and ∂y
∂x ∂y ∂x
∂v
exist, are continuous and satisfy
∂u ∂v ∂u ∂v
= and =−
∂x ∂y ∂y ∂x
on some neighborhood of z0 .
Proof. ( =⇒ ) If f (z) is differentiable at z0 = x0 + iy0 then
f (z0 + h) − f (z0 )
f 0 (z0 ) = lim .
h→0 h
First consider approaching z along the line (x0 + h) + iy0 :
114
11.4. Differentiability Chapter 11. Complex Analysis
Elsewhere, we have
∂f hy ∂f hx ∂f
(z0 ) = · (z0 ) + · (z0 ).
∂x h ∂y h ∂x
Now we subtract these two expressions and take a limit, which gives
f (z0 + h) − f (z0 ) ∂f hy f (z0 + hx + ihy ) − f (z0 + hx ) ∂f
lim − (z0 ) = lim − (z0 )
h→0 h ∂x h→0 h hy ∂y
hx f (z0 + hx ) − f (z0 ) ∂f
+ lim − (z0 ) .
h→0 h hx ∂x
If we can show that the limits on the right are both 0, then we’re done. The ratios hhx and
hy
h
are both bounded by the triangle inequality, so it suffices to prove the the expressions in
parentheses tend to 0. The second term goes to 0 since by definition,
∂f f (z0 + hx ) − f (z0 )
(z0 ) = lim .
∂x hx →0 hx
The other expression is more problematic, since it involves both hx and hy . However, the
Mean Value Theorem from real analysis gives us real numbers 0 < a, b < 1 such that
u(x0 + hx , y0 + hy ) − u(x0 + hx , y0 )
= uy (x0 + hx , y0 + ahy )
hy
v(x0 + hx , y0 + hy ) − v(x0 + hx , y0 )
and = vy (x0 + hx , y0 + bhy ).
hy
Substituting these expressions into the first term above gives us
f (z0 + hx + ihy ) − f (z0 + hx ) ∂f
− (z0 ) = uy (x0 + hx , y0 + ahy ) + ivy (x0 + hx , y0 + bhy )
hy ∂y
− uy (x0 , y0 ) − ivy (x0 , y0 )
= (uy (x0 + hx , y0 + ahy ) − uy (x0 , y0 ))
+ i(vy (x0 + hx , y0 + bhy ) − vy (x0 , y0 )).
Finally, these two pieces each tend to 0 since uy and vy are assumed to be continuous at
z0 = x0 + iy0 . This finishes the proof.
Example 11.4.5. Consider f (z) = Log z using the principal branch D as its domain. We
may write this as
So one sees that u(x, y) = 12 ln(x2 + y 2 ) and v(x, y) = arctan xy . We calculate the partials:
x y 1 −y
ux = vx = − 2 = 2
x2 + y2 2
x 1+ y x + y2
x
y 1 1 x
uy = vy = 2 = .
x2 + y 2 x 1+ y x2 + y 2
x
115
11.4. Differentiability Chapter 11. Complex Analysis
Re(s) = 1/2
1
centered about z0 = 0, where all the coefficients are 1. This series converges to exactly
1−r
when |z| < 1. We will see that power series behave in similar ways, and when they converge,
they converge to complex functions that we may be interested in.
116
11.4. Differentiability Chapter 11. Complex Analysis
∞
X
For a power series an (z − z0 )n we have three cases for convergence:
n=0
(1) The series only converges at z = z0 . In this case, the radius of convergence of the
series is 0.
(2) The series converges for all z in a disc of finite radius R centered at z0 .
(3) The series converges for all z ∈ C, in which case we say the series has an infinite radius
of convergence.
A power series with positive or infinite radius of convergence represents a function that is
holomorphic within the disc of convergence of the series. This is one of the most important
facts in complex analysis, so we take a moment to formalize it here.
∞
X
Theorem 11.4.9. Suppose an (z − z0 )n has a positive or infinite radius of convergence
n=0
R. Then it represents a function f (z) which is holomorphic on D = {z ∈ C : |z − z0 | < R}.
Now that we know that power series are holomorphic (differentiable) on their discs of
convergence, we can take derivatives.
∞
X
Theorem 11.4.10. Suppose an (z − z0 )n has a positive or infinite radius of convergence
n=0
R. Then its derivative is also a power series:
∞
X
0
f (z) = nan (z − z0 )n−1
n=1
Using the formulas for cos z and sin z from Section 11.2, we can derive their Taylor series as
well:
∞
X (−1)n
cos z = (z − z0 )2n
n=0
(2n)!
∞
X (−1)n
sin z = (z − z0 )2n+1 .
n=0
(2n + 1)!
117
11.5. Integration in the Complex Plane Chapter 11. Complex Analysis
= (u dx − v dx) + i (v dx + u dy)
ZγZ γ
ZZ
= (−vx − uy ) dxdy + i (ux − vy ) dxdy by Green’s Theorem
ZZ Ω ZZ Ω
Corollary 11.5.3 (Deformation of Path). Suppose γ1 and γ2 are two simple, closed curves
with the same orientation, with γ2 lying on the interior of γ1 .
γ2
γ1
118
11.5. Integration in the Complex Plane Chapter 11. Complex Analysis
C D
z0
Ω
γ
Proof. Fix z ∈ Ω and let C be a circle with center z contained in Ω. Note that for any
f (ζ)
z ∈ D, is holomorphic on D r {z}. By deformation of path,
ζ −z
Z Z
1 f (ζ) 1 f (ζ)
dζ = dζ.
2πi γ ζ − z 2πi C ζ − z
Now take the limit as r → 0. Since f (z) is continuous, we can bring the limit inside the
integral: Z 2π Z 2π
1 it 1
lim f (z + re ) dt = f (z) dt.
r→0 2π 0 2π 0
Notice that f (z) doesn’t depend on t, so we can integrate this easily and see that it equals
f (z). This proves the theorem.
119
11.5. Integration in the Complex Plane Chapter 11. Complex Analysis
The next theorem shows that Cauchy’s Integral Formula is intimately related to complex
power series.
Theorem 11.5.6. Let f be holomorphic on a domain D and suppose z0 is a point in D such
that the circle |z − z0 | < R for some real R lies in D. Let γ be a simple closed curve lying
within this circle and containing z0 on its interior. Then
∞ Z
X
k 1 f (ζ)
f (z) = ak (z − z0 ) where ak = dζ
k=0
2πi γ (ζ − z0 )k+1
|z − z0 | s
Note that = < 1. This allows us to introduce the series as a convergent geometric
|ζ − z0 | r
series: ∞ k
1 1 X z − z0
= .
ζ −z ζ − z0 k=0 ζ − z0
Using this and the expression given by Cauchy’s integral formula above, we are able to write
Z
1 f (ζ)
f (z) = dζ
2πi γ ζ − z
∞ k
f (ζ) X z − z0
Z
1
= dζ
2πi γ ζ − z0 k=0 ζ − z0
∞ Z
1 X k f (ζ)
= (z − z0 ) k+1
dζ.
2πi k=0 γ (ζ − z0 )
for some γ about z0 . We will see below that we can differentiate (and antidifferentiate) power
series, so f (z) is infinitely differentiable on the region of convergence of the power series.
120
11.5. Integration in the Complex Plane Chapter 11. Complex Analysis
Theorem 11.5.6 suggests a powerful connection between power series and holomorphic
functions in the complex plane. In this section we prove that every power series represents
a holomorphic function on its region of convergence and every holomorphic function has a
power series representation on its domain. First, we need a converse to Cauchy’s Theorem
(11.5.1).
Theorem 11.5.8 (Morera’s Theorem). Suppose f (z) is continuous on a domain D and
Z
f (z) dz = 0
γ
by continuity of the power series on its region of convergence. Then Morera’s Theorem says
that f (z) is holomorphic on D.
Now we know that power series are differentiable on their region of convergence. The
next result says that we can differentiate power series term-by-term, just as in the real case.
∞
X
Theorem 11.5.10. Suppose f (z) = ak (z − z0 )k has positive radius of convergence R.
k=0
Then f (z) is differentiable with
∞
X
0
f (z) = kak (z − z0 )k−1
k=1
121
11.5. Integration in the Complex Plane Chapter 11. Complex Analysis
∞
X
Theorem 11.5.11. Suppose f (z) = ak (z − z0 )k has a positive radius of convergence.
k=0
Then
f (k) (z0 )
ak = .
k!
We now turn to the other connection between holomorphic functions and power series.
Well actually, we have already proven (Corollary 11.5.7) that holomorphic functions have
power series representations, which we recall here.
where z0 ∈ D and γ is a simple closed curve lying in D and containing z0 on its interior.
We now define what it means for a function to be analytic on a certain region in the
complex plane.
The following theorem summarizes everything we have learned so far about holomorphic
functions in the complex plane.
Theorem 11.5.14. For a complex function f (z) which is continuous on a domain D, the
following are equivalent:
(1) f (z) is differentiable on some open disk centered at z0 ∈ D, that is, f is holomorphic
at z0 .
(2) The Taylor series expansion of f (z) about z0 converges to f (z) with positive radius of
convergence, i.e. f is analytic.
122
11.5. Integration in the Complex Plane Chapter 11. Complex Analysis
Theorem 11.5.15 (Liouville’s Theorem). If f (z) is entire and there exists a constant M
such that |f (z)| ≤ M for all z ∈ C, then f is a constant function.
Proof. Let z0 ∈ C and take Cr to be the circle centered at z0 with radius r > 0. By
Corollary 11.5.13, Z
0 1 f (ζ)
f (z0 ) = dζ.
2πi Cr (ζ − z0 )2
Parametrize the circle by Cr : z0 + reit , 0 ≤ t ≤ 2π. Then
Z 2π
0 1 f (z0 + reit ) it
f (z0 ) = ire dt
2πi 0 r2 e2it
Z 2π
1 f (z0 + reit )
= dt.
2πr 0 eit
Taking the modulus of both sides and applying the triangle inequality for integrals, we have
Z 2π
f (z0 + reit )
0 1
|f (z0 )| ≤ dt
2πr 0 eit
Z 2π
1 |f (z0 + reit )|
= dt
2πr 0 |eit |
Z 2π
1
≤ M dt.
2πr 0
123
11.6. Singularities and the Residue Theorem Chapter 11. Complex Analysis
for all z in its domain D. This is highly useful, but when f (z) is not analytic on a domain
D we still want a way of representing f as a series. This motivates the introduction and
application of Laurent series:
Definition. A Laurent series is a series expansion of a function f (z) about a point z0 not
in the domain of f in terms of two infinite power series, a positive and negative one:
∞
X ∞
X X
f (z) = ak (z − z0 )k + bk (z − z0 )−k = ck (z − z0 )k .
k=0 k=1 k∈Z
Remark. A Laurent series converges if and only if both the positive and negative series
converge. Absolute and uniform convergence are defined analagously. Notice that any Taylor
series is a Laurent series whose negative part vanishes.
The positive series has some radius convergence R1 , that is, the series converges on the region
1
{z ∈ C : |z − z0 | < R1 }. Similarly, the negative series is just a power series in z−z 0
so it
1 1 1
has radius of convergence R2 , i.e. it converges when |z−z0 | < R2 . This can be written as the
complement of a closed disk, {z ∈ C : |z − z0 | > R2 }. Thus we see that the Laurent series
is convergent on an annular region {z ∈ C : R2 < |z − z0 | < R1 } (as long as R2 < R1 ).
By Theorem 11.5.9, the Laurent series represents an analytic function f (z) on the region
D = {z ∈ C : R2 < |z − z0 | < R1 }. This is made explicit in the next theorem.
Proof. Apply Cauchy’s Theorem (11.5.1) and related results to both series.
124
11.6. Singularities and the Residue Theorem Chapter 11. Complex Analysis
Remark. By the definition of their coefficients in terms of the integrals above, Laurent
series expansions are unique.
Laurent series give us a way to deal with ‘holes’ in the domain of a function which is
otherwise holomorphic on the region. Such functions have a special name:
A singularity is the name we give to a ‘hole’ in the domain of a complex function. Below
we describe the three different types of singularities a function may have.
Proposition 11.6.2. Let z0 be an isolated singularity of f (z) and suppose f (z) has a Laurent
series expansion
X∞ X∞
f (z) = n
an (z − z0 ) + bn (z − z0 )−n
n=0 n=1
(a) z0 is a removable singularity if and only if bn = 0 for all n and there is a function g,
(
f (z) z = 6 z0
g(z) =
a0 z = z0 ,
(b) z0 is a pole of f (z) if and only if all but a finite number of the bn vanish. Specifically,
if bn = 0 for all n > m then z0 is a pole of order m and f can be written
∞
bm bm−1 b1 X
f (z) = + + . . . + + an (z − z0 )n .
(z − z0 )m (z − z0 )m−1 z − z0 n=0
125
11.6. Singularities and the Residue Theorem Chapter 11. Complex Analysis
(c) z0 is an essential singularity if and only if infinitely many of the bn are nonzero.
We saw there is a connection between the coefficients of the negative part of the Lau-
rent series of a function and contour integrals of the function about its singularities. The
coefficient b1 in a Laurent series is of particular importance, so much so that it has a special
name.
where C : |z − z0 | = r for some 0 < r < R, the radius of convergence of the Laurent series
for f . This is in turn equal to the b1 coefficient of the Laurent series.
There is a nice formula for the residues of removable singularities and poles.
1 dm−1
Res(f ; z0 ) = lim m−1 (z − z0 )m f (z).
(m − 1)! z→z0 dz
Proof. (a) follows from Cauchy’s Theorem (11.5.1), and (b) is a simple application of Taylor’s
Theorem to the series ∞
X
(z − z0 )m f (z) = cn (z − z0 )n+m .
n=−m
The formula for Res(f ; z0 ) follows from the identification of the residue and b1 .
Proposition 11.6.4. Suppose f and g are analytic on |z − z0 | < r for some z0 ∈ C and
r > 0, and suppose g(z0 ) = 0 but g 0 (z0 ) 6= 0. Then
f f (z0 )
Res ; z0 = 0 .
g g (z0 )
Proof. Let g(z) have the following power series centered at z0 (by assumption the series has
no c0 coefficient):
∞
X ∞
X
k
g(z) = ck (z − z0 ) = (z − z0 ) ak (z − z0 )k
k=1 k=0
where ak = ck−1 ; call the analytic function represented by this new series h(z). Note that
h(z0 ) = c1 6= 0, so
f (z) f (z)
=
g(z) (z − z0 )h(z)
126
11.6. Singularities and the Residue Theorem Chapter 11. Complex Analysis
and fh is analytic at z0 . Using the definition of residue in terms of the Laurent series coeffi-
cients, the residue of fg is equal to the constant term of the series for fh (the n = −1 term of
the series for fg ). This is computed to be fh(z
(z0 )
0)
, but by the way we defined h, h(z0 ) = g 0 (z0 ).
Hence
f f (z0 )
Res ; z0 = 0 .
g g (z0 )
We finally arrive at the central theorem in basic complex analysis: the Residue Theorem.
Theorem 11.6.5 (The Residue Theorem). Suppose f (z) is meromorphic on a region D; let
z1 , . . . , zn be the isolated singularties of f inside D. If γ is a piecewise smooth, positively
oriented, simple closed curve lying in D that does not pass through any of the zi then
Z n
X
f (z) dz = 2πi Res(f ; zi ).
γ i=1
Proof. Draw a positively-oriented circle Ci around each singularity zi such that zi is the only
singularity of f on its interior. The case where n = 3 is illustrated below.
z2 z1
z3
Then γ is contractible to a curve γ 0 which connects the Ci together and otherwise contains
no singularities on its interior. Such a contraction is shown in the next figure.
z2 z1
γ0
z3
127
11.6. Singularities and the Residue Theorem Chapter 11. Complex Analysis
Z Z n Z
X
Then f (z) dz = f (z) dz + f (z) dz but by construction, f (z) is holomorphic on
γ γ0 i=1 Ci
the interior of γ 0 , so by Cauchy’s Theorem (11.5.1) this part equals 0. Evaluate the remaining
terms using the definition of residue to produce the main summation formula:
Z n Z
X n
X
f (z) dz = f (z) dz = 2πi Res(f ; zi ).
γ i=1 Ci i=1
128
Chapter 12
129
12.1. The Functional Equation Chapter 12. Zeta Functions and L-Series
1 1
≤ ζ(s) ≤ 1 +
s−1 s−1
1 ≤ (s − 1)ζ(s) ≤ s,
by the characterization of simple poles, evaluated using the formula above. So the residue
of the zeta function at s = 1 is 1.
which converges absolutely for Re(s) > −1. And I(s) = P(s), which is analytic everywhere
on C except negative integers. Define the Gamma function Γ(s) = I(s − 1). Then we
substitute t = nx to obtain
Z ∞
Γ(s) = e−nx (nx)s−1 n dx
Z0 ∞
Γ(s)
= e−nx xs−1 dx.
ns 0
130
12.1. The Functional Equation Chapter 12. Zeta Functions and L-Series
Note here that ns appears, so we want to sum over all n to get our hands on the zeta function.
Doing so yields
∞ ∞ Z
X Γ(s) X ∞ −nx s−1
= e x dx.
n=1
ns n=1 0
By Fubini’s Theorem, we can switch the summation and integral if the absolute value of the
right side is finite. So consider
XN Z ∞ Z ∞
−nx s−1
|e x | dx = |e−nx xs−1 | dx since finite sums swap order
n=1 0 0
Z ∞
∞X
≤ |e−nx xs−1 | dx,
0 n=1
which we want to show exists. Look at |e−nx xs−1 | where s ∈ C with Re(s) > 0, which
becomes
|e−nx xs−1 | = |e−nx | |xs−1 |
= e−nx |xRe(s)−1 |
= e−nx xRe(s)−1 .
Then we have
∞
X ∞
X
|e−nx xs−1 | = e−nx xRe(s)−1
n=1 n=1
∞
X
=x Re(s)−1
e−nx
n=1
e−x
Re(s)−1
=x by geometric series
1 − e−x
xRe(s)−1
= .
ex − 1
Finally,
∞
∞X ∞
xRe(s)−1
Z Z
−nx s−1
|e x | dx = dx
0 n=1 0 ex − 1
1 ∞
xRe(s)−1 xRe(s)−1
Z Z
= dx + dx
0 ex − 1 1 ex − 1
1 ∞
xRe(s)−1
Z Z
1 Re(s)−1 −x
≤ dx + x e dx.
0 x 1 2
131
12.1. The Functional Equation Chapter 12. Zeta Functions and L-Series
ex
(The first integral is due to ex − 1 ≥ 1 for 0 ≤ x ≤ 1, and the second is because ex − 1 >
2
for x ≥ 1.) Note that this is integrable, so we can swap the integral and summation above,
giving:
Z ∞X ∞ Z ∞ s−1
−nx s−1 x
ζ(s)Γ(s) = e x dx = x−1
dx
0 n=1 0 e
by geometric series.
(−z)s dz
Z
Next, define F(s) = lim , where γε,δ is the given contour:
ε,δ→0 γε,δ ez − 1 z
II
I
δ ε
III
For part I, the parametrization z = x + iε, δ 0 ≤ x < ∞, x → ∞ makes the integral become
Z δ0 −πi Z ∞ Z ∞ s−1
(e (x + iε))s dx −πis (x + iε)s−1 ε→0 −πis x
x+iε
= −e x+iε
dx −−→ −e x
dx.
∞ e −1 x + iε δ0 e −1 δ0 e − 1
Similarly, the parametrization z = x − iε, δ 0 ≤ x < ∞, x → ∞ makes part III look like
Z ∞ πi Z ∞ Z ∞ s−1
(e (x − iε))s dx πis (x − iε)s−1 ε→0 πis x
x−iε
= e x−iε
dx −−→ e x
dx.
δ0 e − 1 x − iε δ0 e −1 δ0 e − 1
Now for part II, we want the integral to vanishe as δ → 0. The parametrization z = δeiθ ,
τ ≤ θ ≤ 2π − τ , where τ is arbitrarily small, gives us
Z 2π−τ
(−z)s dz (−δeiθ )s iδeiθ dθ
Z
=
z
II e − 1 z τ eδeiθ − 1 δeiθ
2π−τ
(−δeiθ )s
Z
=i dθ.
τ eδeiθ − 1
On the whole path, we have the following bounds:
132
12.1. The Functional Equation Chapter 12. Zeta Functions and L-Series
iθ
|eδe − 1| = |eσ+iκ − 1| where δeiθ = σ + iκ
≥ |e−δ − 1| since eσ+iκ is closest to 1 when θ = π, giving σ = −δ, κ = 0
δ
≥ since δ is small.
2
Z 2π−τ
s−1
= 2iδ dθ
τ
∞
xs−1
Z
πis −πis
= (e −e ) dx + 0
0 ex − 1
= 2i sin(πs)Γ(s − 1)ζ(s).
This holds for all Re(s) > 1. Next, solve for the zeta function:
(−z)s dz
Z
1
ζ(s) = .
2i sin(πs)Γ(s) γ ez − 1 z
πs
We proved for homework that sin(πs) = , so
P(s)P(−s)
1 P(s)P(−s) P(−s)
= =
2i sin(πs)P(s − 1) 2iπsP(s − 1) 2πi
since P(s) = sP(s − 1). Thus the functional equation for the zeta function is:
Γ(1 − s) (−z)s dz
Z
ζ(s) = z
Re(s) > 1
2πi γ e −1 z
which is an analytic continuation to the entire complex plane minus s = 1. Note that P(−s)
is defined everywhere except positive integers. But ζ(s) is defined at these points. More-
over, the functional equation for ζ(s) covers the rest of the complex plane, namely Re(s) ≤ 1,
133
12.1. The Functional Equation Chapter 12. Zeta Functions and L-Series
s 6= 1, so we have values for ζ(s) everywhere except s = 1. Since the functional equation is
analytic around s = 1, we see that s = 1 is a simple pole.
What happens to the functional equation for Re(s) < 0? P(−s) (and Γ(1 − s)) are
well-defined, so we will examine the integral part. Consider the contour Dn :
Dn
2π (n + 1/2)
poles
Since we cut out z = 0, the only poles occur when ez − 1 = 0 ⇒ ez = 1 ⇒ z = 2πik for
integers 0 < |k| ≤ n. We calculate the residue at z = 2πik by
(−z)s
Res ; 2πik = g(2πik)
(ez − 1)z
(−z)s g(z)
where z
= . Then apply L’Hôpital’s Rule:
(e − 1)z z − 2πik
z − 2πik 1
lim = lim = 1.
z→2πik ez − 1 z→2πik ez
Thus we obtain
(−z)s
(z − 2πik) z
z 6= 2πik
g(z) = (e − 1)z
s
(−2πik)
= −(−2πik)s−1 z = 2πik.
−2πik
134
12.1. The Functional Equation Chapter 12. Zeta Functions and L-Series
Hence the residue at z = 2πik is −(−2πik)s−1 . We can plug this into the integration formula,
which gives us
n
(−z)s
X X
s−1 s−1
2πi Res ; α = −2πi (2πik) + (−2πik)
α∈D
(ez − 1)z k=1
n
n
s−1 s−1 s−1
X
= −2πi(2π) i + (−i) k s−1
k=1
π Xn
1
= −2πi(2π)s−1 2 sin s 1−s
.
2 k=1
k
Consider as n → ∞,
(−z)s dz (−z)s dz (−z)s dz
Z Z Z
z
= − z
+ z
.
Dn e − 1 z γn e − 1 z |z|=2π(n+1/2) e − 1 z
(−z)s dz
Z
Claim. As n → ∞, z
−→ 0.
|z|=2π(n+1/2) e − 1 z
z z 1
1
Proof. Consider e − 1 on |z| = 2π(n + 1/2). By work in class, |e − 1| ≥ 2
⇒ z
≤ 2.
e − 1
Also,
(−z)s
Re(s−1)
1
z = (2π(n + /2))
−→ 0 as n → ∞.
This gives us
(−z)s dz
Z
≤ 2π(n + 1/2)2π ·2 · (2π(n + 1/2))Re(s−1)
z
e −1 z
|z|=2π(n+1/2)
length of path
= 4π (2π(n + 1/2))Re(s) .
(−z)s dz
π Z
s−1
Hence as n → ∞, 2πi(2π) · 2 sin s ζ(1 − s) = z
. Then the functional
2 γ e −1 z
equation for Re(s) < 0 looks like
π
ζ(s) = P(−s)(2π)s−1 · 2 sin s ζ(1 − s).
2
Both ζ(s) and its functional equation are analytic everywhere except s = 1. Since they are
analytic continuations of each other, the functional equation will continue to match ζ(s)
everywhere (except s = 1).
To rephrase things slightly with an eye towards the functional equations derived in
Part VI, define s
−s/2
ξ(s) = π Γ ζ(s),
2
which is sometimes called the completed zeta function. We will prove:
135
12.1. The Functional Equation Chapter 12. Zeta Functions and L-Series
1
Theorem 12.1.2. ξ(s) = + g(s) for some g(s) which is analytic on C.
s(s − 1)
As above, the analytic continuation of ξ(s) to the whole complex plane follows easily.
The key idea in the proof of Theorem 12.1.2 is to study ξ(s) and Γ(s) using the Fourier
transform. Recall that for a complex-valued function f ∈ L1 (R), the Fourier transform of f
is defined by Z
ˆ
f (y) = f (x)e−2πixy dx.
R
These ideas will be critical in Part VI.
2
So it’s enough to show that R e−π(x+iy) dx = 1. Now the change of variables u = x + iy
R
gives us Z Z
−π(x+iy)2 2
e dx = e−πu du.
R iy+R
−πu2
Since e is an entire function and decays rapidly as | Re(u)| gets large, the contour integral
along the vertical pieces in the contour
iy + R
tend to 0 as they move outward, and thus the integrals along R and along iy + R are equal.
Then by a standard computation,
Z Z
−πu2 2
e du = e−πu du = 1.
iy+R R
136
12.1. The Functional Equation Chapter 12. Zeta Functions and L-Series
2
In other words, the function f (x) = e−πx is a fixed point of the Fourier transform
operator. By the same proof, we also have:
Proposition 12.1.4. For any a > 0, fa (x) = e−πx a satisfies fˆ(y) = √1a f a1 .
2
Since f is Schwartz, Fubini’s theorem allows us to swap the order of integration and sum-
mation:
XZ 1 X Z n+1 Z
−2πikx −2πikx
ak = f (x + n)e dx = f (x)e dx = f (x)e−2πikx dx.
n∈Z 0 n∈Z n R
(In the last step we use periodicity.) Thus ak = fˆ(k) where fˆ is the Fourier transform of f .
Now since F is analytic (it is even Schwartz), it equals its Fourier series on R:
X X
F (x) = ak e2πikx = fˆ(k)e2πikx .
k∈Z k∈Z
s Z ∞ 2
Γ = xs/2−1 e−πn x (πn2 )s/2 dx
2 0
Z ∞
2
= (πn )2 s/2
xs/2−1 e−πn x dx.
0
137
12.1. The Functional Equation Chapter 12. Zeta Functions and L-Series
s Z ∞
−s/2 −s 2
So π Γ n = xs/2−1 e−πn x dx. We next sum over n ∈ N to get the zeta function
2 0
involved on the left side:
s ∞ Z ∞
2
X
−s/2
π Γ ζ(s) = xs/2−1 e−πn x dx
2 n=1 0
Z ∞X ∞
2 2
= xs/2−1 e−πn x dx by Fubini’s theorem, since e−πn x is Schwartz
0 n=1
Z ∞ ∞
2
X
= x s/2−1
ω(x) dx where ω(x) = e−πn x .
0 n=1
2
Let θ(x) = n∈Z e−πn x . Then since e\ −πn2 x = √1 e−πn2 /x by Proposition 12.1.4, we have
P
x
1 1
θ(x) = x θ x by Poisson summation. Now θ(x) = 1 + 2ω(x) so we get a similar functional
√
√ √
equation for ω: for all x 6= 0, ω x1 = − 12 + 12 x + xω(x). We use this to evaluate the
above integral:
Z ∞ Z 1 Z ∞
s/2−1 s/2−1
x ω(x) dx = x ω(x) dx + xs/2−1 ω(x) dx
0
Z0 1 1
Z ∞
1−s/2 1 −1 1
= x ω 2
dx + xs/2−1 ω(x) dx after x 7→
x x x
Z∞∞ Z ∞1
1
= x−1−s/2 ω dx + xs/2−1 ω(x) dx
x
Z1 ∞ 1
√ √
−1−s/2 1 1 s/2−1
= x − + x + xω(x) + x ω(x) dx
1 2 2
Z ∞
1 1 s−1
=− + + x− 2 ω(x) + xs/2−1 ω(x) dx
s s−1 1
Z ∞
1 1−s
dx
= + x 2 + xs/2 ω(x) .
s(s − 1) 1 x
Corollary 12.1.6. ξ(s) = ξ(1 − s). Moreover, ξ(s) is meromorphic with only simple poles
at s = 0, 1.
138
12.2. Finding the Zeros Chapter 12. Zeta Functions and L-Series
When n = 0, both sides have zero factors so we can’t get any information about ζ(1 + n)
(which is good). On the other hand, we can write
n+1
(−1) 2(2π)n+1 bn+1
ζ(1 + n) =
2(n + 1)!
n+1
π
since P(n) = n! and (−1) 2 will give us the correct values of sin 2
(−n) by above.
139
12.2. Finding the Zeros Chapter 12. Zeta Functions and L-Series
The Riemann Hypothesis. Every nontrivial zero of ζ(s) lies on the critical line Re(s) = 12 .
(4) Consider
1 Y
1 − p−s
=
ζ(s) p prime
X (−1)k
= where k = # primes dividing n
n square-free
ns
∞
X µ(n)
=
n=1
ns
140
12.3. Sketch of the Prime Number Theorem Chapter 12. Zeta Functions and L-Series
Then we have
X log(x) X log(x) X
ψ(x) = log(p) ≤ log(p) = log(x) = π(x) log(x).
p≤x
log(p) p≤x
log(p) p≤x
ψ(x)
Therefore if lim exists and is equal to 1, we have
x→∞ x
141
12.3. Sketch of the Prime Number Theorem Chapter 12. Zeta Functions and L-Series
∞
−ζ 0 (s)
Z
ψ(x)
Claim. =s dx.
ζ(s) 1 xs+1
Proof. Consider Λ(n) = ψ(n) − ψ(n − 1). Then
N N
X Λ(n) X ψ(n) − ψ(n − 1) ψ(1)
= +
n=1
ns n=2
ns 1
goes to 0
N −1
ψ(N ) X 1 1
= − − s ψ(n)
Ns n=2
(n + 1) s n
N −1 Z n+1
ψ(N ) X s
= + ψ(n) dx
Ns n=2 n xs+1
Z N
ψ(N ) ψ(x)
= +s dx.
Ns 2 xs+1
ψ(N )
Chebyshev showed that ψ(x) = O(x) so −→ 0 as N → ∞ (if Re(s) > 1), and ψ(x) = 0
Ns
for 1 ≤ x ≤ 2. Thus we have proven the claim.
By Mellin Inversion,
a+i∞
−ζ 0 (s) xs
Z
1
ψ(x) = ds for some a ∈ R, a > 1
2πi a−i∞ ζ(s) s
where ρ are all the zeros of ζ(s). This is as far as Von Mangoldt got.
Thus if the Riemann Hypothesis holds, it would give the smallest possible error term for our
ψ(x) approximation above. Once we have ψ(x) ∼ x, the PNT follows.
142
12.4. Dirichlet Series Chapter 12. Zeta Functions and L-Series
Note that since (Z/mZ)× is a finite group for all m ∈ Z+ , χ([n]) is a root of unity for all
congruence classes [n] ∈ (Z/mZ)× . In other words, a Dirichlet character is a multiplicative
homomorphism from (Z/mZ)× to the circle group S 1 ⊂ C.
Example 12.4.1. The trivial character mod m, which takes every [n] ∈ (Z/mZ)× to 1 (and
every other integer to 0), is called the principal Dirichlet character, denoted χ0 . For
instance, the principal character mod 3 maps
1 →
7 1 4 →
7 1
2 →7 1 5 →7 1
3 → 7 0 6 → 7 0
Definition. For a Dirichlet character χ, we define a complex-valued function
∞
X χ(n)
L(s, χ) =
n=1
ns
Notice that for any m > 1, L(s, χ0 ) differs from ζ(s) only by factors 1−p1 −s for p | m. Recall
from Section 12.1 that ζ(s) extends to a meromorphic function on the half-plane Re(s) > 0
and satisfies
1
ζ(s) = + g(s)
1−s
143
12.4. Dirichlet Series Chapter 12. Zeta Functions and L-Series
and suppose there exist real numbers a, b > 0 such that |s(x)| ≤ axb for all x ≥ 1. Then
s(x)
lim(s − 1)L(s, χ) = lim .
s→1 x→∞ x
Generalized Riemann Hypothesis. For any Dirichlet L-function, L(s, χ) = 0 if and only
if Re(s) = 21 or s = −2n for n ∈ Z+ .
Shows that Z∗p has a primitive root of size c(log(p))6 for some uniform constant c.
144
Part III
145
Chapter 13
Introduction
Part III follows a course on algebraic number theory taught by Dr. Andrew Obus at the
University of Virginia in Spring 2016. The main topics covered are:
Hensel’s Lemma
Ramification theory
The main companion for the course is Neukirch’s Algebraic Number Theory. Other great ref-
erences include Cassels and Frohlich’s Algebraic Number Theory, Janusz’s Algebraic Number
Fields, Lang’s Algebraic Number Theory, Marcus’s Number Fields and Weil’s Basic Number
Theory.
146
13.1. Attempting Fermat’s Last Theorem Chapter 13. Introduction
In attempting to prove the theorem, we first remark that the n = 4 case is elementary;
it’s just a matter of parametrizing the Pythagorean triples (x, y, z) that solve x2 + y 2 = z 2
and noticing that not all three can be perfect squares. With this, we can reduce to the case
when n = p, an odd prime. There are two cases:
We will show a proof for the first few primes in Case 1; the other case uses similar tech-
niques. Let ζ be a primitive pth root of unity (e.g. ζ = e2πi/p ) and assume Z[ζ] is a unique
factorization domain (UFD). This was the classical approach, but number theorists quickly
realized that Z[ζ] is not always a UFD. In fact, it is an open question whether there are an
infinite number of primes p for which Z[e2πi/p ] is a UFD.
In any case, the assumption that Z[ζ] is a UFD holds for p < 23 so we will have proven
a number of cases of Fermat’s Last Theorem with the following proof.
Proof. Suppose x, y, z are positive integers satisfying xp + y p = z p . We may assume x, y, z
are relatively prime in Z. The equation above may be factored as
p
Y
(x + ζ i y) = z p (∗)
i=1
For p = 3, the only cubes mod 9 are ±1 and 0 so there are no solutions for (*) where 3 - xyz.
So we may assume p ≥ 5. We need the following lemmas:
p−1
Y
Lemma 13.1.1. p = (1 − ζ i ).
i=1
tp −1
Proof. Consider expanding t−1
in two ways:
tp − 1
(t − ζ) · (t − ζ p−1 ) = = tp−1 + . . . + t + 1.
t−1
Then plugging in t = 1 gives the result.
Lemma 13.1.2. For any 0 ≤ i < j ≤ p − 1, the elements x + ζ i y and x + ζ j y are coprime
in Z[ζ].
147
13.1. Attempting Fermat’s Last Theorem Chapter 13. Introduction
Proof. Suppose that π ∈ Z[ζ] is a prime which divides x + ζ i y and x + ζ j y. Then π divides
ζ i y(1 − ζ j−i ). Notice that ζ i is a unit and p - y by assumption, but 1 − ζ j−i | p. So in
particular, π | y and thus π | yp. Since π is a prime, π | y or π | p. Repeating the argument
for x shows that π | x or π | p. Since x and y are coprime in Z, we cannot have π | x and
π | y simultaneously, so π | p. By assumption we have that π divides xp + y p and therefore
also z p in Z, but (p, z) = 1 so the Euclidean algorithm implies that π | 1. Therefore x + ζ i y
and x + ζ j y are relatively prime in Z[ζ].
Now, each factor x + ζ i y must be a pth power in Z[ζ], possibly multiplied by a unit.
Write x + ζy = utp for u ∈ Z[ζ]∗ and t ∈ Z[ζ].
Lemma 13.1.3. u/ū is a pth root of unity.
Proof. It is simple to show that u/ū and all of its Galois conjugates have modulus 1 in C;
this is then true for all powers of u/ū as well. Then the degree of u/ū and all of its powers
is bounded. Since all of these are algebraic integers, there are only finitely many possible
choices for their minimal polynomials. Hence the set {(u/ū)k : k ∈ N} is finite. This proves
u/ū is a root of unity in Z[ζ]. In particular, (u/ū)2p = 1 but we want to show it is a pth root
of unity. Suppose (u/ū)p = −1. Then up = −ūp . Since u ∈ Z[ζ] we may write
u = a0 + a1 ζ + a2 ζ 2 + . . . + ap−2 ζ p−2
In particular, up is conjugate to a real number mod p. Likewise, we can write −ū as −ū =
−(a0 + a1 ζ p−1 + . . . + ap−2 ζ 2 ) so
148
13.1. Attempting Fermat’s Last Theorem Chapter 13. Introduction
Thus p−1 i
Q
i=2 (1 − ζ ) divides x − y but since x − y ∈ Z, it must be that p | (x − y). Rearranging
the equation xp + y p = z p to read xp + (−z)p = y p and repeating the argument so far shows
that p | (x + z) as well. Thus y ≡ x ≡ −z (mod p). But then
0 = xp + y p − z p ≡ 3xp (mod p)
149
Chapter 14
150
14.1. Integral Extensions of Rings Chapter 14. Algebraic Number Fields
Definition. The integral closure of A in B is the set of all x ∈ B which are integral over
A. If A is equal to its integral closure in B then we say A is integrally closed in B. In
particular, if A is a domain and B is the fraction field of A then we simply say that A is
integrally closed.
Lemma 14.1.1. x ∈ B is integral over A if and only if A[x] is a finitely generated A-module.
Pn−1 i
Proof. ( =⇒ ) If xn + an−1 xn−1 + . . . + a0 for ai ∈ A then xn ∈ M := i=1 Ax which is a
m
finitely generated A-module. By induction, for all m ≥ n, x ∈ M . This implies A[x] = M ,
so in particular A[x] is finitely generated.
( ⇒= ) Suppose A[x] is generated by f1 (x), . . . , fn (x) where fi are polynomials in a single
variable over A. Let d ≥ max{deg fi }ni=1 . Then
n
X
d
x = ai fi (x)
i=1
Pn
for some choice of ai ∈ A. This shows that x is a root of the polynomial td − i=1 ai fi (t) so
x is integral over A.
Proof. It suffices to prove that the integral closure Ā is closed under the addition and mul-
tiplication of B. If x, y ∈ Ā, Lemma 14.1.1 shows A[x, y] is finitely generated. This implies
that the submodules A[x + y] and A[xy] are also finitely generated, so x + y, xy ∈ Ā. Hence
Ā is a ring.
Let A ⊂ B be a subring. We will make use of the following facts about integral extensions
of rings:
Suppose C ⊇ B ⊇ A are all rings. If C is integral over B and B is integral over A then
C is integral over A.
The two most important objects in global algebraic number theory are defined next.
151
14.1. Integral Extensions of Rings Chapter 14. Algebraic Number Fields
Definition. For a number field K ⊃ Q, the integral closure of Z in K is called the ring of
integers of K, written OK .
Examples.
3 For a prime p, the cyclotomic field K = Q(ζp ) = Q(e2πi/p ) has ring of integers OK =
Z[ζp ].
It turns out that OK is always a free Z-module of rank [K : Q]. Thus we can think of
OK as a lattice embedded in the vector space K.
152
14.2. Norm and Trace Chapter 14. Algebraic Number Fields
Definition. The norm of x is the element NL/K (x) = det Tx ∈ K, where Tx : L → L is the
K-linear map Tx (`) = x`.
Note that the norm and trace are defined for any finite extension L/K, not just number
fields. We will often drop the subscript and write N (x) and Tr(x) when the extension is
understood.
Proof. Assume σi (x) 6= σj (x) when i 6= j. A basis of L/K is 1, x, . . . , xn−1 and the matrix
for Tx in this basis is
0 ··· −a0
0 0
1 0 ··· 0 −a1
.
0 1 .. 0 −a2
. .. . . .. ..
.. . . . .
0 0 ··· 1 −an−1
where f (x) = a0 + a1 x + . . . + an xn is the minimal polynomial of x over K. In this case f is
also the characteristic polynomial of x, so by linear algebra, Tr(x) is equal to the sum of the
roots of f and N (x) is equal to the product of the roots of f . This implies the result.
√
Example 14.2.3. Let K = Q( d) for d a squarefree√integer √ (this means d = ±p1 p2 · · · pr in
its prime factorization). Then an element x = a + b d ∈ Q( d) has norm N (x) = a2 − b2 d
and trace Tr(x) = 2a.
153
14.3. The Discriminant Chapter 14. Algebraic Number Fields
Proposition 14.3.1. Let A = [TrL/K (αi αj )]. Then dL/K (α1 , . . . , αn ) = det A. In particular,
dL/K (α1 , . . . , αn ) lies in K.
Proof. By Theorem 14.2.2, TrL/K (αi αj ) = nk=1 σk (αi )σk (αj ). Thus A = BC, where
P
Taking the determinant gives us det A = (det B)(det C) = (det C)2 = dL/K (α1 , . . . , αn ).
One case of interest is when L = K(α) is a simple extension and {1, α, α2 , . . . , αn−1 } is a
basis for L as a K-vector space. Then the discriminant of α is defined to be
Llet f be the minimal polynomial of β over K, setting deg f = m. Then the discriminant
of f is
D(f ) = (−1)m(m−1)/2 NL/K (f 0 (β)).
Lemma 14.3.2. For any algebraic element α over K, dL/K (α) equals the discriminant of
the minimal polynomial of α.
Proof. Set L = K(α) and let αi = σi (α) for each embedding σi : L ,→ K. Then
1 α1 · · · α1n−1
1 α2 · · · αn−1
2
dL/K (α) = det .. .. . .
..
. . . .
n−1
1 αn · · · αn
Since K(α)/K is separable, dL/K (α) 6= 0. In fact, the product formula above is precisely the
discriminant of f , the minimal polynomial of α over K.
154
14.3. The Discriminant Chapter 14. Algebraic Number Fields
γ = f 0 (β) = nβ n−1 + a.
nβ −1 (β n + aβ + b) = 0
nβ n−1 + na + nbβ −1 = 0
or γ = −(n − 1)a − nbβ −1 .
is monic with degree n, this must be the minimal polynomial of γ. Moreover, N (γ) is just
(−1)n times the constant term of g, so N (γ) = nn bn−1 + (−1)n−1 (n − 1)n−1 an .
This suggests the following.
Proposition 14.3.4. Let f (x) = xn + ax + b with f (β) = 0 for some β ∈ K. Then
Both determinants on the right are nonzero, so det(σi (αj )) 6= 0 which implies finally that
dL/K (α1 , . . . , αn ) 6= 0 by the definition of disciminant.
155
14.3. The Discriminant Chapter 14. Algebraic Number Fields
The proof of Proposition 14.3.5 gives us the following useful formula: If A, B are two
K-bases for L with change of basis matrix A, then
dL/K (A) = (det A)2 dL/K (B).
√
Example 14.3.6.√Take our favourite example, K = Q( d) over Q, where d is a squarefree
integer. Then {1, d} is a basis for L, and its discriminant is
√ 2
√ √
1 √d
dK/Q (1, d) = det = (−2 d)2 = 4d.
1 − d
This matches the fact that the discriminant of x2 − d is 4d.
Suppose A ⊆ K is integrally closed with fraction field K. Let B be the integral closure
of A in L. Observe that if x ∈ B then all conjugates of x in K are integral over K. Thus
NL/K (x) ∈ A and TrL/K (x) ∈ A since A is integrally closed.
Lemma 14.3.7. If x ∈ B × then NL/K (x) ∈ A× .
Proof. By Lemma 14.2.1, NL/K is a homomorphism of groups.
Lemma 14.3.8. Suppose α1 , . . . , αn ∈ B form a K-basis of L. Let d = dL/K (α1 , . . . , αn ).
Then dB ⊆ Aα1 + . . . + Aαn .
Proof. Let a1 , . . . , an ∈ K such that α := ni=1 ai αi ∈ B. Then (a1 , . . . , an ) is a solution to
P
the system of linear equations
n
X
TrL/K (αi α) = TrL/K (αi αj )xj , 1 ≤ i ≤ n.
j=1
The matrix corresponding to this system has determinant d by Proposition 14.3.1. Thus
each aj can be written as d1 times an A-linear combination of Tr(αi α). Since αi , α ∈ B,
Tr(αi α) ∈ A so dαj ∈ A for each j. Thus
n
X
dα = daj αj ∈ Aα1 + . . . + Aαn .
j=1
156
14.3. The Discriminant Chapter 14. Algebraic Number Fields
Example 14.3.12. Let ζ be a primitive pr th root of unity and let K = Q(ζ). We know
that [K : Q] = ϕ(pr ) = pr−1 (p − 1). Set n = ϕ(pr ). We will show that OK = Z[ζ] for every
prime power pr . First, we compute the discriminant dK = dK/Q (ζ). Let f (x) be the minimal
polynomial of ζ over Q. We may write this in two ways:
r
xp − 1 r−1 r
f (x) = pr−1 or (xp − 1)f (x) = xp − 1.
x −1
r−1 r−1 −1 r −1
Differentiating the second expression gives us f 0 (x)(xp − 1) + f (x)(pr−1 xp ) = p r xp .
Then plugging in ζ and solving for f 0 (ζ) produces
pr ζ r−1
f 0 (ζ) = .
ζ pr−1 − 1
Take the norm of this expression:
N (pr )
N (f 0 (ζ)) = ± p r−1 = ±pa for some a ∈ Z.
N (ζ − 1)
Thus by Lemma 14.3.2, dK/Q (ζ) = ±pa for some a ∈ Z, where a ≤ ϕ(pr )pr .
It turns out that it’s easier to work with 1 − ζ in this example. In general this creates
no obstacles, since dK/Q (1 − ζ) = dK/Q (ζ). In our case, we observe that
Y Y
dK/Q (1 − ζ) = (1 − σi (ζ) − (1 − σj (ζ)))2 = (σi (ζ) − σj (ζ))2 = dK/Q (ζ).
1≤i<j≤n 1≤i<j≤n
Thus dK/Q (1 − ζ) = ±pa . To proceed, we need the following generalization of Lemma 13.1.1.
157
14.3. The Discriminant Chapter 14. Algebraic Number Fields
Y
Lemma 14.3.13. (1 − ζ k ) = p.
p-k
1≤k≤pr
Proof. Consider
r
xp − 1 r−1 r−1 r−1
f (x) = pr−1 = 1 + xp + x2p + . . . + x(p−1)p .
x −1
Plugging in x = 1 gives the result.
Observe that for any two k1 , k2 ∈ N not divisible by p,
1 − ζ k1
∈ Z[ζ].
1 − ζ k2
1−ζ k1
Then by symmetry, 1−ζ k2 is a unit in Z[ζ] for all such k1 , k2 . We will now show OK = Z[1−ζ].
Consider the basis {1, 1 − ζ, (1 − ζ)2 , . . . , (1 − ζ)n−1 } for K/Q. If x ∈ OK , we can write x in
the following manner by Lemma 14.3.8:
n−1
X bi
x= a
(1 − ζ)i for bi ∈ Z,
i=0
p
using the fact that dK/Q (1 − ζ) = ±pa . If pbai ∈ Z for each i, then we’re done. If not, multiply
by some pc so that all pbai pc ∈ p1 Z but not all of them lie in Z. Note that pc x ∈ OK , so we
may replace x with pc x and write
n−1
X bi
x= (1 − ζ)i , bi ∈ Z.
i=0
p
Suppose x 6∈ Z[1 − ζ]. Subtracting off the terms where p | bi if necessary, we may assume
bi = 0 whenever p | bi . Let j be the smallest index with p - bj . Then
n−1
X bi
x= (1 − ζ)j , p - bj .
i=j
p
p n
The element (1−ζ) j+1 lies in Z[1 − ζ] since j + 1 ≤ n and (1 − ζ) | p by Lemma 14.3.13.
p
Therefore we may multiply the expression for x by (1−ζ)j+1 to obtain
bj
x= + (terms in Z[1 − ζ]).
1−ζ
bn bn
bj j bj
Note that N 1−ζ
= N (1−ζ)
= pj is not divisible by p, Thus N 1−ζ 6∈ Z but this con-
bj
tradicts the fact that 1−ζ
∈ OK . Hence x ∈ Z[1 − ζ] which finally proves the claim that
OK = Z[1 − ζ] = Z[ζ].
158
14.3. The Discriminant Chapter 14. Algebraic Number Fields
The following theorem allows us to generalize Example 14.3.12 to all Q(ζ) where ζ is a
primitive nth root of unity.
Theorem 14.3.14. Let A be an integrally closed integral domain with field of fractions K
and suppose L/K and M/K are finite separable extensions with ω1 , . . . , ωn an integral basis
for L with respect to A and α1 , . . . , αm an integral basis for M with respect to A. Further
suppose dL/K (ω1 , . . . , ωn ) and dM/K (α1 , . . . , αm ) are relatively prime in A. Then {ωi αj } is
an integral basis for the compositum LM over A and
Proof. Factor m = pa11 · · · par r . Then Q(ζm ) = Q(ζpa1 1 ) · · · Q(ζpar r ). Moreover, for distinct
Q
primes p 6= q, dQ(ζp )/Q and dQ(ζq )/Q are relatively prime. Therefore by Theorem 14.3.14, the
ring of integers of Q(ζm ) is
159
14.4. Factorization of Ideals Chapter 14. Algebraic Number Fields
In fact, the exact objects we are looking for are prime ideals in OK . In order to describe a
unique factorization into prime ideals, recall that for ideals I, J ⊂ A, their ideal product is
( n )
X
IJ = xi yi : xi ∈ I, yi ∈ J .
i=1
160
14.4. Factorization of Ideals Chapter 14. Algebraic Number Fields
J −1 := {x ∈ K | xJ ⊆ A}.
161
14.4. Factorization of Ideals Chapter 14. Algebraic Number Fields
xIp−1 −1 −1 −1
1 · · · pr ⊆ p1 · · · pr = J.
162
14.4. Factorization of Ideals Chapter 14. Algebraic Number Fields
Definition. For a Dedekind domain A, let JA denote the group of fractional ideals of
A. The ideal class group of A is defined as the quotient group
CA = JA /PA
Clearly |CA | = 1 if and only if A is a PID (and therefore a UFD), so the ideal class group
is a direct measure of the failure of unique factorization in A. Moreover, the ideal class group
corresponds to an exact sequence of groups
1 → A× → K × → JA → CA → 1.
We will study this further when we characterize the unit group K × in Section 14.10.
One of the most important results in algebraic number theory is the following theorem,
which we will prove in Section 14.9.
163
14.5. Ramification Chapter 14. Algebraic Number Fields
14.5 Ramification
In this section let L/K be a finite separable field extension, let OK be a Dedekind domain
with field of fractions K and let OL be the integral closure of OK in L. Put n = [L : K].
Lemma 14.5.1. OL is a Dedekind domain.
Proof. This is the same proof as for Theorem 14.4.3.
Lemma 14.5.2. If p ⊂ OK is a prime ideal then pOL 6= OL .
Proof. Take x ∈ p−1 r OK , which exists by Lemma 14.4.6. Then xp ⊆ OK so xpOL ⊆ OL .
If pOL = OL then we have xpOL = xOL ( OL , a contradiction. Therefore pOL 6= OL .
Now fix a nonzero prime ideal p ⊂ OK . By Theorem 14.4.2, p considered as an ideal of
OL has a unique factorization
pOL = Pe11 · · · Pegg
where the Pi ⊂ OL are distinct primes and each ei > 1. Note that for each i, OL /Pi is a
finite dimensional OK /p-vector space. (This follows from the fact that Pi ∩ OK = p.) We
say the Pi are the primes of OL lying over p. By unique factorization, these are the only
primes lying over p.
Definition. For a prime Pi in the factorization of pOL , the index fi = [OL /Pi : OK /p] is
called the inertial degree of Pi (over p) and the exponent ei is called the ramification
index of Pi (over p). We say the prime p is totally split if ei = fi = 1 for all 1 ≤ i ≤ g;
p is totally ramified if g = 1 and f1 = 1; and p is inert if g = 1 and e1 = 1.
Definition. If any ei > 1 or (OL /Pi )/(OK /p) is inseparable, we say the prime p is ramified
(in OL ). Otherwise p is unramified.
Example 14.5.3. In Z[i], (2) = (1 + i)2 so (2) ramifies with e1 = 2. By contrast, (3) is inert
in Q(i) with residue field Z[i]/(3) ∼
= F9 , and (5) = (2 + i)(2 − i) is unramified.
Qg ei
Theorem
Pg 14.5.4. For any prime p ⊂ O K with prime factorization pO L = i=1 Pi , we
have i=1 ei fi = n = [L : K].
Proof. By the Chinese remainder theorem (the version of Theorem 3.2.10 for rings), we can
write g g
ei ∼
Y M
OL /pOL = OL / Pi = OL /Pei i .
i=1 i=1
To prove the theorem, we show that [OL /pOL : OK /p] = n and [OL /Pei i : OK /p] = ei fi for
each 1 ≤ i ≤ g. For the first equality, take {ω 1 , . . . , ω m } to be a basis for OL /pOL as an
OK /p-vector space. Lift these elements to ω1 , . . . , ωm ∈ OL . Suppose
a1 ω1 + . . . + am ωm = 0 for coefficients ai ∈ OK .
Let a = (a1 , . . . , am ) ⊂ OK and let x ∈ a−1 r ap; such an element exists by Lemma 14.4.7.
Then xai ∈ OK for all i, but xai 6∈ p for some i. Replacing ai with xai and reducing mod
164
14.5. Ramification Chapter 14. Algebraic Number Fields
Let θ ∈ OL be a primitive element of L/K, that is, L = K(θ). It is not always guaranteed
that OK [θ] = OL . However, we have a way of measuring how far off from the whole ring OL
the submodule OK [θ] really is.
Definition. The conductor of the extension L/K is the ideal
f := {α ∈ OK | αOK ⊆ OK [θ]} ⊂ OK
where L = K(θ).
Example 14.5.5. If OK [θ] = OL , then f = (1).
√ √
Example 14.5.6. For K = Q and L = Q( −3), the conductor is f = (2, 1 + −3).
Note that f is always nonzero.
Theorem 14.5.7. Let L/K be a finite separable extension with L = K(θ). Suppose p ⊂ OK
is prime and pOL + f = (1), where f is the conductor of the extension L/K. Let ϕ(x) be the
minimal polynomial of θ over K. If ϕ(x) factors completely in (OK /p)[x] as
ϕ(x) = ϕ1 (x)e1 · · · ϕg (x)eg mod p
with deg ϕi = fi , then the factorization of p in OL is pOL = gi=1 Pei i where for each
Q
i, Pi is a prime ideal with ramification index ei and inertia degree fi , given explicitly by
Pi = ϕi (θ)OL + pOL for any lift ϕi (x) of ϕi (x) in OK [x].
165
14.5. Ramification Chapter 14. Algebraic Number Fields
= O0 /pO0 ∼
OL /pOL ∼ = (OK /p)[x]/ϕ(x)
where again ϕ(x) is the minimal polynomial of θ over K. Clearly O0 ⊆ OL so we have a map
O0 /pO0 → OL /pOL . By assumption, pOL + f = OL but f ⊆ OL so we have pOL + O0 = OL .
Hence the map is surjective. On the other hand, pO0 ⊆ pOL ∩ O0 and
This proves injectivity, so the first isomorphism is proven. The second isomorphism is im-
mediate from the fact that
O0 /pO0 ∼
= OK [x]/(ϕ(x), p) ∼
= (OK /p)[x]/ϕ(x).
The prime ideals on the right are just the idealsQ(ϕi (x)). Set R = (OK /p)[x]/ϕ(x) and
notice that [R/(ϕi (x)) : OK /p] = fi = deg ϕi and gi=1 ϕi (x)ei = 0. The primes in OL /pOL
corresponding
Qg under the above isomorphism to the ϕi (θ) are Pi := ϕi (θ)OL + pOL . Notice
ei
that i=1 Pi ⊆ pOL , but since
g g
X Y
dimOK /p OL /pOL = ei fi = dimOK /p OL / Pei i ,
i=1 i=1
Qg
we have i=1 Pei i = pOL . This proves the theorem.
Example 14.5.8. Let OK = Z[i] be the Gaussian integers. Here the conductor is f = (1).
Consider how x2 + 1 splits mod 13:
Then by Theorem 14.5.7, the ideal (13) splits in Z[i] in the following way:
Proof. We know dL/K (θ) = i<j (θi − θj )2 where θi are all the K-embeddings of θ in K.
Q
Since p + f = (1), p splits in OL based on how ϕ splits mod p, where ϕ is the minimal
polynomial of θ over K. Explicitly, Theorem 14.5.7 tells us that p is unramified if and only
166
14.5. Ramification Chapter 14. Algebraic Number Fields
if there are no repeated factors in the factorization of ϕ mod p and ϕ mod p is separable.
This is equivalent to all the roots of ϕ mod p having multiplicity 1 in k̄p , which in turn is
equivalent to θi 6= θj mod p for all i 6= j. Now consider
Y
θi 6= θj for all i 6= j ⇐⇒ (θi − θj ) is relatively prime to p in OK
i6=j
167
14.5. Ramification Chapter 14. Algebraic Number Fields
Note that this solves Fermat’s theorem characterizing primes of the form x2 + y 2 . It will
be a continuing theme in these notes to fully characterize primes of the form x2 + ny 2 for all
integers n.
the factorization we obtain from Theorem 14.5.7 is 23OK = pq2 where p 6= q and both are
prime. In general, how do we know that q ramifies but p doesn’t? This type of information
cannot be detected by the discriminant alone. However, the different will provide an answer
(see Section 15.10).
We now discuss Hilbert’s program for ramification theory. Assume that L/K is Galois
G = Gal(L/K). Note that σ(OL ) = OL for all σ ∈ G. If p ⊂ OK is a prime and
and let Q
pOL = gi=1 Pei i , then each σ ∈ G acts on the primes lying over p: σ(Pi ) = Pj for some
1 ≤ j ≤ g. The key observation is that this action is transitive.
Proposition 14.5.13. For any prime p ⊂ OK , G = Gal(L/K) acts transitively on the set
of primes of OL lying over p.
Proof. Suppose not. Then there is some pair of primes Pi , Pj lying over p such that σPj 6=
Pi for all σ ∈ G. By the Chinese remainder theorem, pick x ∈ Pj such that x ≡ 1 (mod
Q σPi )
for all σ ∈ G. Then NL/K (x) ∈ Pj ∩ OK = p. On the other hand, NL/K (x) = σ∈G σ(x)
but σ(x) 6∈ Pi for any σ, so NL/K (x) 6∈ p. This is impossible, so there is some σ ∈ G such
that σPj = Pi .
Corollary 14.5.14. When L/K is Galois, for any prime p ⊂ OK , all ramification indices
ei and all inertia degrees fi for primes over p are equal, and therefore [L : K] = ef g, where
e = ei and f = fi for any prime Pi | p.
Proof. An ideal Pνi divides pOL if and only if σPνi divides pOL for all σ ∈ G, which by
Proposition 14.5.13 is equivalent to Pνj dividing pOL for all 1 ≤ j ≤ g. Therefore the
ramification indices are all equal; let e denote any one of them. Now given 1 ≤ i, j ≤ g,
suppose σ ∈ G is a permutation taking Pj to Pi , that is, Pi = σPj . Then σ determines an
isomorphism OL /Pj → OL /Pi . Therefore fi = fj . Let
Pg f denote any of the inertial degrees.
Then finally, by Theorem 14.5.4 we have [L : K] = i=1 ef = ef g.
Fix a prime P ⊂ OL lying over p.
Clearly by the orbit-stabilizer theorem, |DP | = ef where e and f are the ramification
index and inertia degree of p, respectively. By Galois theory, there is a field extension ZP /K
corresponding to the subgroup DP ≤ G, which is explicitly the fixed field ZP = LDP .
168
14.5. Ramification Chapter 14. Algebraic Number Fields
L
DP
ZP G
Lemma 14.5.15. If σP = P0 for two primes P, P0 lying over p, then DP0 = σDP σ −1 for
some σ ∈ G.
Proof. This is a more general fact about the stabilizers of a transitive group action. Note
that for σ, τ ∈ Gal(L/K),
τ −1 στ ∈ DP ⇐⇒ τ −1 στ P = P ⇐⇒ στ P = τ P ⇐⇒ σ ∈ Dτ P
L P
e f
ZP a
1 1
K p
169
14.5. Ramification Chapter 14. Algebraic Number Fields
ϕ : DP −→ Aut(`P /kp )
σ 7−→ ϕσ .
Proof. By Proposition 14.5.17, ka = kp for any prime ideal a in the ring of integers of the
decomposition field, so we can replace K with Z = ZP and G with DP . Thus P is the
only prime lying over p. Take θ̄ ∈ `P and let θ ∈ OL be any lift, with minimal polynomials
ḡ(x) ∈ kp [x] and f (x) ∈ K[x], respectively. Certainly f¯(θ̄) = 0 mod p so ḡ | f¯ in kp [x].
Since L/K is normal (it is a Galois extension), f splits over L. This implies f¯ splits over `P ,
so ḡ splits as well. This proves `P /kp is a normal extension.
Now choose θ̄ generating the separable closure of kp in `P . Let σ̄ ∈ Aut(`P /kp ). Then
σ̄ θ̄ is a root of ḡ and thus of f¯. Since f splits in L, there exists a root α ∈ L of f such
that ᾱ = σ̄ θ̄ in `P . Choose σ ∈ G = DP such that σθ = α, which is possible since L/K is
normal. Then ϕ(σ) = σ̄ because θ̄ generates the separable closure of kp in `P . This proves
ϕ is surjective.
Definition. The fixed field TP = LIP is called the inertia field of P (over p).
1 → IP → DP → Gal(`P /kp ) → 1
is exact. Moreover, |IP | = e and |DP | = ef , where e and f are the ramification index and
inertial degree, respectively, for L/K.
170
14.5. Ramification Chapter 14. Algebraic Number Fields
L P
e 1
TP b
1 f
ZP a
1 1
K p
Therefore it suffices to prove the former statement, that is, `P = OT /b. The decomposi-
tion/inertia group exact sequence for the extension L/T is
171
14.6. Cyclotomic Fields and Quadratic Reciprocity Chapter 14. Algebraic Number Fields
172
14.6. Cyclotomic Fields and Quadratic Reciprocity Chapter 14. Algebraic Number Fields
Proof. Fix a prime p and set m = n/pν(p) so that n = pν(p) m. Consider the number field
K = Q(ζn ). We know the conductor of ζn in OK is f = 1. Let γn be the nth cyclotomic
polynomial and let {αi } be the primitive pν(p) th roots of unity and {βj } be the primitive
mth roots of unity. Then by the Chinese remainder theorem,
(Z/nZ)× ∼
= (Z/pν(p) Z)× × (Z/mZ)×
so we can write Y
γn (x) = (x − αi βj ).
i,j
Note that all the αi are 1 in any field of characteristic p. Thus, modulo p,
ν(p) ν(p)
Y
γn (x) ≡ (x − βj )ϕ(p ) = γm (x)ϕ(p ) .
j
This allows us to reduce to the case when m = n, that is, the case when pν(p) = 1. Let γ̄m (x)
denote the factorization of γm (x) mod p. Since xm − 1 is separable over Fp (m is relatively
prime to p) and γ̄m (x) | xm − 1, we have that γ̄m (x) is also separable over Fp . The smallest
extension of Fp containing a primitive mth root of unity (and thus all of them) is Fpfp . Thus
γ̄m splits over Fpfp and each irreducible factor of γ̄m over Fp is the minimal polynomial of
some primitive mth root of unity, each of which having degree fp . This implies γ̄m is a
product of degree fp irreducible polynomials over Fp . By Theorem 14.5.7, we have
ν(p) )
pR = (p1 · · · pr )ϕ(p .
ϕ(n)
Remark. In general, Theorem 14.5.14 implies that r = .
ϕ(pν(p) )fp
Corollary 14.6.5. An odd prime integer p is ramified in Q(ζn )/Q if and only if p | n, and
p = 2 is ramified if and only if 4 | n.
p
Lemma 14.6.6. If q is an odd prime integer, then (−1)(q−1)/2 q ∈ Q(ζq ).
Proof. Set
X a
τ= ζqa .
×
q
a∈(Z/qZ)
173
14.6. Cyclotomic Fields and Quadratic Reciprocity Chapter 14. Algebraic Number Fields
Proof. Let p and q be distinct odd primes and set q ∗ = (−1)(q−1)/2 q. Consider the tower of
√
number fields Q(ζq ) ⊃ Q( q ∗ ) ⊃ Q, with Galois groups as shown:
Q(ζq )
Z/((q − 1)/2)Z
√ ∗
Z/(q − 1)Z Q( q )
Z/2Z
Q
∗
q
Then we determine the reciprocity law for p
as follows:
q∗ √
= 1 ⇐⇒ (p) splits in Q( q ∗ ) by Proposition 14.6.1
p
√
⇐⇒ Q( q ∗ ) ⊆ ZP , the decomposition field for any prime P over (p)
⇐⇒ there exist an even number of primes in Z[ζq ] lying over (p)
q−1
⇐⇒ is even, where fp is the multiplicative order of p mod q
fp
q−1
⇐⇒ fp divides
2
q−1
⇐⇒ p 2 ≡ 1 (mod q)
p
⇐⇒ = 1.
q
174
14.7. Lattices Chapter 14. Algebraic Number Fields
14.7 Lattices
One perspective on rings of algebraic integers is to view them as lattices. For example, Z[i]
is very clearly a lattice in C spanned by the vectors 1 and i. We will show that any ring
of integers OK in a number field K/Q is a lattice in some Rn . This is the beginning of
Minkowski’s so-called theory of geometry of numbers.
Definition. A Z-module Γ ⊆ Rn is a lattice of rank m if Γ = Zv1 + ldots + Zvm for
R-linearly independent vectors v1 , . . . , vm . If m = n then we say Γ is a complete lattice,
or has full rank in Rn .
Definition. For a lattice Γ ⊆ Rn , the set
Φ = {x1 v1 + . . . + xm vm | 0 ≤ xi < 1}
is called the fundamental domain of Γ, also sometimes called the fundamental paral-
lelopiped.
Observe that Γ is a complete lattice in Rn if and only if Γ + Φ = Rn .
Definition. A subgroup W ⊆ Rn is said to be discrete if every point in W is open in
the topology on Rn , that is, if every point x ∈ W has a neighborhood U in Rn such that
U ∩ W = {x}.
Proposition 14.7.1. If Γ ⊆ Rn is a subgroup, then Γ is discrete if and only if Γ is a lattice.
Proposition 14.7.2. If Γ ⊆ Rn is a lattice, then Γ is complete if and only if there exists a
bounded set M such that Γ + M = Rn .
Proof. ( =⇒ ) When Γ is complete, M = Φ works.
( ⇒= ) If Γ is not complete, let V ( Rn be the R-span of Γ. Then V lies in some
hyperplane H in Rn . Choose d > 0. Then for any bounded set of diameter diam(M ) < d,
all points further than d from H do not lie in Γ + M ⊆ H + M . Hence Γ + M 6= Rn .
Definition. If Γ = Zv1 + . . . Zvn is a complete lattice in Rn , we define the volume of Γ to
be the volume of the parallelopiped spanned by v1 , . . . , vn :
| |
vol(Γ) := vol(Φ) = | det A| where A = v1 · · · vn .
| |
Note that since det(AT A) = (det A)2 , we can write the volume formula as
q
vol(Γ) = det(vi vj ).
175
14.7. Lattices Chapter 14. Algebraic Number Fields
1 = vol(Γ)
X
1
≥ vol 2
X +γ ∩Φ
γ∈Γ
X
vol (Φ − γ) ∩ 21 X
=
γ∈Γ
1
since Φ + Γ = Rn
= vol 2
X
> 1,
1 1
a contradiction. Therefore there exist such γ1 , γ2 ∈ Γ. Now take x ∈ 2
X + γ1 ∩ 2
X + γ2 .
Then for some x1 , x2 ∈ X, we have
x = 12 x1 + γ1 = 12 x2 + γ2 =⇒ γ1 − γ2 = 21 (x2 − x1 ),
which is just the midpoint of the line between x2 and −x1 . By convexity and central-
symmetry, this implies γ1 − γ2 ∈ X, but since γ1 =
6 γ2 we have found a nonzero lattice point
in X.
Remark. Note that the inequality in Minkowski’s theorem must be sharp, for if Γ = Zn ,
then vol(Γ) = 1, whereas the centrally-symmetric, convex set
The four squares theorem is a famous result in number theory which was proven by
Lagrange in 1770, well over 100 years before Minkowski’s theorem was discovered. Here we
provide a neat proof of the four squares theorem using Minkowski’s geometry of numbers
arguments.
Theorem 14.7.4 (Four Squares). Every positive integer is the sum of the squares of four
integers.
176
14.7. Lattices Chapter 14. Algebraic Number Fields
x2 + y 2 + 1 ≡ 0 (mod p).
implies one of them must be shared. This shows x2 + y 2 + 1 ≡ 0 (mod p) has a solution in
integers.
Fix one of these solutions, say (x, y), and consider the lattice Λ ⊂ Z4 consisting of
(a, b, c, d) such that
c ≡ ax + by and d ≡ bx − ay (mod p).
Then Z4 ⊃ Λ ⊃ pZ4 and Λ/pZ4 is a two-dimensional subspace of F4p since once we pick
a and b, the c and d are determined. Thus Λ has index p2 in Z4 so µ(D) = p2 for D a
fundamental parallelopiped for Λ. Let T be a closed ball about the origin with radius r.
Then µ(T ) = 21 π 2 r4 so we may choose r such that
This gives us µ(T ) > 16µ(D) so by Minkowski’s theorem there exists a nonzero point
(a, b, c, d) in T ∩ (Λ r {0}). This means
a2 + b2 + c2 + d2 < 2p.
177
14.8. Norms of Ideals Chapter 14. Algebraic Number Fields
N (P) = pf
To distinguish this norm from a similar norm to be defined shortly, we will sometimes
refer to N as the ideal norm. If the norm is taken with respective to an extension L/K, we
write NL/K but when the context is clear we will often drop the decoration.
Remark. By the properties of inertial degree f , it is easy to see that for a tower M ⊃ L ⊃ K,
Next we check that the properties discussed above hold for the norm we have defined.
(b) If L/K is Galois and P ⊂ OL is any nonzero prime ideal with p = P ∩ OK and
pOL = (P1 · · · Pg )e , then
Y
N (P) = (P1 · · · Pg )ef = σ(P).
σ∈Gal(L/K)
178
14.8. Norms of Ideals Chapter 14. Algebraic Number Fields
(c) For any nonzero element β ∈ OL , N (β)OK = N (βOL ), where N denotes the regular
field norm.
Proof. (a) It suffices to prove this for prime ideals, for which we have
Y P
N (pOL ) = N Pei i = p ei fi = pm
using Theorem 14.5.4.
(b) Since N (Pi ) = pf for any prime Pi in the prime factorization of pOL , the left equality
is clear. Recall that G = Gal(L/K) acts transitively on the set Spec(p) = {P1 , . . . , Pg }.
Then by the Orbit-Stabilizer Theorem, each Pi occurs
| Gal(L/K)| m
= = ef
|Spec(p)| g
times in the collection {σ(P) | σ ∈ G}, which implies the right equality.
(c) First suppose L/K is Galois. Denote βOL by b. The map IK → IL given by
a 7→ aOL is injective since IK and IL are free on nonzero prime ideals, so it suffices to show
that N (β)OL = N (b). But by (b),
!
Y Y Y
N (b) = σ(b) = (σ(β)OL ) = σ(β) OL = N (β)OL .
σ∈G σ∈G σ∈G
179
14.8. Norms of Ideals Chapter 14. Algebraic Number Fields
Notice that the ideal norm is defined for any extension L/K and outputs an ideal of OK .
On the other hand, the numerical norm is defined on K/Q and outputs an integer in Z. The
connection between the two norms is described in the next proposition.
(a) For any ideal a ⊂ OK , NK/Q (a) = (N(a)) and therefore N(ab) = N(a)N(b).
Y
[OK : a] = [OK : pei i ].
Y ef
We previously proved that [OK : pei i ] = piei fi , thus [OK : a] = (pi i i ) = NK/Q (a). When
we identify the set of nonzero ideals of Z with the set of positive integer generators, N and
N are seen to coincide, and multiplicativity of N follows from the same property of the ideal
norm.
(b) We can multiply by some integer d to make a and b integral ideals. Then part (a)
gives us
[OK : db] N(db)
[a : b] = [da : db] = = = N(a−1 b).
[OK : da] N(da)
180
14.9. The Class Group Chapter 14. Algebraic Number Fields
x x⊗1 (τ (x))τ .
F F
∼
=
K ⊗Q C KC
Observe that Tr ◦j : K → R is just equal to the field trace, TrK/Q , as defined in Section 14.2.
Likewise, the norm map
N : KC −→ C
KR −→ R
Y
(xτ )τ 7−→ xτ
τ ∈T
181
14.9. The Class Group Chapter 14. Algebraic Number Fields
f : KR −→ Rr+2s = Rn
(x1 , . . . , xr , y1 , ȳ1 , . . . , ys , ȳs ) 7−→ (x1 , . . . , xr , Re(y1 ), Im(y1 ), . . . , Re(ys ), Im(ys )).
It is sometimes more useful to think of KR as Rr+2s in this way. There is a standard Hermitian
inner product on KC , which restricts to an inner product on KR called the Minkowski inner
product. In Rn , this corresponds to the canonical real inner product:
For K a number field with ring of integers OK , let JK = JOK be the group of fractional
ideals, PK = POK the subgroup of principal fractional ideals and let CK = JK /PK be the
class group. Our goal is to prove that CK is a finite group.
Lemma 14.9.1. Ideal norm is multiplicative. That is, for any nonzero ideals a, b ⊂ OK ,
N (ab) = N (a)N (b).
Proof. If a and b are relatively prime, this follows from the Chinese remainder theorem.
Thus it suffices to show that N (pa ) = N (p)a for every prime p ⊂ OK and exponent a ≥ 0.
By considering the filtration of OK by powers of p, we have
Lemma 14.9.2. Given any constant c > 0, there exist only finitely many ideals a ⊂ OK
with norm N (a) < c.
Proof. By Lemma 14.9.1, it suffices to prove this statement for prime ideals. For each prime
integer p ∈ Z, Theorem 14.4.2 implies that there are only finitely many prime ideals p ⊂ OK
lying over (p). For each of these p, we have N (p) = pf for some f – in fact, this f is the
residue degree of p/(p) as defined in Section 14.5. Therefore any prime ideal p with N (p) < c
must lie above a prime p ∈ Z such that pf < c. There are only finitely many of these, so we
are done.
182
14.9. The Class Group Chapter 14. Algebraic Number Fields
Proof. It is routine to prove that j(a) is a lattice – in fact, it suffices to show OK is a lattice
since a is a discrete subgroup. Now if α1 , . . . , αn is a Z-basis for a and T = {τ1 , . . . , τn } is
the set of embeddings K ,→ C, then
This implies the formula and in particular vol(Γ) > 0 so j(a) must be a complete lattice.
Lemma 14.9.4. For any nonzero ideal a ⊂ OK , let cτ > 0 for each τ ∈ T be such that
s
Y 2 p
cτ > |dK |N (a).
τ ∈T
π
Then there exists some α ∈ a r {0} such that |τ (α)| < cτ for all τ ∈ T .
Proof. Define X = {(zτ )τ ∈ KR : |zτ | < cτ for each τ ∈ T }. Then it is easy to verify that X
is centrally-symmetric and convex. Viewing X in Rn via the isomorphism f : KR → Rn , we
see that its image is
f (X) = {(xτ )τ ∈ Rn : |xτ | < cτ for τ ∈ TR and x2τ + x2τ̄ < c2τ for τ ∈ TC }
Therefore by Minkowski’s theorem (14.7.3), f (X) contains a nonzero lattice point of j(a).
Let α be the corresponding nonzero point in a. Then it is clear α satisfies the desired
condition.
183
14.9. The Class Group Chapter 14. Algebraic Number Fields
Theorem 14.9.5. For any nonzero ideal a ⊂ OK , there exists a nonzero element α ∈ a
such that s
2 p
|NK/Q (α)| ≤ |dK |N (a).
π
Proof. By Theorem 14.2.2, for any α ∈ OK we have
Y
|NK/Q (α)| = |τ (α)|.
τ ∈T
then by Lemma 14.9.4, there exists a nonzero α ∈ a such that |τ (α)| < cτ for all τ . That is,
s
Y 2 p
|τ (α)| < |dK |N (a) + ε.
τ ∈T
π
Letting ε → 0, the fact that |NK/Q (α)| ∈ N0 implies that α ∈ a may be chosen such that
s
2 p
|NK/Q (α)| ≤ |dK |N (a).
π
Corollary 14.9.6. For any number field K/Q, the class group CK is finite.
s p
Proof. It suffices to show every ideal class in CK contains an ideal of norm at most π2 |dK |,
since then Lemma 14.9.2 says there are a finite number of these. Fix a class C ∈ CK and
pick fractional ideal a ∈ C such that a−1 ⊂ OK is an ideal. By Theorem 14.9.5, there exists
α ∈ a−1 such that s
2 p
N ((α)) = |NK/Q (α)| < |dK |N (a−1 ).
π
Note that αa−1 ⊆ OK . Since norm is multiplicative (Lemma 14.9.1), we have
184
14.9. The Class Group Chapter 14. Algebraic Number Fields
Proof. First note that if K/Q has discriminant dK satisfying |dK | ≤ d and n = [K : Q] ≤ N ,
then K(i)/Q has discriminant |dK(i) | ≤ (4d)n and [K(i) : Q] ≤ 2N , so we are free to assume
i ∈ K. In particular, we may assume all embeddings of K into C are complex. Fix one
of these, τ0 : K ,→ C. Let X ⊆ KR be the set of all (zτ ) ∈ KR satisfying the following
conditions:
√
Im(zτ0 ) < C d for some constant C;
Re(zτ0 ) < 1;
Corollary 14.9.9. For any d > 0, there are finitely many number fields K/Q of discriminant
|dK | ≤ d.
n n/2
Proof. Define the sequence an = nn! π4 . Then
n
an+1 π 1/2 1 π 1/2
= 1+ −→ e > 1 as n → ∞
an 4 n 4
so the sequence (an ) increases geometrically. But by Proposition 14.9.8, |dK | ≥ an so there
can only be finitely many number fields K of bounded discriminant.
185
14.9. The Class Group Chapter 14. Algebraic Number Fields
Proof. Let (an ) be the sequence defined in the proof of Corollary 14.9.9. For all n ≥ 2,
an > 1 so |dK | > 1 by Proposition 14.9.8.
Definition. For a number field K, the finite number hK = |CK | is called the class number
of K.
The preferred setting for algebraic number theory is obviously when the class number is
1, since then OK is a PID and thus a UFD. However, having class number 1 is a substantial
restriction on number fields. For example,
√ Heegner (and others later) proved that the only
imaginary quadratic number fields Q( d), where d < 0 is squarefree, with class number 1
are for
d = −1, −2, −3, −7, −11, −19, −43, −67, −163.
For real quadratic number fields, the situation is wide open. It is conjectured that there are
infinitely many real quadratic fields of class number 1, but this remains unsolved.
Example 14.9.12. Let K = Q(i). Then n = 2, s = 1 and |dK | = 4 so the Minkowski bound
is 1
2! 4 √ 4
2
4 = < 2.
2 π π
Thus every fractional ideal is equivalent to an ideal of norm 1. Since the only ideal of norm
1 is (1), every ideal is principal. Hence hK = 1, which reflects the fact that Z[i] is a PID.
√
Example 14.9.13. We will compute the class group of K = Q( −5). Here, dK = −20
since −5 ≡ −1 (mod 4) so the Minkowski bound in Corollary 14.9.6 becomes
1
2 √
N (a) ≤ 20 ≈ 2.84 < 3.
π
In particular every ideal class in CK has an ideal with norm 1 or 2. Thus any nonprincipal
class contains some ideal lying over (2). Notice that x2 + 5 ≡ (x + 1)2 (mod 2), so by
Theorem 14.5.7, √
(2) = (2, 1 + −5)2 = p2 .
√ √
Further, p = (2, 1 + −5) is not principal because there is no element α = a + b −5 with
norm N (α) = a2 + 5b2 = 2. So we deduce that CK = h[p]i ∼ = Z/2Z.
√ √
Example 14.9.14. Let K = Q( 10) with OK = Z[ 10]. Then n = 2, s = 0 and |dK | = 40,
so the Minkowski bound is
0
2! 4 √ 1 √ √
40 = · 2 10 = 10 < 4.
22 π 2
The proof of Corollary 14.9.6 implies that every ideal class has an integral representative
with norm 1, 2 or 3. We will use the techniques in Section 14.5 to compute the class group.
186
14.9. The Class Group Chapter 14. Algebraic Number Fields
√
The ideal 2OK is ramified in OK and we see that 2OK √ = (2, 10)2 . If this were a
principal ideal, we would have 2OK = (α) for some α = a + b 10 which would have norm
±2. Equivalently the equation a2 − 10b2 = ±2 would have an integer solution. However, √ 0
2 2
and ±1 are the only squares mod 5 so a −10b = ±2 has no integer solutions. Thus (2, 10)
is a nontrivial element in the class group and has order 2 since its square is the principal
ideal 2OK . This shows that 2 | hK .
Next we find integral ideals with norm 3. By Proposition 14.5.9, 3OK splits and we
compute its factorization to be
√ √
3OK = (3, 2 + 10)(3, 4 + 10).
If either of these prime divisors were principal, then x2 − 10y 2 √
= ±3 would have √ integer
solutions. Since it doesn’t for the same reasons as above, (3, 2 + 10) and (3, 4 + 10) are
both nontrivial elements of the class group.
Finally we must√ decide if any√ of these prime ideals belong to the same ideal class in
4+√10 1
C(OK ). Let u = 2+ 10 = 3 (1 + 10). Then
√ √ √ √ √
(3, 2 + 10) · u = (3u, 4 + 10) = (1 + 10, 4 + 10) = (3, 4 + 10)
so the classes with norm 3 are equal. We have shown that everything in C(OK ) is equivalent
to one of √ √
(1) (2, 10) or (3, 2 + 10).
Thus the class group has order ≤ 3 and contains an element of order 2. This implies
|C(OK )| = 2.
√
Example 14.9.15. Let K = Q( −6). Note that n = 2, r = 0, s = 1 and dK = −24 so
1
2! 4 √
BK = 2 24 ≈ 3.1.
2 π
√
Thus C(OK ) is generated by the√prime ideals lying over 2 and 3. Note that OK = Z[ −6]
and the minimal polynomial of −6 over Q is x2 + 6. Factoring this mod 2 and 3, we see
that √ √
p2 = (2, −6) and p3 = (3, −6)
generate the class group. Also, 2 and 3 ramify so 2OK = p22 and 3OK = p23 so each of these
prime ideals has order at most 2 in C(O√K ).
Suppose p2 = (α) for some α = a + b −6 ∈ OK . Then
2 = N(p2 ) = |N (α)| = a2 + 6b2 ,
but a2 + 6b2 = 2 has no integer solutions. Thus p2 is not principal. By a similar argument,
p3 is not principal either. Hence p2 and p3 both belong to classes of order 2 in C(OK ).
Furthermore, observe that
√ √ √ √ √
p2 p3 = (2, −6)(3, −6) = (6, 2 −6, 3 −6) ⊂ ( −6)
√ √ √
but the norms of (6, 2 −6, 3 −6) and ( −6) are both 6, so they must be the same ideal.
Hence p2 p3 is principal so C(OK ) = hp2 i and hK = 2.
187
14.9. The Class Group Chapter 14. Algebraic Number Fields
√ √
Example 14.9.16. Let K = Q( −19) with ring of integers OK = Z[(1 + −19)/2]. Since
n = 2, r = 0, s = 1 and dK = −19, the Minkowski bound for K is
1
2! 4 √
BK = 2 19 ≈ 2.775.
2 π
Since a and b are relatively prime, each pi appears in exactly one of the factorizations for a
and b. So by the above equality, a and b each factor into products of primes whose exponents
are all 3ei . We have not worried about
√ the unit γ yet, but that is because the units in K are
±1, each of√which is a cube in Z[ −2] anyways. Thus we conclude that a and b are both
cubes in Z[ −2]. √ √
Now suppose √ (x, y) is an integer
√ solution to x3 = y 2 + 2 = (y + −2)(y − −2). If d
divides both y + −2 and y − −2, then it divides their difference:
√ √ √
(y + −2) − (y − −2) = 2 −2.
√ √
However −2 is prime in Z[ −2] (norm is multiplicative), so d must divide 2. Suppose x
were even. Then we would have y 2 + 2 ≡ x3 ≡ 0 (mod 8), or y 2 ≡ −2 (mod 8). Of course
−2 is not a square mod 8, so x must√be odd. This √ forces y to be odd as well, so d | y 2 + 2
implies that d must be 1. Hence y + √−2 and y − √−2 are relatively prime. √
By the first part of the proof, y + −2 and y − −2 are both cubes in Z[ −2]. Write
√ √ √
y + −2 = (a + b −2)3 = (a3 − 6ab2 ) + (3a2 b − 2b3 ) −2.
188
14.9. The Class Group Chapter 14. Algebraic Number Fields
We now solve for a and b to show that (3, ±5) are the only valid choices for (x, y). From the
above, we see that 1 = 3a2 b − 2b3 = b(3a2 − 2b2 ). Since a and b are integers, this implies
b = ±1. If b = −1, the other factor is 3a2 + 2 = 1, which can be written 3a2 = −1. This
of course is impossible. So b = 1 and this means 3a2 − 2 = 1 which has solutions a = ±1.
Plugging these values in above, we see that y = ±5 and x = 3.
189
14.10. The Unit Theorem Chapter 14. Algebraic Number Fields
Also set KC× = τ ∈T C× and KR× = KC× ∩ KR . In fact we have an embedding j : K × ,→ KR× .
Q
Let µ(K) be the set of roots of unity in K, i.e. µ(K) = {x ∈ K | xa = 1 for some a ∈ N}.
Define the map
L : KR× −→ Rr+s
(xτ1 , . . . , xτr , xσ1 , x̄σ1 , . . . , xσs , x̄σs ) 7−→ (log |xτ1 |, . . . , log |xτr |, log |xσ1 |2 , . . . , log |xσs |2 ).
j L
K× KR× Rr+s
NK/Q N Tr
log | · |
Q× R× R
commutes.
Proof. This follows from the definitions of the norm and trace maps in Section 14.2 and their
extensions to KC (and KR ) in Section 14.9.
We will prove:
Theorem 14.10.2 (Dirichlet’s Unit Theorem). Let K be a number field of degree n = r +2s.
× ∼ r+s−1
Then OK =Z × µ(K).
Our strategy for proving the unit theorem is to show that Γ is a complete lattice in the
hyperplane H with ker(L ◦ j) = µ(K). The unit theorem will then follow from the theory of
finitely generated modules over Z.
190
14.10. The Unit Theorem Chapter 14. Algebraic Number Fields
× L◦j
1 → µ(K) → OK −−→ Γ → 1.
Proof. Clearly µ(K) ⊆ ker(L ◦ j). Thus it suffices to show that if |τ (x)| = 1 for all τ ∈ T ,
then x ∈ µ(K). First, there exists a bounded domain in KR× containing all the j(x) for
x ∈ OK for which |τ (x)| = 1 for all τ ∈ T . From Proposition 14.9.3, we know that j(OK ) is
a lattice in KR so there can only be finitely many x ∈ OK with |τ (x)| = 1 for all τ . Further,
since for any such x ∈ OK , x, x2 , x3 , . . . all have this property as well, there must be some
m ∈ N such that xm = 1. Therefore x ∈ µ(K).
The proof of Dirichlet’s unit theorem now comes down to showing that Γ ∼
= Zr+s−1 . To
do this, we show that Γ is a complete lattice inside H ∼
= Rr+s−1 .
×
Lemma 14.10.4. Given a ∈ Z, up to multiplication by elements of OK , there are only
finitely many α ∈ OK with NK/Q (α) = a.
Proof. An equivalent statement is that each coset of OK /aOK has at most one element of
of norm a, up to a unit. Suppose α, β ∈ OK are two such elements; that is, β = α + aγ for
some γ ∈ OK . Then
β a N (α)
=1+ γ =1+ γ ∈ OK
α α α
× ×
since N (α)/α ∈ OK . Similarly, αβ ∈ OK so αβ ∈ OK . Thus for some u ∈ OK , α = uβ,
proving the lemma.
Now we prove Theorem 14.10.2.
Proof. We first demonstrate that Γ is a lattice. By Proposition 14.7.1, it’s equivalent to
show that Γ is discrete and to do this, we show the point 0 ∈ Γ is an isolated point, i.e.
every bounded set in H containing 0 contains only finitely many points in Γ. Let X ⊆ H
be such a bounded set. Then L−1 (X) ⊆ S is also bounded, so L−1 (X) is bounded in K R .
Since j(OK ) is a lattice in KR (follows from Proposition 14.9.3), j(OK ) ∩ L−1 (X) is finite.
Applying L, we get that Γ ∩ X is finite, which implies 0 is isolated and hence Γ is a discrete
subgroup. S
To prove Γ is complete, we exhibit a bounded set M ⊆ H such that H = γ∈Γ (M + γ)
and apply Proposition 14.7.2. Since L : SS→ H is surjective, it will be enough to construct
a bounded set B ⊆ S such that S = ε∈O× Bj(ε), where Bj(ε) the translate of B by
K
j(ε). There is a subtlety here: if B ⊆ S is bounded, so is L(B) ⊆ H but only because the
logarithms of the elements in B stay away from 0. Now S ⊆ KR , so for all τ ∈ T , pick cτ > 0
such that cτ̄ = cτ and s
Y 2
C := cτ > sqrt|dK |,
τ ∈T
π
Q
Note that for all y ∈ S, τ ∈T |τ (y)|cτ = C by definition of S. This means that if y = (yτ ) ∈ S
and
Xy = {xy | x ∈ X} = {(zτ )τ ∈ KR : |zτ | < cτ |yτ |}
191
14.10. The Unit Theorem Chapter 14. Algebraic Number Fields
then Xy contains some j(α) for α ∈ OK r {0} by Lemma 14.9.4. Now by Lemma 14.10.4,
there exist elements α1 , . . . , αN ∈ OK such that any α ∈ OK with j(α) ∈ Xy is of the form
×
αi ε for some 1 ≤ i ≤ N, ε ∈ OK . Define
N
[
B=S∩ Xj(αi )−1 .
i=1
It is immediate from the definition of the αi that |NK/Q (αi )| < C, and since X is bounded,
we get that B is bounded. Moreover, if y ∈ S the above shows that Xy −1 contains some j(α)
for α ∈ OK such that |NK/Q (α)| < C. Thus there exists x ∈ X such that xy −1 = j(α), and
hence y = xj(α)−1 so S is covered by these bounded sets B. Hence by the initial comments,
Proposition 14.7.2 implies Γ is a complete lattice.
Finally, by the theory of finitely generated modules over Z, we have OK ∼ = Zr+s−1 ×
(OK )tors , but it is clear by Proposition 14.10.3 that the torsion part of OK is precisely µ(K).
Hence OK ∼ = Zr+s−1 × µ(K) as required.
√ √
Example 14.10.5. Let d > 0 be a squarefree integer, K = Q( d) and take α = a + b d ∈
OK . That is, a, b ∈ Z when d 6≡ 1 (mod 4) and a, b ∈ 21 Z when d ≡ 1 (mod 4). Then
√ ×
√
a + b d ∈ OK ⇐⇒ NK/Q (a + b d) = ±1 ⇐⇒ a2 − b2 d = ±1.
In a real quadratic number field, r = 2, s = 0 and µ(K) = {±1} so Theorem 14.10.2 gives us
×
OK = {±εm | m ∈ Z}
×
for some εOK . (Such an ε is called a fundamental unit of K.) The equation a2 − b2 = ±1 is
known as Pell’s equation, so the unit theorem says that the solutions to Pell’s equation over
Z form a rank 1 abelian group. √ √ √
×
For example, when d = 6 and OK = Z[ 6], 5 + 2 6 is a unit √ in O K with inverse 5 − 2 6.
2 2
√ that 5 − 6 · 2 · 6 = 1 and one can check that 5 + 2 6 is a√fundamental unit for
Notice
Q( 6). Therefore all solutions to a2 − 6b2 = 1 are of the form (5 + 2 6)k for k ∈ Z.
×
Definition. A set of units ε1 , . . . , εr+s−1 ∈ OK such that all units in OK are of the form
ν1 ν r+s−1
ζε1 · · · εr+s−1 for ζ ∈ µ(K) and νi ∈ Z is called a system of fundamental units in K.
×
Definition. For Γ ⊆ H, the complete lattice image of OK under L ◦ j, the volume vol(Γ) is
called the regulator of K.
×
Corollary 14.10.6. If ε1 , . . . , εr+s−1 is a system of fundamental units in OK , then the
regulator of K is √
vol(Γ) = r + s det((L ◦ j(εi ))k )ik .
We next work out an example with cubic fields of negative discriminant, combining
techniques from the last few sections to fully describe the class group of such a field. First
note that since the sign of dK is (−1)s , which implies in this case that r = s = 1, the unit
group consists of all elements of the form ±εm for some fundamental unit ε.
192
14.10. The Unit Theorem Chapter 14. Algebraic Number Fields
Lemma 14.10.7. Let K be a cubic number field with dK < 0 and let ε be the fundamental
unit in UK with ε > 1. Then |dK | < 4ε3 + 24.
Proof. Since ε 6∈ Q we must have K = Q(ε). The two other conjugates must be complex
conjugates, and the product of all three must be +1. Write ε = u2 for u ∈ R, u > 1. Then
the other conjugates of ε can be written as
Set g(x) = −ξx + 2x2 − 1. We are thus seeking a root of3 g with |x| < 1. Note that since
3 −3
−6
u > 1 and ξ = u +u 2
, g(1) = 1 − ξ < 0 and g − 1
2u3
= 4
(u − 1) < 0. Then it appears
that g(x) has one root greater than 1, and that the desired root is less than 2u1 3 .
If x0 is this root, consider
1
x20 > =⇒ u−6 − 4x20 < 0 =⇒ u−6 − 4x−2 −4
0 − 4x0 < 0.
4u6
This yields |D| ≤ 16(ξ 2 − 2ξx0 + x20 )(1 − x20 ). Also note that by the above, we may write
Then
193
14.10. The Unit Theorem Chapter 14. Algebraic Number Fields
Once we know ε it’s easier to compute the class group. It turns out that p = (2, 1 + α)
3
generates the class group, and it’s easy to check that p6 is generated by (α−1)
α+2
so it suffices
2 3
to show that p and p are not principal.
3
First suppose p3 = (γ) for some γ ∈ OK . Then γ 2 = ±αm (α−1)α+2
for some m ∈ Z. This
implies that at least one of the numbers below is a square:
α−1 α−1 α−1 α−1
− α −α .
α+2 α+2 α+2 α+2
Let β be the one that’s a square. If β ∈ OK /q for some prime ideal q, then we should find
that β is still a square mod q. First let q = (29, α − 2). We have
The residue field is OK /q = F29 and under the evaluation homomorphism Z[α] → F29 , α 7→ 2
(mod 29), we see that
α − 1 7→ 1 (α + 2)−1 7→ 22
α + 2 7→ 4 − 1 7→ −1.
α−1
Now 1, 4 and −1 are all squares mod 29, but 22 is not; hence m must be 0. Since α+2 < 0 it
α−1
can’t be a square (in fact it’s non-real) so the only possibility is β = − α+2 . However, if we
look at r = (7, α + 3) and the residue field OK /r = F7 , under the map Z[α] → F7 we have
α 7→ −3 ≡ 4 (mod 7)
α−1 3 1
− 7−→ − = − ≡ −4 ≡ 3 (mod 7).
α+2 6 2
Then 3 is not a square mod 7, so we have eliminated all choices for β and shown that p3 in
fact cannot be principal. By a similar argument, p2 is not principal. After establishing this,
it follows that C(OK ) = Z/6Z.
√
Example 14.10.9. Let K = Q(θ) where θ = 3 11. Then Z[θ] ⊆ OK – in fact Z[θ] is the
whole ring of integers but we won’t need that here. We can compute the discriminant to be
3! 4 √
BK = 3267 ≈ 16.17.
27 π
Thus C(OK ) is generated by the ideal classes with representatives p such that N(p) < 17;
then it suffices to consider the primes lying over p = 2, 3, 5, 7, 11 and 13. Using the techniques
from Section 14.5, we see that
x3 − 11 ≡ (x − 1)(x2 + x + 1) mod 2 so 2OK = p2 p02 with N(p2 ) = 2 and N(p02 ) = 4.
194
14.10. The Unit Theorem Chapter 14. Algebraic Number Fields
x3 − 11 is also irreducible mod 13, so 13OK = p13 is prime as well and N(p13 ) = 2197.
11 is ramified since it divides the discriminant. Then N (θ) = 11 so 11OK = p311 , where
p11 = θOK .
where β is a square mod p, the sign is fixed and d = 0, 1 (since w = ±um ). First consider
p3 = (θ − 2)OK . The map OK → OK /p3 is given by θ 7→ 2. Then
Since −1 is not a square mod 3, the sign must be negative. Next, the trick allows us to
calculate
N (θ + 9) = 740 = 22 · 5 · 37
so (θ + 9)OK is divisible by a prime p37 with norm 37 and residue degree 1. In OK /p37 ∼
= F37 ,
we map θ 7→ −9 and compute
195
Chapter 15
Local Fields
196
15.1. Discrete Valuation Rings Chapter 15. Local Fields
The following definition and proposition explain the where the term discrete valuation
ring comes from.
(1) A is a DVR.
Proof. (i) =⇒ (ii) Since A is a DVR, it is a PID by commutative algebra so each x ∈ A can
be written uniquely as x = uπ n for π generating the maximal ideal m ⊂ A. Define v(x) = n.
Then one verifies v is a discrete valuation on A.
(ii) =⇒ (i) The maximal ideal is m = {x ∈ A | v(x) > 0}. It’s easy to check that A is
local, integrally closed and therefore a DVR.
It is common to extend a valuation v on A to the field of fractions K of A by setting
v(0) = ∞ and v ab = v(a) − v(b) to get a function v : K → Z ∪ {∞}.
Example 15.1.2. Let p be a prime and consider the localization of Z at the prime ideal
(p):
Z(p) = ab ∈ Q : a, b ∈ Z, p - b .
0
Then Z(p) is a DVR with valuation v ab = r if we can write ab = pr ab0 for integers a0 , b0 not
divisible by p.
Example 15.1.3. Let k be a field and consider the polynomial ring k[t]. Localizing at the
maximal ideal (t), we get a discrete valuation ring
n o
C[t](t) = pq ∈ k(t) : p, q ∈ k[t], t - q ,
0
p p
where, much like Example 15.1.2, the valuation is v q
= r if we can write q
= tr pq0 for
polynomials p0 , q 0 ∈ k[t] not divisible by t.
197
15.1. Discrete Valuation Rings Chapter 15. Local Fields
Example 15.1.4. Let k be a field and consider the power series ring k[[t]] with maximal
ideal (t). Then the local ring k[[t]](t) is a DVR with valuation
∞
!
X
v ai ti = min{i ≥ 0 | ai 6= 0}.
i=0
Example 15.1.5. Let Fq be a finite field with q elements and consider the function field
k = Fq (t) in one variable. Then the discrete valuations on k are parametrized by the set of
irreducible monic polynomials f ∈ Fq [t], together with a point at ∞ which corresponds to
the degree valuation:
v∞ hg = deg h − deg g.
Lemma 15.1.6. Let QrA bevi a Dedekind domain and take a nonzero element α ∈ A with
factorization (α) = i=1 pi , with pj prime ideals and vi ≥ 1. Then for any pj ,
v
xApj = pj j Apj
Proof. A prime ideal p determines a local ring Ap which is a discrete valuation ring with
valuation (
vj , if p = pj for (x) = ri=1 pvi i
Q
v(x) =
0, if p - (x).
It follows from Lemma 15.1.6 that v is a discrete valuation. Proposition 15.1.1 shows that
DVRs and discrete valuations are in bijection. Finally, if s : A → K is the canonical
embedding and R ⊂ K is a DVR with maximal ideal mR , then s−1 (mR ) is a nonzero prime
of A.
Definition. Let A be a Dedekind domain with field of fractionsS K and suppose S ⊆ Spec A
contains all but finitely many prime ideals of A. Set U = p∈S p and define the “semi-
localization” n o
AS = U −1 A = fg ∈ K : f, g ∈ A, g 6∈ p for any p ∈ S .
198
15.1. Discrete Valuation Rings Chapter 15. Local Fields
Proposition 15.1.9. Let A be a Dedekind domain and S ⊆ Spec A a set of primes excluding
only finitely many of the primes of A. Then there is an exact sequence
M
1 → A× → A× S → K × /A×p → CA → CAS → 1.
p6∈S
The map CA → CAS is given by [I] 7→ [IAS ], and the middle map comes from
M
Z −→ CA
p6∈S
" #
Y
(ap )p6∈S 7−→ pap .
p6∈S
199
15.1. Discrete Valuation Rings Chapter 15. Local Fields
contradicting IAS =L AS . Therefore none of the p in the factorization of IAS lie in S, so IAS
lies in the image of p6∈S Z. Hence the entire sequence is exact.
Corollary 15.1.10. Let A be a Dedekind domain and S ⊆ Spec A a set of primes excluding
finitely many primes of A. Then if the class number |CA | is finite, so is |CAS |.
×
Definition. For a number field K and a cofinite set of primes S of OK , OK,S is called the
group of S-units of K and CK,S = COK,S the S-class group of K.
Corollary 15.1.12. For any number field K, there exists a cofinite set S of prime ideals of
OK such that CK,S = 1.
200
15.2. The p-adic Numbers Chapter 15. Local Fields
Observe that the coefficients ai are related to derivatives f (i) (a), as in Taylor’s theorem. If
g(x)
instead we have a rational function f (x) = h(x) ∈ K[x](x−a) for g, h ∈ K[x] where h(a) 6= 0,
then we can still write a formal power series expansion of f (x) about x = a:
∞
f (x) X
≈ ai (x − a)i for ai ∈ K.
g(x) i=0
This is the beginning of a fruitful dictionary between the integers Z and polynomial rings
over a field:
Z K[x]
prime ideal (p) maximal ideal (x − a)
reduction of a mod p evaluation f (a)
reduction of a mod pn+1 nth derivative f (n) (a)
If x ∈ Z(p) , the
Plocalization at (p) (see Example 15.1.2), then we would like to write a formal
power series ∞ a
i=0 i p i
with ai ∈ {0, 1, . . . , p − 1} that represents x.
Example 15.2.1. Take p = 5 and x = 233. Then the 5-adic expansion gives a “power
series” for 233:
233 = 3 · 1 + 1 · 5 + 4 · 52 + 1 · 53 + 0 · 54 + . . .
Notice that every p-adic integer has a well-defined residue class modulo pn for each n ≥ 0.
On the other hand, every element ofPthe local ring Z(p) has a well-defined residue class mod
pn . For x ∈ Z(p) , we will write x = ∞ i
i=0 ai p if both of these objects have the same residue
mod pn for all n ≥ 0. In other words, we Phave a map Z(p) → Zp . To see that the map is
∞
injective, suppose x, y ∈ Z(p) with x = i=0 ai pi = y. Then x − y ≡ 0 (mod pn ) for all
n ≥ 0, so we must have x = y.
201
15.2. The p-adic Numbers Chapter 15. Local Fields
Example 15.2.2. Beware that these “p-power series” expansions do not always behave as
they do in the analytic case. For example, take x = −1. Then for each n ≥ 0,
n−1
X
(p − 1)pi = pn − 1 ≡ −1 (mod pn ).
i=0
P∞
Thus −1 has p-adic expansion i=0 (p − 1)pi for any prime p. When p = 2, this gives the
famous “identity”
−1 = 1 + 2 + 4 + 8 + 16 + . . .
In ordinary integers, such a sum does not converge, but in 2-adic land it does! Alternatively,
the power series
1
= 1 + x + x2 + x3 + . . .
1−x
does not converge for x = 2, but it does converge in 2-adic numbers! In general, the above
shows that
1
= 1 + p + p2 + p3 + . . .
1−p
is valid in Zp .
In the polynomial ring case, we have strict containments of rings K[x] ( K[x](x−a) (
K[[x − a]]. Similarly, we have containments of sets Z ( Z(p) ( Zp for any prime p. Our next
goal is to give Zp the structure of a ring.
Informally, we can think of a p-adic integer as a sequence of residue classes in Z/pZ, Z/p2 Z, Z/p3 Z, . . .
which are compatible with the sequence of homomorphisms
λ λ λ
3
· · · −→ Z/p3 Z −→
2
Z/p2 Z −→
1
Z/pZ.
(In commutative algebra, this system of abelian groups and homomorphisms is called an
inverse system and such a sequence of residue classes is called a coherent sequence.) Then
we can view Zp as a (proper) subset of Z/pZ × Z/p2 Z × Z/p3 Z × · · · :
Zp = {x = (xi )∞ i
i=1 | xi ∈ Z/p Z and λi (xi+1 ) = xi for all i ∈ N}.
Lemma 15.2.3. If x = (xi ) and y = (yi ) are coherent sequences of residue classes in
i
(Z/p
Q∞ Z)i∈N then so are x + y = (xi + yi ) and xy = (xi yi ). That is, Zp is a subring of
i
i=1 Z/p Z.
Further, Zp is the completion of the DVR Z(p) with respect to a certain metric topology
called the p-adic topology, which we will discuss further in Section 15.3. One important fact
is that Z(p) is a dense subring of Zp .
202
15.2. The p-adic Numbers Chapter 15. Local Fields
Definition. The field of fractions of Zp is called the field of p-adic numbers, written Qp .
By definition any element of Qp can be written as p−m x for some x ∈ Zp and m ≥ 0:
m m m
X 1 X
m−i −m −m
X
bi = b i p p = p bi pm−i .
i=0
pi i=1 i=0
Q Qp
Z(p) Zp
where the product is over all primes p plus the “infinite prime” p = ∞.
203
15.2. The p-adic Numbers Chapter 15. Local Fields
Proof. Since norms are multiplicative, it’s enough to check the product formula when x is
prime and x = −1. When x = −1, | − 1|p = 1 for all primes p and | − 1|∞ = 1 so the product
formula holds trivially. If x = q is prime, we have
q, p = ∞
|q|p = 1q , p = q
1, p 6= q, ∞.
Then every point b ∈ Bp (a, r) is in fact the center of the ball. The same holds for any closed
ball B p (a, r).
Proof. Suppose c ∈ Bp (a, r) is any other point in the ball, so that |a − c|p < r. Since
b ∈ Bp (a, r), we have
Hence c ∈ Bp (b, r), so Bp (a, r) ⊆ Bp (b, r). Reversing the roles of a and b gives Bp (a, r) =
Bp (b, r).
It is not hard to show Q is not complete with respect to | · |p for any prime p, and we
know from real analysis that | · |∞ does not define a complete topology on Q either. Thus we
can complete Q with respect to any of these topologies by constructing the ring of Cauchy
sequences and taking the quotient by the ideal of sequences whose limit is 0.
Lemma 15.2.9. The completion of Q with respect to any valuation | · |p , for p prime or
p = ∞, is a topological field. Moreover, this completion is precisely Qp if p is prime and R
if p = ∞. Finally, when p is prime, Zp = {x ∈ Qp : |x|p ≤ 1}.
Proof. (Sketch) The p = ∞ case is dealt with in a basic
P∞real analysis course, so assume p is
i
a finite prime. We may identify any p-adic number i=−m ai p with the Cauchy sequence
(sn ) defined by
Xn
sn = ai pi ∈ Q.
i=−m
On the other hand, for any n, any Cauchy sequence is eventually constant mod pn . Thus we
may associate such a sequence (sn ) to a sum
n−1
X
ai p i
i=−m
204
15.2. The p-adic Numbers Chapter 15. Local Fields
P∞
for each n ∈ N. Given this identification, we can treat i=−m ai pi as a convergent power
series in Qp . We know that
X∞
ai p i = p m
i=−m p
ϕ : Z[[x]] −→ Zp
X∞ ∞
X
ai xi 7−→ ai p i ,
i=0 i=0
where the power series on the right is treated as a convergent power series per previous
remarks. Clearly ϕ is surjective by the definition of Zp . Moreover,
P∞ iti is a ring homomorphism
by construction and (x−p) ⊆ ker ϕ. If y ∈ ker ϕ, then y = i=0 ai x such that ni=0 ai pi ≡ 0
P
1
(mod pn+1 ) for all n ≥ 0. For each n, let bn = − pn+1 (a0 + a1 p + . . . + an pn ). Then
so y ∈ (x − p) and hence ker ϕ = (x − p). Now apply the first isomorphism theorem.
205
15.3. Absolute Values Chapter 15. Local Fields
206
15.3. Absolute Values Chapter 15. Local Fields
√
So |x + y| ≤ n n + 1|x|. Taking n → ∞, (n + 1)1/n approaches 1 so we get |x + y| ≤ |x|.
Hence | · | is nonarchimedean.
207
15.3. Absolute Values Chapter 15. Local Fields
s
This shows that | · | = where s is the unique positive number satisfying |p| = p1 . Thus
| · |sp
all nonarchimedean absolute values on Q are equivalent to a p-adic absolute value. (We call
the absolute value with s = 1 above the normalized p-adic absolute value, as in Section 15.2.)
Now assume |·| is archimedean. Suppose that for all m, n ∈ Z with m, n > 1, the absolute
value satisfies the following property: |m|1/ log m = |n|1/ log n (∗). Then for s > 0 such that
es = |n|1/ log n (for any n > 1), we have
log m
|m| = |n|1/ log n = es log m = ms = |m|s .
Therefore |m| = |m|s∞ and this holds for all m ∈ Q by multiplicativity. Thus it suffices to
check that any archimedean absolute value satisfies property (∗).
Fix m, n ∈ Z with m, n > 1 and write m in base n:
m = a0 + a1 n + . . . + ar n r for 0 ≤ ai < n.
log m
Note that r ≤ log n
. Then
|m| = |a0 + a1 n + . . . + ar nr |
X r
≤ |ai | |n|i by the triangle inequality
i=0
log m
≤ 1+ |n| · |n|log m/ log n
log n
log m
= 1+ |n|1+log m/ log n .
log n
Letting k → ∞, we then obtain |m| ≤ |n|log m/ log n , or |m|1 log m ≤ |n|1/ log n . Reversing the
roles of m and n gives the other inequality, establishing property (∗) and completing the
proof.
The following theorem may be seen as a certain generalization of the Chinese remainder
theorem.
Proof. For n = 1 this is trivial, so assume n ≥ 2. Since | · |1 and | · |n are not equivalent,
we know there exists α ∈ K such that |α|1 < 1 but |α|n ≥ 1. Likewise, there exists β ∈ K
208
15.3. Absolute Values Chapter 15. Local Fields
such that |β|1 ≥ 1 and |β|n < 1. Let y = αβ so that |y|1 > 1 and |y|n < 1. We will show that
there exists some z ∈ K such that |z|1 > 1 but |z|j < 1 for all 2 ≤ j ≤ n. The base case
of this statement was just proven, so to induct, pick z ∈ K such that |z|1 > 1 and |z|j < 1
for 2 ≤ j ≤ n − 1. If |z|n < 1 then we are done. If |z|n = 1 then z m y will work for some
zm
sufficiently large m. Finally, if |z|n > 1 then let tm = 1+z m so that as m → ∞, |tm |1 → 1,
|tm |n → 1 and |tm |j → 0 for all 2 ≤ j ≤ n − 1. Then tm y will work for sufficiently large m.
Now given z ∈ K such that |z|1 > 1 and |z|j < 1 for 2 ≤ j ≤ n, consider the same
zm
sequence tm = 1+z m . As m → ∞, we have
m
z 1
|tm |1 = m
= 1 −
m
−→ 1
1+z 1 1 + z 1
m
z
|tm |j = ≤ |z|m
j −→ 0 for all 2 ≤ j ≤ n.
m
1 + z j
Therefore one can find z1 such that |z1 − 1|1 < ε and |z1 |j < ε for 2 ≤ j ≤ n. Repeat
the process to pick z2 , . . . , zn with |zj − 1|j < ε and |zj |` < ε for ` 6= j. Then setting
x = a1 z1 + . . . an zn gives an element satisfying the desired norm conditions.
There exists a generalization, naturally called the strong approximation theorem, which
we will prove in Chapter 16.
Proposition 15.3.11. The only fields that are complete with respect to an archimedean
absolute value are (R, | · |∞ ) and (C, | · |∞ ).
We now connect the theory of nonarchimedean absolute values with discrete valuations
on K (Section 15.1).
Proof. For all x, y ∈ K, we have |xy| = |x| |y| which implies v(xy) = v(x) + v(y). Likewise,
|x + y| = max{|x|, |y|} implies v(x + y) ≥ min{v(x), v(y)}.
called respectively the valuation ring, group of units, valuation ideal and residue field
of | · |.
209
15.3. Absolute Values Chapter 15. Local Fields
Example 15.3.13. The analogy between p-adic numbers and power series is borne out by
these concepts:
(K, | · |) (Qp , | · |p ) (C((t)), | · |t
O Zp C[[t]]
O× Zp ×
C[[t]]×
m pZp (t)
κ Fp C
Definition. If K is a field with a nonarchimedean absolute value and associated discrete
valuation, we will call the triple (K, | · |, v) a discretely valued field.
O ⊃ m ⊇ m2 ⊇ m3 ⊇ · · · (of ideals)
O× ⊇ U (1) ⊇ U (2) ⊇ U (3) ⊇ · · · (of subgroups)
(1) O× /U (n) ∼
= (O/mn )× .
(2) U (n) /U (n+1) ∼
= O/m = κ.
Proof. (1) It is clear that the natural map O× → (O/mn )× is surjective with kernel U (n) .
(2) Pick a generator π of m. Then the map
U (n) −→ O/m
1 + π n a 7−→ a mod m
O b × : |x| ≤ 1} ∪ {0}
b = {x ∈ K
b × : |x| < 1}.
b = {x ∈ K
m
210
15.3. Absolute Values Chapter 15. Local Fields
π m (a0 + a1 π + a2 π 2 + . . .)
Proposition 15.3.18. For any discretely valued field (K, | · |, v), the completions of the
valuation ring and group of units are inverse limits:
b = lim O/mn
O
←−
b× = lim(O/mn )× = lim O× /U (n) .
O
←− ←−
For the rest of the section, assume K is a field which is complete with respect to a
discrete, nonarchimedean absolute value | · |.
Theorem 15.3.19 (Hensel’s Lemma). Suppose f (x) ∈ O[x] is a monic polynomial of degree
n and f¯(x) ∈ κ[x] admits a factorization
f¯(x) = ḡ(x)h̄(x)
for ḡ, h̄ relatively prime, monic polynomials over κ of degrees r and n − r, respectively. Then
f (x) = g(x)h(x)
for g(x), h(x) ∈ O[x] with deg g = r, deg h = n − r, ḡ(x) = g(x) mod m and h̄(x) = h(x)
mod m.
Proof. The idea is to find gk , hk ∈ O[x] inductively such that gk hk − f ∈ mk for all k ∈ N,
satisfying the conditions deg gk = r, deg hk = n − r, ḡ ≡ gk mod m and h̄ ≡ hk mod m.
For k = 1, let g1 and h1 be any monic lifts of ḡ, h̄ to O[x] with the correct degrees. To
induct, assume gk , hk have been constructed. By hypothesis, (ḡ) + (h̄) = (1) in κ[x] so for
all q̄ ∈ κ[x], there exist ā, b̄ ∈ κ[x] such that āḡ + b̄h̄ = q̄. If deg q̄ < n, then we can take
deg ā < n − r and deg b̄ < r. Let m = (π) and write gk hk − f = qπ k for some q ∈ O[x]
with deg q < n. Now let ā, b̄ ∈ κ[x] be as above for q̄, the reduction of this q mod m. Let
a, b ∈ O[x] be lifts of ā, b̄ with the same degrees and set
211
15.3. Absolute Values Chapter 15. Local Fields
Then we have
212
15.3. Absolute Values Chapter 15. Local Fields
Proof. Scale f so that it is primitive in O[x]. Let r be the minimal integer such that ar ∈ O× .
Then
f¯(x) ≡ xr (ar + . . . + xn−r ) mod m.
If 0 < r < n, this contradicts Theorem 15.3.23 and the irreducibility of f . If r = 0, then a0
is a unit after scaling, or in other words, no scaling took place. Likewise, if r = n, no scaling
took place. In all cases, f must be primitive to begin with, so all coefficients lie in O.
213
15.4. Local Fields Chapter 15. Local Fields
Example 15.4.1. For any prime integer p, the p-adic field Qp and the field of Laurent series
Fp ((t)) are both local fields.
Remark. Elsewhere in the literature, it is sometimes required that a discretely valued field
has a perfect residue field to be local. Other times, the residue field is allowed to be arbitrary.
Many times R and C are included in the definition of local field, as they bear similarities to
the prototypical examples of local fields Qp and Fp ((t)).
Lemma 15.4.2. A field K is a local field if and only if K admits a discrete, nonarchimedean
valuation with respect to which K is locally compact.
Proof. ( =⇒ ) Since K is a topological field, it’s enough to show that K has a compact open
neighborhood of 0. Notice that OK is an open neighborhood of 0. If mK is the maximal
ideal of OK , then Proposition 15.3.18 gives us
OK ∼
= lim OK /mn
←−
K = OK ∪ m−1 −2
K OK ∪ mK OK ∪ · · ·
Theorem 15.4.3. Every local field is a finite extension of Qp or Fp ((t)) for some prime
integer p.
214
15.4. Local Fields Chapter 15. Local Fields
Let K be a local field with residue field κ. Then char κ = p > 0 for some prime p. When
char K = 0, we call this the mixed characteristic case, whereas char K = p is called the equal
characteristic case.
Corollary 15.4.4. The only locally compact fields are R, C and finite extensions of Qp and
Fp ((t)) for p prime.
Let K be a local field, with OK , mK , πK , κ and v as usual and set q = |OK /mK | = |κ|.
We now describe the group structure of K × .
K× ∼
= Z × Z/(q − 1)Z × U (1)
(1) log(p) = 0.
x2 x3
(2) For all 1 + x ∈ U (1) , log(1 + x) = x − + − ...
2 3
2 3
Proof. If v(x) > 0 then the infinite sum x − x2 + x3 − . . . converges so this power series
is well-defined on U (1) . Note also that if such a log function is defined, it must necessarily
satisfy log(ω) = 0 for any root of unity ω, since
215
15.4. Local Fields Chapter 15. Local Fields
This defines the formal logarithm on K. Next, define the exponential function
x2 x3
exp(x) = 1 + x + + + ...
2! 3!
e
Lemma 15.4.7. exp(x) converges on mnK whenever n > p−1
, where e = eK/Qp = v(p).
22 23
Example 15.4.8. In K = Q2 , exp(2) = 1 + 2 + 2!
+ 2!
+ . . . does not converge. This is
1
reflected by the fact that v(2) = 1 6> 2−1 = 1.
Lemma 15.4.10. The functions exp and log are continuous on their domains.
e
Theorem 15.4.11. When n > p−1 , exp : mn → U (n) and log : U (n) → mn are inverse
isomorphisms of topological groups.
Now K × ∼= Z × Z/(q − 1)Z × U (1) (Proposition 15.4.6) and one can show that U (1) is a
Zp -module via the action x · u = ux for all u ∈ U (1) and x ∈ Zp . One also computes the
torsion part of U (1) to be U (1) ∩ µ∞ , where µ∞ is the set of all roots of unity in K. For any
n ≥ 1, the rank of the Zp -submodule U (n) is rankZp U (n) = rankZp mn = rankZp OK . Putting
everything together, we get:
216
15.5. Henselian Fields Chapter 15. Local Fields
Example 15.5.1. By Hensel’s Lemma, complete, discretely valued fields are Henselian.
K h = {α ∈ K
b | α is separable over K}.
Proof. Factoring a monic polynomial f (x) ∈ K[x] can be done over the algebraic closure K
of K if it can be done over any extension of K. Thus Hensel’s Lemma holds for K ∩ K b =
sep h
K ∩K =K .
b
Theorem 15.5.3. If (K, | · |) is a Henselian field and L/K is an algebraic extension, then
there is a unique absolute value | · |L on L extending | · |. Further, if L/K is finite of degree
n then q
|x|L = n |NL/K (x)|
and L is complete with respect to | · |L if K is complete with respect to | · |.
p
Proof. (Sketch) Let |x|L = n0 |NL0 /K (x)| for some finite extension L0 /K containng x, where
n0 = [L0 : K]. One can show that |x|L is independent of the choice of L0 , so it’s enough
to prove the theorem when L/K itself is finite. We now demonstrate that | · |L is a nonar-
chimedean absolute value on L.
For any x, y ∈ L, |xy|L = |x|L |y|L follows from multiplicativity of the norm (Lemma 14.2.1).
Moreover, |x|L = 0 if and only if NL/K (x) = 0 if and only if x = 0. Finally, for α, β ∈ L with
|α| ≤ |β|, we have
α
+ 1 ≤ max α , 1 = 1 if and only if |x| ≤ 1 implies |x + 1| ≤ 1 for all x ∈ L.
β β
217
15.5. Henselian Fields Chapter 15. Local Fields
Thus it’s enough to show that OL = {x ∈ L : |x|L ≤ 1} is a ring and is the integral closure
of O in L. For x ∈ L, we have that
Example 15.5.4. Theorem 15.5.3 need not hold if K is not Henselian. For instance, K = Q
with the 5-adic absolute value | · | = | · |5 is not Henselian. If L = Q(i) then one can define
two distinct absolute values on L:
a a
|x|1 = 5−m if x = (1 + 2i)m and |x|2 = 5−m if x = (1 − 2i)m .
b b
Both of these extend | · |5 to L, but they are clearly inequivalent.
The converse of Theorem 15.5.3 is true, that is, the property of unique extension of
absolute values characterizes Henselian fields.
Theorem 15.5.5. Suppose (K, |·|, v) is a nonarchimedean field such that |·| extends uniquely
to any algebraic extension L/K. Then K is Henselian.
Proof. We will prove that K satisfies the first version of Hensel’s Lemma (Theorem 15.3.19)
for monic polynomials. Let f ∈ O[x] be monic with nonzero constant term, i.e. f (x) =
a0 + a1 x + . . . + xn . (If a0 = 0, we may divide out by x and apply the proof to the remaining
factor.) First, if f is irreducible, let L/K be a splitting field of f . By hypothesis, | · | extends
uniquely to L so OL , mL , πL and λ := OL /mL are all defined for this field. Observe that any
σ ∈ Gal(L/K) preserve | · |, since otherwise |x|0 = |σ(x)| is a distinct absolute value on L
218
15.5. Henselian Fields Chapter 15. Local Fields
for some µ. Since |a0 | ≤ 1, we must also have |α| ≤ 1, so α ∈ OL . Thus α has an image ᾱ
in λ = OL /mL . Since each σ(α) lies in OL and as σ ranges over Gal(L/K) these constitute
all roots of f , all roots of f¯ in λ must be of the form σ̄(ᾱ) where σ ∈ Gal(L/K) and σ̄ is the
automorphism in Gal(λ/κ) induced by σ (as in Proposition 14.5.18). Then all roots of f¯ in
λ are Galois conjugate in λ/κ. The only possibility is that f¯(x) = ϕ(x)m for some m ∈ N
and some irreducible polynomial ϕ ∈ κ[x]. (In fact, it’s not too hard to see that ϕ must be
equal to the minimal polynomial of ᾱ over κ.)
Now let f ∈ O[x] be monic but not necessarily irreducible. Write f = f1 · · · fr for monic,
irreducible polynomials fj ∈ O[x]. Then f¯ = f¯1 · · · f¯r in κ[x] so by the irreducible case above,
each f¯j is a power of an irreducible polynomial. If f¯ = ḡ h̄ is a coprime, monic factorization
in κ[x], then Y Y
ḡ = f¯j and h̄ = f¯j
j∈J j6∈J
Q Q
for some subset J ⊆ {1, . . . , r}. Letting g = j∈J fj and h = j6∈J fj , we get that f = gh
in O. So K is Henselian.
Corollary 15.5.7. Let (K, | · |) be a complete nonarchimedean field and L/K an algebraic
extension. Then
p there is a unique absolute value | · |L on L which extends | · | and is of the
form |x|L = |NL/K (x)| if L/K is finite of degree [L : K] = n. Moreover, L is complete
n
219
15.6. Ramification Theory Chapter 15. Local Fields
w : L× −→ R
α 7−→ v(NL/K (α)).
Definition. For a Henselian field (K, | · |, v) and an algebraic extension (L, | · |L , w), the
ramification index is e = eL/K = [w(L× ) : v(K × )] and the inertial degree is f = fL/K =
[λ : κ].
so (πLe ) = (πK ) in OL , i.e. meL = mK OL . In particular, this is consistent with the ramification
theory in the global case (à la Section 14.5; after all, a DVR is a Dedekind domain). In fact,
in the local case, it turns out that ramification behavior is much nicer: a prime only ramifies
or remains inert, never splits.
Proposition 15.6.1. Let K be Henselian, L/K a finite extension and e = eL/K and f =
fL/K the ramification index and inertial degree, respectively. Then [L : K] ≥ ef with equality
if and only if v is a discrete valuation and L/K is separable.
Proof. Pick elements ω1 , . . . , ωf ∈ OL which reduce modulo mK to a basis of λ/κ. Also pick
π0 , π1 , . . . , πe−1 ∈ L× such that w(π0 ), w(π1 ), . . . , w(πe−1 ) are representatives of w(L× )/v(K × ).
It then suffices to prove the products ωi πj are linearly independent over K. Suppose
P
i,j aij ωi πj = 0 where aij ∈ K are not all 0. Collecting the terms of minimal valuation
in this sum, it will be enough to show that the sum of these lowest-valuation terms has the
same valuation as each individually. Observe that all these terms must share the same index
j, because
w(aij ωi πj ) = w(aij ) + w(πj ) ≡ w(πj ) mod w(K × ),
so different j correspond to different valuations. Fix this j and consider
X
aij ωj πj
i∈I
where I ⊆ {1, . . . , f } corresponds to the subset of terms of minimal valuation. Then w(aij )
is constant over i ∈ I, say w(aij ) = a, so aij = εbij for some ε ∈ K × and bij satisfying
w(bij ) = 0. Thus X
επj bij ωj 6≡ 0 mod mL
i∈I
220
15.6. Ramification Theory Chapter 15. Local Fields
OL = N + πL OL
= N + πL (N + πL OL )
= N + πL (N + πL (N + πL OL ))
= N + πL N + πL2 N + . . . + πLe−1 N + πLe OL after e expansions
= M + πLe OL = M + πK OL .
Now OK is a local ring (it’s a DVR) and since L/K is separable, OL is a finitely generated
OK -module. Therefore by Nakayama’s Lemma, OL = M . Hence [L : K] = ef .
Remark. For complete fields with discrete valuations, the ‘fundamental equality’ in Propo-
sition 15.6.1 holds even without the separable assumption.
Let K be a Henselian field with OK , mK , κ and v as usual, and let L/K be an algebraic
extension with extensions OL , mL , λ and w of the objects for the corresponding objects for
K.
Definition. We say a finite extension L/K is unramified if fL/K = [L : K] and λ/κ is
separable. If L/K is infinite, we say the extension is unramified if it is the union of finite
unramified extensions. In all other cases L/K is ramified.
Notice that for a finite extension, fL/K = [L : K] implies eL/K = 1.
Proposition 15.6.2. Suppose L/K is an unramified extension, K 0 /K is an algebraic ex-
tension and L0 = LK 0 is the compositum inside a fixed algebraic closure K/K. Then L0 /K 0
is an unramified extension.
L L0
ur
K K0
alg.
221
15.6. Ramification Theory Chapter 15. Local Fields
Proof. We may assume L/K and K 0 /K are finite. By hypothesis, λ/κ is separable so
λ = κ(ᾱ) for some ᾱ ∈ λ by the primitive element theorem. Lift ᾱ to some α ∈ L. Then
implies deg(α) = [L : K], so L = K(α). This means L0 = K 0 (α). Let g be the minimal
polynomial of α over K 0 and f be the minimal polynomial of α over K. Since f¯ is separable
and g divides f , ḡ is also separable. If ḡ were reducible, g would be reducible by Hensel’s
Lemma (Theorem 15.3.19), but this is impossible since g is a minimal polynomial. Thus ḡ
is irreducible over κ0 = OK 0 /mK 0 and separable. If λ0 is the residue field of L0 , then
On the other hand, Proposition 15.6.1 gives us [λ0 : κ0 ] ≤ [L0 : K 0 ] so we have equality.
Further, λ0 is the splitting field over κ0 of ḡ, so λ0 /κ0 is separable and hence L0 /K 0 is unram-
ified.
LL0
L ur L0
ur ur
K
Proof. Assume all extensions are finite. By Proposition 15.6.2, LL0 /L and LL0 /L0 are un-
ramified. Further, towers of separable extensions are separable and f is multiplicative in
towers (Lemma 14.5.16), so it follows that
Proof. By Corollary 15.6.3, we may take T to be the compositum inside an algebraic closure
K/K of all unramified extensions L/K.
222
15.6. Ramification Theory Chapter 15. Local Fields
Lemma 15.6.5. For an algebraic extension L/K with maximal unramified subextension
K ⊆ T ⊆ L, the residue field τ of T is equal to the separable closure of κ in λ.
Proof. Let κsep be the separable closure of κ in λ and let τ be the residue field of T . Clearly
τ ⊆ κsep ∩ λ. On the other hand, given ᾱ ∈ κsep ∩ λ with minimal polynomial f¯ over
κ, we know f¯ is separable. Lift f¯ to a monic polynomial f in L[x]. By Hensel’s Lemma
(Theorem 15.3.19), f has a root α ∈ L lifting ᾱ. Then K(α)/K is unramified since
If K is any discretely valued field of characteristic 0 with perfect residue field κ of char-
acteristic 0, then saying L/K is tamely ramified is equivalent to saying p - eL/K .
Proof. Suppose α ∈ L r K. Let m = deg(α) and note that p - m because L/K is tame. Set
β = α − m1 TrL/K (α). Then
1
Tr(β) = Tr(α) − m Tr(α) = 0.
m
Since eL/K = 1, there exists b ∈ K × with v(b) = w(β). Set ε = β/b. Thus Tr(ε) = 0 = w(ε).
Further, fL/K = 1 implies TrL/K (ε) = mε̄ because all conjugates of ε in a normal closure of
L/K have the same image in λ = κ. But Tr(ε) = 0 implies mε̄ = 0, but this contradicts
w(ε) = 0. Hence L = K as claimed.
We have the following characterization of tame extensions (tamely ramified extensions)
of a Henselian field.
Theorem 15.6.8. Suppose L/K is a finite extension, with maximal unramified subfield T .
Then L/K is tame if and only if L/T is generated by prime-to-p roots of elements of T .
Proof. (Sketch) By definition of T , L/K is tamely ramified if and only if L/T is tamely
ramified so we may assume K = T .
( ⇒=√ ) Adjoining one prime-to-p root at a time and applying induction, we may assume
L = K( m a) for a ∈ K and p - m. If m - v(a) in v(K × ), then eL/K = m so [L : K] = m.
Since p - m, this means fL/K = 1 so L/K is tame. On the other hand, if m | v(a) then we
can multiply a by
√ an mth power of an element of K to get v(a) = 0. Then ā is an mth power
in κ, or else κ( ā) is an inseparable extension of κ, contradicting K = T . But ā ∈ (κ× )m
m
223
15.6. Ramification Theory Chapter 15. Local Fields
√
implies a ∈ (κ× )m by Hensel’s Lemma (Corollary 15.3.20). Hence L = K( m a) = K, so in
all cases L/K is tame.
( =⇒ ) Suppose L/K is tame and set n = [L : K]. Then p - n. Since for any α ∈ L,
w(α) = n1 v(NL/K (α)) by Theorem 15.5.3, we have p - [w(L× ) : v(K × )] = eL/K . Pick γ ∈ L
such that w(γ) 6∈ v(K × ). (If w(L× ) = v(K × ), skip this step.) Let m be the order of w(γ
in w(L× )/v(K × ). Then p - m so we can write γ m = cε for c ∈ K and ε ∈ L such that
w(ε) = 0. Since λ = κ, we can assume ε̄ = 1 in λ. By Hensel’s Lemma (Theorem 15.3.19),
m
ε is then an mth power in L; √write ε = (ε0 )m for ε0 ∈ L. Hence εγ0 = c ∈ K × . Now
replace K with K εγ0 = K( m c) and repeat the procedure until w(L× ) = v(K × ). This
Corollary 15.6.9. The fundamental equality [L : K] = ef holds for all finite tame extensions
L/K.
Corollary 15.6.10. Given a tame extension L/K and algebraic extension K 0 /K and their
compositum L0 = LK 0 ⊆ K, L0 /K is also tame.
L L0
tame
K K0
alg.
T0
T T K0
ur ur
K K0
alg.
224
15.6. Ramification Theory Chapter 15. Local Fields
Corollary 15.6.11. Let L, L0 be two tamely ramified, algebraic extensions of K. Then their
compositum LL0 ⊆ K is tamely ramified.
Corollary 15.6.12. If L/K is an algebraic extension, there exists a maximal tamely ramified
subfield K ⊆ V ⊆ L.
Definition. The maximal tame extension of a Henselian field K is the maximal tamely
ramified extension of K/K, denoted K tame .
In analogy with the decomposition/inertia field tower in the global case (Proposition 14.5.20),
we have the following tower of Henselian fields, along with corresponding residue fields and
value groups.
L λ w(L× )
T τ = κsep ∩ λ w(T × )
K κ v(K × )
Definition. Let L/K be an algebraic extension of Henselian fields with maximal unramified
and maximal tame extensions K ⊆ T ⊆ V ⊆ L. We say L/K is totally ramified if T = K
and wildly ramified if V 6= L.
Remark. When L/K is a finite extension, we can write eL/K = pa e for some p - e, which is
in fact the ramification indices of V /K and V /T : eV /K = e = eV /T . Therefore [V : T ] = e.
Example 15.6.13. Let K be a local field and consider the cyclotomic extension K(ζn )/K
for ζn a primitive nth root of unity. By Theorem 15.4.3, K is a finite extension of either
Qp or Fp ((t)) for some prime p. Suppose that p - n; set κ = Fq where p | q. If f = ordn q,
i.e. q f ≡ 1 (mod n), then we will show K(ζn )/K is uramified of degree f . Note that Fqf /Fq
is the smallest extension of Fq containing an nth root of unity. Let g(x) be the minimal
polynomial of ζn over K. Then g is separable and ḡ is irreducible in Fq [x] – if not, g has
multiple roots, but all nth roots of unity have distinct reductions in Fqf , so this is impossible.
Thus deg ḡ = f so deg g = f and hence K(ζn )/K is unramified of degree f .
225
15.6. Ramification Theory Chapter 15. Local Fields
Proof. Let L = K(ζn ). Then OL = OK [ζn ] + mL OL but since OL and OK are local rings,
Nakayama’s Lemma implies OL = OK [ζn ].
(Compare this to the global case in Corollary 14.3.15.)
Now suppose p | n. To simplify things, we will assume now that K = Qp and n = pm for
some m ≥ 1.
Lemma 15.6.15. The extension Qp (ζn )/Qp is totally ramified, with Gal(Qp (ζn )/Qp ) ∼
=
m ×
(Z/p Z) , OQp (ζn ) = Zp [ζn ] and mQp (ζn ) = (1 − ζn ), where |N (1 − ζn )| = p.
Proof. Let
m
(x + 1)n − 1 (x + 1)p − 1
h(x) = =
(x + 1)n/p − 1 (x + 1)pm−1 − 1
m−1 m−1
= 1 + (x + 1)p + . . . + (x + 1)(p−1)p
This implies Gal(Qp (ζn )/Qp ) ,→ (Z/pm Z)× but both groups have order ϕ(pm ) = (p−1)pm−1 ,
so the map is an isomorphism. Next, 1−ζn is a prime element of Qp (ζn ), so it is a uniformizer.
Moreover, Y
N (1 − ζn ) = (1 − σ(ζn )) = h(1) = ±p.
σ∈(Z/pm Z)×
1 1 1 1
w(1 − ζn ) = v(N (1 − ζn )) = · v(p) = = .
ϕ(n) ϕ(n) ϕ(n) [Qp (ζn ) : Qp ]
It follows that eQp (ζn )/Qp = [Qp (ζn ) : Qp ] so this extension is totally ramified.
For the general case, let n = pm n0 where p - n0 . Then we still have OQp (ζn ) = Zp [ζn ] by
Lemma 15.6.14, and the following tower gives the full ramification theory for Qp (ζn )/Qp :
226
15.6. Ramification Theory Chapter 15. Local Fields
L = Qp (ζn )
V = Qp (ζpn0 ) = T (ζp )
T = Qp (ζn0 )
K = Qp
227
15.7. Extensions of Valuations Chapter 15. Local Fields
Define w on L by |x|w = |τ (x)|v̄ for this fixed embedding τ . We will write w | v, read “w
extends v”. Now let Lw be the closure of τ (L) in K v with respect to the topology induced
by w. Abstractly, assuming L/K is finite, Lw = Lw , the completion of L with respect to | · |w
in K v . If L/K is infinite, then Lw is the union of the completions of all finite intermediate
extensions of L/K with respect to | · |w . Note that | · |w extends to Lw by restricting | · |v̄ to
Lw ⊆ K v .
Proof. Suppose L/K is finite. Then τ (L)Kv ⊆ Lw . On the other hand, Theorem 15.5.3
implies τ (L)Kv is complete with respect to | · |w and therefore Lw ⊆ τ (L)Kv . Generalizing
to the infinite case is straightforward.
From now on we will write Lw = LKv = τ (L)Kv . There is a diagram of field extensions
in K v
L Lw
K Kv
sometimes called the “local-to-global principle” for algebraic extensions. This terminology is
reflected in the example of a function field K = k(t): one may pass from extensions L/k(t) of
function fields to extensions Lw /k((t)) of fields of power series, that is, from global functions
to local functions.
Proof. Define Lw ⊆ K v as above. Then Lw /Kv is algebraic and w is the unique extension of
v on Kv to Lw . Thus for any embedding τ̄ : Lw ,→ K v , we must have v̄ ◦ τ̄ = w. Restricting
τ̄ to L defines an embedding τ : L ,→ K v satisfying v̄ ◦ τ = w.
228
15.7. Extensions of Valuations Chapter 15. Local Fields
Lemma 15.7.3. Two embeddings τ1 , τ2 : L ,→ K v give rise to the same absolute value on L
if and only if τ2 = σ ◦ τ1 for some σ ∈ Aut(L/K).
Theorem 15.7.4. For any absolutely valued field (K, | · |v , v), there is a one-to-one corre-
spondence
extensions of valuations Galois orbits of embeddings
←→ .
w | v to L L ,→ K v
Corollary 15.7.5. For a simple extension L = K(α) with minimal polynomial f ∈ K[x],
the embeddings L ,→ K v are in one-to-one correspondence with the irreducible factors of f .
so the valuation theory completely captures the ramification theory in Section 14.5.
229
15.7. Extensions of Valuations Chapter 15. Local Fields
More generally, suppose L/K is a finite extension of number fields and fix a prime ideal
p ⊂ OK with factorization
pOL = Pe11 · · · Perr
for distinct prime ideals Pi ⊂ OL and ei > 0. Let v be the p-adic valuation on K, i.e.
v(x) = n if and only if x ∈ pn r pn+1 . In this case, we get r different extensions of v to
L: v1 , . . . , vr , where vi = e1i vPi , the normalization of the Pi -adic valuation on L by the
ramification index ei . To see this, assume OL = OK [α] and p is unramified in OL (there
are only finitely many ramified primes anyway). Then each ei = 1, so we have the following
equivalences:
a ⊗ b 7−→ (ab)w
Proof. Write L = K(α) and let f be the minimal polynomial of α over K. Then f factors
over Kv as Y
f= fw
w|v
with no repeated factors since f is separable. For each w | v, view Lw inside K v and let αw
be the image of α in K v under an embedding corresponding to w. Then Lw = Kv (αw ) and
fw is the minimal polynomial of αw over Kv . This corresponds to the commutative diagram
Y
Kv [x]/f Kv [x]/fw
w|v
∼
= ∼
=
ϕ
Y
L ⊗K Kv Lw
w|v
where the top row is by the Chinese remainder theorem, the left isomorphism is x 7→ α ⊗ 1
and the right isomorphism is x 7→ (αw )w . Therefore ϕ is an isomorphism.
230
15.7. Extensions of Valuations Chapter 15. Local Fields
P
Therefore [L : K] = w|v e(w | v)f (w | v) as claimed.
Example 15.7.9. Let K = Q and let L be any number field. Then the archimedean
absolute value | · |∞ completes to the reals: Q∞ = R, and the corresponding base change
from Proposition 15.7.7 is
L ⊗Q R ∼
Y
= Lw
w|∞
231
15.8. Galois Theory of Valuations Chapter 15. Local Fields
Proposition 15.8.1. For L/K finite Galois, G acts transitively on the set of extensions of
| · |v to L.
Proof. If not, there exist disjoint G-orbits of absolute value extensions. Since all extensions
of | · |v agree on K, any nonequivalent extensions must be distinct. Thus there exists some
x ∈ L with |σ(x)|w < 1 but |σ(x)|w0 > 1 for some w, w0 from distinct G-orbits and for all
σ ∈ G, by the weak approximation theorem (15.3.9). Let
Y
α= σ(x).
σ∈G
Then α ∈ K but |α|v < 1 and |α|v > 1 simultaneously, a contradiction. Hence G acts
transitively.
Let L/K be a Galois extension, w | v an extension of valuations and set
Proof. We prove Gw ≤ G is closed and remark that the proofs for Iw and Rw are similar.
Let σ ∈ G be in the closure of Gw and let K ⊆ M ⊆ L such that M/K is finite Galois. Then
there exists σM ∈ Gw ∩ σ Gal(L/M ), so σM |M = σ|M . Further, σM ∈ Gw implies w ◦ σM = w
and so w ◦ σ|M = w ◦ σM |M = w, or σ ∈ Gw . Therefore Gw is closed in G.
Suppose L/K and K 0 /K are Galois extensions and set L0 = LK 0 ⊆ K:
232
15.8. Galois Theory of Valuations Chapter 15. Local Fields
τ
L L0
τ
K K0
τ ∗ : G0 −→ G
σ 7−→ τ ∗ (σ)(x) := τ −1 στ (x).
Therefore τ ∗ (σ 0 ) = σ ∈ Gw . The proof is similar for the maps on inertia and ramification
groups.
The most important case of this proposition is for the “local-to-global principle” of
Section 15.7, i.e. when K 0 = Kv is the completion of K at v and L0 = Lw = LKv by
Lemma 15.7.1.
τ
L Lw
K Kv
233
15.8. Galois Theory of Valuations Chapter 15. Local Fields
v wZ wT wV w
(2) Zw = L ∩ Kv .
1 → Iw → Gw → Gal(λ/κ) → 1
234
15.8. Galois Theory of Valuations Chapter 15. Local Fields
ψ : Iw −→ χ(L/K)
σ(x)
σ 7−→ δ 7→ mod PL,w .
x
|σ(x)|w
Note that σ(x) = = 1 so indeed δ ∈ χ(L/K). Also, if x0 = xau for |u|w = 1 and
x |x|w
w
a ∈ K, then
σ(xau) σ(x) σ(u) σ(x)
ψ(σ)(x0 ) = = · ≡ mod PL,w
xau x u x
since σ ∈ Iw . Thus the homomorphism ψ is well-defined. It is now clear that Rw = ker ψ by
the definition of the ramification group.
Proposition 15.8.7. Let char κ = p. If p > 0 then Rw is the unique Sylow p-subgroup of
Iw , and if p = 0, then Rw = 1.
235
15.8. Galois Theory of Valuations Chapter 15. Local Fields
This contradicts σ ∈ Rw , so every element in Rw has p-power order. Combined with the
first paragraph, this says that Rw is itself a Sylow p-subgroup and since it is the kernel of ψ
and thus normal, Rw is the unique one.
1 → Rw → Iw → χ(L/K) → 1.
236
15.9. Higher Ramification Groups Chapter 15. Local Fields
237
15.9. Higher Ramification Groups Chapter 15. Local Fields
σ(πL )
Proof. If σ ∈ Gs+1 then vL (σ(πL ) − πL ) ≥ s + 2 which implies vL πL
− 1 ≥ s + 1, i.e.
σ(πL ) (s+1)
πL
∈ UL . Therefore the map is well-defined. To see that it is a homomorphism, take
σ, τ ∈ Gs and consider:
στ (πL ) στ (πL ) τ (πL )
= ·
πL τ (πL ) πL
σ(uπL ) τ (πL )
= · for some u ∈ OL×
uπL πL
σ(u) σ(πL ) τ (πL )
= · cdot .
u πL πL
(s+1)
Since σ ∈ Gs , vL (σ(u) − u) ≥ s + 1, so vL σ(u) u
− 1 ≥ s + 1 and thus σ(u)
u
≡ 1 in UL .
στ (πL ) σ(πL ) τ (πL ) (s) (s+1)
Hence πL
= πL
· πL
in UL /UL .
σ(πL )
Finally, suppose σ ∈ Gs+1 . Then vL (σ(πL ) − πL ) = s + 1 so vL πL
− 1 = s and in
σ(πL ) (s) (s+1)
particular πL
6= 1 in UL /UL . Hence the map is injective.
Corollary 15.9.4. For any L/K with Galois group G,
(1) There is an embedding G0 /G1 ,→ λ× . In particular, G0 /G1 ∼
= µ` , the group of `th
roots of unity in λ, for some p - `.
(2) For each s ≥ 1, there is an embedding Gs /Gs+1 ,→ (λ, +). In particular, Gs /Gs+1 ∼
=
a
(Z/pZ) for some a.
Proof. Apply Proposition 15.3.14.
Example 15.9.5. The corollary implies G1 is the unique Sylow p-subgroup of G0 = I, so
by Proposition 15.8.7, G1 = R, the ramification group. This confirms Example 15.9.1.
Higher ramification groups give us an idea about the general shape of the Galois group
of an extension L/K.
Lemma 15.9.6. G0 is isomorphic to a semidirect product P o Z/mZ where P is a p-group
and m ∈ Z, p - m.
Proof. Apply the Schur-Zassenhaus theorem.
Corollary 15.9.7. G0 is solvable.
Corollary 15.9.8. If L/K is totally ramified and Galois, then Gal(L/K) is solvable.
Example 15.9.9. Consider the local function field K = Fp ((t)). Then any finite Galois
extension L/K is totally ramified and hence has solvable Galois group. In particular, the
inverse Galois problem does not hold for K.
Example 15.9.10. Let K = C((t)) be the global function field over k = C. Then one can
prove GK := Gal(C((t))/C((t))) ∼ = Z,b the profinite completion of the integers. Since C is
algebraically closed of characteristic zero, for any finite Galois extension L/C((t)) we get
G0 = G and G1 = {1}.
238
15.9. Higher Ramification Groups Chapter 15. Local Fields
G−1 ⊇ G0 ⊇ G1 ⊇ G2 ⊇ · · ·
and H−1 ⊇ H0 ⊇ H1 ⊇ H2 ⊇ · · ·
One can see that by the definitions of these higher ramification groups, for each s ≥ −1,
Hs = Gs ∩ H. On the other hand, if G0 = Gal(L0 /K) ∼ = G/H, it is not clear that the
filtrations
G−1 ⊇ G0 ⊇ G1 ⊇ G2 ⊇ · · ·
and G0−1 ⊇ G00 ⊇ G01 ⊇ G02 ⊇ · · ·
Lemma 15.9.11. If L/K is Galois and the residue extension λ/κ is separable, there exists
x ∈ OL such that OL = OK [x].
Proof. By the fundamental equality (Proposition 15.6.1), eL/K fL/K = [L : K]. Since we are
assuming λ/κ is separable, we may choose x̄ ∈ λ such that λ = κ(x̄). Let f¯(t) be the minimal
polynomial of x̄ over κ. Then by Hensel’s Lemma, there is a lift f (t) ∈ OK [t] of f¯(t). Lift
x̄ to an element x ∈ OL . We know vL (f (x)) > 0. If vL (f (x)) = 1, the elements f (x)i xj for
0 ≤ i < eL/K and 0 ≤ j < fL/K generate OL as an OK -module since the number of these
is [L : K]. In this case, it is clear that OK [x] = OL . On the other hand, if vL (f (x)) > 1,
replace x with x + πL , so that
Then f 0 (x) ∈ OL× since f¯ is separable and f¯0 (x̄) 6= 0. Also, vL (f (x)) > 1 implies that
vL (f (x + πL )) = 1. So in all cases, OL is generated by some x ∈ OL as an OK -module.
Let x ∈ OL such that OL = OK [x]. For each nontrivial σ ∈ G = Gal(L/K), write
iL/K (σ) = vL (σ(x) − x) and also set iL/K (1) = ∞. In fact, iL/K (σ) = miny∈OL {vL (σ(y) − y)}
since for any y ∈ OL , we may write
y = a0 + a1 x + . . . + an xn
239
15.9. Higher Ramification Groups Chapter 15. Local Fields
Proof. If σ 0 = 1 then both sides are infinite so the equality holds. Assume σ 0 6= 1. By
Lemma 15.9.11, OL0 = OK [y] for some y ∈ OL0 ; as above, let OL = OK [x]. Then
1
iL0 /K (σ 0 ) = vL0 (σ 0 (y) − y) = vL (σ 0 (y) − y)
eL/L0
which we will rewrite as eL0 /L iL0 /K (σ 0 ) = vL (σ 0 (y) − y). It therefore suffices to show
X
vL (σ 0 (y) − y) = iL/K (σ).
σ|L0 =σ 0
Set a = σ 0 (y) − y and b = τ ∈H (στ (x) − x). If f (t) ∈ OK [t] is the minimal polynomial of x
Q
over K, then
Y Y
f (t) = (t − τ x) =⇒ (σf )(t) = (t − στ x)
τ ∈H τ ∈H
Y
=⇒ (σf )(x) = (x − στ x)
τ ∈H
240
15.9. Higher Ramification Groups Chapter 15. Local Fields
where formally we set [G0 : G−1 ] = [G : G0 ]−1 . Then ϕL/K is piecewise-linear, nondecreasing
and if gs = |Gs |, then we can explicitly write
1
ϕL/K (s) = (g1 + . . . + gm + (s − m)gm+1 )
g0
for any m ∈ N such that 0 < m ≤ s ≤ m + 1. Also, ϕL/K (s) = s for −1 ≤ s ≤ 0. By this
reformulation, we can see that the slope of ϕL/K (s) is gm+1
g0
for all s, where m < s < m + 1,
gs−1
but when s ∈ Z, the slope is g0 . This implies:
1 X
Lemma 15.9.13. For any s ≥ −1, ϕL/K (s) = min{iL/K (σ), s + 1} − 1.
g0 σ∈G
But by maximality, this implies vL (στ (x) − x) = m. On the other hand, if τ ∈ Hm−1 , then
iL/K (τ ) < m so vL (στ (x)−x) = iL/K (τ ). Thus iL/K (στ ) = vL (στ (x)−x) = min{m, iL/K (τ )}.
By Lemma 15.9.12,
1 X
iL0 /K (σ 0 ) = iL/K (σ)
eL/L0 τ ∈H
1 X
= min{m, iL/K (τ )}
h0 τ ∈H
= ϕL/L0 (iL/K (σ) − 1) + 1 by Lemma 15.9.13.
241
15.9. Higher Ramification Groups Chapter 15. Local Fields
Definition. Let L/K be a Galois extension. Then the subgroups Gt := Gs for t = ϕL/K (s)
are called the higher ramification groups for the upper numbering of G.
Since ϕL/K (s) is monotone in s, it has an inverse function ψL/K : [−1, ∞) → [−1, ∞).
Proof. We prove the statement for the ϕ maps; the other statement follows from the fact
that each ψ = ϕ−1 . By Theorem 15.9.14, we know that if t = ϕL/L0 (s) then Gs (L/K)/Hs =
Gs H/H ∼ = (G/H)t . Thus |Gs | = |Hs | |(G/H)t | and comparing the derivatives of ϕL/K (s)
and ϕL0 /K ◦ ϕL/L0 (s), we see that
1
ϕ0L/K (s) = |Gs |
eL/K
1
= |Hs | |(G/H)t | as in Lemma 14.5.16
eL/L0 eL0 /K
1 1
= |Hs | |(G/H)t |
eL/L0 eL0 /K
= ϕ0L/L0 (s)ϕ0L0 /K (t) for s 6∈ Z
= (ϕL0 /K ◦ ϕL/L0 )0 (s) by the chain rule.
Thus ϕL/K (s) and ϕL0 /K ◦ ϕL/L0 (s) differ by a constant away from s ∈ Z, but since both are
continuous and equal to 0 at s = 0, they must be equal.
This shows the advantage of the ramification groups of upper numbering: they are in-
variant under passage to a Galois subextension L0 /K of L/K. By construction, the “jumps”
in the filtration Gs can only occur at integers. However, this is not necessarily true of the
ramification groups of upper numbering Gt . However, we have:
242
15.10. Discriminant and Different Chapter 15. Local Fields
⇐⇒ D−1 −1 −1
B/A TrM/L (I) ⊆ DB/A B = DB/A
⇐⇒ TrL/K (D−1
B/A TrM/L (I)) ⊆ A
⇐⇒ TrM/K (D−1
B/A I) ⊆ OK by transitivity of trace
⇐⇒ D−1 −1
B/A I ⊆ DC/A
⇐⇒ I ⊆ D−1
C/A DB/A .
243
15.10. Discriminant and Different Chapter 15. Local Fields
Corollary 15.10.4. For any A, K, L, B, p, q as above, the different may be computed locally:
Y
DB/A = (DBbq /Abp ∩ B)
q|p
where the product is taken over all primes p ⊂ A and all q ⊂ B lying over p.
Let L/K be an extension of number fields, with rings of integers OK and OL . We will
write DL/K to denote the different DOL /OK . We may assume OL = OK [α] for α ∈ L with
minimal polynomial f (x) over K.
Example 15.10.5. For K = Q(i) with OK = Z[i], T r(a + bi) ∈ Z precisely when 2a ∈ Z,
so we see that Z[i]∨ = 12 Z[i]. Thus the different of K is 2Z[i]. This can be verified with the
next lemma.
Lemma 15.10.6. If L/K is a field extension with OL = OK [α], then DL/K = (f 0 (α)).
f (x)
= b0 + b1 x + . . . + bn−1 xn−1
x−α
. We show the odual basis of {1, α, α2 , . . . , αn−1 } with respect to the trace form
for bi ∈ OK n
is precisely f 0b(α)
0
, . . . , fbn−1
0 (α) . To see this, let α1 , . . . , αn be the distinct roots of f (x). Then
the polynomial
r
X f (x) αr
r
g(x) = x − · 0 i
i=1
x − αi f (αi )
is monic of degree strictly less than n, but α1 , . . . , αn are all roots of g. This implies g = 0,
so n
X f (x) αr
· 0 i = xr
i=1
x − αi f (αi )
f (x) αri f (x)
for each 0 ≤ r ≤ n − 1. Thus TrL/K x−α i
· f 0 (αi )
= xr for 0 ≤ r ≤ n − 1, but x−α =
b0 + b1 x + . . . + bn−1 xn−1 so comparing degrees, we get
bi α j
TrL/K = δij .
f 0 (α)
244
15.10. Discriminant and Different Chapter 15. Local Fields
Now notice that the bi satisfy recursive equations: bn−1 = 1, bn−2 − αbn−1 = an−1 , and so
on. Solving this yields the identity
Since all ai ∈ OK , e | vL (ai ) for each ai and vL (α) = 1, so each term in f 0 (α) has a different
valuation. Thus vL (f 0 (α)) = e − 1 when p - e (the tame case) and vL (f 0 (α)) ≤ vL (e) + e − 1
(the wild case) since OL is a DVR.
(iii) Now suppose L/K is Galois. Then
Y
f 0 (α) = (α − σ(α)).
σ∈Gr{1}
245
15.10. Discriminant and Different Chapter 15. Local Fields
= #{(σ, i) | σ ∈ Gi r {1}, i ≥ 0}
∞
X
= (|Gi | − 1).
i=0
Let L/K be an extension of number fields. Recall from Section 14.3 the definition of the
discriminant dL/K (α1 , . . . , αn ) for a K-basis {α1 , . . . , αn } of L:
As in Proposition 14.5.9, define the discriminant ideal DL/K = (dL/K (α1 , . . . , αn )) for any
such basis.
Theorem 15.10.8. For an extension L/K, the discriminant ideal is the ideal norm of the
different:
DL/K = NL/K (DL/K ).
Proof. Again, we may assume OK and OL are DVRs by Proposition 15.10.3(iii). In par-
ticular, OK is a PID (Proposition 15.1.1) so OL admits an integral basis α1 , . . . , αn by
Proposition 14.3.9. Then DL/K = (dL/K (α1 , . . . , αn )) by definition. On the other hand,
OL is also a PID so D−1 −1 ∗
L/K = βOL for some β ∈ L. By definition, DL/K = (α1 , . . . , αn )
∗
where {α1∗ , . . . , αn∗ } is the dual basis to {α1 , . . . , αn } with respect to the trace form. Then
D−1
L/K = (α1 β, . . . , αn β), so we have
dL/K (α1∗ , . . . , αn∗ ) = dL/K (βα1 , . . . , βαn ) = NL/K (β)2 dL/K (α1 , . . . , αn ).
we obtain [σi (αj )]T [σi (αj∗ )] = In so dL/K (α1 , . . . , αn ) = dL/K (α1∗ , . . . , αn∗ )−1 . It follows that
2
DL/K = NL/K (D2L/K ) but since the norm is multiplicative, we obtain the desired expression.
where the product is taken over all primes p ⊂ OK and all q ⊂ OL lying over p.
We also obtain a strengthening of Proposition 14.5.9:
Corollary 15.10.10. Let L/K be a finite separable extension. Then a prime p ⊂ OK is
ramified in OL if and only if p divides the discriminant DL/K .
Proof. This is immediate from Theorems 15.10.7(i) and 15.10.8.
246
Chapter 16
In order to study harmonic analysis on a global field K in Part VI, we introduce two locally
compact abelian groups:
In ordinary harmonic analysis, recall that Z is a discrete group with dual Hom(Z, R/Z) =
R/Z. Then Z embeds into its universal cover R as a discrete subspace. Moreover, the circle
R/Z ∼= S 1 is compact; thus we say the embedding Z ,→ R is co-compact.
The adèle group will play the role of R here, and we will construct a discrete, co-compact
embedding K ,→ AK . In the case of K = Q, there will be a canonical surjection AQ → R
that induces a cover AQ /Q → R/Z. This mimics the role of the universal cover R → S 1 in
the ordinary version of the theory. Concretely, AK will be a certain ‘restricted’ product of
the completions Kv of K at its places v; likewise, IK will be the ‘restricted’ product of the
unit groups Kv× . We give the construction of this restricted product in the next section, as
well as topological motivation for why we prefer to work with it for analysis.
247
16.1. Restricted Direct Products Chapter 16. Adèlic Number Theory
Definition. The restricted direct product of the collection {Gv }v∈J with respect to
{Hv }v6∈J∞ is defined by
Y
0
Gv := {(xv ) : xv ∈ Gv and xv ∈ Hv for all but finitely many v} .
v∈J
Proof. It’s clear that 0v∈J Gv is closed under the product group operation on v∈J Gv .
Q Q
Remark. The topology on the Q restricted direct product is not the subspace topology in-
herited from the direct product v∈J Gv . In fact, the restricted direct product topology is
strictly finer than the product topology on the given product. We note that the restricted
direct product topology is preferred because it makes G into a locally compact group.
Lemma 16.1.2.
Q0 For any such S, GS is a locally compact subgroup of the restricted direct
product G = v∈J Gv .
Proof. That GS is a subgroup is clear. Note that in the product topology, GS is a product
of finitely many locally compact groups along withQ a product of compact groups, so it is
locally compact (again, in the product topology) in v∈J Gv . However, by definition of the
restricted direct topology on G, it is clear that the subspace topology of GS inherited from
G is precisely the product topology. Hence GS is locally compact in G.
Theorem 16.1.3. Let G = 0v∈J Gv be the restricted direct product of a collection of locally
Q
compact groups {Gv }v∈J with respect to {Hv }v6∈J∞ . Then
248
16.1. Restricted Direct Products Chapter 16. Adèlic Number Theory
Proof. (1) Each x ∈ G lies in GS for some finite set S ⊆ J, so the GS cover G. It follows
from Lemma 16.1.2 that G is locally compact. Q
(2) Suppose Y is contained in such a product v∈J Cv . Then Y is as well, and this
product is compact by Tychonoff’s theorem, so Y is a closed subset of a compact set, hence
compact. Conversely, suppose Y is compact. Since the subgroups GS form an open cover of
G, finitely many of the GS cover Y . But the union of this finite subcover is contained in some
GS0 , so Y ⊆ GS0 . Now note that since the topology on G is finer than the direct product
topology, all of the projections ρv : G → Gv are continuous. Thus since Y ⊆ G is compact,
each ρv (Y ) is compact in Gv . Further, since Y ⊆ GS0 , we have that ρv (Y ) ⊆ Hv for all but
finitely many v. Hence Y is contained in the product of these Hv together with ρv (Y ) for
the remaining v, so indeed Y ⊆ Y is contained in a product of the desired form.
We next construct measures on restricted direct products. Since G is locally compact by
Theorem 16.1.3, there exist Haar measures on G. The trick will be to choose the right one
to agree with the normalized Haar measures on each locally compact group Gv .
Proposition 16.1.4. Let {Gv }v∈J be a collection of locally compact groups, {Hv }v6∈J∞ a
collection of subgroups for almost all v ∈ J and suppose dgv is a Haar measure on Gv which
is normalized so that Z
dgv = 1
Hv
for almost all v 6∈ J∞ . Then there exists a unique Haar measure dg on G such that for every
finite subset S ⊆ J containing J∞ , the restriction of dg to GS coincides with the product
measure on GS .
Q
Proof. For such a set S, let dgS = v∈J dgv be the product measure, restricted to S. Since
the dgv have been normalized Q so that finitely many of the volumes of the Hv are different
from
Q 1, the infinite
Q product v6∈S Hv has finite volume (with respect to the product measure
v6∈S dgv on v6∈S Gv ⊆ GS ). One can then show that dgS is a Haar measure on GS (using
the preceding statement to show that compact sets have finite measure).
Now since G is locally compact (Theorem 16.1.3), there is a Haar measure dg on G and
it restricts to a Haar measure on any GS , so dg is equal to dgS up to a constant. We declare
that dg is the unique Haar measure on G that restricts to dgS on some finite set S ⊆ J
containing J∞ , and proceed to show that this definition of dg does not depend on S.
Suppose S ⊆ T are finite subsets containing J∞ . Consider the set E ⊆ GT defined by
Y Y Y
E= Gv × Hv × Hv .
v∈S v∈T rS v6∈T
is finite by the first paragraph, and GS ⊆ E ⊆ GT , so dgS coincides with the restriction of
dgT to GS . Finally, since the GS cover G, any two GS , GS 0 are contained in a common GT
where T = S ∪ S 0 and our normalized Haar measure is compatible on all of these.
249
16.1. Restricted Direct Products Chapter 16. Adèlic Number Theory
Proposition 16.1.5. Let G be the restricted direct product of {Gv }v∈J with respect to
{Hv }v6∈J∞ . Then
Moreover, Z YZ
f dg = f dgv
G v∈J Gv
1
Q R
and f ∈ L (G) if v Gv
|fv | dgv is finite.
250
16.2. Adèles and Idèles Chapter 16. Adèlic Number Theory
with respect to the subgroups Ov . Here J is the set of all places v of K and J∞ is the set of
all infinite/archimedean places.
Let R× denote the multiplicative group of units in any ring R. Then Kv× is a locally
compact group for each place v of K and for every finite place, Ov× ⊂ Kv× .
Proposition 16.2.3. IK ∼
= A×
K , the group of units in the adèle ring.
Fix a global field K and let S∞ be the set of infinite places of K. Using the notation of
the subgroups in Lemma 16.1.2, define
Y Y
A∞ := (AK )S∞ = Kv × Ov .
v∈S∞ v6∈S∞
251
16.2. Adèles and Idèles Chapter 16. Adèlic Number Theory
v – that is,
Qnm clears the denominators of x. By the Chinese remainder theorem (3.2.10),
∼ rj
Z/mZ = j=1 Z/pj Z so there exists some λ ∈ Z so that
r
mxj ≡ λ mod pj j for each 1 ≤ j ≤ n,
λ
where xj is the component of x at vpj . Set u = m . Then x − u = m−1 (mx − λ). At the places
corresponding to the primes pj , we have |x − u|pj ≤ 1. At any other place v, |m−1 |v = 0 so
again |x − u|v = |mx − λ|v ≤ 1 since (mx − λ)v ∈ Ov . Hence xv − u ∈ Ov for all finite v.
For the second statement, note that all elements of K ∩ AK have the form (x, x, x, . . .)
for x ∈ K, so x ∈ Ov for every place v and hence x ∈ OK .
Corollary 16.2.5. AQ = Q + (R × Z) b and Q ∩ A∞ = Z.
Q b = Q Zp .
Proof. Follows from the identifications A∞ = R × p Zp and Z p
Next, we investigate the geometry of the quotient AK /K for any global field K.
Lemma 16.2.6. Let E/K be a finite extension of global fields and fix a K-basis {u1 , . . . , un }
of E. Then the map
n
Y
AK −→ AE
j=1
n
X
((xv,1 )v , (xv,2 )v , . . .) 7−→ uj (xv,j )v
j=1
∼ ∼
Y Y
Kv −
→ Ev and Ov −
→ OEv .
v v
Suppose S is a finite set of places of K containing the infinite places and consider the
associated subgroup ASK := (AK )S . Set
Y Y
ASE := Ev × OEv .
v∈S v6∈S
Qn
Then the ASE cover AE (just as in the proof of Theorem 16.1.3), so the isomorphism j=1 AK →
AE can be defined locally using the above isomorphisms.
Theorem 16.2.7. K is a discrete, cocompact subgroup of AK .
Proof. Let K0 denote Q or Fp (t) according to whether char K = 0 or p, respectively. Put
n = [K : K0 ]. Then by Lemma 16.2.6, we have a commutative diagram with isomorphisms
along the rows:
252
16.2. Adèles and Idèles Chapter 16. Adèlic Number Theory
n
AK0 ∼ AK
Y
j=1
n
Y
K0 ∼
K
j=1
Therefore it suffices to show K0 is discrete in AK0 and AK0 /K0 is compact. So we may reduce
to K = K0 . For simplicity, we take K = K0 = Q, but the proof is even easier in the Fp (t)
case.
Define the subset
1
C = x ∈ AK : |x∞ |∞ ≤ and |xv |v ≤ 1 for all finite v ⊆ AK .
2
Q
Notice that C lies in A∞ = R × v6=∞ Ov and as we observed in Lemma 16.1.2, this set has
the product topology so it follows that C, being the product of compact sets, is compact in
AK . We claim that AK = K + C and K ∩ C = {0}. In fact, the latter is obvious since we
are taking K = K0 . For the former claim, take y = (yv ) ∈ AK . By the strong approximation
theorem (16.2.4), there exist some δ ∈ K such that yv − δ ∈ Ov for all finite places v. At
v = ∞, let δ 0 be the nearest integer to y∞ − δ (in the Fp (t), one may just wipe out the
constant term of the polynomial y − δ). Then |y∞ − δ − δ 0 |∞ ≤ 12 and for any finite place v,
δ 0 ∈ Ov which implies |yv − δ − δ 0 |v ≤ 1. Hence AK ⊆ K + C as required.
This proves the existence of a surjective, continuous map C → AK/K, so because C is
compact, AK /K is compact as well. Further, since 0 lies in the open set x ∈ C : |x∞ |∞ < 21 ,
0 is an isolated point of K ⊆ AK . Then since K ,→ AK is a group homomorphism, this
implies every point of K is isolated. Hence K is discrete.
AQ /Q −→ lim R/nZ
←−
Cn = {x ∈ AQ | x∞ = 0, xp ∈ pordp (n) Zp }
lim AQ /C n −→ AQ
←−
((xp,n )p )n 7−→ lim xp,n .
n→∞ p
253
16.2. Adèles and Idèles Chapter 16. Adèlic Number Theory
∼
In turn, this gives an isomorphism AQ /Q −
→ lim AQ /(Q + Cn ). Consider the map
←−
R/nZ −→ AQ /(Q + Cn )
x 7−→ (x, 0, 0, . . .)
where x∞ = x and xv = 0 for all finite places v of Q. This map is well-defined, since for any
a ∈ Z, na maps to (na, 0, 0, . . .) = (na, na, na, . . .) + (0, −na, −na, . . .) ∈ Q + Cn . The
Q map
is also injective by observation. Finally, Corollary 16.2.5 gives us AQ = Q + (R × Zp ) so
any adèle x ∈ AQ can be written x = (a + s, a + x2 , a + x3 , . . .) for some a ∈ Q, s ∈ R and
xp ∈ Zp . Then the approximation theorem allows us to write x = (r, 0, 0, . . .) + (b, b, b, . . .) +
(0, y2 , y3 , . . .) for b ∈ Q, r ∈ R and certain yp ∈ Zp for each prime p. Then r 7→ (r, 0, 0, . . .)
which is the image of x in the quotient AQ /(Q + Cn ). Putting these maps together for each
n ≥ 1, we get the desired isomorphism.
Remark. One should regard lim R/nZ as the profinite completion of the universal cover of
←−
the circle R/Z, so Theorem 16.2.8 says that AQ /Q is the ‘algebraic universal cover’ of R/Z.
The Galois group of this cover is Z,
b which is in fact the algebraic fundamental group of
R/Z ∼= S 1.
254
16.3. Idèle Class Group Chapter 16. Adèlic Number Theory
Proof. In the archimedean cases, this is clear from the above definitions. So suppose k and
` are nonarchimedean, π` is a uniformizer of ` and n = [` : k]. If e is the ramification index
of `/k, then πk = π`e is a uniformizer of k and by algebraic number theory, n = ef where f
is the degree of the residue field extensions, so |O` /π` O` | = q f . Now consider
e
e n 1 1 1
|N`/k (π` )|k = |N`/k (πk )|k = |πk |k = n = ef = = |π` |e` .
q q qf
Since norm is multiplicative, take the eth root to get |N`/k (π` )|k = |π` |` . Since π` is a
uniformizer, this also holds for any x ∈ `.
Now let K be a global field and for each place v of K, let Kv be the complete local field
at v.
Definition. The absolute value of the idèle group IK is the map
| · |K : IK −→ R>0
Y
(xv ) 7−→ |xv |v .
v
The following generalizes the product formula for completions of Q (Lemma 15.2.7).
Theorem 16.3.2. Let K be a global field with group of idèles IK . Then
(1) (Artin’s Product Formula) For all x ∈ K × , |x|K = 1.
255
16.3. Idèle Class Group Chapter 16. Adèlic Number Theory
(2) | · |K is surjective onto R>0 when char K = 0 and has image {pm0 n | n ∈ Z} for some
m0 ∈ Z when char K = p > 0.
Proof. First suppose E/K is a finite, separable extension. Let PK (resp. PE ) denote the set
of places of K (resp. E). Then for any x ∈ E × ,
Y Y
|x|E = |x|v
u∈PK v∈PE
v|u
Y Y
= |NEv /Ku (x)|u
u∈PK v∈PE
v|u
by the isomorphism E ⊗K Ku ∼
Y Y
= |NE/K (x)|u = Ev
u∈PK v|u
= |NE/K (x)|K .
Therefore if (1) and (2) hold for K, they also hold for E so we may reduce to the case when
K = Q or K = Fp (t).
(1) If K = Q and p ∈ Z is prime, for each place v we have
p, v = ∞
|p|Qv = p1 , v = p
1, otherwise.
This implies that |p|Q = 1 and since norm is multiplicative, this shows |x|Q = 1 for all
x ∈ Q× . The proof is similar for K = Fp (t).
(2) For K = Q, this is obvious. When K = Fp (t), suppose v is the place where the
residue field is Fp and πv is the uniformizer. Then |πv |v = p1 and taking powers shows that
the image of | · |v is pm0 Z .
Definition. The group of norm 1 idèles of K is the kernel of the normalized absolute
value on K, written
I1K = {x ∈ IK : |x|K = 1}.
1
We also define the norm 1 class group to be CK = I1K /K × .
256
16.3. Idèle Class Group Chapter 16. Adèlic Number Theory
Z1 = {z1 − z2 | z1 , z2 ∈ Z}
and Z2 = {z1 z2 | z1 , z2 ∈ Z1 }.
Since addition and multiplication are continuous on AK , we see that Z1 and Z2 are compact
subsets of AK . By Theorem 16.2.7, K embeds as a discrete subgroup of AK , so K × Z2 is
finite, say K × ∩ Z2 = {y1 , y2 , . . . , yr }. Let δ : IK ,→ AK × AK be the natural inclusion
x 7→ (x, x−1 ). Define the set
r
[
δ −1 {(u, yj−1 v) : u, v ∈ Z1 } .
Ψ=
j=1
Proposition 16.3.4. For any finite set S containing the infinite primes of K (if they exist),
IK,S is an open subgroup of IK which is compact if and only if S = ∅.
Proof. Lemma 16.1.2 gives us that IK,S is a (locally compact) subgroup, and it is clear that it
is an open subgroup since the topology induced on IK,S is equivalent to the product topology.
Second, the fact that IK,S is compact if and only if S is empty follows from the observation
that for any place v, Kv× is not compact in Kv .
Definition. For any finite set S containing the infinite primes of K, define the norm 1
S-idèles by I1K,S = I1K ∩ IK,S and the ring of S-integers of K by RS = K ∩ ASK .
257
16.3. Idèle Class Group Chapter 16. Adèlic Number Theory
Remark. If K is a number field and S∞ is the set of infinite primes of K, then RS∞ = OK ,
the ring of algebraic integers in K. If K is a function field and S∞ denotes the archimedean
places of K, then RS∞ = OK is the algebraic closure of Fq [t] in K.
Lemma 16.3.5. An element x ∈ K × is a root of unity in K if and only if |x|v = 1 for every
place v of K.
The following generalizes Dirichlet’s unit theorem for number fields (Corollary 15.1.11).
Proposition 16.3.6. For any global field K,
(1) I1K,S /RS× is compact.
(2) There is an isomorphism
RS× ∼
= µ(K) × Zr(S)
where µ(K) is the set of roots of unity in K and r(S) = |S| − 1.
Proof. (1) By Proposition 16.2.3, we have RS× = K × ∩ IK,S = K × ∩ I1K,S . Then since I1K,S is
an open subgroup of I1K , I1K,S /RS× is both an open and closed subgroup in I1K /K × , which is
compact by Theorem 16.3.3. Therefore I1K,S /RS× is compact. Q
(2) For each place v of K, let Cv = {xv ∈ Kv : |xv |v = 1} and put C = v Cv . Then since
each Cv is compact in Kv× and the subspace topology on IK,S ⊆ IK is the product topology,
we see that C is compact. Consider the short exact sequence
Y
1 → C → IK,S → (Kv× /Cv ) → 1.
v∈S
For each place v, Kv× /Cv is isomorphic to the value group of v, so in particular by Theo-
rem 16.3.2, (
R>0 ∼
= R, v is archimedean
Kv× /Cv ∼= mZ ∼
p = Z, v is nonarchimedean.
Write |S| = r = r1 + r2 where r1 , r2 are the numbers of archimedean and nonarchimedean
valuations in S, respectively. Then the above short exact sequence yields
1 → C → I1K,S → Rr1 × Zr2 → 1.
Next, Lemma 16.3.5 implies that C ∩ K × = µ(K). Given this and the fact that I1K,S ∩ K × =
RS× , the short exact sequence becomes
1 → µ(K) → RS× → L → 1
and one can show that L ∼
= Zr .
Definition. For a finite set S containing J∞ , the S-class group of K is CK,S = IK /K × IK,S .
Note that I1K ,→ IK induces an inclusion
I1K /K × I1K,S ,−→ IK /K × IK,S
which is an isomorphism whenever S 6= ∅ and has cokernel Z by (2) of Theorem 16.3.2 when
S = ∅ (because in this case char K > 0).
258
16.3. Idèle Class Group Chapter 16. Adèlic Number Theory
Proof. We know I1K,S is open in I1K and by Theorem 16.3.3, I1K /K × is compact. Thus there
is a finite open cover of I1K,S in I1K , so I1K /K × I1K,S is finite. This proves (1). In the S = ∅
case, char K > 0 and the cokernel of the injection
α : IK −→ Cl(K)
" #
Y
x 7−→ pvp (xp )
p∈Spec R
259
Part IV
260
Chapter 17
The contents of Chapters 17 and 18 are a product of research in class field theory as part of
my Master’s thesis at Wake Forest University. The main topics covered are:
Dirichlet L-series, Dirichlet density and the proof of Dirichlet’s theorem on primes in
arithmetic progression
– Artin reciprocity
– The Conductor Theorem
– The fundamental equality
– The Existence and Classification Theorems
A primary motivation for studying these topics is to fully answer the question, described
in Cox’s Primes of the Form x2 + ny 2 , “Given a positive integer n, when can a prime number
be written in the form x2 + ny 2 ?” The reader will see that although the question has a
rather elementary statement, it requires the depth and power of class field theory to fully
understand. After describing the answer to this first question, we will turn our attention to
the much more difficult, and unanswered question, “Given a positive integer n, if x2 + ny 2
is prime, when is y 2 + nx2 also prime?”
261
17.1. The Hilbert Class Field Chapter 17. Global Class Field Theory
262
17.1. The Hilbert Class Field Chapter 17. Global Class Field Theory
P ⊂
Definition. For a given prime OL , the unique element σ ∈ DP described above is called
L/K
the Artin symbol, denoted . It satisfies
P
L/K
(α) ≡ αN(p) mod P
P
L/K
for all α ∈ OL , where p = P ∩ OK . If p = OK ∩ P then is called a Frobenius
P
element for p.
We will describe Frobenius automorphisms in greater detail in Section 17.3 but for now
we will focus on their relation to the Hilbert class field.
Proposition 17.1.3. For a Galois extension L/K, an unramified prime p ⊂ OK and a
prime P ⊃ p, the Artin symbol has the following properties.
L/K L/K
(i) For all σ ∈ Gal(L/K), =σ σ −1 .
σ(P) P
L/K
(ii) The order of in DP is the inertial degree f = f (P | p).
P
L/K
(iii) p splits completely in L ⇐⇒ = 1.
P
L/K
Proof. (i) follows from the uniqueness of and Proposition 14.5.13.
P
(ii) From Lemma 17.1.2, DP ∼ =G e = Gal(`/k) and the order of G e is [OL /P : OK /p] = f .
L/K
By definition, the Artin symbol maps to a generator of G so the order of
e is f .
P
(iii) Recall that p splits completely if and only
if e = f = 1. Then e = 1 since we are
L/K
assuming p is unramified in L, and f = 1 ⇐⇒ = 1 follows from part (ii).
P
Since L/K is abelian, the Artin symbol only depends on the underlying prime p: if P
and P0 are both primes of OL containing p, then P0 = σ(P) for some σ ∈ Gal(L/K) as we
have already shown. Thus (i) of the proposition implies
L/K L/K L/K −1 −1 L/K L/K
= =σ σ = σσ = .
P0 σ(P) P P P
L/K
We will write the Artin symbol as to indicate that it is determined by the underlying
p
prime p ⊂ OK .
The Artin symbol is the first step in establishing a powerful tool in class field theory called
Artin reciprocity (Section 17.8). The name comes from the fact that it is a generalization
of more elementary reciprocity laws, such as quadratic, cubic and biquadratic reciprocities
established by Euler, Legendre and Gauss. The next example shows that the Artin symbol
properly encapsulates cubic reciprocity.
263
17.1. The Hilbert Class Field Chapter 17. Global Class Field Theory
√ √
Example
√
17.1.4. Let K = Q( −3) and L = K( 3 2). Here OK = Z[ω] where ω = e2πi/3 =
−1+ −3
2
. Note that for the extension K/Q, we have n = 2, r = 0, s = 1 and dK = −3 so the
Minkowski bound for K is 1
2! 4 √
BK = 2 3 ≈ 1.103.
2 π
As we have seen before, this shows that K has class number 1, which is equivalent to Z[ω]
being a PID.
Knowing that the ring of integers is a PID is important, since any prime ideal can be
written πZ[ω] for some prime element π ∈ Z[ω]. One can calculate
that Gal(L/K) ∼ = Z/3Z
L/K
but the important part is that Gal(L/K) is abelian, so is defined. In fact the entire
√ π
automorphism is determined by its action on 3 2:
L/K √ 2 √
3 3
( 2) = 2
π π 3
2
where is the cubic Legendre symbol, defined to be the unique cubic root of unity to
π 3
which 2(N(π)−1)/3 is congruent mod π. Specifically, let P be a prime of OL lying over π. Then
by definition,
L/K √ √ 2 √
3 (N(π)−1)/3 3 3
( 2) ≡ 2 · 2≡ 2 mod P.
π π 3
Hence the Artin symbol generalizes the cubic Legendre symbol!
When L/K is an unramified abelian extension, things are especially nice. Let
Y IK be the
group of fractional ideals of OK . For any a ∈ IK with prime factorization a = pri i we can
define the Artin symbol on a by
Y r
L/K L/K i
= .
a pi
Definition. The Artin map for an extension L/K is the homomorphism
L/K
: IK −→ Gal(L/K).
·
Notice that if L/K is ramified at any primes, the Artin map is not defined for all of IK .
Likewise if Gal(L/K) is not abelian, the Artin symbol may not be uniquely defined for all
p ∈ IK . For this reason many of the main theorems in class field theory are complicated to
state, as we will see in subsequent sections. However when L is the Hilbert class field of K
we have the following characterization of the Artin map.
Theorem 17.1.5 (Artin Reciprocity for the Hilbert Class Field). If L is the Hilbert class
field of a number field K, the Artin map
L/K
: IK −→ Gal(L/K)
·
is surjective and its kernel is PK . Therefore the Artin map induces an isomorphism C(OK ) ∼
=
Gal(L/K) where C(OK ) = IK /PK is the ideal class group.
264
17.1. The Hilbert Class Field Chapter 17. Global Class Field Theory
Proof. This will follow from the full Artin reciprocity theorem in Section 17.8.
Using Galois theory, we have the following classification of unramified abelian extensions
of K.
Furthermore, if the extension M/K corresponds to the subgroup H, then the Artin map
induces an isomorphism C(OK )/H ∼= Gal(M/K).
Proof. This too will be proven in a more general setting in Section 17.10.
This is a good example of the general strategy employed in class field theory: describe a
certain type of extensions of K – in this case unramified abelian extensions – using informa-
tion encoded in K itself, e.g. subgroups of the class group.
Corollary 17.1.7. Let L be the Hilbert class field of a number field K and let p ⊂ OK be a
prime ideal. Then p splits completely in L ⇐⇒ p is a principal ideal.
L/K
Proof. By (iii) of Proposition 17.1.3, p splits completely if and only if = 1. Since the
p
Artin map ∼
induces C(OK ) = Gal(L/K) by the Artin reciprocity theorem (Theorem 17.1.5),
L/K
= 1 ⇐⇒ [p] is trivial in the class group, which is equivalent to p being a principal
p
ideal.
The Hilbert class field has an important application to the study of primes of the form
p = x2 + ny 2 .
Theorem 17.1.8. Let n > 0 be a squarefree integer such that n 6≡ 3 (mod 4). Then there
√ irreducible polynomial fn (x) ∈ Z[x] of degree h(−4n) – the class number of
is a monic
K = Q( −n) – such that if p is an odd prime that does not divide n or the discriminant of
fn , then
2 2 −n
p = x + ny ⇐⇒ = 1 and fn (x) ≡ 0 (mod p) has an integer solution.
p
Furthermore, any choice of fn (x) will be the minimal polynomial of a real algebraic integer
α for which L = K(α) is the Hilbert class field of K.
We devote the rest of this section to the proof of Theorem 17.1.8 and its applications.
The first step is to relate p = x2 + ny 2 to the splitting behavior of p in the Hilbert class field.
√
Theorem 17.1.9. Let L be the Hilbert √ class field of K = Q( −n), where n > 0 is squarefree
and n 6≡ 3 (mod 4), so that OK = Z[ −n]. If p is an odd prime not dividing n, then
p = x2 + ny 2 ⇐⇒ p splits completely in L.
265
17.1. The Hilbert Class Field Chapter 17. Global Class Field Theory
266
17.1. The Hilbert Class Field Chapter 17. Global Class Field Theory
We may assume p splits completely in K, so that Z/pZ ∼ = OK /p. Since f (x) is separable
over Z/pZ, it is separable over OK /p. Then Theorem 14.5.7 gives us
p splits completely in L ⇐⇒ f (x) ≡ 0 mod p is solvable in OK
⇐⇒ f (x) ≡ 0 mod p is solvable in Z.
Finally (2) is proven using (iii) ⇐⇒ (iv) from the previous proof.
We are now ready to prove Theorem 17.1.8.
√
Proof. Since the Hilbert class field L of K = Q( −n) is Galois over Q, Theorem 17.1.10 says
there is a real algebraic integer α which is a primitive element of the extension L/K. Let fn
be its minimal polynomial and let p be a prime that does not divide n or the discriminant
of fn . Then the previous two theorems show that
p = x2 + ny 2 ⇐⇒ p splits completely in L
−n
⇐⇒ = 1 and fn (x) ≡ 0 mod p is solvable in Z.
p
As discussed in the proof of Theorem 17.1.9, the hypotheses imply that dK = −4n so
dK −n
= .
p p
It remains to show that deg fn = h(−4n), but by√Artin reciprocity, [L : K] = | Gal(L/K)| =
|C(OK )|, and h(−4n) = |C(OK )| when K = Q( −n), so the theorem is proved.
The polynomial fn (x) is not unique since L/K has infinitely many primitive elements.
We can at least use this theorem to predict deg fn , and later we will see that fn (x) completely
describes the Hilbert class field – quite an amazing result indeed!
The Hilbert class field also allows us to relate the ideal class group C(OK ) to the form
class group C(dK ) for binary quadratic forms. In Section 18.2 we prove
Theorem. Let K be an imaginary quadratic field of discriminant dK = −4n, n ≥ 1.
(1) If f (x, y) = ax2 + bxy + cy 2 is a primitive positive definite quadratic form of discrim-
inant dK , then
p p
[a, (−b + dK )/2] = {ma + n(−b + dK )/2 | m, n ∈ Z}
is an ideal of OK .
√
(2) The map f (x, y) 7→ [a, (−b + dK )/2] is an isomorphism between C(OK ) ∼ = C(dK )
and hence |C(OK )| = h(dK ) which is the number of reduced forms of discriminant dK .
√
Example 17.1.11. Let K = Q( −14). Here dK = −56 and the reduced forms of discrimi-
nant −56 are:
x2 + 14y 2
2x2 + 7y 2
3x2 ± 2xy + 5y 2 .
267
17.1. The Hilbert Class Field Chapter 17. Global Class Field Theory
Moreover, only x2 + 14y 2 and 2x2 + 7y 2 belong to classes of order at most 2. Thus C(−56) ∼
=
∼
Z/4Z and by the above theorem C(OK ) = Z/4Z.
We know from Theorem 17.1.8 that there is a polynomial f14 (x) such that
2 2 −14
p = x + 14y ⇐⇒ = 1 and f14 (x) ≡ 0 mod p has an integer solution.
p
We determined above that h(−56) = sopdeg f14 = 4, but we don’t yet know how to find this
√
polynomial. Let L = K(α) where α = 2 2 − 1. We claim that L is the Hilbert class field
of K. To check this, we need the following lemma.
√
Lemma 17.1.12. Let L = K( β) for some β ∈ OK and let p ⊂ OK be a prime ideal. Then
p is unramified in L if either of the following two conditions are met:
(i) 2β 6∈ p, or
268
17.1. The Hilbert Class Field Chapter 17. Global Class Field Theory
Thus the class group C(OK ) is generated by prime ideals with norm ≤ 5. These correspond
to ideals pOK for p = 2, 3 and 5. Corollary 15.10.10 tells us that of these, only 2 ramifies,
so we have the following factorizations:
2OK = p22 where p2 is prime.
Thus by Proposition 14.6.1, 3 splits in K and we write 3OK = p3 p03 for prime ideals
p3 6= p03 .
269
17.1. The Hilbert Class Field Chapter 17. Global Class Field Theory
q √
We claim that the Hilbert class field of K is L = K(α), where α = (1 + 17)/2,
following a suggestion in Cox. The work above shows the Hilbert class field is a degree 4
extension of K, so it suffices to show that L = K(α) is an unramified abelian extension of
degree 4 over K, from which it will follow from the uniqueness of the Hilbert class
√ field.
2 2
It’s easy to verify, using the minimal polynomial x − x − 4 for α = (1 + 17)/2, that
the minimal polynomial for α is f (x) = x4 − x2 − 4 which splits in L. This shows that L/K
is Galois, so [L : K] = 4. Of course every group of order 4 is abelian, so L/K is an abelian
extension. It remains to check that L/K is ramified at every √ prime of OK .
Of course any infinite prime is unramified since K = Q( −17) is imaginary quadratic and
thus has√no real embeddings. We will use Lemma 17.1.12 to show that E/K and L/E, where
E = K( 17), are both unramified √ extensions and it√will follow that L/K
√ is unramified. As
a sidenote, observe that α2 = (1 + 17)/2 implies 17 ∈ L, so K ⊂ K( 17) ⊂ L and thus
it makes sense to define the extensions E/K and L/E.
Let p be a prime ideal of OK . Since (i) of Lemma 17.1.12 tells us that p is unramified in
E whenever 2 6∈ p, let us assume 2 ∈ p. Note that 17 6∈ p and 17 can be written
17 = 12 − 4(−4)
270
17.2. Orders Chapter 17. Global Class Field Theory
17.2 Orders
In the previous section we were able to prove a full characterization of when a prime is of
the form p = x2 + ny 2 given certain restrictions on n. We have thus described the main
question for infinitely many
√ n, but what about the rest?
In general, if K = Q( n) we have the following characterization of the ring of integers:
( √
Z[h n] i if n 6≡ 1 (mod 4)
OK = √
Z 1+2 n if n ≡ 1 (mod 4).
The
√ important thing is that when n does not satisfy
√ the criteria in Section 17.1, i.e. when
Z[ −n] is √ not the full ring of integers for Q( −n), we still have a characterization that
involves Z[ −n]. We will make some headway on the x2 + ny 2 question towards the end of
this section, but a full characterization of primes of the form x2 + ny 2 will not be possible
until we have the
√ theorems of class field theory at our disposal.
The ring Z[ −n] is an example of an order.
1K ∈ O
O contains a Q-basis of K.
There is a more general notion of an order in an arbitrary ring R, but the behavior is
quite different even when R is not a field. We will primarily make use of orders in quadratic
fields.
(3) OK is an order in K containing every other order. In other words OK is the maximal
order in K.
271
17.2. Orders Chapter 17. Global Class Field Theory
Proof. (1) Clearly O is torsion free, so since it is a Z-module it is free. Also, since O contains
a Q-basis of a quadratic field, O is at least rank 2, so it must be exactly rank 2.
(2) follows from the fact that O contains a Q-basis for K.
(3) Since 1K ∈ OK and OK is a Z-module of rank [K : Q] = 2 by Proposition 14.3.9,
it suffices to show that OK contains a√ basis for K/Q. But this follows from the discussion
above: OK is generated by 1 and dK +2 dK .
Now let O be any order in K. Since O is a free Z-module, it is noetherian. Let α ∈ O
and consider the chain of Z-submodules I0 ⊂ I1 ⊂ I2 ⊂ · · · where I0 = Z and for n ≥ 1,
In = Z + αZ + α2 Z + . . . + αn Z.
By the noetherian condition, there is some n such that for all m ≥ n, Im = In . So for all
such m we have Z + αZ + . . . + αm Z = Z + αZ + . . . + αn Z. This implies αm = αi for some
1 ≤ i ≤ n and thus the powers of α are finite. This shows that Z[α] is finitely generated as
a Z-module, so Lemma 14.1.1 shows α ∈ OK . Thus O ⊂ OK .
Example 17.2.2. For K = Q(α) where α is an algebraic integer, Z[α] is an order in OK
but in general Z[α] 6= OK .
Example 17.2.3. For K = Q(i), the subring Z + niZ ⊂ Z[i] is an order for every nonzero
n ∈ Z. However, Z ⊂ Z[i] is not an order since Z does not have finite index in Z[i].
The next lemma shows that this is essentially the form of every order in a quadratic field.
Lemma 17.2.4. Let O be an order in a quadratic field K with discriminant dK and ring of
integers OK . Then f = [OK : O] is finite and O = Z + f OK .
Proof. The finiteness of f is a result of the fact that O and OK are both free Z-modules of
rank 2. On one hand, since f = [OK : O] we have
f OK ⊂ O =⇒ Z + f OK ⊂ O.
On the other hand, our description of OK at the beginning of the section allows us to write
Z + f OK = [1, f wK ], where √
dK + dK
wK = .
2
Clearly [1, f wK ] has index f in [1, wK ] = OK , which proves the result.
Definition. The index f = [OK : O] is called the conductor of the order.
This is not to be confused with the conductor of an extension in class field theory, which
will be discussed in Section 17.9. To add to the clutter, each order has an associated value
called the discriminant which is distinct from, although related to, the field discriminant.
Definition. For an order [α, β], its discriminant is defined to be
2
α β
D = det 0 0
α β
where α0 and β 0 denote the respective images of α and β under the nontrivial automorphism
of K/Q.
272
17.2. Orders Chapter 17. Global Class Field Theory
α β
The discriminant of an order is independent of the basis chosen, since if A = 0 0 then
α β
changing basis is done by conjugating A by some invertible matrix B, but this doesn’t change
the determinant calculation above. Therefore we can let O = [1, f wK ] as in Lemma 17.2.4
and have D = f 2 dK . This shows that an order is determined by its conductor. Moreover,
the maximal order OK has conductor 1 which shows that the discriminant of OK is dK .
√By our description of dK for quadratic
√ fields, we see that D ≡ 0, 1 (mod 4). Let K =
Q( −n) for any integer n. Then Z[ −n] is an order in K with discriminant −4n. By the
comments
√ above, −4n = f 2 dK which makes it relatively easy to compute the conductor of
Z[ −n].
In fact, if D ≡ 0 or 1 (mod 4) there will be an in order in a quadratic field whose
discriminant is D. For D ≡ 0 (mod √ 4), we may write D = 4n and see that the maximal
order OK = [1, wK ] in K = Q( n) has discriminant dK =h 4n =i D. On the other hand,
√ √
if D ≡ 1 (mod 4), Q( D) has ring of integers OK = Z 1+2 D which has discriminant
dK = D.
Recall that OK is a Dedekind domain and has unique factorization of ideals. Unfortu-
nately this is not true in general for an order O ( OK so our description of the ideals of O
requires a bit more care. It turns out that we can still define a class group C(O) by restrict-
ing to certain types of ideals. One should view the subsequent construction as a precursor
to the types of constructions used in class field theory in the following sections.
Proposition 17.2.5. Let a be a nonzero ideal in an order O of K. Then the quotient O/a
is finite.
Proof. By Proposition 14.8.2, every nonzero ideal a of the maximal order OK has finite index
in OK . If b is a nonzero ideal in an order O of K, Proposition 17.2.1 tells us that O ⊂ OK
so that b ⊂ OK . Then [OK : b] = [OK : O][O : b] and the left side is finite, so [O : b] must
also be finite.
This allows us to define
For any nonzero ideal a ⊂ O, O ⊆ {β ∈ K : βa ⊂ a}, but equality may not always hold.
The ideals for which equality does hold have a special name.
Notice that principal ideals are always proper. Also, every ideal of the maximal order
OK is proper. From this definition we proceed with our construction of a class group for O
by defining an analog of fractional ideals.
Proposition 17.2.6. Every fractional O-ideal is of the form αa for some nonzero α ∈ K
and ideal a ⊂ O.
273
17.2. Orders Chapter 17. Global Class Field Theory
Proof. This is identical to the property for fractional ideals of a Dedekind domain.
Lemma 17.2.7. Let K = Q(α) be a quadratic field and suppose ax2 + bx + c is the minimal
polynomial for α – we may assume (a, b, c) = 1. Then [1, α] is a proper fractional ideal of
the order [1, aα] in K.
Proof. First, [1, aα] is an order by Lemma 17.2.4 since [1, aα] = Z + aαOK and aα is an
algebraic integer. Now suppose β ∈ K such that β[1, α] ⊂ [1, α]. This is equivalent to
β · 1 ∈ [1, α] and β · α ∈ [1, α].
The first of these gives us β = j + kα for j, k ∈ Z, so we can write the second as
2 k ck bk
β · α = (j + kα)α = jα + kα = jα + (−bα − c) = − + − + j α.
a a a
By hypothesis (a, b, c) = 1 so the above shows β · α ∈ [1, α] if and only if a | k. This implies
{β ∈ K : β[1, α] ⊂ [1, α]} = [1, aα]
proving [1, α] is a proper fractional ideal of [1, aα].
For orders in a quadratic field, we have a nice characterization of their fractional ideals.
Proposition 17.2.8. A fractional O-ideal a is proper if and only if a is invertible.
Proof. ( ⇒= ) If a is invertible, there exists some fractional O-ideal b such that ab = O.
Suppose β ∈ K such that βa ⊂ a. Then
βO = β(ab) = (βa)b ⊂ ab = O.
This implies β ∈ O so a is a proper fractional O-ideal.
( =⇒ ) Suppose a ⊂ O is a proper fractional ideal. Since K is quadratic, a is a free
Z-module of rank 2, so a = [β, γ] for some β, γ ∈ K. Let α = βγ ; then a = β[1, α] and
Lemma 17.2.7 implies that O = [1, aα] where ax2 + bx + c is the minimal polynomial of α
over Q. Let z 7→ z 0 be the nontrivial automorphism in Gal(K/Q). Since α0 is also a root
of ax2 + bx + c, Lemma 17.2.7 also shows that a0 = β 0 [1, α0 ] is a fractional O-ideal. We will
show that aaa0 = N(β)O. Note that
aaa0 = aββ 0 [1, α][1, α0 ] = N(β)[a, aα, aα0 , aαα0 ].
Also observe that α + α0 = − ab and αα0 = ac , so
aaa0 = N(β)[a, aα, −b, c] = N(β)[1, aα] = N(β)O
since (a, b, c) = 1. This proves the claim, and it follows that a is invertible.
√ √
Example 17.2.9.√ O = Z[ −3] is an order of conductor 2 in K = Q( −3). Consider the
ideal [2, 1 + −3] in O. It’s easy to see that
√ √
O ( {β ∈ K : β[2, 1 + −3] ⊂ [2, 1 + −3]} = OK .
√ √ √ √
Further, 2, 1+ −3 and 1− −3 are all irreducible in O, but 4 = 2·2 = (1+ −3)(1− −3)
showing that unique factorization fails in O.
274
17.2. Orders Chapter 17. Global Class Field Theory
In the next theorem we construct a class group C(O) for an order in a quadratic number
field. As with the class group in Section 14.9, we take a quotient of a fractional ideal group
by some principal fractional ideals, but in this context we must restrict our consideration to
proper fractional ideals in O.
Theorem 17.2.10. Given an order O in a quadratic number field, the set I(O) of proper
fractional O-ideals forms a group under ideal multiplication. Moreover, the set P (O) of
principal O-ideals is a subgroup of I(O) and hence the ideal class group C(O) = I(O)/P (O)
is defined.
Proof. Let a and b be proper fractional ideals of the order O. By Proposition 17.2.8, it is
equivalent to consider invertible ideals. First note that O is clearly the identity in I(O).
Since a is invertible, there is some fractional O-ideal which we will denote a−1 , such that
aa−1 = O. This shows that a−1 is also invertible and hence proper, so I(O) has inverses.
Now consider the product (ab)c, where we set c = b−1 a−1 . Then
so we see that ab is invertible and hence proper. This proves that I(O) is a group. Clearly
P (O) is a subgroup of I(O) since every principal ideal is proper, and the product of principal
ideals is again principal. C(O) = I(O)/P (O) is a quotient of abelian groups, so it is a group.
This completes the proof of the theorem.
In order to make our work on orders in quadratic fields more compatible with the rest of
class field theory, it will be advantageous to translate O-ideals into the language of OK -ideals.
Proof. (1) Define the map ϕf : O/a → O/a to be multiplication by f . Note that
a + f O = O ⇐⇒ ϕf is surjective
⇐⇒ ϕf is an isomorphism
⇐⇒ f and |O/a| are relatively prime
where the last equivalence comes from the fundamental theorem of finite abelian groups.
Then by definition of numerical norm, |O/a| = N(a) so (1) is proved.
(2) Suppose a is prime to the conductor. Let β ∈ K and suppose βa ⊂ a. Then
βO = β(a + f O) = βa + βf O ⊂ a + f OK .
275
17.2. Orders Chapter 17. Global Class Field Theory
Note that since norm is multiplicative, (1) can be used to show that the set of O-ideals
prime to the conductor forms a subgroup I(O, f ) ≤ I(O). Moreover, the set
is a subgroup of I(O, f ). The next proposition describes the class group C(O) in terms of
O-ideals prime to the conductor.
Proof. A result in Section 18.2 will imply that every ideal class in C(O) contains a proper
O-ideal whose norm is prime to a fixed M ∈ Z. Thus the map I(O, f ) → C(O) is surjective
with kernel I(O, f ) ∩ P (O), so it suffices to show P (O, f ) = I(O, f ) ∩ P (O).
On one hand, P (O, f ) ⊂ I(O, f ) ∩ P (O) is clear from the definitions of these subgroups.
On the other hand, every element of I(O, f ) ∩ P (O) is a fractional ideal of the form αO =
ab−1 , where α ∈ K and a, b are O-ideals prime to f . Let m = N(b). Then mO = bb̄ ∈
P (O, f ) and mb−1 = b̄ which implies
So mαO ∈ P (O, f ). It follows that αO = (mαO)(mO)−1 ∈ P (O, f ) and hence the kernel is
equal to P (O, f ).
Given any positive integer m, an OK -ideal a is prime to m provided that a + mOK = OK .
By Lemma 17.2.11, this is equivalent to (N(a), m) = 1. This implies that for every ring of
integers OK , inside the group of fractional OK -ideals we have a subgroup IK (m) ≤ IK . In
Section 17.4 we will generalize this construction using class field theory, but for now we have
(2) If b is an O-ideal prime to f , then bOK is an OK -ideal prime to f with the same
norm.
(3) IK (f ) ∼
= I(O, f ).
Proof. (1) Let a be an OK -ideal prime to f . By the natural injection ν : O/(a∩O) ,→ OK /a,
(N(a), f ) = 1 implies (N(a ∩ O), f ) = 1 as well. This shows a ∩ O is prime to f . As in
Lemma 17.2.11, the map ϕf is an automorphism of OK /a, but f OK ⊂ O so the injection ν
is also a surjection. Hence the norms are equal.
(2) and (3) Let b be an O-ideal prime to f . Then
276
17.2. Orders Chapter 17. Global Class Field Theory
which shows that bOK is an OK -ideal prime to f . In a moment we will show the norms are
equal, but first consider
a = aO = a(a ∩ O + f O) ⊂ (a ∩ O)OK + f a,
IK (f ) ←→ I(O, f )
a 7−→ a ∩ O
bOK →−7 b.
Corollary 17.2.14. Every O-ideal prime to the conductor has a unique decomposition as a
product of prime O-ideals which are prime to the conductor.
Then C(O) ∼
= IK (f )/PK,Z (f ).
Proof. We have proven that C(O) ∼ = I(O, f )/P (O, f ). In the proof of Theorem 17.2.13
we saw that I(O, f ) ∼
= IK (f ), so it suffices to show that the image of P (O, f ) under this
isomorphism is PK,Z (f ). To do so, we will prove that for α ∈ OK ,
277
17.2. Orders Chapter 17. Global Class Field Theory
278
17.3. Frobenius Automorphisms Chapter 17. Global Class Field Theory
OL /P ⊃ OE /pE ⊃ OK /p
279
17.3. Frobenius Automorphisms Chapter 17. Global Class Field Theory
FrobL/K (·)
m
IK Gal(L/K)
σ
σ|E
IEmE Gal(E/K)
FrobE/K (·)
Proof. Let P ∈ OL and set pE = P ∩ E. Since E/K is normal, FrobE/K (pE ) is defined. To
show the diagram commutes, it suffices to prove that the restriction of FrobL/K (P) to E is
exactly FrobE/K (pE ). For any α ∈ OE , σ(α) ≡ αq mod P if and only if σ(α) ≡ αq mod pE
since pE = P ∩ E is fixed by all of G when E/K is normal. Therefore
FrobL/K (P)E = FrobE/K (pE ).
To prove this, first let p ≡ 3 (mod 4). Then p remains prime in Q(i) and the residue fields
are given by
` = Z[i]/pZ[i] = Fp2 and k = Z/pZ = Fp .
The Frobenius element for p in `/k must be x 7→ xp :
280
17.3. Frobenius Automorphisms Chapter 17. Global Class Field Theory
In particular, this implies that (p) splits completely in Q(ζn ) if and only if p ≡ 1 (mod n).
For the rest of the section, we focus on setting up the right conditions for a generalization
of the Artin map. The definition is simpler when it is a map on unramified primes of OK so
we need a way to restrict to these primes.
Definition. For a number field K, let IK be the group of fractional OK -ideals and let S be
S
a finite set of primes in OK . Then IK is defined to be the subgroup of IK generated by those
prime ideals which are not in S.
In practice we will take S to be the set of primes that ramify in an extension L/K. For
this choice of S, we define
Definition. Suppose L/K is abelian and let S = {primes p ⊂ OK | p ramifies in L} so
S
that IK is generated by the unramified primes in OK . Define the Artin map to be the
homomorphism
S
ϕL/K : IK −→ G = Gal(L/K)
Y L/K ei
a 7−→
p
pi
i
pei i .
Q
where a is a fractional ideal with prime factorization a =
Since L/K is abelian, this map is well-defined. We will later (Section 17.11) generalize the
Artin map to non-abelian extensions.
Suppose E is a finite extension of K. Then EL/E is an abelian extension whose Galois
group, say H, is a subgroup of Gal(L/K) when we restrict elements of H to L. Let IES denote
the subgroup of IE generated by primes in OE that do not lie over any prime in S. Note
that this is equivalent to saying IES is generated by the primes of OE which have norm in IK
S
.
Proposition 17.3.6. Let G = Gal(L/K) and H = Gal(EL/E). Then restricting H to L
gives us ϕEL/E = ϕL/K NE/K on IES .
281
17.3. Frobenius Automorphisms Chapter 17. Global Class Field Theory
Since the Frobenius automorphism is unique, τ f = σ on L. This proves the property for all
primes in IES and since they generate IES we’re done.
Corollary 17.3.7. Let ϕ be the Artin map in an extension L/K. Then NL/K (ILS ) ⊆ ker ϕ.
Proof. Let E = L and apply Proposition 17.3.6 to obtain ϕL/K NL/K = ϕL/L = 1.
From this we obtain a nice description of ϕ for any abelian extension K of Q.
Theorem 17.3.8. Let K/Q and let S be the set of prime ideals containing (m) for some
positive integer m. Then the Artin map ϕ : IQS → Gal(K/Q) is surjective with
n a o
ker ϕ = fractional ideals : a ≡ b (mod m) .
b
Proof. See III.3.3 of Janusz. Surjectivity of ϕ will follow from the Frobenius Density Theorem
in Section 17.6.
When L/K is not an abelian extension, a description of the Artin map becomes more
difficult. For this reason many theorems in class field theory are complicated to state. It is
our goal in the next few sections to provide a glimpse of some of the constructions required
to prove a more general description of the Artin map.
282
17.4. Ray Class Groups Chapter 17. Global Class Field Theory
This product is taken over all places of K, and the n(p) are nonnegative integers subject to
the following conditions:
(1) If p is finite then n(p) ≥ 0 and only finitely many of these are nonzero.
283
17.4. Ray Class Groups Chapter 17. Global Class Field Theory
Example 17.4.3. Let m = (2)3 (17)2 (19) · ∞, a modulus of Q. Then m0 = (2)3 (17)2 (19) so
Qm,1 consists of all x ∈ Q satisfying
x>0
x≡1 mod 23
x≡1 mod 172
x≡1 mod 19.
For example, if x = ab for a, b ∈ Z and b 6= 0 then the condition at the place 2 tells us a
and b are odd and ab−1 ≡ 1 mod 8. This looks similar to the Chinese remainder theorem
(3.2.10), but in fact we’ve seen this before in the weak approximation theorem (15.3.9).
Remark. When p is an infinite place of K, the statement |α − β|p < ε for small ε > 0 is
equivalent to αβ > 0, i.e. α ≡ β mod p. When p is a finite place, recall that |α|p = cv(α)
p
for some real number c, 0 < c < 1. Then we see that |α − β|p < ε is equivalent to
ε
=: ε0 .
α
β − 1 <
p |β|p
In turn when ε0 is small, say ε0 < cn for some n, then v αβ − 1 > 1 which means αβ − 1
is in the valuation ring for p. Recall that this is the same as saying α ≡ β mod pn . So in
general we see that |α − β|p < ε is equivalent to α ≡ β mod pn for a sufficiently large n. As
suggested in Example 17.4.3, the reformulation of the weak approximation theorem in terms
of congruences allows us to view it as a generalization of the Chinese remainder theorem.
The weak approximation theorem and this remark allow us to prove
Theorem 17.4.4. For every modulus m of K, there is an exact sequence
0 → UK /Um,1 → Km /Km,1 → CK (m) → C(OK ) → 0
and isomorphisms
284
17.4. Ray Class Groups Chapter 17. Global Class Field Theory
but then a−1 a ∈ I m and this belongs to the same ideal class as a. Hence I m → C(OK ) is
surjective. Next, if a ∈ I m maps to the trivial class in C(OK ) then a = (α) for some α ∈ Km
and this α is uniquely determined up to multiplication by a unit u ∈ UK . This implies
exactness of the rest of the sequence.
f g
Now consider the maps Km,1 → − Km → − I m . By the work above, ker g = UK and coker g =
C(OK ). By definition, coker(g ◦ f ) = CK (m) and ker(g ◦ f ) = Km,1 ∩ UK = Um,1 . Finally,
f is injective by the definitions of Km and Km,1 . Hence by the Snake Lemma, we have an
exact sequence
0 → Um,1 → UK → Km /Km,1 → CK (m) → C(OK ) → 0.
Next we prove the isomorphisms. Let p | m. If p is an infinite prime we map α ∈ Km to
the sign (+ or −) of the image of α under the embedding (·)p : K ,→ C. If p is finite, we
map α to [a][b]−1 ∈ (OK /pn(p) )× where a, b ∈ OK such that a ≡ b ≡ 1 mod m0 . Since a and
b are in particular relatively prime to p, it makes sense to define their equivalence classes
and take inverses in (OK /pn(p) )× . Consider the map we have defined:
Y Y
ϕ : Km −→ {±} × (OK /pn(p) )× .
p real p|m0
By the weak approximation theorem and the above remark, ϕ is surjective. Moreover, its
kernel is Km,1 by the way this subroup is defined. This shows the first isomorphism, and the
second is easily concluded from the Chinese remainder theorem.
Corollary 17.4.5. The ray class group CK (m) for any modulus m is a finite group of order
hK 2r0 N(m0 ) Y
1
hm = 1−
[UK : Um,1 ] N(p)
p|m0
285
17.4. Ray Class Groups Chapter 17. Global Class Field Theory
Furthermore, this expression is equal to the desired one when we factor out N(m0 ) from the
product on the right, using that N is multiplicative.
The most important implication of Corollary 17.4.5 is that every ray class group CK (m)
is finite. Let’s take a look at some examples.
Also note that N(ε) = −1 if and only if ε and ε̄ have different signs. For the first few values
of n we have
n hK ε√ N(ε)
2 1 1 + √2 −1
3 1 2 +√ 3 1
5 1 (1 + √5)/2 −1
6 1 5+2 6 1
√ √
so we see that the narrow class numbers for Q( 3) and Q( 6) are 2, whereas the others
have narrow class number 1.
Example 17.4.8. Let’s look at the important example of cyclotomic extensions. Let L =
Q(ζm ) where ζm = e2πi/m for m > 2. Define the modulus m = (m)∞ on L. We claim that
all ramified primes of L divide m. The minimal polynomial of ζm over Q is well known:
it is the mth cyclotomic polynomial Φm (x). These polynomials are constructed by setting
Φ1 (x) = x − 1 and recursively defining
xm − 1
Φm (x) = Y .
Φd (x)
d|m
d<m
The relevant property we will use is that Φm (x) is a factor of xm − 1. For a prime p,
consider xm − 1 over the finite field Fp . Since the formal derivative of xm − 1 is mxm−1 , these
polynomials are relatively prime unless m = 0 in Fp , i.e. p | m. In particular this shows that
if p - m, xm − 1 is separable mod p and so are all of its irreducible factors, namely Φm (x).
Hence by Theorem 14.5.7, p is unramified in L. This allows us to consider the Artin map
ϕL/Q : IQm → Gal(L/Q) ∼ = (Z/mZ)× .
We know from Section 17.3 that in any abelian extension L/K, the Artin map takes a
prime p ∈ IK m
to the Frobenius automorphism x 7→ xq where q = |OK /p|. In this example
K = Q and L = Q(ζm ) so OK = Z and p = (p) for a prime integer p. The isomorphism
286
17.4. Ray Class Groups Chapter 17. Global Class Field Theory
p|a r|b
−1
Y Y
= |Z/pZ|sp |Z/rZ|tr
p|a r|b
−1
Y Y
= ps p rtr = [a][b]−1 .
p|a r|b
It’s easy to see that the kernel of the Artin map is precisely PQ (m, 1) since by definition,
Moreover, the Artin map in this case is clearly surjective (this will be proven in general
in Section 17.6). This implies that the ray class group for m = (m)∞ is isomorphic to
(Z/mZ)× .
We can use Corollary 17.4.5 to get even more information out of this example. For
m = (m)∞, the above shows that |CQ (m)| = φ(m). Plugging this into the ray class formula,
we have
hK 2r0 N(m0 ) Y
1
φ(m) = 1− .
[UK : Um,1 ] N(p)
p|m0
Notice that the numerical norm on Q just evaluates to the integer itself, so we can multiply
N(m0 ) = m back into the product on the right to obtain
hK 2r0 Y n(p)−1
φ(m) = p (p − 1)
[UK : Um,1 ]
p|m
where n(p) is the exponent of p in the prime factorization of m. This product is now easily
recognized as φ(m), so we can cancel this from both sides and rearrange:
In general it is a very hard problem to compute the class number of a cyclotomic field so
we end the discussion here. The study of the cyclotomic fields is closely related to 20th
Century pursuits of a proof of Fermat’s Last Theorem. For example, in Section 13.1 unique
factorization was used to prove FLT when Q(ζm ) has class number 1 but this fails for
m as small as 23. To worsen matters, the class number of Q(ζm ) is not even known for
sure for m > 70, and even assuming the Generalized Riemann Hypothesis only allows for
computations up to m = 163.
287
17.5. L-series and Dirichlet Density Chapter 17. Global Class Field Theory
The trivial character mod m, which takes every [n] ∈ (Z/mZ)× to 1 (and every other
integer to 0), is called the principal Dirichlet character, denoted χ0 .
Definition. For a Dirichlet character χ, the complex-valued function
∞
X χ(n)
L(s, χ) =
n=1
ns
Recall that both expressions for L(s, χ) converge when Re(s) > 1.
We can extend the idea of Riemann’s zeta function to an arbitrary algebraic number field
in the following way.
Definition. Let K be an algebraic number field and for any nonzero ideal a ⊂ OK , let N(a)
denote its numerical norm. Then the Dedekind zeta function for K is the complex-valued
function
X 1
ζK (s) = s
.
a⊂O
N(a)
K
Notice that when K = Q, the zeta function is simply the Riemann zeta function. An
even further generalization of ζK (s) is obtained by taking a modulus m of K and letting k
be a class in the ray class group CK (m), and defining
X 1
ζ(s, k) = .
a∈k
N(a)s
X
In particular when m = 1, ζK (s) = ζ(s, k).
k∈C(OK )
288
17.5. L-series and Dirichlet Density Chapter 17. Global Class Field Theory
where χ(a) = 1 if a ∈ k and 0 otherwise, then s(x) simply counts the number of ideals of
OK with norm less than or equal to x. By Proposition 12.4.4,
s(x)
lim(s − 1)ζ(s, k) = lim .
s→1 x→∞ x
M (t)
vol(D) = lim .
t→∞ tn
s(x) M (t)
The plan is to identify x
with tn
for suitably chosen L, D and M (t). First we observe
the following.
Proof. Since CK (m) is finite, each prime not dividing m has some power in the trivial class.
If a = a1 a2−1 is an ideal in the class k, where a1 and a2 are integral ideals, then at2 is trivial
for some t > 1. Thus aat2 is an integral ideal in k = kat2 .
Now suppose a is an integral ideal in k with N(a) ≤ n for a fixed n ∈ N. Then for
any integral ideal b ∈ k −1 , ab = 0 in CK (m) so ab = (α) for some α ∈ b ∩ Km,1 with
N(α) ≤ nN(b). On the other hand, if we have such an α, then a = (α)b−1 ∈ k has norm
less than or equal to n. We summarize this in the following lemma.
Lemma 17.5.2. For any n, the value s(n) is the number of principal ideals (α) such that
α ∈ b ∩ Km,1 and N(α) ≤ nN(b). Furthermore, there is some α0 ∈ K satisfying
289
17.5. L-series and Dirichlet Density Chapter 17. Global Class Field Theory
P
Moreover, α0 = hi βi for some hi ∈ Q. To connect ideals with lattices once again, let L be
the lattice in R of points with integer coordinates, i.e. L = Zn . Take v = (hi ) and recall
n
Lv −→ K ∗
X
(xi ) 7−→ xi βi
Lemma 17.5.3. Let wm denote the number of roots of unity in Um,1 . Then there are exactly
wm · s(n) points (x1 , . . . , xn ) ∈ Lv which satisfy
n
X
(1) α = xi βi .
i=1
(2) α ≡ 1 mod m∞ .
Proof sketch. We know there are s(n) principal ideals (α) satisfying (2) and (3) by Lemma 17.5.2.
Each ideal (α) may be generated by any α0 = uα, where u ∈ Um,1 . Out of all these elements,
exactly wm satisfy (4). Finally, the map L : UK → Rr+s restricted to Um,1 provides the
connection between these ideals and points in Lv .
Now let D be the set of all points (x1 , . . . , xn ) ∈ Rn satisfying Lemma 17.5.3 such that
each xi ≥ 0. We skip straight to the statement of the volume; see section IV.2 of Janusz to
see how it is derived.
Proposition 17.5.4. As before, let r0 be the number of real primes dividing a modulus m.
For D defined above,
2r−r0 reg(m)(2π)s
vol(D) = p
N(m0 b) |dK |
where reg(m) is the regulator for Um,1 .
Recall (Section 14.10) that reg(m) is the determinant of the matrix whose ith row is
L(ui ). Above we defined r0 to be the number of real primes dividing m∞ . We can extend
the norm to any modulus by setting N(m∞ ) = 2r0 , so that N(m) = 2r0 N(m0 ). This leads to
the main result.
290
17.5. L-series and Dirichlet Density Chapter 17. Global Class Field Theory
Corollary 17.5.6. Let ζK (s) be the Dedekind zeta function for a number field K. Then
2r (2π)s reg(K)
lim(s − 1)ζK (s) = p hK
s→1 wK |dK |
Proof. Remember that ζK (s) coincides with the sum of all the ζ(s, k) for m = 1, i.e. k are
the distinct ideal classes in C(OK ). Taking the sum of the formula in Theorem 17.5.5 over
all k ∈ C(OK ) gives the result.
Example 17.5.7. In the case when K = Q, the Riemann zeta function has a simple pole
at s = 1 since by Corollary 17.5.6,
lim(s − 1)ζ(s) = 1.
s→1
We proved this in Section 12.1; however our work on ζK (s) gives us a much simpler proof.
What’s more, the Dedekind zeta function for any number field can be analytically continued
to the whole complex plane except for a simple pole at s = 1.
Next we extend L-series to arbitrary number fields in a similar fashion to what we did
with zeta functions. Let m be a modulus of K and let χ be any multiplicative function
χ : CK (m) → C× . We extend χ to a character on all of I m be defining χ(a) for an ideal
a ∈ I m to be the value of χ at the ideal class [a] in CK (m).
where the sum is taken over all a ∈ I m , i.e. all integral ideals relatively prime to m.
Note that since χ(a) only depends on k = [a], we may express L(s, χ) in terms of zeta
functions as we did with the Dedekind zeta function:
X
L(s, χ) = χ(k)ζ(s, k).
k∈CK (m)
291
17.5. L-series and Dirichlet Density Chapter 17. Global Class Field Theory
Proposition 17.5.8 (Product Formula). Fix a modulus m of a number field K. For all
s ∈ C with Re(s) > 1 and for any character χ : I m → C× , L(s, χ) may be expressed as the
uniform limit of the product
Y −1
χ(p)
L(s, χ) = 1− .
N(p)s
p-m
converges absolutely. Suppose p1 , . . . , pr are all the primes in I m with norm at most n – by
Lemma 14.9.2 there are finitely many of these. Then
r −1 X
Y χ(pi ) χ(pa11 · · · par r ) X χ(a)
1− = = .
i=1
N(pi )s N(pa11 · · · par r )s m
N(a)s
a∈I
N(a)≤n
L(s, χ) converges for all Re(s) > 1 (in fact for all Re(s) > 0 as with L-series over Q) so the
remainder term on the right must tend to 0 as n → ∞. Hence for all Re(s) > 1,
Y −1
χ(p)
L(s, χ) = 1− .
N(p)s
p-m
2r (2π)s reg(m)
gm = p
N(m)wm |dK |
292
17.5. L-series and Dirichlet Density Chapter 17. Global Class Field Theory
Recall the function log z from complex analysis (Section 11.2). One typically restricts its
π π
domain to − 2 , 2 for Re(z) > 0 – called the principal branch of the logarithm – and writes
its series expansion as
∞
z2 z3 X zn
− log(1 − z) = z + + + ... = .
2 3 n=1
n
Example 17.5.10. Suppose there are only a finite number of primes p ∈ Z. Then ζ(s) =
ζQ (s) would have to be bounded near s = 1. Recall that lim(s−1)ζ(s) = 1 by Example 17.5.7.
s→1
Then (s − 1)ζ(s) is also bounded near s = 1. This means
Definition. Let K be an algebraic number field and S a set of prime ideals in OK . If there
exists a real number δ such that
X 1
s
∼ −δ log(s − 1)
p∈S
N(p)
Example 17.5.10 shows that the set of rational primes has Dirichlet density δ = 1. In
general, establishing that a set has nonzero density is important for the following reason.
Proposition 17.5.11. For any set S whose Dirichlet density δ(S) is defined, 0 ≤ δ(S) ≤ 1,
and if δ(S) 6= 0 then S is an infinite set.
293
17.5. L-series and Dirichlet Density Chapter 17. Global Class Field Theory
Proof. The first statement comes from the more general fact that if T ⊆ S then δ(T ) ≤ δ(S).
X 1
This in turn is a result of the fact that s
cannot be negative for s ∈ R sufficiently
p∈S
N(p)
close to s = 1. The prove the second statement, consider the contrapositive: if S is finite
then X 1
s
∼ 0.
p∈S
N(p)
Lemma 17.5.12. The set S of degree 1 primes of a number field K is an infinite set.
Proof. Since there are only a finite number of primes that ramify in K, we may assume S
excludes these. Then S consists of precisely those primes p ∈ OK whose norm N(p) is a
prime integer. Then
X 1
log ζK (s) ∼
p⊂O
N(p)s
K
where the p are all primes in OK . For p 6∈ S (again excluding ramified primes, since the
sum above is bounded at s = 1 for finite sums), N(p) = pf ≥ p2 , where p = p ∩ Z. At most
[K : Q] of these p have their norms equal to a power of the same prime. Therefore we bound
the sum by
X 1 X 1
≤ [K : .
s
Q] 2s
N(p) p
p6∈S p prime
Lemma 17.5.6 now tells us that log(s − 1)ζK (s) is bounded at s = 1, but since log(s − 1) is
clearly not bounded at s = 1, we must have
X 1
∼ log ζK (s) ∼ − log(s − 1).
p∈S
N(p)s
This shows that S is an infinite set; in fact, we have shown that δ(S) = 1. This will be
important in Section 17.6.
We will need the next theorem in the course of proving Dirichlet’s theorem on arithmetic
progressions in Section 17.6.
294
17.5. L-series and Dirichlet Density Chapter 17. Global Class Field Theory
Proof. First note that Corollary 17.4.5 ensures that the index h will be finite. Let χ be
a character defined on I m /H; we may view χ as a homomorphism I m → C whose kernel
contains H. Then by previous remarks,
X χ(p)
log L(s, χ) = + gχ (s)
N(p)s
p-m
X
for gχ (s) convergent on Re(s) > 0 and bounded at s = 1. For any p ∈ I m , the sum χ(p)
χ
taken over all characters χ of I m /H is either h if p ∈ H or 0 otherwise. Then we see that
X h X
= (log L(s, χ) − gχ (s)) + log(s − 1)L(s, χ0 ) − log(s − 1) − gχ0 (s).
p∈H
N(p)s χ6=χ
0
Each of the log L(s, χ) terms are bounded at s = 1 unless L(1, χ) = 0, in which case the
terms become negatively infinite at s = 1. However since we are assuming that s is real and
s > 1, log(s − 1) is negative near s = 1. Hence for the above expression to be positive, we
must have h1 − δ(S) ≥ 0, which impies δ(S) ≤ h1 as claimed.
Our proof implies that if δ(S) = h1 then L(1, χ) 6= 0 for any nonprincipal character χ of
I /H. In Section 17.11 we will see that the condition δ(S) = [I m1:H] holds when S is the set
m
of splitting primes and use this to prove a generalization of the Frobenius density theorem
for non-abelian extensions.
295
17.6. The Frobenius Density Theorem Chapter 17. Global Class Field Theory
Definition. Let σ ∈ G be an element of order n. The division of σ is the set of all elements
of G which are conjugate to some σ m where m ∈ Z is relatively prime to n. Equivalently, the
division of σ is the union of conjugacy classes of all generators of the cyclic subgroup hσi.
Lemma 17.6.1. Let σ ∈ G, H = hσi and t the number of elements in the division of σ.
Then t = φ(n)[G : NG (H)] where φ is Euler’s function and NG (H) denotes the normalizer
of H.
Proof. For all m relatively prime to n = |σ|, ZG (σ m ) = ZG (σ), where ZG denotes the
conjugacy class of an element. Thus as m ranges over the integers relatively prime to n,
we count φ(n)[G : ZG (σ)] conjugates. However, some of these need not be distinct. An
element is counted q times if it is conjugate to q distinct powers of q. Equivalently, q counts
the number of conjugates of σ m which are also powers of σ, i.e. q is the number of distinct
automorphisms of H induced under the conjugation action of G. Thus q = [NG (H) : ZG (σ)].
Putting this together,
φ(n)[G : ZG (σ)]
t= = φ(n)[G : NG (H)].
[NG (H) : ZG (H)]
Theorem 17.6.2 (Frobenius Density). Let σ ∈ G = Gal(L/K), let t denote the number of
elements in the division of σ and let S be the set of primes p ⊂ OK such that there is some
prime P ⊂ OL whose Frobenius automorphism FrobL/K (P) is in the division of σ. Then
t
δ(S) = .
|G|
Proof. We induct on n = |hσi|. For the base case, n = 1 means σ is the identity and S is the
set of primes of K which split completely in L. Let S ∗ denote the set of primes of p ⊂ OL
dividing some prime in S. For each p ∈ S, there are exactly |G| = [L : K] primes in S ∗
dividing p, each of which has norm equal to p. Then
X 1 X 1 X 1
s
= s
= |G| .
P∈S ∗
NL/Q (P) P∈S ∗
NK/Q (NL/K (P)) p∈S
NK/Q (p)s
296
17.6. The Frobenius Density Theorem Chapter 17. Global Class Field Theory
Let T be the set of degree 1 primes of L (those having inertial degree f = 1 over Q). Recall
that in the proof Lemma 17.5.12 we showed that δ(T ) = 1. By properties of Dirichlet density,
T ⊆ S ∗ implies that δ(S ∗ ) ≥ δ(T ) = 1, so δ(S ∗ ) = 1. This combines with the above work to
give us
X 1 1
s
∼ (− log(s − 1))
p∈S
N (p) |G|
1
and hence δ(S) = |G| , proving the base case.
Now assume that n = |hσi| > 1. Let H = hσi and E = LH , the subfield of L fixed by
H. The primes p ⊂ OK which have at least one degree 1 prime factor in OE are exactly
those divisible by a prime P ⊂ OL such that FrobL/K (P) is conjugate to some power of σ.
In other words p ∈ Sd for some d | n.
For each d | n, let td denote the size of the division of σ d . Let Sd denote the set of
OK -primes containing an OL -prime whose Frobenius automorphism lies in the division of
td
σ d . By induction, we have δ(Sd ) = |G| when d 6= 1.
Let SE denote the primes of E having inertial degree 1 over K. For each p ∈ Sd let n(p)
denote the number of primes in SE dividing p. Then each p ∈ Sd is the norm of exactly n(p)
distinct primes in SE . As in the base case, SE contains all the degree 1 primes of E (over
Q), so δ(SE ) = 1. Therefore
X 1 X X n(p)
− log(s − 1) ∼ s
= s
.
P∈S
NK/Q (NE/K (P)) p∈S
N(p)
E d|n d
Note that for any p ∈ Sd , n(p) is exactly the number of distinct cosets Hτi such that
Hτi σ d = Hτi . This coset equivalence occurs if and only if τi σ d τi−1 ∈ H, but since H is
cyclic, this can only happen if τi ∈ NG (hσ d i). Thus n(p) = [NG (hσ d i) : H] and using the
inductive hypothesis, we write
X 1 X [NG (hσ d i) : H]td
[NG (H) : H] ∼ −1 + log(s − 1).
p∈S
N(p)s |G|
d|n
d6=1
297
17.6. The Frobenius Density Theorem Chapter 17. Global Class Field Theory
φ(n)
so the whole coefficient is −1 − n
+ 1
n
· n = − φ(n)
n
. Finally, this implies
X 1 φ(n) t
s
∼− log(s − 1) = − log(s − 1)
p∈S
N(p) [NG (H) : H] n |G|
t
using Lemma 17.6.1 again. Hence δ(S) = |G|
.
Now we can prove an important property of the Artin map that we have thus far neglected.
Corollary 17.6.3. Let L/K be an abelian extension of number fields and suppose S is a
finite set of primes of K that contains all the primes that ramify in L. Then the Artin map
S
ϕL/K : IK −→ Gal(L/K)
is surjective.
Proof. Let G = Gal(L/K) and take σ ∈ G. Since G is abelian, the division of σ is precisely
the set of generators of the cyclic group hσi. By the Frobenius density theorem, there exist
infinitely many primes P ⊂ OL such that FrobL/K (P) generates hσi and so one can certainly
be found outside the finite set S. Recall that when L/K is abelian, ϕL/K is well-defined on
the ideals of OK . Thus we can find p ⊂ OK such that ϕL/K (p) = σ 0 , a generator of hσi.
Since σ ∈ G was arbitrary, ϕL/K is onto.
Corollary 17.6.4. Let L1 and L2 be Galois extensions of a number field K and let S1 and
S2 be the sets of primes of K which split completely in L1 and L2 , respectively. Then S1 ⊆ S2
if and only if L2 ⊆ L1 .
Another important result we can prove now that we have the Frobenius density the-
orem is known as the first fundamental inequality of class field theory. Recall the map
i : K ∗ → IK that takes α 7→ (α). In Section 17.4 we denoted the image of Km,1 under this
map by PK (m, 1); it is also common in the literature to write i(Km,1 ) so we will use them
interchangeably.
Theorem 17.6.5 (First Inequality). Let L/K be a Galois extension of number fields, let m
be a modulus of K and let ILm denote the subgroup of IL generated by all primes P ⊂ OL for
m
which P ∩ K lies in IK . Then
m
[IK : NL/K (ILm )i(Km,1 )] ≤ [L : K].
Proof. With finitely many exceptions, the primes that split completely in L lie in NL/K (ILm ).
By the Frobenius density theorem, the density of the set of these primes is
1 1
=
|G| [L : K]
since it is the set of primes p such that FrobL/K (pOL ) = 1 ∈ G. Then by properties of
Frobenius density,
1 1
≤ m
[L : K] [IK : NL/K (ILm )i(Km,1 )]
which implies the first fundamental inequality.
298
17.6. The Frobenius Density Theorem Chapter 17. Global Class Field Theory
Under certain conditions the reverse inequality holds. This is called, as one might expect,
the second fundamental inequality of class field theory and will be discussed in the next
section.
We conclude the section with a proof of Dirichlet’s famous theorem on the infinitude of
primes in arithmetic progression. We first use the Frobenius density theorem to prove a nice
fact that is often hard to come by: the cyclotomic polynomials are irreducible.
Proposition 17.6.6. Let ζm denote a primitive mth root of unity. Then [Q(ζm ) : Q] = φ(m).
Proof. For m ∈ Z+ , let m = (m)∞ which is a modulus of Q. Set H = i(Qm,1 ) ≤ IQm . Then
by Example 17.4.8, the set of primes in Q that split completely in K = Q(ζm ) is precisely
1
the primes in H. The Frobenius density theorem says that the density of this set is [K:Q] .
Therefore by properties of Dirichlet density, this is at most
1 1
=
[IQm : H] φ(m)
which implies [K : Q] ≥ φ(m). On the other hand, the minimal polynomial of ζm over
Q, which is by definition the mth cyclotomic polynomial, has degree ≤ φ(m) since |G| =
|(Z/mZ)× | = φ(m). Hence we conclude that [K : Q] = φ(m).
Corollary 17.6.7. For any nonprincipal character χ of the ray class group CQ (m), where
m = (m)∞ as above, L(1, χ) 6= 0.
Proof. Apply Theorem 17.5.13 and Proposition 17.6.6 to see that
X
(log L(s, χ) − gχ (s)) + log(s − 1)L(s, χ0 ) − gχ0 (s) − g(s) > 0
χ6=χ0
since the log(s−1) term from the proof of Theorem 17.5.13 vanishes. The terms in the expres-
sion above are either all bounded at s = 1, or become negatively infinite when L(1, χ) = 0.
Since the expression must be positive, L(1, χ) must be nonzero.
The next result is the main step towards proving Dirichlet’s theorem. It is an interesting
result in its own right, since it unites the theories of L-series, Dirichlet density and ray class
groups we have studied so far.
Theorem 17.6.8. Let k0 be any ray class in CQ (m), where m = (m)∞. The set of primes
1
in k0 has density φ(m) .
Proof. For any character χ of CQ (m) we have
X χ(p) X X 1
log(s, χ) ∼ = χ(k) .
p prime
ps p∈k
ps
k∈CQ (m)
299
17.6. The Frobenius Density Theorem Chapter 17. Global Class Field Theory
where the sum is over all characters χ of CQ (m). Moreover, Corollary 17.6.7 implies that
the sum over nonprincipal characters is bounded at s = 1 since L(1, χ) 6= 0 for χ 6= χ0 .
Therefore X 1
log L(s, χ0 ) ∼ φ(m) s
.
p∈k
p
0
Recall from Section 12.4 that L(s, χ0 ) differs from the Riemann zeta function ζ(s) only by
finitely many terms, so log L(s, χ0 ) ∼ log ζ(s) ∼ − log(s − 1). Finally this shows that
X 1 1
s
∼− log(s − 1).
p∈k
p φ(m)
0
By definition this means the Dirichlet density of the set of primes in any k0 in the ray class
1
group CQ (m) is φ(m) .
Now we are prepared to state and prove the famous result.
Theorem 17.6.9 (Dirichlet). For each positive integer m and each integer a relatively prime
to m, there are infinitely many primes p = mb + a.
Proof. To access our work with the Dirichlet density, we turn the problem into one involving
ray classes. Suppose p is a prime in the arithmetic progression mb + a, where b ∈ Z. Then
mb + a ≡ a (mod m) implies mb+a a
∈ Qm,1 , where m = (m)∞ as before. This means p lies
in the coset aQm,1 . On the other hand, if p ∈ aQm,1 then p = ax y
with x ≡ y (mod m). It
follows that x ≡ mq + y and so p = mb + a for some b. Hence the primes congruent to a
mod m generate a prime ideal in a fixed coset of i(Qm,1 ), which is a ray class in the ray class
1
group CQ (m). By Theorem 17.6.8, the density of such primes is φ(m) so in particular there
are infinitely many of these primes.
Remarkably, Dirichlet proved his theorem several years before Frobenius had a proof of
the density theorem. We discuss the history of these theorems at greater length in Sec-
tion 17.11 and relate everything to Čebotarev’s density theorem.
300
17.6. The Frobenius Density Theorem Chapter 17. Global Class Field Theory
301
17.7. The Second Fundamental Inequality Chapter 17. Global Class Field Theory
In his formulation of the main theorems of class field theory, Takagi proved the general
form of the fundamental equality. Since our approach to the Artin reciprocity theorem in
Section 17.8 requires and later generalizes the cyclic case, it will suffice the prove the second
fundamental inequality for cyclic extensions L/K.
Let L/K be a Galois extension with cyclic Galois group G = hσi. Suppose m is a modulus
of K divisible by all primes that ramify in L. We first compute some cohomology groups,
for which we recall the following results (these hold for any cyclic group G).
Definition. For a left G-module A, we define the nth group cohomology of A by
H 0 (G; A) H 0 (G; B)
H 1 (G; C) H 0 (G; C)
H 1 (G; B) H 1 (G; A)
Proof. The exact hexagon is just the long exact sequence in cohomology when G is cyclic
and the cohomologies are 2-periodic after the 0th homological degree.
Definition. Let A be a G-module. The Herbrand quotient of A is
|H 1 (A)|
q(A) =
|H 0 (A)|
which is defined whenever the cohomology groups of A are finite.
302
17.7. The Second Fundamental Inequality Chapter 17. Global Class Field Theory
Corollary 17.7.4. If A ⊂ B are G-modules and C = B/A is a finite quotient, then q(A) =
q(B) whenever either of these are defined.
Theorem 17.7.5 (Hilbert’s Theorem 90). If G = Gal(L/K) is the Galois group for L/K, a
finite, Galois extension of number fields then H 1 (G; L∗ ) = 1 where L∗ denotes the invertible
elements of L.
(i) H 0 (ILm ) = IK
m
/N (ILm ).
(ii) H 1 (ILm ) = 1.
(iv) H 1 (L∗ ) = 1.
Proof. (i) Let a = Pai i be a fractional ideal in ILm which is fixed by σ, i.e. a ∈ ker(σ − 1).
Q
Since σ(a) = a, the distinct conjugates σ j (Pi ) of the primes over a appear with the same
exponent. If we denote p = Pi ∩ K, then
g−1
Y
pOL = σ j (Pi )
j=0
where g is the smallest positive integer such that σ j (Pi ) = Pi . This demonstrates that the
Pi contribute precisely the factor pai to the decomposition of a, and since Pi was arbitrary,
m m
we conclude that a ∈ IK . Therefore IK is the subgroup of ILm fixed by G, so
H 0 (ILm ) = (ILm )G = IK
m
/N (ILm ).
303
17.7. The Second Fundamental Inequality Chapter 17. Global Class Field Theory
a −c
Then we have (σ − 1)B = Pa00 Pa11 · · · Pg−2
g−2
Pg−1g−2 . Let pf = N (P0 ). Since N (a) = 1, we
see that
g−1
!
Y
N Pai i = pf (a0 +...+ag−1 ) = 1.
i=0
Since f ≥ 1, this shows that a0 + . . . + ag−1 = 0, i.e. −cg−2 = ag−1 . Thus (σ − 1)B is
precisely the part of a contributed by the Pi . Since Pi was arbitrary, a ∈ im(σ − 1) so
ker N = im(σ − 1). By definition, this proves H 1 (ILm ) = 1.
(iii) comes from the fact that ker(σ − 1) L∗ = K ∗ .
(iv) is just Hilbert’s Theorem 90 (Theorem 17.7.5).
Definition. For a modulus m of K divisible by the primes ramifying in L, we define a
G-module homomorphism jm : IL → ILm by
(
P if P - m
jm (P) =
1 if P | m.
1 → i(LS ) → ker jm → C → 1
This lemma shows that computing q(LS ) comes down to finding q(UL ) and q(ker jm ).
One can obtain the following results using local class field theory (see Janusz) or ideles (see
Milne).
304
17.7. The Second Fundamental Inequality Chapter 17. Global Class Field Theory
Theorem 17.7.8. Let r0 be the number of infinite primes ramifying in the extension L/K.
[L : K]
Then q(UL ) = .
2r0
Theorem 17.7.9. Let jm : IL → ILm be the homomorphism defined above for a modulus m
of K containing every prime that ramifies in L. Then
1
q(ker jm ) = Q
p|m0 e p fp
where the product is over all primes p dividing m0 the finite part of m, and ep and fp denote
respectively the ramification index and inertial degree of p.
Corollary 17.7.10. Let S be the set of primes which divide m, a modulus of K containing
all ramified primes of the extension L/K. Then the Herbrand quotient of LS is
[L : K]
q(LS ) = Q .
p|m ep fp
for some group V . Looking closer, this sequence contains two short exact sequences:
γ α
1 → LS → → fm (L∗ ) → 1
− L∗ − (17.1)
β
and 1 → fm (L∗ ) →
− ILm → V → 1. (17.2)
It is from these two sequences (and their cohomologies) that we derive the ingredients for
the second fundamental inequality. Define
P = {α ∈ K ∗ | fm (α) ∈ N (ILm )}
and Q = {α ∈ K ∗ | jm (α) ∈ N (ILm )i(Km,1 )}.
Consider the following commutative diagram, which is constructed using the sequences (16.1)
and (16.2) above.
305
17.7. The Second Fundamental Inequality Chapter 17. Global Class Field Theory
P K∗ f0 IKm p
coker f0 1
N (L∗ ) N (L∗ ) N (ILm )
Q K∗ g IKm p0
coker g 1
N (L∗ )Km,1 N (L∗ )Km,1 N (ILm )i(Km,1 )
1 1 1
Set n(m) = [Km ∩ i−1 (N (ILm )) : Km,1 ∩ N (L∗ )]. A standard diagram chase (cf section V.4 in
Janusz) shows that coker f0 ∼= coker g and | ker f0 | = | ker g| · n(m). Note that
P Q
ker f0 = and ker g = .
N (L∗ ) N (L∗ )Km,1
Next we relate ker f0 and coker f0 to q(LS ). Recall from Proposition 17.7.6 that H 1 (L∗ ) and
H 1 (ILm ) are trivial. Then the exact sequences (1) and (2) from above give us exact hexagons
(see Lemma 17.7.2) which may be laid flat:
δ1 γ0 α0 δ2
1 H 1 (fm (L∗ )) H 0 (LS ) H 0 (L∗ ) H 0 (fm (L∗ )) H 1 (LS ) 1
f0
δ3 β0 γ0 δ4
1 H 1 (V ) H 0 (fm (L∗ )) H 0 (ILm ) H 0 (V ) H 1 (fm (L∗ )) 1
The dashed arrow is the identity map on H 0 (fm (L∗ )), and correspondingly the vertical arrow
is f0 = β0 α0 . Then
| coker f0 | = [H 0 (ILm ) : im β0 α0 ] = [H 0 (ILm ) : im β0 ] [im β0 : im β0 α0 ]
[H 0 (fm (L∗ )) : im α0 ]
= [H 0 (ILm ) : im β0 ] by isomorphism theorems
[ker β0 : ker β0 ∩ im α0 ]
| coker α0 |
= | coker β0 |
[ker β0 : ker β0 ∩ im α0 ]
| im δ2 |
= | im γ0 | by exactness
[ker β0 : ker β0 ∩ im α0 ]
|H 1 (LS )|
= | im γ0 | .
[ker β0 : ker β0 ∩ im α0 ]
306
17.7. The Second Fundamental Inequality Chapter 17. Global Class Field Theory
Also note that |H 0 (V )| = | im γ0 | |H 1 (fm (L∗ ))| by the second exact hexagon, so
|H 0 (V )| |H 1 (LS )|
| coker f0 | = .
|H 1 (fm (L∗ ))| [ker β0 : ker β0 ∩ im α0 ]
| ker f0 | = | ker β0 α0 |
= | ker β0 ∩ im α0 | | ker α0 |
= | ker β0 ∩ im α0 | | im γ0 |
|H 0 (LS )|
= | ker β0 ∩ im α0 | .
|H 1 (fm (LS ))|
| coker f0 |
Lemma 17.7.12. q(LS ) = .
| ker f0 |
Proof. By the computations above,
Now, notice that since V is a quotient of the class group of L, which by Corollary 17.4.5
is finite, V is also finite. Then applying Corollary 17.7.4 shows that q(V ) = 1. The result
follows.
We now focus on the bottom row of the big commutative diagram from above,
K∗ g IKm
p0
1 −→ ker g −→ −
− − → −−−→ coker g −→ 1.
N (L∗ )Km,1 N (ILm )i(Km,1 )
Using this and Theorem 17.7.11, we know that when m is divisible by sufficiently high powers
of the ramified primes in L/K,
| im g| | coker g|
hm (L/K) = = a(m) .
| coker g| | ker g|
| coker f0 |
hm (L/K) = a(m)n(m) = a(m)n(m)q(LS ).
| ker f0 |
We are now ready to prove the second inequality for cyclic extensions.
307
17.7. The Second Fundamental Inequality Chapter 17. Global Class Field Theory
Theorem 17.7.13 (Second Inequality for Cyclic Extensions). For L/K a cyclic extension
of number fields and m a modulus of K divisible by sufficiently high powers of the ramified
primes of the extension,
m
hm (L/K) = [IK : N (ILm )i(Km,1 )] ≥ [L : K].
Proof. By the work directly preceding the theorem, hm (L/K) = a(m)n(m)q(LS ). The hy-
potheses allow us to apply Corollary 17.7.10 and Theorem 17.7.11, which say
[L : K] Y
q(LS ) = Q and a(m) = e p fp .
p|m ep fp p|m
hm (L/K) = n(m)[L : K]
Corollary 17.7.14 (Fundamental Equality for Cyclic Extensions). Let L/K be a Galois
extension of number fields such that Gal(L/K) is cyclic. If m is a modulus of K that is
divisible by sufficiently high powers of every prime ramifying in L, then
m
[IK : N (ILm )i(Km,1 )] = [L : K].
308
17.8. The Artin Reciprocity Theorem Chapter 17. Global Class Field Theory
PK (f OK , 1) ≤ PK,Z (f ) ≤ IK (f )
It turns out that the generalized ideal class groups are exactly the Galois groups of all
abelian extensions of K. This correspondence is encoded in the Artin map
m
ϕL/K : IK −→ Gal(L/K)
where m is chosen so that it is divisible by every ramified prime of K. We have seen (courtesy
of Corollary 17.6.3) that the Artin map is surjective onto Gal(L/K), so ker ϕL/K has index
m
[L : K] in IK .
The main result in this section is one of central importance in class field theory:
Theorem (Artin Reciprocity). Let L/K be an abelian extension of number fields with G =
Gal(L/K). If m is a modulus divisible by sufficiently high powers of every prime in K that
ramifies in L, then the Artin map
m
ϕL/K : IK −→ G
is surjective and ker ϕL/K = NL/K (ILm )i(Km,1 ). In particular, G is a generalized ideal class
group for m.
Definition. Let L/K be an abelian extension of number fields and take m a modulus of K.
We say the reciprocity law holds for the triple (L, K, m) provided i(Km,1 ) ⊆ ker ϕL/K .
309
17.8. The Artin Reciprocity Theorem Chapter 17. Global Class Field Theory
The reciprocity law is important to the proof of Artin reciprocity for the following reason.
Lemma 17.8.3. If m is divisible by all primes ramifying in L and the reciprocity law holds
for (L, K, m) then ker ϕL/K = NL/K (ILm )i(Km,1 ).
Proof. By Corollary 17.3.7 we know NL/K (ILm ) ⊆ ker ϕL/K and so NL/K (ILm )i(Km,1 ) ⊆
ker ϕL/K as long as the reciprocity law holds. The first fundamental inequality says that
m
[IK : NL/K (ILm )i(Km,1 )] ≤ [L : K],
m
but since [IK : ker ϕL/K ] = | Gal(L/K)| = [L : K] by surjectivity, we must have
Example 17.8.4. We have previously shown (Example 17.4.8) that for a primitive mth root
of unity ζm and the modulus m = (m)∞, the reciprocity law holds for (Q(ζm ), Q, m) – in
fact we proved that i(Qm,1 ) = ker ϕQ(ζm )/Q .
Remark. By properties of the Artin map (Section 17.3), one can easily prove that
If the reciprocity law holds for (L, K, m) and E is any finite extension of K, then the
reciprocity law holds for (LE, E, m).
If the reciprocity law holds for (L, K, m), then it holds for (L, K, mn) where n is any
modulus of K.
Combining these with the previous example, we see that for any primitive mth root of
unity ζm and any modulus m of K divisible by (m)∞, reciprocity holds for (K(ζm ), K, m).
It is clear that creating certain cyclotomic extensions of number fields is critical to pre-
serving the reciprocity law. This connection runs deep throughout this section, culminating
in the Kronecker-Weber Theorem at the end.
Let L/K be an abelian extension of number fields.
(i) L ∩ E = K.
(iii) There is some element σ ∈ Gal(E/K) whose order is divisible by n that satisfies
hσi ∩ hϕE/K (p)i = {1}.
310
17.8. The Artin Reciprocity Theorem Chapter 17. Global Class Field Theory
Proof. (i) We apply Lemma 6.1.7 to a = N(p). Since L only has finitely many subfields,
there is some M such that Q(e2πi/M ) contains every cyclotomic subfield of L. Lemma 6.1.7
allows us to select m with no prime divisors less than M · s. Then Q(e2πi/M ) ∩ Q(ζm ) = Q
and L ∩ Q(ζm ) = Q. Taking E = K(ζm ) it follows that L ∩ E = K.
(ii) Let τ = ϕE/K (p) ∈ Gal(E/K). By definition ϕE/K (p) is a Frobenius automorphism
N(p) a
satisfying τ (ζm ) = ζm = ζm . Thus τ has order divisible by n.
(iii) Finally, choose b ∈ Z according to Lemma 6.1.7 and define σ ∈ Gal(E/K) on the
b
primitive element of E/K by σ(ζm ) = ζm . Then σ has order divisible by n. Since (a, b) = 1,
it is clear that hσi ∩ hτ i = {1} as desired.
Lemma 17.8.6 (Artin). Let L/K be a cyclic extension and p ⊂ OK a prime that is unram-
ified in L. Then there exists an mth root of unity ζm and an extension F/K such that
(1) L ∩ F = K.
(2) L ∩ K(ζm ) = K.
311
17.8. The Artin Reciprocity Theorem Chapter 17. Global Class Field Theory
assumed to contain all the ramified primes. For each pi we may use Artin’s Lemma to select
a root of unity ζmi such that (mi , mj ) = 1 for all i 6= j, i, j = 1, . . . , r. By Proposition 17.8.5,
we can also force K ∩ Q(ζmi ) = Q for each i. Define Gi := Gal(K(ζmi )/K). Then Gi ∼ =
Gal(Q(ζmi )/Q) and the automorphism group of L(ζm1 , . . . , ζmr )/K is G × G1 × · · · × Gr .
Suppose G = hσi. For each i let τi be the element in Gi chosen via (iii) of Proposi-
tion 17.8.5. Let Hi be the subgroup of G × Gi generated by the elements
(σ, τi ) (ϕL/K (pi ), ϕK(ζmi )/K (pi )).
and
Y
Furthermore, let Fi be the fixed field of Hi × Gj and set F = F1 · · · Fr . We take a moment
j6=i
to verify that L ∩ F = K and Gal(L/K) = Gal(LF/F ). Note that the intersection of all the
Gal(LF/Fi ) fixes F and contains (σ, τ1 , . . . , τr ). The field L ∩ F is also fixed by this element
and by (1, τ1 , . . . , τr ) so L ∩ F is fixed by σ and therefore L ∩ F = K.
Now let ϕL/K (pai i ) = σ di where di ≥ 0. Then 1 = ϕL/K (a) = σ d where d = d1 + . . . + dr
and [L : K] | d. For a sufficiently large modulus m0 , the Artin map
0
ϕLF/F : IFm −→ Gal(LF/F )
is surjective so there is an ideal b0 relatively prime to m and all the mi such that ϕLF/F (b0 ) =
m
σ. Let b = NF/K (b0 ) ∈ IK . By properties of the Artin map in extensions (Proposi-
tion 17.3.6), we see that ϕL/K (b) = σ. For each i, pi splits completely so there exists
an ideal ci relatively prime to m and each mj such that NFi /K (ci ) = pai i b−di . By our choice
of di ,
ϕLFi /Fi (ci ) = ϕL/K (NFi /K (ci )) = 1.
By properties of the reciprocity law, Fi ⊂ LFi ⊂ Fi (ζmi ) and so the reciprocity law holds for
(LFi , Fi , m0 ) as long as m0 is divisible by (mi )∞.
0
We chose ci prime to the mi so we may select m0 so that ci ∈ IFmi . Then there exist
m0
γi ∈ Fi , γi ≡ 1 mod m0 and an ideal di ∈ ILF i
such that ci = (γi )NLFi /Fi (di ). Taking
K-norms yields
pai i b−di = (NFi /K (γi ))NLFi /K (di ).
Selecting m0 so that m | m0 ensures that αi := NFi /K (γi ) lies in Km,1 . Now taking products
of the above pieces over all i gives us
r
Y r
Y r
Y
−d
ab = pai i b−di = αi NLFi /K (di ).
i=1 i=1 i=1
Write d0i = NLFi /L (di ). Then a = bd (α1 · · · αr )NL/K (d01 · · · d0r ). Above we saw that [L : K]
divides d, so bd is a norm on L/K. Hence we have shown that a ∈ NL/K (ILm )i(Km,1 ) and the
theorem is proved.
A small bit of work remains to prove the main result, which we restate here.
Theorem 17.8.8 (Artin Reciprocity). Let L/K be an abelian extension with G = Gal(L/K).
Suppose m is a modulus of K divisible by all primes in K which ramify in L and assume
their exponents are sufficiently large. Then the Artin map
m
ϕL/K : IK −→ G
312
17.8. The Artin Reciprocity Theorem Chapter 17. Global Class Field Theory
313
17.8. The Artin Reciprocity Theorem Chapter 17. Global Class Field Theory
This completes our discussion of Artin reciprocity and the Kronecker-Weber Theorem for
now, although these concepts continue to crop up in future discussions as they are integral
to class field theory as a whole.
314
17.9. The Conductor Theorem Chapter 17. Global Class Field Theory
where ϕL/K is the Artin map for m. There is a smallest integer f (p) ≤ m(p) such that this
sequence factors through (OK /pf (p) )× .
Proposition 17.9.1. If the reciprocity law holds for (L, K, m) then f(L/K) | m.
Proof. Obvious.
So far we do not know if the reciprocity law holds for f(L/K); of particular concern is
that some ramified primes might not divide the conductor. The Conductor Theorem states
that this does not happen.
Theorem 17.9.2 (Conductor Theorem). Let L/K be abelian with conductor f = f(L/K).
Then a prime of K (finite or infinite) ramifies in L if and only if it divides f. Moreover, a
modulus m is divisible by f if and only if ker ϕL/K is a congruence subgroup for m.
The proof of the conductor theorem is rather interesting, as it makes extensive use of
the local Artin map and thus establishes one of the powerful local-global connections in class
field theory. For details, consult sections V.11–12 of Janusz.
Proposition 17.9.3. Let L = Q(ζm ) where ζm is a primitive mth root of unity. The
conductor of L/Q is determined by
1
m≤2
f(L/Q) = (n)∞ m = 2n where n > 1 is odd
(m)∞ otherwise.
Proof. The conductor theorem says that f(L/Q) is the modulus of L divisible by exactly
those primes, finite and infinite, which ramify in L. Every modulus of L/Q is of the form
(n)∞ for some integer n, so write f = (n)∞. When m = 1, 2 the conductor is clearly 1 since
Q(ζm ) = Q in both cases. When m > 2, Example 17.4.8 tells us that all ramified primes
divide the modulus m = (m)∞, so by definition the conductor divides (n)∞, that is, n | m.
315
17.9. The Conductor Theorem Chapter 17. Global Class Field Theory
316
17.10. The Existence and Classification Theorems Chapter 17. Global Class Field Theory
The goal of class field theory is then to classify all abelian extensions by their class groups.
We will prove
Theorem. Let m be a modulus of K and let H be a congruence subgroup for m. Then there
exists an abelian extension L ⊃ K, all of whose ramified primes divide m, such that H is the
m
kernel of the Artin map ϕL/K : IK −→ Gal(L/K), that is, L is a class field of H.
Constructing a class field for H is hard to do directly, so the usual approach in class field
theory texts is to construct enough extensions to force the existence of L.
Lemma 17.10.1. Let m be divisible by all primes of K ramifying in L and suppose there is
a chain of subgroups
i(Km,1 ) ≤ H0 ≤ H1 ≤ I m
such that H0 is a congruence subgroup for an abelian extension L/K. Then H1 is a congru-
ence subgroup for the subfield of L fixed by the subgroup ϕL/K (H1 ) ≤ Gal(L/K).
Proof. Let G1 = ϕL/K (H1 ) and let E be the subfield of L fixed by G1 . Let r : Gal(L/K) →
Gal(E/K) be the natural restriction, so that r(G1 ) = 1. For any a ∈ I m , ϕE/K (a) =
(r ◦ ϕL/K )(a) so in particular ϕE/K (a) = 1 when a ∈ H1 . Thus H1 ⊂ ker ϕE/K .
On the other hand, since H1 is a congruence subgroup the reciprocity law holds for
(E, K, m) and so
[I m : ker ϕE/K ] = [Gal(L/K) : G1 ] = [I m : H1 ].
This proves H1 = ker ϕE/K and the Artin reciprocity theorem (19.2.2) implies the rest.
Lemma 17.10.2. Let H be a congruence subgroup of K for the modulus m. To show there
exists a class field L of H, it suffices to prove this when K contains a primitive nth root of
unity, where n = [I m : H].
where each subextension K (i+1) /K (i) is cyclic. Now apply Lemma 17.10.1 and Proposition
V.7.2 from Janusz.
This allows us to assume K contains the nth roots of unity. Let S1 be a finite set of
primes of K and let Y
m1 = pm1 (p)
p∈S1
for sufficiently high powers m1 (p). Define S2 and m2 in the same way and suppose S1 ∩S2 = ∅
and that S1 ∪ S2 contains all primes p satisfying
317
17.10. The Existence and Classification Theorems Chapter 17. Global Class Field Theory
(i) p | n;
(ii) p | ∞;
(iii) and p | ai where {ai } is a finite set of OK -ideals whose images cover C(OK ).
Then any ideal a can be expressed as a = ai (α) for some α ∈ K and ai only divisible by
primes in S := S1 ∪ S2 . Define the congruence subgroups
where I(Sj ) denotes the group generated by finite primes in Sj . (These are congruence
subgroups since S1 ∩ S2 = ∅ implies H1 ⊆ I m1 and H2 ⊆ I m2 .) Next we define two subgroups
of K ∗ :
W1 = K S K n ∩ Km2 ,1
and W2 = K S K n ∩ Km1 ,1 .
√ √
We claim that L1 = K( n W1 ) and L2 = K( n W2 ) are the respective class fields over K for
H1 and H2 . This is proven in detail in section V.9 of Janusz. We will end the discussion
here, since our goal is to explore the consequences of the existence theorem. In any case, the
construction of such a class field L1 for H1 allows us to prove
318
17.10. The Existence and Classification Theorems Chapter 17. Global Class Field Theory
Proof. The existence theorem shows that every congruence subgroup corresponds to an
abelian extension. Conversely, let L and M be abelian extensions of K. Consider the
f(L/K) f(M/K)
Artin maps ϕL/K : IK → Gal(L/K) and ϕM/K : IK → Gal(M/K), where f de-
notes the conductor of each extension. By the conductor theorem (17.9.2), ker ϕL/K and
ker ϕM/K are both congruence subgroups for K and by Lemma 17.10.4 it suffices to prove
the correspondence for these congruence subgroups. On one hand, Corollary 17.8.9 shows
that if ker ϕL/K ⊆ ker ϕM/K then M ⊂ L. On the other hand, M ⊂ L implies that
ker ϕL/K ⊂ ker ϕM/K and so the correspondence is indeed one-to-one.
At this point we return to the defining property of the Hilbert class field which we have
so far neglected to justify. Take the modulus m = 1 on K and the congruence subgroup
m
PK = PK (m, 1) ≤ IK = IK . By the existence theorem, there is a unique abelian extension
L/K such that the Artin map induces the isomorphism
C(OK ) = IK /PK ∼
= Gal(L/K).
Theorem 17.10.6. For a number field K, the Hilbert class field L/K is the maximal un-
ramified abelian extension of K.
Theorem 17.10.7 (Principal Ideal Theorem). If L is the Hilbert class field of K, then every
ideal a ⊂ OK becomes principal in OL .
319
17.11. The Čebotarev Density Theorem Chapter 17. Global Class Field Theory
(These factorizations are easy to produce with MAGMA.) It turns out that f factors into the
different decompositions (partitions of n = 4) with the following approximate frequencies:
For example, the prime 7 falls into the set C1,3 = {p prime | f = gh3 (mod p)}, while
47 ∈ C4 and 101 ∈ C2,2 . Correspondingly, Frobenius’ theorem says that the number of
automorphisms σ ∈ G = Gal(K/Q) with cycle type 4 is |G|4
; likewise, the number of σ with
|G| |G|
cycle type 1,3 is 3 ; the number with cycle type 2,2 is 8 ; and so forth. In every case,
the identity automorphism is the only element of G with cycle type 1,1,1,1, which tells us
that |G| = 24 and we can go back and compute the number of elements of each cycle type
accordingly.
So far we have seen that for a field K/Q, classes of primes are in a certain correspondence
with the various cycle types of elements of the Galois group of this extension. The natural
320
17.11. The Čebotarev Density Theorem Chapter 17. Global Class Field Theory
question arising from this discussion is: given a polynomial f and a prime p that doesn’t
divide D(f ), is it possible to find, in some canonical way, an element in G with the same cycle
type as the decomposition of f over Fp ? This would successfully generalize both Dirichlet’s
and Frobenius’ results, and indeed Frobenius conjectured that it was possible. The solution
was finally found by Čebotarev after 42 years in the form of his density theorem.
For the next few theorems, we will assume K is a number field and E is a normal, not
necessarily abelian, extension of K, with Galois group G = Gal(E/K).
Let m be a modulus divisible by sufficiently high powers of all the primes of K which
ramify in E. Then the group H m (E/K) := NE/K (IEm )i(Km,1 ) is a congruence subgroup for
m and so the Existence Theorem tells us there is a (unique) abelian extension L/K that is
class field for H m (E/K). We may ‘enlarge’ m by forming a modulus n such that m | n and
NE/K (IEn ) ⊆ H n (L/K). By Corollary 17.8.9, L ⊂ E so we may as well use m after all. This
tells us that H m (E/K) = H m (L/K) and moreover,
m
IK /H m (E/K) = IK
m
/H m (L/K) ∼
= Gal(L/K).
To identify H m (E/K) with Gal(E/K), we prove the following theorem which also serves
to generalize the Artin map to the non-abelian case.
but we showed that the first and last are equal, so it follows that L = M since both are
abelian. Now this tells us by the classification theorem (17.10.5) that Gal(L/K) is the
largest possible quotient of G that is abelian. By definition this is the abelianization of G,
so Gal(L/K) ∼ = G/G0 .
To describe the isomorphism, let P be a prime in IEm and let p = P ∩ K. By Propo-
sition 14.5.13, the primes lying over p are Galois conjugates under the action of G and
therefore p determines a conjugacy class of the Frobenius automorphism FrobE/K (P). This
means that p determines a single element in G/G0 . We define the Artin map for non-abelian
extensions to be
E/K
ϕE/K (p) := G0 .
P
By the work above, this extends to a homomorphism IK m
→ G/G0 .
To complete the description of ϕE/K , we compute its kernel. By Proposition 17.3.2,
E/K L/K
= PL where PL = P ∩ L.
P L
Thus ϕE/K (p) = ϕL/K (p)G0 so ker ϕL/K ≤ ker ϕE/K . But ker ϕL/K = H m (E/K) which was
shown to have index [G : G0 ] in IK
m
. Hence ker ϕE/K = H m (E/K) and our description is
complete.
321
17.11. The Čebotarev Density Theorem Chapter 17. Global Class Field Theory
322
17.11. The Čebotarev Density Theorem Chapter 17. Global Class Field Theory
It follows that Pj ∈ T ⇐⇒ τj στj−1 = σ. Since the τj and therefore the Pj are distinct
(remember that {τj } is a transversal of hσi), the number of primes in T dividing p is equal
to [ZG (σ) : hσi] where ZG (σ) is the centralizer of σ in G = Gal(L/K).
Now let S denote the set of OK -primes divisible by a prime in T and choose some p ∈ S.
There are precisely [ZG (σ) : hσi] primes P ∈ T for which NE/K (P) = p. This implies that
1
[ZG (σ) : hσi] · δ(S) = δ(T ) = |hσi| . Finally, we conclude that
1 1 |C| |C|
δ(S) = = = = .
|hσi| · [ZG (σ) : hσi] |ZG (σ)| |G| [L : K]
The Čebotarev density theorem immediately gives us the following result for abelian
extensions.
Corollary 17.11.6. Let L/K be abelian, m a modulus of K divisible by all primes
that
L/K
ramify in L, and σ ∈ Gal(L/K). Then the set S of primes p - m such that = σ has
p
density
1
δ(S) =
[L : K]
and in particular S is infinite.
This corollary is similar to the conclusion in the proof of Theorem 17.6.5, and both
density theorems imply the surjectivity of the Artin map (this was originally proven in
Corollary 17.6.3). However, Čebotarev’s result implies surjectivity in a much stronger sense,
in that the density of primes in L is uniformly distributed across the collection of sets S
corresponding to conjugacy classes in G. Recall that with Frobenius’ theorem, this density
was only uniformly distributed across divisions, a much less intuitive object to work with in
the group-theoretic sense.
The Čebotarev density theorem is undoubtedly one of the most useful tools in modern
algebraic number theory, and is beginning to have practical application in algebraic geometry.
One important result for our purposes answers a question posed back in Section 14.5.
Proposition 17.11.7. For any Galois extension L/K, there are infinitely many primes of
K that split completely in L.
323
17.11. The Čebotarev Density Theorem Chapter 17. Global Class Field Theory
Q(ζ117 )
Q(ζ9 ) Q(ζ13 )
M = Q(α, β)
3 3
K = Q(α) 9 L = Q(β)
3 3
Example 17.11.8. To illustrate the differences between conjugacy class, division and cycle
type and their associated densities, consider the group G = Z/3Z×Z/3Z. The reason is that
these three types of partitions are all distinct for G, as we will see in a moment. To apply
the density theorems to G we must find a Galois extension M/Q such that G = Gal(M/Q).
We provide two computational methods of constructing such an extension below.
The hard way is to find two extensions K/Q and L/Q of degree 3 and take their com-
positum. By field theory, if K and L are Galois extensions of Q and K ∩ L = Q then the
Galois group of their compositum is a direct product Gal(KL/Q) ∼ = Gal(K/Q) × Gal(L/Q).
There are two concerns: we want M/Q to be Galois with Gal(M/Q) ∼ = Z/3Z × Z/3Z and
we also want K and L to be normal subfields of M .
By the Kronecker-Weber Theorem (17.8.10), we can find all of these abelian exten-
sions within cyclotomic fields. It is a fact that if gcd(m, n) = 1 then Gal(Q(ζmn )/Z) ∼ =
Gal(Q(ζm )/Q) × Gal(Q(ζn )/Z) where ζj denotes a primitive jth root of unity. For our pur-
poses we want an integer k = mn such that gcd(m, n) = 1 and 3 divides ϕ(m) and ϕ(n);
this way we can find subfields of degree 3.
Along these lines, we chose m = 9 and n = 13. We found subfields K = Q(α) and
L = Q(β), where α = ζ9 + ζ98 and β = ζ13 + ζ13 5 8
+ ζ13 12
+ ζ13 . The previous paragraphs ensure
that M = Q(α, β) is a Galois extension of Q with Galois group Gal(M/Q) ∼ = Z/3Z × Z/3Z.
The minimal polynomial of M/Q is
h(x) = x9 + 3x8 − 18x7 − 38x6 + 93x5 + 147x4 − 161x3 − 201x2 + 57x + 53.
324
17.11. The Čebotarev Density Theorem Chapter 17. Global Class Field Theory
Notice that the distribution is essentially uniform across each of the three types of partitions
of G; that is, the distribution of primes in an element of a given partition is proportional to
the size of the element of the partition.
325
17.12. Ring Class Fields Chapter 17. Global Class Field Theory
so C(O) is a generalized ideal class group for K corresponding to the modulus f OK . The
existence theorem (Section 17.10) then says that there is a unique abelian extension L/K
such that Gal(L/K) ∼= C(O).
Definition. For an order O in a number field K, the unique abelian extension L ⊃ K
satisfying Gal(L/K) ∼
= C(O) is called the ring class field of the order O.
Some authors denote a ring class field by KO . It is clear from the classification theorem
that the ring class field of the maximal order OK is precisely the Hilbert class field of K.
We will see that ring class fields are a useful generalization of the Hilbert class field in many
ways.
On the group theory side of things, we have the following characterization of the Galois
group of a ring class field.
Lemma 17.12.1. Let L be the ring class field of the order O in an imaginary quadratic field
K. Then L/Q is Galois and its Galois group can be written as a semidirect product
Gal(L/Q) ∼
= Gal(L/K) o (Z/2Z),
As we did with the Hilbert class field, we√begin by relating a prime p = x2 + ny 2 to its
splitting behavior in the ring class field of Z[ −n].
√
Theorem √ 17.12.2. Fix n ∈ N, let K = Q( −n) and let L be the ring class field of the
order Z[ −n] in K. If p is an odd prime not dividing n, then
p = x2 + ny 2 ⇐⇒ p splits completely in L.
326
17.12. Ring Class Fields Chapter 17. Global Class Field Theory
√
Proof. Let O = Z[ −n] and denote its conductor by f . The discriminant of O is D = −4n,
so we know from Section 17.2 that −4n = f 2 dK , where dK is the discriminant of K. If p - n
is an odd prime, then of course p - f 2 dK and so by Corollary 15.10.10, p is unramified in K.
As with the analogous Theorem 17.1.9, we prove the equivalence of the following statements:
IK (f )/PK,Z (f ) = C(O) ∼
= Gal(L/K)
where
the
isomorphism is the Artin map ϕL/K . This shows thatp ∈ PK,Z (f ) if and only if
L/K L/K
= 1, and Proposition 17.1.3 further implies that = 1 if and only if p splits
p p
completely in L.
(v) ⇐⇒ (vi) Finally, Lemma 17.12.1 shows that L is Galois over Q and so as in the
proof of Theorem 17.1.9, p splits in L if and only p splits in K and some prime lying over p
(e.g. p) splits in L. This proves all equivalences and hence the theorem.
We finally arrive at the main characterization theorem for primes of the form x2 + ny 2 .
Theorem 17.12.3. For every integer n > 0, there is a monic irreducible polynomial fn (x)
of degree h(−4n) with integer coefficients such that for all odd primes dividing neither n nor
the discriminant of fn ,
2 2 −n
p = x + ny ⇐⇒ = 1 and fn (x) ≡ 0 (mod p) for some x ∈ Z.
p
Furthermore, any such choice of fn (x) will be the minimal polynomial
√ of a real algebraic
√
integer α for which L = K(α) is the ring class field of the order Z[ −n] in K = Q( −n).
Proof. As in the proof of Theorem 17.1.8, knowing L/Q is Galois allows us to pick a real
algebraic integer α that generates L/K, that is L = K(α). Let fn (x) be the minimal
polynomial of α over K. By definition such a polynomial is monic, irreducible and has
integer coefficients. Moreover, fn must have degree [L : K] = h(O) = h(−4n).
Let p be a prime not dividing n or the
discriminant
of fn . Then fn is separable mod p, so
−n
p splits completely in K if and only if p = 1. We may assume p splits completely in K,
327
17.12. Ring Class Fields Chapter 17. Global Class Field Theory
Therefore if L ⊂ M then we would be able to conclude that L = K(α) and f is the minimal
polynomial of α over K. To verify L ⊂ M we need the next lemma which, once established,
will allows us to finish the proof of Theorem 17.12.3.
·
Given two sets S and T , we will write S ⊂ T if S is contained in T except for a finite
number of elements. We will apply this in the next lemma to the set
Lemma 17.12.4. Let L and M be Galois extensions of a number field K and define
by definition of the Artin map (and the fact that σ ∈ Gal(N/M )). This shows that OM /Q ∼
=
OK /p so f (Q | p) = 1, which further implies that p ∈ T . In fact, the Čebotarev density
328
17.12. Ring Class Fields Chapter 17. Global Class Field Theory
theorem guarantees that there are infinitely many of these primes p and since we assumed
·
T
⊂ S,we may therefore assume p is one of the primes of T which lies in S. Now this means
L/K
= 1 and by Proposition 17.3.2,
p
L/K N/K
1= = = σ|L .
p P L
Let’s pause for a moment to see how far we have come. Beginning with Example 14.5.11,
where we proved Fermat’s theorem on primes of the form x2 + y 2 , we utilized a number of
tools in algebraic number theory to characterize primes of the form x2 + ny 2 for infinitely
many n – this was Theorem 17.1.8. In order to answer the x2 + ny 2 question for all integer
n, we needed the full force of class field theory, notably Čebotarev’s density theorem, and
this resulted in the characterization proven above. However, both theorems have the same
weakness: they do √ not provide a method for producing the primitive element α of the ring
class field L for Q( −n).
It turns out that there is an element j(O), called the j-invariant of the order O, that
generates L/K where L is the ring class field of K. Its defining characteristics are described
in the so-called First Fundamental Theorem of Complex Multiplication:
Theorem 17.12.5. Let O be an order in an imaginary quadratic field K.
(1) For any proper fractional O-ideal a, j(a) is an algebraic integer.
(2) For any proper fractional O-ideal a, K(j(a)) is the ring class field of K.
(3) For any two proper fractional ideals a, b ⊂ O, j(a) and j(b) are conjugate and therefore
they are all roots of a single irreducible polynomial HO (x) ∈ Q[x] which satisfies
h(O)
Y
HO (x) = (x − j(ai )),
i=1
329
17.12. Ring Class Fields Chapter 17. Global Class Field Theory
where h(O) is the class number of O and ai are distinct representatives of the class
group for O.
(4) The equation HO (x) = 0 is called the class equation for O and there exists an algorithm
for computing the class equation.
The First Fundamental Theorem of CM usually refers to (1) and (2). We will prove
this in Chapter 27. In practice, it is rather difficult to compute HO (x) but there have been
significant results in recent years that make it easier to compute in special cases.
330
Chapter 18
The main focus in the previous chapter was on developing the tools necessary for answering
the question “Given a natural number n and a prime p, when does p = x2 + ny 2 have a
solution in integers x and y?” The object x2 + ny 2 is an example of a quadratic form. In this
chapter we will further explore the theory of quadratic forms and then prove several results
about the special case x2 + ny 2 . Finally, in Section 18.3 we define a symmetric n-Fermat
prime to be a prime x2 + ny 2 such that y 2 + nx2 is also prime and describe the distribution
of such primes for various values of n.
331
18.1. Binary Quadratic Forms Chapter 18. Quadratic Forms and n-Fermat Primes
332
18.1. Binary Quadratic Forms Chapter 18. Quadratic Forms and n-Fermat Primes
Lemma 18.1.3. A form f (x, y) properly represents k ∈ Z if and only if f (x, y) is properly
equivalent to kx2 + b0 xy + c0 y 2 for some b0 , c0 ∈ Z.
Proof. ( =⇒ ) Let f (x, y) = ax2 + bxy + cy 2 and suppose k = f (p, q) for relatively
prime
p q
integers p, q. Then there exist integers r, s such that ps − qr = 1. Set P = and notice
r s
that det P = ps − qr = 1 so P ∈ SL2 (Z). Then writing x̄T = (x y) we have
(ii) For every integer M , f (x, y) properly represents an integer relatively prime to M .
Proof. (i) If p divides f (1, 0) and f (0, 1), this implies p | a and p | c, so f (1, 1) = pa0 + b + pc0
where a = pa0 and c = pc0 . Since f (x, y) is primitive, gcd(a, b, c) = 1 so p cannot divide b
and therefore p - f (1, 1). Similarly, if p divides f (1, 0) and f (1, 1), p must divide a and a + b
which implies p | b as well. Then f (0, 1) = c but since gcd(a, b, c) = 1, p cannot divide c.
Thus p - f (0, 1). The third case is identical to the second.
(ii) Let M be given. For each prime pi in the prime factorization of M , part (i) says
that one of f (1, 0), f (0, 1), f (1, 1) represents a number that is relatively prime to pi . We will
prove the case where M = p1 p2 and then induction on the number of prime factors will finish
the proof of (ii).
333
18.1. Binary Quadratic Forms Chapter 18. Quadratic Forms and n-Fermat Primes
Let k1 and k2 be integers such that p1 - k1 and p2 - k2 . By (i), we may suppose f (x, y)
represents k1 (mod p1 ) via f (x1 , y1 ) and it represents k2 (mod p2 ) via f (x2 , y2 ) for some
x1 , x2 , y1 , y2 ∈ Z. By the Chinese remainder theorem (3.2.10), let K be the unique integer
modulo p1 p2 satisfying
K ≡ k1 (mod p1 )
K ≡ k2 (mod p2 ).
Also using the Chinese remainder theorem (3.2.10), define A and B to be the unique solutions,
modulo p1 p2 , to
Then we can write K = Ak1 + Bk2 . In other words, K is the inverse image of (k1 , k2 ) under
the isomorphism given by the primary decomposition of M :
Z/(M ) ∼
= Z/(p1 ) × Z/(p2 )
Ai + Bj →−7 (i, j).
We use these ingredients to show that f (x, y) properly represents K modulo p1 p2 . Consider
By our choice of K, this shows that f (Ax1 + Bx2 , Ay1 + By2 ) is congruent to K (mod p1 p2 ).
Therefore f (x, y) represents K, which is relatively prime to M by construction.
Example 18.1.5. To illustrate Proposition 18.1.4, consider f (x, y) = 2x2 + 3xy + 6y 2 . Let
p1 = 11 and p2 = 13, whereby M = p1 p2 = 143. By (i) of the proposition, we can represent
k1 = 2 using f (1, 0) and k2 = 6 using f (0, 1). Calculations show that A = 78 and B = 66
(e.g. using a computer algorithm for the Chinese remainder theorem) which gives us
Note that K and M are coprime, so we can show that f (x, y) represents K in order to
demonstrate the conclusion in Proposition 18.1.4(ii). Letting (x1 , y1 ) = (1, 0) and (x2 , y2 ) =
334
18.1. Binary Quadratic Forms Chapter 18. Quadratic Forms and n-Fermat Primes
335
18.1. Binary Quadratic Forms Chapter 18. Quadratic Forms and n-Fermat Primes
Definition. For a fixed D < 0, the number h(D) of equivalence classes of primitive, positive
definite forms of discriminant D is called the class number of D.
Theorem 18.1.11. For every D < 0, the class number h(D) is finite.
Proof. By Theorem 18.1.8, h(D) is the number of distinct reduced forms of discriminant
D. For a reduced form ax2 + bxy + cy 2 of discriminant D, there are only a finite number of
choices for a and b since |b| ≤ a ≤ −D
3
by Lemma 18.1.10. Moreover, D = b2 −4ac shows that
the choices of D, a and b determine c. Therefore there are only a finite number of reduced
forms of discriminant D, so h(D) is finite.
336
18.2. The Form Class Group Chapter 18. Quadratic Forms and n-Fermat Primes
337
18.2. The Form Class Group Chapter 18. Quadratic Forms and n-Fermat Primes
So the product f (x, y)g(x, y) is properly equivalent to F (x, y). This means m divides every
number represented by f (x, y)g(x, y) but by Proposition 18.1.4, f and g represent some
numbers relatively prime to m. Therefore m = 1 so F (x, y) is primitive.
Notice that when D = −4n for an integer n ≥ 1, the principal form is x2 + ny 2 . We now
prove the main theorem for the form class group.
Theorem 18.2.3. Let D ≡ 0, 1 (mod 4) be a negative integer. The set C(D) is a finite
abelian group under Dirichlet composition. Moreover, the identity element is the class con-
taining the principal form and the inverse of the class containing ax2 + bxy + cy 2 is the class
containing ax2 − bxy + cy 2 .
Proof. First, Theorem 18.1.11 says that |C(D)| = h(D) is finite. If f (x, y) = ax2 + bxy + cy 2
and g(x, y) are p.p.d. forms of discriminant D then Proposition 18.1.4(ii) shows we can
replace g with a properly equivalent form g 0 (x, y) = a0 x2 + b0 xy + c0 y 2 with gcd(a, a0 ) =
1. Therefore Dirichlet composition is well-defined on classes of p.p.d. quadratic forms.
Moreover, Dirichlet composition is clearly abelian, so it suffices to check the identity and
inverses.
Let f (x, y) = ax2 + bxy + cy 2 ∈ C(D). Note that for the principal form FD (x, y), a0 = 1
so gcd(a, a0 ) = 1 and Dirichlet composition is well-defined for f and FD . The integer B that
satisfies Lemma 18.2.1 is precisely b, so
b2 − D 2
FD ∗ f (x, y) = aa0 x2 + bxy + y
4aa0
4ac 2
= ax2 + bxy + y
4a
= ax2 + bxy + cy 2 = f (x, y).
b2 − D 2
f ∗ g(x, y) = acx2 + bxy + y = acx2 + bxy + y 2 .
4ac
To finish, we show that F (x, y) = acx2 + bxy + y 2 is properly equivalent to FD (x, y). Using
the matrix S again, F (x, y) is properly equivalent to F (−y, x) and by Example 18.1.2 we
338
18.2. The Form Class Group Chapter 18. Quadratic Forms and n-Fermat Primes
can replace F (−y, x) = x2 − bxy + acy 2 with x2 + (−b + 2n)xy + (n2 − bn + ac)y 2 for any
n ∈ Z. If D ≡ 0 (mod 4), b must be even so let n = 2b . Then
2
b2
2 2 2 2 b
x + (−b + 2n)xy + (n − bn + ac)y = x + (−b + b)xy + − + ac y 2
4 2
−b + 4ac
= x2 + y2
4
D
= x2 − y 2 = FD (x, y).
4
b+1
On the other hand, if D ≡ 1 (mod 4), b is odd so let n = 2
. Then
(3) A positive integer m is represented by a form f (x, y) ∈ C(D) if and only if m = N(a)
for some proper ideal a ∈ Ψ(f (x, y)).
Proof. We will prove (1) and (2). The details of (3) can be found in Cox. √
(1) Let f (x, y) = ax2 + bxy + cy 2 be p.p.d. of discriminant D. Then α = −b+2a D is a root
of the polynomial f (x, 1) = ax2 + bx + c so byhLemma 17.2.7, a[1, α] is a proper ideal of the
√ i
−b+ D
order [1, aα]. Notice that a[1, α] = [a, aα] = a, 2 so it suffices to show [1, aα] = O.
339
18.2. The Form Class Group Chapter 18. Quadratic Forms and n-Fermat Primes
aα + b
f (x, y) and g(x, y) are properly equivalent ⇐⇒ β = for a, b, c, d ∈ Z, ad − bc = 1
cα + d
⇐⇒ [1, α] = λ[1, β] for some λ ∈ K ∗ .
a b
Suppose f (x̄) = g(Ax̄) where A = ∈ SL2 (Z). Then since α is a root of f (x, 1),
c d
2 aα + b
0 = f (α, 1) = g(aα + b, cα + d) = (cα + d) g ,1 .
cα + d
aα+b
Thus cα+d
is a root of g(x, 1) and it is easy to verify that it has positive imaginary part, so
aα+b aα+b a b
β = cα+d . On the other hand, the equation above shows that if β = cα+d for A = in
c d
SL2 (Z) then f (x, 1) and g(A(x, 1)) have the same root. It follows that f (x̄) = g(Ax̄) so the
forms are properly equivalent. This proves the first of the equivalences above.
Next, suppose β = aα+b
cα+d
where ad − bc = 1. Then cα + d ∈ K ∗ so set λ = cα + d. This
implies
aα + b
λ[1, β] = (cα + d) 1, = [cα + d, aα + b]
cα + d
but since ad − bc = 1, [cα + d, aα + b] = [1, α]. On the other hand, if [1, α] = λ[1, β] = [λ, λβ]
for some λ ∈ K ∗ then
λβ = eα + f
and λ = gα + h
e f eα+f eα+f
for some e, f, g, h such that ∈ GL2 (Z). Then β = λ
=and since α and β
gα+h
g h
e f
both have positive imaginary parts, we must have eh − f g = 1, that is ∈ SL2 (Z).
g h
340
18.2. The Form Class Group Chapter 18. Quadratic Forms and n-Fermat Primes
Therefore f and g are properly equivalent if and only if [1, α] = λ[1, β] for some λ ∈ K ∗ .
This establishes an injection
Ψ : C(D) −→ C(O)
" √ #
−b + D
f (x, y) 7−→ a[1, α] = a, .
2
We next show that Ψ is surjective. Let a be a fractional O-ideal which, by the proof of
Proposition 17.2.8, can be written a = [α, β] for some α, β ∈ K. Without loss of generality
assume αβ has positive imaginary part. Set γ = αβ and let ax2 + bx + c be the minimal
polynomial of γ over Q – we may rescale the coefficients to ensure gcd(a, b, c) = 1 and a > 0.
Let f (x, y) = ax2 + bxy + cy 2 which is then a p.p.d. quadratic form. We next check that
f (x, y) has discriminant D = disc(O). Writing O = [1, aγ] we compute the discriminant by
1 aγ 2
D = = a2 (γ̄ − γ)2 = 4a2 im(γ)2 .
1 aγ̄
The roots of ax2 + bx + c are γ and γ̄ which are solutions to the quadratic formula:
√ √
−b + b2 − 4ac −b − b2 − 4ac
γ= and γ̄ = .
2a 2a
√ √ 2
2 −4ac 2 −4ac
So im(γ) = b 2a and hence D = 4a2 b 2a = b2 − 4ac. This is precisely the discrim-
inant of f (x, y). Therefore f (x, y) is a primitive, positive definite form of discriminant D
which maps to a[1, γ] ∼ α[1, γ] = a in C(O). Hence Ψ is surjective.
Now we show that Ψ preserves the group structure of C(D). If f and g are p.p.d.
forms of discriminant D, denote their Dirichlet composition by F (x, y). In the proof of
Theorem 18.2.3, we saw that B = b satisfies the conditions of Lemma 18.2.1 for f and g, so
we can write the images of f, g and F under Ψ as:
√
−b + f dK
Ψ([f ]) = a, = [a, ∆];
2
0
√
0 −b + f dK
Ψ([g]) = a , = [a0 , ∆];
2
√ √
0 −B + f dK 0 −b + f dK
and Ψ([F ]) = aa , = [aa , ∆] where ∆ = .
2 2
We want to show [a, ∆][a0 , ∆] = [aa0 , ∆] in C(O). Note that the conditions on B from
Lemma 18.2.1 give us ∆2 ≡ −B∆ mod aa0 so we have
Since f, g and F are all primitive, the conditions on B also force gcd(a, a0 , B) = 1 so
[a, ∆][a0 , ∆] = [aa0 , a∆, a0 ∆, −B∆] = [aa0 , ∆] as desired. Hence Ψ : C(D) → C(O) is an
isomorphism.
341
18.3. n-Fermat Primes Chapter 18. Quadratic Forms and n-Fermat Primes
The following definitions are not standard in the literature. We have introduced them in
order to facilitate our discussion of Theorem 17.12.3 and Question 18.3.
Question 18.3 can therefore be restated: When is an n-Fermat prime symmetric? The
question is stated rather broadly for a reason, as there are several ways we could answer this.
In this language, Theorems 17.12.3 and 17.12.5 together say the√following: √Let f (x) be
the minimal polynomial of the j-invariant j(O) for the order O = Z[ −n] in Q( −n). Then
−n
a prime p not dividing disc(f ) is an n-Fermat prime if and only if p = 1 and f (x) ≡ 0
(mod p) has an integer solution. In other words, n-Fermat primes are characterized by
congruence conditions in all but finitely many cases. The best possible situation would
therefore be a positive answer to the following question:
Question. For an integer n ≥ 1, are there congruence conditions that determine when an
n-Fermat prime is a symmetric n-Fermat prime?
There is fortunately a case when the answer to Question 18.3 is quite trivial. When n = 1,
an n-Fermat prime is always symmetric. This is certainly the only case when the ratio of
symmetric n-Fermat primes to total n-Fermat primes is 1, as the next example shows.
Example 18.3.1. Let n = 2. The first few symmetric 2-Fermat primes are: p = 3, 11,
19, 43, 59, 67, 83, 107, 139, 163, 179, . . . For small primes it appears that p is a symmetric
2-Fermat prime if and only if p ≡ 3 (mod 8). However, 131 is a 2-Fermat prime since it can
be written 131 = 92 + 2 · 52 , but 52 + 2 · 92 = 187 = 11 · 17 is not prime. Therefore the
condition p ≡ 3 (mod 8) breaks early on.
Using Magma, we generated data to estimate the proportion of symmetric 2-Fermat
primes to the expected number of symmetric 2-Fermat primes with x, y ≤ 1, 000. Empirically,
it appears that the ratio of symmetric 2-Fermat primes to total 2-Fermat primes is about
0.1143; that is, about 11.43% of 2-Fermat primes are symmetric. On the other hand, the
data shows that the ratio of the number of symmetric 2-Fermat primes to the expected
number of 2-Fermat primes, under the assumptions of our Prime Number Theorem heuristic
below, is about 0.9587. That is, there are slightly less symmetric 2-Fermat primes than we
expect. Something interesting is going on here.
For an integer n ≥ 1, let πsym,n (M ) denote the number of primes y 2 + nx2 such that
x2 + ny 2 is prime and x, y ≤ M . Notice that if x2 + ny 2 is prime and x and y are both
342
18.3. n-Fermat Primes Chapter 18. Quadratic Forms and n-Fermat Primes
relatively prime to n, then y 2 + nx2 is necessarily odd. Of course a number has twice the
probability of being prime given that it is odd so the Prime Number Theorem (10.4.2)
heuristically says that for each n ≥ 1, there is a nonnegative real number αn such that
X 1
πsym,n (M ) ∼ 2αn ,
q≤M
log q
where log is the natural logarithm and the sum is over n-Fermat numbers q = y 2 + nx2 ,
x, y ≤ M , for which x2 + ny 2 is prime. For example, the data in Example 18.3.1 shows that
α2 is close to 0.9328. We posit several conjectures related to αn and the asymptotic behavior
of πsym,n (M ) below, along with empirical results that lead us to believe they might hold.
Conjecture. For all n ≥ 1, αn > 0.
Theorem 17.12.3 characterizes primes of the form x2 + ny 2 up to solvability conditions
of fn (x) ≡ 0 (mod p). Moreover, Cox gives a general formula for the Dirichlet density δ(f )
of primes represented by a p.p.d. quadratic form f of discriminant D < 0:
(
1
if f is properly equivalent to its opposite
δ(f ) = h(D)
1
2h(D)
otherwise.
Therefore there are infinitely many n-Fermat primes for any n ≥ 1. In other words, the sum
1
P
q≤M log q over n-Fermat numbers q obtained by switching solutions for n-Fermat primes
diverges as M → ∞, so Conjecture 18.3 would imply that there are infinitely many symmetric
n-Fermat primes for every n ≥ 1. To test this conjecture, we turned Magma loose on some
computations with large search spaces. Through the first 40,000 values for n, and with
search parameters x, y ≤ 1, 000, Conjecture 18.3 is seen to hold. There were several other
interesting observations made, which are discussed via the next two conjectures.
Conjecture. The average value of αn over all n ≥ 1 is equal to 1.
Informally, Conjecture 18.3 means that, on average, n-Fermat primes are about as likely
to be symmetric as the Prime Number Theorem predicts. This is supported by the statistical
analysis of the data we generated.
This describes a global property of the natural numbers, which reinforces the predictions
of the Prime Number Theorem. This shouldn’t be a surprise, as the PNT makes a strong,
global statement about the natural numbers and subsets thereof. However, we know from
experience that the integers often behave more erratically from a local perspective. To this
end, we used Magma to locate the values of n such that αn exceeds a certain threshold r. For
example, there are a handful of numbers n in the first 40,000 such that αn > 2, including:
2277, 12699, 13629, 14540, 15091, 16615, 22576, 24089,
27250, 29127, 29798, 31927, 33060, 34159, 35814.
These n have the apparent property that there are more than twice the number of symmetric
n-Fermat primes than expected. We studied similar data for n values such that αn is less
than a threshold r. In the future we hope to be able to discern why certain numbers have
higher or lower densities of symmetric n-Fermat primes than predicted, but if one is to
believe that the values of αn follow any sort of recognizable distribution, then such outliers
are to be expected in larger and larger data sets.
343
18.3. n-Fermat Primes Chapter 18. Quadratic Forms and n-Fermat Primes
Conjecture. The set of αn is bounded. That is, there are positive constants ε and M such
that for all n, ε ≤ αn ≤ M .
This conjecture is offered solely based on the observations made for large parameter
searches for symmetric n-Fermat primes. It appears so far that 0.4 ≤ αn ≤ 2.1.
Finally, a question lingering on the edge of this discussion is
Question 18.3 is unsolved and it would be difficult at this time to implement a method of
solving p = x2 + ny 2 even for small n. However, there is clear motivation for answering such
a question, as there are important implications to the theory of quadratic partitions and
cryptography.
In a related sense, the characterization (Example 14.5.11) of primes of the form x2 + y 2 ,
that is 1-Fermat primes, forms the basis of a primality test discovered by Euler: m = x2 + y 2
has a single solution (x, y) in positive integers when m is prime. In the future, the complexity
of n-Fermat primes and symmetric n-Fermat primes may contribute to the rise of more secure
cryptosystems and faster primality test algorithms.
344
Chapter 19
In this chapter we reframe the main results in global class field theory using the adèlic
language of Chapter 16, including:
We will give proofs of most results, excluding the difficult Artin reciprocity theorem.
345
19.1. Frobenius Elements Chapter 19. Adèlic Class Field Theory
L P |OL /P| = q d
L
e P
e e = qd
|OLe /P|
K
e p
e |OKe /e
p| = q
K p |OK /p| = q
p) : x + P 7−→ xq + P on OL /P.
FrobL/Ke (P | e
346
19.1. Frobenius Elements Chapter 19. Adèlic Class Field Theory
Gal(L1 L2 /K) ∼
= Gal(L1 /K) × Gal(L2 /K)
and under this isomorphism, FrobL1 L2 /K (p) corresponds to (FrobL1 /K (p), FrobL2 /K (p)).
(2) Suppose M ⊇ L ⊇ K are abelian extensions. Then FrobM/K (p)|L = FrobL/K (p).
Remark. In the abelian case, Frobenius elements may also be written as Artin symbols:
When L/K is not necessarily an abelian extension, FrobK/F (p) is only a conjugacy class in
Gal(K/F ). One may ask whether every conjugacy class in Gal(K/F ) arises as the Frobenius
class for some prime p. The answer was provided by Čebotarev’s theorem (17.11.5).
347
19.2. Artin Reciprocity Chapter 19. Adèlic Class Field Theory
θK : CK −→ Gab
K
called the Artin map, where GK := Gal(F /K). When F is a global field and K is unramified
at p, θK will be defined on classes [(xv )] ∈ CK , where xv = πp if v = p and xv = 1 otherwise,
by θK [(xv )] = (p, K/F ). When F is a local field, θK will be given by a certain power of a
generator of the Galois group Gal(K/F ).
The celebrated Artin reciprocity theorem shows that each of these θK is an isomorphism
and moreover, if L/K is unramified at p, then there is a commutative diagram
θL
CL Gab
L
NL/K
θK
CK Gab
K
h : G × H\G −→ G
(x, Hy) 7−→ s(Hy)xs(Hyx)−1 .
Informally, we might regard h as measuring how far s is from being a homomorphism. Define
for each x ∈ G an element in the abelianization of H, Ve (x) ∈ H ab = H/[H, H] by
Y
Ve (x) = h(x, Hy) mod [H, H].
Hy∈H\G
(b) Ve : G → H ab is a homomorphism.
348
19.2. Artin Reciprocity Chapter 19. Adèlic Class Field Theory
Ve
G H ab
VG/H
Gab
Definition. The homomorphism VG/H : Gab → H ab is called the transfer map for H ≤ G,
or in German, the Verlagerung.
Theorem 19.2.2 (Artin Reciprocity). Let F be a local or global field. Then there exists a
map
θF : CF −→ Gab F
such that
F θ
(1) For every finite abelian extension K/F , the map θK/F : CF −→ Gab
F → Gal(K/F ) is
surjective with kernel ker θK/F = NK/F (CK ).
(2) Conversely, for any finite index open subgroup N ⊆ CF , there exists a finite abelian
extension K/F for which N = ker θK/F . In this case, CF /N ∼
= Gal(K/F ).
(3) If K/F is a finite unramified abelian extension of local fields, then the map θK/F is
given by θK/F (x) = ϕv(x) for any x ∈ F × , where v is the nonarchimedean valuation
on F and ϕ generates Gal(K/F ).
(4) If K/F is a finite abelian extension of global fields and p is a prime of F which is
unramified in K, then
θK/F (x) = (p, K/F )
where x ∈ CF is the class represented by (xv ), with xv = 1 if v 6= p and xp = πp is a
uniformizer at p.
Moreover, the Artin map is functorial in the following sense. If K/F is any finite separable
extension, then there are commutative diagrams
θK θK
CK Gab
K CK Gab
K
349
19.3. Kronecker-Weber Theorem Chapter 19. Adèlic Class Field Theory
Proof. It suffices to prove this for F = Q and Fq . For F = Q, this follows from the
identification Gal(Q(ζpr )/Q) ∼
= (Z/pr Z)× for any prime power pr and the fact that every
cyclotomic extension can be written as a compositum of prime power cyclotomic extensions.
The proof for F = Fq is similar.
F ⊆ F nr ⊆ F cyc ⊆ F ab .
We will prove that the converse holds, i.e. F cyc = F ab ; when F = Q, this is the famous
Kronecker-Weber theorem.
Theorem 19.3.3. Let K/Qp be a cyclic extension of degree pr . Then K ⊆ Qp (ζm ) for some
root of unity ζm .
Proof. We show that K is contained in one of the following: (1) Qp (ζppr −1 ); (2) the index
p − 1 subfield of Qp (ζpr+1 ); and (3) K = Qp (ζmr ) := Qp (ζppr −1 )(ζpr+1 ); all three of which are
cyclotomic, so this would prove the theorem. We now show any K is contained in one of
350
19.3. Kronecker-Weber Theorem Chapter 19. Adèlic Class Field Theory
these extensions. Suppose the contrary. Then for any m = mr as above, K(ζm ) is Galois
over Qp with Galois group
with H ⊆ Z/pr Z × (Z/pr Z × Z/(p − 1)Z) × Z/pr Z. Since we assumed K 6⊆ Qp (ζm ), the final
factor of Z/pr is nontrivial, so that Gal(K(ζm )/Qp ) has a (Z/pZ)3 -quotient. It is a fact that
if p > 2, no extension of Qp has Galois group (Z/pZ)3 , a contradiction. (There is a similar
proof when p = 2, though we will not show it.)
The global case is given by the Kronecker-Weber theorem.
Theorem 19.3.4 (Kronecker-Weber). For any abelian extension K/Q, K ⊆ Q(ζm ) for some
root of unity ζm .
If L/K is an abelian extension of number fields, we have defined the Artin map θL/K :
IL → GL where IL is the idèle group of L and GL is the absolute Galois group of L. As
in Artin’s reciprocity theorem (19.2.2), consider the composition ϕL/K : IL → IK → GK →
Gal(L/K). Then ker ϕL/K = NL/K (IL ).
Definition. For a finite extension L/K, we define Spl(L/K) to be the set of primes of K
that split completely in L with relative degree 1 over Q.
Theorem 19.3.5. Let K be a global field with finite extensions L/K and M/K, where M/K
is Galois. Then L ⊆ M if and only if Spl(M/K) ⊆ Spl(L/K) ∪ Σ for some finite set of
primes Σ.
Proof. ( =⇒ ) is straightforward.
( ⇒= ) Let F ⊇ LM and take σ ∈ Gal(F/K) with F ) F σ ⊇ M . We may choose a ∈ OL
with σ(a) − a 6= 0. Then by Čebotarev’s density theorem (17.11.5), for every σ ∈ Gal(L/K),
there exist infinitely many primes p ⊂ OK with FrobL/K (p) = σ. Choose any of these p and
also pick P ⊂ OL with FrobL/K (P | p) = σ, so taht P - σ(a) − a and p 6∈ Σ. Since σ fixes
M , p splits completely in M but σ does not fix L, so p does not split completely in L, a
contradiction.
In general, Theorem 19.3.5 implies that identifying an abelian extension K/Q comes
down to identifying Spl(K/Q). Let ϕK/Q : IQ → Gal(K/Q) Q be the Artin map and let
U = ker ϕK/Q . By Proposition 19.3.6 below, IQ ∼ = Q× × R+ × p Q Z×p . Since U is an open
subgroup of IQ , we may identify it with an open subgroup of R+ × p Z× p but since R+ has
no nontrivial open subgroups, we must have U = R+ × U for an open subgroup U ⊆ p Z×
Q
p.
By the Chinese remainder theorem (3.2.10) and Artin reciprocity (19.2.2), we can find an
integer m such that U ⊇ Um where Um is the unique open subgroup of CQ corresponding to
Gal(K(ζm )/K). Then if some prime p is equivalent to 1 mod m, it must be in Spl(K/Q).
Hence by Theorem 19.3.5, K ⊆ Q(ζm ). This proves the Kronecker-Weber theorem.
Proposition 19.3.6. IQ ∼ = Q× × R+ × p Z×
Q
p.
351
19.3. Kronecker-Weber Theorem Chapter 19. Adèlic Class Field Theory
0 → N → IQ → Q× → 0
352
Part V
Elliptic Curves
353
Chapter 20
Introduction
The notes in Part V come from a course in algebraic geometry and elliptic curves taught by
Dr. Lloyd West at the University of Virginia in Fall 2016. The first part of the notes are a
survey of the main concepts in algebraic geometry, with an emphasis on curves (i.e. varieties
of dimension 1). Key topics include:
Dimension
Intersection theory
Divisors
Genus
Jacobian of a curve
The main algebraic geometry reference used is Shafarevich’s Basic Algebraic Geometry 1.
The second part of the course covers the basic results in the arithmetic geometry of
elliptic curves, including:
Moduli
Reduction mod p
Zeta functions
354
Chapter 20. Introduction
Heights
Additional topics include the application of elliptic curves to cryptography, higher genus
curves and L-functions. The main text used is Silverman’s Arithmetic of Elliptic Curves.
355
20.1. Geometry and Number Theory Chapter 20. Introduction
Question. Describe the set of all right triangles with integer sides.
Question. A rational number n is said to be congruent if there exists a rational right triangle
with area n. Which rational numbers n are congruent?
We will see that Question 1 is easy to answer, while Question 2 is still unsolved. The
fundamental difference lies in the geometry of each situation.
For example, (3, 4, 5) and (5, 12, 13) are pythagorean triples. Notice that multiplying
any pythagorean triple by an integer n ∈ Z yields another pythagorean triple (in particular,
there are infinitely many pythagorean triples), so we may assume a, b, c are coprime. Such a
triple is called a primitive pythagorean triple.
Theorem 20.1.1. Denote the set of all primitive pythagorean triples by Π. Then there is a
bijection Π ↔ {(x, y) ∈ Q2 | x2 + y 2 = 1}.
Definition. Let k be a field and fix a polynomial f ∈ k[x, y] which is irreducible over the
algebraic closure k̄. Then the curve associated to f is a functor C = Cf given by
C : Fieldsk −→ Sets
K/k 7−→ Ck (K) := {(x, y) ∈ K 2 | f (x, y) = 0}.
For a field extension K/k, the set C(K) is called the K-rational points of the curve C.
356
20.1. Geometry and Number Theory Chapter 20. Introduction
Also note that since f ∈ Q[x, y], we can view f as a polynomial with coefficients in any
finite field Fq , and consequently the Fq -rational points C(Fq ) are defined.
Next, fix the point (−1, 0) on C(K) for any field K and consider the line L : x = 0.
slope = t
(−1, 0)
Theorem 20.1.3. Let k be any field and C = Cf the curve defined by f = x2 + y 2 − 1. Then
there is a bijection
y 1 − t2 2t
χ(x, y) = , ψ(t) = and φ(t) =
x+1 1 + t2 1 + t2
exhibit the bijection.
Theorems 20.1.1 and 20.1.3 answer Question 1: the set of all primitive pythagorean triples
is completely described by the line L given by x = 0, and this description holds over any
field k.
For Question 2, we must understand the set of congruent numbers over a field k. For
n ∈ Q, define the set
357
20.1. Geometry and Number Theory Chapter 20. Introduction
Example 20.1.4. The elliptic curve defined by y 2 = x3 − 25x over R is shown below, with
some points of En (Q) highlighted.
358
20.2. Rational Curves Chapter 20. Introduction
359
20.2. Rational Curves Chapter 20. Introduction
Then Lüroth’s theorem is proven using the fact that tr degk k(C) = 1 when C is a curve,
which means k(C) ⊆ k(t) if and only if k(C) = k(t).
The situation for S 1 , i.e. that existence of rational points is determined by rational
parametrization by a line, in fact holds for all curves defined by a degree 2 polynomial.
(Such a curve is called a quadratic curve or conic.)
Proposition 20.2.7. Let f ∈ k[x, y] be an irreducible quadratic polynomial. Then the curve
C = Cf is rational over k if and only if C(k) is nonempty.
Proof. (Sketch) Fix a point (x0 , y0 ) ∈ C(k) and construct the line ` of slope t through (x0 , y0 )
in the plane k 2 , calling the intersection with C(k)r{(x0 , y0 )} (x, y). Then f (x, t(x−x0 )+y0 )
is the quadratic polynomial defining x coordinates of ` ∩ C, and the polynomial
f (x, t(x − x0 ) + y0 )
ψ(t) =
x − x0
is linear with coefficients in k. A similar parametrization of y coordinates gives a rational
function φ(t) which, together with ψ(t), shows that C is unirational over k. Hence by
Lüroth’s theorem, C is rational over k.
Thus the theory of conics reduces to the problem of finding if a conic curve has a rational
point over a given field.
The strategy of studying roots mod primes p to understand the structure of solutions
in Z illustrates Hasse’s so-called ‘local-global principle’. In Section 22.9, we will use p-adic
analysis (introduced in Section 15.2) to prove:
Theorem. For an irreducible quadratic polynomial f ∈ Q[x, y], if Cf (Qp ) 6= ∅ for all primes
p and Cf (R) 6= ∅, then Cf (Q) 6= ∅.
360
20.2. Rational Curves Chapter 20. Introduction
361
Chapter 21
Algebraic Geometry
362
21.1. Affine and Projective Space Chapter 21. Algebraic Geometry
As sets, An = k n , but the new notation carries with it the implication that An is viewed
geometrically.
Remark. Alternatively, for any field k ⊆ K ⊆ k̄, one can define Ank (K) to be the fixed points
of Ank under the action of the Galois group Gal(k̄/K). In particular, Ank = Ank (k) = (k̄ n )Gk
where Gk = Gal(k̄/k) is the absolute Galois group of the field k.
We will let A denote the polynomial ring k[t1 , . . . , tn ].
Definition. For a polynomial f ∈ A, define its zero set (or zero locus) to be
Z(f ) = {P ∈ An | f (P ) = 0}.
Notice that if I = (F) is the ideal of A generated by F, then Z(F) = Z(I). By Hilbert’s
basis theorem, there exists a finite subset {f1 , . . . , fr } ⊆ F such that Z(F) = Z(f1 , . . . , fr ).
Definition. A subset X ⊆ An is called an algebraic set if X = Z(F) for a set F ⊆ A,
that is, X is algebraic if it is the zero set of some collection of polynomials in k[t1 , . . . , tn ].
By the remark, it is equivalent to say X is a zero set if X = Z(I) for some ideal I ⊂ A.
Thus the operation Z(·) takes a subset of a ring and assigns to it a geometric space. There
is a dual notion:
Definition. For any subset X ⊆ An , we define the vanishing ideal of X to be
363
21.1. Affine and Projective Space Chapter 21. Algebraic Geometry
Examples.
5 The algebraic set Z(y, y − x2 ) = Z(x, y) consists of just the point (0, 0) in A2k :
Z(y − x2 )
Z(y)
Z(y, y − x2 )
Definition. If X = Z(S) ⊆ Ank (k̄) is an algebraic set and K is a field such that k ⊆ K ⊆ k̄,
define the K-points of X by X(K) := X ∩ Ank (K) = X GK , where GK = Gal(k̄/K).
Moreover, we say X is defined over K if J(X) has a generating set consisting of elements
of K[t1 , . . . , tn ].
364
21.1. Affine and Projective Space Chapter 21. Algebraic Geometry
(c) Z(J(X)) ⊇ X.
(d) J(Z(I)) ⊇ I.
(f ) J(Z(J(X))) = J(X).
In particular, these properties demonstrate that the algebraic subsets of An form the
closed sets of a topology on An .
Definition. The topology on An having as its closed sets all algebraic subsets of An is called
the Zariski topology on An .
In (c) and (d), we see that Z and J are not quite inverse operations.
(b) Z(J(X)) = X.
365
21.1. Affine and Projective Space Chapter 21. Algebraic Geometry
Proof. (a) Since X ⊆ X, we immediately get J(X) ⊇ J(X) by Lemma 21.1.2(a). On the
other hand, if f ∈ J(X), f (P ) = 0 for all P ∈ X. In other words, X ⊆ Z(f ) but Z(f ) is
closed by definition, so Z(f ) ⊇ X. Thus f ∈ J(X).
(b) X is algebraic by definition so there exists some ideal I ⊂ A such that Z(I) = X. Now
by (a), Z(J(X)) = Z(J(X)) = Z(J(Z(I))) which by Lemma 21.1.2(e) equals Z(I) = X. So
Z(J(X)) = X as required.
The key development so far is that J and Z establish a correspondence, though not always
bijective, between the ideals of A and the closed subsets of An . Hilbert’s Nullstellensatz says
that when k is algebraically closed, there is a bijective correspondence between algebraic sets
in Ank and radical ideals of A = k[t1 , . . . , tn ].
Theorem 21.1.4 (Hilbert’s Nullstellensatz). If k is algebraically closed, then J(Z(I)) = r(I)
for every ideal I ⊂ A.
Next, we introduce projective space and projective algebraic setes in a manner parallel
to the presentation of affine algebraic sets.
Definition. For n ∈ N, we define projective n-space over k to be the quotient space
Pn = Pnk = An+1 r{0}/ ∼ where (a0 , . . . , an ) ∼ (b0 , . . . , bn ) if and only if there is some λ ∈ k ∗
such that (b0 , . . . , bn ) = (λa0 , . . . , λan ). The coordinates of Pn are written [a0 , . . . , an ], called
homogeneous coordinates.
As in the affine case, for k ⊆ K ⊆ k̄ we can define Pnk (K) = {[a0 , . . . , an ] : ai ∈ K}.
Lemma 21.1.5. For any k ⊆ K ⊆ k̄, Pnk (K) = (Pnk (k̄))GK , where GK = Gal(k̄/K).
Proof. Apply Hilbert’s Theorem 90 (Theorem 17.7.5).
Definition. For a point P = [a0 , . . . , an] ∈ Pnk (k̄), the minimal field of definition for P
over k is the field k(P ) = k aa0i , . . . , aani where ai 6= 0. Alternatively, k(P ) = k̄ G(P ) where
G(P ) = {σ ∈ Gk | σ(P ) = P } ≤ Gk .
√ √ √
Example 21.1.6. The point P = ( 2, 2, 2) ∈ P3Q (Q) has minimal field of definition
Q(P ) = Q since scaling by √12 gives (1, 1, 1) ∈ A3Q .
Let S = k[t0 , . . . , tn ] be the polynomial ring in n + 1 indeterminates. Recall that S is a
graded ring with graded pieces given by total degree:
∞
M
S= Sd where Sd = {f ∈ S | deg f = d}.
d=0
366
21.1. Affine and Projective Space Chapter 21. Algebraic Geometry
Definition. A nonempty topological space X is said to be irreducible if for any two closed
subsets X1 , X2 ⊆ X such that X1 ∪ X2 = X, we have X = X1 or X = X2 .
Definition. If X is an irreducible algebraic set in Ank (k̄) or Pnk (k̄), then X is called geo-
metrically irreducible.
Proof. Obvious.
367
21.1. Affine and Projective Space Chapter 21. Algebraic Geometry
y A2
Example 21.1.12. Take an irreducible polynomial f ∈ k[x, y]. Since k[x, y] is a UFD, (f ) is
a prime ideal so C := Z(f ) is irreducible by Lemma 21.1.10. C is called the (affine) algebraic
curve defined by f , sometimes written f (x, y) = 0. In general, an irreducible polynomial in
k[x1 , . . . , xn ] corresponds to an affine variety Y = Z(f ) ⊆ An , called an (affine) algebraic
hypersurface.
Proposition 21.1.13. If X is a nonempty S algebraic set, then it has finitely many irreducible
components X1 , . . . , Xm such that X = m
i=1 Xi .
368
21.1. Affine and Projective Space Chapter 21. Algebraic Geometry
Ih = {fh | f ∈ I} ⊆ k[t0 , . . . , tn ].
For a projective algebraic set Y ⊂ Pnk , where Y = Z(J) for an ideal J ⊆ k[t0 , . . . , tn ],
we get n + 1 affine algebraic sets Yi = ϕ−1i (Y ∩ Ui ) = Z(J(i) ). These are called the deho-
mogenizations of Y . Conversely, for an affine algebraic set X ⊆ Akn , with X = Z(I), the
projective closure of X in Pnk is the Zariski closure in Pnk of ϕ0 (X), denoted X. Note that
X = Z(I(ϕ0 (X))) = Z(Ih ).
369
21.2. Morphisms of Affine Varieties Chapter 21. Algebraic Geometry
ϕ∗ f : x 7→ f ◦ ϕ(x)
ϕ : X −→ Y
t 7−→ (t2 , t3 )
is both invertible and a morphism, but its inverse is not a morphism so ϕ is not an isomor-
phism of ringed spaces.
Definition. For an algebraic set X ⊆ Ank , we define the coordinate ring of X to be the
quotient ring k[X] := k[t1 , . . . , tn ]/J(X). For any intermediate field k ⊆ K ⊆ k̄, if X is
defined over K we also set K[X] = K[t1 , . . . , tn ]/JK (X). The coordinate ring is defined
similarly for X ⊆ Pnk .
Proposition 21.2.2. Suppose k is algebraically closed and X is an affine variety over k.
Then
(a) OX (X) = k[X], that is, the coordinate ring of X consists of regular k-valued functions
X → k.
(b) For any f ∈ k[X] r {0}, OX (D(f )) = k[X]f , the localization of k[X] at the element
f.
Notice that by Lemma 21.1.10, X ⊆ Ank is a variety if and only if k[X] is an integral
domain.
370
21.2. Morphisms of Affine Varieties Chapter 21. Algebraic Geometry
Definition. For an affine algebraic variety X ⊆ Ank , the function field of X over k is the
fraction field k(X) := Frac k[X]. An element of k(X) is called a rational function on X.
If X is defined over some k ⊆ K ⊆ k̄, then the field K(X) := Frac K[X] is called the field
of K-rational functions on X.
Lemma 21.2.3. For any tower k ⊆ K ⊆ k̄ over which X is defined, K[X] = k̄[X]GK and
K(X) = k̄(X)GK .
For any field k we call elements of MaxSpec k[X] the closed points of X over k.
Theorem 21.2.4. The closed points of X over a field k are in bijective correspondence with
the orbits of Gk on MaxSpec k̄[X].
Example 21.2.5. Let X ⊆ A1Q be the algebraic variety defined by the irreducible polynomial
√
f = x3701 − 2. Then Q[X] = Q[x]/(x3701 − 2) ∼
= Q(
3701
2) is a field, so MaxSpec Q[X] consists
of a single point. On the other hand, MaxSpec Q[X] contains 3701 points.
Fix a variety X over k. The embedding i : k[X] ,→ k̄[X] induces a map on maximal
ideals
i∗ : MaxSpec k̄[X] −→ MaxSpec k[X]
with the following properties:
For every maximal ideal m ∈ MaxSpec k[X], the fibre α(m) := (i∗ )−1 (m) is finite and
nonempty.
The absolute Galois group Gk acts transitively on each fibre α(m), and
The k-points of X are in correspondence with the orbits of size one of this action.
Elements of MaxSpec k[X] are called irreducible 1-cycles. For curves, these irreducible
1-cycles are also called irreducible divisors.
371
21.2. Morphisms of Affine Varieties Chapter 21. Algebraic Geometry
Let X/k̄ be an affine algebraic set. Then X is a ringed space whose structure sheaf
OX : U 7→ OX (U ) is defined on open sets U ⊆ X by
there exists a cover U = Uα such that f |Uα = gα
S
OX (U ) = f : U → k̄ hα
for gα , hα ∈ k̄[X] with hα (P ) 6= 0 for all P ∈ Uα
Proposition 21.2.6. Let X be an affine algebraic set defined over k̄. Then
(a) OX (X) = k̄[X].
h i
1
(b) For any f ∈ k̄[X], OX (D(f )) = k̄[X]f = k̄[X] f
, the localization of k̄[X] at powers
of f .
(c) For any prime ideal p ⊆ k̄[X], OX (X r Z(p)) = k̄[X]p .
Definition. For a point P ∈ X, the local ring of X at P is
n o
OX,P = fg : f, g ∈ k̄x, g(P ) 6= 0 .
Then indeed OX,P is a local ring with maximal ideal mP k̄[X]mP ; by abuse of notation, we
will also denote this maximal ideal by mP . Also note that the residue field κ(P ) := OX,P /mP
is isomorphic to k̄. We will prove that when X is a curve,
\
k̄[X] = OX,P .
P ∈X
372
21.2. Morphisms of Affine Varieties Chapter 21. Algebraic Geometry
phism of k-algebras
ϕ∗ : k[Y ] −→ k(X)
f 7−→ f ◦ ϕ.
Note that if ϕ(U ) is dense in Y , the induced homomorphism extends to an inclusion of
function fields:
ϕ∗ : k(Y ) ,−→ k(X)
f ϕ∗ (f )
7−→ ∗ .
g ϕ (g)
This property is so important that such morphisms are given a name.
Definition. A morphism ϕ : X → Y is said to be dominant if ϕ(X) is dense in Y .
Definition. Let X and Y be affine varieties over k. If there exists a rational ϕ : X 99K Y
which has a rational inverse, that is a rational map ψ : Y 99K X such that ϕ ◦ ψ and ψ ◦ ϕ
are equal to the identity where they are defined, then X and Y are said to be birationally
equivalent over k.
Lemma 21.2.9. X and Y are birationally equivalent over k if and only if k(X) ∼
= k(Y ) as
k-algebras.
A major area of interest in algebraic geometry is the classification of varieties up to
birational equivalence. For curves, there is a canonical invariant called the genus which
completely classifices curves up to birational equivalence over the algebraic closure k̄ of a
field k.
Definition. A rational variety is a variety X over k which is birationally equivalent to
An for some n.
373
21.3. Morphisms of Projective Varieties Chapter 21. Algebraic Geometry
We can now define morphisms between projective varieties using this ringed space struc-
ture.
Definition. A morphism of (quasi-)projective varieties is a map ϕ : X → Y that is a
morphism of the ringed spaces.
The definition of rational maps between affine varieties extends to projective varieties in
the following way.
Definition. For projective varieties X ⊆ Pnk and Y ⊆ Pm k , a rational map ϕ : X 99K Y is
a pair of open sets U ⊆ X and V ⊆ Y and a morphism ϕ = (ϕ0 , . . . , ϕm ) : U → V , such
that each ϕi ∈ k[t0 , . . . , tm ] is a homogeneous polynomial, ϕ(P ) ∈ Y for each P ∈ X and
some ϕi 6∈ J(X).
Definition. A map ϕ : X → Y is regular at a point P ∈ X if at least one ϕi (P ) 6= 0.
We say ϕ is a regular map if it is regular at every P ∈ X.
374
21.3. Morphisms of Projective Varieties Chapter 21. Algebraic Geometry
Note that a quasi-projective set with the Zariski topology is not Hausdorff in general.
Indeed, if X is irreducible, then any nonempty open set is dense. Thus we need a notion to
replace the Hausdorff condition for algebraic sets.
Definition. For a function f ∈ k[X], define the principal open subset of f by D(f ) :=
{P ∈ X | f (P ) 6= 0}.
375
21.4. Products of Varieties Chapter 21. Algebraic Geometry
We thus make X × Y into a ringed space with OX×Y (U × V ) defined for all open sets
U ⊆ X, V ⊆ Y by stipulating that anything of the form
X
∗
f= (πX gi )(πY∗ hi ), for gi ∈ OX (U ) and hi ∈ OY (V ),
∗
is regular on U × V . If g ∈ OX (U ), we must have πX g ∈ OX×Y (U × V ) and likewise, if
∗
h ∈ OY (V ), then πY h ∈ OX×Y (U × V ). Thus for such an f as above, D(f ) is an open subset
of X × Y that would not be open in the usual product topology.
Example 21.4.1. Under the above description of products of affine varieties, An × Am ∼
=
n+m 2
A for any n, m ∈ N. Note that even for n = m = 1, the Zariski topology on A is not
equivalent to the product topology on A1 × A1 .
Lemma 21.4.2. If X and Y are affine varieties, then
(a) X × Y is an affine variety.
σn,m : Pn × Pm −→ P(n+1)(m+1)−1
([x0 , . . . , xn ], [y0 , . . . , ym ]) 7−→ [xi yj ]i,j
such that the image Σn,m := σn,m (Pn × Pm ) has the structure of an algebraic subset that
coincides with the Zariski topology of the product Pn × Pm .
Proof. (Sketch) Viewing P(n+1)(m+1)−1 as a space of (n + 1) × (m + 1) matrices, we have that
Then clearly Σn,m = Z((zij zk` − zkj zi` )i,j,k,` ), so Σn,m is an algebraic set. The fact that σn,m
is a bijection is obvious. One can now verify that the induced topology corresponds to the
topology on Pn × Pm .
376
21.4. Products of Varieties Chapter 21. Algebraic Geometry
Definition. Let V be a vector space over k. The set of lines, i.e. 1-dimensional subspaces,
of V is called the projective space over V , denoted P(V ).
Example 21.4.4. If V = k n is finite dimensional, then P(V ) can be identified with Pnk .
gives X × Y the structure of a ringed space which coincides with the previous description of
the product of two varieties.
G(x0 , . . . , xn , y0 , . . . , ym ),
Example 21.4.7. Consider the Segre embedding P1k × P1k ,→ P3k and set Q = Σ1,1 =
Z(z00 z11 − z01 z10 ). The polynomial z00 z11 − z01 z10 is called a quadric and the embedded
image Q is called a quadric surface. For each α, β ∈ P1k , one gets lines on the quadric surface
realized by {α} × P1k ,→ Q and P1k × {β} ,→ Q. Note that lines of these forms cover Q, for
which reason Q is called a ruled surface.
377
21.5. Blowing Up Chapter 21. Algebraic Geometry
21.5 Blowing Up
We now have a working notion of products of varieties, so consider the space An × Pn−1 .
Coordinates in this space are (P, [`]), where P ∈ An is a point and [`] ∈ Pn−1 is the class of
some line through the origin ` in An . Consider the set B ⊆ An × Pn−1 defined by
On the other hand, if P = 0, the set π −1 (0) = {(0, [`]) ∈ An × Pn−1 } is isomorphic to Pn−1 .
In the dimension 2 case, B is covered by the following affine patches:
Definition. The set B is called the blowup of An at the point 0, denoted B = Bl0 An . The
set E0 An := π −1 (0) ∼
= Pn−1 is called the exceptional divisor of the blowup.
Bl0 X := π −1 (X r {0}).
As the notation suggests, this set is also called the blowup of X at 0. The set
E0 X := Bl0 X ∩ E0 An
378
21.5. Blowing Up Chapter 21. Algebraic Geometry
Remark. More generally, for any subvariety Z ⊆ X, one can define the blowup of X along
Z, a variety BlZ X that is birationally equivalent to X, such that Z is a codimension 1
subvariety of BlZ X.
Note that this variety has a singularity at the point (0, 0). Using the blowup of A2 defined
above, Bl0 A2 , we can blowup X to ‘remove the singularity’ at 0. Let U1 be the first affine
patch and ϕ : U1 → A2 the standard isomorphism. We make the substitution y = xt, so
that ϕ(π −1 (X)) = Z(x2 (t2 − x − 1)). The x2 factor of this polynomial corresponds to the
exceptional divisor E0 X under this blowup, so the proper transform of X at 0 looks like
Blowing up allows us to replace singular curves (or more generally, varieties) with non-
singular curves by a sequence of blowups, such that in each step the birational equivalence
class of the curve is preserved. The problem of finding such a nonsingular blowup is known
as resolution of singularities. Much progress has been made on this problem (e.g. Hironaka’s
theorem says that nonsingular blowups exist for any finite dimensional variety over a field
of characteristic zero), but there is still much to be done (e.g. in finite characteristic cases).
379
21.6. Dimension of Varieties Chapter 21. Algebraic Geometry
Proof. It is well known from commutative algebra that dim k[X] = tr degk k(X). Apply
Proposition 21.6.1.
Proposition 21.6.3. For an affine variety X with projective closure X, k(X) = k(X).
380
21.6. Dimension of Varieties Chapter 21. Algebraic Geometry
The following is a classic result due to Krull, which is proven by an algebraic statement
about height of prime ideals in k[t1 , . . . , tn ].
Corollary 21.6.5. A variety X ⊆ Ank̄ has codimension 1 if and only if X = Z(f ) for a
nonconstant, irreducible polynomial f ∈ k[t1 , . . . , tn ].
Corollary 21.6.8. If X is an affine variety with dimension n and r ≤ n, then any polyno-
mials f1 , . . . , fr ∈ k[X] have a common zero.
Corollary 21.6.9. In P2 , for any forms F and G defining curves C1 = Z(F ) and C2 =
Z(G), we have C1 ∩ C2 6= ∅.
381
21.7. Complete Varieties Chapter 21. Algebraic Geometry
π0 π
Then the Z ∗ (I) are the closed subsets of Pn × Y so it suffices to prove π(Z ∗ (I)) is closed for
all proper homogeneous ideals I ⊂ B. We may assume Z ∗ (I) is irreducible, i.e. I is prime.
382
21.7. Complete Varieties Chapter 21. Algebraic Geometry
We may also assume π|Z ∗ (I) is dominant (changing the target to π(Z ∗ (I)) if necessary). Then
we must show for every y ∈ Y , there exists x∗ ∈ Pn so that (x∗ , y) ∈ Z ∗ (I), since then we
will have π(Z ∗ (I)) = π(Z ∗ (I)).
Take M ⊂ A to be the maximal ideal that vanishes at y. Then J = M B + I is a
homogeneous ideal so Z ∗ (J) is defined, and if we show Z ∗ (J) is nonempty, we’ll be done.
Assume to the contrary that Z ∗ (J) = ∅. Then there is a k > 0 such that Tik ∈ J for each Ti .
Equivalently, there is an m > 0 so that Bm , the set of all degree m homogeneous polynomials
in B, is contained in J. Set N = Bm /(Bm ∩ I). This is a finitely generated A-module in the
obvious way. Moreover, notice that M N = N . Then by Nakayama’s Lemma, this implies
N = 0. But then Bm = Bm ∩ I so it follows that Z ∗ (I) = ∅, which is impossible for a proper
ideal I ⊂ B. Hence Z ∗ (J) 6= ∅ so the theorem is proved.
Example 21.7.3. Consider the variety X = Z(xy − 1) ⊆ A2 . Then under the projection
A2 → A1 , the image of X is A1 r {0} which is not a closed set, so X is not complete. We
will see below that affine varieties are not complete in general.
Corollary 21.7.4. Let X be a connected complete variety. Then OX (X) = k. That is,
every regular k-valued function on X is constant.
g : X → A1 ,→ P1 ,
Proof. Let X be an affine variety of dimension at least 1. View X as a proper subset of affine
n-space An , which has coordinate algebra k[T1 , . . . , Tn ]. Then some coordinate function Ti
does not vanish on X, so Ti ∈ OX (X) is a nonconstant regular function on X.
383
21.8. Tangent Space Chapter 21. Algebraic Geometry
Lα
P
(Formally, we say that the multiplicity of any t as a root of the zero polynomial is ∞.) Then
L is tangent to X at P if the multiplicity of X ∩ L at P is at least 2.
(`) (0)
where fi is the homogeneous part of fi of degree `. If P ∈ X, then fi (P ) = 0. Thus
(1) (2) (1)
fi (αt) = tfi (α) + t2 fi (α) + . . . This shows that Lα ⊆ TP X if and only if fi (α) which is
a linear condition. Thus TP X is a linear subspace of AN as claimed.
Examples.
1 For any P ∈ AN , TP AN = AN .
384
21.8. Tangent Space Chapter 21. Algebraic Geometry
We can use this notion of tangency to formalize the property of “singularity” at a point
of a variety.
385
21.8. Tangent Space Chapter 21. Algebraic Geometry
(a) dP (f + g) = dP f + dP g.
TP X = Z(dP f1 , . . . , dP fm ).
dP g = dP G|TP X
The differential dP induces a map into the dual of the tangent space:
386
21.8. Tangent Space Chapter 21. Algebraic Geometry
m
X
0
G := G − ai f i
i=1
has no linear term by construction and thus G0 ∈ (t1 − α1 , . . . , tN − αN ). On the other hand,
G0 |X = G|X = g so if G0 ∈ (t1 − α1 , . . . , tN − αN )2 then we must have g ∈ m2P . This shows
that ker dP ⊆ m2P . The reverse inclusion is shown similarly, so by the first isomorphism
theorem, mP /m2P ∼= (TP X)∗ .
Corollary 21.8.7. For any affine variety X over an algebraically closed field k, dim X =
dimk mP /m2P for any nonsingular point P ∈ X.
Definition. The vector space mP /m2P is called the cotangent space to X at P . It is the
dual of the tangent space by Theorem 21.8.6.
dP ϕ : TP X −→ Tϕ(P ) Y,
Remark. The above description shows that TP X is an ‘intrinsic object’ to X; that is, it
only depends on the isomorphism class of X. The next result says that the tangent space is
also a local object.
Then the proof of Theorem 21.8.6 goes through with appropriate modifications.
Definition. For any quasi-projective variety X and point P ∈ X, we define the tangent
space to X at P by
TP X = (mP OX,P /(mP OX,P )2 )∗ .
387
21.8. Tangent Space Chapter 21. Algebraic Geometry
By Theorem 21.8.9, this description agrees with TP (X ∩ Ui ) for any affine patch Ui (i.e.
the tangent spaces are isomorphic).
Definition. For a projective variety X ⊆ PN such that J(X) = (F1 , . . . , Fm ), and a point
P ∈ X ∩ Ui , we define the projective tangent space to X at P to be
TP X = Tϕ−1
i (P )
(ϕ−1
i (X ∩ Ui )).
388
21.9. Local Parameters Chapter 21. Algebraic Geometry
Definition. Let A be a local ring with maximal ideal m and residue field k = A/m. Then A
is said to be a regular ring if dim A = dimk m/m2 .
Proposition 21.9.1 shows that P ∈ X is nonsingular if and only if the local ring OX,P is
a regular ring.
389
Chapter 22
Curves
For the rest of the chapter, X will denote an algebraic curve. The first important result
is that the local rings OX,P of a nonsingular curve are discrete valuation rings.
Theorem 22.0.1. Let X be an algebraic curve and P ∈ X a nonsingular point. Then OX,P
is a DVR.
Proof. Fix P ∈ X and let OP = OX,P be the local ring at P , with maximal ideal mP and
residue field κ(P ) = OP /mP . Then by Proposition 21.9.1, OP is a regular local ring. Thus
Corollary 21.8.7 gives us dimκ(P ) (mP /m2P ) = dim X = 1. Let t ∈ mP such that dP t 6= 0; that
is, t is a local parameter at P . Then for f ∈ k̄(X) with f (P ) = 0, we have f = tr u in OP ,
for some u ∈ OP× . Define a map
ordP : OP −→ Z
f 7−→ ordP (f ) = max{d ∈ Z | f ∈ mdP }.
Corollary 22.0.2. For any nonsingular point P ∈ X, OP is a PID and therefore a UFD.
Proof. By the above, every ideal of OP is of the form (tr ) where t ∈ mP is a local parameter.
Definition. Fix a rational function f ∈ k(X) and an integer r > 0. We say f has a pole
of order r at P if ordP (f ) = −r, and a zero of order r at P if ordP (f ) = r.
390
Chapter 22. Curves
Proposition 22.0.3. Every nonconstant, rational function f ∈ k̄(X) has at least one pole.
Proof. A rational function f ∈ k̄(X) with no poles is regular everywhere on X, and therefore
constant by Corollary 21.7.4, since X is projective.
Remark. Each f ∈ k̄(X) has only finitely many zeroes and poles, or none at all.
391
22.1. Divisors Chapter 22. Curves
22.1 Divisors
Definition. Let X be a variety. An irreducible divisor on X is a closed, irreducible
k-subvariety x of X of codimension 1.
When X is a curve over k, an irreducible divisor is a closed point of MaxSpec k[X ∩ Ui ]
for some affine patch Ui , or alternatively, a Gk -orbit of points in X(k̄).
Definition. The degree of an irreducible divisor x on X is the size of the Gk -orbit in X(k̄)
corresponding to x, i.e. deg(x) = [κ(P ) : k] for any P ∈ x.
Example 22.1.1. Let X = P1 . On an affine patch A1 ,→ Ui ⊆ P1 , the irreducible divisors
correspond to irreducible polynomials in k[A1 ] = k[t].
Definition. Let X be a curve over k. The divisor group on X, Div(X), is the free abelian
group on the set of irreducible divisors on X:
( )
X
Div(X) = D = nx x : nx ∈ Z, nx 6= 0 for finitely many x .
x∈X
P
The elements of Div(X) are called
P divisors on X. For a divisor D = x∈X nx x ∈ Div(X),
the degree of D is deg(D) = x∈X nx deg(x).
Example 22.1.2. If k is algebraically closed, then the irreducibleP
divisors are the points of
X, so each D ∈ Div(X) is a weighted sum of points of P X: D = x∈X nx x. The degree of
such a divisor is just the sum of the weights: deg(D) = x∈X nx .
∗
P Now assume X is a nonsingular curve. For f ∈ k(X) , we can define a divisor D(f ) =
x∈X ordx (f )x, called the principal divisor of f . This defines a map
D : k(X)∗ −→ Div(X)
whose image is denoted PDiv(X), the group of principal divisors on X.
Definition. The Picard group, or divisor class group, of X is the quotient group
Pic(X) = Div(X)/ PDiv(X).
This defines an equivalence relation on divisors: D1 ∼ D2 if D1 = D2 + D(f ) for some
f ∈ k(X)∗ .
Example 22.1.3. Consider the variety E = Z(y 2 − x3 − 3x2 − 2x). This is the elliptic curve
defined by y 2 = f (x) where f = x3 + 3x2 + 2x = x(x + 1)(x + 2). The projective closure of
E is E = Z(fh ), where
fh = ZY 2 − X 3 − 3X 2 Z − 2XZ 2 .
Y
Setting y = Z
, we can compute its divisor on E:
X
D(y) = ordP (y)P.
P ∈X
On the affine part, there are only zeroes of y, and they occur precisely at P = (−2, 0), (−1, 0)
and (0, 0).
392
22.1. Divisors Chapter 22. Curves
Note that t ∈ OE,P is a uniformizer whenever dP t 6= 0. Viewing t ∈ k[x, y], i.e. as a lift of
[t] ∈ OE,P , we have that
∂f
t = x is a uniformizer as long as dP x = x|TP E 6= 0, which is equivalent to (P ) 6= 0.
∂y
∂f
t = y is a uniformizer as long as dP y = y|TP E 6= 0, that is, (P ) 6= 0.
∂x
In particular, we can always find a uniformizer! For P = (−2, 0), (−1, 0) and (0, 0), t = y is
a uniformizer. It follows that ordP (y) = 1 at each of these points, and ordQ (y) = 0 for any
other point Q ∈ E. Thus the divisor for y is
g = ζ − (ξ 3 + 3ξ 2 ζ + 2ξζ 2 ).
∂g
Notice that ∂ζ
(0, 0) = 1, so ξ is a uniformizer on this patch. Now
ordP (ζ) = ordP (ξ 3 + 3ξ 2 ζ + 2ξζ 2 ) ≥ min{ordP (ξ 3 ), ordP (3ξ 2 ζ), ordP (2ξζ 2 )}.
393
22.1. Divisors Chapter 22. Curves
We have ordP (ξ 3 ) = 3 and ordP (3ξ 2 ζ), ordP (2ξζ 2 ) ≥ 3. If all three orders are equal to 3, then
by the ultrametric inequality ordP (ζ) must be strictly greater than the minimum, which is
3 in this case. But then ordP (3ξ 2 ζ) = ordP (ξ 2 ) + ordP (ζ) > 2 + 3 > 3, so in fact we cannot
have all three orders equal to 3. Hence ordP (ζ) = 3. We have thus calculated the divisor of
y on the elliptic curve E:
394
22.2. Morphisms Between Curves Chapter 22. Curves
Proof. A more general result is that if Y is a normal variety, i.e. the local rings OY,P are
normal rings, then the locus of nondeterminacy of such a rational map ϕ : Y 99K X is a
subvariety of codimension at least 2. For Y = C a curve, this means there are no points
where ϕ fails to be regular.
A nonconstant rational map ϕ : C1 99K C2 between curves induces a field extension
k(C2 ) ,→ k(C1 ). Since both function fields have transcendence degree 1, this is in fact a
finite field extension.
Definition. For curves C1 and C2 and a rational map ϕ : C1 99K C2 , define the degree
of ϕ by deg ϕ = [k(C1 ) : k(C2 )]; the separable degree of ϕ by degs ϕ = [k(C1 ) : k(C2 )]s ;
and the inseparable degree of ϕ by degi ϕ = [k(C1 ) : k(C2 )]i . We say ϕ is separable if
k(C1 ) ⊇ k(C2 ) is a separable extension.
Definition. Any finitely generated field extension of k with transcendence degree 1 over k
is called a function field of degree 1 over k.
Proof. (Sketch) The assignment X 7→ k(X) determines one direction: we have seen that
k(X) is indeed a function field over k. Conversely, for a function field K/k, we associate
an abstract algebraic curve XK to K by putting a Zariski topology on theTmaximal ideals
of the valuation rings O ⊂ K. The structure sheaf is given by OXK (U ) = P ∈U OP where
U ⊆ XK is open and OP is the valuation ring corresponding to P . This determines the
reverse assignment K 7→ XK . One now checks that these assignments are inverse and
preserve categorical structure.
Now fix nonsingular curves X and Y over k and a morphism ϕ : X → Y defined over k.
Then an irreducible divisor y ∈ Div(Y ) corresponds to a maximal ideal mY (on some affine
patch) with uniformizer ty ∈ k(Y ).
Example 22.2.3. Let X be the plane curve defined by y 2 − x and Y = P1 the projective
line, and let ϕ : X → Y be the x-coordinate projection.
395
22.2. Morphisms Between Curves Chapter 22. Curves
x2
X
x0
x1
Y
y0 y1
Then ϕ∗ y0 = 2x0 + ord∞ (ϕ∗ ty0 )∞ and ϕ∗ y1 = x1 + x2 + ord∞ (ϕ∗ ty1 )∞.
ϕ∗ x = [κ(x) : κ(ϕ(x))]ϕ(x)
396
22.2. Morphisms Between Curves Chapter 22. Curves
If X is defined over the algebraic closure k̄, write Pic(X/k̄) for Pic(X(k̄)). Consider Div(X/k̄)Gk .
Then we have an embedding
397
22.3. Linear Equivalence Chapter 22. Curves
deg : Pic(X) −→ Z.
P
Definition. A divisor D = nx x on X is called effective if nx ≥ 0 for all x ∈ X. In this
case we will write D ≥ 0. Also, if D1 , D2 ∈ Div(X) and D1 − D2 is an effective divisor, we
write D1 ≥ D2 . This defines an ordering on Div(X).
Definition. Let D be an effective divisor on X. Then the Riemann-Roch space associated
to D is the k-vector space
is called the complete linear system of D on X. Any projective subspace of |D| is called
a linear system of D on X.
Note that D is linearly equivalent to an effective divisor if and only if L(D) 6= 0.
Theorem 22.3.3. For any D ∈ Div(X), L(D) is finite dimensional.
Lemma 22.3.4. If D1 , D2 ∈ Div(X) are linearly equivalent, say D1 − D2 = (g) for some
g ∈ k(X)× , then there is an isomorphism
L(D1 ) −→ L(D2 )
f 7−→ gf.
398
22.3. Linear Equivalence Chapter 22. Curves
399
22.4. Differentials Chapter 22. Curves
22.4 Differentials
Definition. For a curve X, the space of meromorphic differentials on X is the k(X)-
vector space ΩX consisting of formal differentials df for each f ∈ k(X)× satisfying
d(f + g) = df + dg,
dα = 0 if α ∈ k,
d(f g) = f dg + g df .
ϕ∗ : ΩY −→ ΩX
X X
ϕ∗ fi dti 7−→ ϕ∗ fi d(ϕ∗ ti ).
(i) df 6= 0.
Definition. Define the order of ω at P ∈ X to be ordP (ω) = ordP (g), where ω = g dt. The
principal divisor associated to ω is then defined to be
X
(ω) = ordP (ω)P.
P ∈X
(2) For any s ∈ k(X) such that s(P ) = 0, ordP (f ds) = ordP (f ) + ordP (s) − 1 if p -
ordP (s), and ordP (f ds) ≥ ordP (f ) + ordP (s) if p | ordP (s).
400
22.4. Differentials Chapter 22. Curves
Definition. The canonical class on a curve X is the class KX = [(ω)] in Pic(X) for any
nonzero differential ω ∈ ΩX .
Lemma 22.4.5. The canonical class is well-defined, i.e. does not depend on the choice of
ω ∈ ΩX .
Note that Ω[X] is a k-vector space but need not be a k(X)-vector space.
Definition. The geometric genus of X is defined as g(X) := `(KX ), the dimension of the
Riemann-Roch space L(KX ) of the canonical class.
Proof. The map is f 7→ f ω for any fixed ω ∈ Ω[X] defining the canonical class.
Remark. For any divisor D ∈ Div(X), `k (D) = `k̄ (D) implies g(X(k)) = g(X(k̄)), so the
geometric genus is unchanged when passing to the algebraic closure k̄. Moreover, g(X) is a
birational invariant of X.
Example 22.4.8. Let X = P1 and let t be a coordinate function on some affine patch U of
P1 . We claim that (dt) = −2∞. Indeed, for any α ∈ U ∼ = A1 , t − α is a local uniformizer at
α. Thus ordα(dt) = ordα (d(t − α)) = 0. At infinity, 1t is a local uniformizer so we can write
dt = −t2 d 1t . Hence
So (dt) = −2∞ as claimed. Now for any ω ∈ ΩP1 , deg(ω) = −2 so we see that `(KP1 ) =
`(−2∞) = 0. Hence the genus of the projective line is g(P1 ) = 0.
Proof. By Corollary 22.4.7, g(P1 ) = dimk Ω[P1 ] but by the calculations above, the genus of
P1 is zero.
401
22.5. The Riemann-Hurwitz Formula Chapter 22. Curves
d(ϕ∗ t) = se g ds + euse−1 ds
=⇒ ordP (d(ϕ∗ t)) = ordP (se g + euse−1 )
= min{ordP (se g), ordP (euse−1 )}.
If char k - e, then this minimum is e − 1; otherwise, when char k | e the minimum is at least
e.
Definition. If ϕ is ramified and char k - eϕ (P ) for all P ∈ X, we say ϕ is tamely ramified.
Otherwise ϕ is wildly ramified.
Remark. If ϕ is tamely ramified, then ordP (d(ϕ∗ t)) = eϕ(P ) − 1 for each P . If ϕ is wildly
ramified at P , then ordP (d(ϕ∗ t)) ≥ eϕ (P ).
Definition. For a morphism ϕ : X → Y , define the ramification divisor
X
Rϕ = ordP (d(ϕ∗ t))P.
P ∈X
Now for ω ∈ ΩY , the canonical classes on X and Y can be defined by KY = [(ω)] and
KX = [(ϕ∗ ω)]. On the other hand, the pullback defines a divisor ϕ∗ KY ∈ Div(X). We want
to determine the relation between these three divisors.
Lemma 22.5.1. If ϕ : X → Y is a morphism of curves, then KX = ϕ∗ KY + [Rϕ ], where
Rϕ is the ramification divisor of ϕ.
Proof. If ω = f dt ∈ ΩY , then
ordP (ϕ∗ ω) = ordP (ϕ∗ f d(ϕ∗ t)) = ordP (ϕ∗ f ) + ordP (d(ϕ∗ t)),
so we see that ordP (ϕ∗ ω) gives the coefficient in KX , ordP (ϕ∗ f ) gives the coefficient in
ϕ∗ KY and ordP (d(ϕ∗ t)) gives the coefficient in Rϕ . Summing over P ∈ X gives the desired
equality.
P
Taking ϕ to be tamely ramified, Rϕ = P ∈X (eϕ (P ) − 1)P so the degree function applied
to the equation in Lemma 22.5.1 gives
X
deg(KX ) = deg(ϕ∗ KY ) + (eϕ (P ) − 1).
P ∈X
402
22.5. The Riemann-Hurwitz Formula Chapter 22. Curves
403
22.6. The Riemann-Roch Theorem Chapter 22. Curves
but `(K) = g by definition and `(0) = 1. Solving for deg(K) we get deg(K) = 2g − 2.
Corollary 22.6.3. Suppose deg(D) > 2g − 2 for some divisor D ∈ Div(X). Then `(D) =
deg(D) + 1 − g.
The genus is a discrete invariant of nonsingular curves. There are two natural questions
that arise:
(1) What are the curves with genus g for a particular g ∈ N0 ?
(2) How do we describe the structure of the collection of all genus g curves?
We will see that one can put the structure of a variety on the collection of genus g curves.
Lemma 22.6.4. Let X be an algebraic curve. Then X ∼ = P1 if and only if there is some
divisor D ∈ Div(X) such that deg(D) = 1 and `(D) ≥ 2.
Proof. ( =⇒ ) If X ∼
= P1 then g(X) = g(P1 ) = 0 by Example 22.4.8. Take a point P ∈ X
and set D = P ∈ Div(X); of course deg(D) = 1. Then by the Riemann-Roch theorem,
404
22.6. The Riemann-Roch Theorem Chapter 22. Curves
(1) X ∼
= P1 .
(2) g = 0 and there exists a divisor D ∈ Div(X) with deg(D) = 1.
405
22.7. The Canonical Map Chapter 22. Curves
Proof. This follows from the Riemann-Roch theorem and Theorem 22.3.9, taking D = KX .
Definition. The canonical map of a genus g ≥ 1 curve X is the map ϕ|KX | : X → Pg−1 .
Example 22.7.2. When char k 6= 2, a hyperelliptic curve is of the form X = Z(y 2 − f (x))
for a polynomial f ∈ k[x]. More generally, the minimal degree of a nonconstant morphism
X → P1 is called the gonality of X. Thus, a hyperelliptic curve is a curve of gonality 2.
We have for the most part dealt completely with the cases of curves of genus g = 0 and
g ≥ 2, so the most interesting work remains to be done for curves of genus g = 1.
406
22.8. Bézout’s Theorem Chapter 22. Curves
Q
P
If L is the linear form representing the line in the figure, then (X · L)P = 1, (X · L)Q = 2 and the
intersection number is (X · L) = 1 + 2 = 3.
Proof. Set f = F/F1 ∈ k(X). Then divX (F ) ∼ divX (F1 ), so deg(divX (F )) = deg(divX (F1 )),
and thus the intersection number is well-defined.
Corollary 22.8.2. If deg F = m and L is any linear form such that L 6∈ J(X), then
(X · F ) = m(X · L).
Lemma 22.8.3. For any form F 6∈ J(X) and any point P ∈ X ∩ Z(F ), (X · F )P = 1 if
and only if F (P ) = 0 and TP X 6⊂ TP Z(F ).
Stated another way, Lemma 22.8.3 says that the intersection multiplicity at P is 1 if and
only if X and Z(F ) meet transversely.
Lemma 22.8.4. For any smooth curve X, there exists a linear form L such that (X ·L)P ≤ 1
for all P ∈ X ∩ Z(L).
407
22.8. Bézout’s Theorem Chapter 22. Curves
(X · F ) = (deg G)(deg F )
408
22.9. Rational Points of Conics Chapter 22. Curves
(b) The pairing bq (v, w) = 12 (q(v + w) − q(v) − q(w)) is symmetric and k-bilinear.
such that F (X, Y, Z) = (X Y Z)MF (X Y Z)t . The determinant deg MF is called the
discriminant of F .
The crucial Hasse-Minkowski theorem says that a plane conic having a Q-rational point
is equivalent to the conic having a rational point over every completion of Q.
409
22.9. Rational Points of Conics Chapter 22. Curves
This theorem is the classic example of Hasse’s “local-to-global principle”: points over the
local fields Qv determine points over Q. Note that the Hasse-Minkowski theorem does not
hold for general varieties X, nor for general fields k.
Example 22.9.3. For a conic X, X(R) 6= ∅ if and only if there is a change of sign among
the coefficients ai in the form F defining X. This condition is easily checked as long as one
can diagonalize MF .
Thus to find rational points of a conic, we need only ask if there is an algorithm for
checking whether X has points over each p-adic field Qp .
Example 22.9.4. Let X = Pn . Then Pn (Q) = Pn (Z) and for any prime p, Pn (Qp ) = Pn (Zp ),
so it’s enough to look for integer solutions. If P = [α0 , . . . , αn ] ∈ Pn (Qp ), then we can clear
denominators so that P = [β0 , . . . , βn ] for βi ∈ Zp and some βj ∈ Z× p . The reduction mod p
n
of P is then given by P = [β̄0 , . . . , β̄n ] ∈ P (Fp ).
e
It turns out that quadratic forms always have points over finite fields. To prove this, we
will need the following counting lemma.
Lemma 22.9.5. For a sum s = α∈Fnq α1k1 · · · αnkn , where α = (α1 , . . . , αn ) and ki ∈ Z≥0 , if
P
Proof. Write
X n
Y X
s= α1k1 · · · αnkn = aki .
α∈Fn
q i=1 a∈Fq
q−2
X 1 − ψ q−1 1−1
= ψm = ≡ = 0.
m=0
1−ψ 1−ψ
Therefore s = 0.
410
22.9. Rational Points of Conics Chapter 22. Curves
Now we have r n
X X
deg P = deg fi (q − 1) < n(q − 1)
i=1 i=1
Corollary 22.9.7. Every quadratic form in at least three variables has a point over each
finite field.
The theory of Hasse-Minkowski extends more generally to number fields K/Q, with
similar local-global principles at work.
We next determine when solutions to quadratic equations F = 0 over finite fields lift to
solutions in Zp , similar to Hensel’s Lemma. To do so, we introduce the notion of an integral
model for a variety over Q.
Note that we may assume the set of all coefficients of an integral model X = {F1 , . . . , Fm }
is coprime.
XFp = Z(F 1 , . . . , F m ) ⊆ PN
Fp ,
411
22.9. Rational Points of Conics Chapter 22. Curves
Definition. We say an integral model X has good reduction mod p if XFp is geometrically
reduced and nonsingular, and bad reduction mod p otherwise.
Lemma 22.9.8. An integral model X = {F1 , . . . , Fm } has good reduction mod p if and only
if Z[X0 , . . . , XN ]/(F1 , . . . , Fm ) ⊗ Fp is a regular ring.
Example 22.9.9. If X ⊆ P2Q is a plane conic and X is an integral model of X over Q given
by a primitive quadratic form F ∈ Z[X0 , X1 , X2 ], then X has bad reduction at a prime p if
and only if p divides the discriminant ∆(F ).
Corollary 22.9.10. A primitive quadratic form F ∈ Z[X0 , X1 , X2 ] has bad reduction at only
finitely many primes.
The following is a stronger version of Hensel’s Lemma (Theorem 15.3.19) that we will
need for lifting solutions of quadratic forms.
Theorem 22.9.11. Let (R, v) be a complete DVR, f ∈ R[x1 , . . . , xN ] and suppose (a1 , . . . , aN ) ∈
RN such that
∂f
v(f (a1 , . . . , aN )) > 2v (a1 , . . . , aN )
∂xi
for some 1 ≤ i ≤ N . Then f has a root in RN .
Then there exists a nontrivial root of F in Zp , that is, α = (α0 , . . . , αn ) ∈ Znp , with α` 6= 0
for some 0 ≤ ` ≤ n, and F (α0 , . . . , αn ) = 0.
412
22.9. Rational Points of Conics Chapter 22. Curves
Theorem 22.9.14. Let F be a nondegenerate, primitive quadratic form over Z and let
X = Z(F ) be the corresponding conic over Q. Then X(Q) 6= ∅ if and only if
(3) F = 0 has a primitive solution mod p2 for all primes p > 2 – i.e. X(Qp ) 6= ∅.
In practice, one need only check (2) and (3) for primes at which X has bad reduction,
and by Corollary 22.9.10 there are only finitely many of these.
413
Chapter 23
Elliptic Curves
y 2 + a1 xy + a3 y = x3 + a2 x2 + a4 x + a6 .
y 2 + a1 xy + a3 y = x3 + a2 x2 + a4 x + a6
414
Chapter 23. Elliptic Curves
and s = us22 . Then the transformation of coordinates between the two Weierstrass equations
has the form
x = u2 x0 + r, y = u3 y 0 + su2 x0 + t.
Since every elliptic curve has a Weierstrass equation, the above can be taken as the general
form of an isomorphism between elliptic curves.
415
23.1. Weierstrass Equations Chapter 23. Elliptic Curves
416
23.1. Weierstrass Equations Chapter 23. Elliptic Curves
nodal cuspidal
417
23.2. Moduli Spaces Chapter 23. Elliptic Curves
E1 : y 2 = x3 + Ax + B and E2 : y 2 = x3 + A0 x + B 0 .
418
23.2. Moduli Spaces Chapter 23. Elliptic Curves
This bijection does not hold in general with classes of elliptic curves over a non-algebraically
closed field. However, Proposition 23.2.2 shows that j is a surjection in general; that is, it is
possible to construct an elliptic curve of any prescribed j-invariant.
Definition. Let C be a collection of objects in a category. If there is a space M such that the
isomorphism classes of objects in C are in bijection with the points of M , then M is called a
moduli space for C.
Example 23.2.5. The projective space Pnk is a moduli space for the collection of lines
through the origin in k n+1 . Likewise, the Grassmannian Gr(k, n) is a moduli space for the
k-dimensional subspaces of a vector space V .
Corollary 23.2.3 says that M1 (k̄) = A1 (k̄) is a moduli space for the collection of elliptic
curves E defined over the algebraic closure k̄. There are more complicated moduli spaces
Mg (k̄) that parametrize the curves of genus g up to isomorphism, for g ≥ 2.
419
23.3. The Group Law Chapter 23. Elliptic Curves
Q
P
P +Q
(d) For all P ∈ E(k), there exists a point −P ∈ E(k) satisfying P + (−P ) = O.
Together, (b) – (e) say that chord-and-tangent addition of points defines an associative,
commutative group law on E(k). The proofs of (a) – (d) are rather routine using the
definition of this addition law, whereas verifying associativity is notoriously difficult. We
will obtain all of these facts as a consequence of the relation between E(k) and Pic0 (X) in
Section 23.4.
420
23.3. The Group Law Chapter 23. Elliptic Curves
y 2 + a1 xy + a3 y = x3 + a2 x2 + a4 x + a6 .
421
23.4. The Jacobian Chapter 23. Elliptic Curves
Definition. An algebraic group over a field k is a variety G over k together with mor-
phisms µ : G × G → G and i : G → G such that µ(a, b) = ab and i(a) = a−1 define a group
structure on G, with identity element e ∈ G(k).
Remark. For any extension K ⊃ k, the variety G(K) is also an algebraic group. The termi-
nology from Chapter 21 carries over to algebraic groups with appropriate modifications, e.g.
an algebraic group is defined over k if it is defined over k as a variety and the multiplication
and inversion morphisms are defined over k.
Example 23.4.1. For any field k, the additive group Ga = A1k is an algebraic group under
addition µ(a, b) = a + b. The multiplicative group Gm = A1k r {0} is also an algebraic group
under multiplication µ(a, b) = ab.
We will prove that the k-rational points on an elliptic curve form an algebraic group.
One can show that these are essentially all of the dimension 1 algebraic groups:
Example 23.4.3. For any n ≥ 1, GLn (k) is an algebraic group defined as a variety by
the nonvanishing of the polynomial det(xij ). Thus GLn (k) is an affine – not a projective –
variety.
When E is an elliptic curve, we will prove that J(E) ∼= E as curves. To do this, we first
0
construct a bijection Pic (E) ↔ E(k) to get a group structure on E(k). We then show that
this determines the structure of an abelian variety on E.
422
23.4. The Jacobian Chapter 23. Elliptic Curves
Lemma 23.4.6. Suppose X is a curve of genus g = 1. Then for any P, Q ∈ X(k), [P ] ∼ [Q]
if and only if P = Q.
Lemma 23.4.7. Let E be an elliptic curve with fixed point O ∈ E(k). For all D ∈ Div0 (E),
there exists a unique point P ∈ E such that D ∼ P − O. Moreover, the map
ξO (D1 ) = ξO (D2 ) ⇐⇒ P1 = P2
⇐⇒ P1 − P2 ∼ O by Lemma 23.4.6
⇐⇒ D1 − D2 ∼ O
⇐⇒ D1 ∼ D2 .
Definition. The inverse of ξO is the map κ : E(k) → Pic0 (E), P 7→ [P − O], called the
Abel-Jacobi map.
For points P, Q ∈ E(k), the Abel-Jacobi map defines an abelian group law by P + Q :=
ξO (κ(P ) + κ(Q)), with κ(P ) + κ(Q) taking place in Pic0 (E). We now show that this group
law matches the chord-and-tangent operation from Section 23.3.
Lemma 23.4.9. The chord-and-tangent and Abel-Jacobi operations on E(k) are the same.
423
23.4. The Jacobian Chapter 23. Elliptic Curves
Proof. Fix the points P, Q, R, R0 ∈ E(k) and lines L, L0 be as in Section 23.3. Then L is
a line given by some linear form f (X1 , X2 , X3 ) = αX1 + βX2 + γX3 . Note that Xf3 defines
a rational function on E, and divE (f ) = Xf3 = P + Q + R − 3O – we can deduce that
ordO (f ) = 3 since the divisor Xf3 must have degree 0. On the other hand, L0 is given by
0
some other linear form f 0 (X1 , X2 , X3 ), for which we have divE (f 0 ) = Xf 3 = R+O+R0 −3O.
Subtracting these equations gives:
R0 − P − Q + O = divE (f ) − divE (f 0 ) = ff0 ∼ 0.
Adding and subtracting O, we get
(R − O0 ) − ((P − O) + (Q − O)) ∼ O =⇒ κ(R0 ) − (κ(P ) + κ(Q)) = 0 in Pic0 (E)
=⇒ κ(R0 ) = κ(P ) + κ(Q).
Finally, since ξO is a bijection, ξO (κ(P ) + κ(Q)) = R0 = P + Q as required.
Corollary 23.4.10. The chord-and-tangent law is an associative group law on E(k).
Theorem 23.4.11. The operation µ : (P, Q) 7→ P + Q is a morphism on E(k).
Proof. Suppose E is given by a short Weierstrass form y 2 = x3 + Ax + B and fix points
P = (x1 , y1 ), Q = (x2 , y2 ) ∈ E(k). Then −P = (x1 , −y1 ). The line L through P and Q is
explicitly given by the linear form
y2 − y1
f : y − y1 = λ(x − x1 ) where λ = .
x2 − x1
Substituting this into the the Weierstrass equation, we get
(y1 + λ(x − x1 ))2 = x3 + Ax + B
=⇒ 0 = x3 − λ2 x2 + (2λy1 − A)x + (y12 − 2λy1 x1 − 2λx1 + λ2 x21 − B).
This cubic equation has three solutions, two of which are known already: x1 and x2 . Further,
if P + Q + R = 0 for R = (x3 , y3 ), then the trace of the cubic polynomial is given by
λ2 = x1 + x2 + x3 when P and Q are distinct. Therefore we get the following formula for R:
R = (x3 , y3 ) = (λ2 − x1 − x2 , λ(x3 − x1 ) + y1 ).
(Compare this to the formulas in Proposition 23.3.2.) Similarly, for P = Q we get
2 !
3x1 + A2
2
−x31 + Ax1 + 2B
3x1 + A
R = (x3 , y3 ) = − 2x1 , − x3 − .
2y1 2y1 2y1
In both cases, the map (P, Q) 7→ −R = P + Q is given by rational functions on the affine
patch of E(k) away from the point at ∞, and the argument at ∞ is similar.
Corollary 23.4.12. E(k) is an abelian variety, and therefore so is the Jacobian J(E)
Remark. In cryptography, it is vital to be able to compute nP quickly, say over a finite
field Fq . To do this efficiently, one writes n as a binary sequence and employs a fast adding-
and-doubling formula for the coordinates of a point. For example, 10P = 2(2(2P ) + P ) can
be computed in a small number of steps. An alternative is to use different coordinates for
an elliptic curve, such as the Jacobian-Edwards coordinates.
424
Chapter 24
Let E be an elliptic curve defined over a field k with point O ∈ E(k). We saw in Chapter 23
that the rational points E(k) form an abelian group, and in fact an abelian variety over k.
In this chapter we will describe the structure of this group.
En (k) = {P ∈ E(k) | nP = O}
of E(k). The torsion subgroup of E(k) is the union of all of these subgroups:
∞
[
Etors (k) = En (k).
n=0
Proof. A consequence of Theorem 23.4.11 is that for any n, the map [n] : E → E, P 7→ nP
is regular. Clearly the kernel of this map is En (k).
We will prove:
(2) deg[n] = n2 and for every d | n, the set of d-torsion points of En (k) has size
#En (k)[d] = d2 .
(3) En (k) ∼
= Z/nZ × Z/nZ.
Ultimately, our goal is to characterize Q-rational points of an elliptic curve. The classic
result in this direction is the Mordell-Weil theorem:
Theorem 24.0.3 (Mordell-Weil). For any elliptic curve E, E(Q) is finitely generated.
425
Chapter 24. Rational Points on Elliptic Curves
As a consequence, we can write E(Q) = Etors (Q) ⊕ Zr where r is called the rank of
E. Then Theorem 24.0.2 and its analogues in characteristic p give a characterization of
the torsion part of E(k). It turns out that Etors (Q) can be effectively computed from the
Weierstrass equation for E. There are countless other interesting results about this group of
rational points, such as Mazur’s suprising theorem:
Theorem 24.0.4 (Mazur). For any elliptic curve E, #Etors (Q) ≤ 16.
Thus the mystery lies in the rank of E. There is a method for finding the generators of
the free part of E(k), known as descent. To understand this here and in Chapter 25, we will
study isogenies, height functions and the Selmer and Tate-Shafarevich groups.
426
24.1. Isogenies Chapter 24. Rational Points on Elliptic Curves
24.1 Isogenies
The class of elliptic curves E over k with specified point O ∈ E(k) form a category, and the
morphisms in this category are called isogenies.
Definition. An isogeny between two elliptic curves (E1 , O1 ) and (E2 , O2 ) is a nonconstant
morphism ϕ : E1 → E2 such that ϕ(O1 ) = O2 .
Example 24.1.1. For the purpose of studying the group E(k), an important isogeny is the
multiplication map [n] : E → E, P 7→ nP . This is regular by Theorem 23.4.11.
Proposition 24.1.2. An isogeny is a morphism of algebraic groups.
Proof. The pushforward map ϕ∗ : Div(E1 ) → Div(E2 ) descends to the Picard group, induc-
ing a commutative diagram
ϕ∗
Pic0 (E1 ) Pic0 (E2 )
κ κ
E1 E2
ϕ
Here, the vertical arrows are the Abel-Jacobi maps, which are isomorphisms by Theo-
rem 23.4.8. Assuming ϕ(O1 ) = O2 , the diagram shows ϕ(P + Q) = ϕ(P ) + ϕ(Q) so the
group structure is preserved.
Remark. Let P ∈ E be a point on an elliptic curve and define a morphism τP : E → E
by Q 7→ Q + P . Then for any regular map α : E1 → E2 , the composition τ−α(O1 ) ◦ α is an
isogeny. That is, every regular map is an isogeny up to translation.
Definition. For two elliptic curves E1 , E2 over k, define the k-morphisms
For any elliptic curve E over k, we also define the endomorphisms and automorphisms
of E by:
Endk (E) = Homk (E, E) and Aut(E) = Endk (E)× .
Lemma 24.1.3. Homk (E1 , E2 ) is an abelian group under pointwise addition: (ϕ + ψ)(P ) =
ϕ(P ) + ψ(P ). Further, Endk (E) is a ring under function composition.
Proof. Obvious.
Proposition 24.1.4. (a) For any elliptic curve E, the multiplication map [m] : E → E
is an isogeny for all nonzero m ∈ Z.
427
24.1. Isogenies Chapter 24. Rational Points on Elliptic Curves
is an isomorphism.
Proof. (1) degs ϕ = #ϕ−1 (Q) for all but finitely many Q ∈ E2 . Fix such a Q and let Q0 ∈ E2
and R ∈ E1 such that ϕ(R) = Q0 − Q. Then τR : ϕ−1 (Q) → ϕ−1 (Q0 ) is a bijection, so all
points in E2 have the same number of preimages.
It is clear that τP∗ induces an automorphism of k(E1 ) so we need only check it fixes ϕ∗ k(E2 ).
For P ∈ ker ϕ, ϕ ◦ τP = ϕ since ϕ(P ) = O. Thus for f ∈ k(E2 ),
Thus it’s enough to show the map is injective. If τP∗ is the identity field automorphism, then
τP∗ fixes k(E1 ), so f ◦ τP = f for all f ∈ k(E1 ). In particular, f (P ) = f (O1 ) for all f ∈ k(E1 ),
but by Corollary 21.3.5, this implies P = O1 .
428
24.1. Isogenies Chapter 24. Rational Points on Elliptic Curves
If k is not algebraically closed, then each P ∈ ker ϕ may not be defined over k. However,
if this condition is satisfied, we would still have ker ϕ ∼
= Aut(k(E1 )/ϕ∗ k(E2 )).
Remark. In the language of Grothendieck’s algebraic geometry, (1) says that “separable
isogenies are étale covers”, while (2) says that “separable isogenies are Galois covers”. Thus
we see the connections between Galois theory, covering space theory and isogenies between
elliptic curves begin to emerge.
ψ
λ
E3
Proof. Set G = Gal(k(E1 )/ϕ∗ k(E2 )); we may use this notation since by hypothesis the field
extension is Galois. Then G ∼ = ker ϕ ⊆ ψ ∼ = Aut(k(E1 )/ψ ∗ k(E3 )), so in particular G fixes
ψ k(E3 ). Since k(E1 )/ϕ k(E2 ) is Galois, we have inclusions of fields ψ ∗ k(E3 ) ⊆ ϕ∗ k(E2 ) ⊆
∗ ∗
Remark. Suppose E1 and E2 are elliptic curves in short Weierstrass form. Then for any
isogeny ϕ : E1 → E2 over k, we can write
u(x) s(x)
ϕ(x, y) = , y for u, v, s, t ∈ k[x].
v(x) t(x)
In this case deg ϕ = max{deg u, deg v}, and ϕ is inseparable if and only if u = f (xp ) and
v = g(xp ) for f, g ∈ k[x], where p = char k.
429
24.1. Isogenies Chapter 24. Rational Points on Elliptic Curves
Recall that the invariant differential of an elliptic curve in Weierstrass form is the mero-
morphic differential ω = 2y+adx1 x+a3
∈ ΩE . By Lemma 22.4.1, dimk(E) ΩE = 1 so ω is a
generator. The following proposition explains the name of the invariant differential.
Proof. The first property is clear for m = 0, 1. Now induct on m, using Theorem 24.1.12 on
[m + 1]∗ ω = [m]∗ ω + ω.
Example 24.1.15. An important application is that the map [1] − π is always separable.
Notice that [1] − π : E(Fq ) → E(F q ) has kernel E(Fq ).
430
24.2. The Dual Isogeny Chapter 24. Rational Points on Elliptic Curves
Proof. For the construction, recall the Abel-Jacobi map and its inverse from Theorem 23.4.8:
κ ϕ∗ ξO
− Div0 (E2 ) −→ Div0 (E1 ) −−→
b : E2 →
ϕ 1
E1 .
(5) deg ϕ
b = deg ϕ.
(6) ϕ
b = ϕ.
b
Proposition 24.2.3. For any pair of elliptic curves E1 , E2 , degree map deg : Hom(E1 , E2 ) →
Z is a positive definite quadratic form, meaning for all ϕ, ψ ∈ Hom(E1 , E2 ),
Lemma 24.2.4. For any endomorphism ψ ∈ Endk (E), the trace is equal to
431
24.2. The Dual Isogeny Chapter 24. Rational Points on Elliptic Curves
= (m + nψ) ◦ (m\
+ nψ)
= deg(m + nψ) ≥ 0.
432
24.2. The Dual Isogeny Chapter 24. Rational Points on Elliptic Curves
(2) If char k = p > 0, then for any e ≥ 1, either Epe (k) = 0 or Epe (k) = Z/pe Z.
π e ) = 1 when π
since π is inseparable. Now degs (b b is inseparable and pe when π
b is separable,
so the two cases follow.
433
24.3. The Weil Conjectures Chapter 24. Rational Points on Elliptic Curves
The zeta functions of curves have many parallels to Dedekind zeta functions of number
fields in algebraic number theory (see Section 17.5).
The following statements were conjectured by Weil and proven in the 20th century by
Weil (for curves), Artin, Grothendieck and Deligne.
Theorem 24.3.2 (Weil Conjectures). Let X be a smooth projective variety over Fq of di-
mension n. Then
(b) (Functional Equation) There is an integer e = e(X), called the Euler characteristic of
X, for which the zeta function satisfies
434
24.3. The Weil Conjectures Chapter 24. Rational Points on Elliptic Curves
However, Artin, Grothendieck and others were able to devise a cohomology theory called
étale cohomology for which the following fixed point property holds:
∞
X
r
#{fixed points of π } = (−1)i tr((π r )∗ : H i (X, Q` ) → H i (X, Q` )),
i=0
where H i (X, Q` ) is the `th étale cohomology group of X. As a sidenote, the étale cohomology
groups satisfy H i (X, Q` ) ⊗ C ∼ = H i (X(C); C), where the latter is the topological (singular)
cohomology of X with coefficients in C.
Then the functional equation has a nice form: ζX/Fq (1 − s) = ζX/Fq (s), as with Dedekind
zeta functions (see Sections 12.1, 12.4 and 17.5). Also, the Riemann hypothesis says that
√
ζX/Fq (s) = 0 for s ∈ C satisfying |q s | = q, i.e. Re(s) = 12 .
Example 24.3.3. For an elliptic curve E/Fq , one can prove that
Then by the Hasse bound (Corollary 24.2.6), (tr π)2 − 4q ≥ 0, so the roots t = α1 and β1 are
complex conjugates. Thus |α| = |β|, but since αβ = q, we get |α| = q 1/2 . Thus the Riemann
hypothesis holds for elliptic curves.
435
24.4. Elliptic Curves over Local Fields Chapter 24. Rational Points on Elliptic Curves
Then Ee ns is called the nonsingular locus of the reduction; E (0) (K) the points of non-
singular reduction; and E (1) (K) the kernel of reduction.
436
24.4. Elliptic Curves over Local Fields Chapter 24. Rational Points on Elliptic Curves
Proposition 24.4.4. Let E be an elliptic curve over a local field K with reduction E.
e Then
(a) E
e is a curve over k with at most one singular point.
(c) If ∆ 6 0, then E
e = e is nonsingular, and hence an elliptic curve over k.
(d) If ∆
e = 0 and A e 6= 0, then E
e has a nodal singular point. Moreover, if y = a1 x + β1
and y = a2 x + β2 are the equations of the two tangent lines at the nodal point of E,
e
then there is an isomorphism of algebraic groups
e ns −→ Gm = A1k r {0}
E
y − α1 x − β1
(x, y) 7−→ .
y − a2 x − β 2
(e) If ∆
e = 0 and A e = 0, then Ee has a cuspidal singular point. Moreover, if y = αx + β is
the tangent line at this cusp (x0 , y0 ), then there is an isomorphism of algebraic groups
e ns −→ Ga = A1
E k
x − x0
(x, y) 7−→ .
y − αx − β
If ∆
e 6= 0, E is said to have good reduction. Otherwise, E has bad reduction.
If ∆
e = 0 and A
e = 0, then E is said to have additive reduction.
e ns (k).
This gives us the beginning of a filtration of E
Lemma 24.4.6. Suppose P = [X, Y, Z] ∈ E(K). Then P ∈ E (0) (K) if and only if for some
N ≥ 1, v(X) = 2N , v(Y ) = 0 and v(Z) = 3N .
Definition. For a point P = [X, Y, Z] ∈ E(K), the N satisfying Lemma 24.4.6 is called the
level of P . We formally define the level of O to be ∞. For each N ≥ 1, define
437
24.4. Elliptic Curves over Local Fields Chapter 24. Rational Points on Elliptic Curves
E : Y 2 Z = X 3 + AXZ 2 + BZ 3
eN is given by Y 2 Z N = X 3 which is a
is also a curve over K. Moreover, the reduction E N N
(N ) (0)
cuspidal curve, so EN has additive reduction. Also observe that E (K) = EN (K) and
e
(1)
E (N +1) (K) = EN (K) for any N ≥ 1. Applying the short exact sequence from Proposi-
tion 24.4.5 to these groups gives isomorphisms
Corollary 24.4.8. Suppose the residue field k has characteristic p > 0. If P ∈ E (1) (K) is
a torsion point then its order is pr for some r ≥ 1.
which means pQ ∈ E (N +1) (K). Thus mQ, pQ ∈ E (N +1) (K), but p and m are relatively
prime, so it follows that Q ∈ E (N +1) (K), a contradiction. Hence pr P = Q = O.
438
24.4. Elliptic Curves over Local Fields Chapter 24. Rational Points on Elliptic Curves
Then |u(P )| = p−N where N is the level of P . To prove Theorem 24.4.10, we need two
lemmas.
Lemma 24.4.11. Take P1 , P2 ∈ E (1) (K) and suppose none of P1 , P2 , P1 + P2 are O. Then
|u(P1 + P2 ) − u(P1 ) − u(P2 )| ≤ max{|u(P1 )|5 , |u(P2 )|5 }.
Proof. Without loss of generality we may assume |u(P1 )| ≥ |u(P2 )|. Let N be the level of
P1 , and set XN = p2N X, YN = Y and ZN = p3N Z, defining the curve EN as in the proof of
Theorem 24.4.7. Then EN has additive reduction with singular point (0, 0). Further, since
P1 , P2 ∈ E (1) (K) ⊆ E (0) (K), neither of these reduces to the singular point. Now the line
between Pe1 and Pe2 does not pass through (0, 0), so before reduction, the line between P1
and P2 has the form
ZN = `XN + mYN for l, m ∈ Z, |`| ≤ 1, |m| ≤ 1.
The third point of intersection between this line and EN is calculated by:
0 = −YN (`XN + mYN ) + XN3 + p4N AXN (`XN + mYN )2 + p6N B(`XN + mYN )3
= c3 XN3 + c2 XN2 YN + c1 XN YN2 + c0 YN3 . (∗)
Rearranging, we get the following relations:
c3 = 1 + p4N A`3 + p6N B`3 (24.1)
4N 6N 2
c2 = 2p A`m + 3p Bm` . (24.2)
Then (1) implies |c3 | = 1, while (2) implies |c2 | ≤ p−4N . On the other hand, dehomogenizing
(∗), we find that the roots of the equation are p−N u(P1 ), p−N u(P2 ) and p−N u(P1 + P2 ). The
sum of the roots must be −c c3
2
, so combining all of this information gives us
439
24.4. Elliptic Curves over Local Fields Chapter 24. Rational Points on Elliptic Curves
Lemma 24.4.12. For all P ∈ E (1) (Qp ) and m ∈ Z, |u([m]P )| = |m| |u(P )|.
Proof. This is trivial when m = 0. For m > 0, Lemma 24.4.11 implies |u(mP ) − mu(P )| ≤
|u(P )|5 . When p - m, |u(mP )| = p−N and |mu(P )| = p−L for some N ≥ L > 1. If L 6= N ,
then |u(mP ) − mu(P )| = p−L > |u(P )|5 by the ultrametric inequality, but this contradicts
Lemma 24.4.11. Thus L = N , so |u(mP )| = |m| |u(P )|. A similar proof works for the case
p = m. Finally, if p | m, the equality is verified by induction on the power of p dividing
m.
We now give the proof of Theorem 24.4.10.
Proof. If P ∈ E (1) (Qp ) is a nontrivial torsion point, then [m]P = O for some m ∈ Z.
However, by Lemma 24.4.12, 0 = |u(O)| = |u([m]P )| = |m| |u(P )| = 6 0, a contradiction.
(1)
Hence E (Qp ) has no nontrivial torsion.
Remark. If E is not in short Weierstrass form, e.g. if p = 2, the theorem may be false.
However, in that case the same proof shows that E (2) (Qp ) is torsion-free.
Corollary 24.4.13. If E is an elliptic curve with good reduction over K, then there is an
embedding Etors (K) ,→ E(k).
e
Corollary 24.4.14. If E is an elliptic curve with good reduction over K, then Etors (K) is
a finite group.
Suppose E is an elliptic curve over Q with good reduction mod p. Then there are
embeddings Etors (Q) ,→ Etors (Qp ) ,→ E(F
e p ). This proves:
Corollary 24.4.15. For any elliptic curve E/Q, Etors (Q) is finite.
Example 24.4.16. Consider the elliptic curve
E : y 2 + y = x3 − x + 1.
Then ∆E = −611 = −13·47 so E has good reduction mod 2. One can see that E(F
e 2 ) = {O},
so it follows that E(Q) is torsion-free.
Example 24.4.17. Consider the elliptic curve
E : y 2 = x3 + 3.
Here ∆E = −3888 = −24 · 35 , so E has good reduction mod p for all primes p ≥ 5. Using the
methods described, one can check that #E(Fe 5 ) = 6, while #E(F
e 7 ) = 13, so it follows that
E(Q) has no torsion. Notice that (1, 2) ∈ E(Q) is a rational point. Then (1, 2) has infinite
order, a completely nontrivial fact.
440
24.4. Elliptic Curves over Local Fields Chapter 24. Rational Points on Elliptic Curves
E : y 2 = x3 + x.
Then its discriminant is ∆E = −64. One checks that (0, 0) is a point of order 2 in E(Q), and
that #E(F
e 3 ) = 4, #E(F
e 5 ) = 4 and #E(F
e 7 ) = 8. So the trick in the previous two examples
will not work here. However, one can further show that
e 3 ) = {O, (0, 0), (2, 1), (2, 2)} ∼
E(F = Z/4Z,
e 5 ) = {O, (0, 0), (2, 0), (3, 0)} ∼
while E(F = Z/2Z × Z/2Z.
Theorem 24.4.19. Let (K, R) be an arbitrary local field whose residue field k has char-
acteristic p > 0. Consider an elliptic curve E over K and a point P = (x, y) ∈ E(K).
Then
Proof. For any prime p at which E has good reduction, there is an embedding Etors (Q) ,→
Etors (Qp ), but we know by Theorem 24.4.10 that x, y ∈ Zp . Since Zp ∩ Q = Z, it follows
that x, y ∈ Z.
Next, it is clear that [2]P = O if and only if y = 0, so suppose [2]P = (x2 , y2 ). Since P is
torsion, [2]P is also torsion, so x2 , y2 ∈ Z by the first paragraph. From the addition formula
(Proposition 23.3.2), we see that
2
3x2 + A
x2 = + 2x,
2y
and y 2 = x3 + Ax + B, so we see that 4A3 + 27B 2 ≡ 0 mod y 2 . This proves the result.
Theorem 24.4.21. A point P ∈ E(Q) is non-torsion if and only if there exists some n ∈ Z
such that [n]P has non-integral coordinates.
This statement is proven by Siegel’s result that an elliptic curve over Q has at most
finitely many integral points.
441
24.5. Jacobians of Hyperelliptic Curves Chapter 24. Rational Points on Elliptic Curves
αD : C −→ E = J(C)
P 7−→ [n]P − D.
This endows C with the structure of an [n]-cover of E (again, see Section 25.3). For example,
a divisor D ∈ Div(C) of degree n = 2 determines a map ϕD : C → P1 whose image is a
variety given by the equation Y 2 Z 2 = U (X, Z), where U is a quartic in X, Z. There is an
SL2 (k) action on the set of all quartic forms:
α β
· U (X, Z) = U (αX + βZ, γX + δZ).
γ δ
In particular, SL2 (k) acts on k[a1 , . . . , a5 ], and it turns out that the invariant subring is of
the form k[a1 , . . . , a5 ]SL2 (k) ∼
= k[I, J] for two invariant generators I, J. If V is the space of
all quartic forms, these define maps I, J : V → k which are equivariant:
E : y 2 = 4x3 − Ix − J,
J2
with j-invariant j(E) = .
I3
442
Chapter 25
Now that we understand Etors (Q), our goal is to prove Mordell’s theorem that E(Q) is finitely
generated. Our strategy is as follows, and will take the entirety of Chapter 5 to describe.
(1) (Weak Mordell-Weil Theorem) Show that E(Q)/mE(Q) is finitely generated for m > 1.
This is achieved by constructing a certain short exact sequence
where Sel(m) (E/Q) is a finite group called the Selmer group and X(E/Q) is the Tate-
Shafarevich group.
(2) Use height functions to construct a function ĥ : E(Q) → R≥0 which satisfies
(i) For all B > 0, the set {P ∈ E(Q) : ĥ(P ) < B} is finite.
(ii) ĥ([m]P ) = m2 ĥ(P ) for all m ∈ Z.
(iii) ĥ is a quadratic form, and thus there is a pairing
(3) Combining the weak Mordell-Weil theorem and height functions gives a proof that E(Q)
is finitely generated.
443
25.1. Some Galois Cohomology Chapter 25. The Mordell-Weil Theorem
where the inverse limit is over all finite extensions L/k. Let A be an abelian group with the
discrete topology and suppose G acts on A continuously. Specifically, for each σ ∈ G there
is a map A → A, a 7→ aσ , which satisfies
(i) a1 = a for all a ∈ A.
Example 25.1.1. The key situation for our purposes is when G = Gk is the absolute Galois
group of a field k and A = E(k̄) is the points of an elliptic curve over the algebraic closure,
with the continuous action described in Section 21.1 (for any variety). In particular, for any
P ∈ E(k̄), StabG (P ) = Gal(k̄/k(P )) is a finite index subgroup, where k(P ) is the field of
definition of P . In this situation, the fixed points of the Galois action are just the k-rational
points of E: E(k̄)G = E(k).
In general, the assignment A 7→ AG is a functor from the category of G-modules to the
category of abelian groups, called the invariant functor.
Lemma 25.1.2. A 7→ AG is a left exact functor, meaning for every short exact sequence of
G-modules 0 → A → B → C → 0, there is an exact sequence 0 → AG → B G → C G .
[m]
Example 25.1.3. Consider the short exact sequence 0 → E[m] → E −−→ E → 0. Then
applying the invariant functor (−)G , where G = Gk , fails to preserve exactness on the right.
Definition. The ith group cohomology of G with coefficients in a G-module A is the ith
right derived functor of the invariant functor:
444
25.1. Some Galois Cohomology Chapter 25. The Mordell-Weil Theorem
(2) For any short exact sequence of G-modules 0 → A0 → A → A00 → 0, there is a long
exact sequence in cohomology
where Z i (G, A) are the i-cocycles, or maps G×· · ·×G → A satisfying a certain combinatorial
condition (e.g. for ξ : G → A, the cocycle condition is that ξστ = (ξσ )τ +ξτ for any σ, τ ∈ G),
and B i (G, A) are the i-coboundaries, i.e. the cocycles of the form ξ : σ 7→ aσ − a for some
a ∈ A.
For a closed subgroup H ≤ G, any G-module A is also an H-module by restricting the
G-action to H. This determines a map called restriction:
On 0th cohomology, this is just given by AG ,→ AH . On the other hand, for a normal,
finite-index subgroup H ≤ G, the quotient G/H is a finite group and AH has the structure
of a G/H-module. This allows one to define an induced map called inflation:
445
25.1. Some Galois Cohomology Chapter 25. The Mordell-Weil Theorem
Proof. The first statement is Hilbert’s Theorem 90 (Theorem 17.7.5). For the second state-
ment, consider the short exact sequence
[m]
1 → µm → Gm −−→ Gm → 0.
446
25.2. Selmer and Tate-Shafarevich Groups Chapter 25. The Mordell-Weil Theorem
We will construct the Selmer group as a subgroup of H 1 (K, A[ϕ]), avoiding the obstacles of
working with the infinite group H 1 (K, A[ϕ]). Notice that when A = B = E and ϕ = [m],
the first term in this sequence is E(K)/mE(K), sometimes called the weak Mordell-Weil
group.
If P ∈ B(K), choose Q ∈ A(K) with ϕ(Q) = P . Then the image of P under δ :
B(K)/ϕA(K) → H 1 (K, A[ϕ]) is the cocycle ξ = δ(P ) : σ 7→ ξσ = Qσ − Q.
Example 25.2.1. In the case A = B = E, suppose ϕ = [m] where E[m] ⊆ E(K). Then by
Proposition 24.2.7 and Proposition 25.1.8,
Lemma 25.2.2. Let K be a number field and v a place of K. Then for any isogeny of
elliptic curves ϕ : A → B over K, there is a commutative diagram
Resv
Since we have such a diagram for every place of K, we can take the product over all
places of K to obtain a commutative diagram
δ
0 B(K)/ϕA(K) H 1 (K, A[ϕ]) H 1 (K, A)[ϕ] 0
α
Y Y Y
0 B(Kv )/ϕA(Kv ) H 1 (Kv , A[ϕ]) H 1 (Kv , A)[ϕ] 0
v v v
447
25.2. Selmer and Tate-Shafarevich Groups Chapter 25. The Mordell-Weil Theorem
Here the vertical arrow in the middle is given by a product of local restrictions: ξ 7→ (ξv )v .
Let ξ ∈ δ(B(K)). Then ξv must lie in δ(B(Kv )) for each place v. This puts a condition on
the cocycles in the image of δ; define
and set HL1 (K, A[ϕ]) = v Lv . Then we see that δ(B(Kv )) ⊆ HL1 (K, A[α]).
T
where α : H 1 (K, A[α]) → v H 1 (Kv , A)[ϕ] is the product of the local restriction maps.
Q
The key observation is that im δ ⊆ Sel(ϕ) (A/K), so in order to prove the weak Mordell-
Weil theorem, it will be enough to show that the Selmer group is finite. The cokernel of the
map δ : B(K)/ϕA(K) → Sel(ϕ) (A/K) has an important role as well.
Definition. The Tate-Shafarevich group of ϕ : A → B is the group
!
Y
1 1
X(A/K) := ker Res : H (K, A) → H (Kv , A) .
v
Applying the Snake Lemma gives the desired short exact sequence.
Fix a place v of K and let Kvur be the maximal unramified extension of the completion
Kv , so that Gal(K v /Kvur ) = Iv , the inertia group of Kv . Set Gv = Gal(K v /Kv ). For any
Gv -module A, we have a map
Res
H 1 (Kv , A) −−→
v
H 1 (Kvur , A) ∼
= H 1 (Iv , A).
1 1
Denote by Hur (Kv , A) the kernel of this map. Elements of Hur (Kv , A) are called unramified
1 1
cocycles; for an element ξ ∈ H (K, A), we say ξ is unramified at v if ξv ∈ Hur (Kv , A).
448
25.2. Selmer and Tate-Shafarevich Groups Chapter 25. The Mordell-Weil Theorem
Proof. Let ξ ∈ Sel(ϕ) (A/K) and fix a place v 6∈ S. By definition of the Selmer group,
ξv = 1 in H 1 (Kv , A)[ϕ], so by the exact sequence in Lemma 25.2.2, ξv = δ(P ) for some
point P ∈ B(Kv ). Explicitly, δ(P ) = ξ, where ξ : σ 7→ Qσ − Q for some Q ∈ A(Kv ) with
ϕ(Q) = P . Since v 6∈ S, A has good reduction at v, so in the residue field kv = Ov /mv , the
reduction of ξσ = Qσ − Q for any σ ∈ Iv is give by
ξ¯σ = Qσ − Q = Qσ − Q = (Q)σ − Q = Q − Q = 0
since σ ∈ Iv acts trivially on kv . This shows that ξσ ∈ A(1) (Kv )[ϕ] ⊆ A(1) (Kv )[m], where
deg ϕ = m. Further, since A has good reduction at v and v - m, then by Theorem 24.4.9,
e v ) is an injection. Hence ξ¯v = 0 in A(k
A(Kv )[m] ,→ A(k e v ) implies ξσ = 0 in A(Kv ). Thus
we have shown ξσ is trivial for all σ ∈ Iv , i.e. ξ is unramified at every v 6∈ S. Hence
Sel(ϕ) (A/K) ⊆ HS1 (K, A[ϕ]).
Proposition 25.2.5. Let S be a finite set of places of K and let M be any finite abelian
GK -module. Then HS1 (K, M ) is finite.
Proof. Since M is finite and GK acts continuously on m, there exists an open subgroup of
finite index in GK that fixes every element of M . Such a subgroup corresponds, by infinite
Galois theory, to an extension K 0 /K. For this extension, we have an inflation-restriction
sequence (Theorem 25.1.6):
Since M is finite, HS1 (K 0 , M GK 0 ) is finite, so it’s enough to show HS1 (K 0 , M ) is finite to imply
that HS1 (K, M ) is finite.
By definition, K 0 is the extension of K for which GK 0 acts trivially on M , so after replacing
K with K 0 , we may assume M is in fact a trivial GK -module. Also assume µn ⊆ K for some
n. Since GK acts trivially on M , we have that HS1 (K, M ) = HomScts (GK , M ). However,
such homomorphisms are in correspondence with abelian extensions of K of exponent m
which are unramified outside S. By Lemma 25.2.6 below, there are finitely many of these,
so HS1 (K, M ) is finite.
Lemma 25.2.6. Let K be a number field and M a finite abelian GK -module. If m is the
exponent of M (i.e. the smallest integer such that mx = 0 for all x ∈ M ), and L/K is
the maximal abelian extension of exponent m which is unramified outside S, then [L : K] is
finite.
449
25.2. Selmer and Tate-Shafarevich Groups Chapter 25. The Mordell-Weil Theorem
n
Proof. Assume µn ⊆ K. By Kummer theory, the short exact sequence 1 → µn → K × →
−
×
K → 0 induces a long exact sequence
n
0 → µn → K × →
− K × → H 1 (K, µn ) → H 1 (K, K × ) = 0
δ : K × /(K × )n −→ H 1 (K, µn )
σ(β)
α 7−→ ξ : σ 7→ β where β n = α.
By algebraic number theory, there are finitely many degree d extensions L/K unramified
outside S for any given d > 0. Further, by Dirichlet’s S-unit theorem, OS× is a finitely
generated abelian group of rank r(S) = r + s − 1 + #S, where r and s are, respectively, the
numbers of real and complex embeddings of Q in K. By class field theory, the class group
C(OS ) is finite and generated by some fractional ideals a1 , . . . , an . Adding all the primes
dividing the aj to S, we get a finite set of places S 0 for which C(OS 0 ) = 1. Therefore we may
assume from the start that OS is a PID.
With these reductions, we will now prove L/K is finite. In fact, we will show
(1) L = K(α1/m | α ∈ OS× )
(2) Gal(L/K) ∼
= (Z/mZ)r(S)+1 .
By Kummer theory, the maximal abelian extension of K with exponent m is K(α1/m | α ∈
K × ). Thus L ⊆ K(α1/m | α ∈ K × ). Let L0 = K(α1/m | α ∈ OS× ). We want to show L0 = L.
First, for any α ∈ K × and place v for which v(m) = 0, we claim v is unramified in K(α1/m )
if and only if ordv (α) ≡ 0 mod m. Indeed, if ordv (α) ≡ 0 mod m, then α = uπvrm for
some u ∈ Ov× and r ∈ Z. Then Kv (α1/m ) = Kv (u1/m ) so u1/m satisfies xm − u = 0. This
polynomial has discriminant ∆ = mm um−1 , so in particular v(∆) = 0 and thus v is unramified
in K(α1/m ). Conversely, if v is unramified in K(α1/m ) then v(K(α1/m )× ) = v(K × ) = Z. So
if α = uπvr then m | r and hence ordv (α) = r ≡ 0 (mod m). Thus the claim holds.
The paragraph above shows that L is the compositum of all K(α1/m ) for α ∈ K × with
ordv (α) ≡ 0 mod m for all v 6∈ S. That is, for all v 6∈ S, ordv (α) = rv m for some rv ∈ Z.
Take such an α ∈ K × and v 6∈ S and let pv be the corresponding prime of OS . By our
reductions, OS is a PID, so Y
prvv = (β)
v6∈S
450
25.2. Selmer and Tate-Shafarevich Groups Chapter 25. The Mordell-Weil Theorem
for some β ∈ K. Then α0 = αβ −m ∈ OS× and K(α1/m ) = K((α0 )1/m ) ⊆ L0 . This holds for
all α ∈ K × , so L ⊆ L0 . On the other hand, L0 ⊆ L is obvious so we get L0 = L and (1) is
proven.
For (2), apply Dirichlet’s S-unit theorem to get
where the extra copy of Z/mZ comes from the torsion part since µm ⊆ K.
Remark. Consider the situation when M = A[m] and A[m] ⊆ A(K). As in the proof of
Lemma 25.2.6, we may assume µm ⊆ K and that OS is a PID. Then
so |HS1 (K, A[m])| = m2(1+r(S)) . On the other hand, #A(K)/[m]A(K) = m2(1+r(A)) and since
there is an embedding A(K)/[m]A(K) ,→ HS1 (K, A[m]), we get a bound on the rank of the
elliptic curve A:
r(A) ≤ 2r(S) = 2(r + s − 1 + #S).
Corollary 25.2.7. For any isogeny of elliptic curves ϕ : A → B over a number field K, the
Selmer group Sel(ϕ) (A/K) is a finite group.
Corollary 25.2.8 (Weak Mordell-Weil Theorem). For any elliptic curve E over Q, E(Q)/mE(Q)
is finite for all m ≥ 2.
called Cassel’s pairing, whose kernel consists of divisible elements. As a result, one obtains
the following useful fact:
Theorem 25.2.9. For any elliptic curve E, the order of X(E/Q) is divisible by 2.
451
25.3. Twists, Covers and Homogeneous Spaces Chapter 25. The Mordell-Weil Theorem
Example 25.3.1. By Proposition 20.2.7 (or 22.6.5), every conic in P2 is isomorphic over k̄
to P1 , but is only isomorphic over k if it has a k-point. Therefore Twist(P1 /k) is the set of
conics in P2 .
The next result is a sort of “meta-proposition” about twists of algebro-geometric objects.
One can repeat the proof in any specific category of algebro-geometric objects to obtain a
bijection between the twists and the given cohomology set.
Proposition 25.3.2. Let X be an algebro-geometric object over a field k. Then there is a
bijection H 1 (k, Autk̄ (X)) ∼
= Twist(X/k).
Proof. Given Y ∈ Twist(X/k), there is an isomorphism ϕ : Y → X defined over k̄. Then
each σ ∈ Gk acts on ϕ in the natural way, and
Then the coset space Y := X(L)/ Gal(L/k) is an object defined over k of the same type as
X that is isomorphic to X over k̄, hence a twist of X over k. It is easy to check that the
assignments are inverses of each other.
Definition. Let A be an algebraic group over a field k. A principal homogeneous space
(or PHS) for A is a variety X over k equipped with a simply transitive action of A as an
algebraic group action over k. In other words, there is a morphism
µ : X × A −→ X, (x, P ) 7→ x P
satisfying
452
25.3. Twists, Covers and Homogeneous Spaces Chapter 25. The Mordell-Weil Theorem
θx0 : A −→ X, P 7→ x0 P
Proof. Let X be a twist of A, with isomorphism θ = θx0 : A → X defined over k̄. Then for
any x ∈ X and P ∈ A,
Lemma 25.3.4. Given an isomorphism θ = θx0 : A → X over k̄, there is a subtraction map
Definition. Two principal homogeneous spaces (X, µ) and (X 0 , µ0 ) of A over k are isomor-
phic over k if there exists an isomorphism i : X → X 0 defined over k such that the following
diagram commutes:
µ
X ×A X
i×1 i
X0 × A X0
µ0
There is a related notion of a “torsor” for A, which turns out to be equivalent to the
definition of a PHS of A.
Definition. A torsor for A over k is a pair (X, θ) where X is an algebraic variety over k
and θ : A → X is an isomorphism defined over k̄.
Definition. Two torsors (X, θ) and (X 0 , θ0 ) for A over k are isomorphic as torsors if
there exists an isomorphism of varieties i : X → X 0 defined over k and a point P ∈ A such
that the following diagram commutes:
453
25.3. Twists, Covers and Homogeneous Spaces Chapter 25. The Mordell-Weil Theorem
θ
A X
τP i
A X0
θ0
τP id
A X
θy0
W C(A/k) −→ H 1 (k, A)
(X, µ) 7−→ (ξ : σ 7→ xσ0 x0 )
Recall that when A is an elliptic curve over a number field K, X(A/K) ⊆ H 1 (K, A) and
elements of X(A/K) are those cocycles ξ ∈ H 1 (K, A) such that ξv ∈ H 1 (Kv , A) is trivial
454
25.3. Twists, Covers and Homogeneous Spaces Chapter 25. The Mordell-Weil Theorem
for each place v of K. Interpreting each H 1 (Kv , A) as W C(A/Kv ), the restriction map is
given by
Y
W C(A/K) −→ W C(A/Kv )
v
Y
X/K −
7 → (X/Kv ).
v
On the other hand, recall that Sel(ϕ) (A/K) ⊆ H 1 (K, A[ϕ]). By Proposition 25.3.2,
H 1 (K, A[ϕ]) can be viewed as the set of twists of A with automorphism group isomorphic
to A[ϕ]. This naturally leads to the idea of twists of an isogeny, also known as ϕ-covers.
α id
A ϕ B
i id
C0 B
π0
455
25.3. Twists, Covers and Homogeneous Spaces Chapter 25. The Mordell-Weil Theorem
δ
0 B(K)/ϕA(K) H 1 (K, A[ϕ]) W C(A/K) 0
π
[C →
− B] [C]
π π
Proposition 25.3.9. If C → − B is a ϕ-cover and there is a point x ∈ C(K), then [C →
−
B] = δ(P ) for P = π(x) ∈ B(K).
456
25.4. Descent Chapter 25. The Mordell-Weil Theorem
25.4 Descent
The goal of descent is to construct torsion elements of the Tate-Shafarevich group X(A/K)
and lift them to generators of B(K)/ϕA(K). We will describe this construction in the
relatively tractable case of 2-torsion elements of an elliptic curve. The general procedure can
be found in Silverman and in Cremona’s “Higher Descent on Elliptic Curves”.
Let E be an elliptic curve with a rational 2-torsion point P ∈ E(K); then hP i is a
subgroup of order 2 in E(K). We can construct a 2-isogeny of E as follows. Change
coordinates of E to move P to the point (0, 0). Then E is given by the Weierstrass form
E : y 2 = x(x2 + ax + b).
E 0 : y 2 = x(x2 + a0 x + b0 )
ϕ : E −→ E 0
2
y y(b − x2 )
(x, y) 7−→ , .
x2 x2
Lemma 25.4.1. If ϕ : E → E 0 is an isogeny, then E and E 0 have good/bad reduction at the
same primes.
Proof. (Move?) Silverman VII.7.2.
Let S be the set of primes of bad reduction for E and E 0 ; that is,
Set
K(S, 2) = {β ∈ K × /(K × )2 : ordv (β) ≡ 0 mod 2 for all v 6∈ S}.
Then E[ϕ] = {(0, 0), O} ∼
= µ2 as a Galois module, so by Kummer theory, there is a bijection
Z : K(E) −→ K(E)ξ
f 7−→ Z(f )
action of GK defined above, is a function field. Let Cβ be the corresponding curve (by
457
25.4. Descent Chapter 25. The Mordell-Weil Theorem
Proposition 22.2.2). Looking at the addition formula (Proposition 23.3.2) for E, one can
compute the translation map τP = τ(0,0) to be
b by
τP (x, y) = ,− .
x x2
√ √ √
Let L = K( β), so that GL/K = hσi where σ : β 7→ − β. Then L(E)ξ = L(x, y)/(y 2 −
x(x2 + ax + b)) with
p p b by
( β)σ = − β, xσ = and y σ = − .
x x2
√
βx √ 2
Observe that z = y
and w = β x − xb xy are GL/K -invariant and satisfy the equation
θ : E r {(0, 0), O} −→ Cβ
√ 2 !
βx p b x
(x, y) 7−→ (z, w) = , β x− .
y x y
x xy y
Since y
= y2
= x2 +ax+b
,
this can be extended to all points Q ∈ E by
√ √
β(x2 − b)
βy
x2 + ax + b , x2 + ax + b , Q 6= (0, 0), O
θ(Q) = √
(0, − β), Q = (0, 0)
(0, √β),
Q = O.
α : Cβ −→ E
√ √
βw − az 2 + β βw − a βz 2 + β β
√
(z, w) 7−→ , .
2z 2 2z 3
Thus θ and α are isomorphisms.
Now consider the diagram
π
Cβ E0
α id
E ϕ E0
458
25.4. Descent Chapter 25. The Mordell-Weil Theorem
β βw
where π is given by (z, w) 7→ 2
, − 3 . Then π = ϕ ◦ α so π : Cβ → E 0 is a ϕ-cover.
z z
π
− E 0 is ξ(β).
Lemma 25.4.2. The cocycle associated to Cβ →
Now recall that the connecting morphism δ : E 0 (K)/ϕE(K) → Sel(ϕ) (E/K) is given by
δ(P 0 ) : σ 7→ Qσ − Q where ϕ(Q) = P 0 . Note that ϕ(O) = O, so when P 0 = O, δ(O) : σ 7→ O
and thus 1 ∈ K(S, 2). If P 0 = P= (0, 0), the2-torsion point, then Q must have y = 0 and
√
2
x a root of x2 + ax + b, so Q = −a± 2a −4b , 0 . This implies
( √
σ O, if σ acts trivially on a2 − 4b
ξ(β)ξ = Q − Q =
(0, 0), otherwise.
From this, we see that β = a2 − 4b, so δ(P ) = β ∈ K(S, 2). Finally, for P 0 = (x, y) 6= (0, 0),
π
one can show that δ(P 0 ) = δ(x, y) = x. These explicit ϕ-covers Cβ → − E 0 allow us to pull
back to generators of E 0 (K)/ϕE(K), as demonstrated in the next examples.
E : y 2 = x3 − 6x2 + 17x.
Our goal is to compute E(Q)/2E(Q). First, ∆ = −147968 = −29 · 172 , so S = {∞, 2, 17}
and Q(S, 2) = {±1, ±2, ±17, ±34}. The above formulas for E 0 and the ϕ-covers Cβ give the
following curves:
E 0 : y 2 = x3 + 12x2 − 32x
Cβ : βw2 = β 2 + 12βz 2 − 32z 4 , β ∈ Q(S, 2).
Notice that δ(0, 0) = a2 − 4b = −32 ≡ −2 mod (Q× )2 so the ϕ-cover C−2 is the image
under δ of (0, 0). Hence [C−2 ] is trivial in X(E/Q)[ϕ]. (In particular, this shows that E has
a point over Q!)
For β = 2, we get the ϕ-cover
C2 : w2 = 2 + 3t2 − t4 .
1
Notice that (t, w) = (1, 2) is a point on C 2 , corresponds to a point (z, w) = 2
, 2 on E, and
1 0
hence π 2 , 2 = (8, −32) ∈ E (Q). Once again, by Proposition 25.3.9, [C2 ] is trivial in the
Tate-Shafarevich group.
Next, let β = 17. The corresponding ϕ-cover is
459
25.4. Descent Chapter 25. The Mordell-Weil Theorem
Here we show that [C17 ] 6∈ Sel(ϕ) (E/Q). Suppose to the contrary that there exists a point
(z, w) ∈ C17 (Q17 ). Then ord17 (17w2 ) is odd and ord17 (32z 4 ) is even, which implies that
ord17 (172 + 12 · 17z 2 − 32z 4 ) 6= ord17 (32z 4 ) = 4 ord17 (z). On the other hand,
and the only way this is possible is if ord17 (z) > 0. However, this contradicts the defining
equation for C17 . Hence C17 (Q17 ) = ∅, so by Theorem 25.3.7, [C17 ] 6∈ Sel(ϕ) (E/Q). Further,
since Sel(ϕ) (E/Q) is a group, we must have [C−17 ], [C34 ], [C−34 ] 6∈ Sel(ϕ) (E/Q) as well. We
have therefore shown that
Observe that if β < 0, Cβ0 (R) = ∅ since the signs don’t alternate. Also, δ(0, 0) = 272 =
24 · 17 ≡ 17 mod (Q× )2 so C170
is the image of (0, 0) ∈ E 0 (Q)/ϕE(Q)
b under δ. Lastly, for
β = 2, we have
C20 : 2w2 = 4 − 12t + 17t4
(with t = 2z). A similar proof as above shows that C20 (Q2 ) = ∅, so [C20 ] 6∈ Sel(ϕ)
b
(E 0 /Q). In
all, this shows that
Sel(ϕ)
b
(E 0 /Q) = {C1 , C17 } ∼
= {1, 17},
but C1 and C17 are images under δ of the points O and (0, 0), respectively, so X(E 0 /Q)[ϕ]b =0
in this case.
Let’s put this together to determine the weak Mordell-Weil group E(Q)/2E(Q). From
above, E 0 (Q)/ϕE(Q) ∼ = Z/2Z × Z/2Z, where the generators are (0, 0) and (8, −32). On
the other hand, the previous paragraph implies that E(Q)/ϕE b 0 (Q) ∼
= Sel(ϕ)b
(E 0 /Q) ∼
= Z/2Z,
with explicit generator (0, 0). The composition ϕ ◦ ϕ
b = [2] gives us an exact sequence
460
25.4. Descent Chapter 25. The Mordell-Weil Theorem
In the above example, we were able to determine X(E/Q)[ϕ] = 0 and X(E 0 /Q)[ϕ] b =0
and use this to deduce E(Q)/2E(Q), and ultimately E(Q). However, sometimes one may
discover a ϕ-cover Cβ not mapping to the trivial class in X(E/Q)[ϕ]. In such a situation,
one may require a method known as ‘second descent’ (cf. Cremona’s paper entitled “Higher
Descents on Elliptic Curves”). Let ϕ : A → B and ϕ b : B → A be dual isogenies such that
ϕ◦ϕ b = [m]. Then we have a commutative diagram with exact rows and columns:
0 0 0 0
0 H A(Q)/ϕB(Q)
b B(Q)/mB(Q) B(Q)/ϕA(Q) 0
0 H Sel(ϕ)
b
(B/Q) Sel(m) (B/Q) Sel(ϕ) (A/Q) 0
0 X(B/Q)[ϕ]
b X(B/Q)[m] X(A/Q)[ϕ] 0
0 0 0
(Here, H = B(Q)[ϕ]/ϕ(A(Q)[ϕ]).)
b Take C ∈ Sel(ϕ) (A/Q) and use exactness of the middle
row to find a lift D ∈ Sel(m) (B/Q); then these are ϕ- and ϕ-covers,
b respectively:
$ π
D C B
∼
= ∼
= id
B A ϕ B
ϕ
b
Such a D is called a descendant of C. The key insight is that a point on D (over any field,
but in particular over local fields) gives a point on C via $. In general, points on D will
have smaller height than those on C (see Section 25.5), so it will be easier in theory to find
points on D.
If points cannot be found on D, replace ϕ with [m], ϕ b with [m] and m = deg ϕ with
m2 = deg[m] and repeat the argument. In principle, this can be repeated indefinitely.
However, each step yields an exact sequence:
461
25.4. Descent Chapter 25. The Mordell-Weil Theorem
X(A/Q) is not infinitely m-divisible. It is conjectured that this is true for all elliptic curves,
but has not been proven. Thus it is believed that the descent procedure always terminates
in a finite number of steps. (In fact, the Birch-Swinnerton-Dyer Conjecture would imply
that the Tate-Shafarevich group is always finite, in which case descent always terminates.)
E : y 2 = x3 + Dx
be the congruent number elliptic curve (see Section 20.1). For simplicity, we will assume
D = p, a prime number congruent to 1 mod 8. Then ∆E = −4p3 and S = {∞, 2, p}, so
Q(S, 2) = {±1, ±2, ±p, ±2p}. One can show using normal means that Etors (Q) = h(0, 0)i ∼ =
Z/2Z. Further, we have the following formulas for the ϕ- and ϕ-covers
b in the Selmer groups:
For the 2-torsion point P = (0, 0), notice that δ(P ) = −4p3 ≡ −p mod (Q× )2 and δ(P
b )≡p
× 2 (ϕ) 0 (ϕ) 0
mod (Q ) . So C−p ∈ Sel (E/Q) and Cp ∈ Sel (E /Q). Also, if β < 0, the coefficients in
b
the second equation above fail to alternate, so Cβ0 (R) = ∅. Consider the ϕ-cover
b for β = 2:
C20 : 2w2 = 4 + pz 4 .
Over Q2 , any point (z, w) must then satisfy 1 + 2 ord2 (w) ≥ min{2, 4 ord2 (z)}, but 2 and
4 ord2 (z) are both even and never equal, so the inequality is an equality. However, 1 +
2 ord2 (w) is odd, so this is impossible. Hence C20 (Q2 ) = ∅, and thus C20 6∈ Sel(ϕ)
b
(E 0 /Q). We
have now shown that Sel(ϕ) b
(E 0 /Q) = {1, p}.
(ϕ)
To finish computing Sel (E/Q), we have
C−1 : w2 + 64 = pz 4 .
Then (1, 1) is a solution mod 8 and satisfies Hensel’s criterion, so C−1 (Q2 ) 6= ∅. This proves
C−1 ∈ Sel(ϕ) (E/Q).
Now for β = −2, the cover is given by
Over Fp , the equation becomes w2 + 2 = 0 which again has a solution since p ≡ 1 (mod 8).
As above, one can check that the point is nonsingular and then lift it to a point of C−2 (Qp ).
Likewise, the proof that C−2 (Q2 ) is nonempty is similar.
462
25.4. Descent Chapter 25. The Mordell-Weil Theorem
The above work shows that Sel(ϕ) (E/Q) = {±1, ±2, ±p, ±2p}. Now consider the se-
quences
E 0 (Q)[ϕ] E 0 (Q) E(Q) E(Q)
0→ → → → →0
b
(A)
ϕ(E(Q)[ϕ]) ϕE(Q) 2E(Q) b 0 (Q)
ϕE
E 0 (Q)
0→ → Sel(ϕ) (E/Q) → X(E/Q)[ϕ] → 0 (B)
ϕE(Q)
0 → X(E/Q)[ϕ] → X(E/Q)[2] → X(E 0 /Q)[ϕ] b → 0. (C)
The terms in all three sequences are F2 -vector spaces, so we can add dimensions as follows:
0 0
E (Q)[ϕ]b E(Q) E (Q) E(Q)
dim + dim = dim + dim
ϕ(E(Q)[2]) 2E(Q) ϕE(Q) b 0 (Q)
ϕE
= dim Sel(ϕ) (E/Q) − dim X(E/Q)[ϕ]
+ dim Sel(ϕ)
b
(E 0 /Q) − dim X(E 0 /Q)[ϕ]
b
(where dim = dimF2 ). On the other hand, E(Q)/2E(Q) = (Z/2Z)1+rank(E) , by the proof
of Lemma 25.2.6, and since Etors (Q) = Z/2Z, we must have Z/2Z ⊆ E(Q)/ϕE b 0 (Q). By
(ϕ)
b 0 (Q) injects into Sel (E 0 /Q) = {1, p}, so we must have
sequence (B) however, E(Q)/ϕE b
463
25.4. Descent Chapter 25. The Mordell-Weil Theorem
These imply u and v are both even, so r is even and therefore so is t. But this contradicts
the assumption that r and t are coprime. Hence C−1 (Q) = ∅.
464
25.5. Heights Chapter 25. The Mordell-Weil Theorem
25.5 Heights
Fix an elliptic curve in short Weierstrass form
E : y 2 = x3 + Ax + B, A, B ∈ Z.
H(P ) = max{|xi | : 0 ≤ i ≤ N }
h : PN
Q −→ R≥0 .
for any P = [x0 , . . . , xN ] ∈ PN (K), where the sum is over all valuations v on K and |x|v =
(#OK /pv )− ordv (x) is the normalized pv -adic valuation on K.
Proposition 25.5.1. Let E be an elliptic curve over Q and fix P0 ∈ E(Q). Then
(1) There is some constant C1 , which depends on P0 , A and B, such that h(P + P0 ) ≤
2h(P ) + C1 for all P ∈ E(Q).
(2) There is some constant C2 , which depends only on A and B, such that h([2]P ) ≥
4h(P ) − C2 for all P ∈ E(Q).
Proof. Silverman.
Definition. The canonical height function for any extension K/Q is defined for a point
P ∈ PN
Q (K) by
1
ĥ(P ) := lim n h([2n ]P ).
n→∞ 4
Proposition 25.5.2. The canonical height function for any elliptic curve E satisfies
465
25.5. Heights Chapter 25. The Mordell-Weil Theorem
(i) For all B > 0, the set {P ∈ E(Q) : ĥ(P ) < B} is finite.
(iii) The pairing hP, Qi = 21 (ĥ(P + Q) − ĥ(P ) − ĥ(Q)) is symmetric and bilinear.
Proof. Silverman.
We are now prepared to give the proof the full Mordell-Weil theorem using the weak
version (Corollary 25.2.8) and heights.
Theorem 25.5.3 (Mordell-Weil). For every elliptic curve E over Q, the group E(Q) is
finitely generated.
So |Q1 | ≤ |Q0 |. Now repeat: either Q1 ∈ S or |Q1 | > c0 . In the latter case, Q1 = Pi2 + mQ2
for Pi2 , Q2 ∈ E(Q) satisfying |Q2 | ≤ |Q1 | ≤ |Q0 |. Now, by Proposition 25.5.2(i), the set
{P ∈ E(Q) : |P | ≤ |Q0 |} is finite, so this descent process must terminate. This shows that
Q0 is a sum of elements of S, so S generates E(Q) and the theorem is proven.
466
Chapter 26
In this chapter we review the classical theory of complex algebraic curves, starting with the
construction and basic properties of elliptic functions, their connection to elliptic curves and
their Jacobians, and then describing the construction in arbitrary dimension.
467
26.1. Elliptic Functions Chapter 26. Elliptic Curves and Complex Analysis
ω1
ω2
Π
Lemma 26.1.1. For any choice of basis [ω1 , ω2 ] of Λ, Π(ω1 , ω2 ) is fundamental for Λ.
Lemma 26.1.2. Let Λ be a lattice. Then
(a) If Π is the fundamental domain of Λ, then for any α ∈ C, Πα := Π + α is fundamental
for Λ.
[
(b) If Φ is fundamental for Λ, then C = Φ + `.
`∈Λ
Corollary 26.1.3. Suppose f is an elliptic function with lattice of periods Λ and Φ funda-
mental for Λ. Then f (C) = f (Φ).
468
26.1. Elliptic Functions Chapter 26. Elliptic Curves and Complex Analysis
γ4 Πα γ2
γ1
α
R R R R
We show that γ1 f (z) dz+ γ3 f (z) dz = 0 and leave the proof that γ2 f (z) dz+ γ2 f (z) dz = 0
for exercise. Consider
Z Z Z 1 Z 1
f (z) dz + f (z) dz = f (α + tω1 )(ω1 dt) + f (α + (1 − t)ω1 + ω2 )(−ω1 dt)
γ1 γ3 0 0
Z 1 Z 0
= ω1 f (α + tω1 ) dt + ω1 f (α + sω1 ) ds since f is elliptic
0 1
Z 1 Z 1
= ω1 f (α + tω1 ) dt − f (α + sω1 ) ds = 0.
0 0
469
26.1. Elliptic Functions Chapter 26. Elliptic Curves and Complex Analysis
Corollary 26.1.6. Any elliptic function has either a pole of order at least 2 or two poles on
the fundamental domain of its lattice of periods.
Proposition 26.1.7. Suppose f is an elliptic function with fundamental domain Π and
n
α ∈ C such that ∆ = ∂Πα does not contain any zeroes or poles of f . LetPn {aj }j=1 be a finite
set of zeroes and poles in Πα , with mj the order of the pole aj . Then j=1 mj = 0.
Proof. For a pole z0 , we can write f (z) = (z − z0 )m g(z) for some holomorphic function g(z),
with g(z0 ) 6= 0. Then
f 0 (z) g 0 (z)
−1
= (z − z0 ) m + (z − z0 ) .
f (z) g(z)
0
Hence Res ff ; z0 = m. Then the statement follows from Proposition 26.1.5.
There are obvious problems of convergence and (in a related sense) the order of summation.
It turns out we can do this construction with f (z) = z1m , m ≥ 3 though. First, we need the
following result from complex analysis, which can be proven using Cauchy’s integral formula
and Morera’s theorem.
Lemma 26.1.8. Let U ⊆ C be an open set and suppose (fn ) is a sequence of holomorphic
functions on U such that fn → f uniformly on every compact subset of U . Then f is
holomorphic on U and fn0 → f 0 uniformly on every compact subset of U .
470
26.1. Elliptic Functions Chapter 26. Elliptic Curves and Complex Analysis
Proposition 26.1.9. Let Λ be a lattice with basis [ω1 , ω2 ]. Then the sum
X 1
|ω|s
ω∈Λr{0}
Λ Λ
∆
Λ Λ
Then ∆ is compact, so there exists c > 0 such that |z| ≥ c for all z ∈ ∆. We claim that for
all m, n ∈ Z,
|mω1 + nω2 | ≥ c · max{|m|, |n|}.
The cases when m = 0 or n = 0 are trivial, so without loss of generality assume m ≥ n > 0.
Then n
|mω1 + nω2 | = |m| ω1 + ω2 ≥ |m|c.
m
Hence the claim holds. Set M = max{|m|, |n|} and arrange the sum in question so that the
1
|ω|s
are added in order of increasing M values. Then the sum can be estimated by
∞ ∞
X 1 X 8M X 1
s
≤ s s
∼ .
|ω| M =1
cM M =1
M s−1
ω∈Λr{0}
Then Fn (z) is holomorphic on C r Λ and has poles of order n at the points of Λ. Moreover,
Fn is doubly periodic and hence elliptic.
Proof. Fix r > 0 and let Br = Br (0) be the open complex r-ball centered at the origin in C.
Let Λr = Λ ∩ B r be the lattice points contained in the closed r-ball. Then the function
X 1
Fn,r (z) =
ω∈ΛrΛr
(z − ω)n
471
26.1. Elliptic Functions Chapter 26. Elliptic Curves and Complex Analysis
1 C
is holomorphic on Br . To see this, one has |z−ω| n ≤ |ω|n for some constant C and for all
since the series is absolutely convergent and we can rearrange the terms.
This shows that elliptic functions exist and more specifically that for each n ≥ 3, there
is at least one elliptic function of order n. Unfortunately the previous proof won’t work
to construct an elliptic function of order 3. However, Weierstrass discovered the following
elliptic function.
Definition. The Weierstrass ℘-function for a lattice Λ is defined by
1 X 1 1
℘(z) = 2 + − .
z (z − w)2 ω 2
ω∈Λr{0}
Theorem 26.1.11. For any lattice Λ, ℘(z) is an elliptic function with poles of order 2 at
the points of Λ and no other poles. Moreover, ℘(−z) = ℘(z) and ℘0 (z) = −2F3 (z).
Proof. (Sketch) To show ℘(z) is meromorphic, one estimates the summands by
1 1 D
(z − ω)2 − ω 2 ≤ |ω|3
472
26.1. Elliptic Functions Chapter 26. Elliptic Curves and Complex Analysis
Lemma 26.1.12. Let ℘(z) be the Weierstrass ℘-function for a lattice Λ ⊆ C and let Π be
the fundamental domain of Λ. Then
(1) For any u ∈ C, the function ℘(z) − u has either two simple roots or one double root
in Π.
(2) The zeroes of ℘0 (z) in Π are simple and they only occur at ω21 , ω22 and ω1 +ω
2
2
.
(3) The double roots occur exactly when ℘0 (u) = 0, so use (2).
We now prove that any elliptic function can be written in terms of ℘(z) and ℘0 (z).
Theorem 26.1.13. Fix a lattice Λ ⊆ C and let E(Λ) be the field of all elliptic functions with
lattice of periods Λ. Then E(Λ) = C(℘, ℘0 ).
Proof. Take f (z) ∈ E(Λ). Then f (−z) ∈ E(Λ) as well and thus we can write f (z) as the
sum of an even and an odd elliptic function:
(Notice that since f is even, f (a) = 0 implies f (a∗ ) = 0 as well.) Moreover, if orda f = 0 then
orda∗ f = m. Note that a = a∗ holds precisely when a is in the set Θ := 0, ω21 , ω22 , ω1 +ω
2
2
.
473
26.1. Elliptic Functions Chapter 26. Elliptic Curves and Complex Analysis
Let Z (resp. P ) be the set of zeroes (resp. poles) of f (z) in Π. Then the assignment
a 7→ a∗ is in fact an involution on Z and P , so we can write
where the Zi0 and Pi0 are the 2-element orbits of the involution and the Zj00 and Pj00 are the
1-element orbits. Of course then s, v ≤ 3. For a0i ∈ Zi0 , set orda0i f = m0i and for a00j ∈ Zj00 ,
set orda00i f = m00i , which is even. Likewise, for b0i ∈ Pi0 , set ordb0i f = n0i and for b00j ∈ Pj00 , set
ordb00i f = n00i which is even. Then we define ϕ(℘(z)) by
0 m0i 00 m00
Qr Qs
j /2
i=1 (℘(z) − ℘(ai )) j=1 (℘(z) − ℘(aj ))
ϕ(℘(z)) = Qu 0 n0i
Qv 00 nj
.
i=1 (℘(z) − ℘(bi )) j=1 (℘(z) − ℘(bj ))
474
26.2. Elliptic Curves Chapter 26. Elliptic Curves and Complex Analysis
Let ℘(z) be the Weierstrass ℘-function for Λ. Then ℘0 (z)2 is an even elliptic function, so
by Theorem 26.1.13, ℘0 (z)2 ∈ C(℘). On a small enough neighborhood around z0 = 0,
1 X 1 1
℘(z) − 2 = −
z (z − ω)2 ω 2
ω∈Λr{0}
1 1 2z 3z 2
= + + 4 + ...
(z − ω)2 ω2 ω3 ω
2
1 1 2z 3z
=⇒ − = + 4 + ...
(z − ω)2 ω 2 ω2 ω
1
P
where Gm = Gm (Λ) := ω∈Λr{0} ω m . These Gm are examples of modular forms (see Sec-
tion 32.2).
1
P
Definition. The series Gm (Λ) = ω∈Λr{0} ω m is called the Eisenstein series for Λ of
weight m.
475
26.2. Elliptic Curves Chapter 26. Elliptic Curves and Complex Analysis
u3 = ℘ ω1 +ω
2
2
are distinct roots.
Thus (x, y) = (℘(z), ℘0 (z)) determine an equation y 2 = 4x3 −g2 x−g3 which is the defining
equation for an elliptic curve E0 over C. Let E = E0 ∪ {[0, 1, 0]} ⊆ P2 be the projective
closure of E0 . Denote the point [0, 1, 0] by ∞.
Theorem 26.2.4. The map
ϕ : C/Λ −→ E(C)
(
[℘(z), ℘0 (z), 1], z ∈
6 Λ
z + Λ 7−→ ϕ(z + Λ) =
[0, 1, 0], z∈Λ
is a bijective, biholomorphic map.
Proof. Assume z1 , z2 ∈ C are such that z1 + Λ 6= z2 + Λ. Without loss of generality we may
assume z1 , z2 ∈ Π, the fundamental domain of Λ (otherwise, translate). If ℘(z1 ) = ℘(z2 ) and
℘0 (z1 ) = ℘0 (z2 ), then
with the notation of Theorem 26.1.13, we must have z2 = z1∗ 6= z1 and
thus z1 , z2 6∈ Θ = 0, ω21 , ω22 , ω1 +ω . Since ℘0 (z) is odd, we get ℘0 (z1 ) = ℘0 (z2 ) = −℘0 (−z2 ) =
2
2
−℘0 (z1 ), but this implies ℘(z1 ) = 0, contradicting z1 6∈ Θ. Therefore ϕ is one-to-one.
Next, we must show that for any (x0 , y0 ) ∈ E(C), x0 = ℘(z) and y0 = ℘0 (z) for some
z ∈ C. If ℘(z1 ) = x0 , then it’s clear that ℘0 (z1 ) = y0 or −y0 . Now one shows as in the
previous paragraph that we must have ℘0 (z1 ) = y0 .
Now consider F (x, y) = y 2 − p(x), where p(x) = 4x3 − g2 x − g3 . If (x0 , y0 ) satisfies
F (x0 , y0 ) = 0 and y0 6= 0, then ∂F ∂y
(x0 , y0 ) 6= 0 and thus the assignment (x, y) 7→ x is a local
chart about (x0 , y0 ). Likewise, (x, y) 7→ y defines a local chart about (x0 , y0 ) when x0 6= 0.
Finally, we conclude by observing that a locally biholomorphic map is biholomorphic.
476
26.2. Elliptic Curves Chapter 26. Elliptic Curves and Complex Analysis
Recall from Chapter 23 that an elliptic curve can be defined by a Weierstrass equation
E : y 2 = f (x) = ax3 + bx2 + cx + d.
X Y
This embeds into projective space via (x, y) 7→ [x, y, 1]. Setting x = Z
and y = Z
, we also
obtain a homogeneous equation for the curve:
E : ZY 2 = aX 3 + bX 2 Z + cXZ 2 + dZ 3 .
The single point at infinity, [0, 1, 0], can be studied by dehomogenizing via the coordinates
z̃ = YZ and x̃ = X
Y
, which yield
E : z̃ = ax̃3 + bx̃2 z̃ + ax̃z̃ 2 + dz̃ 3 .
We have shown that a lattice Λ ⊆ C determines elliptic functions ℘(z) and ℘0 (z) that satisfy
℘0 (z)2 = 4℘(z)3 − g2 ℘(z) − g3 and that this polynomial expression has no multiple roots.
Therefore the mapping z 7→ (℘(z), ℘0 (z)) determines a bijective correspondence C/Λr{0} →
E(C) r {∞} which can be extended to all of C/Λ → E(C) (this is Theorem 26.2.4). There
is a natural group structure on C/Λ induced from C, but what is not so obvious is that this
coincides precisely with the “chord-and-tangent” group law on E(C) from Section 23.3.
Theorem 26.2.5. The map ϕ : C/Λ → E(C) is an isomorphism of abelian groups.
Proof. Consider the diagram
ϕ×ϕ
C/Λ × C/Λ E(C) × E(C)
α β
C/Λ E(C)
ϕ
where α and β are the respective group operations. Since C/Λ × C/Λ is a topological group,
it’s enough to show the diagram commutes on a dense subset of C/Λ × C/Λ. Consider
e = {(u1 , u2 ) ∈ C2 | u1 , u2 , u1 ± u2 , 2u1 + u2 , u1 + 2u2 6∈ Λ}.
X
Then X e ∼
= C2 so X = X e mod Λ × Λ is dense in C/Λ × C/Λ. Take (u1 + Λ, u2 + Λ) ∈ X
and set u3 = −(u1 + u2 ). Then u1 + u2 + u3 = 0 in C/Λ. Set P = ϕ(u1 ), Q = ϕ(u2 ) and
R = ϕ(u3 ) ∈ E(C). By the assumptions on X, the points P, Q, R are distinct. We want to
show ϕ(u1 + u2 ) = ϕ(u1 ) + ϕ(u2 ) = P + Q. Since ℘(z) is even and ℘0 (z) is odd, we see that
ϕ(−z) = −ϕ(z) for all z ∈ C/Λ. Thus ϕ(u1 + u2 ) = −ϕ(−(u1 + u2 )) = −R so we need to
show P + Q + R = O, i.e. P, Q, R are colinear. Since u1 6= u2 , the line P Q is not vertical,
so there exist a, b such that ℘0 (ui ) = a℘(ui ) + b for i = 1, 2. Consider the elliptic function
f (z) = ℘0 (z) − (a℘(z) + b).
Then on the fundamental domain Π, f only has a pole at 0, so ord0 f = −3. Also, u1 and u2
are distinct zeroes of f , so there is a third point ω ∈ Π such that deg(f ) = u1 +u2 +ω−3·0 = 0,
i.e. u1 + u2 + ω = 0. Solving for ω, we get ω = −(u1 + u2 ) = u3 . It follows that R = ϕ(u3 )
is on the same line as P and Q, so we are done.
477
26.2. Elliptic Curves Chapter 26. Elliptic Curves and Complex Analysis
The compatibility of the group operations of C/Λ and E(C) is highly useful. For example,
fix N ∈ N and let
E[N ] = {P ∈ E(C) | [N ]P = O},
be the N -torsion points of E. For N = 2, the points P such that P = −P are exactly the
intersection points of E with the x-axis along with O = [0, 1, 0]:
Theorem 24.0.2 said that #E[N ] = N 2 . This is hard to see from the geometric picture,
but working with the isomorphism E(C) ∼ = C/Λ from Theorem 26.2.5, we see that since
C/Λ = R/Z×R/Z as an abelian group, the N -torsion is given by (C/Λ)[N ] = N1 Z/Z× N1 Z/Z.
This is a group of order N 2 , so we have proven (3) of Theorem 24.0.2. The other statements
of the theorem are straightforward to prove.
Recall that morphism in the category of elliptic curves is called an isogeny. Explicitly,
ϕ : E1 → E2 is an isogeny between two elliptic curves if it is a (nonconstant) morphism of
schemes that takes the basepoint O1 ∈ E1 to the basepoint O2 ∈ E2 .
f (z mod Λ1 ) = az + b mod Λ2 .
Proof. As topological spaces, C/Λ1 and C/Λ2 are complex tori with the same universal
covering space C, so any f : C/Λ1 → C/Λ2 lifts to F : C → C making the diagram
commute:
F
C C
π1 π2
C/Λ1 C/Λ2
f
478
26.2. Elliptic Curves Chapter 26. Elliptic Curves and Complex Analysis
Since covers are local homeomorphisms, it follows that F is holomorphic as well. Thus for
any z ∈ C, ` ∈ Λ1 ,
Corollary 26.2.7. For two lattices Λ1 , Λ2 , the elliptic curves C/Λ1 and C/Λ2 are isomorphic
if and only if there exists an a ∈ C such that Λ1 = aΛ2 .
Definition. Two lattices Λ1 and Λ2 are said to be homothetic if Λ1 = aΛ2 for some a ∈ C.
Thus the set of homothety classes of lattices is naturally identified with the set of iso-
morphisms of complex elliptic curves.
Corollary 26.2.8. Any holomorphic map f : C/Λ1 → C/Λ2 is, up to translation, a group
homomorphism. In particular, if f (0) = 0 then f is a homomorphism.
Corollary 26.2.9. For any elliptic curve E, the group of endomorphisms End(E) has rank
at most 2.
End(E) = {f : E → E | f is an isogeny}
= {f : C/Λ → C/Λ | f is holomorphic and f (0) = 0} by Corollary 26.2.8
= {z ∈ C | zΛ ⊆ Λ}
= {z ∈ C | z(Z + Zτ ) ⊆ (Z + Zτ )}
⊆ Z + Zτ.
End(E) = Z.
End(E) is an order O in some imaginary quadratic number field K/Q (for the defini-
tion, see Section 17.2). In this case, E is said to have complex multiplication.
479
26.3. The Classical Jacobian Chapter 26. Elliptic Curves and Complex Analysis
For an elliptic curve E defined by the equation y 2 = f (x), fix a holomorphic differential
form ω on E(C). (In general, the space of holomorphic differential forms on a curve has
dimension equal to the genus of the curve, so in the elliptic curve case, there is exactly one
such ω, up to scaling.)
Example 26.3.3. Under the map ϕ : C/Λ → E(C), z 7→ (x, y) = (℘(z), ℘0 (z)), we see that
dx = ℘0 (z) dz = y dz
so ω = dx
y
is a differential form on E(C). In fact, ω = f dx 2
0 (x) , where E is defined by y = f (x),
is holomorphic because f 0 (x) 6≡ 0. This differential form is also holomorphic at O = [0, 1, 0],
so up to scaling, this is the unique holomorphic form on E.
480
26.3. The Classical Jacobian Chapter 26. Elliptic Curves and Complex Analysis
For a more functorial description, let VE = Γ(E, ΩE ) be the space of all holomorphic
differential forms on E. If γ is a curve in E(C), there is an associated linear functional
ϕγ ∈ VE∗ defined by
ϕγ : VE −→ C
Z
ω 7−→ ω.
γ
Fixing the basepoint O ∈ E(C), the lattice of periods for E can be written
Definition. The Jacobian of an elliptic curve E is the quotient J(E) = VE∗ /Λ.
For each point P ∈ E(C), the coset ϕγ + Λ is an element of the Jacobian, where γ is a
path from O to P . This defines an injective map i : E ,→ J(E).
i1 i2
J(E1 ) J(E2 )
τ
Thus σ ∗∗ ϕγ1 = ϕσ(γ1 ) . If γ1 is a closed curve through O1 , then σ(γ1 ) is a closed curve passing
through O2 = σ(O1 ). Hence if ΛE1 , ΛE2 are the lattices of periods for E1 , E2 , respectively,
we have σ ∗∗ (λE1 ) ⊆ ΛE2 . So σ ∗∗ factors through the quotients, defining τ :
Lemma 26.3.5. For any elliptic curve E, the inclusion i : E ,→ J(E) induces an isomor-
phism
i∗ : π1 (E, O) −→ π1 (J(E), i(O)).
481
26.3. The Classical Jacobian Chapter 26. Elliptic Curves and Complex Analysis
Unfortunately, the construction of the Jacobian given so far is not algebraic so it would
be hard to carry over to curves over an arbitrary ground field. To construct Jacobians
algebraically, we will prove Abel’s theorem:
Theorem 26.3.6 (Abel). Suppose Λ ⊆ C is a lattice with fundamentalP domainPΠ and take
any set {ai } ⊂ Π such that there are integers mi ∈ Z satisfying mi = 0 and mi ai ∈ Λ.
Then there exists an elliptic function f (z) whose set of zeroes and poles is {ai } and whose
orders of vanishing/poles are ordai f = mi .
2 2
One has |eπi(n τ +2nz) | = e−π(n im τ +2n im z)
for any z ∈ C, which implies that the above
series converges absolutely.
Properties (2) and (3) together say that θ(z) is what’s known as a semielliptic function.
For our purposes, this will be good enough. Notice that for z = 1+τ 2
, we have
1+τ 1+τ
θ =θ − + (1 + τ )
2 2
πi(τ +2(− 1+τ )) 1+τ
=e 2 θ −
2
πi 1+τ 1+τ
=e θ − = −θ .
2 2
1+τ
Thus z = 2
is a zero of θ(z).
Lemma 26.3.8. All zeroes of θ(z, t) are simple and are of the form 1+τ
2
+ ` for ` ∈ Λ.
482
26.3. The Classical Jacobian Chapter 26. Elliptic Curves and Complex Analysis
Proof. Given such a set {ai } ⊂ Π, let x1 , . . . , xn be the list of all ai with mi > 0, listed with
repetitions corresponding to the number mi . For example, if m1 = 2 then x1 = x2 = a1 .
Likewise, letPy1 , . . . , yn be the list of all ai with mi < 0, once again with repetitions. By the
hypothesis mi = 0, there are indeed an equal number of each. Set
Qn (xi )
θ (z)
f (z) = Qi=1
n (yi ) (z)
.
i=1 θ
Then by Lemma 26.3.9, f (z + 1) = f (z). On the other hand, the lemma also gives
Qn (xi )
i=1 θ (z + τ )
f (z + τ ) = Q n (y i ) (z)
i=1 θ
Pn Pn
= e2πi( i=1 xi − i=1 yi ) f (z)
P
= e2πi mi ai
f (z)
X
= f (z) since mi ai = 0.
Therefore f (z) is elliptic.
Note that θ(z) is a meromorphic function, so by complex analysis, the integral
θ0 (z)
Z
1
dz
2πi ∂Π θ(z)
counts the number of zeroes of θ(z) in the fundamental domain Π, up to multiplicity. To
ensure no zeroes lying on ∂Π are missed, we may shift Π → Πα for an appropriate α ∈ C.
Parametrize ∂Π as in Proposition 26.1.5. Then once again the integrals along γ2 and γ4
cancel since θ(z + 1) = θ(z). On the other hand,
θ(z + τ ) = e−πi(τ +2z) θ(z)
=⇒ θ0 (z + τ ) = e−πi(τ +2z) (−2πiθ(z) + θ0 (z))
θ0 (z + τ ) θ0 (z)
=⇒ = −2πi + .
θ(z + τ ) θ(z)
This implies
θ0 (z) θ0 (z) θ0 (z) θ0 (z) θ0 (z)
Z Z Z Z Z
dz = dz + dz + dz + dz
∂Π θ(z) γ1 θ(z) γ2 θ(z) γ3 θ(z) γ4 θ(z)
θ0 (z) θ0 (z) θ0 (z) θ0 (z)
Z Z Z Z
= dz + dz + dz + dz
γ1 θ(z) γ3 θ(z) γ2 θ(z) γ4 θ(z)
θ0 (z) θ0 (z)
Z Z
= dz − dz + 2πi + 0
γ1 θ(z) γ1 θ(z)
= 2πi.
It follows that θ(z) has exactly one zero in Π, and it must be z = 1+τ
2
.
The inverse map ψ : E → C/Λ extends to the group of divisors on E:
Ψ : Div(E) −→ C/Λ
X X
nP P 7−→ nP ψ(P ).
483
26.3. The Classical Jacobian Chapter 26. Elliptic Curves and Complex Analysis
Corollary 26.3.10. The map Ψ : Div0 (E) → C/Λ induces an isomorphism Pic0 (E) ∼
= C/Λ.
Proof. One can prove that Ψ is a surjective group homomorphism. Moreover, Abel’s theorem
(26.3.6) implies that ker Ψ = PDiv(E).
Consider the map iO : E → Div0 (E) that sends P 7→ P −O. This fits into a commutative
diagram:
Div0 (E)
Ψ
iO C/Λ
ψO
E
Pic0 (E)
Ψ
iO C/Λ
ψO
E
484
26.4. Jacobians of Higher Genus CurvesChapter 26. Elliptic Curves and Complex Analysis
Pic0 (C)
Ψ
iO J(C)
ψO
C
However, this time not every map is a bijection. In particular, dim C = 1 < g = dim J(C).
To remedy this, let C g be the g-fold product of C and consider the map
ψ g : C g −→ J(C)
(P1 , . . . , Pg ) 7−→ ψ(P1 ) + . . . + ψ(Pg )
485
26.4. Jacobians of Higher Genus CurvesChapter 26. Elliptic Curves and Complex Analysis
There is still work to do to show that the natural map C g → Pic0 (C) is surjective.
It turns out that J(C) is birationally equivalent to the symmetric power C (g) = C g / ∼,
where (P1 , . . . , Pg ) ∼ (Pσ(1) , . . . , Pσ(g) ) for any permutation σ ∈ Sg . Jacobi proved that this
birational equivalence is enough to endow Pic0 (C) ∼ = J(C) with the structure of an algebraic
group.
486
Chapter 27
Complex Multiplication
We saw in Section 24.1 that many endomorphisms of an elliptic curve are of the form [m] :
P 7→ mP for m ∈ Z. In fact, for most elliptic curves, these are the only endomorphisms,
but a special class of curves admit extra endomorphisms which are the starting place for a
beautiful theory of complex multiplication in number theory.
In class field theory (Part IV), we classified all abelian extensions of a number field K by
studying complex roots of unity, i.e. torsion points of the group scheme Gm (C), and using
them to construct cyclotomic extensions of K – by the Kronecker-Weber theorem (17.8.10),
all abelian extensions are subfields of such cyclotomic fields. In a completely analogous way,
the theory of complex multiplication allows one to construct, for an elliptic curve E for which
End(E) has extra elements coming from a number field K, abelian extensions of K. Namely,
torsion points of E along with the j-invariant will generate all such fields.
487
27.1. Classical Complex Multiplication Chapter 27. Complex Multiplication
ϕ ϕ
[α]
E E
488
27.1. Classical Complex Multiplication Chapter 27. Complex Multiplication
Therefore (ϕ ◦ [α]1 )∗ = ([α]2 ◦ ϕ)∗ , but since ϕ∗ is nonzero by Theorem 24.1.10, we must have
(ϕ ◦ [α]1 = [α]2 ◦ ϕ.
For an order O in an imaginary quadratic field K, let Ell(O) denote the set of isomorphism
classes of elliptic curves E/C with End(E) ∼= O.
Theorem 27.1.4. Let K be a number field with ring of integers OK , class group CK and
nonzero fractional ideals a, b ⊂ K. Then for any lattice Λ ⊂ C with associated elliptic curve
E = C/Λ,
Proof. (a) follows from the proof of Proposition 14.8.2, with OK replaced by Λ.
(b) For all α ∈ C, αaΛ ⊆ Λ is equivalent to αΛ ⊆ Λ, after multiplying through by a−1 .
This shows that
by Corollary 26.2.9.
(c) By Corollary 26.2.7, Ea ∼
= Eb if and only if the lattices aΛ and bΛ are homothetic,
i.e. aΛ = cbΛ for some c ∈ C. So
Ea ∼
= Eb ⇐⇒ aΛ = cbΛ for some c ∈ C
⇐⇒ Λ = ca−1 bΛ and Λ = c−1 ab−1 Λ for some c ∈ C
⇐⇒ ca−1 b, c−1 ab−1 ⊆ OK for some c ∈ C
⇐⇒ ca−1 b = OK = c−1 ab−1 for some c ∈ C
⇐⇒ a = cb for some c ∈ K
⇐⇒ [a] = [b] in CK .
(d) Define the action of CK on Ell(OK ) by [a] · E = Ea−1 . Fix E1 , E2 ∈ Ell(OK ) with
E1 = C/Λ1 and E2 = C/Λ2 . For j = 1, 2, choose λj ∈ Λj and set aj = λ−1 j Λj . By the proof
489
27.1. Classical Complex Multiplication Chapter 27. Complex Multiplication
Thus the action is transitive. To see that it is simply transitive, note that by (c), if [a] · E =
[b] · E then [a] = [b]. Then (e) follows immediately.
Example 27.1.5. For the lattice Λ = Z[i], the Gaussian integers, set E = C/Λ. Then
End(E) ∼ = Z[i] so E has complex multiplication. Moreover, Aut(E) = {±1, ±i} ∼ = Z/4Z
and j(E) = 1728 by analysis of the Weierstrass equation, so E is isomorphic to the elliptic
curve given by y 2 = x3 + x. To see this explicitly, note that iΛ = Λ implies g3 (Λ) = g3 (iΛ) =
i6 g3 (Λ) = −g3 (Λ), where g3 (Λ) is the normalized Eisenstein series for Λ (see Section 26.2).
Thus g3 (Λ) = 0 so by Theorem 26.2.4, E has Weierstrass equation
E : y 2 = 4x3 − g2 (Λ)x.
This also confirms that j(E) = 1728. Note that although E is isomorphic to a rational
elliptic curve, e.g. y 2 = x3 + x, the above Weierstrass equation is not rational. In fact,
Z 1 4
dt
g2 (Λ) = 64 √ .
0 1 − t4
Example 27.1.6. Similarly, consider the lattice Λ = Z[ρ] where ρ = e2πi/3 is a primitive
third root of unity. Then for E = C/Λ, we have End(E) = Z[ρ] so once again, E has complex
multiplication. Let us describe E explicitly as in the previous example. First, ρΛ = Λ implies
g2 (Λ) = g2 (ρΛ) = ρ4 g2 (Λ) = ρg2 (Λ), so g2 (Λ) = 0. By Theorem 26.2.4, E is given by the
Weierstrass equation
E : y 2 = 4x3 − g3 (Λ)
so j(E) = 0. Moreover, Aut(E) = {±1, ±ρ, ±ρ2 } ∼ = Z/6Z and E is isomorphic to the
2 3
rational elliptic curve y = x + 1.
490
27.2. Torsion and Rational Points Chapter 27. Complex Multiplication
Proposition 27.2.1. For any OK -ideal a, there is an isogeny ϕa : E → [a] · E such that
Corollary 27.2.2. Let N = NK/Q be the ideal norm of the extension K/Q. Then
(a) For any ideal a ⊂ OK , the isogeny ϕa : E → [a] · E has degree Na.
(b) In particular, for all α ∈ OK , the isogeny [α] : E → E has degree |N α| where
N = NK/Q is the field norm.
(a) For any elliptic curve E/C with complex multiplication by OK , j(E) ∈ Q.
(b) Ell(OK ) is equal to the set of Q-isomorphism classes of elliptic curves defined over Q
with End(E) ∼ = OK .
Proof. (a) Set L = Q(j(E)); we must show that [L : Q] < ∞. For any σ ∈ Aut(C), E σ
is the curve obtained by letting σ act on the Weierstrass equation for E, so by definition
j(E σ ) = j(E)σ . Since End(E σ ) ∼
= OK for each σ, there are only finitely many C-isomorphism
σ
classes that E can take on. By Proposition 23.2.1, elliptic curves over C are in bijective
491
27.2. Torsion and Rational Points Chapter 27. Complex Multiplication
Hence [σα]∗E σ = ([α]∗E )σ so Theorem 24.1.10 implies [σα]E σ = [α]σE since we are in character-
istic 0.
(2) Take σ ∈ Aut(C/L). Then E σ = E so by (a), we have [α]σE = [σα]E σ = [σα]E for
all α ∈ O. Given that O ⊆ K, if σ also fixes K then σα = α. Thus [α]σE = [α]E for all
σ ∈ Aut(C/LK), meaning [α] = [α]E is defined over LK. But by Proposition 27.1.2, these
are all the elements of End(E).
(3) Fix an isogeny ϕ : E → E 0 and suppose σ ∈ Aut(C/L). Then ϕσ is an isogeny E → E 0
as well, since the Weierstrass equations of E, E 0 are fixed under σ. By Proposition 24.1.9, ϕ
is determined by its kernel which is a finite subgroup of E(C). There are only finitely many
finite subgroups of E(C), so we see that there are only finitely many isogenies E → E 0 of a
given degree. Therefore {ϕσ | σ ∈ Aut(C), σ fixes L} is a finite set (noting that deg ϕσ =
deg ϕ) which implies ϕ is defined over a finite extension of L. Repeating the argument for
any ϕ gives an extension M/L, but since Hom(E, E 0 ) is finitely generated, we may take M/L
to be a finite extension.
Corollary 27.2.5. If E is an elliptic curve with complex multiplication via OK where K is
an imaginary quadratic field, then [Q(j(E)) : Q] ≤ hK , the class number of K.
We will later show that [Q(j(E)) : Q] = hK , so in particular j(E) is rational if and only if
K is an imaginary quadratic field with class number 1. As there are only a finite number of
such number fields, it follows that only a finite number of Q-isomorphism classes of elliptic
curves have complex multiplication.
492
27.2. Torsion and Rational Points Chapter 27. Complex Multiplication
([i](x, y))τ = (−x, iy)τ = (−τ · x, τ · (iy)) = (−τ · x, −i(τ · x)) = [−i](τ · x, τ · y) = [i]τ (x, y)τ .
Theorem 27.2.7. Let E be a complex elliptic curve with complex multiplication by OK and
let L = K(j(E), Etors ) be the field extension generated by j(E) along with all torsion points
of E. Then L is an abelian extension of K(j(E)).
Proof. Set L0 = K(j(E)) and for each m ≥ 1, let LmS= L0 (E[m]) be the extension of L0
generated by the m-torsion points of E. Then L = m≥1 Lm so it suffices to show each
Lm /L0 is abelian. For each σ ∈ Gal(Lm /L0 ), P ∈ E[m] and α ∈ OK , Theorem 27.2.4 gives
us
([α]P )σ = [α](P σ )
so the actions of Gal(Lm /L0 ) and OK on E[m] commute. This induces a group homomor-
phism
ρ : Gal(K/L0 ) −→ AutOK /mOK (E[m])
where K is an algebraic closure of K, which descends to an injective homomorphism
but by Proposition 27.2.1(b), E[m] is a free OK /mOK -module of rank 1. Thus AutOK /mOK (E[m]) ∼
=
(OK /mOK )× which is abelian, so Gal(Lm /L0 ) is abelian as required.
Let K be an imaginary quadratic field with ring of integers OK and define
F : Gal(K/K) −→ CK
by sending σ to the unique element F (σ) = [a] ∈ CK such that [a] · E = E σ for all elliptic
curves E ∈ Ell(OK ). (Existence and uniqueness of this element follow from Theorem ??).
The following results highlight an interesting fact: F converts the algebraic information of
the absolute Galois group of K into the analytic information of elliptic curves over Q, via
their j-invariants.
Lemma 27.2.8. For all σ ∈ Gal(K/K) and all elliptic curves E ∈ Ell(OK ),
493
27.2. Torsion and Rational Points Chapter 27. Complex Multiplication
Proposition 27.2.10. For all elliptic curves E ∈ Ell(OK ), classes [a] ∈ CK and automor-
phisms σ ∈ Gal(Q/Q), ([a] · E)σ = [a]σ · E σ .
Proof. By Proposition 27.2.3 we may assume E is defined over Q, so E σ makes sense. Choose
a lattice Λ ⊂ C so that E ∼
= C/Λ. Also, since a is a finitely generated OK -module, we have
an exact sequence
m n
OK → OK →a→0
for some m, n ∈ N. Note that for any OK -module M , the map
0→Λ→C→E→0
0 a−1 Λ C Hom(a, E)
0 Λn Cn En 0
0 Λm Cm Em 0
m n
(The other rows come from applying Hom(OK , −) and Hom(OK , −) to the same sequence.)
Applying the Snake Lemma to the bottom rows gives an exact sequence
This identifies the C-points of the variety [a] · E = C/a−1 Λ with the identity component of
ker(E n → E m ). The same argument shows that the C-points of [a]σ · E σ may be identified
with the identity component of ker((E σ )n → (E σ )m ), but the latter is precisely ker(E n →
E m )σ , so we conclude that [a]σ · E σ = ([a] · E)σ .
494
27.3. Class Field Theory with Elliptic Curves Chapter 27. Complex Multiplication
495
Part VI
L-Functions
496
These notes in Part VI come from the 2017-2018 Galois-Grothendieck Seminar at the
University of Virginia. The topic for most of the year was Tate’s thesis on Fourier analysis
over number fields. Tate’s work is a natural jumping off point for the study of L-functions,
modular forms and the beginnings of the Langlands program.
497
Chapter 28
Introduction
The Riemann zeta function is a very basic example of an L-function, an analytic object with
important ties to many branches of mathematics. (In Section 12.4, we saw an example of
an L-function associated to a Dirichlet character.) Many L-functions have Euler products
and functional equations, among other amazing properties, but for certain L-functions of
interest these properties remain conjectures. A particular class of L-functions called Hecke
L-functions have fundamental ties to number theory. In his doctoral thesis, Tate established
a remarkably useful and general framework for studying functional equations for these Hecke
L-functions.
In this introduction we give an overview of some of the types of L-functions that are out
there, as well as their properties and how they connect to each other. By an L-series, we
mean a particular series representation of a function on a subset of C, and when such a series
has meromorphic continuation to C, this continuation is called an L-function. Often we will
use the terms interchangeably though.
(1) The first example of an L-function is the Riemann zeta function, given in series form
∞
X 1
by ζ(s) = . By Theorem 10.3.1, ζ(s) has an Euler product
n=1
ns
Y
ζ(s) = (1 − p−s )−1
p
(with the product being taken over all prime integers p). The zeta function also has mero-
morphic continuation and
−s/2 s
a functional equation coming from the expression ξ(s) = ξ(1 − s),
where ξ(s) = π Γ 2 ζ(s) (this was Theorem 12.1.2). The zeta function has many im-
portant connections to number theory; for instance, certain special values of ζ(s) encode
number-theoretic properties of Z (see Section 12.2):
π4
ζ(2) = π
6
and ζ(4) = 90
were proven by Euler. More generally,
(2π)2n b2n (−1)n+1
ζ(2n) =
2(2n)!
where bk is the kth Bernoulli number (the odd Bernoulli numbers are zero). In contrast,
the values of ζ(s) at odd positive integers are rather mysterious: while ζ(3) is known
498
Chapter 28. Introduction
The functional equation implies ζ(−2n) = 0 for all integer n ≥ 1. The negative even
integers are known as the trivial zeroes of ζ(s).
The Riemann hypothesis states that all nontrivial zeroes of ζ(s) lie on the line Re(s) =
1
2
in the complex plane. This statement, while still not proven (or disproven), encodes
amazing information about the distribution of prime numbers, among other things.
Since χ is fully multiplicative, that is χ(ab) = χ(a)χ(b) for all a, b ∈ Z, there is an Euler
product for its Dirichlet L-function:
Y
L(s, χ) = (1 − χ(p)p−s )−1
p
where, as usual, the product is taken over all prime integers p. There is also a functional
equation and meromorphic continuation for L(s, χ) in terms of the Γ function and a certain
Xm
analogue χ(n)e2πin/m of the Fourier transform, called a Gauss sum. The Riemann hy-
n=1
pothesis also makes sense to state for Dirichlet L-functions, though it too remains unsolved.
(3) Let K/Q be a number field. As in Section 17.5, the Dedekind zeta function for K is
X 1
ζK (s) =
a⊂OK
N (a)s
where the sum is over all nonzero ideals a of the ring of integers OK and N = NK/Q is the
ideal norm of the extension. Since norm is a multiplicative function and every ideal factors
uniquely in OK into a product of prime ideals, there is an Euler product
Y
ζK (s) = (1 − N (p)−s )−1
p∈Spec OK
where the product is over all nonzero prime ideals p ⊂ OK . Perhaps not surprisingly at
this point, there is a functional equation for ζK (s). Let ΓR (s) = π −s/2 Γ 2s and ΓC (s) =
2(2π)−s Γ(s). Then the completed Dedekind zeta function for K is
499
Chapter 28. Introduction
= ζQ (s)L(s, χ),
where ζQ (s) = ζ(s) is the ordinary Riemann zeta function and χ is the nontrivial Dirichlet
character mod 4:
1,
r ≡ 1 (mod 4)
χ(r) = −1, r ≡ 3 (mod 4)
0, r is even.
(4) Let K/Q be a number field with absolute Galois group GK = Gal(K/K), where K
is a fixed algebraic closure of K. A Galois representation of K is a continuous morphism of
topological groups ρ : GK → GL(V ) for V a finite dimensional C-vector space. Note that
ρ being continuous is equivalent to it having finite image in GL(V ). Therefore any such
morphism factors through Gal(L/K) → GL(V ) for some finite extension L/K; we will also
denote this by ρ. For each (nonzero) prime ideal p ⊂ OK , there is a decomposition subgroup
DL/K,p and an inertia subgroup IL/K,p C DL/K,p ⊆ Gal(L/K), each of which is well-defined
up to conjugacy. Further, there exists a Frobenius element FrobL/K (p) ∈ DL/K,p /IL/K,p which
is also well-defined up to conjugacy. For each p, let Vp = V IL/K,p be the subspace of V fixed
by the action of the inertia group. Set np = dim Vp and write ρp = ρ|Vp . Then the local Artin
L-function at p is defined by
Lp (s, ρ) = det[Inp − ρp (FrobL/K (p))NK/Q (p)−s ]−1
where Inp is the identity operator on Vp . Stitching these together, we also define the (global)
Artin L-function for K: Y
L(s, ρ) = Lp (s, ρ).
p∈Spec OK
×
Observe that if ρ : GK → GL1 (C) = C is the trivial representation, then L(s, ρ) = ζK (s)
is the Dedekind zeta function for K. Moreover, one can prove that if ρ factors through
the regular representation Gal(L/K) → GL(V ) for some finite L/K, then L(s, ρ) is the
Dedekind zeta function for L. As with previous L-functions, there is a functional equation
for Artin L-functions that relates L(s, ρ) to L(1 − s, ρ̄), where ρ̄ is the complex conjugate
representation.
500
Chapter 28. Introduction
Example 28.0.2. Let L = Q(i) and K = Q and consider the nontrivial Galois representa-
tion
ρ : Gal(Q(i)/Q) −→ C× , (z 7→ z̄) −
7 → −1.
Then as in Example 28.0.1, we can compute the Artin L-function using the splitting behavior
of primes in Z[i]. If p ∈ Z splits or ramifies in Z[i], then for any p | p, N (p) = p, the Frobenius
element is trivial, and so Lp (s, ρ) = (1 − p−s )−1 . However, when p is inert, the Frobenius
element is complex conjugation, so we get Lp(s,ρ) = (1 + p−s )−1 . Putting this together, we
have
Y Y ζQ(i) (s)
L(s, ρ) = (1 − 2−s )−1 (1 − p−s )−1 (1 + p−s )−1 = (1 − 2−s )−1 .
ζQ (s)
p≡1 (mod 4) p≡3 (mod 4)
but they are better understood as sums of idèle class characters. Tate’s thesis focused on
proving a functional equation for Hecke L-functions, which we will outline in Chapter 31.
(6) Let X be a smooth projective algebraic variety over a finite field Fq . Then the
Hasse-Weil zeta function of X is defined as
∞
!
X sr
Z(X/Fq , s) = exp Nr
r=1
r
where Nr = #X(Fqr ) for each r ≥ 1. The Weil conjectures are a set of essential statements
about Z(X/Fq , s) that were formulated in the 1940s by André Weil and later proven by
Dwork, Grothendieck and Deligne. They assert that:
p1 (s) · · · p2n−1 (s)
(Rationality) Z(X/Fq , s) is rational: Z(X/Fq , s) = for polynomials
p0 (s) · · · p2n (s)
pi (s) over Z.
501
Chapter 28. Introduction
(7) Let K/Q be a number field and X a smooth projective variety over K. The (global)
zeta function for X is the following product of the Hasse-Weil zeta functions for X/Fq :
Y
Z(X, s) = Z(X/Fp , p−s )
p∗
502
Chapter 29
503
29.1. Topological Vector Spaces Chapter 29. Locally Compact Groups
Example 29.1.1. Let k be a topological field. Then any abstract k-vector space V is
∼ L
isomorphic to a direct sum of copies of k, ϕ : V −→ Ω k, indexed
L byQsome set Ω. Then V
inherits a topology by pulling back the subspace topology on Ω k ⊆ Ω k along ϕ and this
makes V into a topological vector space.
We will assume for the rest of these notes that all topological vector spaces are T1
(and therefore Hausdorff by homogeneity). For a topological vector space V /k, let Aut(V )
denote the k-automorphisms of V and let Auttop (V ) denote the subspace of continuous k-
automorphisms of V having continuous inverses.
For a real or complex vector space V and a subset S ⊆ V , we say S is convex if for all
x, y ∈ S, tx + (1 − t)y ∈ S for every value t ∈ [0, 1]. We say V is locally convex if there exists
a topological basis of V consisting of convex sets.
Example 29.1.3. When V is a Banach space, the metric balls {B(0, ε) | ε > 0} form a
system of convex neighborhoods around 0, so by homogeneity V is locally convex.
Definition. Suppose G is a locally compact topological group and V is a locally convex topo-
logical vector space over C. A topological representation of G is a group representation
ρ : G → Aut(V ) such that the associated map
G × V −→ V
(g, v) 7−→ ρg (v)
Definition. Let X be a topological space, V a topological vector space and let Map(X, V ) be
the space of set maps X → V . A set F ∈ Map(X, V ) is said to be equicontinuous if for
all x ∈ X and every neighborhood U ⊆ V of 0, there exists a neighborhood W ⊆ X such that
f (y) ∈ U + f (x) for every y ∈ W and f ∈ F .
504
29.1. Topological Vector Spaces Chapter 29. Locally Compact Groups
Corollary 29.1.6. Suppose V is a Banach space and G is a locally compact group. Then
a group representation ρ : G → Aut(V ) is a topological representation if and only if for all
x ∈ V , the map G → V, g 7→ ρg (x) is continuous.
505
29.1. Topological Vector Spaces Chapter 29. Locally Compact Groups
T
V V0
ρg ρ0g
T
V V0
506
29.2. Banach Algebras Chapter 29. Locally Compact Groups
ρa : A −→ A
b 7−→ ab.
Then ρa ∈ End(A) and it follows from ||1A || = 1 that ||ρa ||op = ||a|| for all a ∈ A. This
determines an embedding ρ : A ,→ End(A). Let A× be the units of A and observe that, by
submultiplicativity,
P∞ n if a ∈ A such that ||a|| < 1, then 1 − a ∈ A× (this follows from the fact
that n=1 a converges in A).
Proposition 29.2.1. Let A be a Banach algebra. Then A× ⊆ A is an open subset and
A× → A× , a 7→ a−1 is a homeomorphism.
Proof. Let a ∈ A× and take b ∈ B(a, ||a−1 ||−1 ). (Since || · || is only submultiplicative,
||a−1 ||−1 ≤ ||a|| but not necessarily equal.) Then ||a − b|| < ||a−1 ||−1 so multiplying by a−1 ,
we get
||a−1 (a − b)|| ≤ ||a−1 || ||a − b|| < 1
which by the remark above implies 1 − a−1 (a − b) ∈ A× . Multiplying by a gives b =
a − (a − b) ∈ A× , so we have an open neighborhood around a in A× . The second statement
is an easy consequence.
Definition. Let A be a Banach algebra and a ∈ A. The spectrum of a is
sp(a) = {λ ∈ C | λ1A − a 6∈ A× }.
The spectral radius of a is r(a) = sup{|λ| : λ ∈ sp(a)} and the complement C r sp(a) is
called the resolvent set of a.
Lemma 29.2.2. For all a ∈ A, r(a) ≤ ||a||.
Proof. Suppose λ ∈ C r {0} such that |λ| > ||a||. Then
so λ 6∈ sp(A).
507
29.2. Banach Algebras Chapter 29. Locally Compact Groups
||a + J|| ||b + J|| = inf ||a − x|| inf ||b − y||
x∈J y∈J
Hence || · || is a norm.
(2) is straightforward.
Remark. It is useful to note that for any two-sided ideal of A, the topological closure J is
also a two-sided ideal of A, by submultiplicativity.
508
29.3. The Gelfand Transform Chapter 29. Locally Compact Groups
b −→ MaxSpec(A)
A
χ 7−→ ker χ
is a bijection.
A/ ker χ
Since χ is surjective, χ is surjective, so A/ ker χ is a field and thus ker χ is a maximal ideal.
On the other hand, for any m ∈ MaxSpec(A), the Gelfand-Mazur theorem (Corollary 29.2.4)
implies
χm : A/m −→ C
λ1A 7−→ λ
A/m
509
29.3. The Gelfand Transform Chapter 29. Locally Compact Groups
p χ
(3) Any χ ∈ A − A/ ker χ →
b factors as χ : A → − C as above, and both maps are continuous.
(4) Let a ∈ A. Then
λ ∈ sp(a) ⇐⇒ λ1A − a 6∈ A×
⇐⇒ λ1A − a ∈ m for some maximal ideal m
⇐⇒ χ(λ1A − a) = 0 for some χ ∈ A
b by (2)
⇐⇒ λ = χ(a) for some χ ∈ A.
b
eva : A∗ −→ C
ϕ 7−→ ϕ(a).
We endow A b with the subspace topology induced by the weak∗ topology on A∗ ; this is
called the Gelfand topology on A.
b
Lemma 29.3.2. The weak∗ topology makes A∗ into a locally convex topological vector space.
Theorem 29.3.3 (Alaoglu). Let B ∗ = {f ∈ A∗ : ||f ||op ≤ 1} be the unit ball in A∗ . Then
B ∗ is compact in the weak∗ topology.
(2) A
b is compact and Hausdorff in the Gelfand topology.
510
29.3. The Gelfand Transform Chapter 29. Locally Compact Groups
Γ : A −→ C(A) b
a 7−→ â.
Proof. The proofs of all five properties are straightforward from the definitions.
511
29.4. Spectral Theorems Chapter 29. Locally Compact Groups
Now suppose X is a compact Hausdorff space. Set C(X) = C(X, C) to distinguish from
C(X, R). The Stone-Weierstrass theorem is an important result from functional analysis
which in some ways gives a function space analogue of Hilbert’s Nullstellensatz.
Corollary 29.4.2. Let A be a self-adjoint, unital subalgebra of C(X) that separates points
in X. Then A is dense in C(X).
Proof. By the remark, we may write A = AR + iAR . Since A separates points, so does AR , so
by the Stone-Weierstrass theorem for this real function space, we get AR = C(X, R). Hence
A = AR + iAR = C(X, R) + iC(X, R) = C(X).
Let H be a Hilbert space and consider End(H), the space of continuous (bounded) linear
maps H → H. Then End(H) is a Banach algebra. For each T ∈ End(H), there is a unique
adjoint operator T ∗ ∈ End(H) satisfying
(i) T ∗∗ = T .
512
29.4. Spectral Theorems Chapter 29. Locally Compact Groups
(iii) (ST )∗ = T ∗ S ∗ .
self-adjoint if T = T ∗ ;
unitary if T −1 = T ∗ ;
normal if T T ∗ = T ∗ T .
Proposition 29.4.4. If T ∈ End(H) is normal then ||T || = r(T ), the spectral radius of T .
Proof. On one hand, we have r(T ) ≤ ||T || by Lemma 29.2.2. Note that when T is normal,
the operator T T ∗ is self-adjoint. This allows us to write the following for any n ≥ 1:
n 2n−1
||T ||2 = ||T ||2
n−1
= ||T T ∗ ||2 by Lemma 29.4.3(v)
2n
= ||(T T ∗ ) ||1/2 since T T ∗ is self-adjoint
n n
= ||T 2 (T ∗ )2 ||1/2 since T is normal
2n 2n ∗ 1/2
= ||T (T ) ||
n 1/2
= ||T 2 ||2 by Lemma 29.4.3(v) again
n
= ||T 2 ||.
Recall from (2) of Theorem 29.2.3 that r(T ) = limn→∞ ||T n ||1/n . Then the above shows that
n −n
r(T ) ≥ limn→∞ ||T 2 ||2 = limn→∞ ||T || = ||T || so we conclude that r(T ) = ||T ||.
Proof. (a) Note that in general, λ ∈ sp(T ) if and only if λ−1 ∈ sp(T −1 ). So if T is unitary,
meaning T T ∗ = 1, then it follows from Lemma 29.4.3(iv) that ||T || = ||T −1 || = 1. Thus
if λ ∈ sp(T ), then |λ| ≤ 1, but at the same time λ−1 ∈ sp(T −1 ) implies |λ−1 | ≤ 1. Hence
|λ| = 1, or λ ∈ S 1 .
(b) The operator
∞
X (iT )n
exp(iT ) =
n=0
n!
513
29.4. Spectral Theorems Chapter 29. Locally Compact Groups
For a normal operator T ∈ End(H), let AT denote the smallest subalgebra of End(H)
containing T which is self-adjoint, unital, closed and commutative. Equivalently, AT is the
subalgebra of End(H) generated by {1, T, T ∗ }.
Theorem 29.4.7 (First Spectral Theorem). Let T ∈ End(H) be a normal operator. Then
there is a map
Φ : C(sp(T )) −→ AT
which is an isometry and a ∗-isomorphism of unitary C-algebras. Further, if iT : sp(T ) ,→ C
is the natural inclusion, then Φ(iT ) = T .
514
29.4. Spectral Theorems Chapter 29. Locally Compact Groups
Proof. Consider the map Ψ : C(sp(T )) → C(A bT ) which sends f 7→ f ◦ Tb, which is well-defined
since im Tb = spAT (T ), the spectrum of T in the subalgebra AT . Then to prove the theorem,
we will show Ψ is an isometry and a ∗-isomorphism and spAT (T ) = sp(T ), so that we can
define Φ by
Ψ
C(sp(T )) C(A
bT )
Γ
Φ
AT
since for any µ, ((i − λ)f )(µ) = (µ − λ)f (µ). If T − λ1H had an inverse in End(H), we would
have
1 = ||P || = ||(T − λ1H )−1 (T − λ1H )P || ≤ ||(T − λ1H )−1 ||ε
by submultiplicativity of || · ||, but this would imply
1
≤ ||(T − λ1H )−1 ||
ε
for all ε > 0, which is impossible. Hence T − λ1H is not a unit in End(H), so λ ∈ sp(T ),
which proves spAT (T ) ⊆ sp(T ).
Now to show Ψ is an isometry and a ∗-isomorphism, note that Tb : A bT → spA (T ) =
T
sp(T ) is surjective and continuous by Proposition 29.3.1. Moreover, if Tb(γ1 ) = Tb(γ2 ) for
γ1 , γ2 ∈ A
bT , then γ1 (T ) = γ2 (T ), which is equivalent to
γ1 (T ∗ ) = γ1 (T ) = γ2 (T ) = γ2 (T ∗ )
515
29.5. Unitary Representations Chapter 29. Locally Compact Groups
Notice that when ρ is a unitary representation, we have hx, yi = hρg (x), ρg (y)i for all
g ∈ G and x, y ∈ H.
Proposition 29.5.1. Let H be a Hilbert space and T ∈ End(H) be a normal operator. Then
the following are equivalent:
(2) AT ∼
= C as C ∗ -algebras.
(3) T = λ1H for some λ ∈ C.
Theorem 29.5.2 (Schur’s Lemma). Let G be an abstract group and suppose ρ : G → Aut(V )
and ρ0 : G → Aut(V 0 ) are irreducible representations. Then any T ∈ HomG (V, V 0 ) is either
trivial or a k-vector space isomorphism.
Proof. For any T ∈ EndG (H), let T ∗ be the adjoint. Then for all g ∈ G and x, y ∈ H,
In particular, for x = 1H , this gives h1, T ∗ ρg (y)i = h1, ρg (T y)i, but h1, ·i is injective, so this
implies T ∗ ρg = ρg T ∗ for all g ∈ G. Hence T ∗ is G-equivariant. Since AT is generated as a
516
29.5. Unitary Representations Chapter 29. Locally Compact Groups
subalgebra of End(H) by {1, T, T ∗ } and all of these are now G-equivariant, it follows that
AT ⊆ EndG (H).
Now take T to be normal and suppose λ1 , λ2 ∈ sp(T ) are distinct. Since sp(T ) is Haus-
dorff, there are disjoint neighborhoods U1 , U2 ⊆ sp(T ) of λ1 and λ2 , respectively. Choose
functions f1 , f2 ∈ C(sp(T )) such that for i = 1, 2, fi (sp(T ) r {Ui }) = 0 and fi (λi ) = 1,
again using the Hahn-Banach theorem for example. Then f1 , f2 6= 0 but since U1 ∩ U2 = ∅,
f1 f2 = 0. Let Φ : C(sp(T )) → AT be the isomorphism from the spectral theorem (29.4.7).
Then since f1 6= 0, Φ(f1 )(H) is nonzero. On the other hand, Φ(f1 ) ∈ AT ⊆ EndG (H)
by the first paragraph, so Φ(f1 )(H) is a nonzero, G-equivariant subspace of EndG (H) and
by the same argument, so is its closure. Since ρ is topologically irreducible, this means
Φ(f1 )(H) = H. Applying this again for Φ(f2 ), we conclude that Φ(f2 )Φ(f1 )(H) = H, but
Φ(f2 f1 )(H) = Φ(0)(H) = {0}, contradicting the fact that Φ is an algebra homomorphism.
Hence sp(T ) can only consist of one point, so Proposition 29.5.1 shows that T = λ1H for
some λ ∈ C.
Corollary 29.5.4. Suppose G is a locally compact abelian group, H is a Hilbert space and
ρ : G → Aut(H) is a unitary, irreducible topological representation. Then dimC (H) = 1.
517
Chapter 30
Duality
Let G be a topological abelian group and let S 1 be the unit circle in C. The multiplicative
group of characters
b = {f : G → S 1 | f is a continuous homomorphism}
G
(1) If G is discrete, G
b is compact.
(2) If G is compact, G
b is discrete.
The Pontrjagin dual is the key ingredient in establishing the Fourier transform and prov-
ing the Pontrjagin duality theorem for locally compact groups.
518
30.1. Functions of Positive Type Chapter 30. Duality
Lemma 30.1.1. For all functions ϕ of positive type on G, Wϕ is a vector subspace of Cc (G)
and h·, ·iϕ descends to a positive definite, Hermitian form on the the quotient Cc (G)/Wϕ .
Let Vϕ be the completion of the normed space (Cc (G)/Wϕ , h·, ·iϕ ). By abuse of notation
we will also denote the extension of h·, ·iϕ to this completion by h·, ·iϕ .
519
30.1. Functions of Positive Type Chapter 30. Duality
Theorem 30.1.4. Let G be a locally compact group and ϕ a function of positive type on G.
Then s 7→ Ls induces a unitary representation of G on Vϕ .
(i) If f ∗ g(x) exists for some x ∈ G, then g ∗ f (x) exists and f ∗ g(x) = g ∗ f (x).
(ii) If f, g ∈ L1 (G) then f ∗ g(x) exists for almost all x ∈ G. Moreover, ||f ∗ g||1 ≤
||f ||1 ||g||1 so in particular f ∗ g ∈ L1 (G).
Theorem 30.1.7. Let ϕ be a function of positive type on G. Then there exists xϕ ∈ Vϕ such
that ϕ(s) = hxϕ , Ls xϕ iϕ for almost all s ∈ G.
T
Proof. Let {Uα } be a system of open neighborhoods of e ∈ G. Since G is Hausdorff, α Uα =
{e}. The index set {α} is a directed set under the partial ordering defined by α ≤ β
if Uβ ⊆ Uα . By Urysohn’s lemma for locally compact spaces, for each α there exists a
continuous
R function gα : G → R+ such that the support of gα is a compact subset of Uα and
g (s) ds = 1. This defines a net {gα ds}α of positive linear functionals on Cc (G); explicitly,
G αR
f 7→ G f (s)gα (s) ds. These functionals weakly converge to the Dirac measure δe : f 7→ f (e).
Let f ∈ Cc (G). Then for any α, Fubini’s theorem gives
ZZ Z
−1
ϕ(s t)f (s) ds gα (t) dt = (f ∗ ϕ)(t)gα (t) dt
G×G G
520
30.1. Functions of Positive Type Chapter 30. Duality
This determines a linear form Φ on Vϕ which, after replacing f ∗ ϕ by (f ∗ ϕ)h for a function
h with compact support and such that h ≡ 1 on a neighborhood eventually containing the
support of gα , is of the form
Z
Φ(f ) = (f ∗ ϕ)(e) = ϕ(s−1 )f (s) ds. (30.1)
G
521
30.1. Functions of Positive Type Chapter 30. Duality
Corollary 30.1.8. Let ϕ be a function of positive type on G. Then ϕ is equal almost every-
where to a continuous function of positive type on G. If, moreover, ϕ is itself continuous,
then
Proof. By Theorem 30.1.7, ϕ(s) = hxϕ , Ls xϕ iϕ a.e. for some xϕ ∈ Vϕ , but the latter is
continuous by real analysis. Now assume ϕ is continuous.
(i) Since h·, ·iϕ is positive definite on Vϕ , ϕ(e) = hxϕ , Le xϕ iϕ = hxϕ , xϕ iϕ ≥ 0.
(ii) For any s ∈ G, consider
|ϕ(s)|2 = |hxϕ , Ls xϕ iϕ |2
≤ |hxϕ , xϕ iϕ | |hLs xϕ , Ls xϕ iϕ | by Cauchy-Schwarz
= hxϕ , xϕ iϕ hxϕ , xϕ iϕ by Lemma 30.1.3(b)
= (hxϕ , xϕ iϕ )2 = ϕ(e)2 .
Taking the square root of both sides, we get ϕ(e) = sup |ϕ(s)|.
s∈G
(iii) For s ∈ G,
Definition. We say a function ϕ ∈ P(G) is elementary if ϕ(e) = 1 and for any decomposi-
tion ϕ = ϕ1 + ϕ2 , with ϕ1 , ϕ2 ∈ P(G), there exist scalars λ1 , λ2 ∈ R≥0 satisfying λ1 + λ2 = 1,
ϕ1 = λ1 ϕ and ϕ2 = λ2 ϕ. Let E(G) be the set of all elementary functions on G, together with
the zero map.
Theorem 30.1.10. Let G be a locally compact abelian group. Then the elementary functions
of positive type on G are precisely the continuous characters of G, i.e. E(G) r {0} = G.
b
Proof. Given ϕ of positive type on G such that ϕ(e) = 1, consider the following two condi-
tions:
522
30.1. Functions of Positive Type Chapter 30. Duality
(ii) ϕ is a character of G.
By Theorem 30.1.10, showing that (i) and (ii) are equivalent will imply the statement of this
theorem.
(ii) =⇒ (i) Take ϕ ∈ G
b and f ∈ Cc (G). Then
ZZ
hf, f iϕ = ϕ(s−1 t)f (s) ds f (t) dt
G×G
Z 2
= ϕ(s)f (s) ds
G
by Fubini’s theorem, which shows that Wϕ has codimension 1 in Cc (G) and hence dim Vϕ = 1.
Since G is abelian, Vϕ is an irreducible G-module.
(i) =⇒ (ii) By Corollary 29.5.4, if the unitary representation s 7→ Ls is irreducible, it is
one-dimensional. So for all ξ ∈ Vϕ , Ls (ξ) = λ(s)ξ for λ a continuous function of s. Since Ls
is unitary, Proposition 29.4.5 shows that ||Ls || = 1, which implies |λ(s)| = 1, and thus λ is
a character of G. Finally, for all s ∈ G,
ϕ(s) = hxϕ , Ls xϕ iϕ
= λ(s)hxϕ , xϕ iϕ
= λ(s)ϕ(e) = λ(s).
523
30.2. Fourier Inversion Chapter 30. Duality
for all χ ∈ G.
b
Example 30.2.1. Let G = R. Then each t ∈ R may be identified with a group character
s 7→ eist . Then the Fourier transform of any f ∈ L1 (R) is the standard Fourier transform:
Z
fˆ(t) = f (s)e−ist ds.
R
Let V (G) denote the space of continuous functions of positive type in Cc (G) and set
V 1 (G) = V (G) ∩ L1 (G). The goal of this section is to prove the Fourier inversion formula:
Theorem 30.2.2 (Fourier Inversion Formula). Let G be a locally compact abelian group with
Haar measure dx. Then there exists a Haar measure dχ on G b which satisfies
Z
f (y) = fˆ(χ)χ(y) dχ
G
b
To prepare for the proof of the Fourier inversion formula, we relate the Fourier and
Gelfand transforms by the following result. Let B = L1 (G) and let B b = HomC (B, C)× be
the space of complex characters of B. For χ ∈ G b and f ∈ L1 (G), define
Z
ˆ
ν̂χ (f ) := f (χ) = f (y)χ(y) dy.
G
b −→ B
G b
χ 7−→ ν̂χ
is a bijection.
524
30.2. Fourier Inversion Chapter 30. Duality
Let Gb have the transform topology induced by A, b i.e. the weakest topology with respect to
which each fˆ ∈ Ab is continuous. Also, let C0 (G)
b denote the C-algebra of rapidly-decaying
maps on G, or equivalently, the space of continuous functions on the one-point compactifi-
cation of G
b which are 0 at the point at infinity.
Moving towards the proof of Theorem 30.2.2, we now discuss Fourier transforms of char-
acter measures. For a locally compact group G with character group G, b let µ̂ be a Radon
measure on G b < ∞. A standard analysis result is:
b with finite total mass, that is, µ̂(G)
Definition. For a finite Radon measure µ̂ on G,b the Fourier transform of µ̂ is the function
Tµ̂ : G → C defined for each y ∈ G by
Z
Tµ̂ (y) := χ(y) dµ̂(χ).
G
b
Proof. (a) Continuity is clear. Boundedness follows from the fact that Tµ̂ (y) ≤ µ̂(G)
b for all
y ∈ G.
(b) By Fubini’s theorem and the definitions of fˆ and Tµ̂ ,
Z ZZ
ˆ
f (χ) dµ̂(χ) = f (y)χ(y) dy dµ̂(χ)
G
b G×Gb
ZZ
= f (y)χ(y) dµ̂(χ) dy
G×G
b
Z
= f (y)Tµ̂ (y) dy.
G
Proposition 30.2.7. Let µ̂ be a finite Radon measure on G. b If Tµ̂ (y) = 0 for all y ∈ G,
then µ̂ = 0. That is, µ̂ is completely determined by its Fourier transform.
525
30.2. Fourier Inversion Chapter 30. Duality
Theorem 30.2.8 (Bochner). Let G be a locally compact abelian group. The functions in
P(G) are precisely the Fourier transforms of Radon measures µ̂ on G
b with finite total mass
b ≤ 1.
µ̂(G)
Thus the Fourier transform of µ̂ is the character χ which is a function of positive type on
G such that ||χ||∞ ≤ 1, by Theorem 30.1.10. The general case is obtained by taking weakly
convergent limits of point-measures of total mass 1.
Conversely, by Lemma 30.2.6(a), the Fourier transform is a continuous map M c → P(G).
Then the same argument using weakly convergent limits of point-measures can be used to
show that the image of M c is (weakly) compact, hence closed in P(G). Finally, one observes
c in P(G) is convex and contains G∪{0},
that the image of M b and then the characterization of
elementary functions as extreme points of P(G), together with Theorem 30.1.10, will imply
that this image is all of P(G).
Let G be a locally compact abelian group and set V = V (G), the complex vector space of
continuous functions of positive type on G. Then Corollary 30.1.8(ii) implies the functions
of V are bounded. Put V 1 = V 1 (G) = V ∩ L1 (G).
Corollary 30.2.9. Each function f ∈ V uniquely determines a Radon measure µ̂f of finite
total mass on G
b such that f is the Fourier transform of µ̂f .
526
30.2. Fourier Inversion Chapter 30. Duality
Z
As a result, we may view any function f ∈ V as f (y) = χ(y) dµ̂f (χ).
G
b
Lemma 30.2.10. There exists a net of functions {f } on V 1 = V 1 (G) such that the associated
sequence of Fourier transforms {fˆ} converges uniformly to the constant function 1 on all
compact subsets of G.
b
Proof. By Proposition 30.2.7, it’s enough to show the equality on the corresponding Fourier
transforms. For any y ∈ G, consider
Z ZZ
Tĝ dµ̂f (y) = χ(y)ĝ(χ) dµ̂f (χ) = χ(y)g(z)χ(z) dz dµ̂f (χ) by definition of ĝ
Gb G×Gb
ZZ
= χ(y)g(z)χ(z) dµ̂f (χ) dz by Fubini’s theorem
G×G
b
ZZ
= χ(z −1 y)g(z) dµ̂f (χ) dz after a change of variables
Z G×G
b
but this equals f ∗ g, the convolution of f and g. Since f ∗ g is symmetric with respect to f
and g, this implies Tĝ dµ̂f = Tfˆdµ̂g .
Let F be the set of bounded continuous functions ϕ : G b → C for which there exists a
Radon measure ν̂ϕ on Gb with finite total mass that satisfies ϕ dµ̂f = fˆ dν̂ϕ for all f ∈ V 1 .
Then Lemma 30.2.11 shows that the Fourier transforms of the functions in V 1 lie in F. In
particular, F is nonempty.
Lemma 30.2.12. Let ϕ ∈ F. Then
(i) The associated measure ν̂ϕ is unique.
(ii) If ϕ = fˆ for some f ∈ L1 (G), then ν̂ϕ = µ̂f , where µ̂f is the unique Radon measure
corresponding to f in Corollary 30.2.9.
and the µ̂f are unique by Corollary 30.2.9, so this implies ν̂ϕ is unique.
527
30.2. Fourier Inversion Chapter 30. Duality
(ii) This already holds for f ∈ V 1 by the paragraph proceeding this lemma, and now (i)
implies the property for all f ∈ L1 (G).
(iii) This uses the same argument as in (i).
(iv) Again, use Lemma 30.2.10 and the fact that limits are linear.
(v) For any measure µ, element z ∈ G b and subset E ⊆ G, b set µz (E) = µ(z −1 E). To
prove the statement fix χ0 ∈ G b and suppose ψ(χ) = ϕ(χ−1 0 χ). Then for all h ∈ Cc (G) and
1
f ∈ L (G),
Z Z Z
−1 χ−1
h(χ)ψ(χ) dµ̂f (χ) = h(χ)ϕ(χ0 χ) dµ̂f (χ) = h(χ0 χ)ϕ(χ) dµ̂f 0 (χ)
G
b G
b G
b
χ−1
by a change of variables. We claim that dµ̂f 0 = dµ̂χ−1 0 f
. Indeed, by Bochner’s theorem
(30.2.8),
Z Z
f (y) = χ(y) dµ̂f (χ) = (χ0 χ)(y) dµ̂f (χ0 χ)
ZGb Gb
−1
χ
so χ−1 0 f (y) = χ(y) dµ̂f 0 (χ)
G
b
χ−1
but by uniqueness of µ̂χ−1
0 f
, this proves dµ̂χ−10 f
= dµ̂f 0 . Now continuing with the above
computation, we have
Z Z
h(χ)ψ(χ) dµ̂f (χ) = h(χ0 χ)ϕ(χ) dµ̂χ−1 0 f
(χ)
Gb
ZG
b
Hence ψ dµ̂f = fˆ dν̂ϕχ0 for all f ∈ L1 (G), but dν̂ϕχ0 = dν̂ψ , so we get ψ ∈ F as desired.
We now prove the main statement in the Fourier inversion formula (Theorem 30.2.2).
Theorem 30.2.13. Let G be a locally compact abelian group. Then there exists a Haar
b such that for all f ∈ V 1 (G),
measure dχ on G
Z
f (y) = fˆ(y)χ(y) dχ.
G
b
528
30.2. Fourier Inversion Chapter 30. Duality
= Lχ−1
0
dν̂ψ (χ) by a change of variables
ZGb
= η(ψ).
Hence η is left-invariant, so it follows that dχ is a Haar measure. Explicitly, this satisfies
Z Z
ψ(χ) dχ = 1 dν̂ψ (χ)
G
b G
b
Hence ϕ dχ = dν̂ϕ for all ϕ ∈ F. In particular, for f ∈ V we know fˆ ∈ F from before, and
1
529
30.2. Fourier Inversion Chapter 30. Duality
Finally, we obtain half of the second statement in Theorem 30.2.2, namely, that any
function in V 1 can be recovered from its Fourier transform.
b f 7→ fˆ, is injective.
Corollary 30.2.15. The map V 1 (G) → V 1 (G),
530
30.3. Pontrjagin Duality Chapter 30. Duality
Let G be a topological abelian group, S 1 ⊆ C the complex unit circle and Gb = Homcts (G, S 1 )
the Pontrjagin dual of G. An element χ ∈ G b is called a (complex) character of G. We endow
G
b with the compact-open topology, namely the topology generated by open sets of the form
W (K, V ) where K ⊆ G is compact, V ⊆ S 1 is open and W (K, V ) contains the trivial
character 1 : G → S 1 , g 7→ 1.
Lemma 30.3.1. G
b is a topological abelian group with respect to the compact-open topology.
α : G −→ G
bb
Proof. This amounts to saying that G b separates points in G. Suppose z ∈ G r {e}. We must
produce a character χ ∈ Gb for which χ(z) 6= χ(e). Assume to the contrary that χ(z) = 1 for
b Then for all f ∈ L1 (G),
all χ ∈ G.
Z Z
Lz f (χ) =
d f (zy)χ(y) dy = f (zy)χ(zy) dy = fˆ(χ)
G G
531
30.3. Pontrjagin Duality Chapter 30. Duality
Let 1 ∈ G
b be the trivial character. Then the sets
b V ) = {ψ ∈ G
W (K,
bb
| ψ(χ) ∈ V for all χ ∈ K},
b
532
30.3. Pontrjagin Duality Chapter 30. Duality
Proof. Since α(G) is a locally compact, dense subset of α(G), general topology says that it
is also open in α(G). But in a topological group, open subgroups are also closed, so this
implies α(G) is closed in α(G), hence α(G) = α(G).
Thus to prove Pontrjagin duality, we only need to show that α(G) is dense in G. This
bb
requires an important sequence of results culminating in Plancherel’s theorem.
For f ∈ L1 (G), let f˜(y) = f (y −1 ) as in Lemma 30.3.3.
b fˆ˜(χ) = fˆ(χ).
Lemma 30.3.8. For any f ∈ L1 (G) and χ ∈ G,
Proof. For any f ∈ L1 (G)∩L2 (G), set g = f ∗f˜. Then by the same logic as in Lemma 30.3.3(i),
g is of positive type. Consider
Z Z Z
2
|f (y)| dy = f (y) f (y) dy = f (y −1 ) f (y −1 ) dy by a change of variables
G
ZG G
Z
−1 ˜
= f (y )f (y) dy = g(e) = ĝ(χ) dχ by Fourier inversion
ZG Z G
b
= ˆ ˜
f (χ)f (χ) dχ = fˆ(χ)fˆ˜(χ) dχ by Lemma 30.3.8
ZG ZG
b b
Taking the square root of both sides, we get ||f ||2 = ||fˆ||2 .
Let A
b = A(G)
b be the ring of Fourier transforms of L1 (G) and set
Lemma 30.3.11. A
b1 is an α(G)-invariant subspace of A.
b
533
30.3. Pontrjagin Duality Chapter 30. Duality
= f (y)χ(y0−1 ) χ(y) dy
ZG
= f (y)χ(y0−1 y) dy since χ is a character
ZG
= f (y0 y)χ(y) dy by a change of variables
G
[
=Ly0 f (χ).
b1 is dense in L2 (G).
Lemma 30.3.12. A b
Thus the Fourier transform of the measure g f¯ dχ is trivial. Moreover, g f¯ ∈ L1 (G) b and
dχ is a finite Radon measure, which means g f¯ dχ is also a finite Radon measure, so that
g f¯ dχ = 0 implies g f¯ = 0 a.e. by Proposition 30.2.7. Note that for any χ ∈ G b and h ∈ L1 (G),
c = Lχ ĥ. Therefore if f ∈ A
χh b1 is nonzero and continuous, then for every χ ∈ G,
b there exist a
continuous element of A b1 , namely a translate of f , that is nonzero at χ. By Lemma 30.3.11,
such an f is guaranteed to exist, so g f¯ = 0 a.e. then implies that g = 0 a.e., that is, g = 0
in L2 (G).
b This contradicts our initial assumption, so A b1 is dense in L2 (G).
b
This proves:
Theorem 30.3.13 (Plancherel). Let G be a locally compact, Hausdorff abelian group. Then
b f 7→ fˆ extends by continuity to a map
the Fourier transform L1 (G) ∩ L2 (G) → L2 (G),
F : L2 (G) −→ L2 (G)
b
The map F is called the Plancherel transform of G. We will denote the Plancherel
transform of a function f ∈ L2 (G) by fˆ, even though technically this is an extension of the
Fourier transform.
534
30.3. Pontrjagin Duality Chapter 30. Duality
The Plancherel transform also gives us a converse to the reciprocity formula of Lemma 30.2.11.
Corollary 30.3.15. Let f, g ∈ L2 (G), h ∈ L1 (G) and suppose h = f g pointwise. Then
ĥ = fˆ ∗ ĝ.
Proof. For any χ0 ∈ G,
b we have
Z
ĥ(χ0 ) = f (y)g(y)χ0 (y) dy
ZG
= f (y)g(y)χ0 (y) dy
ZG
= fˆ(χ)(ḡd
χ0 )(χ) dχ
ZGb
= fˆ(χ)ĝ(χ−1 χ0 ) dχ
G
b
= (fˆ ∗ ĝ)(χ0 ).
Therefore ĥ = fˆ ∗ ĝ.
b = {f ∗ g | f, g ∈ L2 (G)}.
Corollary 30.3.16. Set C2 (G) b b = C2 (G).
Then A b
Proof. Take h ∈ L1 (G). Then h can be written as a product of L2 (G) functions, e.g. as
h = r · |r| where
ph(y) , h(y) 6= 0
r(y) = |h(y)|
0, h(y) = 0.
Then ĥ = fˆ ∗ ĝ by Corollary 30.3.15, so A
b ⊆ C2 (G).
b Conversely, Plancherel’s theorem gives
a bijection L2 (G) ↔ L2 (G) b corresponds to fˆ ∗ ĝ = fˆg ∈ A.
b so any element f ∗ g ∈ C2 (G) b
This shows that C2 (G)b ⊆ A.b
Proposition 30.3.17. If U ⊆ G b is a nonempty open set, then there exists a nonzero Fourier
transform fˆ ∈ A
b with support contained in U .
Proof. Since U is nonempty and open, it has (finite) positive measure so by inner regularity,
there exists a compact set K ⊆ U with vol(K) > 0. For all x ∈ K, we can find an open
neighborhood Vx ⊆ G b containing 1 and an open neighborhood Ux ⊆ G b containing x such
that Ux Vx ⊆ U . Since K is compact, there is a compact neighborhood V ⊆ G b containing
1 such that vol(V ) > 0 and KV ⊆ U . Define fˆ = χK ∗ χV where χK , χV ∈ L2 (G) b are the
characteristic functions on K, V , respectively. Then by Corollary 30.3.16, fˆ ∈ A.
b Finally,
ˆ
the support of f by definition is KV ⊆ U , and we have
Z
fˆ(χ) dχ = vol(K) vol(V ) > 0,
G
b
so fˆ is nonzero.
535
30.3. Pontrjagin Duality Chapter 30. Duality
Proof of Theorem 30.3.2. In light of Corollaries 30.3.6 and 30.3.7, it remains to show that
c
α(G) is dense in G. Suppose to the contrary that α(G) is not dense. Then α(G) is a
bb
nonempty open set in G, so by Proposition 30.3.17, there exists a nonzero function ϕ ∈ L1 (G)
bb b
such that ϕ̂|α(G) = 0. This implies that for any y ∈ G,
Z
ϕ(χ)χ(y −1 ) dχ = ϕ̂(α(y)) = 0,
G
b
Corollary 30.3.18. For any locally compact abelian group G, the Fourier transform induces
a bijection V 1 (G) ↔ V 1 (G).
b
Proof. By Corollary 30.2.15, the map is injective so it remains to show surjectivity. Take
F ∈ V 1 (G)
b and define a function f : G → C by
Z Z
f (y) = F (χ)χ(y) dχ = F (χ)α(y −1 )(χ) dχ = Fb(α(y −1 )).
G
b G
b
= f (y)χ(y) dy = fˆ(χ).
G
536
Chapter 31
Functional Equations
Recall the main ingredients in our proof of the analytic continuation and functional equation
∞
X 1
of ζ(s) = in Section 12.1. We defined the extended zeta function
n=1
ns
s
ξ(s) = π −s/2 Γ ζ(s)
2
and showed (Theorem 12.1.2) that ξ(s) has meromorphic continuation to all of C. Further,
the Poisson summation formula (Proposition 12.1.5)
X X
f (n) = fˆ(n)
n∈Z n∈Z
where R is a topological ring (in our case Qp , R, C or the adèle ring AK for a global field K),
χ is a character on R× and f is an Panalogue of a Schwartz function. When R = Fp is a finite
field, this also makes sense for f = cψ ψ where ψ runs over the elements of Homcts (Fp , C× ),
and in this case the zeta function is
p−1
X X
Z(χ, f ) = cψ χ(a)e2πab/p for some b = b(ψ) ∈ Z.
a=1
537
31.1. Local ζ-Functions Chapter 31. Functional Equations
Lemma 31.1.1. Every χ ∈ X(F × ) is of the form χ(x) = µ(x)|x|s for some unitary character
µ on UF and some s ∈ C.
Proof. Write F × = UF × VF . Since UF is compact, its characters are all unitary. On the
other hand, by the above, VF is either R×
>0 or q , and in both cases the characters are of the
Z
s
form t 7→ t for some s ∈ C.
For χ = µ| · |s ∈ X(F × ), call Re(s) the exponent of χ; it is uniquely determined by χ.
Also call χ unramified if χ|UF = 1, that is, if µ is the trivial character. Otherwise, χ is said
to be ramified.
Definition. Define the local L-factor L(χ) for a character χ ∈ X(F × ) as follows. If F is
nonarchimedean with uniformizer πF , set
(
(1 − χ(πF ))−1 , χ is unramified
L(χ) =
1, χ is ramified.
538
31.1. Local ζ-Functions Chapter 31. Functional Equations
Note that for any local field F and character χ ∈ X(F × ), the assignment s 7→ χ| · |s
determines a local L-function L(s, χ) := L(χ| · |s ). Setting χ∨ = χ−1 | · |, called the dual of
χ, we have that
L((χ| · |s )∨ ) = L(1 − s, χ−1 ).
So we start to see a form of duality on the L-factors arise.
Proposition 31.1.2. For a local field F with additive characters Fb = Homcts (F, S 1 ), fix a
nontrivial ψ ∈ Fb and for each a ∈ F , write ψa (x) = ψ(ax). Then every character in Fb is of
the form ψa for some a ∈ F , and the map
Φ : F −→ Fb
a 7−→ ψa
Proof. It’s easy to see that each ψa is a character F → S 1 . Moreover, for any a, b, x ∈ F ,
observe that
ψ((a + b)x) = ψ(ax + bx) = ψ(ax)ψ(bx)
so ψa+b = ψa ψb and hence Φ is a group homomorphism. Since ψ is nontrivial, ψa is also
nontrivial unless a = 0, meaning Φ is injective. We next show im Φ is dense in Fb. Set
H = im Φ and suppose H 6= Fb, so that Fb/H 6= {1} and there exists a nontrivial ψb ∈ Fb
b
such that ψ|b = 0. By Pontrjagin duality (Theorem 30.3.2), ψb is of the form ψ(χ)
H
b = χ(a)
for some a ∈ F . Since ψ|H = 0, ψ(ax) = ψa (x) = 0, but by injectivity, this means a = 0.
b
However, χ(0) = 1 for any χ ∈ Fb, so ψb = 0 on Fb, contradicting nontriviality. Hence H = Fb.
To finish, we show that Φ and Φ−1 are continuous, which will imply that H = im Φ is
closed and hence im Φ = Fb. Consider a closed set Ar = {x ∈ F : |x| ≤ r} for some r > 0.
If a sequence a → 0 in F , then the sequence aAr converges to {0} and ψa (Ar ) converges
to {1} in C. This shows that ψa converges to the trivial character 1 ∈ Fb in the compact-
open topology. On the other hand, suppose a0 ∈ F × such that ψ(a0 ) 6= 1. As ψa → 1
in Fb, eventually ψa (Ar ) must be closer to 1 ∈ C than ψ(a0 ), so a0 6∈ aAr , which can only
happen when a is small. In other words, as ψa → 1 in Fb, a → 0 in F . Hence Φ and Φ−1
are continuous, so H = im Φ is locally compact and in particular closed. This finishes the
proof.
Definition. A Haar measure dx on F is self-dual if dx identifies with its dual measure dχ,
in the sense of Fourier inversion, via the isomorphism F ∼
= Fb.
Definition. A function f : F → C is smooth if F is archimedean and f is analytic, or if
F is nonarchimedean and f is locally constant. A smooth function f is called a Schwartz-
Bruhat function if either:
(1) F is archimedean and p(x)f (x) → 0 pointwise on F for all polynomials p ∈ F [x] (that
is, f decays rapidly); or
539
31.1. Local ζ-Functions Chapter 31. Functional Equations
Definition. Fix an (additive) character ψ ∈ Fb. For each f ∈ SB(F ), define the Fourier
transform of f by Z
ˆ
f (y) = f (x)ψ(xy) dx.
F
Definition. For f ∈ SB(F ) and χ ∈ X(F × ), the local ζ-function for (f, χ) is:
Z
Z(f, χ) := f (x)χ(x) d∗ x.
F×
(ii) If 0 < σ < 1, there exists a meromorphic function of s, γ = γ(χ, ψ, dx), such that
ˆ χ∨ ) = γZ(f, χ).
Z(f,
γL(χ) = εL(χ∨ ).
(b) The poles of Z(f, χ) are of no higher order than the poles of L(χ). Consequently, for
each χ ∈ Fb, the poles of any Z(f, χ) are uniformly bounded.
ˆ χ∨ ) converges absolutely for σ < 1, so Theorem 31.1.3(iii) gives a meromor-
Proof. (a) Z(f,
phic continuation for Z(f, χ).
(b) By Theorem 31.1.3(ii), we can write
ˆ χ∨ ) = εL(χ∨ )Z(f, χ),
L(χ)Z(f,
with ε nonzero and Z(f, ˆ χ∨ ) absolutely convergent for σ < 1. Therefore since the L-factors
have no zeroes, the orders of the poles of Z(f, χ) are bounded by those of L(χ) as claimed.
Proof of Theorem 31.1.3(i). Let χ = µ|·|s with σ = Re(s). The cases when F is archimedean
are routine computations in Fourier analysis, so we will focus on the nonarchimedean case.
Thus our f ∈ SB(F ) is locally constant with compact support. If m = (πF ) is the unique
prime ideal of OF , this means that f factors through mm /mn for some integers m ≤ n. Thus
540
31.1. Local ζ-Functions Chapter 31. Functional Equations
where in the last step, vol(mk r {0}, d∗ x) is invariant for all k, so we may choose k = 0 to
get vol(OF× , d∗ x), and the rational expression is the limit of a convergent geometric series (it
is convergent since σ < 1). Since this number is finite, the integral converges.
Lemma 31.1.5. For all χ ∈ Fb with 0 < σ < 1 and all f, g ∈ SB(F ),
ˆ χ∨ )Z(g, χ).
Z(f, χ)Z(ĝ, χ∨ ) = Z(f,
Proof. Consider
ZZ
∨
Z(f, χ)Z(ĝ, χ ) = f (x)ĝ(y)χ(xy −1 )|y| d∗ x d∗ y
F × ×F ×
ZZ
= f (x)ĝ(xy)χ(y −1 )|xy| d∗ x d∗ y by translation
F × ×F × ×F ×
Z Z
−1
= χ(y )|y| f (x)ĝ(xy)|x| d∗ x d∗ y.
F× F×
So this integral is symmetric with respect to f and g, and hence the lemma follows.
Fix f ∈ SB(F ) and define
ˆ χ∨ )
Z(f,
γ = γ(χ, ψ, dx) := .
Z(f, χ)
541
31.1. Local ζ-Functions Chapter 31. Functional Equations
Proof of Theorem 31.1.3(iii). For different F , we will choose f ∈ SB(F ) and construct an
entire function h = h(f, χ, ψ, dx) such that
542
31.1. Local ζ-Functions Chapter 31. Functional Equations
(Recall that Ur = 1 + mr .)
for some unitary character ω of F × . Let mm be the conductor of the standard character ψ,
mn be the conductor of this unitary character ω and define f : F → C by
(
ψ(x), x ∈ mm−n
f (x) =
0, otherwise.
When n = 0, we have
Z Z
∗
Z(f, χs,0 ) = f (x)χs,0 (x) d x = ψ(x)|x|s d∗ x
F× mm r{0}
Z
= |x|s d∗ x since ψ has conductor mm
mm r{0}
∞
X
= vol(OF× , d∗ x) q −js as in the proof of (i)
j=m
−ms
q
= vol(OF× , d∗ x)
1 − q −s
= q −ms vol(OF× , d∗ x)L(χs,0 )
The following result is easy to verify from the definition of the Gauss sum.
543
31.1. Local ζ-Functions Chapter 31. Functional Equations
and since ω and ψπm−n both have conductor mn , Lemma 31.1.6 also implies that g(ω, ψπm−n ) 6=
0. Moreover, L(χs,n ) = 1 for n > 0 (since χs,n is ramified in this case) so setting g(ω, λ) =
vol(OF× , d∗ x) when both ω, λ have conductor OF = m0 , we can summarize our zeta function
calculations as:
Z(f, χs,n ) = q −(m−n)s g(ω, ψπm−n )L(χs,n ).
ˆ χ∨ ). To do so, we need the following calculation.
Now we compute Z(f, s,n
Lemma 31.1.7. For f = 1mm−n ψ defined above, f ∈ SB(F ) and its Fourier transform is
given by fˆ(y) = vol(mm−n , dx)1mn −1 .
Proof. When n = 0 and m0 = OF , since the conductor of ψ is mm , we have fˆ|F rOF = 0 (by
orthogonality of characters) and fˆ|OF = vol(mm , dx). When n > 0, first suppose y 6∈ mn − 1.
Then vF (y + 1) ≤ n − 1 so x(y + 1) 6∈ mm for any x ∈ mm−n . Hence ψy+1 is a nontrivial
character on mm−n , so we get
Z Z Z
fˆ(y) = f (x)ψ(xy) dx = ψ(x(y + 1)) dx = ψy+1 (x) dx = 0.
F mm−n mm−n
544
31.1. Local ζ-Functions Chapter 31. Functional Equations
1
= vol(mm , dx) vol(OF× , d∗ x)
1 − q −(1−s)
= vol(mm , dx) vol(OF× , d∗ x)L(χ∨s,0 ).
Putting this together with the computation of Z(f, χs,0 ) from above, we get
ˆ χ∨ )
Z(f, vol(mm , dx) vol(OF× , d∗ x)L(χ∨s,0 ) L(χ∨s,0 )
s,0 ms m
γ= = = q vol(m , dx) .
Z(f, χs,0 ) q −ms vol(OF× , d∗ x)L(χs,0 ) L(χs,0 )
This implies h(f, χ, ψ, dx) = q −ms vol(OF× , d∗ x), h(f, ˆ χ∨ , ψ, dx) = vol(mm , dx) vol(O× , d∗ x)
F
and therefore ε(χ, ψ, dx) = q ms vol(mm , dx). As these functions are entire, we have proven
all of (iii) in the case that n = 0.
When n > 0, we similarly compute
Z
ˆ ∨
Z(f, χs,n ) = fˆ(y)χ∨s,n (y)d∗ y
×
ZF
|y|
= vol(m m−n
, dx)ω d∗ y by Lemma 31.1.7
n
m −1 y
Z
= vol(m m−n
, dx) ω(u) d∗ u
n
Zm −1
= vol(mm−n , dx) ω(−u) d∗ u
1+mn
= vol(m m−n
, dx) vol(Un , d∗ x)ω(−1)
using the fact that ω̄ also has conductor mn . Consider the conjugate of the Gauss sum
g(ω, ψπm−n ):
Z
g(ω, ψπm−n ) = ω(u)ψ(π m−n u) d∗ u
×
O
Z F
= ω(u)ψ(−π m−n u) d∗ u
×
OF
Z
= ω(−1) ω(u)ψ(π m−n u) d∗ u
×
OF
= ω(−1)g(ω̄, ψπm−n ).
545
31.1. Local ζ-Functions Chapter 31. Functional Equations
Notice that ω and ψπm−n have the same conductor mn . Now we have
Remark. Let D be the different of the extension F/Qp . Then D = m−d for some d ∈ Z and
if ψ is the standard character on F constructed above, one can show that the conductor of
ψ is md . (This is at least believable since ψ = ψp ◦ tr and the different is defined in terms of
the trace!)
546
31.2. Adèlic and Idèlic Characters Chapter 31. Functional Equations
which has the product topology, and the restricted direct product topology on G is the
weakest topology such that every inclusion GS ,→ G is an open embedding of topological
groups.
In particular, when K is a global field, we defined the adèle ring AK of K and the idèle
group IK of K as restricted direct products for J = {v}, the set of places of K with infinite
places J∞ , by
Y
0
AK = Kv with respect to Hv = Ov for finite v
v
Y
0
IK = Kv× with respect to Hv = Ov× for finite v.
v
Gv ,→ G{v} ⊆ G
Y
g 7−→ {g} × 1.
u6=v
b = Homcts (G, S 1 ) be the set of continuous characters of G, that is, the Pontrjagin dual.
Let G
Lemma 31.2.1. Let χ ∈ G b be a character. Then χ|Hv = 1 for all but finitely many v ∈ J
Q
and for all y = (yv ) ∈ G, χ(y) = v χ(yv ).
547
31.2. Adèlic and Idèlic Characters Chapter 31. Functional Equations
Proof. Let S ⊆ J be the finite set such that χv |Hv = 1 for all v 6∈ S and set m = |S|. Let
U ⊆ S 1 be an open neighborhood of 1 and choose a neighborhood V ⊆ S 1 containing 1 such
that V m ⊆ U . For v ∈ S, choose a neighborhood Nv of 1 ∈ Gv such that χv (Nv ) ⊆ V . Then
Y Y
N := Nv × Hv
v∈S v6∈S
Hv∗ = {χv ∈ G
bv | χv |Hv = 1}.
Y
0 b
ϕ: Gv −→ G b
v
Y
(χv ) 7−→ χv
v
0
Q
is an isomorphism of topological groups, where the restricted direct product v G
bv is with
respect to the compact subgroups Hv∗ , v 6∈ J∞ .
Proof. Lemmas 31.2.1 and 31.2.2 show that ϕ is a bijection, so it remains to show ϕ and
ϕ−1 are continuous. For ϕ, let W (K, U ) Q
be a neighborhood of the trivial character 1 ∈ G.
b
Without loss of generality, assume K = v Kv for Kv ⊆ Gv compact Q and Kv = Hv for all
but finitely many v. Then χ ∈ W (K, U ) is equivalent to χ(K) = v χ(Kv ) ⊆ U . As above,
choose a finite set S ⊆ J such that χ|Kv = 1 for all v 6∈ S and set m = |S|. There is a
neighborhood V ⊆ S 1 containing 1 such that m m
Q 0 V ⊆ U , and we have χ(N ) ⊆ V ⊆ U where
N is the neighborhood of the identity in v Gv given by
Y
N= W (Kv , V ).
v
548
31.2. Adèlic and Idèlic Characters Chapter 31. Functional Equations
Q
On the other hand, sets of the form N = v W (Kv , U ), with Kv = Hv for all but finitely
many v, form a neighborhood basis of the identity in v 0 G
Q b
v , so for any such N ,
!
Y
W Kv , U ⊆ ϕ(N ).
v
then there is a unique Haar measure dg on G = v 0 Gv such that for any finite set J∞ ⊆
Q
S ⊆ J, Y Y
dgS := dg|GS = dgv × dgv |Hv .
v∈S v6∈S
b∼
Then by Proposition 31.2.3, we can consider a dual measure on G = v0 G
Q b
v.
Now for each v ∈ J, let dχv be the dual measure to dgv on G bv . By orthogonality of
characters on a compact group, we have
(
vol(Hv ), χv |Hv = 1
Z
1
bHv (χv ) = χv dgv =
Hv 0, otherwise.
Thus by Fourier inversion (Theorem 30.2.2),
Z
1 = 1Hv (1) = bHv dχv
1
Gbv
Z
= bHv dχv = vol(Hv ) vol(Hv∗ )
1
Hv∗
which implies Hv∗ has volume 1 with respect to dχv for all but finitely many v. Hence
Y
dχ := dχv
v
is a well-defined Haar measure on Gb which restricts to the product measure on each G bS for
J∞ ⊆ S ⊆ J finite.
Corollary 31.2.5. dχ is equal to the dual measure of dg on G, b that is, for all f ∈ V 1 (G),
Z
f (g) = fˆ(χ)χ(g) dχ.
G
b
549
31.3. Schwartz-Bruhat Functions and Riemann-Roch Chapter 31. Functional Equations
Proof. This can be found in Wawrzynczyk’s “On tempered distributions and Bochner-
Schwartz theorem on arbitrary locally compact abelian groups”.
Lemma 31.3.2. For any global field K, there exists a nontrivial unitary character ψ : AK →
S 1 satisfying ψ|K = 1.
Q
Proof. For K/Q, let ψ(x) = v ψv (xv ) where ψv is the standard nontrivial character on Kv
from Section 31.1. Then by construction ψ ∈ AcK and ψ|K = 1 since ψv |Ov = 1 for all finite
places v.
Proof. For any f = (fv ) ∈ S(AK ), fˆ = (fˆv ) and each fˆv ∈ SB(Kv ). By the remark at the
end of Section 31.1, the conductor of ψv is Dv−1 = mdv , where Dv is the different of Kv , and
for all but finitely many v, Dv−1 = Ov . Also, fv |Ov = 1 for all but finitely many v, and for
those v we have
Z Z
fˆv (yv ) = fv (xv )ψv (xv yv ) dxv = ψv (xv yv ) dxv
Kv Ov
(
1, yv ∈ Ov
=
0, yv 6∈ Ov .
Thus fˆv |Ov = 1 for all but finitely many v, so it follows that fˆ = v fˆv ∈ S(AK ).
Q
550
31.3. Schwartz-Bruhat Functions and Riemann-Roch Chapter 31. Functional Equations
= f (x) h(y)ψ(xy) dy dx
AK AK
Z Z
= h(y) f (x)ψ(xy) dx dy by Fubini’s theorem
AK AK
Z Z
= ˆ
h(y)f (y) dy = fˆ(y)fˆ(y) dy
ZAK AK
= |fˆ(x)| dx.
AK
where dx ¯ is the measure on AK /K induced by dx. For y ∈ K, ψ(xy) = ψ((a + x)y) holds
for all x ∈ AK /K, a ∈ K. Thus
Z Z X
ˆ
f (y) = f (x)ψ(xy) dx = f (a + x)ψ((a + x)z) dx = Fb(y)
AK AK /K a∈K
P
where F (y) = a∈K f (a + x). Note that F is defined for all y ∈ AK and by Pontrjagin
duality (Theorem 30.3.2) applied to AK /K and K, we get
X
Fb(y) = Fb(a)ψ(ay)
a∈K
551
31.3. Schwartz-Bruhat Functions and Riemann-Roch Chapter 31. Functional Equations
for any x ∈ IK .
Proof. Fix x ∈ IK and consider the function h(y) = f (xy) defined for y ∈ AK . Then by
Poisson summation,
X X
h(a) = ĥ(a)
a∈K a∈K
XZ
= f (xy)ψ(ay) dy
a∈K AK
X 1 Z
= f (y)ψ(ayx−1 ) dx by y 7→ yx−1
a∈K
|x| AK
1 X
= fˆ(ax−1 ).
|x| a∈K
552
31.3. Schwartz-Bruhat Functions and Riemann-Roch Chapter 31. Functional Equations
div : IK −→ Div(K)
X
(xv ) 7−→ v(xv )v.
v
553
31.3. Schwartz-Bruhat Functions and Riemann-Roch Chapter 31. Functional Equations
Since mdvv may be identified with the inverse different Dv−1 of Kv and Dv−1 = Ov for all but
finitely many v, we have dv = 0 for all but finitely many v. Thus K ∈ Div(K). Since ψ
Q (see Proposition 31.4.1(i) below), the class [K] ∈ Pic(K) is uniquely
is unique up to scaling
defined. Take f = v 1Ov ∈ S(AK ) so that by the proof of Proposition 31.3.7,
X
q `(D) = f (axD )
a∈K
P
for any D = v mv v with mv = v((xD )v ). On the other hand,
1 Y P
= qvmv = q v mv deg(v) = q deg(D) .
|xD | v
n /2
For all places v, we have fˆv = (1mnv v )1/2 = qv v = q deg(v)nv /2 . Taking the product over all v,
we get Y Y
fˆv =
P
q deg(v)nv /2 = q v deg(v)nv /2 = q − deg(K)/2 = q 1−g .
v v
Thus (
q 1−g , v(a) ≥ mv + nv
fˆ(ax−1
D ) =
0, v(a) < mv + nv .
These of course are the conditions defining membership in L(D), so we conclude that
X X
fˆ(ax−1
D ) = q −g+1 = q `(K−D)−g+1 .
a∈K a∈K
v(a)≥mv +nv
554
31.4. Global Zeta Functions and Functional Equations Chapter 31. Functional Equations
555
31.4. Global Zeta Functions and Functional Equations Chapter 31. Functional Equations
(iv) The only poles of Z(f, χ) occur when χ = | · |σ+iτ for τ ∈ R, in which case the poles
are at s = iτ, 1 + iτ and have residues
1 1 ˆ
Res(Z(f, χ); iτ ) = − vol(CK )f (0) and Res(Z(f, χ); 1 + iτ ) = vol(CK )f (0).
Proof of (i). Let S be the finite set of places v for which fv |Ov 6= 1. We may write f as a
linear combination of characteristic functions on these finitely many Ov . Consider
Z Y Z
∗
|f (x)| |χ(x)| d x = cv |fv (xv )| |xv |σ−1
v dxv
IK v Kv×
Y Z Y Z Y Z
= cv |xv |σ−1
v dxv × |fv (xv )| |xv |σ−1
v dxv × cv |xv |σ−1
v dxv .
v∈S mm v
v r{0} v∈S∞ Kv× v6∈S∪S∞ Ov×
R
For v ∈ S∞ , we remarked in the proof of Theorem 31.1.3(i) that Kv× |fv (xv )| |xv |σ−1 v dxv
converges for σ > 0 by routine calculations. Since S∞ is a finite set, the second factor above
is finite for σ > 0. For v 6∈ S∞ , our computations in the proof of Theorem 31.1.3(iii) showed
that for each v, f |Ov = 1mm v
v for some mv ≥ 0, and we have
q −mv σ
Z
cv |xv |σ−1
v dxv = vol(Ov× , d∗ xv ) v −σ
mm v
v r{0}
1 − qv
for σ > 0. Thus the first factor above corresponding to v ∈ S is a finite product of finite
integrals, so it too converges. Finally, since cv = qvq−1
v
, the third factor becomes
Y 1
vol(Ov× , d∗ xv ).
v6∈S∪S∞
1 − qv−σ
The product v6∈S∪S∞ 1−q1−σ now converges for σ > 1 by a similar proof to that of Theo-
Q
v
rem 10.3.1. Hence all three factors converge, so Z(f, χ) converges absolutely when σ > 1.
Now assume K is a number field. For f ∈ S(AK ) and χ ∈ Ch(IK ) with σ > 1, the
decomposition IK ∼
= I1K × R>0 (from Theorem 16.3.2) allows us to write
Z Z Z
∗ dt
Z(f, χ) = f (x)χ(x) d x = f (tx)χ(tx) d∗ x .
IK R>0 I1K t
Proposition 31.4.3. For all t ∈ R>0 , f ∈ S(AK ) and χ ∈ Ch(IK ) with σ > 1,
Z Z
Zt (f, χ) + f (0) ∗ ˆ ∨ ˆ
χ(tx) d x = Zt−1 (f, χ ) + f (0) χ∨ (t−1 x) d∗ x.
1
CK 1
CK
556
31.4. Global Zeta Functions and Functional Equations Chapter 31. Functional Equations
1
Proof. By definition, CK = I1K /K × so we can write
Z Z ! Z
X
∗ ∗
Zt (f, χ) + f (0) χ(tx) d x = χ(tx) f (atx) d x + f (0) χ(tx) d∗ x
1
CK 1
CK 1
CK
a∈K ×
Z !
X
= χ(tx) f (atx) d∗ x
1
CK a∈K
Z !
1 X ˆ −1 −1
= χ(tx) f (at x ) d∗ x by Riemann-Roch (31.3.5)
1
CK |tx| a∈K
Z !
X
= |t−1 x|χ(t−1 x) fˆ(at−1 x) d∗ x by x 7→ x−1
1
CK a∈K
Z ! Z
X
= −1
|t x|χ(t x) −1
fˆ(at−1 x) d x + fˆ(0)
∗
χ∨ (t−1 x) d∗ x
1
CK 1
CK
×
Z a∈K
ˆ χ∨ ) + fˆ(0)
= Zt−1 (f, χ∨ (t−1 x) d∗ x.
1
CK
Proof of Theorem 31.4.2(ii) – (iv). We give the proof when K is a number field and leave
the function field case as an exercise. In this case, IK = I1K × R>0 and we have
Z ∞
dt
Z(f, χ) = Zt (f, χ)
t
Z0 1 Z ∞
dt dt
= Zt (f, χ) + Zt (f, χ)
t t
Z0 1 Z1
dt
= Zt (f, χ) + f (x)χ(x) d∗ x
0 t C
where C = {x ∈ IK : |x| ≥ 1}. Note that since f ∈ S(AK ), the second integral converges for
all s. For the first term, Proposition 31.4.3 allows us to write
Z 1 Z 1
dt ˆ χ∨ ) dt + E
Zt (f, χ) = Zt−1 (f,
0 t 0 t
where
!
Z 1 Z Z
dt
E = E(f, χ) = fˆ(0)χ∨ (t−1 ) χ∨ (x) d∗ x − f (0)χ(t) χ(x) d∗ x .
0 1
CK 1
CK t
557
31.4. Global Zeta Functions and Functional Equations Chapter 31. Functional Equations
so since fˆ ∈ S(AK ) and χ∨ ∈ Ch(IK ), this integral converges by the work above. Thus the
meromorphic continuation of Z(f, χ) is proven once we show that E is meromorphic. We
analyze two cases below.
If χ is nontrivial on I1K , then by orthogonality of characters, the integrals
Z Z
∗
χ(x) d x and χ∨ (x) d∗ x
1
CK 1
CK
are both zero, so E = 0 (which is holomorphic). The interesting case is when χ is trivial on
I1K . In this case, it must be of the form χ = | · |s = | · |σ+iτ for τ ∈ R, and E looks like
Z 1
E= fˆ(0)tσ−1 vol(CK
1
) − f (0)tσ vol(CK
1
)
0
!
ˆ(0)
f f (0)
1
= vol(CK ) −
σ−1 σ
which is meromorphic. Therefore E is meromorphic in all cases, proving (ii). Also notice
that E(f, χ) = E(f,ˆ χ∨ ). Moreover, the only poles of Z(f, χ) occur when χ = | · |σ+iτ and
these occur at σ = 0, 1 and the residues of Z(f, χ) at s = iτ, 1 + iτ are as claimed in (iv).
Finally, our computations above give us
Z ∞ Z ∞
dt ˆ χ∨ ) dt + E(f, χ)
Z(f, χ) = Zt (f, χ) + Zt (f,
t t
Z1 ∞ Z 1
Z ∞Z
dt dt
= f (tx)χ(tx) d∗ x + fˆ(tx)χ∨ (tx) d∗ x + E(f, χ).
1 IK t 1 IK t
Meanwhile,
Z ∞ Z ∞
ˆ χ∨ ) = ˆ ∨ dt ˆ χ∨ ) dt + E(f,ˆ χ∨ )
Z(f, Zt (f, χ ) + Zt (f,
t t
Z1 ∞ Z 1
Z ∞Z
ˆ ∨ ∗ dt dt ˆ χ∨ )
= f (tx)χ (tx) d x + f (−tx)χ(tx) d∗ x + E(f,
t t
Z1 ∞ ZIK Z1 ∞ ZIK
dt dt
= fˆ(tx)χ∨ (tx) d∗ x + f (tx)χ(tx) d∗ x + E(f, χ) by x 7→ −x.
1 IK t 1 IK t
558
31.5. Hecke L-Functions Chapter 31. Functional Equations
qv−ms
Since each µv is unitary, the entire sum is dominated by the sum of the m
terms, which
we analyze as follows:
∞ ∞
XX qv−ms X X X qv−ms
=
v∈S m=1
m p m=1
m
v|p
where p runs over all prime integers and v runs over the places of K lying over p. Since the
number of v lying over p is bounded by n = [K : Q] and qv is a pth power for each of these
v, we get
∞ ∞
XXX qv−ms XX p−mσ
≤n
p m=1
m p m=1
m
v|p
!
Y 1
= n log .
p
1 − ps
This converges for σ > 1 since it is the Euler product of the Riemann zeta function (see
Theorem 10.3.1), so L(χ) converges for σ > 1.
Definition. For χ ∈ Ch(IK ), the function
L(s, χ) := L(χ| · |s )
559
31.5. Hecke L-Functions Chapter 31. Functional Equations
When v was archimedean, we even had Z(fv , χv | · |sv ) = L(s, χv ). For v nonarchimedean with
standard character ψv on Kv , we set fv ≡ ψv on mvmv −nv where mm v was the conductor of ψv
v
is meromorphic and satisfies Z(f, χ| · |s ) = h(f, χ)L(s, χ), proving the meromorphic contin-
uation.
560
31.5. Hecke L-Functions Chapter 31. Functional Equations
(iii) Let f = v fv be as above. From Theorem 31.4.2(iv), we know the poles of Z(f, χ|·|s )
Q
occur exactly when χ = | · |−iτ for τ ∈ R, and the poles are s = iτ, 1 + iτ with residues
1 1 ˆ
− vol(CK )f (0) and vol(CK )f (0), respectively. By construction, f (0) = 1 and
Y Y Y
fˆ(0) = fˆv (0) = vol(Ov× , d∗ xv ) = |N (Dv )|−1/2 = |N (DK )|−1/2 .
v v6∈S∞ v6∈S∞
of norm 1 S-idèles. Let CK,S = I1K /K × I1K,S , which is a finite group of order hS when S 6= ∅
(called the S-class number) by Theorem 16.3.7. Then from the same theorem, we get an
exact sequence
1 → I1K,S /(K × ∩ I1K,S ) → CK
1
→ CK,S → 1.
This implies
1
vol(CK ) = hS vol(I1K,S /(K × ∩ I1K,S )).
Assume K is now a number field and S = S∞ , the set of archimedean places of K. We
may write |S∞ | = r1 + r2 where r1 is the number of real embeddings of K and r2 is the
number of pairs of complex conjugate embeddings of K. Define the logarithmic map
Also let H be the hyperplane in Rr1 +r2 = {(tv )v∈S∞ } defined by the equation
X X
tv + 2 tv = 0.
v real v complex
Lemma 31.5.3. For every number field K, im λ = H and ker λ = I1K,∅ = IK,∅ .
Q
Proof. Since v∈S∞ |xv |v = 1, we have
!
Y X X X
0 = log |xv |v = log |xv |v = log |xv | + 2 log |xv |.
v∈S∞ v∈S∞ v real v complex
Thus im λ ⊆ H. On the other hand, for (tv )v∈S∞ ∈ H, we may choose an idèle (xv ) ∈ I1K,S∞
with xv = 1 for all finite v and |xv |v = etv for all infinite v. This shows that im λ ⊇ H, so
they are equal. The identification ker λ = I1K,∅ is trivial.
Definition. The restriction of λ to K × ∩ I1K,S∞ is called the regulator map of K, written
reg : I1K,S∞ → Rr1 +r2 .
By Lemma 31.5.3, ker(reg) = K × ∩I1K,∅ = µ(K), the set of roots of unity in K. Moreover,
× ×
since OK = K × ∩ I1K,∅ by definition, it makes sense to define L = reg(OK ) ⊂ H. Since
1 1 ×
CK = IK /K is compact (Theorem 16.3.3), the quotient H/L is compact, or in other words,
L is a complete lattice in H.
561
31.5. Hecke L-Functions Chapter 31. Functional Equations
This allows us to define a Haar measure ν = νreal × νcomplex × νf inite on I1K,∅ by:
for v real, Uv = {±1} so we take νreal to be the counting measure;
for v complex, Uv = S 1 so we take νcomplex to be the standard Lebesgue measure on
S 1 ⊂ C;
for finite v, we take νf inite to be the product of the d∗ xv over all finite v.
Thus the volume of each Uv is given by
2,
v real
vol(Uv , ν) = 2π, v complex
−1/2
N (Dv ) , v finite.
×
reg
1 µ(K) OK L 0
λ
1 I1K,∅ I1K,S∞ H 0
×
1 I1K,∅ /µ(K) I1K,S∞ /OK H/L 0
1 1 0
562
31.5. Hecke L-Functions Chapter 31. Functional Equations
On the other hand, using the bottom row and the induced measures on each quotient, we
get
1
vol(CK ) × 2r1 (2π)r2 RK
= vol(I1K,S∞ /OK ) = vol(I1K,∅ /µ(K)) vol(H/L) = p .
hK |µ(K)| |dK |
1
Solving for vol(CK ) gives the desired formula.
Corollary 31.5.5 (Class Number Formula). Let ζK (s) be the Dedekind zeta function of a
number field K. Then
2r1 (2π)r2 hK RK
Res(ζK ; 1) = p .
|µ(K)| |dK
1
Proof. By the proof of Theorem 31.5.2, ζK (s) = L(s, 1f ) = h(s) Z(f, 1f ) where h(s) =
Q
v hv (s) is the product of the local hv functions
(
m (s−1/2)
qv v , nv = 0
hv (s) = −(mv −nv )s
qv g(χv , ψmvmv −nv ), nv 6= 0.
Example 31.5.6. Let ζ(s) = L(s, 1f ) be the Riemann zeta function (the L-function for
the finite part of the trivial character χ = 1 on K = Q). The only archimedean place of
Q corresponds to the usual absolute value with completion R, so by the definition of the
L-factors in Section 31.1,
s
L(s, 1) = L(s, χ∞ )L(s, 1f ) = π −s/2 Γ ζ(s),
2
which is the completed zeta function ξ(s) defined in Section 12.1. On the other hand,
∨ ∨ ∨ −(1−s)/2 1−s
L(1 − s, 1 ) = L(1 − s, χ∞ )L(1 − s, 1f ) = π Γ ζ(1 − s),
2
and by Theorem 31.5.2, L(s, 1) = L(1 − s, 1∨ ) so this proves ξ(s) = ξ(1 − s), as we saw in
Corollary 12.1.6. Moreover, Theorem 31.5.2 also shows that the only poles of L(s, 1) are at
s = 0, 1. Theorem 31.5.4 shows that L(s, 1) in fact has simple poles at s = 0, 1. At s = 0,
s
we know Γ 2 has a simple pole, so this implies ζ(s) is holomorphic at s = 0. On the other
hand, Γ 2s is holomorphic at s = 1, so it follows that ζ(s) has a simple pole at s = 1.
563
31.5. Hecke L-Functions Chapter 31. Functional Equations
Proposition 31.5.7. Fix m ≥ 3 and let Fm = Q(e2π/m ) be the mth cyclotomic extension of
Q. Then Y
ζFm (s) = L(s, χ)
χ
for all p. Fix p and set t = p−s . Since the factor 1 − χ(p)t is trivial when χ is ramified, we
may assume the product on the right is taken over all unramified characters χ. Now Fm /Q
is Galois, so by Corollary 14.5.14, f = fv is constant on the set of places v | p. Let g be the
number of such places, so that
Y Y
(1 − tfv ) = (1 − tf )g = (1 − zt)g .
v|p z f =1
Thus it suffices to show that for each f th root of unity z, there are exactly g characters χ
\ × of Dirichlet
with χ(p) = z. To see this is true, define a homomorphism on the set (Z/mZ)
characters mod m by
\ × −→ µf
evp : (Z/mZ)
χ 7−→ χ(p).
where hm , Rm , µm and dm are, respectively, the class number, regulator, group of roots of
unity and discriminant of Fm .
Now for any finite abelian extension K/Q, the Kronecker-Weber theorem (17.8.10) says
that K ⊆ Fm for some m. Set G = Gal(K/Q) and Gm = Gal(Fm /Q) so that G is a quotient
of Gm . Taking duals, we then have that Gb is a subgroup of G
bm .
Proposition 31.5.8. For any finite abelian extension K/Q with Galois group G,
Y
ζK (s) = L(s, χ).
χ∈G
b
564
31.5. Hecke L-Functions Chapter 31. Functional Equations
Corollary 31.5.9. For any finite abelian extension K/Q with Galois group G,
Y 2r1 (2π)r2 hK RK
L(1, χ) = p .
|µ(K)| |dK |
χ∈Gr{1}
b
−g(χ) X
L(1, χ) = χ(a) log(1 − e−2πia/m )
m a mod m
565
Part VII
Modular Forms
566
Chapter 32
Modular Forms
This chapter gives an overview of the theory of modular forms from a number theoretic
perspective. Good resources for this introductory material are Serre’s A Course in Arith-
metic, Diamond-Shurman’s A First Course in Modular Forms and Koblitz’s Introduction to
Elliptic Curves and Modular Forms. More advanced material, including the connections be-
tween modular forms and algebraic geometry, can be found in Iwaniec’s Topics in Classical
Automorphic Forms and Milne’s Modular Functions and Modular Forms.
Modular forms are functions on the upper half-plane in C which have certain delicate
properties allowing for the systematic study of the complex structure on this upper half-
plane. It turns out that a large class of Riemann surfaces have as their universal cover the
upper half-plane, and modular functions then allow one to describe all holomorphic functions
on these Riemann surfaces. In particular, modular forms have such a name because they
arise as sections of line bundles on various moduli spaces of these Riemann surfaces. In our
setting, we will examine the forms coming from moduli spaces of elliptic curves.
567
32.1. The Upper Half-Plane Chapter 32. Modular Forms
ρ −ρ̄
i
Re(z)
−1 − 12 1
2 1
Im(z)
568
32.1. The Upper Half-Plane Chapter 32. Modular Forms
We specify three points on the boundary of D: the fourth root of unity i = eiπ , the third
root of unity ρ = e2πi/3 and its negative conjugate, the sixth root of unity −ρ̄ = eπi/3 . Define
two matrices S, T ∈ SL2 (Z) by
0 −1 1 1
S= and T =
1 0 0 1
which act on z ∈ h by S(z) = − z1 and T (z) = z + 1. Also let S, T denote the images of these
matrices in Γ = P SL2 (Z).
Theorem 32.1.1. For Γ = P SL2 (Z), D ⊂ h and S, T ∈ Γ as above,
(1) D is the fundamental domain for the action of Γ on h.
hS, T | S 2 , (ST )3 i,
569
32.2. Modular Functions and Modular Forms Chapter 32. Modular Forms
570
32.2. Modular Functions and Modular Forms Chapter 32. Modular Forms
For each k, let Sk be the space of cusp forms of weight 2k. We will show that M ∼
= C[G4 , G6 ]
is a polynomial ring in two distinguished modular forms G4 , G6 .
We next discuss the connection between lattices and modular forms. Set
M = {(ω1 , ω2 ) ∈ C× | Im(ω1 /ω2 ) > 0}.
Then each point (ω1 , ω2 ) ∈ M defines a lattice Λ = Zω1 ⊕ Zω2 ⊂ C, though not a unique
one. Each element of SL2 (Z) acts on M by
a b
(ω1 , ω2 ) = (aω1 + bω2 , cω1 + dω2 ).
c d
Then the quotient M/SL2 (Z) naturally identifies with the set of all lattices in C. Further,
two lattices (ω1 , ω2 ) and (η1 , η2 ) are called homothetic if there exists some λ ∈ C× such that
ωi = ληi for i = 1, 2. The set of homothety classes of lattices in C naturally identifies with
the set of complex elliptic curves E/C via
Λ = (ω1 , ω2 ) ←→ E = C/Λ.
(See Section 26.2.) Let R be the set of all lattices in C.
Definition. A function F : R → C is called a modular lattice function of weight 2k if
for all Λ ∈ R and λ ∈ C× , we have
F (λΛ) = λ−2k F (Λ).
For short, we will write F (ω1 , ω2 ) = F (Zω1 ⊕ Zω2 ). Notice that for any lattice function
F of weight 2k, the value ω22k F (ω1 , ω2 ) depends only on the ratio ωω21 . We can use this to
build modular functions out of lattice functions.
Lemma 32.2.2. For a lattice function F : R → C of weight 2k, the function f : h → C
defined by
ω1
f = ω22k F (ω1 , ω2 )
ω2
−2k a b
satisfies the weight 2k modular condition f (z) = (cz + d) f (gz) for any g = ∈
c d
SL2 (Z). Thus if f is holomorphic, it is a weakly modular function of weight 2k.
Proof. Take ω1 , ω2 ∈ C× and consider
ω1 aω1 + bω2
f g· =f = (cω1 + dω2 )2k F (aω1 + bω2 , cω1 + dω2 )
ω2 cω1 + dω2
2k ω1 ω1
= (cω1 + dω2 ) F ω2 a + b , ω2 c + d
ω ω2
2
2k −2k ω1 ω1
= (cω1 + dω2 ) ω2 F a + b, c + d since F has weight 2k
ω2 ω2
2k
ω1 ω1
= c +d F ,1 since F is SL2 (Z)-invariant
ω2 ω2
2k
ω1 ω1
= c +d f .
ω2 ω2
571
32.2. Modular Functions and Modular Forms Chapter 32. Modular Forms
Conversely, the formula f ωω12 = ω22k F (ω1 , ω2 ) defines a modular lattice function F
of weight 2k for every (weakly) modular function of the same weight, so we can identify
all (weakly) modular functions with some subset of the set of modular lattice functions.
In particular, this identification is useful for producing examples of modular functions and
forms.
converges for all σ ∈ C with Re(σ) > 2. Using this, for each k ≥ 2 we can define a lattice
function Gk : R → C by
X 1
Gk (Λ) = .
γ 2k
γ∈Λr{0}
is a weakly modular function of weight 2k. (To see that Gk converges uniformly on h, first
observe that it converges uniformly on the fundamental domain D since for any z ∈ D,
|mz + n| is bounded below by |mρ − n|. Now extend this convergence to all of h by applying
the action of SL2 (Z) and the modular condition on the lattice function Gk .)
What happens at infinity? Viewing ∞ = i∞, it is enough to consider the limit of Gk (z)
as z → ∞ within D, but since the series Gk converges uniformly on D, we may take the
limit term-by-term to get
X 1
Gk (∞) = lim Gk (z) = = 2ζ(2k).
Im(z)→∞ n2k
n∈Zr{0}
572
32.2. Modular Functions and Modular Forms Chapter 32. Modular Forms
∆ = (60G2 )3 − 27(140G3 )2
called the modular discriminant. (Note the resemblance to the discriminant formula for an
elliptic curve; see Section 23.1.) It is common to write g2 = 60G2 and g3 = 140G3 , so that
∆ = g23 − 27g32 .
π4 2π 6
By Example 32.2.3, G2 (∞) = 2ζ(4) = 45
and G3 (∞) = 2ζ(6) = 945
, so we have
3 2
4π 4 8π 6 64π 12 64π 12
∆(∞) = − 27 = − = 0.
3 27 27 27
C/Λ −→ C2 ∪ {∞}
u 6= 0 7−→ (℘(u), ℘0 (u))
0 7−→ ∞
which realizes the elliptic curve E = C/Λ as a complex planar curve. This illustrates one of
the important connections between modular forms and elliptic curves.
573
32.3. Modular Functions as Sections Chapter 32. Modular Forms
pn : B −→ B
z 7−→ z n
for n ≥ 1. Let k ≥ 1 and let Ω⊗k be the kth tensor power of the sheaf of meromorphic
differentials Ω = ΩB/C on B (see Section 22.4). Then for a section ω ∈ Γ(B, Ω⊗k ), we may
write ω = f dz k for some rational function f on B. Pulling this back under pn , we have
which shows that ord0 (p∗n ω) = k(n − 1) + n ord0 (ω). This construction passes to the curve
X as follows.
Note that the map p : h∗ → h∗ /Γ = X is a local homeomorphism except at the Γ-orbits of
the points i, ρ and ∞, so p is a branched cover with these points as branch points. Locally at
i, the cover is given by z 7→ z 2 , while at ρ, it is given by z 7→ z 3 . Thus the orders of vanishing
at these points of any differential form may be computed using the computations above. At
∞, we know q = e2πiz is a local parameter, so if ω = g(q) dq k for some q holomorphic at ∞,
then
p∗ ω = (2πi)k g(q)q k dz k
and so ord∞ (p∗ ω) = k + ordq=0 (ω). Now let f be a modular function of weight 2k and set
ω = f dz k ∈ Γ(X, Ω⊗k ), where Ω = ΩX/C . Then we have proven:
Lemma 32.3.2. For any f ∈ M2k , ω = f dz k ∈ Γ(X, Ω⊗k ) and the principal divisor (ω)
has degree −2k.
Proof. Since f is modular, ω descends to the quotient X = h∗ /Γ. Since the total degree of
each form in ΩP1C is −2 by Corollary 22.6.2, it follows that (ω) has degree −2k.
574
32.3. Modular Functions as Sections Chapter 32. Modular Forms
Corollary 32.3.4. Let k ∈ Z and let ∆ ∈ S6 be the cusp form ∆ = (60G2 )3 − 27(140G3 )2 .
Then
(b) ∆ 6= 0.
Proof. (a) Every f ∈ Mk is holomorphic, so for k < 0 there is no way for the order formula
in Lemma 32.3.3 to be satisfied unless f ≡ 0. Likewise, when k = 1 the right-hand side of
the formula is 16 and there are no positive integers a, b, c, d satisfying a + 12 b + 13 c + d = 61 .
Hence M1 = 0.
(b) Since G2 ∈ M2 , the formula in Lemma 32.3.3 has 13 on the right, so ordi (G2 ) =
0, ordρ (G2 ) = 1 and hence G2 (i) 6= 0 and G2 (ρ) = 0. Similarly for G3 ∈ M3 , we have
ordi (G3 ) = 1, ordρ (G3 ) = 0 and therefore G3 (i) = 0 and G3 (ρ) 6= 0. Since ∆ is a linear
combination of G32 and G23 , this shows that ∆(i) and ∆(ρ) are both nonzero. In particular,
∆ is nontrivial.
(c) The order formula also shows that ∆ has a simple zero at ∞. If f ∈ Sk+6 is a cusp
f f
form, then f (∞) = 0 so ∆ is holomorphic and hence ∆ ∈ Mk . As ∆ 6= 0, this clearly
establishes the isomorphism Mk → Sk+6 , g 7→ g∆.
(d) In general, if k − 6 < 0 then by (a), Mk−6 = 0. By (c), this implies Sk = 0, so there
are no cusp forms in Mk . In other words, the map Mk → C sending f 7→ f (∞) is injective,
so it follows that for k < 6, dim Mk ≤ 1. Since Eisenstein series exist and are nontrivial for
k = 2, 3, 4, 5, we therefore have dim Mk = 1 for each of these k and dim Mk = C[Gk ].
575
32.3. Modular Functions as Sections Chapter 32. Modular Forms
576
32.4. q-Expansions Chapter 32. Modular Forms
32.4 q-Expansions
In this section we further study the coefficients an in the q-expansion
∞
X
f (q) = an q n
n=0
of a modular form f ∈ M. First, recall that the kth even Bernoulli number Bk can be
defined as the kth coefficient in the Laurent series expansion
∞ 2k
x x X k+1 Bk x
= 1 − + (−1) .
ex − 1 2 k=1 (2k)!
Comparing the coefficients of z 2k in both expressions of z cot z gives the desired formula.
577
32.4. q-Expansions Chapter 32. Modular Forms
Example 32.4.3. Proposition 32.4.2 gives the well-known values of the zeta function ζ(2) =
π2 4
6
and ζ(4) = π90 .
Note that in the proof of Proposition 32.4.2, z cot z may alternatively be expressed as
∞
X z2
z cot z = 1 + 2 .
n=1
z 2 − n2 π 2
Evaluating at πz and dividing out by z, we have two equivalent expressions for π cot(πz):
∞
1 X 1 1
π cot(πz) = + +
z m=1 z + m z − m
∞
2πi X
and π cot(πz) = πi − = πi − 2πi qn.
1 − e2πiz n=0
Equating these two expressions and taking the kth derivative with respect to z yields the
following formula:
Lemma 32.4.4. For all k ≥ 2,
∞
X 1 (−2πi)k X k−1 d
= d q .
m∈Z
(m + z)k (k − 1)! d=1
Then the q-expansion of the kth Eisenstein series Gk (z) (Example 32.2.3) may be written in
terms of these σk (n) as follows.
Proposition 32.4.5. For all k ≥ 2,
∞
2(2πi)2k X
Gk (z) = 2ζ(2k) + σ2k−1 (n)q n .
(2k − 1)! n=1
Proof. By definition,
∞ X
X 1 X 1
Gk (z) = = 2ζ(2k) + 2 .
(mz + n)2k m=1 n∈Z
(mz + n)2k
(m,n)6=(0,0)
578
32.4. q-Expansions Chapter 32. Modular Forms
Example 32.4.6. The weight 2 Eisenstein series G1 (z) (see Example 32.2.3) may not be a
modular form, but the proof of Proposition 32.4.5 still goes through, so we have
∞
X
G1 (z) = 2ζ(2) − 8π 2 σ1 (n)q n .
n=1
By Corollary 32.3.4, Mk has dimension 1 for k = 2, 3, 4, 5 so there are relations among the
Ek (z) for small values of k. In fact, we have
E22 = E4 , E2 E3 = E5 , E2 E5 = E7 , E3 E4 = E7 .
Comparing the q-expansions of these identities, we obtain the following interesting relations
among the generalized divisor sum functions:
n−1
X
σ7 (n) = σ3 (n) + 120 σ3 (n)σ3 (n − m)
m=1
n−1
X
11σ9 (n) = 21σ5 (n) − 10σ3 (n) + 5040 σ3 (n)σ5 (n − m).
m=1
Let f (z) = ∞ n
P
n=0 an q be any modular form of weight 2k. In the next few results, we
give bounds on the growth of an .
Proposition 32.4.8. For the Eisenstein series f (z) = Gk (z), there exist constants A, B > 0
such that
An2k−1 ≤ |an | ≤ Bn2k−1
for all n ≥ 0. That is, |an | grows at the same rate as n2k−1 .
Proof. By Proposition 32.4.5, there is a positive number A such that an = (−1)k Aσ2k−1 (n),
so we certainly have
|an | = Aσ2k−1 (n) ≥ An2k−1 .
on the other hand,
∞
|an | Aσ2k−1 (n) X 1 X 1
= = A ≤ A = Aζ(2k − 1).
n2k−1 n2k−1 d2k−1 d=1
d 2k−1
d|n
579
32.4. q-Expansions Chapter 32. Modular Forms
Theorem 32.4.9 (Hecke). If f (z) is a cusp form of weight 2k, then an = O(nk ), i.e. |annk | is
bounded as n → ∞.
This shows that |an | ≤ M y −k e2πny which works for all y > 0, so we can pick y = 1
n
to get
|an | ≤ M e2π nk .
Corollary 32.4.10. If f (z) is any noncuspidal modular form of weight 2k, then an =
O(n2k−1 ).
Proof. By Corollary 32.3.4(b), we may write f (z) as a linear combination of cusp forms and
Eisenstein series, so Proposition 32.4.8 and Theorem 32.4.9 give the result.
Remark. Deligne showed the following improved bound on an for cusp forms:
an = O(nk−1/2 σ0 (n)).
Further, we can show that σ0 (n) = O(nε ) for any ε > 0, so as a result, we get
an = O(nk−1/2+ε )
Theorem 32.4.11 (Jacobi). The cusp form ∆(z) ∈ S12 has q-expansion
∞
Y
12
∆(z) = (2π) q (1 − q n )24 .
n=1
Q∞
Proof. Set f (z) = q n=1 (1 − q n )24 . It then suffices to show f ∈ S6 since by Corollary 32.3.4,
dim S6 = 1 and clearly ∆(z) and (2π)12 f (z) match in degree 1. Since f is given in terms of
a q-expansion, f (z + 1) = f (z) is guaranteed. Moreover, f is holomorphic and f (∞) = 0
580
32.4. q-Expansions Chapter 32. Modular Forms
We will suppress the indices on each summation from now on, but they are understood to
be the sums over all (m, n) ∈ Z2 in a prescribed order, with (m, n) = (0, 0) or (m, n) =
(0, 0), (1, 0) omitted as appropriate. For fixed (m, n) 6= (0, 0), (1, 0), we have
1 1 1
= −
(m − 1 + nz)(m + nz) m − 1 + nz m + nz
so by telescoping series, the terms in H1 (z) become
X 1 1
− = 0 when n 6= 0
m∈Z
m − 1 + nz m + nz
X 1 1
and − = 2 when n = 0.
m6=0,1
m−1 m
581
32.4. q-Expansions Chapter 32. Modular Forms
(Again, the inner sums are over all n ∈ Z such that (m, n) 6= (0, 0), (1, 0).) Notice that
the right side of the expression can be written as both G1 − H1 and G − H, using absolute
convergence. This shows that G1 − H1 = G − H, so by the work above on H and H1 , we
have G1 − G = H1 − H = 2πi z
. Therefore
XX
1 1 2 2
G1 − = = z G(z) = z G1 (z) − 2πiz.
z n 2
n m m− z
Comparing this to the formula for G1 (z) in Example 32.4.6, we see that
f 0 (z) 6i
= G1 (z) dz.
f (z) π
f 0 − z1
f 0 (z)
6i 1 dz 6i 2 dz dz
1
= G 1 − 2
= (z G1 (z) − 2πiz) 2
= + 12 .
f −z π z z π z f (z) z
Thus ln f − z1
and ln(z 12 f (z)) differ by a constant, which in turn implies
1
f − = Cz 12 f (z)
z
for some constant C. But evaluating at z = i shows that C = 1, so f (z) is weakly modular
and hence a cusp form of weight 12.
582
32.4. q-Expansions Chapter 32. Modular Forms
For each n ∈ N, let τ (n) be the nth coefficient in the q-expansion of the function f (z)
from above: ∞ ∞
Y X
n 24
f (z) = q (1 − q ) = τ (n)q n .
n=1 n=1
Example 32.4.12. As mentioned in Example 32.2.4, the first few values of τ (n) are τ (1) =
1, τ (2) = −24, τ (3) = 252, etc. Therefore, the q-expansion of f (z) is
(a) τ (n) = O(n6 ) from Hecke’s theorem (32.4.9), but Deligne’s estimate gives τ (n) =
O(n11/2+ε ) for any ε > 0.
(c) For all primes p and k ≥ 1, τ (pn+1 ) = τ (p)τ (pn ) − p11 τ (pn−1 ).
Properties (b) and (c) allow one to associate an L-function to τ that has an Euler product:
∞
X τ (n) Y 1
Lτ (s) = = .
n=1
ns p prime
1− τ (p)p−s + p11−2s
In particular, since τ is multiplicative, Lτ (s) is a Dirichlet L-series. Hecke showed that Lτ (s)
extends to an entire function on C and there is a functional equation
Example 32.4.13. Ramanujan’s function has many interesting arithmetic properties other
than multiplicativity, such as:
Amazingly however, the following conjecture is still open, although is has been shown nu-
merically for n ≤ 1015 .
583
Chapter 33
Hecke Operators
The Hecke operators are a set of powerful algebraic tools that encode the number theoretic
properties of the coefficients in a q-expansion of a modular form. For example, they give
a proof that Ramanujan’s function τ (n) satisfies the multiplicativity conditions seen at the
end of Section 32.4:
584
33.1. Hecke Operators on Lattices Chapter 33. Hecke Operators
Definition. A correspondence
P on E is an abelian group homomorphism T : XE → XE .
This can be written T (x) = y∈E ny (x)y for ny (x) ∈ Z such that all but finitely many ny (x)
nonzero.
T F : XE −→ C
X
x 7−→ (T F )(x) := (F ◦ T )(x) = ny (x)F (y).
y∈E
Let R be the set of lattices in C and for any fixed Λ ∈ R, let RΛ be the set of all lattices
contained in Λ.
Tn : XR −→ XR
X
Λ 7−→ Tn Λ := Λ0
Λ0 ∈RΛ
[Λ:Λ0 ]=n
Remark. Notice that any sublattice Λ0 ⊂ Λ of index n must contain nΛ, and since Λ/nΛ ∼ =
(Z/nZ)2 , the number of such Λ0 is equal to the number of subgroups of (Z/nZ)2 of order n.
In particular, when n = p is prime, (Z/pZ)2 has exactly p + 1 subgroups of order p.
(a) Rλ Rµ = Rλµ = Rµ Rλ .
(b) Rλ Tn = Tn Rλ .
585
33.1. Hecke Operators on Lattices Chapter 33. Hecke Operators
Proof. (a) and (b) are immediate from the definitions of the Hecke and homothety operators.
(c) Fix Λ ∈ R and suppose Λ00 is a sublattice of Λ of index mn. If (m, n) = 1, then
the canonical isomorphism Z/mnZ ∼ = Z/nZ ⊕ Z/mZ implies there is a unique sublattice Λ0
with Λ00 ⊂ Λ0 ⊂ Λ and such that [Λ : Λ0 ] = n and [Λ0 : Λ00 ] = m. By definition this means
Tn Tm = Tnm .
(d) Note that for any Λ ∈ R, Tpn Tp Λ, Tpn+1 Λ and Tpn−1 Rp Λ are all linear combinations of
sublattices of index pn+1 in Λ. Let Γ be such a lattice occuring with coefficient a in Tpn Tp ,
coefficient b in Tpn+1 Λ and coefficient c in Tpn−1 Rp Λ. Our goal is then to show that a = b+pc.
Note that by the remark above, b = 1 is automatic. First suppose that Γ 6⊂ pΛ. Then
X
Tpn−1 Rp Λ = Tpn−1 pΛ = Λ00
[Λ00 :pΛ]=pn−1
Corollary 33.1.3. The algebra generated by the Rλ and Tp for λ ∈ C× and p prime is
commutative and contains Tn for every n ≥ 1.
586
33.2. Hecke Operators on Modular Functions Chapter 33. Hecke Operators
587
33.2. Hecke Operators on Modular Functions Chapter 33. Hecke Operators
Next, we pass from lattice functions to modular functions. Let f (z) be a weakly modular
function on h of weight 2k. By Lemma 32.2.2, f corresponds to a lattice function F : R → C
of weight 2k satisfying
−2k ω1
F (ω1 , ω2 ) = ω2 f
ω2
for all ω1 , ω2 ∈ h.
Definition. For n ≥ 1, the Hecke transform of a weakly modular function f (z) of weight
2k is the function
(Tn f )(z) = n2k−1 (Tn F )(z, 1)
where F is the lattice function associated to f .
Lemma 33.2.4. For all n ≥ 1 and weakly modular functions f (z) of weight 2k,
X
(Tn f )(z) = n2k−1 f (σz).
σ∈Sn
Proposition 33.2.5. Let f (z) be a weakly modular function of weight 2k and let m, n ≥ 1.
Then
(b) Tm Tn f = Tn Tm f if (m, n) = 1.
X
γ(m) = a2k−1 cmn/a2 .
a|(m,n)
588
33.2. Hecke Operators on Modular Functions Chapter 33. Hecke Operators
(d) By definition,
2k−1
X
2k−1
X
−2k az + b a b
(Tn f )(z) = n f (σz) = n d f where σ = ∈ Sn
d 0 d
σ∈Sn
X X
= n2k−1 d−2k cm e2πim(az+b)/d
σ∈Sn m∈Z
X 0 m
= n2k−1 d−2k · dcm0 q am where m0 =
σ,m0
d
X X n 2k−1 0
= cm0 d/a q m
σ,m0
d
a|(n,m0 )
0
X
= γ(m0 )q m .
m0 ∈Z
(b) γ(1) = cn .
589
33.3. Eigenfunctions Chapter 33. Hecke Operators
33.3 Eigenfunctions
Let f (z) = ∞ m
P
m=0 cm q be a modular form of weight 2k. By Proposition 33.2.5(e), each
Hecke operator Tn is an operator on the spaces Mk and Sk of modular forms and cusp
forms. In this section, we study functions which are eigenvectors simultaneously for all Tn .
(a) c1 6= 0.
Corollary 33.3.2. Two modular forms of weight 2k which are eigenfunctions for all n ≥ 1
and have the same eigenvalues λ(n) are equal.
It follows from Theorem 32.4.9 and Corollary 32.4.10 that L(f, s) converges for Re(s) >
2k. In fact, Deligne’s improved bounds on the coefficients of the Fourier expansion of f imply
that when f is a cusp form, L(f, s) converges for Re(s) > k + 21 . When f is an eigenform,
L(f, s) has an Euler product, similar to other L-functions we have encountered.
590
33.3. Eigenfunctions Chapter 33. Hecke Operators
Proof. We first prove this for n = p prime. Let Gk (Λ) denote the Eisenstein series as a
lattice function (see Example 32.2.3). Then
X X 1
(Tp Gk )(Λ) = .
0 0
γ 2k
[Λ:Λ ]=p γ∈Λ r{0}
591
33.3. Eigenfunctions Chapter 33. Hecke Operators
Since σ2k−1 (p) = 1 + p2k−1 , we are finished with the proof for Tp . But by Corollary 33.1.3,
this is enough to show Gk (z) is an eigenform for all Tn , n ≥ 1. Moreover, our proof shows
that λ(p) = σ2k−1 (p). The relation σ2k−1 (pn )σ2k−1 (p) = σ2k−1 (pn+1 ) + pσ2k−1 (pn−1 ) is easy
to verify, and implies λ(pn ) = σ2k−1 (pn ) for all n ≥ 2. Finally, since Tn and Tm commute
when (m, n) = 1, we conclude that Gk (z) is an eigenform for all n ≥ 1 with eigenvalues as
claimed.
To describe the normalized eigenform, recall that by definition,
∞
4k X
k
Ek (z) = 1 + (−1) σ2k−1 (n)q n .
Bk n=1
Then the linear term of (−1)k B4kk Ek (z) has coefficient 1, so it is normalized. By the work
above, it is also an eigenform.
Corollary 33.3.6. The Dirichlet series attached to the normalized eigenform F (z) = (−1)k B4kk Ek (z)
is
L(F, s) = ζ(s)ζ(s − 2k + 1).
Proof. By Proposition 33.3.5, the Dirichlet series attached to F is
∞ X a2k−1 X ∞ ∞
X σ2k−1 (n) 1 X 1
L(F, s) = = =
n=1
ns a,d≥1
s
ad s
d=1
s
d a=1 a s−2k+1
Proof. The space S6 of cusp forms of weight 12 has dimension 1 by Corollary 32.3.4 and is
stable under each Tn by Proposition 33.2.5(e), so ∆(z) is indeed an eigenform. The other
statements follow immediately.
592
33.3. Eigenfunctions Chapter 33. Hecke Operators
We can now deduce the arithmetic properties of Ramanujan’s τ -function at the end of
Section 32.4.
Remark. There are similar results for the spaces Sk of dimension 1. By Corollary 32.3.4,
this happens when k = 6, 8, 9, 10, 11, 13 and the bases of these spaces are, respectively,
∆, ∆G2 , ∆G3 , ∆G4 , ∆G5 , ∆G7 .
593
33.4. Petersson Inner Product Chapter 33. Hecke Operators
Lemma 33.4.1. If f and g are two cusp forms of weight 2k, then
dy
µ(f, g) := f (z)g(z)y 2k dx ,
y2
where z = x + iy, is a Γ-invariant measure on h which is bounded on h/Γ.
Proof. It is clear
that µ is a measure.
To see that it is Γ-invariant, it is enough to check
0 −1 1 1
this for S = and T = by Theorem 32.1.1; these calculations are routine.
1 0 0 1
Finally, boundedness follows from the fact that f and g are cusp forms, so they decay rapidly
as iy → ∞.
Lemma 33.4.2. For all k ≥ 1, h·, ·i is a positive, nondegenerate, Hermitian inner product
on Sk .
Proof. Straightforward.
Proof. First note that both sides of the equation are well-defined since Tn acts on the space
of cusp forms Sk for each k ≥ 1. By Corollary 33.1.3, it suffices to prove the statement for
n = p prime. In this case we have
X
hTp f, gi = p2k−1 hf (σz), gi by Lemma 33.2.4
σ∈Sp
XZ
2k−1
=p f (σz)g(z)y 2k−2 dx dy
σ∈Sp D
XZ
= p2k−1 f (z)g(σ −1 z)y 2k−2 dx dy using modularity and z 7→ σ −1 z
σ∈Sp σ −1 D
X
= p2k−1 hf, g(σ −1 z)i
σ∈Sp
X
= p2k−1 hf, g(σz)i = hf, Tp gi.
σ∈Sp
594
33.4. Petersson Inner Product Chapter 33. Hecke Operators
Corollary 33.4.4. For each k ≥ 1, there exists a basis for Sk consisting of eigenforms which
are orthogonal with respect to the Petersson inner product and have eigenvalues which are
real numbers.
Proof. Since the Petersson inner product is Hermitian, hTn f, gi = hf, Tn gi implies each
Tn is self-adjoint. Moreover, the Tn commute by Corollary 33.1.3. Thus spectral theory,
in particular Proposition 29.4.5(b), shows that the Tn can be simultaneously diagonalized,
giving an orthogonal basis of eigenforms for Sk with real eigenvalues.
For a cusp form f (z) = ∞ m
P
m=1 cm q of weight 2k which is a normalized eigenform (i.e.
c1 = 1), define
Φf,p (T ) = 1 − cp T + p2k−1 T 2
for each prime p. This is a quadratic in T which factors as
for αp , αp0 ∈ C satisfying αp +αp0 = cp and αp αp0 = p2k−1 . The following result was a conjecture
of Ramanujan and Petersson until 1973, when it was proven by Deligne using his proof of
part of the Weil Conjectures.
(That the statements in the conjecture are all equivalent is easy to prove. Deligne proved
the deep fact that |αp | = |αp0 | = pk−1/2 using the Riemann hypothesis for curves over a finite
field; see Theorem 24.3.2.)
595
33.5. Theta Series Chapter 33. Hecke Operators
(Note that fˆ is also a Schwartz function on V 0 .) We have the following Poisson summation
formula (see Proposition 12.1.5 and Theorem 31.3.4) over V .
Proposition 33.5.1. For a lattice Λ ⊂ V , set v = µ(V /Λ). Then for any Schwartz function
f : V → C,
X 1X ˆ
f (x) = f (y).
x∈Λ
v y∈Λ0
Proof. After normalizing µ so that v = 1, fˆ becomes v1 fˆ, so proving the v = 1 case proves
the general case. Choose a Z-basis {e1 , . . . , en } for Λ, so that the isomorphism V ∼ = Rn
induces Λ ∼
= Zn . Then µ on V corresponds to dx1 · · · dxn on Rn , so pulling back the classical
Poisson summation formula (Proposition 12.1.5) on Zn ⊂ Rn to Λ ⊂ V gives the result.
Let {e1 , . . . , en } be an orthonormal basis for V with respect to the inner product h·, ·i
and let Φ be the fundamental parallelopiped spanned by the ei . We may normalize µ so that
µ(Φ) = 1.
596
33.5. Theta Series Chapter 33. Hecke Operators
Then by Proposition 12.1.3, f (x) = fˆ(x). Applying Proposition 33.5.1 to the lattice Λ
e =
e 0 = t−1/2 Λ0 , we get
t1/2 Λ, which has dual Λ
X X
ΘΛ (t) = e−πthx,xi = f (x)
x∈Λ x∈Λ
e
1 X
= f (y)
µ(V /Λ)
e
e0
y∈Λ
1 X 1
= n/2 e−πhx,xi/t = n/2 ΘΛ0 (t−1 ).
vt y∈Λ0 vt
Fix a Z-basis {e1 , . . . , en } of Λ and let A = (aij ) be the corresponding positive, symmetric
matrix defined by aij = hei , ej i. Then for an orthonormal basis {ε1 , . . . , εn i of V , let Q be
the change-of-basis matrix from {ei } to {εi }, so that A = Qt Q. Let Φ (resp. Φ0 ) be the
fundamental parallelopiped spanned by the ei (resp. the εi ). Then we have
Z Z
v = µ(V /Λ) = dµ = | det(Q)| dµ = | det(Q)| = | det(A)|1/2 .
Φ Φ0
Pn
If B = (bij ) = A−1 , then e0i = j=1 bij ej defines the dual basis {e01 , . . . , e0n } to {e1 , . . . , en }
with respect to h·, ·i. Thus by the same argument as above,
1
v 0 := µ(V /Λ0 ) = | det(B)|1/2 = | det(A)|−1/2 = .
v
Thus vv 0 = 1. The lattices satisfying v = v 0 = 1 are given a special name.
Definition. A lattice Λ ⊂ V is called unimodular if Λ = Λ0 , or equivalently, if det(A) = 1
for any positive, symmetric matrix A representing a basis for Λ. Further, Λ is even if
hx, xi = 0 mod 2 for all x ∈ Λ.
Definition. For an even, unimodular lattice Λ ⊂ V and each integer m ≥ 0, define rΛ (m) =
#{x ∈ Λ | hx, xi = 2m}. The theta function (or theta series) of Λ is the function
θΛ : h → C defined by
∞
X
θΛ (z) = rΛ (m)q m where q = e2πiz .
m=0
Proof. (a) It is easy to show that rΛ (m) = O(mn/2 ) if Λ has rank n. Thus the q-expansion
of θΛ (z) converges absolutely on h.
(b) Both sides of the expression are analytic, so it suffices to test equality on a subset of
h containing an accumulation point. For example, on the set z = it, t ∈ (0, ∞), the formula
597
33.5. Theta Series Chapter 33. Hecke Operators
to prove is θΛ (it) = t−n/2 θΛ − it1 . Notice that by definition of the lattice theta series ΘΛ ,
we have
∞
X ∞
X X X
θΛ (it) = rΛ (m)q m = e−2πthx,xi/2 = e−πthx,xi
m=0 m=0 x∈Λ x∈Λ
hx,xi=2m
Thus (ST ) · ω = −ω, which implies (ST )3 · ω = −ω, but this contradicts (ST )3 = 1 from
Theorem 32.1.1. Thus n ≡ 0 mod 8.
(b) now follows from (a) and Lemmas 32.2.1 and 33.5.3.
Corollary 33.5.5. For each even, unimodular lattice Λ of rank n, there exists a cusp form
fΛ (z) of weight n2 such that θΛ (z) = En/4 (z) + fΛ (z).
Proof. Both θΛ (z) and En/4 (z) have constant term 1, so their difference is a cusp form of
weight n2 .
(−1)n/4
Corollary 33.5.6. For all m ∈ N, rΛ (m) = σn/2−1 (m) + O(mn/4 ).
Bn/4
Proof. This follows from applying Theorem 32.4.9 to the cusp form fΛ (z).
Remark. As the last corollary shows, we can view the cusp form fΛ (z) like an “error term”
for the theta series θΛ (z). This cusp form is usually nonzero; however, Siegel proved that
the weighted mean of all the fΛ (z) is 0. Explicitly, for each n ≡ 0 mod 8 let Cn be the set
of isomorphism classes of rank n unimodular lattices and for each Λ ∈ Cn , let gΛ be the size
of the isomorphism class of Λ, which is always finite. Then Siegel showed that
X 1
fΛ (z) = 0.
Λ∈C
gΛ
n
598
33.5. Theta Series Chapter 33. Hecke Operators
1
P
Setting mn = Λ∈Cn gΛ , this says that
X 1
θΛ (z) = mn En/4 (z).
Λ∈C
gΛ
n
By Proposition 33.3.5, Ek (z) is an eigenform for the Hecke operators with eigenvalues
σ2k−1 (n), so this weighted mean of the θΛ (z) is also an eigenform with eigenvalues mn .
Example 33.5.7. Let n = 8. Then by Corollary 32.3.4, there are no cusp forms of weight
n
2
= 4, so there is a single rank 8 unimodular lattice Λ8 ∈ C8 for which
Using Example 32.4.7, we obtain rΛ8 (m) = 240σ3 (m) for all m ≥ 1.
Example 33.5.8. Similarly, when n = 16, any Λ ∈ C16 has theta series
θΛ (z) = E4 (z).
This shows θΛ8 ⊕Λ8 (z) = (θΛ8 (z))2 , so we recover the formula
∞
!2 ∞
X X
m
1 + 240 σ3 (m)q = 1 + 480 σ7 (m)q m
m=1 m=1
from Example 32.4.7. There is another rank 16 lattice Λ16 which is not isomorphic to E8 ⊕E8 ,
but by the above it has the same theta series.
Example 33.5.9. When n = 24, things get interesting since by Corollary 32.3.4, S12 6= 0.
Explicitly, M12 can be generated by E6 (z) and F (z) = (2π)−12 ∆(z) (this is the normalization
of the modular discriminant ∆(z) by Proposition 33.3.7). If Λ is a unimodular lattice of rank
24, then by Corollary 33.5.5, its theta series can be written
for some cΛ ∈ C. In fact, since the coefficients of the q-expansions of θΛ , E6 and F are all
rational, cΛ ∈ Q. Comparing these coefficients, we get the following identity for all m ≥ 1:
65520
rΛ (m) = σ11 (m) + cΛ τ (m).
691
65520
In particular, since τ (1) = 1, cΛ = rΛ (1) − 691
. It turns out that there are 24 different
unimodular lattices of rank 24, including:
599
33.5. Theta Series Chapter 33. Hecke Operators
600
Chapter 34
Level Structure
601
34.1. Congruence Subgroups Chapter 34. Level Structure
Definition. Fix an integer N ≥ 1. Then the level N modular group is the subgroup
Γ(N ) ≤ SL2 (Z) defined by
a b a b 1 0
Γ(N ) = SL2 (Z) : ≡ mod N .
c d c d 0 1
Definition. The set of cusps for a congruence subgroup Γ ≤ SL2 (Z) is the set of Γ-orbits
of P1 (Q) = Q ∪ ∞ in h∗ .
(2) f (z) is holomorphic at the cusps of Γ, i.e. for all γ ∈ Γ taking ∞ to a cusp
z0 = γ∞ ∈ h∗ , f (γz) is holomorphic at ∞.
A cusp form of weight 2k for Γ is a modular form which vanishes at every cusp z0 = γ∞
of Γ. Write Mk (Γ) and Sk (Γ) for the spaces of modular forms and cusp forms, respectively,
of weight 2k for Γ.
602
34.1. Congruence Subgroups Chapter 34. Level Structure
Example 34.1.3. The congruence subgroup Γ0 (2) has index [SL2 (Z) : Γ0 (2)] = 3 with coset
representatives
1 0 0 −1 0 −1
I= , A= , B= .
0 1 1 0 1 1
(Note that A = S, a generator of P SL2 (Z).) Thus a fundamental domain D(Γ0 (2)) may be
obtained as a union of translates of the fundamental domain D for P SL2 (Z):
ρ −ρ̄
i
AD
BD
Re(z)
−1 − 12 1
2 1
Im(z)
Example 34.1.4. More generally, a modular form f (z) of level N , i.e. a modular form for
the congruence subgroup Γ0 (N ), has q-expansion
∞
X
f (z) = an q n/N .
n=0
603
34.1. Congruence Subgroups Chapter 34. Level Structure
Definition. Let N ≥ 1, let Γ ≤ SL2 (Z) be a level N subgroup and let χ : (Z/N Z)× → C×
be a Dirichlet character mod N . A modular form of weight 2k for Γ with nebentypus χ is
a holomorphic function f : h → C such that
2k a b
f (γz) = (cz + d) χ(d)f (z) for all γ = ∈Γ
c d
and f (z) is holomorphic at the cusps of Γ. We write Mk (Γ, χ) and Sk (Γ, χ) for the spaces
of modular and cusp forms with nebentypus χ, and in the special case Γ = Γ0 (N ), we write
these as Mk (N, χ) and Sk (N, χ).
Remark. With level structure and nontrivial characters χ, we can have modular forms of
odd weight, i.e. holomorphic f (z) such that f (γz) = (cz + d)k χ(d)f (z).
604
34.2. Modular Curves Chapter 34. Level Structure
Theorem 34.2.1. h/Γ admits the structure of an open Riemann surface, that is, a sur-
face Y (Γ) of genus g with some number of punctures. Moreover, the action of Γ on the
extended half-plane h∗ = h ∪ P1 (Q) defines a compact Riemann surface X(Γ) = h∗ /Γ which
topologically is the compact surface of genus g underlying Y (Γ).
In other words, X(Γ) is the compactification of Y (Γ) obtained by filling in the cusps. For
the congruence subgroups Γ(N ), Γ0 (N ), Γ(1), we let the open and compact Riemann surfaces
Y (Γ) and X(Γ) be denoted Y (N ), Y0 (N ), Y1 (N ) and X(N ), X0 (N ), X1 (N ), respectively. For
Γ0 (N ), we have the following important interpretation.
Theorem 34.2.2. The complex points of Y0 (N ) are in bijection with the isomorphism classes
of pairs (E, C), where E is an elliptic curve and C ⊆ E(C) is a cyclic subgroup of order N .
Explicitly, [τ ] ∈ Y0 (N ) corresponds to (E, C) where
1
E = C/(Z + τ Z) and C= Z + τ Z /(Z + τ Z) .
N
Theorem 34.2.3. The complex points of Y1 (N ) are in bijection with the isomorphism classes
of pairs (E, P ), where E is an elliptic curve, P ∈ E(C) is a torsion point of order N and
(E, P ) ∼= (E 0 , P 0 ) if there exists an isomorphism E → E 0 mapping P 7→ P 0 . Explicitly,
[τ ] ∈ Y1 (N ) corresponds to (E, P ) where
1
E = C/(Z + τ Z) and P = Z + τ Z /(Z + τ Z).
N
Example 34.2.4. When N = 1, Γ(1) = Γ0 (1) = Γ1 (1) = SL2 (Z) and Y (1) is equal to the
j-line A1j ∼= C from Proposition 32.3.7. Then A1j is a moduli space for isomorphism classes
of all elliptic curves, with j ∈ A1j corresponding to the unique isomorphism class of elliptic
curves E with j-invariant j(E) = j (see Section 23.2).
Theorem 34.2.5 (Modularity). If E is a complex elliptic curve with j(E) ∈ Q, then there
exists a cover of compact Riemann surfaces X0 (N ) → E for some N ≥ 1.
For the remainder of the section, we focus on modular forms of level N , i.e. modular
forms for Γ0 (N ). The following result gives us two ways of constructing modular forms of
higher levels.
605
34.2. Modular Curves Chapter 34. Level Structure
Take f (z) ∈ Mk (N ). We can extend the action of SL2 (Z) on f (z) to an action of GL+
2 (Q)
– the positive determinant 2 × 2 invertible matrices over Q – by
k −2k a b
γ · f (z) = (det γ) (cz + d) f (γz) for γ = ∈ GL+2 (Q).
c d
Then f (az) is fixed under the action of α−1 Γ0 (N )α, so f (az) is weakly modular for Γ0 (aN ).
It is routine to check holomorphicity at the cusps, which gives f (az) ∈ Mk (aN ).
Example 34.2.7. For k = 1 and N = p prime, there are no oldforms of weight 2 and level
p. Thus M1 (p) consists entirely of newforms. To produce such a modular form, we look
for differential forms on the modular curve X0 (p) = h∗ /Γ0 (p). Note that the genus of this
curve is g(X0 (p)) = dim S1 (Γ0 (p)). It turns out that p = 11 is the smallest prime for which
g(X0 (p)) > 0, so it is the first prime for which we have cusp forms of weight 2 and level p.
Explicitly, X0 (11) is an elliptic curve which is the smooth projective completion of the affine
equation
y 2 + y = x3 − x2 − 10x − 20.
One can use this to show that Mk (11) = Sk (11) is the one-dimensional space spanned by
the cusp form
f (z) = q − 2q 2 − q 3 + 2q 4 + q 5 + 2q 6 + . . .
As mentioned above, this f is necessarily a newform of level 11.
606
34.3. Automorphic Forms Chapter 34. Level Structure
Given a modular form f ∈ Mk (N ) and an adèle g ∈ GL2 (AQ ), write g = γh∞ κ for
γ ∈ GL2 (Q), h∞ ∈ GL+ 2 (R) and κ ∈ K0 (N ). Then we define a function ϕf : GL2 (AQ ) → C
by
−k 2k a b
ϕf (g) = (det h∞ ) (ci + d) f (h∞ i) if h∞ = .
c d
Lemma 34.3.2. For each f (z) ∈ Mk (N ), ϕf is well-defined and independent of the decom-
position g = γh∞ κ.
607
Part VIII
Galois Cohomology
608