Primes II
Primes II
Adding for n = 1, 2, 3, . . . ,
n−s = 1 + 2−s + 2−2s + · · · 1 + 3−s + 3−2s + · · · 1 + 5−s + 5−2s + · · · · · · ,
X
each term on the left arising from just one product on the right. But for each
prime p, −1
1 + p−s + p−2s + · · · = 1 − p−s ,
and the result follows.
P −s
Euler’s Product Formula equates the Dirichlet series n on the left
with the infinite product on the right.
1–1
1.2. INFINITE PRODUCTS 1–2
To make the formula precise, we must develop the theory of infinite prod-
ucts, which we do in the next Section.
To understand the implications of the formula, we must develop the the-
ory of Dirichlet series, which we do in the next Chapter.
We say that the infinite product diverges if either the partial products do
not converge, or else they converge to 0 (as would be the case for example if
any factor were 0).
Q
Proposition 1.1. If cn is convergent then
cn → 1.
Proof I We have
Pn
cn = .
Pn−1
Since Pn → ` and Pn−1 → `, it follows that
`
cn → = 1.
`
J
It is usually more convenient to write the factors in the form
cn = 1 + an .
In these terms, the last Proposition states that
Y
(1 + an ) convergent =⇒ an → 0.
1.2. INFINITE PRODUCTS 1–3
θ
1
1.2.3 Convergence
Proposition 1.2. Suppose an 6= −1 for n ∈ N. Then
Y X
(1 + an ) converges ⇐⇒ Log(1 + an ) converges.
P
Proof I Suppose Log(1 + an ) converges to S. Let
X
Sn = Log(1 + am ).
m≤n
Then
eSn =
Y
(1 + am ).
m≤n
But
Sn → S =⇒ eSn → eS .
Thus (1 + an ) converges to es .
Q
Q
Conversely, suppose (1 + an ) converges. Let
Y
Pn = (1 + an ).
m≤N
Hence X
Log(Pn /PN ) = Log(1 + am ).
N <m≤m
Since
Pn → ` =⇒ Log(Pn /LN ) → Log(`/PN ),
P
we conclude that n>N (1 + an ) converges to Log(`/PN ); and in particular,
P
n≥0 Log(1 + an ) is convergent. J
|an | ≤ 1/2
|Log(1 + an )| ≤ 2|an |
for n ≥ N . Hence X
Log(1 + an ) converges.
J
1.3. PROOF OF THE PRODUCT FORMULA 1–6
1 −σ
= M − N −σ
σ
→ 0 as M, N → ∞.
Hence n−s is convergent, by Cauchy’s criterion.
P
where the second sum on the right extends over those n > N all of whose
prime factors are ≤ N .
As N → ∞, the right-hand side → n−s , since this sum is absolutely
P
n = pe11 · · · perr
with p1 , . . . , pr ≤ N .
P
Hence 1/n is convergent. But
1 Z n+1 dx
> .
n n x
Thus
N Z N +1
X
−1 dx
n ≥ = log(N + 1).
1 1 x
Since log N → ∞ as N → ∞ it follows that 1/n is divergent.
P
that if pn denotes the nth prime (so that p2 = 3, p5 = 11, etc) then
pn ∼ n log n.
To see that, note that π(pn ) = n (ie the number of primes ≤ pn is n).
Thus setting x = pn in the Prime Number Theorem,
pn
n∼ ,
log pn
ie
pn
→ 1.
n log pn
Taking logarithms,
log pn − log n − log log pn → 0;
hence
log n
→ 1,
log pn
ie
log pn ∼ log n.
We conclude that
pn ∼ n log pn ∼ n log n.
P P
Returning to Euler’s Theorem, we see that 1/p behaves like 1/n log n.
The latter diverges, but only just, as we see by comparison with
Z
dx
= log log x.
x log x
On the other hand,
X 1
p p log p
converges for any > 0, since
X 1
n n log1+ n
converges by comparison with
Z
dx 1
1+ = − log− x.
x log x
What is perhaps surprising is that it is so difficult to pass from Euler’s
Theorem to the Prime Number Theorem.
Chapter 2
Dirichlet series
2.1 Definition
Definition 2.1. A Dirichlet series is a series of the form
where ai ∈ C.
while
1−s = 1
for all s.
3. The term ‘Dirichlet series’ is often applied to the more general series
a0 λ−s −s −s
0 + a1 λ 1 + a2 λ 2 + · · · ,
2–1
2.2. CONVERGENCE 2–2
where
0 < λ0 < λ1 < λ2 < · · · ,
and
λ−s = e−s log λ
2.2 Convergence
Proposition 2.1. Suppose
Then
N
X N
X
an Bn = AN BN +1 − AM −1 BM − An bn+1 .
M M
2.2. CONVERGENCE 2–3
J
P P
Lemma 2. Suppose an converges and bn converges absolutely. Then
X
an Bn
converges.
Proof I By the previous Lemma,
N
X N
X
an Bn = AN BN +1 − AM −1 BM − An bn+1
M M
N
X
= AN (BN +1 − BM ) + (AN − AM −1 )BM − An bn+1 .
M
P P
Let an = A, bn = B. The partial sums of both series must be
bounded; say
|An | ≤ C, |Bn | ≤ D.
Then
N
X N
X
| an Bn | ≤ C|BN +1 − BM | + D|AN − AM −1 | + C |bn+1 |.
M M
As M, N → ∞,
N
X
BN +1 − BM → 0, AN − AM −1 → 0, |bn+1 | → 0.
M
Hence
N
X
an Bn → 0
M
as M, N → ∞; and therefore
P
an Bn converges, by Cauchy’s criterion. J
2.2. CONVERGENCE 2–4
Let s0 = s − s0 . Then <(s0 ) > 0. We apply the last Lemma with an n−s0
0
for an , and n−s for Bn . Thus
bn = Bn − Bn−1
0 0
= n−s − (n − 1)−s
Z n
0 dx
= −s0 x−s .
n−1 x
Hence
Z n
0 dx 0
|bn | ≤ |s | |x−s |
n−1 x
Z n
0 dx
= |s0 | x−σ ,
n−1 x
where σ 0 = <(s0 ).
Summing,
N Z N
0 dx
|bn | ≤ |s0 | x−σ
X
M M −1 x
|s0 | −σ 0 −σ 0
= (M − 1) − N .
σ0
It follows that
N
X
|bn | → 0 as M, N → ∞.
M
Thus |bn | is convergent, and so the conditions of the last Lemma are
P
is convergent. J
Corollary 2.1. A Dirichlet series either
1. converges for all s,
<(s) = σ0 ,
σ + iT X + iT
σ0 σ X
{S = x + it : σ ≤ x ≤ X; −T ≤ t ≤ T }
strictly to the right of <(s) = σ0 (Fig 2.1), since
N
X |s0 | −σ0
|bn | ≤ M
M σ0
|s − s0 | −(σ−σ0 )
≤ M
σ − σ0
in this region.
Thus f (s) is holomorphic in this rectangle. We conclude that f (s) is
holomorphic in the half-plane <(s) > σ0 . J
Proposition 2.3. If
<(s) ≥ <(s0 ).
Proof I This follows at once from the fact that each term
is a decreasing function of σ. J
<(s) = σ1 ,
and does not converge absolutely for any s to the left of this line.
σ0 ≤ σ1 ≤ σ0 + 1.
Proof I Suppose
<(s) > σ0 .
Then
f (s) = a1 1−s + a2 2−s + · · ·
is convergent. Hence
an n−s → 0 as n → ∞.
In particular, an n−s is bounded, say
|an n−s | ≤ C.
2.4. THE RIEMANN ZETA FUNCTION 2–7
But then
|an n−(s+1+) | ≤ Cn−(1+)
P −(1+)
for any < 0. Since n converges, it follows that
f (s + 1 + )
σ > σ0 =⇒ σ + 1 + ≥ σ1
σ0 + 1 ≥ σ1 .
J
Proposition 2.5. If an ≥ 0 then σ1 = σ0 .
Proof I This is immediate, since in this case
N (a)−s ,
X
ζk (s) =
a
2.4. THE RIEMANN ZETA FUNCTION 2–8
where a runs over the ideals in k (or rather, in the ring of integers
I(k) = k ∩ Z̄), and N (a) is the number of residue classes moda.
Since the unique factorisation theorem holds for ideals, the analogue of
Euler’s product formula holds:
Y −1
ζk (s) = 1 − N (p)−s ,
p
λ−s
X
ζE (s) = n ,
Let us recall how this is established, by comparing the sum with the
R −σ
integral x dx. If n − 1 ≤ x ≤ n,
Integrating, Z n
n−σ < x−σ dx < (n − 1)−σ ,
n−1
Summing from n = M + 1 to N ,
N Z N N +1
−σ
x−σ dx < n−σ ,
X X
n <
M +1 M M
2.4. THE RIEMANN ZETA FUNCTION 2–9
Proof I We have
1
ζ(s) = f (s)
1 − 21−s
for <(s) > 1. But the right-hand side is meromorphic in <(s) > 0, and so
defines an analytic continuation of ζ(s) (necessarily unique, by the theory of
analytic continuation) to this half-plane.
Since f (s) is holomorphic in this region, any pole of ζ(s) must be a pole
of 1/(1 − 21−s ), ie a zero of 1 − 21−s . But
Hence
21−s = 1 ⇐⇒ (1 − s) log 2 = 2nπi
for some n ∈ Z. Thus 1/(1 − 21−s ) has poles at
2nπ
s=1+ i (n ∈ Z).
log 2
At first sight this seems to give an infinity of poles of ζ(s) on the line
<(s) = 1. However, the following argument shows that f (s) must vanish at
all these points except s = 1, thus ‘cancelling out’ all the poles of 1/(1−21−s )
except that at s = 1.
Consider
Like f (s), this converges for all σ > 0. For if we group g(σ) in sets of three
terms
we see that each set is > 0. Thus the series either converges (to a limit > 0),
or else diverges to +∞.
On the other hand, we can equally well group g(σ) as
Now each group is < 0, if we omit the terms 1−σ + 2−σ . Thus g(σ) either
converges (to a limit < 1−σ + 2−σ ), or else diverges to −∞.
We conclude the g(σ) converges (to a limit between 0 and 1−σ + 2−σ ).
2.4. THE RIEMANN ZETA FUNCTION 2–11
ie
2πm
s=1+ i.
log 3
Thus ζ(s) can only have a pole where s is expressible in both forms
2πn 2πm
s=1+ i=1+ i (m, n ∈ Z).
log 2 log 3
But this implies that
m log 2 = n log 3,
ie
2m = 3n ,
which is of course impossible (by the Fundamental Theorem of Arithmetic)
unless m = n = 0.
We have therefore eliminated all the poles except s = 1. At s = 1,
1 1 1
f (1) = 1 − + − + · · · = log 2.
2 3 4
(This follows on letting x → 1 from below in log(1 + x) = x − x2 /2 + · · · .)
On the other hand, if we set s = 1 + s0 then
0
1 − 21−s = 1 − e−s log 2
2
= s0 log 2 − s0 /2! log2 2 + · · ·
= s0 log 2(1 − s0 /2 log 2 + · · ·
2.5. THE RIEMANN-STIELTJES INTEGRAL 2–12
Thus
1 1
=
1−2 1−s 1 − 2−s0
1 s0
= (1 + log 2 + · · · )
log 2 s0 2
1
= + h(s),
log 2 s0
uous function f (x) on [a, b]. Note that f (x) is in fact uniformly continuous
on [a, b], ie given > 0 there exists a δ > 0 such that
We set
k∆k = max |xi+1 − xi |.
0≤i<n
Let X
S(f, ∆) = f (xi )(xi+1 − xi ).
0≤i<n
x ≤ y =⇒ f (x) ≤ f (y).
Then we set
X
SM (f, ∆) = f (xi )(M (xi+1 ) − M (xi )).
0≤i<n
2.5. THE RIEMANN-STIELTJES INTEGRAL 2–14
Proof I The result follows in exactly the same way as for the Riemann in-
tegral above, with (1) replaced by
J
Definition 2.5. We call
I = lim SM (f, ∆)
k∆k→0
J
2.5. THE RIEMANN-STIELTJES INTEGRAL 2–15
and so
A(f, ∆) = f (b) − f (a).
If f (x) is decreasing then −f (x) is increasing, so the result follows from the
last Proposition. J
Proposition 2.13. A function f (x) of class C 1 [a, b], ie with continuous
derivative f 0 (x) on [a, b], is of bounded variation.
Proof I Since f 0 (x) is continuous, it is bounded: say
|f 0 (x)| ≤ C.
and so
A(f, ∆) ≤ C(b − a).
J
Proposition 2.14. A real-valued function f (x) is of bounded variation on
[a, b] if and only if it can be expressed as the difference of two increasing
functions:
f (x) = M (x) − N (x),
where M (x), N (x) are monotone increasing.
Proof I If f (x) is expressible in this form then it is of bounded variation,
by Propositions 2.12 and 2.11.
For the converse, let
X
P (f, ∆) = (f (xi ) − f (xi−1 )),
i:f (xi )≥f (xi−1 )
X
N (f, ∆) = − (f (xi ) − f (xi−1 )).
i:f (xi )<f (xi−1 )
2.5. THE RIEMANN-STIELTJES INTEGRAL 2–16
for each dissection ∆ of [a, b]. Then P (f, ∆), N (f, ∆) ≥ 0; and
It follows that
0 ≤ P (f, ∆), N (f, ∆) ≤ A(f, ∆).
Hence
P (f ) = sup P (f, ∆), N (f ) = sup N (f, ∆)
∆ ∆
are defined.
Lemma 3. We have
P (f ) − N (f ) = f (b) − f (a).
P (f ) ≥ P (f, ∆1 ) > P (f ) − ,
N (f ) ≥ N (f, ∆2 ) > N (f ) − .
But
P (f, ∆) − N (f, ∆) = f (b) − f (a).
It follows that
P (f ) − N (f ) − ≤ f (b) − f (a) ≤ P (f ) − N (f ) + .
P (f ) − N (f ) = f (b) − f (a).
J
Now suppose a ≤ x ≤ b. We apply the argument above to the interval
[a, x]. Let p(x), n(x) be the functions P (f ), N (f ) for the interval [a, x]. By
the last Lemma,
p(x) − n(x) = f (x) − f (a).
It is easy to see that p(x), n(x) are increasing functions of x. For suppose
a ≤ x < y ≤ b.
2.5. THE RIEMANN-STIELTJES INTEGRAL 2–17
To each dissection
∆ : a = x0 < x 1 < · · · xn = x
of [a, x] we can associate the dissection
It follows that
p(y) ≥ p(x), n(y) ≥ n(x),
ie p(x), n(x) are monotone increasing. Since
Proof I It follows from Propositions 2.11 and 2.12 that a function of this
form is of bounded variation.
For the converse, note that if f (x) is complex-valued then it can be split
into its real and imaginary parts:
where fR (x), fI (x) are real-valued functions. It is easy to see that if f (x) is
of bounded variation then so are fR (x) and fI (x). Hence each is expressible
as a difference of increasing functions, say
But then
f (x) = MR (x) − NR (x) + iMI (x) − iNI (x),
which is of the required form. J
2.5. THE RIEMANN-STIELTJES INTEGRAL 2–18
I = lim SU (f, ∆)
k∆k→0
SU (f, ∆) → I as k∆k → 0
Proposition 2.17. Suppose U (x) is of class C 1 [a, b], ie U (x) has continuous
derivative U 0 (x) on [a, b]; and suppose f (x)U 0 (x) is Riemann integrable on
[a, b]. Then f (x) is Riemann-Stieltjes integrable, and
Z b Z b
f (x)dU = f (x)U 0 (x)dx.
a a
where xi < ξi < xi+1 . Moreover, since U 0 (x) is continuous on [a, b], it is
absolutely continuous; so given any > 0 we can find δ > 0 such that
if xi+1 − xi < δ.
Hence
|SU (f, ∆) − S(f U 0 , ∆)| ≤ max|f |(b − a)
if k∆k < δ, from which the result follows. J
2.5.2 Discontinuities
Proposition 2.18. If f (x) is a function of bounded variation on [a, b] then
the left limit
f (x − 0) = lim f (t)
t→x−0
f (x + 0) = lim f (t)
t→x+0
f (x) = µ1 M1 (x)+· · ·+µr Mr (x) for all x =⇒ f (x−0) = µ1 M1 (x−0)+· · ·+µr Mr (x−0),
f (ξ − 0) = f (ξ) = f (ξ + 0).
j(ξ) = f (ξ + 0) − f (ξ − 0).
f (ξ − 0) ≤ f (ξ) ≤ f (ξ + 0).
Then X
j(ξi ) ≤ f (b) − f (a).
1≤i≤n
Corollary 2.4. Suppose M (x) is increasing on [a, b]; Then the number of
discontinuities with
j(x) = f (x + 0) − f (x − 0) ≥ 2−r
is
≤ 2r (f (b) − f (a)).
Using the Lemma, we can enumerate the discontinuities of M (x) by first
listing those with j(x) ≥ 1, then those with 1 > j(x) ≥ 2−1 , then those with
2−1 > j(x) ≥ 2−2 , and so on. In this way we enumerate all the discontinuities:
ξ0 , ξ1 , ξ2 , . . . .
J
Remarks. 1. Note that we are not claiming that the discontinuities can
be enumerated in increasing order, ie so that
That is not so, in general; f (x) could , for example, have a discontinuity
at every rational point.
f (x + 0) = f (x)
for all x ∈ [a, b), then the second contribution vanishes, and the dis-
continuity is completely determined by
To simplify the discussion, the functions we use have all been chosen
to be right-continuous; for example, we set
is absolutely convergent.
by Lemma 4. J
Proof I It is sufficient to prove the result in the case where U (x) is increas-
ing, by Proposition 2.15.
Let
f (x) = U (x) − J(x).
We have to show that f (x) is continuous.
2.5. THE RIEMANN-STIELTJES INTEGRAL 2–23
≤ U (y) − U (x),
by Proposition 4. Thus
f (x) = U (x) − J(x) ≤ U (y) − J(y) = f (y),
ie f (x) is increasing.
Moreover,
0 ≤ f (y) − f (x) ≤ U (y) − U (x).
Hence
0 ≤ f (y) − f (y − 0) ≤ U (y) − U (y − 0).
In particular, if U (x) is left-continuous at y then so is f (x).
Now suppose U (x) has a discontinuity at y. If x < y then
J(y) − J(x) ≥ j(y) = U (y) − U (y − 0).
Hence
J(y) − J(y − 0) ≥ U (y) − U (y − 0),
ie
2. Our definition of J(x) entails that J(a) = 0. With that condition, the
splitting of U (x) is unique. If we drop the condition then J(x) and
f (x) are defined up to a constant.
xi < ξ ≤ xi+1 .
Then
Since
f (xi ) → f (ξ) as k∆k → 0,
the result follows. J
Rb
In practice we shall encounter the Riemann-Stieltjes integral a f (x)dU
in just two cases: the case above, where f (x) is continuous and U (x) is
purely discontinuous; and the case where U (x) ∈ C 1 [a, b], when (as we saw
in Proposition 2.17)
Z Z
f (x)dU = f (x)U 0 (x)dx.
R
Proof I We may suppose that U (x) R
is continuous. Then U (x)dV is cer-
tainly defined; we must show that V (x)dU is also defined.
Let
∆ : a = x0 < x 1 < · · · < x n = b
be a dissection of [a, b].
Our proof is based on the formula for ‘summation by parts’ (Lemma 1),
which we may re-write as
n−1
X n−1
X
Ai b i + ai+1 Bi = An Bn−1 − A1 B0 .
i=1 i=1
Ai = U (xi ), Bi = V (xi ).
This is almost SU (V, ∆). There is a discrepancy because we are taking the
value U (xi ) at the top of the interval [xi−1 , xi ] rather than at the bottom.
However, U (x) is continuous, and so uniformly continuous, on [a, b]. Thus
given > 0 we can find δ > 0 such that
A(V, ∆) ≤ C
an n−s .
X
f (s) =
if <(s0 ) > 0.
Let
an n−s .
X
V (x) =
n≤x
Then V (x) has discontinuities at each integer point x = n, with j(n) = n−s .
Thus by Proposition 2.22
N Z N
0) 0
a−(s+s x−s dV
X
n =
M +1 M
Z N
= U (x)dV,
M
2.5. THE RIEMANN-STIELTJES INTEGRAL 2–27
where
0
U (x) = x−s .
Integrating by parts (by Proposition 2.23),
N Z N
0
a−(s+s )
= [U (x)V (x)]N
X
n M − V (x)dU
M +1 M
Z N
= [U (x)V (x)]N
M − V (x)U 0 (x)dx
M
Z N
0 dx
= [U (x)V (x)]N
M −s 0
x−s V (x) ,
M x
0
since U (x) has continuous derivative s0 x−(s +1) .
Since f (s) is convergent, V (x) is bounded, say
V (x) ≤ V.
Thus if σ 0 = <(s0 ),
N Z N
0)
−σ 0 −σ 0
0 dx
a−(s+s 0
x−σ
X
| n | ≤V M +N + |s |
M +1 M x
0
0 0
V |s | −σ0 −σ 0
= V M −σ + N −σ + M − N
σ0
|s0 | −σ0
!
0 0
≤V M −σ + N −σ + 0M
σ
→ 0 as M, N → ∞.
0 ≤ F (x) ≤ 1.
Now
Z ∞ Z ∞
dx
ζ(s) = s x−s dx − s x−s F (x)
"1 1 x
1−s ∞
# Z ∞
x dx
=s −s x−s F (x)
1−s 1 1 x
Z ∞
s dx
= −s x−s F (x) .
s−1 1 x
But the integral on the right converges if <(s) > 0, since
Z Y Z Y
−s dx dx
| x F (x) | ≤ x−σ
X x X x
1 −σ
= X − Y −σ
σ
→ 0 as X, Y → ∞.
Thus Z ∞
s dx
ζ(s) = + x−s F (x)
s−1 1 x
gives an analytic continuation of ζ(s) to <(s) > 0.
Moreover, since the integral is holomorphic in this region, we see that ζ(s)
has a single simple pole at s = 1 with residue 1 in the half-plane <(s) > 0.
2.5. THE RIEMANN-STIELTJES INTEGRAL 2–29
We can even extend ζ(s) further, to the half-plane <(s) > −1, if we take
a little care. (In Chapter 7 we shall show by an entirely different method
that ζ(s) can be continued analytically to the whole complex place C; so the
present exercise is just that — an exercise.)
Let
1
h(x) = F (x) − .
2
Then Z n+1
h(x)dx = 0.
n
Hence Z x
H(x) = h(t)dt
0
is bounded; in fact
1
|H(x)| ≤ .
4
Suppose <(s) > 0. Integrating by parts (in the usual sense),
Z ∞
dx 1 Z ∞ −s dx Z ∞ −s dx
F (x) = x + x h(x)
1 x 2 "1 # x 1 x
−s ∞ Z ∞
1 x h
−s−1
i ∞
= + x H(x) + x−s−2 H(x)dx
2 −s 1 1 1
Z ∞
1
= + x−s−2 H(x)dx.
2s 1
Thus Z ∞
s 1
ζ(s) = − −s x−s−2 H(x)dx.
s−1 2 1
But the integral on the right converges if <(s) > −1, since
Z Y
−s−2 1 Z Y −σ−2
| x H(x)dx| ≤ x dx
X 4 X
1
= X −(σ+1) − Y −(σ+1)
4(σ + 1)
→ 0 as X, Y → ∞.
Thus Z ∞
s 1
ζ(s) = − −s x−(s+2) H(x)dx
s−1 2 1
an n−s
X
an n−s
X
f (s) =
An = o(nσ )
σ > σ 0 > σ0 .
Then
0
f (σ 0 ) = an n−σ
X
0
is convergent. Hence an n−σ is bounded, say
0
|an n−σ | ≤ C.
Then
0
|an | ≤ Cnσ
2.6. THE RELATION BETWEEN AN AND σ0 2–31
ie
0
an = O(nσ ) = o(nσ ).
Conversely, suppose
An = O(nσ ),
say
|An | ≤ Cnσ ;
and suppose
σ 0 > σ.
Let X
A(x) = an .
n≤x
Then
|A(x)| = |A([x])| ≤ C[x]σ ≤ Cxσ .
Integrating by parts,
N Z N
−σ 0 0
x−σ dA
X
an n =
M +1 M
Z N
h 0
iN 0 dx
= x−σ A(x) + σ0 x−σ A(x) .
M M x
Hence
N Z N
−σ 0 σ−σ 0 σ−σ 0 0 dx
xσ−σ
X
| an n | ≤ C(M +N ) + Cσ
M +1 M x
σ
0
≤C 2+ N σ−σ .
σ0 −σ
Thus
N
0
an n−σ | → 0 as M, N → ∞.
X
|
M +1
is convergent; and so
σ 0 ≤ σ0 .
Since this holds for all σ 0 > σ,
σ ≤ σ0 .
J
2.7. DIRICHLET SERIES WITH POSITIVE TERMS 2–32
is σ0 = 0. (The terms in the series do not even → 0 for <(s) < 0.) But as we
shall see, this series extends analytically to an entire function, ie a function
holomorphic in the whole of C.
However, the following Proposition shows that if the coefficients an of the
Dirichlet series are positive then the converse does hold — f (s) cannot be
extended holomorphically across the line <(s) = σ0 .
Proposition 2.25. Suppose the Dirichlet series
an n−s
X
f (s) =
D0 = D(σ0 , δ) = {z ∈ C : |z − σ0 | < δ}
Let
δ
σ = σ0 + .
4
Then f (s) is holomorphic in
3δ 3δ
D1 = D(σ, ) = {z ∈ C : |z − σ| < } ⊂ D0 .
4 4
It follows by Taylor’s theorem that
1 00
f (s) = f (σ) + f 0 (σ)(s − σ) + f (σ)(s − σ)2
2!
for s ∈ D1 .
2.7. DIRICHLET SERIES WITH POSITIVE TERMS 2–33
D0
D1
D2
σ0 σ0 σ
Now
an n−s
X
f (s) =
near s = σ (since this point is in the half-plane of convergence). Moreover
this series converges uniformly and absolutely for s sufficiently close to σ,
say inside
δ
D2 = D(σ, .
8
It follows that we can differentiate term-by-term, as often as we like:
etc. In particular
f (k) (σ) = (−1)k an logk nn−σ ,
X
δ
σ0 = σ − .
4
2.7. DIRICHLET SERIES WITH POSITIVE TERMS 2–34
We have
1 (k)
f (σ 0 ) = f (σ)(σ 0 − σ)k
X
k k!
!k
1 δ
(−1) f (k) (σ)
k
X
= .
k k! 4
Since all the terms on the right are positive (the two factors (−1)k cancelling
out), the double series is absolutely convergent, and we can invert the order
of the summations:
!k
0 −σ 1 δ
logk n
X X
f (σ ) = an n
n k k! 4
The series on the right may seem complicated, but common-sense tells
us what the sum must be. We could have carried out the whole operation
entirely within the half-plane of convergence, in which case we know that
0
f (σ 0 ) = an n−σ .
X
and so
0
f (σ 0 ) = an n−σ nσ−σ
X
n
0
an n−σ .
X
=
n
Li(x) → ∞ as x → ∞.
3–1
3.1. STATEMENT OF THE THEOREM 3–2
It follows that
x x 1 x 1 x x
Li(x) = + 2 + 3 + ··· + n + O( n+1 ).
log x log x 2! log x (n − 1)! log x log x
Corollary 3.1. The Prime Number Theorem can be stated in the form:
π(x) ∼ Li(x).
3.2. PREVIEW OF THE PROOF 3–3
Remark. This is actually a more accurate form of the Prime Number Theo-
rem, in the following sense. It has been shown that
π(x) − Li(x)
changes sign infinitely often, ie however large x gets we find that sometimes
π(x) ≥ Li(x), and sometimes π(x) < Li(x).
On the other hand, it follows from the Remark above that Li(x) is sub-
stantially larger than x/ log x; and it has also been shown that π(x) > x/ log x
for all sufficiently large x.
an n−s ,
X
=
n
where
log p if n = pr
an =
0 otherwise.
3. The function ζ 0 (s)/ζ(s) has poles wherever ζ(s) has a pole or zero. It
follows from Euler’s Product Formula that ζ(s) has no zeros in <(s) >
1. Accordingly ζ 0 (s)/ζ(s) has a pole at s = 1 (with residue 1) and no
poles in <(s) > 1.
where
p−rs .
X X
h(s) = log p
r>1
The function h(s) converges for <(s) > 1/2, as can be seen by com-
parison with ζ(2s). Its partial sums are therefore of order o(n1/2+ ), by
Proposition 2.24. Consequently the contribution of h(s) can be ignored
in our argument.
5. We are left with the function
log p p−s
X
Θ(s) =
p
Z ∞
= x−s dθ,
0
where X
θ(x) = log p.
p≤x
ie
θ(x) = x + o(x).
where
log p − 1 if n = p,
an =
−1 otherwise.
Then Z ∞
Ψ(s) = x−s dψ,
0
where
ψ(x) = θ(x) − [x] = θ(x) − x + O(1).
The Prime Number Theorem, as we have seen, is equivalent to the
statement that
ψ(x) = o(x).
ψ(x) = o(x1/2+ )
for any > 0, which is more than enough to prove the Theorem.
In fact, Riemann showed that with a little more care one can deduce
from his hypothesis that
ie
ψ(x) = o(xσ+ )
ζ(1 + it) 6= 0
for t ∈ R \ {0}.
This proof of this result is, in a sense, the heart of the proof of the
Prime Number Theorem. The argument we use is rather strange; we
show that if ζ(s) had a zero at s = 1 + it then it would have a pole at
s = 1 + 2it, which we know is not the case.
10. This takes us a tiny step forward; it shows that Ψ(s) is holomorphic in
<(s) ≥ 1.
Proposition 2.24 only tells us that in this case
ψ(x) = o(x1+ )
satisfies some natural auxiliary conditions) then its partial sums satisfy
A(x) = o(xσ0 ).
ψ(x) = o(x),
and the result follows on dividing by f (x). This shows that the result holds
without assuming that ui (x) > 0. Indeed, by this argument the result holds
for complex-valued functions: if
Y
f (z) = u(z),
1≤i≤n
is holomorphic on U ; and
f 0 (z) X a0n (z)
=
f (z) n 1 + an (z)
on U .
3.4. FROM π(X) TO θ(X) 3–8
Pn (z) → f (z).
It follows that
Pn0 (z) → f 0 (z).
Hence
Pn0 (z) f 0 (z)
→ .
Pn (z) f (z)
But
Pn0 (z) X a0m (z)
= .
Pn (z) m≤n 1 + am (z)
We conclude that
X a0m (z) f 0 (z)
= .
n∈N 1 + am (z) f (z)
J
Thus
0 for x < 2
log 2for 2 ≤ x < 3
θ(x) =
log 6 for 3 ≤ x < 5
...
Proof I Suppose
x
π(x) ∼ Li(x) ∼ .
log x
3.4. FROM π(X) TO θ(X) 3–9
Then
X
θ(X) = log p
p≤x
Z X
= log 2 + log x dπ
e
Z X
1
= log 2 + [log x π(x)]X
e −
π(x)dx
e x
Z X
π(x)
= log 2 + π(X) log X − 1 − dx.
e x
Since π(x) ∼ x/ log x,
x
π(x) ≤ C
log x
for some constant C; and so
Z X Z X
π(x) dx
0≤ dx ≤ C = C Li(x) = o(x),
e x e log x
Now
θ(x) ∼ x =⇒ θ(x) ≤ Cx
for some C; and so
Z X
θ(x)
0≤ 2 dx
e x log x
Z X
dx
≤C 2 .
e log x
3.5. THE ZEROS OF ζ(S) 3–10
Hence Z X
θ(x) θ(x) dx
≥ π(x) ≥ +C 2 .
log x log x e log x
But
θ(x) x
θ(x) ∼ x =⇒ ∼ ∼ Li(x),
log x log x
while Z X
dx
2 = o(Li(x)),
e log x
θ(x) ∼ x.
For the right-hand side converges absolutely for <(s) > 1; and by the defini-
tion of convergence its value is 6= 0. J
We want to show that ζ(s) has no zeros on the line <(s) = 1. This
is equivalent to showing that ζ 0 (s)/ζ(s) has no poles on this line except at
s = 1.
Proposition 3.5. For <(s) > 1,
ζ 0 (z)
an n−s ,
X
=−
ζ(s)
where
log p if n = pr
an =
0 otherwise.
3.5. THE ZEROS OF ζ(S) 3–11
in <(s) > 1. Since the coefficients are all positive, the convergence is absolute.
But the series for Θ(s) consists of some of the terms of ζ 0 (s), and so also
converges absolutely in <(s) > 1. J
Proposition 3.7. For <(s) > 1,
ζ 0 (z)
= −Θ(s) + h(s),
ζ(s)
where h(s) is holomorphic in <(s) > 1/2.
Proof I We have
p−rs .
X X
h(s) = − log p
p r≥2
−s −σ
If σ = <(s) then |p | = p . Thus
|p−rs | = p−rσ
X X X X
log p log p
p r≥2 p r≥2
−2σ
X p
= log p
p 1 − p−σ
1
log p p−2σ
X
≤ −σ
1−2
1
= Θ(2σ),
1 − 2−σ
which converges for 2σ > 1, ie σ > 1/2, by Proposition 3.6. J
3.5. THE ZEROS OF ζ(S) 3–12
1 + 2it + σ
Proposition 3.8. The Riemann zeta function ζ(s) has no zeros on the line
<(s) = 1:
ζ(1 + it) 6= 0 (t ∈ R \ {0}).
Proof I We shall show (in effect) that if ζ(s) has a zero at s = 1 + it then
it must have a pole at s = 1 + 2it; but we know that is impossible, since the
only pole of ζ(s) in <(s) > 0 is at s = 1.
We work with Θ(s) rather than ζ(s). If ζ(s) has a zero of multiplicity m
at 1 + it then ζ 0 (s)/ζ(s) has a simple pole with residue m, and so Θ(s) has
a simple pole with residue −m. Similarly, where ζ(s) has a pole of order M ,
Θ(s) has a simple pole with residue M .
We are going to compare
log p p−(1+σ) ,
X
Θ(1 + σ) =
log p p−(1+σ) p−it ,
X
Θ(1 + it + σ) =
log p p−(1+σ) p−2it .
X
Θ(1 + 2it + σ) =
3.5. THE ZEROS OF ζ(S) 3–13
Note that
cos 2θ + 4 cos θ + 3 ≥ 0.
Proof I For τ ∈ R,
eiτ + e−iτ = 2 cos(τ ) ∈ R.
Raising this to the fourth power,
4
eiτ + e−iτ = e4iτ + e−4iτ + 4(e2iτ ) + e−2iτ ) + 6 ≥ 0,
ie
cos 4τ + cos 2τ + 3 ≥ 0.
Proof I We have
< p−2it + 4p−it + 3 = cos(2t log p) + 4 cos(t log p) + 3 ≥ 0,
−1 ≤ cos(t log p) ≤ 1,
we see that in order to reach −1/σ (let alone −m/σ), cos(t log p) must be
close to −1 for almost all p.
But
cos τ = −1 =⇒ cos 2τ = +1.
3.6. THE TAUBERIAN THEOREM 3–15
Thus follows that cos(2t log p) is close to 1 for almost all p; and that in turn
implies that
is close to 1/σ, which means that Θ(s) must have a pole with residue 1 (ie
ζ(s) must have a simple pole) at s = 1 + 2it, which we know is not the case.
1. bounded; and
Then Z ∞
F (s) = e−xs f (x)dx
0
Proof I Suppose
|f (x)| ≤ C.
For each X > 0, Z X
FX (s) = e−xs f (x)dx.
0
is an entire function, ie holomorphic in the whole of the complex plane C.
Suppose σ = <(s) > 0. If X < Y then
Z Y
FY (s) − FX (s) = e−xs f (x)dx.
X
Thus
Z Y
|FY (s) − FX (s)| ≤ C e−xσ dx
X
C −Xσ
= (e − e−Y σ).
σ
3.6. THE TAUBERIAN THEOREM 3–16
Thus
FY (s) − FX (s) → 0 as X, Y → ∞.
Hence Z ∞
F (s) = e−xs f (x)dx
0
{s = x + iy : −δ ≤ x ≤ 0; −R ≤ y ≤ R}.
γ2a Ri
−δ + Ri
γ1
γ2b
−δ − Ri γ2c −Ri
Note that we are playing with two constants, X and R, both tending to
∞. The interaction between them is subtle. First we fix R, and let X → ∞.
We shall show that there is a constant c such that
for sufficiently large X. Since this holds for all R, it will show that
FX (0) → F (0) as X → ∞,
as required.
First we consider
Z
ds
I1 (X, R) = (FX (s) − F (s)) λ(s) .
γ1 s
For σ = <(s) > 0, Z ∞
FX (s) − F (s) = e−xs dx.
X
Thus
Z ∞
|FX (s) − F (s)| ≤ C e−xσ dx
X
C
= e−Xσ .
σ
As to the factor λ(s),
|eXs | = eXσ ;
while if s = Reiθ then
s2
1+ = 1 + e2iθ ,
R2
3.6. THE TAUBERIAN THEOREM 3–18
and so
s2 2σ
|1 + 2
| = eiθ + e−iθ = 2 cos θ = .
R R
Hence
C −Xσ Xσ 2σ
|(FX (s) − F (s)) λ(s)| ≤ e ·e
σ R
2C
= .
R
(We see now how the two parts of λ(s) were chosen to cancel out the
factors e−Xσ and 1/σ.)
Since s = Reiθ ,
ds
= ieiθ dθ;
s
and so
2Cπ
|I1 (X, R)| ≤ .
R
Turning to the part γ2 of the integral in the negative half-plane, we con-
sider FX (s) and F (s) separately:
where
Z
ds
I20 (X, R) = FX (s)λ(s)
γ2 s
Z
ds
I200 (X, R) = F (s)λ(s) .
γ2 s
Since FX (s) is an entire function, we can replace the contour γ2 in the
integral I20 (X, R) by the half-circle γ20 of radius R in the negative half-plane
(Fig 3.3), ie the complementary half-circle to γ1 .
We have Z X
FX (s) = e−xs f (x)dx.
0
Thus if σ = <(s) ≤ 0 then
Z X
|FX (s)| ≤ C e−xσ dx
0
C −Xσ
≤ e .
−σ
As before,
|eXs | = eXσ ;
3.6. THE TAUBERIAN THEOREM 3–19
Ri
γ20 γ2
−Ri
while
s2
|1 + | = |eiθ + e−iθ |
R2
= 2|cos θ|
−2σ
= .
R
Thus
2Cπ
I20 (X, R) ≤ .
R
It remains to consider
Z
ds
I200 (X, R) = F (s)λ(s) .
γ2 s
We divide the integrand into two parts: the factor
eXs → 0 as X → ∞
for all s ∈ γ2 except for the two end-points ±Ri; while the remaining factor
s2
!
1
F (s) 1 + 2
R s
|F (s)| ≤ D.
I200 (X, R) → 0 as X → ∞.
3.7. PROOF 3–20
More precisely,
Z δ
|I2a (X, R)|, |I2c (X, R)| ≤ D e−Xσ dσ
0
D
≤ ,
X
while
|I2b (X, R)| ≤ 2Re−Xδ .
Thus all three parts of I2 (X, R) tend to 0, and so
I2 (X, R) → 0 as X → ∞
FX (0) − F (0) → 0 as X → ∞,
as required. J
3.7 Proof
We now have all the ingredients to complete the proof of the Prime Number
Theorem.
Proof I By Proposition 3.3, it is sufficient to prove that
θ(x) ∼ x.
θ(x) = O(x).
θ(x) ≤ Cx
for all x ≥ 0.
3.7. PROOF 3–21
But !
2n
≤ 22n ,
n
since the binomial coefficient is one term in the expansion of (1 + 1)2n . Thus
p ≤ 22n .
Y
n<p≤2n
Now let
ψ(x) = θ(x) − x.
We have to show that
ψ(x) = o(x).
For <(s) > 1, let Z ∞
Ψ(s) = x−s dψ.
1
Integrating by parts,
Z X Z X
−s
h
−s
iX dx
x dψ = x ψ(x) +s x−s ψ(x)
1 1 1 x
Z X
dx
= X −s ψ(X) − 1 + s x−s ψ(x) .
1 x
But
X −s ψ(X) → 0 as X → ∞
since
|ψ(X)| ≤ max(θ(X), X) ≤ C 0 X.
Thus
Z ∞
dx
Ψ(x) = 1 + s x−s ψ(x)
1 x
Z ∞
dx
=1+s x−s (θ(x) − x)
1 x
1
= 1 + s Θ(s) − .
s−1
Now Θ(s) has a pole at s = 1 with residue 1 (arising from the pole of
ζ(s)). It follows that Ψ(s) is holomorphic at s = 1; and it has no poles
elsewhere on <(s) = 1, since Θ(s) does not. Thus
Z ∞
1 dx
(Ψ(s) − 1) = x−s ψ(x)
s 1 x
is holomorphic in <(s) ≥ 1,
On making the change of variable x = et (we can think of this as passing
from the multiplicative group R+ to the additive group R),
Z ∞
1 dx
(Ψ(s) − 1) = x−s ψ(x)
s Z1 x
∞
= e−ts ψ(et )dt.
0
3.7. PROOF 3–23
converges to Ψ(1). (We only need the convergence, not the value.)
Changing variables back to x = et , we deduce that
Z ∞
ψ(x) Z ∞
θ(x) − x
2
dx = dx
1 x 1 x2
converges.
It remains to show that this implies that
θ(x) ∼ x.
(or both). In other words, there exists a δ > 0 such that either
θ(X) ≥ (1 + δ)X
θ(X) ≤ (1 − δ)X
ie
θ(x) − x ≥ 0
But the fact that there exist such intervals [X, (1 + δ)X] with arbitrarily
large X contradicts the convergence of
Z ∞
θ(x) − x
dx,
x2
which we have already established. We conclude that
θ(x)
lim sup ≤ 1.
x
3.7. PROOF 3–25
Similarly, suppose
θ(X) ≤ (1 − δ)X.
Since θ(x) is increasing, it follows that
(1 − δ)X ≤ x ≤ X =⇒ θ(x) ≤ θ(X) ≤ (1 − δ)X ≤ x,
ie
θ(x) − x ≤ 0
on the interval [(1 − δ)X, X].
More precisely,
Z X
θ(x) − x Z X
x − θ(x)
− 2
dx = dx
(1−δ)X x (1−δ)X x2
Z X
x − (1 − δ)X
≥ dx
(1−δ)X x2
Z 1
y − (1 − δ)
≥ dy
1−δ y2
Z 1
1
≥ (y − 1 + δ)dy
(1 − δ)2 1−δ
Z δ
1
≥ u du
(1 − δ)2 0
δ2
= .
2(1 − δ)2
Again, this contradicts the convergence of
Z ∞
θ(x) − x
dx.
x2
Hence
θ(x)
lim inf ≥ 1.
x
We have shown therefore that
θ(x)
→ 1,
x
ie
θ(x) ∼ x.
The proof of the Prime Number Theorem is complete. J
Chapter 4
χ : A → C× .
χ1 (a) = 1
for all a ∈ A.
χ(ab) = χ(a)χ(b)
4–1
4.1. CHARACTERS OF A FINITE ABELIAN GROUP 4–2
and therefore
since
χ(a)n = χ(an ) = χ(e) = 1.
Example. Suppose
Let ω = e2πi/n .
The cyclic group Cn has just n characters, namely
|χ(a)| = 1
for all a ∈ A.
χ(a−1 ) = χ(a).
|z| = 1 =⇒ z −1 = z̄.
J
4.1. CHARACTERS OF A FINITE ABELIAN GROUP 4–3
It follows that the characters can be identified with the group Z mod m;
hence
Cn∗ ∼
= Z/(n) ∼
= Cn .
We may say that the cyclic group Cn is self-dual.
Proposition 4.4. Every finite abelian group A is self-dual, ie
A∗ ∼
= A.
A = Cn1 × · · · × Cnr .
Lemma 8. If A = B × C then
A∗ = B ∗ × C ∗ .
4.1. CHARACTERS OF A FINITE ABELIAN GROUP 4–4
A → A∗∗ .
A∗∗ = A.
Proof I Since
|A∗∗ | = |A∗ | = |A|,
it is sufficient (by the Pigeon-Hole Principle) to show that the map
A → A∗∗
is injective.
Lemma 9. If a 6= e then there exists a character χ such that
χ(a) 6= 1.
J
We conclude that the homomorphism
A → A∗∗
χ(b) 6= 1.
Thus
X
(χ(b) − 1) χ(a) = 0,
a
and so
X
χ(a) = 0.
a
J
4.1. CHARACTERS OF A FINITE ABELIAN GROUP 4–7
a∈A a∈A
and the result follows from Proposition 4.6, since
χ−1 χ0 = χ1 ⇐⇒ χ = χ0 .
J
Proof I If a = e then χ(a) = 1 for all χ ∈ A∗ and the sum is evidently |A|.
Suppose a 6= e. By the Lemma to Proposition 4.5, we can find a χ0 ∈ A∗
such that
χ0 (a) 6= 1.
As χ runs over A∗ so does χ0 χ. Hence
(χ0 χ)(a)
X X
χ(a) =
χ∈A∗ χ∈A∗
= χ0 (a)
X
χ(a).
χ∈Aa st
Thus
(χ0 (a) − 1)
X
χ(a) = 0,
χ
and so
X
χ(a) = 0.
χ
J
4.1. CHARACTERS OF A FINITE ABELIAN GROUP 4–8
Proof I Since
f :A→C
form a vector space C(A) (over C) of dimension |A|, of which the |A| char-
acters of A are elements.
It is convenient to introduce an inner product in the space C(A) of func-
tions on A.
1. hg f i = hf gi;
2. hf f i ≥ 0, and hf f i = 0 ⇐⇒ f = 0;
3. hf λ1 g1 + λ2 g2 i = λ1 hf g1 i + λ2 hf g2 i.
with
1 X
λχ = hχ f i = χ(a)f (a).
|A| a∈A
Proof I The characters must form a basis for C(A), since they are linearly
independent and there are
|A| = dim C(A)
of them.
So certainly X
f= λχ χ,
χ
= λχ0 .
J
4.2. MULTIPLICATIVE CHARACTERS MODM 4–10
Corollary 4.3. Let cb (x) denote the characterestic function of the element
b, ie
1 if a = b,
cb (a) =
0 otherwise.
rs ≡ 1 mod m.
is injective, since
rs ≡ 1 mod 1.
J
4.2. MULTIPLICATIVE CHARACTERS MODM 4–11
Example.
(Z/1)× = {1},
(Z/2)× = {1},
(Z/3)× = {1, 2} = {±1} ∼ = C2 ,
(Z/4)× ∼
= {1, 3} = {±1} = C2 ,
(Z/5)× = {1, 2, 3, 4} = {±1, ±2} ∼ = C4 ,
(Z/6)× ∼
= {1, 5} = {±1} = C2 ,
(Z/7)× = {1, 2, 3, 4, 5, 6} = {±1, ±2, ±3} ∼
= C6 ,
(Z/8)× = {1, 3, 5, 7} = {±1, ±3} ∼ = C2 × C2 ,
(Z/9)× = {1, 2, 4, 5, 7, 8} = {±1, ±2, ±4} ∼
= C6 ,
is an isomorphism.
Suppose r ∈ Z/(m). Then
Hence Θ maps (Z/m)× onto (Z/m1 )× ×(Z/m2 )× , which proves the result. J
Example. Since gcd(4, 3) = 1,
1 7→ (1, 1),
5 7→ (1, 2),
7 7→ (3, 1),
11 7→ (3, 2).
4.2. MULTIPLICATIVE CHARACTERS MODM 4–12
φ(mn) = φ(m)φ(n).
gn = e
ge = e
xe − 1 = 0
over the field Z/(p). But this equation has at most e roots. It follows that
p − 1 ≤ e.
e = p − 1.
J
4.2. MULTIPLICATIVE CHARACTERS MODM 4–13
order(ai ) = pei i qi .
Then
bi = aqi i
has order ei .
But if A is a finite abelian group, and a, b ∈ A have orders r, s then
(ab)rs = 1 =⇒ n | rs.
On the other hand, since r, s are coprime we can find x, y ∈ Z such that
rx + sy = 1.
But then
(ab)sy = asy = a1−rx = a.
It follows that r | n. Similarly s | n. Since gcd(r, s) = 1 this implies that
rs | n.
Hence
n = rs.
Applying this to
a = b1 · · · br
we conclude that a has order
pe11 · · · perr = e.
J
By these two Lemmas, we can find an element a ∈ (Z/p)× of order p − 1.
Hence (Z/p)× is cyclic. J
4.2. MULTIPLICATIVE CHARACTERS MODM 4–14
(Z/pe )×
Proof I We have proved the result for e = 1. We derive the result for e > 1
in the following way.
The group (Z/pe )× has order
order(a mod p) = p − 1.
Evidently
order(a mod p) | order(a mod pe ).
Thus the order of a mod p is divisible by p − 1. It is therefore sufficient by
Lemma 11 to show that there exists an element of order pe−1 in the group.
The elements of the form x = 1 + py form a subgroup
S = {x ∈ (Z/pe )× : x ≡ 1 mod pe }
pe−2 2 2 pe−2 3 3
! !
pe−2 e−2
(1 + py) =1+p py + py + p y + ··· .
2 3
4.2. MULTIPLICATIVE CHARACTERS MODM 4–15
We claim that all the terms after the first two are divisible by pe , ie
pe−2 r r
!
pe | py
r
pr−1 > r,
eg because
pr−1 > (1 + 1)r−1 ≥ 1 + (r − 1) = r.
Thus any element of the form 1 + py where y is not divisible by p (for
example, 1 + p) must have multiplicative order pe−1 , and so must generate
S. In particular the subgroup S is cyclic, and so (Z/pe )× is cyclic. J
Turning to p = 2, it is evident that (Z/2)× is trivial, while (Z/4)× = C2 .
(Z/2e )× ∼
= C2 × C2e−2 .
Proof I Since
φ(2e ) = 2e−1 ,
(Z/2e )× contains 2e−1 elements. By the Structure Theorem for finite abelian
groups, it is sufficient to show that (Z/2e )× has exponent 2e−2 . For then one
of the factors in
(Z/2e )× = C2e1 × · · · × C2er
must be C2e−2 , and the remaining factor must be C2 .
4.2. MULTIPLICATIVE CHARACTERS MODM 4–16
(±1)2 = (±3)2 = 1.
2e−3 4 2 2e−3 6 3
! !
2 2e−3 e−3 2
(1 + 2 y) =1+2 2 y+ 2y + 2 y + ··· .
2 3
As before, all the terms after the first two are divisible by 2e , ie
pe−3 2r r
!
e
2 | 2 y
r
for r ≥ 2. For
2e−3 2r 2e−3 (2e−3 − 1) · · · (2e−3 − r + 1) 2r
!
2 = 2
r 1 · 2···r
(2e−3 − 1) · · · (2e−3 − r + 1) e−3 22r
= 2
1 · 2 · · · (r − 1) r
e−3 2r
!
2 − 1 e−3 2
= 2 .
r−1 r
22(r−1) > r,
eg because
Thus any element of the form 1 + 22 y with y odd (for example, 5) must
have multiplicative order 2e−1 , which as we have seen is sufficient to prove
the result. J
4.2. MULTIPLICATIVE CHARACTERS MODM 4–17
χ : (Z/m)∗ → C× .
χ : Z/(m) → C,
by setting
χ(r) = 0 if gcd(r, m) > 1.
Now we extend χ to a function
χ : N → Z/(m) → C
by composition. (It should cause no confusion that we use the same symbol
χ for all three functions.)
For example, suppose m = 6. Since φ(6) = 2, there are just 2 multiplica-
tive characters mod6, the principal character χ1 and the character
1 if r ≡ 1 mod 6,
χ(r) =
−1 if r ≡ 5 mod 6.
and χ(0) = 0, χ(1) = 1. (We include the latter condition to exclude the case
where f (n) = 0 for all n).
We say that χ(n) is strictly multiplicative if
χ(mn) = χ(m)χ(n)
f (n + m) = f (n)
for all n.
If is clear that if d | m then any multiplicative character modd defines a
function which is modular with modulus m.
The following result shows that every function f : N → C which is both
strictly multiplicative and modular arises in this way.
Proposition 4.17. Suppose f : N → C is modular modm. Then f (n) is
strictly multiplicative if and only if it is defined by a multiplicative character
modd for some d | m.
Proof I We argue by induction on m.
Suppose
f (d) 6= 0
for some proper divisor d | m, 1 < d < m. Then
d | m, d > 1 =⇒ f (d) = 0;
r = dr0 , m = dm0 .
4.3. DIRICHLET’S L-FUNCTIONS 4–19
Then
f (r) = f (d)f (r0 ) = 0.
On the other hand, if gcd(r, m) = 1 then r has a multiplicative inverse
modm, say
rs ≡ 1 mod m;
and so
f (r)f (s) = f (1) = 1 =⇒ f (r) 6= 0.
It follows that f (n) is defined by a function
χ : (Z/m)× → C× ,
which is readily seen to be a multiplicative character modm. J
where
Fp (s) = 1 + χ(p)p−s + χ(p2 )p−2s + · · · .
This follows from the fact that if
n = pe11 · · · perr
then
χ(n) = χ(pe11 ) · · · χ(perr ),
and so
χ(n)n−s = χ(pe11 )n−e1 s · · · χ(perr )n−er s ,
χ(n)n−s .
X
Lχ (s) =
n∈N
|S(x)| ≤ φ(m).
It follows that X
χ(r) = 0,
r∈Z/(m)
P
ie χ(r) vanishes over any complete set of residues. Hence
mq−1
X
S(mq − 1) = χ(n) = 0.
n=0
This sum contain ≤ m terms, of which at most φ(m) are non-zero. Since
|χ(n)| = 1 for each of these terms, we conclude that
|S(x)| ≤ φ(m).
J
4.3. DIRICHLET’S L-FUNCTIONS 4–21
φ(m)
|S(x)| ≤ .
2
For
[x] m(q+1)−1
X X
S(x) = χ(n) = − χ(n);
n=mq [x]+1
and these two sums together contain φ(m) non-zero terms, so one of them
contains ≤ φ(m)/2 such terms.
Integrating by parts,
N Z N
−s
x−s dS
X
χ(n)n =
M M
Z N
dx
= [x−s S(x)]N
M +s x−s S(x) .
M x
Thus
N Z N
−s −σ −σ dx
x−σ
X
| χ(n)n | ≤ φ(m)(M +N ) + |s|φ(m)
M M x
!
−σ −σ |s|
= φ(m) M +N + (M −σ − N −σ .
σ
φ(m) X
h(s) = Lχ (s) − = a(n)n−s ,
m
where
1 − φ(m)/m if gcd(n, m) = 1
a(n) =
−φ(m)/m if gcd(n, m) > 1
Evidently, X
a(r) = 0,
r∈Z/(m)
4.3. DIRICHLET’S L-FUNCTIONS 4–22
while |a(n)| < 1 for all n ∈ N. It follows by the argument we used above
that the Dirichlet series
a(n)n−s
X
h(s) =
n≥1
Proof I This follows in exactly the same way as for ζ(s). Thus if <(s) > 1
then
0
−s −1
Y
−s
χ(n)n−s ,
X X
1 − χ(p)p = χ(n)n +
p≤N n≤N
where the second sum on the right extends over those n > N all of whose
prime factors are ≤ N .
The sum
χ(n)n−s
X
n∈N, n6=0
as N → ∞.
J
Chapter 5
Dirichlet’s Theorem
r, r + m, r + 2m, . . . .
If r and m have a factor in common then clearly there is at most one prime
in this sequence, namely its first element r if r is prime:
Li(x) 1 x
πr,m ∼ ∼ .
φ(m) φ(m) log x
5–1
5.1. PREVIEW OF THE PROOF 5–2
an χ(n)n−s ,
X
=−
n
where
log p if n = pe
an =
0 otherwise.
3. Now we use the fact that we can pick out a particular residue class by
taking an appropriate linear combination of characters:
1 X L0 (s)
χ(r) χ an n−s ,
X
=
φ(m) χ Lχ (s) n≡r mod m
where the sum on the left runs over all the multiplicative characters
modm.
4. As before, it is convenient to ‘hive off’ the part of the Dirichlet series
on the right corresponding to higher prime-powers:
an n−s = Θr,m (s) + h(s),
X
where
log p p−s ,
X
Θr,m (s) =
p≡r mod
5.1. PREVIEW OF THE PROOF 5–3
while h(s) converges absolutely for <(s) > 1/2, by comparison with
ζ(2s), and so may be ignored in our argument.
5. As before (again!), Z ∞
Θr,m (s) = x−s dθr,m ,
0
where X
θr,m (x) = log p.
p≤x, p≡r mod m
ie
x
θr,m (x) = + o(x).
φ(m)
7. The function L0χ (s)/Lχ (s) has poles wherever Lχ (s) has a pole or zero.
It follows from the Product Formula that Lχ (s) has no zeros in <(s) >
1. Accordingly
1 X L0χ (s)
Θr,m (s) = − χ(r) + h(s)
φ(m) χ Lχ (s)
9. Having got over this hurdle, it follows that Θr,m (s) has a simple pole
at s = 1, arising from the pole of Lχ1 (s), with residue 1/φ(m), and no
other poles on the line <(s) = 1.
where
1
ψr,m (x) = θr,m (x) − [x]
φ(m)
1
= θr,m (x) − x + O(1).
φ(m)
log p p−s .
X
Θr,m (s) =
p≡r mod m
1 X L0 (s)
χ̄(r) χ = −Θr,m (s) + h(s),
φ(m) χ Lχ (s)
Differentiating logarithmically,
L0χ (s) X χ(p) log p p−s
=− −s
Lχ (s) p 1 − χ(p)p
where
p−es .
X X
hr,m (s) = − log p
p pe ≡r mod m
Since the function hr,m (s) consists of certain terms taken from the corre-
sponding series for h(s) in Proposition 3.7, and since we showed that this
series converges absolutely for <(s) > 1/2, it follows that hr,m (s) also con-
verges absolutely, and so is holomorphic, in <(s) > 1/2. J
1 X L0χ (s)
=− + h(s),
φ(m) χ Lχ (s)
where h(s) is holomorphic in <(s) > 1/2 (and so may be ignored).
Each character χ for which Lχ (1 + it) = 0 will contribute to the residue
of Θr,m (s) at s = 1 + it. More precisely, if the multiplicity of this zero is mχ
then
1 X
res1+it (Θr,m ) = − mχ .
φ(m) χ
(If Lχ (1 + it) 6= 0 then we set mχ = 0.) Similarly, if each Lχ (s) has a zero
with multiplicity Mχ at s = 1 + 2it then
1 X
res1+2it (Θr,m ) = − Mχ .
φ(m) χ
We know that Lχ1 (s) has a simple pole at s = 1. Suppose that, for
χ 6= χ1 , Lχ (s) has a zero with multiplicity µχ at s = 1. Then
1 X
res1 (Θr,m ) = 1− µχ .
φ(m) χ6=χ1
5.4. THE ZEROS OF Lχ (S) 5–7
But now, applying Lemma 6 to Θ1,m (s) in exactly the same way that we
applied it to Θ(s),
ie
X X X
Mχ + 4 mχ + 3 µχ ≤ 3.
mχ = 0 for all χ.
(For if mχ ≥ 1 for any χ this will already ‘out-vote’ the right-hand side.) In
other words,
Lχ (1 + it) 6= 0.
J
Proof I In the proof above, we lumped all the Lχ (s) together. We can
equally well consider the Lχ (s) separately, by modifying Lemma 6 slightly,
as follows.
Lemma 13. Let
χ(p) log p p−s .
X
Θχ (s) =
Then
< (Θχ (1 + 2i + σ) + 4Θχ (1 + it + σ) + 3Θχ (1 + σ)) ≥ 0
for any σ > 0.
Proof I If χ(p) 6= 0 then |χ(p)| = 1, say
χ(p) = eiθp .
It follows that
< χ2 (p)p−2it + 4χ(p)p−it + 3 = cos (2(t log p + θp ))+4 cos(t log p+θp )+3 ≥ 0,
ie
−Mχ − 4mχ + 3 ≥ 0,
where Mχ , mχ are the multiplicities of the zeros of Lχ2 (s) at s = 1 + 2it and
of Lχ (s) at s = 1 + it. Since Mχ and mχ are both non-negative integers, it
follows that
mχ = 0,
ie
Lχ (1 + it) 6= 0.
J
There is one important difference between the proofs of the Prime Number
Theorem and Dirichlet’s Theorem. In the earlier proof, we knew that ζ(s)
had a simple pole at s = 1. But now, while we know that Lχ1 (s) has a
simple pole at s = 1 we must also consider the behaviour of Lχ (s) at s = 1
for χ 6= χ1 .
Of course, Lχ (s) cannot have a pole at s = 1 if χ 6= χ1 , since we know
by Proposition 4.18 that Lχ (s) is holomorphic in <(s) > 0. However, it
could have a zero at s = 1, and this would affect the residue of Θr,m (s) at
s = 1, and that in turn would affect the number of primes in the arithmetic
sequence.
We must show that this does not in fact occur, ie
Lχ (1) 6= 0
if χ 6= χ1 .
It turns out that there are two very different cases to consider, according
as χ is real or not. For non-real characters, the result follows easily by the
argument used above to show that Lχ (1 + it) 6= 0. However, the real case is
a much harder nut to crack.
Definition 5.4. The multiplicative character χ(n) mod m is said to be real
if
χ̄ = χ,
5.4. THE ZEROS OF Lχ (S) 5–9
ie
for all n ∈ N.
Proof I If χ(n) ∈ {0, ±1} then evidently χ is real.
Conversely, suppose χ is real. If χ(n) 6= 0 then |χ(n)| = 1. Hence
χ(n) = ±1, since these are the only reals on the unit circle in C. J
Corollary 5.2. Suppose χ is a multiplicative character modm. Then
χ real ⇐⇒ χ2 = χ1 .
Lχ (1) 6= 0.
Proof I We have in effect already proved this result, in both of the proofs
of Proposition 5.5.
Thus in the first proof, taking any point s = 1 + it (whether Lχ (s) has a
zero there or not) it follows that
X
µχ ≤ 1.
χ
In other words,
Lχ (1) = 0
for at most one character χ.
But
Lχ (1) = 0 =⇒ Lχ (σ) → 0 as σ → 1 + 0
χ(n)nσ → 0
X
=⇒
χ(n)nσ → 0
X
=⇒
=⇒ Lχ̄ (σ) → 0
=⇒ Lχ̄ (1) = 0.
even if t = 0, ie
res1 (Θχ2 ) + 4 res1 (Θχ ) + 3 ≥ 0
But now if χ is not real then χ2 6= χ1 and so Θχ2 does not have a pole
at s = 1. Hence both residues are negative, and we deduce as before that
Θχ (s) cannot have a zero at s = 1. J
Remark. These proofs might be considered something of overkill. More sim-
ply,
log p p1+σ ≥ 0
X
Θ1,m (1 + σ) =
p≡ mod m
res1 (Θ1,m ) ≥ 0,
ie
X
1− µχ ≥ 0,
χ
Lχ (1) 6= 0.
The putative zero of Lχ (s) at s = 1 cancels out the pole of ζ(s), leaving a
function F (s) holomorphic in <(s) > 0.
The following result on the product of two Dirichlet series is readily es-
tablished.
Lemma 14. Suppose the two Dirichlet series
an n−s , bn n−s
X X
f (s) = g(s) =
5.4. THE ZEROS OF Lχ (S) 5–11
cn n−s ,
X
f (s)g(s) =
where X
cn = ad be ,
n=de
f (n)n−s ,
X
F (s) =
where X
f (n) = χ(d).
d|n
3. f (n2 ) > 0.
Proof I 1. In general, if χ(n) is multiplicative then so is
X
f (n) = d | nχ(d).
= f (n1 )f (n2 ).
2. Suppose
n = pe11 · · · perr .
Since f (n) is multiplicative,
But
f (pe ) = 1, 2e + 1 or 1
f (p2e ) > 0,
and so
f (n2 ) > 0.
J
Now suppose
f (n)n−s
X
F (s) =
has abscissa of convergence σ0 . Since the coefficients are non-negative this is
also the abscissa of absolute convergence.
By Proposition 2.25, since F (s) is holomorphic in <(s) > 0 it follows that
σ0 ≤ 0.
f (n)n−σ < ∞
X
= ζ(2σ).
Lχ (1) 6= 0
L0χ (s)
Lχ (s)
is holomorphic in <(s) ≥ 1; while on the other hand, Lχ1 (s) has a simple
pole at s = 1, by Proposition 4.18, and so
L0χ1 (s)
Lχ1 (s)
for <(s) > 1, we can apply our Tauberian Theorem, Proposition 3.9, with
1
F (s) = Ψr,m (s + 1)
s+1
and
f (x) = e−x ψr,m (ex ).
(As we noted earlier, the condition that f (x) is bounded follows at once from
the fact that
θr,m (x) ≤ θ(x) ≤ Cx
for some constant C.)
We conclude that
Z ∞ Z ∞
−t t ψr,m (x)
e ψr,m (e )dt = dx
0 1 x2
Z ∞
θr,m (x) − x/φ(m)
= dx
1 x2
converges; and from this we deduce, as before, that
x
θr,m (x) ∼ ,
φ(m)
6.1 Definition
Definition 6.1. For <(s) > 0 we set
Z ∞
dx
Γ(s) = xs e−x
0 x
The integral converges as x → ∞ for all s, since e−x → 0 faster than any
power xn → ∞. It converges at 0 for <(s) > 0 since
6–1
6.2. THE FIRST IDENTITY 6–2
Γ(s + 1) = sΓ(s).
Γ(n + 1) = n!
Proof I For n = 0,
Z ∞
Γ(1) = e−x dx
0
h i∞
= −e−x
0
= 1.
This holds for <(s) > 0. But the right-hand side is defined for <(s) > −n,
and so extends Γ(s) to this region.
6.4. AN ALTERNATIVE APPROACH 6–3
(The cut prevents us encircling 0 and thus passing from one branch of log z
to another.) On the upper edge of the cut θ = 0, and so
log z = log x
at z = x > 0.
Passing to
z s = es log z ,
we have
z s = xs
at z = x on the upper edge of the cut, while
z s = e2πis xs
6.4. AN ALTERNATIVE APPROACH 6–4
γ1
γ2
γ3
In other words, Z
f (z) dz
γ
Proof I As → 0,
Z ∞
dx
I1 (s) → − xs e−x = −Γ(s).
0 x
Similarly, Z ∞
dx
I3 (s) → e2πis xs e−x = e2πis Γ(s).
0 x
Also, if σ = <(s),
We conclude that
I(s) → (e2πis − 1)Γ(s)
as → 0. Since I(s) is in fact independent of , it follows that
ie
1
Γ(s) = I(s)
e2πis −1
for <(s) > 0. J
The integral I(s) converges for all s ∈ C, since the ‘diversion’ round 0
along γ2 avoids the problem of convergence at s = 0; it therefore defines an
entire function.
6.5. Γ(S) AS A LIMIT 6–6
Γ(s, n) → Γ(s).
6.5. Γ(S) AS A LIMIT 6–7
x2
n !
x x
log 1 − = −n + + ···
n n 2n2
1
= −t + O( ).
n
x 2
In fact, since each term −x, − 2n , . . . increases with n, this argument shows
that (1 − x/n)n increases monotonically to e−x , for each x ≥ 0.
Let
(1 − x/n)n if 0 ≤ x ≤ n
f (x, n) =
0 if x > n
Then
f (x, n) → e−x
uniformly in any finite range [0, X]; and
0 ≤ f (x, n) ≤ e−x
for all x.
It follows that if <(s) > 0 then
n
Z n
1 dx Z ∞ s dx
s
x 1− = x f (x, n) → Γ(s)
0 x x 0 x
as n → ∞.
6.5. Γ(S) AS A LIMIT 6–8
n!
= n ns+n
n s(s + 1) · · · (s + n)
n!ns
=
s(s + 1) · · · (s + n)
= Γ(s, n).
Γ(s, n) → Γ(s)
nr+s n!
Γ(s + r, n) =
(s + r)(s + r + 1) · · · (s + r + n)
s(s + 1) · · · (s + n)
= nr Γ(s, n).
(s + r)(s + r + 1) · · · (s + r + n)
Thus if n ≥ r,
Γ(s + r, n) nr
Γ(s, n) = .
s(s + 1) · · · (s + r − 1) (s + n + 1) · · · (s + n + r)
6.5. Γ(S) AS A LIMIT 6–9
Moreover,
nr 1
= s+1 s+r →1
(s + n + 1) · · · (s + n + r) (1 + n
) · · · (1 + n
as n → ∞. It follows that
Γ(s + r)
Γ(s, n) → = Γ(s).
s(s + 1) · · · (s + r − 1)
We have thus extended the result to <(s) > −r, and so to the whole plane
(excluding the poles s = 0, −1, −2, . . . ). J
We can re-write Γ(s, n) as
ns 1
Γ(s, n) = .
s (1 + s)(1 + 2s ) · · · (1 + ns )
Thus −1
s
s
Y
sΓ(s, n) = n 1+ .
1≤m≤n m
We can also re-write n as
23 n
n= ···
12 n−1
1
Y
= 1+ .
1≤m≤(n−1)
m
Thus s
1
ns =
Y
1+ .
1≤m≤(n−1)
m
Hence −s ( −1 s )
1 s 1
Y
sΓ(s, n) = 1 + 1+ 1+ .
n 1≤m≤n m m
Since (1 + n1 )s → 1, it follows that
( −1 s )
Y s 1
1+ 1+ → sΓ(s).
1≤m≤n m m
6.6. THE SECOND IDENTITY 6–10
and bounded) subset C not containing any of the poles, the function de-
fined by the infinite product is holomorphic in C. This gives a third way of
extending Γ(s) holomorphically to the entire plane.
Thus √
Γ(1/2) = ± π.
Since Z ∞
dx
Γ(1/2) = x1/2 e−x > 0,
0 x
it follows that √
Γ(1/2) = π.
J
Corollary 6.2. For each n ∈ N,
1 13 1 1
Γ(n + ) = · · · (n − ) √
2 22 2 π
ie
√
Γ(n)Γ(n + 12 ) = 21−2n πΓ(2n).
This strongly suggests — but does not establish— the following result.
6.8. THE EULERIAN INTEGRAL 6–12
t=0
X =x+y
x
O t=1
B(u, v) is often called the Eulerian integral of the first kind ; the integral
by which we defined Γ(s) being the Eulerian integral of the second kind.
Γ(u)Γ(v)
B(u, v) = .
Γ(u + v)
as required. J
This provides an alternative proof of our second identity
π
Γ(s)Γ(1 − s) = .
sin πs
For suppose 0 < <(s) < 1. Then
Γ(s)Γ(1 − s) = Γ(1)B(s, 1 − s)
Z 1
dt
= ts (1 − t)−s
0 t
Z 1 s
t dt
= .
0 1−t x
Let
t
u= .
1−t
As t increases from 0 to 1, u increases from 0 to ∞. Also
u 1
t= =1− ,
u+1 u+1
and so
dt u + 1 du
=
t u (u + 1)2
du
= .
u(u + 1)
Thus Z ∞
us
Γ(s)Γ(1 − s) = du
0 u(u + 1)
Now we can play the same ‘trick’ that we used to continue Γ(s) analyti-
cally:
1 Z
zs
Γ(s)Γ(1 − s) = 2πis dz,
e − 1 γ z(z + 1)
6.8. THE EULERIAN INTEGRAL 6–15
γ10
γ4 γ2
−1
γ30
where γ is the contour shown in Fig 6.1, with the proviso now that < 1, to
avoid the pole at s = −1.
But now let us ‘complete’ the contour with a large circle radius R (Fig 6.4).
Let
0
Z
zs
I (s) = dz,
γ 0 z(z + 1)
where γ 0 = γ10 +γ2 +γ30 +γ4 , with corresponding definitions of I10 (s), I2 (s), I30 (s), I4 (s).
As R → ∞,
I10 (s) → I1 (s), I30 (s) → I3 (s),
Also
Rσ
|I4 (s)| ≤ 2πR ;
R(R − 1)
and so
I4 (s) → 0
as R → ∞.
In fact I 0 (s) is independent of R (provided R > 1) by the same argument
that showed I(s) was independent of . Hence
I 0 (s) = I(s).
But now we can compute I 0 (s) by Cauchy’s Theorem. Since we are going
6.8. THE EULERIAN INTEGRAL 6–16
We conclude that
2πieπis
Γ(s)Γ(1 − s) = 2πis
e −1
2i
= πis π
e − e−πis
π
= ,
sin πs
since sin z = (eiz − e−iz )/2i.
Chapter 7
7–1
7.1. ANALYTIC CONTINUATION OF ζ(S) 7–2
2πi
γ1
γ2
γ3
−2πi
Proposition 7.2. The Riemann zeta function ζ(s) can be analytically con-
tinued to the whole complex plane C through the formula
1 Z
z s dz
Γ(s)ζ(s) = .
e2πis − 1 γ ez − 1 z
Proof I Let Z
z s dz
I(s) = ,
γ ez − 1 z
so that
I(s) = I1 (s) + I2 (s) + I3 (s),
7.1. ANALYTIC CONTINUATION OF ζ(S) 7–3
by Proposition 7.1.
On the other hand, the function
z
f (z) =
ez −1
is holomorphic, and so bounded, in |z| ≤ π, say
|f (z)| ≤ C,
ie
1
| | ≤ C|z|−1 .
ez −1
Hence
ie
1
ζ(s) = I(s).
e2πis −1
J
7.2. THE FUNCTIONAL EQUATION 7–4
Γ(1 − s) −πis Z z s dz
ζ(s) = e .
2πi γ ez − 1 z
Thus
1
ζ(s) = I(s)
Γ(s)(e2πis − 1)
Γ(1 − s) −πis
= e I(s).
2πi
J
Proposition 7.3. The only pole of ζ(s) in the entire complex plane C is the
simple pole (with residue 1) at s = 1.
2πi
γ10
γ4 γ2
γ30
−2πi
γ 0 = γ10 + γ2 + γ30 + γ4
(Fig 7.2), where γ10 runs from R to along the upper edge of the cut, γ2 is a
small circle radius as before, γ30 runs from to R along the lower edge of the
cut, and γ4 is the circle radius R = (2n+1)π considered above. Let us denote
the corresponding integrals along these contours by I10 (s), I2 (s), I30 (s), I4 (s),
so that
I 0 (s) = I10 (s) + I2 (s) + I30 (s) + I4 (s).
To avoid the poles of 1/(ez − 1) at z = 2nπi let us take
R = (2n + 1)π,
so that the circle γ4 passes mid-way between two poles at the top, and simi-
larly at the bottom.
7.2. THE FUNCTIONAL EQUATION 7–6
As n → ∞ (and so R → ∞),
I4 (s) → 0
I 0 (s) → I(s).
The function
1
f (z) =
ez −1
has poles at
z = 2nπi (n ∈ Z).
The following Lemma shows that provided we keep a reasonable distance
away from the poles, the function f (z) will remain reasonably small.
Lemma 17. There is a constant C such that
1
≤ C.
|ez − 1|
provided
|z − 2nπi| ≥ 1
for all n ∈ Z.
Proof I Since f (z) = 1/(ez − 1) is periodic with period 2πi, it is sufficient
to consider its value in the strip
S = {z = x + iy : −π ≤ y ≤ π}
πi
R
S
−πi
in
R = {z : −1 ≤ <(z) ≤ 1}
(Fig 7.3). Thus
|f (z)| ≤ c
in R \ D (since |z| ≥ 1 outside D).
On the other hand, if <(z) ≥ 1 then
|ez − 1| ≥ |ez | − 1 ≥ e − 1;
It follows that
1
≤ C = max(c, 1/(1 − e−1 ).
|ez − 1|
J
By the Lemma,
1
≤C
|ez − 1|
7.2. THE FUNCTIONAL EQUATION 7–8
as n → ∞.
But now. since the contour γ 0 is closed, we can compute the integral
I 0 (s) by Cauchy’s Theorem, from the residues of F (z) at its poles within the
contour. Since the contour runs in the ‘wrong’ direction (clockwise rather
than anti-clockwise), we must negate the sum. Thus
I 0 (s) = −2πi
X
(res2mπi (F ) + res−2mπi (F )) .
0<m≤n
In the neighbourhood of z = 0,
1 1 1
f (z) = = = + h(z),
ez −1 2
z + z /2 + · · · z
z = reiθ (0 ≤ θ ≤ 2π)
It follows that
eπi/2 e3πi/2
!
0 s
X
I (s) = −2πi (2nπ) +
0<m≤n 2nπi −2nπi
= (2π)s ns−1 e3πi/2 − eπi/2 .
X
0<m≤n
Since
ns−1 = ζ(1 − s),
X
we conclude that
1
Γ(s)ζ(s) = I(s)
e2πis −1
1
= lim I 0 (s)
e2πis − 1 n→∞
e3πis/2 − eπis/2
= (2π)s ζ(1 − s)
e2πis − 1
eπis/2 − e−πis/2
= (2π)s ζ(1 − s)
eπis − e−πis
1
= (2π)s ζ(1 − s)
eπis/2 + e−πis/2
1
= (2π)s ζ(1 − s),
2 cos(πs/2)
ie
2 cos(πs/2)Γ(s)ζ(s) = (2π)s ζ(1 − s).
All this was on the assumption that <(s) < 0. But now it follows by
analytic continuation that the result holds for all s ∈ C. J
The functional equation can be re-written in various ways, using the
properties of Γ(s) established in Chapter 6. In particular we can express it
in a form invariant under the transformation s → 1 − s. (In geometric terms,
this transformation describes reflection in the point s = 1/2.)
ξ(1 − s) = ξ(s).
7.2. THE FUNCTIONAL EQUATION 7–10
Proof I The function Γ(s/2) has poles at s = 0, −2, −4, . . . , while ζ(s) has
zeros at s = −2, −4, . . . . This leaves a pole at s = 0 which is cancelled by
the zero of the factor s, In addition, the pole of ζ(s) at s = 1 is cancelled by
the zero of the factor s − 1. Thus all possible poles of ξ(s) are accounted for,
and this function must be entire.
By the second gamma function identity (Proposition 6.7), with (1 − s)/2
in place of s,
1−s
1+s
π
Γ 2
Γ 2
= π(1−s)
sin 2
π
= ,
cos πs
2
Γ( 2s ) πs 1
1−s = 2
1−s
cos Γ(s)π − 2 .
Γ( 2 ) 2
ζ(1 − s) πs
= 21−s cos Γ(s)π −s
ζ(s) 2
s
Γ( 2 ) 1 −s
= 1−s π2 .
Γ( 2 )
Thus if we set
s
η(s) = Γ( )ζ(s)
2
then
η(1 − s) 1
= π 2 −s .
η(s)
then
θ(1 − s) 1
= π 2 −s .
θ(s)
Hence
η(1 − s) θ(1 − s)
=
η(s) θ(s)
ie
β(1 − s) = β(s),
where
β(s) = η(s)θ(s)
s
= π− 2 Γ s
2
ζ(s).
satisfies
ξ(1 − s) = ξ(s).
J
s 7→ 1 − 2
sends the left-hand half-plane <(s) < 0 into the half-plane <(s) > 1, where
ζ(s) is well-behaved.
We already know that ζ(s) has no poles in <(s) ≤ 0, by Proposition 7.3.
It does however have zeros, as we shall see.
Proposition 7.6. The Riemann zeta function ζ(s) has simple zeros at s =
−2, −4, −6, . . . . It has no other zeros (or poles) in <(s) ≤ 0.
7.4. THE VALUES OF ζ(2N ) 7–12
Proof I Since π −s/2 and Γ(s/2) have no zeros anywhere, it follows that any
zero of
s
ξ(s) = s(s − 1)π 2 Γ ss ζ(s)
must be a zero of ζ(s), except possibly for s = 0, 1.
At s = 1, ζ(s) has a simple pole which is cancelled out by the zero of
s − 1. Thus ξ(1) 6= 0; and since ξ(0) = ξ(1) by the functional equation
ξ(1 − s) = ξ(s), it follows that ξ(0) 6= 0. Thus
ξ(s) = 0 =⇒ ζ(s) = 0.
Now we know that ζ(s) has no zeros in <(s) ≥ 1 by Propositions 3.4 and
3.8. Hence ξ(s) has no zeros in <(s) ≥ 1. Thus ξ(s) has no zeros in <(s) ≤ 0,
since ξ(1 − s) = ξ(s).
It follows that ζ(s) has zeros in <(s) ≤ 0 just at those points where
s(s − 1)Γ(s/2) has poles. Now Γ(s/2) has simple poles at s = 0, −2, −4, . . . ;
but the pole at s = 0 is cancelled by the zero of s at this point. We conclude
that ζ(s) has simple zeros at s = −2, −4, −6, . . . , and that these are the only
zeros of ζ(s) in <(s) ≤ 0. J
Remarks. 1. Different authors use slightly different notations for the Bernouilli
numbers. As we shall see, the odd Bernouilli numbers all vanish after
the first. What we call B2n is sometimes denoted by Bn .
7.4. THE VALUES OF ζ(2N ) 7–13
Again, it will follow from our formulae for ζ(2n) that B2 , B4 , B6 , . . . are
alternatively positive and negative. Sometimes Bn is used to denote the
absolute value, so that Bn ≥ 0 for all n.
However, we shall stick with the definition above.
2. We can compute the Bernouilli numbers recursively from the identity
1 + 12 z + 16 z 2 + 1 3
24
z + 1 4
120
z + ··· B0 + B1 z + 12 B2 z 2 + 61 B3 z 3 + 1
B z4
24 4
+ · · · = 1.
B0 = 1.
Comparing coefficients of z, z 2 , z 3 , z 4 ,
B1 + 21 B0 =0 =⇒ B1 = − 12 ,
1
B + 12 B1 + 61 B0
2 2
=0 =⇒ B2 = − 16 ,
1
B + 41 B2 + 16 B1 + 24
6 3
1
B0 =0 =⇒ B3 = 0,
1 1 1 1 1 1
B + 12 B3 + 12 B2 + 24 B1 + 24 B0
24 4
=0 =⇒ B4 = − 30 .
B2n+1 = 0 (n = 1, 2, 3, . . . ).
Proof I Let
z
f (z) = .
ez − 1
Then
−z
f (−z) =
e−z
−1
zez
= z .
e −1
Thus
f (z) − f (−z) = −z.
On the other hand,
X zn
f (z) − f (−z) = 2 Bn .
n odd n!
It follows that
1
B1 = − , B3 = B5 = · · · = 0.
2
J
7.4. THE VALUES OF ζ(2N ) 7–14
Γ(2n)
ζ(1 − 2n) = − I2 ,
2πi
where
Z
I2 = F (z)dz
γ2
= 2πi res0 (F ),
by Cauchy’s Theorem.
But
z −2n
F (z) =
ez − 1
z
= z −2n−1
ez −1
−2n−1
X zr
=z Br
r≥0 r!
X z r−2n−1
= Br .
r≥0 r!
B2n
res0 (F ) = .
(2n)!
7.4. THE VALUES OF ζ(2N ) 7–15
Hence
Γ(2n) B2n
ζ(1 − 2n) = (−2πi)
2πi (2n)!
(2n − 1)!
=− B2n
(2n)!
B2n
=− .
2n
J
Thus
ζ(1 − 2n)
ζ(2n) = (−1)n 22n−1 π 2n
Γ(2n)
B2n
= (−1)n−1 22n−1 π 2n
2nΓ(2n)
B2n
= (−1)n−1 22n−1 π 2n .
(2n)!
J
For n = 1 this gives
1 1 2 π2
ζ(2) = 1 + + + · · · = π B2 = ,
22 32 6
a result which is probably familiar, and which can be proved in several ways.
For n = 2 it gives
1 1 1 4 π4
ζ(4) = 1 + + + · · · = − 3
π B4 = .
24 34 90
7.5. POSTSCRIPT 7–16
7.5 Postscript
In his seminal paper (Appendix B), Riemann gave a second proof of the
functional equation. Although at first sight this seems more complicated than
his first proof (given above) it has turned out to have far greater significance.
By Lemma 16, with πn2 in place of n,
Z ∞
2x dx
Γ(s)(πn2 )−s = xs e−πn .
0 x
Summing over n, as before,
Z ∞
ψ(x) − 1 dx
Γ(s)π −s ζ(2s) = zs ,
0 2 x
where ∞
2
e−πn x .
X
ψ(x) =
−∞
θ(x + 1) = θ(x).
0–18
CONTENTS 0–19