Zeta Course Notes 1
Zeta Course Notes 1
ADAM J HARPER
Abstract. These are rough notes covering the first block of lectures in “The Riemann
Zeta Function” course. In these first lectures we will introduce the zeta function ζ(s),
obtain some basic estimates for it, and use it to prove the Prime Number Theorem.
We will also develop a general procedure for widening the zero-free region for the zeta
function (given suitable analytic information), and thus improving the error term in
the Prime Number Theorem.
(No originality is claimed for any of the contents of these notes. In particular, they
borrow from the classic books of Ivić [1] and Titchmarsh [2].)
The Riemann zeta function ζ(s) is a meromorphic function on the entire complex
plane, but its definition is not straightforward to explain for all s ∈ C. We will begin by
defining the zeta function when <(s) > 1. Later we will extend the definition to cover
the range <(s) > 0, (which is by far the most important for applications), and finally
to cover all s ∈ C. (Note that, since the half plane {<(s) > 1} is a set containing a
limit point, the Identity Theorem from complex analysis implies there is at most one
analytic continuation of ζ(s) to a meromorphic function on C.)
Definition 1.1. For each s ∈ C such that <(s) > 1, the Riemann zeta function ζ(s) is
defined by
∞
X 1
ζ(s) := .
n=1
ns
Note that the series is absolutely convergent.
is the prototypical Dirichlet series, but we will meet some others later in the course.
To get a feel for how the zeta function behaves when <(s) > 1, we shall quickly prove
the following approximation and estimate.
Lemma 1.2. If s = σ + it, with σ > 1 and t ∈ R, then for any natural number x we
have
X 1 x1−s
ζ(s) = s
+ + O(|s|x−σ ).
n≤x
n s − 1
Consequently, if σ > 1 and |t| ≥ 2 then ζ(σ + it) = O(log |t|).
if <(s) > 1. This is not entirely correct because the integral doesn’t approximate the
sum very accurately at the beginning, but the lemma asserts that the approximation
becomes good for the later terms.
Note that Z ∞ −s ∞
dw w 1 1
s+1
= = ,
n w −s n s ns
so in fact we have
!
X 1 X Z ∞ dw Z ∞ X dw
s
=s s+1
=s 1 ,
n>x
n n>x n
w x x<n≤w
ws+1
where bwc here denotes the integer part of w. The integral is equal to x1−s /(s − 1), and
the error term is O(|s|x−σ ), as claimed.
Finally, applying the estimate we just proved, with x chosen as the integer part of
|t|, yields that
X 1
1 X 1
ζ(σ + it) = + O + O(1) = + O(1).
nσ+it |t| nσ+it
n≤|t| n≤|t|
The sum here is O( n≤|t| n1 ), and since that is O(log |t|) the claimed estimate follows.
P
The proof of Lemma 1.2 has lots of potential for improvement and further develop-
ment, and we shall revisit it shortly when we define the zeta function on the extended
range <(s) > 0.
For each <(s) > 1, the zeta function is built from some information about every
natural number n. Since each n is a (possibly empty) product of primes, in an essentially
unique way, one might hope that the values of the zeta function can be related to the
RIEMANN ZETA FUNCTION LECTURE NOTES 1 3
behaviour of ps for primes p only. The following result, the so-called Euler product
expression for ζ(s), provides such a connection.
where cP (n) is 1 if all the prime factors of n are ≤ P , and cP (n) is zero otherwise.
Then we see
Y −1 ∞ ∞
1 X 1 X 1
ζ(s) − 1− s = s
≤ <(s)
.
p≤P
p n=1,
n n=1,
n
cP (n)=0 cP (n)=0
When Riemann first proposed that one could use the zeta function, together with
complex analysis, to investigate the distribution of primes, he proposed working directly
with the counting function X
π(x) := 1.
p≤x
However, as we shall see it turns out to be technically easier to study a weighted counting
function, and that is now the standard approach.
∞
X Λ(n) X1 d 1 d X1 1
=− = − ,
n=1
ns k ds pks ds k pks
where the sums are over all prime powers pk , and the interchange of summation and
differentiation is justified because we have the uniform bound k1 ds
d 1
pks
≤ plog p
k<(s) , and
P log p
pk<(s)
is convergent. Then we see
X1 1 −1
X 1
= log 1 − s = log ζ(s),
k pks p
p
Note that ζ(s) 6= 0 if <(s) > 1, (since the zeta function is an absolutely convergent
product in which no term vanishes), and so the logarithmic derivative ζ 0 (s)/ζ(s) is well
defined if <(s) > 1. (In fact one sees this directly in the absolute convergence of all
series in the above calculations.)
RIEMANN ZETA FUNCTION LECTURE NOTES 1 5
Remark 2.2. In the foregoing calculations, we begin to see the relevance of vanishing of
the zeta function to the behaviour of prime numbers.
After the foregoing preparations, we would like to formulate a procedure for recov-
ering information about Ψ(x) from information about ∞ Λ(n)
P
n=1 ns . One can think of
the Dirichlet series ∞ Λ(n)
P
n=1 nσ+it as a kind of Fourier transform of the values of the von
Mangoldt function, in which the oscillating terms n−it = e−it log n are analogous to the
exponential phases e2πint in a Fourier series, and the terms n−σ are present to make
everything converge. With this in mind, one could reasonably hope to formulate a
procedure like Fourier inversion, allowing one to recover information about Ψ(x) by
P∞ Λ(n)
summing/integrating the series n=1 nσ+it over a suitable range of the “frequency”
variable t. This is the procedure that we shall now develop, in a quite general context.
Then (
c+iT 1
y c min{1, T | log } if y 6= 1
Z
1 ds
δ(y) − ys < y|
2πi c−iT s min{1, Tc } if y = 1.
Proof of Lemma 2.3. The obvious approach is to use Cauchy’s Residue Theorem, and
evaluate the integral by deforming the line of integration in a suitable way.
s
For example, if 0 < y < 1 then the integrand ys tends to zero as <(s) → ∞ (in a
uniform way, independently of =(s)), and is holomorphic on the line of integration and
to the right of it (the only pole being to the left, at s = 0). Thus Cauchy’s Residue
Theorem implies that
Z c+iT Z ∞+iT Z ∞−iT
1 s ds 1 s ds 1 ds
y =− y + ys ,
2πi c−iT s 2πi c+iT s 2πi c−iT s
R ∞+iT R∞ σ yc
and we certainly have c+iT y s ds s
≤ 1
T c
y dσ = T | log y|
. On the other hand, Cauchy’s
Residue Theorem also implies that
Z c+iT Z
1 s ds 1 ds
y =− ys ,
2πi c−iT s 2πi Γ(c,T ) s
where Γ(c, T ) is the arc of the circle centred at the origin, with radius |c + iT | =
√
c2 + T 2 , that runs from c + iT to c − iT on the right. And we have Γ(c,T ) y s ds
R
s
≤
c
√ y
R
c2 +T 2 Γ(c,T )
|ds| ≤ πy c .
s
Similarly, if y > 1 then the integrand ys tends to zero as <(s) → −∞, so one can
apply Cauchy’s Residue Theorem with the contour shifted to the left instead of the
6 ADAM J HARPER
right. This time the contour encloses the pole at s = 0, which contributes its residue of
1 to the value of the integral.
Finally, if y = 1 then the integral is quite easy to estimate directly (by real variable
methods).
1
R c+i∞ s ds
Note in particular that we have δ(y) = 2πi c−i∞
y s , (where the infinite integral is
R c+iT s ds
interpreted as limT →∞ c−iT y s ), and also that for any x > 0 and any n ∈ N we have
( c
Z c+iT s x 1
1 x ds nc
min{1, T | log(x/n)| } if n 6= x
δ(x/n) − s
< c
2πi c−iT n s min{1, T } if n = x.
Lemma 2.4 (Truncated Perron formula). Let x, c, T > 0, and suppose that ∞ |an |
P
n=1 nc
is convergent. Then
0 ∞
Z c+iT X ! ∞
!
X 1 an ds X |a n | 1
an = s
xs + O xc min{1, } ,
n≤x
2πi c−iT n=1
n s n=1
nc T | log(x/n)|
where 0n≤x denotes that if x is an integer, then the final summand ax is replaced by
P
(1/2)ax .
Lemma 2.4 provides our desired relationship between a Dirichlet series and the count-
ing function of its coefficients. Note that if T is chosen larger, meaning that we input
information about the Dirichlet series at a wider range of “frequencies” c + it, then the
“big Oh” error term becomes smaller.
Remark 2.6. The integral appearing in the truncated Perron formula is a little unsatis-
factory, since the factor 1/s in the integrand doesn’t decay very fast when |=(s)| → ∞.
RIEMANN ZETA FUNCTION LECTURE NOTES 1 7
This is precisely analogous to the way in which the usual Fourier transform of (the
characteristic function of) an interval only decays like 1/|t|. One can obtain faster de-
cay on the Fourier side by introducing smooth weight functions (like the Fejér kernel)
P0
on the “physical space” side, and the same is true here if one replaces n≤x an by
P
n an Φ(n/x), where Φ is a smooth function that approximates the indicator function
1[0,1] . This is sometimes technically very useful.
We conclude from all our work in this section that for any x > 1, and any 1 < c ≤ 2
and 1 < T ≤ x (say), we have
∞
Z c+iT X ! ∞
!
1 Λ(n) ds X Λ(n) 1
Ψ(x) = s
xs + O xc min{1, }
2πi c−iT n=1
n s n=1
nc T | log(x/n)|
Z c+iT 0 c X∞
1 ζ (s) ds x Λ(n) X 1
= − xs + O c
+ 2c Λ(n) min{1, }
2πi c−iT ζ(s) s T n=1 n T | log(x/n)|
x/2<n<2x
Z c+iT 0 c X∞
1 ζ (s) ds x log n X 1
= − xs + O c
+ 2c log(2x) min{1, }
2πi c−iT ζ(s) s T n=1 n T | log(x/n)|
x/2<n<2x
Z c+iT 0 2
xc
1 ζ (s) ds x log (2x)
= − xs + O 2
+ .
2πi c−iT ζ(s) s T (c − 1) T
It is usual to choose c = 1 + 1/ log x, so that both “big Oh” terms are O(x log2 (2x)/T ),
and the term xc+it in the integrand is of order x (which is the expected size of Ψ(x)).
At this point, a possible approach to investigating Ψ(x) would be to obtain very
precise information about ζ(1 + 1/ log x + it) (and ζ 0 (1 + 1/ log x + it)), on a good range
of t relative to x, and try directly to evaluate the contour integral above, up to a small
error. Some nice work in multiplicative number theory does proceed a bit like that, but
it is not the classical approach and it is not what we shall do at the moment. Instead
we shall extend the definition of the zeta function to hold when <(s) ≤ 1, and then use
Cauchy’s Residue Theorem to estimate the integral by deforming the line of integration.
We shall also start to investigate the zeros of the zeta function, (i.e. the values s at
ζ 0 (s) xs
which ζ(s) = 0), since these may produce poles of the integrand − ζ(s) s , which
produce residues.
As promised, we shall now give a definition of the zeta function that makes sense
whenever <(s) > 0, except at s = 1 (where the function has a simple pole), and agrees
with our original definition 1.1 when <(s) > 1.
8 ADAM J HARPER
Definition 3.1. For each s ∈ C such that <(s) > 0, except for s = 1, and for any
x > 0, the Riemann zeta function is defined by
Z ∞
X 1 x1−s {x} dw
ζ(s) := s
+ + s
− s {w} s+1
,
n≤x
n s − 1 x x w
Proof of well definedness. It is obvious that, for any fixed x, the right hand side in
Definition 3.1 defines a holomorphic function on the half plane <(s) > 0, except for a
simple pole at s = 1 with residue 1 (coming from the term x1−s /(s − 1)). Moreover, in
the proof of Lemma 1.2 we have already seen that for any x ∈ N, the right hand side in
Definition 3.1 coincides with ζ(s) for <(s) > 1.
It only remains to check that the right hand side takes the same value for any choice
of x, and (by analytic continuation) it will suffice to do that when <(s) > 1. If x > 0
is not an integer, and N = bxc + 1 is the smallest integer exceeding x, then we have
Z N Z N 1−s N −s N !
dw dw w w
s {w} s+1 = s (w − (N − 1)) s+1 = s − (N − 1)
x w x w 1−s x −s x
sN 1−s sx1−s N −1
= − + (N − 1)N −s −
1−s 1−s xs
1−s 1−s
N x 1 N −1
= − + x1−s + − s− .
s−1 s−1 N xs
Then we see
Z ∞ ! Z ∞ !
X 1 x1−s {x} dw X 1 N 1−s dw
s
+ + s −s {w} s+1 − s
+ −s {w} s+1
n≤x
n s − 1 x x w n≤N
n s − 1 N w
Z N
1 x1−s N 1−s {x} dw
= − s+ − + s −s {w} s+1 ,
N s−1 s−1 x x w
and since {x} = x − (N − 1) this all vanishes, so indeed
Z ∞
X 1 x1−s {x} dw
s
+ + s −s {w} s+1 = ζ(s) ∀<(s) > 1, ∀x > 0.
n≤x
n s−1 x x w
It follows immediately from Definition 3.1 that the final estimate in Lemma 1.2
continues to hold on, and slightly to the left of, the line <(s) = 1.
100
Lemma 3.2. For any t such that |t| is sufficiently large, and any σ > 1 − log |t|
(say),
we have
ζ(σ + it) = O(log |t|).
RIEMANN ZETA FUNCTION LECTURE NOTES 1 9
To prove the estimate for ζ 0 (σ + it) we simply differentiate Definition 3.1, finding
that
X log n x1−s log x Z ∞ Z ∞
0 x1−s {x} log x dw dw
ζ (s) = − s
− − 2
− s
− {w} s+1
+s {w} log w s+1
.
n≤x
n s − 1 (s − 1) x x w x w
Proof of Lemma 3.4. The idea is roughly to express e(f (n)) as a Fourier series (in fact
we will derive something like the Poisson summation formula). It will turn out that
Rb
the zero mode in the Fourier expansion produces the main term a e(f (x))dx, whilst all
the other modes contribute to the error term O(1/(1 − δ)) (under the hypotheses of the
lemma, which imply that the non-trivial Fourier phases e(−kx) oscillate more rapidly
than e(f (x)), and so produce cancellation).
More precisely, we have
K Z 1
e(f (n)) + e(f (n + 1)) X
= lim un (k), where un (k) = e(f (n + x) − kx)dx.
2 K→∞
k=−K 0
R n+1
We note that un (0) = n e(f (x))dx, and
1
1 1 0
Z
e(f (n + x))e(−kx)
un (k) = + f (n + x)e(f (n + x) − kx)dx
−2πik 0 k 0
e(f (n)) − e(f (n + 1)) 1 n+1 0
Z
= + f (x)e(f (x) − kx)dx
2πik k n
if k 6= 0, using integration by parts and the fact that e(−kx) = e(−k(n + x)). Here the
term e(f (n))−e(f
2πik
(n+1))
always cancels the corresponding term at −k, so can be ignored.
Therefore
X X X
e(f (n)) = (un (0) + un (k)) + O(1)
a<n≤b bac+1≤n≤bbc−1 k6=0
Z b X1Z bbc
= e(f (x))dx + O(1) + f 0 (x)e(f (x) − kx)dx
a k6=0
k bac+1
b bbc
f 0 (x) d
Z Z
X 1
= e(f (x))dx + O(1) + e(f (x) − kx)dx.
a k6=0
2πik bac+1 f 0 (x) − k dx
Finally, since f 0 (x) is monotonic on [a, b], and |f 0 (x)| < 1, we observe that f 0 (x)/(f 0 (x)−
k) is also monotonic (for each fixed k 6= 0). Now one can easily check the following sum-
mation lemma of Abel: if c1 ≥ c2 ≥ ... ≥ cN , and if d1 , ..., dN ∈ R are arbitrary, then
N N N −1 N0
X X X X
cn dn ≤ |cN | dn +(cN −1 −cN ) dn +...+(c1 −c2 )|d1 | ≤ (c1 −cN +|cN |) max
0
dn .
1≤N ≤N
n=1 n=1 n=1 n=1
0 0
So if f (x)/(f (x) − k) is monotone decreasing then, by approximating the real and
imaginary parts of the integral by Riemann sums, we see
Z bbc
f 0 (x) d
0
< e(f (x) − kx)dx
bac+1 f (x) − k dx
Z X
f 0 (x) d 1
max · max < e(f (x) − kx)dx ,
bac+1≤x≤bbc f 0 (x) − k bac+1≤X≤bbc bac+1 dx |k| − δ
RIEMANN ZETA FUNCTION LECTURE NOTES 1 11
similarly for the imaginary part. If instead f 0 (x)/(f 0 (x) − k) is monotone increasing
then a similar argument yields the bound O(1/(|k| − δ)), so finally summing over k
completes the proof of the lemma.
Proof of Theorem 3.3. Let N ≥ x be a large parameter, and recall that we write σ =
<(s) and t = =(s). Then directly from Definition 3.1, we have
Z ∞
X 1 N 1−s −σ dw X 1 N 1−s
ζ(s) = s
+ + O(N ) + O(|s| σ+1
) = s
+ + O(|s|N −σ /σ).
n≤N
n s − 1 N w n≤N
n s − 1
In particular, if we choose N large enough in terms of x, σ, |t| then the error term will
be O(x−σ ).
Now to prove Theorem 3.3, it will suffice to show that
! !
X 1 N 1−s X 1 x1−s
s
+ − s
+ = O(x−σ )
n≤N
n s − 1 n≤x
n s − 1
Note that Theorem 3.3 provides a good approximation of the zeta function by a
Dirichlet polynomial with around |t| terms (usually described as a Dirichlet polynomial
of length |t|), together with the easy to understand term x1−s /(s − 1). We usually think
of the length of a Dirichlet polynomial as a measure of its complexity. It turns out
that Dirichlet polynomials of length |t|, evaluated where =(s) |t|, can be understood
12 ADAM J HARPER
moderately well (we will see this later), but if one could approximate by a shorter
Dirichlet polynomial that would be even better.
We are now ready to prove the following fundamental result about primes.
(Actually Hadamard and de la Vallée Poussin did not originally obtain the quanti-
1/10
tative estimate Ψ(x) = x + O(xe−c log x ), but from a modern perspective it follows
easily from variants of their methods. A little later, de la Vallée Poussin obtained the
√
stronger estimate Ψ(x) = x + O(xe−c log x ), which is usually called the Prime Number
Theorem with classical error term. We will discuss that result a little later.)
P
Recall that Ψ(x) := n≤x Λ(n) is a counting function of weighted prime powers.
Given Theorem 4.1, it is easy (using summation by parts) to deduce an asymptotic for
the unweighted counting function π(x) as well.
Theorem 4.3 (Weak zero-free region, Hadamard, de la Vallée Poussin, 1896). There
exists a small absolute constant c > 0 such that the following is true. For any t ∈ R
and any σ ≥ 1 − c/ log9 (|t| + 2),
1
= O(log7 (|t| + 2)).
ζ(σ + it)
RIEMANN ZETA FUNCTION LECTURE NOTES 1 13
The zero-free region {s : σ ≥ 1 − c/ log9 (|t| + 2)} supplied by Theorem 4.3 is only
a small extension of the half plane {σ > 1}, where the zeta function trivially has no
zeros (because it is an absolutely convergent product). However, it does include the line
<(s) = 1, and is sufficient to imply quite spectacular results like Theorem 4.1.
Lots of our recent work on the zeta function has made use of its series expansion.
In contrast, the proof of Theorem 4.3 is mainly based on its Euler product expression,
showing that this continues to have an influence a little to the left of the half plane
{σ > 1} where it is valid.
Proof of Theorem 4.3. We may assume throughout that σ − 1 is less than a small con-
stant, since if σ − 1 is large then the result is a trivial consequence of the Euler product.
We may also assume that |t| is larger than a small constant, since otherwise ζ(σ + it) is
in the neighbourhood of the pole of the zeta function at s = 1, in which case |1/ζ(σ +it)|
is certainly small.
Let σ 0 > 1 be a number, to be chosen later in terms of σ and t. For any t0 we see
X
1
X cos(kt0 log p)
0 0 0 0
|ζ(σ +it )| = exp{< log ζ(σ +it )} = exp{−< log 1 − σ0 +it0 } = exp{ },
p
p k
kpkσ0
p
since we have the Euler product expression (Lemma 1.3) for σ 0 > 1. Here the sum
P
pk
is over all prime powers.
The key idea is to consider the product ζ(σ 0 )3 |ζ(σ 0 + it)|4 |ζ(σ 0 + 2it)|. We see
X 3 + 4 cos(kt log p) + cos(2kt log p)
ζ(σ 0 )3 |ζ(σ 0 + it)|4 |ζ(σ 0 + 2it)| = exp{ }
k
kpkσ0
p
X 2(1 + cos(kt log p))2
= exp{ } ≥ 1,
k
kpkσ0
p
in view of an elementary trigonometric identity. This means that the only way that
|ζ(σ 0 + it)| can be very small is if ζ(σ 0 ), |ζ(σ 0 + 2it)| are very large. In fact we have
1 (σ 0 − 1)3/4
|ζ(σ 0 + it)| ≥ ,
ζ(σ 0 )3/4 |ζ(σ 0 + 2it)|1/4 log1/4 (|t| + 2)
since ζ(s) has a simple pole at s = 1, and since ζ(σ 0 + 2it) = O(log(|t| + 2)) by the
second part of Lemma 1.2 (which was stated for |t| ≥ 2 but is valid, with the bound
O(log(|t| + 2)), provided that s is away from 1).
Finally we observe
Z σ0
0
|ζ(σ + it)| ≥ |ζ(σ + it)| − |ζ 0 (r + it)|dr = |ζ(σ 0 + it)| + O((σ 0 − σ) log2 (|t| + 2)),
σ
14 ADAM J HARPER
by the second part of Lemma 3.2 (which, again, was stated for large |t| but is valid
for small |t|, with the bound O(log2 (|t| + 2)), provided s is away from 1). So if σ ≥
1 + c/ log9 (|t| + 2) we can choose σ 0 = σ, and see
c3/4
|ζ(σ + it)| ;
log7 (|t| + 2)
whereas if 1−c/ log9 (|t|+2) ≤ σ ≤ 1+c/ log9 (|t|+2) we can choose σ 0 = 1+c/ log9 (|t|+2),
and see
(σ 0 − 1)3/4 c3/4 c
1/4
= 7 , and (σ 0 − σ) log2 (|t| + 2) = O( 7 ).
log (|t| + 2) log (|t| + 2) log (|t| + 2)
In particular, provided c is a sufficiently small (but fixed) constant then |ζ(σ + it)|
c3/4
log7 (|t|+2)
in this case as well.
Proof of Theorem 4.1. In view of the truncated Perron formula (Lemma 2.4) and the
subsequent discussion, for any large x and 1 < T ≤ x we have
x log2 x
Z 1+1/ log x+iT 0
1 ζ (s) s ds
Ψ(x) = − x +O .
2πi 1+1/ log x−iT ζ(s) s T
We shall try to evaluate the integral using Cauchy’s Residue Theorem and our preceding
estimates, and will succeed up to an error term involving T . Then we will choose T to
balance our two “big Oh” terms.
Indeed, by the residue theorem and Theorem 4.3 we see
Z 1+1/ log x+iT 0
1 ζ (s) ds
− xs
2πi 1+1/ log x−iT ζ(s) s
0 Z 1−c/ log9 (T +2)+iT 0
ζ (s) xs
1 ζ (s) ds
= Ress=1 − + − xs
ζ(s) s 2πi 1−c/ log9 (T +2)−iT ζ(s) s
Z 1+1/ log x+iT 0 Z 1+1/ log x−iT 0
1 ζ (s) s ds 1 ζ (s) ds
+ − x − − xs
2πi 1−c/ log9 (T +2)+iT ζ(s) s 2πi 1−c/ log9 (T +2)−iT ζ(s) s
Z 1−c/ log9 (T +2)+iT
ζ 0 (s)
1 ds
= x+ − xs
2πi 1−c/ log9 (T +2)−iT ζ(s) s
Z 1+1/ log x+iT 0 Z 1+1/ log x−iT 0
1 ζ (s) s ds 1 ζ (s) ds
+ − x − − xs ,
2πi 1−c/ log9 (T +2)+iT ζ(s) s 2πi 1−c/ log9 (T +2)−iT ζ(s) s
since the only pole of the integrand we encounter when moving to the left comes from
the simple pole of the zeta function at s = 1 (there being no zeros of zeta enclosed by
the relevant contour), and that gives rise to a residue x.
RIEMANN ZETA FUNCTION LECTURE NOTES 1 15
x
log9 (T + 2)
T
x log9 x
,
T
in view of Lemma 3.2 and Theorem 4.3 (and the assumption that T ≤ x). The same is
obviously true on the other short line [1 − c/ log9 (T + 2) − iT, 1 + 1/ log x − iT ], and so
both of those integrals contribute an error term O((x/T ) log9 x).
On the vertical line [1 − c/ log9 (T + 2) − iT, 1 − c/ log9 (T + 2) + iT ] we have instead
that
9
ζ 0 (s) xs x1−c/ log (T +2)
− ≤ max |ζ 0 (s)|
ζ(s) s |s| s∈[1−c/ log9 (T +2)−iT,1−c/ log9 (T +2)+iT ]
1
· max
9 9
s∈[1−c/ log (T +2)−iT,1−c/ log (T +2)+iT ] ζ(s)
9
x1−c/ log (T +2)
log7 x max |ζ 0 (s)| ,
|s| s∈[1−c/ log9 (T +2)−iT,1−c/ log9 (T +2)+iT ]
in view of Theorem 4.3 again. We need to be slightly careful when bounding max |ζ 0 (s)|,
since some of the relevant values of s are close to the pole at s = 1. However, since we
always have |s − 1| ≥ c/ log9 (T + 2), a quick check of the proof of Lemma 3.2 reveals
that
9
max 9
|ζ 0 (s)| log18 (T + 2) log18 x.
s∈[1−c/ log (T +2)−iT,1−c/ log (T +2)+iT ]
Remark 4.4. From this point on we will usually feel free to “absorb logarithmic terms”,
as was done in the last sentence of the proof of Theorem 4.1, without much discussion.
This leads to neater looking bounds, which are therefore easier to think about. Thus
the reader should make sure that he or she understands the justification for removing
the factor log26 x (and, in compensation, replacing c by c/2 in the exponent). Note this
is also an example of recycling letters to mean different things: the constant c in the
statement of Theorem 4.1 turned out to be half as big (in our writing of the argument)
as the constant c from Theorem 4.3.
Theorem 4.1 is a fundamental result about the distribution of primes, and is very
1/10
useful throughout number theory, but the error term O(xe−c log x ) that we obtained
there is a bit awkward and unsatisfactory. For example, if we wanted to investigate the
1/4
values of the von Mangoldt function Λ(n) in the fairly long interval (x, x + xe− log x ],
the obvious approach is to note that
1/4
X
Λ(n) = Ψ(x + xe− log x ) − Ψ(x)
1/4 x
x<n≤x+xe− log
1/4 1/10 1/10
= (x + xe− log x
+ O(xe−c log x
)) − (x + O(xe−c log x
))
1/4 1/10
= xe− log x
+ O(xe−c log x
).
But this is useless, since the “big Oh” error term is much larger than the supposed
1/4
main term xe− log x . Thus they might, for all we know at this point, cancel each other
completely, so we cannot even guarantee in this way that there will be a single prime
1/4
in the interval (x, x + xe− log x ] (for all large x).
Much later in the course we will develop a more sophisticated approach to studying
primes in intervals. But the obvious goal, for this and other applications, is simply to
improve the error term in the Prime Number Theorem. In this section we will develop a
general procedure for doing this, and use it to quickly prove the Prime Number Theorem
√
with classical error term O(xe−c log x ). In the next chapter of the course we will use the
same procedure, but inputting much more powerful analytic data about ζ(s), to obtain
the best known error term.
We saw in the proof of Theorem 4.1 that the quality of our error term depended on
how far we could shift the line of integration to the left, which in turn depended on
how large a zero-free region we have for the zeta function. To obtain a wider zero-free
region we shall prove an important technical theorem.
Theorem 5.1 (Landau, 1924). Let φ(t) ≥ 1 and w(t) ≥ 1 be non-decreasing functions
such that φ(t) → ∞ as t → ∞. Also let t0 ≥ 0 be any fixed constant.
RIEMANN ZETA FUNCTION LECTURE NOTES 1 17
So Theorem 5.1 implies that the zeta function is non-zero in the region
c
1− ≤ σ, t ≥ 3.
log(2t + 1)
Using Theorem 4.3 to handle those 0 ≤ t < 3, and adjusting the value of c suitably, we
in fact conclude that zeta is non-zero in the region
c
1− ≤ σ, t ≥ 0.
log(|t| + 2)
Finally, it is easy to check (e.g. in Definition 3.1) that we always have ζ(σ − it) =
ζ(σ + it), and so we have the result for negative t by symmetry.
Remark 5.3. It isn’t really necessary to use Theorem 3.3 to check the conditions in
the proof of Corollary 5.2: it would suffice to use Definition 3.1. One can also avoid
appealing to Theorem 4.3 to handle 0 ≤ t < 3, by choosing t0 smaller and noting that
ζ(s) is certainly non-zero in the neighbourhood of the pole at s = 1.
Next we shall prove Theorem 5.1, which requires two ingredients. The first of these,
which is crucial in essentially all known zero-free region arguments, is the fact that
ζ(σ 0 )3 |ζ(σ 0 + it)|4 |ζ(σ 0 + 2it)| ≥ 1 when σ 0 > 1, as we saw in the proof of Theorem 4.3.
The new ingredient is the following lemma.
18 ADAM J HARPER
Lemma 5.4. Let r, M > 0 and z0 ∈ C. Suppose that f (z) is a holomorphic function
on the disc |z − z0 | ≤ r, that f (z0 ) 6= 0, and that |f (z)/f (z0 )| ≤ M for all |z − z0 | ≤ r.
Then if f (z) 6= 0 in the right half of the disc (where <z ≥ <z0 ), we have
f 0 (z0 ) −8 log M X 1
< ≥ +< ,
f (z0 ) r z0 − ρ
ρ: f (ρ)=0,
|ρ−z0 |≤r/2,
<ρ<<z0
so in particular we have
f 0 (z0 ) −8 log M
< ≥ .
f (z0 ) r
Proof of Lemma 5.4. The proof is based on some facts from complex analysis.
Let Z denote the multi-set of all zeros of f (z) in the small disc |z −z0 | ≤ r/2, counted
with multiplicity. This must be a finite multi-set, since otherwise the zeros would have
a limit point and so f (z) would be identically zero, which is false by hypothesis. Then
we define a function g(z) on the large disc |z − z0 | ≤ r, by setting
(
1
Q
f (z) ρ∈Z z−ρ if z ∈
/Z
g(z) :=
limz0 →z g(z 0 ) if z ∈ Z.
Note that g(z) is holomorphic on the large disc |z −z0 | ≤ r, so by the maximum modulus
principle we have
g(z) g(z) f (z) Y z0 − ρ
max = max ≤ max · max ≤ M.
|z−z0 |≤r g(z0 ) |z−z0 |=r g(z0 ) |z−z0 |=r f (z0 ) |z−z0 |=r
ρ∈Z
z−ρ
Next, by construction the function g(z) is non-zero on the small disc |z − z0 | ≤ r/2,
so we can define h(z) := log(g(z)/g(z0 )) by taking the principal branch of the logarithm.
Then h(z) is holomorphic on the small disc |z − z0 | ≤ r/2, and there we have
But one can bound the modulus of a holomorphic function at a point given a bound
for its real part on a surrounding disc: since h(0) = 0, the Borel–Carathéodory theorem
implies that, for any r0 < r/2,
2r0
max 0 |h(z)| ≤ log M.
|z−z0 |≤r r/2 − r0
h(z)
Then Cauchy’s Integral Formula implies that |h0 (z0 )| = | 2πi 1
R
|z−z0 |=r/4 (z−z0 )2
dz| ≤
8(log M )/r.
Finally, note that
f 0 (z0 ) d d X 1 X 1
= log f (z)|z=z0 = log g(z)|z=z0 + = h0 (z0 ) + ,
f (z0 ) dz dz ρ∈Z
z0 − ρ ρ∈Z
z0 − ρ
RIEMANN ZETA FUNCTION LECTURE NOTES 1 19
from which the first lower bound claimed in the lemma follows. Also if <ρ < <z0 then
1 z0 − ρ
< =< > 0,
z0 − ρ |z0 − ρ|2
0
so the second lower bound < ff (z(z00)) ≥ −8 log M
r
is weaker than the first.
Now we are in a position to prove Theorem 5.1. The proof is, in overview, rather
similar to the proof of the weak zero-free region in Theorem 4.3, with Lemma 5.4
(applied with f = ζ) providing a size estimate for ζ 0 (s)/ζ(s) that takes the place of the
size estimates for ζ(s) and ζ 0 (s) in that proof. Because of the need to set the quantities
r, M in Lemma 5.4, there are however some additional fiddly details.
Proof of Theorem 5.1. Let t ≥ t0 and σ > 0. We wish to prove that if ζ(σ + it) = 0
c
then we must have σ < 1 − φ(2t+1)w(2t+1) , where c > 0 is a small constant that depends
(possibly) on the implicit constants in the conditions of the theorem. In view of Theorem
4.3, we may certainly assume that σ < 1, and we may also assume that t ≥ 10, say.
Let σ 0 > 1 be a number, to be chosen later in terms of σ and t. We will choose σ 0 − 1
sufficiently small that ζ 0 (σ 0 )/ζ(σ 0 ) is under the influence of the simple pole at 1, and
specifically so that ζ 0 (σ 0 )/ζ(σ 0 ) ≥ −(5/4)/(σ 0 − 1), say.
As calculated at the beginning of section 2, (and similarly as in the proof of Theorem
4.3), since σ 0 > 1 the Euler product expression implies that
∞ ∞ ∞
ζ 0 (σ 0 ) ζ 0 (σ 0 + it) ζ 0 (σ 0 + 2it) X Λ(n) X Λ(n) X Λ(n)
−3 − 4< −< = 3 0 + 4< 0 +it + <
ζ(σ ) 0 0
ζ(σ + it) 0
ζ(σ + 2it) n=1
n σ
n=1
n σ
n=1
nσ0 +2it
∞
X Λ(n)(3 + 4 cos(t log n) + cos(2t log n))
=
n=1
nσ 0
∞
X Λ(n)2(1 + cos(t log n))2
= ≥ 0.
n=1
nσ 0
Now let 0 < r ≤ 1 be another parameter (to be chosen later), and let M =
M (r, σ 0 , t) > 1 be such that |ζ(s)/ζ(σ 0 + it)| ≤ M for all |s − (σ 0 + it)| ≤ r, and
|ζ(s)/ζ(σ 0 +2it)| ≤ M for all |s−(σ 0 +2it)| ≤ r. Then our assumption that ζ 0 (σ 0 )/ζ(σ 0 ) ≥
−(5/4)/(σ 0 − 1), together with Lemma 5.4 applied to f = ζ, imply that
ζ 0 (σ 0 + it) 3 ζ 0 (σ 0 ) 1 ζ 0 (σ 0 + 2it) (15/16) 2 log M
−< 0
≥ 0
+ < 0
≥− 0 − .
ζ(σ + it) 4 ζ(σ ) 4 ζ(σ + 2it) σ −1 r
0 (σ 0 +it)
On the other hand, Lemma 5.4 also implies that −< ζζ(σ 8 log M 1
P
0 +it) ≤ r
−< ρ: ζ(ρ)=0, σ 0 +it−ρ
.
|ρ−(σ 0 +it)|≤r/2,
<ρ<σ 0
0
In particular, if ζ(σ + it) = 0 then either σ < σ − r/2, or else (taking only the term
20 ADAM J HARPER
ρ = σ + it in the sum, remembering that all the terms are non-negative) we must have
1 15/16 10 log M
≤ 0 + .
σ0 −σ σ −1 r
If we choose σ 0 = 1 + c min{1, r/ log M }, where c > 0 is a sufficiently small absolute
31/32
constant, then the right hand side will be ≤ c min{1,r/ log M }
, so we conclude that
Having established the classical zero-free region in Corollary 5.2, we are almost ready
√
to prove the Prime Number Theorem with classical error term O(xe−c log x ). To do this
we just need to obtain a bound for ζ 0 (s)/ζ(s) inside the classical zero-free region. (Recall
that when we proved our weak zero-free region, in Theorem 4.3, we proved a bound for
1/ζ(s) in that region at the same time.)
Lemma 5.5. There exists a small absolute constant c > 0 such that the following is
true. For any |t| ≥ 1 and any σ ≥ 1 − c/ log(|t| + 2),
ζ 0 (σ + it)
= O(log(|t| + 2)).
ζ(σ + it)
Sketch proof of Lemma 5.5. [[This proof is non-examinable, although it follows by slightly
adapting the proof of Lemma 5.4, and applying Corollary 5.2.]] We will only sketch the
argument.
RIEMANN ZETA FUNCTION LECTURE NOTES 1 21
One can check that, under the hypotheses of Lemma 5.4, one actually has
f 0 (z)
log M X 1
< ≥O +<
f (z) r z−ρ
ρ: f (ρ)=0,
|ρ−z0 |≤r/2,
<ρ<<z0
whenever |z − z0 | ≤ r/4 (say), and not just at z = z0 . Note that the sum is still over
those |ρ − z0 | ≤ r/2, regardless of the value of z. In particular, by applying Lemma 5.4
with the choices f = ζ, z0 = 9/8 + it, r = 1/2 and M = O(|t|), we obtain
ζ 0 (z) X 1
< ≥ O(log(|t| + 2)) + < .
ζ(z) z−ρ
ρ: ζ(ρ)=0,
|ρ−(9/8+it)|≤1/4
By Corollary 5.2, if z = σ + it for any σ ≥ 1 − c/(2 log(|t| + 2)) then we have <ρ < <z
for every ρ counted in the sum (in other words there are no zeros to the right of z).
1 z−ρ
This means that < z−ρ = < |z−ρ| 2 > 0 for every ρ in the sum, and so actually
ζ 0 (z)
< ≥ O(log(|t| + 2))
ζ(z)
for all such z.
Finally, the Borel–Carathéodory theorem (applied on a circle around the point 1 +
c/(2 log(|t| + 2)) + it) implies that
ζ 0 (σ + it)
= O(log(|t| + 2)) ∀|t| ≥ 1, ∀σ ≥ 1 − c/(4 log(|t| + 2)).
ζ(σ + it)
This is the claimed result, when c is replaced by c/4.
Theorem 5.6 (Prime Number Theorem with classical error term). For all x ≥ 2 we
have
√
Ψ(x) = x + O(xe−c log x ).
Proof of Theorem 5.6. The proof is exactly similar to the proof of the Prime Number
Theorem with weak error term (Theorem 4.1), but moving the line of integration further
to the left to exploit the classical zero-free region rather than the weak zero-free region.
We outline the main details.
Using the truncated Perron formula, Cauchy’s Residue Theorem, and the classical
zero-free region from Corollary 5.2, we obtain that
Z 1−c/ log(T +2)+iT 0 Z 1+1/ log x+iT 0
1 ζ (s) s ds 1 ζ (s) ds
Ψ(x) = x + − x + − xs
2πi 1−c/ log(T +2)−iT ζ(s) s 2πi 1−c/ log(T +2)+iT ζ(s) s
x log2 x
Z 1+1/ log x−iT
ζ 0 (s)
1 s ds
− − x +O ,
2πi 1−c/ log(T +2)−iT ζ(s) s T
for any large x and any 1 < T ≤ x.
22 ADAM J HARPER
√
Choosing T = exp{ log x} to balance the two terms, the theorem follows.
We end this chapter by stating a couple more facts about the zeta function, which I
would be embarrassed not to include somewhere in the course. Since we will not need to
use these results (except perhaps at the very end of the course), the proofs are omitted.
As yet we have only defined ζ(s) when <(s) > 0. However, Riemann himself showed
that one can analytically continue ζ(s) to a meromorphic function on the whole of C,
and in the process he obtained some very important symmetry information about the
zeta function.
Remark 6.2. When 0 < <(s) < 1, all the terms in the functional equation have straight-
forward definitions, and it asserts a certain symmetry of behaviour on either side of the
line <(s) = 1/2 (called the critical line). When <(s) ≤ 0, the functional equation allows
us to define ζ(s) in terms of ζ(1 − s) (which is already defined, since <(1 − s) ≥ 1) and
some other functions.
RIEMANN ZETA FUNCTION LECTURE NOTES 1 23
Γ(z) is meromorphic and non-zero on C, with simple poles at z = 0, −1, −2, .... Thus
ζ(s) must vanish when s = −2, −4, −6, ..., to cancel the poles of Γ(s/2) on the left hand
side of the functional equation. These zeros of zeta are called the trivial zeros, since
they don’t seem to encode information about the primes (unlike any zeros in the strip
0 < <(s) < 1).
Remark 6.3. Chapter 2 of Titchmarsh’s book [2] contains seven different proofs of the
functional equation (a particularly famous one, due to Riemann, using a functional
equation for ∞ −n2 πx
P
n=−∞ e )!
Remark 6.4. It is speculated that the reason the Riemann Hypothesis should hold is
because the Euler product should force all the non-trivial zeros of the zeta function to lie
close to the critical line <(s) = 1/2, and then the functional equation should somehow
force them to lie exactly on the line. In this course we will not get close to the critical
line and will not need the functional equation.
As an alternative to repeatedly using the truncated Perron formula, with the line of
integration moved to different positions, to study Ψ(x), one can use an explicit formula
that directly links Ψ(x) and the zeros of the zeta function. We state a version of this
below.
Theorem 6.5 (Explicit formula, von Mangoldt, 1895). For any 2 ≤ T ≤ x, we have
X xρ x log2 x
Ψ(x) = x − +O .
ρ T
ρ: ζ(ρ)=0,
|=(ρ)|≤T
The explicit formula is proved by starting with the truncated Perron formula, moving
the line of integration far to the left (picking up residues at the zeros of the zeta function,
which appear in the sum), and then estimating the contribution on the shifted line of
integration using the functional equation.
References
[1] A. Ivić. The Riemann zeta-function: theory and applications. Dover edition, published by Dover
Publications, Inc.. 2003
[2] E. C. Titchmarsh. The Theory of the Riemann Zeta-function. Second edition, revised by D. R.
Heath-Brown, published by Oxford University Press. 1986