Analytic Number Theory 1
Analytic Number Theory 1
Valentin Blomer
21. Oktober 2022
1
Inhaltsverzeichnis
0 Introduction 3
2
0 Introduction
Recommended literature:
• J. Brüdern, Einführung in die analytische Zahlentheorie
• How many primes are there in short interval (x, x + y] with y much less
than x?1
• in how many ways can one write a number as a sum of 2 (3, 4, . . .) primes?
• the same questions for squarefree numbers, sums of two squares, etc.
q (a prime, say), if they are q-adically close, and they are in a short interval, if they are
archimedically close
3
Example 1
We use the notation e(x) = e2πix . For n ∈ Z we have
Z 1
e(αn)dα = δn=0 .
0
Hence the number of integral solutions to n = x21 + x22 + x23 + x24 (sums of four
squares) is
X Z 1 Z 1 X 4
e(α(x21 + . . . + x24 − n))dα = e(−αn) e(αx2 ) dα.
0 0 √
x1 ,...,x4 ∈Z |x|≤ n
Example 2
For natural numbers n we have
X
µ(d) = δn=1
d|n
4
√
One can show that this is asymptotic to e−γ x/ log x, where γ = 0.577 . . .
is the Euler constant. Hence our heuristic argument gives the correct order of
magnitude, but the wrong constant. In particular, it is not allowed to ignore error
terms. Nevertheless, this method can be refined to prove interesting results (not
for counting primes, though), and leads to sieve theory.
Example 3
For c ∈ R let (c) be the vertical path c + it, −∞ < t < ∞. Then for c > 0 one
has Z (
1 −s ds 1, 0 < y < 1,
y =
2πi (c) s 0, y > 1.
(This is a non-trivial formula that we will prove later in (4.6)). The complex
contour integral is only conditionally convergent, but we ignore this for the
moment. Hence if an is any sequence of complex numbers, we have (ignoring
convergence)
Z −s Z X
X X X 1 n ds 1 an s ds
an = an δn/x≤1 = an = s
x .
n n
2πi (c) x s 2πi (c) n n s
n≤x
There are serious convergence issues, but they can be overcome in many appli-
cations. We will use this idea, for instance, to count primes up to x and prove
the prime number theorem.
5
1 Arithmetic functions and Dirichlet series
(1.1) Definition
An arithmetic function is a map f : N → C. An arithmetic function f 6= 0 is
multiplicative resp. completely multiplicative if f (nm) = f (n)f (m) for all
(m, n) ∈ N2 with (m, n) = 1 resp. for all (m, n) ∈ N2 . The set A of arithmetic
functions is a ring (an integral domain) with respect to pointwise addition and
Dirichlet convolution
X
(f ∗ g)(n) := f (d)g(n/d).
d|n
The identity element with respect to multiplication is the function η with η(1) =
1, η(n) = 0 for n > 1. (To show that there are no zero divisors let r, s minimal
with f (r) 6= 0 6= g(s) for 0 6= f, g ∈ A. Then (f ∗ g)(rs) = f (r)g(s) 6= 0.)
(1.2) Remarks
a) Since f (n) = f (1 · n) = f (1)f (n) for a multiplicative function f , we have
automatically f (1) = 1.
b) Multiplicative function are determined by their values on prime powers, com-
pletely multiplicative functions by their values on primes.
c) The units of A are precisely the function f with f (1) 6= 0: if f ∗ g = η, then
1 = η(1) = f (1)g(1). Conversely, if f (1) 6= 0, define its inverse g recursively by
g(1) = 1/f (1) and
1 X
g(n) = − f (d)g(n/d).
f (1)
d|n
d>1
(1.3) Lemma
The set of mutliplicative functions forms a subgroup of A∗ .
Proof. For (n, m) = 1 we have a bijection {d | nm} ↔ {d1 | n} × {d2 | m}. Let
f, g be multiplicative (hence invertible). Then
nm X X
X nm
(f ∗ g)(nm) = f (d)g = f (d1 d2 )g
d d1 d2
d|nm d1 |n d2 |m
X
X n n
= f (d1 )g f (d2 )g = (f ∗ g)(n) · (f ∗ g)(m)
d1 d2
d1 |n d2 |m
6
Then h is multiplicative and h = f −1 on prime powers, so f ∗ h = η on prime
powers, and hence by multiplicativity everywhere. We conclude that f −1 =
f −1 ∗ η = f −1 ∗ f ∗ h = h is multiplicative.
(1.4) Examples
• the function 1(n) = 1 for all n ∈ N is completely multiplicative;
P
• the divisor function τ (n) = d|n 1 = (1 ∗ 1)(n) is not completely mul-
tiplicative. We have τ (pe ) = e + 1.
P
• the iterated divisor function τk (n) = (1 ∗ . . . ∗ 1)(n) = a1 ···ak =n 1 with
τk (pe ) = k+e−1
e ;
• Euler φ-function φ(n). We have (φ ∗ 1)(n) = n (check on prime powers);
P
• the functions ω(n) = #{p | n : p prim} and Ω(n) = pk kn k are not
multiplicative;
(
log p, n = pk
• the function Λ(n) = (von Mangoldt function) sa-
0, otherwise
Q ep
tisfies (Λ ∗ 1)(n) = log n: for n = p we have
X X X
Λ(d) = ep log p = log pep = log n;
d|n p p
Proof. We have
X X
A(x)g(x) − an g(n) = an (g(x) − g(n))
y≤n≤x y≤n≤x
X Z x Z x X
= an g 0 (ξ)dξ = g 0 (ξ) an dξ.
y≤n≤x n y y≤n≤ξ
7
(1.6) Examples
a) The limit
X 1
γ := lim − log M = 0.577 . . .
M →∞ m
m≤M
b) We have X
τ (n) = x log x + (2γ − 1)x + O(x1/2 ),
n≤x
Proof: We have
X X X X X X √
τ (n) = 1= 1+ 1+ 1=2 1 + [ x]2 .
n≤x ab≤x ab≤x
√ ab≤x
√ ab≤x
√ ab≤x
√
a> x b> x a,b≤ x a> x
√ √
We have [ x]2 = x + O( x) and
X X X X x √
1= 1= − x + O(1)
√ √ √ b
ab≤x
√ b≤ x x<a≤x/b b≤ x
a> x
√ √ √ √ √
1 1
= x log x + γ + O √ − x[ x] + O( x) = x log x + γ − 1 + O( x).
x 2
(1.7) Lemma
For k ∈ N, ε > 0, there exists a constant C(k, ε) > 0 such that τk (n) ≤ C(k, ε)nε .
If p > C0 (k, ε) for some sufficiently large C0 (k, ε), then pαε > (α + k)k . We can
drop these factors and obtain
k+α−1
k+α−1
!C0 (k,ε)
τk (n) Y
α α
≤ ≤ max =: C(k, ε).
nε α
pαε α≥1 2αε
p kn
p≤C0 (k,ε)
8
(1.8) Definition and Lemma
Let f be an arithmetic
P∞ function. The Dirichlet series attached to f is the
−s
formal series n=1 f (n)n . If f, g are two functions whose Dirichlet series
converge absolutely for some s ∈ C, then
a)
∞ ∞ ∞
X (f ∗ g)(n) X f (n) X g(n)
= ;
n=1
ns n=1
ns n=1 ns
b)
∞ ∞
X f (n) Y X f (pk )
=
n=1
ns p
pks
k=0
Proof. a) We have
∞ ∞ ∞ ∞
X (f ∗ g)(n) X 1 X X f (a) X g(b)
= f (a)g(b) = .
n=1
ns n=1
ns a=1
as bs
ab=n b=1
b) Multiply the right hand side and use uniqueness of prime factorization.
(1.9) Examples
a) For <s > 1 we have
∞ Y −1
X 1 1
ζ(s) := = 1− s 6= 0.
n=1
ns p
p
The Riemann zeta function has meromorphic continuation to <s > 0 with only
a simple pole at s = 1: for N ∈ N we have by partial summation we have
X Z N
n−s = [N ]N −s + s [x]x−s−1 dx.
n≤N 1
The right hand side exists for <s > 0 (except at s = 1) and yields the desired
continuation.
b) By (1.4) we have
∞ ∞
X τ (n) X φ(n) ζ(s − 1)
s
= ζ(s)2 , <s > 1, s
= , <s > 2,
n=1
n n=1
n ζ(s)
∞ ∞
ζ 0 (s) X Λ(n) X µ(n) 1
− = , <s > 1, = , <s > 1.
ζ(s) n=1
ns n=1
n s ζ(s)
9
(1.10) Theorem
P∞ −s
If F (s) = n=1 an n is convergent for some s0 ∈ C, then it converges for
all s ∈ C with <s > <s0 , uniformly on compacta, and defines a holomorphic
function in this region.
For any ε > 0 there is M = M (ε) such that | M ≤n≤X an n−s0 | ≤ ε for all
P
X > M . Since <s ≥ <s0 we have
N Z N !
X a
n <(s0 −s)−1 |s − s0 | 1
≤ ε 1 + |s − s0 | t dt ≤ ε 1 + ≤ε 1+ .
ns <(s − s0 ) sin η
n=M M
(1.11) Theorem
There is a number σ0 ∈ R∪{±∞} (abscissa of convergence) such that a Dirichlet
P for all s with <s > σ0 and diverges for all s with <s < σ0 . If
series converges
σ0 > 0, i.e. if n an diverges, it is given by
PN ( N
)
log | n=1 an | X
α
σ0 = lim sup = inf α | an = O(N ) .
N →∞ log N n=1
Proof. The existence of σ0 follows from (1.10). The second equality is obvious.
P call−σthe right hand side γ. It remains to show γ = σ0 . Let σ > σ0 . Then
We
an n is convergent, hence by partial summation
N N Z NX
X X an σ an σ−1
an = σ
N − σ σ
t dt = O(N σ ),
n=1 n=1
n 1 n
n≤t
10
(1.12) Corollary
of convergence of n an n−s and σ1 the abscissa of con-
P
Let σ0 be the
Pabscissa
vergence of n |an |n−s . Then σ0 ≤ σ1 ≤ σ0 + 1.
Proof. Exercise
Proof. If not, then let m be the smallest index with am 6= bm . For σ > c we
have
∞ ∞ ∞ ∞
!
X an X bn X m σ X m σ
σ
0=m − = am − bm + a n − bn
n=1
nσ n=1 nσ n=m+1
n n=m+1
n
11
(1.15) Example
What is the “probability” that two numbers are coprime? By (1.4) we have
1 X 1 X X 1 X X X
1= µ(d) = µ(d) 1
x2 x2 x 2
n,m≤x n,m≤x d|(n,m) d≤x n≤x m≤x
(n,m)=1 d|n d|m
1 X x 2 X µ(d) 1 X1 1 X
= 2 µ(d) + O(1) = +O + 1
x d d2 x d x2
d≤x d≤x d≤x d≤x
∞
!
X µ(d) X 1 log x 1 1 log x 6
= 2
+O 2
+ + = +O → 2.
d d x x ζ(2) x π
d=1 d>x
Appendix: characters
(1.17) Definition
Let q ∈ N. A homomorphism χ : (Z/qZ)∗ → S 1 is called a Dirichlet character.
We extend χ to a completely multiplicative q-periodic function Z → S 1 ∪ {0}
by putting χ(n) = 0 for (n, q) 6= 1 and call this again a Dirichlet character. The
function
X∞ Y
L(s, χ) := χ(n)n−s = (1 − χ(p)p−s )−1
n=1 p
2 this is easy to see for cyclic groups and then follows in general
12
To show for instance the second formula for χ 6= χ0 , pick m with (m, q) = 1 and
χ(m) 6= 1, then
X X X
χ(n) = χ(nm) = χ(m) χ(n).
n (mod q) n (mod q) n (mod q)
(1.19) Definition
a) Let χ modulo q be a Dirichlet character. We say that χ has quasiperiod d
if χ(n) = χ(m) whenever n ≡ m (mod d) and (nm, q) = 1. The smallest quasi-
period is called conductor of χ.
b) If q ∗ | q, then χ mod q is induced by χ∗ mod q ∗ if χ(n) = χ∗ (n) for all
(n, q) = 1. A character χ mod q is called primitive if it is not induced by a
character χ∗ mod q ∗ with q ∗ < q.
(1.20) Lemma
a) The conductor of a character χ mod q is a divisor of q.
b) Every character χ mod q is induced by a unique character χ∗ mod q ∗ where
q ∗ is the conductor of χ. This character χ∗ is primitive.
13
any k ∈ Z such that (n + kq ∗ , q) = 1, and χ∗ (n) = 0 for (n, q ∗ ) > 1. Such a k
exists, for instance Y
k= p
p|q
p-q ∗ n
(1.21) Lemma
If a, b ∈ Z and χ is a primitive character modulo q, then
1 X
χ(ac + b) = χ(b)δq|a .
q
c (mod q)
Proof. First we observe that we can replace a by (a, q) without changing the
sum: indeed, if we write d = (a, q), a0 = a/d, q 0 = q/d with (a0 , q 0 ) = 1, then
X X X
χ(ac + b) = χ(a0 dc + b) = d χ(d(a0 c) + b)
c (mod q) c (mod q) c (mod q 0 )
X X
=d χ(dc + b) = χ(dc + b).
c (mod q 0 ) c (mod q)
Assume that χ(1 + ax) = 1 for all x ∈ Z with (1 + ax, q) = 1. Let (n, q) = 1 and
fix an integer n̄ such that n̄n ≡ 1 (mod q). Then χ(n + ay) = χ(n + nn̄ay) =
χ(n)χ(1 + an̄y) = χ(n) for all y ∈ Z with (1 + an̄y, q) = (n + ay, q) = 1, so that
by definition χ has quasiperiod a < q, contradiction. Hence there exists x with
χ(1 + ax) 6= 1, and so S = 0.
3 To check this, show that every p | q divides exactly one of n and kq ∗ , therefore it does
not divide n + kq ∗ .
14
(1.22) Definition
For a character χ modulo q we define the Gauß sum
X
τ (χ) = χ(h)e(h/q)
h (mod q)
(1.23) Theorem
Let χ be a character modulo q, a ∈ Z.
a) If (a, q) = 1, then4
X
χ(a)τ (χ̄) = χ̄(h)e(ha/q).
h (mod q)
Note in the first step that τ (χ) = χ(−1)τ (χ̄), so that |τ (χ)| = |τ (χ̄)|.
(1.24) Definition
A Dirichlet character is called even if χ(−1) = 1 and odd if χ(−1) = −1.
15
2 Fourier analysis and Poisson summation
The main result of this section is the Poisson summation formula. We give two
arithmetic applications. More will follow later.
(2.1) Definition
A Schwartz class function on R is a C ∞ -function f : R → C such that
f (n) (x) n,A (1 + |x|)−A for all n ∈ N0 and all A > 0. We denote the vector
space of such functions by S(R).
(2.2) Definition
For f ∈ L1 (R) we define the Fourier transform fb ∈ L∞ (R) by
Z
F(f )(y) = f (y) :=
b f (x)e(−xy)dx.
R
(2.3) Lemma
Let f, g ∈ L1 (R), c ∈ R.
a) The Fourier transform is a linear operator.
b) If h(x) = f (x − c), then bh(y) = e(−cy)fb(y).
h(y) = |c|−1 fb(y/c).
c) If h(x) = f (cx) with c 6= 0, then b
d) If h(x) = (f ∗ g)(x) = R f (t)g(x − t)dt, then h ∈ L1 (R) and b
R
h(y) = fb(y)b
g (y).
R R
e) We have R f (x)b g (x)dx = R fb(x)g(x)dx.
2
f) The function f (x) = e−πx is self-dual with respect to the Fourier transform.
(2.4) Theorem
a) If f ∈ S(R), then fb ∈ S(R). More precisely, if f (r) (x) (1 + |x|)−n−1−δ for
all r ≤ k and some δ > 0, then fb(n) (y) (1 + |y|)−k .
b) If f ∈ C 1 (R) ∩ L1 (R) and fb ∈ L1 (R), in particular if f, f 0 , f 00 (1 + |x|)−2
(by part a) with n = 0, k = 2, δ = 1), the Fourier inversion formula holds:
Z
f (x) = fb(y)e(xy)dy = fb(−x).
b
R
16
c) If f ∈ S(R), Parseval’s identity holds:
Z Z
2
|f (x)| dx = |fb(y)|2 dy.
R R
Proof. a) differentiation under the integral sign and integration by parts:
Z
1 1
fb(n) (y) r |(xn f (x))(r) |dx r
|y| R |y|
for all r ≤ k.
b) By Lebesgue’s dominated convergence theorem we have
Z Z Z
2 2
fb(y)e(−xy)dy = lim e−πε y fb(y)e(−xy)dy = lim fb(y)gε (y)dy
R ε→0 R ε→0 R
−πε2 y 2
where gε (y) = gε (y; x) = e e(−xy). By (2.3b, c, f) we compute
1 − π(x−t)2
gbε (t) = e ε2
ε
so that by (2.3e)
Z Z Z
1 π(x−y)2 f ∈C 1 (R) 1 π(x−y)2
fb(y)e(−xy)dy = lim f (y) e− ε2 dy = lim (f (x) + O(|y − x|)) e− ε2 dy
R ε→0 R ε ε→0 R ε
= lim (f (x) + O(ε)) = f (x).
ε→0
(2.5) Remarks
a) Theorem 2.4b, c holds with less restrictive assumptions on f . A standard
condition in b) is f ∈ L1 (R) ∩ C(R) and fb ∈ L1 (R), and f ∈ L1 (R) ∩ L2 (R) in
c).
b) The results in (2.3), (2.4) generalize in an obvious way to functions on Rn .
c) The Fourier transform translates smoothness in decay conditions (and vice
versa), and it translates small support into large support. In particular, there is
an uncertainty principle: the support of f and fb cannot be small simultaneously.
P
Proof. Let F (x) = P n∈Z f (x + n) and expand this 1-periodic function into a
Fourier series: F (x) = m∈Z a(m)e(mx) where
Z 1 Z 1X Z ∞
a(m) = F (x)e(−mx)dx = f (x+n)e(−mx)dx = f (x)e(−mx)dx = fb(m).
0 0 n∈Z −∞
Now put x = 0.
17
(2.7) Corollary
Let f ∈ S(R), a, q ∈ N, χ a primitive Dirichlet character modulo q.
a) We have
X 1 X b m am
f (m) = f e .
q q q
m∈Z m∈Z
m≡a (mod q)
b) We have
X τ (χ) X b m
f (m)χ(m) = f χ̄(m).
q q
m∈Z m∈Z
Proof. Exercise
(2.8) Corollary
2
e−πn x
P
√ Theta function Θ(x) =
The n∈Z satisfies the functional equation Θ(1/x) =
xΘ(x).
(2.9) Lemma
Let X and 0 < Z < Y be three real numbers. There exists a smooth, non-
negative function f satisfying the following properties:
a) the support of f is contained in [X − Z, X + Y + Z]
b) f = 1 on [X, X + Y ];
c) kf (j) k1 Z 1−j for all j ∈ N.
Proof. Exercise.
Proof. Let us first assume that χ is primitive (and q ≥ 2). We would like to
apply Poisson summation. To this end we need to smooth out the characteristic
function on the interval (M, M + N ]. Let w be a smooth bounded function with
support on [M −1, M +N +1] such that w = 1 on (M, M +N ] and kw(j) k1 j 1
for all j ∈ N. Then we have by (2.7b) that
Z
X X τ (χ) X nx
χ(n) = χ(n)w(n)+O(1) = χ̄(n) w(x)e − dx+O(1).
q R q
M <n≤M +N n∈Z n∈Z
18
for all j ∈ N. Hence
X 1 X q X q2
χ(n) 1 + √ + q 1/2 log q.
q n n>q n2
M <n≤M +N n≤q
Application: let q be a prime. Then any interval of length ≥ cq 1/2 log q with c
sufficiently large contains a quadratic residue and a quadratic non-residue.
(2.11) Lemma
For k ∈ Z, x ∈ R define the Bessel function
Z π
1
Jk (x) = e−ikφ+ix sin φ dφ.
2π −π
for all j ∈ N0 .
e) We have Jk (x) k min(1, |x|−1/2 ).
19
Proof. a) Follows directly from the theory of Fourier series.
b) Differentiate the generating series with respect to x:
and θ:
eiθ + e−iθ ix sin θ X
x e = kJk (x)eikθ
2
k∈Z
for any 0 < α < 1/10 such that |x|α > 100k. We can p estimate the other regions
of the integral in the same way. Choosing α 1/ |x| and assuming wlog that
x is sufficiently large (i.e. x k 2 ), we obtain the desired bound.
Remark: This improves what one gets from the elementary Lipschitz principle:
πR + O(R1/2 ).
20
Proof. Let 1 ≤ T ≤ R1/2 be another parameter and let w be a smooth bounded
function with support on [0, R + T ] such that w = 1 on [T, R] and kw(j) k1 j
T 1−j for all j ∈ N. Let r(n) be as in (1.18d). We have
X
r(n) = 4 χ−4 (d) τ (n) nε .
d|n
Hence
X X X
r(n) = r(n)w(n) + O(T Rε ) = w(x2 + y 2 ) + O(T Rε )
n≤R n≤R x,y∈Z
X Z
= w(x2 + y 2 )e(−nx − my)dx dy + O(T Rε ).
n,m∈Z R2
X Z ∞ √
=π r(`) w(r)J0 (2π r`)dr.
`≥1 0
1/3
Choosing T = R gives the result.
with ∞ √
Z
w̌(n) = π w(x)J0 (2π xn)dx.
0
21