100% found this document useful (1 vote)
1K views

Modular Functions and Dirichlet Series in Number Theory (Apostol) PDF

Uploaded by

Aidan Holwerda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
1K views

Modular Functions and Dirichlet Series in Number Theory (Apostol) PDF

Uploaded by

Aidan Holwerda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 218

Graduate Texts in Mathematics 41

EditOl·ial Board
s. Axler F.W. Gehring К.А. Ribet

Springer Science+Business Medi~ LLC


Graduate Texts in Mathematics
TAKEUTI/ZARING. Introduction to 35 ALExANDERIWERMER. Several Complex
Axiomatic Set Theory. 2nd ed. Variables and Banach Algebras. 3rd ed.
2 OXTOBY. Measure and Category. 2nd ed. 36 KELLEy/NAMIOKA et al. Linear
3 SCHAEFER. Topological Vector Spaces. Topological Spaces.
2nd ed. 37 MONK. Mathematical Logic.
4 HILTON/STAMMBACH. A Course in 38 GRAUERT/FRITZSCHE. Several Complex
Homological Algebra. 2nd ed. Variables.
5 MAC LANE. Categories for the Working 39 ARVESON. An Invitation to C*-Algebras.
Mathematician. 2nd ed. 40 KEMENy/SNELL/KNAPP. Denumerable
6 HUGHES/PIPER. Projective Planes. Markov Chains. 2nd ed.
7 SERRE. A Course in Arithmetic. 41 ApOSTOL. Modular Functions and Dirichlet
8 TAKEUTI/ZARING. Axiomatic Set Theory. Series in Number Theory.
9 HUMPHREYS. Introduction to Lie Algebras 2nd ed.
and Representation Theory. 42 SERRE. Linear Representations of Finite
10 COHEN. A Course in Simple Homotopy Groups.
Theory. 43 GILLMAN/JERISON. Rings of Continuous
II CONWAY. Functions of One Complex Functions.
Variable I. 2nd ed. 44 KENDIG. Elementary Algebraic Geometry.
12 BEALS. Advanced Mathematical Analysis. 45 LOEVE. Probability Theory J. 4th ed.
13 ANDERSON/FuLLER. Rings and Categories 46 LOEVE. Probability Theory II. 4th ed.
of Modules. 2nd ed. 47 MOISE. Geometric Topology in
14 GOLUBITSKy/GUILLEMIN. Stable Mappings Dimensions 2 and 3.
and Their Singularities. 48 SACHS/WU. General Relativity for
15 BERBERIAN. Lectures in Functional Mathematicians.
Analysis and Operator Theory. 49 GRUENBERG/WEIR. Linear Geometry.
16 WINTER. The Structure of Fields. 2nd ed.
17 ROSENBLATT. Random Processes. 2nd ed. 50 EDWARDS. Fermat's Last Theorem.
18 HALMOS. Measure Theory. 51 KLINGENBERG. A Course in Differential
19 HALMOS. A Hilbert Space Problem Book. Geometry.
2nd ed. 52 HARTSHORNE. Algebraic Geometry.
20 HUSEMOLLER. Fibre Bundles. 3rd ed. 53 MANIN. A Course in Mathematical Logic.
21 HUMPHREYS. Linear Algebraic Groups. 54 GRA VERlW ATKINS. Combinatorics with
22 BARNES/MACK. An Algebraic Introduction Emphasis on the Theory of Graphs.
to Mathematical Logic. 55 BROWN/PEARCY. Introduction to Operator
23 GREUB. Linear Algebra. 4th ed. Theory I: Elements of Functional
24 HOLMES. Geometric Functional Analysis Analysis.
and Its Applications. 56 MASSEY. Algebraic Topology: An
25 HEWITT/STROMBERG. Real and Abstract Introduction.
Analysis. 57 CROWELL/Fox. Introduction to Knot
26 MANES. Algebraic Theories. Theory.
27 KELLEY. General Topology. 58 KOBLITZ. p-adic Numbers, p-adic Analysis,
28 ZARISKI/SAMUEL. Commutative Algebra. and Zeta-Functions. 2nd ed.
VoU. 59 LANG. Cyclotomic Fields.
29 ZARISKI/SAMUEL. Commutative Algebra. 60 ARNOLD. Mathematical Methods in
Vol. II. Classical Mechanics. 2nd ed.
30 JACOBSON. Lectures in Abstract Algebra I. 61 WHITEHEAD. Elements of Homotopy
Basic Concepts. Theory.
31 JACOBSON. Lectures in Abstract Algebra II. 62 KARGAPOLOV/MERLZJAKOV. Fundamentals
Linear Algebra. of the Theory of Groups.
32 JACOBSON. Lectures in Abstract Algebra 63 BOLLOBAS. Graph Theory.
III. Theory of Fields and Galois Theory. 64 EDWARDS. Fourier Series. Vol. l. 2nd ed.
33 HIRSCH. Differential Topology. 65 WELLS. Differential Analysis on Complex
34 SPITZER. Principles of Random Walk. Manifolds. 2nd ed.
2nd ed.
(continued after index)
Tom M. Apostol

Modular Functions
and Dirichlet Series
in Number Theory
Second Edition

With 25 Illustrations

Springer
Тот М. Aposto!
Department of Mathematics
Ca!ifornia Institute of Techno!ogy
Pasadena, СА 91125
USA
Еl!i/щiа! В{){т!

S. Axler F. W. Gсhгiпg К. А. Ribet


Department of Dерагtmепt af Dерагtmепt af
Mathematics Mathcmatics Mathematics
San Francisco State Uпivеl'sitу of Мiсhigап University of California
University Апп АгЬог. М! 48109 at Berke1ey
San Francisco, СА 94132 U.S.A. Berkeley, СА 94720
U.S.A. U.S.A.

Mathematics Subject Classification (2000): 11-01, IIFXX

Library of Congress Cataloging-in-Publication Data


Apostol, Тот М.
Modular functions and Dirichlet series in number theory/Tom М. Apostol.-2nd ed.
р. cm.-{Graduate texts in mathematics; 41)
Includes bibliographical references.
ISBN 978-1-4612-6978-6 ISBN 978-1-4612-0999-7 (eBook)
DOI 10.1007/978-1-4612-0999-7
1. Number theory. 2. Functions, Elliptic. 3. Functions, Modular. 4. Series,
Dirichlet. 1. Title. 11. Series.
QA241.A62 1990
512'.7-<lc20 89-21760

Printed оп acid-free рарег.

© 1990 Springer Science+Business Media New York


OriginalIy published Ьу Sрriпgег-VегlаgNеw York 'пс. in 1990
Softcover reprint ofthe hardcover 2nd edition 1990
AII rights reserved. This work тау по! Ье translated ог copied in whole ог in part without the written
permission ofthe publisher (Springer Science+Business Media, LLC), ехсер! for brief excerpts in
connection with reviews or scholarly analysis. Use in connection with anу form of information stora-
ge and retrieval, electronic adaptation, computer software, or Ьу similar or dissimilar methodology
now known or hereafter developed is forbidden.
The use of general descriptive names, trade names, trademarks, etc., in this pubIication,
еуеп if the former аге по! especially identified, is по! to Ье taken as а sign that such
names, as understood Ьу the Trade Marks and Merchandise Marks Act, тау accordingly
Ье used freely Ьу апуопе.

Typeset Ьу Asco Trade Typesetting Ltd., Hong Kong.

9 8 7 6 5 4 3

ISBN 978-1-4612-6978-6 SPIN 10841555


Preface

This is the second volume of a 2-volume textbook* which evolved from a


course (Mathematics 160) offered at the California Institute of Technology
during the last 25 years.
The second volume presupposes a background in number theory com-
parable to that provided in the first volume, together with a knowledge of
the basic concepts of complex analysis.
Most of the present volume is devoted to elliptic functions and modular
functions with some of their number-theoretic applications. Among the
major topics treated are Rademacher's convergent series for the partition
function, Lehner's congruences for the Fourier coefficients of the modular
functionj(r), and Hecke's theory of entire forms with multiplicative Fourier
coefficients. The last chapter gives an account of Bohr's theory of equivalence
of general Dirichlet series.
Both volumes of this work emphasize classical aspects of a subject which
in recent years has undergone a great deal of modern development. It is
hoped that these volumes will help the nonspecialist become acquainted
with an important and fascinating part of mathematics and, at the same
time, will provide some of the background that belongs to the repertory of
every specialist in the field.
This volume, like the first, is dedicated to the students who have taken
this course and have gone on to make notable contributions to number
theory and other parts of mathematics.
T.M.A.
January, 1976
* The first volume is in the Springer-Verlag series Undergraduate Texts in Mathematics under
the title Introduction to Analytic Number Theory.

v
Preface to the Second Edition

The major change is an alternate treatment of the transformation formula for


the Dedekind eta function, which appears in a five-page supplement to Chap-
ter 3, inserted at the end of the book Gust before the Bibliography). Other-
wise, the second edition is almost identical to the first. Misprints have been
repaired, there are minor changes in the Exercises, and the Bibliography has
been updated.

T.M. A.
July, 1989
Contents

Chapter I
Elliptic functions
1.1 Introduction 1
1.2 Doubly periodic functions 1
1.3 Fundamental pairs of periods 2
1.4 Elliptic functions 4
1.5 Construction of elliptic functions 6
1.6 The Weierstrass SJ function 9
1.7 The Laurent expansion of SJ near the origin 11
1.8 Differential equation satisfied by SJ 11
1.9 The Eisenstein series and the invariants g2 and g3 12
1.10 The numbers e 1 , e 2 , e 3 13
1.11 The discriminant ~ 14
1.12 Klein's modular function J(r) 15
1.13 Invariance of J under unimodular transformations 16
1.14 The Fourier expansions of g2(r) and gir) 18
1.15 The Fourier expansions of ~(r) and J(r) 20
Exercises for Chapter 1 23

Chapter 2
The Modular group and modular functions
2.1 Mobius transformations 26
2.2 The modular group r 28
2.3 Fundamental regions 30
2.4 Modular functions 34

VB
2.5 Special values of J 39
2.6 Modular functions as rational functions of J 40
2.7 Mapping properties of J 40
2.8 Application to the inversion problem for Eisenstein series 42
2.9 Application to Picard's theorem 43
Exercises for Chapter 2 44

Chapter 3
The Dedekind eta function
3.1 Introduction 47
3.2 Siegel's proof of Theorem 3.1 48
3.3 Infinite product representation for Ll(r) 50
3.4 The general functional equation for 11(r) 51
3.5 Iseki's transformation formula 53
3.6 Deduction of Dedekind's functional equation from Iseki's
formula 58
3.7 Properties of Dedekind sums 61
3.8 The reciprocity law for Dedekind sums 62
3.9 Congruence properties of Dedekind sums 64
3.10 The Eisenstein series G 2 (r) 69
Exercises for Chapter 3 70

Chapter 4
Congruences for the coefficients of the modular function j
4.1 Introduction 74
4.2 The subgroup r o(q) 75
4.3 Fundamental region of r o(p) 76
4.4 Functions automorphic under the subgroup r o(P) 78
4.5 Construction of functions belonging to r o(P) 80
4.6 The behavior of fp under the generators of r 83
4.7 The function !per) = Ll(qr)/Ll(r) 84
4.8 The univalent function <I>(r) 86
4.9 Invariance of <I>(r) under transformations of r o(q) 87
4.10 The function j p expressed as a polynomial in <I> 88
Exercises for Chapter 4 91

Chapter 5
Rademacher's series for the partition function
5.1 Introduction 94
5.2 The plan of the proof 95
5.3 Dedekind's functional equation expressed in terms of F 96
5.4 Farey fractions 97

viii
5.5 Ford circles 99
5.6 Rademacher's path of integration 102
5.7 Rademacher's convergent series for pen) 104
Exercises for Chapter 5 110

Chapter 6
Modular forms with multiplicative coefficients
6.1 Introduction II3
6.2 Modular forms of weight k 114
6.3 The weight formula for zeros of an entire modular form 115
6.4 Representation of entire forms in terms of G4 and G6 II7
6.5 The linear space Mk and the subspace Mk,o 118
6.6 Classification of entire forms in terms of their zeros 119
6.7 The Hecke operators Tn 120
6.8 Transformations of order n 122
6.9 Behavior of Tnfunder the modular group 125
6.10 Multiplicative property of Hecke operators 126
6.11 Eigenfunctions of Hecke operators 129
6.12 Properties of simultaneous eigenforms 130
6.13 Examples of normalized simultaneous eigenforms 131
6.14 Remarks on existence of simultaneous ~igenforms in M 2k • O 133
6.15 Estimates for the Fourier coefficients of entire forms 134
6.16 Modular forms and Dirichlet series 136
Exercises for Chapt~r 6 138

Chapter 7
Kronecker's theorem with applications
7.1 Approximating real numbers by rational numbers 142
7.2 Dirichlet's approximation theorem 143
7.3 Liouville's approximation theorem 146
7.4 Kronecker's approximation theorem: the one-dimensional
case 148
7.5 Extension of Kronecker's theorem to simultaneous
approximation 149
7.6 Applications to the Riemann zeta function 155
7.7 Applications to periodic functions 157
Exercises for Chapter 7 159

Chapter 8
General Dirichlet series and Bohr's equivalence theorem
8.1 Introduction 161
8.2 The half-plane of convergence of general Dirichlet series 161
8.3 Bases for the sequence of exponents of a Dirichlet series 166

ix
8.4 Bohr matrices 167
8.5 The Bohr function associated with a Dirichlet series 168
8.6 The set of values taken by a Dirichlet seriesf(s) on a line
a = ao 170
8.7 Equivalence of general Dirichlet series 173
8.8 Equivalence of ordinary Dirichlet series 174
8.9 Equality of the sets Uiao) and Uiao) for equivalent
Dirichlet series 176
8.10 The set of values taken by a Dirichlet series in a neighborhood
of the line a = ao 176
8.11 Bohr's equivalence theorem 178
8.12 Proof of Theorem 8.15 179
8.13 Examples of equivalent Dirichlet series. Applications of Bohr's
theorem to L-series 184
8.14 Applications of Bohr's theorem to the Riemann zeta function 184
Exercises for Chapter 8 187

Supplement to Chapter 3 190

Bibliography 196

Index of special symbols 199

Index 201

x
1
Elliptic functions

1.1 Introduction
Additive number theory is concerned with expressing an integer n as a sum
of integers from some given set S. For example, S might consist of primes,
squares, cubes, or other special numbers. We ask whether or not a given
number can be expressed as a sum of elements of S and, if so, in how many
ways this can be done.
Letf(n) denote the number of ways n can be written as a sum of elements
of S. We ask for various properties of f(n), such as its asymptotic behavior
for large n. In a later chapter we will determine the asymptotic value of the
partition function p(n) which counts the number of ways n can be written as a
sum of positive integers S n.
The partition function p(n) and other functions of additive number theory
are intimately related to a class of functions in complex analysis called
elliptic modular functions. They playa role in additive number theory analo-
gous to that played by Dirichlet series in multiplicative number theory. The
first three chapters of this volume provide an introduction to the theory of
elliptic modular functions. Applications to the partition function are given
in Chapter 5.
We begin with a study of doubly periodic functions.

1.2 Doubly periodic functions


A function f of a complex variable is called periodic with period W if
f(z + w) = f(z)
whenever z and z + ware in the domain off If w is a period, so is nw for
every integer n. If WI and W2 are periods, so is mW I + nW 2 for every choice of
integers m and n.
I: Elliptic functions

Definition. A function f is called doubly periodic if it has two periods WI


and W2 whose ratio W2/W I is not real.

We require that the ratio be nonreal to avoid degenerate cases. For


example, if WI and W2 are periods whose ratio is real and rational it is easy
to show that each of WI and W2 is an integer multiple of the same period. In
fact, if W2/WI = alb, where a and b are relatively prime integers, then there
exist integers m and n such that mb + na = 1. Let W = mW I + nW2' Then
W is a period and we have

so WI = bw and W2 = aw. Thus both WI and W2 are integer multiples of w.


If the ratio W 2 /W I is real and irrational it can be shown thatfhas arbitrarily
small periods (see Theorem 7.12). A function with arbitrarily small periods
is constant on every open connected set on which it is analytic. In fact, at
each point of analyticity offwe have

f'(Z) = lim f(z + zn) - f(z),


%"-+0 Zn

where {zn} is any sequence of nonzero complex numbers tending to O. Iff


has arbitrarily small periods we can choose {zn} to be a sequence of periods
tending to O. Then f(z + zn) = f(z) and hence f'(z) = O. In other words,
f'(z) = 0 at each point of analyticity off, hencefmust be constant on every
open connected set in whichfis analytic.

1.3 Fundamental pairs of periods


Definition. Let f have periods WI' W2 whose ratio W2/WI is not real. The
pair (WI> W2) is called afundamental pair if every period of f is of the form
mW I + nW2, where m and n are integers.

Every fundamental pair of periods WI' W2 determines a network of


parallelograms which form a tiling of the plane. These are called period
parallelograms. An example is shown in Figure l.1a. The vertices are the
periods W = mW I + nW2' It is customary to consider two intersecting edges
and their point of intersection as the only boundary points belonging to the
period parallelogram, as shown in Figure 1.1 b.

Notation. If WI and W2 are two complex numbers whose ratio is not real
we denote by il(WI' W2), or simply by il, the set of all linear combinations
mWI + nW2, where m and n are arbitrary integers. This is called the lattice
generated by WI and W2'

2
1.3: Fundamental pairs of periods

(a) (b)

Figure 1.1

Theorem 1.1. If (w l , w 2 ) is a fundamental pair of periods, then the triangle


with vertices 0, w l , W2 contains no further periods in its interior or on its
boundary. Conversely, any pair of periods with this property isfundamental.
PROOF. Consider the parallelogram with vertices 0, W l , W l + W 2 , and W2'
shown in Figure 1.2a. The points inside or on the boundary of this parallel-
ogram have the form
z = txW l + 13w2'
where 0 S tx S 1 and 0 s 13 s 1. Among these points the only periods are 0,
Wl' W2, and W l + W2, so the triangle with vertices 0, Wb W2 contains no
periods other than the vertices.

o o
(a) (b)
Figure 1.2

3
I: Elliptic functions

Conversely, suppose the triangle 0, WI' W2 contains no periods other


than the vertices, and let W be any period. We are to show that W = mW l +
nW2 for some integers m and n. Since W2/W1 is nonreal the numbers WI and
W 2 are linearly independent over the real numbers, hence

where t 1 and t2 are real. Now let [t] denote the greatest integer:::;; t and
write

Then
W - [t 1]W 1 - [t 2]W 2 = r1w1 + r2w2'
If one ofr1 or r2 is nonzero, then r 1w 1 + r2w2 will be a period lying inside
the parallelogram with vertices 0, WI' W2, WI + W2 . But if a period w lies
inside this parallelogram then either w or WI + W2 - w will lie inside the

complete.
°
triangle 0, WI' W2 or on the diagonal joining WI and W2 , contradicting the
hypothesis. (See Figure 1.2b.) Therefore r1 = r2 = and the proof is
0

Definition. Two pairs of complex numbers (WI, ( 2) and (WI', wz'), each with
nonreal ratio, are called equivalent if they generate the same lattice of
periods; that is, if 0(W1' (2) = 0(w 1', W2').

The next theorem, whose proof is left as an exercise for the reader,
describes a fundamental relation between equivalent pairs of periods.

Theorem 1.2. Two pairs (WI' (2) and (WI', wz') are equivalent if, and only if,
there is a 2 x 2 matrix (; : ) with integer entries and determinant

ad - bc = ± 1, such that

or, in other words,

W/ = aW2 + bw l ,
WI' = CW2 + dWl'

1.4 Elliptic functions


Definition. A functionjis called elliptic ifit has the following two properties:
(a) j is doubly periodic.
(b) j is meromorphic (its only singularities in the finite plane are poles).

4
1.4: Elliptic functions

Constant functions are trivial examples of elliptic functions. Later we


shall give examples of nonconstant elliptic functions, but first we derive some
fundamental properties common to all elliptic functions.

Theorem 1.3. A nonconstant elliptic function has afundamental pair of periods.


PROOF. Iff is elliptic the set of points wherefis analytic is an open connected
set. Also, f has two periods with nonreal ratio. Among all the nonzero
periods of f there is at least one whose distance from the origin is minimal
(otherwisefwould have arbitrarily small nonzero periods and hence would
be constant). Let W be one of the nonzero periods nearest the origin. Among
all the periods with modulus IW I choose the one with smallest nonnegative
argument and call it WI' (Again, such a period must exist otherwise there
would be arbitrarily small nonzero periods.) If there are other periods
with modulus IWII besides WI and -WI' choose the one with smallest
argument greater than that of WI and call this Wz. If not, find the next
larger circle containing periods -# nWI and choose that one of smallest
nonnegative argument. Such a period exists since f has two noncollinear
periods. Calling this one Wz we have, by construction, no periods in the
triangle 0, WI> W z other than the vertices, hence the pair (WI' W2) is funda-
mentaL D

If f and g are elliptic functions with periods Wi and W 2 then their sum,
difference, product and quotient are also elliptic with the same periods. So,
too, is the derivative!,.
Because of periodicity, it suffices to study the behavior of an elliptic
function in any period parallelogram.

Theorem 1.4. If an elliptic function f has no poles in some period parallelogram,


then f is constant.
PROOF. Iffhas no poles in a period parallelogram, thenfis continuous and
hence bounded on the closure of the parallelogram. By periodicity, f is
bounded in the whole plane. Hence, by Liouville's theorem,fis constant. D

Theorem 1.5. If an ellipticfunctionfhas no zeros in some period parallelogram,


then f is constant.
PROOF. Apply Theorem 1.4 to the reciprocaI1/! D

Note. Sometimes it is inconvenient to have zeros or poles on the bound-


ary of a period parallelogram. Since a meromorphic function has only a
finite number of zeros or poles in any bounded portion of the plane, a period
parallelogram can always be translated to a congruent parallelogram with
no zeros or poles on its boundary. Such a translated parallelogram, with no
zeros or poles on its boundary, will be called a cell. Its vertices need not be
periods.
5
1: Elliptic functions

Theorem 1.6. The contour integral of an elliptic fimction taken along the
boundary of any cell is zero.
PROOF. The integrals along parallel edges cancel because of periodicity. 0

Theorem 1.7. The sum of the residues of an elliptic function at its poles in any
period parallelogram is zero.
PROOF. Apply Cauchy's residue theorem to a cell and use Theorem 1.6. 0

Note. Theorem 1.7 shows that an elliptic function which is not constant
has at least two simple poles or at least one double pole in each period
parallelogram.

Theorem 1.8. The number of zeros of an elliptic function in any period parallel-
ogram is equal to the number of poles, each counted with multiplicity.
PROOF. The integral

_1 f
f'(z) dz
2ni c /(z) ,

taken around the boundary C of a cell, counts the difference between the
number of zeros and the number of poles inside the cell. But f'1f is elliptic
with the same periods asf, and Theorem 1.6 tells us that this integral is zero.
o
Note. The number of zeros (or poles) of an elliptic function in any period
parallelogram is called the order of the function. Every nonconstant elliptic
function has order ;::: 2.

1.5 Construction of elliptic functions


We turn now to the problem of constructing a nonconstant elliptic function.
We prescribe the periods and try to find the simplest elliptic function having
these periods. Since the order of such a function is at least 2 we need a
second order pole or two simple poles in each period parallelogram. The
two possibilities lead to two theories of elliptic functions, one developed by
Weierstrass, the other by Jacobi. We shall follow Weierstrass, whose point
of departure is the construction of an elliptic function with a pole of order
2 at z = 0 and hence at every period. Near each period w the principal part
of the Laurent expansion must have the form

A B
.,-----:-;;-2
(z - w)
+- -.
z - OJ

6
1.5: Construction of elliptic functions

For simplicity we take A = 1, B = O. Since we want such an expansion near


each period w it is natural to consider a sum of terms of this type,

summed over all the periods w = mW l + nw 2 . For fixed z ¥= w this is a


double series, summed over m and n. The next two lemmas deal with con-
vergence properties of double series of this type. In these lemmas we denote
by Q the set of all linear combinations mW l + nW2, where m and n are
arbitrary integers.

Lemma 1. If IX is real the infinite series

L~
W
well
",*0

converges absolutely if, and only if, IX > 2.


PROOF. Refer to Figure 1.3 and let rand R denote, respectively, the minimum
and maximum distances from 0 to the parallelogram shown. If w is any of
the 8 nonzero periods shown in this diagram we have

r$;lwl$;R (for 8 periods w).

Figure 1.3

In the next concentric layer of periods surrounding these 8 we have 2·8 = 16


new periods satisfying the inequalities

2r $; Iwl $; 2R (for 16 new periods w).

In the next layer we have 3 . 8 = 24 new periods satisfying


3r $; I wi$; 3R (for 24 new periods w),
7
1: Elliptic functions

and so on. Therefore, we have the inequalities

~" ~ I~ I" ~ ~ for the first 8 periods w,

(2~)" ~ I~I" ~ (2~)" for the next 16 periods w,


and so on. Thus the sum S(n) = L Iwl-", taken over the 8(1 + 2 + ... + n)
nonzero periods nearest the origin, satisfies the inequalities
8 2·8 n·8 8 2·8 n·8
R" + (2R)" + ... + (nR)" ~ S(n) ~ ;;. + (2r)" + ... + (nr)'"
or
8 n 1 8 n 1
R" L k,,-l
k=l
~S(n)~a L k,,-l'
r k=l

This shows that the partial sums S(n) are bounded above by 8,(cx - l)/r" if
cx > 2. But any partial sum lies between two such partial sums, so all of the
L
partial sums of the series Iwl-" are bounded above and hence the series
converges if cx > 2. The lower bound for S(n) also shows that the series
diverges if cx ~ 2. D

Lemma 2. If cx > 2 and R > 0 the series

L 1"
/w/>R (z - w)

converges absolutely and uniformly in the disk Izl ~ R.


PROOF. We will show that there is a constant M (depending on Rand cx)
such that, if ex ~ 1, we have

(1)

for all w with Iwi> R and all z with IzI ~ R. Then we invoke Lemma 1 to
prove Lemma 2. Inequality (1) is equivalent to

(2) I-
z --
w
wi" > 1
-
-M'
To exhibit M we consider all w in n with Iwl > R. Choose one whose
modulus is minimal, say Iwl = R + d, where d > O. Then if Izl ~ Rand
Iw I ~ R + d we have

Iz : wi = 11 - ~I ~ 1 -I~I ~ 1 - R : d'

8
1.6: The Weierstrass f.J function

and hence

I ~I' > (1 __
W -
R ). =~M'
R+d
where
R )-.
M= ( 1 - - -
R +d .
This proves (2) and also the lemma. o
As mentioned earlier, we could try to construct the simplest elliptic
function by using a series of the form
1
L
well (z - w)
2'

This has the appropriate principal part near each period. However, the
series does not converge absolutely so we use, instead, a series with the
exponent 2 replaced by 3. This will give us an elliptic function of order 3.
Theorem 1.9. Let f be defined by the series
1
f(z) = L(
well Z - W
)3'

Thenfis an elliptic function with periods Wt, W2 and with a pole of order 3 at
each period W in n.
PROOF. By Lemma 2 the series obtained by summing over Iwi> R converges
uniformly in the disk Iz I :::; R. Therefore it represents an analytic function
in this disk. The remaining terms, which are finite in number, are also
analytic in this disk except for a 3rd order pole at each period w in the disk.
This proves thatfis meromorphic with a pole of order 3 at each w in n.
Next we show thatfhas periods Wt and W2' For this we take advantage
of the absolute convergence of the series. We have

But W - W t runs through all periods in n with w, so the series for f(z + wd
is merely a rearrangement of the series for f(z). By absolute convergence we
have f(z + Wt) = f(z). Similarly, f(z + W2) = f(z) so f is doubly periodic.
This completes the proof. 0

1.6 The Weierstrass f.J function


Now we use the function of Theorem 1.9 to construct an elliptic function
or order 2. We simply integrate the series forf(z) term by term. This gives us
a principal part -(z - w)-2/2 near each period, so we multiply by -2 to

9
1: Elliptic functions

get the principal part (z - W)-2. There is also a constant of integration to


reckon with. It is convenient to integrate from the origin, so we remove the
term z - 3 corresponding to w = 0, then integrate, and add the term z - 2.
This leads us to the function

1
2"
z
+ i Z

0 ",*0
L (t --2w )3 dt.
Integrating term by term we arrive at the following function, called the
Weierstrass f.J function.

Definition. The Weierstrass f.J function is defined by the series

Theorem 1.10. The function f.J so defined has periods w! and W2 . It is analytic
except for a double pole at each period w in Q. Moreover f.J(z) is an even
function of z.
PROOF. Each term in the series has modulus

1 1 1 1w 2 - (z - W)21 1z(2w - z) 1
1(z - W)2 - w 2 = w 2(z _ W)2 = w 2(z _ W)2 .

Now consider any compact disk Iz I ~ R. There are only a finite numoer of
periods w in this disk. If we exclude the terms of the series containing these
periods we have, by inequality (1) obtained in the proof of Lemma 2,

I(z _1 w)21 ~ 1~2'


where M is a constant depending only on R. This gives us the estimate

z(2w - z) 1 MR(2Iwl + R) MR(2 + R/lwl) 3MR


< < <--
1
w2(z _ W)2 - IW14 - IW13 - IW13
since R < Iwl for w outside the disk Izi ~ R. This shows that the truncated
series converges absolutely and uniformly in the disk Iz I ~ R and hence
is analytic in this disk. The remaining terms give a second-order pole at
each w inside this disk. Therefore f.J(z) is meromorphic with a pole of order 2
at each period.
Next we prove that f.J is an even function. We note that
(-z - W)2 = (z + W)2 = (z - (-w)f

Since -w runs through all nonzero periods with w this shows that f.J( -z) =
f.J (z), so f.J is even.

10
1.8: Differential equation satisfied by f.J

Finally we establish periodicity. The derivative of p is given by

p'(z) =- 2 L(
wen Z -
1 )3'
W

We have already shown that this function has periods WI and W2' Thus
p'(z + w) = p'(z) for each period w. Therefore the function p(z + w) - p(z)
is constant. But when z = -w/2 this constant is p(w/2) - p( -w/2) = 0
since p is even. Hence p(z + w) = p(z) for each w, so p has the required
periods. 0

1.7 The Laurent expansion of f.J near the


ongm
Theorem 1.11. Let r = min {Iwl:w"# O}. Thenfor 0 < Izl < r we have
1
L1 (2n + I)G2n+2z2n,
00
(3) p(z) = 2: +
Z n=
where

(4) for n ~ 3.

PROOF. If 0 < Izl < r then Iz/wl < 1 and we have


1
(z _ ",)2 (
1
)2 = w1 2
(
1 + n=L1(n + 1)
00 (z )n)
- ,
UJ
W
2 1 --z W
W

hence

Summing over all w we find (by absolute convergence)


1 1 1
= 2 + L (n + 1) L = 2 + L (n + I)G n +2zn,
00 00

p(z) ----;;:;:z zn
z n=1 w*O W Z n=1

where Gn is given by (4). Since p(z) is an even function the coefficients G2n + 1
must vanish and we obtain (3). 0

1.8 Differential equation satisfied by f.J


Theorem 1.12. The function p satisfies the nonlinear differential equation
[p'(Z)]2 = 4p 3(Z) - 60G 4 P(z) - 140G 6 •

PROOF. We obtain this by forming a linear combination of powers of p and


p' which eliminates the pole at z = O. This gives an elliptic function which has

11
1: Elliptic functions

no poles and must therefore be constant. Near z = 0 we have

p'(z) = - 3"2 + 6G 4 z + 20G 6 z 3 +"',


z
an elliptic function of order 3. Its square has order 6 since
, 2 4 24G 4
[p (z)] = Z6 - ~ - 80G 6 + .. "
where + ... indicates a power series in z which vanishes at z = O. Now
3 4 36G
4p (z) = 6 + - 2 -4 + 60G 6 + ...
z z
hence
60G 4
[p'(z)]
2
- 4p (z)
3
=- - --
Z2
140G 6 + ...
so
[p'(Z)]2 - 4 p 3(Z) + 60G 4 p(z) = -140G 6 + .. '.
Since the left member has no pole at z = 0 it has no poles anywhere in a
period parallelogram so it must be constant. Therefore this constant must
be -140G 6 and this proves the theorem. D

1.9 The Eisenstein series and the invariants


g2 and g3
Definition. If n 2: 3 the series
1
G="-
n L.. n
w*o W

is called the Eisenstein series of order n. The invariants g2 and g3 are the
numbers defined by the relations

The differential equation for p now takes the form


[p'(Z)]2 = 4 p 3(Z) - g2P(Z) - g3'
Since only g2 and g3 enter in the differential equation they should determine
p completely. This is actually so because all the coefficients (2n + 1)G 2n + 2
in the Laurent expansion of p(z) can be expressed in terms of g2 and g3'

Theorem 1.13. Each Eisenstein series Gn is expressible as a polynomial in g2


and g3 with positive rational coefficients. In jact, if b(n) = (2n + l)G 2n + 2
we have the recursion relations
b(l) = g2/20,
12
and
n- 2
(2n + 3)(n - 2)b(n) = 32: b(k)b(n - 1 - k) for n :::::: 3,
k= I

or equivalently,
m-2
(2m + l)(m - 3)(2m - 1)G 2m = 3 2: (2r - 1)(2m - 2r - 1)G2rG2m-2r
r= 2

for m :::::: 4.

PROOF. Differentiation of the differential equation for p gives another


differential equation of second order satisfied by p,
(5)

Now we write p(z) = Z-2 + 2::'=1


b(n)z2n and equate like powers of z
in (5) to obtain the required recursion relations. 0

1.10 The numbers e l , e 2 , e 3

Definition. We denote by e l , e2, e3 the values of p at the half-periods,

The next theorem shows that these numbers are the roots of the cubic
polynomial4 p 3 - g2 P - g3'

Theorem 1.14. We have


4p 3(Z) - g2 p(z) - g3 = 4(p(z) - ed(p(z) - e2)(p(z) - e3)'
Moreover, the roots e l , e2, e3 are distinct, hence g2 3 - 27g/ i= O.
PROOF. Since p is even, the derivative p' is odd. But it is easy to show that
the half-periods of an odd elliptic function are either zeros or poles. In fact,
by periodicity we have p'( --tw) = p'(w - -tw) = p'(-tw), and since g;/ is odd
we also have p'( --tw) = - p'(-tw). Hence p'(-tw) = 0 if p'(!w) is finite.
Since p'(z) has no poles at -tw l , -tw z , !(WI + w z ), these points must be
zeros of p'. But p' is of order 3, so these must be simple zeros of p'. Thus
p' can have no further zeros in the period-parallelogram with vertices
0, WI' W z , WI + w 2 . The differential equation shows that each of these points
is also a zero of the cubic, so we have the factorization indicated.
Next we show that the numbers el' e2' e3 are distinct. The elliptic function
p(z) - el vanishes at z = -tw l . This is a double zero since p '(-twd = O.
Similarly, p(z) - e2 has a double zero at -tW2' If e l were equal to e2' the
elliptic function p(z) - e l would have a double zero at -tWI and also a double

13
1: Elliptic functions

zero at 10)2' so its order would be ~ 4. But its order is 2, so el # e2' Similarly,
el # e3 and e2 # e3'
If a polynomial has distinct roots, its discriminant does not vanish. (See
Exercise 1.7.) The discriminant of the cubic polynomial
4x 3 - g2 X - g3
is g/ - 27g/. When x = SO(z) the roots of this polynomial are distinct so
the number 9 2 3 - 279 3 2 # O. This completes the proof. D

1.11 The discriminant ~

The number ~ = 9 2 3 - 279 / is called the discriminant. We regard the


invariants g2 and g3 and the discriminant ~ as functions of the periods WI
and W 2 and we write

The Eisenstein series show that g2 and g3 are homogeneous functions of


degrees -4 and -6, respectively. That is, we have

g2(AW 1, AW 2) = A-4 giw 1, w 2) and g3(AWl, AW 2) = A-6 g3 (W 1, w 2)


for any A # O. Hence ~ is homogeneous of degree - 12,

~(AWl> AW 2) = A-12~(Wl' w 2).


Taking A = l/wl and writing T = W2/Wl we obtain
g2(1, T) = w l 4 giw 1, w 2), g3(1, T) = w I 6 g 3(W 1, w 2),
~(1, T) = WI12~(Wl' w 2).

Therefore a change of scale converts 9 2, 9 3 and ~ into functions of one


complex variable T. We shalliabei WI and W2 in such a way that their ratio
T = W 2/W 1 has positive imaginary part and study these functions in the upper
half-plane Im(T) > O. We denote the upper half-plane Im(T) > 0 by H.
If T E H we write g2(T), g3(T) and ~(T) for g2(1, T) g3(1, T) and ~(1, T),
respectively. Thus, we have
+00 1
g2(T) = 60 m,n~-oo (m + mf'
(m. n) * (0,0)

+ co 1
g3(T) = 140 m,n~- ex; (m + m)6
(m,n)*(O,O)

and
~(T) = g/(T) - 27g/(r).
Theorem 1.14 shows that ~(r) # 0 for all T in H.

14
1.12: Klein's modular function J(T)

1.12 Klein's modular function J(r)


Klein's function is a combination of gz -and g3 defined in such a way that,
as a function of the periods W1 and W z , it is homogeneous of degree O.

Definition. If w zlw 1 is not real we define

J( ) _ gz\w 1, w z )
W 1,WZ - A( ).
I..\W 1 ,W Z

Since gz 3 and ,1 are homogeneous of the same degree we have J(),w 1 , AWz)
= J(Wb w z). In particular, if r E H we have

J(1, T) = J(w 1 , wz).


Thus J(w 1 , w z ) is a function of the ratio T alone. We write J(T) for J(l, T).
Theorem 1.15. Thefunctions gz(T), g3(T), ,1(T), and J(T) are analytic in H.

PROOF. Since ,1(T) -# 0 in H it suffices to prove that gz and g3 are analytic


in H. Both (10 and g3 are given by double series of the form
+00 1
m,n~-CX)
(m,n)*(O,O)
(m + my
'ith IY. > 2. Let r = x + iy, where y > O. We shall prove that if IY. > 2 this
series converges absolutely for any fixed T in H and uniformly in every strip
S of the form
s= {x+iy:lxl'::;A,y2b>0}.
(See Figure 1.4.) To do this we prove that there is a constant M > 0, depending
only on A and on b, such that
M
(6) ,------.,,- < ,------,.,-
1m + m la - 1m + nW
for all T in S and all (m, n) -# (0,0). Then we invoke Lemma 1.
To prove (6) it suffices to prove that
1m + mlz > Kim + nW
for some K > 0 which depends only on A and b, or that
(7) (m + nx)Z + (ny)Z > K(m Z + nZ).
If n = 0 this inequality holds with any K such that 0 < K < 1. If n -# 0
let q = min. Proving (7) is equivalent to showing that
(q + x)z + yZ
(8) > K
1 + qZ

15
1: Elliptic functions

--------+-----+-----~----~x
-A A

Figure 1.4

for some K > O. We will prove that (8) holds for all q, with
«5 2
K=---~
1 + (A + «5)2
if Ix I ~ A and y :2: «5. (This proof was suggested by Christopher Henley.)
If Iq I ~ A + «5 inequality (8) holds trivially since (q + X)2 :2: 0 and
y2 :2: «5 2.lflql > A + «5 then Ix/ql < Ixl/(A + «5) ~ A/(A + «5) < 1 so

11 + ~1:2:
q
1 -I~I >
q
1_ _ A
A+«5
=_«5
A+«5
hence
q«5
Iq + xl :2: A + «5
and

(9)

Now q2/(1 + q2) is an increasing function of q2 so


q2 (A + «5)2
-->--~--..:..---,--;::
1 + q2 - 1 + (A + «5)2
when q2 > (A + «5)2. Using this in (9) we obtain (8) with the specified K. 0

1.13 Invariance of J under unimodular


transformations
If WI' W2 are given periods with nonreal ratio, introduce new periods
WI', w 2 ' by the relations

16
1.13: Invariance of J under unimodular transformations

where a, b, c, d are integers such that ad - be = 1. Then the pair (w 1 ', w z')
is equivalent to (W1' w z ); that is, it generates the same set of periods n.
Therefore gz(w 1', w z') = gz(w 1, wz) and g3(W 1', w z') = g3(W 1, w z ) since g2
and g3 depend only on the set of periods n. Consequently, L1(w 1', W2') =
L1(w 1, w 2) and J(w 1', W2') = J(W1' W2)'
The ratio of the new periods is
, w2' aW 2 + bW 1 ar +b
r
+ d'
=-=-~--
w 1' cW z + dW 1 cr
where r = WZ /w 1 . An easy calculation shows that
Im(r') = Im(ar + b) = ad - be Im(r) = Im(r) .
cr+d Icr+dl 2 Icr+dl z
Hence r' E H if and only if r E H. The equation
ar
r' = - - -
+b
cr +d
is called a unimodular transformation if a, b, c, d are integers with ad - be = 1.
The set of all unimodular transformations forms a group (under composition)
called the modular group. This group will be discussed further in the next
chapter. The foregoing remarks show that the function J(r) is invariant
under the transformations of the modular group. That is, we have:

Theorem 1.16. If r E H and a, b, c, d are integers with ad - be = 1, then


(ar + b)/(cr + d) E Hand

(10) J( - b)
ar-+d = J(r).
cr +
Note. A particular unimodular transformation is r' = r + 1, hence (10)
shows that J(r + 1) = J(r). In other words, J(r) is a periodic function of r
with period 1. The next theorem shows that J(r) has a Fourier expansion.

Theorem 1.17. If r E H, J(r) can be represented by an absolutely convergent


Fourier series

L
00

(11) J(r) = a(n)e 2 "int.


n== - 00

PROOF. Introduce the change of variable

Then the upper half-plane H maps into the punctured unit disk
D = {x: 0 < Ix I < I}.
17
1: Elliptic functions

(See Figure l.S.) Each t in B maps onto a unique point ~ in D, but each
x in D is the ima.f.pfinfinitely many points in B.·U t andr map onto x
then ellrit . . el*i't"sot.and r dift'er by an integer.


H

figure 1.5

IfxeD, let
f(x) == J(t)
where tis any of the points in B whieh map onto x. Since J is periodic with
period I, J has the same value at a11 these points so f(x) is well-defined.
Now f is anaJytic in D because
, d. d dt , IdX J'(t}
f (x) = d- J(t) = d- J(t) -d = J (t) -d = 21t1e. Zxit'
X txt

so f'(x) exists at each point in D. Since f is analytic in D it bas a Laurent


expansion about 0,
ac
f(x) = L a(n)xn,
n= -00

absolutely convergent for each x in D. Replacing x by e2 1<if we see that J(t)


has the absolutely convergent Fourier expansion in (11). 0

Later we will show that a -n = 0 for n ~ 2, that a_I = 12 - 3, and that


the Fourier expansion of 123J(r) has integer coefficients. To do this we first
determine the Fourier expansions of 92(t), 93(t) and A(r).

1.14 The Fourier expansions of gir)


and g3(r)
Each Eisenstein series L(m.n)"(O,o) (m + nr)-k is a periodic function of t of
period 1. In particular, 92(r) and 93(r) are periodic with period L In this
section we determine their Fourier coefficients explicitly.
We recall that
1 1
9z(t) = 60 L
(m,n)"(O,O) (m + nt)
4' 93(t) L
= 140(m,n)"(O, 0) (m + nr)
6'

18
1.14: The Fourier expansions of g2(r) and g3(r)

These are double series in m and n. First we obtain Fourier expansions for
the simpler series
+00 1 +00 1
L
m=-oo(m + nr)4
and L
m= -00 (m + nr)

Lemma 3. If r E Hand n > 0 we have the Fourier expansions


+ 00 1 8n 4 00
L 4 = - L r3e21tirnt
",=-oo(m+ nr) 3 r=1
and
+00 1 8 6 00
L 6 = - ~ LrSe21tirnt.
"'=-00 (m + nr) 15 r=1

PROOF. Start with the partial fraction decomposition of the cotangent:

L
n cot nr = - + +00 1
r "'=-00 r+m
(1
- - - -1).
m
",*0

Let x = e21tit. If r E H then Ixl < 1 and we find


cos nr e21!it + 1 x +1
n cot nr = n -.-- = ni 21tit 1
Sill nr e -
= ni -x -- 1 =

In other words, if r E H we have

1- +
r
L --
+00
"'=-00 r+m
(1 -- 1)
m
= -ni 1 (+ 2L
00 e21!irt) .
r=1
",*0

Differentiating repeatedly we find


00 1 00
(12) L 2 = -(2ni)2 L re21tirt
- r 2 -",=_00(r+m) r=1
",*0

+ 00 1 00
-3' L - (2ni)4 L r3e21!irt
·"'=-00 (r + m)4 r= 1

and
+00 1 00
-5' L -(2ni)6 L rSe21tirt.
·m=-oo (r + m)6 r= 1

Replacing r by nr we obtain Lemma 3. o


19
1: Elliptic functions

Theorem 1.18. If, E H we have the Fourier expansions

g2(') = 4n 4 {I
3
+ 240
k= 1
f (J3(k)e21tikr}
and

where (Ja(k) = Ldlk da.


PROOF. We write
+00 1
g2(') = 60 m,n~-oo (m + nr)4
(m,n)*(O,O)

= 60{ I
m=-oo
~
m
f I (
+ n=lm=-oo (m+n,)
1 + (m-nr)
1 4 4)}
m*O(n=O)
00 +00 1 }
= 60 2((4) + 2n~' m=~oo +
{
(m nr)4

2n4 16n4
= 60 { - +- L L r3 xnr
00 00 }

90 3 n=1 r=1
where x = e 21tir • In the last double sum we collect together those terms for
which nr is constant and we obtain the expansion for g2(')' The formula
for g3(') is similarly proved. 0

1.15 The Fourier expansions of ~(T) and J( T)


Theorem 1.19. If, E H we have the Fourier expansion

L ,(n)e21tinr
00

L1(,) = (2n)12
n= 1
where the coefficients ,(n) are integers, with ,(1) = 1 and ,(2) = -24.

Note. The arithmetical function ,(n) is called Ramanujan's tau function.


Some of its arithmetical properties are described in Chapter 4.
PROOF. Let
00 00

A = L (J3(n)xn, B = L (Js(n)xn.
n=1 n= 1
Then

20
1.15: The Fourier expansions of A(r) and J(r)

Now A and B have integer coefficients, and


(1 + 240A)3 - (1 - 504B)2 = 1 + 720A + 3(240)2 A 2 + (240)3 A 3 - 1
+ 1008B - (504)2 B2
= 122(5A + 7B)
+ 12 3(100A 2 - 147B2 + 8000A 3).
But
00

5A + 7B = L {5O-3(n) + 7o- s(n)}xn


n~l

and

so
5d 3 + 7d s == 0 (mod 12).
Hence 12 3 is a factor of each coefficient in the power series expansion of
(1 + 240A)3 - (1 - 504B)2 so

12
i\(r) = 64n {123
27 n=1
f
r(n)e27rinr} = (2n)12
n=1
r(n)e 27rinr f
where the r(n) are integers. The coefficient of x is 122(5 + 7), so r(l) = 1.
Similarly, we find r(2) = - 24. 0

Theorem 1.20. If r E H we have the Fourier expansion

L c(n)e27rint,
00

12 3J(r) = e- 27rit + 744 +


n=1
where the c(n) are integers.

PROOF. We agree to write I for any power series in x with integer coefficients.
Then if x = e 27rir we have
g/(r) = ~~nI2(1 + 240x + 1)3 = ~~nI2(1 + 720x + f),
i\(r) = ~~nI2{123x(1 - 24x + I)}
and hence

) = 92 3(r) = 1 + 720x + f = _1_ (1 720x 1)(1 24x I)


J(r i\(r) 123 x (1 - 24x + f) 123 x + + + +
so
1
L c(n)xn,
00

12 3J(r) = - + 744 +
x n=1
where the c(n) are integers. o
21
1: Elliptic functions

Note. The coefficients c(n) have been calculated for n ;S; 100. Berwick
calculated the first 7 in 1916, Zuckerman the first 24 in 1939, and Van
Wijngaarden the first 100 in 1953. The first few are repeated here.
c(O) = 744
c(l) = 196, 884
c(2) = 21, 49~, 760
c(3) = 864,299,970
c(4) = 20,245,856,256
c(5) = 333,202,640,600
c(6) = 4,252,023,300,096
c(7) = 44, 656, 994, 071, 935
c(8) = 401,490,886,656,000
The integers c(n) have a number of interesting arithmetical properties. In
1942 D. H. Lehmer [20] proved that
(n + 1)c(n) == 0 (mod 24) for all n ;;:: 1.
In 1949 Joseph Lehner [23] discovered divisibility properties of a different
kind. For example, he proved that
c(5n)== 0 (mod 25),
c(7n)== 0 (mod 7),
c(11n) == 0 (mod 11).
He also discovered congruences for higher powers of 5, 7, 11 and, in a later
paper [24] found similar results for the primes 2 and 3. In Chapter 4 we will
describe how some of Lehner's congruences are obtained.
An asymptotic formula for c(n) was discovered by Petersson [31] in 1932.
It states that
e4 ",,"
c(n) '" M as n -+ 00.
v 2 n3 / 4
This formula was rediscovered independently by Rademacher [37] in
1938.
The coefficients r(n) in the Fourier expansion of ~(r) have also been
extensively tabulated by D. H. Lehmer [19] and others. The first ten entries
in Lehmer's table are repeated here:
r(l) = 1 r(6) = -6048
r(2) = -24 r(7) = -16744
r(3) = 252 r(8) = 84480
.(4) = -1472 r(9) = - 113643
• (5) = 4830 .(10) = -115920.

Lehmer has conjectured that r(n) =1= 0 for all n and has verified this for all
n < 214928639999 by studying various congruences satisfied by r(n). For
papers on r(n) see Section F35 of [27].

22
Exercises for Chapter I

Exercises for Chapter 1


1. Given two pairs of complex numbers (WI' W2) and (WI', W2') with nonreal ratios
W 2 /W I and W2'/WI" Prove that they generate the same set of periods if, and only if,

there is a 2 x 2 matrix (: :) with integer entries and determinant ± 1 such that

2. Let S(O) denote the sum of the zeros of an eJliptic function f in a period paraJlelo-
gram, and let S( 00) denote the sum of the poles in the same paraJlelogram. Prove
that S(O) - S(oo) is a period of f [Hint: Integrate z/,,(z)/f(z).]

3. (a) Prove that p(u) = p(v) if, and only if, u - v or u + v is a period of p.
(b) Let al,"" an and b l , ... , bm be complex numbers such that none of the numbers
pea;) - p(b) is zero. Let

f(z) = })I [f.J(z) - p(ad] 1.61 [p(z) - p(b,)].

Prove that f is an even eJliptic function with zeros at a I, ... , an and poles at
bl , .. ·, bm •
4. Prove that every even elliptic function f is a rational function of f.J, where the
periods of p are a subset of the periods of f

5. Prove that every elliptic function f can be expressed in the form

where R I and R 2 are rational functions and p has the same set of periods as f

6. Let f and 9 be two elliptic functions with the same set of periods. Prove that there
exists a polynomial P(x, y), not identically zero, such that
P[f(z), g(z)] = C
where C is a constant (depending on f and g but not on z).

7. The discriminant of the polynomial f(x) = 4(x - XI)(X - X2)(X - X3) is the
product 16{(x2 - Xd(X3 - X2)(X3 - Xd}2. Prove that the discriminant of f(x) =
4X 3 - ax - b is a 3 - 27b 2 •
8. The differential equation for p shows that gJ'(z) = 0 if z = w l /2, w 2 /2 or
(WI + w 2 )/2. Show that

p"(~I) = 2(e1 - e2)(el - e3)

and obtain corresponding formulas for gJ"(w2/2) and P"«WI + w2V2).


23
I: Elliptic functions

9. According to Exercise 4, the function p(2z) is a rational function of "J(z). Prove


that, in fact,

(2z) = {&i(z) + !92}2 + 293 &J(z) = _ 2&J(z) + !(&J~(Z))2.


&J 4&J3(Z) - 92&J(Z) - 93 4 &J (z)
10. Let WI and W2 be complex numbers with nonreal ratio. Letf(z) be an entire function
and assume there are constants a and b such that

fez + WI) = af(z), fez + W 2) = bf(z),


for all z. Prove that f(z) = Ae Bz , where A and B are constants.
11. If k 2': 2 and TE H prove that the Eisenstein series

(m, 0) ~ (0,0)

has the Fourier expansion


2(2rri)2k
I (J2k_l(n)e 2rrw ,.
00 .

G2k (T) = 2(2k) +


(2k-I)!0=1

12. Refer to Exercise 11. If T E H prove that


G 2k ( - liT) = T2k G2k (T)
and deduce that

G2k (i) = 0 if k is odd,


G2k(e2rri/J) = 0 if k ~ 0 (mod 3).

13. Ramanujan's tau function T(n) is defined by the Fourier expansion


00

.1(T) = (2rr)12 I T(n)e 2rrio "


n=l

derived in Theorem 1.19. Prove that

where fog denotes the Cauchy product of two sequences,


o
(f g)(n) =
0 I f(k)g(n - k),
k=O

and (J,(n) = Idlo d' for n 2': 1, with (J3(O) = 2io, (J5(O) = - 554'
[Hint: Theorem 1.18.J
14. A series of the form I:=
I f(ll)x nl(l - XO) is called a Lambert series. Assuming
absolute convergence, prove that
x"
I I
OC' 00

f(n) --0 = F(n)x n ,


0=1 I-x 0=1

where

F(n) = I fed).
dlo

24
Exercises for Chapter 1

Apply this result to obtain the following formulas, valid for Ix I < 1.
oc 11(11 jxn cp(I1)X n X
L
oc
(aj --n=x. (b) n~1 1 _ xn (1 _ X)2'
n=11 - x
.l.(I1)xn
L a.(I1)x n. L - - n = L xn2.
00 00

(d)
n=1 n=1 1 - x n=1
(e) Use the result in (c) to express g2(T) and g3(T) in terms of Lambert series in
x = e 2 n:it.

Note. In (a), /1(11) is the Mobius function; in (b), cp(l1) is Euler's totient; and in (d),
.l.(I1) is Liouville's function.

15. Let

and let

F(xj = L 1 + xn
n::;;;1
(nodd)

(a) Prove that F(x) = G(x) - 34G(x 2 ) + 64G(x 4 ).


(b) Prove that

(c) Use Theorem 12.17 in [4] to prove the more general result
~ 114k + 1 24k + 1 - 1
11
L
= I
1 + en .. = 8k + 4 B4k + 2 •
(n odd)

25
2
The modular group
and modular functions

2.1 Mobius transformations


In the foregoing chapter we encountered unimodular transformations
,
r =--
ar +b
er +d
where a, b, e, d are integers with ad - be = 1. This chapter studies such
transformations in greater detail and also studies functions which, like
J(r), are invariant under unimodular transformations. We begin with some
remarks concerning the more general transformations

(1) fez) = az +b
ez +d
where a, b, e, d are arbitrary complex numbers.
Equation (1) definesf(z) for all z in the extended complex number system
C* = C U {oo} except for z = -die and z = 00. We extend the definition
off to all of C* by defining
a
and f(oo) = -,
e
with the usual convention that zlO = 00 if z #- o.
First we note that

(2) few) _ fez) = (ad - be)(w - z),


(cw + d)(ez + d)
which shows thatfis constant if ad - be = O. To avoid this degenerate case
we assume that ad - be #- O. The resulting rational function is called a

26
2.1: Mobius transformations

Mobius transformation. It is analytic everywhere on C* except for a simple


pole at z = -die.
Equation (2) shows that every Mobius transformation is one-to-one on
C*. Solving (1) for z in terms off(z) we find
df(z) - b
z=
-ef(z) + a'
so f maps C* onto C*. This also shows that the inverse function f -1 is a
Mobius transformation.
Dividing by w - z in (2) and letting w -+ z we obtain
, ad-be
f (z) = (ez + d)2'
hence f'(z) -# 0 at each point of analyticity. Therefore f is conformal every-
where except possibly at the pole z = -die.
Mobius transformations map circles onto circles (with straight lines
being considered as special cases of circles). To prove this we consider the
equation
(3) Azz + Bz + Ez + C = 0,
where A and C are real. The points on any circle satisfy such an equation
with A -# 0, and the points on any line satisfy such an equation with A = O.
Replacing z in (3) by (aw + b)/(ew + d) we find that w satisfies an equation
of the same type,
A'ww + B'w + B'w + C' = 0

where A' and C' are also real. Hence every Mobius transformation maps a
circle or straight line onto a circle or straight line.
A Mobius transformation remains unchanged if we multiply all the
coefficients a, b, e, d by the same nonzero constant. Therefore there is no loss
in generality in assuming that ad - be = 1.
For each Mobius transformation (1) with ad - be = 1 we associate the
2 x 2 matrix

Then det A = ad - be = 1. If A and B are the matrices associated with


Mobius transformations f and g, respectively, then it is easy to verify
that the matrix product AB is associated with the composition fog, where

(f" g)(z) = f(g(z)). The identity matrix I = G~) is associated with the
identity transformation
1z +0
f(z) = z = Oz + l'
27
2: The modular group and modular functions

and the matrix inverse

A- 1 = (
-c
d -b)a
is associated with the inverse ofJ,
dz - b
f-l(Z) =- --
-cz + a
Thus we see that the set of all Mobius transformations with ad - be = 1
forms a group under composition. This chapter is concerned with an impor-
tant subgroup in which the coefficients a, b, c, d are integers.

2.2 The modular group r


The set of all Mobius transformations of the form
I aT + b
r =---
C"C + d'
where a, b, c, d are integers with ad - be = 1, is called the modular group and
is denoted by r. The group can be represented by 2 x 2 integer matrices

A = (: : ) with det A = 1,

provided we identify each matrix with its negative, since A and - A represent
the same transformation. Ordinarily we will make no distinction between

the matrix and the transformation. If A = (: :) we write

AT = aT + b.
cr +d
The first theorem shows that r is generated by two transformations,
1
Tr = T +1 and Sr = - - .
T

Theorem 2.1. The modular group r is generated by the two matrices

T = (~ !) and S = (~ -1)O·
That is, every A in r can be expressed in theform
A = Tn'STnzs ... STnk

where the ni are integers. This representation is not unique.

28
2.2: The modular group r

PROOF. Consider first a particular example, say

A = C~ 2~}
We will express A as a product of powers of Sand T. Since S2 = I, only the
first power of S will occur.
Consider the matrix product

AT" = C~ 2:)G ~) = C1 11~: 2:).


Note that the first column remains unchanged. By a suitable choice of n
we can make 111n + 251 < 11. For example, taking n = - 2 we find
11 n + 25 = 3 and

AT
-2
=
(4 1)
11 3'

Thus by multiplying A by a suitable power of T we get a matrix (: ~) with


Id I < Ic I. Next, multiply by S on the right:

AT
-2
S-
_ (4 1)(0 -1) _(1 -4)
11 3 1 0 - 3 -11 .
This interchanges the two columns and changes the sign ofthe second column.
Again, multiplication by a suitable power of T gives us a matrix with
Idl < lei. In this case we can use either T4 or T3. Choosing T4 we find

AT
-2
ST =
4 (1 -4)(1 1) G~).
3 -11 0 =

Multiplication by S gives

AT- 2 ST 4S = (~ -1)
-3 .
Now we multiply by T3 to get

AT- 2 ST 4ST 3 = (0 -1)(1 3) (0 -1)o =


1 -3 0 1
=
1 S.

Solving for A we find


A = ST- 3ST- 4ST 2 •
At each stage there may be more than one power of T that makes Id I < Ic I
so the process is not unique.
To prove the theorem in general it suffices to consider those matrices

A = (: ~) in r with c ~ O. We use ind~ction on c.


29
2: The modular group and modular functions

If e = 0 then ad = 1 so a = d = ± 1 and
±b)1 = T±h .
Thus, A is a power of T
If e = 1 then ad - b = 1 so b = ad - 1 and

A = G 1) GnG -~)(~
ad; = ~) = TaST d.

Now assume the theorem has been proved for all matrices A with lower
left-hand element <e for some e ~ 1. Since ad - be = 1 we have (e, d) = 1.
Dividing d by e we get
d = eq + r, where 0 < r < e.
Then

and

-1)o = (-a q +b
r
-a).
-e
By the induction hypothesis, the last matrix is a product of powers of S
and T, so A is too. This completes the proof. 0

2.3 Fundamental regions


Let G denote any subgroup of the modular group r. Two points rand r'
in the upper half-plane H are said to be equivalent under G if r' = Ar for
some A in G. This is an equivalence relation since G is a group.
This equivalence relation divides the upper half-plane H into a disjoint
collection of equivalence classes called orbits. The orbit Gr is the set of all
complex numbers of the form Ar where A E G.
We select one point from each orbit; the union of all these points is
called a fundamental set of G. To deal with sets having nice topological
properties we modify the concept slightly and define a fundamental region
of G as follows.

Definition. Let G be a subgroup of the modular group r. An open subset


RG of H is called a fundamental region of G if it has the following two
properties:
(a) No two distinct points of RG are equivalent under G.
(b) If r E H there is a point r' in the closure of RG such that r' is equivalent
to r under G.

30
2.3: Fundamental regions

For example, the next theorem will show that a fundamental region Rr
of the full modular group r consists of all t in H satisfying the inequalities
It I > 1, It + fl < 1.
This region is the shaded portion of Figure 2.1.

t = U + iv, v> 0

----------~----.----+-----.----+----------u
-1 -t o
Figure 2.1 Fundamental region of the modular group

The proof will use the following lemma concerning fundamental pairs of
periods.

Lemma 1. Given WI', w/ with W2'/W I' not real, let

Q = {mwI' + nw/: m, n integers}.


Then there exists a fundamental pair (WI, w 2 ) equivalent to (WI', w/) such
that

(::) = (: ~)(:::) with ad - bc = 1,

and such that

PROOF. We arrange the elements of Q in a sequence according to increasing


distances from the origin, say
Q = {O, WI' W 2 , ... }
where
0< IWII;s; Iw 2 1;s; .. · and argw n < argw n+ 1 if Iwnl = IWn+ll·
31
2: The modular group and modular functions

Let Wi = Wi and let W2 be the first member of this sequence that is not a
multiple of Wi' Then the triangle with vertices 0, Wi' W z contains no element
of n except the vertices, so (WI> w z ) is a fundamental pair which spans the
set n. Therefore there exist integers a, b, e, d with ad - be = ± 1 such that

(::) = G~)(::)
If ad - be = -1 we can replace c by -c, d by -d, and Wi by -WI and the
same equation holds, except now ad - be = 1. Because of the way we have
chosen WI> W z we have
and
since WI ± W 2 are periods in n occurring later than W2 in the sequence. 0

Theorem 2.2. If r' E H, there exists a complex number r in H equivalent to r'


under r sueh that
Irl ~ 1, Ir + 11 ~ Irl and Ir - 11 ~ Irl.

PROOF. Let WI' = 1, w 2 ' = r' and apply Lemma 1 to the set of periods
n = {m + nr': m, n integers}. Then there exists a fundamental pair WI' W z

with IW21 ~ Iwll, IWi ± wzi ~ IW21· Let r = WZ/w I· Then r = (: ~)r'
with ad - be = 1 and
Irl ~ 1, Ir ± 11 ~ Irl. o
N ate. Those r in H satisfying Ir ± 11 ~ Ir I are also those satisfying
Ir+ilsl.

Theorem 2.3. The open set


Rr = {rEH:lrl > 1,lr + il < I}
is a fundamental region for r. Moreover, if A E r and if Ar = r for some
r in R r , then A = I. In other words, only the identity element has fixed
points in R r .

PROOF. Theorem 2.2 shows that if r' E H there is a point r in the closure of
Rr equivalent to r' under r. To prove that no two distinct points of Rr are
equivalent under r, let r' = Ar where A = (: ~} We show first that
Im(r') < Im(r) if r E Rr and e ¥- O. We have
, Im(r)
Im(r) = Icr + dlz'

32
2.3: Fundamental regions

If TE Rr and c =f. 0 we have


leT + dl 2 + d)(ci + d) = C2 Ti + Cd(T + i) + d2 > c2 - Icdl + d2 •
= (CT

If d = 0 we find ICT + dl 2 > c 2 ~ 1. If d =f. 0 we have


c 2 - Icdl + d 2 = (lei - Idl)2 + Icdl ~ Icdl ~ 1
so again leT + d 12 > 1. Therefore c =f. 0 implies IC'r + d 12 > 1 and hence
Im(T') < Im(T). In other words, every element A of r with c =f. 0 decreases
the ordinate of each point T in R r .
Now suppose both T and T' are equivalent interior points of Rr . Then
,
T=--
aT +b and T=
dT' - b
.
CT + d -CT' + a
If c =f. 0 we have both Im(T') < Im(T) and Im(T) < Im(T'). Therefore c = 0
so ad = 1, a = d = ± 1, and

±1b) = T±b
.
But then b = 0 since both T and T' are in Rr so T = T'. This proves that no
two distinct points of Rr are equivalent under r.
Finally, if AT = T for some T in R r , the same argument shows that c = 0,
a = d = ± 1, so A = I. This proves that only the identity element has fixed
points in R r . D

Figure 2.2 shows the fundamental region Rr and some of its images under
transformations of the modular group. Each element of r maps circles into
circles (where, as usual, straight lines are considered as special cases of
circles). Since the boundary curves of Rr are circles orthogonal to the real

I T

Figure 2.2 Images of the fundamental region Rr under elements of r

33
2: The modular group and modular functions

axis, the same is true of every image f(Rr) under the elements f of r. The
set of all images f(R r ), where / E r, is a collection of nonoverlapping open
regions which, together with their boundary points, cover all of H.

2.4 Modular functions

Definition. A function/is said to be modular if it satisfies the following three


conditions:
(a) / is meromorphic in the upper half-plane H.
(b) f(Ar) = f(r) for every A in the modular group r.
(c) The Fourier expansion of/has the form

L
00

/(r) = a(n)e 2 1[int.


n= -m

Property (a) states that/is analytic in H except possibly for poles. Property
(b) states that / is invariant under all transformations of r. Property (c) is
a condition on the behavior offat the point r = ioo. If x = e2 1[it the Fourier
series in (c) is a Laurent expansion in powers of x. The behavior of/at ioo is
described by the nature of this Laurent expansion near O. If m > 0 and
a( -m) # 0 we say thatfhas a pole of order m at ioo. If m ~ 0 we say fis
analytic at ioo. Condition (c) states thatfhas at worst a pole of order mat
ioo.
The function J is a modular function. It is analytic in H with a first order
pole at ioo. Later we show that every modular function can be expressed as
a rational function of J. The proof of this depends on the following property
of modular functions.

Theorem 2.4. Iff is modular and not identically zero, then in the closure 0/ the
fundamental region R r , the number 0/ zeros off is equal to the number of
poles.

Note. This theorem is valid only with suitable conventions at the boundary
points of R r . First of ail, we consider the boundary of Rr as the union of
four edges intersecting at four vertices p, i, P + 1, and ioo, where p = e2 1[i/3
(see Figure 2.3). The edges occur in equivalent pairs (1), (4) and (2), (3).
Iffhas a zero or pole at a point on an edge, then it also has a zero or pole
at the equivalent point on the equivalent edge. Only the point on the leftmost
edge (1) or (2) is to be counted as belonging to the closure of R r .
The order of the zero or pole at the vertex p is to be divided by 3; the order
at i is to be divided by 2; the order at i 00 is the order of the zero or pole at
x = 0, measured in the variable x = e 2nit •

34
2.4: Modular functions

-- .... --
ooi
".-
....
/ "" ",

(1) (4)

(2) (3)

p p +1
Figure 2.3

PROOF. Assume first that / has no zeros or poles on the finite part of the
boundary of Rr . Cut Rr by a horizontal line, Im(r) = M, where M > 0 is
taken so large that all the zeros or poles of/ are inside the truncated region
which we call R. [If/had an infinite number of poles in Rr they would have
an accumulation point at ioo, contradicting condition (c). Similarly, since/
is not identically zero, / cannot have an infinite number of zeros in R r .]
Let oR denote the boundary of the truncated region R. (See Figure 2.4.)
Let Nand P denote the number of zeros and poles of/inside R. Then

N - P = _1
2ni
r f'(r) dr = _1
JOR /(r) 2ni
{f + f + f + f + f }
(1) (2) (3) (4) (5)

where the path is split into five parts as indicated in Figure 2.5. The integrals
along (1) and (4) cancel because of periodicity. They also cancel along (2)
and (3) because (2) gets mapped onto (3) with a reversal of direction under

-t +
-
iM . - - - - - - - - - - - - . , t + iM

R t

-
p p+i

Figure 2.4

35
2: The modular group and modular functions

(5)

j (I) (4) r

(2) (3)
~ ~

Figure 2.5

the mapping u = S(r) = -l/r, or r = S-IU = S(u). The integrand remains


unchanged because f[S(u)] = f(u) implies f'[S(u)]S'(u) = f'(u) so

f'(r) d = f'[S(u)] S'( ) d = f'(u) d


f(r) r f[S(u)] u u f(u) u.
Thus we are left with

N - P = 2ni
1 f(5)
f'(r)
f(r) dr.

We transform this integral to the x-plane, x = e 2 "it. As r varies on the


horizontal segment r = u + iM, -t ::; U ::; t, we have

so x varies once around a circle K of radius e - 2nM about x = 0 in the negative


direction. The points above this segment are mapped inside K, so f has no
zeros or poles inside K, except possibly at x = O. The Fourier expansion
gives us

f(r) = a-mm + ... = F(x),


x
say, with
f'(r) d _ F'(x) d
f'(r) = F'(x) ~:, f(r) r - F(x) x.
Hence

N - P = _1
2ni
f
(5)
f'(r) dr
f(r)
= - _1
2ni
i
JK
F'(x) dx = -(NF - PF) = PF - N F,
F(x)
where N F and PF are the number of zeros and poles of F inside K.

36
2.4: Modular functions

If there is a pole of order m at x = °then m > 0, N F = 0, P F = m so


PF - N F = m, and

N = P + m.
°
Thereforeftakes on the value in Rr as often as it takes the value 00.
If there is a zero of order n at x = 0, then m = -n so PF = 0, NF = n,
hence
N +n= P.
°
Again,ftakes the value in Rr as often as it takes the value 00. This proves
the theorem iff has no zeros or poles on the finite part of the boundary of R r .
Iffhas a zero or a pole on an edge but not at a vertex, we introduce detours
in the path of integration so as to include the zero or pole in the interior of R,
as indicated in Figure 2.6. The integrals along equivalent edges cancel as
before. Only one member of each pair of new zeros or poles lies inside the new
region and the proof goes through as before, since by our convention only
one of the equivalent points (zero or pole) is considered as belonging to the
closure of R r .

- t

Figure 2.6

If f has a zero or pole at a vertex p or i we further modify the path of


integration with new detours as indicated in Figure 2.7. Arguing as above we
find

N - P =1-
2ni
{(f + f) + f + f-
CI C3 C2
I 2 iM
/ +
1/2+iM
}f'(r}
-dr
f(r}

= -1
2ni
{(f + f) + f }f'(r}
CI
--
C3 C2 f(r}
dr + m,

where x - m is the lowest power of x occurring in the Laurent expansion near


x = 0, x = e 21tit•
37
2: The modular group and modular functions

-! + iM .------------, ! + iM

"

p + 1
Figure 2.7

Near the vertex p we write


f(r) = (r - p)kg(r), where g(p) "# 0,
The exponent k is positive iffhas a zero at p, and negative iffhas a pole at p,
On the path C 1 we write r - p = re i9 where r is fixed and a :s; :s; nl2 e
where a depends on r. Then
I'(r) k g'(r)
-=--+-
f(r) r - p g(r)
and

_1_
2ni
f
CI
I'(r) dr = _1_
f(r) 2ni
fa
1[(2
(~+
+ re,9)
re,9
g'(p
g(p
+ re i9 ))re i9 i de

- ka' r fll g'(p + reW) . ,


= -- + -
1T
'9 e,6d(}, where a = -2 - a,
2n 2n 1[(2 g(p + re' )

°since the integrand is bounded. Also,


°
As r -> 0, the last term tends to
a' -> nl3 as r -> so

lim _1
2ni
r ... O
f CI
I'(r) dr
f(r)
= - ~6'

Similarly,

lim _1_
2ni
r ... O
f C3
I'(r) dr =
f(r)
k
6

38
2.5: Special values of J

so
lim _1
r~O 2ni
(1 + 1
c, C3
)f'(r) dr =
f(r)
k

Similarly, near the vertex i we write


f(r) = (r - i)lh(r), where h(i) #- °
and we find, in the same way,

lim _1 f f'(r) dr =
r~O 2ni C2 f(r) 2
Therefore we get the formula
k I
N - P = m - 3 - 2'
Ufhas a pole at x = 0, and zeros at p and i, then m, k and I are positive and
we have
k I
N + 3 + 2 = P + m.

The left member counts the number of zeros offin the closure of Rr (with the
conventions agreed on at the vertices) and the right member counts the
number of poles. Iff has a zero of order n at x = then m = - n and the
equation becomes
°
I k
N +n +- + -
= P.
2 3
Similarly, if f has a pole at p or at i the corresponding term k/3 or 1/2 is
negative and gets counted along with P. This completes the proof. 0

Theorem 2.5. Iff is modular and not constant, then for every complex c the
function f - c has the same number of zeros as poles in the closure of R r .
In other words,ftakes on every value equally often in the closure of R r .
PROOF. Apply the previous theorem to f - c. o
Theorem 2.6. Iff is modular and bounded in H then f is constant.
PROOF. Since f is bounded it omits a value so f is constant. o
2.5 Special values of J
Theorem 2.7. The function J takes every value exactly once in the closure of
R r . In particular, at the vertices we have
J(p) = 0, J(i) = 1, J(ioo) = 00.

There is afirst order pole at ioo, a triple zero at p, and J(r) - 1 has a double
zero at r = i.

39
2: The modular group and modular functions

PROOF. First we verify that g2(P) = 0 and g3(i) = O. Since p3 = 1 and


p2 + p + 1 = 0 we have
1 1 1 1
60 g2(P) = ~n (m + np)4 ~n ( mp 3 + np)4 = p4 ~n (mp2 + n)4
1 1 1 1 1
= PJ::n (n - m - mp)4 = pJ..N (N + Mp)4 = 60p g2(P),

so g2(P) = O. A similar argument shows that g3(i) = O. Therefore

J( ) = g2 3(P) = 0 and J(') = g2 3(i) = 1


p ~(p) I g/(i) .

The multiplicities are a consequence of Theorem 2.4. o

2.6 Modular functions as rational functions


of J
Theorem 2.8. Every rational function of J is a modular function. Conversely,
every modular function can be expressed as a rational function of J.
PROOF. The first part is clear. To prove the second, suppose f has zeros at
Zl' ... , Zn and poles at Pl' ... , Pn with the usual conventions about multi-
plicities. Let

g(T) = Ii J(T) - J(Zk)


k= 1 J(T) - J(Pk)
where a factor 1 is inserted whenever Zk or Pk is 00. Then g has the same zeros
and poles asfin the closure of Rr , each with proper multiplicity. Therefore
f /g has no zeros or poles and must be constant, so f is a rational function
~~ 0

2.7 Mapping properties of J


Theorem 2.7 shows that J takes every value exactly once in the closure
of the fundamental region R r . Figure 2.8 illustrates how Rr is mapped by
J onto the complex plane.
The left half of Rr (the shaded portion of Figure 2.8a) is mapped onto the
upper half-plane (shaded in Figure 2.8b) with the vertical part ofthe boundary
mapping onto the real interval ( - 00, 0]. The circular part of the boundary
maps onto the interval [0, 1], and the portion of the imaginary axis v > 1,
u = 0 maps onto the interval (1, + (0). Points in Rr symmetric about the
imaginary axis map onto conjugate points in J(Rr). The mapping is con-
formal except at the vertices T = i and T = P where angles are doubled
and tripled, respectively.

40
2.7: Mapping properties of J

v
Rr

J
..
-/
/
P ",\
I
I \
I \
\
u
-1 0 1
0= J(p) 1 = J(i)

(a) (b)

Figure 2.8

These mapping properties can be demonstrated as follows. On the


imaginary axis in Rr we have. = iv hence x = e 2 "it = e- 2 "v > 0, so the
Fourier series
1
+ L c(n)x"
00
12 3J(.) = - (x = eZ"it)
x "=0
shows that J(iv) is real. Since J(i) = 1 and J(iv) -+ + 00 as v -+ + 00 the
portion of the imaginary axis 1 ~ v < + 00 gets mapped onto the real axis
1 ~ J(.) < +00.
On the left boundary of Rr we have. = -t + iv, hence x = eZ"it =
e- Z1tve-"i = _e- Z1tV < O. For large v (small x) we have J( -t + iv) < 0 so
J maps the line u = -t onto the negative real axis. Since J(p) = 0 and
J( 00) = 00, the left boundary of Rr is mapped onto the line - 00 < J(.) ~ O.
As the boundary of Rr is traversed counterclockwise the points inside Rr
lie on the left, hence the image points lie above the real axis in the image
plane.
Finally, we show that J takes conjugate values at points symmetric about
the imaginary axis, that is,

J(.) = J( -f).

To see this, write. = u + iv. Then

and

Thus. and - f correspond to conjugate points x and x, but the Fourier


series for J has real coefficients so J(.) and J( -f) are complex conjugates.
41
2: The modular group and modular functions

In particular, on the circular arc ti = 1 we have - i = -l/t, hence


J( -i) = J( -l/t) = J(t) so J is real on this arc.

2.8 Application to the inversion problem for


Eisenstein series
In the Weierstrass theory of elliptic functions the periods WI' W2 determine
the invariants 92 and 93 according to the equations
1
g2 = giwI' W2) = 60 L(mw, + nW2 )4
(4) 1
93 = 93(WI, w 2 ) = 140 L(mw, + nW2 )6'
A fundamental problem is to decide whether or not the invariants 92 and 93
can take arbitrary prescribed values, subject only to the necessary condition
92 3 - 279/ =1= O. This is called the inversion problem for Eisenstein series
since it amounts to solving the equations in (4) for WI and W2 in terms of 92
and 93' The next theorem shows that the problem has a solution.

Theorem 2.9. Given two complex numbers a2 and a3 such that a2 3 - 27a32 =1= O.
Then there exist complex numbers WI and W2 whose ratio is not real such
that
and
PROOF. We consider three cases: (1) a2 = 0; (2) a3 = 0; (3) a2a3 =1= O.
Case 1. If a2 = 0 then a3 =1= 0 since a2 3 - 27a32 =1= O. Let w, be any
complex number such that

and let W2 = pw" where p = e211i/3. We know that 93(1, p) =1= 0 because
92(1, p)= 0 and L\(1, p) = 92 3 - 279/ =1= O. Then
1
92(WI, W2) = 92(W" wlP) = -492(1, p) = 0 = a2
WI
and

Case 2. If a3 = 0 then a2 =1= 0 and we take w, to satisfy

Wl4 = _92_(1_,_i)

42
2.9: Application to Picard's theorem

and let W2 = iw i . Then


1
92(W I, ( 2) = 92(W 1 , iwd = -492(1, i) = a2
WI
and
1
93(W I, ( 2) = 93(W 1 , iwd = -693(1, i) = 0 = a3'
WI
Case 3. Assume a2 i= 0 and a3 i= O. Choose a complex r with 1m r > 0
such that
a/
J(r)= 3 27 2'
a2 - a3
Note that J(r) i= 0 since a2 i= 0 and that
J(r) - 1 27a/
(5)
J(r) ~
For this r choose WI to satisfy

and let W2 = rw i . Then


92(W I , ( 2) WI -492 (1, r) 292(1, r) a2
= 6 =W I - - - = -
93(W I, ( 2) WI 93(1, r) 93(1, r) a3'
so

(6)

But we also have


J(r) - 1 279/(WI, ( 2) 27(a3/a2)2 9 /(W I, ( 2) 27a/
J(r) 923(WI' ( 2) 3
92 (WI> ( 2) = a/92(w I, ( 2)'

Comparing this with (5) we find that 92(W I, (2) = a2 and hence by (6) we
also have 93(W I, ( 2) = a3' This completes the proof. D

2.9 Application to Picard's theorem


The modular function J can be used to give a short proof of a famous theorem
of Picard in complex analysis.

Theorem 2.10. Every nonconstant entire function attains every complex value
with at most one exception.

Note. An example is the exponential function f(z) = eZ which omits


only the value O.

43
2: The modular group and modular functions

PROOF. W~ assume / is an entire function which omits two values, say a


and b, a "# b, and show that/is constant. Let

( ) _ /(z) - a
9z - b .
-a
Then 9 is entire and omits the values 0 and 1.
The upper half-plane H is covered by the images of the closure of the
fundamental region Rr under transformations of r. Since J maps the closure
of Rr onto the complex plane, J maps the half-plane H onto an infinite-
sheeted Riemann surface with branch points over the points 0, 1 and 00
(the images of the vertices p, i and 00, respectively). The inverse function J- 1
maps the Riemann surface back onto the closure of the fundamental region
R r . Since J'(T) "# 0 if T "# p or T "# i and since J'(p) = J'(i) = 0, each single-
valued branch of r 1 is locally analytic everywhere except at 0 = J(p),
1 = J(i), and 00 = J( (0). For each single-valued branch of r 1 the composite
function
h(z) = r 1 [g(z)]

is a single-valued function element which is locally analytic at each finite


z since g(z) is never 0 or 1. Therefore h is arbitrarily continuable in the entire
finite z-plane. By the monodromy theorem, the continuation of h exists as a
single-valued function analytic in the entire finite z-plane. Thus h is an entire
function and so too is
qJ(z) = eih(z).

But 1m h(z) > 0 since h(z) E H so


IqJ(z) I = e-1mh(z) < 1.
Therefore qJ is a bounded entire function which, by Liouville's theorem,
must be constant. But this implies h is constant and hence 9 is constant since
g(z) = J[h(z)]. Therefore/is constant since/(z) = a + (b - a)g(z). 0

Exercises for Chapter 2


In these exercises, r denotes the modular group, Sand T denote its gen-
erators, S(T) = -l/T, T(T) = T + 1, and I denotes the identity element.
1. Find all elements A of r which (a) commute with S; (b) commute with ST.
2. Find the smallest integer n > 0 such that (ST)" = I.
3. Determine the point 't in the fundamental region Rr which is equivalent to
(a) (8 + 6i)/(3 + 2i); (b) (Wi + 11)/(6i + 12).

4. Determine all elements A of r which leave i fixed.

5. Determine all elements A of r which leave p = e2ni/ 3 fixed.

44
Exercises for Chapter 2

QUADRATIC FORMS AND THE MODULAR GROUP


The following exercises relate quadratic forms and the modular group r. We
consider quadratic forms Q(x, y) = ax 2 + bxy + cy2 in x and y with real
coefficients a, b, c. The number d = 4ac - b2 is called the discriminant of
Q(x, y).
6. If x and yare subjected to a unimodular transformation, say

(I) x = (Xx' + {3y', y = yx' + by', where (~ !) E r,


prove that Q(x, y) gets transformed to a quadratic form Ql(X', y') having the same
discriminant. Two forms Q(x, y) and Ql(X', y') so related are called equivalent. This
equivalence relation separates all forms into equivalence classes. The forms in a given
class have the same discriminant, and they represent the same integers. That is, if
Q(x, y) = n for some pair of integers x and y, then Ql(X', y') = n for the pair of
integers x', y' given by (1).
In Exercises 7 thru 10 we consider forms ax 2 + bxy + cy2 with d > 0,
a > 0, and c > 0. The associated quadratic polynomial
f(z) = az 2 + bz + c
has two complex roots. The root r with positive imaginary part is called the
representative of the quadratic form Q(x, y) = ax 2 + bxy + ci.
7. (a) If d is fixed, prove that there is a one-to-one correspondence between the set
of forms with discriminant d and the set of complex numbers T with Im(T) > O.
(b) Prove that two quadratic forms with discriminant d are equivalent if and only if
their representatives are equivalent under r.

Note. A reduced form is one whose representative r ERr. Thus, two


reduced forms are equivalent if and only if they are identical. Also, each
class of equivalent forms contains exactly one reduced form.
S. Prove that a form Q(x, y) = ax 2 + bxy + cyZ is reduced if, and only if, either
- a < b :S a < c or 0 :S b :S a = c.
9. Assume now that the form Q(x, y) = ax 2 + bxy + cy2 has integer coefficients
a, b, c. Prove that for a given d there are only a finite number of equivalence classes
with discriminant d. This number is called the class number and is denoted by h(d).
Hint: Show that 0 < a :S Jdi3 for each reduced form.
10. Determine all reduced forms with integer coefficients a, b, c and the class number
h(d) for each d in the interval 1 :S d :S 20.

CONGRUENCE SUBGROUPS

The modular group r has many subgroups of special interest in number


theory. The following exercises deal with a class of subgroups called con-
gruence subgroups. Let

and

45
2: The modular group and modular functions

be two unimodular matrices. (In this discussion we do not identify a matrix


with its negative.) If n is a positive integer write
A == B (mod n) whenever a == ct, b == /3, c == y and d == {) (mod n).
This defines an equivalence relation with the property that
and
implies
A1B1 == A2B2 (mod n) and A 1- 1 == A2 -1 (mod n).
Hence
A == B (mod n) if, and only if, AB- 1 == I (mod n),
where I is the identity matrix. We denote by r(n) the set of all matrices in r
congruent modulo n to the identity. This is called the congruence subgroup
of level n (stufe n, in German).
Prove each of the following statements:
11. pn) is a subgroup of r. Moreover, if B E r(n) then A-I BA E pn) for every A in r.
That is, pn) is a normal subgroup of r.
12. The quotient group rjr(n) is finite. That is, there exist a finite number of elements of
r, say A I> ••• , A k , such that every B in r is representable in the form
B = AiB(n) where 1 ::; i ::; k and B(n) E pnl.
The smallest such k is called the index of pnl in r.
13. The index of pn) in r is the number of equivalence classes of matrices modulo n.

The following exercises determine an explicit formula for the index.


14. Given integers a, b, e, d with ad - be == 1 (mod n), there exist integers IX, p, y, b
such that IX == a, p == b, y = e, b == d (mod n) with IX£; - py = 1.
15. If (m, n) = 1 and A E r there exists A in r such that

A == A (mod n) and A == I (mod m).


16. Letf(n) denote the number of equivalence classes of matrices modulo n. Thenfis a
multiplicative function.
17. If a, b, n are integers with n ;::: 1 and (a, b, n) = 1 the congruence
ax - by == 1 (mod n)
has exactly n solutions, distinct mod n.(A solution is an ordered pair (x, y) of integers.)
18. For each prime p the number of solutions, distinct mod pr, of all possible congruences
of the form
ax - by == 1 (mod prj, where (a, b, p) = \,
is equal to f(pr).
19. If p is prime the number of pairs of integers (a, b), incongruent mod pr, which satisfy
the condition (a, b, p) = 1 is pZr-Z(pZ - 1).
20. f(n) = n 3 Ldln !J.(d)jd Z , where!J. is the Mobius function.

46
3
The Dedekind eta function

3.1 Introduction
In many applications of elliptic modular functions to number theory the
eta function plays a central role. It was introduced by Dedekind in 1877
and is defined in the half-plane H = {t: Im(t) > O} by the equation

TI (1
00

(1) l1{t) = enit/12 - e2nint).


n=1

The infinite product has the form TI (1 - xn) where x = e 2nit . If t E H then
Ix I < 1 so the product converges absolutely and is nonzero. Moreover,
since the convergence is uniform on compact subsets of H, l1{t) is analytic
onH.
The eta function is closely related to the discriminant Ll(t) introduced
in Chapter 1. Later in this chapter we snow that
Ll(t) = (211:)12 11 24(t).

This result and other properties of 1J(t) follow from transformation formulas
which describe the behavior of l1(t) under elements of the modular group r.
For the generator Tt = t + 1 we have

= eni(t+ 1)/12 TI (1
00

(2) l1(t + 1) - e 2nin (t+ 1)) = e 1til12 11(t).


n= 1

Consequently, for any integer b we have

(3)
Equation (2) also shows that 11 24(t) is periodic with period 1.

47
3: The Dedekind eta function

For the other generator Sr = -l/r we have the following theorem.


Theorem 3.1. If r E H we have

(4) 11( ~1) = (-ir)I/2 11(r).


Note. We choose that branch of the square root function ZI/2 which is
positive when z > o.

This chapter gives two different proofs of (4). The first is a short proof of
C. L. Siegel [48] based on residue calculus, and the second derives (4) as a
special case of a more general functional equation which relates

l1(ar
cr
+
+d
b)
to l1(r) when

(; :) E rand c > o.
(See Theorem 3.4.) A third proof, based on interchange of summation in a
conditionally convergent iterated series, is outlined in the exercises.

3.2 Siegel's proof of Theorem 3.1


First we prove (4) for r = iy, where y > 0, and then extend the result to all
r in H by analytic continuation. If r = iy the transformation formula becomes
l1(i/y) = yl/2 11 (iy), and this is equivalent to
log l1(iM - log l1(iy) = t log y.
Now
:n:y
log l1(iy) = - -
12
+ log n (1 -
00

n=1
e- 2nny )

:n:y :n:y 00 en

L L
00

- 12 + n~1 10g(1 - e- 2nny ) = ---


12 n= 1 m= 1 m
:n:y 00 1 e- 2nmy :n:y en 1 1
- 12- m~I;;;1-e-2nmy - - + '\' - ~----.::=
12 mL: 1 m 1 - e2nmy ·
Therefore we are to prove that
1 1 1 1 :n: ( 1) 1
e2nm/y - 12 Y - Y = - 2" log y.
00 00
(5) m~ 1 ;;; 1 - e 2nmy - m~ 1 ;;; 1 -
This will be proved with the help of residue calculus.
For fixed y > 0 and n = 1,2, ... , let
1 :n:Nz
Fiz) = - - cot :n:iNz cot--
8z y ,

48
3.2: Siegel's proof of Theorem 3.1

-y y

-i

Figure 3.1

where N = n + 1. Let C be the parallelogram joining the vertices y, i, - y, - i


in that order. (See Figure 3.1.) Inside C, Fn has simple poles at z = ik/N and
at z = ky/N for k = ± 1, ±2, ... , in. There is also a triple pole at z = 0
with residue i(y - y-l)/24. The residue at z = ik/N. is
1 nik
8nk cot y'
Since this is an even function of k we have
1 nik
L" L
n
Res Fiz) = 2 8 k cot-.
k= -n z=ik/N k= 1 n Y
k,.O

But

cot i(}
cos i(}
= -.-.-
sm I(} = i e
e- 8
8
+ e88 =
- e -
e28 + 1 1(
1 I - e28 .
i e28 - 1 = i -
2)
Using this with () = nk/y we get
n 1"1 I n l 1
L Res F (z) = - L - - - L ----"--,,..,-
k= -n z=ik/N n 4ni k= 1 k 2ni k= 1 k 1 - e21tk / y '
k,.O

Similarly
n in 1 i nl 1
L Res Fn(z) =-4 L -k--2 L-k-l-~2"~kY'
k = -" z =ky/N n k =1 n k =1 - e
k,.O

Hence 2ni times the sum of all the residues of Fiz) inside C is an expression
whose limit as n -+ 00 is equal to the left member of(5). Therefore, to complete
the proof we need only show that

lim
n-+C()
f C
Fiz) dz =, -1 log y.

49
3: The Dedekind eta function

On the edges of C (except at the vertices) the function zFn(z) has, as


t
n -+ 00, the limit on the edges connecting y, i and - y, - i, and the limit - t
on the other two edges. Moreover, Fn(z) is uniformly bounded on C for all n
(because N = n + ! and y > 0). Hence by Arzeh't's bounded convergence
theorem (Theorem 9.12 in [3]) we have

.f
hm
n--+oo C
Fn(z) dz = f. dz
hm zFn(z)-
C n-4co Z

1
=- {f
8
-
Y

-i
+ Ii -
Y
f- + f- i} -
i
Y

-y
dz
Z

= ~ {-
4
fY + Ii} dz
_I Y Z

= ~ { - (log y+ ~i) + (~ - log y)} = - ~ log y.


This completes the proof. o
3.3 Infinite product representation for Ll(T)
In this section we express the discriminant d(r) in terms of I1(r) and thereby
obtain a product representation of d(r). The result makes use of the following
property of d(r).

Theorem 3.2. If (: :) E r then

de: : :) = (er + d)12d(r).


In particular,

d(r + 1) = d(r) and

PROOF. Since d(w l , w 2 ) is homogeneous of degree -12 we have


d(w l , w 2 ) = WI -12 d (l, r) = WI -12d (r),
where r = W 2 /W 1 . Also,
d(W 1 , w 2 ) = d(W I ', w 2 ')
if (WI' w 2 ) and (WI', w 2') are equivalent pairs of periods. Taking WI = 1,
w 2 = r, WI' = er + d, w 2 ' = ar + b, we find

d(r) = d(WI' W2) = d(er + d, ar + b) = (er + d)-12 d ( at +


1, ~-. b)
er + d
0

50
3.4: The general functional equation for ~(r)

Theorem 3.3. If r E H and x = e21tit we have

(6) L\(r) = (2n)121'/24(r) = (2n)12x n (1 -


00

X"f4.
"~ 1

Consequently,

L r(n)x" = x n (1
00 00

(7) - X")24 whenever Ix I < 1


"~1 "~1

where r(n) is Ramanujan's tau function.

PROOF. Let f(r) = L\(r)/I'/24(r). Then f(r + 1) = f(r) and f( -l/r) = f(r),
so f is invariant under every transformation in r. Also, f is analytic and non-
zero in H because L\ is analytic and nonzero and 1'/ never vanishes in H.
Next we examine the behavior of fat ioo. We have

I'/24(r) = e 21tit n (1 -
00

n~l
e 21ti"t)24 =x n (1 -
00

n~l
X")24 = x(l + I(x)),

where I(x) denotes a power series in x with integer coefficients. Thus, I'/24(r)
has a first order zero at x = O. By Theorem 1.19 we also have the Fourier
expansion
00

(8) L\(r) = (2n)12 L r(n)x" = (2n)12x(1 + I(x)).


"~ 1

Thus, near ioo the function f has the Fourier expansion

(9) f( ) = L\(r) = (2n)12x(1 + I(x)) = (2 )12(1 I())


r I'/24(r) x(l + I(x)) n + x,

so f is analytic and nonzero at ioo. Therefore f is a modular function which


never takes the value 0, so f must be constant. Moreover, (9) shows that
this constant is (2n)12, hence L\(r) = (2n)121'/24(r). This proves (6), and (7)
follows from (8). 0

3.4 The general functional equation for t](r)


Extracting 24th roots in the relation

L\(ar
cr
+
+d
b) = (cr + d)12L\(r)
and using (6) we find that

I'/(ar
cr
+
+d
b) = e(cr + d)1/21'/(r),

where e 24 = 1. For many applications of I'/(r) we require more explicit


information concerning e. This is provided in the next theorem.
51
3: The Dedekind eta function

Theorem 3.4 (Dedekind's functional equation). If (; :) E r, c > 0, and


'T E H, we have

(10) 11(:: : !) = e(a, b, c, d){ - i(cr + dW/ 2 11(r)


where

e(a, b, c, d) = ex p{ nie 1;cd + s( - d, C))}


and

(11) s(h, k) = L -kr (hr


k-1
r= 1
-k - -21).
-k - [hrJ
Note. The sum s(h, k) in (11) is called a Dedekind sum. Some of its properties
are discussed later in this chapter.

We will prove Theorem 3.4 through a sequence of lemmas. First we note


that Dedekind's formula is a consequence of the following equation, obtained
by taking logarithms of both members of (10),

(12) log 11(:: : !) = log I1(r) +n{a l;cd + s( -d, C)) +! log{ -i(cr + d)}.
From the definition of I1(r) as a product we have
nir nir
(13) log I1(r) = -
12
+ L log(1 -
00

n=l

e 2n •nt ) = - -
12
L A( -inr),
00

n=l

where A(X) is defined for Re(x) > 0 by the equation


e-2nmx
L --.
00

(14) A(X) = -log(1 - e- 2nx ) =


m=l m
Equations (12) and (13) give us

Lemma 1. Equation (12) is equivalent to the relation

(15) LA(-mr)
00 •
= LA
00 (
-in +-
ar - b) + -ni ( r ---
ar + b)
n=l n=l cr+d 12 cr+d

a +d
+ ni( ~ + s( -d, c) ) +! log{ - i(cr + d)}.

We shall prove (15) as a consequence of a more general transformation


formula obtained by Sh6 Iseki [17] in 1957. For this purpose it is convenient
to restate (15) in an equivalent form which merely involves some changes
in notation.

52
3.5: Iseki's transformation formula

Lemma 2. Let z be any complex number with Re(z) > 0, and let h, k and H be
any integers satisfying (h, k) = 1, k > 0, hH == -1 (mod k). Then Equation
(15) is equivalent to the formula

(16) I: A{~k
n= 1
(z - ih)} = I: A{~k (~z - iH)}
n= 1

n (z -
+ "21 log z - 12k ~1) + . 1rls(h, k).

PROOF. Given (: ;) in r, with c > 0, and given r with Im(r) > 0, choose
z, h, k, and H as follows:
k = e, h = -d, H = a, z = -i(cr + d).
Then Re(z) > 0, and the condition ad - be = 1 implies - hH - bk = 1, so
(h, k) = 1 and hH == -1 (mod k). Now b = -(hH + 1)/k and iz = er + d,
so
iz - d iz +h
r=--=--
e k
and hence

iz + h hH + 1 iz (
ar+ b = H -k- - k =I H+~. i)
Therefore, since er +d= iz, we have

ar +
er +d
b= ~k (H + ~).
z
Consequently

ar + b
r - cr + d =
1-
k (h H) + ki ( z - 1)
~ = - -e- +
a +d
ki ( Z - ~
1)
so

~~ (r -;;: ~) = -n{a l;ed ) - l;k (z -~).


Substituting these expressions in (15) we obtain (16). In the same way we
find that (16) implies (15). 0

3.5 Iseki's transformation formula


Theorem 3.5 (Iseki's formula). If Re(z) > ° °S and (I. S 1, °s {3 s 1, let
00

(17) A((I., {3, z) = L {A((r + (I.)z - i{3) + A((r + 1 - (I.)z + i{3)}.


r=O

53
3: The Dedekind eta function

Then if either 0 ~ ex ~ 1 and 0 < {3 < 1, or 0 < ex < 1 and 0 ~ {3 ~ 1,


we have

(18) A(ex, {3, z) = A(1 - {3, ex, Z-l) - nz i


n:O
(2)(iZ)-nB2_iex)Bn({3).
n

Note. The sum on the right of (18), which contains Bernoulli polynomials
Bix), is equal to

PROOF. First we assume that 0 < ex < 1 and 0 < {3 < 1. We begin with the
first sum appearing in (17) and use (14) to write
e27timfJ
L: A((r + ex)z - L: L - - e-27tm('+~)z.
ct:J ct:J ct:J

(19) i{3) =
,:0 ,:Om:J m
Now we use Mellin's integral for e- X which states that

(20) 1.
e- x = -2
m
IC
+

c-ct:Ji
ooi
r(s)x- S ds,

where c > 0 and Re(x) > O. This is a special case of Mellin's inversion formula
which states that, under certain regularity conditions, we have

cp(s) = f
oo
xs-Jt/I(x) dx
.
if, and only if, t/I(x) = -2.
1 I C +ooi
cp(s)x- S ds.
o nl c- ct:Ji

In this case we take cp(s) to be the gamma function integral,

r(s) = LX) xs-Je- x dx

and invert this to obtain (20). (Mellin's inversion formula can be deduced
from the Fourier integral theorem, a proof of which is given in [3]. See also
[49], p. 7.) Applying (20) with x = 2nm(r + ex)z and c = 3/2 to the last
exponential in (19) and writing I(c) for I~~~: we obtain

L: A((r + ex)z -
00
i{3) = L: L -m- -2.m
00 00 e27timfJ 1 f. r(s){2nm(r + ex)z} -s ds
f.
,:0 ,:0 m: J (3/2)

1 r(s) 00 1 e 21timfJ
L (r + ex)S L -y:tS ds
00
= -2
m. (3/2)
-(2)S
nz ,:0 m:J m

= -2. f.
1 r(s)
-(2)S ((s, ex)F({3, 1 + s) ds.
m (3/2) nz
54
3.5: Iseki's transFormation Formula

Here ((s, 0:) is the Hurwitz zeta function and F(x, s) is the periodic zeta function
defined, respectively, by the series
ex 1 oc e27rimx

((s,o:) = L (. + 0: y' and F(x, s) = L ~5-


r=O I m=1 m
where Re(s) > 1,0< 0: ::; 1, and x is real. In the same way we find

f. A((r + 1 -
r=O
o:)z + i{3) 1.
= -2
7rl
J
(312)
(2r(S))5 ((s, 1 - o:)F(1 - {3, 1
1[Z
+ 5) ds,

so (17) becomes

(21) A(o:, {3, z) 1.


= -2
m
J
(312)
z-5<1>(0:, {3, s) ds,

where

(22) <1>(0:, {3, s) = (r21[(s)) Ws,o:)F({3, 1 + s) +


5
((s, 1 - o:)F(1 - {3, 1 + s)}.

Now we shift the line of integration from c = 1 to c = -l Actually, we


apply Cauchy's theorem to the rectangular contour shown in Figure 3.2,

i + iT

~
-- i + iT

t
i-iT -- i-iT

Figure 3.2

and then let T ..... 00. In Exercise 8 we show that the integrals along the
horizontal segments tend to 0 as T ..... 00, so we get

i312) - i - 312) +R
where R is the sum of the residues at the poles of the integrand inside the
rectangle. This gives us the formula

A(o:, (3, z) = ~-:


2m
J
(-3;2)
z-5<1>(0:, (3, s) ds + R.

55
3: The Dedekind eta function

In this integral we make the change of variable u = - s to get it back in


the form of an integral along the ~ line. This gives us

(23) A(a, {3, z) 1.


= -2
7II
f(3/2)
z"<1>(a, {3, -u) du + R.

Now the function <1> satisfies the functional equation


(24) <1>( a, {3, - s) = <1>(1 - {3, a, s).
This is a consequence of Hurwitz's formula for ((s, a) and a proof is outlined
in Exercise 7. Using (24) in (23) we find that
(25) A(a, {3, z) = A(1 - {3, a, z - I) + R.
To complete the proof of Iseki's formula we need to compute the residue
sum R.
Equation (22) shows that <1>(a, {3, s) has a first order pole at each of the
points s = 1, °
and -1. Denoting the corresponding residues by R(1),
R(O) and R( - 1) we find
f(1) 1 (e 2 ninfJ e - 2 nin fJ )
+ F(1 L + -~2~
CL
R(1) =- {F({3, 2) - {3,2)} =- -2
2nz 2nz n= 1 n n
1 oc e2ninfJ 1 - (2ni)2 n
= -2
nz
L
n= - x
- z = -2
n nz.
2' B z({3) =-
z
B 2 ({3),
n*O

where we have used Theorem 12.19 of [4] to express the Fourier series as a
Bernoulli polynomial.
To calculate R(O) we recall that ((0, a) = 1 - a. Hence ((0, 1 - a) = a - 1
so
e 2nin f! _ e - 2ninfJ
L ----
CL

R(O) = ((0, a)F({3, 1) + ((0, 1 - a)F(l - {3, 1) = (1- a)


n= 1 n
co e2ninfJ oc e2ninfJ
= (1 - a) L -- = -BI(a) L - - = 2niBI(a)Bl({3),
n=-~ n n=-o: n
n*O ,dO

where again we have used Theorem 12.19 of [4]. To calculate R( -1) we


write

R( -1) = Res z-s<1>(a, {3, s) = lim (s + l)z-s<1>(a, {3, s)


s= - 1 s- - 1

= lim( - s + l) zs<1>(a, {3, - s).


s~ 1

Using the functional equation (24) we find

R( -1) = lim(1 - s)zs<1>(1 - {3, a, s) = - Res zS<1>(1 - {3, a, s).


s= 1

56
3.5: Iseki's transformation formula

Note that this is the same as R(l) = Res s = 1 z-S<l>(ex, {3, s), except that z is
replaced by - z - 1, ex by 1 - {3, and {3 by ex. Hence we have
R( -1) = - nzB2(ex).
Thus

R = R( -1) + R(O) + R(1) = -1tznto G)(iZ)-IIB2- n(ex)B n({3).


This proves Iseki's formula under the restriction 0 < ex < 1,0 < 13 < 1.
Finally, we use a limiting argument to show it is valid if 0 :s; ex :s; 1 and
o < [3 < 1, or if 0 :s; [3 :s; 1 and 0 < ex < 1. For example, consider the series
oc' oc oc' e2rrimfJ
LA((r+ex)z-i[3)= L L _ _ e- 2rrm (r+.)z
r=O r=OIll=1 m

L - - e- 2rrm.z L e-21tmrz
ex e2rrimp ex.
=
111=1 m r=O
(( e 21tim {J e - 2 rrmaz x

L
",=1
- - 1 _ e 2rrmz
m
L e 2rrimPJ.(m),
m=1

say, where
1 e - 2rrm.z

J,(m) = ;;; 1 _ e 2rrmz'

As m ~ CXJ, I,(m) ~ 0 uniformly in ex if 0 :s; ex :s; 1. Therefore the series

L
CJJ

e2rrimp J,(m)
m=1

converges uniformly in ex if 0 :s; ex :s; 1, provided 0 < {3 < 1, so we can pass


to the limit ex ~ 0 + term by term. This gives us
oc oc
lim L A((r + ex)z - i{3) = L A(rz - if3).
<1-0+ r=O r=O

Therefore, if 0 < [3 < 1 we can let ex ~ 0 + in the functional equation. The


other limiting cases follow from the in variance of the formula under the
following replacements:

ex ~ 1 - ex, {3~I-f3

ex ~ 13, 13 ~ 1 - ex, z~­


z

ex ~ 1 - {3, z ~-.
z
o
57
3: The Dedekind eta function

3.6 Deduction of Dedekind's functional


equation from Iseki's formula
Now we use Iseki's formula to prove Equation (16) of Lemma 2. This, in turn,
will prove Dedekind's functional equation for ,.,(r).
Equation (16) involves integers hand k with k > O. First we treat the case
k = 1 for which Equation (16) becomes

(26) "~lA{n(Z - ih)} = n~/{nG - iH)} + ~ log z - ;2 (z -~).


Since A(X) is periodic with period i this can be written as

(27) L A(nz) =
00

n= 1
LA
00

n= 1
- (n) + -21log z - -12n(z - -1)z .
Z

We can deduce this from Iseki's formula (18) by taking f3 = 0 and letting
0( -+ 0 +. Before we let 0( -+ 0 + we separate the term r = 0 in the first term

of the series on the left of(18) and in the second term ofthe series on the right
of (18). The difference of these two terms is A(o(Z) - A(iO(). Each of these tends
to 00 as 0( -+ 0+ but their difference tends to a finite limit. We compute this
limit as follows:
. 1 - e-2"i~
A(o(Z) - A(iO() = 10g(1 - e- 21t1 <%) - 10g(1 - e- 2,,<%Z) = log 1 _ e 2,,<%z'

By L'Hopital's rule,
. 1 - e- 2"i<% • 2ni i
hm 1
<%-0+ - e
2,,<%z = <%-0
hm -2 = -
nz Z
so
lim (A(O(Z) - A(iO(» = log ~ = ni - log z.
<%-0+ Z 2
Now when 0( -+ 0+ the remaining terms in each series in (18) double up
and we obtain, in the limit,

(28)
ni
- - log Z + 2 L
00
A(rz) = 2 A - - - L00 (r) nz + -n+-.
ni
2 r= 1 r= 1 Z 6 6z 2
This reduces to (27) and proves (16) in the case k = 1.
Next we treat the case k > 1. We choose rational values for 0( and f3 in
Iseki's formula (18) as follows. Take

0( =~, where 1 ::;;; Jl. ::;;; k - 1

58
3.6: Deduction of Dedekind's functional equation from Iseki's formula

and write

hJ1 = qk + v, where 1 ::;; v ::;; k - 1.

Now let

Note that v == hf.1 (mod k) so - Hv == - HhJ1 == f.1 (mod k), and therefore
-Hv/k == 14k (mod 1). Hence ex = f.1/k == -Hv/k (mod 1) and f3 = v/k ==
hf.1/k (mod 1). Substituting in Iseki's formula (18) and dividing by 2 we get

Rewrite this as follows:

Now sum both sides on f.1 for J1 = 1, 2, ... , k - 1 and note that

{rk + II: r = 0, 1,2, ... ; f.1 = 1,2, ... , k - I} = {n: 11 =j. °


(mod k)}

and similarly for the set of all numbers rk + k - f.1. Also, since v == hf.1 (mod k),
as f.1 runs through the numbers 1,2, ... , k - 1 then v runs through the same

59
3: The Dedekind eta function

set of values in some other order. Hence we get

n~1 A(~ (z - ih)) = n~1 A(k~ (-z1- iH)) + -rr2 (1-z - z) /1=L k-I

I
112
2"
k
n'tO (modk) ntO (modk)

- n(1
- - - Z L 11- + -rr(1- - )(k -
)k-1 Z 1)
2 Z /1= I k 12 Z

+ ni L - - - - - - L - + -
k-I Ii (v 1) ni k-I v rri
(k - 1)
/1=lk k 2 2/1=lk 4

x ((k - 1)(2k - 1) _ 3(k _ 1)


k
+ (k - 1)) + rr/f ~ (~ - ~)
/1= k k 2
I

<X)

L A( -n (1- - iH))
k Z
+ -rr ( Z
12
- -1) (
Z
1 - -1)
k
+ rri kL (v
- I -
P - - -1).
/1= I k k 2
n= I
n to (mod k)

But v was defined by the equation hp = qk + v, so we have

h: = q + ~, q= [h: J I = h: - [~ J.
Therefore

ki l ~ (~ _~) = hf ~ (h P_ [hPJ _~) = s(h, k).


/1= k k 2
I /1= k k k 2 I

Therefore we have proved that

(29)

Add this to Equation (27) which corresponds to the case k = 1:

m= I
f A(rnz) = m=f A(~) - ~12 (z - ~)z + ~2 log z.
I Z

This accounts for the missing terms in (29) with n == 0 (mod k), if we write
11 = rnk. When (27) is combined with (29) we get

f A(~k (z - ih)) = f A(~k (~z - iH)) -


n= I n= I
~
12k
(z - ~)z + ~2 log z+ nis(h, k).
60
3.7: Properties of Dedekind sums

This proves (16) which, in tum, completes the proof of Dedekind's functional
equation for 1](T). For alternate proofs see p. 190 and [18], [35], and [45]. 0

3.7 Properties of Dedekind sums


The Dedekind sums s(h, k) which occur in the functional equation for I1(T)
have applications to many parts of mathematics. Some of these are described
in an excellent monograph on Dedekind sums by Rademacher and Grosswald
[38]. We conclude this chapter with some arithmetical properties of the sums
s(h, k) which will be needed later in this book. In particular, Theorem 3.11
plays a central role in the study of the invariance of modular functions under
transformations of certain subgroups of r, a topic discussed in the next
chapter.
Note. Throughout this section we assume that k is a positive integer and
that (h, k) = 1.
Dedekind sums are defined by the equation

(30) s(h, k) = kit ~ (hr _ [hr] _ ~).


r= t k k k 2
First we express these sums in terms of the function «x)) defined by
_ {x - [xJ -! if x is not an integer,
«x)) - 0 · f··
1 X IS an mteger.

This is a periodic function of x with period 1, and « -x)) = -«x)). Actually,


((x)) is the same as the Bernoulli periodic function Bj(x) discussed in [4J,
Chapter 12. Since «(x)) is periodic and odd we find that

and, more generally,

"" ((lir))
L.
r mod k k
_0 for (Ii, k) = 1.

Since

the Dedekind sums can now be represented as follows:

(31)

This representation is often more convenient than (30) because we can exploit
the periodicity of «x)).

61
3: The Dedekind eta function

Theorem 3.6
(a) If h' = ±h (mod k), then 5(h', k) = ±5(h, k), with the same sign as
in the congruence. Similarly, we have:
(b) If hn =
± 1 (mod k) then s(n, k) = ± 5th, k).
(c) If h 2 + 1 = 0 (mod k), then 5th, k) = O.

PROOF. Parts (a) and (b) follow at once from (31). To prove (c) we note that
h2 + 1 =°
(mod k) implies h = -n
(mod k), where 11 is the reciprocal of
h mod k, so from (a) and (b) we get 5th, k) = -5th, k) = O. 0

For small values of h the sum 5th, k) can be easily evaluated from its
definition. For example, when h = 1 we find

s(l, k) =
r(r 1) 1 L r - -1 L r
L - -- -
k- 1
= 2:
k- 1 2 k- 1

r= 1 k k 2 k r= 1 2k r= 1

(k-l)(2k-l) k-l (k-l)(k-2)


6k 4 12k

Similarly, the reader can verify that

(2 k) = (k - l)(k - 5) if k is odd.
s , 24k

In general there is no simple formula for evaluating s(h, k) in closed form.


However, the sums satisfy a remarkable reciprocity law which can be used
as an aid in calculating 5th, k).

3.8 The reciprocity law for Dedekind sums

Theorem 3.7 (Reciprocity law for Dedekind sums). If h > 0, k > 0 and
(h, k) = 1 we have

12hks(h, k) + 12khs(k, h) = h 2 + k 2 - 3hk + 1.

PROOF. Dedekind first deduced the reciprocity law from the functional
equation for log I1(T). We give an arithmetic proof of Rademacher and
Whiteman [39], in which the sum L~= 1 «hr/k))2 is evaluated in two ways.
First we have

(32) L ((hr))2
k - = L -
((hr))2 = L -
((r))2 = L (r---1)2
k- 1
r =1 k r mod k k r mod k k r=1 k 2 .

62
3.8: The reciprocity law for Dedekind sums

We can also write

i
r= 1
((hl'))2 = kil (hI' _ [hl'J _~)2
k r= 1 k k 2

_
-
L (h2r2
k-l
-k 2+ [hrJ2
r= 1
1 hI'
-k +---+
4 k
[hrJ
-k -2hr
-k
[hrJ)
-k

L
= 2h k-l
r= 1
-kI' (hr-k - [hrJ
-k --21)

L [hrJ([hrJ
+ k-J - - + ) h L
1 - 2:
2
k-l
1'2 + -1 k-lL 1.
r= 1 k k k r= 1 4 r= 1

Comparing this with (32) and using (30) we obtain

In the sum on the left we collect those terms for which [hr/k] has a fixed value.
I'
Since 0 < < k we have 0 < hr/k < h and we can write

(34) [~ J= v-I, where v = 1,2, ... , h.

For a given v let N(v) denote the number of values of I' for which [hr/k] =
v - 1. Equation (34) holds if, and only if

v-l<-<v
hI' or
k(v - 1) kv
k ' h <I'<h'

equality being excluded since (h, k) = 1 and 0 < I' < k. Therefore, if
1 S; v S; h - 1, Equation (34) holds when ranges from [k(v - 1)/h] + 1 I'
to [kv/h], and hence

N(v) = [~: J - [k(V ~ 1)J if I S; v S; h - 1.

But when v = h the quotient kv/h = k and since I' = k is excluded we have

N(h) = k _ 1_ [k(h; 1]
63
3: The Dedekind eta function

Hence

(35) L [hrJ([hrJ
k- I- - + 1) = Lh (v - l)vN(v)
r= I k k v= I

v~/v
h
- 1)v ([kVJ
h - [k(V h- I)J) - h(h - 1)

L
= h-I [kVJ
- {(v - 1)v - v(v + I)}
1'= I h
+ kh(h - 1) - h(h - 1)

= Lv~
-2 h-I [k J+ h(h - l)(k - 1).
v= I h
Now we also have

2hs(k, h) = 2 L v(kV
h-I
- -
[kVJ
- -
1)
- = - 2L v -
h-I [kVJ + -2k L v2 - h-I
h-I
LV
1'=1 h h 2 1'=1 h h,,=l v=1

so (35) becomes

k- I
II:
[h J([h J)
+ 1 = 2hs(k, h) -
:
2k
h V~I v2 + V~I V + h(h
h- 1 h- I
- l)(k - 1).

We use this in (33) and multiply by 6k to obtain the reciprocity law. 0

3.9 Congruence properties of Dedekind sums


Theorem 3.8. The number 6ks(h, k) is an integer. Moreover, iff) = (3, k) we have
(a) 12hks(k, h) == 0 (mod f)k)
and
(b) 12hks(h, k) == h 2 + 1 (mod f)k).

PROOF. From (30) we find

6h k- I k- I [hrJ k- I
(36) 6ks(h, k) = k r~/2 - 6 r~/ k - 3 r~/'
Since 6 I::
i r2 = k(k - 1)(2k - 1) each term on the right of (36) is an
integer. Moreover, (36) shows that

6ks(h, k) == h(k - 1)(2k - 1) (mod 3)

so we have

(37) 12ks(h, k) == 2h(k - 1)(2k - 1) == h(k - l)(k + 1) (mod 3).


64
3.9: Congruence properties of Dedekind sums

If 31 k then 3,(h and (37) implies


12ks(h, k) == - h =1= 0 (mod 3).
If 3,(k then 31(k - l)(k + 1) and (37) implies

(38) 12ks(h, k) == 0 (mod 3).


In other words, 12ks(h, k) == 0 (mod 3) if, and only if, 3,(k. Hence, inter-
changing hand k, we have
12hs(k, h) == 0 (mod 3) if, and only if, 3,(h.
If () = 3 this implies (a) since (h, k) = 1. If () = 1, (a) holds trivially. Part (a),
together with the reciprocity law, gives (b) since k 2 - 3hk == 0 (mod (}k).
o
Note. Theorems 3.8(b) and 3.6(c) show that
s(h, k) = 0 if, and only if, h2 + 1 == 0 (mod k).

Theorem 3.9. The Dedekind sums satisfy the congruence

(39) 12ks(h, k) == (k - l)(k + 2) - 4h(k - 1) + 4 L [k


2hrJ (mod 8).
r<k/2

If k is odd this becomes

(40) 12ks(h,k) == k - 1 + 4 L [k2hrJ (mod 8).


r<k/2

PROOF. From (36) we obtain


k-l [hJ
12ks(h, k) = 2h(k - 1)(2k - 1) - 12r~\ r : - 3k(k - 1)

= -2h(k - 1) + 4hk(k -1) - 12r~/


k-l [h: ]
+ k(k - 1) - 4k(k - 1).

Now we reduce the right member modulo 8. Since 4k(k - 1) == 0 (mod 8)


this gives us

12ks(h, k) == - 2h(k - 1) - 4 r~/ :


k-\ [h ] + k(k - 1) (mod 8)

L [hrJ
== (k - l)(k - 2h) - 4 k-\ - (mod 8)
r= 1 k
r odd

== (k - l)(k - 2h) - 4kL - +4


- 1 [hrJ
L [2hrJ
-k (mod 8).
r= 1 k r<k/2

65
3: The Dedekind eta function

The next to last term is equal to

-4 L [hr]
k-I
- = 4L k-I ((hr))
- - 4 L -hr + 2 L 1
k-I k-I

r=1 k r=1 k r=1 k r=1

= 0 - 2h(k - 1) + 2(k - 1) = (k - 1)(2 - 2h).


Since
(k - l)(k - 2h) + (k - 1)(2 - 2h) = (k - l)(k + 2) - 4h(k - 1)

this proves (39).


When k is odd we have 4h(k - 1) '= 0 (mod 8) and
(k - l)(k + 2) = k 2 + k - 2 '= k - 1 (mod 8)
since k 2 '= 1 (mod 8). Hence (39) implies (40). o
Theorem 3.10. If k = 2)k l where A > 0 and kl is odd, then for odd h 2': 1
we have
(41) 12hks(h, k) '= h2 + k 2 + 1 + 5k - 4k L [2~V] (mod 2)+ 3).
v < h/2

PROOF. Since h is odd we can apply (40) to obtain, after multiplication by k,

12hks(k, h) '= k(h - 1) + 4k L [2~V] (mod 2)+ 3).


v < h/2

By the reciprocity law we have


12hks(h, k) = h2 + k 2 - 3hk + 1- 12hks(k, h)

'= h2 + k2 - 3hk + 1 - k(h - 1) - 4k L [2kV] (mod 2)+3)


v<h/2 h

'= h2 + k 2 + 1 + k - 4hk - 4k L [ ~
2kV] (mod 2)+3).
v <h/2 h
Since h is odd we have 4k(h + 1) '= 0 (mod 2;+3) hence k - 4hk '= 5k
(mod 2;' + 3) and we obtain (41). 0
Finally, we obtain a property of Dedekind sums which plays a central
role in the study of the invariance of modular functions under transforma-
tions of certain subgroups of the modular group. This will be needed in
Chapter 4.
Theorem 3.11. Let q = 3, 5, 7 or 13 and let r = 24/(q - 1). Given integers
a, b, c, d with ad - bc = 1 such that c = c1q, where C 1 > 0, let

b = {s(a, c) - a I;C d} - {s(a, cd - al~ld}


Then rb is an even integer.

66
3.9: Congruence properties of Dedekind sums

PROOF. Taking k = c in Theorem 3.8(b) we find


a+
12ae { s(a, c) - ~ d}
== a 2 + 1 - a(a + d) == -be (mod 8c),

where 8 = (3, c). The same theorem with k = CI = c/q gives, after multi-
plication by q,

12ac{S(a,e l ) - al~ld} == qa 2 + q - qa(a + d) == -qbc (mod 8 I c),

where 8 1 = (3, cd. Note that 81 18 so both congruences hold modulo 8 1 c.


Subtracting the congruences and multiplying by r we find
12acr6 == r(q - l)bc (mod 8 1 c).
But r(q - l)bc = 24bc == 0 (mod 8 1c) so this gives
12acr6 == 0 (mod 8 1 c).
Now (a, c) = 1 since ad - bc = 1. Also, 12e6 is an integer so we can cancel
a in the last congruence to get

(42) 12cr6 == 0 (mod 8 1c).


Next we show that we also have
(43) 12crb == 0 (mod 3c).
Assume first that q > 3. In this case 8 = (3, qcd = (3, cd = 8 1 so (42)
becomes
12cr6 == 0 (mod 8c).
If 8 = 3 this gives (43). But if 8 = 1 then 3,rc so 3,rCI and (38) implies
12cs(a, c) == 0 (mod 3) and 12cs(a, cd == 0 (mod 3). Hence
12cr6 == r(q - l)(a + d) = 24(a + d) == 0 (mod 3),
which, together with (42), implies (43).
Now assume that q = 3 so r = 12. Then 8 = 3 and 81 is 1 or 3. If 81 = 3
we get (43) by the same argument used above, so it remains to treat the case
8 1 = 1. In this case 3,rcl so (38) implies 12c l s(a, cd == 0 (mod 3), hence
12cs(a, cd == 0 (mod 9).

Also,
12c6 = 12cs(a, c) - (a + d) - 12cs(a, cd + 3(a + d)
== 12cs(a, c) + 2(a + d) (mod 9),
so
(44) 12rac6 = 12racs(a, c) + 2r(a 2 + ad) (mod 9).

67
3: The Dedekind eta function

But Theorem 3.8(b) gives us 12acs(a, c) == a 2 + 1 (mod 9) since 3 !c. Hence


(44) becomes
12racb == r(a 2 + 1) + 2ra 2 + 2rad (mod 9)
== 3ra 2 + r + 2r(1 + bc) == 3r + 2rbc == 0 (mod 9)
since r = 12 and 9112c. This shows that
12racb == 0 (mod 9).
Now 3,(a since (a, c) = 1 so we can cancel a to obtain 12rcb == 0 (mod 9)
which, with (42), implies (43).
Our next goal is to show that we also have
(45) 12crb == 0 (mod 24c)
since this implies rb is even and proves the theorem. To prove (45) we treat
separately the cases c odd and c even.
Case 1: c odd. Apply (40) with k = c to obtain

a +
12e { s(a, e) - ~ d}
== e - 1 + 4T(a, c) - (a + d) (mod 8)

where we have written


T(a, e) = L [- .
2av]
v<e/2 C

We only need the fact that T(a, e) is an integer. Applying (40) again with
k = el = c/q and multiplying by q we have

12c{S(a, CI) - al;e~} == c - q + 4qT(a, CI) - q(a + d) (mod 8).

Subtracting the last two congruences and multiplying by r we find


12erb == r(q - 1) + r(q - l)(a + d) == 0 (mod 8)
since r(q - 1) = 24 and 4r == 0 (mod 8). Combining this with (43) we obtain
(45) and the theorem is proved for odd c.
Case 2: c even. Write c = 2Ay with y odd. Now a is odd since (a, e) = 1 so
if a 2:: 1 we can apply Theorem 3.10 with k = c and h = a to obtain

12ac{S(a, c) - a l;ed} == a 2 + c2 + 1

+ 5c - 4eT(c, a) - a(a + d) (mod 2.\+ 3)


== e 2 + 5c - be - 4eT(e, a) (mod 2.\+ 3)
since ad - bc = 1. Similarly,

12ac{S(a, c l ) - al~ld} = CCI + 5c - qbe - 4cT(cI' a) (mod 2.\+ 3).

68
3.10: The Eisenstein series G2 (r)

Subtract. multiply by I' and use the congruence 4('1' == 0 (mod 2' + 3) to obtain
12earb == rcel(q - 1) + r(q - l)bc == 0 (mod 2-<+3).
Since a is odd we can cancel a to obtain
(46) 12erb == 0 (mod 2-<+ 3).
Now (43) states that 12erb == 0 (mod 3 ·2 Ay) which, together with (46)
implies (45) and proves the theorem for a ~ 1.
To prove it for a < 0, write b = b(a) to indicate the dependence on a.
If a' = a + te, where t is an integer, an easy calculation shows that
b(a') - b(a) = t(q - 1)/12 since s(a, c) = s(a', c) and s(a', el) = s(a, cd. There-
fore rb(a') - rb(a) = 2t, an even integer. Choosing t so that a' ~ 1 we know
rb(a') is even by the above argument, so rb(a) is also even. This completes
the proof. 0

3.10 The Eisenstein series G2 (r)


If k is an integer, k ~ 2, and if r E H the Eisenstein series
1
(47) G2k(r) = L
(m.n)*(O.O) (m + nr)
2k

converges absolutely and has the Fourier expansion


2(2ni)2k oc .
(48) G2k(r) = 2,(2k) + (2k _ I)! n~/'2k_l(n)e21tlnr

where, as usual, a<x(n) = Ldln d<X. The cases k = 2 and k = 3 were worked out
in detail in Chapter 1, and the same argument proves (48) for any k ~ 2. If
k = 1 the series in (47) no longer converges absolutely. However, the series
in (48) does converge absolutely and can be used to define the function G 2 (r).

Definition. If r E H we define
+ 2(2ni)2 L a(n)e21tinr.
00

(49) G2(r) = 2,(2)


n= I

If x = e 21tir
the series on the right of (49) is an absolutely convergent
power series for Ixl < 1 so G 2 (r) is analytic in H. This definition also shows
that G 2 (r + 1) = G 2 (r).
Exercises 1 through 5 describe the behavior of G 2 under the other generator
of the modular group. They show that

(50)

a relation which leads to another proof of the functional equation 1]( - l/r) =
(- ir)I/21](r).

69
3: The Dedekind eta function

Exercises for Chapter 3


1. If r E H prove that
, x 1
(51) G 2 (r) = 2((2) + L L 2'
n= - x. m=-x(m+m)
n*O

H illt: Start with Equation (12) of Chapter 1, replace r by liT, where II > 0, and sum
over all n ~ 1.

2. Use the series in (51) to show that

(52) T- 2 G2 - (-1)T
= 2(2) + Loc
m=-xn=-oc(m+nr)
L
x 1 2'

the iterated series in (52) being the same as that in (5 1) except with the order of sum-
mation reversed, Therefore, proving (50) is equivalent to showing that
x oc 1 x x 1 2rri
(53) L L
m=_",'n=_",(m+nr)2
= L L
n=_cx:m=_oc(m+IH)2
--
T

3. (a) Inthegammafunctionintegralf(z) = Ii;' e-'t Z - 1 dt make the change of variable


t = :W, where 01. > 0, to obtain the formula

(54) OI.-zf(z) = {x: e-"Uuz- I dll,

and extend it by analytic continuation to complex 01. with Re(OI.) > 0,


(b) Take z = 2 and 01. = - 2rri(m + nr) in (54) and sum over aliI! ~ 1 to obtain the
relation

I
n=_x(lH+m)2
1 = - 8rr 2 fox cos(2rrmll)9,(II) dtt,
n*O
where
xc
9,(11) = II Le 2 "in,u if II > 0
,,= 1
and

-\
9,(0) = lim 9,(11) = - , '
"-0+ 2mT
4. (a) Use Exercise 3 to deduce that

(55) L
Z

m=-, n=-x
x
L (nT
1
+ /11)2 = f r(t) cos(2rrmt) dr,
n*O

where
x
fir) = L g,(t + k),
k=O

70
Exercises for Chapter 3

(b) The series on the right of (55) is a Fourier series which converges to the value
t{f(O+) + I(I- )}, Show that
_IX
1(0+) = ' + L9,(k)
-2
mT k=l

and that

I( 1-) = L g,(k) = L CJ(I1)e2rri""


k:::; 1 n=1

and then use (55) to obtain (50),

5. (a) Use the product defining II(T) to show that

d
-4rci -log I1(T) = G 2(T),
dT

(b) Show that (50) implies

-d log 11
ciT
(-I)
~
T
d
= -log
tiT
I1(T) Iti
+ - -Iog(
2 dT
-iT),

Integration of this equation gives 11( - liT) = C( - iT)l!2l1(r) for some constant C.
Taking T = i we find C = 1.
6. Derive the reciprocity law for the Dedekind sums s(l!, k) from the transformation
formula for log I1(T) as given in Equation (12),

Exercises 7 and 8 describe properties of the function


r(s)
<I>(er:, (3, s) = (2n)S {((s, er:)F((3, 1 + s) + ((s, 1 - er:)F(1 - (3,1 + s)}
which occurs in the proof of Iseki's formula (Theorem 3,5), The properties
follow from Hurwitz's formula (Theorem 12,6 of [4]) which states that
r(s)
{e- mS F(a, s) + ems F( -a, s)},
'/2 '/2
W - s, a) = (2n)S

7. (a) If 0 < a < I and Re (5) > I, prove that Hurwitz's formula implies

r(1 - 5) '1) 2
F(a, 5) = (2rc)1 s {err'l -s / W- S, a) + err,'( s- 1/2
) W - 5, I - a)l,

(b) Use (a) to show that <1>(1X, fi, 5) can be expressed in terms of Hurwitz zeta functions
by the formula
$(0:, fi, 5) , 2
-"-- = err ,,/ {(Is, 0:)(( -5, I - fJ) + ((5, I - 0:)(( -s,13))
ns)r( -s)
+ e - rris/2 {(( - s, I - fiKIs, I - 0:) + (( - s, fJK(s, 0:))

and deduce that $(0:, fi, s) = $( I - fJ, 0:, - s),


8. This exerCise gives an estimate for the modulus of the function z-S$(o:, fi, s) which
occurs in the integral representation of 1\(0:, fi, s) in the proof of Iseki's formula
(Theorem 3.5),

71
3: The Dedekind eta function

(a) Show that the formula of Exercise 7(b) implies


-1!Z -s .
z-ScD(a, f3, s) = ~.~ {e- rrls / 2[((s, al'( -s, f3) + ((s, I - !XK( -s, I - fJ)]
S SIn 1!S

+ enis/ 2 [((s, aK( -5, I - f3) + ((5, I - aj(( -5, f3)]}.


(b) Forfixedzwithlargzl < 1!/2,chooseb > Osothatlargzl::; 1!/2 - b,andshow
that if 5 = IJ + it where IJ :2: we have-1
Iz-sl = O(elflln/2-b»),

where the constant implied by the O-symbol depends on z.


(c) If 5 = IJ + it where IJ :2: -1 and It 1 :2: I, show that

I (e- nlfl )
Is sin 1!51 = 0 -I-tl- ,

and that

-1
(d) If IJ :2: and It I :2: 1 obtain the estimate 1((5, a)1 = O(ltl') for some c > 0
(see [4], Theorem 12.23) and use (b) to deduce that
Iz-ScD(a, f3, 5)1 = O(ltI2e-Ie-lfl,').
This shows that the integral of z -ScD(~, f3, 5) along the horizontal segments of the
rectangle in Figure 3.2 tends to 0 as T --> Xo.

PROPERTIES OF DEDEKIND SUMS

9. If k :2: I the equation

5(h,k) = rm~k (G))((~))


is meaningful even if h is not relatively prime to k and is sometimes taken as the
definition of Dedekind sums. Using this as the definition of 5(h, k) prove that
5(qh, qk) = 5(h, k) if q > O.
10. If p is prime prove that
p- I

(p + I)s(h, k) = s(ph, k) + I 5(h + mk, pk).


m=O

11. For integers r, h, k with k :2: I prove that we have the finite Fourier expansion

(( -hr)) = - -
I k- I
I .
SIn ~-
21!hrv 1!V
cot -
k 2k ,= I k k

and derive the following expression for Dedekind sums:

I 1!hr 1!r
I
k-I
5(h, k) = - cot - cot-.
4k r= I k k
12. This exercise relates Dedekind sums with the sequence {u(Il)) of Fibonacci numbers
I, 1,2,3,5,8, ... , in which u(1) = u(2) = I and U(11 + 1) = U(I1) + !t(1I - I).
(a) If h = u(211) and k = u(211 + I) prove that 5(h, k) = O.
(b) If h = u(211 - I) and k = u(211) prove that 12hks(h, k) = h 2 + k 2 - 3hk + 1.

72
Exercises for Chapter 3

FORMULAS FOR EVALUATING DEDEKIND SUMS

The following exercises give a number of formulas for evaluating Dedekind


sums in closed form in special cases. Assume throughout that (h, k) = 1,
k ~ 1, h ~ 1.
13. If k == I' (mod 11) prove that the reciprocity law implies
12I1ks(h, k) = k 2 - {12s(l', 11) + 3}l1k + h2 + 1.

Use the result of Exercise 13 to deduce the following formulas:


14. If k == I (mod h) then 12hks(h, k) = (k - I)(k - h2 - I).
15. If k == 2 (mod h) then 12hks(h, k) = (k - 2)(k - 1(h 2 + I)).
16. If k == -I (mod h) then 12hks(h, k) = k 2 + (h 2 - 6h + 2)k + h2 + 1.

17. If k == I' (mod h) and if h == t (mod 1') where I' ;::: 1 and t = ± I, then
h2 - t(1' - 1)(1' - 2)h + 1'2 + 1
12hk5(h,k) = k 2 - k + h2 + 1.
I'

This formula includes those of Exercises 14 and 15 as special cases.


18. Show that the formula of Exercise 17 determines 5(h, k) completely when I' = 3
and when I' = 4.
19. If k == 5 (mod 11) and if h == t (mod 5), where t = ± 1 or ± 2, then
2 112 + 4t(t - 2)(t + 2)h + 26 2
12hk 5(h, k) = k - 5 k + h + 1.
20. Assume 0 < h < k and let 1'0, 1'1' ... , I'n+ 1 denote the sequence of remainders in the
Euclidean algorithm for calculating the gcd (h, k), so that

1'0 = k, 1'1 = h. I'j+ 1 == I'j_ 1 (mod 1'), 1:$ I'j+ 1 < I'j' I'n+ 1 = 1.
Prove that

s(lI,k)= ~
12j~1
nf {(-I)j+1 r/ +.r j _ 1

'jrj_1
2
+ I} _ (_I)" + I.
8
This also expresses 5(h, k) as a finite sum, but with fewer terms than the sum in the
original definition.

73
4
Congruences for the coefficients
of the modular function}

4.1 Introduction
The function j(,) = 12 3 J(,) has a Fourier expansion of the form
100.
j(,) = - + L c(n)xn, (x =e 21t1t )
x n:O
where the coefficients c(n) are integers. At the end of Chapter 1 we mentioned
a number of congruences involving these integers. This chapter shows how
some of these congruences are obtained. Specifically we will prove that
c(2n) == 0 (mod 211),
c(3n) == 0 (mod 35 ),
c(5n) == 0 (mod 52),
c(7n) == 0 (mod 7).
The method used to obtain these congruences can be illustrated for the
modulus 52. We consider the function
00

15(') = L c(5n)xn
n:l

obtained by extracting every fifth coefficient in the Fourier expansion of j.


Then we show that there is an identity of the form
(1)
where the ai are integers and <1>(,) has a power series expansion in x = e 2 "it
with integer coefficients. By equating coefficients in (1) we see that each
coefficient of 15(') is divisible by 25.
Success in this method depends on showing that such identities exist.
How are they obtained?

74
4.2: The subgroup ro(q)

Theorem 2.8 tells us that every modular function f is a rational function


of j. Sometimes this rational function is a polynomial in j with integer co-
efficients, giving us an identity of the form
f("C) = ad("C) + a2/(') + ... + ak/(')'
However, the function f5(') is not invariant under all transformations of
the modular group r and cannot be so expressed in terms of j(')' But we
shall find that f5{'r) is invariant under the transformations of a certain
subgroup of r, and the general theory enables us to express f5(') as a poly-
nomial in another basic function <1>(,) which plays the same role as j(,)
relative to this subgroup. This representation leads to an identity such as
(1) and hence to the desired congruence property.

The subgroup in question is the set of all unimodular matrices (: ;)


with c == 0 (mod 5). More generally we shall consider those matrices in r
with c == 0 (mod q), where q is a prime or a power of a prime.

4.2 The subgroup r o(q)


Definition. If q is any positive integer we define r o(q) to be the set of all

matrices G~) in r with c == 0 (mod q).

It is easy to verify that r o(q) is a subgroup of r. The next theorem gives a


way of representing each element of r in terms of elements of r o(p) when
p is prime. In the language of group theory it shows that r o(p) is of finite
index in r.

Theorem 4.1. Let S, = - 1/, and T, = , + 1 be the generators of the full


modular group r, and let p be any prime. Then for every V in r, V ¢ r o(P),
there exists an element P in r o(P) and an integer k, 0 :::; k < p, such that
V = PSTk.

PROOF. Given V = (~ ~) where C f= 0 (mod p). We wish to find

P = (: !} with c == 0 (mod p),

and an integer k, 0 :::; k < p, such that

75
4: Congruences for the coefficients of the modular function j

~) to get

e~) (~ ~)G -~r (~ ~)


All matrices here are nonsingular so we can solve for (:

= 1 = (_ ~ ~) = (~~ =~ ~).
Choosp. k to be that solution of the congruence
kC == D (mod p) with 0:::; k < p.
This is possible since C¢:O (mod p). Now take
c = kC - D, a = kA - B, b = A, d = C.
Then c == 0 (mod p) so PEro(P). This completes the proof. o
4.3 Fundamental region of r o(p)
As usual we write Sr =- 1/r and Tr = r + 1, and let Rr denote the funda-
mental region of r.

Theorem 4.2. For any prime p the set


p-l
Rr u USTk(Rr)
k=O

is a fundamental region of the subgroup r o(P).

This theorem is illustrated for p = 3 in Figure 4.1.

PROOF. Let R denote the set


p-l
R = Rr U U STk(R r )·
k=O

We will prove

(i) if r E H, there is a V in r o(P) such that Vr belongs to the closure of R, and


(ii) no two distinct points of R are equivalent under r o(P).

To prove (i), choose r in H, choose r 1 in the closure of Rr and choose


A in r such that Ar = r 1 . Then by Theorem 4.1 we can write
A- 1 = PW

where PEro(p) and W = lor W = ST k for some k, 0:::; k:::; p - 1. Then


P = A - 1 W - 1 and P - 1 = W A. Let V = P - 1. Then V E r o(p) and
Vr = WAr = Wr 1 .

Since W =I or W = ST k , this proves (i).


76
4.3: Fundamental region of r o(p)

I T

-I -2
I
o
Figure 4.1 Fundamental region for r 0(3)

Next we prove (ii). Suppose tiE R, t 2 E Rand Vt I = t 2 for some V in


r o(P). We will prove that t I
= t 2' There are three cases to consider:

(a) tl ERr, t2 ERr. In this case tl = t2 since V E r.


(b) tl ERr, t2 E STk(R r ).
(c) tl ESTkl(R r ), t2 EST k 2(R r )·

In case (b), t2 = STkt3 where t3 ERr. The equation


implies

V = ST k = (~ -1)
k .

This contradicts the fact that V E r o(p).


Finally, consider case (c). In this case
and
where tl' and t2' are in R r . Since Vt l = t2 we have VSTk't l ' = ST k2t / so
VSTkl = ST k2,

77
4: Congruences for the coefficients of the modular function i

Since VEro(p) this requires k2 == kl (mod pl. But both k l , k2 are in the
interval [0, p - 1], so k 2 = k I' Therefore
V = STOS = S2 = I

and r I = r 2' This completes the proof. o


We mention (without proof) the following theorem of Rademacher [34]
concerning the generators of r o(p). (This theorem is not needed in the
later work.)

Theorem 4.3. For any prime p> 3 the subgroup ro(p) has 2[p/12] +3
generators and they may be selected from the following elements,'

where Tr =r + 1, Sr = -l/r, and

V, - STkST-k'S _
k - -
(k' I)
-(kk' + I) -k'

where kk' == - 1 (mod p). The subgroup r 0(2) has generators T and VI;
the subgroup r 0(3) has generators T and V2 .

Here is a short table of generators:

p 2 3 5 7 11 13 17 19

Generators: T T T T T T T T
VI V2 V2 V3 V4 V4 V4 Vs
V3 Vs V6 Vs V7 VB
VB V9 VI2
VIO VI3 VI3

4.4 Functions automorphic Uflder the


subgroup r o(P)
We recall that a modular function f is one which has the following three
properties:

(a) f is meromorphic in the upper half-plane H.


(b) f(Ar) = f(r) for every transformation A in the modular group r.
(c) The Fourier expansion of f has the form

L
00

f(r) = ane27tin,.
n= -m

78
4.4: Functions automorphic under the subgroup r o(p)

If property (b) is replaced by


(b') f(Vr) = f(T) for every transformation V in ro(p),
then f is said to be automorphic under the subgroup r o(p). We also say that
f belongs to r o(p).
The next theorem shows that the only bounded functions belonging to
r o(p) are constants.
Theorem 4.4. If f is automorphic under r o(p) and bounded in H, then f is
constant.
r
PROOF.
P in r o(P) and an integer k, °: ;
According to Theorem 4.1, for every V in
k ::; p, such that
there exists an element

V = PA b
where Ak = ST k if k < p, and Ap = I. For each k = 0, 1, ... , p, let
r k = {PAk:PEro(p)}·
Each set r k is called a right coset of r o(P). Choose an element fk from the
coset r k and define a function fk on H by the equation
fk(T) = f(fk T).
Note that fp(T) = f(PT) = f(T) since PEr o(P) and f is automorphic under
ro(P). The function value h(T) does not depend on which element fk was
chosen from the coset r k because
fk(T) = f(fk T) = f(PA kT) = f(Ak T)
and the element Ak is the same for all members of the coset r k.
How doesfk behave under the transformations of the full modular group?
If V E r then
fk(VT) = f(fk VT).
Now fk V E
such that
r so there is an element Q in r o(P) and an integer m, °: ; m ::; p,

Therefore we have

Moreover, as k runs through the integers 0, 1, 2, ... , p so does m. In other


words, there is a permutation (J of {O, 1,2, ... , p} such that
fk(VT) = f<r(k)(T) for each k = 0, 1, ... ,p.
Now choose a fixed w in H and let
p
<p(T) = fl {fk(T) - f(w)}.
k~O

79
4: Congruences for the coefficients of the modular function.i

Then if V E r we have

cp(Vr) = n {h(Vr) -
p

k=O
f(w)} = n {f,1(kir) -
p

k=O
f(w)} = cp(r),

so cp is automorphic under the full group r. Now cp is bounded in H (since


each fk is). Therefore, cp omits some value hence, by Theorem 2.5, cp is
constant, so cp(r) = cp(w) for all r. But cp(w) = 0 because

n {fk(W) -
p
cp(w) = f(w)}
k=O
and the factor with k = p vanishes since fp = f Therefore cp(r) = 0 for all r.
Now take r = i. Then

n {h(O -
p
o= f(w)}
k=O
hence some factor is o. In other words, f(w) = h(O for some k. But w was
arbitrary so f can take only the values fo(i), ... ,fp(i). This implies that f is
constant. 0

4.5 Construction of functions belonging


to r o(p)
This section shows how to construct functions automorphic under the
subgroup r o(p) from given functions automorphic under r.

Theorem 4.5. If f is automorphic under r and if p is prime, let

fir) = ! Pf f(r + A.).


p ),=0 P
Thenfp is automorphic under r o(p). Moreover, iff has the Fourier expansion

= L a(n)e21tint
00

f(r)
n= -m

then fp has the Fourier expansion

L
00

fp(r) = a(np)e21tint.
n= -[m/pl

PROOF. First we prove the statement concerning Fourier expansions. We have


1 p-l
fp(r) = - L L
00
a(n)e 21t ;n(tH)/P
p ),=0 n=-m

1
L a(n)e 21t ;nt/ p L e 21t ;n)./p.
00 p-l
= -
p n=-m ),=0

80
4.5: Construction of functions belonging to r o(p)

But
if p,{'n
if pin
so

L L
00 00

flr) = a(n)e2nint/P = a(np)e2nint.


n= -m n= -[m/p]
pin

This shows that fp has the proper behavior at the point r = ioo. Also,
fp is clearly meromorphic in H because it is a linear combination offunctions
meromorphic in H.
Next we must show that
f/Vr) = fp(r) whenever V E r o(P).
F or this we use a lemma.
Lemma 1. If V E ro(P) and if 0 :$ .Ie :$ p - 1, let T).! = (r + .Ie)/p. Then
there exists an integer f1., 0 :$ f1. :$ P - 1 and a transformation in a;,
r O(P2) such that

Moreover, as .Ie runs through a complete residue system modulo p, so does f1..

First we use the lemma to complete the proof of Theorem 4.5, then we
return to the proof of the lemma.
If V E r o(P) we have

Now we use the lemma to write the last sum as

This proves that fp is invariant under all transformations in r o(P), so fp is


automorphic under r o(P). 0

PROOF OF LEMMA 1. Let V = (: ~), where c == 0 (mod p), and let .Ie be

given, 0 :$ .Ie :$ P - 1. We are to find an integer f1., 0 :$ f1. :$ P - 1 and a

transformation WI' = (~ ~) such that WI' E r O(p2) and


T)Y = a;, Til"
81
4: Congruences for the coefficients of the modular function j

Since T). = (~ ;) we must satisfy the matrix equation

or

(a + AC b + Ad) = (A All + BP)


pc pd C CIl + Dp
with C == 0 (mod p2). Equating entries we must satisfy the relations

{
A = a + AC
(2)
C = pc
(3) {All + Bp = b + Ad
CIl + Dp = pd
with
and AD - BC = 1.
Now (2) determines A and C. Since pic, we have C == 0 (mod p2). Substi-
tuting these values in (3) we must satisfy

(4) {
(a + AC)1l + Bp = b + Ad
CPIl + Dp = pd.
Choose Il to be that solution of the congruence
Ila == b + Ad (mod p)
which lies in the interval 0 S Il S P - 1. This is possible because ad - bc = 1
and pic imply p,( a. Note that distinct values of A mod p give rise to distinct
values of Il mod p. Then, since p Ie we have
Ila + IlAC == b + Ad (mod p)
or
(a + AC)1l == b + Ad (mod p).
Therefore there is an integer B such that
(a + AC)1l + Bp = b + Ad.
Therefore the first relation in (4) is satisfied. The second relation requires
D = d - ell. Thus, we have found integers II, A, B, C, D such that

Clearly AD - BC = 1 since all matrices in this equation have determinant


1 or p. This completes the proof of the lemma. 0
82
4.6: The behavior off~ under the generators of r

4.6 The behavior of Ip under the generators


ofr
Let Tr = r + 1 and Sr = - l/r be the generators of r. Since T E r o(p) we
have fiT!) = fir). The next theorem gives a companion result for fp(Sr).
Theorem 4.6. Iff is automorphic under r and if p is prime, then

f, (-
P
!)
r
= f, (r) + !p f(pr) - !P f(~).
P P
To prove this we need another lemma.

Lemma 2. Let Tl r = (r + A)/p. Thenfor each A in the interval 1 ::; A ::; p - 1


there exists an integer J1 in the same interval and a transformation V in
r o(p) such that
TlS = VT/l"
Moreover, as A runs through the numbers 1,2, ... ,p - 1, so does J1.

PROOF OF LEMMA 2. We wish to find G~) in r o(p) such that

or
-1)o = (a
c
aJ1
CJ1
+ bP).
+ dp
Take a = A, C = P and let J1 be that solution of the congruence
AJ1 == -1 (mod p)
in the interval 1 ::; J1 ::; p - 1. This solution is unique and J1 runs through a
reduced residue system mod p with A. Choose b to be that integer such that
aJ1 + bp = -1, and take d = - J1. Then CJ1 + dp = 0 and the proof is
complete. 0
PROOF OF THEOREM 4.6. We have

( 1)
pfp - - = L f (Sr+A)
p-!
--' =
(sr) + L
f -
p-!
f(TlSr)
r l=O p P l= 1

= f(- ~) + Pi! f(VTIl r) = f(rp) + Pi! f(TIL r) - f(~)


rp Il=! Il=O P

= f(rp) + pfi r ) - fG} 0

83
4: Congruences for the coefficients of the modular function;

4.7 The function cp(r) = ~(qr)/~(r)

The number of poles of an automorphic function in the closure of its fun-


damental region is called its valence. A function is called univalent on a
subgroup G if it is automorphic under G and has valence 1. Such a function
plays the same role in G that J plays in the full group r.
It can be shown (using Riemann surfaces) that univalent functions exist
on G if and only if the genus of the fundamental region RG is zero. [This is
the topological genus of the surface obtained by identifying congruent edges
of RG • For example, the genus of Rr is zero because Rr is topologically
equivalent to a sphere when its congruent edges are identified.]
Our next goal is to construct a univalent function on the subgroup
r o(p) whenever the genus of r o(p) is zero. This will be done with the aid of
the discriminant ~ = g2 3 - 27g/.
We recall that ~(,) is periodic with period 1 and has the Fourier expansion
(Theorem 1.19)

L ,(n)e 2 "int
00

~(,) = (2n)12
n= 1

where the T(n) are integers with T(1) = 1 and T(2) = - 24. However, ~(,) is
not invariant under all transformations of r. In fact we have

In particular,

~(, + 1) = ~(T) and ~( ~ 1) = ,12~(,).


Even though ~(,) is not invariant under r it can be used to construct functions
automorphic under the subgroup r o(q) for each integer q.

Theorem 4.7. For a fixed integer q, let


~(q,)
rp(T) = - - if, E H.
~(T)

Then rp is automorphic under r o(q). Moreover, the Fourier expansion ()f rp


has theform

where the b n are integers and x = e 2 "it.


84
4.7: The function qJ(r) = ~(qr)/~(r)

PROOF. First we obtain the Fourier expansion. We have

~(r) = (2lt)I2Jlr(n)xn = (2lt)I2X{1 + n~Ir(n + l)x n }

where x = e 21tit • Hence

so

<p(r) = ~(qr) =
~(r)
xq-I 1 + L:'=
1 r(n + l)x nq
1 +L:'=Ir(n+ l)x n
= X q- I(1 + f b xn)
n=I n

where the bn are integers.


Now <p is clearly meromorphic in H, and we will prove next that <p is
invariant under r o(q).
If V = (: !) E ro(q) then C = Clq for some integer CI' Hence

~(Vr) = (cr + d)I2~(r) = (clqr + d)I2~(r).

On the other hand,


ar + b a(qr) + bq
= = ( )
qVr q--d
cr + CI qr +d = W(qr),
where

W=(a bq ).
CI d
But WE r because det W = ad - bClq = ad - bc = 1. Hence
~(qVr) = ~(W(qr)) = (cI(qr) + d)I2~(qr),
so
~(qVr) (clqr + d)I2~(qr)
<p(Vr) = ~(Vr) = (clqr + d)I2~(r) = <per).

This completes the proof. D

Now <p has a zero of order q ~ 1 at 00 and no further zeros in H. Next we


show that <p does not vanish at the vertex r = 0 of the fundamental region
of r o(q). In fact, we show that <per) -. 00 as r -. O.

Theorem 4.8. If r E H we have

<p ((ji'-1) 1
= qI2<p(r)'

Hence <per) -. 00 as r -. O.

85
4: Congruences for the coefficients of the modular function j

PROOF. Since ~(-I/r) = r12~(r) we have

~( _ :r) = (qr)12~(qr)
so

4.8 The univalent function <1>(1')


The function cp has a zero of order q - 1 at 00 and no further zeros so its
valence is q - 1. We seek a univalent function automorphic under r o(q)
and this suggests that we consider cp2, where ex = l/(q - 1). The Fourier
expansion of cp~ need not have integer coefficients, since

cp~(r) = X(I + JlbnxnJ.


On the other hand we have the product representation

~(r) = (2n)12x TI (1
00

- xn)24
n= 1
so

where the coefficients dq(n) are integers. Therefore if ex = I/(q - 1) we have


00 )24~
(5) cp~(r) = x ( 1 + n~l dq(n)x n

and the Fourier series for cp~(r) will certainly have integer coefficients if
24ex is an integer, that is, if q - 1 divides 24. This occurs when q = 2, 3, 4, 5,
7, 9, 13, and 25.

Definition. If q - 1 divides 24 let ex = 1/(q - 1) and r = 24ex. We define the


function <I> by the relations

<I>(r) = cp~(r) = (~~7Y = (ry~rr~J·


86
4.9: Invariance of <1>(r) under transformations of r o(q)

The function <1> so defined is analytic and nonzero in H. The Fourier series
for <1> in (5) shows that <1> has a first order zero at 00 and that
1 1
<1>(T) = ~ + lex),
where lex) is a power series in x with integer coefficients.
Since cp is automorphic under ro(q) we have cp(VT) = cp(T) for every
element V of r o(q). Hence, extracting roots of order q - 1, we have
<1>(VT) = c:<1>(T)
where c: q - 1 = 1. The next theorem shows that, in fact, c: = 1 whenever
24/(q - 1) is an even integer and q is prime. This occurs when q = 2, 3, 5, 7,
and 13. For these values of q the function <1> is automorphic under ro(q).

4.9 Invariance of <l>(r) under transformations


of r o(q)
The properties of Dedekind sums proved in the foregoing chapter lead to a
simple proof of the in variance of the univalent function <1>(T).

Theorem 4.9. Let q = 2,3,5,7,01' 13, and let I' = 24/(q - 1). Then thefimction

(6) <1>( T) = (1]( qT ))Y


I](T)

is automorphic under the subgroup r o(q).


PROOF.Ifq = 2wehavel' = 24 and <1>(T) = ~(qT)/~(T).Inthiscasethetheorem
was already proved in Theorem 4.7. Therefore we shall assume that q ;;::: 3.
Let V = (: ~) be any element of r o(q). Then ad - bc = 1 and
c == 0 (mod q). We can suppose that c ;;::: O. If c = 0 then V is a power of
the translation TT = T + 1, and since I](T + 1) = e1ti!121](T) we find

<1>(T + 1) = (l](qT + q))Y = e 1tiY (Q-l)!12<1>(T) = <1>(T).


I](T + I}
Therefore we can assume that c > 0 and that c = Clq, where Cl > O.
Dedekind's functional equation for I](T) gives us

(7) I](VT) = c:(V){ -iter + d)}I!21](T}


where

(8)

87
4: Congruences for the coefficients of the modular function.i

We also have
a(qr) + bq )
l1(qVr) = 11( cl(qr) + d = I1(Vl qr)

where

v
I
= (ac i
bdq) .

Since VI E r we have

which, together with (7), gives us

But (8) shows that (r.(Vd/r.( v))r = e - "irb, where

. = {a~
(j
+ d + s(-d,c)} - {a12cI
+ d + s(-d,cd } .

Since ad - bc = 1 we have ad == 1 (mod c) and ad == 1 (mod CI) so


s( -d, c) = -s(a, c) and s( -d, CI) = -s(a, cd, and Theorem 3.11 shows that
r~ is an even integer. Therefore e-"ird = 1 and <I>(Vr) = <I>(r). 0

4.10 The function j p expressed as a


polynomial in <I>
If p is prime and if f is automorphic under r, we have shown that the
function

fir) = ~
p
Pf f(r + A)
).=0 P
is automorphic under r o(p), and its Fourier coefficients consist of every pth
coefficient of f To obtain divisibility properties of the coefficients of j p(r)
we shall express j p as a polynomial in the function <1>.
In deriving the differential equation for the Weierstrass f,J function we
formed a linear combination of f,J, f,J2 and f,J3 which gave a principal part
near z = 0 equal to that of [f,J'(z)Y The procedure here is analogous.
Both functions j p and <I> have a pole at the vertex r = 0 of the fundamental
region of r o(p). We form a linear combination of powers of <I> to obtain a
principal part equal to that of j P'

88
4.10: The function j p expressed as a polynomial in <ll

To obtain the order of the pole of jlr) at r = 0 we use Theorem 4.6


which gives us the relation

j (-
p
!)
r
= j (r)
P
+ !p j(pr) - !P j(::')
P

valid for prime p. Replacing r by pr in this formula we obtain

Theorem 4.10. If p is prime and r E H then

j (-
P
~)
pr
= j (pr)
P
+ !p j(p2r) - !P j(r).
Hence ifx = e 2nir we have the Fourier expansion

pj
p
(-~)
pr
= x- p2 _ x- 1 + I(x),
where I(x) is a power series in x with integer coefficients.
PROOF. We have
j(r) = x- 1 + c(O) + c(l)x + c(2)x 2 + ... ,
jp(r) = c(O) + c(p)x + C(2p)X2 + ... ,
pjp(pr) = pc(O) + pc(p)xP + pC(2p)X2p + ... ,
and

so

pjp( - ;r) = pjp(pr) + j(p2r) - j(r)

= x- p2 _ x- 1 + I(x). o
Now we can express jp as a polynomial in <1>.

Theorem 4.11. Assume p = 2, 3, 5, 7 or 13, and let


"I(pr
<1>(r) = ( "I(r)
»)r, where r =
24
p _ 1.

Then there exist integers a 1, ... , ap2 such that

PROOF. By Theorem 4.10 we have

pjp - pr( 1) = x _p2 - x -1 + J(x),

89
4: Congruences for the coefficients of the modular function j

and, since 12l.( = 1'/2, Theorem 4.8 gives us

pr/2C1>(_ ~) = _1_ = X-I + I(x).


pT CI>(T)
Let IjJ(T) = pr!2C1>( -1/(pT». Then the difference

pj (-
P
~)
pT
- {1jJ(T)}p2

has a pole of order :s; p2 - 1 at x = 0, and the Laurent expansion near x = 0


has integer coefficients. Hence there is an integer b l such that

pj ( 1)
P
- -
pT
- {1jJ(T)}P 2 - b l {1jJ(T)}P , - I

has a pole of order :s;p2 - 2 at x = 0, and the Laurent expansion near x = 0


has integer coefficients. In p2 steps we arrive at a function

f(-~)
pT
= pj (-~)
pTP
- {1jJ(T)}P' -bdljJ(TW'-1 - ... - b P'- I IjJ(T)

which is analytic at x = 0 and has a power series expansion with integer


coefficients. Moreover, all the numbers b l , . . . , bp'-I are integers. Replacing
T by - 1/(pT) we obtain

f(T) = pjp(T) - {pr/2C1>(TW' - b l {pr/2C1>(TW'-1 - ... - bp'-I {pr/2C1>(T)}.


Now f(T) is automorphic under r o(p) and analytic at each point T in H.
The functionfis also analytic at the vertex T = 0 (by construction). Therefore
f is bounded in H so f is constant. But this constant is pc(O) since CI>(T)
vanishes at 00. Thus we find
pjp(T) = {pr/2C1>(T)}p' + b l {pr/2C1>(TW'-1 + ... + bp'-I {pr/2C1>(T)} + pc(O)
so jiT) is expressible as indicated in (9). 0
Theorem 4.12. The coefficients in the Fourier expansion of j(T) satisfy the
following congruences.'
c(2n) == 0 (mod 2")
c(3n) == 0 (mod 35 )
c(5n) == 0 (mod 52)
c(7n) == 0 (mod 7).
PROOF. The previous theorem shows that for p = 2, 3, 5, 7 and 13 we have
c(pn) == 0 (mod p(r/2)-'),
where I' = 24/(p - I). Therefore we simply compute (1'/2) - 1 to obtain the
stated congruences. Note that (1'/2) - 1 = 0 when p = 13 so we get a trivial
congruence in this case. 0

90
Exercises for Chapter 4

N ate. By repeated application of the foregoing ideas Lehner [24] derived


the following more general congruences, valid for II. ;::: 1:
e(2 a n) == 0 (mod 2 3d 8)
eWn) == 0 (mod 3 2d 3)
c(san) == 0 (mod sa + I)
c(7 a n) == 0 (mod 7a ).

Since it is known that e(13) is not divisible by 13, congruences of the above
type cannot exist for 13. In 19S8 Morris Newman [30J found congruences
of a different kind for 13. He showed that

e(13np) + c(13n)e(13p) + p-I e( p


13n) == 0 (mod 13),

where p - I P == 1 (mod 13) and e(x) = 0 if x is not an integer. The congruences


of Lehner and Newman were generalized by Atkin and O'Brien [SJ in 1967.

Exercises for Chapter 4


1. This exercise relates the Dedekind function 17(r) to the Jacobi theta function 9(r)
defined on H by the equation
or: a:
:j(T) = I + 2 L e,in" = L e'in".
n=1 n=-:x:

The definition shows that .9 is analytic in H and periodic with period 2.


Jacobi's triple product identity (Theorem 14.6 in [4]) states that
oc a:
f1 (1 - x2n)(1 + X 2n - 1z2 )(1 + X 2n - 1 Z- 2 ) = L x m'z2m
n=1 m=-x

if z # 0 and Ix I < 1.
(a) Show that x and z can be chosen to give the product representation
oc
;1(r) = f1 (I - e 2 'in')(1 + e(2n-l)'i')2.
n=1

This implies that ,9(T) is never zero in H.


(b) If T E H prove that

(
.1(T) =
~2(' : 1)
.
~(r + I)
(c) Prove that:j( -I/T) = (-ir)I/2:1(r).
Hint: If Sr = -I/T, find elements A and B of r such that

sr; 1 A(r : I)
= and Sr + 1 = B(r + I).

91
4: Congruences for the coefficients of the modular function j

2. Let G denote the subgroup of r generated by the transformations Sand T2, where
Sr = - l/r and Tr = r + 1.

(a) If (: !) E G prove that a == d (mod 2) and b == c (mod 2).


(b) If V E G prove that there exist elements A and B of r such that

vr 2+ 1=A(r; 1) and Vr + 1= B(r + 1).

(c) If C!) E G and c > 0 prove that

I) ( cr +
ar b)
+ d = e(a, b, c, d){ - i(cr + d)} 1/21)( r),

where Iera, b, c, d) I = 1. Express e(a, b, c, d) in terms of Dedekind sums.

Exercises 3 through 8 outline a proof (due to Mordell [28]) of the multiplica-


tivity of Ramanujan's function r(n). We recall that

L r(n)e 21tinr = (27!)-12~(r) = e 21tir f1 (1


00 00

_ e21timr)24.
n = 1 m=l
3. Let p be a prime and let k be an integer, 1 :0; k :0; P - 1. Show that there exists an
integer h such that

rl2d(r ; h) = d(krp~ 1)
and that h runs through a reduced residue system mod p with k.
4. If p is a prime, define

Fp(r) = p11d(pr) Ld
+ -1 p-I (r + k)
-- .
P k=O P
Prove that:

(-1)
(b) Fp -r- = r 12 Fir).

Note: Exercise 3 will be helpful for part (b).


5. Prove that Fir) = r(p)d(r), where r(p) is Ramanujan's function.
6. Use Exercises 4 and 5 to deduce the formulas
(a) r(p"+I) = r(p)r(p") - p11r(p"-I) for n;::: 1.
(b) r(p'n) = r(p)r(p·-I n) - p 11 r(p·-2 n) for ct ;::: 2 and (n, p) = 1.
7. If ct is an integer, ct ;::: 0, and if (n, p) = 1, let
g(ct) = r(p'n) - r(p·)r(n).
Show that g(ct + 1) is a linear combination of g(ct) and g(ct - 1) for ct ;::: 2 and deduce
that g(ct) = 0 for all ct.

92
Exercises for Chapter 4

8. Prove that

r(m)r(n) = L dllr(m~).
dilm.n) d
In particular, when (m, n) = 1 this implies r(m)r(n) = r(mn).
9. If t E H and x = e 2 • iT prove that

{504JOO"s(n)xnf = {j(t)- 12 3 }J 1
r(I1)X n,

where 0"5(0) = - 1/504. Equate coefficients of xn to obtain the identity


n n-1
(504)2 L O"s(k)O"s(l1- k) = t(1l + I) - 984r(n) + L c(k)r(1l - k).
k~O k~ 1

10. Use Exercise 9 together with Exercise 10 of Chapter 6 to prove that


65520 n- I
- - {O"ll(n) - t(Il)} = t(n + I) + 24r(ll) + L c(k)t(n - k).
691 k~1

This formula, due to Lehmer [20], can be used to determine the coefficients c(1l)
recursively in terms of t(n). Since the right member is an integer. the formula also
implies Ramanujan's remarkable congruence

t(ll) == O"l1(n) (mod 691).

93
5
Rademacher's series for
the partition function

5.1 Introduction
The unrestricted partition function p(n) counts the number of ways a positive
integer n can be expressed as a sum of positive integers ::; n. The number of
summands is unrestricted, repetition is allowed, and the order of the sum-
mands is not taken into account.
The partition function is generated by Euler's infinite product

1
f1 L p(n)x",
00 00

(1) F(x) = --m =


m=ll-x "=0

where p(O) = 1. Both the product and series converge absolutely and repre-
sent the analytic function F in the unit disk Ix I < 1. A proof of (1) and other
elementary properties of p(n) can be found in Chapter 14 of [4]. This chapter
is concerned with the behavior of p(n) for large n.
The partition function p(n) satisfies the asymptotic relation

eKJR
p(n) '" Ii as n -+ 00,
4ny 3

where K = n(2/3)1/2. This was first discovered by Hardy and Ramanujan [13]
in 1918 and, independently, by J. V. Uspensky [52] in 1920. Hardy and
Ramanujan proved more. They obtained a remarkable asymptotic formula
of the form

(2) p(n) = L Pk(n) + O(n- 1/ 4 ),


k<a.../ii

94
5.2: The plan of the proof

where (J. is a constant and P 1(n) is the dominant term, asymptotic to


eKJiI/(4nJ3). The terms P 2 (n), P 3(n), ... are of similar type but with smaller
constants in place of K in the exponential. Since p(n) is an integer the finite
sum on the right of (2) gives p(n) exactly when n is large enough to insure
that the error term is less than 1/2. This is a rare example of a formula which
is both asymptotic and exact. As is often the case with asymptotic formulas
of this type, the infinite sum

(3)

diverges for each n. The divergence of (3) was shown by D. H. Lehmer [21]
in 1937.
Hans Rademacher, while preparing lecture notes in 1937 on the work of
Hardy and Ramanujan, made a small change in the analysis which resulted
in slightly different terms Rk(n) in place ofthe Pk(n) in (2). This had a profound
effect on the final result since, instead of (2), Rademacher obtained a con"
vergent series,
00

(4) p(n) = L Rk(n).


k= 1

The exact form of the Rademacher terms Rk(n) is described below in Theorem
5.10. Rademacher [35] also showed that the remainder after N terms is
O(n - 1/4) when N is of order .jn, in agreement with (2).
This chapter is devoted to a proof of Rademacher's exact formula for
p(n). The proof is of special interest because it represents one of the crowning
achievements of the so-called "circle method" of Hardy, Ramanujan and
Littlewood which has been highly successful in many asymptotic problems
of additive number theory. The proof also displays a marvelous application
of Dedekind's modular function I1(r).

5.2 The plan of the proof


This section gives a rough sketch of the proof. The starting point is Euler's
formula (1) which implies

F(x) =
n+ 1
~
L...,
p(k)x k
n+ 1
'fO
1 < IX I < 1,
X k=O X

for each n ::::: O. The last series is the Laurent expansion of F(x)/x n + 1 in the
punctured disk 0 < Ix I < 1. This function has a pole at x = 0 with residue
p(n) so by Cauchy's residue theorem we have

p(n)
1
= -2'
m
i F(x)
Ii'TI dx,
eX

95
5: Rademacher's series for the partition function

where C is any positively oriented simple closed contour which lies inside
the unit circle and encloses the origin. The basic idea of the circle method is
to choose a contour C which lies near the singularities of the function F(x).
The factors in the product defining F(x) vanish whenever x = 1, x 2 = 1,
x 3 = 1, etc., so each root of unity is a singularity of F(x). The circle method
chooses a circular contour C of radius nearly 1 and divides C into arcs
Ch,k lying near the roots of unity e21tih/k, where °
:$; h < k, (h, k) = 1, and
k = 1,2, ... , N. The integral along C can be written as a finite sum of integrals
along these arcs,

On each arc Ch,k the function F(x) in the integrand is replaced by an elemen-
tary function t/lh.k(X) which has essentially the same behavior as F near the
singularity e21tih/k. This elementary function t/lh,k arises naturally from the
functional equation satisfied by the Dedekind eta function I'/(r). The functions
F and 1'/ are related by the equation

F( e 21tit ) = e 1tit/ 12 /1'/( r),


and the functional equation for 1'/ gives a formula which describes the behavior
of F near each singularity e21tih/k. The replacement of F by t/lh,k introduces
an error which needs to be estimated. The integrals of the t/lh,k along Ch,k are
then evaluated, and their sum over h produces the term Rk(n) in Rademacher's
series.
In 1943 Rademacher [38] modified the circle method by replacing the
circular contour C by another contour in the r-plane, where x = e 21tit . This
new path of integration simplifies the estimates that need to be made and
clarifies the manner in which the singularities contribute to the final formula.
The next section expresses Dedekind's functional equation in terms of F.
Sections 5.5 and 5.6 describe the path of integration used by Rademacher,
and Section 5.7 carries out the plan outlined above.

5.3 Dedekind's functional equation expressed


in terms of F
Theorem 5.1. Let F(t) = 1/[1;:;= 1 (1 - t m ) and let

x = exp(2nih _ 2nz) 2niH 2n)


(5) x' = exp ( -k- - ~ ,
k k2 '

where Re(z) > 0, k > 0, (h, k) = 1, and hH == -1 (mod k). Then

(6) F(x) = e1tis(h,k) ( -Z)I/2 exp ( - n - -nz-) F(x')


k 12z 12k2 .

96
5.4: Farey fractions

If
Note. Izl is small, the point x in (5) lies near the root of unity e27tihlk,
whereas x' lies near the origin. Hence F(x') is nearly F(O) = 1, and Equation
(6) gives the behavior of F near the singularity e27tihlk. Aside from a constant
factor, for small Iz I, F behaves like

Zl/2 ex p ( I;Z).
PROOF. If(: ~) E r with c > 0, the functional equation for '1(T) implies

(7) 1 _ 1.
'1(r) - '1(T') { -1(CT + d)} 1/2 exp {.(a~
1tl
+ d + s( -d, c))},

where r' = (ar + b)/(n + d). Since F(e 27tit ) = e7tit/121'1(r), (7) implies

(8) F(e 27tit ) = F(e 27tit') exp('!i(Tl~ r')} -i(n + d)}1/2

x exp{ It{a l~cd + s( -d, C))}.


Now choose

a = H, c = k, d = - h, b =
hH + 1 and
iz + h
T=-k-·
k

Then

, iz- I +H
T=-~-
k
and (8) becomes

( (2ltih
F exp - - - -
k
2ltZ)) _- F(exp(2ltiH
k
- - - -2lt))z 1/2
k kz

1t
x exp{ 12kz - 12k
ltZ
+ nis(h, k) } .
When z is replaced by z/k this gives (6). o

5.4 Farey fractions


Our next task is to describe the path of integration used by Rademacher.
The path is related to a set of reduced fractions in the unit interval called·
Farey fractions. This section describes these fractions and some of their
properties.

97
5: Rademacher's series for the partition function

Definition. The set of Farey fractions of order n. denoted by Fn. is the set of
reduced fractions in the closed interval [0. 1] with denominators ::;n.
listed in increasing order of magnitude.

EXAMPLES

F1: ¥. t
F 2 : ¥. t. t
F 3: ¥. t. t. i. t
F4 : ¥.i.t.t.i.i.t
F 5: ¥. t. i. t. ~. t. i. j. t t t
F6: ¥.i.t.i.t.~.t.i.i.i.!.i.t
F7: ¥.+.i.!.i.~.t.~.i.t.4.i.i.~.i.!.i.~.t

These examples illustrate some general properties of Farey fractions.


For example. Fn c Fn+ i. so we get Fn+ 1 by inserting new fractions in Fn·
If (alb) < (Cld) are consecutive in Fn and separated in Fn + 1. then the fraction
(a + c)/(b + d) does the separating. and no new ones are inserted between
alb and cld. This new fraction is called the mediant of alb and cld.

Theorem 5.2. If(alb) < (cld). their mediant (a + c)/(b + d) lies between them.
PROOF

a+c a bc - ad c a+c bc - ad
and
b + d - b = b(b + d) > 0 d- b+d = d(b + d) > O. D

The above examples show that t and ~ are consecutive fractions in F n


for n = 5. 6. and 7. This illustrates the following general property.

Theorem 5.3. Given 0 ::; alb < cld ::; 1. If bc - ad = 1 then alb and cld are
consecutive terms in Fnfor the following values of n:
max(b. d) ::; n ::; b +d- 1.

PROOF. The condition bc - ad = 1 implies that alb and cld are in lowest
terms. If max(b. d) ::; n then b ::; nand d ::; n so alb and cld are certainly in
Fn. Now we prove they are consecutive if n ::; b + d - 1. If they are not
consecutive there is another fraction hlk between them. alb < hlk < cld.
But now we can show that k ~ b + d because we have the identity
(9) k = (bc - ad)k = b(ck - dh) + d(bh - ak).
But the inequalities alb < hlk < cld show that ck - dh ~ 1 and bh - ak ~ 1
so k ~ b + d. Thus. any fraction hlk that lies between alb and cld has
denominator k ~ b + d. Therefore. if n ::; b + d - 1. then alb and cld must
be consecutive in Fn. This completes the proof. D

98
5.5: Ford circles

Equation (9) also yields the following theorem.

Theorem 5.4. Given 0 :s; alb < c/d :s; 1 with be - ad = 1, let h/k be the
mediant q{ alb and c/d. Then alb < h/k < c/d, and these fractions satisfy
the unimodular relations
bh - ak = 1, ck - dh = 1.

PROOF. Since h/k lies between alb and c/d we have bh - ak ~ 1 and
ck - dh ~ 1. Equation (9) shows that k = b + d if, and only if, bh - ak =
ck - dh = 1. D

The foregoing theorems tell us how to construct Fn+l from Fn.

Theorem 5.5. The set Fn + 1 includes Fn' Each fraction in Fn + 1 which is not in
Fn is the mediant of a pair of consecutive fractions in Fn. Moreover, if
alb < c/d are consecutive in any Fn, then they satisfy the unimodular
relation bc - ad = 1.
PROOF. We use induction on n. When 11 = 1 the fractions 0/1 and 1/1 are
consecutive and satisfy the unimodular relation. We pass from F 1 to F 2 by
inserting the mediant 1/2. Now suppose alb and c/d are consecutive in Fn
and satisfy the unimodular relation bc - ad = 1. By Theorem 5.3, they will
be consecutive in F m for all m satisfying
max(b, d) :s; m :s; b + d - 1.
Form their mediant h/k, where h = a + c, k = b + d. By Theorem 5.4
we have bh - ak = 1 and ck - dh = 1 so hand k are relatively prime.
The fractions alb and c/d are consecutive in Fm for all m satisfying
max(b, d) :s; m :s; b + d - 1, but are not consecutive in Fk since k = b + d
and h/k lies in Fk between alb and c/d. But the two new pairs alb < h/k and
h/k < c/d are now consecutive in Fk because k = max(b, k) and k = max(d, k).
The new consecutive pairs still satisfy the unimodular relations bh - ak = 1
and ck - dh = 1. This shows that in passing from Fn to Fn + 1 every new
fraction inserted must be the mediant of a consecutive pair in F n' and the new
consecutive pairs satisfy the unimodular relations. Therefore F n+ 1 has these
properties if F n does. 0

5.5 Ford circles


Definition. Given a rational number h/k with (h, k) = 1. The Ford circle
belonging to this fraction is denoted by C(h," k) and is that circle in the
complex plane with radius 1/(2k2) and center at the point (h/k) + i/(2k 2)
(see Figure 5.1).
Ford circles are named after L. R. Ford [9] who first studied their
properties in 1938.
99
5: Rademacher's series for the partition function

. 1
radiUS = 2k2

h
k

Figure 5.1 The Ford circle C(h, k)

Theorem 5.6. Two Ford circles qa, b) and qc, d) are either tangent to each
other or they do not intersect. They are tangent if, and only if, bc - ad =
± 1. In particular, Ford circles of consecutive Farey fractions are tangent
to each other.
PROOF. The square of the distance D between centers is (see Figure 5.2)

D1 = ( ba - C)2
d +
(12b1- 2d1
1)1'

a c
b d
Figure 5.2

whereas the square of the sum of their radii is

(r +
1
Rf = ( 2b1
1
+ 2d1 .
)2
The difference D2 - (r + R)l is equal to

D1 - (r + R)2 = (ad ~ bey + C!2 - 2~1)1 - C!l + 2~ly


(ad - bC)2 - 1
= b2 d 1 2 O.

Moreover, equality holds if, and only if (ad - bC)2 = 1. o


100
5.5: Ford circles

Theorem 5.7. Let hdkl < h/k < h2/k2 be three consecutive Farey fractions.
The points of tangency of C(h, k) with C(h l , kd and C(h 2, k 2) are the points
h kl i
ttl(h, k) = k- k(k 2 + k 1 2) + k2 + k/
and

Moreover, the point of contact rtl(h, k) lies on the semicircle whose diameter
is the interval [hl/k l , h/k].
PROOF. We refer to Figure 5.3. Write rtl for rtl(h, k). The figure shows that

b
Q
2k/
,
---'--,-::,.----- ---- --- - --
"
;ii',

,
,,
,,
,
h
k
Figure 5.3

To determine a and b we refer to the similar right triangles and we get

k/
so

Similarly, we find

b 20 - 2k;2 1 k/ - k 2
so b = 2k2 k 2 + k12·
1 1
2k2 2k2 + 2kl2
These give the required formula for rt l , and by analogy we get the correspond-
ing formula for rt 2 .

101
5: Rademacher's series for the partition function

To obtain the last statement, it suffices to show that the angle in Figure e
5.3 is n12. For this it suffices to show that the imaginary part of ril(h, k) is
the geometric mean of a and d, where
kl , h hi 1
a = k(k 2 + k 1 2 ) and a = k - k"; - a = kk 1 - a.

(See Figure 5.4.) Now

a
Figure 5.4

5.6 Rademacher's path of integration


For each integer N we construct a path P(N) joining the points i and i + 1
as follows. Consider the Ford circles for the Farey series P v . If hl/kl < h/k
< h21k2 are consecutive in FN, the points of tangency of C(1l I' k d, C(h, k),
and C(h 2 , k 2 ) divide C(h, k) into two arcs, an upper arc and a lower arc.
P(N) is the union of the upper arcs so obtained. For the fractions 0/1 and III
we use only the part of the upper arcs lying above the unit interval [0, 1].

EXAMPLE. Figure 5.5 shows the path P(3).


Because of Theorem 5.7, the path P(N) always lies above the row of semi-
circles connecting adjacent Farey fractions in F N'
The path P(N) is the contour used by Rademacher as a path of integration.
It is convenient at this point to discuss the effect of a certain change of variable
on each circle C(h, k).

Theorem 5.S. The transformation

Z . 2( kh)
= -Ik r -

102
5.6: Rademacher's path of integration

i + I

o 1
:3
1
2
2
:3

Figure 5.5 The Rademacher path P(3)

maps the Ford circle C(h, k) in the T-plane onto a circle K in the z-plane of
radius! about the point z = ! as center (see Figure 5.6). The points of
contact rJ.1(h, k) and rJ.z{h, k) of Theorem 5.7 are mapped onto the points
k2 kk
zl(h,k) = k 2 + k 1 Z + i k Z + lk/
and

The upper arc joil1il1g rJ.1(h, k) with rJ.z(h, k) maps onto that arc of K which
does 110t touch the imaginary z-axis.

z-plane
Figure 5.6

103
5: Rademacher's series for the partition function

PROOF. The translation l' - (h/k) moves C(h, k) to the left a distance h/k,
and thereby places its center at i/{2k 2). Multiplication by - ik 2 expands the
radius to 1/2 and rotates the circle through n/2 radians in the negative
direction. The expressions for zl(h, k) and z2(h, k) follow at once. 0
Now we obtain estimates for the moduli of Z 1 and Z2 .

Theorem 5.9. For the points Z1 and Z2 of Theorem 5.8 we have


k k
(10) IZI(h, k)1 = J k2 + kl ' 2
IZ2(h, k)1 = Jk 2 + k22
Moreover, if z is on the chord joining Z 1 and Z2 we have

yfik
(11) Izi < t:/'
if hl/kl < h/k < h2/k2 are consecutive in F N' The length of this chord does
not exceed 2yfik/N.
PROOF. For IZ112 we have
e + k2k 2
IZ112 = (k 2 + kJf + k/' k2
There is a similar formula for IZ212. This proves (10). To prove (11) we note
that if z is on the chord, then IzI ~ max( Iz 1 I, Iz21), so it suffices to prove that
yfik yfik
(12) IZ11<N and IZ21<N'
For this purpose we use the inequality relating the arithmetic mean and the
root mean square:
k _
_ k1 + < (k2 + k 1 2)1/2
2 - 2 .
This gives us
(k 2 k 2)1/2 k + kiN + 1 N
+ 1 ~ yfi ~ yfi > yfi'

so (10) and (12) imply (11). The length of the chord is ~ IZI I + IZ21. 0

5.7 Rademacher's convergent series for p(n)


Theorem 5.10. If n ~ 1 the partition function p(n) is represented by the
convergent series

p(n)
1
= n yfi2 k=L1 Ak(n)jk -dn
00 d (Sinh{ ~ JfF1)f)
R 1
/1--
24

104
5.7: Rademacher's convergent series for p(n)

where

A k(n) = '\' e"is(h, k) - 2 "inh/k.


L.
o ,,;h< k
(h,k)= 1

PROOF. We have

(13) p{n) = -2'


1
1tl
fF(x)
eX
IiTI dx where F(x) =
m=l
TI (1 -
<Xl

Xm)-l =
<Xl

L p{n)xn;
n=O

C is any positively oriented closed curve surrounding X = 0 and lying inside


the unit circle. The change of variable

maps the unit disk IX I :::;; 1 onto an infinite vertical strip of width 1 in the
r-plane, as shown in Figure 5.7. As x traverses counterclockwise a circle of

x-plane
o
t-plane
Figure 5.7

radius e - 2" with center at x = 0, the point r varies from i to i + 1 along a


horizontal segment. We replace this segment by the Rademacher path P{N)
composed of the upper arcs of the Ford circles formed for the Farey series
FN' Then (13) becomes

p(n) = r
J
i+ 1
F(e 21!i')e- 2 1!inr dr = r
Jp(N)
F{e 2"i')e- 2 "inr dr.
i

In this discussion the integer n is kept fixed and the integer N will later be
allowed to approach infinity. We can also write

L(N) ktl O";~<k


(h,k)= 1
th,k) = b th,k)

where y{h, k) denotes the upper arc of the circle C(h, k), and Lh,k is an
abbreviation for the double sum over hand k.
105
5: Rademacher's series for the partition function

Now we make the change of variable

z = -lk . 2(r - kh)


so that
/1 iz
r = k + k2'
Theorem 5,8 shows that this maps C(h, k) onto a circle K of radius t about
z = t as center. The arc y(h, k) maps onto an arc joining the points zl(h, k)
and z2(h, k) in Figure 5.6. We now have

p(n) = L f(h, k) F ( exp (2nih


Z2

h,k z,(h,k)
2nz))
-- - -
k k
2 2i
k
.
e-2nlnh/k e2nnz/k 2 dz

= L ik - 2 e - 2ninh/k f Z2
(h, k) e2nnz/k2 F(ex p(2nih _ 2n2z)) dz,
h,k zI(h,k) k k
Now we use the transformation formula for F (Theorem 5.1) which states
that
Z)I/2
F(x) = w(h, k) ( k
( n
exp 12z - 12e
nz )
F(x'),
where
2nih 2nz)
x = exp ( -k- - k2 ' x , = exp (2niH 2n),
-k- - ~
and
w(h, k) = enis(h, k), hH == - 1 (mod k), (h,k) = 1.
Denote the elementary factor Zl/2 exp[7l'/(l2z)- nz/(12k 2 )] by '1\(z) and
split the integral into two parts by writing
F(x') = 1 + {F(x') - I}.
We then obtain
p(n) = L ik-s/2w(h,k)e-Zninh/k(ll(h,k) + 12(h,k»
h, k

where

11(/1, k) = f Z2 (h,k)

zdh.k)
'1\(z )e2nnz/k2 dz

f
and
I 2(h, k) = (h,k) '1\(Z){F(ex p(2:iH _ 2n)) _ 1}e2nnz/k2 dz.
Z2

z,(h,k) Z

We show next that 12 is small for large N. The path of integration in the
z-plane can be moved so that we integrate along the chord joining z dh, k)
and z2(h, k). (See Figure 5.8.) We have already estimated the length of this

106
5.7: Rademacher's convergent series for p(n)

z tfh, k)

o\

Figure 5.8

chord; it does not exceed 2.j2k/N. On the chord itself we have Izl
::; max{IZ11, IZ21} < .j2k/N. Note also that the mapping w = l/z maps
the disk bounded by K onto the half-plane Re(w) ~ 1. Inside and on the
circle K we have 0 < Re(z) ::; 1 and Re(1/z) ~ 1, while on K itself we have
Re{1/z) = 1.
Now we estimate the integrand on the chord. We have

::; Izll/2 exp{~ Re(!)}e2nrr/k2


12 z m=1
I: p(m)e-2rrmRe(1/Z)
L p(m)e- 2rr (m-(1/24))Re(I/Z)
00
< Izll/2e 2n1t
m=1

L p(m)e- 21t(m-(1/24))
00
::; Izll/2e 2nrr
m=l

L p(m)e- 2rr(24m-l)/24
00

= Izll/2e 2nrr
m=1

L p(24m -

< Izll/2e 2n1t l)e- 2rr (24m-l)/24


m=l

" p(24m - 1)j,24m-l (where.V = e- Zrr /24 )


= Izll/2e 2/Jrr L
m::::1

107
5: Rademacher's series for the partition function

where

L p(24m -
00

c = e Zn1t 1)yZ4m-1.
m= 1

The number c does not depend on z or on N. (It depends on n, but n is fixed


in this discussion.) Since z is on the chord we have Izl < j2k/N so the
integrand is bounded by C21/4(k/N)1/Z. The length of the path is less than
2j2k/N, so altogether we find
11 z(h,k)1 < Ck3/ZN-3/z

for some constant C, and therefore

L ik-SIZw(h, k)e-Z1tinhlklz(h, k)1


Ih,k < £ L
k= 1 05h<k
Ck- 1N- 3/Z
(h,k)= 1
N
:S; CN- 3/Z L 1 = CN- 1/Z.
k=l
This means we can write

L L
N
(14) p(n) = ik-SIZw(h,k)e-Z1tinhlkl1(h,k) + O(N- 1/2 ).
k= 1 05h<k
(h,k)= 1
Next we deal with 1 1(h, k). This is an integral joining zl(h, k) and zz(h, k)
along an arc of the circle K in Figure 5.8. We introduce the entire circle K

i
as path of integration and show that the error made is also O(N- 1/ Z ). We have

11(h,k) = i i
K(-)
-
0
z llh, k)
-
fO

z2(h,k)
=
K(-)
-J 1 -J 2 ,

where K( - ) denotes that the integration is in the negative direction along K.


To estimate IJ 11 we note that the length of the arc joining 0 to zl(h, k) is less
than

Since Re(1/z) = 1 and 0 < Re(z) :s; 1 on K the integrand has absolute value

I'¥ (z)eZnnZlk21 = eZn1tRe(z)lk2IzI1/2 exp{~ Re(~) - ~ Re(Z)}


k 12 z 12k2

so that

108
5.7: Rademacher's convergent series for p(n)

where C I is a constant. A similar estimate holds for 1J 21 and, as before,


this leads to an error term O(N- 1 / 2 ) in the formula for p(n). Hence (14)
becomes

p(n) = I
k~ I Osh<k
I ik-5/2w(h,k)e-21tinhik r
JK(-)
\l'k(z)e2n1tz/k2 dz + O(N- 1/2).
(h, k) ~ I

Now we let N --> 00 to obtain

p(n) = ik~IAk(n)k-5/2
00
JrK (_)ZI/2 exp {127
n + k2
2nz ( n - 1 )} dz,
24

where
Ak(n) = I e"is(h, k)- 21tinh/k.
Osh<k
(h, k)~ I

The integral can be evaluated in terms of Bessel functions. The change of


variable
1
W =-, dz = - ldw,
z w
gives us

p(n) = ~ J I Ak(n)k - 5/2 f-+:iiW- 5/2 exp { ~; + ~~ (n - ;4) ±} dw.

Now put t = nw/12 and the formula becomes

p(n) = 2nCn2Y/2 J/k(n)k- 5/2 2~i f_+:i it - 5/2 exp{t + 6nk22 (n - 2~)ndt
where e = n/12. Now on page 181 of Watson's treatise on Bessel functions
[53] we find the formula

Iv{z) = 2Z. e )" f C


+ ooi
t- V - 1el +(z2/4t)dt (ife > O,Re(v) > 0),
2m c- ooi

where fv(z) = i-vJJiz). Taking

} = {::2 (n - ;4)f/2
and v = 3/2 we get
)-3/4
1))
1
3 2(
n)3/2 n- / n - 24 (n ~(
k v"3 "n - 24.
-52
p(n) = (2n) ( 12
00

k~1 Ak(n)k / 6 3/4k 3/2 13/2

109
5: Rademacher's series for the partition function

But Bessel functions of half odd order can be reduced to elementary functions.
In this case we have

13 / 2 (z) _- ~z d (sinh
- -d ~~. z)
n Z Z

Introducing this in the previous formula we finally get Rademacher's


formula,

Exercises for Chapter 5


1. Two reduced fractions alb and eld are said to be similarly ordered ifk - a)(d - b) ~ O.
Let alibi < a21b 2 < ... denote the Farey fractions in F".
(a) Prove that any two neighbors a/b i and ai + Ilb i + I are similarly ordered.
(b) Prove also that any two second neighbors a/b i and ai+ 2lb i +2 are similarly ordered.
Note: Erdos [8] has shown that there is an absolute constant c > 0 such that the
kth neighbors a/b i and aiulbiH in FlO are similarly ordered if II> ck.

2. If a, b, c, d are positive integers such that alb < cld and if ;, and II are positive integers,
prove that the fraction

lies between alb and cld, and that (c - del/(eb - a) = ;./11. When A = fl, 0 is the
mediant of alb and cld.

3. If bc - ad = I and 11 > max(b, d), prove that the terms of the Farey sequence F"
between alb and cld are the fractions of the form (Aa + flC)/(Ab + luI) for which ).
and fl are positive relatively prime integers with ;,b + lid s 11. Geometrically. each
pair (A, fl) is a lattice point (with coprime coordinates) in the triangle determined by
the coordinate axes and the line bx + ely = 11. Neville [29] has shown that the
number of such lattice points is
3 112
2- + O(nlog/J).
TC bel

This shows that for a given n, the number of Farey fractions between alb and eld is
asymptotically proportional to l/(bd), the length of the interval [alb, ('1£1].

Exercises 4 through 8 relate Farey fractions to lattice points in the plane.


In these exercises, n ;;::; 1 and T" denotes the set of lattice points (x, y) in the
triangular region defined by the inequalities
1 :s; x :s; n, 1 :s; y :s; n, n + 1 :s; x +y :s; 2n.

110
Exercises for Chapter 5

Also, T,,' denotes the set of lattice points (x, y) in T" with relatively prime
coordinates.
4. Prove that alb and cld are consecutive fractions in the Farey sequence F" if, and only
if, the lattice point (b, d) E T~.

5. Prove that L(b.d)E 1';, I/(bd) = 1. Hint: Theorem 5.5.


6. Assign a weight fix, y) to each lattice point (x, y) and let S" be the sum of all the
weights in T",

S" = L fix, y).


(x.y)e Tn

(a) By comparing the regions T, and T,-l for I' ::::: 2 show that
r-l r-I

Sr - Sr-l = f(l', 1') + L {.f(k, 1') + f(l', k)} - 2: f(k. r - k),


k= 1 k= I

and deduce that


n n r- 1 n r- I

SII = L f(I',I') + L L {.f(k, 1') + f(l', k)} - 2: 2: f(k. r - k).


r=l r=2 k=l r=2k=1

Note: If fix, y) = 0 whenever (x, y) > 1 this reduces to a formula of 1. Lehner


and M. Newman [25],
n r- 1

(15) L
(x. y) e T~
f(x, y) = f(1, I) +
r=2
L L k::;: 1
{.f(k,l') + f(l', k) - f(k, I' - k)}.
(k.r)= 1

This relates a sum involving Farey fractions to one which does not.

7. Let

1
S" =
(b, d)
L T~ bd(b + d) .
E

(a) Use Exercise 5 to show that 1/(2n - I) ::; S" ::; I/(n + I).
(b) Choose f(x,·),) = I/(xy(x + y)) in (15) and show that

3 "r 1
S" = - - 2L L 2 •
2 r= 1 k= 1 I' (I' + k)
(k,r)= 1

When n --> 00 this gives a formula of Gupta [12J,

<X) r 1 3
r=
L1 L1 k=
1'2(1' + k) = 4'
(k.r)= 1

8. Exercise 7(a) shows that S" --> 0 as n --> x,. This exercise outlines a proof of the
asymptotic formula

(16) S" -
_ 12 log 2
2 +0
(lOg n)
7! n n2
obtained by Lehner and Newman in [25].

111
5: Rademacher's series for the partition function

Let

Ar = ~
k=l r2(r
1
+ k)
= I L Il(d)
k=l dllr.k)r 2(r + k)'
(k.r)= 1
so that

r>n
(a) Show that

Ar = L
d/r
f
h= 1
---.:--:-::-Il_(r/_d--::-)
r (h + d)

and deduce that

qJ(r)
Ar = log 2 - 3 + 0 ( 3"1 L 1J1(d) I) .
r r d/r

(b) Show that L~= 1 Ld/r IIl(d) I = O(n log n) and deduce that

1 (lOg
L 3" L IIl(d) I = 0 - 2
n) .
r> n r d/r n
(c) Use the formula Lr,; n q>(r) = 3n 2 /n 2 + O(nlog n)(proved in [4], Theorem 3.7)to
deduce that

(d) Use (a), (b), and (c) to deduce (16).

112
6
Modular forms with
multiplicative coefficients

6.1 Introduction
The material in this chapter is motivated by properties shared by the discrimi-
nant ~(t) and the Eisenstein series
1
G 2k(t) = L
(m,II)"(O.O)
(m + nt )2k'
where k is an integer, k ~ 2. All these functions satisfy the relation

(1) at + b)
f ( ct + d = (Ct + d)'f(t),

where r is an integer and (: !) is any element of the modular group r.


The function ~ satisfies (1) with r = 12, and G2k satisfies (1) with r = 2k.
Functions satisfying (1) together with some extra conditions concerning
analyticity are called modular forms. (A precise definition is given in the
next section.)
Modular forms are periodic with period 1 and have Fourier expansions.
For example, we have the Fourier expansion,

L t(n)e2ltillt,
00

~(t) = (2n)12
n=1

where t(n) is Ramanujan's function, and


Y(2k) 2(2ni)2k ~ ( ) 2ltillt
()
G2k t = 2.. +(2k_l)!I1~/"2k_1ne ,

where 11,,(n) is the sum of the ~th powers 'of the divisors of n.

113
6: Modular forms with multiplicative coefficients

Both r(n) and O"a(n) are multiplicative arithmetical functions; that is, we
have
(2) r(m)r(n) = r(mn) and O"a(m)O"a(n) = O"a(mn) whenever (m, n) = 1.
They also satisfy the more general multiplicative relations

(3) r(m)r(n) = L dllr(;~)


dl(m. n)

and

(4)

for all positive integers m and n. These reduce to (2) when (m, n) = 1.
The striking resemblance between (3) and (4) suggests the problem of
determining all modular forms whose Fourier coefficients satisfy a multi-
plicative property encompassing (3) and (4). The problem was solved by
Hecke [16] in 1937 and his solution is discussed in this chapter.

6.2 Modular forms of weight k


In this discussion k denotes an integer (positive, negative, or zero), H denotes
the upper half-plane, H = {r: Im(r) > O}, and r denotes the modular group.

Definition. A function f is said to be an entire modular form of weight k if it


satisfies the following conditions:

(a) f is analytic in the upper half-plane H.


ar+b)
(b) f ( - - = (cr + d)kf(r) whenever (a
cr + d c
(c) The Fourier expansion of f has the form

L c(n)e21tinr.
00

f(r) =
n~O

Note. The Fourier expansion of a function of period 1 is its Laurent


expansion near the origin x = 0, where x = e 21tir . Condition (c) states that
the Laurent expansion of an entire modular form contains no negative
powers of x. In other words, an entire modular form is analytic everywhere
in H and at ioo.

The constant term c(0) is called the value of f at ioo, denoted by f(ioo).
If c(O) = 0 the function f is called a cusp form (" Spitzenform" in German),
and the smallest r such that c(r) =1= 0 is called the order of the zero of fat ioo.
It should be noted that the discriminant ~ is a cusp form of weight 12 with
a first order zero at ioo. Also, no Eisenstein series GZk vanishes at ioo.

114
6.3: The weight formula for zeros of an entire modular form

Warning. Some authors refer to the weight k as the" dimension - k"


or the "degree -k." Others write 2k where we have written k.

In more general treatments a modular form is allowed to have poles in


H or at ioo. This is why forms satisfying our conditions are called entire
forms. The modular function J is an example of a nonentire modular form of
weight 0 since it has a pole at ioo. Also, to encompass the Dedekind eta func-
tion there are extensions of the theory in which k is not restricted to integer
values but may be any real number, and a factor £(a, b, c, d) of absolute
value 1 is allowed in the functional equation (b). This chapter treats only
entire forms of integer weight with multiplier £ = 1.
The zero function is a modular form of weight k for every k. A nonzero
constant function is a modular form of weight k only if k = O. An entire
modular form of weight 0 is a modular function (as defined in Chapter 2) and
since it is analytic everywhere in H, including the point ioc, it must be constant.
Our first goal is to prove that nonconstant entire modular forms exist
only if k is even and::::: 4. Moreover, they can all be expressed in terms of the
Eisenstein series G4 and G6' The proof is based on a formula relating the
weight k with the number of zeros of f in the closure of the fundamental
region of the modular group.

6.3 The weight formula for zeros of an entire


modular form
We recall that the fundamental region Rr has vertices at the points p, i,
P + 1 and ioo. Iff has a zero of order r at a point p we write r = N(p).

Theorem 6.1. Let f be an entire modular form of weight k which is not identically
zero, and assume f has N zeros in the closure of the fundamental region
R r , omitting the vertices. Then we have the formula

(5) k = 12N + 6N(i) + 4N(p) + 12N(ioc).

PROOF. The method of proof is similar to that of Theorem 2.4 where we


proved that a modular function has the same number of zeros as poles in
the closure of R r . Since f has no poles we can write

N = _1
2n:i
f
oR
f'(r) dr.
f(r)

The integral is taken along the boundary of a region R formed by truncating


the fundamental region by a horizontal line y = M with sufficiently large M.
The path oR is along the edges of R with circular detours made around the
vertices i, p and p + 1 and other zeros which might occur on the edges. By

115
6: Modular forms with multiplicative coefficients

calculating the limiting value of the integral as M -+ 00 and the circular


detours shrink to their centers we find, as in the proof of Theorem 2.4,
k 1 1
(6) N = - - - N(i) - - N(p) - N(ioo)
12 2 3 .
The only essential difference between this result and the corresponding
formula obtained in the proof of Theorem 2.4 is the appearance of the term
k/12. This comes from the weight factor (c. + d)k in the functional equation
f(A(.» = (c. + d)kf(.),
where A(.) = (a. + b)/(c. + d). Differentiation of this equation gives us
f'(A(.»A'(.) = (c. + d)kf'(.).+ kc(C't + d)k-lf(.)
from which we find

c.
f'(A(.»A'(.) 1'(.)
= --
~--,--,----,--,-- + -kc
-.
f(A(.» f(.) +d
Consequently, for any path y not passing through a zero we have

_1
2ni
f A(y)
f'(u) du = _1
f(u) 2ni
f
y
1'(.) d.
f(.)
+ _1
2ni
f ~d•.
y C't +d
Therefore the integrals along the arcs (2) and (3) in Figure 2.5 do not cancel
as they did in the proof of Theorem 2.4 unless k = O. Instead, they make a
contribution whose limiting value is equal to

-k
2ni
Ii ~d.
p = 2ni
-k -k (ni
(log i-log p) = 2ni 2" -"3
2ni)
=
k
12'

The rest of the proof is like that of Theorem 2.4 and we obtain (6), which
implies (5). 0

From the weight formula (5) we obtain the following theorem.

Theorem 6.2
(a) The only entire modular forms of weight k = 0 are the constant
functions.
= 2, the only entire modular form of weight k
(b) Ifk is odd, ifk < 0, or ifk
is the zero function.
(c) Every nonconstant entire modularform has weight k ~ 4, where k is even.
(d) The only entire cusp form of weight k < 12 is the zero function.

PROOF. Part (a) was proved earlier. To prove (b), (c) and (d) we simply refer
to the weight formula in (5). Since each integer N, N(i), N(p) and N(ioo) is
nonnegative, k must be nonnegative and even, with k ~ 4 if k =F O. Also,
if k < 12 then N(ioo) = 0 so f is not a cusp form unless f = O. 0
116
6.4: Representation of entire forms in terms of G4 and G6

6.4 Representation of entire forms in terms


ofG4 and G6
In Chapter 1 it was shown that every Eisenstein series Gk with k > 2 is a
polynomial in G4 and G6. This section shows that the same is true of every
entire modular form. Since the discriminant Il is a polynomial in G4 and
G6 ,
Il = g2 3 - 27g/ = (60G 4 )3 - 27(140G 6 )2,
it suffices to show that all entire forms of weight k can be expressed in terms
of Eisenstein series and powers of Il. The proof repeatedly uses the fact that
the product fg of two entire forms f and g of weights wI and W2' respectively,
is another entire form of weight WI + W2' and the quotient fig is an entire
form of weight WI - W2 if g has no zeros in H or at ioo.
Notation. We denote by M k the set of all entire modular forms of weight k.
Theorem 6.3. Let f be an entire modular form of even weight k ~ 0 and define
Gotr) = 1 for all 'to Then f can be expressed in one and only one way as a
sum of the type
[k112]
(7) f = LarGk-12rllr,
r=O
k-12r*2
where the ar are complex numbers. The cusp forms of even weight k are
those sums with ao = O.
PROOF. If k < 12 there is at most one term in the sum and the theorem can
be verified directly. If f has weight k < 12 the weight formula (5) implies
N = N(ioo) = 0 so the only possible zeros of f are at the vertices p and i.
For example, if k = 4 we have N(p) = 1 and N(i) = O. Since G4 has this
property,fIG 4 is an entire modular form of weight 0 and therefore is constant,
so f = a OG4 . Similarly, we find f = aOG k if k = 6, 8 or 10. The theorem
also holds trivially for k = 0 (since f is constant) and for k = 2 (since the
sum is empty). Therefore we need only consider even k ~ 12.
We use induction on k together with the simple observation that every
cusp form in Mk can be written as a product Ilh, where h E M k- 12 .
Assume the theorem has been proved for all entire forms of even weight
<k. The form G k has weight k and does not vanish at ioo. Hence if
c = f(ioo )/Gk(ioo) the entire form f - cGk is a cusp form in Mk so
f - cG k = Ilh, where hEM k_ 12. Applying the induction hypothesis to h
we have
[(k-12)/12] [k/12]
h= L br Gk - 12 - l2rllr = r=1L br_1Gk_12rllr-1.
r=O
k-12-12r*2 k-12r*2

117
6: Modular forms with multiplicative coefficients

Therefore f = cG k + !1h is a sum of the type shown in (7). This proves, by


induction, that every entire form of even weight k has at least one representa-
tion of the type in (7). To show there is at most one such representation we

°
need only verify that the products Gk - 12 ,!1' are linearly independent. This
follows easily from the fact that !1(ioo) = but G 2 ,(ioo) =f. 0. Details are left
as an exercise for the reader. 0

Since both !1 and G2 , can be expressed as polynomials in G4 and G6 ,


Theorem 6.3 also shows that f is a polynomial in G4 and G6 . The exact
form of this polynomial is described in the next theorem.

Theorem 6.4. Every entire modular form f of weight k is a polynomial in G4


and G6 of the type

(8) f = L ca,bG4aG6b
a, b

°
where the Ca, b are complex numbers and the sum is extended over all integers
a ~ 0, b ~ such that 4a + 6b = k.
PROOF. °
If k is odd, k < or k = 2 the sum is empty and f is 0. If k = 0, f is
constant and the sum consists of only one term, co, 0' If k = 4, 6, 8 or 10

°
then each of the respective quotients f/G 4 ,f/G 6 ,f/G42 and f1(G 4 G6) is an
entire form of weight and hence is constant. This proves (8) for k < 12 or
k odd. To prove the result for even k ~ 12 we use induction on k.
Assume the theorem has been proved for all entire forms of weight < k.
Since k is even, k = 4m or k = 4m + 2 = 4(m - 1) + 6 for some integer
m ~ 3. In either case there are nonnegative integers rand s such that
k = 4r + 6s. The form g = G/G 6s has weight k and does not vanish at ioo.
Hence if c = f(ioo)/g(ioo) the entire form f - cg is a cusp form in Mk so
f - cg =!1h where hEM k_ 12' By the induction hypothesis, h can be
°
expressed as a sum as in (8), taken over all a ~ 0, b ~ such that 4a + 6b =
k - 12. Multiplication by !1 gives a sum of the same type with 4a + 6b = k.
Hence f = cg + !1h is also a sum of the required type and this proves the
theorem. 0

6.5 The linear space Mk and the


subspace Mk,o
The results of the foregoing section can be described in another way. Let
Mk denote the set of all entire forms of weight k. Then Mk is a linear space
over the complex field (since M k is closed under addition and under multi-
plication by complex scalars). Theorem 6.3 shows that M k is finite-dimen-
sional with a finite basis given by the set of products Gk - 12,!1' occurring
in the sum (7). There are [k/12] + 1 terms in this sum if k "!- 2 (mod 12),

118
6.6: Classification of entire forms in terms of their zeros

and one less term if k == 2 (mod 12). Therefore the dimension of the space
Mk is given by the formulas

if k == 2 (mod 12),
(9)
if k 1= 2 (mod 12).

Another basis for Mk is the set of products G4 a G6 b where a 2:: 0, b 2:: and
4a + 6b = k (see Exercise 6.12).
°
The set of all cusp forms in M k is a linear subspace of M k which we denote
by M k, o. The representation in Theorem 6.3 shows that
(10) dim Mk,o = dim Mk - 1
since the cusp forms are those sums in (7) with ao = 0.
We also note that if k 2:: 12, f E M k , 0 if and only if f = Ah, where
hEM k_ 12 . Therefore the linear transformation T : M k- 12 ...... M k, 0 defined by
T(h) = llh
establishes an isomorphism between M k, 0 and M k _ 12' Consequently, if
k 2:: 12 we have
(11) dim Mk,o = dim M k- 12 •
The two formulas (11) and (10) imply
dim M k = 1 + dim M k - 12

if k ~ 12. This equation, together with the fact that dim Mk = 1,0,1,1, 1, 1
when k = 0, 2, 4, 6, 8, 10, gives another proof of (9).
EXAMPLES. Formula (9) shows that
dim M k = 1 if k = 4, 6, 8, 10, and 14.
Corresponding basis elements are G4 , G6 , G/, G4 G6 , and G/G 6 ·
Formulas (11) and (9) together show that
dim Mk,o = 1 if k = 12,16,18,20,22, and 26.
Corresponding basis elements are ll, llG 4 , llG 6 , llG 4 2 , llG 4 G6 , and llG/G 6 .

6.6 Classification of entire forms in terms of


their zeros
The next theorem gives another way of expressing all entire forms in terms
of G4 , G 6 , II and Klein's modular invariant J.

Theorem 6.5. Let f be an entire form of weight k and let Zl' ... , ZN denote the
N zeros off in the closure of Rr (omitting the vertices) with zeros of order
119
6: Modular forms with multiplicative coefficients

N(p), N(i) and N(ioo) at the vertices. Then there is a constant c such that

TI
N
(12) f(r) = cG4(r)N(p)G6(r)N(i)~(r)N(iOO)~(rt {J(r) - J(Zk)}'
k=l
PROOF. The product
N
g(r) = TI {J(r) - J(zd}
k=l

is a modular function with its only zeros in the closure of Rr at Zl' ... , ZN
and with a pole of order N at ioo. Since ~ has a first-order zero at ioo, the
product ~Ng is an entire modular form of weight 12N which, in the closure
of R r , vanishes only at Zb ... , ZN' Therefore the product
h = G4N(p)G6N(i)~N(ioo)~Ng

has exactly the same zeros asfin the closure of R r . Also, h is an entire modular
form having the same weight as f since
k = 4N(p) + 6N(i) + 12N(ioo) + 12N.

Therefore f /h is an entire form of weight 0 so f /h is constant. This proves (12).


o
6.7 The Heeke operators Tn
Hecke determined all entire forms with multiplicative coefficients by intro-
ducing a sequence of linear operators Tn' n = 1,2, ... , which map the linear
space M k onto itself. Hecke's operators are defined as follows.

Definition. For a fixed integer k and any n = 1, 2, ... , the operator 1'" is
defined on Mk by the equation

(13)

In the special case when n is prime, say n = p, the sum on d contains


only two terms and the definition reduces to the formula

(14) (Tpf)(r) = pk-l f(pr) + -1 p-l


Lf (r + b)
-- .
P b=O P
The sum on b is the operator encountered in Chapter 4. It maps functions
automorphic under r onto functions automorphic under the congruence
subgroup r o(p).
We will show that 1'" maps each f in M k on to another function in M k •
First we describe the action of 1'" on the Fourier expansion of f

120
6.7: The Heeke operators T"

Theorem 6.6. Iff E Mk and has the Fourier expansion


00

f(,) = L c(m)e 21tim"


m=O

then T" f has the Fourier expansion


00

(15) (T"f)(,) = L Yn(m)e 2"im"


m=O

where

(16)

PROOF. From the definition in (13) we find

(Tnf)(,) = nk - I
d-I
L d- L L C(m)e "im(nt+bd)/d
k
00
2 2

)k-I d-I
din b=O m=O

1
= L L
00 (
~ c(m)e 2 "imnt/d 2 - L e 2 "imb/d.
m=O din d d b=O
The sum on b is a geometric sum which is equal to d if dim, and is 0 otherwise.

()k-I
Hence
(T"f)(,) = L L
00
~ c(m)e 2 "imnt/d 2 •
m=O dln.dlm d
Writing m = qd we have

(T"f)(,) = f
Q=O din
L (~)k-I c(qd)e 2 "i nt/d.
d
Q

In the sum on d we can replace d by nld to obtain

(Tnf)(,) = f Ld
q=Odln
k- 1 C(qn)e 2 "iQdt.
d
If x = e 2 "it the last sum contains powers of the form x qd • We collect those
terms for which qd is constant, say qd = m. Then q = mid and dim so

(Tn f)(,) = L L
00

m=O din. dim


k-I
d c(mn)
(j2 x m ,
which implies (16). o
Our next task is to prove that T" maps Mk into itself. For this purpose
we note that the definition of T" f can be written in a slightly different form.
We write n = ad and let
aT + b
A'=-d-'

121
6: Modular forms with multiplicative coefficients

Then (13) takes the form

(17) (T"f)(r)=n k - 1 L d-kf(Ar)=~ L akf(Ar).


a?: l.ad=n n a?: l.ad=n
O~b<d O~b<d

The matrix (~ !) which represents A has determinant ad = n. To deter-


mine the behavior of T"f under transformations of the modular group r
we need some properties of transformations with determinant 11. These are
described in the next section.

6.8 Transformations of order n


Let n be a fixed positive integer. A transformation of the form
ar + b
Ar=--d'
er +
where a, b, e, d are integers with ad - be = n, is called a transformation of
order n. It can be represented by the 2 x 2 matrix

where, as usual, we identify each matrix with its negative.


We denote by nn) the set of all transformations of order n. The modular
group r is nl).
Two transformations Al and A2 in nn) are called equivalent, and we
write A 1 ~ A 2 , if there is a transformation V in r such that
Al = VA 2 •
The relation ~ is obviously reflexive, symmetric, and transitive, and hence
is an equivalence relation. Consequently, the set nn) can be partitioned
into equivalence classes such that two elements of nn) are in the same class
if, and only if, they are equivalent. The next theorem describes a set of
represen ta ti ves.

Theorem 6.7. In every equivalence class of nn) there is a representative of


triangular form

(a
o
b)
d'
where d > o.

PROOF. Let A = (: ~) be an arbitrary element of nn). If e = 0 there is


nothing more to prove. If e =1= 0 we reduce the fraction - ale to lowest terms.
That is, we choose integers rand s such that sir = -ale and (r, s) = 1.

122
6.8: Transformations of order 11

Next we choose two integers p and q such that ps - qr = 1 and let

v = (~ ~}
Then V Er and

VA=(~ !)G ;)=(~:::; ~~:::}


Since ra + sc = 0 and det(VA) = det V det A = n we see that VA E nn) so
VA '" A. Hence V A or its negative is the required representative. 0

Theorem 6.S. A complete system q{ nonequivalent elements in nn) is given by


the set q{ transformations of triangular form

(18)

where d runs through the positive divisors of n and, for each fixed d,
a = nld, and b runs through a complete residue system modulo d.
PROOF. Theorem 6.7 shows that every element in nn) is equivalent to one
of the transformations in (18). Therefore we need only show that two such
transformations, say
al
Al = ( 0 and

are equivalent if, and only if,


(19) and
If (19) holds then b 2 = b l + qd l for some integer q and we can take

V = (~ i).
Then V A I = A 2 so A I '" A 2·
Conversely, if Al '" A2 there is an element

V = (~ ~)
in r such that A2 = VAI. Therefore

(20)

Equating entries we find ral = Osor = Osinceal #- obecause ald l = n ~ 1.


Now ps - qr = 1 so ps = 1 hence both p and s are 1 or both are - 1. We can
assume p = s = 1 (otherwise replace V by - V). Equating the remaining
entries in (20) we find a 2 = ai' d 2 = d l , b 2 = b l + qd l , so b2 == b l (mod d t ).
This completes the p r o o f . · 0
123
6: Modular forms with multiplicative coefficients

Note. The sum in (17) defining T"f can now be written in the form
1
(21) (T"f)(r) = -
n
L akf(Ar),
A

where A runs through a complete set of nonequivalent elements in r(n) of


the form described in Theorem 6.8. The coefficient ak is the kth power of
the entry in the first row and first column of A.

Theorem 6.9. If A1 E r(n) and V1 E r, then there exist matrices Az in r(n)


and V2 in r such that
(22)
Moreover, if

Ai = (~ ~:) and

for i = 1,2, then we have


(23)
PROOF. Since det(A 1V1) = det A1 det V1 = n, the matrix A1 V1 is in r(n) so,
by Theorem 6.7, there exists A z in r(n) and Vz in r such that (22) holds. To
verify (23) we first note that A 1 V1 has the form

A1 V1 = (~ ~:)(;: ::) = C1*Y1 d 1*bJ


and that

_ 1
A2 -1 - - (d
n 0
z

Therefore (22) implies

V2 = A1 V1A z -1 = ~ (d 1*Y1 d 1*bJ( ~ -::)

Equating entries in the second row we find

d 1d 2 Y1 d2
Y2 = - - = - Y l
n a1
and
b2 = -d 1Y1 b 2 + d 1b 1a z = _ b2 Y1 + a2 b 1
n a1 a1

124
6.9: Behavior of T" f under the modular group

since a1d l = n. Hence


alY2 = d 2Yl and
and we obtain
al(Y2A2r + b2 ) = alY2A2r + al b2

which proves (23). D

6.9 Behavior of Tn/under the modular group

Theorem 6.10. IffEMk and V = G:)Er then

(24) (T.J)(Vr) = (yr + b)k(T"f)(r).


PROOF. We use the representation in (21) to write
1
(T"f)(r) = - L a/ f(Alr)
n AI

where A I = ( a0
l hi)
dI and A I runs through a complete set of noneqUlvalent .
elements in r(n). Replacing r by Vr we find
1
(25) (T"f)(Vr) = - L a/f(A I Vr).
n Al
By Theorems 6.7 and 6.9, there exist matrices

A2 = (a~ ~:) in r(n) and

such that
and
Therefore
a/f(AIVr) = a/f(V2A 2r) = a/(Y2 A2r + b 2 tf(A 2 r)
= a/(yr + b)kf(A 2 r)
since f E M k • Now as Al runs through a complete set of nonequivalent
elements of r(n) so does A 2 . Hence (25) becomes
1
(T" f)( Vr) = - (yr + b)k L a/ f(A2 r) = (yr + b)k(T" f)(r). D
n A,

125
6: Modular forms with multiplicative coefficients

The next theorem shows that each Hecke operator 1'" maps Mk into Mk
and also maps Mk,o into Mk,o,

Theorem 6.11. If f E Mk then T"f E M k , Moreover, iff is a cusp form then


7;, f is also a cusp form.
PROOF. Iff E Mk the definition of Tn shows that T"f is analytic everywhere
in H. Theorem 6.6 shows that T"f has a Fourier expansion of the required
form and that 1'" f is analytic at i 00. And Theorem 6.10 shows that 1'" f has
the proper behavior under transformations of r. Finally, if f is a cusp form,
the Fourier expansion in Theorem 6.6 shows that 1'" f is also a cusp form. D

6.10 Multiplicative property of Hecke


operators
This section shows that any two Hecke operators 1'" and Tm defined on M k
commute with each other. This follows from a multiplicative property of the
composition Tm 1'". First we treat the case in which m and n are relatively
prime.

Theorem 6.12. If (m, n) = 1 we have the composition property


(26)
PROOF. Iff E Mk we have
1
(T"f)(,) = - I ak f(A,),
n a<-I,ad=n
Osb<d

where A = (~ ~). Applying Tm to each member we have


1 1
{Tm(T,,(f))}(r)=- I ri- I akf(BAr),
m a~ 1,7b=m n a~ I, ad=n
osf3<b OSb<d

where B = (~ ~). This can be written as


1
(27) {(Tm T,,)(f)}(r) =~ I I (cw)kf(C,),
mn a<-I,ab=m a<-I.ad=n
Osf3<b Osb<d

where

C = BA = (~ ~) ( ~ ~) = (~ ab ~ Pd).
As d and b run through the positive divisors of nand m, respectively, the
product db runs through the positive divisors of mn since (m, n) = 1. The

126
6.10: Multiplicative property of Hecke operators

linear combination rxb + f3d runs through a complete residue system


mod db as band f3 run through complete residue systems mod d and b,
respectively. Therefore the matrix C runs through a complete set of non-
equivalent elements of r(mn) and we see that (27) implies (26). 0

The next theorem extends the composition property in (26) to arbitrary


m and n. For convenience in notation we write T(n) in place of 7;..

Theorem 6.13. Any two Hecke operators T(n) and T(m) defined on M k commute
with each other. Moreover, we have the compositionjormu/a
(28) T(m)T(n) = L dk- 1 T(mn/d 2 ).
d!<m,n)

PROOF. Commutativity follows from (28) since the right member is symmetric
in m and n. If (m, n) = 1 formula (28) reduces to (26). Therefore, to prove
(28) it suffices to treat the case when m and n are powers of the same prime p.
First we consider the case m = p and n = pr, where r ~ 1. In this case we
are to prove that
(29)
We use the representation in (17) and note that the divisors of pr have the
form pr where 0 s t S r. Hence we have

(30) {T(pr).f}(r) = p-r L p(r_r)k!(pr-rrr+ br).


Osr9 P
OSb,<p'

By (14) we have

{T(p)g }(r) = pk-lg(pr) + p- /i g(r +P b),


b=O
1

so when we apply T(P) to each member of (30) we find

{T(p)T(pr)!}(r)=pk-l-r L p(r-r)k! ( Pr+l-r ~ + Pb)


r
OsrSr P
Osb,<p'

+ p- 1- r L L ! (r-r
p(r - r)k p-l p r +r +b ~ + bp . r)
OsrSr b=O P
OSb,<p'

In the second sum the linear combination br + bpr runs through a complete
residue system mod pr+ 1. Since r - t = (r + 1) - (t + 1) the second sum,
together with the term t = 0 from the first sum, is equal to {T(pr+ 1).f} (r). In
the remaining terms we cancel a factor p in the argument of f, then transfer
the factor pk to each summand to obtain

{T(p)T(pr).f}(r) = {T(pr+l).f}(r) + p-l-r L p(r+l_r)k!(pr-r~_~ br).


1 srSr P
osb,<p'

127
6: Modular forms with multiplicative coefficients

Dividing each hI by pl-l we can write


hI = qlpl-l + r"
where 0 ::;; r l < pt-l and ql runs through a complete residue system mod p.
Since f is periodic with period 1 we have
pr-tt + hI) = (pr-I t + rl)
f( P t-l f 1
P 1 '

so as q, runs through a complete residue system mod p each term is repeated


p times. Replacing the index t by t - 1 we see that the last sum is pk-l times
the sum defining {T(pr-l)f}(t). This proves (29).
Now we consider general powers of the same prime, say m = pS and
n = pro Without loss of generality we can assume that r ::;; S. We will use in-
duction on r to prove that

(31) T(pr)T(ps) = Lr P'(k-l)T(pr+ S - 21) = L r+s)


dk- 1 T ( ~
1=0 dl(p'.pS) d
for all r and all s ~ r. When r = 1, (31) follows for all s ~ 1 from (29).
Therefore we assume that (31) holds for r and all smaller powers and all
s ~ r, and prove it also holds for r + 1 and all s ~ r + 1.
By (29) we have
T(p)T(pr)T(ps) = T(pr+l)T(pS) + pk-lT(pr-l)T(pS),
and by the induction hypothesis we have
r
T(p)T(pr)T(ps) = LP'(k-l)T(p)T(pr+s-21).
1=0

Equating the two expressions, solving for T(pr + 1 )T(pS) and using (29) in the
sum on t we find
r r
T(pr+l)T(ps) = Lpl(k-l)T(pr+S+l-U) + LP(I+l)(k-l)T(pr+s-I-2r)
1=0 1=0

_ pk-1T(pr-l)T(ps).

By the induction hypothesis the last term cancels the second sum over t
except for the term with t = r. Therefore
r
T(pr+l)T(pS) = Lpt(k-l)T(pr+s+1-21) + p(r+1)(k-l)T(ps-l-r)
1=0

r+ 1
= L p,(k-l)T(pr+l+S-U).
1=0

This proves (31) by induction for all r and all s ~ r, and also completes the
proof of (28). 0
128
6.11: Eigenfunctions of Hecke operators

6.11 Eigenfunctions of Hecke operators


In Theorem 6.6 we proved that if f E Mk and has the Fourier expansion
00

(32) f(r) = L c(m)xm,


m=O

where x = e Z7ti r, then Tnf has the Fourier expansion


00

(33) (T"f)(r) = L Yn(m)x m,


m=O

where

(34) ~ dk-l c(mn)


Yn(m) = '\' ([2 .
dl(n,m)

When m = 0 we have (n, 0) = n so the constant terms of f and T"f are


related by the equation
(35) Yn(O) = L dk-1C(0) = O"k-l(n)c(O)
din

for all n ~ 1. Similarly, when m = 1 we find


(36) Yn(l) = c(n)
for all n ~ 1.
The sum on the right of (34) resembles that which occurs in the multi-
plicative property of Ramanujan's function r(n) and the divisor functions
O"~(n). These examples suggest we seek those formsffor which the transformed
function T" f has Fourier coefficients
(37) Yn(m) = c(n)c(m)
since this would imply the multiplicative property

c(n)c(m) = L dk-lC(;~).
dl(n,m)

The relation (37) is equivalent to the identity


T"I = c(n)f
for all n ~ 1. A nonzero function f satisfying a relation of the form
(38) T"f = ).(n)f
for some complex scalar ).(n) is called an eigenfunction (or eigenform) of the
operator T", and the scalar ).(n) is called an eigenvalue of T". If f is an
eigenform so is cf for every c # O.

129
6: Modular forms with multiplicative coefficients

EXAMPLES. If a linear operator T maps a I-dimensional function space V


into itself, then every nonzero function in Vis an eigenfunction of T. Formula
(9) shows that
dim Mk =1 if k = 4,6,8, 10 and 14,
so each Hecke operator 1',. has eigenforms in M k for each of these values of k.
For example, the respective Eisenstein series G4 , G6 , G8 , G 10 and G 14 are
eigenforms for each 1',..
Similarly, formula (11) implies that
dim M k • O = 1 if k = 12,16,18,20,22 and 26,
so each 1',. has eigenforms in M k. 0 for each of these values of k. The respective
cusp forms~, ~G4' ~G6' ~G8' ~G10 and ~G14 are eigenforms for each 1',..

If f is an eigenform for every Hecke operator 1',., n ;;::: I, then f is called a


simultaneous eigenform. All the examples just mentioned are simultaneous
eigenforms.

6.12 Properties of simultaneous eigenforms


Theorem 6.14. Assume k is even, k ;;::: 4. If the space Mk contains a simultaneous
eigenform f with Fourier expansion (32), then c(1) =f. O.

PROOF. The coefficient of x in the Fourier expansion of T"f is i'n(l) = c(n).


Since f is a simultaneous eigenform this coefficient is also equal to A(n)c( I), so
c(n) = A(n)c(l)
for all n ;;::: 1. If c(1) = 0 then c(n) = 0 for all n ;;::: 1 and fer) = c(O). But then
c(O) = 0 since k ;;::: 4, hence f = 0, contradicting the definition of eigenform.
This proves that c( I) =f. O. D

An eigenform with c(l) = I is said to normalized. If Mk contains a simul-


taneous eigenform then it also contains a normalized eigenform since we
can always make c(l) = 1 by multiplying f by a suitable nonzero constant.
It is easy to characterize all cusp forms which are simultaneous eigenforms.
Since the zero function is the only cusp form of weight < 12 we need consider
only k ;;::: 12.

Theorem 6.15. Assume f E M k. 0 where k is even, k ;;::: 12. Theil f is a simul-


taneous normalized eigenform if; and only if; the coefficiellts ill the Fourier
expansion (32) sati~ry the multiplicative property

(39) c(m)c(n) = L dk - 1 c(nd1; )


dl(n.m)

for allm ;;::: I, n ;;::: I, ill which case the coeffiCient c(n) is an eigenvalue of T".

130
6.13: Examples of normalized simultaneous eigenforms

PROOF. The equation T,J = },(n)f is equivalent to the relation

(40) 'Yn(m) = A(n)c(m)

obtained by equating coefficients of xnt in the corresponding Fourier expan-


sions. Since f is a cusp form so is T" f hence (40) is to hold for all m ;:::; 1
and n ;:::; 1. Now 'Yn(l) = c(n) so (40) implies A(n) = c(n) if c(1) = 1, and
hence 'Yim) = c(n)c(m). On the other hand, Equation (34) shows that (40)
is equivalent to (39) if c(1) = 1. Therefore f is a normalized simultaneous
eigenform if, and only if, (39) holds for all m ;:::; 1, n ;:::; 1. 0

6.13 Examples of normalized simultaneous


eigenforms
The discriminant l1 is a cusp form with Fourier expansion
oc
l1( r) = (2n)12 L r(m)xm
m=l

where r(1) = 1. Therefore (2n) - 12l1( r) is a normalized eigenform for each


Tn with corresponding eigenvalue r(n). This also proves that Ramanujan's
function r(ll) satisfies the mUltiplicative property in (3).
The next theorem shows that the only simultaneous eigenforms in M 2k
which are not cusp forms are constant multiples of the Eisenstein series G2k .

Theorem 6.16. Assume that I E M 2" where k ;:::; 2, alld that I is Ilot a cusp
form. Theil I is a normalized simultaneous eigen{orm if, alld only if,
. (2k - 1)!
(41 ) fir) = 2(2ni)2k G2k(r).

PROOF. In the Fourier expansion (32) we have c(O) =f. 0 since f is not a cusp
form. The relation
(42) T,J = A(n)f

is equivalent to the relation


(43) 'Yn(m) = A(n)c(m)

obtained by equating coefficients of xnt in the corresponding Fourier expan-


sions. When m = 0 this becomes
'Yn(O) = A(n)c(O).

On the other hand, (35) implies )In(O) = 0' 2k _ 1(1l)c(0) smce f EM 2k' But
dO) =f. 0, so Equation (42) holds if, and only if,

A(n) = 0'2k-l(Il).

131
6: Modular forms with multiplicative coefficients

Using this in (43) we find that


'Yn(m) = (J" 2k - 1(n)c(m).
When m = 1 this relation, together with (36), gives us
c(n) = (J"2k-l(n)c(I).
Therefore,f is a normalized simultaneous eigenform in M 2k if, and only if,
c(n) = (J"2k-l(n)
for all n ~ 1. Since the Eisenstein series G 2k has the Fourier expansion
2(2ni)2k 00
G 2k(r) = 2(2k) + (2k _ I)! m~1(J"2k-l(m)Xm,

the function in (41) is normalized and its Fourier expansion is given by


(2k - 1)!
(44) f(r) = (2 ')2k (2k)
nl
+m=L1(J"2k-l(m)x
00 m
. o
Note. Since
Y(2k) ( k+ 1 (2n)2k
.. = -1) 2(2k)! B2k

where Bk is the kth Bernoulli number defined by


x ~ Bk k
--X::::-l
e
= L.
k=O
-k' x ,

the constant term in (44) is equal to - B2k/(4k). (See [4], Theorem 12.17.)
We can also write

Since the eigenvalue A,(n) in (42) is (J"2k-l(n), Theorem 6.16 shows that the
divisor functions (J"in) satisfy the multiplicative property in Equation (4)
when IX = 2k - 1. Actually, they satisfy (4) for all real or complex IX, but
(J"Cl(n) is the nth coefficient of an entire form only when IX is an odd integer ~ 3.

EXAMPLES. The problem of determining all entire noncusp forms whose


coefficients satisfy the multiplicative property (39) has been completely
settled by Theorem 6.16. For the cusp forms the problem has been reduced
by Theorem 6.15 to that of determining simultaneous normalized eigenforms
of even weight 2k ~ 12. We have already noted that the function (2n)-12.1(r)
is the only simultaneous normalized eigenform of weight 2k = 12. Also
there is exactly one simultaneous normalized eigenform for each of the
weights
2k = 16,18,20,22, and 26

132
6.14: Remarks on existence of simultaneous eigenforms in M 2k. 0

since dim M 2k. 0 = 1 for these weights. The corresponding normalized


eigenforms are given by

(2 ) - 12 ~(T) . G2k -( )
12 T
00
'\' ()
n{ 1 2(2k - 12) '\'
00
( ) m}
1! 2((2k-12) n~ITnx - B 2k - 1Z mf~\O"Zk-13mx .

We define T(O) = 0 and 0"2k-l(0) = - B2J(4k). Then the coefficients c(n) of


these eigenforms are given by the Cauchy product
4k - 24
L T(m)O"Zk-13(n
n
c(n) = - B - m).
Zk-12 m=O
They satisfy the multiplicative property

c(m)c(n) = L d2k-1C(~;)
dl(m.n)

for all m 2: 1, n 2: 1.

6.14 Remarks on existence of simultaneous


eigenforms in M 2k,O
Let K = dim M Zk. 0 where 2k 2: 12. Then we have

K = lG~J - 1 if2k == 2 (mod 12)

[~~ ] if 2k 1= 2 (mod 12).

Let e(k) denote the number of linearly independent simultaneous eigenforms


in M Zk. o. Clearly, e(k) ~ K. We have shown that e(k) = 1 when K = 1.
Hecke showed that e(k) = 2 when K = 2, and later Petersson [32] showed
that e(k) = K in all cases. He did this by introducing an inner product
(f, g) in M 2k. 0 defined by the double integral

(f,g) = SS f(T)g(T)V 2k - Zdudv


Rr

extended over the fundamental region Rr in the T = u + iv plane. Relative


to the Petersson inner product the Hecke operators are Hermitian, that is,
they satisfy the relation
(T"f, g) = (f, T"g)
for any two cusp forms in M 2k. o. Therefore, by a weIl known theorem of
linear algebra (see [2], Theorem 5.4) for each T" there exist K eigenforms
which form an orthonormal basis for M Zk. o. These need not be simultaneous
eigenforms for all the T". However, since the T" commute with each other,
another theorem of linear algebra (see [10], Ch. IX, Sec. 15) shows that

133
6: Modular forms with multiplicative coefficients

M 2k, 0 has an orthonormal basis consisting of K simultaneous eigenforms,


Each of these can be multiplied by a constant factor to get a new basis of
simultaneous normalized eigenforms. (The new basis will be orthogonal
but need not be orthonormal.) Since the Tn are Hermitian, the corresponding
eigenvalues are real. Details of the proofs of these statements can be found
in references [32], [26], or [11].

6.15 Estimates for the Fourier coefficients of


entire forms
Assume f is an entire form with Fourier expansion
ex;

(45) f(,) = L c(n)xn,


n=O

where x = e Write, = u + iv so that x = e-21tve21tiu. For fixed v> 0,


21tir •

as U varies from 0 to 1 the point x traces out a circle C(v) of radius e - 21tv
with center at x = O. By Cauchy's residue theorem we have

(46) c(n) 1.
= -2
m
fC(v)
~~~
x
dx = flf(U + iv)x- n duo
0

We shall use this integral representation to estimate the order of magnitude


of Ic(n) I· First we consider cusp forms of weight 2k.

Theorem 6.17. Iff E M 2k . o we have


c(n) = O(nk).

PROOF. The series in (45) converges absolutely if Ix I < 1. Since c(0) = 0 we


can remove a factor x and write

If(,)1 = IXIIJ1c(n)xn-11::s: IXIJ11c(n)llxln-1.


If, is in R r , the fundamental region of r, then, = u + iv with v ~ fi/2
> 1/2, so Ixl = e- hv < e-". Hence
I f(,)1 ::s: A Ixl = Ae- 21tv
where

= L Ic(n)le-(n-l)1t.
C1j

A
n=1

This implies
(47)

Now define
g( ,) = 11, - i I= v
134
6.15: Estimates for the Fourier coefficients of entire forms

iftEH. Then
g(At) = let + dl- 2g(t)
if A = (: ~) E r, so gk(At) = let + dl- 2kgk(t). Therefore the product
tp(t) = If(t)lgk(t) = If(t)lv k
is invariant under the transformations of r. Moreover, tp is continuous
in R r , and (47) shows that tp(t) -+ 0 as v -+ + 00. Therefore tp is bounded
in Rr and, since tp is invariant under r, tp is also bounded in H, say
Itp(t) I ~ M
for all t in H. Therefore

If(t)1 ~ Mv- k
for all t in H. Using this in (46) we find

le(n)1 ~ f If(u + iv)x-"I du ~ Mv-klxl-" = Mv- ke2,....

This holds for all v > O. When v = lin it gives us


le(n)1 ~ Mn ke21t = O(nk). o
Theorem 6.18. Iff EM 2k and f is not a eusp form, then

(48) e(n) = O(n 2k - 1).

PROOF. If f = G2k each coefficient c(n) is of the form a0"2k-l(n) where a is


independent of n. Hence

Now
2k-l locI
_
0"2k-l(n) - L
din
(n)
d -_ n
2k - 1
L d 2k-l ~ n 2k - L d 2k -
din
1

d= 1
_
1 - O(n
2k - 1
),

so (48) holds if f = G2k .


For a general noncusp form in M 2k, let il = f(ioo)/G 2k(ioo). Thenf - ilG 2k
is a cusp form so
f = ilG 2k +g
where gEM 2k, o. The Fourier coefficients of f are the sum of those of ilG 2k
and g so they have order of magnitude
O(n 2k - 1 ) + O(nk) = O(n 2k - 1 ). 0
135
6: Modular forms with multiplicative coefficients

Note. For cusp forms, better estimates for the order of magnitude of the
c(n) have been obtained by Kloosterman, Salie, Davenport, Rankin, and
Selberg (see [46]). It has been shown that
c(n) = O(n k- O / 4 )+<)

for every 8 > 0, and it has been conjectured that the exponent can be further
improved to k - ! + 8. For the discriminant d, Ramanujan conjectured the
sharper estimate
Ir(p)1 :s;; 2pll/2

for primes p. This was recently proved by P. Deligne [7].

6.16 Modular forms and Dirichlet series


Hecke found a remarkable connection between each modular form with
Fourier series

L c(n)e2rrint
00

(49) f(r) = c(O) +


n= 1

and the Dirichlet series

(50) ( ) _ ~ c(n)
cps-L..,.-s
n= 1 n

formed with the same coefficients (except for c(O)). If f EM 2k then c(n) =
O(nk) if fis a cusp form, and c(n) = O(n 2k - 1) iff is not a cusp form. Therefore,
the Dirichlet series in (50) converges absolutely for (1 = Re(s) > k + 1 if f is
a cusp form, and for (1 > 2k if f is not a cusp form.

Theorem 6.19. If the coefficients c(n) sati~fy the multiplicative property

(51) c(m)c(n) = L d2k-1C(;~)


dl(m,n)

the Dirichlet series will have an Euler product representation of the form

(52) cp(s) = Q1 _ c(P)p s


1
+ p2k Ip 2s'

absolutely convergent with the Dirichlet series.


PROOF. Since the coefficients are multiplicative we have (see [4], Theorem
11.7)

(53)

136
6.16: Modular forms and Dirichlet series

whenever the Dirichlet series converges absolutely. Now (51) implies

for each prime p. Using this it is easy to verify the power series identity

(1 - c(P)x + p2k-1 X2)( 1 + JI c(P")X") = 1

for all Ix I < 1. Taking x = p - S, we find that (53) reduces to (52). 0

EXAMPLE. For the Ramanujan function we have the Euler product represen-
tation

f r(n} = n 1 -
"=1 n p r(p)p
1
S + pll 2s

for (j > 7 since r(n) = O(n 6 ).

Hecke also deduced the following analytic properties of q>(s).

Theorem 6.20. Let q>(s) be the function defined for (j > k by the Dirichlet
series (50) associated with a modular form f(r) in Mk having the Fourier
series (49), where k is an even integer 2::: 4. Then q>(s) can be continued
analytically beyond the line (j = k with the following properties:
(a) If c(O) = 0, q>(s) is an entire function of s.
(b) If c(O) =F 0, q>(s) is analytic for all s except for a simple pole at s = k
with residue
( -1)k/2c(0)(271:Jk
r(k)

(c) Thefunction q> satisfies the functional equation

(2ntsr(s)q>(s) = (-1)k/2(2n)S-kr(k - s)q>(k - s).

PROOF. From the integral representation for r(s) we have

if (j > O. Therefore if (J > k we can multiply both members by c(n) and sum
on n to obtain

(2n)-sr(s)q>(s) = fooo {f(iy) - C(O)}y'-l dy.

137
6: Modular forms with multiplicative coefficients

f:
Since f is a modular form in Mk we have f(ijy) = (iy)kf(iy) so

(2n)-sr(s)cp(s) = {" {f(iy) - C(O)}y'-l dy + {(iy)-k fG) - c(O)}y·-l dy

= foo {f(iy) - C(O)}y'-l dy + i- k foo f(iw)W k- s - 1 dw _ c(O)


l i S

= {OO {f(iy) _ c(O)}y·-l dy

OO
+ (_1)k/2 fl {f(iw) - c(0)}W k - S- 1 dw

+ (_I)k/2c(0) fOO Wk- s - I dw _ c(O)


1 S

= fOO {f(iy) - c(O)}(y' + (-ll/2/-S) dy


1 y
1 (_I)k/2)
-c(O) ( s+~.

Although this last relation was proved under the assumption that (J > k, the
right member is meaningful for all complex s. This gives the analytic continua-
tion of cp(s) beyond the line (J = k and also verifies (a) and (b). Moreover,
replacing s by k - s leaves the right member unchanged except for a factor
(_I)k/2 so we also obtain (c). 0

Heeke also proved a converse to Theorem 6.20 to the effect that every
Dirichlet series cp which satisfies a functional equation of the type in (c),
together with some analytic and growth conditions, necessarily arises from
a modular form in M k • For details, see [15].

Exercises for Chapter 6


Exercises 1 through 6 deal with arithmetical functions f satisfying a relation
of the form

(54) f(m)f(n) = 1: a(d)f(:~)


dl(m,n)

for all positive integers m and n, where a is a given completely multiplicative


function (that is, a(l) = 1 and a(mn) = a(m)a(n) for all m and n). An arith-
metical function satisfying (54) will be called a-multiplicative. We write
f = 0 if f(n) = 0 for all n.
1. Assume f is (X-multiplicative and f oF O. Prove that f(1) = l. Also prove that cf is
(X-multiplicative if, and only if, c = 0 or c = 1.

138
Exercises for Chapter 6

2. If I and 9 are IX-multiplicative, prove that I +9 is IX-multiplicative if, and only if,
! = 0 or 9 = O.
3. Let II' ... , Ik be k distinct nonzero IX-multiplicative functions. If a linear combination

i= 1

is also IX-multiplicative, prove that:


(a) The functions II' ... ,Ik are linearly independent.
(b) Either all the C; are 0 or else exactly one of the C; is I and the others are O. Hence
either I = 0 or I = I; for some i. In other words, linear combinations of iX-
multiplicative functions are never IX-multiplicative except for trivial cases.

4. If I is IX-multiplicative, prove that

lX(n)!(I11) = I J1(d)!(l11nd)!(~).
dl" d
S. If I is multiplicative, prove that f is IX-multiplicative if, and only if,

(55)

for all primes p and all integers k ;::0: I.


6. The recursion relation (55) shows that I(p") is a polynomial in I(p), say
f(P") = Q,,(f(p)).
The sequence {Q,,(x)} is determined by the relations
QI(X) = x. Q2(X) = Xl - lX(p), Q,+ tfx) = xQ,(x) - lX(p)Q,_I(X) for r ;::0: 2.

Show that
Q,,(21X(p)I/1X) = lX(p),,/l U ,,(x),
where U ,,(x) is the Chebyshev polynomial of the second kind, defined by the relations
UI(x) = 2x, Ul(x) = 4Xl - I, U d1 (X) = 2xU,(x) - U,_I(X) for r;::O: 1.

7. Let E 2k(,) = -!G lk (,)/((2k). If x = e 2n ;, verify that the Fourier expansion of E lk (,)
has the following form for k = 2, 3, 4, 5, 6, and 7:

E 4 (,) = I + 240 I0"3(n)x",


,.,=1

n=:1

Cf)

E 8 (,) = I + 480 I0"7(n)x",


,.,=1
CfC

E 1O (,) = 1 - 264 I0"9(n)x",


n=1

65520 cx·
Ed,) = 1 + - - IO"II(n)x",
691 ,,~I
oc
E I4 (,) = I - 24 IO"13(n)x".
,.,=1

139
6: Modular forms with multiplicative coefficients

Derive each ofthe identities in Exercises 8, 9, and 10 by equating coefficients


in appropriate identities involving modular forms.
n-I

8. 0"7(n) = 0"3(n) + 120 I 0"3(m)0"3(n - m).


m=l

n-I

9. 110"9(n) = 210"5(n) - 100"3(n) + 5040 I 0"3(m)0"5(n - m).


m::;;:l

65 691 691
I
n-I
10. r(n) = - O"l1(n) +- 0"5(11) - - 0"5(m)0"5(n - m).
756 756 3 m~1

Show that this identity implies Ramanujan's congruence

r(ll) == 0" 1 dll) (mod 691).


11. Prove that the products Gk_11rllr which occur in Theorem 6.3 are linearly inde-
pendent.

12. Prove that the products G/G 6 b are linearly independent, where a and b are non-
negative integers such that 4a + 6b = k.

13. Show that the Dirichlet series associated with the normalized modular form

(2k - l)! ~ ( ) 1nimr


f(r) = .2k ((2k) + L.,O"2k- l me
(2m) m~ 1

is cp(s) = ((5)((S +1- 2k).

14. A quadratic polynomiall - Ax + Bx 2 with real coefficients A and B can be factored


as follows:
1 - Ax + Bx 2 = (1 - r\x)(l - r1x).
Prove that 1'\ = :x + ifJ and 1'2 = }' - ifJ, where :x, p, I' are real and fJ(1' - :x) = O.
Hence, if fJ "" 0 the numbers 1'1 and /'2 are complex conjugates.

Note. For the quadratic polynomial occurring in the proof of Theorem


6.19 we have

where

and
Peters son conjectured that /'1 and /'1 are always complex conjugates. This
implies

and

When c(n) = r(n) this is the Ramanujan conjecture. The Petersson conjecture
was proved recently by Deligne [7].

140
Exercises for Chapter 6

15. This exercise outlines Riemann's derivation of the functional equation

(56) n- S/ 2rG}(s) = n(S-I)/ 2rC; s)w - s)


from the functional equation (see Exercise 4.1)

(57) '9( ~ 1) = (-ir)I/2.9(r)

satisfied by Jacobi's theta function


ex)

,9('r) = 1 + 2 Lenin".
"=1
(a) If (J > 1 prove that

n-S/2r(~)n-s = 1°Oe-nn'xxs/2-1 dx

and use this to derive the representation

n- 2rG),(s)
S/ = 1 00
t/I(x)x s/2 - I dx,

where 2t/1(x) = 3(x)-1.


(b) Use (a) and (57) to obtain the representation

n-S/2r(:),(s) = __ 1_ + foo(XS/2-1 + X(I-s)/2-1)t/I(x)dx


2 s(s - 1) 1

for (J > 1.
(c) Show that the equation in (b) gives the analytic continuation of '(s) beyond the
line (J = 1 and that it also implies the functional equation (56).

141
7
Kronecker's theorem
with applications

7.1 Approximating real numbers by rational


numbers
e
Every irrational number can be approximated to any desired degree of
accuracy by rational numbers. In fact, if we truncate the decimal expansion
of e after n decimal places we obtain a rational number which differs from
e by less than 10-". However, the truncated decimals might have very large
denominators. For example, if
e= 11: - 3 = 0.141592653 ...
the first five decimal approximations are 0.1, 0.14, 0.141, 0.1415, 0.14159.
Written in the form alb, where a and b are relatively prime integers, these
rational approximations become
1 7 141 283 14159
10' 50' 1000' 2000' 100,000·
On the other hand, the fraction 1/7 = 0.142857 ... differs from eby less than
2/1000 and is nearly as good as 141/1000 for approximating e, yet its denomi-
nator 7 is very small compared to 1000.
This example suggests the following type of question: Given a real
e,
number is there a rational number h/k which is a good approximation to
e but whose denominator k is not too large?
This is, of course, a vague question because the terms" good approxima-
tion" and "not too large" are vague. Before we make the question more
precise we formulate it in a slightly different way. If e - h/k is small, then
(ke - h)/k is small. For this to be small without k being large the numerator
ke - h should be small. Therefore, we can ask the following question:

142
7.2: Dirichlet's approximation theorem

e
Given a real number and given 6 > 0, are there integers hand k such that
Ike - hi < 6?
The following theorem of Dirichlet answers this question in the affirma-
tive.

7.2 Dirichlet's approximation theorem


Theorem 7.1. Given any real e and any positive integer N, there exist integers
hand k with 0 < k :::; N such that
1
(1) Ike - hi <-
N'

PROOF. Let {x} = x - [x] denote the fractional part of x. Consider the
N + 1 real numbers
0, {e}, {2e}, ... , {Ne}.
All these numbers lie in the half open unit interval 0 :::; {me} < 1. Now
divide the unit interval into N equal half-open subintervals of length liN.
Then some subinterval must contain at least two of these fractional parts,
say rae} and {be}, where 0 :::; a < b :::; N. Hence we can write
1
(2) I{be} - rae} I < N'

But
{be} - rae} = be - [be] - ae + Cae] = (b - a)() - ([be] - Cae]).
Therefore if we let
k=b-a and h = [be] - Cae]
inequality (2) becomes
1
Ike - hi < N' with 0 < k :::; N.

This proves the theorem. o


Note. Given 6> 0 we can choose N > 1/6 and (1) implies Ike - hi < l:.

The next theorem shows that we can choose hand k to be relatively


prime.

Theorem 7.2. Given any real e and any positive integer N, there exist relatively
prime integers hand k with 0 < k :::; N such that
1
Ike - hi < N'

143
7: Kronecker's theorem with applications

PROOF. By Theorem 7.1 there is a pair h', k' with 0 < k' ::::; N satisfying

(3) If) - k'h'l < 1


Nk"

Let d = (h', k'). If d = 1 there is nothing to prove. If d > 1 write h' = hd,
k' = kd, where (h, k) = 1 and k < k' ::::; N. Then 11k' < 11k and (3) becomes

hili
If) - k <
Nk' < Nk'

from which we find Ikf) - hi < liN. o


Now we restate the result in a slightly weaker form which does not involve
the integer N.

Theorem 7.3. For every real f) there exist integers hand k with k > 0 and
(h, k) = 1 such that

PROOF . In Theorem 7.2 we have I/(Nk) ::::; l/k 2 because k ::::; N. o


Theorem 7.4. Iff) is real, let S(f)) denote the set of all ordered pairs of integers
(h, k) with k > 0 and (h, k) = 1 such that

If) - ~I < :2'


Then S(f)) has the following properties:
(a) S(f)) is nonempty.
(b) Iff) is irrational, S(f)) is an infinite set.
(c) When S(f)) is infinite it contains pairs (h, k) with k arbitrarily large.
(d) If f) is rational, S(f)) is a finite set.

PROOF. Part (a) is merely a restatement of Theorem 7.3. To prove (b), assume
f) is irrational and assume also that S(f)) is finite. We shall obtain a contra-
diction. Let

rJ. = mm . If) -
(h, k) E S(9)
hi
-.
k
Since f) is irrational, rJ. is positive. Choose any integer N > l/rJ., for example,
N = 1 + [1/rJ.]. Then liN < rJ.. Applying Theorem 7.2 with this N we obtain
a pair of integers hand k with (h, k) = 1 and 0 < k ::::; N such that

If) - ~I < k~'


144
7.2: Dirichlet's approximation theorem

Now 1/(kN) ~ l/k2 so the pair (h, k) E S(8). But we also have
1 1
-<-<01 so
kN - N '
contradicting the definition of 01. This shows that S(8) cannot be finite if 8 is
irrational.
To prove (c) assume that all pairs (h, k) in S(8) have k ~ M for some M.
We will show that this leads to a contradiction by showing that the number
of choices for h is also bounded. If (h, k) E S(8) we have
1
1 k8 - hi < k~ 1,

so
1h 1= 1h - k8 + k8 1~ 1h - k81 + 1k8 1< 1 + 1k8 1~ 1 + MIG I·
Therefore the number of choices for h is bounded, contradicting the fact that
S(8) is infinite.
To prove (d), assume 8 is rational, say G = alb, where (a, b) = 1 and b > O.
Then the pair (a, b) E S(8) because 8 - alb = O. Now we assume that S(G)
is an infinite set and obtain a contradiction. If S(8) is infinite then by part (c)
there is a pair (h, k) in S(G) with k > b. For this pair we have

o < I~b ~I
k - ~
< k2 '

from which we find 0 < 1ak - bh 1 < b/k < 1. This is a contradiction because
ak - bh is an integer. 0

Theorem 7.4 shows that a real number 8 is irrational if, and only if,
there are infinitely many rational numbers h/k with (h, k) = 1 and k > 0
such that

18 - k~I < ~k 2'

This inequality can be improved. It is easy to show that the numerator 1


t
can be replaced by (see Exercise 7.4). Hurwitz replaced t by a smaller
constant. He proved that 8 is irrational if, and only if, there exist infinitely
many rational numbers h/k with (h, k) = 1 and k > 0 such that

18 -;1 < }sk 2'

Moreover, the result is false if 1/)5 is replaced by any smaller constant. (See
Exercise 7.5.) We shall not prove Hurwitz's theorem. Instead, we prove a
theorem of Liouville which shows that the denominator k 2 cannot be re-
placed by k 3 or any higher power.
145
7: Kronecker's theorem with applications

7.3 Liouville's approximation theorem


Theorem 7.5. Let e be a real algebraic number of degree n ~ 2. Then there is a
positive constant C(e), depending only 011 e, such that for all integers hand
k with k > 0 we have

(4) Ie - ~I
k
> C(e)
kn '

e
PROOF. Since is algebraic of degree n, e is a zero of some polynomial f(x)
of degree 11 with integer coefficients, say

where f(x) is irreducible over the rational field. Since f(x) is irreducible it
has no rational roots so f(h/k) #- 0 for every rational h/k.
Now we use the mean value theorem of differential calculus to write

(5) fG) = fG) - f(e) = f'(~)(~ - e).

e
where ~ lies between and h/k. We wilJ deduce (4) from (5) by getting an upper
bound for 1f'(~)1 and a lower bound for If(h/k)l. We have

(h) n
f k = r ~o ar k = kn
(h)r N
where N is a nonzero integer. Therefore

(6)

which is the required lower bound. To get an upper bound for If'(~) I we let

d= Ie - H
If d > 1 then (4) holds with C(e) = 1, so we can assume that d < 1. (We
e
cannot have d = 1 since is irrationa1.) Since ~ lies between and h/k and e
e
d < 1 we have I~ - I < 1 so
I~I = Ie + ~ - el s; lei + I~ - el < lei + 1.
Hence
1f'(~)1 s; A(e) < 1 + A(e),
where A(e) denote the maximum value of I f'(x) I in the interval Ix I S; Ie I + 1.
Using this upper bound for If'(~) I in (5) together with the lower bound in (6)
we obtain (4) with C(e) = 1/(1 + A(e)). D

146
7.3: Liouville's approximation theorem

A real number which is not algebraic is called transcendental. A simple


counting argument shows that transcendental numbers exist. In fact, the
set of all real algebraic numbers is countable, but the set of all real numbers
is uncountable, so the transcendental numbers not only exist but they form
an uncountable set.
It is usually difficult to show that some particular number such as e or 11:
is transcendental. Liouville's theorem can be used to show that irrational
numbers that are sufficiently well approximated by rationals are necessarily
transcendental. Such numbers are called Liouville numbers and are defined
as follows.
Definition. A real number () is called a Liouville number if for every integer
r ~ 1 there exist integers hr and kr with kr > 0 such that

(7)

Theorem 7.6. Every Liouville number is transcendental.


PROOF. If a Liouville number () were algebraic of degree n it would satisfy
both inequality (7) and

I() - hrkr I> C(k,"(})


for every r ~ 1, where C((}) is the constant in Theorem 7.5. Therefore
C«(}) 1 1
0< k" < 0' or 0 < C((}) < k
r
r-n·
r r

The last inequality gives a contradiction if r is sufficiently large. 0

EXAMPLE. The number


'"
() = L
m=l
10m!

is a Liouville number and hence is transcendental. In fact, for each r ~ 1 we


can take kr = lOr! and
r 1
hr = kr L 10m!.
m=l

Then we have
h '" 1 1 "'I
0< () - k r =
r
L
m==r+l
10m! ~ lO(r+l)! ~ 10m
m-O

10/9 1 10/9 1
- - - - - - - - <k:
- 10(r+ i)! - k: lor!
-

so (7) is satisfied.
147
7: Kronecker's theorem with applications

°
Note. The same argument shows that 2:~=1 amlO- m! is transcendental if
am = or 1 and am = 1 for infinitely many m.

We turn now to a generalization of Dirichlet's theorem due to Kronecker.

7.4 Kronecker's approximation theorem:


the one-dimensional case
Dirichlet's theorem tells us that for any real (J and every
integers x and y, not both 0, such that
I': > °
there exist

l(Jx + yl < 1':.

In other words, the linear form (Jx + y can be made arbitrarily close to by a °
suitable choice of integers x and y. If (J is rational this is trivial because we
can make (Jx + y = 0, so the result is significant only if (J is irrational.
Kronecker proved a much stronger result. He showed that if (J is irrational
the linear form (Jx + y can be made arbitrarily close to any prescribed real
number 0:. We prove this result first for 0: in the unit interval. As in the proof
of Dirichlet's theorem we make use of the fractional parts {n(J} = n(J - [n(J].

Theorem 7.7. If (J is a given irrational number the sequence of numbers {n(J}


is dense in the unit interval. That is, given any 0:,
> 0, there exists a positive integer k such that
I':
°S 0: S 1, and given any

I {k(J} - 0:1 < 1':.

Hence, if h = [k(J] we have Ik(J - h - 0: I< 1':.

Note. This shows that the linear form (Jx +y can be made arbitrarily
close to 0: by a suitable choice of integers x and y.
PROOF. First we note that {n(J} #- {m(J} if m #- n because (J is irrational.
Also, there is no loss of generality if we assume < (J < 1 since n(J =
n[(J] + n{(J} and {n(J} = {n{{}}}.
°
° °
Let I': > be given and choose any 0:, S 0: S 1. By Dirichlet's approxima-
tion theorem there exist integers hand k such that Ik(J - hi < 1':. Now
either k(J > h or k(J < h. Suppose that k(J > h, so that < {k(J} < 1':. (The °
argument is similar if k(J < h.) Now consider the following subsequence of
the given sequence {n(J}:
{k(J}, {2k(J}, {3k(J}, ....

We will show that the early terms of this sequence are increasing. We have
k(J = [k(J] + {k(J}, so mk(J = m[k(J] + m{k(J}.
Hence

{mk(J} = m{k(J} if, and only if, {k(J} < ~.


m
148
7.S: Extension of Kronecker's theorem to simultaneous approximation

Now choose the largest integer N which satisfies {kG} < liN. Then we have
1 1
- - < {kG} <-
N + 1 N·
Therefore {mkG} = m{kG} for m = 1,2, ... , N, so the N numbers
{kG}, {2kG}, ... , {NkG}
form an increasing equally-spaced chain running from left to right in the
interval (0, 1). The last member of this chain (by the definition of N) satisfies
the inequality
N
- - 1 < {NkG} < 1,
N+
or
1
1- N + 1< {NkO} < 1.

Thus {NkO} differs from 1 by less than 1/(N + 1) < {kO} < c. Therefore the
first N members of the subsequence {nkO} subdivide the unit interval into
subintervals of length < B. Since a lies in one of these subintervals, the
theorem is proved. 0

The next theorem removes the restriction °: ; a ::;; 1.

Theorem 7.8. Givell any real a, allY irratiollal 8, alld all.\' > 0, there exist
°
f.

integers hand k with k > such that


IkG - h- al < e.
PROOF.
I{kG}
Write a = [a] + {a}. By Theorem 7.7 there exists k >
- {a} I < e. Hence
° such that

IkG - [kG] - (a - [a])1 < e


or
IkG - ([kG] - [a]) - al < e.
Now take h = [kO] - [a] to complete the proof. o
7.5 Extension of Kronecker's theorem to
simultaneous approximation
We turn now to a problem of simultaneous approximation. Given n irrational
numbers 8 1 , G2 , ... , Gn , and n real numbers ai' a2, ... , an' and given e > 0,
we seek integers hi, h2' ... , hn and k such that
IkG i - hi - ad < e for i = 1,2, ... ,11.

149
7: Kronecker's theorem with applications

It turns out that this problem cannot always be solved as stated. For example,
suppose we start with two irrational numbers, say (Jl and 2(Jl, and two real
numbers 1X1 and 1X2' and suppose there exist integers hI, h2 and k such that
Ik(Jl - hI - 1X11 < f:
and
12k(J1 - h2 - 1X21 < f:.
Multiply the first inequality by 2 and subtract from the second to obtain
12hl - h2 + 21X1 - 1X21 < 3f:.
Since f:, 1X1 and 1X2 are arbitrary and hl, h2 are integers, this inequality cannot
in general be satisfied. The difficulty with this example is that (Jl and 2(Jl are
linearly dependent and we were able to eliminate (J 1 from the two inequalities.
Kronecker showed that the problem of simultaneous approximation can
always be solved if (Jl' ... , (In are linearly independent over the integers;
that is, if
n

L Ci(Ji = 0
i; 1

with integer multipliers Cl' ... , Cn implies Cl = ... = Cn = O. This restriction


is compensated for, in part, by removing the restriction that the (Ji be
irrational. First we prove what appears to be a less general result.

Theorem 7.9 (First form of Kronecker's theorem). If lXI' ••• , IXn are arbitrary
real numbers, if (Jl' ... , (In are linearly independent real numbers, and if
f: > 0 is arbitrary, then there exists a real number t and integers hI, ... , hn
such that
It(Ji - hi - IX;I < f: for i = 1,2, ... , n.

Note. The theorem exhibits a real number t, whereas we asked for an


integer k. Later we show that it is possible to replace t by an integer k, but
in most applications of the theorem the real t suffices.

The proof of Theorem 7.9 makes use of three lemmas.

Lemma 1. Let {A·n} be a sequence of distinct real numbers. For each real t
and arbitrary complex numbers co, ... , CN define
N
f(t) = L creiO.•.
r;O

Then for each k we have

Ck = lim -1 iT f(t)e- iO. dt. k

T~oo T °
150
7.5: Extension of Kronecker's theorem to simultaneous approximation

PROOF. The definition of f(t) gives us


N
f(t)e- i,J..k = L cre iO.r- J..k)'.
r=O

Hence

f Tf(t)e-i,J..k dt =
o
f
r=O
C r fTeiO.r-J..k)' dt
0
+ Ck T,
r*-k

from which we find

Now let T ...... 00 to obtain the lemma. o


Lemma 2. If t is real, let

(8) F(t) = 1 + L• e 2"i{!Or-a rl,


r= 1

where oc 1, ... , oc. and (}1,"" (). are arbitrary real numbers. Let
L= sup IF(t)l.
-oo<t<+oo
Then the following two statements are equivalent:
(a) For every e > 0 there exists a real t and integers h b ... , h. such that
It(}r - OCr - hrl < e forr = 1,2,oo.,n.
(b) L = n + 1.
PROOF. The idea of the proof is fairly simple. Each term of the sum in (8)
has absolute value 1 so IF(t) I ::; n + 1. If (a) holds then each number t(}r - OCr
is nearly an integer hence each exponential in (8) is nearly 1 so IF(t) I is nearly
n + 1. Conversely, if (b) holds then IF(t) I is nearly n + 1 for some thence
every term in (8) must be nearly 1 since no term has absolute value greater
than 1. Therefore each number t(}r - OCr is nearly an integer so (a) holds.
Now we transform this idea into a rigorous proof.
First we show that (a) implies (b). If (a) holds take e = 1/(2nk), where
k 2:: 1, and let tk be the corresponding value of t given by (a). Then
2n(t k (}r - ocr) differs from an integer multiple of 2n by less than 11k so
1
cos 2n(t k (}r - ocr) 2:: cos k'
Hence

151
7: Kronecker's theorem with applications

and therefore L 2:: IF(t k ) I 2:: I + n cos(l/k). Letting k --> CIJ we find L 2:: 11 + 1.
Since L .:::;; n + I this proves (b).
Now we assume (a) is false and show that (b) is also false. If (a) is false there
exists an e > 0 such that for all integers 11 b ... ,17 11 and all real t there is a k,
1 .:::;; k .:::;; n, such that
e
(9) Ite k - !Y.k - I1kl 2:: 2n'

(We can also assume that e .:::;; n/4 because if(a) is false for e it is also false for
every smaller e.) Let Xr = te r - !Y. r - hr. Then (9) implies 12nxk I 2:: e so the
point 1 + e2rrixk lies on the circle of radius 1 about 1 but outside the shaded
sector shown in Figure 7.1.

Figure 7.1

Now 11 + eiE I < 2 so 11 + eiE I = 2 - (5 for some (5 > O. Hence


11 + e2rrixk I .: :; 11 + e iE I = 2 - (5,
so

IF(t)1 = 11 + Jl ezrrixrl.:::;; 11 + ezrrixkl + Jlle2"ixrl


r*k

.:::;; (2 - (5) + (n - 1) = n + 1 - (5.


Since this is true for all t we must have L .:::;; n + 1 - (5 < n + 1, contra-
dicting (b). 0

Lemma 3. Let g = g(x l , ... , xn) be the polynomial in n variables given by

g = 1+ Xl + X z + ... + XII'
and write
(10) gP = 1 + "L arl, ... ,r l1
X 1 r, •.• X nrn ,

152
7.5: Extension of Kronecker's theorem to simultaneous approximation

where p is a positive integer. Then the coefficients art, .... ,,, are positive
integers such that
(11)

and the number of terms in (10) is at most (p + l)n.


PROOF. Since 1 + L art, .... ,,, = gP(l, 1, ... , 1) = (1 + n)P this proves (11).
Let 1 +N be the number of terms in (10). We shall prove that
(12) 1 + N ::; (p + l)n
by induction on n. For n = 1 we have

(1 + xd P= 1 + G)Xl + (~)X12 + ... + x/


and the sum on the right has exactly p + 1 terms. Thus (12) holds for n = 1.
If n > 1 we have
gp= {(1 +x 1 + ... +xn-d+xn}P
= (1 + Xl + ... + Xn -l)P + (n(1 + ... + x n _ly-l xn + ... + x/,

so if there are at most (p + l)n - 1 terms in each group on the right there will
be at most (p + l)n terms altogether. This proves (12) by induction. 0
PROOF OF KRONECKER'S THEOREM. Choosing F(t) as in Lemma 2 we have
n
F(t) = 1 + L e 21ti (rO,-a,l.
r= 1

By Lemma 2, to prove Kronecker's theorem it suffices to prove that


L= sup !F(t)!=n+1.
-oo<t<+oo

The pth power of F(t) is a sum of the type discussed in Lemma 1,


N
(13) f(t) = P(t) = 1 + L c,eir)."
,= 1
with ,1,0 = 1 and A, replaced by 211:(rl e 1 + ... + rnen) if r ~ 1. The numbers
e
A, are distinct because the i are linearly independent over the integers. The
coefficients c, in (13) are the integers a,t, .... ," of Lemma 3 multiplied by a
factor of absolute value 1. Hence (11) implies
N
(14) 1 + Lie,! = 1 + ~>rt ..... rn = (1 + nY·
r= 1

By Lemma 1 we have

(15) c, = lim -1 IT FP(t)e- . H ;., dt.


T~oo T 0

153
7: Kronecker's theorem with applications

Now IF(t) I :s;; L so IFP(t)l :s;; U for all t, hence

I~ {T FP(t)e- itAr dtl :s;; ~ {TU dt = U.

Hence (15) implies lerl :s;; U for each r, and (14) gives us
(1 + n)p :s;; (N + I)U :s;; (p + l)nu

by Lemma 3. Therefore

from which we find

10g(n ~ 1) :s;; ~ log(p + 1).


Now let p -. 00. The last inequality becomes log[(n + 1)/L] :s;; 0, so L ~
n + 1. But L:s;; n + 1 hence L = n + 1, and this proves Kronecker's
theorem. 0

The next version of Kronecker's theorem replaces the real number t by an


integer k.

Theorem 7.10 (Second form of Kronecker's theorem). If OC 1 , ••• , OC n are

°
arbitrary real numbers, iffJ 1 , ..• ,fJn , 1 are linearly independent real numbers,
and if e > is given, then there exists an integer k and integers m1, •. , mn
such that
IkfJ i - mi - ocd < e for i = 1,2, ... , n.

PROOF. We apply the first form of Kronecker's theorem to the system


°
oc 1 , •.• , OC n , and {Od, {02}, ... , {On}, 1, with el2 instead of e, where e < 1.
Then there exists a real t and integers hI> ... , hn+ 1 such that
e
It{fJ;} - hi - ocd <"2 for i = 1,2, ... , n

and

(16)

The last inequality shows that t is nearly equal to the integer hn+ 1. Take
k = hn + 1 • Then (16) implies
Ik{O;} - hi - (Xd = It{Oi} - hi - (Xi + (k - t){fJ i } I
:s;; It{Oi} - hi - (Xd + Ik - tl < e.
154
7.6: Applications to the Riemann zeta function

Hence, writing {Oi} = 0i - [OJ, we obtain


Ik(Oi - [OJ) - hi - (Xii < e
or, what is the same thing,
IkO i - (hi + k[8J) - (Xii < e.
Putting mi = hi + k[8 i] we obtain the theorem. o
7.6 Applications to the Riemann zeta
function
With the help of Kronecker's theorem we can determine the least upper
bound and greatest lower bound of I((0- + it) Ion any fixed line 0- = constant,
0- > 1.

Definition. For fixed 0-, we define


m(o-) = infl((o- + it)1 and M(o-) = sup 1((0- + it) I,
t t

where the infimum and supremum are taken over all real t.

Theorem 7.11. For each fixed 0- > 1 we have

M(o-) = ((0-) and ( ) _ ((20-)


m 0- - ((0-).

PROOF. For 0- > 1 we have 1((0- + it) I :s; ((0-) so M(o-) = ((0-), the supremum
being attained on the real axis. To obtain the result for m(o-) we estimate the
reciprocal 11/ ((s) I. For 0- > 1 we have

(17) _1 I = fll1 - -si < fl (1 + -U) _ ((0-)


1 ((s) p P - p P - ((20-)"

Hence I((s) I ~ ((20-)/((0-) so m(o-) ~ ((20-)/((0-).


Now we wish to prove the reverse inequality m(o-) :s; ((20-)/((0-). The idea
is to show that the inequality
11 - p-si :s; 1 + p-u
used in (17) is very nearly an equality for certain values of t. Now
1 - p-s = 1 - p-u-it = 1 _ p-ue-itlo gp = 1 + p-Uei(-tlo gp -It),
so we need to show that - t log p - n is nearly an even multiple of 2n for
certain values of t. For this we invoke Kronecker's theorem. Of course,
there are infinitely many terms in the Euler product for 1/((s) and we cannot
expect to make - t log p - n nearly an even multiple of 2n for all primes p.
But we will be able to do this for enough primes to obtain the desired
inequality.

155
7: Kronecker's theorem with applications

Choose any e, 0 < e < n/2, and choose any integer 11 ~ 1. We apply
Kronecker's theorem to the numbers
-1
8k = 2nlogpk, k = 1,2, ... ,11,

where PI' ... , Pn are the first 11 primes. The 8i are linearly independent
because
n
L: ai log Pi =
i= I
° implies 10g(pI a, ... Pn a ") = 0

so PI a, ... Pn a " = 1 hence each ai = 0. We also take ex l = ex 2 = ... = exn = ~.


Then by Theorem 7.9 there is a real t and integers hI' ... , hn such that
It8 k - ex k - hk I < e/(2n), which means
(18) I-t log Pk - n - 2nh kl < e.
For this t we have
1 - Pk -s = 1 - Pk -<1 e -itlo gpk = 1 + Pk
e i(-tlo gpk -1t)
-<1

= 1 + Pk -<1 cos( - t log Pk - n) + iPk -<1 sin( - t log Pk - n),


so
11 - Pk - S I ~ 1 + Pk - <1 cos( - t log Pk - n).
But (18) implies
cosl-t log Pk - nl = cosl-t log Pk - n - 2nh k l > cos e,
since the cosine function decreases in the interval [0, n/2]. Hence
11 - Pk - S I > 1 + Pk - <1 cos e.
Now consider any partial product of the Euler product for 1/((5). For
a given e and 11 there exists a real t (depending on e and on 11) such that

(19) I}]I (1
n
- Pk -S)
Inn
= )I
11 - Pk -si > }]I (1 + Pk -<1 cos e).

Now
1 00

1((5)1 = })I 11 - Pk -si

and hence, by the Cauchy condition for convergent products, there is an


110 such that 11 ~ 110 implies

fill
Ik=n+1 - Pk - S I - 11 < e
or

n 11 -
00

1- e< Pk - S I < 1 + e.
k=n+l
156
7.7: A pplications to periodic functions

Using (19) with n 2 no we have


1
I((s) I =
nOOn
}]I11 - Pk -si J:t
11 - Pk -si > (1 - e) (1 }]I + Pk -a cos e).
This holds for n 2 no and a certain t depending on n and on e. Hence
1 l I n
m(cr) = inft 1((11
+ it)1 = s~p I((cr + it)1 2 (1 - e) }]I
(1 + Pk -a cos e).

Letting n -> 00 we find

-
1
m(l1)
2 (1 - e)
ro
k= I
n(1 + Pk -a cos e).

We will show in a moment that the last product converges uniformly for
o :$;
e :$; n/2. Therefore we can let e -> 0 and pass to the limit term by term
to obtain
_1_ > nro (1 + _a) _ ((11)
m(l1) - k= I Pk - ((2cr)"

This gives the desired inequality m(l1) :$; ((211)/((11).


To prove the uniform convergence of the product, we use the fact that
a product n (1 + fn(z)) converges uniformly on a set if, and only if, the
series Lfn{z) converges uniformly on this set. Therefore we consider the
L L L
series Pk -a cos e. But this is dominated by Pk -a :$; n -a = ((cr) so the
convergence is uniform in the interval 0 :$; e :$; n/2, and the proof is complete.
D
7.7 Applications to periodic functions
We say that n complex numbers WI' W 2 , ... , Wn are linearly independent
over the integers if no linear combination
aiw i + a 2w 2 + ... + anru n
with integers coefficients is 0 except when a I = a2 = ... = an = O.
Other-
wise the numbers WI' .. , Wn are called linearly dependent over the integers.
Elliptic functions are meromorphic functions with two linearly indepen-
dent periods. In this section we use Kronecker's theorem to show that there
are no meromorphic functions with three linearly independent periods
except for constant functions.
Theorem 7.12. Let WI and W 2 be periods off such that the ratio W 2 /W I is
real and irrational. 7hen f has arbitrarily small nonzero periods. That is,
given e > 0 there is a period W such that 0 < IW I < e.
PROOF. We apply Dirichlet's approximation theorem. Let = W2/WI' e
e
Since is irrational, given any e > 0 there exist integers hand k with k > 0
such that

157
7: Kronecker's theorem with applications

Multiplying by IWII we find


Ikw z - hw I I < B.

But W = kw z - hWI is a period of f with Iwi < B. Also, W =f. 0 since wz/w l
is irrational. 0

Theorem 7.13. Iff has three periods WI' Wz , W3 which are linearly independent
over the integers, then f has arbitrarily small nonzero periods.
PROOF. Suppose first that WZ/w l is real. If WZ/w l is rational then WI and
Wz are linearly dependent over the integers, hence WI' Wz , W3 are also depen-
dent, contradicting the hypothesis. If WZ/WI is irrational, thenfhas arbitrarily
small nonzero periods by Theorem 7.12.
Now suppose Wz /w l is not real. Geometrically, this means that WI and
Wz are not collinear with the origin. Hence W3 can be expressed as a linear
combination of WI and Wz with real coefficients, say
W3 = aWl + pw z , where a and p are real.
Now we consider three cases:
(a) Both a and p rational.
(b) One of a, p rational, the other irrational.
(c) Both a and Pirrational.
Case (a) implies WI' WZ , W3 are dependent over the integers, contradicting
the hypothesis.
For case (b), assume a is rational, say a = alb, and P is irrational. Then
we have

so

This gives us two periods bW3 - aWl and bw z with irrational ratio, hence f
has arbitrarily small periods. The same argument works, of course, if P is
rational and a is irrational.
Now consider case (c), both a and P irrational. Here we consider two
subcases.
(c l ) Assume a and P are linearly dependent over the integers. Then
there exist integers a and b, not both zero, such that aa + bP = O. By sym-
metry, we can assume that b =f. O. Then P = -aa/b and

so

Again we have two periods bW3 and bw! - aW 2 with irrational ratio, so f
has arbitrarily small nonzero periods.
(C2) Assume a and Pare linearly independent over the integers. Then by
Kronecker's theorem, given any B > 0 there exist integers hI> h2 and k

158
Exercises for Chapter 7

such that

Multiply these inequalities by 1WI I, 1w2 1, respectively, to get


clwll clw 2 1
1klY.w I - hi WI 1 < 1
+ WI 1 1
+ W2I'
1 1kf3w 2 - h 2W21 < 1
+ 1
WI
1
+ 1W2 1
Since kW3 = klY.w I + kf3w 2 we find, by the triangle inequality,
c(lwll + Iw 2 1)
Ikw3-hlwl-h2w21<1
+ IWI I + II<c.
W2

Thus kW3 - hlw i - h2W2 is a nonzero period with modulus <E;. 0

Note. In Chapter 1 we showed that a function with arbitrarily small


nonzero periods is constant on every open connected subset in which it is
analytic. Therefore, by Theorem 7.13, the only meromorphic functions
with three independent periods are constant functions.

Further applications of Kronecker's theorem are given in the next chapter.

Exercises for Chapter 7


1. Prove the following extension of Dirichlet's approximation theorem.
Given 11 real numbers B1 , .•. , B" and given an integer N ;::: 1, there exist integers
hl> ... , h" and k, with 1 ~ k ~ N", such that
1
IkBi - hi I < IV for i = 1, 2, ... , 11.

2. (a) Given 11 real numbers B1 . . . . ,B", prove that there exist integers hl, ... ,h" and
k > 0 such that

IBi - ih'l < kl 1 1/"


+ for i = 1,2, ... ,11.

(b) If at least one of the Bi is irrational, prove that there is an infinite set of n-tuples
(hdk, ... , h"/k) satisfying the inequalities in (a).
3. This exercise gives another extension of Dirichlet's approximation theorem. Given
m linear forms,

i = 1,2, ... ,m,

in n + m variables XI"'" X". Y1"'" Ym' prove that for each integer N > 1 there
exists integers X I" .. , X"' YI,' .. , Ym such that

1
ILi I < IV for i = 1, 2, ... , m

159
7: Kronecker's theorem with applications

and O<max{lxll, ... , Ixnl}::;N",;n. Hillt: Let Mj=ajlxl+···+ajnxn and


examine the points ({ M I)' ... , {M "'}) in the unit cube in m-space, where {M j} =
M j - [M;].
4. Let IJ be irrational, 0 < e< l. Then IJ lies between two consecutive Farey fractions,
say
a c
- < IJ <-.
b d
(a) Prove that either 0 - alb < 1/(2b 2 ) or ("Id - e
< 1/(2£1 2 ).
(b) Deduce that there exist infinitely many fractions hlk with (11, k) = 1 and k > 0
such that

2k2 .
l e-~I<-I
k

5. Let ex = (I + )5)/2. This exercise shows that the inequality

(20) l ex-~Ik <~k2

has only a finite number of solutions in integers hand k with k > 0 if 0 < c < 1/)5.
(a) Let fJ = ex - )5 so that ex and {3 are roots of the equation x 2 - x - I = O.
Show that for any integers hand k with k > 0 we have

~k ::; lex - ~k 11f3 - ~k I


2

and deduce that

(b) If (20) has infinitely many solutions hlk with k > 0, say h 11k 1, h 2lk· , ... ,show that
kn --> % as 11 --> % and use part (a) to prove that c ~ 11)5.
6. In Lemma 2, define

L = lim sup IF(t)1 instead of L = sup IF(t)l.


-oo<l<oc

Prove that the equation L = 11 + 1 is equivalent to the following statement: For


every c; > 0 and every T > 0 there exists a real t > T and integers hi'" ., hI! such that
Irei - l1i - exd < c; for every i = 1,2, ... ,11.
7. Prove that the multiplier r in the first form of Kronecker's theorem can be taken
positive and arbitrarily large. That is, under the hypotheses of Theorem 7.9, if T > 0
is given there exists a real r > T satisfying the 11 inequalities Irei - hi - ex i I < c:.
Show also that the integer multiplier k in the second form of Kronecker's theorem
can be taken positive and arbitrarily large.

160
8
General Dirichlet series and
Bohr's equivalence theorem

8.1 Introduction
This chapter treats a class of series, called general Dirichlet series, which
includes both power series and ordinary Dirichlet series as special cases.
Most of the chapter is devoted to a method developed by Harald Bohr [6]
in 1919 for studying the set of values taken by Dirichlet series in a half-plane.
Bohr introduced an equivalence relation among Dirichlet series and showed
that equivalent Dirichlet series take the same set of values in certain half-
planes. The theory uses Kronecker's approximation theorem discussed in
the previous chapter. At the end of the chapter applications are given to the
Riemann zeta function and to Dirichlet L-functions.

8.2 The half-plane of convergence of general


Dirichlet series
Definition. Let {A(n)} be a strictly increasing sequence of real numbers such
that A(n) -+ + 00 as n -+ 00. A series of the form

L a(n)e-s).(n)
ro

n;1

is called a general Dirichlet series. The numbers A(n) are called the
exponents of the series, and the numbers a(n) are called its coefficients.

As usual, we write s = (j + it where (j and t are real.


Note. When A(n) = log n then e-s).(n) = n- S and we obtain the ordinary
L
Dirichlet series a(n)n- s. When A(n) = n the series becomes a power series
in x, where x = e- s •

161
8: General Dirichlet series and Bohr's equivalence theorem

A general Dirichlet series is analogous to the Laplace transform of a


function, SO' f(t)e- st dt. As a matter offact, both Dirichlet series and Laplace
transforms are special cases of the Laplace-Stieltjes transform, Joe- st d(X(t).
When (X(t) has a continuous derivative (X'(t) = f(t) this gives the Laplace
transform of f When (X is a step function with jump a(n) at the point A{n)
the integral becomes the general Dirichlet series L a(n)e-s).(n). Much of
what we do here can be extended to Laplace-Stieltjes transforms, but we
shall not deal with these generalizations.
As is the case with ordinary Dirichlet series, each general Dirichlet
series has associated with it an abscissa (Jc of convergence and an abscissa
(Ja of absolute convergence. We could argue as in Chapter 11 of [4] to

prove the existence of (J c and (J a' Instead we give a different method of proof
which also expresses (Jc and (Ja in terms of the exponents A(n) and the
coefficients a(n).

Theorem 8.1. Assume that the series L a(n)e -s).(n) converges for some s with
positive real part, say for s = So with (Jo > O. Let
L - I' 10gIL~=1 a(k)1
- 1m sup
n-+oo
1( )
An
.

Then L :::;; (J 0 . Moreover, the series converges in the half-plane (J > L, and
the convergence is uniform on every compact subset of the half-plane
(J> L.

PROOF. First we prove that L :::;; (J o. Let A(n) denote the partial sums of the
coefficients,
n

A(n) = L a(k).
k=l

Note that A(n) > 0 for all sufficiently large n. If we prove that for every
e> 0 we have
(1) 10gIA(n)1 < ((Jo + e)A(n)
for all sufficiently large n, then it follows that

log IA(n) I
A(n) < (Jo +e
for these n, so L ::; (J 0 + e, hence L ::; (J o. Now relation (l) is equivalent to the
inequality
(2)

To prove (2) we introduce the partial sums


n
S(n) = L a(k)e-so)'(k).
k=l

162
8.2: The half-plane of convergence of general Dirichlet series

The Sen) are bounded since the series Lk'= 1 a(k)e-So)'(kl converges. Suppose
that ISen) I < M for all n. To express A(n) in terms of the Sen) we use partial
summation:
n n
A(n) = L a(k) = L a(k)e-So),(kleso).(kl
k=1 k=1
n
= L {S(k) - S(k - l)}eso)'(k),
k=1
provided S(O) = O. Thus
o 0-1
A(n) = L S(k)eso)'(kl - L S(k)eso),(k+ 1)
k= 1 k= 1
0-1
= L S(k){eso)'(k) - eso),(k+ I)} + S(n)eso).(nl.
k=1

Hence
n-l
IA(n)1 < M L leSo)'(k) - e So ).(k+l)1 + Metro).(n).
k=1

But

n-l
L Ieso)'(k) - eso).(k+ 1) I =
n- 1 1
L
So
f).(k+l) I
e SOU du :::;; ISo I L
n-l f).(k+l)
etrou du
k= 1 k= 1 )'(k) k= 1 )'(k)

= Iso I f).(nleaou du = ~ (etro).(n) - etro )'(ll) < ~ etro).(n).


).(1) 0'0 0'0

Thus

Now A,(n) ~ 00 as n ~ 00 so

ee).(n) > M(l + 1;:1)


if n is sufficiently large. Hence for these n we have IA(n)l < e(ao+e»).(n), which
proves (2) and hence (1). This proves that L :::;; 0'0'
Now we prove that the series converges for all s with 0' > L. Consider
any section of the series L a(n)e-s),(n), say L~=a. We shall use the Cauchy'
convergence criterion to show that this section can be made small when
a and b are sufficiently large. We estimate the size of such a section by using

163
8: General Dirichlet series and Bohr's equivalence theorem

partial summation to compare it to the partial sums A(n) = D= 1 a(k). We


have
b b
L a(n)e-SA(n) = L {A(n) - A(n - l)}e- SA (n)
n=a

L A(n){e-SA(n) -
b
= e-SA(n+ I)} + A(b)e-SA(b+ 1)

- A(a - l)e- s ).(a).

This relation holds for any choice of s, a and b. Now suppose s is any complex
number with (J > L. Let e = -!<(J - L). Then e > 0 and (J = L + 2e. By the
definition of L, for this e there is an integer N(e) such that forall n 2: N(e)
we have
10gIA(n)1
A(n) < L + e.
We can also assume that A(n) > 0 for n 2: N(e). Hence
IA(n)1 < e(L+.»).(n) for all n 2: N(e).

If we choose b 2: a > N(e) we get the estimate

Intaa(n)e-SA(n) I : : ; nta e(L+.»).(n) Ie-SA(n) - e -s).(n+ 1) I


+ e(L+.)A(b+ 1)e-uA(b+ 1) + e(L+f)A(a)e-U).(a).
The last two terms are e - .A(b + I) + e - fA (a) since L +e- (J = - e. Now we
estimate the sum by writing

le-SA(n) - e- SA (n+l)1 = I -s
A(n+ 1)
e- SU du ::::;
I lsi fA(n+ 1)
e- UU du
f
A(n) A(n)

so

Lb e(L+ f)A(n) Ie -s).(n) - e-SA(n+ I) I ::::; Is I L e(L+ f)A(n)


b f).(n+ 1)
e - uu du
n=a n=a A(n)

::::; lsi L lsi L


b fA(n+ 1) b fA(n+ 1)
e-UUe(L+f)U du = e- w du
n=a A(n) n=a A(n)

= lsi f )'(b+

).(a)
1)
e-' u du =
II
~ (e-fA(a)
e
_ e-d(b+ 1»).

Thus we have

t
In=aa(n)e-SA(n) I : : ; !.::!
e
(e-f).(a) - e-f)'(b+ 1») + e-f).(b+ I) + e-fA(a).

164
8.2: The half-plane of convergence of general Dirichlet series

Each term on the right tends to 0 as a ~ 00, so the Cauchy criterion shows
that the series converges for all s with 0' > L. This completes the proof.
Note also that this proves uniform convergence on any compact subset of
the half-plane 0' > L. 0

Theorem 8.2. Assume the series La(n)e-SA(n) converges for some s with 0' >0
but diverges for all s with 0' < O. Then the number

L -- l'1m sup 10g1Lk=1 a(k)1


n- 00
'( )
A n

is the abscissa of convergence of the series. In other words, the series


converges for all s with 0' :> L and diverges for all s with 0' < L.
PROOF. We know from Theorem 8.1 that the series converges for all s with
0' > L and that L cannot be negative. Let S be the set of all 0' > 0 such that
the series converges for some s with real part 0'. The set S is nonempty and
bounded below. Let O'c be the greatest lower bound of S. Then O'c> O. Each
0' in S satisfies L ~ 0' hence L ~ O'c' If we had O'c > L there would be a 0' in
the interval L < 0' < O'c' For this 0' we would also have convergence for all
s with real part 0' (by Theorem 8.1) contradicting the definition of O'c' Hence
O'c = L. But the definition of O'c shows that the series diverges for all s with
o ~ 0' < L. By hypothesis it also diverges for all s with 0' < O. Hence it
diverges for all s with 0' < L. This completes the proof. 0

As a corollary we have:

Theorem 8.3. Assume the series L a(n)e-SA(n) converges absolutely for some s
with 0' > 0 but diverges for all s with 0' < O. Then the number
_I' log Lk=l Ia(k) I
O'a - 1m sup '( )
n-oo ILn
is the abscissa of absolute convergence of the series.
PROOF. Let A be the abscissa of convergence of the series L la(n)le-SA(n).
Then, by Theorem 8.2,

A -I' log Lk=l Ia(k) I


- 1m sup
n-C()
'(n)
I\,.
.

We wish to prove that L la(n)le-O'A(n) converges if 0' > A and diverges if


0' < A. Clearly if 0' > A then the point s = 0' is within the half-plane of
convergence of L la(n)le-SA(n) so L Ia(n)le-O'A(n) converges.
Now suppose L Ia(n)le-O'A(n) converges for some 0' < A. Then the series
L la(n)le-SA(n) converges absolutely for each s with real part 0' so, in particular
it converges for all these s, contradicting the fact that A is the abscissa of
convergence of L la(n)1 e-SA(II).' 0

165
8: General Dirichlet series and Bohr's equivalence theorem

8.3 Bases for the sequence of exponents of a


Dirichlet series
The rest of this chapter is devoted to a detailed study of Harald Bohr's
theory with applications to the Riemann zeta-function and Dirichlet's
L-series. The first notion we need is that of a basis for the sequence of
exponents of a Dirichlet series.

Definition. Let A = {).(n)} be an infinite sequence of distinct real numbers. By


a basis of the set A we shall mean a finite or countably infinite sequence
B = {p(n)} of real numbers satisfying the following three conditions:
(a) The sequence B is linearly independent over the rationals. That is,
for all m 2 1, if

with rational multipliers rk, then each rk = O.


(b) Each ,l.(n) is expressible as a finite linear combination of terms of B,
say
q(n)

,l.(n) = L r n,k{3(k)
k=l

where the r n, k are rational and the number of summands q(n) depends
on n. (By condition (a), if ,l.(n) # 0 this representation is unique.)
(c) Each (3(n) is expressible as a finite linear combination of terms of A,
say
m(n)

(3(n) = L tn,k,l.(k)
k=l

where the tn,k are rational and m(n) depends on n.

EXAMPLE I. Let A be the set of all rational numbers. Then B = {I} is a basis.
EXAMPLE 2. Let A = {log n}. Then B = {log Pn} is a basis, where Pn is the
nth prime. It is easy to verify properties (a), (b) and (c). For independence
we note that
q
1: r
k=l
k log Pk =0 implies so

To express each ,l.(n) in terms of the basis elements we factor n and compute
log n as a linear combination of the logarithms of its prime factors. Property
(c) is trivially satisfied since B is a subsequence of A.

166
8.4: Bohr matrices

Theorem 8.4. Every sequence A has a subsequence which is a basis for A.


PROOF. Construct a basis as follows. For the first basis element take ),(n 1 ),
the first nonzero) (either )(1) or )(2)), and call this P(1). Now delete the
remaining elements of A that are rational multiples of P(1). If this exhausts
all of A take B = {P(1)}. If not, let )(112) denote the first remaining ), take
P(2) = )(n2), and strike out the remaining elements of A which are rational
linear combinations of P(l) and P(2). Continue in this fashion to obtain a
sequence B = (P(1), P(2), ...) = ()(nd, )(n2), ...). It is easy to verify that B
is a basis for A. Property (a) holds by construction, since each Pwas chosen
to be independent of the earlier elements. To verify (b) we note that every) is
either an element of B or a rational linear combination of a finite number of
elements of B. Finally, (c) holds trivially since B is a subsequence of A. 0

Note. Every sequence A has infinitely many bases.

8.4 Bohr matrices


It is convenient to express these concepts in matrix notation. We display the
sequences A and B as column matrices, using an infinite column matrix for A
and a finite or infinite column matrix for B, according as B is a finite or
infinite sequence.
We also consider finite or infinite square matrices R = (rij) with rational
entries. If R is infinite we require that all but a finite number of entries in each
row be zero. Such rational square matrices will be called Bohr matrices.
We define matrix addition and multiplication of two infinite Bohr
matrices as for finite matrices. Note that a sum or product of two Bohr
matrices is another Bohr matrix. Also, the product RB of a Bohr matrix R
with an infinite column matrix B is another infinite column matrix r.
Moreover, we have the associative property (R 1 R 2 )B = R 1 (R 2 B) if Rl
and R2 are Bohr matrices and B is an infinite column matrix.
In matrix notation, the definition of basis takes the following form. B is
called a basis for A if it satisfies the following three conditions:
(a) If RB = 0 for some Bohr matrix R, then R = O.
(b) There exists a Bohr matrix R such that A = RB.
(c) There exists a Bohr matrix T such that B = T A.
The relation between two bases Band r of the same sequence A can be
expressed as follows:

Theorem 8.5. If A has two bases Band r, then there exists a Bohr matrix
A such that r = AB.
PROOF. There exist Bohr matrices Rand T such that r = T A and A = RB.
Hence r = T(RB) = (TR)B = AB where A = TR. 0

167
8: General Dirichlet series and Bohr's equivalence theorem

Theorem 8.6. Let Band f be two bases for A, and write f = AB, A = RBB,
A = Rrr, where A, R B, Rr arC? Bohr matrices. Then RB = RrA.

Note. If we write AlB for R B, Alf for Rr and fiB for A, this last equation
states that
A A f
B r B'
PROOF. We have A = RBB and A = Rrf = RrAB. Hence RBB = RrAB,
so (RB - RrA)B = O. Since RB - RrA is a Bohr matrix and B is a basis, we
must have RB - RrA = O. 0

8.5 The Bohr function associated with a


Dirichlet series
To every Dirichlet series f(s) = L:::,,= 1 a(n)e-SA(n) we associate a function
F(ZI, Z2, ... ) of countably many complex variables ZI' Z2, ... as follows.
Let Z denote the column matrix with entries ZI' Z2, .... Let B = {f3(n)}
be a basis for the sequence A = {A(n)} of exponents, and write A = RB,
where R is a Bohr matrix.

Definition. The Bohr function F(Z) = F(Z1' Z2' ... ) associated with f(s),
relative to the basis B, is the series

L: a(n)e-(RZ)n,
00

F(Z) =
n= 1

where (RZ)n denotes the nth entry of the column matrix RZ.

In other words, if
gIn)
).(n) = L: r n,kf3(k)
k=1

then
oc
F(Z1,Z2"") = L: a(n)e-(rn.1=1+"·+rn,q(n)=q(n».
n=1

Note that the formal substitution Zm = sf3m gives Z = sB, RZ = sRB =


sA, so (RZ)n = s).(n) and hence

L: a(n)e-SA(n) = f(s).
00

F(sB) =
n=l

In other words, the Dirichlet series f(s) arises from F(Z) by a special choice
of the variables Z1' Z2, .... Therefore, if the Dirichlet series f(s) converges
for s = (J + it the associated Bohr series F(Z) also converges when Z = sB.

168
8.5: The Bohr function associated with a Dirichlet series

Moreover, if the Dirichlet series f(s) converges absolutely for s = a + it


then the Bohr series F(Z) converges absolutely for any choice of ZI' Z2,'"
with Re z. = ap(n) for all n. To see this we note that if Re z. = ap(n) then
Re Z = aB so

L la(n)e-(RZ)nl = L la(n)le-u(RB)n = L la(n)le-


00 00 00
UA(.) •
.=1 • =1 .=1

To emphasize the dependence of the Bohr function on the basis B we


sometimes write A = RBB and

L a(n)e-(RBZ)n .
00

F B(Z) =
• =1

Bohr functions FBand F r corresponding to different bases are related by


the following theorem.

Theorem 8.7. Let Band r be two bases for A and write r = AB for some
Bohr matrix A. Then
F B(Z) = F r(AZ).

PROOF. By Theorem 8.6 we have


A = RBB = Rrr, where RB = RrA.
Hence

L a(n)e-(RBZJ" = L a(n)exp{ -(RrAZ).} = Fr(AZ).


ex; 00

FB(Z) = 0
n=1 n=1

Definition. Assume the Dirichlet series f(s) = L:'=


1 a(n)e- SA (.) converges
absolutely for some s = a + it. We define U ia; B) to be the set of values
taken on by the associated Bohr function, relative to the basis B, when
Re Z = aB. Thus,

U f(a; B) = {F(Z): Re Z = aB}.


The next theorem shows that this set is independent of the basis B.

Theorem 8.8. If Band r are two basesfor A then Uia; B) = U f(a; n


PROOF. Choose any value F B(Z) in Uia; B), so that Re Z = aB. By Theorem
8.7 we have F B(Z) = F r(AZ), where r = AB. But

Re AZ = A Re Z = AaB = aAB = ar
so F B(Z) E U f( a; r). This proves U f( a; B) £;; U f( a; r), and a similar argument
gives Uia; r) £;; Uia; B). 0

169
8: General Dirichlet series and Bohr's equivalence theorem

Note. Since U f(a; B) is independent of the basis B we designate the set


U f(a; B) simply by U f(a).

8.6 The set of values taken by a Dirichlet


seriesf(s) on a line a = a o
This section relates the set U Aa o) with the set of values taken by the Dirichlet
series f(s) on the line a = 0'0'

Definition. If the Dirichlet series f(s) = I:'; 1 a(n)e - sA(n) converges absolutely
for a = 0'0 we let
Vf(ao) = {f(a o + it): -00 < t < +oo}
denote the set of values taken by f(s) on the line a = 0'0'
Since f(5) can be obtained from its Bohr function F(Z) by putting Z = aB,
it follows that Vf(ao) £:; U f(a O)' Now we prove an inclusion relation in the
other direction. r

Theorem 8.9. Assume 0'0 > aa' where aa is the abscissa of absolute convergence
of a Dirichlet series f(s). Then the closure of Vf(ao) contains U f(aO)' That
is, we have

VAa o) £:; U f(a O) £:; Vf(a O)' and hence U f(a O) = VAa o).
PROOF. The closure VAa o) is the set of adherent points of Vf(a O)' We are
to prove that every point u in U f(a O) is an adherent point of VAa o). In
other words, given u in U Aa o) and given e > 0 we will prove that there exists
a v in Vf(a O) such that lu - vi < e. Since v = f(a o + it) for some t, we are
to prove that there exists a real t such that
If(a o + it) - ul < e.
Since u E Uf(a O) we have u = F(Zlo Z2,"') where Zn = a o{3(n) + iYn' Hence
Z = aoB + iY, RZ = aoRB + iRY = aoA + iRY,
so
(RZ)n = aoA(n) + i(RY)n = aoA(n) + illn,
say. Therefore

= I a(n)e-aoA(n)e-iltn.
co
U
n;l

On the other hand, we have

L a(n)e-aOA(n)e-iIA(n),
co
f(a o + it) =
n;l

170
8.6: The set of values taken by a Dirichlet series f(s) on a line (J = (Jo

hence

L a(n)e-l1ol(nl(e-itl(nl -
00

I(a o + it) - u= e- ill ").


n~1

The idea of the proof from here on is as follows: First we split the sum into
two parts, L~~ 1 + L~~ N + l' We choose N so the second part L~~ N + 1 is
small, say its absolute value is < !t:. This is possible by absolute convergence.
Then we show that the first part can be made small by choosing t properly.
The idea is to choose t to make every exponential e - itl(nl very close to e - ill"
simultaneously for every 11 = I, 2..... N. Then each factor e - itA(nl - e - ill"
will be small, and since there are only N terms. the whole sum will be small.
Now we discuss the details. For the given t:, choose N so that

I
f
n~N+l
a(n)e-I1OA(nl(e-itl(nl - e-ill")1 < ~.
2
Then we have

This holds for any choice of t. We wish to choose t to make the first sum
< !t:. Since leitA(n l I = 1 we can rewrite the sum in question as follows:

Intla(n)e-aOA(lIl(e-itA(nl - e-ill")1 = Int/-itA(nla(n)e-aOl(nl(1 _ ei (tAlll-ll"l)I

L la(n)le-l1oA(nllei(O.(nl-ll"l -
N
5 11.
n~ 1

Let M = 1 + L~~l la(n)le-aoA(n l. For the given t: there is a b> 0 such


that

(3) Ie ix - 11 < 2~ if Ix I < b.

Suppose we could choose a real t and integers kl' ... , kN such that

(4)
where Ixnl < b for n = 1,2, ... , N. Then for this t we would have

By (3), this would give us


l ei(tl(nl-Il"l _ 11 < _t:_
2M'
171
8: General Dirichlet series and Bohr's equivalence theorem

and hence
e
LN la(n)le- uo1(n)le (tl(n)-l'n) -
i 11 < - LN la(n)le-uo1(n) < -.e
n= I 2M n=1 2

Thus, the proof will be complete if we can find t and integers kl' ... , kN to
satisfy (4). If the A(n) were linearly independent over the integers we could
apply Kronecker's theorem to A(1), ... A(N) and obtain (4). However the
A(n) are not necessarily independent so instead we apply Kronecker's
theorem to the following system:

where
(J = p(n) Yn
n 2nD' CXn = 2nD'
The p(n) are the elements of the basis B used to define F(Z), and the Yn are
the imaginary parts of the numbers Zn which determine u. The integers Q and
D are determined as follows. We express A in terms of B by writing
A(n) = rn, IP(1) + ... + rn,q(n)P(q(n)).
Then Q is the largest of the integers q(I), ... , q(N), and D is the least common
multiple of the denominators of the rational numbers ri,j that arise from the
A(n) appearing in the sum. There are at most q(I) + ... + q(N) such numbers
ri, j ' The numbers (In are linearly independent over the integers because B
is a basis.
By Kronecker's theorem a real t and integers hi' ... , hQ exist such that
()
It(Jk - CXk - hkl < 2nDA'

where
N q(n)
A = L L Irn,jl.
n= j=I I

For this t we have 12nDt(Jk - 2nDcxk - 2nDhkl < ()/A, or


()
Itp(k) - Yk - 2nDhkl < A'
Therefore tp(k) - Yk = 2nDhk + {)k, where I()k I < ()/ A. Now we can write
q(n) q(n)
tA{n) - JJ.n = t L rn,jpU) - L rn,jYj
j= I j= I
q(n) q(n)
= L rnjtpU) - Yj) = L rnj2nDhj + ())
j=1 j=1
q(n) q(n)
= 2n L hjDrn,j + L {)jrn,j
j= I j= I

172
8.7: Equivalence of general Dirichlet series

where k., is an integer and Ix.1 < «(j/A) Lj<:\ Irn,jl < (j. But this means we
have found a real t and integers kl' ... , kN to satisfy (4), so the proof is
oo~~. 0

8.7 Equivalence of general Dirichlet series


Consider two general Dirichlet series with the same sequence of exponents
A, say

L a(n)e-s).(n)
cc
and
n=1 n=1
Let B = {f3(I1)} be a basis for A and write A = RB, where R is a Bohr matrix.
Definition. We say the two series are equivalent, relative to the basis B, and
we write
L a(n)e-s).(n) '" L b(n)e-s).(n)
OC; 00

n=1 n=1

if there exists a finite or infinite sequence of real numbers Y = {Yn} such


that
b(n) = a(n)e ixn
where X = {xn} = R Y.
In other words, if we write
q(n)

A(I1) = L rn,kf3(k),
k= 1

equivalence means that for some sequence {Yn} we have


q(n) )
b(n) = a(l1) exp ( ik~/n'kYk .

Theorem 8.10. Two equivalent Dirichlet series have the same abscissa of
absolute convergence. Moreover, the relation", just defned is independent
of the basis B.
PROOF. Equivalence implies Ib(n)1 = la(n)1 so the series have the same
abscissa of absolute convergence.
Now let Band r be two bases for A, and assume that two series are equiv-
alent with respect to B. We will show that they are also equivalent with
respect to r.
Write A = RBB. Then there is a sequence Y = {Yn} such that b(n) =
a(n)e ixn , where X = {xn} = RB Y. Now write A = Rrr. If we show that for
some sequence V = {vn} we have X = Rr V then the two series will be
equivalent relative to r. The sequence
V = AY
173
8: General Dirichlet series and Bohr's equivalence theorem

has this property, where A is the Bohr matrix such that r = AB. In fact,
we have Rr V = RrA Y = RB Y = X, since RrA = R B. This completes the
proof. 0

Theorem 8.11. The relation ~ defined in the foregoing definition is all equiv-
alence relation. That is, it is reflective, symmetric, and transitive.
PROOF. Every series is equivalent to itself since we may take each Yn = O.
The corresponding Xn will then be zero.
If b(n) = a(n)e iXn then a(n) = b(n)e- ixn . Since X = RB Y we have -X =
R B ( - Y) so the relation is symmetric.
To prove transitivity we may use the same basis throughout and assume
that b(n) = a(n)e ixn , where X = RB Y for some Y, and that a(n) = c(n)e illn ,
where U = RB V for some V. Then b(n) = c(n)ei(Xn+u n) where
X + U = RB Y + RB V = RB(Y + V).
This completes the proof. o
8.8 Equivalence of ordinary Dirichlet series
Theorem 8.12. Two ordinary Dirichlet series

I a(~) and I b(~)


n=! n n=! n
are equivalent if, and only if, there exists a completely multiplicative function
fsuch that
(a) b(n) = a(n)f(n) for all n 2: 1, and
(b) I f(P) I = 1 whenever a(n) =1= 0 and p is a prime divisor of 11.
PROOF. For ordinary Dirichlet series the sequence of exponents A = {A(n)}
is {log n} and for a basis we may use the sequence B = {log Pn}, where Pn
denotes the nth prime. In fact, if we use the prime-power decomposition

(5)

where each exponent an. k 2: 0, we he 'Ie


cc
log n = L an,k log Pk'
k=!

so the integer powers may be used as entries in the Bohr matrix RB for which
A = RBB. In the sum and product only a finite number of the an.k are
nonzero.
We note that, because of the fundamental theorem of arithmetic, the
numbers an,k defined by (5) have the property
(6)
174
8.8: Equivalence of ordinary Dirichlet series

Now let A(s) = La(n)n- S , B(s) = L


b(n)n- s . Suppose that A(s) - B(s).
Then there exists a real sequence {yd such that

(7) b(n) = a(n) eXP{ik;!a",kYk}

where the integers a",k are determined by equation (5), Define a function f
by the equation

f(n) = exp{i f a",kYk}'


k=!
Property (6) implies that f(mn) = f(m)f(n) for all m and n, so f is completely
multiplicative, Equation (7) states that b(n) = a(n)f(n), and the definition of
f shows that I f(n) I = 1 for alln, so conditions (a) and (b) of the theorem are
satisfied.
Now we prove the converse. Assume there exists a completely multi-
plicative function f satisfying conditions (a) and (b). We must show that
there is a real sequence {yd satisfying (7) for alln. First we consider those n
for which a(n) = O. Property (a) implies b(n) = 0, so equation (7) holds for
such n since both sides are zero no matter how we choose the real numbers
Yk' We shall now construct the sequence {yd so that equation (7) also holds
for those n for which a(n) =1= O.
Assume, then, that n is such that a(n) =1= O. We use the prime-power
decomposition (5) and the completely multiplicative property of f to write

n g(n, k),
oc:
(8) fen) =
k=!
where

g(n, k) = {f(Pd an ' k if Pk In .


1 otherWise.
Condition (b) implies that I f(Pk) I = 1 for each prime divisor Pk of n. Therefore
for such primes we may write

f(Pd = exp(iYd,

where Yk = argf(Pk)' The real numbers Yk have been defined for those k such
that the prime Pk divides some n with a(n) =1= O. For the remaining k (if any)
we define Yk = O. Thus, Yk is well-defined for every integer k ~ 1 and we
have

g(n, k) = exp(ia",kYk)
for every k ~ 1. Equation (8) now becomes

f(n) = eXP{if,a",kYk}.
k= 1

175
8: General Dirichlet series and Bohr's equivalence theorem

This, together with property (a), shows that (7) holds for those n for which
a(n) =I O. Thus, (7) holds for all n so A(s) "" B(s). This completes the proof of
the theorem. D

8.9 Equality of the sets Ui(Jo) and Ui(Jo) for


equivalent Dirichlet series
TheoremS.13. Let f(s) and g(s) be equivalent general Dirichlet series, each of
which converges absolutely for (J = (Jo. Then
Uf((Jo) = Ui(Jo)·

PROOF. Let B = {,B(n)} be a basis for the sequence A of exponents. If


f(s) = L a(n)e-s),(n) and g(s) = L b(n)e-s),(n) then there is a real sequence
{yd such that

b(n) = a(n) ex p { -iJ>n,kYk}

The Bohr series of f and g are given by

F(z I' Z2' ... ) = n~1 a(n) exp { - ktrn'kZk}


and

G(ZI' Z2,···) = Jlb(n) ex p { - J>n,kZk}

Expressing the b(n) in terms of the a(n) we find

G(Zlo Z2,' .. ) =Jla(n)ex p{ - J>n,k(Zk + iYk)} = F(ZI + iYl,Z2 + iY2' ... ).

Since the real part of Zn + iYn is the real part of Zn' both series take the same
set of values on the lines Xn = (Jo{J(n). Hence Uf((Jo) = Ug((Jo), as asserted.
o

8.10 The set of values taken by a Dirichlet


series in a neighborhood of the
line (J = (Jo
Definition. Let f(s) be a general Dirichlet series which converges absolutely
for (J > aa' Given (j > 0 and (J 0 such that a 0 - (j > aa' we define the
set Wf(a O ; (j) as follows:
Wf((Jo;(j) = {.f(s):a o - (j < a < a o + (j, -00 < t < +oo}.

176
8.10: The set of values taken by a Dirichlet series in a neighborhood of the line (J = (J0

That is, Wf(O'o; b) is the set of values taken by f(s) in the strip
0'0 - b< 0' < 0'0 + b.
Also, if 0'0 > O'a we define
WiO'o) = n
o <(j<O'o- O'a
WiO'o; b).

Thus, Wf(O' 0) is the intersection of the sets of values taken by f(s) in all
such strips.

It is clear that Vf(O'o) ~ WiO'o) since every value taken by f(s) on the line
0' is also taken in every strip containing this line. Of course, it may
= 0'0
happen that ViO'o) = Wf(O'o) or that Vf(O'o) :t= Wf(O'o)'
In general, we have:

Theorem 8.14. Vf(O'o) !;;; Wf(O'o) !;;; Vf(O'o), hence Vf(O'o) = Wf(O'o)'
PROOF. We remark that this proof is entirely function-theoretic and has
nothing to do with the concept of a basis.
We are to prove that every point in Wf(O'o) is in the closure of ViO'o).
We will show that if WE Wf(O'o) then W is an adherent point of Vf(O'o). In
fact, we will prove that

for some real sequence {tn}.


Since WE WiO'o) this means that WE WiO'o; b) for all b > 0 such that
b < 0'0 - O'a' In particular, WE Wf(O'o; l/n) for all n ~ no for some no.
This means that for n ~ no we have W = f(sn) where Sn = O'n + itn and
0'0 - (l/n) < O'n < 0'0 + (l/n). Using the numbers tn so determined, consider
the difference
W - f(O'o + it n ) = f(O'n + it n) - f(O'o + it n)
where n ~ no. We shall express this difference in terms of the derivative
/,(s). Now just as in the case of ordinary Dirichlet series, the function f(s)
defined by

L a(n)e-s).(n)
r>:;.

f(s) =
n=1

is analytic within its half-plane of absolute convergence. In fact in the proof


of Theorem 8.1 we showed that the series converges uniformly on every
compact subset of the half-plane 0' > O'c. Therefore the sum is analytic in
the half-plane 0' > O'c. Moreover, we can calculate the derivative /'(s) by
term-by-term differentiation, so

L a(n)A(n)e-s).(n).
00

/'(s) = -
n=1

177
8: General Dirichlet series and Bohr's equivalence theorem

Hence if a 2:: ao then s is the half-plane of absolute convergence and we get

L la(n)IIA,(n)le-a).(n) = L la(n)le-ao').(n)IA,(n)le-(a-ao')).(n)
00 00

II'(s) I ::::;
n=l n=l
where aa < a o' < ao. Now IA,(n) Ie-(a-ao')).(n) --+ 0 as n --+ 00 so, in particular,
this factor is less than 1 for large enough n. Hence

L
00

II'(s) I ::::; la(n)le-ao'.l(n). K


n=l
for some K, which shows that I I'(s) I is uniformly bounded in the region
a 2:: ao'. Let ao' = a o - l/no and let M be an upper bound for II'(s) I in the
region a 2:: ao'. Then

Iw - f(q,o + itn)1 = If(a n + it n) - f(a o + itn)! = I f~"I'(a + itn)dal


M
::::; Mlan - aol ::::;-
n
if n 2:: no. Hence limn-+oo f(ao + it n) = w, so w is an adherent point of
~(ao). This completes the proof. 0

8.11 Bohr's equivalence theorem


We have just shown that Wf(ao) c;; Vf(ao). The next theorem shows that this
inclusion is actually equality.

Theorem 8.15. We have

The proof of Theorem 8.15 is lengthy and appears in Section 8.12. In


this section we show how Theorem 8.15 leads to Bohr's equivalence theorem.

Theorem 8.16 (Bohr's equivalence theorem). Let f and g be equivalent


Dirichlet series with abscissa of absolute convergence aa' Then in any
open half plane a> (:1 2:: aa the functions f(s) and g(s) take the same set
ofvalues.
PROOF. Let Sia 1) be the set of values taken by f(s) in the half-plane a > a l'
Then
Sial) = U Viao)·
0'0>0'1

Now we prove that


Si ( 1)= U
0'0>0'1
Wr(ao)·

178
8.12: Proof of Theorem 8.15

First of all, we have SJ{O"I) s;;; U"o>", WJ{O"o) because VACTo) S;;; WACTo). To
get inclusion in the other direction, assume WE U"o>", WJ{CTo). Then
W E WJ { CT 0) for some CT 0 > CT l' Hence W E WJ { CT 0; 0) for all 0 satisfying
o < c5 < CT o - CTa • In other words, f{s) takes the value W in every strip
CT o - 0 < 0" < CT o + 0 irO < 0 < CTO - CT a • In particular, when 0 = 0"0 - CT 1 ,
we have CT o - 0 = CT 1 sof(s) = w for some s with CT > CT 1 • Hence WES J (CT 1 ).
This proves that U"o>", WJ(CTo) s;;; SJ(CT 1), so the two sets are equal. Therefore,
we also have
SiCTd = U Wg(CTo)·
0'0>0'1

To prove Bohr's theorem it suffices to prove that


WJ(CTo) = Wg(CT o)
whenever f and g are equivalent. But f - g implies

UACT o) = Ug(CT o)'

Hence U ACTo) = U g(CT o)' But, in view of Theorem 8.9, this means
VACT o) = Yg(CTo).

But Theorem 8.15 states that VJ(CTo) = WJ(CTo) and Yg(CTo) = Wg(CT o) so Bohr's
equivalence theorem is a consequence of Theorem 8.15. D

8.12 Proof of Theorem 8.15


To complete the proof of Bohr's equivalence theorem we need to prove
Theorem 8.15, which means we must establish the inclusion relation.
(9)
The proof of (9) makes use of two important theorems of analysis which we
state as lemmas:

Lemma 1 (Helly selection principle). Let {Om,n} be a double sequence of real


numbers which is bounded, say
10m,nl < A for all m, n.
Then there exists a subsequence of integers n 1 < n2 < ... with nr -+ 00
as r -+ 00, and a sequence {On} of real numbers such that for every
m = 1, 2, ... , we have

lim Om,nr = Om·


r-rIO

Note. The important point is that one subsequence {nk} works for every m.
To show the true import of the Lemma, let us see what we can deduce in a
trivial fashion. Display the double sequence as an infinite matrix. Consider

179
8: General Dirichlet series and Bohr's equivalence theorem

the first row: {OI, n},~= I' This is a bounded infinite sequence so it has an
accumulation point, say ()I' Hence there is a subsequence {n r } such that
lim r _ oo ()I.nr = ()I' Similarly, for the second row there is an accumulation
point ()2 and a subsequence n,' such that lim n _ oo ()2,nr' = ()2' and so on. The
subsequence {n,'} needed for ()2 may be quite different from that needed for
()I' Helly's principle says that one subsequence works simultaneously for all
rows.

PROOF OF LEMMA 1. Let ()I be an accumulation point of the first row and
suppose the subsequence {n,!1)} has the property that

lim
r~<X)
()1,nr(l) = ()I'

In the second row, consider only those entries ()2. nr(I). This is a bounded
sequence which has a convergent subsequence with limit ()2, say. Thus,
lim () 2, n r (2) = () 2
r~oo

where {n r(2)} is a subsequence of {n r(1)}. Repeat the process indefinitely. At


the mth step we have a subsequence {n,<m)} which is a subsequence of all
earlier subsequences and a number ()m such that
lim ()m, nr(m) = ()m'

Now define a sequence {n r } by the diagonal process:

That is, n l is the first integer used in the first row, n2 the second integer used
in the second row, etc. Look at the mth row and consider the sequence
{()m,nJ. We assert that

Since nr = n,<r>, after the mth term in this row we have r > m so every integer
n,<r) occurs in the subsequence n,<m), so from this point on {n r} is a sub-
sequence of {nr(m)} hence ()m,nr ~ ()m' as asserted. 0

Lemma 2 (Rouche's theorem). Given two functions f(z) and g(z) analytic
inside and on a closed circular contour C. Assume
Ig(z)1 < If(z)1 on C.
Then f(z) and f(z) + g(z) have the same number of zeros insidl! C.
PROOF OF LEMMA 2. Let m = inf{lf(z)1 - Ig(z)l:z E C}. Then m > 0
because C is compact and the difference 1f(z) 1 - Ig(z) 1 is a continuous
function on C. Hence for all real t in the interval 0 :s; t :s; 1 we have
If(z) + tg(z) 1 2 If(z) 1 - 1tg(z) 1 2 If(z)1 - Ig(z)1 2 m > O.

180
8.12: Proof of Theorem 8.15

If 0 ::::; t ::::; 1 define a number qJ(t) by the equation

qJ(t) = _1 [f'(z) + tg'(z) dz.


2ni Jc f(z) + tg(z)
This number qJ(t) is an integer, the number of zeros minus the number of
poles of the function f(z) + tg(z) inside C. But there are no poles, so qJ(t) is
the number of zeros of f(z) + tg(z) inside C. But qJ(t) is a continuous function
of t on [0, 1]. Since it is an integer, it is constant: qJ(O) = qJ(1). But qJ(O) is the
number of zeros of f(z), and qJ(l) is the number of zeros of f(z) + g(z). This
proves Rouche's theorem. 0

PROOF OF RELATION (9). Vf(a O} £; Wiao). Assume v E Vf(ao}. Then either


v E Vf(a O) or v is an accumulation point of VAa o)' If v E Vf(a O) then v E Wf(a O)
since Vf(a O) £; Wf(O'o). Hence we can assume that v is an accumulation point
of VAO'o), and v rt Vf(O'o). This means there is a sequence {t n } of real numbers
such that
v = lim f(O'o + it n)·
n-oc,

We wish to prove that v E Wf(O'o). This means we must show that v E Wf(O'o; 6)
for every 6 satisfying 0 < 6 < 0'0 - O'a' In other words, if 0 < 6 < 0'0 - O'a
we must find an s = 0' + it in the strip
0'0 - 6< 0' < 0'0 +6
such that f(s) = v. Therefore we are to exhibit an s in this strip such that
f(s) = lim f(O' 0 + it.).

Let us examine the numbers f(O'o + it m ) for the given sequence {t n }. We


have

L a(n)e-CfoA(n). e-i1mA(n).
00

f(O'o + it m) =
n=1

The products tmA(n) form a double sequence. There exists a double sequence
of real numbers en, m such that
en,m = tmA(n) + 2nk m,n' with 0::::; e n. m < 2n,
where km," is an integer. If we replace tmA(n) by en,m in the series we don't
alter the terms, hence

+ it m) = L a(n)e - CfoA(n)e -
00

f(O' 0 i8 n ,,,,.
n= 1

By Lemma 1, there is a subsequence of integers {n r } and a sequence of real


numbers {em} such that
(10) Jim em,",
r~oo
= em'
181
8: General Dirichlet series and Bohr's equivalence theorem

Use this sequence {Om} to form a new Dirichlet series


0()

g(s) = L b(n)e-s).(n)
n=l
where

b(n) = a(n)e- iBn .

This has the same abscissa of absolute convergence as f(s). Now consider
the following sequence of functions:

J..(s) = f(s + it n)
where {n r } is the subsequence for which (10) holds. We assert that

(a) J..(s) -+ g(s) uniformly in the strip (10 - 0 < (1 < (10 + 0, hence, in
particular, in the circular disk Is - 0"0 I < O.
(b) g«(1o) = v.
(c) There is a d, 0 < d < 0, and an R such that fR(S) - v and g(s) - v have
the same number of zeros in the open disk Is - 0"01 < d.
If we prove (b) and (c) then fR(S) - v has at least one zero in the disk because
g(O"o) = v. But fR(S) = f(s + it nR ) and s + itnR is in the strip if s is in the disk,
so this proves the theorem. Now we prove (a), (b) and (c).

Proof of (a). We have

1J..(s) - g(s) I = In~la(n)e-s).(n)(e-iBn.nr - e-iBn)1

0()

:S; L Ia(n) le-a).(n) Ie- iBn,nr - e- iBn I


n= 1

N
:S; L la(n)le-(<ro-d»).(n)le-iBn,nr - e-iBnl
n=l

L
0()

+2 Ia(n) Ie-(<ro-d»).(n).
n=N+ 1
Now if e > 0 is given there is a number N = N(e) such that
e
2 L la(n)le-(<ro-d»).(n) < -,
0()

n=N+1 2
because the series L:=l la(n)le-(<ro-d»).(n) converges. In the finite sum L~=l
we use the inequality

Ie - ib - e- ia I = I~ s:e- it dt I :S; Ib - aI

182
8.12: Proof of Theorem 8.15

to write
Ie-iBn ... , - e-iBnl -< 10n,n,. - 0n I.

But if M(b) = 1 + L:'=l la(n)le-(ao-cl)l(n), there is an integer ro = ro(l»


such that for every n = 1,2, ... , N we have
I>
10n,n, - Onl < 2M(b) ifr;;::: roo

Therefore, if r ;;::: ro we have


I> N I> I> I>
1J,.(s) - g(s)l:s; 2M(b) n~lla(n)le-("O-cl)l(n) + "2 < "2 + "2 = 1>.
Since ro depends only on I> and on b this shows that J,.(s) -+ g(s) uniformly
in the strip 0'0 - b < 0' < 0'0 + bas r -+ 00. This proves (a).
Proof of (b). We use (a) to write
g(O'o) = Iimfr(O'o) = Iimf(O'o + itn ) = v.
r-+oo r-oo

Proof of (c). Assume first that 9 is not constant. Since g(O'o) = v there is a
positive d < b such that g(s) =I v on the circle
C = {s: Is - 0'0 I= d}.
Let M be the minimum value of Ig(s) - v Ion c. Then M > O. Now choose R
so large that I fR(S) - g(s) I < M on C. This is possible by uniform convergence
of the sequence {fR(S)}, since the circle C lies within the strip 10' - 0'01 < b.
Then on C we have
IfR(S) - g(s) I < M :s; Ig(s) - vi·
If G(s) = fR(S) - g(s) and F(s) = g(s) - v we have IG(s) I < IF(s) I on C with
F(s), G(s) analytic inside C. Therefore, by Rouche's theorem the functions
F(s) + G(s) and F(s) have the same number of zeros inside C. But F(s) + G(s)
= fR(S) - v, so fR(S) - v has the same number of zeros inside C as g(s) - v.
Now g(O'o) = v so g(s) - v has at least one zero inside C. Hence fR(S) - v
has at least one zero inside C. As noted earlier, this completes the proof if 9
is not constant.
To complete the proof we must consider the possibility that g(s) is constant
in the half-plane of absolute convergence. Then g'(s) = 0 for all s in this half-
plane, which means

= - L A(n)b(n)e-Sl(n) = O.
00

g'(s)
n= 1

But as in the case of ordinary Dirichlet series, if a general Dirichlet series


has the value 0 for a sequence of values of s with real parts tending to + 00
then all the coefficients must be zero. (See [4], Theorem 11.3.) Hence

183
8: General Dirichlet series and Bohr's equivalence theorem

A(n)b(n) = 0 for all n ~ 1. Therefore b(n) = 0 with at most one exception,


say b(nd, in which case A(nd = o. Therefore, since a(n) = b(n)e i8n , we must
have a(n) = 0 with at most one exception, say a(nl), and then A(nl) = o.
Hence the series for f(s) consists of only one term,J(s) = a(nde-Sl(n tl = a(nl),
so f(s) itself is constant. But in this case the theorem holds trivially. D

8.13 Examples of equivalent Dirichlet series.


Applications of Bohr's theorem to
L-series
Theorem 8.17. Let k ~ 1 be a given integer, and let X be any Dirichlet character
modulo k. Let L:'=I a(n)n- S be any Dirichlet series whose coefficients
have the following property:
a(n) =F 0 implies (n, k) = 1.
Then
t
n= I
a(~) '"
n
t
n= I
a(n)~(n).
n
PROOF. Since these are ordinary Dirichlet series we may use-Theorem 8.12
to establish the equivalence. In this case we take f(n) = x(n). Then f is
completely multiplicative and condition (a) is satisfied. Now we show that
condition (b) is satisfied. We need to show that I f(p) I = 1 if a(n) =F 0 and
pin. But a(n) =F 0 implies (n, k) = 1. Since pin we must have (p, k) = 1 so
If(P)1 = IX(p) I = 1 since X is a character. Therefore the two series are
equivalent. 0
Theorem 8.18. For a given modulus k, let ·XI' ... , X",(k) denote the Dirichlet
characters modulo k. Then in any half-plane of the form (1 > (11 ~ 1 the
set of values taken by the Dirichlet L-series L(s, Xi) is independent of i.
PROOF. Applying the previous theorem with a(n) = XI(n) we have

f XI~n) '" I XI(n)sx(n)


n= I n n= I n
for every character X modulo k. Here we use the fact that XI(n) =F 0 implies
(n, k) = 1. Thus each L-series L(s, X) is equivalent to the particular L-series
L(s, XI). Therefore, by Bohr's theorem, L(s, X) takes the same set of values as
L(s, xd in any open half-plane within the half-plane of absolute convergence.
o
8.14 Applications of Bohr's theorem to the
Riemann zeta function
Our applications to the Riemann zeta function require the following identity
involving Liouville's function A(n) which is defined by the relations
,1.(1) = 1, A(PI a, ... Prar ) = (_l)a, + ... +a r.

184
8.14: Applications of Bohr's theorem to the Riemann zeta function

The function ;.(n) is completely multiplicative and we have (see [4J, p. 231)
~ ).(n) _ (2s) 'f
L. 5- r() 1(J>1.
n=l n .. s

Theorem 8.19. Let A(n) denote Liouville's function and let

C(x) = L A(n).
n$x n
Then if (J > 1 we have
(2s) = Joc C(x) dx
(s - 1)(s) 1 x5 •

PROOF. By Abel's identity (Theorem 4.2 in [4]) we have

"
L...
A(n) ~ _ C(x)
5 - 5 +s
JX C(t)1 dt. 5+
n$x n n x 1 t
Keep (J > 0 aRd let x --+ 00. Then

C(x) =
x5
o(~
x"
" !)
n'tx n
= O(IOg x) = 0(1) as x
x"
--+ 00,

so we find
.~
L... s:tT
n= 1
),(n)
n
=s I
X'

1
C(t)
--.-+r dt,
t
for (J > O.

Replacing s by s - 1 we get

I: A(7)
n= 1 nit
(s - 1) Joo C~) dt
= for (J > 1.

Since the series on the left has sum (2s)/(s) the proof is complete. D

Now we prove a remarkable theorem discovered by P. Tunin [50] in


1948 which gives a surprising connection between the Riemann hypothesis
and the partial sums of the Riemann zeta function in the half-plane (J > 1.

Theorem 8.20. Let


n I
'n(s) = L -k
k=l
5'

If there exists an no such that (n(s) # 0 for all n ~ no and all (J > 1, then
(s) # 0 for (J > 1-
k - 5 and L~ = 1 A(k)k - 5
PROOF. First we note that the two Dirichlet series L~ = 1
are equivalent because A is completely multiplicative and has absolute

185
8: General Dirichlet series and Bohr's equivalence theorem

value 1. Therefore, by Bohr's theorem, 'l~) ::/; 0 for (J > 1 implies that
Lk=1 A(k)k- s ::/; 0 for (J > 1. But for s real we have

lim t
s- + 00 k= 1
A(~) = A(I) = 1.
k
Hence for all real s > 1 we must have Lk= 1 A(k)k- > O. S Letting s -+ 1+
we find

~ A(k) > 0
L.... if n ~ no·
k=1 k -
In other words, the function

(11) C(x) = L A(n)


nSx n
is nonnegative for x ~ no. Now we use the identity of Theorem 8.19,

,(2s) = foo C(x) dx


(s - lK(s) 1 XS '

valid for (J > 1. Note that the denominator (s - lK(s) is nonzero on the
real axis s > 1. and ,(2s) is finite for real s > t.
Therefore, by the integral
analog of Landau's theorem (see Theorem 11.13 in [4]) the function on the
t.
left is analytic everywhere in the half-plane (J > This implies that '(s) ::/; 0
for (J > t, and the proof is complete. 0

Turan's theorem assumes that the sum C(x) in (11) is nonnegative for all
x ~ no. In 1958, Haselgrove [14] proved, by an ingenious use of machine
computation, that C(x) is negative for infinitely many values of x. Therefore,
Theorem 8.20 cannot be used to prove the Riemann hypothesis. Subse-
quently, Tunin [51] sharpened his theorem by replacing the hypothesis
C(x) ~ 0 by a weaker inequality that cannot be disproved by machine
computation.

Theorem 8.21 (Turan). Let C(x) = Lnsx A(n)/n. !f there exist constants
0( > 0, c > 0 and no such that

log« x
(12) C(x) > - c - -
Jx
for all x ~ no, then the Riemann hypothesis is true.
PROOF. If e > 0 is given there exists an n1 ~ no such that c log« x ::;; x' for
all x ~ n1 so (12) implies
C(x) > _X'- 1/2 .

186
Exercises for Chapter 8

Let A(x) = C(x) + X<-1/2, where E is fixed and 0 < E < 1. Then A(x) > 0
for all x ;:::: nl' Also, for (J > 1 we have

f OO A(x) d - fOC! C(x) d


- s-x- -s-x+ foo x <-s-1/2 d
x
1 X 1 X 1

((2s) + ~ = f(s)
(s - 1)((s) s - "2 - E '

say. Arguing as in the proof of Theorem 8.20, we find that the function f(s)
is analytic on the real line S > 1 + E. By Landau's theorem it follows that
f(s) is analytic in the half-plane (J > 1 + E. This implies that ((s) # 0 for
(J > 1 + E, hence ((s) # 0 for (J > 1 since E can be arbitrarily small. D

Note. Since each function (n(s) is a Dirichlet series which does not vanish
identically there exists a half-plane (J > 1 + (In in which (n(s) never vanishes.
(See [4], Theorem 11.4.) The exact value of (In is not yet known. In his 1948
paper [50] Tunin proved that, for all sufficiently large n, (n(s) # 0 in the
half-plane (J > 1 + 2(1og log n)/log n, hence (J n .::; 2(1og log n)/log n for large
n. In the other direction, H. L. Montgomery has shown that there exists a
constant c > 0 such that for all sufficiently large n, (n(s) has a zero in the half-
plane (J > 1 + c(log log n)/log n, hence (In ;;:: c(1og log n)/log n for large n.
The number 1 + (In is also equal to the abscissa of convergence of the
Dirichlet series for the reciprocall/(n(s). If (J > 1 + (In we can write
1 _ f
l1ik)
(n(s) - k~ 1 7 '

where I1n(k) is the Dirichlet inverse of the function un(k) given by

un(k) =
,
{I 0
~f k .::; n,
If k > n.
The usual Mobius function l1(k) is the limiting case of I1n(k) as n -> 00.

Exercises for Chapter 8


1. If L: a(n)e-sJ.(n) has abscissa of convergence (Je < 0, prove that

_ I'
(Je - 1m sup
log I D"=n a(k)1
1 '
n-", Il(n)

2. Let (Je and (Ja denote the abscissae of convergence and absolute convergence of a
Dirichlet series. Prove that

(Ja - °: ;. log n
(Je ::; hm sup ,l.(n)·

°: ;
n-a;

This gives (Ja - (Je ::; 1 for ordinary Dirichlet series.

187
8: General Dirichlet series and Bohr's equivalence theorem

3. If log n/).(n) ---+ 0 as n ---+ 00 prove that


. logla(n)1
(Ja=(J,=lImsu p ).( .
n~oc n)

What does this imply about the radius of convergence of a power series?

4. Let Urn)} be a sequence of complex numbers. Let A denote the set of aU points
s = (J + it for which the series L a(n)e-sA(nl converges absolutely. Prove that A
is convex.

Exercises 5, 6, and 7 refer to the seriesfl\) = I:= I a(ll)e- SA (n) with exponents
and coefficients given as follows

n 1 2 3 4 5

A(n) -1 - log 2 -1 -log 2 -1 + log 2 0

a(n) 1
8
1
2" 4
1
-8
1 1
2"

n 6 7 8 9 10

A(n) 1 - log 2 log 2 1 log 3 1 + log 2

a(n) 1
8 -4
1 1
2" - 43 -8
1

Also, a(n + 10) = -i 2- n and A(n + 10) = (n + 1) log 3 for n ~ 1.


5. Prove that (Ja = -(log 2)/log 3.
6. Show that the Bohr function corresponding to the basis B = (I, log 2, log 3) is
1 - 2e-· 3
F(zl' Z2, Z3) = cos(izd - ti sin(iz2)(1 + cos(izd) + 2 - e ., ,

ifx3> -log2,zl,z2arbitrary.
7. Determine the set Vj(O). Hint: The points -1, 1 + i, 1 - i are significant.
8. Assume the Dirichlet series f(s) = L;'= I a(n)e-s!.(n) converges absolutely for (J > (Ja'

If (J > (J a prove that

·
I1m -
1 fT eA(o+it}f( (J + It')d t = {a(n) if).=).(Il)
T~+-,;2T -T 0 ifX#).(1),).(2), ....

9. Assume the series f(s) = L::'=I a(ll)e- S A( n l converges absolutely for (J > (Ja > O. Let
v(n) = eArn).

(a) Prove that the series g(s) = L:= 1 a(Il)e-S\'ln) converges absolutely if (J > O.
(b) If (J > (Ja prove that

r(s)f(s) = f' g(t)t S - 1 dt.

188
Exercises for Chapter 8

f'"
This extends the classic formula for the Riemann zeta function,
rs-I
r(s)((s) = -,- dr.
o e - 1
Hinr: First show that r(s)e-s).ln l = J~ e-'''ln1rs-1 cit.

189
Supplement to Chapter 3

Alternate proof of Dedekind's functional equation


This supplement gives an alternate proof of Dedekind's functional equation
as stated in Theorem 3.4:

Theorem. If A = (: ~) E rand c > 0, thenfor every T in H we have


(1)
aT
71 ( C'T + b)
+ d = e(A){ - i(cT + d)}127j(T),
J

where

(2) e(A) = exp{-1Ti(a 1;Cd - s(d, C))}


and s(d, c) is a Dedekind sum.

The alternate proof was suggested by Basil Gordon and is based on the fact
that the modular group r has the two generators TT = T + 1 and ST =
- liT. In Theorem 2.1 we showed that every A in r can be expressed in
the form
A = T"IST"2S· •• ST''r,
where the n i are integers. But T = ST-IST-IS, so every element of r also
has the form ST'"IS ••• STInk for some choice of integers m l , • • • , mk. The
idea of the proof is to show that if the functional equation (1) holds for a

particular transformation A = (: ~) in r with c > 0 and with e(A) as

specified in (2), then it also holds for the products AT'" and AS for every

190
Supplement to Chapter 3

integer m. (See Lemma 3 below.) In Theorem 3.1 we proved that it holds


for S. Therefore, because every element of r has the form STmlS •.. STmk,
it follows that the functional equation (1) holds for every A with e > O.
The argument is divided into three lemmas that show that the general
functional equation is a consequence of the special case in Theorem 3.1
together with three basic properties of Dedekind sums derived in Sections
3.7 and 3.8. The first two lemmas relate e(A) with e(ATm) and e(AS), where
T and S are the generators of the modular group.

Lemma l. If A = (~ ~) E rand e > 0, then for every integer m we

have

PROOF. We have AT'" = ( ae b)(l m) = (a am + b),


dOl e em + d so

e(ATm) = ex p{ 1Ti(a + ~~ + d - s(em + d, c»)}.


But s(em + d, c) = s(d. c) by Theorem 3.5(a), and hence, we obtain
Lemma I. o
Lemma 2. If A (~ ~) E rand e > 0, then we have

e-1Til4e(A) ifd>O,
e(AS) ={ e1Ti/4e(A) if d < O.

PROOF. We have

AS
= (ae b)(O
d 1
- 1) = (b
0 d
- a)
-c'
If d > 0, we represent the transformation AS by the matrix

AS = (~ -a),
-e
but if d < 0, then - d > 0 and we use the representation AS

(=~ ~).
For d > 0, we have

(3) e(AS) = ex p{ 1Ti(bl~e - s( -e, d»)}


= exp{ 1Ti(bl ; / + s(e, d»)}
191
Supplement to Chapter 3

because s( - e, d) = - s(e, d) by Theorem 3.5(a). The reciprocity law for


Dedekind sums implies
c d 1 1
s(e, d) + s(d, e) = 12d + 12e - 4 + 12cd"

We replace the numerator in the last fraction by ad - bc and rearrange


terms to obtain

b-e a+d 1
l2d + s(c, d) = ~ - s(d, c) - 4·
Using this in (3), we find e(AS) = e- 1Til4e(A).

If d < 0, we use the representation AS = (= ~ ;) to obtain

(4) e(AS) = ex p{ 1Ti( -!l;d -d»)}.


e - s(c,

In this case, -d'> 0 and we use the reciprocity law in the form

c d 1 ad - be
s(e, - d) + s( - d, e) = _ 12d - 12c - 4 - 12cd·

Rearrange terms and use s( -d, c) = -s(d, e) to obtain

-b+e a+d 1
-12d - s(c, -d) = ~ - s(d, e) + 4·
Using this in (4), we find that e(AS) e1Ti/4e(A). This completes the proof
of Lemma 2. o
Lemma 3. If Dedekind's functional equation
(5)

is satisfied for some A = (; ~) in r with e > 0 and e(A) given by (2),

then it is also satisfied for AT'" and for AS. That is, (5) implies

(6) T/(AT'"T) = e(ATm){ - i(cT + d + mc)}1/2T/(T),


and

(7) T/(AST) = e(AS){ -i(dT - eW /2 T/(T) ifd> 0,

whereas

(8) T/(AST) = e(AS){ - i( - dT + e)}1/2T/(T) if d < O.

192
Supplement to Chapter 3

PROOF. Replace r by T"'r in (5) to obtain


1)(AT mr) = s(A){ - i(cTmr + d)}1121)(Tmr)
= e(A){ - i(cr + mc + dW'2e1Timil21)(r).
Using Lemma 2 we obtain (6).
Now replace r by Sr in (5) to get
1)(ASr) = s(A){ - i(cSr + d)}1121)(Sr).
Using Theorem 3.1 we can write this as
(9) 1)(ASr) = s(A){ - i(cSr + d)}112 { - irr'21)(r).
If d > 0, we write
c dr - c
cSr + d = - - + d = - - '
r r'

hence,
- i(dr - c)
r + d) --
- I.( CS . e -1TiI2 ,
-Ir

and therefore, {-i(cSr + d)}112 {-ir}1/2 = e- 1TiI4{-i(dr - C)}112. Using this


in (9) together with Lemma 3, we obtain (7).
If d < 0, we write
c -dr +c
cSr + d = --r + d = - -
-r
-
so that in this case we have
- i( - dr + c) 1Ti12
- i(cSr + d) = . e,
-Ir

and therefore, {-i(cSr + d)}112 {-irfl2 = e1Ti/4{-i(-dr + C)f/2. Using


this in (9) together with Lemma 3, we obtain (8). 0

Remark on the root of unity e(A)


Dedekind's functional equation (I), with an unspecified 24th root of unity
e(A), follows immediately by extracting 24th roots in t~e functional equation
for ~(r). Much of the effort in this theory is directed at showing that the
root of unity s(A) has the form given in (2). It is of interest to note that a
simple argument due to Dedekind gives the following theorem:

Theorem. If (I) holds whenever A = (: !) E rand c ¥- 0, then

s(A) = ex p{1Ti(a l ;Cd - f(d, c»)}

for some rational number f(d, c) depending only on d and c.

193
Supplement to Chapter 3

PROOF. Let
aT +b I a'T + b'
AT=--- and
CT + d AT= CT+d

be two transformations in r having the same denominator CT + d. Then


ad-bc=1 and a'd - b'c = I,
so both pairs a, b and a', b' are solutions of the linear Diophantine equation
xd - yc = 1.
Consequently, there is an integer n such that
a' = a + nc, b' = b + nd.

Hence,

A'T = (a + nC)T + (b + nd) = aT + b + n = AT + n.


CT+d cT+d
Therefore, we have
7/(A'T) = 7/(AT + n) = e1Tin/127/(AT) = e1Tin/12e(A){ - i(CT + d)}1127/(T),
because of (1). On the other hand, (I) also gives us
7/(A 'T) = e(A'){ -i(CT + d)}1127/(T).
Comparing the two expressions for 7/(A 'T), we find e(A') e1TinI12e(A). But
n = (a ' - a)lc, so

= exp ( 7Ti(a12c a»)


l
-
e(A') e(A) ,

or

( ')
7TlQ
. e(A)I
exp - 12c = exp (. )
7TlQ
- 12c e(A).

This shows that the product exp( - ~~:)e(A) depends only on C and d. There-
fore, the same is true for the product

exp ( -
7Ti(a +
12c
d») e(A).
This complex number has absolute value 1 and can be written as

exp(
7Ti(a +
12c
d») e(A) = exp(-7Tif(d, c»
194
Supplement to Chapter 3

for some real number fed. c) depending only on c and d. Hence,

e(A) = exp{ 7T{a I;C d - fed. C))}.


Because e 24 = I, it follows that 12cf(d. c) is an integer, so fed. c) IS a
rational number. 0

195
Bibliography

1. Apostol, Tom M. Sets of values taken by Dirichlet's L-series. Proc. Sympos. Pure
Math., Vol. VIII, 133-137. Amer. Math. Soc., Providence, R.I., 1965. MR 31
# 1229.
2. Apostol, Tom M. Calculus, Vol. II, 2nd Edition. John Wiley and Sons, Inc. New
York, 1969.
3. Apostol, Tom M. Mathematical Analysis, 2nd Edition. Addison-Wesley Publishing
Co., Reading, Mass., 1974.
4. Apostol, Tom M. Introduction to Analytic Number Theory. Undergraduate Texts
in Mathematics. Springer-Verlag, New York, 1976.
5. Atkin, A. O. L. and O'Brien, J. N. Some properties of p(n) and c(n) modulo powers
of 13. Trans. Amer. Math. Soc. 126 (1967),442-459. MR 35 #5390.
6. Bohr, Harald. Zur Theorie der allgemeinen Dirichletschen Reihen. Math. Ann.
79 (1919), 136-156.
7. Deligne, P. La conjecture de Wei!. I. Inst. haut. Etud sci., Publ. math. 43 (1973),
273-307 (1974). Z. 287, 14001.
8. Erdos, P. A note on Farey series. Quart. 1. Math., Oxford Ser. 14 (1943), 82-85.
MR 5, 236b.
9. Ford, Lester R. Fractions. Amer. Math. Monthly 45 (1938), 586-601.
10. Gantmacher, F. R. The Theory oj Matrices, Vol. I. Chelsea Pub!. Co., New York,
1959.
II. Gunning, R. C. Lectures on Modular Forms. Annals of Mathematics Studies, No.
48. Princeton Univ. Press, Princeton, New Jersey, 1962. MR 24 #A2664.
12. Gupta, Hansraj. An identity. Res. Bull. Panjab Univ. (N.S.) 15 (1964), 347-349
(1965). MR 32 #4070.
13. Hardy, G. H. and Ramanujan, S. Asymptotic formulae in combinatory analysis.
Proc. London Math. Soc. (2) 17 (1918),75-115.
14. Haselgrove, C. B. A disproof of a conjecture of P6lya. Mathematika 5 (1958),
141-145. MR 21 #3391.
15. Hecke, E. Uber die Bestimmung Dirichletscher Reihen durch ihre Funktional-
gleichung. Math. Ann. 112 (1936), 664--699.

196
Bibliography

16. Hecke, E. Ober Modulfunktionen und die Dirichlet Reihen mit Eulerscher Produkt-
entwicklung. I. Math. Ann. JJ4 (1937), 1-28; II. 316-351.
17. Iseki, Shoo The transformation formula for the Dedekind modular function and
related functional equations. Duke Math. J. 24 (1957),653-662. MR 19, 943a.
18. Knopp, Marvin I. Modular Functions in Analytic Number Theory. Markham
Mathematics Series, Markham Publishing Co., Chicago, 1970. MR 42 #198.
19. Lehmer, D. H. Ramanujan's function ,(n). Duke Math. J. 10 (1943), 483-492.
MR 5, 35b.
20. Lehmer, D. H. Properties of the coefficients of the modular invariant J(,). Amer.
J. Math. 64 (1942), 488-502. MR 3, 272c.
21. Lehmer, D. H. On the Hardy-Ramanujan series for the partition function. J.
London Math. Soc. 12 (1937),171-176.
22. Lehmer, D. H. On the remainders and convergence of the series for the partition
function. Trans. Amer. Math. Soc. 46 (1939),362-373. MR 1, 69c.
23. Lehner, Joseph. Divisibility properties of the Fourier coefficients of the modular
invariant}(,). Amer. J. Math. 71 (1949), 136-148. MR 10, 357a.
24. Lehner, Joseph. Further congruence properties of the Fourier coefficients of the
modular invariantj(')' Amer. J. Math. 71 (1949), 373-386. MR 10, 357b.
25. Lehner, Joseph, and Newman, Morris. Sums involving Farey fractions. Acta
Arith. 15 (1968/69),181-187. MR 39 # 134.
26. Lehner, Joseph. Lectures on Modular Forms. National Bureau of Standards,
Applied Mathematics Series, 61, Superintendent of Documents, U.S. Government
Printing Office, Washington, D.C., 1969. MR 41 #8666.
27. LeVeque, William Judson. Reviews in Number Theory, 6 volumes. American Math.
Soc., Providence, Rhode Island, 1974.
28. Mordell, Louis J. On Mr. Ramanujan's empirical expansions of modular functions.
Proc. Cambridge Phil. Soc. 19 (1917), 117-124.
29. Neville, Eric H. The structure of Farey series. Proc. London Math. Soc. 51 (1949),
132-144. MR 10, 681f.
30. Newman, Morris. Congruences for the coefficients of modular forms and for the
coefficients ofj(')' Proc. Amer. Math. Soc. 9 (1958), 609-612. MR 20 #5184.
31. Petersson, Hans. Ober die Entwicklungskoeffizienten der automorphen formen.
Acta Math. 58 (1932),169-215.
32. Petersson, Hans. Ober eine Metrisierung der ganzen Modulformen. Jber. Deutsche
Math. 49 (1939), 49-75.
33. Petersson, H. Konstruktion der samtlichen Losungen einer Riemannscher
Funktionalgleichung durch Dirichletreihen mit Eulersche Produktenwicklung. I.
Math. Ann. 116 (1939), 401-412. Z. 21, p. 22; II. 117 (1939),39-64. Z. 22,129.
34. Rademacher, Hans. Ober die Erzeugenden von Kongruenzuntergruppen der
Modulgruppe. Abh. Math. Seminar Hamburg, 7 (1929),134-148.
35. Rademacher, Hans. Zur Theorie der Modulfunktionen. J. Reine Angew. Math.
·167 (1932), 312-336.
36. Rademacher, Hans. On the partition function p(n). Proc. London Math. Soc. (2)
43 (1937), 241-254.
37. Rademacher, Hans. The Fourier coefficients of the modular invariantj(r). Amer.
J. Math. 60 (1938),501-512.
38. Rademacher, Hans. On the expansion of th'e partition function in a series. Ann. of
Math. (2) 44 (1943), 416-422. MR 5, 35a.
197
Bibliography

39. Rademacher, Hans. Topics in Analytic Number Theory. Die Grundlehren der
mathematischen Wissenschaften, Bd. 169, Springer-Verlag, New York-Heidel-
berg-Berlin, 1973. Z. 253.10002.
40. Rademacher, Hans and Grosswald, E. Dedekind Sums. Carus Mathematical
Monograph, 16. Mathematical Association of America, 1972. Z. 251. 10020.
41. Rademacher, Hans and Whiteman, Albert Leon. Theorems on Dedekind sums.
Amer. J. Math. 63 (1941),377-407. MR 2, 249f.
42. Rankin, Robert A. Modular Forms and Functions. Cambridge University Press,
Cambridge, Mass., 1977. MR 58 #16518.
43. Riemann, Bernhard. Gessamelte Mathematische Werke. B. G. Teubner, Leipzig,
1892. Erliiuterungen zu den Fragmenten XXVIII. Von R. Dedekind, pp. 466-
478.
44. Schoeneberg, Bruno. Elliptic Modular Functions. Die Grundlehren der mathema-
tischen Wissenschaften in Einzeldarstellungen, Bd. 203, Springer-Verlag, New
York-Heidelberg-Berlin, 1974. MR 54 #236.
45. Sczech, R. Ein einfacher Beweis der Transformationsformel fUr log 1](z). Math.
Ann. 237 (1978), 161-166. MR 58 #21948.
46. Selberg, Atle. On the estimation of coefficients of modular forms. Proc. Sympos.
Pure Math., Vol. VIII, pp. 1-15. Amer. Math. Soc., Providence, R.I., 1965. MR 32
#93.
47. Serre, Jean-Pierre. A Course in Arithmetic. Graduate Texts in Mathematics, 7.
Springer-Verlag, New York-Heidelberg-Berlin, 1973.
48. Siegel, Carl Ludwig. A simple proofoflJ( - liT) = IJ(T)JTii. Mathematika I (1954),
4. MR 16, 16b.
49. Titchmarsh, E. C.lntroduction to the Theory of Fourier Integrals. Oxford, Clarendon
Press, 1937.
50. Tunin, Paul. On some approximative Dirichlet polynomials in the theory of the
zeta-function of Riemann. DallSke Vid. Selsk. MaI.-Fys. Medd. 24 (1948), no. 17,
36 pp. MR 10, 286b.
51. Tunin, Paul. Nachtrag zu meiner Abhandlung "On some approximative Dirichlet
polynomials in the theory of the zeta-function of Riemann." Acta Math. A cad.
Sci. Hungar. JO (1959),277-298. MR 22 #6774.
52. Uspensky, J. V. Asymptotic formulae for numerical functions which occur in the
theory of partitions [Russian]. Bull. A cad. Sci. URSS (6) 14 (1920), 199-218.
53. Watson, G. N. A Treatise on the Theory of Bessel Functions, 2nd Edition. Cambridge
University Press, Cambridge, 1962.

198
Index of special symbols

Q(W I , w z ) lattice generated by WI and Wz , 2


f.J(z) Weierstrass f.J-function, 10
Gn Eisenstein series of order 11, 11 ~ 3, 12
Gz Eisenstein series of order 2, 69
gZ,g3 invariants, 12
e l , ez, e3 values of f.J at the half-periods, 13
l1(Wl, w z), l1(t) discriminant g~ - 27g~, 14
H upper half-plane Im(t) > 0, 14
J(t) Klein's modular function g~/l1, 15
t(l1) Ramanujan tau function, 20
a.(n) sum of the IXth powers of divisors of 11, 20
r modular group, 28
S,T generators of r, 28
RG fundamental region of sub-group G of r, 30
R fundamental region of r, 31
1/(t) Dedekind eta function, 47
s(h, k) Dedekind sum, 52
A(X) -Iog(l - e- 2U ), 52
A(IX, {3, z) Iseki's function, 53
'(s, a) Hurwitz zeta function, 55
F(x, s) periodic zeta function, 55
j(t) 12 3J(t), 74
ro(q) congruence subgroup of r, 75

!p(t)
Ip-I
- IJ- ,
('+A) 80
P P
l=O

( l1(qt)Y/(Q-I)
<1>( t) 86
l1(t) ,
9(t) Jacobi theta function, 91
p(n) partition function, 94

199
Index of special symbols

F(x) generating function for p(Il), 94


Fn set of Farey fractions of order 11, 98
Mk linear space of entire forms of weight k, 117
Mk,o subspace of cusp forms of weight k, 119
T" Hecke operator, 120
r(11) set of transformations of order 11, 122
K dim M Zk,O, 133
EZk(r) normalized Eisenstein series, 139
F(Z) Bohr function associated with Dirichlet series, 168
VJ((To) set of values taken by Dirichlet series f(s) on line (T = (To, 170
'n(s) partial sums I k- S , 185
k5n

200
Index

A Bohr, equivalence theorem, 178


function of a Dirichlet series, 168
Abscissa, of absolute convergence, 165
matrix, 167
of convergence, 165
Additive number theory, I
Apostol, Tom M., 196
Approximation theorem, of Dirichlet, 143
c
of Kronecker, 148, 150, 154 Circle method, 96
of Liouville, 146 Class number of quadratic form, 45
Asymptotic formula for p(n), 94, 104 Congruence properties, of coefficients of
Atkin, A. O. L., 91, 196 j(r), 22, 90
Automorphic function, 79 of Dedekind sums, 64
Congruence subgroup, 75
Cusp form, 114

B
Basis for sequence of exponents, 166 D
Bernoulli numbers, 132 Davenport, Harold, 136
Bernoulli polynomials, 54 Dedekind, Richard, 47
Berwick, W. E. H., 22 Dedekind function '1(r), 47
Bessel functions, 109 Dedekind sums, 52, 61
Bohr, Harald, 161,196 Deligne, Pierre, 136, 140, 196

201
Index

Differential equation for p(z), II


Dirichlet, Peter Gustav Lejeune, 143
Dirichlet's approximation theorem, 143 G
Dirichlet L-function, 184 g2,g3,12
Dirichlet series, 161 General Dirichlet series, 161
Discriminant d(-r), 14 Generators, of modular group r, 28
Divisor functions a.(n), 20 of congruence subgroup r o(P), 78
Doubly periodic functions, 2 Grosswald, Emil, 61, 198
Gupta, Hansraj, III, 196

E
e l ,e 2 ,e3 ,13 H
Eigenvalues of Hecke operators, 129 Half-plane H, 14
Eisenstein series G., 12 Half-plane, of absolute convergence, 165
recursion formula for, 13 of convergence, 165
Elliptic functions, 4 Hardy, Godfrey Harold, 94, 196
Entire modular forms, 114 Hardy-Ramanujan formula for pen), 94
Equivalence, of general Dirichlet series, Haselgrove, C. B., 186, 196
173 Hecke, Erich, 114, 120, 133, 196, 197
of ordinary Dirichlet series, 174 Hecke operators T., 120
of pairs of periods, 4 Helly, Eduard, 179
of points in the upper half-plane H, 30 Helly selection principle, 179
of quadratic forms, 45 Hurwitz, Adolf, 55, 145
Estimates for coefficients of modular Hurwitz approximation theorem, 145
forms, 134 Hurwitz zeta function, 55, 71
Euler, Leonhard, 94
Euler products of Dirichlet series, 136
Exponents of a general Dirichlet
I
series, 161 InvariantsB2,B3,12
Inversion problem for Eisenstein
series, 42
F Iseki, Sho, 52, 197
Farey fractions, 98 Iseki's transformation formula, 53
Ford, L. R., 99, 196
Ford circles, 99
Fourier coefficients ofj(-r), 21, 74 J
divisibility properties of, 22, 74, 91 j(-r), J(-r), 74, 15
Functional equation, for "I(-r), 48, 52 Fourier coefficients of, 21
for .9(-r), 91 Jacobi, Carl Gustav Jacob, 6, 91,141
for C(s), 140 Jacobi theta function, 91, 141.
for t\(0(, p, z), 54 Jacobi triple product identity, 91
for clI(lX, p, s), 56, 71
Fundamental pairs of periods, 2
Fundamental region, of modular group K
r. 31 Klein, Felix, 15
of subgroup r o(P), 76 Klein modular invariant J(-r), 15

202
Index

Kloostennan. H. D .• 136
Knopp. Marvin I., 197
Kronecker, Leopold. 148 N
Kronecker approximation theorem, 148. Neville, Eric Harold, 110, 197
150,154 Newman, Morris, 91, Ill, 197
Normalized eigenform, 130

L
Lambert, Johann Heinrich, 24
o
Lambert series. 24 O'Brien, J. N., 91,196
Landau, Edmund. 186 Order of an elliptic function, 6
Lehmer. Derrick Henry. 22, 93, 95, 197
Lehmer conjecture, 22
Lehner, Joseph, 22, 91, Ill,) 97
p
LeVeque, William Judson, 197 f.J-function of Weierstrass, 10
Linear space Mk of entire forms, 118 Partition function pen), 1, 94
Linear subspace M k. 0 of cusp forms, 119 Period, 1
Liouville, Joseph, 5, 146, 184 Period parallelogram, 2
Liouville approximation theorem, 146 Periodic zeta function, 55
Liouville function ).(n), 25, 184 Petersson, Hans, 22, 133, 140, 197
Liouville numbers, 147 Petersson inner product, 133
Littlewood, John Edensor, 95 Petersson-Ramanujan conjecture, 140
Picard, Charles Emile, 43
Picard's theorem, 43
Product representation for A(t), 51
M
Mapping properties of J(t), 40
Mediant,98
Q
Mellin, Robert Hjalmar. 54 Quadratic forms, 45
Mellin inversion formula, 54
Mobius, Augustus Ferdinand, 24, 27, 187
Mobius function, 24, 187 R
Mobius transformation, 27 Rademacher, Hans, 22, 62, 95, 102,
Modular forms, 114 104, 197
and Dirichlet series, 136 Rademacher path of integration, 102
Modular function, 34 Rademacher series for pen), 104
Modular group r, 28 Ramanujan, Srinivasa, 20, 92, 94,
subgroups of, 46, 75 136,191
Montgomery, H. L., 187 Ramanujan conjecture, 136
Mordell, Louis Joel, 92, 197 Ramanujan tau function, 20, 22, 92,
Multiplicative property, of coefficients of 113, 131, 198
entire forms, 130 Rankin, Robert A., 136, 198
of Hecke operators, 126, 127 Reciprocity law for Dedekind sums, 62
of Ramanujan tau function, 93, 114 Representative of quadratic form, 45

203
Index

Riemann, Georg Friedrich Bernhard,


140, 155, 185, 198
Riemann zeta function, 20, 140, 155,
w
18S, 189 Watson, G. N., 109, 198
Rouche, Eugene, 180 Weierstrass, Karl, 6
Rouche's theorem, 180 Weierstrass 8c1-function, 10
Weight of a modular form, 114
Weight formula for zeros of an entire

s form, 115
Whiteman, Albert Leon, 62, 198
Sa lie, Hans, 136
Schoeneberg, Bruno, 198
Sczech, R., 61, 198 z
Selberg, Atle, 136, 198
Zeros, of an elliptic function, 5
Serre, Jean-Pierre, 198
Zeta function, Hurwitz, 55
Siegel, Carl Ludwig, 48, 198
periodic, 55
Simultaneous eigenforms, 130
Riemann, 140, 155, 185, 189
Spitzenform, 114
Zuckerman, Herbert S., 22
Subgroups of the modular groups, 46, 75

T
Tau function, 20, 22, 92, I 13, 131
Theta function, 91,141
Transcendental numbers, 147
Transformation of order n, 122
Transformation formula, of Dedekind,
48,52
of Iseki, 54
Tunin, Paul, 185, 198
Tunin's theorem, 185, 186

u
Univalent modular function, 84
Uspensky, J. V., 94, 198

V
Valence of a modular function, 84
Van Wijngaarden, A., 22
Values, of J(r), 39
of Dirichlet series, 170
Vertices of fundamental region, 34

204
Graduate Texts in Mathematics
(comillued from page ii)

66 WATERHOUSE. Introduction to Affine 100 BERG/CHRISTENSEN/RESSEL. Hannonic


Group Schemes. Analysis on Semigroups: Theory of
67 SERRE. Local Fields. Positive Definite and Related Functions.
68 WEIDMANN. Linear Operators in Hilbert 101 EDWARDS. Galois Theory.
Spaces. 102 VARADARAJAN. Lie Groups, Lie Algebras
69 LANG. Cyclotomic Fields II. and Their Representations.
70 MASSEY. Singular Homology Theory. 103 LANG. Complex Analysis. 3rd ed.
71 FARKAS/KRA. Riemann Surfaces. 2nd ed. 104 DUBROVIN/FoMENKOINOVIKOV. Modem
72 STILLWELL. Classical Topology and Geometry-Methods and Applications.
Combinatorial Group Theory. 2nd ed. Part II.
73 HUNGERFORD. Algebra. 105 LANG. SL 2(R).
74 DAVENPORT. Multiplicative Number 106 SILVERMAN. The Arithmetic of Elliptic
Theory. 3rd ed. Curves.
75 HOCHSCHILD. Basic Theory of Algebraic 107 OLVER. Applications of Lie Groups to
Groups and Lie Algebras. Differential Equations. 2nd ed.
76 IrTAKA. Algebraic Geometry. 108 RANGE. Holomorphic Functions and
77 HECKE. Lectures on the Theory of Integral Representations in Several
Algebraic Numbers. Complex Variables.
78 BURRIS/SANKAPPANAVAR. A Course in 109 LEHTO. Univalent Functions and
Universal Algebra. Teichmiiller Spaces.
79 WALTERS. An Introduction to Ergodic 110 LANG. Algebraic Number Theory.
Theory. III HUSEMOLLER. Elliptic Curves.
80 ROBINSON. A Course in the Theory of 112 LANG. Elliptic Functions.
Groups. 2nd ed. 113 KARATZAS/SHREVE. Brownian Motion and
81 FORSTER. Lectures on Riemann Surfaces. Stochastic Calculus. 2nd ed.
82 BOTT/Tu. Differential Fonns in Algebraic 114 KOBLITZ. A Course in Number Theory and
Topology. Cryptography. 2nd ed.
83 WASHINGTON. Introduction to Cyclotomic 115 BERGERIGOSTIAUX. Differential Geometry:
Fields. 2nd ed. Manifolds, Curves, and Surfaces.
84 IRELAND/RoSEN. A Classical Introduction 116 KELLEy/SRINIVASAN. Measure and Integral.
to Modem Number Theory. 2nd ed. Vol. I.
85 EDWARDS. Fourier Series. Vol. II. 2nd ed. 117 SERRE. Algebraic Groups and Class Fields.
86 VAN LINT. Introduction to Coding Theory. 118 PEDERSEN. Analysis Now.
2nd ed. 119 ROTMAN. An Introduction to Algebraic
87 BROWN. Cohomology of Groups. Topology.
88 PIERCE. Associative Algebras. 120 ZIEMER. Weakly Differentiable Functions:
89 LANG. Introduction to Algebraic and Sobolev Spaces and Functions of Bounded
Abelian Functions. 2nd ed. Variation.
90 BR0NDSTED. An Introduction to Convex 121 LANG. Cyclotomic Fields I and II.
Polytopes. Combined 2nd ed.
91 BEARDON. On the Geometry of Discrete 122 REMMERT. Theory of Complex Functions.
Groups. Readings in Mathematics
92 DIESTEL. Sequences and Series in Banach 123 EBBINGHAUS/HERMES et al. Numbers.
Spaces. Readings ill Mathematics
93 DUBROVIN/FoMENKOlNovIKOV. Modem 124 DUBROVIN/FoMENKOlNovIKOV. Modem
Geometry-Methods and Applications. Geometry-Methods and Applications.
Part I. 2nd ed. Part III.
94 WARNER. Foundations of Differentiable 125 BERENSTEIN/GAY. Complex Variables: An
Manifolds and Lie Groups. Introduction.
95 SHIRYAEV. Probability. 2nd ed. 126 BOREL. Linear Algebraic Groups. 2nd ed.
96 CONWAY. A Course in Functional 127 MASSEY. A Basic Course in Algebraic
Analysis. 2nd ed. Topology.
97 KOBLITZ. Introduction to Elliptic Curves 128 RAUCH. Partial Differential Equations.
and Modular Fonns. 2nd ed. 129 FULTON/HARRIs. Representation Theory: A
98 BROCKERIToM DIECK. Representations of First Course.
Compact Lie Groups. Readings in Mathematics
99 GROVE/BENSON. Finite Reflection Groups. 130 DODSON/POSTON. Tensor Geometry.
2nd ed.
131 LAM. A First Course in Noncommutative 165 NATHANSON. Additive Number Theory:
Rings. Inverse Problems and the Geometry of
132 BEARDON. Iteration of Rational Functions. Sum sets.
133 HARRIS. Algebraic Geometry: A First 166 SHARPE. Differential Geometry: Cartan's
Course. Generalization of Klein's Erlangen
134 ROMAN. Coding and Information Theory. Program.
135 ROMAN. Advanced Linear Algebra. 167 MORANDI. Field and Galois Theory.
136 ADKINS/WEINTRAUB. Algebra: An 168 EWALD. Combinatorial Convexity and
Approach via Module Theory. Algebraic Geometry.
137 AXLERIBoURDON/RAMEY. Harmonic 169 BHATIA. Matrix Analysis.
Function Theory. 2nd ed. 170 BREDON. Sheaf Theory. 2nd ed.
138 COHEN. A Course in Computational 171 PETERSEN. Riemannian Geometry.
Algebraic Number Theory. 172 REMMERT. Classical Topics in Complex
139 BREDON. Topology and Geometry. Function Theory.
140 AUBIN. Optima and Equilibria. An 173 DIESTEL. Graph Theory. 2nd ed.
Introduction to Nonlinear Analysis. 174 BRIDGES. Foundations of Real and
141 BECKERIWEISPFENNING/KREDEL. Grabner Abstract Analysis.
Bases. A Computational Approach to 175 LICKORISH. An Introduction to Knot
Commutative Algebra. Theory.
142 LANG. Real and Functional Analysis. 176 LEE. Riemannian Manifolds.
3rd ed. 177 NEWMAN. Analytic Number Theory.
143 DOOB. Measure Theory. 178 CLARKEILEDYAEV/STERNlWoLENSKI.
144 DENNIS/FARB. Noncommutative Nonsmooth Analysis and Control
Algebra. Theory.
145 VICK. Homology Theory. An 179 DOUGLAS. Banach Algebra Techniques in
Introduction to Algebraic Topology. Operator Theory. 2nd ed.
2nd ed. 180 SRIVASTAVA. A Course on Borel Sets.
146 BRIDGES. Computability: A 181 KRESS. Numerical Analysis.
Mathematical Sketchbook. 182 WALTER. Ordinary Differential
147 ROSENBERG. Algebraic K- Theory Equations.
and Its Applications. 183 MEGGINSON. An Introduction to Banach
148 ROTMAN. An Introduction to the Space Theory.
Theory of Groups. 4th ed. 184 BOLLOBAS. Modem Graph Theory.
149 RATCLIFFE. Foundations of 185 COx/LITTLElO'SHEA. Using Algebraic
Hyperbolic Manifolds. Geometry.
150 EISEN BUD. Commutative Algebra 186 RAMAKRISHNANIVALENZA. Fourier
with a View Toward Algebraic Analysis on Number Fields.
Geometry. 187 HARRIS/MoRRISON. Moduli of Curves.
151 SILVERMAN. Advanced Topics in 188 GOLDBLATT. Lectures on the Hyperreals:
the Arithmetic of Elliptic Curves. An Introduction to Nonstandard Analysis.
152 ZIEGLER. Lectures on Polytopes. 189 LAM. Lectures on Modules and Rings.
153 FULTON. Algebraic Topology: A 190 ESMONDE/MuRTY. Problems in Algebraic
First Course. Number Theory.
154 BROWN/PEARCY. An Introduction to 191 LANG. Fundamentals of Differential
Analysis. Geometry.
155 KASSEL. Quantum Groups. 192 HIRSCH/LACOMBE. Elements of Functional
156 KECHRIS. Classical Descriptive Set Analysis.
Theory. 193 COHEN. Advanced Topics in
157 MALLIAVIN. Integration and Computational Number Theory.
Probability. 194 ENGELINAGEL. One-Parameter Semi groups
158 ROMAN. Field Theory. for Linear Evolution &}uations.
159 CONWAY. Functions of One 195 NATHANSON. Elementary Methods in
Complex Variable II. Number Theory.
160 LANG. Differential and Riemannian 196 OSBORNE. Basic Homological Algebra.
Manifolds. 197 EISENBUD/HARRIS. The Geometry of
161 BORWEIN/ERDEL YI. Polynomials and Schemes.
Polynomial Inequalities. 198 ROBERT. A Course inp-adic Analysis.
162 ALPERIN/BELL. Groups and 199 HEDENMALM/KORENBLUM/ZHU. Theory
Representations. of Bergman Spaces.
163 DIXON/MORTIMER. Permutation Groups. 200 BAO/CHERN/SHEN. An Introduction to
164 NATHANSON. Additive Number Theory: Riemann-Finsler Geometry.
The Classical Bases.
201 HINDRY/SILVERMAN. Diophantine 204 ESCOFIER. Galois Theory.
Geometry: An Introduction. 205 FELlXlHALPERINITHOMAS. Rational
202 LEE. Introduction to Topological Homotopy Theory.
Manifolds. 206 MURTY. Problems in Analytic Number
203 SAGAN. The Symmetric Group: Theory.
Representations, Combinatorial Readings in Mathematics
Algorithms, and Symmetric Functions. 207 GODSILlRoYLE. Algebraic Graph Theory.
2nd ed.

You might also like