RM 1
RM 1
The Riemann hypothesis (RH) is perhaps the most important outstanding problem
in mathematics. This two-volume text presents the main known equivalents to RH
using analytic and computational methods. The books are gentle on the reader with
definitions repeated, proofs split into logical sections, and graphical descriptions of
the relations between different results. They also include extensive tables,
supplementary computational tools, and open problems suitable for research.
Accompanying software is free to download.
These books will interest mathematicians who wish to update their knowledge,
graduate and senior undergraduate students seeking accessible research problems in
number theory, and others who want to explore and extend results computationally.
Each volume can be read independently.
Volume 1 presents classical and modern arithmetic equivalents to RH, with some
analytic methods. Volume 2 covers equivalences with a strong analytic orientation,
supported by an extensive set of appendices containing fully developed proofs.
This series is devoted to significant topics or themes that have wide application in
mathematics or mathematical science and for which a detailed development of the
abstract theory is less important than a thorough and concrete exploration of the
implications and applications.
All the titles listed below can be obtained from good booksellers or from Cambridge
University Press. For a complete series listing visit
www.cambridge.org/mathematics.
119 M. Deza and M. Dutour Sikirić Geometry of Chemical Graphs
120 T. Nishiura Absolute Measurable Spaces
121 M. Prest Purity, Spectra and Localisation
122 S. Khrushchev Orthogonal Polynomials and Continued Fractions
123 H. Nagamochi and T. Ibaraki Algorithmic Aspects of Graph Connectivity
124 F. W. King Hilbert Transforms I
125 F. W. King Hilbert Transforms II
126 O. Calin and D.-C. Chang Sub-Riemannian Geometry
127 M. Grabisch et al. Aggregation Functions
128 L. W. Beineke and R. J. Wilson (eds.) with J. L. Gross and T. W. Tucker Topics in Topological
Graph Theory
129 J. Berstel, D. Perrin and C. Reutenauer Codes and Automata
130 T. G. Faticoni Modules over Endomorphism Rings
131 H. Morimoto Stochastic Control and Mathematical Modeling
132 G. Schmidt Relational Mathematics
133 P. Kornerup and D. W. Matula Finite Precision Number Systems and Arithmetic
134 Y. Crama and P. L. Hammer (eds.) Boolean Models and Methods in Mathematics, Computer
Science, and Engineering
135 V. Berthé and M. Rigo (eds.) Combinatorics, Automata and Number Theory
136 A. Kristály, V. D. Rădulescu and C. Varga Variational Principles in Mathematical Physics,
Geometry, and Economics
137 J. Berstel and C. Reutenauer Noncommutative Rational Series with Applications
138 B. Courcelle and J. Engelfriet Graph Structure and Monadic Second-Order Logic
139 M. Fiedler Matrices and Graphs in Geometry
140 N. Vakil Real Analysis through Modern Infinitesimals
141 R. B. Paris Hadamard Expansions and Hyperasymptotic Evaluation
142 Y. Crama and P. L. Hammer Boolean Functions
143 A. Arapostathis, V. S. Borkar and M. K. Ghosh Ergodic Control of Diffusion Processes
144 N. Caspard, B. Leclerc and B. Monjardet Finite Ordered Sets
145 D. Z. Arov and H. Dym Bitangential Direct and Inverse Problems for Systems of Integral and
Differential Equations
146 G. Dassios Ellipsoidal Harmonics
147 L. W. Beineke and R. J. Wilson (eds.) with O. R. Oellermann Topics in Structural Graph Theory
148 L. Berlyand, A. G. Kolpakov and A. Novikov Introduction to the Network Approximation Method
for Materials Modeling
149 M. Baake and U. Grimm Aperiodic Order I: A Mathematical Invitation
150 J. Borwein et al. Lattice Sums Then and Now
151 R. Schneider Convex Bodies: The Brunn–Minkowski Theory (Second Edition)
152 G. Da Prato and J. Zabczyk Stochastic Equations in Infinite Dimensions (Second Edition)
153 D. Hofmann, G. J. Seal and W. Tholen (eds.) Monoidal Topology
154 M. Cabrera Garcı́a and Á. Rodrı́guez Palacios Non-Associative Normed Algebras I: The
Vidav–Palmer and Gelfand–Naimark Theorems
155 C. F. Dunkl and Y. Xu Orthogonal Polynomials of Several Variables (Second Edition)
156 L. W. Beineke and R. J. Wilson (eds.) with B. Toft Topics in Chromatic Graph Theory
157 T. Mora Solving Polynomial Equation Systems III: Algebraic Solving
158 T. Mora Solving Polynomial Equation Systems IV: Buchberger Theory and Beyond
159 V. Berthé and M. Rigo (eds.) Combinatorics, Words and Symbolic Dynamics
160 B. Rubin Introduction to Radon Transforms: With Elements of Fractional Calculus and Harmonic
Analysis
161 M. Ghergu and S. D. Taliaferro Isolated Singularities in Partial Differential Inequalities
162 G. Molica Bisci, V. Radulescu and R. Servadei Variational Methods for Nonlocal Fractional
Problems
163 S. Wagon The Banach–Tarski Paradox (Second Edition)
164 K. Broughan Equivalents of the Riemann Hypothesis I: Arithmetic Equivalents
165 K. Broughan Equivalents of the Riemann Hypothesis II: Analytic Equivalents
166 M. Baake and U. Grimm Aperiodic Order II: Representation Theory and the Zelmanov Approach
Encyclopedia of Mathematics and Its Applications
KEVIN BROUGHAN
University of Waikato, New Zealand
University Printing House, Cambridge CB2 8BS, United Kingdom
One Liberty Plaza, 20th Floor, New York, NY 10006, USA
477 Williamstown Road, Port Melbourne, VIC 3207, Australia
4843/24, 2nd Floor, Ansari Road, Daryaganj, Delhi – 110002, India
79 Anson Road, #06-04/06, Singapore 079906
www.cambridge.org
Information on this title: www.cambridge.org/9781107197046
DOI: 10.1017/9781108178228
c Kevin Broughan 2017
This publication is in copyright. Subject to statutory exception
and to the provisions of relevant collective licensing agreements,
no reproduction of any part may take place without the written
permission of Cambridge University Press.
First published 2017
Printed in the United Kingdom by Clays, St Ives plc
A catalogue record for this publication is available from the British Library.
Library of Congress Cataloging-in-Publication Data
Names: Broughan, Kevin A. (Kevin Alfred), 1943– author.
Title: Equivalents of the Riemann hypothesis / Kevin Broughan,
University of Waikato, New Zealand.
Description: Cambridge : Cambridge University Press, 2017– |
Series: Encyclopedia of mathematics and its applications ; 164 |
Includes bibliographical references and index. Contents: volume 1. Arithmetic equivalents
Identifiers: LCCN 2017034308 | ISBN 9781107197046 (hardback : alk. paper : v. 1)
Subjects: LCSH: Riemann hypothesis.
Classification: LCC QA246 .B745 2017 |
DDC 512.7/3–dc23 LC record available at https://lccn.loc.gov/2017034308
ISBN – 2 Volume Set 978-1-108-29078-4 Hardback
ISBN – Volume 1 978-1-107-19704-6 Hardback
ISBN – Volume 2 978-1-107-19712-1 Hardback
Cambridge University Press has no responsibility for the persistence or accuracy of
URLs for external or third-party internet websites referred to in this publication
and does not guarantee that any content on such websites is, or will remain,
accurate or appropriate.
Dedicated to Jackie, Jude and Beck
RH is a precise statement, and in one sense what it means is clear, but what
it is connected with, what it implies, where it comes from, can be very un-
obvious.
Martin Huxley
Contents for Volume One
1 Introduction 1
1.1 Chapter Summary 1
1.2 Early History 1
1.3 Volume One Summary 8
1.4 Notation 12
1.5 Background Reading 13
1.6 Unsolved Problems 14
3 Estimates 40
3.1 Introduction 40
3.2 Constructing Tables of Bounds for ψ(x) 41
3.3 Exact Verification Using Computation 51
3.4 Estimates for θ(x) 54
3.5 More Estimates 65
3.6 Unsolved Problems 67
vii
viii Contents for Volume One
4 Classical Equivalences 68
4.1 Introduction 68
4.2 The Prime Number Theorem and Its RH Equivalences 69
4.3 Oscillation Theorems 81
4.4 Errors in Arithmetic Sums 88
4.5 Unsolved Problems 93
8.4 Integers Not Divisible by the Seventh Power of Any Prime 211
8.5 Integers Not Divisible by the 11th Power of Any Prime 214
8.6 Unsolved Problems 217
References 313
Index 321
Contents for Volume Two
1 Introduction 1
1.1 Why This Study? 1
1.2 Summary of Volume Two 2
1.3 How to Read This Book 7
2 Series Equivalents 8
2.1 Introduction 8
2.2 The Riesz Function 10
2.3 Additional Properties of the Riesz Function 14
2.4 The Series of Hardy and Littlewood 15
2.5 A General Theorem for a Class of Entire Functions 16
2.6 Further Work 22
x
Contents for Volume Two xi
6 Orthogonal Polynomials 93
6.1 Introduction 93
6.2 Definitions 94
6.3 Orthogonal Polynomial Properties 96
6.4 Moments 99
6.5 Quasi-Analytic Functions 104
6.6 Carleman’s Inequality 106
6.7 Riemann Zeta Function Application 113
6.8 Recent Work 116
14 Epilogue 359
References 473
Index 485
Illustrations
xiv
D
List of Illustrations xv
xvi
Preface
xvii
xviii Preface
of zeros that are on the critical line s = 12 [40], and increasing the size of the
region in the complex plane where ζ(s) can be proved to be non-zero [63].
This volume includes a detailed account of some recent work that
takes a different approach. It is based on inequalities involving some
simple accessible arithmetic functions. Broadly outlined RH implies that
an inequality is true for all integers, or all integers sufficiently large, or all
integers of a particular type. If RH fails, there is an integer in the given
range, or of the given type, for which the inequality fails, so the truth of
the inequality is equivalent to the truth of RH. Progress under this approach
is made whenever the nature of any counterexample is shown to be more
restricted than previously demonstrated. The reader may also wish to consult
the introductions to Chapters 3 to 7 for further details.
The relatively recent work depends critically on a range of explicit
estimates for arithmetic functions. These have been derived using greater
computing power than was available in the 1940s and 1960s when these
sorts of estimates were first published. In many cases the details are included,
along with simplified presentations.
Also included are a range of other equivalences to RH, some by now
classical. The more recent work depends on these classical equivalences for
both the results and techniques, so it is useful to set both out explicitly. It also
shows how some equivalences are more fundamental than others. This is not
to suggest new equivalences are easy consequences of older established ones,
even though this may be true in some cases.
The aim of these volumes is to give graduate students and number theory
researchers easy access to these methods and results in order that they might
build on them. To this end, complete proofs have been included wherever
possible, so readers might judge for themselves their depth and crucial steps.
To provide context, a range of additional equivalences has been included in
this volume, some of which are arithmetical and some more analytic. An
intuitive background for some of the functions employed is also included
in the form of graphical representations. Numerical calculations have been
reworked, and values different from those found in the literature have often
been arrived at.
To aid the reader, definitions are often repeated and major steps in proofs
are numbered to give a clear indication of the main parts and allow for easy
proof internal referencing. When possible, errors in the literature have been
corrected. Where a proof has not been verified, either because this author
was not able to fill gaps in the argument, or because it was incorrect, it
has not been included. There is a website for errata and corrigenda, and
readers are encouraged to communicate with the author in this regard at
[email protected]. The website is linked to the author’s homepage:
www.math.waikato.ac.nz/∼kab.
Preface xix
A note concerning the cover figure. This represents integral paths for the
flow ṡ = ζ(s) in a small region of the upper complex plane, rotated and
reflected in σ = 0. It was produced using an interactive program written by
the author and Francis Kuo in Java . It includes three critical zeros and three
trivial zeros.
Many people have assisted with the development and production of
these volumes. Without their help and support, the work would not have
been possible, and certainly not completed in a reasonable period of time.
They include Sir Michael Berry, Enrico Bombieri, Jude Broughan, George
Csordas, Daniel Delbourgo, Tomás Garcia Ferrari, Pat Gallagher, Adolf
Hildebrand, Geoff Holmes, Stephen Joe, Jeff Lagarias, Wayne Smith, Tim
Trudgian, John Turner and Michael Wilson. The support of the University of
Waikato and especially its Faculty of Computing and Mathematical Sciences
and Department of Mathematics and Statistics has been absolutely essential.
Cambridge University Press has also provided much encouragement and
support, especially Roger Astley and Clare Dennison. Last, but not least, I
am grateful for my family’s belief in me and support of my work.
Kevin Broughan
December 2016
Acknowledgements
The author gratefully acknowledges the following sources and/or permissions for the non-
exclusive use of copyrighted material.
Euclid: Figure 1.1, being an excerpt from Raphael’s fresco “School of Athens”, Visions of
America/Superstock/Getty Images.
L. Euler: Figure 1.2, Apic/Hulton Archive/Getty Images.
C. F. Gauss: Figure 1.3, Bettmann/Getty Images.
J. L. Dirichlet: Figure 1.4, Stringer/Hulton Archive/Getty Images.
B. Riemann: Figure 1.5, Author: Konrad Jacobs. Source: Archives of the Mathematisches
Forschungsinstitut Oberwolfach.
D. Hilbert: Figure 1.6, Ullstein bild/Getty Images.
Zeta flow: Figure 2.2, being figure 2, p. 990, of K. A. Broughan and A. R. Barnett, The
holomorphic flow of the Riemann zeta function, Mathematics of Computation 73 (2004),
287–1004. Permission of the American Mathematical Society.
Xi flow: Figure 2.3, being figure 1, p. 1274, of K. A. Broughan, The holomorphic flow of
Riemann’s function ξ(s), Nonlinearity 18 (2005), 1289–1294. Copyright c IOP Publishing
and London Mathematical Society. Permission of the American Mathematical Society. All
rights reserved.
J. E. Littlewood: Figure 4.1, used by permission of the Master and Fellows of Trinity College
Cambridge.
J.-L. Nicolas: Figure 5.1, used by permission of J.-L. Nicolas.
P. Erdős: Figure 6.1, Author: Kay Piene. Source: Ragni Piene and Archives of the
Mathematisches Forschungsinstitut Oberwolfach.
T. S. Trudgian: Figure 7.2, used by permission of T. S. Trudgian.
J. C. Lagarias: Figure 7.7, used by permission of J. C. Lagarias.
Y. Choie: Figure 8.1, used by permission of Y. Choie.
Section 8.5 was derived from a previously published article: K. A. Broughan and T. Trudgian,
Robin’s inequality for 11-free integers, Integers 15 (2015), A12 (5pp.).
xxi
1
Introduction
1
2 Introduction
1 1 1 π2
ζ(2) = 1 + 2
+ 2 + 2 + ··· = ,
2 3 4 6
1 1 1 π4
ζ(4) = 1 + 4 + 4 + 4 + · · · = ,
2 3 4 90
1 1 1
ζ(s) = 1 + s + s + s + · · · ,
2 3 4
1
−1
ζ(s) = 1− s .
p
p
For Euler, the variable s was an integer, and the product over all primes. He
tried in vain to find a closed expression for ζ(3) having the same style as that
for ζ(2) and ζ(4).
1.2 Early History 3
The next main player was Gauss (1777–1855; Figure 1.3), although
he does not appear to have used the zeta function. He was the giant of
nineteenth-century mathematics and physics. For example, he used calculus
to compute correctly the position of Ceres after it had passed behind the
Sun. Although he spent most of his life in his observatory in Göttingen in
Germany, he contributed to statistics, non-Euclidean geometry, curvature,
geodesy, electromagnetism and complex numbers, as well as to number
theory, for which he had an abiding passion.
Gauss was given a book of log tables, at around the age of 14, that included
a table of primes. He extended the table, counting the number of primes up to
a real positive variable x, now called π(x). He considered the average number
of primes in each interval of numbers [1, 2, 3, . . . , N], and in this way arrived
at his prime number conjecture, an inspired guess that was proved about 100
years later:
x π(x)
π(x) ∼ which means → 1, x → ∞.
log x x/log x
It was this conjecture that inspired many mathematicians during the
nineteenth century.
Dirichlet (1805–1859; Figure 1.4) was a fine teacher and had a great
influence on Bernhard Riemann, who attended his number theory lectures
in Berlin during 1847–1849. According to Jacobi, Dirichlet was not only
creative, but knew how to make a robust proof: “Only Dirichlet, not I, nor
4 Introduction
Cauchy, not Gauss, knows what a perfectly rigorous proof is, but we learn it
only from him. When Gauss says he has proved something it is very likely.
When Cauchy says it, it’s a fifty-fifty bet. When Dirichlet says it, it is certain.”
Dirichlet showed that there were an infinite number of primes in arithmetic
progressions whenever the step and initial value are coprime. He used groups,
characters and zeta functions. He was a wonderfully inspiring lecturer, and
joined Riemann in Göttingen in 1855.
According to Felix Klein: “Riemann was bound to Dirichlet by the
strong inner sympathy of a like mode of thought. Dirichlet loved to
make things clear to himself with an intuitive underpinning. Along with
this he would give acute, logical analyses of foundational questions
and avoid long computations as much as possible.” His manner suited
Riemann, who adopted it and worked in many ways according to Dirichlet’s
methods.
So we come to the central character in this account, Bernhard Riemann
(1826–1866; Figure 1.5). Riemann was impoverished for most of his life. He
was shy and withdrawn, sickly and a hypochondriac. Other than the two years
in Berlin, he spent most of his working life in the still very beautiful walled
German town of Göttingen, but he changed mathematics forever. As well as
numbers, formulae and concepts, the idea of mathematical spaces and of the
σ(n)
< eγ loglog n.
n
The nice surprise is that this inequality, with an explicit lower bound for n
replacing “sufficiently large”, and other related arithmetic inequalities and
equalities, is equivalent to the Riemann hypothesis. The original inequality
involving σ(n) is the most famous, and the equivalence is known as “Robin’s
theorem”. A detailed history of its evolution is given in [128] with quotes
from Erdős, Hardy, Rankin, Berndt and Nicolas.
8 Introduction
1
β < 1− .
R log γ
It is expected that the value of R will decrease in time. We give the derivation
of Landau, and quote more recent improvements.
Chapter 3 is devoted to numerical estimates, especially of the arithmetical
functions θ(x) and ψ(x). Note that there are additional estimates of products
of functions of primes in Section 5.2. Estimates in this chapter are primarily
derived without using RH. The material depends not only on the work of
Rosser and Schoenfeld in their separate and joint papers of 1941, 1962, 1963
and 1975, but also on von Mangold’s theorems of 1905.
Chapter 4 gives derivations of some well-known explicit classical equiv-
alences to RH. It may be skipped by anyone familiar with introductory
material. The proof of an important theorem of Landau, used in many cases in
the text where RH is assumed to be false, is included. The chapter concludes
with overview material on zero-free regions and a summary of heights up
to which the hypothesis has been shown to hold, with the corresponding
numbers of critical zeros.
The work of Rosser and Schoenfeld is remarkable in that it contains sharp
explicit results produced before computer-based methods became ubiquitous.
In this chapter we take some advantage of the much greater processing
speed now available to extend the ranges for which computer verification
of inequalities is practicable, and the greatly improved height below which
all the zeros of ζ(s), with positive imaginary part, have real part σ = 12 .
Improving these estimates is the subject of current research, since they have
a strong influence on the equivalences to RH.
Moving on to Chapter 5, Nicolas proved in 1983 that RH is true if and only
if for every non-prime primorial q we have
q
> eγ .
ϕ(q) loglog(q)
Here a primorial is the product of all primes up to a given prime, so the kth
primorial, say q, is q = Nk := p1 · · · pk , where pi is the ith prime starting with
p1 = 2.
Nicolas improved this result in 2012, and found four statements equivalent
to the Riemann hypothesis. Let Nk = 2 · 3 · · · pk be the kth primorial and let
n γ
c(n) := − e loglog(n) log n
ϕ(n)
10 Introduction
and define
1
β := = 2 + γ − log π − 2 log 2 = 0.046191 . . . ,
ρ
ρ(1 − ρ)
where ρ ranges through the non-trivial zeros of ζ(s) with increasing absolute
value of the imaginary part. Then RH is equivalent to each of the following:
(1) lim supn→∞ c(n) = eγ (2 + β) = 3.644415 . . . .
(2) For all n ≥ N120569 = 2 · 3 · · · 1591 883 we have c(n) < eγ (2 + β).
(3) For all n ≥ 2, c(n) ≤ c(N66 ) = c(2 · 3 · · · 317) = 4.0628356921.
(4) For all k ≥ 1 we have c(Nk ) ≥ c(N1 ) = c(2) = 2.208589 . . . .
From this one shows that the Riemann hypothesis is equivalent to the
following inequality holding for all n ≥ N120569 :
n eγ (2 + β)
< eγ loglog n + .
ϕ(n) log n
The chapter includes the derivation of useful estimates for treating the Euler
phi function, namely some products and sums of functions of primes. These
depend on the relationship between primes and zeta zeros, and form an
essential basis, along with other estimates, for developing the equivalences.
In Chapter 6, fundamental results to do with two types of number with
many divisors are developed. These are called superabundant and colossally
abundant. The work includes that of Alaoglu, Erdős and Nicolas. These
numbers appear as counterexamples to RH, so are used in the chapters that
follow. We include a corrected proof of a fundamental theorem of Alaoglu
and Erdős.
Chapter 7 is also a fundamental chapter. Grönwall proved in 1913 that
σ(n) σ(n)
G(n) := =⇒ lim sup = eγ = 1.78107 . . . .
n loglog(n) n→∞ n loglog n
Ramanujan showed, probably in 1915, that if RH is true then for n sufficiently
large we must have G(n) < eγ . Then Robin showed in 1983 that RH is true if
and only if n > 5040 implies G(n) < eγ . Also included is the equivalence of
Lagarias, which is dependent on the result of Robin, namely if Hn = 1 + 1/2 +
· · · + 1/n is the n harmonic number, then RH is equivalent to the inequality
σ(n) ≤ Hn + exp(Hn ) log(Hn ), n ≥ 1.
In Chapter 8, properties of numbers that do not satisfy Robin’s inequality
are explored. The work of Choie, Lichiardopol, Moree and Solé is developed
and extended. In particular it is shown that any integer greater than 5040
not satisfying Robin’s inequality must be even, fail to be squarefree and
not squarefull. The smallest such number must be a so-called Hardy–
Ramanujan number and superabundant. Then it is shown, successively, that
1.3 Volume One Summary 11
any counterexample must be divisible by the fifth power of at least one prime,
the seventh power and finally the 11th power of at least one prime.
We now move on to Chapter 9. Define a composite integer n > 1 to be left
abundant or GA1 if it satisfies G(n/p) ≤ G(n) for all primes p | n. Call it right
abundant or GA2 if G(n) ≥ G(an) for all natural numbers a > 1. Then in 2011
Caveney, Nicolas and Sondow showed that the Riemann hypothesis is true
if and only if 4 is the only number that is both left and right abundant. They
called such numbers extraordinary. They also showed that a right abundant
number n > 5040 exists if and only if the Riemann hypothesis is false, and,
if so, such an integer n is even with n > 108576 . The chapter concludes with
the equivalence of Nazardonyavi and Yakubovich based on “champions” for
G(n), which they call extremely abundant.
Although Chapter 9 represents the final section of the main body of this
text, a range of other equivalences that have caught the attention of the author,
and are not in Volume Two [32], have been included in Chapter 10. Proofs
are given, as in the earlier chapters, to show depth and illustrate technique:
Farey fractions, the Redheffer matrix, the divisibility graph, the Dirichlet eta
function, the derivative of the Riemann zeta function, a simple zeta-related
inequality, and an inequality for the real part of the logarithmic derivative
of the Riemann xi function, ξ(s). The chosen set of equivalences is far from
comprehensive. For detailed descriptions of these equivalences, the reader is
invited to consult Chapter 10. An improvement of a result of Lagarias, which
gives, for every t ≥ 168π, the imaginary part of a zeta zero γ with |t − γ| ≤ 1.5,
is included.
In the penultimate section of Chapter 10, an equivalence regarding the
maximum multiplicative order, ord n, of any element of the symmetric group
S n is outlined. Not all of the details are included – these are spread over
many papers. The development is based on the work of Landau and Shah,
and includes the theorem of Massias, Nicolas and Robin, which says that RH
is equivalent to the inequality
log ord(n) < li−1 (n), n sufficiently large,
where li(x) is the logarithmic integral. A new “straight line” proof of the
theorem of Shah which is fundamental to the work is included, as is an
introduction to the related work of Erdős and Turan on the statistics of S n .
In the final section of Chapter 10, a simple equivalence, but very influential
conjecture, going under the name of the “Hilbert–Pólya conjecture”, is
introduced.
In Figure 1.7 we give some of the dependences between the chapters in
this volume. An arrow from Chapter A to Chapter B means some results in
B depend on material in A. Not all dependences have been included, but the
network flow diagram provides a general overview.
12 Introduction
Ch 9:Extreme
Ch 5:Totient Ch 10:Other
Ch 7:Robin Ch 4:Classical
Ch 6:Abundant Ch 2:Zeta
Ch 3:Estimates
Ch 8:NotRobin
1.4 Notation
We use the following notation, sometimes without (further) explanation:
A note on the use of symbols: Many authors have built explicit rational
constants into their explicit results. We have attempted to avoid this practice,
since an improvement in a constant will invalidate a result. Constants are
called a1 , a2 , . . . , c1 , c2 , . . ., etc. and given explicit definitions. There is
also a problem in this part of the literature with overloaded symbols. Two
examples are the function names φ(·) and K(·), both of which have a variety
of meanings, sometimes in the same context. We have attempted to avoid
this practice.
The symbols p and q invariably represent rational prime numbers. The
Landau–Vinogradov symbols O, o, ,
and , with their usual meanings,
are also used. Additional notation is introduced as needed.
In addition, e, π and i have their usual meanings, γ is Euler’s constant, B is
Mertens’ constant, β is 2 + γ − log π − 2 log 2 and H is a known height up to
which all complex zeros of ζ(s) lie on the critical line.
Sautoy [151]. Books by Edwards [51], Patterson [131] and Borwein et al.
[13] give accessible mathematical introductions, whilst Titchmarsh [167],
Ivić [82] and Karatsuba and Voronin [88] are more advanced monographs.
2.1 Introduction
The purpose of this chapter is to give some insight into the nature of
the Riemann zeta function, and describe some of its properties, which are
essential for the development of arithmetic equivalences to RH.
Following this introductory section, the second section of this chapter
discusses some of the main properties of the Riemann zeta function, such as
how it is defined, the functional equation and the function’s zeros, which are
all important since they are the subject of the Riemann hypothesis. An idea
of how ζ(s) behaves is given with a presentation of plots of its modulus and
argument in small parts of the complex plane, including some of the complex
zeros. There is no attempt to be comprehensive. It is interesting to note that
few of the properties of ζ(s) are used in deriving equivalent formulations to
the hypothesis.
In Section 2.2, statements of the formulae of von Mangoldt are included,
which give explicit representations of errors in a form of the prime number
theorem in terms of sums of quantities dependent on the zeros of ζ(s). This
material is presented with references but without proofs, since there are a
number of excellent texts that give comprehensive expositions.
In Section 2.3 information is given on heights up to which all zeros of
ζ(s) have been proved, numerically, to be on the critical line. The zero-free
region of Landau is derived in Section 2.4. The zeta function properties and
zero-free regions are used in this volume in essential ways.
There are quite a few very important results in Section 2.6, which has
complete proofs. These include the product formula for ξ(s) given by
s
ξ(s) = ξ(0) 1− ,
ρ
ρ
and the explicit expression for the convergent sum
1 γ 1
= 1 + − log(4π).
ρ
ρ 2 2
15
16 The Riemann Zeta Function
Both the product and sum are over the non-trivial zeros of ζ(s). When each ρ
is associated with the zero 1 − ρ, the product and sum converge absolutely.
The following RHpack functions (see Appendix B) relate to the material
in this chapter: Psi0, PlotVonMangoldtPsi, BacklundB, ZetaZeroHeight,
LargestZetaZero and NextZetaZero.
(7) There are an infinite number of simple zeros of ζ(s) on the so-called
“critical line”, σ = 12 [76]. The Riemann hypothesis is the conjecture that all
of the non-trivial zeros of ζ(s) lie on this line. This is a long story, beginning
with Hardy and Littlewood [74]. Conrey [40] has shown that more than two-
fifths of the critical zeros are on this line.
To give the reader some intuitive understanding of how ζ(s) behaves
for small values of |s|, three graphical descriptions are given. The first is
Figure 2.1, which includes some of the contours for |ζ(s)|. We see the first
three zeros on the critical line s = 12 surrounded by closed curves.
The graphic of Figure 2.2 gives the direction modulo π of the complex
number ζ(s) at each s in the domain. It is a phase portrait of the flow ṡ = ζ(s).
The first three real zeros at s = −2, −4 and −6, where the flow is alternately a
source or a sink, and the first three complex zeros, where the flow is a source
focus, are shown in the figure.
The passage of ζ(s) → 1 as s → ∞ is also clear. For further information
see the papers [17, 19, 20, 23, 25, 26].
The graphic of Figure 2.3 is a depiction of part of the flow ṡ = ξ(s). It shows
the symmetry of the flow stemming from the functional equation, the first
three complex zeros, each of which is a centre for the flow, and the trending
of the separatrices to the positive and negative x-axis. For further information
see the papers [21, 22, 26].
30 0.75
2
2
1
25 0.1 0.25 0.5
0.75
1
1
0.5
0.1 0.25 0.75
20 1
1
15 0.75
0.1 0.25 0.5
0.5
0.75
1
10
0.0 0.2 0.4 0.6 0.8 1.0
25
20
15
10
0
−10 −5 0 5 10
(8) Bounds on the modulus of zeta, or its multiplicative inverse, are crucial
in estimating integrals using Cauchy’s theorem. One we use is the following:
assuming the Riemann hypothesis, for every > 0 and δ > 0 there is a T 0 > 0
such that for a positive constant K, dependent on the chosen constants, such
that for every s = σ + it with σ ≥ 12 + and t ≥ T 0 we have the bound |1/ζ(s)| ≤
Ktδ . Detailed derivations of bounds of this type are given in Volume Two
[32, Appendix I].
(9) We also need bounds on zeta and its derivative that apply on vertical
strips a ≤ σ ≤ b where a and b are any given real numbers. The bound is
unconditional and the implied constant and power A depend on a and b. There
is a constant A > 0 such that for every s = σ + it with a ≤ σ ≤ b and |t| ≥ δ > 0,
we have |ζ(s)|
t A and |ζ (s)|
t A [167].
2.2 Basic Properties 19
40
35
30
25
20
15
10
–15 –10 –5 0 5 10 15 20
At times, more precise bounds for the modulus of ζ(s) on the critical line
are needed. These have been the subject of close study over a considerable
period. For example, there is the inequality of Cheng and Graham [37]:
|ζ( 12 + it)| ≤ min(6t1/4 + 57, 3t1/6 log t), t ≥ 3. (2.1)
We make particular use of the inequality of Platt and Trudgian [134], which
is better than (2.1):
|ζ( 12 + it)| ≤ 0.732t1/6 log t, t ≥ 2. (2.2)
Some use is also made of a bound of Richert [139] as made completely
explicit by Ford [60], and which gives a bound for the entire right half of the
critical strip. It is most useful for σ near 1:
3/2
|ζ(σ + it)| ≤ 76.2|t|4.45(1−σ) log2/3 |t|, |t| ≥ 3, 1
2
≤σ<1 (2.3)
(see also Cheng [36]).
We also need to use explicit zero-free regions for ζ(s), beyond heights
at which all zeros are known to be on the critical line. These are studied in
some detail in Section 2.3. We use the result of Korobov [96] and Vinogradov
20 The Riemann Zeta Function
Table 2.1 Some improvements for Backlund’s zero counting estimate.
Year Author a1 a2 a3 T0
In spite of their central importance, I do not give their derivation as this is set
out fully in [51, sections 3.2 and 3.5]. Let ρ range over the complex zeros of
ζ(s) with increasing absolute value of the imaginary part, and let p represent
a rational prime and n a natural number. Then we have:
Theorem 2.1 (von Mangoldt) For x > 1 which is not a power of a prime
number we have
xρ ∞ xρ 1
1 ζ (0) 1
ψ(x) = x − + − = x− − log 1 − 2 − log 2π.
ρ
ρ n=1 2nx2n ζ(0) ρ
ρ 2 x
Because the final three items in Table 2.2 are “unpublished” at the time of
writing, we have used the 1986 value of van de Lune, te Riele and Winter
[111] in our own derivations in Chapter 3. However Platt’s value of H was
used, as reported in [133], to show that
which is his theorem 7.1. This result corresponds to that of Büthe, Franke,
Jost and Kleinjung of 2010 in [33], which was derived using RH. Thus there
is considerable confidence that Platt’s value of H is correct.
The most influential method to date for verifying RH, up to some height
H, has been Turing’s method, which was published in 1953, a year before
his most unfortunate death [174, 175]. Almost all methods following Turing
share important components with it. There are details in [51, section 8.2].
For an introduction see Booker’s article [12] or that by Hejhal and Odlyzko
[78]; see also Trudgian [172]. Here we give a brief outline, with many details
skipped.
We saw in Section 2.2 (11) that if we define E(t) := N(t) − F(t) we can
write |E(t)| ≤ B(t). Now −F(t) is a smooth decreasing function and N(t) a step
2.3 Zero-Free Regions 23
function that increases by 1 at each simple critical line zero, and by at least 2
if t is the ordinate of a pair of zeros off the critical line. We will see below that
detecting (and hence counting) zeros on the critical line is straightforward.
We have more analytic information on S (t), a good approximation to E(t),
defined below, which is contradicted if a pair of off-critical-line zeros occurs.
The great strength of Turing’s method is that it enables RH to be checked
by considering only the values of ζ(s), and related functions, with s on the
critical line.
Some further details: let γ(s) := π−s/2 Γ(s/2) and Λ(s) := γ(s)ζ(s). Then the
functional equation can be written Λ(s) = Λ(1 − s), and this can be used to
show that Λ( 12 + it) is real for all real t. The zeros of Λ( 12 + it) are precisely the
zeros of ζ( 12 + it). Now let ϑ(t) be the argument of γ( 12 + it), with ϑ(0) := 0, or
in other words
γ( 12 + it) = |γ( 12 + it)|eiϑ(t) .
Then it can be shown that
ϑ(t) 1
+ 1 = F(t) + O .
π t
The real-valued function of a real variable S (T ) is the argument of ζ(s) on
the critical line at the point 12 + iT which is not the ordinate of a zero. The
argument is fixed by taking the value found by continuous variation from 0
along the horizontal line from (∞, iT ) to ( 12 , iT ). At a zero the value is defined
to be limt→0+ S (t). See Figure 10.4 to get a feeling for how S (T ) behaves. It
is well known that we can write
T 7
T 1
N(T ) = log + + S (T ) + O .
2π 2πe 8 T
See for example Edwards [51, chapters 8 and 9]. It follows that
ϑ(t) 1
S (t) = N(t) − +1+O .
π t
Now |Λ(s)| decays exponentially on the critical line, so, for detecting zeros,
it is easier to work with the function
Λ( 1 + it)
Z(t) := 12 ,
|γ( 2 + it)|
which also has the same zeros for real t as ζ( 12 + it). Each simple zero on the
critical line for ζ(s) is accompanied by a sign change in Z(t). This function
also has the advantage in computations of requiring fewer terms in a summa-
tion than competing methods, via the famous Riemann–Siegel formula:
√
t/2π
cos(ϑ(t) − t log n) 1
Z(t) = 2 √ + O 1/4 .
n=1
n t
24 The Riemann Zeta Function
Turing developed an explicit form for this, which was recently improved by
Trudgian. For h > 0 and T > 168π we have
T +h
S (t) dt ≤ 2.067 + 0.059 log(T + h). (2.5)
T
–S(t)
1.0
0.5
t
1010 1020 1030 1040
–0.5
–1.0
t
980 1000 1020 1040 1060
–1
Figure 2.5 The function T (t) on [950, 1060] with a phantom zero at t = 1010.
Lemma 2.3 [105, section 79] If 1 < σ ≤ 2 and t ≥ 2 there is a real constant
b1 such that
ζ (σ + it) log p cos(mt log p)
− =
ζ(σ + it) p, m ≥ 1 pmσ
1 σ−β β
< log t + b1 − + ,
2 ρ
(σ − β)2 + (t − γ)2 β2 + γ2
Proof From Lemma 2.3, if we apply the stricter bounds 1 < σ ≤ 2 and t ≥ 2
we have
log p cos(mt log p) 1 σ−β β
< log t + b 1− + ,
p,m
pmσ 2 ρ
(σ − β)2 + (t − γ)2 β2 + γ2
and γ ≥ 21. This form for the zero-free region, together with a better
approximation for the integrals
∞ t
φm (t) log dt,
h 2π
than that which has been used here in Chapter 3, gives, on the face of it,
better results than those we computed and use when calculating estimates.
However, since we do not need sharper estimates than those obtained by
using larger values of H, and in the interests of simplicity of exposition, we
have not used R1 .
Finally, note some more recent results: Kadiri [84] showed that we could
choose R = 5.69693. He used Weil’s explicit formula with the method and
one of the test functions of Heath-Brown. For background definitions, proofs
and applications of the explicit formula, see Volume Two [32, Chapter 9].
The value was slightly improved by Jang and Kwon, who used an alternative
test function in the explicit formula and Platt’s height H = 3.0610046 × 1010
of 2011 to derive R = 5.68371; see [83]. This has been further improved
to R = 5.573412 by Mossinghoff and Trudgian [121] by finding a well-
adapted trigonometric polynomial (of degree 16) using simulated annealing,
and carefully analysing the error term of Kadiri. They used Platt’s value of H.
Ford used the value for H established by van de Lune, te Riele and Winter
[111], and Kadiri used the 2004 height of Wedeniwski. We have chosen to
use a value R = 8.0 in the computations reported in Chapter 3.
The details of these more recent results are outside the scope of this book.
The interested reader could consult the fundamental papers [77, 179], or the
references given above.
This is given in Theorem 2.11. First we have two lemmas, which take the
form of evaluations, and then a call-out to Hadamard’s product expansion,
which is proved in Volume Two [32, Appendix B].
Lemma 2.5 We have the special values:
Γ (1)
(1) = −γ,
Γ(1)
Γ (3/2)
(2) = 2 − γ − log 4,
Γ(3/2)
2.6 The Product Over Zeta Zeros 31
ζ (0)
(3) = log(2π).
ζ(0)
Proof (1) By Lemma H.1 of Volume Two [32], for all z which is not a
negative integer, we have the following representation for the logarithmic
derivative of Γ(z):
∞
Γ (z) 1 1 1
= −γ − + − ,
Γ(z) z n=1 n n + z
and
∞
dx 1
= .
N x s (s − 1)N s−1
Hence
∞ ∞ ∞
1 dx 1 {x}
s
− s
= s
− s s+1
dx.
n=N
n N x N N x
Recall the definition: the order of an entire function f (z) is the infimum of
real values b such that as z → ∞, f (z)
exp(|z|b ). Next we give Hadamard’s
theorem statement.
Theorem 2.7 (Hadamard factorization) Let f (z) be an entire function of
finite order ρ, having f (0) 0 and zeros z1 , z2 , . . . , with multiple zeros
appearing multiple times. Then f (z) has the absolutely and uniformly
converging on compact subsets product representation
∞
z Q(z/zn )
f (z) = e P(z)
1− e , (2.6)
n=1
zn
where P(z) is a polynomial of degree less than or equal to ρ, and Q(z) is a
polynomial of the form
z2 zm
Q(z) = z + + ··· + ,
2 m
where m ≥ 0 is the smallest integer such that
∞
1
< ∞.
|z
n=1 n
|m+1
Recall the definition:
ξ(s) := 12 s(s − 1)π−s/2 Γ(s/2)ζ(s).
Lemma 2.8 For all s ∈ C we have constants A and B such that
s s/ρ
ξ(s) = e A+Bs
1− e ,
ρ
ρ
where, if we combine the factors associated with (1 − s/ρ) and (1 − s/(1 − ρ)),
the product converges absolutely and uniformly on compact subsets of C.
Proof (1) It follows from properties of the zeros and poles of ζ(s) and Γ(s)
that ξ(s) is an entire function. We first show that ξ(s) has order at most 1. By
the functional equation we need only consider σ ≥ 12 . In this half plane, for
|s| sufficiently large and some constant c1 > 0 we have
| 21 s(s − 1)π−s/2 | ≤ exp(c1 |s|).
By Stirling’s approximation (see (H.5) in Theorem H.5 in Volume Two [32]),
there is a constant c2 > 0 such that, for s → ∞ in σ ≥ 12 ,
Γ(s/2) ≤ exp(c2 |s| log |s|).
Then using Abel’s theorem [6, theorem 4.2] for σ > 1, and extending the
domain of validity of the expression by analytic continuation to σ > 0,
∞
s {x}
ζ(s) = −s dx,
s−1 1 x s+1
34 The Riemann Zeta Function
To verify the latter statement, by Theorem 10.32 proved below, for each n ∈ N
there is a zeta zero γ jn such that |4n − γ jn | ≤ 1.5. Therefore if n n , we have
γ jn γ jn and we can write
1 1 1 ∞
1
∞
1
≥2 ≥ ≥ ≥ = ∞.
ρ
|ρ| γ>0 1/4 + γ2 γ>0 |γ| n=1 |γ jn | n=1 |4n − 2|
(3) It follows from Steps (1) and (2) and Hadamard’s theorem that there
are constants A, B such that for s ∈ C
s s/ρ
ξ(s) = e A+Bs
1− e . (2.7)
ρ
ρ
Therefore n(R) ≤ log M(2R)/log 2. Since we have M(R) ≤ ec1 R log R so n(R) ≤
c3 R log R, and therefore n(R) = O(R log R). This completes the proof.
We also need, frequently in the developments that follow in later chapters,
several expressions and estimates for sums over critical zeta zeros. Statement
(a) in Lemma 2.10 was, apparently, known to Riemann. We saw in the proof
of Lemma 2.8 that the convergence was conditional. In the second sum, as
usual 1/ρ is associated with 1/(1 − ρ), in which form the series is absolutely
convergent.
Lemma 2.10 We have
1 γ 1
(a) = 1 + − log 4π,
ρ
ρ 2 2
1
(b) = 2 + γ − log 4π,
ρ
ρ(1 − ρ)
1
(c) ≤ 0.046191441,
ρ
|ρ| 2
where the sums are over all of the non-trivial zeros of ζ(s).
Proof By the definition of ξ(s) and Lemma 2.8, for some real U, V we can
write
1 s
s s/ρ
−s/2
ξ(s) = s(s − 1)π Γ ζ(s) = eU−V s
1 − e = ξ(1 − s), (2.8)
2 2 ρ
ρ
Because ζ(ρ) = 0 if and only if ζ(1 −ρ) = 0, there are matching terms in these
sums that cancel, leading to 2V = 2 ρ 1/ρ and therefore
1 γ 1
V= = 1 + − log(4π). (2.10)
ρ
ρ 2 2
This completes the proof of part (a). Part (b) is immediate since
1 1 1
1
= + =2 .
ρ
ρ(1 − ρ) ρ
ρ 1−ρ ρ
ρ
Now choose a height H up to which all of the zeta zeros are on the critical
line and define
1 1
SH = and S ∞ = .
|γ|≤H
1/4 + γ2 |γ|>H
β2 + γ 2
We choose in this derivation H = 600 269.6771, giving 106 zeros with positive
imaginary part. Then
1
≤ S H + S ∞.
ρ
|ρ|2
and therefore
1 β 1 − β
S∞ < = + 2
|γ|>H
γ2 |γ|>H γ2 γ
1 β 1−β
< 1+ 2 + .
H |γ|>H β2 + γ2 (1 − β)2 + γ2
But
1
1 2β 1
2V = + = =⇒ S ∞ < 1 + 2 (2V − S H ),
ρ
ρ ρ̄ ρ
|ρ|2 H
and therefore
1
1
< S H + 1 + 2 (2V − S H ) < 0.046191441,
ρ
|ρ|2 H
where if we combine the factors (1 − s/ρ) and (1 − s/(1 − ρ)) the product
converges absolutely and uniformly on compact subsets of C. In addition
ξ(0) = 12 .
Proof (1) By Lemma 2.8 there are constants A, B such that for s ∈ C
s s/ρ
ξ(s) = e A+Bs
1− e . (2.11)
ρ
ρ
Use the definition of ξ and take the logarithmic derivative of both sides of
s s/ρ 1
e A+Bs
1 − e = s(s − 1)π−s/2 Γ(s/2)ζ(s)
ρ
ρ 2
to get
ζ (s) 1 1 1 Γ (s/2 + 1) 1 1
= B + log π − − + + .
ζ(s) 2 s − 1 2 Γ(s/2 + 1) ρ∈S s − ρ ρ
(2) From the definition of ξ(s) and the functional equation ξ(s) = ξ(1 − s)
we can write
1 1
ξ(0) = ξ(1) = π−1/2 Γ(1/2) lim(s − 1)ζ(s) = .
2 s→1 2
Thus, by (2.11), we have eA = ξ(0) = 12 .
(3) Next we consider the convergence of the product. With ρ = β + iγ let
s
s(1 − s)
1− = 1− .
ρ
ρ γ>0
ρ(1 − ρ)
This converges absolutely provided γ>0 1/|ρ(1 − ρ)| < ∞. To see this,
complete the square
1 ∞
1 1 d(t log t)
= <
< ∞.
|ρ(1 − ρ)|
γ>0
(ρ − ) +
γ>0
1 2
2
1
4 γ>0
|ρ − |
1 2
2 1 t2
where the product is over the non-trivial zeros of ζ(s). If s is not a zeta zero
we can take the logarithmic derivative to get
ζ (s) 1 1 1 Γ (s/2 + 1) 1
− = − log π + + − .
ζ(s) 2 s − 1 2 Γ (s/2 + 1) ρ
s−ρ
Finally substitute for ζ (0)/ζ(0) from Lemma 2.5 (3) to complete the proof
of the lemma.
3.1 Introduction
This chapter includes the derivation of estimates that are used throughout the
volume. The aim here is not to be comprehensive and derive all estimates of a
given type, or to present the best that are, in some sense, currently available.
Almost all of the results that are needed for deriving equivalences included
in the volume are deduced from first principles, in a manner which could be
followed step by step, should the reader so wish.
The main focus is on numerical estimates, especially of the arithmetical
functions ψ(x) and θ(x). Note that there are additional estimates of products
of functions of primes in Section 5.2. In the main, the estimates in this
chapter are derived without using RH, and so are significantly weaker than
Theorems 4.5 or 4.6, which give good estimates for ψ(x) and θ(x), which
could be regarded as best possible. The material is based on the work of
Rosser and Schoenfeld, scattered through their separate or joint papers of
1941, 1962 and 1975, but also depends on von Mangoldt’s theorems of 1905.
The work of Rosser and Schoenfeld is remarkable in that they produced
sharp explicit results before computer-based methods became easy to use
and ubiquitous. In this chapter we take some advantage of the much greater
processing speed available in the early twenty-first century to extend the
ranges for which computer verification of inequalities is practicable. We also
are able to take advantage of the much greater height, denoted by H, below
which all the zeros of ζ(s), with positive imaginary part, have real part σ = 12 .
Results are expressed in terms of key parameters, regarded as constants, but
which are likely to change
in time with further
improvements.
Recall that θ(x) := n≤x log p and ψ(x) := pm ≤x log p. In Section 3.2 tables
of values of b and b > 0 are derived such that
x ≥ eb =⇒ |ψ(x) − x| < b x,
where e is the exponential number. This represents a recalculation of the table
developed by Rosser and Schoenfeld in 1962 [145, table I], with improved
40
3.2 Constructing Tables of Bounds for ψ(x) 41
parameters and some more minor improvements. The range is increased from
b = 5000 to b = 10 000, but the vital tool is an algorithm to compute b given
b, rather than the table.
In Section 3.3 some details are given of the simple strategy used to derive
exact results from inequalities using floating-point arithmetic. In Section 3.4
estimates are given for θ(x) of the form
x
|θ(x) − x| < ,
log x
with x in a range depending on , and also the useful bounds 0.8x ≤ θ(x) ≤
1.02x valid for x ≥ 121. The inequality θ(x) < 1.000081x, used by some
authors, was able to be proved, but only by assuming RH. This proof is
included in the final section, as are some useful bounds for ψ(x) − θ(x).
Many authors simply assume these estimates, especially those from [145].
However, the work presented here, especially that in Chapters 5 and 7, is
totally dependent on their validity, so sufficient detail of derivations is given
in this chapter for readers to reproduce results, or to set about improving
them.
The RHpack (see Appendix B) functions ComputeKm, PsiSmallXEp-
silon, PsiLargeXEpsilon, PsiLargeXEpsilon2 and VerifyThetaX relate to the
material in this chapter.
we have
h
∞ t 1 + m log
2π
φm (t) log dt <
h 2π (1 + m log(h/2π)) log x
2
mh xm 1/(R log h) 1−
log(h/2π)Rm2 log2 h
=: B1 (m, x, h).
If however we have 0 < 4 log x < Rm log2 h, again with h ≥ 2eπ, then the bound
takes a simpler form:
he
∞ t 2 log
2π
φm (t) log dt < .
h 2π mh x log h)
m 1/(R
∞
Proof First integrate I(h) := h φm (t) log(t/2π) dt by parts, noting that
t
log t 1 + m log
2π dt = − 2π ,
tm+1 m2 tm
to get
h
1 + m log ∞ 1 + m log t log x
2π 2π log t φ (t) dt.
I(h) = 2 m 1/(R log h) + t m
mh x h Rm2 log2 t log 2π
2π
Therefore
⎛ ⎞
h ⎜⎜⎜ h ⎟⎟⎟
1 + m log ⎜⎜⎜ 1 + m log log x ⎟⎟⎟
2π ⎜ 2π ⎟
I(h) < 2 m 1/(R log h) + ⎜⎜⎜⎜ ⎟⎟⎟⎟ I(h).
mh x ⎜⎜⎜ h ⎟⎟⎟
⎝ Rm2 log2 h log ⎠
2π
Solving for I(h) gives the stated upper bound. Note that the condition on log x
ensures the denominator of the upper bound is positive. The more stringent
condition ensures the bracketed term in the denominator is greater than 12
and the numerator less than m log(he/2π). This completes the proof of the
lemma.
which is false.
44 Estimates
(6) If 0 > h there is a z with 0 ≥ z ≥ mh such that η(x + z) ≥ fm (x, h, z); if not
we would have
0 0
(−1) Em (x, h) =
m
··· η(x + y1 + · · · + ym ) dy1 · · · dym
h h
0 0
< ··· fm (x, h, y1 + · · · + ym ) dy1 · · · dym
h h
= (−1) Em (x, h).
m
(7) If δ and x satisfy 0 < δ < (1 − 1/x)/m, then −1 x ≤ η(x) ≤ 2 x. To see
this let h = δx with δ > 0. By (5) and the definition of 2 , there is a z ∈ [0, h]
such that
Em (x, δx) mδx
η(x + z) ≤ + − z ≤ x(1 + 2 ).
δm xm 2
Therefore, since z ≥ 0 and − 12 log(1 − 1/(x + z)2 ) ≤ − 12 log(1 − 1/x2 ) we get
1 1
ψ(x) ≤ ψ(x + z) ≤ x(1 + 2 ) − log(2π) − log 1 − 2 .
2 x
In other words, η(x) ≤ 2 x. The proof of the lower bound follows in a similar
manner.
Now define for m ≥ 1
xβ−1
V(m, x) := ,
ρ
|γ|m+1
Thus
h xρ+1 − (x + h)ρ+1
η(x + z) dz = .
0 ρ
ρ(ρ + 1)
Take absolute values of the result of Step (1) using the upper bound
xρ+m < x
β+m
ρ(ρ + 1) · · · (ρ + m) |γ|m+1
Hence
xβ−1
|Em (x, ±xδ)| ≤ xm+1 bm (δ)
ρ
|γ|m+1
= xm+1 bm (δ)V(m, x).
(3) Since we are given δ ≥ 2V(m, x)1/(m+1) we have
V(m, x) 1
≤ m+1 ,
δm+1 2
and therefore
Em (x, xδ) mδ
2 = +
xm+1 δm 2
xm+1 bm (δ) xβ−1 mδ
< m+1 m +
x δ ρ
|γ|m+1 2
δ V(m, x)
= 2bm (δ) m+1 + m
2 δ
δ bm (δ)
≤ +m .
2 2m
The proof for 1 is similar.
Recall the definition φm (γ) = exp(−log x/(R log γ))/|γ|m+1 . Also note that
we are designating H as a fixed height below which all complex zeros of ζ(s)
are on the critical line, and
xβ−1
V(m, x) := ,
ρ
|γ|m+1
and so
xβ−1
< 2 φ m (γ) ≤ φm (γ).
1
|γ|m+1 1 γ>H
β> 2 β> 2 , γ>0
where
a1 log h + a2 loglog h + a3
P(h) := 2
hm+1
and
1 a1 log h + a2
Q(h) := + .
2π h log h log(h/2π)
Proof By Lemma 4.2 (which relies simply on Lemma 4.1, a generalized
form of Abel summation) and Backlund’s bound B(T ), we can write
∞ t
1 a2 1
φm (γ) < log + a1 + φm (t) dt + 2B(h)φm (h).
h<γ h 2π 2π log t t
and 3.5 to obtain an upper bound for V(m, x). The reader might compare this
with [145, lemma 6]. Recall that we have defined B1 (m, x, H) in the proof of
Lemma 3.1:
h
1 + m log
2π
B1 (m, x, H) := .
1 + m log (h/2π) log x
2 m
mh x 1/(R log h) 1−
log (h/2π) Rm2 log2 h
Lemma 3.6 Let 0 < 2 log x < Rm log2 H. Then
km (0) P(H)
V(m, x) < √ + 1/(R log H) + Q(H)B1 (m, x, H) =: B2 (m, x, H).
x x
Next recall the definition:
1 1
η(x) := ψ(x) − x + log(2π) + log 1 − 2 , x > 1.
2 x
3.2 Constructing Tables of Bounds for ψ(x) 49
1 0.046300 2 0.0016700
3 0.000074400 4 4.46301 × 10−6
5 2.88354 × 10−7 6 1.93507 × 10−8
7 1.32606 × 10−9 8 9.19838 × 10−11
9 6.42733 × 10−12 10 4.5113 × 10−13
11 3.17539 × 10−14 12 2.23908 × 10−15
13 1.58067 × 10−16 14 1.11669 × 10−17
15 7.89294 × 10−19 16 5.58061 × 10−20
17 3.94653 × 10−21 18 2.79132 × 10−22
19 1.97443 × 10−23 20 1.3967 × 10−24
21 9.88051 × 10−26 22 6.98986 × 10−27
23 4.94499 × 10−28 24 3.49838 × 10−29
25 2.47499 × 10−30 26 1.75098 × 10−31
27 1.23877 × 10−32 28 8.76397 × 10−34
b m b
10 1 0.03604387616
11 1 0.02792600226
12 1 0.02166958397
13 1 0.01683159959
14 1 0.013082730087
15 1 0.010173897079
16 1 0.007914814691
17 1 0.006159239110
18 1 0.004794353324
19 1 0.003732913461
20 1 0.002907340379
25 1 0.0008427406872
30 1 0.0002748595135
35 1 0.0001515932079
40 1 0.0001348313906
45 2 0.0001315791411
50 2 0.00008556326873
55 2 0.00003720592016
60 2 0.000016278904663
65 2 7.594348788 × 10−6
75 2 4.457990402 × 10−6
80 2 4.377481479 × 10−6
85 2 4.330519430 × 10−6
90 2 4.286798206 × 10−6
95 3 2.5990044920 × 10−6
100 3 1.4581963574 × 10−6
3.3 Exact Verification Using Computation 51
Table 3.3 More values of b, m and b .
b m b
0.04
m=1
0.03
0.02
0.01
b
0 20 40 60 80 100
7. × 10–7
6. × 10–7
5. × 10–7
4. × 10–7
3. × 10–7
—m=3
2. × 10–7
1. × 10–7
b
0 200 400 600 800 100
5. × 10–8
4. × 10–8
3. × 10–8
2. × 10–8
m=7
1. × 10–8
b
0 2000 4000 6000 8000 10 000
To achieve that target we need to approximate f (n) and the terms of (ai ) with
a “local accuracy” > 0 so we can calculate rationals (bi ) and g(n) such that
|ai − bi | ≤ and | f (n) − g(n)| ≤ for all 1 ≤ i, n ≤ N0 . Then we need to have,
for each 1 ≤ n ≤ N0 ,
n
n
0 ≤ f (n) − ai ≤ g(n) + − bi + n,
i=1 i=1
so we should test 1 := 0 − (n + 1) ≤ g(n) − ni=1 bi , and that should be
sufficient to verify (3.1) exactly.
For example, if we wanted to show that θ(x) < x up to x = 1011 with
accuracy 10−10 we could set the local accuracy to = 10−23 and define
an = log n for prime n and zero otherwise.
(2) Continuing with this example, the equation θ(n) < n will have been
verified at integer values n of x in the given range. Since θ(x) is constant
between integers and f (x) = x is increasing, evaluation at integers is sufficient
to show it is true for all real numbers x up to N0 .
(3) In case the inequality is similar, for example, to 0.8x < θ(x) for 0 < x ≤
106 say, then the process adopted is similar, except we verify the equation,
using the approach of Step (1), 0.8(n + 1) < θ(n) at integers n. Then, since
54 Estimates
k1 (0)
∞
1.9 3
1 (x) := √ + S n x−1/Rn + + 2,
x n=9
x x
∞
log x
2 (x) := S n exp − ,
n=9
Rn
k1 (0) 1.9 3
3 (x) := √ + + 2 so 1 = 2 + 3 .
x x x
(2) Next we claim that for S r , when r ≥ 9 we have S r < (0.274)r exp(−r).
To see this, define
Δ := F(er ) − F(er−1 ),
Hence
1
Sr =
γ2
er−1 <γ≤e
r
19 + 2.24r
Δ
1
< + 2
e2r−2 2πn
n=1
+ er−1
r−a
1
Δ
2.24(r + 9) 1
< + 2
e2r−2 e2r−2 n=1 (e − 1)n
1+
Δ
56 Estimates
Δ
2.24(r + 9) 1 dt
< 2r−2
+ 2r−2 2
e e 0 e−1
1+ t
Δ
2.24(r + 9) Δ
= 2r−2
+ 2r−1
e e
2.24(r + 9) (r − a)(e − 1)
= +
e2r−2 2πer
< 16.6(r + 9)e−2r + 0.274(r − 2.25)e−r < 0.274re−r ,
where the last inequality is true for r ≥ 9. This completes the proof of the
claim.
(3) In this step we focus on the term that depends on the zeta zeros ρ where
ρ = β + iγ, namely 2 (x). Using the result of Step (2) we have
∞ ∞
log x
2 (x) = S n exp − < (0.274) ne−n x−1/Rn .
n=9
Rn n=9
Therefore
x2 x2 x2
ψ1 (x) − ≤ 2x 1 (x) =⇒ − 2x2 1 (x) ≤ ψ1 (x) ≤ + 2x2 1 (x).
2
2 2 2
Therefore, if h > 0, because 1 (x) > 1 (x + h) we have
ψ1 (x + h) − ψ1 (x) < 12 (x + h)2 − 12 x2 + 2(x + h)2 1 (x + h) + 2x2 1 (x)
≤ xh + 12 h2 + 21 (x)(2x2 + 2xh + h2 ).
Discarding the positive last term in the first lower bound below
ψ1 (x) − ψ1 (x − h) ≥ 12 x2 − 2x2 1 (x) − 12 (x − h)2 + 2(x − h)2 1 (x − h)
> xh − 12 h2 − 2x2 1 (x)
> xh − 12 h2 − 21 (x)(2x2 + 2xh + h2 ).
Thus, because ψ(x) is non-decreasing we have
1 21 (x) 2 ψ1 (x) − ψ1 (x − h)
x− h− (2x + 2xh + h2 ) ≤ ≤ ψ(x)
2 h h
ψ1 (x + h) − ψ1 (x) 1 21 (x) 2
≤ ≤ x+ h+ (2x + 2xh + h2 ).
h 2 h
√
Finally, set h := 4x 1 (x) and use (3.4) to derive
√
|ψ(x) − x| ≤ x 1 3 + 4 1 (x) + 81 (x)
⎛ ⎞
⎜⎜⎜ log x ⎟⎟⎟⎟
< 4x (log x)/8 exp ⎜⎝− ⎜ ⎟
R ⎠
⎛ ⎞
⎜⎜⎜ log x ⎟⎟⎟
= x 2 log x exp ⎜⎜⎝− ⎟⎟ ,
R ⎠
for x ≥ e140 , completing the proof of the estimate for ψ(x).
(5) Bringing the estimates together,
θ(x) − x ≤ ψ(x) − x ≤ 3.983x (log x)/8 exp(− (log x)/R) < (x)x.
√ √
Then, using Lemma 3.10 which gives ψ(x) − θ(x) < 1.02 x + 3 3 x (x > 0),
we get
√ √
θ(x) − x ≥ ψ(x) − x − 1.02 x − 3 3 x
√ √
≥ −3.983x (log x)/8 exp(− (log x)/R) − 1.02 x − 3 3 x
≥ −4x (log x)/8 exp(− (log x)/R) = −x(x),
with the last inequality holding for x ≥ e81 . Therefore
|θ(x) − x| < (x)x, x ≥ e140 .
This completes the proof.
3.4 Estimates for θ(x) 59
Table 3.4 Values of b = log x, and rounded log (x) for R = 8 and R = 18.
In Tables 3.4 and 3.5 we give values of log (x) for a range of values of b =
log x. The second and fifth columns of Table 3.4 are computed rounding the
result of Theorem 3.8, as are the R = 8 plots in Figures 3.4 and 3.5. The third
and sixth columns
are computed
using the Rosser–Schoenfeld expression for
(x), namely log x exp(− (log x)/R), as are the R = 18 plots in Figures 3.4
and 3.5.
As an application of these values, first consider Table 3.5 at line b = 106 .
Since exp(−346) < 5.43 × 10−151 , we get
6
|ψ(x) − x| ≤ x × 5.43 × 10−151 , x ≥ e10 .
At the other extreme, from Table 3.4, line b = 1000,
3
|ψ(x) − x| ≤ x × (0.00092), x ≥ e10 .
60 Estimates
x
b
2000 4000 6000 8000 10 000
–5
–10
–15
—R=18
–20
–25
–30 —R=8
x
b
200 000 400 000 600 000 800 000 1 × 106
–50
–100
–150
–200
—R=18
–250
–300
–350 —R=8
This compares badly with the value obtained in Section 3.2; see Table 3.3,
which gives
3
|ψ(x) − x| ≤ x × (4.45 × 10−9 ), x ≥ e10 .
There is one more preliminary lemma in this chapter, viz. Landau’s lemma
(Lemma 3.9). We have made some small changes in his proof and result,
but the interesting parts of a fascinating proof are his. Other than this, the
remaining material consists of essential estimates, with applications spread
3.4 Estimates for θ(x) 61
throughout the rest of the book. For example, Lemma 3.14 below is used in
Lemmas 3.10, 5.6, 7.7, 8.13 and 9.16, and in Theorems 5.25, 7.11 and 7.13.
Lemma 3.9 [105, pp. 89–90] For all x ≥ 1 we have
ψ(x) < αx + (log x + 2)(3 log x)
where
6 log 2 log 3 log 5 log 30
α := + + − < 1.105551.
5 2 3 5 30
Proof (1) For x ≥ 1 define
T (x) := log n = log (x!) .
1≤n≤x
Then
x−1 x−1
n+1
T (x) = log n + logx ≤ log t dt + logx
n
x
n=1 n=1
Hence there is a θ0 , which depends on x, with |θ0 | ≤ 1 such that we have the
estimate
T (x) = x log x − x + 1 + θ0 log x.
Define U(x) := T (x) − T (x/2) − T (x/3) − T (x/5) + T (x/30) and evaluate U(x)
using the estimate for T (x) to obtain the expression, for some θ1 with |θ1 | ≤ 1
and x ≥ 30,
log 2 log 3 log 5 log 30
U(x) = x + + − + θ1 (5 log x)
2 3 5 30
= βx + θ1 (5 log x), (3.5)
where β = 0.921292 . . . . Equation (3.5) was proved for x ≥ 30. A simple
computation shows it is also true for 1 ≤ x ≤ 30. Note that the combination of
T evaluations has been chosen to make this bound possible.
62 Estimates
x
(2) Next we claim we can write T (x) = n=1 ψ(x/n). To see this derive
x x
T (x) = log (x!) = log p + 2 + ···
p≤x
p p
∞ ∞
x x
= log p m
= log p m ,
p≤x m=1
p m=1 m√ p
p≤ x
and so
∞
x ∞ x
x
T (x) = log p = θ m
√
m=1 n=1 p≤ m x m=1 n=1
n
x ∞ x x
x
= θ m = ψ .
n=1 m=1
n n=1
n
Then substitute this expression for T (x) into the definition of U(x) to get
∞ x x x x ∞ x
x
U(x) = ψ −ψ −ψ −ψ +ψ = cn ψ ,
n=1
n 2n 3n 5n 30 n=1
n
we can write
log
√
x/log 2
when x ∈ [10 , 10 ].
5 18
64 Estimates
We can now complete the proof of (3.6) in the range x ≥ 1018 . Using (3.7)
and (a) and (b) again, we can write for x ≥ 1018
√ √ √
ψ(x) − θ(x) − θ( x ) = θ( 3 x ) + θ( n x )
n≥4
√ √
≤ θ( 3 x ) + θ( 4 x )(log x/log 2 − 3)
√3 log x/log 2 − 3 √
< 1.0009 x 1 + √
12
< 3 3 x.
x
This completes the proof of (c).
(d) A numerical check shows that for integers x in the range 0 < x ≤ 1010
we have
√
x − 2.05282 x < θ(x) < x. (3.8)
√
Thus, for 0 < x ≤ 1010 , by (3.8) we have x − 2.05282 x < θ(x), and so for
121 ≤ x ≤ 1010
2.05282
0.813337x ≤ x 1 − √ < θ(x).
x
√ √
By (c) we have ψ(x) − 1.0009 x − 3 3 x < θ(x) for x > 0, and by Table 3.2,
x(1 − 0.0361) < ψ(x) for x > 1010 . Therefore, for x ≥ 1010
1.0009 3
0.8x < x 1 − 0.0361 − √ − √6 < θ(x)
x x
which completes the proof.
A note regarding part (a) is needed here. It is known, see Platt and Trudgian
[135, theorem 1], that θ(x) < x for 2 < x < 1.39 × 1017 , improving on a result
by Schoenfeld [153, p. 360] (see also [142, (4), p. 194]). In addition Platt
and Trudgian showed that for all x > 0 we have θ(x) < (1 + 7.5 × 10−7 )x.
The paper [135] also contains the result (corollary 2), which is then an easy
consequence,
√ √
ψ(x) − θ(x) < (1 + 7.5 × 10−7 ) x + 3 3 x, x > 0.
The note [153, p. 360] states that for all 0 < x we have θ(x) < 1.000081x. This
was not proved by Schoenfeld in that paper, but is the subject of a final note,
with details to follow. However, the details have not been found. We have a
simple derivation of the estimate, using RH, which is all that is needed.
Lemma 3.11 If we assume RH, then for 0 < x we have θ(x) < 1.000081x.
Proof Since by Lemma 3.10, θ(x) < x for 0 < x ≤ 1011 , the upper bound
for θ(x) holds in that range. By Theorem 4.6, which depends on RH,
3.5 More Estimates 65
we have
√
x log2 x log2 x
θ(x) − x ≤ =⇒ θ(x) ≤ x 1 + √ < 1.000081x
8π 8π x
for x ≥ 1011 .
Using the same approach as for Lemma 3.10 (d), we obtain the following
list of lower bounds for θ(x):
Lemma 3.12 [145, theorem 10] We have θ(x) ≥ βx for x ≥ α when (α, β) are
in the list:
{(89 387, 0.995), (32 057, 0.990), (11 927, 0.985), (7481, 0.980),
(5381, 0.975), (3457, 0.970), (2657, 0.965), (1481, 0.960), (1433, 0.955),
(1427, 0.950), (853, 0.945), (809, 0.94), (599, 0.93), (557, 0.92),
(349, 0.91), (227, 0.89), (149, 0.86)}.
(d) Up to 1016 the result follows by (b). √ For x > 1016 use the pair
(0.98, 7481) from Lemma 3.12 and write, since x ≥ 7481 in this range,
√ √
ψ(x) − θ(x) ≥ θ( x ) ≥ 0.98 x.
|ψ(x) − x| < b x, x ≥ eb ,
so therefore
√ √ √ √
(1 − b )x − 1.0009 x − 3 3 x < ψ(x) − 1.0009 x − 3 3 x < θ(x).
If b j and b j+1 are in the tables with b j < b j+1 and j corresponds to a j = eb j ,
then for a j ≤ x ≤ a j+1 we have
⎛ ⎞
1 1 ⎜⎜⎜ 1.02 3 ⎟⎟⎟⎟
x 1− ≤ x 1− ⎜
≤ x ⎜⎝1 − j − √ − 2/3 ⎟⎠ < θ(x)
8 log x 8 log a j+1 aj aj
if
1.02 3 1
j + √ + 2/3 ≤ ,
aj aj 8 log a j+1
3.6 Unsolved Problems 67
for x ≥ 1011 . Now e25 < 1011 and we were able to verify the inequality for j
using four intervals for b, namely
[25, 30], [30, 40], [40, 100] and [100, 5000].
(3) For x ≥ e5000 use Theorem 3.8. In this range, with the notation in that
theorem and with R ≥ 8 we have
⎛ ⎞
⎜⎜⎜ log x ⎟⎟⎟ 1
(x) = 2 log x exp ⎜⎜⎝− ⎟⎟⎠ < .
R 8 log x
This completes the proof of (a) and therefore of (b)–(e) for x ≥ 1011 .
4.1 Introduction
In this chapter a number of classical equivalences to the Riemann hypothesis
are presented, several with complete proofs. In Section 4.2 the explicit
result of Schoenfeld is proved in detail, namely that RH is equivalent to the
inequality
√
x log2 x
|ψ(x) − x| ≤ ,
8π
being true for all x ≥ 74. We call this the Schoenfeld criterion. It is directly
related to the size of the error in different forms of the prime number theorem.
Note that RH is equivalent to the weaker form
√
|ψ(x) − x|
x log2 x or even |ψ(x) − x|
x1/2+ .
68
4.2 The Prime Number Theorem and Its RH Equivalences 69
and is such that Li(x) = li(x) − li(2) for x ≥ 2 with 1 < li(2) < 1.046.
Schoenfeld [153, 154] also found an explicit bound for the error in the
estimate ψ(x) ∼ x in the case of the function ψ(x), where ψ(x) := pn ≤x log p
counts primes with a weighted sum. This function, and its close companion
θ(x) := p≤x log p, are fundamental components of this study. They are named
70 Classical Equivalences
so therefore, noting that A(γn ) = A(b) and shifting the index in the first sum:
b
A(t) f (t) dt
a
4.2 The Prime Number Theorem and Its RH Equivalences 71
γ1 γn b
= A(t) f (t) dt + A(t) f (t) dt + A(t) f (t) dt
a γ1 γn
= A(a)( f (γ1 ) − f (a)) + A(γn )( f (b) − f (γn ))
n−1
+ (A(γ j ) f (γ j+1 ) − A(γ j ) f (γ j ))
j=1
= A(b) f (b) − A(a) f (a) + A(a) f (γ1 ) − f (γn )A(γn )
n−1
n
− A(γ j ) f (γ j ) + A(γ j−1 ) f (γ j )
j=1 j=2
= A(b) f (b) − A(a) f (a) + A(a) f (γ1 ) − f (γn )A(γn )
n−1
+ (A(γ j−1 ) − A(γ j )) f (γ j ) + A(γn−1 ) f (γn ) − A(γ1 ) f (γ1 )
j=2
n
= A(b) f (b) − A(a) f (a) + A(a) f (γ1 ) − f (γn )A(γn ) − a j f (γ j )
j=1
+ A(γn−1 ) f (γn ) − A(γ1 ) f (γ1 )
n
= A(b) f (b) − A(a) f (a) − a j f (γ j ).
j=1
By specializing to the case of the positive imaginary parts of zeta zeros, using
Lemma 4.1 we obtain a simplified upper bound for a functional sum. First
recall some definitions from Section 2.2 (11):
N(T ) := #{ρ : ζ(ρ) = 0, 0 < ρ ≤ T },
T T 7
F(T ) := log + ,
2π 2πe 8
B(T ) := a1 log T + a2 loglog T + a3 .
Lemma 4.2 [146, lemma 7, p. 255] Let 1 < a < b and the real function f (y)
be such that f (y) is continuous with f (y) ≤ 0 and f (y) > 0 for a ≤ y ≤ b. Then
b b
1 y a2 f (y)
f (γ j ) ≤ f (y) log dy + a1 + dy + 2B(a) f (a).
a<γ ≤b
2π a 2π log a a y
j
72 Classical Equivalences
Now Backlund’s explicit estimate for N(T ) (see Section 2.2) can be written,
for all T ≥ 2,
Lemma 4.3 [146, lemma 8, p. 256] Assume the Riemann hypothesis is true.
If x > 1 and 0 < δ < 1 − 1/x we have
1 1 √ δx
ψ(x) − x + log(2π) + log 1 − 2 ≤ x (S 1 (δ) + S 2 (δ)) + .
2 x 2
Proof (1) First we need to define some functions:
1 1
η(x) := ψ(x) − x + log(2π) + log 1 − 2 ,
2 x
h
E(x, h) := η(x + y) dy,
0
E(x, h) nhα
fn,α (x, h, z) := + − zhα−1 ,
hn 2
E(x, h) h
f (x, h, z) := f1,1 (x, h, z) = + − z,
h 2
E(x, −xδ) δ
1 := + ,
x2 δ 2
E(x, xδ) δ
2 := + .
x2 δ 2
Also assume x > 1 and x + h > 1. The variable h could be negative, indeed
we will choose later to have h = ±δx. In the properties below, (2) and (3) are
and (4) can be deduced by induction on n ≥ 1.
immediate
h
(2) f1,α (x, h, z) dz = E(x, h).
0 h
(3) fn,α (x, h, y1 + · · · + yn ) dyn = fn−1,α+1 (x, h, y1 + · · · + yn−1 ).
0 h h
(4) E(x, h) = ··· fn,α (x, h, y1 + · · · + yn ) dyn · · · dy1 .
0 0
(5) If 0 < h there is a z with 0 ≤ z ≤ h such that η(x + z) ≤ f (x, h, z): if not,
we would have by (2)
h h
E(x, h) = η(x + y) dy > f (x, h, y) dy = E(x, h),
0 0
which is false.
(6) If 0 > h there is a z with 0 ≥ z ≥ h such that η(x + z) ≥ f (x, h, z): if not,
we would have
0 0
−E(x, h) = η(x + y) dy < f (x, h, y) dy = −E(x, h).
h h
(7) If < δ < 1 − 1/x then −1 x ≤ η(x) ≤ 2 x: let h = δx with δ > 0. By (5)
there is a z ∈ [0, h] such that
E(x, δx) δx
η(x + z) ≤ + − z ≤ x(1 + 2 ).
δx 2
74 Classical Equivalences
Let h = ±δx and recall we are assuming RH is true. In the first sum use the
bound
ρ+1 δ
x − (x + h)ρ+1 δ(2 + δ)
≤ x (1 + y) = x3/2 .
3/2
(ρ + 1) 0 2
In the second sum use the upper bound |1 − (1 ± δ)1+ρ | ≤ (1 + (1 + δ)2 ) so that
ρ+1
x − (x + h)ρ+1 1
≤ x (2 + 2δ + δ ) .
3/2 2
ρ(ρ + 1) |ρ(ρ + 1)|
4.2 The Prime Number Theorem and Its RH Equivalences 75
Therefore
1 1
|E(x, ±xδ)| ≤ x3/2 (2 + δ)δ + 2x3/2 (2 + 2δ + δ2 ) .
0<γ≤T
|ρ| T <γ
|ρ(ρ + 1)|
Finally using this bound, the definitions of E(x, h), i , S i (δ) and Step (7), we
get
√ δx
|η(x)| ≤ x(S 1 (δ) + S 2 (δ)) + ,
2
and the proof of the lemma is complete.
Now let T = 2(2 + 2δ + δ2 )/(δ(2 + δ)) so T → ∞ as δ → 0+. Set D :=
158.84998, just less than the y-coordinate of the 58th zeta zero. We also
in Lemma 4.4 and in Theorem 4.5 use the choices a1 = 0.137 and a2 =
0.443, which are sufficient for our purposes, but not the best available. See
Section 2.2 and the unsolved problems at the end of this chapter.
Lemma 4.4 [146, lemma 9, p. 262] If T 1 ≥ D and δ > 0 then
2
2+δ T1
S 1 (δ) + S 2 (δ) < log +1 +1 .
4π 2π
Proof A computation gives
1
< 0.8113925.
γ≤D 1
4
+ γ2
Therefore
S 1 (δ) 1 T 0.224212 0.224212 2B(D)
< 0.8113925 + log2 − + +
2+δ 4π 2π T D D
1 D
− log2
4π 2π
1 T 0.224212
≤ log2 − − 0.791068.
4π 2π T
Again using Lemma 4.2, this time with f (y) = 1/y2 , we get
1 1
<
T <γ
|ρ(ρ + 1)| T <γ γ2
∞ ∞
1 1 y 0.443 dy 2B(T )
≤ 2
log dy + 0.137 + +
2π T y 2π log T T y3 T2
76 Classical Equivalences
1 T 1 0.443 2B(T )
= log + 1 + 2 0.137 + + .
2πT 2π 2T log T T2
Then, making the choice T = T 1 where (2 + δ)T 1 = 2(2 + 2δ + δ2 )/δ we get
2
2+δ T1
S 1 (δ) + S 2 (δ) < log +1 +1 .
4π 2π
We are now able to present Schoenfeld’s proof of one side of the
equivalence, wherein we assume the Riemann hypothesis is true.
Theorem 4.5 [153, theorem 10, p. 337] The Riemann hypothesis implies
that for all x ≥ 23 × 108 we have
√
x log x (log x − 2)
|ψ(x) − x| <
8π
and
√
x log x (log x − 2)
|θ(x) − x| < .
8π
Proof Let x ≥ ξ ≥ 828 000. In the proof, constants δ and αi with 1 ≤ i ≤ 6
will be defined progressively, with each of them depending on ξ in the given
range, which finally will be chosen explicitly to enable the proof to succeed.
Let
log2 ξ log2 ξ
α1 := √ and α2 := √ < 0.0126. (4.1)
π ξ π ξ
Then
log x α1
δ := √ ≤ ≤ α2 .
π x log x
Now let T 1 := 2(2 + 2δ + δ2 )/(δ(2 + δ)) and D := 158.84998 so there are
precisely 57 zeros ρ = 12 + iγ of ζ(s) with 0 < γ ≤ D and T 1 > D. Then if
we set
α2 (1 + α2 )
δ1 := < 0.00634 and α3 := 2 loglog ξ − 2(1 + δ1 ) > 2.841,
2 + α2
we get
√ √
T1 x 2 + 2δ + δ2 x
log + 1 = log + 1 ≤ log (1 + δ1 ) + 1
2π log x 2 + δ log x
1 1
≤ (log x − 2 loglog x + 2(1 + δ1 )) ≤ (log x − α3 ).
2 2
4.2 The Prime Number Theorem and Its RH Equivalences 77
Now define
α23 + 4.152828
α4 := 2α3 − .
log ξ
By Lemmas 4.3 and 4.4, since
2+δ 1 α1
< 1+ ,
4π 2π 2 log x
we can write
2
|ψ(x) − x| 1 α1 T1
< √ 1+ log + 1 + 1.038207
x 2π x 2 log x 2π
log x 1 1 1
+ √ + log(2π) + log 1 − 2
2π x x 2 x
1 α1
< √ 1+ (log2 x − 2α3 log x + α23 + 4.152828)
8π x 2 log x
log x log(2π)
+ √ + .
2π x x
Thus
|ψ(x) − x| 1 α1
< √ 1 + (log2 x − α4 log x)
x 8π x 2 log x
log x 8π log(2π)
+ √ 4+ √ .
8π x x log x
Next, note that α3 < 2 loglog ξ < log ξ, so α4 increases with α3 , and therefore,
for this range,
2.8412 + 4.152828
α4 > 2 × 2.841 − > 0.
log ξ
Therefore, using the upper bound which was derived for |ψ(x) − x|/x and
defining
α1 8π log(2π)
α5 := α4 − −4− √ ,
2 ξ log ξ
we get
|ψ(x) − x| log x (log x − α5 )
< √ .
x 8π x
Finally define
8.16π 24π
α6 := α5 − − 1/6 .
log ξ ξ log ξ
78 Classical Equivalences
To get the analogous inequality for θ(x), use parts (a), (b) and
√ (d)√of
Lemma 3.10, which gives for 0 < x, the bound ψ(x) − θ(x) < 1.02 x + 3 3 x.
Then
|θ(x) − x| |ψ(x) − x| ψ(x) − θ(x) log x
< + ≤ √ (log x − α6 ).
x x x 8π x
Let ξ = 108 to compute α5 > α6 > 2, which competes the proof of the
theorem.
√
Remark The inequality |θ(x) − x| ≤ x log2 x/(8π)√succeeds for 231 integers
n in the range [2, 598]. The inequality |ψ(x) − x| ≤ x log2 x/(8π) succeeds at
integers in the list
{13, 19, 20, 23, 24, 25, 26, 27, 29, 30, 31, 32, 33, 34, 35, 38, 39,
41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57,
58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73}
and fails at all other integers in [2, 73].
Theorem 4.6 [153, theorem 1] The Riemann hypothesis implies that for all
x ≥ 74 we have
√
x log2 x
|ψ(x) − x| <
8π
and for all x ≥ 599 we have
√
x log2 x
|θ(x) − x| < .
8π
Proof In the range x ≥ 23 × 108 the result follows from Theorem 4.5.
For 90 000 ≤ x < 23 × 108 we checked the inequalities through making a
computation at integers
1 √
n+1− n + 1 log(n + 1)(log(n + 1) − 2) < θ(n)
8π
1 √
< n+ n log n (log n − 2).
8π
Below 90 000 we checked the inequality
1 √ 1 √
n+1− n + 1 log2 (n + 1) < θ(n) < n + n log2 n.
8π 8π
The derivation in the case of ψ(x) is similar.
More information about this process of verifying estimates over finite
ranges numerically is given in Chapter 3.
4.2 The Prime Number Theorem and Its RH Equivalences 79
n√log n/(8π) numerically at integers and then the inequality π(n) > li(n + 1)
− n + 1 log(n + 1)/(8π), also at integers, demonstrating that the theorem is
true for all x ≥ 2657.
Remark
√ A computation shows that the inequality |π(n) − li(n)| <
n log n/(8π) is also true for a substantial number of integers n with 2 ≤ n ≤
2656. Indeed 1434 successes were found, splitting into about 40 subranges.
We now give an example of a well-known technique which is used a few
other times in this volume in case the Riemann hypothesis is false. Define
θ := sup{σ : ζ(σ + iγ) = 0} so RH is equivalent to θ = 12 , and always 12 ≤ θ ≤ 1.
Theorem 4.8 Let > 0 be given. Then ψ(x) − x = Ω± (xθ− ), where the implied
constant is arbitrary and when Ωt is defined in Section 4.3.
80 Classical Equivalences
Then for all λ < θ we have f (x) = Ω± (xλ ) where the implied constant C may
be chosen arbitrarily large. In addition there are real sequences xn , yn → ∞
such that
√ √
xn yn
f (xn ) − Li(xn ) > and f (yn ) − Li(yn ) < − .
29 log xn 29 log yn
Then in 1905 Landau developed a new method to attack these “oscillation”
problems. As usual, we define θ(x) := p≤x log p.
Theorem 4.11 (Landau) As x → ∞ we have θ(x) = Ω± (x).
The principal tool in this work is the theorem of Landau given in his 1905
paper [104], but also in [80, theorem H] and [166, chapter II.1, theorems 6
and 7]. The proof is included because the result is fundamental to this work,
and used in many places.
Theorem 4.12 (Landau) Let s ∈ C and let f (x) : [1, ∞) → R be measurable
and bounded on all bounded intervals. Suppose that
∞
dx
F(s) := f (x) s
1 x
has a finite abscissa of convergence σc so F(s) ∈ C if s > σc .
(a) If there exists an a ∈ R such that f (x) is non-negative or non-positive for
x ≥ a, then the integral F(s) for σ = s > σc has a singularity at s = σc
and F(s) converges in a half plane such that it is holomorphic for σ > σc
but not in any half plane σ > σc − for any > 0.
82 Classical Equivalences
Suppose that f (x) ≥ 0 for x ≥ a. (If f (x) ≤ 0 replace f (x) by − f (x).) Therefore
if b > a and n ≥ 0 we can write
∞ b
f (x) logn x f (x) logn x
(−1) F (α) =
n (n)
dx ≥ dx.
1 xα 1 xα
Therefore if s ∈ R satisfies α − δ < s ≤ α, F(s) is real also and we can write
∞ b ∞
(α − s)n b f (x) logn x (α − s)n f (x) logn x
F(s) ≥ dx = dx.
n=0
n! 1 xα 1 n=0 n! xα
To justify the interchange of the sum and the integral note that for 1 ≤ x ≤ b
we have
n
(s − α) f (x) log x ≤ M δ log b
n n n
n! xα n!
where M := max{| f (x)x−α | : 1 ≤ x ≤ b}, so the series is uniformly convergent.
Therefore
b b
f (x) (α−s) log x f (x)
F(s) ≥ α
e dx = dx.
1 x 1 xs
Because for real s ≥ a the integrand is positive, this integral is increasing with
b for b ≥ a, and bounded above by F(s). Hence the integral is convergent at
all points of the interval s ∈ (α − δ, α]. But this is impossible since the interval
properly contains s = σc . Therefore the function defined by the integral has a
singularity at s = σc .
(b) Assume, to derive a contradiction, that there exists a constant M > 0
such that f (x) ≤ Mxσc − for all x ≥ 1. Then if we define
g(u) := Me(σc −)u − f (eu )
4.3 Oscillation Theorems 83
To see this we first apply von Mangoldt’s theorem, Theorem 2.2, to get, for
some constant c ∈ R,
x xρ+1
(ψ(t) − t) dt = − − cx + O(1),
0 ρ
ρ(ρ + 1)
where the sum is over all of the non-trivial zeta zeros. This implies
δ eδ(ρ+1) − e−δ(ρ+1) ρ
A(δ) = − x + O(1). (4.4)
sinh(δ) ρ 2δρ(ρ + 1)
Next using
∞
d(t log t) 2 + log T log T
2
=
,
T t T T
we have
sin(γδ) xiγ sin(γδ) xiγ
δ ρ(ρ + 1) δ ρ(ρ + 1)
ρ γ≥1
sin(γδ) γ 1 |sin(γδ)|
+
1≤γ≤1/δ
γδ γ2 δ 1/δ<γ γ2
1 1 1
+
1≤γ≤1/δ
γ δ 1/δ<γ γ2
log2 (1/δ) + log(1/δ)
log2 (1/δ).
We have sin(γδ)/δ ≤ |γ| and by RH, 1/ρ = 1/(iγ) + O(1/γ2 ) and 1/(ρ + 1) =
1/(iγ) + O(1/γ2 ). Therefore
√ sin(γδ) xiγ √
A(δ) = − x + O( x)
ρ
γδ iγ
√ sin(γδ) xiγ − x−iγ √
= −2 x + O( x)
ρ
γδ iγ
√ sin(γδ) sin(γ log x) √
= −2 x + O( x),
γ>0
γδ γ
Therefore for x = N n exp(1/N), δ = 1/N and using the result of Step (3)
(this is the key to the proof) and Step (5), we see A(δ) can be expressed as
1
∞
sin(γ j /N) sin(γ j log x) √
A(δ) = −2x 2 · + O( x) (4.5)
j=1
γ j /N γj
1
sin(γ j /N) sin(γ j log x)
= −2x 2 ·
γ j ≤N log N
γ j /N γj
⎛ ⎞
⎜⎜⎜ √ sin(γ /N) sin(γ log x) ⎟⎟⎟ √
+ O ⎜⎜⎜⎜⎝ x ⎟⎟⎟⎟ + O( x)
j j
· ⎠
γ j >N log N
γ j /N γj
2
x 2 sin(γ j /N) √
1
= −2 + ( x)
N γ ≤N log N γ j /N
j
√ N log N 2
x sin(t/N) √
−2 log t dt + O( x)
N 1 t/N
√
1
x2
−2 N log N + O( x)
N
1 √
= −2x 2 log N + O( x).
(4.6)
Similarly for x = N n exp(−1/N) we have
∞
sin(γ j /N) sin(γ j log x) √
−2x 1/2
+2x1/2 log N + O( x). (4.7)
j=1
γ j /N γj
K
(6) Because x ≤ N N e1/N , T = N log N and K T log T N log2 N we get
loglog x
N log3 N.
Taking logarithms for a constant C > 0 we get from this
log N + logC + 3 loglog N = (1 + o(1)) log(N) ≥ logloglog x,
which implies
log N ≥ (1 + o(1)) logloglog x.
(7) Finally, by (4.5) and (4.7), and the bound established in Step (6), letting
δ → 0+ at a point of continuity of ψ(x) we can write
√
ψ(x) = x + Ω± ( x logloglog x).
The same is true at points of discontinuity, since the jump is O(log x).
(8) Considering the expansion of ψ(x) in terms of θ(x1/ j ), with 1 ≤ j ≤
log x/log 2, and the bound θ(x)
x, we can write
√ √
ψ(x) = θ(x) + θ( x) + O(x1/3 log x) =⇒ 0 ≤ ψ(x) − θ(x)
x.
4.3 Oscillation Theorems 87
√
Thus ψ(x) = θ(x) + O( x), and substituting this in (4.2) completes the
proof.
This theorem can be shown to be equivalent to
√
x logloglog x
π(x) = Li(x) + Ω± .
log x
The reader might compare this with theorem of Walfisz, giving the best-
known error in the prime number theorem.
Littlewood (Figure 4.1) gives an explicit form for the estimates of
Theorem 4.13. There are sequences of positive real numbers xn → ∞ and
yn → ∞ such that for all n ∈ N we have
√
xn logloglog xn
π(xn ) > Li(xn ) + π(xn )
3 log xn
and
√
yn logloglog yn
π(yn ) < Li(yn ) + .
3 log yn
It follows from these inequalities that there is an x > 0 such that π(x) >
li(x). The Skews number is the smallest natural number S for which this
inequality is true. This number is very celebrated, its proved upper bound
being at one time the largest number appearing in a mathematical proof.
Indeed Skews proved in 1955 [157] that
10964
S < 1010 .
There has been a great deal of work reducing the size of the known upper
bound and finding lower bounds for S . For example Zegowitz [183] showed
in 2010 that S < exp(727.951347) and Kotnik [97] in 2008 that 1014 < S .
For this section, and indeed all of this chapter, the reader would do well to
consider the article by Don Zagier [182].
Now we will see below one of the most closely considered equivalences
to RH, due originally to Littlewood (see Littlewood [109] or Edwards [51,
section 12.1]), namely that RH is equivalent to the estimate that for all > 0
μ(n)
x1/2+ .
n≤x
Lemma 4.14 [51, p. 54] Let a > 0, T > 0 and 0 < x < 1. Then
a+iT s
1 x xa
ds ≤ .
2πi a−iT s πT |log x|
Proof Let a < A and apply Cauchy’s residue theorem to the function x s /s on
the rectangle [a, A] × [−T, T ], on which the function is holomorphic. Thus the
contour integral of the function around the boundary of the rectangle is zero
so
a+iT s A+iT s A−iT s a+iT s
1 x 1 x 1 x 1 x
ds = − ds − ds − ds.
2πi a−iT s 2πi A−iT s 2πi a−iT s 2πi A+iT s
Therefore, since the modulus of the integrand is bounded by xA /A on the
right-hand vertical section of the contour and, if σ = s, by xσ /T on the
horizontal sections, we have
a+iT s A σ
1 x 1 2T xA 1 x T xA (xa − xA )
ds ≤ +2 dσ ≤ + .
2πi a−iT s 2π A 2π a T πA πT |log x|
Letting A → ∞ the lemma follows.
4.4 Errors in Arithmetic Sums 89
Lemma 4.15 [51, p. 55] Let a > 0, T > 0 and x > 1. Then
a+iT s
1 x xa
ds − 1 ≤ .
2πi a−iT s πT |log x|
Proof The proof is similar to that of Lemma 4.14, except this time integrate
around [−A, a] × [−T, T ] and evaluate the residue of the function x s /s at
s = 0.
Proof (1) First assume the given condition, i.e. for all > 0 we have
μ(n)
x1/2+ , x → ∞.
n≤x
This part of the proof will show the condition implies the Riemann hypothesis
is true. Now
1 ∞
μ(n)
= , s > 1.
ζ(s) n=1 n s
Using Abel summation [6, theorem 4.2] with M(x) := n≤x μ(n),
x x
μ(n) M(x) M(t)
s
= s
+ s dt.
n=1
n x 1 t s+1
(2) Now assume RH. Let T > 0 and x > 0, considered large, x N and
2+iT s
1 x ds
Δ(x, T ) := M(x) −
2πi 2−iT ζ(s) s
90 Classical Equivalences
2+iT
∞ x s ds
= M(x) −
1
μ(n)
2πi n s
2−iT n=1
2+iT
∞ x s ds
= μ(n) −
1
μ(n)
n<x 2πi 2−iT n=1 n s
2+iT
x s ds
= μ(n) −
1
μ(n)
n<x 2πi 2−iT n<x n s
2+iT
x s ds
−
1
μ(n) .
2πi 2−iT n>x n s
Therefore, using Lemma 4.14, Lemma 4.15, |μ(n)| ≤ 1, for values of x
restricted to being half an integer so that |log(x/n)| ≥ 2/n for x/2 ≤ n < 2x
and |log(x/n)| ≥ log 2 for x/2 > n or n > 2x, this gives
(x/n)2 (x/n)2
Δ(x, T ) ≤ +
n<x
πT |log(x/n)| n>x πT |log(x/n)|
(x/n)2 (x/n)2
≤ +
πT |log(x/n)| x/2≤n<2x πT |log(x/n)|
⎛
x/2>n or n>2x
⎞
x2 ⎜⎜⎜⎜ 1 1 1 ⎟⎟⎟
∞
≤ ⎜⎜ + ⎟⎟⎟
πT ⎝ log 2 n=1 n2 x/2<n≤2x n ⎠
2
x2 π
≤ + (log(2x) − log(x/2))
πT 6 log 2
x2
.
T
Now let > 0 be given and integrate x s /(ζ(s)s) around the rectangle
[ 12 + − iT, 2 − iT ] × 12 + − iT, 12 + + iT
to derive an alternative expression for the integral approximation to M(x)
using the approximation to 1/|ζ(s)|
δ tδ given, since we are assuming RH,
in item (8) of Section 2.2:
2+iT s 1/2+−iT s 1/2++iT s
1 x ds 1 x ds 1 x ds
= +
2πi 2−iT ζ(s) s 2πi 2−iT ζ(s) s 2πi 1/2+−iT ζ(s) s
2+iT
1 x s ds
+
2πi 1/2++iT ζ(s) s
1/2++iT0 s
δ−1 2
1 x ds
≤ 2Kδ T x +
2πi 1/2+−iT0 ζ(s) s
T
2 dt
+ x1/2+ Kδ tδ
2π T0 t
1/2+ δ
x K δ T
x1/2++2δ + .
πδ
4.4 Errors in Arithmetic Sums 91
M(x)
0.5
x
20 000 40 000 60 000 80 000 100 000
-0.5
√
Figure 4.2 The function M(x)/ x for 1 ≤ x ≤ 105 .
92 Classical Equivalences
D(N) N 1/2+ .
Then
n x
L(x) = μ 2 = μ(e) = M 2 .
d √ √ d
d |n
1≤n≤x 2 2 d≤ x e≤x/d d≤ x
From this it follows that if L(x)
x1/2+ then M(x)
x1/2+ . The criterion
then follows from the Littlewood criterion, Theorem 4.16.
4.5 Unsolved Problems 93
5.1 Introduction
This chapter is based on the work of Jean-Louis Nicolas, J. Barkley Rosser
and Lowell Schoenfeld, as published in a variety of articles, but most
especially in [126, 127, 145].
J. Barkley Rosser (1907–1989) was a logician and student of Alonzo
Church. Together they proved the “Church–Rosser theorem” of the lambda
calculus. He also undertook research in prime number theory where
“Rosser’s rule” is named after him. Less is known about Lowell Schoenfeld,
who worked in the main in number theory and complex analysis. His joint
1962 paper with Rosser, giving explicit estimates for many arithmetical
functions, has 358 citations registered by MathSciNet. Parts of that work are
detailed in Chapter 3.
Jean-Louis Nicolas received his Ph.D. in 1968 with a thesis entitled “The
order of the maximal element of the group S n and permutations of highly
composite numbers” supervised by Charles Pisot. His work on the symmetric
group is outlined in Chapter 10. His first student was Guy Robin, who
graduated in 1983, some of whose work is described in Chapter 7.
The equivalences to RH, which are the subject of this chapter, are
expressed in terms of the well-known arithmetical function ϕ(n). This was
defined by Euler in 1763 as “the multitude of numbers less than n which
have no common divisor with it”. He proved that
ϕ(n) 1
= 1− ,
n p|n
p
where the product is over primes which divide n, and proved his theorem
aϕ(n) ≡ 1 mod n when (a, n) = 1. It was Gauss who introduced the now
standard notation ϕ(n) in his 1801 “Disquisitiones Arithmeticae”, and
J. J. Sylvester in 1879 who called the function a “totient”.
94
5.1 Introduction 95
Some idea of the behaviour of ϕ(n) for large n can be obtained from the
average order
3n2
ϕ(1) + ϕ(2) + · · · + ϕ(n) ∼
π2
and the spread of values. These are such that {n/ϕ(n) : n ∈ N} is dense
in (1, ∞),
ϕ(n) ϕ(n)
lim inf =0 and lim sup = 1.
n→∞ n n→∞ n
In addition we have for all > 0
ϕ(n)
lim = ∞.
n→∞ n1−
Finally, in this brief survey of some of the properties of ϕ(n), we have
ϕ(n) loglog n
lim inf = e−γ
n→∞ n
but for n ≥ 2,
n 3
< eγ loglog n + ,
ϕ(n) loglog n
96 Euler’s Totient Function
where the lower bound also holds for 1 < x < 286. We also call on explicit
estimates from Chapters 3 and 4. These explicit estimates provide an essential
and important underpinning for the RH equivalences.
The first main result, of Rosser and Schoenfeld, Theorem 5.25, comes after
quite a large number of lemmas.
Theorem 5.25 Without any hypotheses, for all n ≥ 3 except for the ninth
primorial (see below) N9 = 2 · 3 · · · 23, we have
n 5
< eγ loglog n + . (5.1)
ϕ(n) 2 loglog n
The second main result, that of Nicolas, Theorem 5.29, is an essential tool,
used in this chapter and in Chapter 7, when we assume RH is false. Here we
use the function
1
f (x) := eγ log θ(x) 1− ,
p≤x
p
where as usual θ(x) := p≤x log p.
Lem 5.6
Lem 5.9
Lem 3.14
Lem 5.10
Lem 5.4
Thm 2.1
Lem 5.30
Lem 5.16
Lem 5.22
Lem 5.23
Lem 5.20
xβ−1
V(x) := .
ρ
γ2
This leads to the definition of the principal function in this part of the
development, φ(x, γ, R), which we write as φ(γ):
log x
exp −
R log γ
φ(γ) := .
γ2
We begin by deriving a preliminary upper bound for V(x) in terms of the
φ(γ) values. This will be made more explicit later.
Lemma 5.1 For x > 1 we have
1 1
V(x) ≤ √ + φ(γ).
x ρ γ2 H<γ
We are now able to bound sums of the φ(γ) in terms of two supplementary
functions which depend on a1 , a2 and a3 using the generalized form of Abel
summation.
Lemma 5.2 [145, theorem 27] If 2 ≤ h then
∞
t
φ(γ) < 2P(h)e−log x/(R log h) + Q(h) φ(t) log dt
h<γ h 2π
where
a1 log h + a2 loglog h + a3
P(h) =
h2
and
1 a1 log h + a2
Q(h) = + .
2π h log h log(h/(2π))
Proof By Lemma 4.2 and Backlund’s bound B(T ) we can write
∞ t
1 a2 1
φ(γ) < log + a1 + φ(t) dt + 2B(h)φ(h).
h<γ h 2π 2π log h t
Using this integral together with Lemmas 5.1 and 5.2 we get:
xβ−1 1 1
V(x) = ≤ √ + φ(γ)
ρ
ρ2 x ρ γ2 H<γ
k1 (0) k1 (0) x−1/(R log h) 1
< √ + φ(γ) + φ(γ) ≤ √ + +
x H<γ≤h h<γ
x H<γ≤h γ2 h<γ
γ2
k1 (0) 2B(h) 1 + log(h/(2π)) k1 (H)
< √ + 2 + Q(H) + 1/(R log h) .
x h h x
This completes the derivation.
Next we link ψ(x) with V(x). This takes the form of upper bounds for two
infinite integrals. Von Mangoldt’s first theorem plays a crucial role here.
Lemma 5.4 [145, lemmas 7 and 8] For all x > 1 we have the upper bounds
∞
(ψ(t) − t) log 2π 1 − log 2
(1) I1 := 2
dt < V(x) + +
x t x x3
and
∞
(ψ(t) − t)(1 + log t)
(2) I2 := dt
x t2 log2 t
2 + log x log 2π 1 − log 2
< V(x) + + ,
log2 x x x3
where log 2π < 1.84 and 1 − log 2 < 0.31.
Proof (1) Let ρ be a complex zeta zero and note that
∞
xρ−1
tρ−2 dt = − .
x ρ−1
Using von Mangoldt’s theorem, Theorem 2.1, and observing that |γ| ≤ |ρ| and
|γ| ≤ |1 − ρ| gives
∞ ∞
ψ(t) − t xβ−1 log 2π − 12 log(1 − 1/t2 )
I1 = dt
≤ + dt
x t2 ρ
|ρ(ρ − 1)| x t 2
log 2π x−2n−1
∞
≤ V(x) + +
x n=1
2n(2n + 1)
log 2π
∞
x−3
≤ V(x) + +
x n=1
2n(2n + 1)
log 2π 1 − log 2
< V(x) + + ,
x x3
which gives the bound for the integral.
5.2 Estimates for Euler’s Function ϕ(n) 103
Also
ρ−1 ∞ ρ−2 ∞ β−2
x t xβ−1 t
2 − 2 3
dt ≤
2
+2 3
dt
log x x log t log x x log t
∞
xβ−1 β−1 1 2xβ−1
< + 2x dt = .
log2 x 3
x t log t log2 x
Therefore
∞ ρ−2
1 t (1 + log t) 1 xβ−1 1 2xβ−1
dt < +
ρ x log2 t |ρ(ρ − 1)| log x |ρ − 1|2 log2 x
2 + log x xβ−1
< .
log2 x γ2
Summing we get
1 ∞ tρ−2 (1 + log t) 2 + log x
dt < V(x).
ρ x log2
t log2 x
ρ
Proof Rewrite the form of the integral and then use integration by parts:
∞ α ∞
t (1 + n log t) α d 1
dt = − t dt
x tn+1 log2 t x dt tn log t
∞
1 tα
= n−α +α n+1 log t
dt
x log x x t
∞
1 α
≤ n−α + tα−n−1 dt
x log x log x x
n 1
= ,
n − α x log x
n−α
and let u ≤ x ≤ v. Then, since L(x) and M(x, α) are decreasing as x increases,
we have
(1) In this case u = 108 , v = e600 . Let h → ∞ in Lemma 5.3 and set α = 0.31
to get κ(0.31) < 1.460 to obtain
L(u) = L(108 ) < 3.0 × 10−4 < M(e600 , 0.31) = M(v, 0.31)
=⇒ L(x) < M(x, 0.31) for 108 ≤ x ≤ e600 ,
and
3 6
L(e4×10 ) < 9.0 × 10−9 < M(e3×10 , 0.47).
so if
1
1
S := − log 1 − + ,
p>x
p p
By Lemma 3.10 we have θ(t) < 1.02t for all t > 0 so therefore using
Lemma 5.5 we get
1 ∞
1 + n log t 1.02n 1.02n
< 1.02 dt ≤ ≤ n−1 ,
p>x
p n
x
n 2
t log t (n − 1)x log x x log x
n−1
and so
1 1 1.02 1 1.02
< =
n≥2
n p>x
p n x log x n≥2
x n−2 (x − 1) log x
Proof (1) By Abel’s identity [6, theorem 4.2], with f (t) = 1/(t log t) and
an = log n for prime n and 0 otherwise, we have
1 x
θ(x) d 1
= − θ(t) dt
p≤x
p x log x 2 dt t log t
x ∞
θ(x) − x dt d 1
= + + B+ (θ(t) − t) dt,
x log x 2 t log t x dt t log t
where
∞
1 d 1
B := − loglog 2 − (θ(t) − t) dt ∈ R.
log 2 2 dt t log t
It follows that
1 ∞
θ(x) − x (θ(t) − t)(1 + log t)
− loglog x − B = + dt.
p≤x
p x log x x t2 log2 t
(3) To complete the proof we split the range. Firstly let 108 ≤ x ≤ e375 . By
Lemma 5.6 parts (1) and (3) we have in this range
0.31x
|θ(x) − x| < and L(x) < M(x, 0.31).
log x
Using the formula derived in Step (2), and the definition of M(x, α), namely,
1 α
M(x, α) := log x log 1 + 2
− ,
2 log x log x
we therefore have
1 0.31 M(x, 0.31)
− loglog x − B <
1.02
+ −
p≤x p log x2
log x (x − 1) log x
1 1.02
= log 1 + − .
2
2 log x (x − 1) log x
(4) Now let e375 ≤ x. Using Steps (2) and (4) of Lemma 5.6, we have in
this range
0.47x
|θ(x) − x| < and L(x) < M(x, 0.47),
log x
and the result follows as in Step (3).
5.2 Estimates for Euler’s Function ϕ(n) 109
Using these bounds and replacing B with the expression we have derived
gives
1 1
γ + loglog x − log 1 + < − log 1 −
2 log2 x p≤x
p
1
< γ + loglog x + log 1 + .
2 log2 x
Then because log(1 − x) < −log(1 + x) for 0 < x < 1 we get
1 1
γ + loglog x + log 1 − < − log 1 −
2 log2 x p≤x
p
1
< γ + loglog x + log 1 + .
2 log2 x
Finally, take exponentials to complete the proof.
To show how Lemma 5.15, and some of the results which follow, fit into
the overall scheme of the proof, we have another network flow diagram
(Figure 5.3).
We have defined S (x) := θ(x) − x. Analogous to J(x) let
∞
S (t) 1 1
K(x) := + dt. (5.7)
x t2 log t log2 t
Note that the prime number theorem gives S (t) = O(t/log t) and ensures the
convergence of the integral. Then we have an estimate for f (x) for x > 1:
Lemma 5.15 [126, proposition 1] For x > 1
S (x)2 1
K(x) − ≤ log f (x) ≤ K(x) + . (5.8)
2
x log x 2(x − 1)
Proof First use Abel summation [6, theorem 4.2] and θ(x) = x + S (x) to
derive an expression, in terms of S (x), for the partial sum over primes p
of prime reciprocals:
1 x
θ(x) θ(t)(1 + log t)
= + dt
p≤x
p x log x 2 t2 log2 t
x
S (x) 1 S (t)(1 + log t)
= + loglog x − loglog 2 + + dt,
x log x log 2 2 t2 log2 t
so
1 ∞
S (x) S (t)(1 + log t)
= + loglog x + B − dt
p≤x
p x log x x t2 log2 t
S (x)
= + loglog x + B − K(x),
x log x
with
∞
1 S (t)(1 + log t)
B := − loglog 2 + + dt.
log 2 2 t2 log2 t
Thus
1 S (x)
K(x) = − + + loglog x + B. (5.9)
p≤x
p x log x
Note also the given expression for B can be compared with the classical
formula given by Lemma 5.9:
1
1
= loglog x + B + O ,
p≤x
p log x
5.3 Preliminary Results With RH True 113
Now we are able to derive an expression for log f (x) where the reader will
γ
recall the definition f (x) := e log θ(x) p≤x (1 − 1/p) (see Figure 5.4). Then
1
log f (x) = loglog θ(x) + γ + log 1 −
p≤x
p
⎛ ⎞ ⎛ ⎞
⎜⎜⎜ 1 ⎟⎟⎟ ⎜⎜⎜ 1 1 ⎟⎟⎟
⎜
= ⎜⎝⎜loglog θ(x) + ⎟ ⎜
− + B⎟⎠⎟ + ⎜⎝⎜ log 1 − + + γ − B⎟⎟⎠⎟ .
p≤x
p p≤x
p p
Thus
log f (x) = U(x) + u(x),
where, using the classical expression for B, we have
1
1
u(x) = − log 1 − − .
p>x
p p
Therefore
1
U(x) := loglog θ(x) + − +B
p≤x
p
f(x)
0.99
0.98
0.97
0.96
0.95
0.94
x
0.93 200 400 600 800 1000
and
1 1 1
1 1
0 < u(x) < 1 + + 2 + ··· ≤ ≤ .
p>x
2p 2 p p n>x
2n(n − 1) 2(x − 1)
Note that by Lemma 3.10 (d), for x ≥ 121 we have θ(x) ≥ 4x/5. Note also that
for t > 1 the function
d2 1 + log t
(loglog t) = −
dt 2
t2 log2 t
is increasing. A direct computation gives, for x ≥ 121 and t := 4x/5,
1 + log t 25 log(4x/5) + 1 2
− 2
=− 2
≥− 2 .
2
t log t 16 x log (4x/5)
2 x log x
Now, using this bound, apply Taylor’s theorem to the function loglog x for
θ(x) = x + S (x), expanded about x, to get for x ≥ 121
S (x) S (x)2 S (x)
loglog x + − 2 ≤ loglog θ(x) ≤ loglog x + .
x log x x log x x log x
Then, to derive the upper bound of the lemma, use (5.9) to get
1
log f (x) = U(x) + u(x) < U(x) +
2(x − 1)
1 1
≤ loglog θ(x) + − + B+
p≤x
p 2(x − 1)
S (x) 1 1
≤ loglog x + + − + B++
x log x p≤x p 2(x − 1)
1
= K(x) + .
2(x − 1)
The derivation of the lower bound is similar using the lower bound from the
application of Taylor’s theorem.
Next define for complex z with z < 1 and real x > 1 the integral
∞ z
t 1 1
Fz (x) := + dt. (5.10)
x t
2 log t
log2 t
This very useful function has some easy-to-derive properties set out in
the next lemma. Again, integration by parts is the workhorse! There is an
illustrative plot of F1/2 (x) later in Figure 5.6.
If z = 1/2 then
1 4
|rz (x)| ≤ √ 1+ . (5.12)
|1 − z| x log2 x log x
If z = 1/2 then
2 2 2 2 8
√ − √ 2
≤ F1/2 (x) ≤ √ − √ 2
+ √ . (5.13)
x log x x log x x log x x log x x log3 x
Finally, if z = 1/3 we have
3
0 ≤ F1/3 (x) ≤ . (5.14)
2x2/3 log x
Proof Split the integral and integrate the first term by parts to derive
∞
1 1
Fz (x) = t z−2
+ dt
x log t log2 t
∞ z−2 ∞ z−2
t t
= dt + 2
dt
x log t x log t
∞ ∞ z−2
xz−1 tz−2 t
= + dt + dt
(1 − z) log x x (z − 1) log t
2 2
x log t
∞
xz−1 −ztz−2
= + dt
(1 − z) log x x (1 − z) log t
2
xz−1
= + rz (x).
(1 − z) log x
Integrating by parts again we have
∞
z xz−1 2tz−2
rz (x) = − + dt dt. (5.15)
1 − z (1 − z) log2 x x (z − 1) log t
3
Lemma 5.17 [127, lemma 2.3] Let κ = κ(x) := log x/log 2. For x ≥ 16 define
κ
H(x) := 1 + x1/n−1/3
n=4
Then
(a) for j ≥ 9 and x ≥ 2 j we have H(x) ≤ H(2 j ), and
(b) for j ≥ 35 and x ≥ 2 j we have L(x) ≤ L(2 j ).
Proof (a) The function H(x) is continuous and decreasing on the interval
[2 j , 2 j+1 ) for all j ≥ 1. Therefore to prove (a) we need only verify H(2 j ) ≥
H(2 j+1 ) for all j ≥ 9. If j ≥ 20 then
j
H(2 j ) − H(2 j+1 ) = 2 j(1/k−1/3) (1 − 21/k−1/3 ) − 2( j+1)(1/( j+1)−1/3)
k=4
≥2 j(1/4−1/3)
(1 − 21/4−1/3 ) − 2( j+1)(1/( j+1)−1/3)
H(x)
2.6
2.4
2.2
2.0
1.8
x
200 000 400 000 600 000 800 000 1×106
to get
min δi = δ1886 > 1.03795 > 1,
11≤i≤1921
so by Lemma 5.17
4
G(x) ≤ H(x) ≤ H(232 ) < 1.32 < .
3
(3) Finally let x ≥ 64 × 1022 ≥ 279 . Use θ(x) ≤ x + T (x) to write
κ
κ
ψ(x) − θ(x) = θ(x1/n ) ≤ (x1/n + T (x1/n )),
n=2 n=2
5.3 Preliminary Results With RH True 119
To plot the graph of F1/2 (x) in Figure 5.6, we used the representation
2 log x
F1/2 (x) = √ − Ei − .
x log x 2
where J1 (x) satisfies 0 ≤ J1 (x) ≤ log(2π)/(x log x), and where the sum over ρ
is over all of the imaginary zeros of ζ(s) with increasing absolute value of the
imaginary part.
(The reader can explore properties of this and similar sums in [51].)
F(1/2)(x)
0.10
0.08
0.06
0.04
0.02
x
200 400 600 800 1000
to obtain
∞ ∞
tρ+1
g (t) dt = g (t)( 12 t2 − ψ1 (t) − g1 (t)) dt
x ρ
ρ(ρ + 1) x
The first series is convergent. Note that ρ(1 − ρ) > 0 and recall that by
Lemma 2.10,
1
β := = 2 + γ − log(4π) ≤ 0.047.
ρ
ρ(1 − ρ)
Then using the estimate for rρ (x) given by inequality (5.12) of Lemma 5.16
we get for x ≥ e4
Fρ (x) 0.047 × 1.5 0.08
≤ √ ≤ √ ,
ρ ρ x log x x log x
which completes the proof of the lemma.
and
√
W(x) + 2 1 4β x0 log2 x0
log f (x) ≤ − √ + √ 2 + β + + ,
x log x x log2 x log x0 2(x0 − 1)
from which (5.25) follows on substituting for √ x0 .
Now let v := −log f (x) and v0 := (2 + β)/( x0 log x0 ). For all real v we have
ev − 1 ≥ v so, because
1
− 1 = ev − 1 ≥ v = −log f (x),
f (x)
the upper bound of (5.25) implies the lower bound of (5.26). Then by (5.5)
and (5.25) we can write
W(x) + 2 2+β
v≤ √ ≤ √ ≤ v0 < 3.13 × 10−6 .
x log x x log x
Then, to obtain the upper bound notice that
ev0 v2 ev0 (2 + β)2 ev0 (2 + β)2 6.63 × 10−5
ev − 1 − v ≤ ≤ ≤ √ √ < √ .
2 2x log2 x 2 x0 x log2 x x log2 x
Then
1 6.63 × 10−5 6.63 × 10−5
− 1 = ev − 1 ≤ v + √ ≤ −log f (x) + √ .
f (x) x log2 x x log2 x
This completes the proof.
Now we need three estimates, first given by Rosser and Schoenfeld and
proved here in Lemmas 3.14, 5.12 and 3.13, respectively:
(1) For 563 ≤ x we have x(1 − 1/(2 log x)) < θ(x) and for 1 < x we have
θ(x) < x(1 + 1/(2 log x)).
(2) For 286 ≤ x we have
p
γ 1
< e log x 1 + .
p≤x
p−1 2 log2 x
√
(3) For 1423 ≤ x ≤ 108 we have x − 2 x < θ(x).
The next theorem should be compared with an analogous upper bound for
the ratio σ(n)/n given in Chapter 7, which is also unconditional. It might
also be compared with the unconditional lower bound given in Chapter 6,
Lemma 6.8, where n > 1 is restricted to being superabundant, i.e. for all 1 ≤
m < n we have
σ(m) σ(n)
< .
m n
To show how it leads on to other results, we have a network flow diagram,
Figure 5.7. The proof is in six steps.
126 Euler’s Totient Function
Lem 5.9
Thm 5.29
Lem 5.27
Lem 5.12
Thm 5.34 Cor 5.35
Lem 5.33
Lem 3.13
Lem 5.32
(2) Next we verify the theorem for log n ≤ 294: indeed, if Nm is the mth
primorial, and g(n) the right-hand side of the inequality of this theorem,
which is increasing for n in this range, then by Step (1), if Nm ≤ n < Nm+1 ,
and we have verified the result of the theorem at each primorial in the range
9 < m ≤ M, then for these values of m we would have
n p
≤ < g(Nm ) ≤ g(n).
ϕ(n) p≤p p − 1
m
(3) Now let x ≥ 5 and suppose log n < θ(x). Choose m so pm+1 ≤ x < pm+2 .
If pm+1 was 2 or 3 then pm+2 would be 3 or 5 so we would have the bound
x < 5, which is false. Hence 5 ≤ pm+1 , and thus pm ≤ pm+1 − 2 ≤ x − 2. By Step
(1) we therefore have
n p
≤ .
ϕ(n) p≤x−2 p − 1
(4) The next step is quite clever. Let n > 1 be an integer and the real number
y ≥ 2 be chosen to satisfy three conditions:
288 ≤ log n + y,
log n < θ(log n + y),
0.9 log n
y−2 ≤ .
loglog n
Then, by Lemma 5.12, replacing x by log n + y and using Step (3), since
log n < θ(log n + y) and (y − 2)/log n ≤ 0.9/loglog n we get
n 0.5
< log(log n + y − 2) +
eγ ϕ(n) log(log n + y − 2)
y−2 0.5
≤ loglog n + log 1 + +
log n loglog n
1.4
≤ loglog n + .
loglog n
Therefore
n 5
< eγ loglog n + .
ϕ(n) 2 loglog n
(5) Now assume 255 ≤ log n ≤ 1340 and set y = 2 + 2 1 + log n. This
implies 288 ≤ log n + y ≤ 1420.
Therefore using [145, theorem 19] (or compare with Lemma √ 3.13(a)),
which in particular shows that for 0 < x ≤ 1420.9 we have x − 2 x < θ(x), we
get
log n = log n + y − 2 log n + y < θ(log n + y).
The other two conditions in Step (4) are readily found to hold, so in the given
range the theorem follows by the result of Step (4).
(6) Finally assume 1340 ≤ log n. Choose y as 0.9 log n/loglog n and use
Lemma 3.14(e). With these choices for log n and y each of the hypotheses
listed in (4) are easily checked. This completes the proof.
Recall Nk := p1 · p2 · · · pk is the kth primorial, with pi the ith prime and
p1 = 2. Also recall the definition given in the introduction to this chapter:
n γ
c(n) := − e loglog n log n. (5.28)
ϕ(n)
128 Euler’s Totient Function
The following lemma shows that c(n) is very well behaved. In particular it
has a local maximum on the right-hand side at every primorial, and that if
ω(n) is small so is c(n).
Lemma 5.26 [127, lemma 3.1] Let n ≥ 2 and k ≥ 1 with n, k ∈ N. Fix k and
let n be such that either k = ω(n) or Nk ≤ n < Nk+1 . Then in each of these cases
we have c(n) ≤ c(Nk ).
Proof If k = ω(n) then Nk ≤ n and ω(n) ≤ k. If ω(n) > k then n ≥ Nk+1 , so
Nk ≤ n < Nk+1 implies Nk ≤ n and ω(n) ≤ k also.
α
Write n = Pα1 1 · · · P j j where P1 < P2 < · · · < P j are primes and αi ≥ 1 integers.
Then each Pi ≥ pi , where pi is the ith prime, so
n j
1 1
j 1
k
Nk
= ≤ ≤ =
ϕ(n) i=1 1 − 1/Pi i=1 1 − 1/pi i=1 1 − 1/pi ϕ(Nk )
and therefore
Nk γ
c(n) ≤ − e loglog n log n =: h(n), (5.29)
ϕ(Nk )
where for fixed k the expression on the right-hand side of (5.29) defines a
function h(n). Interpolate h on [Nk , Nk+1 ) with a differentiable real function
h(x) so
1 Nk γ γ
h (n) = − e loglog n − 2e
2n log n ϕ(Nk )
1 Nk γ γ
≤ − e loglog Nk − 2e .
2n log n ϕ(Nk )
We claim the term in parentheses in the upper bound is negative. For k = 1, 2
this can be tested numerically. For k ≥ 3 it is also negative because N3 = 30,
so loglog Nk ≥ loglog 30 and we can use the bound given in Theorem 5.25,
namely for all n ≥ 3, without any special assumptions,
n 2.50637
≤ eγ loglog n + ,
ϕ(n) loglog n
where the slightly increased numerator 2.50637 > 2.5 has been introduced to
accommodate the exception N9 . Therefore h(n) ≤ h(Nk ) = c(Nk ) so by (5.29),
c(n) ≤ c(Nk ) as claimed.
In Figure 5.8 there is plotted c(N3 ) and c(n) for 30 = N3 ≤ n < N4 = 210.
For x ≥ 1 recall the definition
xiρ
W(x) = ,
ρ
ρ(1 − ρ)
5.4 Further Results With RH True 129
c(n)
n
0 50 100 150 200
–2
–4
which for real x is real, since the zeros of ζ(s) occur in complex conjugate
pairs.
Lemma 5.27 [127, proposition 3.1] Assume RH is true and that 109 ≤ pk ≤
x < pk+1 where pi is the ith prime. Then for c(Nk ) we have the upper bound
0.07 0.07
c(Nk ) ≤ eγ (2 + W(x)) − ≤ eγ (2 + β) −
log x log x
and the lower bound
3.7 3.7
eγ (2 − β) − ≤ eγ (2 + W(x)) − ≤ c(Nk ).
log x log x
Proof Firstly, the definitions of f (x) and c(n) ((5.5) and (5.28) respectively)
and the identity log Nk = θ(pk ) = θ(x) imply
1
c(Nk ) = eγ θ(x) log θ(x) −1 . (5.30)
f (x)
Equation (5.18) and the bound θ(x) ≥ 4x/5, x ≥ 121, from Lemma 3.10(d),
give
√ θ(x) 2 + log t
θ(x) log θ(x) − x log x = √ dt
x 2 t
log(4x/5) + 2
≤ |θ(x) − x| √
2 4x/5
130 Euler’s Totient Function
√
5 2 + log x
≤ T (x) √
4 x
√
5
= log2 x(log x + 2).
32π
Therefore, using x0 = 109 again:
√ √
θ(x) log θ(x) 5 log2 x (2 + log x)
√ − 1 ≤ √
x log x 32π x log x
√
5 log2 x0 (2 + log x0 ) 6.9 × 10−3
≤ √ ≤ .
32π x0 log x log x
Now use Lemma 5.24 and (5.30), (5.26) and (5.5), recall that by
Lemma 2.10
1
β := = 2 + γ − log(4π),
ρ
ρ(1 − ρ)
Theorem 5.29 If the Riemann hypothesis is false then there exists a real b
with 0 < b < 1/2 such that, as x → ∞,
1
log f (x) = Ω± b .
x
Proof The proof is in seven steps.
(1) First apply Theorem 4.12 with h(x) = J(x) for x ≥ 2 and h(x) = 0 for
1 ≤ x ≤ 2. For s > 1 set
∞ ∞ ∞
J(x) 1 R(t) 1 1
G(s) := dx = + dt dx.
2 xs 2 x s x t2 log t log2 t
132 Euler’s Totient Function
The prime number theorem gives |R(t)| = O(t/log t). Hence, the double
integral is absolutely convergent. Therefore we can use Fubini’s theorem to
change the order of integration and write
∞ t
R(t) 1 1 dx
G(s) = 2
+ 2
dt,
2 t log t log t 2 x s
which gives
∞
1 R(t) 1 1
G(s) = 2 J(2) −
1−s
+ dt . (5.31)
s−1 2 t1+s log t log2 t
Note that the term on the right-hand side inside the outer parentheses is
holomorphic for s > 1 and vanishes at s = 1.
(2) Next, for s > 1 we can write
∞ ∞ ∞ 2
R(t) ψ(t) dt dt
s+1
dt = s+1
dt − s
+ s
.
2 t 2 t 1 t 1 t
The third integral on the right-hand side is an entire function (1 − 21−s )/(s − 1)
which we call E1 (s).
Therefore for s > 1 we have, by Lemma 5.28,
∞
R(t) 1 ζ (s) 1
dt = − − + E1 (s). (5.32)
2 t s+1 s ζ(s) s − 1
We know from the relationship [6, theorem 13.11], applied with 0 < s < 1,
∞
(−1)n+1 1 1 1
(1 − 2 )ζ(s) =
1−s
= 1 − s + s − s + · · · > 0,
n=1
ns 2 3 4
that, since 1 − 21−s < 0 and ζ(0) = − 12 , ζ(s) has no zero in [0, ∞), so therefore
there is a real number δ > 0 such that there are no zeros of ζ(s) in a simply
connected set
that
∞
R(t) 1
dt = −G1 (s) + λ,
t s+1 log t
∞2
R(t) 1
dt = G2 (s) + λs + μ.
2 t s+1 log2 t
By (5.31) we have
1
G(s) = (G1 (s) − G2 (s) + E2 (s)), (5.33)
s−1
where E2 (s) is entire. The expression in parentheses in (5.33) is a function
holomorphic on Θ which by Step (1) vanishes at s = 1. Therefore the right-
hand side of (5.33) is holomorphic on Θ.
In the neighbourhood of a zero ρ of ζ(s) of multiplicity m we have
d 1 ζ (s) 1 m
G1 (s) ∼ − ∼ ,
ds s ζ(s) ρ (s − ρ)2
1 ζ (s) 1 m
G2 (s) ∼ − ∼− ,
s ζ(s) ρ s−ρ
so G1 (s) − G2 (s) would have a pole of order 2 at s = ρ and ρ ≥ 12 . This
shows that σ0 ≥ 12 so σ0 ∈ Θ and G(s) is holomorphic at σ0 .
If R(x) had the same sign for x sufficiently large, by Theorem 4.12, G(s)
would have a singularity at σ0 , but is holomorphic at that point. Therefore
R(x) changes sign infinitely often as x → ∞.
(4) Now let θ be a fixed real number satisfying 0 < θ < 1/2. Suppose there
exists a zero ρ of ζ(s) with ρ = β > 1 − b where b is a real number which
satisfies 1 − θ < b < 1/2. We will show that J(x) − x−b and J(x) + x−b do not
keep a constant sign as x → ∞. To see this first calculate the Mellin transform
∞
J(x) − x−b 1
dx = G(s) − + E3 (s), s > 1,
2 x s s − 1 +b
2
where E3 (s) := 1 x−b−s dx is an entire function. The second term extends to a
holomorphic function on Θb := Θ ∩ {s : s > 1 − b}.
If J(x) − x−b had a constant sign as x → ∞, by Theorem 4.12 the abscissa of
convergence of the Mellin transform would be less than or equal to 1 − b ≤ β.
This is impossible since, by Step (4), G(s) has a singularity ρ with real part
β. Then adopt the same procedure for J(x) + x−b to show that it too does not
ultimately have constant sign.
(5) Recall the definition
∞
S (t) 1 1
K(x) = + t dt
x t2 log t log2
134 Euler’s Totient Function
(7) This is the final step. We claim there are an infinite number of values of
i with y(xi ) > 0. Indeed, if the function y(x) was negative or zero at all local
extrema, it would be negative everywhere, and this would mean, for all x,
K(x) ≤ x−b . But, as has been shown, K(x) = Ω+ (x−b ) where b := (1 + b − θ)/2.
At such a point xi we have
S (xi )2
K(xi ) − > xi−b + O(xi−2b log xi ),
xi2 log xi
giving, by Lemma 5.15, log f (x) = Ω+ (x−b ). To show that log f (x) = Ω− (x−b ),
also consider K(x)+ x−b and use the same approach. This completes the proof.
0.15
0.10
0.05
n
0 20 000 40 000 60 000 80 000 100 000
Δk
k
1000 2000 3000 4000
–0.00002
–0.00004
–0.00006
–0.00008
–0.0001
definitions:
n γ
c(n) := − e loglog(n) log n,
ϕ(n)
xiρ
W(x) := ,
ρ
ρ(1 − ρ)
1
β := = 2 + γ − log(4π) = 0.046191 . . . ,
ρ
ρ(1 − ρ)
1
γ
f (x) := e log θ(x) 1− .
p≤x
p
For a plot of values of the sequence c(n) for n ∈ [1, 1000], see Figure 5.11.
Next we give a form of Dirichlet’s approximation theorem [5, section 7.2],
which we need to show the supremum of W(x) is β:
Lemma 5.32 Given > 0, fixed real numbers α1 , . . . , αN and a real t0 > 0,
then there exist integers x1 , . . . , xN and real t ≥ t0 such that
|tα j − x j | < , 1 ≤ j ≤ N.
Proof Let q ∈ N satisfy 1/q < and set a = (α1 , . . . , aN ) ∈ RN . Let the real
variable u take the 1 + qN values
0, t0 , 2t0 , . . . , qN t0 .
c(n)
n
200 400 600 800 1000
–2
–4
–6
Then, if as usual we denote the fractional part of the real variable x by {x}, the
1 + qN points ({uα1 }, . . . , {uαN }) ∈ [0, 1]N . This cube can be decomposed into
exactly qN regular subcubes each having all sides of length 1/q, so at least
two of these points are in a particular subcube. If the corresponding values
of u are u1 and u2 with u1 < u2 , then for 1 ≤ j ≤ N we have integers n j and m j
such that
1 1
|{u2 α j } − {u1 α j }| ≤ =⇒ |u2 α j − n j − u1 α j + m j | ≤
q q
1
=⇒ |(u2 − u1 )α j − (n j − m j )| ≤
q
=⇒ |tα j − x j | ≤ .
Finally observe that each x j is an integer and t = u2 − u1 ≥ t0 to complete the
proof.
Lemma 5.33 Assume RH is true. Then
lim sup W(x) = β.
x→∞
Proof Let > 0 and t0 > 0 be given. Choose N sufficiently large so that
N
2
> β − ,
j=1
|ρ j |2
Let α j := γ j /(2π) where ρ1 = 12 +iγ1 is the first complex zeta zero with positive
imaginary part. By Lemma 5.32 there is a t ≥ t0 such that for 1 ≤ j ≤ N we
have |tα j − x j | < /(2π). This enables us to write |tγ j − 2πx j | < which implies
cos(tγ j ) ≥ cos() ≥ 1 − . Therefore letting x = et we have
N
2 cos(γ j log x) 1
W(x) ≥ −
j=1
|ρ j |2 |ρ|>|ρ |
|ρ|2
N
N
2(1 − )
≥ −
j=1
|ρ j |2
≥ (1 − )(β − ) − .
Since x may be chosen arbitrarily large and W(x) ≤ β this shows that
lim supx→∞ W(x) = β.
140 Euler’s Totient Function
so (1) follows.
(2) Let, as before, x0 = 109 and k0 = π(x0 ) = 50 847 534. Setting k1 :=
120 568 and testing the equation numerically shows that for k1 < k ≤ k0 we
have c(Nk ) < eγ (2 + β). To do this we used a Mathematica procedure based on
the following pseudocode where γ is Euler’s constant and pk is the kth prime:
Set k0 , k1 and the precision d
Evaluate eγ numerically
Set the number of failures to 0
Set c to eγ (4 + γ − log(4π)) numerically
Set rk to the numerical product of pk /(pk − 1) from k = 1 to k = k1 − 1
Set LogNk to the numerical sum of log(pk ) from k = 1 to k = k1 − 1
For k from k1 to k0 update rk and LogNk and then evaluate
√
cNk which is (rk − eγ log(LogNk)) LogNk
Test to see if cNk > c, report any failure, update the number of failures
Return the number of failures
See the RHpack function NicolasTwoFailures. Also see Figure 5.12 for a plot
of c(Nk ) values.
For k > k0 Lemma 5.27 gives c(Nk ) < eγ (2+β) so, by Lemma 5.26, choosing
k appropriately, c(n) ≤ c(Nk ) < eγ (2 + β) so (2) follows.
(3) A Mathematica computation similar to that in (2) shows that for 1 ≤
k ≤ k0 we have c(N1 ) ≤ c(Nk ) ≤ c(N66 ) and then use Lemma 5.27 to complete
the proof that c(n) ≤ c(N66 ) for all n ≥ 2.
(4) By Lemma 5.27, for all k ≥ k0 we get
3.7
c(Nk ) ≥ eγ (2 − β) − = 3.30 · · · > c(2),
log x0
so, using the lower bound from the computation in (3), (4) is true also.
5.7 Nicolas’ Second Theorem 141
3.64
3.62
3.60
3.58
Now assume RH is false. By the prime number theorem in the form θ(x) ∼ x
and using Mertens’ theorem (see Lemma 5.12) we have lim x→∞ f (x) = 1.
Observe that f (x) is locally constant, which goes to the essence of
its definition. If pk is the kth prime, when pk ≤ x < pk+1 , f (x) =
eγ loglog(Nk )ϕ(Nk )/Nk . Also, when k → ∞, because log(1+y)/y → 1 as y → 0,
if we set 1 + 1/y = f (pk ) we get:
1 1
log 1 + ∼ log f (pk ) ∼ = f (pk ) − 1,
y y
so log f (pk ) ∼ f (pk ) − 1. Therefore, since log Nk ∼ pk and by Theorem 5.25
and Lemma 5.12 we have
Nk
∼ eγ loglog Nk ,
ϕ(Nk )
we can derive
Because RH fails, there is a real number b with 0 < b < 1/2 such that, by
Theorem 5.29, we have log f (x) = Ω± (1/xb ), so in this case, where f (x) is
constant between primes, we must have
e−γ c(Nk )
= Ω± (1),
(log pk )p1/2−b
k
so lim inf n→∞ c(n) = −∞ and lim supn→∞ c(n) = ∞. Therefore each of (1), (2),
(3) and (4) is false also. This completes the proof of the equivalences.
Now consider the inequality derived in Theorem 5.25, namely for all n ≥ 3
except n = N9 we have
n 5
< eγ loglog n + .
ϕ(n) 2 loglog n
From Theorem 5.34 part (2), we see that RH is equivalent to a stronger
form of this inequality, at least for n sufficiently large. We call this Nicolas’
second criterion, something of a misnomer since his second theorem is more
comprehensive, but is given because of its simplicity and because it takes the
same shape as other criteria.
Corollary 5.35 (Nicolas’ second criterion) The Riemann hypothesis is
equivalent to the following inequality holding for all n ≥ N120569 :
n eγ (2 + β)
< eγ loglog n + .
ϕ(n) log n
Note also from Chapter 6, Lemma 6.8, for n a sufficiently large
superabundant number with maximum prime divisor p, that, without RH,
n σ(n) n 0.6
(1 − (p)) < < < eγ loglog n + ,
ϕ(n) n ϕ(n) loglog n
where (p) := (1 + 3/(2 log p))/log p and provided n ≥ Nk with pk ≥ 20 000.
Remark A computation shows the inequality of Corollary 5.35 fails for
primorials Nk with k in the following ranges:
[9, 27 220], [28 168, 40 969], [41 368, 56 605],
[70 250, 82 904], [118 349, 120 568],
and succeeds for all other primorials with 1 ≤ k ≤ 120 568.
(3) Explain the remark following the corollary to Theorem 5.34 in terms of
the distribution of primes.
(4) Find a relationship between gaps, or sets of gaps, between primes which
is equivalent to the Riemann hypothesis.
(5) Prove that RH implies eγ loglog n < n/(ϕ(n) + 1) for all n ≥ 2.
(6) Lehmer’s conjecture of 1932: if ϕ(n) | n − 1 then necessarily n is prime.
(7) Carmichael’s totient function conjecture: for all m, #{n : ϕ(n) = m} > 1.
(8) The Duffin–Schaeffer conjecture of 1941: if f : N → (0, ∞) then for
almost all α (with respect to Lebesgue measure), the inequality
α − p < f (q)
q q
has infinitely many solutions in coprime integers p, q with q > 0 if and
only if the sum
∞
ϕ(q)
f (q) = ∞.
q=1
q
6
A Variety of Abundant Numbers
6.1 Introduction
In this chapter we present a study of superabundant and colossally abundant
numbers. The material is included because these are the numbers which
appear as possible counterexamples to the inequality of Robin, which we
will show in Chapter 7 to be equivalent to RH.
Paul Erdős (Figure 6.1) made fundamental contributions to our understand-
ing of these numbers, especially in two joint papers, one with Alaoglu in 1944
[3] and the other with Nicolas in 1975 [56]. Erdős was the most prolific and
connected mathematician of all time, in terms of both the number of articles
he published and the number of his joint authors. He never held a regular
position, but travelled the world, staying for a time with one or other of his
joint authors and discussing mathematical problems. These were in a wide
range: graph theory, combinatorics, analysis, probability and set theory, as
well as number theory.
All of the numbers of interest share the property that, apart from exceptions
like 1, they have many divisors, with many of the divisors being large. Here
now are some definitions. A positive integer n is said to be highly composite
if the number-of-divisors function d(n) satisfies, for 1 ≤ m < n, d(n) > d(m).
In other words, a highly composite number n has more divisors than any
smaller number. It is called abundant provided σ(n)/n > 2. We say it is
highly abundant if σ(n) > σ(m) for all m < n. It is superabundant if
σ(n)/n > σ(m)/m for all 1 ≤ m < n. Some authors denote these numbers as
“SA”. It is colossally abundant, sometimes denoted “CA”, if there exists an
> 0, depending on n, such that for all m > 1 we have
σ(m) σ(n)
≤ 1+ .
m1+ n
There are relations between these types of numbers. For example all
superabundant numbers are highly abundant and all colossally abundant
144
6.1 Introduction 145
numbers are superabundant. To see this, suppose that n > 1 were colossally
abundant but not superabundant. Then for some m < n we would have
σ(n)/n < σ(m)/m. But this would mean that for some > 0 we would have
Here we are interested in the ratios σ(n)/n and, for > 0, σ(n)/n1+ , over
initial sequences of natural numbers. If Nk = 2 · 3 · · · pk is the kth primorial,
then
σ(Nk )
k
σ(n)
= (1 + p j ) =⇒ lim sup = ∞,
Nk j=1 n→∞ n
but since for each prime p, σ(p)/p = 1 + 1/p, we have lim inf n→∞ σ(n)/n = 1.
It can be shown that {σ(n)/n : n ∈ N} is, in the usual topology, dense in (1, ∞).
Even though the values of both σ(n) and σ(n)/n exhibit a high degree
of randomness as functions of n, they have very nice Dirichlet series
representations. For example
∞
σ(n)
= ζ(s)ζ(s − 1), s > 2,
n=1
ns
and
∞
σa (n)σb (n) ζ(s)ζ(s − a)ζ(s − b)ζ(s − a − b)
= , s > σ0 ,
n=1
ns ζ(2s − a − b)
where
For further background material and references for σ(n) the reader might
consult [6, 75, 150].
The following RHpack (see Appendix B) functions relate to the mate-
rial in this chapter: HighlyCompositeQ, HardyRamanujanQ, SuperAbun-
dantQ, ColossallyAbundantQ, ColossallyAbundantInteger, ColossallyAbun-
dantF and CriticalEpsilonValues.
6.2 Superabundant Numbers 147
n = 2a2 3a3 · · · pa p ,
https://oeis.org/A004394/b004394.txt
which is:
Note that q x−1 ≤ r − 1 so q x ≤ q(r − 1) giving q x − 1 < q(r − 1). Using this we
have
with ai ≥ 0, am ≥ 1, with pi the ith prime. Let S := {1, 2, 4, 6, 24, . . .} be the set
of all superabundant numbers. Let A := max(ai : 1 ≤ i ≤ m) be the maximum
of the exponents of n so n | (p1 · · · pm )A . Let
F := {s ∈ S : pmA s}.
Because of Lemma 6.1, if pmA | s we would have n | s. Hence the set of
superabundant numbers s for which n s is a subset of those for which pmA
does not divide s, i.e. F.
We claim F is a finite set. To see this let s = pb11 · · · pl l be in F so bm < A.
b
0.04
0.03
0.02
0.01
α=2
x α=3
5 10 15 20
Figure 6.2 The functions F(3, α) and F(2, α) showing F(3, α) < F(2, α).
Lemma 6.12 [56, proposition 4(c)] If the sets (Eq )q∈P are pairwise disjoint,
the set of colossally abundant numbers is equal to the set of ni with i ≥ 1. The
function n → σ(n)/n1+i attains its maximum at the two points ni and ni+1 .
Proof Choose = i = F(q, β). For p q, since the Eq are pairwise disjoint,
E p so α p () is given by the expression of Lemma 6.10. By Lemma 6.9 the
exponent of q can be chosen to be β or β − 1. In the first case we get ni+1 and
in the second ni with
qβ+1 − 1 σ(ni ) σ(ni+1 )
q1+i = ⇐⇒ = ,
qβ − 1 n1+
i
i
n1+
i+1
i
σ(n)/n1+∈4
2.0
1.5
1.0
0.5
n
0 20 40 60 80 100 120
σ(n)/n1+∈
2.0
1.5
1.0
0.5
n
0 10 20 30 40 50 60 70
so the left-hand side is an integer and the right-hand side not an integer. (The
theorem of Gel’fond and Schneider, that if α and β are algebraic numbers
with α 0 or 1 and if β is not a rational number, then any value of αβ is a
transcendental number, would show that in fact is a transcendental number.)
6.3 Colossally Abundant Numbers 159
where the products are over all primes, and for all p, 0 ≤ α p () ≤ α p ( ).
(2) For each prime p define a function on (0, 1) by
1+t
p −1
f p (t) := log p ,
pt − 1
and note that f p (t) increases as t decreases, and is a continuous function of t.
(3) By Lemma 6.10 we have α p () = f p () − 1 and α p ( ) = f p ( ) −
1. For fixed p, as t decreases from to , since n and n are consecutive
colossally abundant numbers and f p (t) is continuous, α p ( ) must differ from
160 A Variety of Abundant Numbers
α p () by at most one, and in such a case where they differ, f p (i ) must be a
positive integer greater than 1, say f p (i ) = m. This means
pi +1 − 1 pm − 1
= pm =⇒ pi = ,
pi − 1 pm − p
which is a rational number.
(4) If this exponent jump occurs at three or more distinct primes p, q, r
then we would have a real i with pi , qi and ri each rational. But again by
Lemma 6.14, this is impossible. This completes the proof.
G(n) ≤ max(G(M),G(N)).
Proof By Lemmas 6.12 and 6.13 there exists an index i ≥ 1 such that M = ni
and N = ni+1 and such that these two numbers are critical points giving the
same maximum value for the function n → σ(n)/n1+i .
The function f (x) := i x − loglog x is concave upwards for x ≥ 2, so for
2 ≤ a ≤ ξ ≤ b we have f (ξ) ≤ max( f (a), f (b)). Because ni ≤ n ≤ ni+1 this
means
i log n − logloglog n
≤ max(i log ni − logloglog ni , i+1 log ni+1 − logloglog ni+1 ),
or taking exponentials
ni ni i ni+1
i
≤ max , . (6.3)
loglog n loglog ni loglog ni+1
6.4 Estimates for x2 () 161
Therefore
u2 √
(1 + z + z ) 1 + u −
2
≤ 2x(1 − 2.05u2 ) + 1.054 2x
log 2
2
√ log 2
= 2x + 1.06 2x − 4.1x < 2x,
log x
where the final inequality follows since for x ≥ 212 we have
2
√ log 2
1.06 2x − 4.1x < 0.
log x
This proves inequality (6.8) for all x > 212 . For 1530 ≤ x ≤ 212 , inequality
(6.14) can easily be verified numerically, so the proof of the lemma is
complete.
We also need the following with k = 3, stated in [56] to be proved along
the same lines as lemma 2(b) in [3].
Lemma 6.18 As → 0 we have xk ∼ (kx1 )1/k .
Proof We have for x = x1 and y = xk
1 1
log 1 + log 1 +
y + y2 + · · · + yk x
= = .
log y log x
Note that → 0+ if and only if x → ∞ if and only if y → ∞. Hence, using
log(1 + 1/x) ∼ 1/x as x → ∞ we get
yk log y ∼ x log x. (6.19)
Taking logarithms gives k log y ∼ log x. Inserting this in (6.19) then gives yk ∼
kx, completing the proof of the lemma.
(6) For fixed k, the function t → F(t, k) is strictly decreasing for all t > 0,
and tends to zero when t → ∞. Therefore, as k → ∞, if n is colossally
abundant with xk := xk (), k ∈ N,
xk ∼ (kx)1/k .
This result is in [3]. Prove it and then give an error estimate.
(7) It was proved in [3] that the number of superabundant numbers N(x) up
to x > 0 satisfies
log x loglog x
N(x)
.
(logloglog x)2
Improve this bound and find a lower bound for N(x).
(8) Find “book proofs” for some of the more difficult theorems and lemmas
in [3].
(9) Find an RH equivalence expressible in terms of superabundant numbers.
7
Robin’s Theorem
7.1 Introduction
This chapter has as its focus Robin’s theorem, an explicit inequality involving
the sum-of-divisors function, valid on an explicit range, its validity being
equivalent to RH. It forms part of what we call the Ramanujan–Robin
criterion. We include Ramanujan because this great Indian mathematician
first demonstrated the truth of the inequality assuming RH, albeit the
asymptotic form.
The life of Srinivasa Ramanujan (pronounced Ra-manu-jan) requires no
introduction, since it has been celebrated in books, plays and film. His vast
mathematical contributions have been treated in depth in the monumental
works of Bruce Berndt, including [10].
Guy Robin was supervised by Jean-Louis Nicolas (the Nicolas of
Chapter 5) at the Université de Limoges, and received his Ph.D. in 1983 with
a thesis entitled “Grandes valeurs de fonctions arithmétiques et problèmes
d’optimisation en nombres entiers”. One of his students, Pierre Dusart, is
making important contributions to the ongoing development of this approach
to RH. According to [128], Robin did not learn of Ramanujan’s unpublished
result, showing Robin’s inequality was necessary if RH was true, until several
years after he had discovered his theorem and proof.
We have already seen some introductory and historical information
regarding σ(n) in Chapter 6. It should come as no surprise that, because of the
close relationships between the sum-of-divisors function and Euler’s totient
function ϕ(n), and the equivalences to RH developed in Chapter 5, there are
equivalences to RH which can be expressed in terms of σ(n).
Paul Erdős was interested in common properties of σ(n) and ϕ(n) through-
out his career, and authored or co-authored many papers exploring this
connection. For example Alaoglu and Erdős [3] state that both σ(n) and ϕ(n),
for every > 0, apart from a set of density zero, are divisible by every prime
less than (loglog n)1− and by “relatively few” primes larger than (loglog n)1+ .
165
166 Robin’s Theorem
where f (x) g(x), for positive functions f (x) and g(x), means that there are
numbers α > 0, β > 0 with, as x → ∞, αg(x) ≤ f (x) ≤ βg(x). Moreover the
same is true [53, 58] for the size of the sets of values
#{σ(n) : n ≤ x} #{ϕ(n) : n ≤ x}.
In addition, the normal order of ω(σ(n)) is the same as the normal order
of ω(ϕ(n)), namely (loglog n)2 /2. Recall that we say arithmetic function f (n)
has normal order ρ(n), for some non-negative non-decreasing function ρ(n),
if for every > 0 we have
% &
# n ≤ x : f (n)
− 1 <
ρ(n)
lim = 1.
x→∞ x
See [75] or [95] for the development of this topic.
Work has also been done on special values of these functions; for example
see [27].
Recently there has been much work on “joint properties” shared by these
two functions. For instance there exists an infinite number of squarefree
integers n such that σ(n) ϕ(n) is a square [29]. In a similar vein Ford, Luca
and Pomerance [61] proved that the two functions have an infinite number
of values in common, that is σ(a) = ϕ(a ) for an infinite set of integer pairs
(a, a ) (see also Ford and Pollack [62, 63]). The case of when ϕ(n) divides
σ(n) has also been explored [28, 30].
Now we focus on the material in this chapter. Consider the growth of
σ(n)/n. On the one hand, whenever n is a prime we have σ(n)/n = 1 + 1/n,
so the range is arbitrarily close to 1. On the other hand, σ(n) cannot grow too
quickly. Grönwall [69] showed that
σ(n)
lim sup = eγ . (7.1)
n→∞ n loglog n
We do not use this result in this chapter, but it is proved and used in Chapter 9
as Theorem 9.2. What is interesting is the manner in which this limit superior
is approached. Ramanujan [136], assuming the Riemann hypothesis, proved
that σ(n)/(n loglog n) is asymptotically bounded above by eγ . That is he
showed:
7.1 Introduction 167
Theorem 7.2 [136, 137] Assume the Riemann hypothesis is true. Then there
exists an n0 ≥ 1 such that
σ(n)
< eγ , (7.2)
n loglog n
for all n ≥ n0 .
Robin [142], using explicit estimates for arithmetic functions given by
Rosser and Schoenfeld [145, 153, 154], set out in Chapter 3, gave an explicit
value of n0 in Theorem 7.2. In [142, proposition 3, p. 204], he proves:
Theorem 7.11 [142, theorem 1] Assume the Riemann hypothesis is true.
Then
σ(n) < eγ n loglog n (n ≥ 5041). (7.3)
Moreover the values of 1 < n ≤ 5040 for which the inequality in (7.3) fails are
listed in the set A, where
A := {1, 2, 3, 4, 5, 6, 8, 9, 10, 12, 16, 18, 20, 24, 30, 36, 48,
60, 72, 84, 120, 180, 240, 360, 720, 840, 2520, 5040}. (7.4)
The inequality in expression (7.3) is known as “Robin’s inequality”. Note
that in this volume the term does not include the lower bound n ≥ 5041 for
the range of validity, which can sometimes cause confusion. We attempt to
make the range explicit whenever the term is used.
Since Grönwall’s result (7.1) is unconditional, one may expect an explicit
inequality similar to inequality (7.2) in which limit of the right-hand side was
eγ . Robin proved such a result (with a different constant) in:
One may therefore combine Theorems 7.11 and 7.15 to give the following
equivalence for RH, which is known also as “Robin’s theorem”:
Theorem 7.16 (Ramanujan–Robin criterion) [142, theorem 1] A necessary
and sufficient condition for the Riemann hypothesis is that
σ(n) < eγ n loglog n (n ≥ 5041). (7.7)
Because of the function σ(n), the ratio of the left side to the right side of
inequality (7.3) is far from smooth.
We see in Figure 7.1 that, even for small values of n > 5040, the ratio is
frequently close to 1.0.
Lagarias [100] found a modification of Theorem 7.16, namely:
Theorem 7.18 (Lagarias’ criterion) [100, theorem 1.1] Let Hn = nj=1 j−1
represent the nth harmonic number in Q. A necessary and sufficient condition
for the Riemann hypothesis is that
σ(n) ≤ Hn + exp(Hn ) log(Hn ) (7.8)
is true for all n ≥ 1.
To summarize this chapter: Section 7.2 gives the proof of Ramanujan’s
theorem, which assumes RH. In Section 7.3 a lower bound
for p≤x (1 −
1/p) is derived. Section 7.4 gives an upper bound for p≤x (1 − 1/p2 ). In
Section 7.5 there is a lower bound for loglog N and in Section 7.8 an
σ(n)/neγ loglog(n)
0.8
0.6
0.4
0.2
n
5200 5400 5600 5800 6000
upper bound. Section 7.6 includes the proof of Robin’s theorem in the
case RH is true. Section 7.7 derives Robin’s unconditional upper bound for
σ(n)/n. Section 7.9 gives the proof of Robin’s theorem when RH is false.
In Section 7.10, we give the proof of Lagarias’ theorem, which depends on
Robin’s theorem. In Section 7.11 an unconditional upper bound related to
Lagarias’ formulation is presented, and in the final Section 7.12, some ideas
relating to the sum of unitary divisors of an integer are discussed.
The substantial contributions of Tim Trudgian (Figure 7.2) to this volume,
especially this essential chapter, are gratefully acknowledged.
The following RHpack (see Appendix B) functions relate to the material
in this chapter: RobinsInequalityQ, PlotRobinsInequality, SigmaOverN,
LagariasInequalityQ and UnitaryDivisorSigma.
To obtain an overview of the dependences between the results in this
chapter, the reader is invited to consult Figures 7.3, 7.4, 7.5 and 7.6.
Lem 3.14
Lem 7.7
Lem 7.10
Lem 5.7
Thm 4.6
Thm 6.16
Thm 7.11
Lem 5.24
Thm 7.2
Lem 7.8
Lem 7.1
Lem 6.17
Lem 7.9 Lem 2.10
Lem 3.12
Looking ahead, Lemma 8.9 of Akbary and Friggstad will show that if a
counterexample exists to Robin’s inequality greater than 5040, then the first
such example must be a superabundant number.
7.2 Ramanujan’s Theorem Assuming RH 171
Lem 3.13
Lem 3.10
Lem 7.5
Thm 2.1
Lem 7.4
Lem 7.6
Lem 7.8 Lem 7.7
Lem 3.14
Lem 7.3
Thm 4.6
Lem 3.11
Thm 7.13
Lem 7.12
Now we present the statement of Theorem 7.2. Note that the proof depends
on the result of Lemma 7.10, which is proved in Section 7.5.
Theorem 7.2 (Ramanujan) [136, 137] Assume the Riemann hypothesis is
true. Then there exists an n0 ≥ 1 such that
σ(n)
< eγ , (7.9)
n loglog n
for all n ≥ n0 .
Proof Assume there exists an arbitrarily large counterexample N to inequal-
ity (7.9), which by Lemma 7.1 is colossally abundant. Then, by Theorem 6.16
and the results of Chapter 6 we have
σ(N)
1
1 1
= 1+ 1 + + 2 × ··· , (7.10)
N x <p≤x
p x <p≤x p p
2 3 2
7.3 Ramanujan’s Theorem Assuming RH 173
√
where the xi are defined in Chapter 6. By Lemma 6.17, we have x2 < 2x, so
that we can bound (7.10) by
√
x2 <p≤x (1 − 1/p ) 2x<p≤x (1 − 1/p )
2
σ(N) 2
≤ ≤ . (7.11)
N p≤x (1 − 1/p) p≤x (1 − 1/p)
Using the Maclaurin series for −log(1 − z) and neglecting all terms other than
the first two we see that
⎛
⎜⎜⎜ 1 ⎞⎟
⎟⎟
1 − 2 = exp ⎜⎜⎜⎜⎝− + O 3/2 ⎟⎟⎟⎟⎠
1 1
p p 2 x
√ √
2x<p≤x 2x<p≤x
⎛ √ ⎞
⎜⎜⎜ − 2 1 ⎟⎟⎟
= exp ⎝ √ ⎜ +O √ ⎟⎠ , (7.12)
x log x x log2 x
which gives an upper bound for the numerator in (7.11). To bound the
denominator below, one may use the result of Nicolas, Lemma 5.24, namely
1
e−γ
−(2 + β)
1
1− ≥ exp √ +O √ , (7.13)
p≤x
p log θ(x) x log x x log2 x
and so
1
S (x)
= loglog θ(x) + B + loglog x − loglog θ(x) +
p≤x
p x log x
∞
S (t)(log t + 1)
− dt. (7.18)
x (t log t)2
The purpose of writing the expression this way is to combine it with one
which comes from the more commonly found definition for B, which can be
written:
1
1 1
1
log 1 − = B−γ− − log 1 − + . (7.19)
p≤x
p p≤x
p p>x p p
where
∞ ∞
R(t)(log t + 1) (ψ(t) − θ(t))(log t + 1)
J1 (x) = dt and K1 (x) = dt.
x (t log t)2 x (t log t)2
(7.21)
The following Lemma 7.3 has the same derivation using integration by
parts as that of Lemma 5.16, which has a more restricted formulation.
Lemma 7.3 [142, lemma 1] For n ≥ 1 and ρ a complex number with 0 <
ρ < 1, set ∞ ρ
t n log t + 1
Fρ,n (x) := dt. (7.22)
x t n+1
log2 t
Then
n xρ−n
Fρ,n (x) = + rρ,n (x), (7.23)
n − ρ log x
where ∞ ρ−n−1
ρ xρ−n t
rρ,n (x) := − 2 +2 dt . (7.24)
(ρ − n)2
log x x log3 t
If ρ = 12 then for n ≥ 1 we have |ρ| ≤ |ρ − n| and thus
1 4
|rρ,n (x)| ≤ 1+ . (7.25)
|ρ|xn−1/2 log2 x (2n − 1) log x
This completes the statement.
We shall apply Lemma 7.3 where ρ is a non-trivial zero of the Riemann
zeta function. In particular, if we assume the Riemann hypothesis is true we
have ρ = 12 .
The next lemma bounds not only the first integral appearing in (7.21), but
also a generalization, which we shall need in Section 7.4.
Lemma 7.4 [142, lemma 2] Assume the Riemann hypothesis is true. For
n ≥ 1 and for x ≥ 2, let
∞
n log t + 1
Jn (x) = R(t) dt (7.26)
x tn+1 log2 t
and recall β := ρ 1/|ρ|2 , the sum over the imaginary zeros of ζ(s). Then
1 4 2 log 2π
β n+ + + √
log x (2n − 1) log2 x β x
−
xn−1/2 log x
1 4
β n+ +
log x (2n − 1) log2 x
≤ Jn (x) ≤ . (7.27)
xn−1/2 log x
176 Robin’s Theorem
Proof Consider first the explicit formula in Theorem 2.1 (or see for example
[80, theorem 29]):
xρ
1 1
ψ(x) = x − − log 2π − log 1 − 2 (7.28)
ρ
ρ 2 x
and write
∞ % & ∞ g(t)tρ
1 1
g(t) ψ(t) − t + log 2π + log 1 − 2 dt = − dt (7.29)
x 2 t ρ x ρ
where
n log t + 1
g(t) := . (7.30)
tn+1 log2 t
For n ≥ 1 the dominated convergence theorem justifies the interchange of
summation and integration on the right-hand side of (7.29). Recalling that
R(t) = ψ(t) − t we may write
Jn (x) = Jn(1) (x) + Jn(2) (x), (7.31)
where
∞ ρ
t n log t + 1
Jn(1) (x) =− dt,
ρ x ρ tn+1 log2 t
∞ (7.32)
1 1 n log t + 1
Jn(2) (x) = − log 2π + log 1 − 2 dt.
x 2 t tn+1 log2 t
For t ≥ x ≥ 2 we have 0 < − 12 log(1 − t−2 ) < log 2π, and it then follows that
2 log 2π
− < Jn(2) (x) < 0. (7.33)
xn log x
Next use Lemma 7.3 with ρ = 12 + iγ to get
n xiγ rρ (x)
Jn(1) (x) = − . (7.34)
xn−1/2 log x ρ ρ(ρ − n) ρ
ρ
Using the upper bound (7.25), the bounds which have been derived for Jn(2) (x),
and the estimate
xiγ 1
≤ = β, (7.35)
ρ ρ(ρ − n) ρ
|ρ|2
completes the proof of the lemma.
We now proceed to bound the integral K1 (x) in (7.21). In order to do this
we require the following lemma to bound the difference ψ(x) − θ(x). This
gives the same result as Lemma 5.18 (b), but the proof is different. Note that
7.3 Preliminary Lemmas With RH True 177
the result improves on [153, (5.3*)], which, instead of the estimate (7.37),
has √ √
0.998697 x < ψ(x) − θ(x) < 1.001093 x + 3x1/3 . (7.36)
Lemma 7.5 [142, lemma 3] Assume the Riemann hypothesis is true. Then
for all x ≥ 121
0 < ψ(x) − θ(x) − x1/2 ≤ 43 x1/3 . (7.37)
Proof Let g(x) = (ψ(x) − θ(x) − x1/2 )/x1/3 . We may first check that g(x) ≤
1.332 for all x ≤ 4.1 × 1011 . To do this we need to examine only those primes
with p ≤ x1/2 , that is to say, the first 52 104 primes for which the equation is
able to be verified
numerically.
Since ψ(x) = n≥1 θ(x1/n ) we may write
ψ(x) − 2ψ(x1/2 ) = (θ(x1/n ) − 2θ(x1/(2n) )) = (−1)n+1 θ(x1/n ), (7.38)
n≥1 n≥1
so that
ψ(x) − θ(x) ≤ ψ(x1/2 ) + (ψ(x1/2 ) − θ(x1/2 )) + θ(x1/3 ). (7.39)
Next, by Lemma 3.10 (a), for 0 < x ≤ 4.1 × 1011 we have θ(x) < x. For 4.1 ×
1011 ≤ x ≤ 1.68 × 1023 we have x1/2 ≤ 4.1 × 1011 , and therefore
ψ(x) − θ(x) ≤ θ(x1/2 ) + 2(ψ(x1/2 ) − θ(x1/2 )) + θ(x1/3 )
2 2.666
≤ x + x 1 + 1/12 + 1/6 ≤ x1/2 + 1.24637x1/3 .
1/2 1/3
(7.40)
x x
Now consider x ≥ 1.68 × 1023 and make use of the following results due to
Rosser and Schoenfeld (recall we are assuming RH):
ψ(x) − θ(x) ≤ θ(x1/2 ) + 3x1/3 , x > 0,
θ(x) < 1.000081x, x > 0, (7.41)
1 √
|ψ(x) − x| ≤ x log2 x, x ≥ 74,
8π
which are, respectively Lemma 3.10, Lemma 3.11 and Theorem 4.6. It
follows using inequality (7.39) that
log2 x 1.000081 3
ψ(x) − θ(x) ≤ x + x 1.000081 +
1/2 1/3
+ + 1/6
32πx1/12 x1/12 x
≤ x + 1.33128x ,
1/2 1/3
(7.42)
which proves the first part of the lemma.
For the second part of the lemma when 121 ≤ x ≤ 1016 we use
Lemma 3.13 (b), √
ψ(x) − θ(x) > x, (7.43)
178 Robin’s Theorem
Using the result of Lemma 7.3 and the bound in the estimate (7.25), we bound
the right side of the estimate (7.47), noting from Lemma 5.16 that r 1 ,1 (x) ≤ 0,
3
to obtain
2 4 1
K1 (x) ≤ √ 1+ 2
+ 1/6 . (7.48)
x log x log x x
Finally, the bounds (7.48) and (7.27) can be used to complete the proof of the
lemma.
7.3 Preliminary Lemmas With RH True 179
−1
We are finally in a position to bound p≤x (1 − p ) from below. In
Lemma 7.7 we find an upper bound for the multiplicative inverse of this
product:
Lemma 7.7 [142, lemma 5] For x ≥ 20 000,
1
−1
2+β α(x)
1− ≤ eγ log θ(x) exp √ + √ 2
, (7.49)
p≤x
p x log x x log x
where
S 2 (x)(log x + 1.31) 8 + 4β 2 log x 2 log 2π log x
α(x) := 3/2
+(β−2)+ + 1/6 + . (7.50)
2x log x x x1/2
Proof We use Taylor’s formula to expand loglog θ(x) about the point x, so
S (x) S 2 (x)
loglog θ(x) = loglog x + − g(ξ), (7.51)
x log x 2
for some ξ ∈ (x, θ(x)) or (θ(x), x), where g(t) = (log t + 1)/(t2 log2 t). For x ≥
20 000 we use Lemma 3.14 (a):
x
θ(x) > x − (x ≥ 19 421). (7.52)
8 log x
Note that the function g(t) is decreasing, so we may use inequality (7.52) to
write g(ξ) < g(x − x/(8 log x)). This implies, since log(1 − ) < − for > 0,
1
log x + 1 −
8 log x
g(ξ) < 2 2 .
1 1
x2 1 − log x 1 −
2
8 log x 7.8 log2 x
Note that because −log(1 − ) < /(1 − ) for 0 < < 1 we have used:
2 2
1 1 1
log x 1 −
2
> log x − > log x 1 −
2
.
8 log x log x − 1 7.8 log2 x
If y := log x, consider for y ≥ log(20 000) > 9.90349 the expression
1
y+1−
0.44 8y
y + 1.26 + − 2 2 .
y 1 1
1− 1−
8y 7.8y2
This expression is strictly positive for y ≥ log(20 000): to see this convert it
to a polynomial in y with leading term 0.01y7 and then divide by y7 to get the
180 Robin’s Theorem
form k(y) + 0.01 where the rational function k(y) satisfies |k(y)| < 0.00007.
Therefore
log x + 1.26 + 0.44/log x log x + 1.31
g(ξ) < < . (7.53)
x2 log2 x x2 log2 x
This and (7.51) show that
S (x) S 2 (x) log x + 1.5626
loglog θ(x) ≥ loglog x + − . (7.54)
x log x 2 x2 log2 x
Insert this into (7.18) and (7.19) gives:
S (x)
loglog x − loglog θ(x) +
x log x
S (x)2 log x + 1.31 1
≤ − log 1 −
2 x2 log2 x p≤x
p
1
1 1
= −B + γ + + log 1 − +
p≤x
p p>x p p
S (x)2 log x + 1.31
≤ −B + γ + loglog θ(x) + B + − L1 (x),
2 x2 log2 x
so exponentiating we get
1
−1
S (x)2 log x + 1.31
γ
1− ≤ e log θ(x) exp −L1 (x) + .
p≤x
p 2 x2 log2 x
Finally we are able to use Lemma 7.6 to complete the derivation of the upper
bound.
7.4 Bounding p≤x (1 − p−2 ) From Above With RH True
In this section we make (7.12) explicit by giving a more precise form of the
sum of Lemma 5.7 in the case α = 2. The proof takes seven steps and relies
on many of the results which have been proved.
Lemma 7.8 [142, lemma 6] Assume RH is true. For x ≥ 20 000,
⎛ √ ⎞
1 ⎜⎜⎜ − 2 4 ⎟⎟⎟
⎜
1 − 2 ≤ exp ⎝ √ + √ 2 ⎠
⎟. (7.55)
√ p x log x x log x
2x<p≤x
Proof (1) We consider separately small and large values of x. For 2 × 104 ≤
x ≤ 4 × 109 we write
1
6
1
−1
1
−1
1− 2 = 2 1− 2 1− 2 . (7.56)
√ p π √ p p>x
p
2x≤p≤x p≤ 2x
7.4 Bounding p≤x (1 − p−2 ) From Above With RH True 181
We can estimate the final sum appearing in the upper bound of (7.58) by
using Abel summation and integrating by parts so that
1 ∞
−θ(x) 2 log t + 1
= 2 + θ(t) dt. (7.59)
p>x
p 2 x log x x t3 log2 t
Using the middle inequality in the estimate (7.41), and the lower bound
(7.52), we can bound the right side of (7.59) thus
1 1.000081 1 0.070912
≤ + 2
≤ √ , (7.60)
p>x
p 2 x log x 8x log x x log2 x
where we have used θ(x) < 1.000081x from Lemma 3.11. Substituting this
bound in estimate (7.58), then using estimates (7.57) and (7.60) we establish
the lemma for x ∈ [2 × 104 , 4 × 109 ].
(2) For larger x we make use of the results from Lemmas 7.3, 7.4 and 7.5.
We write
⎛ ⎞⎟
⎜⎜⎜ 1 ⎟⎟⎟ 1 x dθ(t)
− log ⎜⎜⎝ ⎜ 1 − 2 ⎟⎟⎠ ≥ = √ 2
x2 <p≤x
p √ p2 2x t log t
2x<p≤x
x x
dt dS (t)
= √ 2 + √ 2 =: I1 + I2 . (7.61)
2x t log t 2x t log t
Hence
x
√ √
dt 1 2 1 2 2
√
=− + √ + − √
2
2x t log t x log x x log 2x x log2 x x log2 2x
x
dt
+2 √ 3
2
2x t log t
√ √
2 log 2 2 2 1
≥ √ 1− − √ 2
− .
x log x log x x log 2x x log x
Therefore since for x > 2 we have
2
log x log x
+ √ < 1,
log 2x 2 2x
we get
√ √
2 2 (2 + log 2)
I1 ≥ √ − √ . (7.62)
x log x x log2 x
(4) For the second integral:
√
S (x) S ( 2x ) √
I2 ≥ 2 − + L2 ( 2x ) − L2 (x), (7.63)
x log x x log 2x
where ∞
2 log t + 1
L2 (y) = S (t) dt = J2 (y) − K2 (y). (7.64)
y t3 log2 t
By Lemma 7.5, we have
and so we may bound K2 (y) using the bounds on Fρ,n (x) in Lemma 7.3. Using
Lemma 7.4 to bound J2 (y) we find that
⎛ ⎞
1 ⎜⎜⎜ β + 29 4β 4 ⎟⎟⎟
L2 (y) ≤ 3/2 ⎜
⎝2β + + − ⎟⎠ . (7.66)
y log y log y 3 log2 y 3
It is straightforward to show that the right
√ side of estimate (7.66) is negative
for y ≥ 2. We may similarly bound L2 ( 2x ) from below:
√ √ √ √
L2 ( 2x ) ≥ J2 ( 2x ) − F 1 ,2 ( 2x ) − 43 F 1 ,2 ( 2x ),
2 3
and so
⎛ √ ⎞
1 ⎜⎜⎜ 2 4 ⎟⎟⎟
1 − 2 ≤ exp ⎜⎝− √ + √ 2 ⎠
⎟,
√ p x log x x log x
2x<p≤x
Lemma 7.9 [142, lemma 7] With the notation defined in the paragraph
above, if x ≥ 20 000 then i≥2 θ(xi ) ≥ 0.998x2 .
Proof To prove the lemma we make use of Lemma 3.12 which gives explicit
bounds of the form
θ(x) > αx (x ≥ a). (7.71)
Let A(x2 ) := θ(x2 ) + i≥3 θ(xi ). By Lemma 6.17 we have (x22 /2)1/k < xk , and so
⎛ ⎞
⎜⎜ x2 1/i ⎟⎟
A(x2 ) ≥ θ(x2 ) + ⎜
θ ⎜⎜⎝ 2 ⎟⎟⎟ . (7.72)
i≥3
2 ⎠
Now, let a and b with a < b be two positive real numbers. For y ∈ [a, b] we
have a bound of the form (7.71), so that
⎛ ⎛ 1/i ⎞⎞
⎛⎜⎜ a2 1/i ⎞⎟⎟ ⎜⎜⎜ 1 ⎜⎜⎜ a2 ⎟⎟⎟⎟⎟⎟⎟
A(y) ≥ αy + ⎜
θ ⎝⎜ ⎟ ⎜
⎠⎟ ≥ y ⎜⎝α + θ ⎜⎝ ⎟⎠⎟⎠ = yη, (7.73)
3≤i
2 b 3≤i 2
where η = η(α, a, b). The sum over all i ≥ 3 is finite, since the summands in
the expression (7.73) are zero whenever i > log(a2 /2)/log 2. We sum over
all 3 ≤ i ≤ log(a2 /2)/log 2 and use explicit numerical values for the θ(x)
evaluations needed on the right-hand side of (7.72). The range to be covered
needs to include the minimum value of x2 , which is not less than 193.
By choosing α, a and b appropriately we can bound A(y) in regions a ≤ y ≤
b. We do this in Table 7.1 for which we computed the values of η, given a, b
and α. The first four values of a, b and α are from [146, p. 265, corollary]
and the final values are from [145, theorem 10].
The lemma then follows from expression (7.73) and Table 7.1.
7.5 Bounding loglog N From Below With RH True 185
Table 7.1 Evaluation of η(α, a, b), given α, a and b.
a b α η
Take logarithms and use Lemmas 7.9 and 3.11, which gives θ(x) <
1.000081x, x > 0, to get
log N = θ(xi ) = θ(x) + θ(xi )
i≥1 i≥2
≥ θ(x) + c1 x2
c1 x2
= θ(x) 1 +
θ(x)
c1 x2
≥ θ(x) 1 +
1.000081x
c2 x2
≥ θ(x) 1 + ,
x
186 Robin’s Theorem
We now use the results of three of the lemmas which have been proved above,
all valid for x ≥ 20 000 and assuming RH. By Lemma 7.7,
1
−1
2+β α(x)
γ
1− ≤ e log θ(x) exp √ + √ ,
p≤x
p x log x x log2 x
where
S 2 (x)(log x + 1.31) 8 + 4β 2 log x 2 log 2π log x
α(x) := 3/2
+ (β − 2) + + 1/6 + ,
2x log x x x1/2
by Lemma 7.8,
⎛ √ ⎞
1 ⎜⎜⎜ 2 4 ⎟⎟⎟
1 − 2 ≤ exp ⎜⎝− √ + √ 2 ⎠
⎟,
√ p x log x x log x
2x<p≤x
where
c1 ≤ −0.7642, c2 = α(x) + 4.484, (7.77)
and where α(x) is defined in (7.50) as part of the statement of Lemma 7.7.
We will show that c1 + c2 /log x ≤ 0 for√certain ranges of x set out below.
(1) For x ≥ 100 000 we use |S (x)| ≤ x log2 x/(8π) (Theorem 4.6), which is
valid since we are assuming RH, to bound S (x) and hence α(x) from above.
This shows that α(x) ≤ 2.27146, thus c1 + c2 /log x ≤ −0.175 < 0.
(2) For x ∈ [2 × 104 , 105 ] we use inequality (7.52), i.e. 0 ≤ x − θ(x) ≤
x/(8 log x) (Lemma 3.14 (a)) to bound S (x), so c1 + c2 /log x ≤ −0.055 < 0 in
this case also.
(3) Finally, for x ≤ 20 000, we calculate all of the colossally abundant
numbers Ck with largest prime factor at most x. We do this using the
fact that the first 107 colossally abundant numbers satisfy that the ratio of
any two consecutive numbers is a single prime which can be determined
algorithmically. (A Mathematica implementation of this calculation is on the
page for A004990 in the Online Encyclopedia of Integer Sequences [158].)
Once the next prime greater than 20 000, namely 20 011, occurs (at the
2348th term) we have the complete set of primes which appear in order and
188 Robin’s Theorem
can easily generate the corresponding 2347 colossally abundant numbers; see
Table A.5.
Recall the definition G(n) := σ(n)/(n loglog n). By (1), (2) and (3) above
we have G(Ck ) < eγ for all Ck ≥ 5041. By Theorem 6.16, this shows that the
theorem is true for all n ≥ 5041. Finally, we compute G(n) for all 2 ≤ n ≤ 5040
to derive the set S . This completes the proof.
where, by Lemma 6.17, we have xk+1 > x1/(k+1) . We can rewrite the right-hand
side of (7.84) as
1
−1
1
1− 1 − 2 E(x), (7.85)
p≤x
p x <p≤x p
2
Therefore using (7.84), the bounds (7.86) and the estimate (7.87) we get
−1
σ(N) 1 1 1
≥ 1− 1− 2 1+O √ .
N p≤x
p √
2x<p≤x
p x log2 x
Nicolas, in Theorem 5.29, proved that if the Riemann hypothesis is false there
is a b with 0 < b < 12 such that
⎛ ⎞
⎜⎜⎜ 1
⎟⎟⎟
log f (x) = log ⎜⎜⎜⎝eγ 1 − log θ(x)⎟⎟⎟⎠ = Ω± (x−b ). (7.88)
p≤x
p
1.0
0.8
0.6
0.4
0.2
n
200 400 600 800 1000
Note that, in the expression (7.93), we did not require the addition of the
term Hn . It was pointed out by Kaneko [100, p. 542] that one could instead
prove that σ(n) ≤ exp(Hn ) log(Hn ) for n > 60 is equivalent to the Riemann
hypothesis.
In Figure 7.8 we see how close the left side gets to the right side of the
inequality (7.92), even for small values of n.
One could also use a result of Robin [142, section 6, proposition 2 (1)], which
improves the right-hand side of inequality (7.95) to 2.57n loglog n for n ≥ 7.
There is an editorial comment in [102] that includes an observation by
the GCHQ Problem Solving Group. They contest that one could replace the
number 2 in inequality (7.94) by any constant K > 1 at the cost of having to
check more cases by hand when K is close to one. An example is cited: when
K = 1.2 one needs to verify inequality (7.94) for 1 ≤ n ≤ 106 .
Further advances have since obviated the need for such a large check.
Akbary, Friggstad and Juricevic [1] showed that
σ(n) σ(180)
< loglog n ≤ 1.0339eγ loglog n (n ≥ 121).
n 180 loglog 180
This was improved by Akbary and Friggstad [2, p. 3] by considering
superabundant numbers – see also Lemma 8.9:
σ(n)
≤ 1.013617eγ loglog n (n ≥ 5041).
n
This, and a computer check for 1 ≤ n ≤ 5040, establishes the unconditional
inequality given in Theorem 7.19.
Theorem 7.19 Without any hypotheses
σ(n) ≤ Hn + 1.013617 exp(Hn ) log(Hn ),
for all n ≥ 1.
One could reduce the size of the coefficient of exp(Hn ) log(Hn ) by
testing Robin’s inequality against more superabundant numbers. Given the
unconditional nature of Theorem 7.19, we note that if Theorem 7.18 is false,
it cannot be false by much. In this sense Theorems 7.19 and 7.18 complement
Theorems 7.11 and 7.13.
8.1 Introduction
Recall the theorem of Robin in Chapter 7, Theorem 7.11, that the Riemann
hypothesis is equivalent to Robin’s inequality holding for all n > 5040:
σ(n)
< eγ loglog n.
n
A := {1, 2, 3, 4, 5, 6, 8, 9, 10, 12, 16, 18, 20, 24, 30, 36, 48,
60, 72, 84, 120, 180, 240, 360, 720, 840, 2520, 5040},
B := {2, 3, 5, 6, 10, 30},
C := {1, 4, 8, 9, 16, 36},
D := {4, 8, 9, 16, 36, 72, 108, 144, 216, 900, 1800, 2700, 3600,
44 100, 88 200}.
200
8.1 Introduction 201
the three successive sections, Sections 8.3, 8.4 and 8.5, the given cases of
r-free integers are proved.
It is somewhat confusing that references to “Robin’s inequality” sometimes
implicitly include the lower bound n > 5040. In this volume by Robin’s
inequality we always mean
σ(n)
< eγ loglog n,
n
and any lower bound is given explicitly.
The following RHpack (see Appendix B) variables and functions relate
to the material in this chapter: SetA, SetB, SetC, SetD, HardyRamanujanQ,
NthPrimeBounds, LogKthPrimorialBounds, KthApproximatePrimorial, Pri-
morialFormToInteger and HardyRamanujanToPrimorialForm.
Lemma 8.3 [38, lemma 2.3] All elements of S other than {1, 3, 5, 9} satisfy
Robin’s inequality and all elements other than {1, 3, 5, 9, 15} satisfy Nicolas’
inequality.
Proof For p ≥ 7 and n ∈ S ∩ [31, ∞) we can write
σ(n) n 3 5 p 3 5 7
≤ ≤ · · ≤ · · = 2.1875 < eγ loglog 31 ≤ eγ loglog n.
n ϕ(n) 2 4 p − 1 2 4 6
The remaining elements of S can be checked numerically.
Lemma 8.4 [38, theorem 1.1] All squarefree integers other than the set B,
satisfy Robin’s inequality.
Proof (1) Begin by noting that no element of B satisfies Robin’s inequality.
If p ≥ 7 is prime then p + 1 < 8p/7 < peγ loglog p, so all such primes satisfy
Robin’s inequality.
(2) We now proceed using induction. If m := ω(n) we have just shown the
lemma is true for m = 1. Assume it is true for all squarefree integers n with
ω(n) = m − 1 for some m ≥ 2 and let n = q1 · · · qm be an integer not in B and
with primes qi satisfying q1 < q2 < · · · < qm .
204 Numbers That Do Not Satisfy Robin’s Inequality
(3) Suppose first that qm ≥ log n. If, on the one hand, q1 · · · qm−1 ∈ B and
qm B, then by Lemma 8.2, n = (q1 · · · qm−1 )qm satisfies Robin’s inequality.
If, however, qm ∈ B and n B we must have n = 15, which can be shown to
satisfy Robin’s inequality directly.
If, on the other hand, q1 · · · qm−1 B, we must have, by the inductive
assumption,
(q1 + 1)(· · ·)(qm−1 + 1) < eγ q1 · · · qm−1 loglog(q1 · · · qm−1 ),
and so
(q1 + 1)(· · ·)(qm + 1) < eγ q1 · · · qm−1 (qm + 1) loglog(q1 · · · qm−1 ). (8.1)
Note that if 0 < a < b we have
log b − log a 1 b
dt 1
= > .
b−a b−a a t b
Therefore setting a = log(q1 · · · qm−1 ) and b = log(q1 · · · qm ) and using the
assumption qm ≥ log(n), we can write
loglog n − loglog(n/qm ) loglog(q1 · · · qm ) − loglog(q1 · · · qm−1 )
=
log qm log(q1 · · · qm ) − log(q1 · · · qm−1 )
1
>
log(q1 · · · qm )
loglog(q1 · · · qm−1 )
≥ ,
qm log qm
so that
(qm + 1) loglog(q1 · · · qm−1 ) ≤ qm loglog(q1 · · · qm ).
Therefore by (8.1)
(q1 + 1)(· · ·)(qm + 1) < eγ q1 · · · qm loglog(q1 · · · qm ),
so n ∈ R and the result of the lemma follows by induction.
(4) Suppose now it is the case that qm < log n. Recall we can assume m ≥ 2.
If qm = 3 then n = 6 but log 6 < 3. If qm = 5 then n ∈ {10, 15}, but log 10 < 5
and log 15 < 5. Hence we can assume qm ≥ 7. Note that for a > 0
a+1
dt 1
log(a + 1) − log(a) = < ,
a t a
so, by Lemma 8.1, we can write
m m
1 1
log(qi + 1) − log(qi ) < ≤ ≤ γ + loglog qm
i=1
q p≤q p
i=1 i m
< γ + logloglog(q1 · · · qm ).
Taking exponentials we get σ(n)/n < eγ loglog n, which completes the
induction step in this case also.
8.2 Hardy–Ramanujan Numbers 205
Lemma 8.5 [38, theorem 2.1] All odd integers other than the set
{1, 3, 5, 9, 15}, satisfy Nicolas’ inequality.
Proof Let m := ω(n) ≥ 1. If m = 1 then, by Lemma 8.3, n satisfies Nicolas’
inequality. If m ≥ 2 then n/ϕ(n) = γ(n)/ϕ(γ(n)). Therefore if r is a squarefree
number satisfying Nicolas’ inequality so does every integer n with γ(n) = r.
Now let n = q1 · · · qm be squarefree with the qi prime and 3 ≤ q1 < q2 < · · · <
qm . In this case n satisfies Nicolas’ inequality if and only if
m
1 qi
m
= < eγ loglog(q1 · · · qm ).
i=1
1 − 1/qi i=1 qi − 1
Because q1 /(q1 −1) ≤ 3/2 and for 2 ≤ i ≤ m, qi /(qi −1) < (qi−1 +1)/qi−1 , setting
n1 := 2n/q1 gives
3 qi + 1 σ(n1 )
m−1
n
< = .
ϕ(n) 2 i=1 qi n1
It follows that since n1 is squarefree, by Lemma 8.4, other than elements of
B, n1 satisfies Robin’s inequality, so we have
n σ(n1 )
< < eγ loglog n1 < eγ loglog n.
ϕ(n) n1
Therefore n would satisfy Nicolas’ inequality. Elements of B with m ≥ 2 can
be checked numerically.
If however m = 1 then n would be a prime power so would be in S . The
result then follows by Lemma 8.3.
Recall the definition θ(x) := p≤x log p for x ≥ 1. Recall that by
Lemma 3.10 (d), for x ≥ 121 we have θ(x) ≥ 0.8x. Recall also that by
Lemma 5.12 for x ≥ 286 we have
p
γ 1
≤ e log x 1 + .
p≤x
p−1 2 log2 x
These estimates enable us to derive the following upper bound for the left-
hand side of that expression in terms of the product of successive primes:
Lemma 8.6 [38, lemma 3.2] For all m > 4 we have
m
pi
≤ eγ log(2 log(p1 · · · pm )),
i=1
p i − 1
206 Numbers That Do Not Satisfy Robin’s Inequality
where as usual pi is the ith prime, so m(ē) has exponent pattern ē.
8.2 Hardy–Ramanujan Numbers 207
Lemma 8.8 [38, proposition 5.1] If Robin’s inequality holds for all Hardy–
Ramanujan integers n with 5041 ≤ n ≤ x it holds for all integers in this same
range.
Proof (1) First note that for given e > f > 0 and x ∈ (1, ∞) the function
1 − x−e
g(x) :=
1 − x− f
is strictly decreasing. To see this, observe that the numerator of the derivative
is ex f − f xe + f − e and its derivative is 0 at x = 1 and is negative for all x > 1.
(2) It follows from (1) that if p, q, f and e are positive integers with p, q
prime, p < q and e < f then
σ(p f qe ) 1 − p− f −1 1 − q−e−1
=
p f qe 1 − p−1 1 − q−1
−e−1
1− p 1 − q− f −1 σ(pe q f )
> = .
1 − p−1 1 − q−1 pe q f
(3) For primes p and q with p < q we have σ(pe )/pe > σ(qe )/qe . Therefore
if the exponent pattern is fixed, the maximum value of the ratio σ(n)/n will
be assumed when the primes are as small as possible, i.e. for n = m(ē) and if
n1 = pβ11 · · · pβmm is any other number for which the maximum is assumed, by
(2), we must have β1 ≥ · · · ≥ βm . Thus
σ(m(ē)) σ(n)
= max : n has factorization pattern ē .
m(ē) n
(4) This shows that if σ(n)/n ≥ eγ loglog n, then the same inequality holds
with n replaced by m(ē).
(5) Now we claim that if Robin’s inequality is true for the integer m(ē),
then it is true for every positive integer n having exponent pattern ē. To see
this, let n be such that it has exponent pattern ē and m(ē) ≤ 5040. Assume
also that n ≥ 5041 and that it does not obey Robin’s inequality. Since the
maximum number of prime factors of any number less than 5041 is 5, we
can write using (3)
σ(n) σ(m(ē)) 1 − p−6
eγ loglog n ≤ ≤ ≤ < 5,
n m(ē) p≤11
1 − p−1
so log(5041) ≤ log n < 14. Checking n values in this range numerically shows
that they satisfy Robin’s inequality, a contradiction, completing the proof of
the claim. Thus if n > 5040 and m(ē) ≤ 5040 then n obeys Robin’s inequality.
If however n > 5040 and m(ē) > 5040, since m(ē) is a Hardy–Ramanujan
number it must, by the hypothesis of the lemma, satisfy Robin’s inequality.
Hence, because of (3) and n ≥ m(ē),
σ(n) σ(m(ē))
≤ < eγ loglog(m(ē)) ≤ eγ loglog n.
n m(ē)
208 Numbers That Do Not Satisfy Robin’s Inequality
Proof (1) Suppose there exists an integer larger than 5040 which is 5-free
and does not satisfy Robin’s inequality. We claim that if n is the smallest
integer with this property then P(n) < log n. To see this first let q := P(n) with
n = mq. Since n is minimal, either m satisfies Robin’s inequality or it does
not and m ≤ 5040 so m ∈ A. If this latter is true, because
max {n ∈ A : P(n) ≤ 5} = 720,
and 5 × 720 = 5040, we can assume q ≥ 7. By Lemma 8.2, since n > 5040 and
does not satisfy Robin’s inequality, we obtain a contradiction in this case.
Hence m satisfies Robin’s inequality so m ≥ 7.
(2) Assume now, to get a contradiction, that we also have q ≥ log n. This
implies
q log q ≥ log n loglog n > log n loglog m,
which, since
loglog n − loglog m 1 log n
dt 1
= > ,
log q log n − log m log m t log n
implies first
q loglog m q(loglog n − loglog m) loglog m
> =⇒ >
log n log q log q log q
1
⇐⇒ 1 + loglog m < loglog n.
q
Then, since for all integers a, b ≥ 1 we have σ(ab) ≤ σ(a)σ(b), we get
σ(n) σ(qm) 1 σ(m) 1 γ
= ≤ 1+ < 1 + e loglog m < eγ loglog n,
n qm q m q
so n satisfies Robin’s inequality, a contradiction, completing the proof of the
claim.
(3) Note next that the set of 5-free Hardy–Ramanujan integers n > 5040
with P(n) ≤ 73 is finite. An explicit computation shows each member satisfies
Robin’s inequality.
(4) Now let t ≥ 2 and define for x ≥ 3
1 1 − 1/pt
Rt (x) := and S t (x) := .
p>x
1 − 1/pt p≤x
1 − 1/p
Then
log Rt (x) = − log(1 − p−t )
p>x
210 Numbers That Do Not Satisfy Robin’s Inequality
1
=
p>x m≥1
mptm
1
≤
p>x m≥1
(pm )t
1 t 1
≤ ≤ .
n>x
n t − 1 xt−1
t
(The reader might wish to experiment with the better upper bound given by
log Rt (x)−t≤ tx /(t − 1).
1/((t − 1)(x − 1)t−1 ). Therefore 1−t
(5) Because Rt (x)/ζ(t) = p≤x (1 − p ), using the result of Step (3) we can
write
t
log(1 − p−t ) = −log ζ(t) + log Rt (x) ≤ −log ζ(t) +
p≤x
(t − 1)xt−1
Hence we must have log m ≤ exp(S 5 (193)e−γ ) < 175 and the largest prime
less than 175 is 173.
We then repeat this process, replacing 193 with 173, and checking
exp(S 5 (173)e−γ ) ≤ 173, which is true. This continues until we reach the prime
73 for which exp(S 5 (73)) > 73: so the process necessarily stops:
193 → 173 → 151 → 137 → 109 → 103 → 97 → 89 → 83 → 79 → 73
and we conclude P(m) < log m ≤ 73.
(7) We are now able to give the final step in the proof. To obtain a
contradiction, let n be the smallest 5-free integer which is greater than
5040 and which does not satisfy Robin’s inequality. By Step (1) we have
P(n) < log n and by Step (6) P(n) ≤ 73.
Now let ē be the factorization pattern of n. Then necessarily m(ē) is 5-free
and m(ē) ≤ n. Since n is minimal, by Lemma 8.8 we cannot have 5041 ≤
m(ē) < n so either m(ē) = n, and n is Hardy–Ramanujan, or m(ē) ≤ 5040, so
by Lemma 8.8 Step (5) n satisfies Robin’s inequality, which is false. Hence n
is a Hardy–Ramanujan number, which by Step (3) implies n satisfies Robin’s
inequality. This contradiction completes the proof.
Thus we have shown:
Theorem 8.12 [38, theorem 1.5] Any integer greater than 5040 which does
not satisfy Robin’s inequality must be divisible by at least the fifth power of
at least one odd prime.
and therefore
1
log pn < loglog Nn + .
8 log pn
(4) Now write
−1
Ψt (Nn ) (1 − p−t p>pn (1 − p−t )
n
i ) 1
= = 1− .
Nn i=1
−1
1 − pi ζ(t) p≤pn
p
Therefore by Steps (1), (2) and (3) and Lemma 5.12, for n > π(20 000) = 2262
we get
Ψt (Nn ) exp(2/pn ) γ 1
≤ e log pn +
Nn ζ(t) 2 log pn
exp(γ + 2/pn ) 5
≤ loglog Nn +
ζ(t) 8 log pn
exp(γ + 2/pn ) 7
≤ loglog Nn + .
ζ(t) 10 log pn
This completes the proof of the lemma.
Recall the definition of n1 (t): it is the smallest integer n with n ≥ n0 and
such that exp(2/pn ) f (n) < ζ(t) where f (n) := 1 + 7/(10 log pn loglog Nn ).
Lemma 8.14 [159, corollary 9] For n ≥ n1 (t) and M ≥ Nn we have
Qt (M) < eγ .
Proof Since n ≥ n1 (t) we have
7
e 2/pn
1+ < ζ(t),
10 log pn
which implies by Lemma 8.13
Ψt (Nn ) exp(γ + 2/pn ) 7
Qt (Nn ) = ≤ 1+ < eγ .
Nn loglog Nn ζ(t) 10 log pn loglog Nn
Because M ≥ Nn , for some n2 ≥ n ≥ n1 (t) we have Nn2 ≤ M < Nn2 +1 . Therefore,
by Step (1) of Lemma 8.13 we have
Qt (M) ≤ Qt (Nn2 ) < eγ ,
and the proof of the lemma is complete.
Theorem 8.15 [159, theorem 10] If an integer N is 7-free and N > 5040 then
it satisfies Robin’s inequality.
Proof By Step (1) of Lemma 8.13 and Lemma 8.14, if n ≥ n1 (7) = 1292 and
N ≥ Nn then, since n is 7-free,
σ(N) ≤ Ψ7 (N) < Neγ loglog N,
214 Numbers That Do Not Satisfy Robin’s Inequality
so all such integers N ≥ Nn1 (7) < 3.51 × 104550 satisfy Robin’s inequality. By
the computations of Briggs [16, p. 253] Robin’s inequality is true for
10
5040 < N ≤ 1010 ,
and therefore true for all 7-free integers N > 5040.
Solé and Planat note that for a t-free integer n one has σ(n) ≤ Ψt (n). Using
their method, for a given t, first find an integer n1 (t) such that Rt (Nn1 (t) ) < eγ
8.5 Integers Not Divisible by the 11th Power of Any Prime 215
10
and Nn1 (t) < 1010 . Solé and Planat in Lemma 8.14 showed that for all
N > Nn1 (t) we have Qt (N) < eγ . Since Briggs [16] has proved that Robin’s
10
inequality is true for all 5041 ≤ n ≤ 1010 , this then shows that Robin’s
inequality is true for all such t-free integers n ≥ 5041.
The main idea of this section is to use explicit estimates on sums over
primes to bound the primorials. This makes for an easy computation, and
one which is used to verify Theorem 8.16.
First the function Qt (n) is estimated using bounds on primorials. For t ≥ 2
we have by Step (4) of Lemma 8.13
−t −1
1 − p−t p>pn (1 − p )
n
1
Qt (Nn ) = k
= (1 − p−1 )−1 . (8.5)
loglog Nn k=1 1 − p−1k
ζ(t) loglog N n p≤p n
The first product on the right-hand side of the expression (8.5) is estimated
using Lemma 6 of Solé and Planat, namely,
(1 − p−t )−1 ≤ exp(2/pn ), (8.6)
p≥pn
given in the unpublished result of Dusart [50, theorem 6.12] is used. This
improves on Lemma 5.12.
Now to apply the estimates (8.6) and (8.7) with x = pn , an explicit bound on
the nth prime is needed. It is possible to proceed without an explicit bound,
though this increases greatly the computation time when n is large. In fact, if
one considers the function
exp(2/x) log x
f (x) = , (8.8)
1
1−
5 log2 x
which is increasing for all x ≥ 5, one sees that it is sufficient to consider only
an explicit upper bound on pn . Consider
pn ≤ b1 (n) := n(log n + loglog n − 12 ), n ≥ 20, (8.9)
and
pn ≤ b2 (n) := n(log n + loglog n − 0.9484), n ≥ 39 017, (8.10)
216 Numbers That Do Not Satisfy Robin’s Inequality
due respectively to Rosser and Schoenfeld [145, (3.11)] and Dusart [49,
section 4].
Now we bound the factor loglog Nn in expression
(8.5) and make use of the
function θ(x) = p≤x log p. Since log Nn = ni=1 log pi = θ(pn ), bounds on Nn
can be derived using bounds on θ(pn ). These results are given in the following
lemma.
Lemma 8.18 For k ≥ 198,
loglog k − 2.1454
k log k + loglog k − 1 +
log k
loglog k − 2
≤ log Nk ≤ k log k + loglog k − 1 + . (8.11)
log k
Proof The left inequality follows from Robin [141, theorem 7] and is valid
for all k ≥ 3. The right inequality follows from Massias and Robin [114,
theorem B(v)] and is valid for all k ≥ 198.
Note that the constant “2” in the right inequality in (8.11) cannot be
improved. One could reduce the “2.1454” that appears in the left inequality
at the expense of taking a much larger k. As shown below at the end of this
section, this has very little influence on our problem.
Now we use the estimates (8.6)–(8.10) and Lemma 8.18 to estimate the
right-hand side of expression (8.5). We have
⎧
⎪
⎪
⎨eγ g1 (t, n), 430 ≤ n ≤ 39 016,
Qt (Nn ) ≤ ⎪
⎪ (8.12)
⎩eγ g2 (t, n), n ≥ 39 017,
where
f (bi (n))
gi (t, n) = % & ,
loglog n − 2.1454
ζ(t) log n log n + loglog n − 1 +
log n
for i = 1, 2. We require n ≥ 430 in the bounds (8.12) to ensure that the
conditions in the estimate (8.7) and Lemma 8.18 are met since p429 = 2971
and p430 = 2999. Following Solé and Planat, define n1 (t) to be the least value
of n ≥ 430 for which gi (t, n) < 1. Since gi (t, n) is decreasing in n, we will
have gi (t, n) < 1 for all n ≥ n1 (t). For such an n1 (t) consider the size of the
associated Nn1 (t) . The results are summarized in Table 8.1. Exact values of
Nn1 (t) for t = 11 and 12 are not given owing to the computational complexity
of calculating the nth primorial exactly.
As has been previously noted, Briggs [16] has proved that Robin’s
10
inequality is true for 5041 ≤ n ≤ 1010 , by demonstrating numerically it
holds for all colossally abundant numbers in this range, and then invoking
Theorem 6.16. To prove a statement such as “Robin’s inequality holds for all
8.6 Unsolved Problems 217
Table 8.1 Values of n1 (t) and Nn1 (t) .
10
t-free integers”, one must show that Nn1 (t) ≤ 1010 . The last entry in Table 8.1
shows that, at present, it is impossible to consider t = 12 without a new idea.
Conclusion. Here is a brief discussion of the possibility of proving
that Robin’s inequality is satisfied by all 12-free integers. Dusart [50]
(unpublished) has considered some improved versions of the estimates (8.7)
and (8.11), namely
−1 −1 γ 1
(1 − p ) < e log x 1 + , x ≥ 2973,
p≤x 5 log2 x
and
loglog k − 2.04
k log k + loglog k − 1 + ≤ θ(pk ), pk ≥ 1015 .
log k
These results appear respectively as Theorem 6.12 and Proposition 6.2 in
12
[50]. Even with these improvements one still has Nn1 (12) > 1010 . Without
injecting new ideas into the argument, one would have to increase the range
10
of Briggs’ computations beyond 1010 .
To extend the computation one need only check those numbers that are
colossally abundant and are divisible by the 11th power of some prime.
Presumably, this is a very thin set of numbers. Checking only these numbers
may motivate an extension of Briggs’ computations and hence the possibility
of extending the results in this section to t = 12.
9.1 Introduction
This chapter explores further properties of numbers for which Robin’s
inequality fails. It is based on two papers [34, 35], by the same three authors
published in 2011 and 2012: Geoffrey Caveney, Jean-Louis Nicolas and
Jonathan Sondow. Two new equivalences to RH are derived. It also includes
an equivalence from 2013 of Nazardonyavi and Yakubovich.
Recall the definition of Grönwall’s function:
σ(n)
G(n) := , n > 1,
n loglog n
and note the important property which is used in this chapter and proved in
Section 9.2:
Theorem 9.2 (Grönwall’s theorem) [69; 75, theorem 323] We have that
218
9.1 Introduction 219
where A as defined at the start of Chapter 8 is the set of integers not greater
than 5040 for which Robin’s inequality is false. This theorem includes an
equivalence for the Riemann hypothesis, namely if the Riemann hypothesis
is true then there are no other right abundant numbers, and if the hypothesis
is false then infinitely many right abundant numbers exist and in that case
Lem 5.13
Lem 9.7
Thm 9.8
Lem 9.6
Lem 9.3
Thm 9.2
Lem 9.5 Thm 7.16
Lem 9.4
Lem 6.1
Lem 6.2
Lem
Thm 7.13
Lem 3.14
Thm 7.16
Lem 9.16 Thm 9.17
Lem 9.12
Lem 6.10
Lem 9.15
Lem 9.5
Theorem 9.2 (Grönwall’s theorem) [69; 75, theorem 323] We have that
Proof (1) Let m ∈ N and for each j let p j be the jth prime with p1 = 2. Define
an integer nm by
nm n
m +1
Thus
log m
lim = 1.
m→∞ pn
m
It follows that as m → ∞
pnm = log m (1 + o(1)). (9.3)
(2) By Mertens’ formula (Lemma 5.12) we can write, as x → ∞,
1
1
1− = γ (1 + o(1)).
p≤x
p e log x
Let x = pm in this formula and use (9.3) to derive, using log(1 + o(1)) = o(1),
nm
1 1 1
1− = γ (1 + o(1)) = γ (1 + o(1)). (9.4)
j=1
p j e log p nm e loglog m
Lemma 9.6 [34, lemma 2] If n ∈ A is greater than 5, then G(n) < G(n/p) for
some prime p | n.
Proof The smallest prime factor of n ∈ A \ {1, 2, 3, 4, 5} for which G(n) <
G(n/p) is given by
(n, p) ∈{(6, 2), (8, 2), (9, 3), (10, 2), (12, 2), (16, 2), (18, 3), (20, 5), (24, 3),
(30, 3), (36, 2), (48, 3), (60, 5), (72, 3), (84, 7), (120, 2), (180, 5),
(240, 5), (360, 5), (720, 3), (840, 7), (2520, 7), (5040, 2)},
so this result is a simple computation.
Lemma 9.7 [34, lemma 3] If r ∈ A then there exists an > 0 such that for all
primes p ≥ 11 we have G(pr) < eγ − .
Proof Let n be a positive integer, and let p and q be odd primes not dividing
n with q < p. Then
σ(pn) p + 1 σ(n) q + 1 σ(n)
G(pn) = = < = G(qn).
pn loglog pn p n loglog pn q n loglog qn
No prime p ≥ 11 divides any element of A. Also G(11r) < eγ for all r ∈ A.
Therefore G(pr) ≤ G(11r) < eγ .
Finally if p ≥ 3 then
G(p2 ) (p2 + p + 1) loglog p
= < 1.
G(p) (p2 + p) loglog p2
This completes the proof.
Recall that in Chapter 7, Theorem 7.13, Robin’s unconditional upper
bound gives for all n > 1 the bound G(n) ≤ eγ + 1.1161/(loglog n)2 . We also
saw in Theorem 7.15 that if RH was false there exists a real number b with
1 − θ < b < 12 , where θ := sup{β : ζ(β + iγ) = 0}, such that when n → ∞ through
colossally abundant numbers
γ 1
G(n) = e 1 + Ω± .
logb n
Also recall from Chapter 6 we defined and used the function
1 1
F(z, k) := log 1 + .
z + · · · + z log z
k
k
<
(z + · · · + zk ) log zk
k
<
(k − 1 + zk ) log kx
k
≤
(k/2 + kx)log x
1 1
< log 1 + = F(x, 1),
x log x
so F(z, k) < F(x, 1). But this implies F(xk , k) = = F(x, 1) > F(z, k), and the
lemma follows since t → F(t, k) is strictly decreasing on (1, ∞).
Lemma 9.12 [35, lemma 2] Let N0 be a colossally abundant number of
parameter 0 , and N > N0 , for all n ≥ N0 , satisfies
σ(n) σ(N)
≤ 1+
n1+ N
for some fixed > 0. Then N is also colossally abundant of parameter .
228 Left, Right and Extremely Abundant Numbers
lemma.
Lemma 9.13 [35, lemma 3] Let N be a colossally abundant number of
parameter where < F(2, 1) and let x = x be the unique solution to
F(x, 1) = . Then √
(i) for some constant c > 0 we have log N ≤ θ(x) + c x, and
(ii) if N is the largest colossally abundant number of parameter then
√
θ(x) ≤ log N ≤ θ(x) + c x.
Proof Recall that x = x1 where for k ≥ 1 we have F(xk , k) = , and that the
sequence (xk ) is decreasing. Let K be the largest integer such that xK ≥ 2 so
that for all primes q with q | N we have 2 ≤ q ≤ xk and k = νq (N) ≤ K. Note
also that ν2 (N) = K or K − 1 and that if < F(2, 1) then x > 2 and K ≥ 1. Then
K
N= q. (9.9)
k=1 q≤xk
Therefore
log N ≤ θ(x1 ) + θ(x2 ) + · · · + θ(xK ).
Because t → F(t, k) is decreasing we have
F(2, K) ≥ F(xK , K) = = F(xK+1 , K + 1) > F(2, K + 1).
But
1 1 1 1 2
F(2, K) = log 1 + < K+1 ≤ K < K,
2K+1 − 2 log 2 (2 − 2) log 2 2 log 2 2
9.4 Riemann Hypothesis Equivalences 229
By Lemma 9.11 we have x2 ≤ (2x)1/2 and √x3 ≤ (3x)1/3 so using (9.10) and
θ(t) ∼ t as t → ∞, we get log N ≤ θ(x) + c x, which proves part (i) of the
lemma.
Equation (9.9) gives the largest colossally abundant number of parameter
as
K
N= q.
k=1 q≤xk
Lemma 9.14 [35, lemma 4] For some c > 0 there are infinitely many primes
√
p such that θ(p) < p − c p logloglog p, and infinitely many other primes p
√
such that θ(p) > p + c p logloglog p.
(5) Finally assume that the Riemann hypothesis is false. Let the real
number η with 12 < η ≤ 1 be defined by
η := sup ρ.
ζ(ρ)=0
Using Theorem 7.15 we have for some b with 1−η < b < 12 , as N → ∞ through
colossally abundant numbers,
γ 1
G(N) = e + Ω+ .
logb N
Therefore there are infinitely many positive integers N which are colossally
abundant and satisfy G(N) > eγ . Thus, for all real t we have max{G(n) :
n ≥ t} > eγ .
Let μ1 := μ = max{G(n) : n > 5040} and let a1 be the smallest integer and b1
the largest integer both greater than 5040 with G(a1 ) = G(b1 ) = μ1 . If an and
bn have been defined for 1 ≤ n < i, we set
and then let ai be the smallest integer and bi the largest integer, both greater
than bi−1 , such that G(ai ) = G(bi ) = μi . Because μi > eγ = lim supn→∞ G(n),
there is an infinite number of distinct integers ai , and they are such that if
n > ai then G(n) ≤ G(ai ). Therefore each ai is right abundant, so infinitely
many right abundant numbers exist.
(6) Now let an integer M > 5040 satisfy G(M) = μ. The method of the
previous paragraph can be used to show that M is necessarily right abundant.
Using Lemma 9.12 with N0 = 5040, 0 = 3/100 (in the notation of Section 6.3,
7 < 0.03 < 8 ), N = M and = 1/(log M loglog M), and employing also
Lemmas 9.16 and 9.5, we get N = M > e19747 > N0 . For n ≥ N0 , using the
definition of M we get G(n) ≤ G(M). By Lemma 9.15, because < N0 < M,
on the interval [N0 , ∞) the function g (t) attains its minimum at t = M, so for
n ≥ N0 we get
σ(n) loglog n G(n) G(M) G(M) σ(M)
= G(n) ≤ g (n) ≤ g (n) ≤ g (M) = 1+ .
n 1+ n e e e M
Therefore, by Lemma 9.12, M is also colossally abundant, completing the
proof of the theorem.
Lemma 9.20 [35, lemma 8] Let p ≥ 5 be an odd prime and N the largest
colossally abundant number of parameter with < 1/(log N loglog N). Then
N is not left abundant.
Proof Firstly we have
p + 1 σ(p)
p = = .
p p
Therefore, since F(p, 1) = , P(N) = p and p divides N at most to the first
power, inequality (9.12) with q = p is an equality
σ(N/p) σ(N)
= 1+ ,
(N/p)1+ N
and similarly for inequality (9.13). Defining g (t) and t0 as in Lemma 9.18
we get g (t) decreasing when t < t0 and
G(N/p)
= exp(g (N/p) − g (N)).
G(N)
Therefore the upper bound for of the hypothesis gives N/p < N < t0 , so
G(N) < G(N/p) and N is not left abundant. This completes the proof of the
lemma.
Theorem 9.21 [35, theorem 7] There are infinitely many colossally
abundant numbers which are not left abundant.
Proof Use Lemma 9.14 to find a prime p which satisfies
√
θ(p) < p − c p logloglog p,
where c is defined in Lemma 9.13, and then let := F(p, 1). Let N be the
largest colossally abundant number of parameter . Using Lemma 9.13 for
the upper bound for θ(p) we get, for p sufficiently large,
√ √ √
log N ≤ θ(p) + c p < p − c p logloglog p + c p < p.
Therefore
log(1 + 1/p) 1 1
= < < .
log p p log p log N loglog N
By Lemma 9.20, N is not left abundant. The result of the lemma then follows
√
since an infinite number of primes satisfy θ(p) < p − c p logloglog p.
A final note is now given here regarding a counting result which uses many
of the ideas in this chapter. A proper left abundant number is defined to
be a left abundant number with more than two prime factors. In [35, theorem
14] the authors showed that there exists a positive constant c such that
log x
Q(x) := #{N ≤ x : N is a proper left abundant number} ≤ exp c .
loglog x
9.7 Unsolved Problems 235
10.1 Introduction
In this chapter a selection of additional equivalences to RH are presented.
This is to show that the techniques used to derive those involving ψ(x), θ(x),
σ(n) and ϕ(n) are not unique, but that other methods and equivalences have
been derived. The equivalences are often surprising. Some are intricate, but
others are easy consequences of other equivalences. As before, in the main,
proofs are included to show depth or shallowness, and to provide the reader
with techniques which could be further developed. There is one notable
exception, namely the symmetric group application of Section 10.10 wherein
only the introductory proofs and ideas are presented. The selection is by no
means exhaustive, and tends to stray away from the arithmetic into other
realms, including the analytic, which is more property the domain of Volume
Two [32].
To begin, in Section 10.2 there is the interesting inequality, the Shapiro
criterion from 1976, which is Theorem 10.1. It is based on an arithmetic
function δ(n) which satisfies
n
ψ(t) dt = log δ(n)
0
236
10.1 Introduction 237
with denominators not greater than n, in increasing order. Then we have the
theorem of Franel and Landau of 1924:
Theorem 10.6 (Franel–Landau criterion) The Riemann hypothesis is equiva-
lent to the statement that for all > 0, as n → ∞,
m
n j
f j −
n1/2+ .
j=1
m
topological but not rigorous proof [161], that RH is equivalent to all of the
zeros of the derivative of ζ(s) in the critical strip being on or to the right
of the critical line. Part of this statement was given rigorous justification by
Levinson and Montgomery in 1974:
Theorem 10.16 (Levinson–Montgomery criterion) If the Riemann hypothe-
sis is true, all non-trivial zeros of the derivative of the Riemann zeta function
ζ (s) in the critical strip have real part greater than or equal to 12 . If the
Riemann hypothesis is false, the number of zeros of ζ (s) in the critical strip,
with positive imaginary part less than T and real part less than 12 , is O(log T ).
A simple inequality equivalence was found by Spira in 1965:
Theorem 10.18 (Spira criterion) The Riemann hypothesis is equivalent to the
statement that if σ > 12 and t ≥ 6.29073 then |ζ(1 − s)| > |ζ(s)|.
As part of a re-examination of a lemma of Lagarias, the present author was
able to improve his Riemann zeta imaginary “zero gap” explicit estimate:
Theorem 10.32 If t ≥ 168π then ζ(s) has a zero with imaginary part satisfying
|t − γ| ≤ 1.5.
Later, in Section 10.9 the details are given of work of Lagarias and
Garunkstis on an inequality equivalence to RH given in terms of the
logarithmic derivative of Riemann’s function ξ(s).
Theorem 10.35 (Garunkstis–Lagarias criterion) Let a satisfy 12 ≤ a ≤ 1. If ζ(s)
has no zeros for σ > a then for s > a we have
% &
ξ (σ) ξ (σ + it)
= inf :t∈R .
ξ(σ) ξ(σ + it)
In Section 10.10 a summary and proofs of foundational results are
presented of a completely different type of equivalence. Nonetheless it was
considered that this should be included because the methods used related
strongly to those used in Chapters 5 and 7. Let g(n) be the maximum
multiplicative order of an element of the symmetric group S n . Then the
theorem of Massias, Nicolas and Robin of 1988 is:
Theorem 10.42 (Symmetric group criterion) The Riemann hypothesis is
equivalent to the statement that for all n sufficiently large we have
log g(n) < li−1 (n) ,
where li(n) is the Cauchy principal value of the logarithmic integral,
n
dt
li(n) := ,
0 log t
and li−1 (n) represents its functional inverse.
10.2 Shapiro’s Criterion 239
Finally in Section 10.11 we touch on the large body of work which goes
under the heading the “Hilbert–Pólya conjecture”. Broadly speaking, this is
the idea that there is a physical system which underlies the zeta function
so that the critical zeros are associated with eigenvalues of a self-adjoint
transformation on a (dense subset of) a Hilbert space.
The following RHpack (see Appendix B) functions relate to the material
in this chapter: FareyFractions, RedhefferMatrix, DivisibilityGraph, Plot-
DivisibilityGraph, Xi, Lambda, S, IntegerHeight and ZetaZeroCount.
for example
δ(11) = 219 · 310 · 56 · 74
and
δ(102) = 2486 · 3288 · 5174 · 7148 · 1191 · 1389 · 1785 · 1983
· 2379 · 2973 · 3171 · 3765 · 4161 · 4359 · 4755 · 5349
· 5943 · 6141 · 6735 · 7131 · 7329 · 7923 · 8319 · 8913 · 975 · 101.
Note that with this definition we have δ(1) = 1, and for n ∈ N, ψ1 (n) = log δ(n).
Theorem 10.1 (Shapiro criterion) The inequality
2
1 n2
− < 36n3 , n ∈ N, (10.1)
1≤ j≤δ(n) k 2
Proof (1) First assume RH is true and let S be the non-trivial zeros of ζ(s).
Then for ρ ∈ S we have 1 − ρ = ρ so by Lemma 2.10 we get
1 1 1
≤ = = γ + 2 − log(4π).
ρ∈S
|ρ| · |1 + ρ| ρ∈S
|ρ| 2
ρ∈S
ρ(1 − ρ)
Using the explicit formula from Theorem 2.2, by RH we have |xρ+1 | = x3/2 .
We also have the sum
∞
1
= log 2,
n=1
2n(2n − 1)
so we get for x ≥ 1
⎛ ⎞
x2 ⎜⎜⎜ ζ ζ ⎟
+ log 2⎟⎟⎟⎟ .
+ +
1 (0) (−1)
ψ1 (x) − ≤ x ⎜⎜⎜⎝
3/2
⎟⎠
2 |ρ| · |1 + ρ|
ρ∈S
ζ(0) ζ(−1)
Thus, by Lemma 2.5 and the evaluation |ζ (−1)/ζ(−1)| = 1.985 . . . < 2 we can
write
x2
ψ1 (x) − ≤ x (γ + 2 − log(4π) + log(2π) + 2 + log 2) < 5x .
3/2 3/2
2
But for all n ∈ N we have
n
1
n
1
−1 + < log n < .
j=1
j j=1
j
Since for x ≥ 1 we have ψ1 (x) = log δ(x), we get
1
− ψ1 (x) < 1.
1≤ j≤δ(x) j
Therefore
1 x2
− < 1 + 5x3/2 ≤ 6x3/2 ,
j≤δ(x) j 2
which gives the inequality (10.1).
(2) Now assume inequality (10.1) is true for all n ∈ N. Then for x a positive
integer we have ψ1 (x) = log δ(x). Hence |ψ1 (x) − x2 /2| < 1 + 6x3/2 . Since
ψ(x) ≤ log x ≤ x log x ≤ x3/2 ,
j≤x
Therefore
ψ1 (x) − x2 /2 9
≤ s+1/2 .
x s+2 |x |
(3) Next, taking the logarithmic derivative of the Euler product for ζ(s),
and using partial summation, the definition of ψ1 (x) and integration by parts,
we can derive for s > 1
∞
ζ (s) ψ(x)
− =s dx
ζ(s) x s+1
∞
1
dψ1 (x)
=s
x s+1
1
∞
ψ1 (x)
= s(s + 1) dx.
1 x s+2
Therefore
∞
ζ (s) s(s + 1) ψ1 (x) − x2 /2
− − = s(s + 1) dx. (10.2)
ζ(s) 2(s − 1) 1 x s+2
The inequality derived in Step (2) implies that the integral of (10.2) converges
to a function of s which is holomorphic on s > 12 . But this means ζ(s) has
no zero in this region, so RH is true. This completes the proof.
Note, as far as numerical checking of inequality (10.1) is concerned, that
the expression
2 log δ(n) − 1 < 12
n2 √n (10.3)
is better.
A subset T ⊂ N is computable if there is an algorithm to determine in
a finite number of steps whether or not an arbitrary given natural number
is a member of T [44]. From the theory of algorithms it follows that RH is
decidable, i.e. its truth or negation are able to be proved. In addition, using the
inequality (10.1) it would be possible to write down an explicit diophantine
equation with integer coefficients which has no solutions in integers if and
only if RH is true. See Davis et al. [44].
This equivalence is not at all obvious, even though it relies on the classical
Littlewood criterion, Theorem 4.16. Before giving the proof we need several
lemmas. The proof is taken in the main from Edwards [51] which was taken
from the work of Franel and Landau of 1924, based on, apparently, the work
of the former [64].
Lemma 10.2 We have for all n ≥ 1
n
#Fn = 1 + ϕ( j).
j=1
For n ∈ N let Bn (x) be the nth Bernoulli polynomial. One way of defining
Bn (x) is that it is the unique polynomial which satisfies
x+1
Bn (t) dt = xn .
x
These polynomials have many properties which are useful. The first few are
B0 (x) = 1,
B1 (x) = x − 12 ,
B2 (x) = x2 − x + 16 ,
B3 (x) = x3 − 32 x2 + 12 x.
Here we use also the related period-1 function bn (x) := Bn (x − x).
Lemma 10.4 For all n, k ∈ N and u ∈ R we have
1 k−1
Bn (ku) = kn−1 Bn (u) + Bn u + + · · · + Bn u + ,
k k
with the same identity holding with Bn replaced by bn .
Proof First we derive the identity for Bn (u). We have for any real x, changing
variables and then splitting the integral domain into k subintervals of equal
length,
x+1/k
1 kx+1
Bn (kt) dt = Bn (u) du = kn−1 xn
x k kx
x+1
= kn−1 Bn (t) dt
x
x+1/k
1 k−1
= k n−1
Bn (t) + Bn t + + · · · + Bn t + dt.
x k k
Because the integrands on both sides of this equation are polynomials, they
must be equal for every value of t. This gives the first identity.
Now let 0 ≤ u < 1/k ≤ 1 so 0 ≤ ku < 1 and we can write, since u + j/k < 1
for all j ≤ k − 1,
k−1
bn (ku) = Bn (ku) = kn−1 Bn (u) + · · · + Bn u +
k
k−1
= kn−1 bn (u) + · · · + bn u + ,
k
so the identity in bn (u) is true for these values of u. If 0 ≤ u < 1 there is an l < k
so u = l/k + (u − l/k) with 0 ≤ u − l/k < 1/k. Then, since bn (u) has period 1,
l l k−l−1
bn (ku) = bn k u − =k n−1
bn u − + · · · + b n u +
k k k
244 Other Equivalents to the Riemann Hypothesis
k−l k−l+1
=k n−1
bn u + + · · · + bn u +
k k
1 k−1
=k n−1
bn (u) + bn u + + · · · + bn u + ,
k k
where, for example, bn (u) occurs as the (l + 1)th term in the sum in the second
line of the derivation. This completes the proof of the lemma.
Now define the “sawtooth function” b(u) := b1 (u) = u − u − 12 .
Lemma 10.5 For all j, k ∈ N we have
1
( j, k)2
I( j, k) := b( ju)b(ku) du = ,
0 12 jk
where ( j, k) on the right-hand side represents the greatest common denomi-
nator (GCD) of j and k.
Proof First let k = 1. Then, since b(u) has period 1, and using Lemma 10.4
with n = 1,
j−1
1 j u 1 1 k u
I( j, 1) = b(u) b du = b(k + u) b + du
j 1 j j k=0 0 j j
1 1
1 u 1 1
= b(u) b j · du = (u − 12 )2 du = .
j 0 j j 0 12 j
Next in case ( j, k) = 1 note that kv/ j modulo 1 when 0 ≤ v ≤ j − 1, since k has
a modulo j inverse, gives each rational v/ j exactly once, so that
1 1 ku
I( j, k) = b(u) b j · du
j 0 j
1 1 1 1
= b(u) b(ku) du = I(k, 1) = .
j 0 j 12 jk
In the general case let c = ( j, k), the GCD, and set j = cα and k = cβ so
(α, β) = 1. Then
1
I( j, k) = b(cαu) b(cβu) du
0
c
1
= b(αu) b(βu) du
c 0
1
= b(αu) b(βu) du
0
1 c2
= I(α, β) = = .
12αβ 12 jk
This completes the proof.
10.3 Farey Fractions 245
Proof (1) The proof is in five steps. First assume the statement is true. Fix
n ∈ N and, for 0 ≤ j ≤ m, write f j instead of f jn . Also use the same notation as
in Lemmas 10.2, 10.3 and 10.4. Let g(u) := exp(2πiu) in identity (10.4) to get
m ∞ k x
e2πi f j = e2πi j/k M .
j=1 k=1 j=1
k
For 1≤ j ≤ m set δ j := f j − j/m, the so-called “discrepancy” at j. Now for
k > 1, kj=1 exp(2πi j/k) = 0 and for k = 1 the sum is 1 = k, so the right-hand
side of this expression simplifies directly to M(x). Thus
m
m j
2πi f j
|M(x)| = e = exp 2πi + δ j
j=1 j=1 m
m m
= e 2πi j/m 2πiδ j
(e − 1) + e2πi j/m
j=1 j=1
m
m
m
≤ |e 2πiδ j
− 1| = 2 |sin(πδ j )| ≤ 2π |δ j |
x1/2+ .
j=1 j=1 j=1
Next in Steps (3) and (4) we use the definition 1 of G(u) and the expression
we have derived to evaluate the integral I2 := 0 G(u)2 du.
(3) Since Fn = {1 − f j : 0 ≤ j ≤ m} we have
m m
G(u) = b(u + 1 − f j ) = b(u − f j )
j=1 j=1
and b(0) = − 12 , b(0−) = 12 . This shows that G(u) decreases by 1 at each f j and
increases like m · u as u increases from f j to f j+1 . Also because
m−1
m−1
m−1
m−1
m−1
( f j − 12 ) = b( f j ) = b(1 − f j ) = (− f j + 12 ) = − b( f j ),
j=1 j=1 j=1 j=1 j=1
we have m−1 j=1 b( f j ) = 0 so limu→0+ G(u) = − 2 . All this goes to show that we
1
(5) For given j, k ≥ 1 let c = ( j, k) and set j = αc, k = βc. By the Riemann
hypothesis and Theorem 4.16, we can assume for every > 0 there is an
implied constant, depending only on , such that M(x)
x1/2+ for all x > 0.
Using this to bound our expression for I2 gives
∞ ∞ 1/2+
( j, k)2 x2
I2
j=1 k=1
12 jk jk
∞
∞
∞
c2
x 1+2
0.4
0.3
0.2
0.1
n
0 20 40 60 80 100
√
Figure 10.1 The Farey sums over n with 1 ≤ n ≤ 100.
FareySumj
0.008
0.006
0.004
0.002
j
0 500 1000 1500 2000 2500 3000
and call it the Redheffer matrix of order n. The determinant of this matrix
was first evaluated by Ray Redheffer in 1976 [138]. It was also evaluated,
using a combinatorial argument, by Herbert Wilf in 2006 [180]. Here we
present the method of Bryan Gillespie from 2011 [67], which better reveals
the underlying structure and gives a vast generalization.
10.4 Redheffer Matrix 249
If f is completely multiplicative this looks much simpler. In this case, for all
n ≥ 1 we have [6, theorem 2.17] f −1 (n) = μ(n) f (n).
Given any arithmetic function f : N → C with f (1) 0, and a natural
number n, define an n × n matrix by Rn ( f ) := (ri, j ) where, for 1 ≤ i, j ≤ n,
ri, j = f ( j/i) if i | j, ri, j = 1 if i > 1 and j = 1 and ri, j = 0 otherwise.
Lemma 10.7 [67, theorem 2.7] Let f be an arithmetic function with f (1) 0.
Then
⎛ ⎞
⎜⎜ n ⎟⎟⎟
n⎜
det Rn ( f ) = f (1) ⎜⎜⎜⎝1 + f ( j)⎟⎟⎟⎠ .
−1
j=2
5 1
In this context, other than the first and last, the vertices in a cycle are distinct.
A cycle is called odd if it consists of an odd number of edges. Similarly even
cycles are defined. Let o(Gn ) be the number of odd cycles of Gn and e(Gn )
the number of even cycles. Then we have the following parity equivalence to
RH:
Theorem 10.10 (Divisibility graph criterion) [9] The Riemann hypothesis
is true if and only if for all > 0, as n → ∞, we have
|o(Gn ) − e(Gn )|
n1/2+ .
Proof Let In be the n × n identity matrix and set Bn := Rn − In . Let Gn be
the directed graph with adjacency matrix Bn = (bi, j ) so the vertices of Gn are
{1, . . . , n} and an edge in Gn connects i to j if and only if bi, j = 1. Then Gn is a
divisibility graph. Define a polynomial in Z[x] by
qn (x) := det(Bn + xIn ).
Each cycle
1 → i1 → i2 → · · · → ik−1 → 1
in Gn , starting and ending at the vertex 1, having length k and distinct i j 1,
contributes to the determinant of Bn + xIn a term with value (−1)k−1 times the
determinant of the (n − k) × (n − k) submatrix which is obtained by deleting all
rows and columns indexed 1, i1 , i2 , . . . , ik−1 from that matrix. The submatrix is
upper triangular, so its determinant is simply xn−k , the product of terms on the
diagonal. Every cycle includes 1. Therefore each k-cycle in Gn contributes
(−1)k−1 xn−k to det(Bn + xIn ). In addition to cycles, the determinant includes
the diagonal xn and no other non-zero terms.
Now let c(n, k) be the number of distinct cycles of length 2 ≤ k ≤ n in Gn , so
n
qn (x) = xn + (−1)k−1 c(n, k)xn−k .
k=2
Then
n
det Rn = qn (1) = 1 + (−1)k−1 c(n, k) = 1 + o(Gn ) − e(Gn ).
k=2
Therefore
|o(Gn ) − e(Gn )| = |det(Rn ) − 1|.
By Theorem 10.9, it follows that RH is true if and only if for all > 0, |o(Gn )−
e(Gn )|
n1/2+ , which completes the proof.
252 Other Equivalents to the Riemann Hypothesis
Then we have:
∞
Lemma 10.11 If σ > 0 the series n=1 (−1)
n+1
/n s converges and has sum
η(s).
Proof A simple calculation shows that none of the zeros of the factor 1 − 21−s
have real part in (0, 1).
10.7 The Derivative of ζ(s) 253
Recall Table 2.1 in Section 2.2. Let T ≥ e and denote by N(T ) the number
of zeros, including multiplicity, of ζ(s) in the rectangle [0, 1] × [0, T ] ⊂ C.
Then
T T T 7
N(T ) − log − + < 0.122 log T + 0.278 loglog T + 2.510.
2π 2π 2π 8
(10.6)
then we have
1
1
I := = − − σ I1 . (10.9)
ρ
s−ρ 2
By Lemma 10.13 we have an estimate for the logarithmic derivative of Γ(w)
where w = u + iv:
Γ (w) 1 1
= log w − + R, where |R| ≤ , |w| ≥ 2, u ≥ 0.
Γ(w) 2w 10|w|2
Therefore for |s| ≥ 3 and σ ≥ 0 we can write
Γ (s/2 + 1) σ+2 2
= log |1 + s/2| − + R1 , where |R1 | ≤ .
Γ(s/2 + 1) |s + 2|2 5|s + 2|2
(10.10)
10.7 The Derivative of ζ(s) 255
By (10.6), (10.7)–(10.10) and using β = 12 for |γ| < 103 , we see that when t =
10 and σ ∈ [0, 12 ] we have (ζ (s)/ζ(s)) < 0. This was the method of Levinson
and Montgomery. Alternatively it is much easier to check this condition
numerically, where we find that in the given range (ζ (s)/ζ(s)) < −0.2.
Consider now the line σ = 0. By (10.8), using 0 < β ≤ 12 all terms in I1 are
positive, so I < 0. Then using (10.7) and (10.10), when t ≥ 10, evaluating the
bounds for each term in (10.7), we have (ζ (s)/ζ(s)) < 0 also.
On the line σ = 12 , unless we are at a zero of ζ(s), we have I = 0. If ρ0 =
1
2
+ iγ0 is a zero on this line then the first sum for I1 is positive and, in the
second sum, the term 1/|s−ρ0 |2 can be made arbitrarily large for s sufficiently
close to ρ0 . Thus on a sufficiently small semicircle with centre ρ0 and with
σ < 12 we get I1 > 0, and thus I < 0. Hence on these semicircles we also have
(ζ(s)/ζ(s)) < 0. Replacing the segment σ = 12 by a suitably indented contour
then gives (ζ (s)/ζ(s)) < 0 for t ≥ 10 on that contour.
Now assume there exists a real sequence (T j ) j≥0 with T j → ∞ such that
(ζ (s)/ζ(s)) < 0 on the segment (0, 12 )×{T j }. On the indented contour around
[10i, 1
2
+ 10i] × [iT j , 1
2
+ iT j ]
we must have (ζ (s)/ζ(s)) < 0, so the change in argument of ζ (s)/ζ(s) is
zero on the contour. By the principle of the argument, this implies the number
of zeros of ζ (s) and ζ(s) are the same inside each of these contours, and the
proof is complete in this case.
Now suppose no such sequence (T j ) exists. This requires that there exists
a t0 > 0 such that, for all t ≥ t0 , (ζ (s)/ζ(s)) ≥ 0 for s = σ + it and some
σ ∈ (0, 12 ). Because in (10.7) for t sufficiently large the first term on the right
can be made arbitrarily small and the third term arbitrarily large and negative,
this implies I > 0, so at least one term in the sum expression for I1 must be
negative. Therefore there exists a β < 12 such that
( 12 − β)2 > (t − γ)2 + (σ − 12 )2
so |t − γ| < 12 . If t = n is a sufficiently large integer, there must be at least one
zero ρ = β+iγ of zeta with β < 12 and |γ −n| < 12 . This shows N − (T ) ≥ T +O(1).
Now one can use Lemma 10.14 to show that the change in argument of
ζ(σ + it) and of ζ (σ + it) for fixed t ≥ t1 and σ ∈ [0, 12 ] are both O(log t).
Alternatively, this is Step (4) of Theorem I.1 of Volume Two [32]. These
derivations Also use the estimates |ζ(s)|
t A and |ζ (s)|
t A for −3 ≤ σ ≤ 1
and some A > 0 which is property (9) in Section 2.2.
Using this and the negativity (ζ (s)/ζ(s)) on the other three components
of the indented rectangular contour around [0, 12 ] × [10, t] completes the
proof.
Theorem 10.16 [107, theorem 1; 162] If the Riemann hypothesis is true, all
non-trivial zeros of the derivative of the Riemann zeta function ζ (s) in the
256 Other Equivalents to the Riemann Hypothesis
critical strip have real part greater than or equal to 12 . Even if the Riemann
hypothesis is false, the number of zeros of ζ (s) in this strip with imaginary
part less than T > 0 and real part less than 1/2 is O(log T ).
Proof If the Riemann hypothesis holds, then for all T > 0, N − (T ) = 0 so
N − (T ) > T + O(1) is false for all T ≥ T 0 . Hence, by Lemma 10.15, N1− (T ) =
N − (T ) = 0 for all T > 0, so ζ (s) has no zero with 0 < σ < 12 .
Conversely if N1− (T ) = 0 for all T > 0 then again, by Lemma 10.15, N − (T ) =
O(log T ). (Note that this is not sufficient to give the Riemann hypothesis.)
Recently Suriajaya derived an asymptotic formula for the number of zeros
of the kth derivative of ζ(s), assuming RH [164].
where we have used the mean value theorem to obtain η with 12 < η < σ.
If 0 ≤ x ≤ 12 we have 12 log(1 − x) ≥ −x. Therefore for |t| ≥ 12 we can derive
the lower bound
⎛ ⎞
1 ⎜⎜⎜⎜ | sin π(σ − 12 )| ⎟⎟⎟ | sin π(σ − 12 )| 1 −π|t|
log ⎝1 − ⎟⎠ ≥ − > −2π σ − e .
2 cosh πt cosh πt 2
Therefore, dividing by σ − 12 we get
h(s) ∂
> log |Γ(σ + it)| − 2πe−π|t| − log 2π.
σ − 12 ∂σ σ=η
1 σ σ2 − t 2
> log(σ2 + t2 ) − −
2 2(σ2 + t2 ) 12(σ2 + t2 )2
√ ∞
3 1
− dx.
36 0 ((σ + x)2 + t2 )2
To bound the integral let y := σ + x and then tan θ := y/|t| to get
∞ ∞
1 1
dx ≤ dy
0 ((σ + x) + t ) 0 (y + t )
2 2 2 2 2 2
π/2
1 π
= 3 cos2 (θ) dθ = .
|t| 0 4|t|3
Using this bound and 12 < σ < 1 we get
√
h(s) 1 3π
> log |t| − 2 − − 2πe−π|t| − log 2π.
σ − 12 2t 72|t|3
When |t| ≥ 2π + 0.1 the right-hand side is strictly positive, and therefore so is
h(s). This completes the proof of the lemma.
258 Other Equivalents to the Riemann Hypothesis
Note that, for s = 12 + it, h(s) = 1, and for σ > 12 , ∂h(σ + it)/∂σ can easily
be shown, using the method of Lemma 10.17, to be quite positive.
Theorem 10.18 (Dixon–Spira criterion) [47, 162] The Riemann hypothesis
is equivalent to the statement that, if 12 < σ < 1 and t ≥ 6.29073, then
|ζ(1 − s| > |ζ(s)|.
Proof If the Riemann hypothesis is false and s is a zero not on the critical
line, then both sides of the inequality are zero, so it is false. If the Riemann
hypothesis is true, then Lemma 10.17 shows the inequality is true.
Corollary 10.19 For |t| > 10 and 1 ≥ σ > 1
2
we have tσ−1/2 > |Λ(σ + it)| >
t(σ−1/2)/5 .
Proof First we derive an upper bound for h(s)/(σ − 12 ), where h(s) is defined
in Lemma 10.17. We have
cos(πσ) |sin(π(σ − 12 ))| 1 −π|t|
log 1 + ≤ < π σ− e
cosh(πt) cosh(πt) 2
and
∂ 1 σ σ2 − t 2
log |Γ(σ + it)| ≤ log(σ2 + t2 ) − −
∂σ 2 2(σ2 + t2 ) 12(σ2 + t2 )2
√ ∞
3 1
+ dx
36 0 ((σ + x)2 + t2 )2
√
1 1 3π
≤ log(σ + t ) +
2 2
2
+ .
2 12t 72|t|3
Therefore since σ ≤ 1
√
h(s) 1 1 3π
< log(1 + t ) +
2
+ − log(2π)
σ− 2 2
1
12t 2 72|t|3
√
1 3π √
≤ log |t| + + − log( 2π) < log |t|
12t2 72|t|3
for |t| > 1, giving the upper bound.
Next we derive a lower bound. Using the notation and argument of
Lemma 10.17 we first evaluate the integral:
∞
dx tan−1 (t/σ) σ
I(σ, t) := = − 2 2 2 .
0 ((σ + x)2 + t2 )2 2|t| 3 2t (σ +t )
Therefore, if we define as before
σ σ2 − t 2 1
F(σ, t) := − − + log(t2 + σ2 ),
2(t + σ ) 12(t + σ )
2 2 2 2 2 2
10.9 The Real Part of the Logarithmic Derivative of ξ(s) 259
we can write
√
h(s) 3
> F(σ, t) − I(σ, t) − 2πe−π|t| − log 2π.
σ− 21
36
Since the integral is decreasing in σ, and F(σ, t) is increasing, we can
substitute σ = 12 and obtain a lower bound
√
h(s) 3 1
> F( 2 , t) −
1
I( , t) − 2πe−π|t| − log 2π.
σ − 12 6 2
Then for all σ > 12 we get h(s)/(σ − 12 ) > G(t) where
1 4t2 − 1 1
G(t) := log(4t2 + 1) + + √
2 3(4t + 1)
2 2
12 3t2 (4t2 + 1)
−1
1 tan (2t)
− 2 − √ − 2πe−πt − log(4π).
4t + 1 24 3 |t|3
Finally this expression is greater than 0.201675 log |t| > log |t|/5 for all |t| ≥ 10.
This completes the proof.
This concept, considered along with Theorem 10.23, sheds new light on the
Riemann hypothesis and its generalizations.
Lemma 10.20 The zeros of the Riemann function ξ(s) form an admissible
zero set.
Proof By Lemma 2.8, Step (2), ρ 1/|ρ|2 converges.
In what follows we need the Weierstrass product, which is developed and
proved in Volume Two [32, Theorem B.13]. See also Alfors [4, chapter 5].
Lemma 10.21 (Weierstrass product) The Weierstrass product
s
m(ρ)
s
s
m(ρ)
fΩ (s) := sm(0)
1− 1− 1−
ρ∈Ω∩R
ρ ρ∈Ω, ρ>0
ρ ρ̄
We claim fΩ,n → fΩ uniformly on K. Let > 0 be given. By Steps (1) and
(2) there is a r > 0 such that the corresponding L is a compact subset of V.
Therefore fΩ,n → fΩ uniformly on L and thus there is an N such that for all
s ∈ L and n ≥ N we have
| fΩ,n (s) − fΩ (s)| < r.
In particular this inequality holds on each B(s, r] for every s ∈ K, so by
Cauchy’s bound applied to fΩ,n (s) − fΩ (s) we have for all s ∈ K and n ≥ N
| fΩ,n (s) − fΩ (s)| < .
(4) Since the convergence is uniform on compact subsets K for both fΩ,n
and fΩ,n , and fΩ,n (s) 0 on K, convergence is uniform for the logarithmic
derivative also. This completes the proof.
Theorem 10.23 [98, theorem 1.1] If Ω ⊂ C is an admissible zero set with
corresponding entire function fΩ (s) then the following two conditions are
equivalent for a given real θ:
(a) All elements ρ ∈ Ω have ρ ≤ θ.
(b) For s > θ we have
f (s)
Ω > 0.
fΩ (s)
Proof If s > θ and (b) is true, then fΩ (s)/ fΩ (s) is well defined at s ∈ C,
hence does not have a pole, so s Ω. Thus if ρ ∈ Ω we have ρ ≤ θ, which is
condition (a).
Since Ω is admissible, by Lemma 10.22 there is a Mittag–Leffler expansion
for fΩ (s)/ fΩ (s). If (a) is true and s > θ then s > ρ for all ρ ∈ Ω. We
claim that if c > 0 is real and ρ = σ + iγ, z = x + iy, then if x > σ we have
(c/(z − ρ)) > 0. This follows directly from the identity
c c(x − σ)
= .
z−ρ (x − σ)2 + (y − γ)2
262 Other Equivalents to the Riemann Hypothesis
It follows that every term in the expansion of ( fΩ (s)/ fΩ (s)) is positive and
therefore so is the sum. Therefore condition (b) is true also.
In Garunkstis [66] from 2002, there is a better result than that given in [24],
and we give the details of this below. Let 12 ≤ a ≤ 1 and suppose that ζ(s) has
no zero when σ > a. Then Garunkstis [66] showed that for all such σ we have
for all t ∈ R
ξ (σ + it) ξ (σ)
≥ .
ξ(σ + it) ξ(σ)
There are a number of lemmas leading to this result; some are due to
Lagarias [98].
Note that (see Volume Two [32, Theorem 4.6] or Edwards [51, chapter 2])
we can write the Weierstrass product form of ξ(s) valid for all s ∈ C
s
ξ(s) = ξ(0) 1− , (10.12)
ρ
ρ
where the product is over all of the complex zeros ρ of zeta taken by
combining complex conjugate pairs.
Lemma 10.24 [98; 99, lemma 3.1] For non-zero t, σ and γ we have
1 1 2
+ 2 ≥ 2 ⇐⇒ 3γ2 ≥ σ2 + t2 . (10.13)
σ + (t + γ)
2 2 σ + (t − γ)2 σ + γ2
Proof No denominator is zero, so clearing denominators of the first
inequality, dividing by 2 and simplifying gives the second inequality. These
steps can be reversed so the second implies the first.
Note that if t = 0 the left-hand inequality is an equality.
Lemma 10.25 [66, lemma 2] Assume all zeros ρ = β +iγ of ζ(s) satisfy β < a.
For a < σ ≤ 10 and 0 < |t| ≤ 21 we have
ξ (σ + it) ξ (σ)
> .
ξ(σ + it) ξ(σ)
Proof First note that 1 − a < β < a and |γ| ≥ 14.134 . . . . If we set σ0 := σ − β
and let t satisfy |t| ≤ 21 we get 3γ2 > 3(14)2 > 102 + 212 ≥ σ20 + t2 so the second
inequality of Lemma 10.24 holds strictly, hence so does the first. Since σ > a
we can write using (10.12)
ξ (σ + it) σ−β σ−β
= +
ξ(σ + it) γ>0
(σ − β)2 + (t − γ)2 (σ − β)2 + (t + γ)2
2(σ − β) ξ (σ)
> =
γ>0
(σ − β)2 + γ2 ξ(σ)
and the proof of the lemma is complete.
10.9 The Real Part of the Logarithmic Derivative of ξ(s) 263
√
Lemma 10.26 [66, lemma 3] Let 1
2
≤ β ≤ a ≤ 1, σ > a and γ ≥ (σ + 12 )/ 3.
Then we have two inequalities:
σ−a σ+a−1 σ−β σ+β−1
+ ≤ + ,
(σ − a) + γ
2 2 (σ + a − 1) + γ
2 2 (σ − β) + γ
2 2 (σ + β − 1)2 + γ2
σ−β σ+β−1 2(σ − 12 )
+ ≤ .
(σ − β)2 + γ2 (σ + β − 1)2 + γ2 (σ − 12 )2 + γ2
Proof Define
σ− x
f (x) := and g(x) := f (x) + f (1 − x)
(σ − x)2 + γ2
with x ∈ [a − 1, a]. Then
2(σ − x)3 − 6(σ − x)γ2
f (x) = .
((σ − x)2 + γ2 )3
√
Because γ ≥ (σ + 12 )/ 3 we have f (x) ≤ 0 so f (x) is concave down, giving
2 f ((x+y)/2) ≥ f (x)+ f (y). Setting x = β, y = 1−β gives the second inequality.
Since 12 < β ≤ a, g ( 12 ) = 0 and g (x) < 0 for 12 < x ≤ a the first inequality follows
also.
Lemma 10.27 [66, lemma 4] For all a and σ with a < σ ≤ 9/2 we have
ξ (σ)
0< ≤ 0.047(σ − 12 ).
ξ(σ)
Proof Taking the logarithmic derivative of the definition of ξ(s) we get for
ζ(s) 0
ξ (s) 1 1 1 1 Γ (s/2) ζ (s)
= + − log π + + .
ξ(s) s s − 1 2 2 Γ(s/2) ζ(s)
From the Laurent expansion of ζ(s) about s = 1 (Volume Two [32, Theorem
I.2] or [82]) we get, with C being Euler’s constant,
ζ (s) 1
=− + C + O(|s − 1|).
ζ(s) s−1
We also know [Γ ( 12 )/Γ( 12 )]/2 = −C/2 − log 2 (using for example the
representation for Γ (z)/Γ(z) given in Theorem I.1 in Volume Two [32]
– and see Lemma 2.5 in this volume) so we can compute ξ (1)/ξ(1) =
0.0230957 . . . .
Now let A := {ρ : ζ(ρ) = 0, β = 12 , γ > 0} and B := {ρ : ζ(ρ) = 0, β > 12 ,
γ > 0}. Then by Lemma 10.26 we have for σ ≥ a
ξ (σ) 1 1
= +
ξ(σ) γ>0 σ − ρ σ − ρ̄
264 Other Equivalents to the Riemann Hypothesis
2(σ − 12 ) σ−β σ−1+β
= +2 +
ρ∈A
(σ − 12 )2 + γ2 ρ∈B
(σ − β) + γ
2 2 (σ − 1 + β)2 + γ2
2(σ − 1
) 4(σ − 12 )
≤ 2
+
ρ∈A
(σ − 12 )2 + γ2 ρ∈B
(σ − 12 )2 + γ2
2(σ − 12 )
=
(σ − 12 )2 + γ2
γ>0
1
≤2 (σ − 12 ).
γ>0
γ 2
(i) there are two non-trivial zeros or a double zero of ζ(s) with |t − γ| ≤ 5, or
(ii) there is a complex zero ρ = β + iγ of ζ(s) with |t − γ| ≤ 2. Then
ξ (σ + it)) ξ (σ)
≥ .
ξ(σ + it) ξ(σ)
Proof Case (i): In case both zeros, not necessarily distinct, lie on the critical
line, ρ1 = 12 + iγ1 and ρ2 = 12 + iγ2 say, then by Lemma 10.27
ξ (σ + it) σ − 12 σ − 12
≥ +
ξ(σ + it) (σ − 12 )2 + (t − γ1 )2 (σ − 12 )2 + (t − γ2 )2
2(σ − 12 ) ξ (σ)
≥ > .
4 2 + 52 ξ(σ)
In case one of the zeros is off the critical line, consider the two zeros ρ1 =
β + iγ and ρ = 1 − β + iγ with 12 < β and |t − γ| ≤ 5. Then since β < a using
10.9 The Real Part of the Logarithmic Derivative of ξ(s) 265
Lemma 10.26
ξ (σ + it) σ−β σ−1+β
≥ +
ξ(σ + it) (σ − β) + (t − γ)
2 2 (σ − 1 + β)2 + (t − γ)2
σ−β σ−1+β
≥ +
(σ − β) + 25 (σ − 1 + β)2 + 25
2
σ−a σ−1+a
≥ +
(σ − a) + 25 (σ − 1 + a)2 + 25
2
= h(σ, a)(σ − 12 ),
where we define
⎛ ⎞
σ−a σ−1+a ⎜⎜⎜⎜ 1 ⎟⎟⎟⎟
h(σ, a) := + ⎝ ⎠.
(σ − a)2 + 25 (σ − 1 + a)2 + 25 σ − 12
Using Mathematica to evaluate the derivatives of h(σ, a) and simplify the
resulting expressions we find
∂h(σ, a) (2σ − 1)(a2 − 11a − 20 + σ − σ2 )(a2 + 9a − 30 + σ − σ2 )
=−
∂σ ((σ − a)2 + 25)2 ((σ − 1 + a)2 + 25)2
and
∂h(9/2, a) 32(2a − 1)(16a4 − 32a3 − 1288a2 + 1340a − 29 031)
= .
∂a (4a2 − 36a + 181)2 (4a2 + 28a + 149)2
Now we have assumed a ≤ σ ≤ 9/2 and 12 < a ≤ 1 giving ∂h(9/2, a)/∂σ ≤ 0
and thus h(σ, a) ≥ h(9/2, a). In the range a ∈ [ 12 , 1], ∂h(9/2, a)/∂a ≤ 0 giving
h(9/2, a) ≥ h(9/2, 1) = 0.0483518 . . . .
Feeding this inequality back into the lower bound for the logarithmic
derivative of ξ(s) and then using Lemma 10.27 completes the proof in this
case.
Case (ii): If the zero does not lie on the critical line then we have case (i)
which is already covered. Hence we can write ρ = 12 +iγ and assume |t −γ| ≤ 2.
Then by (10.12) we have
ξ (σ + it) σ − 12
≥
ξ(σ + it) (σ − 12 )2 + (t − γ)2
1 1 1
≥ 2 2 σ − = 0.05 σ −
4 +2 2 2
ξ (σ)
≥
ξ(σ)
by Lemma 10.27 and the proof is complete.
266 Other Equivalents to the Riemann Hypothesis
Table 10.1 The first 20 Riemann zeros and gaps.
1 14.134725141734693790 6.8873144970368612022
2 21.022039638771554993 3.9888179413741337706
3 25.010857580145688763 5.4140185457138244471
4 30.424876125859513210 2.5101854618796764804
5 32.935061587739189691 4.6511165710864815666
6 37.586178158825671257 3.3325408533218239302
7 40.918719012147495187 2.4083542687675043321
8 43.327073280914999519 4.6780776002521602084
9 48.005150881167159728 1.7686815965051424540
10 49.773832477672302182 3.1964890000421584622
11 52.970321477714460644 3.4759262193489341602
12 56.446247697063394804 2.9007963055389582753
13 59.347044002602353080 1.4847345220074567646
14 60.831778524609809844 4.2807655234717968166
15 65.112544048081606661 1.9672664814125670536
16 67.079810529494173714 2.4665911816798055384
17 69.546401711173979253 2.5207559633079283296
18 72.067157674481907583 3.6375330246020255858
19 75.704690699083933168 1.4401493697908722044
20 77.144840068874805373 2.1925349513745625501
Note that ρ300 = 0.5 + i541.847 . . . . Table 10.1 gives the first 20 imaginary
parts of the imaginary zeros of zeta. The third column is the size of the gap
to the next zero. Figure 10.4 is a plot of the first 300 coordinate gaps.
We see from the data in Table 10.1 that when 21 ≤ t ≤ 79 there is either a
zero with |t − γ| ≤ 2 or two zeros with |t − γ| ≤ 5.
There was some discussion of the error in finding the number of zeros N(T )
of ζ(s) in the critical strip up to a given height in Chapter 2, Section 2.3. It is
a well-studied problem and well known that we can write
T 7
T 1
N(T ) = log + + S (T ) + O ,
2π 2πe 8 T
where S (T ) is the argument of ζ(s) on the critical line at the point 12 + iT
which is not the ordinate of a zero. The argument is fixed by taking the
value found by continuous variation from an argument value of 0 along the
horizontal line from (∞, iT ) to ( 12 , iT ). At a zero the value is defined to be
limt→0+ S (t). This function and its properties are discussed fully by Edwards
[51, chapters 8 and 9] for example. In this section we use the explicit estimate
of Trudgian:
10.9 The Real Part of the Logarithmic Derivative of ξ(s) 267
γn+1–γn
6
n
0 50 100 150 200 250 300
Figure 10.4 The first 300 imaginary coordinate gaps between zeros of ζ(s).
1.0
0.5
T
410 420 430 440 450
–0.5
–1.0
Proof Firstly
1 1 Γ (1 + 2it) Γ (1 − 2it)
ϑ (t) = − log π + + .
2 4 Γ(1 + 2it) Γ(1 − 2it)
Now use the estimate given in Lemma 10.13, namely
Γ (s) 1 1
= log s − + R1 , where |R1 | ≤ , |s| > 2, s > 0,
Γ(s) 2s 10|s|2
to get
1 2t 21
ϑ (t) = log + R2 , where |R2 | ≤ , t > 2,
2 π 160t2
completing the proof.
Lemma 10.31 For 168π ≤ t < t + h, if ζ(s) has no zero with imaginary part
in [t, t + h] and S (t) has no zero in (t, t + h) then h < 1.5 uniformly in t.
Proof Assume, to get a contradiction, that the given conditions hold with
h = 1.5. If the sign of S (t) is uniformly negative on [t, t + 1.5], because
ϑ(t) is concave up, considering the area of a triangle we can write, using
Lemma 10.30 and (10.14),
t+h
h2
S (u) du ≥ ϑ (t).
t
2π
10.9 The Real Part of the Logarithmic Derivative of ξ(s) 269
One could find the explicit value of T 0 and relate this result to that of
Theorem 10.32 for example.
Lemma 10.34 [98, lemma 3.6] If σ > 9/2 and |t| ≥ 21 then
ξ (σ + it) ξ (σ)
≥ .
ξ(σ + it) ξ(σ)
Proof Let (σ, t) be in the given ranges, so in particular σ > 1. From
the definition of ξ(s), taking the logarithmic derivative and using the
representation of the logarithmic derivative of Γ(z) (see Volume Two [32,
Theorem H.1])
∞
1 Γ (s/2) γ 1 1 1
=− − + − ,
2 Γ(s/2) 2 s n=1 2n 2n + s
and
ζ (s) ∞
Λ(n)
=− cos(t log n).
ζ(s) n=1
nσ
From this we can conclude the minimum of this latter sum is attained at t = 0,
since each cosine term is bounded above by 1 and each coefficient is positive.
Now let
ξ (s) ζ (s) 1 1 1 Γ (s/2) 1
g(s) := − = + + − log π.
ξ(s) ζ(s) s s − 1 2 Γ(s/2) 2
Then
This completes the proof since all the terms in the sum are positive.
Theorem 10.35 [66, p. 2] Let a satisfy 12 ≤ a ≤ 1. If ζ(s) has no zeros for
σ > a then for s > a we have
% &
ξ (σ) ξ (σ + it)
= inf :t∈R .
ξ(σ) ξ(σ + it)
Proof This result follows directly from Lemmas 10.25, 10.27, 10.28
and 10.34, together with Corollary 10.33.
are central to any study of finite groups. This is because of Caley’s theorem,
which states that any group of order n is isomorphic to a subgroup of S n .
Indeed S n has many subgroups; for example S 4 has 30 and S 5 over 100.
These groups appear in many parts of mathematics and its applications,
including Galois theory, the definition of the determinant, symmetries of
objects including molecules, cryptography, etc. For low values of n these
groups have concrete isomorphic realizations: for example S 1 consists only
of the identity, S 2 is the largest Abelian S n , S 3 is the group of rotational and
reflection symmetries of an equilateral triangle, S 4 is the group of rotations
about opposite faces, diagonals and edges of a cube, and S 5 is the Galois
group of the general polynomial of the 5th degree.
In this section we need only a few simple properties of S n . A cycle is
a permutation which can be represented as an m-tuple of distinct elements
(n1 , n2 , . . . , nm ), with m ≤ n, and such that the corresponding permutation is
n1 → n2 → · · · → nm → n1 .
Values of g(n) are given in Table A.6 and in part A000793 of the
Online Encyclopedia of Integer Sequences [158]. Landau’s theorem [103]
is included here as Theorem 10.39. We based our presentation of this result
on the proof of William Miller [118].
In 1939 Shah [156] improved Landau’s asymptotic formula by proving
loglog n 1
log g(n) = n log n 1 + +O .
2 log n log n
10.10 The Order of Elements of the Symmetric Group 273
400
300
200
100
n
200 400 600 800 1000
1≤i≤k ai . Let {ai } be a set with the smallest such sum. Since removing a term
with ai = 1 reduces the sum, but preserves the value of the LCM, we can
assume each ai ≥ 2. If we could write one of the ai as ai = xy with (x, y) = 1 and
both x, y ≥ 2 then since x+y < xy = ai we could replace ai with the two integers
x and y, and again reduce the sum. Hence we can assume each of the ai is
a power of a prime. Finally if two of the ai were powers of the same prime
we could delete the one with the smaller or equalpower, and get a smaller
sum, but the same LCM and so satisfying h(m) > 1≤i≤k ai . But in case the ai
are powers of distinct primes we have h(m) = ki ai , a contradiction, which
proves the claim.
(2) We will now show that there exists a permutation on n elements of order
m if and only if h(m) ≤ n. To see this, first suppose h(m) ≤ n. Let m = 1≤i≤k pαi i
and define a permutation σ ∈ S n with cycle lengths (pαi i ) and n − h(m) fixed
points. Then the order of this permutation is m. If such a permutation of order
m exists let (ai ) be its cycle lengths so their sum is n and LCM is m. Hence,
by Step (1) we have h(m) ≤ n.
(3) Now we claim that among all of the permutations on n symbols with
maximum order there exists at least one with non-trivial cycle orders being
powers of distinct primes: let m be the maximum order of a permutation
so we can write g(n) = m. Then by Step (2) we have h(g(n)) ≤ n. Use the
construction of Step (2), with m having this value g(n), to construct the
required permutation.
10.10 The Order of Elements of the Symmetric Group 275
(4) We can now complete the proof. Let θ := maxh(m)≤n m. Now by Step (3)
using a permutation of maximum order, h(g(n)) ≤ n so g(n) ≤ θ. If on the
other hand m is such that h(m) ≤ n, by Step (2) there is a permutation of order
m so m ≤ g(n). Hence θ ≤ g(n) so θ = g(n) and the proof of the lemma is
complete.
This section includes a version of Landau’s theorem log g(n) ∼ n log n
[105, pp. 222–229] based on the proof in Miller’s paper [118], which on the
face of it looks very similar to Landau’s. In Table A.6 the first 160 values
of g(n) are given. Extensive evaluations were carried out by Morain up to
n = 32 000 with some details
in [120].
For x > 0 let A(x) := p≤x p be the sum of all primes up to x.
Lemma 10.37 As x → ∞ we have
2
x2 x
A(x) = +O .
2 log x log2 x
Proof This is a simple application√ of Abel’s identity [6, theorem 4.2],
splitting the integral for the error at x.
The reader may wish to compare the use of Lemma 10.37 with the method
of Shah [156] who takes A(x) to the fourth order. This is not needed in the
proof, but deriving it could be good practice for the energetic reader.
Lemma 10.38 As x → ∞ we have
2
x2 1 x2 1 x2 x
A(x) = + 2
+ 3
+O .
2 log x 4 log x 4 log x log4 x
Proof First use induction to show that
x
p=− π(n) + π(x)(x + 1).
p≤x n=1
Using these expressions and the prime number theorem in the form
π(n) = Li(n) + O(x exp(−c log x ))
enables us to derive
x x
n n 2n n
π(n) = + + +O
n=1 n=2
log n log2 n log3 n log4 n
276 Other Equivalents to the Riemann Hypothesis
x x
1 1 2 u du
= u + + du + O
2 log u log2 u log3 u 4
2 log u
x2 3 x2 7 x2 x2
= + + + O .
2 log x 4 log2 x 4 log3 x log4 x
Therefore
2
x2 1 x2 1 x2 x
A(x) = p= + + + O .
p≤x
2 log x 4 log2 x 4 log3 x log4 x
This completes the proof.
Theorem 10.39 [105, 118] As n → ∞ we have log g(n) ∼ n log n.
Proof (1) Let n be a positive integer and let P = P(n) be the largest prime
such thatthe sum of the primes less than P does not exceed n and define
F(n) := p<P p. Then because log F(n) = θ(P − 1) and P ∼ P − 1 we have, by
∼ P.
the prime number theorem, log F(n)
(2) Now for x > 0 let A(x) := p≤x p. By the definitions of A(x) and P we
have A(P − 1) ≤ n < A(P). But for all x > 0 we have A(x − 1) ∼ A(x) and
therefore, by Lemma 10.37, we have P2 /(2 log P) ∼ n, which implies that
P2 /(2n log P) → 1.
If P is not asymptotic to n log n then there is an > 0 such that for an
infinite number of values of n one of the inequalities
P ≤ (1 − ) n log n or P ≥ (1 + ) n log n
√
holds. Because x → x2 /log x is increasing for x > e, if the first inequality is
true then
P2 (1 − )2 log n
≤ .
2n log P log n + loglog n + 2 log(1 − )
However, as n → ∞ the right-hand side approaches (1 − )2 but the left-hand
side tends to 1. Similarly the second inequality does not hold for infinitely
many n. Hence P ∼ n log n and so, by Step (1) we have log F(n) ∼ n log n.
(3) Because, by Step (2) of Lemma 10.36, there is a permutation on n
letters of order F(n) we have F(n) ≤ g(n).
(4) Let q1 , . . . , q s be all of the distinct primes dividing g(n) in increasing
order. Then we claim that
s
log q j < 2 + log F(n) + log P.
j=1
Therefore
loglog n 1
α= +O .
2 log n log n
(2) By Steps (1) and (6) of the proof of Theorem 10.39 we have
√
θ(P − 1) ≤ log g(n) ≤ 2 + θ(P − 1) + log P + 2P log(2P).
Therefore, using the result of Step (1),
⎛ ⎛√ ⎞⎞
⎜⎜⎜ ⎜⎜⎜ P log P ⎟⎟⎟⎟⎟⎟
log g(n) = θ(P − 1) ⎜⎝1 + O ⎜⎝ ⎟⎠⎟⎠
θ(P)
1 log P
= (P − 1) 1 + O 1+O √
log P P
1 1 log P
= P 1+O 1+O 1+O √
P log P P
1
= P 1+O
log P
loglog n 1 1
= n log n 1 + +O 1+O
2 log n log n log n
loglog n 1
= n log n 1 + +O
2 log n log n
and this completes the proof.
Figure 10.7 is a plot of log g(n)/ n log n.
Nicolas studied the properties of g(n) in detail in the 1960s, and Massias
continued this study, publishing related work in his Limoges doctoral thesis
in 1985. Several papers were also published on the topic by these two authors
together with Robin. See the references in [112] and [113]. The first set of
results stated below are for the main part proved in detail in Nicolas’ paper
[125], where the methods are similar to those used in Chapter 6. There is
also a summary with sketched proofs in [113]. The RH equivalence is proved
in [112]. The introductory results we have given above should be sufficient
background for the reader to read these papers.
Recall the definition: h(1) = 0 and, if n = p|n pv p (n) is the standard prime
factorization set,
h(n) := pv p (n) .
p|n
0.9
0.8
0.7
0.6
n
0 50 100 150 200
Figure 10.7 Values of the function log g(n)/ n log n for 3 ≤ n ≤ 1000.
It follows from this expression that we have m = g(n) for some n if and only
if m > m implies h(m ) > h(m).
Definition of the set G: we say that n ∈ G if there exists ρ > 0 such that
xii − xii−1
= ρ,
log xi
which enables us to write
∞
nρ = p. (10.19)
i=1 p≤xi
If θ < 1 then
log g(n) = li−1 (n) + O (n log n)θ/2 .
(iii) If there exists a zero of ζ(s) with real part θ > 12 then
log g(n) = li−1 (n) + Ω± (n log n)θ/2 .
(iv) If the Riemann hypothesis is true then for all n sufficiently large
log g(n) < li−1 (n).
(v) If the Riemann hypothesis is true then as n → ∞
⎛ √ ⎞
−1
⎜⎜⎜ 2 − 2 ⎟⎟⎟
log g(n) = li (n) − ⎝ ⎜ ⎟⎠ (n log n)1/4 + Ω± (n log n)1/4 .
3
Theorem 10.42 (Symmetric group criterion) The Riemann hypothesis is
equivalent to the statement that for all n sufficiently large we have
log g(n) < li−1 (n),
where li(n) is the Cauchy principal value of the logarithmic integral
n
dt
li(n) := .
0 log t
relating to the stability of a rotating fluid on the same page as notes on the
zeros of ζ(s) (see du Sautoy [151, p. 286]). The condition for stability of
the fluid subject to perturbation was that a set of eigenvalues should be on a
straight line! An implicit connection between RH and something physical!
Andrew Odlyzko attempted, in the early 1980s while Pólya was still alive,
to trace the formulation of the conjecture back to Pólya and Hilbert by
corresponding with Pólya (see Volume Two [32, Chapter 5]). Pólya recalled
answering a question of Landau as to whether or not there was a physical
reason why RH should be true. Pólya’s answer was that it would be the case
if the nontrivial zeros of the function ξ(s) were so connected with the physical
problem that RH would be equivalent to all of the eigenvalues of that problem
being real. Odlyzko was not able to find any reported statement by Hilbert,
but the name of the conjecture is now well established by use.
While up to the time of writing Hilbert–Pólya has failed to provide a proof
of RH, the conjecture has inspired a great deal of work and progress in
the intersection of mathematics and physics. The main focus has been to
design the Hilbert space H, domain Δ and transformation T . Macroscopic
classical mechanical systems, quantum mechanical and quite a few others
have been investigated, but no one seems to have come close to providing
what is needed. Some believe asking for the transformation to be self-adjoint
is too much, or that the structures should be constructed out of mathematics
alone, i.e. starting with the integers. Sir Michael Berry and his co-workers
have given quite precise specifications on what a successful structure might
need to satisfy [11].
As part of this story of successful synergies between mathematics and
physics, there is the well-known and often retold interaction between Hugh
Montgomery and Freeman Dyson over tea in the Institute for Advanced
Study, Princeton [151, p. 262]. This became related to the great body of
work on random matrices, which was originally developed as a way to study,
statistically, the energy levels of heavy nuclei, i.e. those with large numbers
of neutrons and protons. The relationships with the results of computations
of zeta zero imaginary parts, especially those carried out by Andrew Odlyzko
[79, 130], have been startling.
Background reading might include Conrey [41], Katz and Sarnak [90] and
Rudnick and Sarnak [147]. On the physics side, there is the survey preprint
of Schumayer and Hutchinson [155].
One of the remarkable particular discoveries of random matrix theory
has been very precise expressions for moments of distribution functions,
providing corresponding conjectures for the moments of ζ(s). Here are the
conjectured moments of John Keating and Nina Snaith [91] for ζ(s) for all
k ∈ N and s on the critical line:
k 2
1 T 1
|ζ( 2 + it)|2k dt ∼ gk a(k) log 2π
T
,
T 0
284 Other Equivalents to the Riemann Hypothesis
where
1
k 2
∞
Γ(m + k)
2
1
a(k) := 1−
p
p m=0
m!Γ(k) pm
and
and if
n−2
G(k + 1)2
G(n) := i! =⇒ gk = .
i=0
G(2k + 1)
Estimates for moments of zeta have been very significant, and have been
so for a long time. For example Landau in his treatise of 1908 included the
following theorem for the case k = 1 [51, section 9.7]: Given > 0 and σ0 > 12
there is a T 0 such that for all σ ≥ σ0 and T ≥ T 0 we have
T
1
|ζ(σ + it)| dt − ζ(2σ) < .
2
T −1 1
This estimate was used by Bohr and Landau [51, section 9.6], together with
Jensen’s formula (Volume Two [32, Theorem B.5]), to show that the number
of zeta zeros in any rectangle R := [δ, 1] × [0, T ] for any δ > 12 is bounded
above by Kδ T . Since the total number of zeta zeros N(T ) in [0, 1] × [0, T ]
satisfies N(T ) ∼ T/(2π) log(T/(2πe)), the proportion of roots in R divided by
the total number up to T tends to zero as T → ∞. This result is still one of
the best pieces of analytic evidence for the truth of RH.
For more recent significant developments on the topic of estimates for
moments and references, there is the article of Soundararajan [160].
Searching for a single self-adjoint transformation which would resolve
RH could be asking too much. For example associating each zero with an
individual transformation depending on the zero would give the same result.
The work of Ross Barnett and the author leading to [21], consistent with
random matrix theory, seems to indicate a relationship between rotations and
zeta zeros, which has yet to be fully developed.
Proof If RH is true let H = 2 (N, C) with Euclidean inner product, and for n ∈
N let en := (0, 0, . . . , 1, 0, . . . ) be the nth standard basis element. If ρn = 12 + iηn
10.12 Epilogue 285
Let Δ := {x ∈ H : T (x) < ∞}. Since the linear subspace generated by any
finite subset of the (en ) is in Δ, Δ is dense in H. Since by RH each ηn is real,
T is self-adjoint. Setting T ρ = T completes the derivation.
10.12 Epilogue
Looking back over the equivalences to RH described in Chapters 4
through 10, the reader might be surprised at their variety, but also at the
few properties of ζ(s) which have been used to derive them: von Mangoldt’s
theorem, a simple zero-free region, both known for over 100 years, and a
better critical zero line height H – but even deriving improvements to H often
uses methods based on those dating back to Turing. The functional equation
barely appears, neither does the lower bound for the proportion of zeros on
the critical line, nor upper bounds for zeros possibly off the line.
In addition, the properties of primes that are used are slight and based on
not much more than the prime number theorem – no quadratic reciprocity,
no modular restrictions. It is some explicit estimates for arithmetic quantities
which are used to derive the simple inequalities which have been shown to
be equivalent to RH. Work on these estimates has thus been shown to be
invaluable.
Another observation is that when RH fails, the failure has dramatic
consequences. Just one off-critical-line zero will result in an infinite number
of failures for an equivalent inequality.
One approach to resolving RH, begun as reported in Chapter 8, would
be to further explore properties of potential counterexamples, or sets of
counterexamples, i.e. integers which do not satisfy an equivalent inequality.
Finding a set of restrictions which are inconsistent would be the goal.
Another approach would be to work at weakening a given inequality and
strengthening a related unconditional form, until they merge.
Bringing to bear on equivalences recent better understanding of prime
properties, such as primes in progressions and infinite subsequences of
primes with pn+1 − pn ≤ K for some explicit (and small) value of K, as well
as more classical properties, could also be fruitful. In this way it is properties
of primes which could resolve RH, rather than the reverse!
In addition to the equivalences described here, there are a host of other
equivalences which have been demonstrated. These are often fascinating and,
on the face of it, it is difficult to detect any connection to RH. Many of these
are described in Volume Two [32].
286 Other Equivalents to the Riemann Hypothesis
One thing is for certain – when it comes to equivalences to RH, the last
word has not been spoken.
σ(n)
G(n) := , n ≥ 2.
n loglog n
1 25 · 32 · 5 · 7 0.0252580790
2 2 · 3 · 5 · 7 · 112 · 132 · 171 · · · 1131
8 5 3 2
0.0238889777
3 28 · 35 · 53 · 72 · 112 · 132 · 171 · · · 1271 0.0237040357
4 28 · 35 · 53 · 72 · 112 · 132 · 171 · · · 1311 0.02342819838
5 28 · 35 · 53 · 72 · 112 · 132 · 171 · · · 1371 0.0232960582
6 28 · 35 · 53 · 72 · 112 · 132 · 171 · · · 1391 0.0228625950
7 29 · 35 · 53 · 72 · 112 · 132 · 171 · · · 1391 0.0228130145
287
288 Tables
σ(n)
> 1.
neγ loglog n
1 1 1 —
2 2 2 −2.29784
3 3 3 7.95991
4 4 2 3.00812
5 5 5 1.41579
6 6 2·3 1.92545
7 8 23 1.43797
8 9 32 1.03024
9 10 2·5 1.21174
10 12 22 · 3 1.43927
11 16 24 1.06673
12 18 2 · 32 1.14614
13 20 22 · 5 1.07462
14 24 23 · 3 1.21395
15 30 2·3·5 1.10079
16 36 22 · 32 1.1196
17 48 24 · 3 1.07517
18 60 2 ·3·5
2
1.11527
19 72 23 · 32 1.04641
20 84 2 ·3·7
2
1.00581
21 120 23 · 3 · 5 1.07559
22 180 22 · 32 · 5 1.03387
23 240 24 · 3 · 5 1.0231
24 360 23 · 32 · 5 1.02942
25 720 24 · 32 · 5 1.00087
26 840 23 · 3 · 5 · 7 1.0094
27 2 520 23 · 32 · 5 · 7 1.01322
28 5 040 24 · 32 · 5 · 7 1.00556
A.3 Superabundant Numbers 289
1 1 1 —
2 2 2 −2.29784
3 4 22 1.750
4 6 2·3 2.000
5 12 22 · 3 2.333
6 24 23 · 3 2.500
7 36 22 · 32 2.528
8 48 24 · 3 2.583
9 60 22 · 3 · 5 2.800
10 120 23 · 3 · 5 3.000
11 180 22 · 32 · 5 3.033
12 240 24 · 3 · 5 3.100
13 360 23 · 32 · 5 3.250
14 720 24 · 32 · 5 3.385
15 840 23 · 3 · 5 · 7 3.429
16 1 260 22 · 32 · 5 · 7 3.467
17 1 680 24 · 3 · 5 · 7 3.543
18 2 520 23 · 32 · 5 · 7 3.714
19 5 040 24 · 32 · 5 · 7 3.838
20 10 080 25 · 32 · 5 · 7 3.900
21 15 120 24 · 33 · 5 · 7 3.937
22 25 200 24 · 32 · 52 · 7 3.966
23 27 720 23 · 32 · 5 · 7 · 11 4.052
24 55 440 24 · 32 · 5 · 7 · 11 4.187
25 110 880 25 · 32 · 5 · 7 · 11 4.255
26 166 320 24 · 33 · 5 · 7 · 11 4.294
27 277 200 24 · 32 · 52 · 7 · 11 4.327
28 332 640 25 · 33 · 5 · 7 · 11 4.364
29 554 400 25 · 32 · 52 · 7 · 11 4.396
30 665 280 26 · 33 · 5 · 7 · 11 4.398
31 720 720 2 · 32 · 5 · 7 · 11 · 13
4
4.509
290 Tables
1 2 2 2
2 6 2·3 3
3 12 22 · 3 2
4 60 22 · 3 · 5 5
5 120 23 · 3 · 5 2
6 360 23 · 32 · 5 3
7 2 520 23 · 32 · 5 · 7 7
8 5 040 24 · 32 · 5 · 7 2
9 55 440 24 · 32 · 5 · 7 · 11 11
10 720 720 24 · 32 · 5 · 7 · 11 · 13 13
11 1 441 440 25 · 32 · 5 · 7 · 11 · 13 2
12 4 324 320 25 · 33 · 5 · 7 · 11 · 13 3
13 21 621 600 25 · 33 · 52 · 7 · 11 · 13 5
14 367 567 200 25 · 33 · 52 · 7 · 11 · 13 · 17 17
15 6 983 776 800 25 · 33 · 52 · 7 · 11 · 13 · 17 · 19 19
16 160 626 866 400 25 · 33 · 52 · 7 · 11 · 13 · 17 · 19 · 23 23
17 321 253 732 800 26 · 33 · 52 · 7 · 11 · 13 · 17 · 19 · 23 2
18 9 316 358 251 200 26 · 33 · 52 · 7 · 11 · 13 · 17 · 19 · 23
· 29 29
19 288 807 105 787 200 26 · 33 · 52 · 7 · 11 · 13 · 17 · 19 · 23
· 29 · 31 31
20 2 021 649 740 510 400 26 · 33 · 52 · 72 · 11 · 13 · 17 · 19 · 23
· 29 · 31 7
21 6 064 949 221 531 200 2 · 3 · 5 · 7 · 11 · 13 · 17 · 19 · 23
6 4 2 2
· 29 · 31 3
22 224 403 121 196 654 400 26 · 34 · 52 · 72 · 11 · 13 · 17 · 19 · 23
· 29 · 31 · 37 37
23 9 200 527 969 062 830 400 26 · 34 · 52 · 72 · 11 · 13 · 17 · 19 · 23
· 29 · 31 · 37 · 41 41
24 c24 26 · 34 · 52 · 72 · p5 · · · p14 43
25 c25 27 · 34 · 52 · 72 · p5 · · · p14 2
26 c26 27 · 34 · 52 · 72 · p5 · · · p15 47
27 c27 27 · 34 · 52 · 72 · p5 · · · p16 53
28 c28 27 · 34 · 52 · 72 · p5 · · · p17 59
29 c29 27 · 34 · 53 · 72 · p5 · · · p17 5
30 c30 27 · 34 · 53 · 72 · p5 · · · p18 61
A.5 Primes to Make Colossally Abundant Numbers 291
{1, 2, 3, 4∗ , 5, 6∗ , 8∗ , 9, 10∗ , 12∗ , 14∗ , 15, 16∗ , 18∗ , 20∗ , 22∗ , 24∗ , 26∗ ,
28∗ , 30, 36, 40, 42, 48, 50, 54, 60∗ , 66, 70, 72, 78, 84, 90∗ , 96, 102, 108,
114, 120∗ , 126, 132, 138, 140, 144, 150∗ , 156, 162, 168, 174, 180∗ , 186,
192, 198, 204, 210, 216, 222, 228, 234, 240, 246, 252, 258, 264, 270,
276, 294, 300, 306, 312, 330, 336, 342, 360, 378, 390, 396, 420∗ , 450,
462, 468, 480, 504, 510, 528, 540, 546, 570, 588, 600, 630∗ , 660, 672,
690, 714, 720, 750, 756, 780, 798, 810, 840∗ , 858, 870, 882, 900, 924,
930, 960, 966, 990, 1008, 1020, 1050∗ , 1080, 1092, 1110, 1122, 1134,
1140, 1170, 1176, 1200, 1218, 1230, 1260∗ , 1290, 1302, 1320, 1350,
1380, 1386, 1410, 1428, 1440, 1470∗ , 1500, 1530, 1554, 1560, 1590,
1596, 1620, 1638, 1650, 1680∗ , 1710, 1722, 1740, 1770, 1800, 1830,
1848, 1860, 1890∗ , 1920, 1932, 1950, 1980, 2010, 2040, 2070, 2100∗ ,
2130, 2142, 2160, 2184, 2190, 2220, 2250, 2280, 2310, 2340, 2370,
2394, 2400, 2430, 2460, 2490, 2520, 2550, 2580, 2610, 2640, 2670,
2700, 2730, 2760, 2772, 2790, 2820, 2850, 2856, 2880, 2910, 2940,
2970, 3000, 3030, 3060, 3090, 3120, 3150, 3180, 3210, 3234, 3240,
3270, 3276, 3300, 3330, 3360, 3390, 3420, 3450, 3480, 3510, 3540,
3570, 3600, 3630, 3660, 3690, 3696, 3720, 3780, 3810, 3822, 3870,
3900, 3930, 3960, 3990, 4020, 4080, 4110, 4140, 4158, 4170, 4200,
4230, 4260, 4290, 4350, 4368, 4380, 4410, 4440, 4560, 4590, 4620∗ ,
4650, 4680, 4770, 4830, 4920, 4950, 5040, 5070, 5082, 5100, 5130,
5160, 5220, 5250, 5280, 5460, 5520, 5544, 5550, 5580, 5610, 5670,
5700, 5850, 5880, 5940, 6006, 6090, 6120, 6210, 6240, 6270, 6300,
6510, 6600, 6630, 6720, 6840, 6900, 6930∗ , 6960, 7020, 7140, 7260,
7350, 7410, 7560, 7590, 7650, 7770, 7800, 7854, 7920, 7980, 8160,
8190, 8250, 8280, 8400, 8550, 8580, 8610, 8670, 8778, 8820, 8910,
8970, 9030, 9120, 9180, 9240∗ , 9282, 9360, 9450, 9570, 9660, 9690,
9750, 9870, 9900}
A.8 Maximum Order of an Element of the Symmetric Group 293
Table A.5 Initial values of h(n); compare with item A008475 in OEIS [158].
1 0 13 13 25 25 37 37
2 2 14 9 26 15 38 21
3 3 15 8 27 27 39 16
4 4 16 16 28 11 40 13
5 5 17 17 29 29 41 41
6 5 18 11 30 10 42 12
7 7 19 19 31 31 43 43
8 8 20 9 32 32 44 15
9 9 21 10 33 14 45 14
10 7 22 13 34 19 46 25
11 11 23 23 35 12 47 47
12 7 24 11 36 13 48 19
Table A.6 Initial values of g(n); compare with item A000793 in OEIS [158].
1 1 13 60 25 1 260 37 13 860
2 2 14 84 26 1 260 38 16 380
3 3 15 105 27 1 540 39 27 720
4 4 16 140 28 2 310 40 30 030
5 6 17 210 29 2 520 41 32 760
6 6 18 210 30 4 620 42 60 060
7 12 19 420 31 4 620 43 60 060
8 15 20 420 32 5 460 44 60 060
9 20 21 420 33 5 460 45 60 060
10 30 22 420 34 9 240 46 60 060
11 30 23 840 35 9 240 47 120 120
12 60 24 840 36 13 860 48 120 120
Appendix B
RHpack Mini-Manual
B.1 Introduction
This appendix is the manual for a set of functions written to assist the reader
to reproduce, and possibly extend, the calculations mentioned in the main
part of the book. The software for the package is provided over the world
wide web at the webpage for the book linked to the author’s homepage:
www.math.waikato.ac.nz/∼kab
and is in the form of a standard Mathematica add-on package. To use the
functions in the package you will need to have a version of Mathematica at
level 7.0 or higher.
B.1.1 Installation
First connect to the website given in the paragraph above and click on the
link for RHpack listed under the heading “Software” to get to the RHpack
homepage. Instructions on downloading the files for the package will be
given on the homepage. If you have an earlier version of RHpack, first delete
the file RHpack.m. The name of the package file is RHpack.m. To install
the package, if you have access to the file system for programs on your
computer, place a copy of the file in the standard repository for Mathematica
packages or any other directory, which will be listed by evaluating $Path in
Mathematica, to which you have access. You can then type <<RHpack.m
and then press the Shift and Enter keys to load the package. You may need
administrator or super-user status to complete this installation. Alternatively,
place the package file RHpack.m anywhere in your own file system where it
is safe and accessible.
Instructions for Windows systems The package can be loaded by typing
SetDirectory["c:\\your\\directory\\path"]<Shift/Enter>
<<RHpack.m;ResetDirectory[]<Shift/Enter>
294
B.1 Introduction 295
or
Get["RHpack.m",Path->{"c:\\your\\directory\\path"}]
<Shift/Enter>
where the path-name in quotes should be replaced by the actual path-name
of directories and subdirectories which specify where the package has been
placed on the computer you are using. If the double backslash does not work,
try a single backslash.
Instructions for Unix/Linux systems These are the same as for
Windows, but the path-name syntax style should be like
/usr/home/your/subdirectory
Instructions for Macintosh systems These are the same as for Windows,
but the path-name syntax style should be like
HD:Users:ham:Documents:
All systems The package should load printing a message. The functions
of RHpack are then available to any Mathematica notebook you subsequently
open.
BacklundB (blb)
This function computes an upper bound for the error in the approx-
imation to the number of critical zeta zeros up to height T , given
by
|N(T ) − F(T )| ≤ B(T ).
We have B(T ) := a1 log T + a2 loglog T + a3 where the values ai are stored
in the global variables $a1 , $a2 , $a3 , and are best possible at the time of
writing. See Section 2.2.
BacklundB[T, d] −→ r
T is a positive real number representing the height up to which zeta zeros
are being counted,
d is a precision to which evaluations to compute B(T ) will be made
internally,
r is a floating-point number being the value of B(T ).
CAPrimes (cap)
This function computes a list of primes for generating colossally
abundant numbers – see Table A.5. This has been shown to work up
to n = 1011 .
CAPrimes[n] −→ L
n is a natural number being the number of primes which are to be generated,
L is an ordered list of primes such that the product of the first j primes is the
jth colossally abundant number.
B.2 RHpack Functions 297
CheckCNk (cnk)
This function computes the k ≤ k0 such that c(N1 ) ≤ c(Nk ) ≤ c(N66 ) fails,
to precision d – see Theorem 5.34.
CheckCNk[k0, d] −→ L
k0 is an integer being the upper bound for which the values of c(Nk ) are
checked,
d is a natural number being the precision to which internal calculations are
made,
L is a list of values of k for which either c(Nk ) < c(N1 ) or c(Nk ) > c(N66 ).
ColossallyAbundantF (caf)
This function evaluates the function F(x, α) defined in Section 6.3, to
precision d.
ColossallyAbundantF[x, a, d] −→ r
a is a natural number,
x is a real number greater than 1,
d is a natural number being the precision for internal computations,
r is a floating-point number being the value of
1
log x 1 + a a−1 .
x + x + ··· + x
ColossallyAbundantInteger (cai)
This function returns the colossally abundant integer corresponding to a
given e > 0 – see Section 6.3.
ColossallyAbundantInteger[e] −→ L
e is a positive real number,
L is a list, the first element being a colossally abundant number, the second
being a list of prime powers for the prime divisors of n, in ascending
order.
ColossallyAbundantQ (caq)
This function determines whether n is a colossally abundant integer –
see Table A.4.
ColossallyAbundantQ[n] −→ P
n is a natural number,
P is True if n is colossally abundant and False otherwise.
298 RHpack Mini-Manual
CompositeQ (coq)
This function determines whether or not n is a composite integer.
CompositeQ[n] −→ P
n is a natural number,
P is True if n is composite, False otherwise.
ComputeC (coc)
This function evaluates c(n) = (n/ϕ(n) − eγ loglog n) log n, to precision
d – see Section 5.4.
ComputeC[n, d] −→ r
n is a natural number 2 or more,
d is the precision to which evaluations to compute c(n)will be made
internally,
r is a floating-point number.
ComputeDeltas (cod)
This function computes the values of
ψ(qi ) − θ(qi )
δi := √
qi+1
which are used in the proof of Lemma 5.18. The qi are the ordered
squarefull powers of primes up to 5993 . It also prints a table of the first
11 δi .
ComputeDeltas[d] −→ min
d is a natural number being the precision for internal computations,
min is the minimum value of the δi .
Computef (cof)
This function evaluates f (x) = eγ log θ(x) p≤x (1 − 1/p), to precision d –
see Chapters 5 and 7.
Computef[x, d] −→ r
x is a positive real number,
d is a natural number being the precision for internal computations,
r is a floating-point number representing the value of f (x).
ComputeH (coh)
This function computes the value of
log
x/log 2
H(x) := 1 + x1/n−1/3 .
n=4
B.2 RHpack Functions 299
ComputeKm (ckm)
Note the definition
1
km (h) = ,
γ>h
γm+1
where ρ = β+iγ are complex zeta zeros. This function finds upper bounds
for the values km (0) recursively, to precision d, starting with the bound
k1 (0) ≤ 0.0463 – see Table 3.1.
ComputeKm[m, d] −→ L
m is a natural number,
d is a natural number being the precision for internal computations,
L is a list of floating-point numbers representing computed upper bound for
{k1 (0), k2 (0), . . . , km (0)}.
ComputeTheta (cth)
This function evaluates θ(x) = p≤x log p.
ComputeTheta[x, d] −→ r
x is a real number,
d is a natural number being the precision for internal computations,
r is a floating-point number which is an approximation to θ(x).
ComputeThetaValues (ctv)
This function computes b values, at integer points commencing at 1, of
a list of θ(x) values, to precision d.
ComputeThetaValues[b, d] −→ L
b is a natural number being the number of θ(x) values to be computed,
d is a natural number being the precision for internal computations,
L is a list of b θ(x) values.
CriticalEpsilonValues (cev)
This function computes a list of critical values at primes up to b and
powers up to jmax, in decreasing order – see Section 6.3.
CriticalEpsilonValues[b, jmax] −→ L
300 RHpack Mini-Manual
b is a natural number,
jmax is a natural number,
L is a list of critical epsilon values in decreasing order in symbolic form for
primes up pb and exponents j up to jmax.
DivisibilityGraph (dvg)
This function returns the n × n divisibility graph of Section 10.5.
DivisibilityGraph[n] −→ L
n is a natural number,
L is a list of terms i → j where 1 ≤ i, j ≤ n is a subset of pairs, representing
the divisibility graph.
ExtraordinaryQ (exq)
This function determines whether n is an extraordinary number up to a
finite number of multiples of n – see Chapter 9.
ExtraordinaryQ[n, b, d] −→ P
n is a natural number,
b is a natural number,
d is a natural number being the precision for internal computations,
P is True if n is left abundant and right abundant up to multiples not greater
than b, both to precision d, otherwise False.
ExtremelyAbundantQ (eaq)
This function determines whether n is an extremely abundant number –
see Table A.1
ExtremelyAbundantQ[n, d] −→ P
n is a natural number with n ≥ 10 080,
d is a natural number being the precision for internal computations,
P is True if G(m) < G(n), to precision d, for 10 080 ≤ m < n, otherwise False.
FareyFractions (ffr)
This function computes the list of Farey fractions for denominator n, in
increasing order – see Section 10.3.
FareyFractions[n] −→ L
n is a natural number ,
L is a list of the Farey fractions {0, 1/n, . . . , 1} of order n.
FHalf (fha)
This function computes F1/2 (z) – see Lemma 5.16.
FHalf[x, d] −→ r
B.2 RHpack Functions 301
FindNextXA (fnx)
This function finds an extremely abundant number larger than a given
extremely abundant number – see Chapter 9. This function does not
check that its argument is extremely abundant, and simply checks the
size of Grönwall’s function G(n) against its values at multiples by primes
which preserve its Hardy–Ramanujan status. Because of this, some
extremely abundant numbers will be missed.
FindNextXA[n] −→ m
n is a natural number which should be a Hardy–Ramanujan number,
m is a natural number with G(m) > G(n) or False if it fails to find such an m.
GenerateCA (gca)
This function generates the b smallest colossally abundant numbers, with
some restriction on the size of b – see Table A.4.
GenerateCA[b] −→ L
b is a natural number,
L is a list of b colossally abundant numbers in increasing order.
GenerateHC (ghc)
This function generates all highly composite numbers up to b – see
Section 8.2.
GenerateHC[b] −→ L
b is a natural number,
L is a list of the highly composite numbers up to b.
GenerateHR (ghr)
This function generates a list of all Hardy–Ramanujan numbers up to b
– see Section 8.2.
GenerateHR[b] −→ L
b is a natural number,
L is a sorted list of all Hardy–Ramanujan numbers less than or equal to b.
GenerateLandau (gla)
This function generates values of Landau’s function g(n) up to n = b –
see Section 10.10. It uses the algorithm of Heinz and Alcover.
GenerateLandau[b] −→ LL
302 RHpack Mini-Manual
b is a natural number,
LL is a list of lists, each sublist being a pair {n, g(n)}.
GenerateSA (gsa)
This function generates all superabundant numbers up to b – see
Section 6.2.
GenerateSA[b] −→ L
b is a natural number,
L is a list of all superabundant numbers up to b in increasing order.
GronwallG (grg)
This function evaluates Grönwall’s function G(n) = σ(n)/(n loglog n), to
precision d.
GronwallG[n, d] −→ r
n is a natural number,
d is a natural number being the precision for internal computations,
r is a floating-point number representing the value of G(n).
HardyRamanujanQ (hrq)
This function determines whether n is a Hardy–Ramanujan integer – see
Section 8.2.
HardyRamanujanQ[n] −→ P
n is a natural number,
P is True if n is a Hardy–Ramanujan number, otherwise False.
HardyRamanujanToPrimorialForm (hr2pf)
This function converts a HardyRamanujan integer to primorial form –
see Section 6.1.
HardyRamanujanToPrimorialForm[n] −→ L
n is a natural number which is Hardy–Ramanujan,
L is a list of primes in non-increasing order of prime value such that n is the
product of the primorials corresponding to those primes to the given
powers such that each prime is the largest in an associated primorial.
HighlyCompositeQ (hcq)
This function determines whether n is a highly composite integer – see
Section 6.1.
HighlyCompositeQ[n] −→ P
B.2 RHpack Functions 303
n is a natural number,
P is True if n is highly composite, otherwise False.
KFreeQ (kfq)
This function determines whether the integer n is k-free.
KFreeQ[n, k] −→ P
n is a natural number,
k is a natural number,
P is True if n is k-free, otherwise False.
KthApproximatePrimorial (app)
This function returns the kth primorial, to precision d – see Chapter 5.
KthApproximatePrimorial[k, d] −→ r
k is a natural number,
304 RHpack Mini-Manual
KthPrimorial (kpp)
This function returns the kth primorial Nk as an exact integer.
KthPrimorial[k] −→ m
k is a natural number,
m is a natural number being the value of Nk , the kth primorial.
LagariasInequalityQ (liq)
This function checks σ(n) < Hn + exp(Hn ) log(Hn ), to precision d – see
Section 7.10.
LagariasInequalityQ[n ,d] −→ P
n is a natural number,
d is a natural number being the precision for internal computations,
P is True if the inequality is satisfied at precision d, False otherwise.
Lambda (lam)
This function evaluates the function Λ(s) of Section 10.8 for complex s.
Lambda[s, d] −→ r
s is a real or complex number,
d is a natural number being the precision for internal computations,
r is a floating-point complex number.
LargestZetaZero (lzz)
This function finds the zeta zero ρ = 1
2
+ iγ with maximum γ ≤ h, to
precision d for h ≤ 106 .
LargestZetaZero[h, d] −→ r
h is a positive real number with h ≤ 106 ,
d is a natural number being the precision for internal computations,
r is the value of the y-coordinate of the largest zeta zero with imaginary part
less than or equal to h.
LeftAbundantQ (laq)
This function determines whether n is a left abundant number – see
Chapter 9.
LeftAbundantQ[n, d] −→ P
n is a natural number,
B.2 RHpack Functions 305
LogKthPrimorialBounds (lppb)
This function gives upper and lower bounds of Lemma 8.18 for log Nk .
LogKthPrimorialBounds[k, d] −→ L
k is a natural number with k ≥ 198,
d is a natural number being the precision for internal computations,
L is a list of two floating-point numbers {l, u} such that l ≤ log Nk ≤ u.
M
This function evaluates the sum of the Möbious μ(n) function up to x > 0
– see Chapter 4 and Section 10.3.
M[x] −→ s
x is a positive real number,
x
s is an integer representing the sum n=1 μ(n).
MaxPrime (mxp)
This function finds the maximum prime less than or equal to x.
MaxPrime[x] −→ p
x is a real number,
p is a prime number which is maximal such that p ≤ x.
NextZetaZero (nzz)
This function finds the next zeta zero with height larger than h, to
precision d. It is suitable only for modest values of h.
NextZetaZero[h, d] −→ r
h is a positive real number,
d is a natural number being the precision for internal computations,
r is a floating-point number representing the y-coordinate of the smallest
zeta zero ρ with γ > h, where ρ = 12 + i 12 .
NiceFactorInteger (nfi)
This function factors a natural number n with primes to the power 1
appearing as singletons.
NiceFactorInteger[n] −→ L
n is a natural number,
306 RHpack Mini-Manual
L is list of list pairs {p, e} where p is prime and pe n and e ≥ 2 and singletons
p where p1 n.
NicolasInequalityQ (niq)
This function checks n/ϕ(n) < eγ loglog n to precision d – see Chapter 5.
NicolasInequalityQ[n, d] −→ P
n is a natural number,
d is a natural number being the precision for internal computations,
P is True if the inequality is satisfied to precision d and False otherwise.
NicolasTwoFailures (n2f)
This function computes the number of failures to Nicolas inequality at
primorials
n
− e loglog n log n < eγ (4 + γ − log π − 2 log 2)
γ
ϕ(n)
in the range k1 ≤ k ≤ k0 with n = Nk – see the proof of Theorem 5.34 and
Corollary 5.35. This function is suitable only for small values of k.
NicolasTwoFailures[k1, k0, d] −→ n
k1 is a natural number,
k0 is a natural number with k1 ≤ k0,
d is a natural number being the precision for internal computations,
n is a non-negative integer representing the number of integer points k in
[k1, k0] at which the inequality fails at n = Nk .
NOverPhi (nop)
This function evaluates n/ϕ(n), to precision d – see Chapter 5.
NOverPhi[n, d] −→ r
n is a natural number,
d is a natural number being the precision for internal computations,
r is a floating-point number representing the value n/ϕ(n).
NthHarmonicNumber (nha)
This function evaluates the n harmonic number, Hn , to precision d – see
Section 7.10.
NthHarmonicNumber[n, d] −→ r
n is a natural number,
d is a natural number being the precision for internal computations,
r is a floating-point number representing the value of Hn .
B.2 RHpack Functions 307
NthPrime (npr)
This function returns the nth prime pn where p1 = 2 – see Section 8.5.
NthPrime[n] −→ c
n is a natural number,
d is a natural number being the precision for internal computations,
p is a rational prime being the nth largest prime number.
NthPrimeBounds (prb)
This function gives the upper and lower bounds of Rosser and
Schoenfeld [145] for the nth prime – see Section 8.5.
NthPrimeBounds[n, d] −→ L
n is a natural number being the index for the nth prime pn with n > 20,
d is a natural number being the precision for internal computations,
L is a list of two floating-point numbers, being {l, u} where l ≤ pn ≤ u.
PlotCNk (pcnk)
This function prints a plot of the values of c(Nk ) for k1 ≤ k ≤ k0 – see
Chapter 5. It returns the value of c(Nk ) at k = k0.
PlotCNk[k1, k0, d] −→ r
k1 is a natural number,
k0 is a natural number with k1 ≤ k0,
d is a natural number being the precision for internal computations,
r is a floating-point number representing the value of c(Nk0 ).
PlotDivisibilityGraph (pdg)
This function returns a plot of the divisibility graph of Section 10.5. The
edges are directed and vertices labelled 1 through n.
PlotDivisibilityGraph[n] −→ G
n is a natural number,
G is a labelled, directed graph.
PlotEulerPhiRatio (pep)
This function prints a plot of the graph of the proportion of positive
integers n for which loglog n < n/ϕ(n) up to b – see Chapter 5. It returns
the value of this proportion at b.
PlotEulerPhiRatio[b] −→ r
b is a positive real number,
308 RHpack Mini-Manual
PlotRobinsInequality (pri)
This function prints a plot of n vs σ(n)/n and eγ loglog n on the same
axes, for a ≤ n ≤ b – see Chapter 7.
PlotRobinsInequality[a, b, d] −→ L
a is a natural number,
b is a natural number with a < b,
d is a natural number being the precision for internal computations,
L is a two-element list being {σ(b)/b, eγ loglog b}.
PlotVonMangoldtPsi (pvm)
This function prints a plot of the explicit formula for ψ0 (x) – see
Section 2.2. It returns the value of this function at the end point of its
domain.
PlotVonMangoldtPsi[a, b, nx, d] −→ r
a is a positive real number being the start of the domain to be plotted,
b is a positive real number with a < b being the end point of the domain to
be plotted,
r is a floating-point number being the value of ψ0 (b),
nx is a natural number being the number of terms of the form xρ /ρ used to
approximate ψ0 (x),
d is a natural number being the precision for internal computations.
PowerfulQ (pfq)
This function determines whether a natural number n is powerful.
PowerfulQ[n] −→ P
n is a natural number,
P is True if for each prime p | n we have p2 | n, otherwise False.
PrimePower (prp)
This function returns the maximum power to which p divides n. The
number p need not be prime.
PrimePower[p, n] −→ e
p is a natural number with p > 1,
n is a natural number
e is a non-negative integer being ν p (n).
B.2 RHpack Functions 309
PrimorialFormQ (pfq)
This function determines whether a list of integers is in primorial form, i.e. a
list of primes in non-decreasing order
PrimorialFormQ[L] −→ P
L is a list of integers,
P is True if each element of L is prime, else False.
PrimorialFormToInteger (pf2i)
This function computes the integer corresponding to the primorial form.
PrimorialFormToInteger[P] −→ n
P is a list of primes in non-decreasing order,
n n is a natural number being the product of the primorials corresponding to
the primes in P.
Psi
This function computes ψ(x) using the expression
∞
ψ(x) = θ x1/n .
n=1
Psi0
This function computes ψ0 (x) using nz terms – see Section 2.2.
Psi0[x, nz, d] −→ r
x is a positive real number being the point at which ψ0 is to be evaluated,
nz is a natural number being the number of terms in the von Mangoldt sum
which will be used,
d is a natural number being the precision for internal computations,
r is a floating-point number representing the value of ψ0 (x).
PsiLargeXEpsilon (ple)
This function computes
(x) = log x exp(− log x/R ),
to precision d, for R ≥ 16 – see Section 3.4.
PsiLargeXEpsilon[x, R, d] −→ r
310 RHpack Mini-Manual
PsiLargeXEpsilon2 (ple2)
This function computes
(x) = 2 log x exp(− log x/R ),
to precision d for 8 ≤ R ≤ 18 – see Section 3.4.
PsiLargeXEpsilon2[x, R, d] −→ r
x is a positive real number,
R is a real number in the range 8 ≤ R ≤ 18,
d is a natural number being the precision for internal computations,
r is a floating-point number representing (x).
PsiSmallXEpsilon (pse)
This function computes b based on Rosser and Schoenfeld’s 1962
approach, to precision d – see Section 3.2.
PsiSmallXEpsilon[b, m, d] −→ r
b is a positive real number such that x = eb is the point at which ψ(x) is
estimated,
m is a natural number being the parameter of the estimation,
d is a natural number representing the precision for internal computations,
r is a floating-point number representing the value of b .
RedhefferMatrix (rem)
This function gives the divisibility matrix of Redheffer – see
Section 10.4.
RedhefferMatrix[n] −→ LL
n is a natural number,
LL is a list of n lists, being the square divisibility matrix. It can be printed in
standard matrix form using the function MatrixForm.
RightAbundantQ (raq)
This function determines whether n is a right abundant number up to
multiples a · n with 1 ≤ a ≤ b – see Chapter 9.
RightAbundantQ[n, b, d] −→ P
n is a natural number,
b is a natural number,
B.2 RHpack Functions 311
b is a natural number,
P is True if G(n) ≥ G(an) for 1 < a ≤ b, False otherwise.
RobinsInequalityQ (riq)
This function checks the inequality σ(n)/n < eγ loglog n, to precision d –
see Chapter 5.
RobinsInequalityQ[n,d] −→ P
n is a natural number,
d is a natural number being the precision for internal computations,
P is True if the inequality is satisfied, otherwise False.
S
This function computes the argument of ζ( 12 + it) as in Section 10.9.
S[t] −→ r
t is a positive real number,
d is a natural number being the precision for internal computations,
r is a floating-point number representing the value of S (t) = N(t) −
(ϑ(t)/π + 1).
SuperAbundantQ (saq)
This function determines whether n is a superabundant integer – see
Chapter 6.
SuperAbundantQ[n] −→ c
n is a natural number,
P is True if n is superabundant, False otherwise.
UnitaryDivisorSigma (uds)
This function evaluates the sum the unitary divisors of n – see
Section 7.12.
UnitaryDivisorSigma[n] −→ m
312 RHpack Mini-Manual
n is a natural number,
m is a natural number being the sum of the unitary divisors of n.
VerifyThetaX (vth)
This function verifies the equation θ(x) < x for a ≤ x ≤ b to precision d –
see Section 3.3
VerifyThetaX[a, b, d] −→ bad
a is a real number being the initial point of the x values to be tested,
b is a real number being the end point of the domain of x values to be tested,
d is a natural number being the precision for internal computations,
bad is a list of integers representing points in the domain where the
inequality θ(x) < x fails at precision d/2.
Xi
This function evaluates the Riemann ξ(s) function of Section 10.9 at
complex numbers s.
Xi[s, d] −→ c
s is a real or complex number,
d is a natural number being the precision for internal computations,
c is a floating-point complex number.
ZetaZeroCount (zzc)
This function returns a count of the zeta zeros up to height T up to height
106 – see Section 2.2.
ZetaZeroCount[T, d] −→ n
T is a positive real number not greater than 106 ,
d is a natural number being the precision for internal computations,
n is a non-negative integer, representing the number of zeros of ζ(s) on the
critical line, above the x-axis, with imaginary part satisfying 0 ≤ γ ≤ T ,
to precision d.
ZetaZeroHeight (zzh)
This function computes the ordinate of the nth highest positive
imaginary part of a zeta zero, to precision d – see Section 2.2.
ZetaZeroHeight[n, d] −→ r
n is a positive integer being the rank of the zero with n ≤ 107 ,
d is a natural number being the precision for internal computations,
r is a floating-point number representing the y-coordinate of the nth zero.
References
[1] A. Akbary, Z. Friggstad and R. Jurecevic, Explicit upper bounds for p≤pω(n) p/(p − 1),
Cont. Discrete Math. 2 (2007), 153–160.
[2] A. Akbary and Z. Friggstad, Superabundant numbers and the Riemann hypothesis,
Amer. Math. Monthly 116 (2009), 273–275.
[3] L. Alaoglu and P. Erdős, On highly composite and similar numbers, Trans. Amer.
Math. Soc. 56 (1944), 448–469.
[4] L. Alfors, Complex Analysis, 2nd edn, McGraw-Hill, 1966.
[5] T. M. Apostol, Modular Functions and Dirichlet Series in Number Theory, Springer,
1976.
[6] T. M. Apostol, Introduction to Analytic Number Theory, 2nd edn, Springer, 1990.
[7] R. J. Backlund, Über die nullstellen der Riemannschen Zetafunktion, Acta Math. 41
(1918), 345–375.
[8] R. Balasubramanian, S. Kanemitsu and M. Yoshimoto, Euler products, Farey series
and the Riemann hypothesis II, Publ. Math. Debrecen 69 (2006), 1–16.
[9] W. W. Barrett, R. W. Forcade and A. D. Pollington, On the spectral radius of a (0,1)
matrix related to Mertens’ function, Linear Algebra Appl. 107 (1988), 151–159.
[10] B. C. Berndt, Ramanujan’s Notebooks, Parts I–V, Springer, 1985–1998.
[11] M. V. Berry and J. P. Keating, The Riemann zeros and eigenvalue asymptotics, SIAM
Rev. 41 (1999), 236–266.
[12] A. R. Booker, Turing and the Riemann hypothesis, Notices Amer. Math. Soc. 53
(2006), 1208–1211.
[13] P. Borwein, S. Choi, B. Rooney and A. Weirathmueller, The Riemann Hypothesis: A
Resource for the Afficionado and Virtuoso Alike, CMS Books in Mathematics, Springer,
2008.
[14] R. P. Brent, On the zeros of the Riemann zeta function in the critical strip, Math. Comp.
33 (1979), 1361–1372.
[15] R. P. Brent, J. van de Lune, H. J. J. te Riele and D. T. Winter, On the zeros of the
Riemann zeta function in the critical strip II, Math. Comp. 39 (1982), 681–688.
[16] K. Briggs, Abundant numbers and the Riemann hypothesis, J. Exp. Math. 15 (2006),
251–256.
[17] K. A. Broughan, Structure of sectors of zeros of entire flows, Topology Proc. 27 (2002),
1–16.
[18] K. A. Broughan, Vanishing of the integral of the Hurwitz zeta function, Bull. Austral.
Math. Soc. 65 (2002), 121–127.
313
314 References
[19] K. A. Broughan, Holomorphic flows on simply connected subsets have no limit cycles,
Meccanica 38 (2003), 699–709.
[20] K. A. Broughan and A. R. Barnett, The holomorphic flow of the Riemann zeta function,
Math. Comp. 73 (2004), 987–1004.
[21] K. A. Broughan, Holomorphic flow of the Riemann xi function, Nonlinearity 18 (2005),
1269–1294.
[22] K. A. Broughan and A. R. Barnett, Linear law for the logarithms of the Riemann
periods at simple critical zeros, Math. Comp. 75 (2006), 891–902.
[23] K. A. Broughan and A. R. Barnett, Corrigendum to “The holomorphic flow of the
Riemann zeta function”, Math. Comp. 76 (2007), 2249–2250.
[24] K. A. Broughan, Extension of the Riemann ξ-function’s logarithmic derivative
positivity region to near the critical strip, Can. Math. Bull. 52 (2009), 186–194.
[25] K. A. Broughan and A. R. Barnett, Gram lines and the average of the real part of the
Riemann zeta function, Math. Comp. 81 (2012) 1669–1679.
[26] K. A. Broughan, Website for phase portraits of the zeros of ζ(s) and ξ(s). www.math.
waikato.ac.nz/∼kab.
[27] K. A. Broughan, J.-M. De Koninck, I. Katai and F. Luca, On integers for which the sum
of divisors is the square of the squarefree core, J. Integer Seq. 15 (2012), art. 12.7.5.
[28] K. A. Broughan and D. Delbourgo, On the ratio of the sum of divisors and Euler’s
totient function I, J. Integer Seq. 16 (2013), art. 13.8.8 (17pp.).
[29] K. A. Broughan, K. Ford and F. Luca, On square values of the product of the Euler
totient function and the sum of divisors function, Colloq. Math. 130 (2013), 127–137.
[30] K. A. Broughan and Q. Zhou, On the ratio of the sum of divisors and Euler’s totient
function II, J. Integer Seq. 17 (2014), art. 14.9.2 (22pp.).
[31] K. A. Broughan and T. S. Trudgian, Robin’s inequality for 11-free integers, Integers
15 (2015), #A12 (5pp.).
[32] K. A. Broughan, Equivalents of the Riemann Hypothesis II: Analytic Equivalences,
Cambridge University Press, 2017.
[33] J. Büthe, J. Franke, A. Jost and T. Kleinjung, A practical analytic method for
calculating π(x), Preprint.
[34] G. Caveney, J.-L. Nicolas and J. Sondow, Robin’s theorem, primes, and a new
elementary reformulation of the Riemann hypothesis, Integers 11 (2011), #A33 (10pp.).
[35] G. Caveney, J.-L. Nicolas and J. Sondow, On SA, CA and GA numbers, Ramanujan J.
29 (2012), 359–384.
[36] Y. Cheng, An explicit upper bound for the Riemann zeta-function near the line σ = 1,
Rocky Mountain J. Math. 29 (1999), 115–140.
[37] Y. Cheng and S. W. Graham, Explicit estimates for the Riemann zeta function, Rocky
Mountain J. Math. 34 (2004), 1261–1280.
[38] Y. Choie, N. Lichiardopol, P. Moree and P. Solé, On Robin’s criterion for the Riemann
hypothesis, J. Théor. Nombres Bordeaux 19 (2007), 357–372.
[39] S. Chowla, On abundant numbers, J. Indian Math. Soc. (2) 1 (1934), 41–44.
[40] J. B. Conrey, More than two fifths of the zeros of the Riemann zeta function are on the
critical line, J. reine angew. Math. 399 (1989), 1–16.
[41] J. B. Conrey, L-functions and random matrices, in Mathematics Unlimited – 2001 and
Beyond, pp. 331–352, Springer, 2001.
[42] H. Davenport, Über numeri abundantes, Sitzungsber. Preuss. Akad. Wiss. Berlin 27
(1933), 830–837.
[43] H. Davenport, Multiplicative Number Theory, 3rd edn, Springer, 2000.
[44] M. Davis, Y. Matijasevic̆ and J. Robinson, Hilbert’s tenth problem: diophantine
equations: positive aspects of a negative solution, in Mathematical Developments
References 315
Arising from Hilbert Problems, Proc. Symp. Pure Math., XXVIII, De Kalb, IL, 1974,
pp. 323–378. American Mathematical Society, 1976.
[45] A. Derbal, Grandes valeurs de la fonction σ(n)/σ∗ (n), C. R. Math. Acad. Sci. Paris
364 (2008), 125–128.
[46] L. E. Dickson, History of the Theory of Numbers, vol. I, Dover, 2005.
[47] R. D. Dixon and L. Schoenfeld, The size of the Riemann zeta-function at places
symmetric with respect to the point 1/2, Duke Math. J. 33 (1966), 291–292.
[48] P. Dusart, Inégalités explicites pour ψ(X), θ(X), π(X) et les nombres premiers, C. R.
Math. Acad. Sci. Soc. R. Can. 21 (1999), 53–59.
[49] P. Dusart, The kth prime is greater than k(log k + loglog k − 1) for k ≥ 2, Math. Comp.
68 (1999), 411–415.
[50] P. Dusart, Estimates of some functions over primes without RH, Preprint,
arXiv:1002.0442v1, 2010.
[51] H. M. Edwards, Riemann’s Zeta Function, Academic Press, 1974; reprinted by Dover,
2001.
[52] P. Erdős, On the density of abundant numbers, J. London Math. Soc. 9 (1934), 278–
282.
[53] P. Erdős, On the normal number of prime factors of p − 1 and some related problems
concerning Euler’s φ-function, Quart. J. Math. 6 (1935), 205–213.
[54] P. Erdős, Some remarks on Euler’s ϕ function, Acta Arith. 4 (1958), 10–19.
[55] P. Erdős and P. Turan, On some problems of a statistical group theory I,
Z. Wahrscheinlichkeitstheorie verw. Geb. 4 (1965), 175–186.
[56] P. Erdős and J.-L. Nicolas, Repartition des nombres superabondants, Bull. Soc. Math.
France 103 (1975), 65–90.
[57] L. Faber and H. Kadiri, New bounds for ψ(x), Math. Comp. 84 (2015), 1339–1357.
[58] K. Ford, The distribution of totients, Ramanujan J. 2 (1998), 67–151 (Paul Erdős
memorial issue).
[59] K. Ford, Zero-free regions for the Riemann zeta function, in Number Theory for the
Millennium II, Urbana, IL, 2000, pp. 25–56, A. K. Peters, 2002.
[60] K. Ford, Vinogradov’s integral and bounds for the Riemann zeta function, Proc.
London Math. Soc. (3) 85 (2002), 565–633.
[61] K. Ford, F. Luca and C. Pomerance, Common values of the arithmetic functions φ and
σ, Bull. London Math. Soc. 42 (2010), 478–488.
[62] K. Ford and P. Pollack, On common values of φ(n) and σ(n) I, Acta Math. Hungar.
133 (2011), 251–271.
[63] K. Ford and P. Pollack, On common values of φ(n) and σ(n) II, Algebra Number Theory
6 (2012), 1669–1696.
[64] J. Franel and E. Landau, Les suites de Farey et le probléme des nombres premiers,
Göttinger Nachr. (1924), 198–206.
[65] J. A. Gallian, Contemporary Abstract Algebra, 5th edn, Houghton Mifflin, 2002.
[66] R. Garunkstis, On a positivity property of the Riemann ξ-function, Lithuanian Math. J.
43 (2002), 140–145.
[67] B. R. Gillespie, Extending Redheffer’s matrix to arbitary arithmetic functions,
Bachelor with Honors Thesis, Pennsylvania State University, 2011.
[68] X. Gourdon, The 1013 first zeros of the Riemann zeta function, and zeros computation
at very large height, Preprint, 2004.
[69] T. H. Grönwall, Some asymptotic expressions in the theory of numbers, Trans. Amer.
Math. Soc. 14 (1913), 113–122.
[70] E. Grosswald, Oscillation theorems of arithmetical functions, Trans. Amer. Math. Soc.
126 (1967), 1–28.
316 References
[71] J. L. Hafner, New omega theorems for two classical lattice point problems, Invent.
Math. 63 (1981), 181–186.
[72] R. R. Hall and W. K. Hayman, Hyperbolic distance and distinct zeros of the Riemann
zeta-function in small regions, J. reine angew. Math. 526 (2000), 35–59.
[73] G. H. Hardy, The average order of the arithmetical functions P(x) and Δ(x), Proc.
London Math. Soc. 15 (1916), 192–213.
[74] G. H. Hardy and J. E. Littlewood The zeros of Riemann’s zeta function on the real line,
Math. Z. 10 (1921), 283–317.
[75] G. H. Hardy and J. M. Wright, An Introduction to the Theory of Numbers, 6th edn,
Oxford University Press, 2008.
[76] D. R. Heath-Brown, Simple zeros of the Riemann zeta function on the critical line, Bull.
London Math. Soc. 11 (1979), 17–18.
[77] D. R. Heath-Brown, Zero-free regions for Dirichlet L-functions, and the least prime in
an arithmetic progression, Proc. London Math. Soc. 64 (1992), 265–338.
[78] D. A. Hejhal and A. M. Odlyzko, Alan Turing and the Riemann zeta function, in Alan
Turing – His Work and Impact, eds S. B. Cooper and J. van Leeuwen, Elsevier, 2012.
[79] G. A. Hiary and A. M. Odlyzko, The zeta function on the critical line: numerical
evidence for moments and random matrix theory models, Math. Comp. 81 (2012),
1723–1752.
[80] A. E. Ingham, The Distribution of Prime Numbers, Cambridge University Press, 1932.
[81] A. Ivić, Two inequalities for the sum of divisors functions, Univ. Novom Sadu, Zb.
Rad. Prirod.-Mat. Fak. 7 (1977), 17–22.
[82] A. Ivić, Riemann Zeta Function: Theory and Applications, Dover, 2003.
[83] W.-J. Jang and S-H. Kwon, A note on Kadiri’s explicit zero free region for the Riemann
zeta function, J. Korean Math. Soc. 51 (2014), 1291–1304.
[84] H. Kadiri, Une région explicite sans zéros pour la fonction ζ de Riemann, Acta Arith.
117 (2005), 303–339.
[85] S. Kanemitsu and M. Yoshimoto, Farey series and the Riemann hypothesis, Acta Arith.
75 (1996), 351–374.
[86] S. Kanemitsu and M. Yoshimoto, Farey series and the Riemann hypothesis III,
Ramanujan J. 1 (1997), 363–378.
[87] S. Kanemitsu and M. Yoshimoto, Euler products, Farey series and the Riemann
hypothesis, Pub. Math. Debrecen 56 (2000), 431–449.
[88] A. A. Karatsuba and S. M. Voronin, Riemann zeta function (transl. from Russian by N.
Koblitz), De Gruyter, 1994.
[89] N. M. Katz and P. Sarnak, Zeroes of zeta functions and symmetry, Bull. Amer. Math.
Soc. 36 (1999), 1–26.
[90] N. M. Katz and P. Sarnak, Random matrices, Frobenius eigenvalues, and monodromy,
AMS Colloquium Publications, 45. American Mathematical Society, 1999.
[91] J. P. Keating and N. C. Snaith, Random matrix theory and ζ( 21 + it), Comm. Math.
Phys. 214 (2000), 57–89.
[92] D. E. Knuth, The Art of Computer Programming I: Fundamental Algorithms, 3rd edn,
Addison-Wesley, 1997.
[93] H. von Koch, Sur la distribution des nombres premiers, Acta Math. 24 (1901),
159–182.
[94] H. von Koch, Über die Riemannsche Primzahlfunction, Math. Ann. 55 (1902),
440–464.
[95] J.-M. De Koninck and F. Luca, Analytic Number Theory: Exploring the Anatomy of
Integers, American Mathematical Society, 2012.
References 317
[96] N. M. Korobov, Estimates of trigonometric sums and their applications, Uspehi Mat.
Nauk 13 (1958), 185–192.
[97] T. Kotnik, The prime counting function and its analytic approximations, Adv. Comp.
Math. 29 (2008), 55–70.
[98] J. C. Lagarias, On a positivity property of the Riemann ξ-function, Acta Arith. 99
(1999) 217–213.
[99] J. C. Lagarias, Errata: On a positivity property of the Riemann ξ-function, Acta Arith.
99 (1999) 765.
[100] J. C. Lagarias, An elementary problem equivalent to the Riemann hypothesis, Amer.
Math. Monthly 109 (2002), 534–543.
[101] J. C. Lagarias, Problem 10949, Amer. Math. Monthly 109 (2002), 569.
[102] J. C. Lagarias and W. Janous, A generous bound for divisor sums: problem 10949,
Amer. Math. Monthly 111 (2004), 264–265.
[103] E. Landau, Über die Maximalordnung der Permutationen gegebenen Grades, Arch.
Math. Phys. Ser. 3 5 (1903).
[104] E. Landau, Über einen satz von Tschebyschef, Math. Ann. 61 (1905), 527–550.
[105] E. Landau, Handbuch der lehre von der Verteilung der Primzahlen, 2nd edn, vols 1
and 2, Chelsea, 1953.
[106] S. Lang, Introduction to Transcendental Numbers, Addison-Wesley, 1966.
[107] N. Levinson and H. L. Montgomery, Zeros of the derivatives of the Riemann zeta-
function, Acta Math. 133 (1974), 49–65.
[108] J. E. Littlewood, Quelques conséquences de l’hypothése que la fonction ζ(s) n’a pas
de zéros dans le demi-plan s > 12 , C. R. Acad. Sci. Paris Sér. I Math. 158 (1912),
263–266.
[109] J. E. Littlewood, Sur la distribution des nombres premiers, C. R. Acad. Sci. Paris Sér.
I Math. 158 (1914), 1869–1872.
[110] E. R. Lorch, Spectral Theory, Oxford University Press, 1962.
[111] J. van de Lune, H. J. J. te Riele and D. T. Winter, On the zeros of the Riemann zeta
function in the critical strip IV, Math. Comp. 46 (1986), 667–681.
[112] J.-P. Massias, J.-L. Nicolas and G. Robin, Evaluation asymptotique de l’ordre
maximum d’un element du groupe symétrique, Acta Arith. 50 (1988), 221–242.
[113] J.-P. Massias, J.-L. Nicolas and G. Robin, Effective bounds for the maximal order of
an element in the symmetric group, Math. Comp. 53 (1989), 665–678.
[114] J.-P. Massias and G. Robin, Bornes effectives pour certaines fonctions concernant les
nombres premiers, J. Théor. Nombres Bordeaux 8 (1996), 215–242.
[115] F. Mertens, Über einize asymptotische Gesetse der Zahlentheorie, J. reine angew.
Math. 77 (1874), 46–62.
[116] M. Mikolás, Farey series and their connection with the prime number problem I, Acta
Univ. Szeged. Sect. Sci. Math. 13 (1949), 93–117.
[117] M. Mikolás, Farey series and their connection with the prime number problem II, Acta
Sci. Math. Szeged 14 (1951), 5–21.
[118] W. Miller, The maximum order of an element of a finite symmetric group, Amer. Math.
Monthly 94 (1987), 497–506.
[119] H. L. Montgomery, The pair correlation of zeros of the zeta function, in Analytic
Number Theory, Proc. Symp. Pure Math. XXIV, pp. 181–193, American Mathematical
Society, 1973.
[120] F. Morain, Tables sur la fonction g(n), Département de Mathématiques, Université de
Limoges, 1988.
[121] M. J. Mossinghoff and T. S. Trudgian, Non-negative trigonometric polynomials and a
zero-free region for the Riemann zeta-function, Preprint, arXiv:1410.3926v1, 2014.
318 References
[147] Z. Rudnick and P. Sarnak, Zeros of principal L-functions and random matrix theory. A
celebration of John F. Nash, Jr., Duke Math. J. 81 (1996), 269–322.
[148] K. Sabbagh, The Riemann Hypothesis, Farrar, Straus and Giroux, 2003.
[149] B. Sagan, The Symmetric Group: Representations, Combinatorial Algorithms, and
Symmetric Functions, Springer, 2001.
[150] J. Sándor, D. S. Mitrinović and B. Crstici, Handbook of Number Theory I, Springer,
2006.
[151] M. du Sautoy, The Music of the Primes, HarperCollins, 2003.
[152] E. Schmidt Über die Anzahl der Primzahlen unter gegebener Grenze, Math. Ann. 57
(1903), 195–204.
[153] L. Schoenfeld, Sharper bounds for the Chebyshev functions ϑ(x) and ψ(x) II, Math.
Comput. 30 (1976), 337–360.
[154] L. Schoenfeld, Corrigendum: Sharper bounds for the Chebyshev functions ϑ(x) and
ψ(x) II, Math. Comput. 30 (1976), 900.
[155] D. Schumayer and D. A. W. Hutchinson, Physics of the Riemann hypothesis, Rev. Mod.
Phys. 83 (2011), 307–330.
[156] S. M. Shah, An inequality for the arithmetic function g(x), J. Indian Math. Soc. 3
(1939), 316–318.
[157] S. Skews, On the difference π(x) − li(x), J. London Math. Soc. 8 (1933), 277–283.
[158] N. J. A. Sloan, Online Encyclopedia of Integer Sequences (OEIS), https://oeis.org.
[159] P. Solé and M. Planat, The Robin inequality for 7-free integers, Integers 11 (2011),
#A65 (8pp.).
[160] K. Soundararajan, Moments of the Riemann zeta function, Annals Math. 170 (2009),
981–993.
[161] A. Speiser, Geometrisches zur Riemannschen Zetafunktion, Math. Ann. 110 (1934),
514–521.
[162] R. Spira, An inequality for the Riemann zeta function, Duke Math. J. 32 (1965), 247–
250.
[163] R. Spira, Zeros of ζ (s) and the Riemann hypothesis, Illinois J. Math. 17 (1973),
147–152.
[164] A. I. Suriajaya, On the zeros of the k-th derivative of the Riemann zeta function under
the Riemann hypothesis, Funct. Approx. Comment. Math. 53 (2015), 69–95.
[165] S. M. Szalay, On the maximal order in S n and S n∗ , Acta Arith. 37 (1980), 321–331.
[166] G. Tenenbaum, Introduction to Analytic and Probabilistic Number Theory, Cambridge
University Press, 1995.
[167] E. C. Titchmarsh and D. R. Heath-Brown, The Theory of the Riemann Zeta-Function,
2nd edn, Oxford University Press, 1986.
[168] T. S. Trudgian, Improvements to Turing’s method, Math. Comp. 80 (2011), 2259–2279.
[169] T. S. Trudgian, An improved upper bound for the argument of the Riemann zeta-
function on the critical line II, J. Number Theory 134 (2014), 280–292.
[170] T. S. Trudgian, The sum of the unitary divisor function, Publ. Inst. Math. (Belgrad)
(N.S.) 97 (2015), 175–180.
[171] T. S. Trudgian, A short extension of two of Spira’s results, J. Math. Inequalities 9
(2015), 795–798.
[172] T. S. Trudgian, Improvements to Turing’s method II, Rocky Mountain J. Math. 46
(2016), 325–332.
[173] T. S. Trudgian, Updating the error term in the prime number theorem, Ramanujan J.
39 (2016), 225–234.
[174] A. M. Turing, A method for the calculation of the zeta-function, Proc. London Math.
Soc. 48 (1943), 180–197.
320 References
[175] A. M. Turing, Some calculations of the Riemann zeta-function, Proc. London Math.
Soc. 3 (1953), 99–117.
[176] I. M. Vinogradov, A new estimate for ζ(1 + it), Izv. Akad. Nauk SSSR, Ser. Mat. 22
(1958), 161–164.
[177] A. Walfisz, Weylsche Exponentialsummen in der neueren Zahlentheorie, Mathematis-
che Forschungsberichte, XV, VEB Deutscher Verlag der Wissenschaften, 1963.
[178] S. Wedeniwski, Section 1.1 History, ZetaGrad website, www.zetagrid.net/zeta/
math/zeta.result.100billion.zeros.html.
[179] A. Weil, Sur les “formules explicites” de la théorie des nombres premiers, Comm.
Sém. Math. Univ. Lund (1952), 252–265.
[180] H. S. Wilf, The Redheffer matrix of a partially ordered set, Electron. J. Combin. 11
(2004/2006), Research Paper 10 (5pp.).
[181] M. Yoshimoto, Farey series and the Riemann hypothesis II, Acta Math. Hungar. 78
(1998), 287–304.
[182] D. Zagier, The first 50 million prime numbers, Math. Intelligencer (1977), 7–19.
[183] S. Zegowitz, On the positive region of π(x) − li(x). Master’s Thesis, University of
Manchester, 2010.
Index
Abel’s identity, 70, 79, 106, 107, 194, 275 CAPrimes (RHpack), 287, 296
abundant number, 144–146 Cauchy, A. L., 5
adjoint of a transformation, 282 Caveney, G., 11, 218, 314
admissible zero set, 259, 261 CheckCNk (RHpack), 98, 297
Akbary, A., 170, 197, 313 Cheng, Y., 19, 314
Alaoglu, L., 10, 144, 313 Choi, S., 14, 21, 313
Alfors, L., 260, 313 Choie, Y., 10, 200, 201, 314
AllPrimesFactor (RHpack), 296 Chowla, S., 145, 314
Apostol, T. M., 16, 26, 31–33, 70, 79, 88, 89, 91, colossally abundant number, 12, 144, 153–161,
106, 107, 109, 112, 132, 146, 194, 249, 252, 170, 212, 219, 280, 290
275, 313 ColossallyAbundantF (RHpack), 146, 297
applications of S n , 272 ColossallyAbundantInteger (RHpack), 146, 297
author homepage, xviii ColossallyAbundantQ (RHpack), 146, 297
CompositeQ (RHpack), 298
computable, 241
background reading, 13 ComputeC (RHpack), 98, 298
Backlund, R. J., 20, 21, 39, 47, 48, 54, 72, 101, ComputeDeltas (RHpack), 98, 118, 298
254, 313 Computef (RHpack), 98, 298
BacklundB (RHpack), 16, 296 ComputeH (RHpack), 98, 298
Balasubramanian, R., 247 ComputeKm (RHpack), 41, 49, 299
Barnett, A. R., 17, 284, 314 ComputeTheta (RHpack), 69, 299
Barrett, W. W., 251, 313 ComputeThetaValues (RHpack), 69, 299
Berndt, B. C., 7, 165, 313 Comrie, L. J., 21
Berry, M. V., xx, 283, 313 Conrey, J. B., xviii, 283, 314
Bohr, H., xvii critical epsilon value, 153, 161
Bombieri, E., xvii, xx critical line, 6, 17
Borwein, P., 14, 21, 93, 313 CriticalEpsilonValues (RHpack), 146, 299
Brent, R. P., 21, 313 Crstici, B., 96, 319
Briggs, K., 216, 313 cycle in Sn , 272
Broughan, K. A., 11, 17, 97, 166, 200, 201,
238, 259, 260, 262, 278, 284, 313,
314 Davenport, H., 145, 314
Büthe, J., 22, 314 Davis, M., 239, 315
De Koninck, J.-M., 96, 109, 166, 314, 316
de la Vallée-Poussin, 69
CA number, 144 definition of F(p, α), 153
Caley’s theorem, 272 definition of G(n), 160
321
322 Index
PlotRobinsInequality (RHpack), 169, 308 Rosser, J. B., 9, 20, 21, 29, 39–41, 49, 54, 94, 96,
PlotVonMangoldtPsi (RHpack), 16, 308 98–110, 125, 167, 177, 216, 318
Pollack, P., 166, 315 Rudnick, Z., 283, 319
Pollington, A. D., 251, 313
Pomerance, C., 166, 315
PowerfulQ (RHpack), 308 S n , 238, 271, 272
prime number theorem, 15, 69–80 S (RHpack), 239, 311
PrimePower (RHpack), 308 Sándor, J., 96, 319
primorial, 9, 96 SA number, 144
PrimorialFormQ (RHpack), 309 Sabbagh, K., 13, 319
PrimorialFormToInteger (RHpack), 202, 287, 309 Sagan, B., 272, 319
proper left abundant number, 235 Sarnak, P., 269, 283, 316, 319
properties of Sn , 272 Schmidt, E., 81, 319
Psi (RHpack), 69, 309 Schoenfeld criterion, 68
Psi0 (RHpack), 16, 309 Schoenfeld, L., 9, 20, 21, 29, 39–41, 54, 64, 69,
PsiLargeXEpsilon (RHpack), 41, 309 94, 96, 98–110, 125, 167, 177, 198, 216, 315,
PsiLargeXEpsilon2 (RHpack), 41, 310 318, 319
PsiSmallEpsilon (RHpack), 51 Schumayer, D., 283, 319
PsiSmallXEpsilon (RHpack), 41, 310 self-adjoint transformation, 282
set A, 167, 200, 203, 219, 224
set B, 200, 203
Ramanujan, S., 6, 10, 11, 165, 318 set C, 200
Ramanujan–Robin criterion, 214 set D, 200, 206
Rankin, R., 7 set G, 280
Redheffer criterion, 247–250 set R, 200
Redheffer matrix, 11 set S , 203
Redheffer, R., 248, 318 SetA (RHpack), 202, 311
RedhefferMatrix (RHpack), 239, 310 SetB (RHpack), 202, 311
RH, xvii, xviii, 89, 94, 96, 97, 111, 117, 119, 123, SetC (RHpack), 202, 311
144, 164, 165, 168, 169, 173, 174, 180, SetD (RHpack), 202, 311
184–188, 191, 198, 201, 203, 218, 219, 227, Shah, S. M., 11, 319
235, 236, 238, 251, 253, 256, 259, 271, 273, Shapiro criterion, 236, 239–241
279, 283, 285 Siegel, C. L., 6, 24
RH classical equivalences, 68–92 SigmaOverN (RHpack), 169, 311
RHpack, xix, 12, 41, 69, 98, 120, 140, 146, 169, Skews number, 88
202, 219, 239, 287, 294, 295 Skews, S., 88, 319
Richert, H.-E., 19, 318 Sloan, N. J. A., 147, 319
Riemann hypothesis, xvii, 6–8, 15, 17, 21, 68, 89, Snaith, N. C., 283, 316
145, 167, 214, 219, 225, 231, 235, 273, 283 Solé, P., 10, 200, 212, 314, 319
Riemann hypothesis history, 1–7 Sondow, J., 11, 218, 314
Riemann xi function, 11, 16 Soundararajan, K., 284, 319
Riemann zeta function, 2, 8, 11, 15–39 Speiser, A., 237, 253, 319
Riemann, B., 5, 8, 21, 24, 283, 318 Spira, R., 238, 253, 255, 256, 319
Riemann–Siegel formula, 24 squarefree core, 205
right abundant number, 11, 218, 219, 231, 235 squarefree number, 201–203
RightAbundantQ (RHpack), 219, 310 squarefull number, 201, 202, 206
Robin’s inequality, 12, 201, 203, 206, 212, 214, standard notation, 12
218 sum-of-divisors function, 7
Robin’s inequality failures, 200–217 summary of Volume One, 8–12
Robin’s theorem, 165–193 superabundant number, 10, 12, 144, 147–161,
Robin, G., 7, 10, 11, 64, 144, 160, 163, 165–193, 170, 201, 208, 218, 280, 289
197, 273, 317, 318 SuperAbundantQ (RHpack), 146, 311
RobinsInequalityQ (RHpack), 169, 311 Suriajaya, A. I., 256, 319
Robinson, J., 239, 315 symmetric group criterion, 271–282
Rooney, B., 14, 21, 313 Szalay, S. M., 273, 319
Index 325
table for Nicolas’ reversed inequality, 292 Walfisz, A., 69, 91, 320
table for Robin’s inequality, 288 Wedeniwski, S., 21, 30, 320
table of ni and i , 157 Weierstrass product, 26, 260, 262
table of colossally abundant numbers, 290 Weil, A., 30, 320
table of extremely abundant numbers, 287 Weirathmueller, A., 14, 21, 313
table of maximum symmetric group orders, 293 Wilf, H. S., 248, 320
table of superabundant numbers, 289 Wilson, M., xx
tables of bounds for ψ(x), 41–51 Winter, D. T., 21, 30, 313, 317
te Riele, H. J. J., 21, 30, 91, 313, 317, 318 Wright, J. M., 96, 109, 146, 166, 316
Tenenbaum, G., 81, 319
Titchmarsh, E. C., 14, 16, 21, 319
trivial zeros, 16 Xi (RHpack), 239, 312
Trudgian, T. S., xx, 19–21, 29, 30, 64, 93, 196,
198, 200, 201, 266, 314, 317–319
Turan, P., 11, 315 Yakubovich, S., 11, 147, 218, 219, 318
Turing, A. M., 21, 285, 320 Yohe, J. M., 21
Turner, J. C., xx Yoshimoto, M., 247, 316, 320