Localglobal
Localglobal
KEITH CONRAD
1. Introduction
Hensel created p-adic numbers near the start of the 20th century and they had some early
positive influences: Hensel used them to solve a conjecture of Dedekind about discriminants
in algebraic number theory and they directly inspired Steinitz [10] to develop a general
theory of fields in order to account for the p-adic numbers alongside previously known fields
of numbers and functions. (See [8] for a discussion of Steinitz’s paper in English.) Most
mathematicians viewed p-adic numbers with suspicion, perhaps in part because of their
unclear foundations and Hensel’s mistaken proof by p-adic numbers that e is transcendental.
The first indication that the p-adic numbers for all p together have a conceptual role in
mathematics came in the 1920s when Hasse discovered that Minkowski’s work on quadratic
forms over the rational numbers could be streamlined by expressing the results in terms
of quadratic forms over the real numbers and all p-adic numbers. Hasse became a strong
advocate of this point of view for number theory, which came to be called the local-global
principle or Hasse principle. Roughly speaking, it states
a theorem or property holds over Q if and only if it holds over R and Qp for all p
or, more generally,
study a problem over Q by studying it in R and all Qp .
The local-global principle is not a definite theorem, but more of a philosophy. It plays a
role comparable in number theory to the idea in geometry of studying global properties of
a curve or surface through the local geometric properties near each point on the curve or
surface. We call Q a global field and the fields R and Qp local fields.1
We will see what the local–global principle is about by taking a classical theorem about
sums of two squares in the integers and reformulating it as a theorem in R and every Zp .
A few examples will show the deficiency of working with Zp and what is gained by working
with Qp instead. Then we will state Hasse’s version of Minkowski’s theorem on quadratic
forms over Qp as an example where the local–global principle works. Next we will see
counterexamples to the local–global principle. Finally we will discuss a local–global result
for powers and for heights.
Example 2.2. Let m = 15 = 3 · 5. Its only prime factor that is congruent to 3 mod 4 is
3, which divides 15 only once. You can check that 15 is not a sum of two squares. The
number m = 45 = 32 · 5 is divisible by 3 two times and 45 = 9 + 36 = 32 + 62 .
To reformulate Theorem 2.1 in terms of p-adic numbers, we describe when a (nonzero)
p-adic integer is a sum of two squares in Zp .
Theorem 2.3. For a prime p ≡ 1 mod 4, every p-adic integer is a sum of two squares of
p-adic integers.
Proof. If p ≡ 1 mod 4 then a theorem of Fermat from number theory says −1 mod p is a
square: −1 ≡ s20 mod p for some integer s0 . Then Hensel’s lemma lifts that up to saying
s2 + 1 = 0 has a solution in Zp : f (x) = x2 + 1 has f (s0 ) ≡ 0 mod p and f 0 (s0 ) = 2s0 6≡
0 mod p.
For each t in Zp ,
(1 + t)2 + (s(t − 1))2 = 1 + 2t + t2 − (t2 − 2t + 1) = 4t,
so t = ((1 + t)/2)2 + (s(t − 1)/2)2 and the numbers (1 + t)/2 and s(t − 1)/2 are in Zp since
p 6= 2.
Theorem 2.4. For a prime p ≡ 3 mod 4, a nonzero p-adic integer t is a sum of two squares
in Zp if and only if ordp (t) is even.
Proof. Write t = pe u, with e ≥ 0 and u ∈ Z×p.
2 2
Step 1: We can write u = x + y for some x and y in Zp .
This will follow from the pigeonhole principle to solve the equation as a congruence mod
p first and then from Hensel’s lemma to lift the solution mod p to a solution in Zp . We
start by considering the two sets
A = {y 2 mod p : 0 ≤ y ≤ p − 1}, B = {u − x2 mod p : 0 ≤ x ≤ p − 1}.
For odd prime p, the number of squares in Z/(p), including 0, is (p + 1)/2. Therefore
|A| = (p + 1)/2 and |B| = (p + 1)/2. Since |A| + |B| = p + 1 > |Z/(p)|, the sets A and
B must overlap by the pigeonhole principle: there are x0 and y0 from 0 to p − 1 such that
y02 ≡ u − x20 mod p, so u ≡ x20 + y02 mod p. At least one of x0 or y0 is nonzero modulo p.
Since the congruence is symmetric in x0 and y0 we can assume without loss of generality
that x0 6≡ 0 mod p. Then define
f (X) = X 2 + (y02 − u) ∈ Zp [X].
We have f (x0 ) ≡ 0 mod p and f 0 (x0 ) = 2x0 6≡ 0 mod p, so by Hensel’s lemma there is an
x ∈ Zp such that f (x) = 0, so x2 + y02 = u.
Step 2: e is even.
Write e = 2k. Then t = p2k u = p2k (x2 + y 2 ) by Step 1, so t = (pk x)2 + (pk y)2 .
Step 3: e is odd.
Assuming t = x2 + y 2 in Zp we will get a contradiction. Since ordp (t) is odd we can’t
have x or y equal to 0 (otherwise t would be a square, hence of even p-adic valuation). We
also must have ordp (x) = ordp (y), since if the two valuations were not equal then x2 and y 2
would have different valuations, making ordp (t) = max(2 ordp (x), 2 ordp (y)), which is even.
Write x = pn v and y = pn w where n ≥ 0 and v and w are in Z× p . Then
t = x2 + y 2 = p2n (v 2 + w2 ).
THE LOCAL-GLOBAL PRINCIPLE 3
polynomial equation has a solution in a complete field than in a field like Q: an approximate
solution can often be refined to an exact solution using limits. For example, the equation
x2 + y 2 − z 2 = c clearly has a real solution for every c ∈ Q× because we see both plus and
minus signs. The condition for a quadratic form equation over Qp to be solvable in Qp is
a bit harder than checking signs of coefficients, but it is algorithmic and can be found in
detailed treatments of the Hasse–Minkowski theorem. (See [5].)
4. Counterexamples
If we move beyond quadratic forms, which have degree 2, to polynomial equations of
degree 3 or higher, then we find more counterexamples to the local-global principle for
Z-solutions as we saw in Examples 3.1 and 3.2 and also even counterexamples to the local–
global principle for Q-solutions.
Example 4.1. The equation y 2 = x3 − 51 has the rational solution (1375/9, 50986/27),
which is a solution in Zp for p 6= 3. At p = 3 set x = 1 to make the equation y 2 = −50,
which has a solution in Z3 since −50 ≡ 1 mod 3. Therefore the equation y 2 = x3 − 51 has
a solution in R (obviously) and in each Zp ,4 but by methods of algebraic number theory or
elliptic curves it can be shown this equation has no solution in integers.
Example 4.2. Here is a famous example of Selmer [9]: the cubic equation
3x3 + 4y 3 + 5z 3 = 0
has a solution other than (0, 0, 0) in R and in each Qp , but it has no solution in Q other
than (0, 0, 0). p
In R we have the solution ( 3 5/3, 0, −1). To show there is a solution besides (0, 0, 0) in
each Qp we follow a method I learned from Kevin Buzzard. The basic idea is to show there
is a nonzero solution modulo p and then lift that solution p-adically by Hensel’s lemma. We
will separately treat the cases p = 3, p = 5, and p 6= 3 or 5.
To find a 3-adic solution, set x = 0 and z = −1, making the equation 4y 3 − 5 = 0, or
3
y = 5/4. Although 5/4 ≡ −1 mod 9 and −1 is a 3-adic cube, this congruence modulo 9
isn’t sharp enough to conclude by Hensel’s lemma that 5/4 is a 3-adic cube: to use Hensel’s
lemma (in the form |f (α)|3 < |f 0 (α)|23 ), we seek an α ∈ Z× 3
3 such that |α − 5/4|3 < 1/9,
i.e., α3 ≡ 5/4 mod 27. The choice α = 2 works, so 5/4 is a 3-adic cube and we can solve
Selmer’s equation in Q3 as (0, y, −1) where y 3 = 5/4 in Z3 .
If p 6= 3 and the p-adic integer a is a nonzero cube mod p then a is a cube in Z× p by
3
Hensel’s lemma for X − a. In particular, for p = 5, set y = −1 and z = −1 to make
Selmer’s equation 3x3 − 4 − 5 = 0, or x3 = 3. Since 3 ≡ 23 mod 5, by Hensel’s lemma for
X 3 − 3 with approximate solution 2 we see that 3 is a 5-adic cube. We get a 5-adic solution
to Selmer’s equation as (x, −1, −1) where x3 = 3 in Z5 .
From now on let p be a prime other than 3 or 5 (this includes allowing p = 2). Then
3, 5 6≡ 0 mod p. We are going to look at the group (Z/(p))× , which is cyclic of order p − 1.
What proportion of the group is filled up by cubes?
• If p ≡ 1 mod 3 then the cubes in (Z/(p))× are a subgroup of index 3.
• If p 6≡ 1 mod 3 then (3, p − 1) = 1, so every number in (Z/(p))× is a cube.
If 3 mod p is a cube then 3 is a cube in Zp by Hensel’s lemma for X 3 − 3, so we can solve
Selmer’s equation as (x, −1, −1) where x3 = 3 in Qp .
4More generally, for every integer k the equation y 2 = x3 + k has a solution in R and in every Z .
p
6 KEITH CONRAD
If 3 mod p is not a cube then not all numbers in (Z/(p))× are cubes. Thus p ≡ 1 mod
3, so the nonzero cubes mod p are a subgroup of (Z/(p))× that has index 3 and coset
representatives {1, 3, 9}: for every a 6≡ 0 mod p we have a ≡ b3 , 3b3 , or 9b3 mod p for some
b 6≡ 0 mod p. We will apply this with a = 5.
• If 5 ≡ b3 mod p then 5 is a cube in Zp by Hensel’s lemma for X 3 − 5, and we can
solve Selmer’s equation as (−y, y, −1) where y 3 = 5 in Zp .
• If 5 ≡ 3b3 mod p then 5/3 is a cube in Zp by Hensel’s lemma and we can solve
Selmer’s equation as (x, 0, −1) where x3 = 5/3.
• If 5 ≡ 9b3 mod p then 5 · 3 = 15 is a cube in Zp by Hensel’s lemma and we can solve
Selmer’s equation as (3t, 5, −7) where t3 = 15. That is, 3a3 + 4b3 + 5c3 = 0 where
a = 3t, b = 5, and c = −7. By homogeneity there is a solution (3t/7, 5/7, −1) too.
This completes the proof that Selmer’s equation has local solutions everywhere. That the
equation has no rational solution besides (0, 0, 0) is harder. There are proofs using algebraic
number theory [4] or elliptic curves [3, pp. 86–87].
This example implies 3x3 + 4y 3 = 5 has a solution in R and every Qp since in each
completion we found a solution to 3x3 + 4y 3 + 5z 3 = 0 where z = −1.
Even though the local–global principle for cubic equations is not always true, the Birch
and Swinnerton-Dyer conjecture describes a relationship between the behavior of rational
solutions of cubic equations (elliptic curves) and their real and p-adic solutions.
many primes satisfying each of those congruence conditions, so we can always show −2 is
not a square using local considerations if any finite number of completions is removed.
Theorem 5.1 with finitely many completions removed works not just for n = 2, but for all
n up to 7. That is, if 2 ≤ n ≤ 7 then the nth powers in Q× are precisely the nonzero rational
numbers that are nth powers in all but finitely many completions of Q. However, when
n = 8 the pattern breaks: being an 8th power in all but at most finitely many completions
of Q need not imply being an 8th power in Q.
Example 5.2. We show 16 is an 8th power in every completion of Q except Q2 . Write
X 8 − 16 = (X 4 − 4)(X 4 + 4) = (X 2 − 2)(X 2 + 2)(X 2 + 2X + 2)(X 2 − 2X + 2).
The two quadratic factors both have discriminant −4, so there is an 8th root of 16 in a
completion of Q as long as that completion contains a square root of 2 or −2 or −4.
Clearly 2 is a square in R. For each odd prime p, one of the numbers 2, −2, or −4 is a
square in (Z/(p))× (more generally, if a, b 6≡ 0 mod p then a, b, or ab is a square in (Z/(p))×
– this is because the squares in (Z/(p))× are a subgroup of index 2), so 2, −2, or −4 is a
square in Qp by Hensel’s lemma. Therefore 16 an 8th power in every completion of Q other
than Q2 , where it obviously is not since ord2 (16) is not a multiple of 8.
A similar example occurs for each n divisible by 8 (and not for any other n): 2n/2 is an
nth power in each Qp and R except for Q2 . Indeed, in each of those fields we can write
16 = x8 so 2n/2 = 16n/8 = xn . Here is a version of this story using just modular arithmetic
rather than anything p-adic.
Theorem 5.3. For a ∈ Z, if xn ≡ a mod p is solvable for all but finitely many primes p
then a = bn for some b ∈ Z if 8 - n, and a is either bn or 2n/2 bn for some b ∈ Z if 8 | n.
When p - n and p - a, solvability of the mod p congruence xn ≡ a mod p in Z/(p) and
solvability of the equation xn = a in Zp are equivalent by Hensel’s lemma.
Theorem 5.3 was first proved by Trost [11] in 1934.5 Nobody noticed and the theorem was
rediscovered by Ankeny and Rogers [1] in 1951. The criterion describing when a nonzero
number in Q or any finite extension of Q is an nth power if it is an nth power in all but
finitely many completions is called the Grunwald–Wang Theorem [14]. It was originally just
Grunwald’s theorem (1933), with no exceptional case recognized when 8 | n, until Wang [12]
found a counterexample to it 15 years after Grunwald’s paper appeared [6] and 6 years after
Whaples [13] had published a second proof of Grunwald’s incorrect theorem. The setting in
which Wang worked was sufficiently technical that the very simple form of Example 5.2 as
a counterexample was not noticed for a while. That is probably how [1] could appear a few
years after [12] in the same journal with no mention of their connection or the link to [11].
The product formula seems innocuous, but it is a fundamental result gluing together all
the different absolute values of Q and can in fact be taken as a starting point for algebraic
number theory [2].
Theorem 6.2. Let r ∈ Q be rational. Then
Y
H(r) = max(|r|v , 1),
v
Q
Multiplying this by 1 in the form v |b|v , from the product formula, we get
Y Y
Y a Y Y |a|v |a|v
max ,1 = |b|v max ,1 = |b|v max ,1 = max (|a|v , |b|v )
v
b v v v
|b|v v
|b|v v
and now we use the fact that a and b are relatively prime integers: for each prime p, |a|p ≤ 1
and |b|p ≤ 1, with either |a|p = 1 or |b|p = 1 because p can’t divide both a and b. Thus
max(|a|p , |b|p ) = 1 for all p, so
Y
max (|a|v , |b|v ) = max(|a|∞ , |b|∞ ),
v
which by definition is the height of r.
More generally, for n ≥ 1 and rational numbers r0 , . . . , rn that are not all 0 we have
Y
(6.2) max(|r0 |v , . . . , |rn |v ) = max(|a0 |∞ , . . . , |an |∞ )
v
where a0 , . . . , an are the numerators of the ri ’s when we write them with a least common
denominator d: ri = ai /d for all i. (Necessarily the numerators a0 , . . . , an are relatively
prime as an n-tuple, since otherwise d would not be a least common denominator.) When
n = 1, r0 = r, and r1 = 1 we recover Theorem 6.2 from (6.2) since r = a/b and 1 = b/b
is the representation of r and 1 with least common denominator when a/b is the reduced
form of r.
If we replace ri in (6.2) with sri for some Q common factor s ∈ Q× then the left side of
(6.2) is unchanged by the product formula ( v |s|v = 1). This makes (6.2) the starting
point for the study of heights on projective n-space over Q.
References
[1] N. C. Ankeny and C. A. Rogers, A Conjecture of Chowla, Annals of Math. 53 (1951), 541–550.
[2] E. Artin and G. Whaples, Axiomatic Characterization of Fields by the Product Formula for Valuations,
Bull. Amer. Math. Soc. 51 (1945), 469–492.
[3] J. W. S. Cassels, “Lectures on Elliptic Curves,” Cambridge Univ. Press, Cambridge, 1991.
[4] K. Conrad, Selmer’s Example, https://kconrad.math.uconn.edu/blurbs/gradnumthy/selmerexample.
pdf.
[5] A. Gamzon, The Hasse–Minkowski Theorem, UConn Honors Thesis, 2006. URL http://digital
commons.uconn.edu/srhonors theses/17/.
[6] W. Grunwald, Ein allgemeines Existenztheorem für algebraische Zahlkörper, J. Reine Angew. Math. 169
(1933), 103–107.
[7] K. Ireland and M. Rosen, “A Classical Introduction to Modern Number Theory,” 2nd ed., Springer,
1990.
[8] P. Roquette, In Memoriam Ernst Steinitz, http://www.rzuser.uni-heidelberg.de/∼ci3/STEINITZ.pdf.
[9] E. Selmer, The Diophantine equation ax3 + by 3 + cz 3 = 0, Acta Arithmetica 85 (1951), 203–362.
[10] E. Steinitz, Algebraische Theorie der Körper, J. reine angew. Math. 137 (1910), 167-309
[11] E. Trost, Zur Theorie der Potenzreste, Nieuw Arch. Wisk. 18 (1934), 58–61.
[12] S. Wang, A Counter-Example to Grunwald’s Theorem, Annals of Math. 49 (1948), 1008–1009.
[13] G. Whaples, Non-analytic class field theory and Grunwald’s theorem, Duke Math. J. 9 (1942), 455–473.
[14] Wikipedia, Grunwald–Wang Theorem, https://en.wikipedia.org/wiki/Grunwald-Wang_theorem.