Reverse Math
Reverse Math
4.2. Z2 .
Definition 4.2. Z2 , the theory of full second order arithmetic consists of:
• The axioms P − ,
• The second order induction axiom
∀X(0 ∈ X ∧ ∀x(x ∈ X → Sx ∈ X) → ∀x x ∈ X),
• The comprehension axiom
∃X∀x(x ∈ X ↔ φ(x))
for each formula φ where X does not appear free.
3
Proof. First, we consider the proof in general, without worrying about ACA0 .
The idea is that we take the infinite sequence hsn i so that each of its initial
segments is also an initial segment of infinitely many elements of S. For
definiteness, we simply take the sn least such that this is the case.
Formally, suppose we have defined s0 , . . . , sm (possibly with m = −1 for
the base case) such that hs0 , . . . , sm i is an initial segment of infinitely many
elements of S. (This is true by assumption in the base case, since the empty
sequence is an initial segment of all sequences, and so in particular of all
the infinitely many elements of S.) By assumption, there are finitely many
t1 , . . . , tr such that hs0 , . . . , sm , ti i ∈ S. Consider those elements τ of S
such that hs0 , . . . , sm i is an initial segment of τ , and which are not equal to
hs0 , . . . , sm i. This is an infinite set S 0 ⊆ S. For each τ ∈ S 0 , there is exactly
one i ≤ r such that hs0 , . . . , sm , ti i is an initial segment of τ , so we can define
a function f : S 0 → [1, r]. By the infinite pigeonhole principle, there is an
i ≤ r so that f (τ ) = i for infinitely many τ ∈ S 0 ; we take sm+1 = ti .
To formalize this argument in ACA0 , we first show that we can write
down an arithmetic formula (using S as a parameter) defining an example
of the desired sequence. It is convenient to do this in two steps. First, we
simply define S ∗ to be the set of sequences hs0 , . . . , sm i ∈ S such that there
are infinitely many τ ∈ S extending hs0 , . . . , sm i; this is easily expressed with
a few numeric quantifiers (“for every r there is a t > r coding a sequence
extending hs0 , . . . , sm i”).
We wish to show that for every σ ∈ S ∗ , there is an immediate extension
of σ ∈ S ∗ . To see this, let σ ∈ S ∗ be given; then σ ∈ S and there are
infinitely many extensions of σ ∈ S. We carry out the argument above:
let Sσ = {τ | σ is an initial segment of τ }, let Sσ0 be the set of immediate
extensions of σ in S, and define the coloring f : Sσ → Sσ0 . Since Sσ0 is finite,
there is an element of Sσ0 which is the value of f infinitely often, so this value
of Sσ0 belongs to S ∗ .
Now we define our actual infinite sequence, T , to consist of those pairs
(m, sm ) such that there is a hs0 , . . . , sm i ∈ S ∗ such that there is no i ≤ m
and t < si such that hs0 , . . . , si−1 , ti ∈ S ∗ . The definition ensures that for
each m, there is at most one sm such that (m, sm ) ∈ T . To see that there
is such an sm , let T ∗ = {m | ∃sm (m, sm ) ∈ T }; by the induction axiom, it
suffices to show that 0 ∈ T ∗ and whenever m ∈ T ∗ , also m + 1 ∈ T ∗ . The
arguments are essentially the same; suppose (0, s0 ), . . . , (m, sm ) ∈ T . Then
hs0 , . . . , sm i = σ ∈ S ∗ , so there is some t with hs0 , . . . , sm , ti ∈ S ∗ . Let U be
the set of t such that for all t0 < t, hs0 , . . . , sm , t0 i 6∈ S ∗ ; clearly 0 ∈ U , but
there must be some t 6∈ U —take t + 1 where hs0 , . . . , sm , t + 1i ∈ S ∗ , which
9
In the L∀ case, if the main variable is not X, the claim follows immediately
from the inductive hypothesis. So consider the difficult case:
ΓACA0 , Γ, ψ[X/Y ] ⇒ Σ
ΓACA0 , Γ, ∀Y ψ ⇒ Σ
ΓACA0 , Γ, ψ ⇒ Σ
ΓACA0 , Γ0 , ∃Xψ ⇒ Σ0
ΓACA0 , Γ, ψ ⇒ Σ
ΓACA0 , Γ0 , ∀Xψ ⇒ Σ0
11
Note that RCA0 has slightly more induction than its comprehension
scheme suggests; we have to add induction over Σ1 formulas. It turns out
that without the additional induction axiom, the system is too weak even
to use as a base theory for many purposes.
One might wonder why, instead, we don’t add comprehension for Σ1 for-
mulas. The answer is that this theory is equivalent to ACA0 ! The reason
is that we can use multiple steps: if we want {x | ∀y∃zφ(x, y, z)}, we can
first define S1 to be {(x, y) | ∃zφ(x, y)}, define S2 to be {x | ∃y (x, y) 6∈ S1 },
and finally our desired set is {x | x 6∈ S2 }.
In particular, this means that to show that an extension of RCA0 includes
ACA0 , it suffices to show that it has comprehension for Σ1 formulas with
parameters.
In RCA0 , the definition of a real number we gave above does not work.
We should define a real number to by a Cauchy sequence which converges
with a specified rate of convergence.
Definition 4.17. In RCA0 , a real number is a sequence of rational numbers
hqn i such that
∀k ∈ N∀m, n(k < m < n → |qm − qn | < 2−k ).
In ACA0 , this is equivalent to the original definition, in the sense that
ACA0 proves that every number in the original sense is equal to a real
number in this stronger sense. On the other hand, RCA0 cannot prove
useful properties of real numbers in the sense of ACA0 .
The next two theorems are typical results in the area known as Reverse
Mathematics.
Theorem 4.18. Over RCA0 , the Bolzano-Weierstraß Theorem implies
ACA0 .
We mean the Bolzano-Weierstraß Theorem for real numbers in the sense
of RCA0 , of course.
Proof. Fix some Σ1 formula ∃xφ(x, y). We wish to show that {y | ∃xφ(x, y)}
actually exists. The idea is that we’ll define a sequence of rational numbers
which is already convergent, and whose limit encodes exactly this set. More
precisely, we will arrange to converge to
X
2−y .
y|∃xφ(x,y)
sequence, so there is any x such that ∃xφ(x, y) exactly if the y-th digit in
0
the expansion of qy+1 is 1.
This means that the Bolzano-Weierstraß Theorem is equivalent to ACA0
(given that we are working in at least RCA0 ).
Theorem 4.19. Over RCA0 , König’s Lemma implies ACA0 .
Proof. Again, fix some Σ1 formula ∃xφ(x, y). We define a tree of sequences
by saying that σ = (s0 , . . . , sn ) ∈ T if for each i ≤ n, either:
• si = 0 and there is no x ≤ n such that φ(x, i) holds, or
• si > 0 and si − 1 is least such that φ(si − 1, i).
Again, notice that this is given by a formula with only bounded quantifiers.
This is clearly finitely branching; actually, each σ has at most two extensions—
one by 0 and one by the least si −1 with φ(si −1, i). To see that this is infinite,
it suffices to show that for each n, there is a sequence hs0 , . . . , sn i ∈ T ; we
may take this to be the sequence where si is x + 1 where x ≤ n is least such
that φ(x, i), if there is any, and 0 otherwise. This sequence always exists,
and always belongs to T .
By König’s Lemma, there is an infinite sequence hsn i through this. We
claim that ∃xφ(x, y) iff sy 6= 0. Suppose ∃xφ(x, y); then there is a least such
x. The sequence hs0 , . . . , sy−1 , 0, . . . , sx i 6∈ T , so we cannot have sy = 0. On
the other hand, suppose ∀xφ(x, y). Then hs0 , . . . , sy i ∈ T implies sy = 0
since the second clause can never be satisfied.
Just as the first order part of ACA0 is exactly PA, it can be shown that
the first order part of RCA0 is exactly IΣ1 .
4.6. WKL0 and other theories weaker than ACA0 .
One of the surprising discoveries in the early investigation of reverse math-
ematics was that statements from analysis and algebra tended to group into
a small number of categories. In fact, almost all the early theorems investi-
gated turned out to be equivalent to one of five theories, the so-called Big 5
theories of reverse mathematics. RCA0 and ACA0 are two of them.
Definition 4.20. The theory WKL0 consists of the axioms of RCA0 to-
gether with the following axiom:
Suppose S is a tree of sequences of 0’s and 1’s containing
infinitely many elements. Then there is an infinite sequence
hsn i such that all its initial segments are in S.
This axiom is called “weak König’s lemma”. It is clearly a special case
of König’s lemma, so WKL0 is between RCA0 and ACA0 . To see that
WKL0 is actually stronger than RCA0 is typically an exercise in com-
putability: it is easy to show that there is a model of RCA0 where the sets
are exactly the computable sets, and one then shows that is not a model
of WKL0 . On the other hand, it can be shown that not only is WKL0
weaker than ACA0 , it’s actually quite close to RCA0 .
14
There are several proofs of this theorem; the simplest one, due to Har-
rington, uses forcing to show that any model of RCA0 can be extended
to a model of WKL0 by adding additional sets. An alternate proof, due
to Ferreira and Ferreira, uses cut-elimination. It immediately follows that
the same holds for Π11 sentences, since deducing ∀Xφ(X) is equivalent to
deducing φ(X) with X free.
One consequence is that the proof-theoretic ordinal of WKL0 is the same
as that of RCA0 (ω ω ); this is easily seen since stating that an ordinal is
well-founded is a Π11 sentence. Nonetheless, WKL0 proves many theorems
about the existence of sets that RCA0 does not. For example:
Theorem 4.22. The statement that every consistent theory has a complete
consistent extension is equivalent to WKL0 (using RCA0 as a base theory).
Proof. Recall that we can formalize all our proof theory, interpreting for-
mulas by sequences of symbols, which are in turn coded by numbers, and
deductions as sequences of formulas which can again be coded by numbers.
Let S be a set of formulas with the property that there is no deduction of
S 0 ⇒ ⊥ where S 0 ⊆ S.
For each sequence σ of 0’s and 1’s, we will define a finite set of formulas
Xσ : if n < |σ| and n codes a formula φ then n ∈ Xσ if the n-th position of σ
is 1, and the number coding ¬φ is in Xσ if the n-th position of σ is 0. Now
we define a tree T consisting of those σ such that there is no deduction of
S 0 ⇒ ⊥ where S 0 ⊆ S ∪ Xσ and the code of the deduction is ≤ |σ|.
Note that we really need the bound on the code of the deduction in
order to prove in RCA0 that the tree exists: the statement that S ∪ Xσ is
consistent—that is, that there is no deduction at all—is Σ1 .
We claim that for every n, there is a σ ∈ T with |σ| = n: if not, for all the
finitely many σ of length n we have a deduction of ⊥ from some subset of
S ∪Xσ . But given a deduction of ⊥ from S0 ∪{φ} and a deduction of ⊥ from
S1 ∪ {¬φ}, we may easily derive a deduction of ⊥ from S0 ∪ S1 . Iterating
this argument finitely many times, we obtain a deduction of ⊥ from some
subset of S, contradicting our assumption. S
By WKL0 , there is an infinite branch Λ through T . Take X = σ@Λ Xσ .
Clearly S ⊆ X and X is consistent: if there were any deduction of a contra-
diction, it would be coded by some number, and use only a finite subset of
X, and therefore at some finite length an initial segment would be rejected
from T . For any formula φ, φ is coded by some n, so when |σ| = n + 1,
we have ensured either φ is in Xσ or the negation of φ is. Therefore X is a
complete, consistent extension of S, as desired.
For the converse, suppose every consistent theory has a complete exten-
sion, and let T be a tree of 0’s and 1’s with infinitely many elements. We
define a language with countably many zero-ary relation symbols Pσ , one
for each sequence σ. Define a theory S to consist of:
15
W
• For each n, the formula σ,|σ|=n Pσ ,
• If |σ| = |τ |, Pσ → ¬Pτ ,
• If σ 6∈ T , ¬Pσ .
Observe that this theory is consistent: a deduction of a contradiction would
use finitely many sentences from S, and in particular, only contains symbols
Pσ with |σ| ≤ n for some n. But the formulas with such propositional
variables have a model, namely take any σ ∈ T and make Pτ true iff τ v σ.
By
S assumption, this theory has a complete extension X, and we take Λ =
Pσ ∈X σ.
Π11 -CA0 obviously includes ACA0 . The proof-theoretic ordinal of Π11 -CA0
is too complicated to describe here, but we briefly explain why. Suppose we
wanted to replace the Π11 comprehension axiom the same way we replaced
the arithmetic comprehension axiom. The main case is where we introduce
some formula on the left-hand side:
ΓACA0 , Γ, ∀x(x ∈ X ↔ ∀Y φ(x, Y )) ⇒ Σ.
In ACA0 , we could replace the variable X with ∀Y φ(x, Y ) everywhere it
appeared in the deduction. Here, if we can try to do the same, we discover
that the variable X could have been used to deduce instances of ∃Y ¬φ(x, Y )!
In other words, the set X consisting of those x where ∀Y φ(x, Y ) was true
might have used the set X itself to demonstrate that ∀Y φ(x, y) was false for
some particular x!
This is not a minor issue. Π11 -CA0 introduces a deep new obstacle to
cut-elimination: which elements belong to a set defined by a Π11 formula
depends on a quantification over all sets, including the set which is in the
process of being defined. This is a genuine circularity, and raises genuine
new difficulties, both mathematically and philosophically. (We should con-
sider that it is not obvious, when stated like this, that Π11 comprehension is
actually well-defined at all: how can we be sure that we won’t accidentally
write down some set X with the property that x ∈ X iff x 6∈ X?) This the-
ory is impredicative: the meaning of a set defined by a Π11 formula depends
essentially on a family of objects—the sets of numbers—which has not yet
been completely constructed.
For the sake of comparison with the literature, we mention that the proof
theoretic ordinal of Π11 -CA0 is given as the limit of a sequence of ordinals,
with most (though not all) of the technical work needed to produce the first
one (the later ones iterate the same idea relative to the first one). The first
of these ordinals is known as the Howard-Bachmann ordinal, and is usually
written ψΩ+1 . Here Ω is (a name for) some very large ordinal, larger than
any other in the system—for example, ℵ1 —Ω+1 is the first fixed point of
the α 7→ ω α which is larger than Ω, and ψ is a collapsing function which
takes a very large ordinal and “collapses” it to a smaller one. (Alternatively,
one can view Ω+1 as representing the function α 7→ α+1 and ψ as being
an operation which chooses an ordinal which is both a fixed point of this
function and also which is larger than the chosen fixed point of all “easily
defined” functions which grow more slowly.) Note that this definition is
itself impredicative—we define the ordinal in terms of larger ordinals (or
the class of functions on ordinals).