Mathematical Logic I
Mathematical Logic I
CONTENTS
Introduction
A. The Topic . . . . . . . . . . . . . . .
B. Metalanguage and Object Language
C. Set Theory . . . . . . . . . . . . . .
D. Mathematical Induction . . . . . . .
1
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
1
1
2
3
6
.
.
.
.
.
.
.
.
.
8
8
9
12
16
17
19
22
23
25
.
.
.
.
.
.
.
.
28
28
31
35
39
40
41
43
50
.
.
.
.
.
56
56
59
63
64
68
F.
G.
H.
4
ii
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
71
77
80
85
85
86
88
94
95
96
98
101
102
104
107
INTRODUCTION
A.
The Topic
B.
Metalinguistic variables
Ordinary English doesnt really use variables, but
they make our lives a lot easier. Since the metalanguage is usually used in this course to discuss the
object language, the variables we use most often
in the metalanguage are variables that are used to
talk about all or some expressions of the metalanguage. Especially when we get to predicate logic,
where the object language itself contains variables,
again, we dont want to get the variables of the
object language confused with those of the metalanguage. Since predicate logic uses letters like
x and y as variables in the object language, it is
important to be clear when a variable is part of the
language. This can be done by making the metalanguages variable distinctive. For example, I use
fancy script letters like A and B in the metalanguage to mean any object-language expression of
a certain specified type. For example, I might write
something like:
C.
Set Theory
P Q
P
Q
3
Definition: The intersection of and , written , is the set that contains everything that
is a member of both and .
Sets
Definition: If and are sets, then they are disDefinition: A set is a collection of entities for
joint iff they have no members in common, i.e., iff
which it is determined, for every entity of a given
= .
type, that the entity either is or is not included in the
set.
{A} means the set containing A alone. Generally, {A1 , . . . , An } means the set containing all
of A1 , . . . , An , but nothing else.
The members of sets are not ordered, so from
{A, B} = {C, D} one cannot infer that A = C,
only that either A = C or A = D.
Definition: If and are sets, the union of Definition: If R is a binary relation, then the doand , written , is the set that contains ev- main of R is the set of all A for which there is an B
such that hA, Bi R.
erything that is a member of either or .
4
sitive iff for all A, B and C in the field of R, if and = Z but there is no set W such that W
and W
= .
hA, Bi R and hB, Ci R then hA, Ci R.
Definition: A binary relation R is an equiva- Definition: If is a set, then A is denumerable
lence relation iff R is symmetric, transitive and iff is equinumerous with the set of natural numbers
{0, 1, 2, 3, 4, . . . , (and so on ad inf.)}.
reflexive.
Definition: If R is an equivalence relation then, Definition: Aleph null, also known as aleph
the R-equivalence class on A, written [A]R , is naught, written 0 , is the cardinal number of any
the set of all B such that hA, Bi R.
denumerable set.
Definition: If is a set, then is finite iff either
= or there is some positive integer n such that
Definition: A function (in extension) is a bi- is equinumerous with the set {1, . . . , n}.
nary relation which, for all A, B and C, if it includes
hA, Bi then it does not also contain hA, Ci unless Definition: A set is infinite iff it is not finite.
B = C.
Definition: A set is countable iff it is either finite
So if F is a function and A is in its domain, then or denumerable.
there is a unique B such that hA, Bi F ; this
unique B is denoted by F (A).
Homework
Assuming
that , and Z are sets, R is a relaDefinition: An n-place function is a function
whose domain consists of n-tuples. For such a tion, F is a function, and A and B are any entities,
function, we write F (A1 , . . . , An ) to abbreviate informally verify the following:
(1) A {B} iff A = B
F (hA1 , . . . , An i).
(2) if and Z then Z
Definition: An n-place operation on is a (3) if and then =
function whose domain is n and whose range is (4) ( ) Z = ( Z)
a subset of .
(5) ( ) Z = ( Z)
(6) = and =
Definition: If F is a function, then F is one-one (7) =
iff for all A and B in the domain of F , F (A) = (8) ( ) ( ) =
F (B) only if A = B.
(9) 1 =
Functions
D.
Mathematical Induction
Well also be expanding the logic of the metalan- This principle is often used in logical metatheory.
induction. Actuguage by allowing ourselves the use of mathemati- It is a corollary of mathematical
0
ally,
it
is
a
version
of
it.
Let
be
the property a
cal induction, a powerful tool of mathematics.
number has if and only if all wffs of the logical
Definition: The principle of mathematical in- language having that number of logical operators
have . If is true of the simplest well-formed forduction states the following:
If ( is true of 0), then if (for all natural numbers n, mulas, i.e., those that contain zero operators, then
0
if is true of n, then is true of n + 1), then is 0 has . Similarly, if holds of any wffs that are
constructed out of simpler wffs provided that those
true of all natural numbers.
simpler wffs have , then whenever a given natural
0
0
To use the principle mathematical induction to ar- number n has then n + 1 also has . Hence, by
rive at the conclusion that something is true of mathematical induction, all natural numbers have
0
all natural numbers, one needs to prove the two , i.e., no matter how many operators a wff contains, it has . In this way wff induction simply
antecedents, i.e.:
reduces to mathematical induction.
Similarly, this principle is usually utilized by
Base step. is true of 0
proving the antecedents, i.e.:
Induction step. for all natural numbers n, if is
Base step. is true of the simplest well-formed
true of n, then is true of n + 1
formulas (wffs) of that language; and
Typically, the induction step is proven by means of
Induction step. holds of any wffs that are cona conditional proof in which it is assumed that
structed out of simpler wffs provided that
is true of n, and from this assumption it is shown
those simpler wffs have .
that must be true of n + 1. In the context of this
conditional proof, the assumption that is true of Again, the assumption made when establishing the
n is called the inductive hypothesis.
induction step that holds of the simpler wffs is
From the principle of mathematical induction, called the inductive hypothesis.
one can derive a related principle:
Well also be using:
Definition: The principle of complete (or
strong) induction states that:
If (for all natural numbers n, whenever is true of
all numbers less than n, is also true of n) then is
true of all natural numbers.
The principle of proof induction is an obvious corollary of the principle of complete induction. The
steps in a proof can be numbered; were just applying complete induction to those numbers.
Homework
Answer any of these we dont get to in class:
(1) Let be the property a number x has just in
case the sum of all numbers leading up to and
including x is x(x+1)
. Use the principle of math2
ematical induction to show that is true of all
natural numbers.
(2) Let be the property a number x has just in
case it is either 0 or 1 or it is evenly divisible by
a prime number greater than 1. Use the principle of complete induction to show that is
true of all natural numbers.
(3) Let be the property a wff A of propositional
logic has if and only if has a even number of
parentheses. Use the principle of wff induction
to show that holds of all wffs of propositional
logic. (If needed, consult the next page for a
definition of a wff in propositional logic.)
(4) Consider a logical system for propositional
logic that has only one inference rule: modus
ponens. Use the principle of proof induction to
show that every line of a proof in this system
is true if the premises are true.
UNIT 1
METATHEORY FOR PROPOSITIONAL LOGIC
A.
,
Conjunction
&,
Disjunction
+
Material conditional
,
Material biconditional
,
Definition: A well-formed formula* (abbrevi- Feel free to use whatever signs you prefer. I might
ated wff) is defined recursively as follows:
not even notice.
1
Here we are not really using the phrase A , since this definition is in the metalanguage and is not part of
English. Nor, however, are we mentioning it, since A is not a part of the object language. Really we should be using
special quasi-quotation marks, also known as Quine corners, where pA q is the object language expression formed by
concatenating to whatever expression A is. Although imprecise, I forgo Quine corners and rely just on context, to avoid
a morass of these marks, and to allow for another use of the same notation Mendelson uses in chap. 3.
B.
Parentheses Conventions
The chart above also gives the ranking of the connectives used when omitting parentheses. Sometimes when a wff gets really complicated, its easier
to leave off some the parentheses. Because this
leads to ambiguities, we need conventions regarding how to read them. When parentheses are omitted, and it is unclear which connective has greater
scope, the operator nearer the top on the list above
should be taken as having narrow scope, and the
operator nearer the bottom of the list should be
taken as having wider scope. For example:
The Semantics of
Propositional Logic
ABC
is an abbreviation of:
(A (B C))
whereas:
(P
T
T
F
F
ABC
is an abbreviation of:
((A B) C)
T
T
T
F
Q)
T T
F T
T T
F F
(Q
F T
F F
T T
F F
(P
T T
T T
F
F
T
F
Q))
T
T
F
F
F
T
T
F
A
wff B iff there is no possible truth-value assignment Definition: Two wffs A and B are said to be conto the statement letters making them up that makes sistent or mutually satisfiable if and only if there
A true and B false.
is at least one truth-value assignment to the statement letters making them up that makes both A
Abbreviation: The notation:
and B true.
A B
means that A logically implies B. Note that this
sign is part of the metalanguage; it is an abbreviation of the English words . . . logically implies . . . .
The sign is not used in the object language.
So A (B D) and A (B C ) are
nonsense.
Definition: Two wffs A and B are logically
equivalent if and only if every possible truth-value
assignment to the statement letters making them up
give them the same truth value.
10
Proof:
(1) Assume that A B. We need to show that
(A B) is a tautology.
(2) Suppose for reductio ad absurdum (indirect
proof) that (A B) is not a tautology. This
means that there is some truth-value assignment that does not make (A B) true. It
must make (A B) false.
(3) According to the truth table rules for the sign
, this means it must make A true and B
false.
(4) However, this contradicts the assumption that
A B, since that rules out any truth-value
assignment making A true and B false.
(5) Our supposition at line (2) must be mistaken,
and so (A B) after all.
Lines (1)(5) represent a conditional proof in the
metalanguage that if A B then (A B).
We need to go the other way as well.
(6) Assume that (A B).
(7) Assume for reductio ad absurdum that it is not
true that A B. This means that there is at
least one truth-value assignment that makes
A true but B false.
(8) Since there is at least one truth-value assignment that makes A true and B false, there is
at least one truth-value assignment that makes
(A B) false, given the rules for constructing truth tables for the sign .
(9) However, this contradicts our assumption at
line (6). Hence, A B after all.
Proof:
Similar to previous example.
11
(2) This that both A and (A B) are tautologies, i.e., that every possible truth-value assignment to the statement letters making them up
makes them true.
(3) Suppose for reductio ad absurdum that there
were some truth-value assignment (row of a
truth table) making B false.
(4) Notice that because every truth-value assignment makes (A B) true, if it makes B false
it must make A false as well.
(5) From lines (3) and (4) we get the result that
there is a truth-value assignment making A
false.
(6) However, it follows from line (2) that no truthvalue assignment makes A false.
(7) Lines (5) and (6) are a contradiction, and so our
assumption at line (3) is false, and so B.
(8) Therefore, by conditional proof, if A and
(A B) then B.
e
Result (Adequate Connectives): Every possible truth function can be represented by means
of the connectives , and alone.
C.
Proof:
Well prove this somewhat informally.
(1) Assume that A is some wff built using any set
of truth-functional connectives, including, if
you like, connectives other than our five. (A
might make use of some three or four-place
truth-functional connectives, or connectives
such as the exclusive or, or any others you
might imagine for bivalent logic.)
(2) What were going to show is that there is a wff
B formed only with the connectives ,
and that is logically equivalent with A .
(3) In order for it to be logically equivalent to A ,
the wff B that we construct must have the
same final truth value for every possible truthvalue assignment to the statement letters making up A , or in other words, it must have the
same final column in a truth table.
(4) Let P1 , P2 , . . . , Pn be the distinct statement
letters making up A . For some possible truthvalue assignments to these letters, A may be
true, and for others A may be false. The
only hard case would be the one in which
A is contingent. Clearly tautologies and selfcontradictions can be constructed with the
12
Example: Suppose the letters involved are (6) The wff B constructed in step (5) is logically
A, B and C. This means that there
equivalent to A . Consider that for those truthare eight possible truth-value assignments,
value assignments making A true, one of the
corresponding to the eight rows of a truth
conjunctions making up the disjunction B is
table. We construct an appropriate contrue, and hence the whole disjunction is true as
junction for each.
well. For those truth-value assignments making A false, none of the conjunctions making
A B C
Conjunction
up B is true, because each conjunction will
T T T
ABC
contain at least one conjunct that is false on
T T F
A B C
that truth-value assignment.
T F T
A B C
T F F
A B C
Example: Let us construct a truth table for the
F T T
A B C
formula we constructed during our last step:
F T F
A B C
F F T
A B C
(ABC)(ABC) (ABC)
F F F
A B C
T TTT T T T TT F F T T F TF T F T
b) From the resulting conjunctions, form a
T T T F F T T T T TT F T F T F T F F
complex disjunction formed from those
T F F F T F T F F FF T F F TF F F T
conjunctions formed in step a) for which
T F F F F F T F F FT F F F TF F F F
the corresponding truth-value assignment
F F T F T F F F T F F T T T FTTT T
makes A true.
F F T F F F F F T FT F F T FTT F F
Example: Suppose for the example above
F F F F T F F F F FF T F T FF F F T
that the final column of the truth table for
F F F F F F F F F FT F F T F F F F F
A is as follows (just at random):
By examining the final column for this truth
A B C
A
table, we see that it has the same final column
T T T
T
as that given for A .
T T F
T
T F T
F
(7) This establishes our result. The example was
T F F
F
arbitrary; the same process would work regardF T T
T
less of the number of statement letters or final
F T F
F
column for the statement involved.
e
F F T
F
F F F
F
13
Reducing Further
B
T
F
T
F
(A | B)
F
T
T
T
(A | A ) A
((A | A ) | (B | B)) (A B)
((A | B) | (A | B)) (A B)
and just for kicks, we can add:
(A | (B | B)) (A B)
(((A | A ) | (B | B)) | (A | B)) (A B)
Hence, forms using the Sheffer stroke can be substituted in the proof above.
e
Another way is with the Sheffer/Peirce dagger,
written (neither . . . nor . . . ), which has the
truth table:
Proof:
Note that
(A B) (A B) and
(A B) (A B)
and so the former forms can be used in place of the
latter forms in the proof above.
e
14
A
T
T
F
F
B
T
F
T
F
(A B)
F
F
F
T
Proof:
It suffices to note that:
(A A ) A
((A A ) (B B)) (A B)
((A B) (A B)) (A B)
Austere Syntax
Proof:
(1) Suppose there were some other binary connective # that was adequate by itself.
(2) We know immediately that (A # B) must
be false when A and B are both true. If not,
then it would impossible to form something
equivalent to a contradiction, since the top
row of the truth table (the truth-value assignment making all statement letters true) would
always make a wff true.
(3) For similar reasons, (A #B) must be true
when A and B are both false, or else it would
be impossible to form something equivalent to
a tautology.
Definition: A(n official) well-formed formula
(4) Lines (2) and (3) give us this much of the table
(wff) is defined recursively as follows:
for #:
(i) Any statement letter is a wff;
(ii) if A is a wff then so is A ;
A B A #B
(iii) if A and B are wffs then so is (A B);
T T
F
(iv) nothing that cannot be constructed by repeated
T F
?
applications of the above is a wff.
F T
?
F
F
T
We can continue to use the signs , and ,
The question is how to fill in the remaining ?s. but treat them as mere abbreviations. They are
15
definitional shorthands, just like the conventions DN: From A infer A . From A infer A .
we adopted regarding parentheses:
I: From A infer A B. From A infer B A .
I: From A and B infer A B.
Abbreviations:
I: From A B and B A infer A B.
6I: From A andA infer 6.
(A B) abbreviates (A B)
6O: From 6 infer A .
(A B) abbreviates (A B)
O: From (A B) infer A B.
(A B) abbreviates ((A B) (B A ))O: From (A B) infer A . From (A
B) infer B.
Whenever one of these signs appears, what is really O: From (A B) infer A B.
meant is the wff obtained by replacing definiens O: From (A B) infer A B.
with the definiendum. o, e.g.,
Additional proof techniques
CD: Start a subderivation assuming A . If you de(P Q) (R S)
rive B, you may end the subderivation and infer
is just a shorthand abbreviation for
A B.
ID: Start a subderivation assuming A . If you de((P Q) (R S))
rive 6, you may end the subderivation and infer
A . OR Start a subderivation assuming A . If
Similarly, (P P ) means (P P ), and you derive 6, you may end the subderivation and
(P P ) means (P P ).
infer A .
Here there are 21 inference rules and 3 additional proof techniques.
D.
Our next topic is proofs or deductions in the object language. You learned a deduction system for
propositional logic in your first logic course. Most
likely, it was what is called a natural deduction
system, and contained 15 or more rules of inference. There are many competing natural deduction systems out there. The following are derived
loosely on the systems of Kalish and Montague,
Gentzen and Fitch, respectively.
Examples:
(1) Hardegrees System
Inference rules
O: From A B and A infer B. From
A B and B infer A .
O: From A B and A infer B. From A B
and B infer A .
O: From A B infer A . From A B infer B.
O: From A B infer A B. From A B
infer B A .
Inference rules
MP: From A B and A infer B.
MT: From A B and B infer A .
DS: From A B and A infer B .
HS: From A B and B C infer A C .
Simp: From A B infer A .
Conj: From A and B infer A B.
Add: From A infer A B.
CD: From A B and (A D) (B C )
infer D C
Abs: From A B infer A (A B)
Replacement rules
DN: Replace A with A or vice versa.
Com: Replace A B with B A or vice versa.
Replace A B with B A or vice versa.
Assoc: Replace A (B C ) with (A B)
C or vice versa. Replace A (B C ) with
(A B) C or vice versa.
Dist: Replace A (B C ) with (A B)
(A C ) or vice versa. Replace A (B
C ) with (A B) (A C ) or vice versa.
16
P Q
Q
Q P
P
E.
Axiomatic System L
18
6. ` A A
(Self-Imp)
7. A ` A
5, 6 MP
Written this way, every single line becomes a metatheoretic result. Moreover, it shows which lines in
a proof are justified by which premises, and which
lines were justified without using any premises.
(When a premise is introduced, it is its own justification.) Here we see that in the first proof every line was a theorem, but in the second proof,
some lines were theorems, but others required the
assumption at line 1. The disadvantage of this notation is that it is more to write, which gets tedious
especially when more than one premise is involved.
You can do much the same thing by abbreviating
using line numbers, e.g., by writing line 3 instead
as:
3. [1] ` A A
1, 2 MP
with the [1] representing line 1, and so on.
F.
1. ` A ((A A ) A )
A1
2. ` A (A A )
A1
3. ` (A ((A A ) A ))
((A (A A )) (A A )) A2
4. ` (A (A A )) (A A ) 1, 3 MP
5. ` A A
2, 4 MP
And for the second, I prefer to write:
A ` A
Premise
` A (A A )
A1
A ` A A
1, 2 MP
` (A A ) ((A A ) A )
Proof:
A3 (1) Assume that {C } ` A . This means that
5. A ` (A A ) A
3, 4 MP
there is a proof, i.e., an ordered sequence of
1.
2.
3.
4.
19
20
(Self-Imp)
15. ` (A A ) (A (A A ))
A1
16. ` A (A A )
14, 15 MP
Line 7 is another MP step:
17. ` (A ((A A ) A ))
((A (A A )) (A A ))
A2
18. ` (A (A A )) (A A )
13, 17 MP
19. ` A A
16, 18 MP
Weve transformed our original 7 step proof into a
19 step proof for the result we were after. Notice
that in the new proof, every single step is a theorem; the hypothesis is removed entirely. The final
line shows that all wffs of the form A A
are theorems.
This procedure can be lengthy, but it sure is
effective! The proofs that result from the transformation procedure are not usually the most elegant ones possible Notice, e.g., that lines 2 and
7 are identical, so we could have skipped lines 3
7! However, we followed the recipe provided on
the previous page blindly, since we know that that
procedure will work in every case.
Since we know we can always transform the
one kind of proof into the other, from here on out
(well, except in tonights homework), whenever
you have a result of the form A ` B, just go
ahead and conclude ` A B, annotating with
DT. (In effect, this allows you to do conditional
proofs in our System L.)
8. ` (A A ) ((A A ) A )
A3
9. ` ((A A ) ((A A ) A ))
(A ((A A )
((A A ) A )))
A1
10. ` A ((A A )
((A A ) A ))
8, 9 MP Important Derived Rules for System L
Line 5 is gotten at by MP, so case c) again:
The following are either proven in your book, as11. ` (A ((A A ) ((A
signed for homework, or not worth our time to
A ) A ))) ((A (A
A )) (A ((A A ) A ))) bother proving now.
Remember that (A B) is defined as (A
A2
B) and (A B) is defined as (A B), etc.
12. ` (A (A A ))
(A ((A A ) A )) 10,11 MP
Derived rules:
My name/abbreviation:
13. ` A ((A A ) A ) 7, 12 MP
A B, B C ` A C
Syllogism (Syll)
Line 6 appeals to a theorem schema. Strictly speak- A (B C ) ` B (A C )
ing we should write out the intermediate steps, but
Interchange (Int)
to save time we can treat it like an axiom, and use A B ` B A
Transposition (Trans)
the method for case b):
A B ` B A
Transposition (Trans)
14. ` A A
(Self-Imp) A B, B ` A
21
A ` A
Double Negation (DN)
A ` A
Double Negation (DN)
A ` A B
False Antecedent (FA)
A `BA
True Consequent (TC)
A , B ` (A B) True Ant/False C.(TAFC)
(A B) ` A
True Antecedent (TA)
(A B) ` B
False Antecedent (FC)
A B, A B ` B
Inevitability (Inev)
A A `A
Redundancy (Red)
A `A A
Redundancy (Red)
A `A B
Addition (Add)
A `BA
Addition (Add)
A B`BA
Commutativity (Com)
A B`BA
Commutativity (Com)
A B`BA
Commutativity (Com)
(A B) C ` A (B C )
Associativity (Assoc)
A (B C ) ` (A B) C
(Assoc)
(A B) C ` A (B C )
(Assoc)
A (B C ) ` (A B) C
(Assoc)
A B`A
Simplification (Simp)
A B`B
Simplification (Simp)
A ,B ` A B
Conjunction Intro (Conj)
A B, B A ` A B
Biconditional Intro (BI)
A ,B ` A B
Biconditional Intro (BI)
A , B ` A B
Biconditional Intro (BI)
A B`A B
Biconditional Elim (BE)
A B`BA
Biconditional Elim (BE)
A B, A ` B
Bic. Modus Ponens (BMP)
A B, B ` A
Bic. Modus Ponens (BMP)
A B, A ` B Bic. Modus Tollens (BMT)
A B, B ` A
Bic. Modus Tollens (BMT)
Proof:
(1) Assume ` A . This means that there is a sequence of wffs B1 , B2 , . . . , Bn constituting
proof of A in which every step is either an
axiom or derived from previous steps by MP.
(2) We shall show by proof induction that every
step of such a proof is a tautology. We assume
as inductive hypothesis that all the steps prior
to a given step Bi are tautologies. We now
need to show that Bi is a tautology.
(3) Bi is either an axiom or derived from previous steps by MP. If it is an axiom, then it is a
tautology. (A simple truth table for the three
axiom schemata shows that all instances are
tautologies.) By an earlier result (see p. 3), anything derived from MP from tautologies is also
a tautology. Hence, Bi is a tautology.
(4) By proof induction, all steps of the proof are
tautologies, including the last step, which is A .
Hence A .
e
G.
Corollary (Consistency): System L is consistent, i.e., there is no wff A such that both ` A
and ` A .
Proof:
By the definition of a tautology, every truth-value negation is a tautology, and hence neither it nor
assignment makes both A true and A true.
its negation is a theorem. The other sense of comHowever, no truth-value assignment can make pleteness is the converse of soundness, i.e., that
both A and A true, and so our assumption is everything that should be provable in the system
impossible.
e given the semantics of the signs it employs is in
fact provable. This notion of completeness was
Here we see that consistency is a corollary of first used by Kurt Gdel, and is sometimes called
semantic completeness. System L is complete in
soundness. Heres another.
this sense. Before we prove this, we first need to
prove something else.
Corollary: If {B1 , B2 , . . . , Bn } ` A then
{B1 , B2 , . . . , Bn } A .
Composition Lemma
Proof:
The reason is that if
{B1 , B2 , . . . , Bn } ` A
then by multiple applications of the deduction theorem,
` (B1 (B2 . . . (Bn A ))).
Then, by soundness, we can conclude:
(B1 (B2 . . . (Bn A )))
Then by simple reflections on the rules governing
truth tables, it is obvious that:
{B1 , B2 , . . . , Bn } A
In other words, only logically valid arguments have
proofs in L.
e
H.
Completeness
Proof:
Example: For illustration purposes only, well
We show this by wff induction.
assume A contains only three statement letBase step: Let A be a statement letter. Then the
ters P , Q and R.
only statement letter making up A is A itself. If
f assigns T to A , then A and hence ` A . (2) As a tautology, every truth-value assignment
to those statement letters makes A true.
Similarly, if f assigns F to A , then A , and
(3)
By the Composition Lemma, it follows that for
hence ` A .
every for set that contains either Pi or Pi
Induction step: Because all complex wffs are built
but not both for each i such that 1 i n, we
up using the signs and , we need to show two
have ` A .
things, (a) if A takes the form B then the above
holds for A assuming it holds of B, and (b) if A
Example: Consider the truth table for A ; it is a
takes the form B C , then the above holds of A
tautology, true on every row. Each row gives us
assuming it holds of B and C .
a different result from the Composition Lemma,
but always a different way of proving A .
First lets show part (a).
Suppose A takes the form B. If f makes A true,
P Q R A
Result of lemma
then it must make B false. By our assumption,
T T T T
{P, Q, R} ` A
` B, which is the same as ` A , which is
T T F T
{P, Q, R} ` A
what we want. If f makes A false, it must make
T F T T
{P, Q, R} ` A
B true. By our assumption ` B, and by (DN)
T F F T
{P, Q, R} ` A
` B, which is the same as ` A .
F T T T
{P, Q, R} ` A
Now lets show part (b).
F T F T
{P, Q, R} ` A
Suppose A takes the form B C . If f makes
F F T T
{P, Q, R} ` A
A true, it must make either B false or C true.
F F F T {P, Q, R} ` A
If f makes B false, then by our assumption `
B, and by the derived rule (FA), it follows that (4) By the Deduction Theorem, we can conclude
that if is a set containing either Pi or Pi
` B C , i.e., ` A . If f makes C true, by
for each i such that 1 i n 1, we have
our assumption ` C and so by the derived rule
both ` Pn A and ` Pn A .
(TC), we get ` B C , i.e., ` A . On the
By the derived rule (Inev), we can conclude
other hand, if f makes A false, it must make B
`A.
true and C false. By the assumption, ` B and
` C . Then by the derived rule (TAFC), we get
Example: What were doing here taking the
` (B C ), or in other words, ` A .
last statement letter or negation in each
This completes the induction step, and hence the
premise set and removing it by the deducComposition Lemma follows by wff induction. e
tion theorem, thereby making it an antecedent.
However, since we have both the case with the
We are now ready to tackle completeness.
affirmative antecedent and the case with the
negative antecedent, they drop off by (Inev).
{P, Q} ` R A
so {P, Q} ` A
{P, Q} ` R A
{P, Q} ` R A
so {P, Q} ` A
{P, Q} ` R A
{P, Q} ` R A
so {P, Q} ` A
{P, Q} ` R A
{P, Q} ` R A
so {P, Q} ` A
{P, Q} ` R A
Result (Completeness): System L is semantically complete, i.e., for any wff A , if A then
`A.
Proof:
(1) Assume that A , and let the statement letters
making it up be P1 , . . . , Pn .
24
A , then
Corollary: if B1 , . . . , Bn
B1 , . . . , Bn ` A . For every valid argument,
there is a deduction for it in our very minimal
System L.
In effect, such an assignment is something that assigns either 0, 1 or 2 to each statement letter. This
is rather like a truth-value assignment, which maps
statement letters to T and F, except that here there
are more possibilities.
Proof:
Indirectly a schmuth-value assignment deterSee the proof of the converse of the above, given
as a corollary to Soundness on p. 23, and run it in mines a schmuth-value for complex wffs according
the other direction.
e to the following charts:
A B A B
0
0
0
0
1
2
Corollary: For any wff A , ` A iff A . (All
A A
0
2
2
and only tautologies are theorems of L.)
0
1
1
0
2
1
1
1
1
2
2
0
1
2
0
2
0
0
Proof: Combine Soundness and Completeness.
2
1
0
2
2
0
I.
A
0
0
0
1
1
1
2
2
2
(B
0
0
2
1
0
2
0
0
0
1
2
2
0
0
0
1
0
2
Proof:
Suppose A is select, i.e., has schmuth value 0 for
every possible schmuth-value assignment. Similarly, suppose that A B is select, i.e., has 0 for
every possible schmuth-value assignment. Then
B must be select as well. We can see this by the
schmuth table rules for . If B were not select,
then it would have 1 or 2 as value for some assignment. If so, then A B and A could not both
be select, because A B has value 2 when A
has 0 and B gets 1 or 2 as value.
e
A)
0
0
2
0
0
0
2
1
2
1
0
1
2
2
0
2
0
2
B)
2 1 0
0
2 1 1
0
0
2 0 2
0
2 1 0
2 1 1
0
0
2 0 2
2 1 0
0
2 1 1
0
0
0 0 2
(( A
1 0
1 0
1 0
1 1
1 1
1 1
0 2
0 2
0 2
B) A )
2 0 0 0
2 1 0 0
0 2 0 0
2 0 0 1
2 1 0 1
0 2 2 1
0 0 2 2
2 1 0 2
2 2 0 2
Proof:
Suppose we had an axiom system in which our
only axiom schemata were (A2) and (A3) and our
only inference rule were modus ponens. If so, then
every theorem of the system would be select, since
the axioms are select and everything derived from
select wffs by MP is also select. Because some
instances of (A1) are not select, this means some
instances of (A1) would not be theorems of this system. Hence, not all instances of (A1) are derivable
from (A2), (A3) and MP alone.
e
We then define a notion of grotesqueness. A complex wff is grotesque if and only if it comes out
with value 0 using these revised rules for any possible assignment of 0, 1 or 2 to all its statement
letters.
It turns out that all instances of (A1) and (A3)
are grotesque, but some instances of (A2) are not.
Modus ponens preserves grotesqueness. So (A2) is
independent of (A1) and (A3).
For homework, youll be proving the independence of (A3) from (A1) and (A2). Relatively, thats
the easiest, since it doesnt require three values,
and can be done with assignments into {0, 1}, provided that one changes the rule governing how the
value for A is determined by the value of A .
These independence results establish that there
is no redundancy in our axiom schemata; we
couldnt simply remove one of them and be left
with a complete system.
In one sense, our system is as minimal as possible, but in another sense it isnt. We cant simply
remove any of the ones we have, but we could start
with completely different axiom schemata. Several
rival axiomatizations are possible; you can find a
list of some of them in your book pp. 4546. Axiomatizations have been found in which there is
only one axiom schema. Just like the decision to
use both and instead of |, however, there
are diminishing returns to minimalism. The proofs
in such systems for even the most mundane results often require an insane number of steps and
insanely complicated axioms.
However, in case youre curious, the first complete system for propositional logic using a single
axiom schema was discovered by Jean Nicod in
1917, and it uses the Sheffer stroke instead of
and . An axiom is any instance of the single
schema:
(A | (B | C )) | ((D | (D | D)) |
((E | B) | ((A | E ) | (A | E ))))
The only inference rule is: From A | (C | B) and
A infer B.
Are you glad I didnt make you use that system?
27
UNIT 2
METATHEORY FOR PREDICATE LOGIC
A.
I use the unitalicized letters x, y and z as object language variables, and italicized letters in
the same rangex, y, z as metalinguistic
schematic letters for any object-language variables.
Thus, e.g.,
(x)(F x Gx)
Definition: A predicate letter is one of the uppercase letters from A to T , written with a numerical
Schematically represents all of (x)(F x Gx)
superscript 1, and with or without a numerical
and (y)(F y Gy) and (x3 )(F x3 Gx3 ),
subscript.
and so on. The difference is subtle, and usually not
so important to keep straight. After all, object lan- Examples: A1 , R2 , H 4 , F 1 , G3 , etc.
2
4
guage variables tend to be interchangeable; these
do not mean anything different. This is why Im us- Even when italicized, take these to be object laning notation that does not emphasize the difference. guage constants; script letters such as P are used
Still, we do need a technical means for differentiat- in their place schematically if need be.
ing between the two when it is necessary.
The superscript indicates how many terms the
Officially, Mendelson only uses xn , and not predicate letter takes to form a statement. A prediyn or zn , although he doesnt stick to this. His cate letter with a superscript 1 is called a monadic
variables are always italicized; the only differ- predicate letter. A predicate letter with a superence beween object language and metalanguage script 2 is called a binary or dyadic predicate
is whether a particular numerical subscript occurs, letter.
28
It is customary to leave these superscripts off everyone else. Here, I follow Mendelson, though
when it is obvious from context what they must be. Ill put hard brackets to another use in a minute.
E.g., R2 (a, b) can be written simply R(a, b).
However, I adopt the convention that if the
m
Officially Mendelson only uses An .
terms in an atomic formula contain no function letters, the parentheses and commas may be removed.
Definition: A function letter is one of the lowercase letters from f to l, written with a numerical
superscript 1, and with or without a numerical Examples: F x is shorthand for F 1 (x), and
subscript.
Rab is shorthand for R2 (a, b).
Examples: f 1 , g 2 , h33 , etc.
The numerical superscript indicates how many Definition: A well-formed formula (wff) is reargument places the function letter has. A func- cursively defined as follows:
(i) any atomic formula is a wff;
tion letter with a superscript 1 is called a monadic
(ii) if A is a wff, then A is a wff;
function letter; a function letter with a superscript
(iii)
if A and B are wffs, then (A B) is a wff;
2 is called a binary/dyadic function letter, etc.
Here too, it is customary to leave these super- (iv) If A is a wff and x is an individual variable,
then ((x) A ) is a wff;
scripts off when it is obvious from context what
(v) nothing that cannot be constructed by repeated
they must be. E.g., f 1 (x) can be written simply
applications of the above is a wff.
f (x).
Definition: A term of the language is defined recursively as follows:
(i) all individual variables are terms;
(ii) all individual constants are terms;
(iii) if F is a function letter with superscript n,
and t1 , . . . , tn are terms, then F (t1 , . . . , tn )
is a term;
(iv) nothing that cannot be constructed by repeated
applications of the above is a term.
Examples: a, x, f (a), g(x, f (y)), etc.
(((x) F x) Ga)
(x) F x Ga
((x)(F x Ga))
1
29
Abbreviations:
((x) A ) abbreviates ((x) A )
(A B) abbreviates (A B)
(A B) abbreviates (A B)
(A B) abbreviates
((A B) (B A ))
B.
Examples:
1. If A [x] is F x, then A [y] is F y.
2. If A [x] is (y) R(y, x) then A [f (b)] is
(y) R(y, f (b)).
3. If A [z] is F z Gz, then A [d] is F d Gd.
4. If A [x] is F x (x) Gx, then A [d] is Definition: An interpretation M consists of the
F d (x) Gx.
following four things:
5. If A [x] is F a then A [y] is F a.
1. The specification of some non-empty set D to
serve as the domain of quantification for the
Mendelson writes A (x) and A (t) instead of A [x]
language.
and A [t]. I think my notation makes it clearer
This set is the sum total of entities the quantithat these signs are parts of the metalanguage, and
fiers are interpreted to range over. The domain
that the parentheses that appear here are not the
might include numbers only, or people only, or
parentheses used in atomic formula or in function
anything else you might imagine. The domain
terms.
of quantification is sometimes also known as
Similarly, I write A [x, y] for an arbitrary wff
the universe of discourse.
that may or may not contain x and y free, and in 2. An assignment, for each individual constant in
the same context I use A [t, s] for the result of subthe language, some fixed member of D for which
stituting t for all free occurrences of x, and s for
it is taken to stand.
all free occurrences of y, in A [x, y].
For a given constant c, this member is denoted
in the metalanguage by (c)M .
Examples:
3. An assignment, for each predicate letter with su1. If A [x, y] is Rxy, then A [a, b] is Rab.
perscript n in the language, some subset of Dn .
2. If A [x, y] is (z)(Rzx Ryz), then A [a, b]
That is, the interpretation assigns to each prediis (z)(Rza Rbz).
cate letter a set of n-tuples from D.
31
For a given predicate letter P n , this set is denoted in the metalanguage by (P n )M . This
set can be thought of as the extension of the
predicate letter under the interpretation.
4. An assignment, for each function letter with superscript n in the language, some n-place operation on D.
In other words, each function letter is assigned
a set of ordered pairs, the first member of which
is itself some n-tuple of members of D, and the
second member of which is some member of
D. This set of ordered pairs is a function, so
for each n-tuple in its domain, there is a unique
element of D in its range. So if D is the set
of natural numbers, a two-place function letter F 2 might be assigned the addition operation, i.e., the set of all ordered pairs of the form
hhn, mi, pi such that n + m = p. This operation
can be thought of as the mapping, or functionin-extension, represented by the function letter
under the interpretation.
In a sense, the four parts of a model fix the meanings of the quantifiers, constants, predicate letters
and function letters, respectively. (Or at the very
least, they fix as much of their meanings as is relevant in an extensional logical system such as firstorder predicate logic.) This leaves only something
to be said about variables.
Sequences
Each model is associated with a certain domain or
universe of discourse. Variables are allowed to take
different values within that domain. A variable
is given a value by what is called a (denumerable)
sequence.
Definition: A denumerable sequence or variable assignment for domain D is a function whose
domain is the set of positive natural numbers, and
whose range is a subset of D.
What does this have to do with assigning values to
the variables?
We first note that while there are an infinite
number of variables (since we can always use different subscripts), we can arrange them in a fixed
32
Therefore, each sequence correlates every variable of the language with a member of the domain
of that interpretation. For a given variable x and
sequence s, in the metalanguage, we use s(x) to
denote the member of D which s correlates with
x. Hence, s3 (y) is Paul. Given the assignment
made for the constants and function letters in M,
derivatively, each sequence correlates every term
of the language with a member of D. If c is an
individual constant, then let s(c) be (c)M . Then for
function terms, let s(F (t1 , . . . , tn )) be the entity
in D such that hhs(t1 ), . . . , s(tn )i, i (F )M .
A sequence acts just as an assignment of values
to the variables. Within a given interpretation M,
an open wff might be satisfied by some and not
others.
Satisfaction
Definition: The notion of satisfaction is defined
recursively. For a given interpretation M with domain
D:
(i) If A is an atomic wff P(t1 , . . . , tn ), then sequence s satisfies A iff the ordered n-tuple
formed by those entities in the domain D that
s correlates with t1 , . . . , tn is in the extension
of P for interpretation M, or more precisely,
hs(t1 ), . . . , s(tn )i (P)M .
(ii) Sequence s satisfies a wff of the form A iff s
does not satisfy A .
(iii) Sequence s satisfies a wff of the form (A
B) iff either s does not satisfy A or s does
satisfy B.
(iv) Sequence s satisfies a wff of the form (x) A
iff every sequence s that differs from s at most
with regard to what entity of D it correlates
with the variable x satisfies A .
Roughly speaking, each sequence assigns a member of D to each free variable, and from there, one
can determine whether or not the wff is satisfied
by that variable assignment as one would expect.
The notion of satisfaction is important because
it is used to define the notion of truth. (This is the
heart of Tarskis formal semantics.)
33
(leaving off any subscript) means that A is logically valid. Because interpretations are analogous
for truth-value assignments in propositional logic, Abbreviation: Similarly
this definition is analogous to the definition of a
A
tautology given in our last unit; this is why the
notation is appropriate.
means that A is a logical consequence of .
What interpretations are possible? Do we know
how many? (In a footnote, Mendelson, rather
dubiously, equates interpretations with possible Definition: A wff A is said to be logically equivworlds. This is misleading in many ways, but it alent to a wff B iff in every interpretation M, A
can sometimes be helpful to think of it in this way.) and B are satisfied by the same sequences.
It is impossible that both A and A .
A B
means that A and B are logically equivalent.
C.
Countermodels and
Semantic Trees
break down how the satisfaction of a given wff de- Afterwards, reapply rule for any T (y) B (or
pends on its parts, to see if the proposal is possible. F (y) B) lines that were applied earlier on the
When a certain possibility might be true in more
current branch.
than one way, the tree branches to explore both
possibilities.
Atomic Formulas
T P(t1 , . . . , tn )
Semantic Tree Rules
Check to see whether F P(t1 , . . . , tn ) appears
previously on branch. If so, close branch with 6.
Here are the rules for the primitive connectives.
If not, do nothing.
Negations
T (A )
..
.
F P(t1 , . . . , tn )
Check to see whether T P(t1 , . . . , tn ) appears
previously on branch. If so, close branch with 6.
If not, do nothing.
FA
F A
..
.
TA
Conditionals
T (A B)
..
.
FA
Disjunctions
T (A B)
..
.
TB
F (A B)
..
.
TA
TA
FB
TB
F (A B)
..
.
FA
FB
Universal Quantifier
T (x) A [x]
..
.
Conjunctions
T (A B)
..
.
T A [t1 ]
..
.
T A [tn ]
(for all closed terms ti occurring on this branch of
tree)
F (x) A [x]
..
.
TA
TB
F (A B)
..
.
FA
F A [c]
(where c is some new constant unused in tree)
36
FB
Biconditionals
T (A B)
..
.
TA
TB
FA
FB
F (A B)
..
.
TA
FB
FA
TB
Existential Quantifiers
T (x) A [x]
..
.
T A [c]
(where c is some new constant unused in tree)
Afterwards, reapply rule for any T (y) B (or
F (y) B) lines on the current branch.
F (x) A [x]
..
.
T (x)(F x Gx)
T (x)(Gx Hx)
F (x)(F x Hx)
F (F a Ha)
T Fa
F Ha
T (F a Ga)
F Fa
6
T Ga
T (Ga Ha)
F Ga
6
F Fa
6
T Ha
6
F Ga
F F b Gb
F Fb
F Gb
6
D.
An Axiom System
Abbreviation:
` A and simply ` A
Proof:
are defined as you might expect. In this unit, unless Follows directly from (A4) by MP.
otherwise specified, ` means `PF .
1
PF stands for Full Predicate calculus, i.e., the calculus within a syntax including all possible constants, predicate letters
and function letters. The Pure Predicate calculus, PP, is the same, but excluding all constants or function letters from the
syntax. Mendelson gives these abbreviations in Chapter 3. There are predicate calculi that are neither pure nor full.
39
Proof:
The following schema shows the object-language
Proof:
steps necessary.
1. A [t, t] ` A [t, t]
(Premise) (1) Assume the complex antecedent of DT. We will
show, using proof induction, that for every step
2. A [t, t] ` A [t, t]
1 SL (DN)
Bi in the proof B1 , . . . , Bn of A from {C },
3. ` (x) A [t, x] A [t, t]
A4
that it holds that ` C Bi . We are enti4. A [t, t] ` (x) A [t, x]
2, 3 SL (MT)
tled to assume that we have already gotten
5. A [t, t] ` (x) A [t, x]
4 definition of
` C Bj for all steps Bj prior to Bi .
e (2) Because B is a step in the proof of A from
i
{C }, the cases we have to consider are that:
(a) Bi is a member of , (b) Bi is C , (c) Bi is
an axiom, (d) Bi follows from previous steps
Result (Sub or Repl): A [x] ` A [t], where t is
in the proof by MP, and (e) Bi follows from a
free for x in A [x]. (The rule of substitution or
previous step by an application of Gen obeying
replacement of free variables.)
the restriction mentioned above. We consider
each case.
Case (a). Bi is a member of . Hence ` Bi ,
and by SL, ` C Bi .
Proof:
Case (b). Bi is C . Then C Bi is simSchematically:
ply C C , a simple tautology, whence
1. A [x] ` A [x]
(Premise)
` C Bi .
2. A [x] ` (x) A [x]
1 Gen
Case (c). Bi is an axiom. Hence ` Bi and by
3. A [x] ` A [t]
2 UI
SL, ` C Bi . A fortiori, ` C Bi .
e
Case (d). Bi follows from previous members
of the series by MP. Therefore there are preE. The Deduction Theorem in
vious members of the series Bj and Bk such
that Bj takes the form Bk Bi . By the
Predicate Logic
inductive hypothesis, we already have both
` C Bk and ` C (Bk Bi ). By
The deduction theorem does not hold generally
SL, ` C Bi .
in the first-order predicate calculus PF, nor would
we want it to. After all, in the semantics of prediCase (e). Bi follows from a previous member of
cate logic, it is not the case that F x (x) F x,
the series by an application of Gen obeying the
and similarly in the system of deduction, while we
restriction mentioned above. Therefore, there
have F x ` (x) F x by Gen we should not have
is a previous step Bj such that Bi takes the
` F x (x) F x. We therefore state and prove
form (x) Bj for some variable x. Because of
the deduction theorem in the following restricted
the restriction, either obtaining Bj did not deform:
pend on having C in the premise set, or C does
40
F.
The natural deduction rule of Existential Instantiation or Existential Elimination (EI, O) recommends that from a given existentially quantified statement, one should infer the corresponding
statement with the quantifier removed, and some
new or unused constant in place of the variable.
(Mendelson calls this Rule C for choice.) HowObviously, in the proof establishing that F x ` ever, note that for most wffs A [x] it is not the case
(x) F x, Gen is applied to a step that both depends that
(x) A [x] A [c]
on F x and makes use of a variable occurring free
in F x. (Note that invoking the Sub or Repl for any constant c. Thus, e.g., we ought not have
derived rule requires the same.) So we cannot con- (x) F x ` F (c) for any constant c, even an unused
one. Within an interpretation M, every constant c
clude ` F x (x) F x.
However, such is not the case with the proof: is assigned a fixed entity of the domain, viz., (c)M .
There simply is no inferring that (c)M is in the
extension of the predicate letter F , viz., (F )M ,
1. (x) F x ` (x) F x
Premise simply on the assumption that something is. This
2. ` (x) F x F y
(A4) so-called rule of natural deduction is logically
3. (x) F x ` F y
1, 2 MP invalid, and should be done away with. Luckily,
4. (x) F x ` (y) F y
3 Gen we dont need it. Bearing in mind Exercise 2.32d
from your homework, we have:
This we transform as follows:
(New-DR) If A does not contain x free, then
(x)(B A ) ` (x) B A .
` (x) F x (x) F x
` (x) F x F y
` (x) F x ((x) F x F y)
` (x) F x F y
` (y)((x) F x F y)
` (y)((x) F x F y)
((x) F x (y) F y)
7. ` (x) F x (y) F y
1.
2.
3.
4.
5.
6.
1.
2.
3.
4.
(x) F x ` (x) F x
(x) F x ` F a
(x) F x ` F a Ga
(x) F x ` (x)(F x Gx)
Premise
1 EI/O
2 SL
4 EG
Premise
Premise
2 SL
3 EG
4 DT
New-DR. Then, along with the existential state {C1 [c1 ], . . . , Cm1 [cm1 ]} ` Cm [cm ] B
ment to which you applied EI, and MP, you get the
(5) Pick some variable y that does not occur free
result. Let us state this result more formally.
anywhere in the series A1 , . . . , An (preferDefinition: Pseudo-derivability or ` : ` B
ably xm ). Replace cm with y everywhere in
iff there is an ordered series of wffs A1 , . . . , An where
the proof for {C1 [c1 ], . . . , Cm1 [cm1 ]} `
An is B, and for each step Ai where 1 i n, eiCm [cm ] B. The result will also be a proof.
ther:
Notice that cm does not occur anywhere in the
(a) Ai is an axiom;
set {C1 [c1 ], . . . , Cm1 [cm1 ]}, since it was
(b) Ai is a member of ;
new when we introduced it. So there is no
(c) there is some previous step in series, Aj , such
reason it must be used rather than the variable.
that Aj takes the form (x) C [x], and Ai takes (6) Hence, {C1 [c1 ], . . . , Cm1 [cm1 ]} `
the form C [c], where c is a constant that does
Cm [y] B.
not occur in any previous step of the pseudo- (7) By Gen, {C1 [c1 ], . . . , Cm1 [cm1 ]} `
proof, nor in B, nor in any premise in (i.e.,
(y)(Cm [y] B).
Ai was derived by the pseudo-rule, EI);
(8) Because y does not occur free in the proof, and
(d) Ai follows from previous steps by MP;
B is An , B does not contain y free. Hence,
(e) Ai follows from a previous step by Gen, but not
by (New-DR), {C1 [c1 ], . . . , Cm1 [cm1 ]} `
using a variable that is free in some previous
(y) Cm [y] B.
step of the series C [c] arrived at by EI.
(9) Because Cm [cm ] was arrived at in the
original pseudo-derivation by EI on
some wff of the form (xm ) Cm [xm ],
which either is (y) Cm [y], or can be
used to get it, it must be that
Result: If ` B then ` B.
{C1 [c1 ], . . . , Cm1 [cm1 ]} ` (y) Cm [y]. Thus,
(The non-necessity of EI.)
by MP, {C1 [c1 ], . . . , Cm1 [cm1 ]} ` B.
(10) By the same procedure described in steps (5)
(9), we can eliminate Cm1 [cm1 ] from the
premise set, and so on, until we have elimProof:
inated everything except the members of .
Hence, ` B.
e
(1) Assume ` B, and let A1 , . . . , An be the This proof shows us that we dont need Existensteps of the pseudo-proof.
tial Instantiation, very much like we dont need
42
G.
Proof:
Suppose for reductio ad absurdum that there is a
wff A such that ` A and ` A . By soundness,
A and A . In other words, every sequence
in every model satisfies both A and A . But a
sequence satisfies A iff it does not satisfy A , so
any arbitrary sequence will both satisfy and not
satisfy A , which is absurd.
e
Ultimately, we also want to prove the completeness
of PF. Well get there, but we first need to prove a
number of lemmas.
Proof:
Every instance of the axiom schemata is logically
valid. (This can be verified using semantic trees.)
MP and Gen preserve logical validity. (In the case
of Gen, note that a wff is logically valid iff it is
satisfied by all sequences in all models. If an open
wff is satisfied by all sequences in a model, then the
corresponding wff with one of the variables bound
with an initial quantifier will also be satisfied by
all sequences in that model.) If ` A , then A is
derivable from the axioms by some finite number
of steps of MP and Gen, each preserving validity,
and hence, A .
e
Proof:
All wffs are built up of the simple signs: (, , ,
), , , , as well as the individual constants,
variables, predicate letters and function letters.
A. We define a function g that associates each simple sign with a different natural number.
(1) Let g(() = 3, g()) = 5, g(, ) = 7, g() =
9, g() = 11, and g() = 13.
(2) If c is a constant, and n is the number of its
subscript (if c has no subscript, then n = 0),
then depending on which letter of the alphabet
is used, let k be either 1, 2, 3, 4 or 5 (1 for a, 2
for b, etc.), and let g(c) = 7 + 8(5n + k).
(3) If x is a variable, and n is the number of its
subscript (if x has no subscript, then n = 0),
then depending on which letter of the alphabet
is used, let k be either 1, 2, or 3 (1 for x, 2 for
y and 3 for z), and let g(x) = 13 + 8(3n + k).
(4) If F is a function letter, and n is the number of
its subscript (if F has no subscript, then n = 0)
and m is the number of its superscript, then
depending of which letter of the alphabet is
used (f through l), let k be one of 1 through
7, and let g(F ) = 1 + 8(2m 37n+k ).
43
Proof:
As above, with the set of closed wffs (and their
Gdel numbers) substituted for W (and C).
Were inching closer to completeness. Before moving on, I want to make note of some differences between my proof of completeness and Mendelsons.
Mendelson prefers to speak of different first-order
theories. Remember than a first-order theory is
an axiomatic system gotten by adding additional
axioms to the axioms of PF. Really, talking about
what theorems are provable in a given system K,
where the additional axioms of K are the members
of a set is equivalent to speaking about what
is provable in the barebones system PF beginning
with as a set of premises, since clearly:
`K A iff `PF A
Its really a matter of taste whether we view the
proof as being about different theories or as being about different premise sets. I prefer to speak
about premise sets, since that we dont have to deal
with any sense of ` other than `PF . But the
differences are trivial.
Before moving on, let us introduce some new
metalinguistic definitions.
Definition: A set of wffs is said to be consistent iff there is no wff B such that both ` B and
` B . (Otherwise, is inconsistent.)
Definition: A set of wffs is said to be maximal
iff for every closed wff B, either B or B .
Definition: A set of wffs is said to be universal
iff for every wff B[x] that contains at most x free,
if it is the case for all closed terms t that B[t] ,
then (x) B[x] .
We now move on to our next important Lemma on
the way to completeness.
44
Proof:
(1) Assume that is a consistent set of closed wffs.
(2) For convenience, we assume that none of the
constants e, e1 , e2 , e3 , . . . , etc., occur anywhere in the wffs in .2
(3) By the denumerability of the set of closed wffs
of the language, we can arrange them in an
infinite sequence:
A1 , A2 , A3 , . . . , etc.
Making use of this sequence, let us recursively
define an infinite sequence of sets of wffs:
0 , 1 , 2 , . . . , etc.
As follows:
a) Let 0 = .
b) We define n+1 in terms of n in one of the
following three ways:
(i) if n {An+1 } is consistent, then let
n+1 = n {An+1 };
(ii) if n {An+1 } is inconsistent
and An+1 does not take the form
(x) B[x], then let n+1 = n
{An+1 };
(iii) if n {An+1 } is inconsistent
and An+1 does take the form
(x) B[x], then let n+1 = n
{An+1 } {B[ex ]}, where ex is
the first member of the sequence
e, e1 , e2 , e3 , . . . , that does not
occur in n .
(4) Let be the union of all of the members of the
-sequence (i.e., 0 1 2 . . . etc.)
(5) Obviously, . This establishes part (a) of
the consequent of the Lemma.
(6) Every member of the -sequence is consistent.
We prove this by mathematical induction.
Base step: 0 is , and it is consistent ex hypothesi.
Induction step: Suppose n is consistent. It
follows that n+1 is consistent by a proof by
cases:
2
If this assumption is not warranted, we could use another denumerable sequence of constants, e.g., the bs or the cs, or
even add a new sequence of constants o, o1 , o2 , o3 , . . . , to the language if need be.
45
By MP, n {B[ex ]} ` C
c) So for all closed wffs A1 , A2 , . . . , etc., either it or its negation is included in .
C .
d) This establishes part (c) of the consequent
Because An+1 is closed and it
takes the form (x) B[x], the
of the Lemma.
wff B[ex ] is also closed.
(9) Finally, is also universal.
By DT, n ` B[ex ] (C
a) We show this by reductio. Suppose othC ).
erwise, i.e., suppose that there is a wff
` (C C ), and so by SL,
B[x] that contains at most x free, such
n ` B[ex ].
that for all closed terms t, B[t] , but
ex is not included in n .
(x) B[x]
/ .
Hence, we can replace ex
b) (x) B[x] is closed, so because is maxiwith the variable x throughmal, it must be that (x) B[x] .
out the proof for n ` B[ex ]
c) Because (x) B[x] is closed, it also foland the result will also be a
lows that (x) B[x] is a member of the A proof. Hence n ` B[x].
sequence, i.e., (x) B[x] is An+1 for some
By Gen, n ` (x) B[x],
number n.
which is the same as n `
/ ,
d) Obviously, however, since (x) B[x]
An+1 .
it follows that n+1 is not obtained from
So n is inconsistent, which
n using case (i).
contradicts the inductive hye) Nor was it obtained using case (ii), since
pothesis.
An+1 is of the form (x) B[x].
Hence n+1 is consistent.
f) This leaves case (iii), so n+1 is n
{An+1 } {B[ex ]}.
(7) It follows from (6) that is consistent.
g) Hence for some x, B[ex ] n+1 and so
a) Note that the -sequence is constantly exB[ex ] .
panding: For all j and k such that j < k,
h) But by our assumption, it holds for all
j k . Crudely, can be thought of as
closed terms t that B[t] .
the upper limit of the expansion.
i) All constants, ex included, are closed
b) So every finite subset of is a subset of
terms, so B[ex ] . j) Hence, both
some i for some suitably large i.
` B[ex ] and ` B[ex ].
c) However, every proof from has only a fij) However, this is impossible, because we
nite number of steps, and hence only makes
have already shown to be consistent.
use of a finite subset of .
k) Our supposition has been shown to be imd) If there were some B such that both `
possible, hence is universal.
B and ` B, for some suitably large i,
l)
This establishes part (d) of the consequent
it would have to be that both i ` B and
of the Lemma.
i ` B.
e) This is impossible because all the members
(10) By suitably defining , we have shown each of
of the -sequence are consistent by (6).
parts (a)-(d) of the consequent of the Lemma on
f) Hence, is consistent.
the basis of the assumption of its antecedent.
g) This establishes part (b) of the consequent
Hence, the Lemma is established.
e
of the Lemma.
(8) is obviously maximal as well.
a) All closed wffs are members of the sequence A1 , A2 , . . . , etc.
b) For each Ai , either it or its negation is a
member of i , and i .
Weve just shown that beginning with any consistent set of sentences, we can keep adding to it
ad infinitum to get a maximally consistent set of
sentences of the language.
We pause again for a new definition:
46
Proof:
(1) Assume that is a consistent, maximal, and
universal set of closed wffs. We can then describe a denumerable model M for using the
following procedure.
(2) Essentially, well let all the closed terms of the
language stand for themselves. (Another possible way of constructing a model would be to
let each closed term stand for its Gdel number. However, let us proceed using the former
method.)
(3) Let the domain of quantification D of M be the
set of closed terms of the language of first-order
predicate logic. Note that there are denumerably many closed terms, so M is a denumerable
model.
(4) For each constant c, let (c)M be c itself. So, for
example, (a)M is a, (b12 )M is b12 , etc.
(5) For each function letter F with superscript n,
let (F )M be that n-place operation on D which
includes all ordered pairs of the form
hht1 , . . . , tn i, F (t1 , . . . , tn )i
i.e., the operation that has the closed term
F (t1 , . . . , tn ) as value for ht1 , . . . , tn i as argument.
Example: The operation (f 1 )M , which M assigns to the monadic function letter f 1 , will
contain such ordered pairs as ha, f 1 (a)i,
hb12 , f 1 (b12 )i, and hf 1 (a), f 1 (f 1 (a))i,
and so on.
(6) For each predicate letter P with superscript
n, let (P)M be that subset of Dn that includes the n-tuple ht1 , . . . , tn i iff the atomic
wff P(t1 , . . . , tn ) is included in .
47
If (B C ) , then
would be inconsistent, because
` (B C ) B and
` (B C ) C .
So B C , i.e., A .
Putting these two results together,
we get that A iff M A .
Case (c): A takes the form (x) B[x], where
B[x] contains fewer connectives,
and B[x] contains at most x free.
First we prove that if A then
M A .
Suppose A , i.e.,
(x) B[x] .
Because B[x] contains at most
x free, for all closed terms t ,
B[t] is a closed wff.
Because is maximal, for all
closed terms t, either B[t]
or B[t] .
However, since is consistent,
it must be that for all closed
terms t , B[t] .
By the inductive hypothesis, for
all closed terms t, M B[t].
Because the domain of quantification for M is D, and D
consists of the set of closed
terms, and every closed term
is interpreted as standing for itself, a sequence of M will satisfy
B[x] iff it satisfies B[t] for that
closed term t that gets assigned
to x in that sequence.
Because all sequences of M satisfy B[t] for all closed terms t,
all sequences of M will satisfy
B[x], and hence all sequences
of M will satisfy (x) B[x].
Hence, M (x) B[x], i.e.,
M A .
We now prove that if M A then
A .
Suppose M A , i.e., all sequences of M satisfy (x) B[x].
Hence, all sequences of M sat-
Proof of (MLb):
(1) Assume that is a consistent set of closed wffs.
(2) By LEL, there is a set of closed wffs such
that: (a) , (b) is consistent, (c) is
maximal, and (d) is universal.
(3) By MCL, there is an interpretation M that is a
Proof:
denumerable model for .
This biconditional breaks down into:
(4) So for all closed wffs A , if A , then M A .
(MLa) If a set of closed wffs has a denumerable (5) Because , for all closed wffs A , if A
model, then is consistent.
then A .
49
Corollary: If A then ` A .
Proof:
Follows from minor modifications on the above
proof.
e
Proof:
(1) Suppose A , but suppose for reductio ad absurdum that it is not the case that ` A .
(2) Let B be the universal closure of A , i.e., if the
free variables of A are x1 , . . . , xn , then B is
(x1 ) . . . (xn ) A .
(3) Universal closure preserves truth in an interpretation, so B.
(4) B has no free variables left, so B is closed.
(5) The singleton set containing B alone, {B},
must be consistent. Heres a proof of this by
reductio:
a) Suppose there were some C such that
{B} ` C and {B} ` C .
b) By SL, {B} ` C C .
c) Since B is closed, so is B, and so by DT,
we have ` B (C C ).
d) But ` (C C ), so by SL, ` B.
Unfortunately, this proof does not, as in the Propositional Calculus (System L), provide a recipe for
constructing a proof of any given logical truth in
PF. We have simply proven that any given logical truth must be derivable, because if it were not,
there would exist a countermodel to its logical validity.
The completeness of the first-order predicate
calculus was first proven by Kurt Gdel in 1930,
and so this is sometimes called Gdels Completeness Theorem, although his way of proving it
was actually very different from ours. (It was first
proven our way by Leon Henkin in 1949.) However,
Gdel is much more famous for his incompleteness
theorems than his completeness theorem.
H.
Identity Logic
50
Syntax
Officially, the syntax is unchanged. We already had
I 2 as a predicate letter. We are simply fixing its
intended meaning.
However, it is useful to introduce abbreviations
such as the following.
Abbreviations:
Proof:
Direct from (A6) by universal instantiation.
t = u abbreviates I 2 (t, u)
t 6= u abbreviates I 2 (t, u)
(1 x) A [x] abbreviates (x) A [x]
[(x) (y)(A [x] A [y] x = y)],
where y is the first variable not occurring in A [x].
(n+1 x) A [x] abbreviates
(y)(A [y] (n x)(x 6= y A [x])),
where y is the first variable not occurring in A [x].
Result (LL/Sub=):
t = u, A [t, t] `PF= A [t, u], for all terms t and
u that are free for all variables in A [x, y], and
where A [t, u] arises from A [t, t] by replacing
some or all occurrences of t with u. (Leibnizs
law)
Proof:
Derived from (A7) by Gen on both x and y, then
universal instantiation to t and u, and MP 2. It
may be necessary to do some bound variable juggling, but this is no problem.
System of Deduction
Result (Sym=): t = u `PF= u = t, for any
Definition: The first-order predicate calculus
terms t and u. (Symmetry of identity.)
with identity, or System PF = is the system obtained from PF by adding the following axiom and
axiom schema:
Proof:
(A6) (x) x = x
t = u, t = t `PF= u = t is an instance of LL, and
(A7) x = y (A [x, x] A [x, y]),
for all instances in which y is free for x in we have `PF= t = t by reflexivity.
A [x, x], and A [x, y] is obtained from A [x, x]
by replacing some, but not necessarily all, free
occurrences of x with y.
Result (Trans=): t = u, u = v `PF= t = v for
any terms t, u and v. (Transitivity of identity.)
Definition: A first-order theory with identity
[equality] is any first-order theory that has all theorems of PF = formulable in its syntax as theorems
(i.e., it is a theory built on PF in which (A6) is either Proof:
an axiom or theorem, and all instances of (A7) are u = v, t = u ` = t = v is an instance of LL.
PF
either axioms or theorems.) This includes PF= itself.
The deduction theorem and replacement for exisSome easy theorems and derived rules:
tential instantiation are unchanged by the addition.
51
Result (Soundness):
For all wffs A , if `PF= A then = A .
Proof:
(1) Assume that K is a first-order theory in which
(A6) is an axiom or theorem, and all instances
of (A7) in which A [x, x] is an atomic formula
with no individual constants are either axioms
or theorems. We shall prove that all instances
of (A7) can be derived regardless of the complexity of A [x, x], by wff induction.
(2) Base step: A [x, x] is atomic. By hypothesis,
(A7) is a theorem of K for all cases in which
A [x, x] is an atomic formula with no individual constants. All others can be obtained by
Gen and universal instantiation.
(3) Induction step: We assume that all instances
of (A7) hold for instances of A [x, x] that are
simpler than a given instance, and need to
show that for the given instance of A [x, x]
(A7) holds as well. This proceeds by a proof by
cases of the possible make-up of the instance
of A [x, x] in question.
Case (a): A [x, x] takes the form B[x, x].
i) Let C [x] be B[z, x]. Clearly,
C [x] is the same complexity as
B[x, x].
ii) By the inductive hypothesis,
we have this instance of (A7):
`K x = y (C [x] C [y]).
iii) By manipulating variables
with Gen and UI, we get:
`K y = x (C [y] C [x]).
iv) Because we have atomic
instances, we have: `K x =
y (x = x y = x), and so
with (A6) and SL we get:
`K x = y y = x.
v) So by SL:
`K x = y (C [y] C [x]).
vi) That is: `K x = y
(B[z, y] B[z, x]).
vii) By Gen on z and UI to x we
get: `K x = y (B[x, y]
B[x, x]).
viii) By SL: `K x = y
(B[x, x] B[x, y]), i.e.,
`K x = y (A [x, x]
A [x, y]).
Case (b): A [x, x] is of form (B[x, x]
52
C [x, x]).
i) By the inductive hypothesis,
we have: `K x = y
(C [x, x] C [x, y]).
ii) By the same procedure
described in the previous case:
`K x = y (B[x, y]
B[x, x]).
iii) By MP:
x = y `K B[x, y] B[x, x].
iv) Similarly:
x = y `K C [x, x] C [x, y].
v) Clearly, if we add
B[x, x] C [x, x] as a further
premise, we could complete a
syllogism, i.e.:
x = y, B[x, x] C [x, x] `K
B[x, y] C [x, y].
vi) By DT 2, we have: `K x =
y ((B[x, x] C [x, x])
(B[x, y] C [x, y])), which
is: `K x = y (A [x, x]
A [x, y]).
Case (c): A [x, x] takes the form
(z) B[x, x, z].
i) By the inductive hypothesis:
`K x = y (B[x, x, z]
B[x, y, z]).
ii) By MP: x = y `K
B[x, x, z] B[x, y, z].
iii) Hence, by UI and MP:
x = y, (z) B[x, x, z] `K
B[x, y, z].
iv) By Gen:
x = y, (z) B[x, x, z] `K
(z) B[x, y, z].
v) By DT 2, we have: `K x =
y ((z) B[x, x, z]
(z) B[x, y, z]), which is:
`K x = y (A [x, x]
A [x, y]).
(4) Hence, regardless of the complexity of A [x, x],
we have the appropriate instance of (A7).
Therefore, all instances of (A7) are theorems of
K.
(5) K is a first-order theory (one built by expanding
PF by adding proper axioms). Hence K has all
Proof:
(1) Assume that M is a model for the set of axioms
of PF= .
(2) It does not follow from this that M is a normal model, i.e., it does not follow that (I 2 )M
only consists of ordered pairs of the form ho, oi
of objects o included in the domain of quantification D of M. However, we do know the
following things about (I 2 )M :
a) Because M makes (A6) true, (I 2 )M must
be a reflexive relation in the set-theoretic
sense.
b) Because M makes the instance of (A7),
x = y (x = x y = x), true, and because it is reflexive (so all sequences satisfy
x = x), (I 2 )M must also be a symmetric
relation in the set-theoretic sense.
c) Because M makes the instance of (A7),
x = y (x = z y = z), true, and
because it is symmetric, (I 2 )M must also
be a transitive relation in the set-theoretic
sense.
d) So, (I 2 )M must be an equivalence relation.
e) Let us call this equivalence relation E. For
any object o in the domain D of M , [o]E is
the E-equivalence class on o; i.e., the set of
p such that ho, pi E.
53
(P)M .
By our description of M*
E, (I 2 )M is the set of ordered pairs that contains h[o]E , [p]E i iff ho, pi E. Because E is an
equivalence relation, ho, pi E iff [o]E = [p]E .
(5) We now prove that M A iff M A for all
wffs A . Note that this will be the case when
A is satisfied by all sequences in one interpretation when it is satisfied by all sequences in
the other. Each sequence s of M of the form:
o1 , o2 , o3 , . . . corresponds to a sequence s0 of
M* of the form [o1 ]E , [o2 ]E , [o3 ]E , . . . . For each
such sequence pair, s and s0 , it is apparent that
for any term t, s0 (t) is [s(t)]E . We now prove
that for all wffs A , for all such sequence pairs
s and s0 , sequence s (of M) will satisfy A iff
the corresponding sequence s0 (of M*) satisfies
A , by wff induction.
Induction step: Assume as inductive hypothesis that it holds for all wffs B simpler than
A that, for all such sequence pairs s and s0 , s This proof is left as an exam question, but it re54
55
UNIT 3
PEANO ARITHMETIC AND RECURSIVE FUNCTIONS
A.
The System S
y = x (y = z x = z)
3 UI3
x=yy=x
(Sym=T) UI2
x = y (y = z x = z)
4, 5 SL
(x) (y) (z)(x = y (y = z x = z))
6 Gen3
(Trans=) follows by UI3, and MP2.
e
4.
5.
6.
7.
`S
`S
`S
`S
Proof:
This follows from (S9) and MP2.
Proof:
Demonstration of (A6):
1. `S x + 0 = x
(S5)
2. `S x = y (x = z y = z)
(S1)
3. `S (x) (y) (z)(x = y (x = z y = z))
2 Gen3
4. `S x + 0 = x (x + 0 = x x = x) 3 UI3
5. `S x = x
1, 4 MP2
6. `S (x) x = x
5 Gen
14. `S y + z 0 = (y + z)0
13 Gen, UI
15. x = y x+z = y+z, x = y `S x+z 0 = y+z 0
12, 13, 14 Trans=, Sym=
16. `S (x = y x + z = y + z)
(x = y x + z 0 = y + z 0 )
15 DT2
17. `S (z)[(x = y x + z = y + z)
(x = y x + z 0 = y + z 0 )]
16 Gen
18. `S (z)(x = y x + z = y + z)
7, 17 MI
19. `S (x) (y) (z)(x = y x + z = y + z)
18 Gen2
e
The proofs of the following are in the book:
1. We add to the syntax of predicate logic the following subnective, which yields a term for any
variable x and wff A [x].
{x|A [x]}
This is read, the set of all x such that A [x].
2. All occurrences of x in a term of the form
{x|A [x]} are considered to be bound.
3. We also choose a two-place predicate letter E 2
to use for the membership relation. An expression of the form (t u) is shorthand for
E 2 (t, u), and (t
/ u) is shorthand for E 2 (t, u).
Axiomatization
The system F contains analogues of axiom
schemata (A1) through (A7) of the predicate calculus with identity (PF= ), the inference rules MP
and Gen, and the following two additional axiom
schemata:
(A8) (x)(A [x] x {y|A [y]}), for all cases
in which the variable y is free for x in A [x].
B.
Syntax
Suppose you wanted to construct an axiomatic system for mathematics, but did not want to take ,
+, 0 , and 0 as primitive, and instead wanted to
define them. One initially attractive way would be
to do this within an axiomatic set theory, in a way
such as the following, which Im calling system
F. This system is not Freges system, but a crude
oversimplification thereof.
59
Results
With these definitions in place, one can derive
Peanos postulates as theorems in the following
forms:
(P1) `F 0 N
(P2) `F x N x0 N
(P3) `F x N 0 6= x0
(P4) `F x N y N (x0 = y 0 x = y)
(P5) `F A [0] (x)(A [x] A [x0 ])
(x)(x N A [x])
As well as analogues of Mendelsons other axioms:
(S5F) `F x N x + 0 = x
(S6F) `F x N y N (x + y 0 ) = (x + y)0
(S7F) `F x 0 = 0
(S8F) `F x N y N (x y 0 ) = ((x
y) + x)
Also, we have, e.g.:
`F ((1 x) A [x]) ({x|A [x]} 1)
`F ((2 x) A [x]) ({x|A [x]} 2)
`F ((3 x) A [x]) ({x|A [x]} 3)
And so on.
Disaster
The system F, unfortunately, is inconsistent due to
Russells paradox:
`F {x|x
/ x}
/ {x|x
/ x}
{x|x
/ x} {x|x
/ x}
Proof: Direct from (A8) and universal instantiation.
Whence both `F {x|x
/ x} {x|x
/ x}, and
`F {x|x
/ x}
/ {x|x
/ x}.
Hence `F A for all wffs A , making the system
entirely unsuitable for mathematics. In this system
we have both `F 1 + 1 = 2 and `F 1 + 1 = 3!
Poor Frege.
Homework
Without using Russells paradox or other contradiction, prove `F {x} = {y} x = y.
Some History
In the late 19th century, Euclids axiomatization of
geometry came under new scrutiny. Many mathematicians began to investigate the axiomatization
of arithmetic as well. In 1879 German mathematician Richard Dedekind surmised that five principles formed the basis of all pure arithmetic.
In the 1880s and 1890s, the adequacy of those
five principles (and others) was studied in depth,
most importantly by a group of Italian mathematicians lead by Giuseppe Peano. In order to consider
them more systematically, Peano urged that the
principles be written using a rigorously defined
symbolic notation for logic and set theory, which
he was still developing at that time. Given Peanos
60
Russell decided to publish vol. I with only a preliminary discussion of the contradiction and possible
ways of avoiding it, leaving a complete solution
of the inconsistency within the formal system for
further development in vol. II.
While finishing vol. I in 19011902, Russell did
a search of recent literature on the foundations
of mathematics, and in so doing rediscovered the
works of Gottlob Frege. Frege, working in almost
complete isolation, had already in his 1884 Grundlagen der Arithmetik (trans. Foundations of Arithmetic), given a list of basic principles of arithmetic
very similar to Dedekinds, but also suggested, like
Russell, that given suitable definitions in terms of
notions of pure logic, that these principles could be
derived from logical principles alone. In fact, Frege
had already developed the core of an axiomatic system for logic in his 1879 work Begriffsschrift, and
in his later 1893 magnum opus, Grundgesetze der
Arithmetik, vol. I. (trans. Basic Laws of Arithmetic),
Frege expanded that system by adding axioms for
value-ranges (in effect, class theory), and had begun to derive the elementary truths of number
theory. While Russell was delighted to find such
common ground between his work and Freges,
he also discovered that Freges system fell prey to
his paradox, and was therefore inconsistent. He
broke the news gently to Frege in a letter. Here is a
translation of that letter, as well as Freges response
(both originally written in German):
61
Dear Colleague:
[16 June 1902]
I have known your Grundgesetze
der Arithmetik for a year a half, but
only now have I been able to find the
time for the thorough study I intend
to devote to your writings. I find myself in full accord with you on all main
points, especially in your rejection of
any psychological element in logic and
in the value you attach to a conceptual
notation for the foundations of mathematics and of formal logic, which, incidentally, can hardly be distinguished.
On many questions of detail, I find discussions, distinctions and definitions
in your writings for which one looks
arithmetic but the only possible foundations for arithmetic as such . . . Your
discovery is at any rate a very remarkable one, and it may perhaps lead to a
great advance in logic, undesirable as
it may seem at first sight . . .
The second volume of my
Grundgesetze is to appear shortly. I
shall have to give it an appendix where
I will do justice to your discovery. If
only I could find the right way of looking at it!
Yours sincerely,
Gottlob Frege
Unfortunately, the hastily prepared solution Frege
included in an Appendix to vol. II of Grundgesetze
was unsuccessful, and leads to a similar, but more
complicated, contradiction. For Russells part, it
took him seven more years to find a solution he
was happy with. By then, volume II of the Principles had grown so big, and had deviated so far from
the plan laid out in vol. I that Russell, along with his
new collaborator, Alfred North Whitehead, decided
to rename it Principia Mathematica, which was itself split into three volumes, published in 1910,
1911 and 1913. (Principia dropped set theory as
such, and instead re-interpreted talk of classes in
mathematics using notions not involving sets, but
instead higher-order quantification over propositional functions divided into ramified types.)
Meanwhile, other mathematicians had developed consistent set theories whose axioms, however, did not seem to have the character of selfevidence usually thought to be required of logical
truths. Such mathematicians still thought much of
mathematics could be reduced to set theory, but
denied that set theory was a branch of logic. The
first system was developed by Ernst Zermelo in
1908, added to, and made more rigorous by Adolf
Frnkel in 1922. Their system is now called ZF or
Zermelo-Frnkel set theory. Another was suggested by John von Neumann in 1925, and expanded by Paul Bernays and Kurt Gdel in the
1930s and is now called NBG set theory. Two more
set theories, NF and ML, were developed by W. V.
Quine in 1937 and 1940. New versions continue
62
C.
(S6)
(S5)
1, 2 LL
3 def. 1
e
Numerals
Result (1): `S x 1 = x
The following were either proven in your homework, or follow from those results.
(S8) Gen, UI
(S7)
1, 2 LL
(S5), (Com+)
3, 4 Trans=
5 def. 1
e
Result (2): `S x 2 = x + x.
Proof:
In the language of system S, the numerals consti- 1. `S
2. `S
tute the following series of closed terms:
3. `S
0 00 000 0000 00000 000000 0000000
4.
`S
0, 0 , 0 , 0 , 0 , 0 , 0 , 0 , . . . , etc.
Let us now introduce a metalanguage function, n
that yields, for a given number, the numeral of S for
that number. We define this function recursively
in the metalanguage as follows:
x 000 = (x 00 ) + x
x 00 = x
x 000 = x + x
x2=x+x
(S8) Gen, UI
Above, def. 1
1, 2 LL
3 def. 2
e
00000
So 2 is 0 , 5 is 0
0000000000000000000000000
, and 25 is 0
Result (+1): `S x + 1 = x0
Proof:
1. x + 0 = 0 `S x + 0 = 0
(Premise)
2. `S 0 + 0 = 0
(S5) Gen, UI
3. x + 0 = 0 `S x + 0 = 0 + 0
1, 2 LL
4. x + 0 = 0 `S x = 0
3 Canc+
5. `S 0 = 0
Ref=
6. x + 0 = 0 `S x = 0 0 = 0
3, 4 SL
7. `S x + 0 = 0 (x = 0 0 = 0)
6 DT
63
`S x + y 0 = (x + y)0
(S6)
0
`S 0 6= (x + y)
(S3) Gen, UI
`S (x + y)0 = 0 0 = (x + y)0 (Sym=T) UI2
`S (x + y)0 6= 0
9, 10 MT
`S x + y 0 6= 0
8, 11 LL
0
0
`S x + y = 0 (x = 0 y = 0)
12 SL
`S [x + y = 0 (x = 0 y = 0)]
[x + y 0 = 0 (x = 0 y 0 = 0)]
13 SL
15. `S (y){[x + y = 0 (x = 0 y = 0)]
[x + y 0 = 0 (x = 0 y 0 = 0)]}
14 Gen
16. `S (y)[x + y = 0 (x = 0 y = 0)] 7,15
MI
17. `S x + y = 0 (x = 0 y = 0)
16 UI
e
8.
9.
10.
11.
12.
13.
14.
or that:
000...(n times)...0 = 000...(m times)...0 `S 000...(nm times)...0 = 0
However, the negations of these follow from (S3),
or (S3) and Sym=, and UI. So, by DT and MT,
we get that `S 000...(n times)...0 6= 000...(m times)...0 , i.e.,
`S n 6= m.
e
D.
Ordering, Complete
Induction and Divisibility
Proof:
Abbreviations:
Assume that n 6= m. Hence what we need to prove
(t < u) for (x)(x 6= 0 t + x = u), where x
is a statement of the form:
is the first variable not in t or u
0
`S 000...(n times)... 6= 000...(m times)...0
(t u) for (t < u) (t = u)
0
where one side has more -signs than the other. (t > u) for (u < t)
Perform a reductio in the object language, taking as (t u) for (u t)
a premise the wff, 000...(n times)...0 = 000...(m times)...0 . By
(t u) for (t < u)
successive applications of (S4) and MP, depending
on whether n < m or m < n youll get either that: and we define (t u), (t u) & (t u) similarly.
000...(n times)...0 = 000...(m times)...0 `S 0 = 000...(mn times)...
64
Proof:
1. `S x + 0 = x
(S5)
2. `S x + y = x + 0 y = 0
(Canc+T) Gen, UI, Com+
3. `S x + y = x y = 0
1, 2 LL
4. `S y 6= 0 x + y 6= x
3 SL
5. `S (y 6= 0 x + y 6= x)
4 DN
6. `S (y 6= 0 x + y = x)
5 def.
7. `S (y) (y 6= 0 x + y = x)
6 Gen
8. `S (y)(y 6= 0 x + y = x) 7 DN, def.
9. `S x x
8 defs. <,
e
Result (Trans<):
`S x < y (y < z x < z)
(Ref) `S
(Anti-Sym<) `S
(Trans ) `S
(Trans<) `S
(Order) `S
`S
`S
(to =) `S
(0) `S
(0<) `S
(0) `S
(0) `S
(< Succ) `S
(< Succ) `S
`S
(+Pres) `S
(+Pres<) `S
(Pres) `S
(Pres<) `S
(Canc+<) `S
(Canc+) `S
(Canc<) `S
(Canc) `S
(OrdC) `S
xx
x<yyx
x y (y z x z)
x y (y < z x < z)
x=yx<yy<x
x<yxy
xyx>y
xyxyx=y
0x
0 < x0
x0
x=0x0
x < x0
x < y x0 y
x < y0 x y
xx+y
y 6= 0 x < x + y
y 6= 0 x x y
x 6= 0 (y > 1 x < x y)
x<y x+z <y+z
xy x+z y+z
z 6= 0 (x < y x z < y z)
z 6= 0 (x y x z y z)
(x)[((y)(y < x A [y])
(y)(y x B[y]))
(y)(A [y] B[y])]
`S (x)[((y)(y x A [y])
(y)(y > x B[y]))
(y)(A [y] B[y])]
Proof:
1. x < y `S (z)(z 6= 0 x + z = y) Pr, def. <
2. y < z `S (x)(x 6= 0 y + x = z) Pr, def. <
3. x < y `S b 6= 0 x + b = y
1 Rule C
4. y < z `S c 6= 0 y + c = z
2 Rule C
5. x < y `S x + b = y
3 SL
6. y < z `S y + c = z
4 SL
7. x < y, y < z `S (x + b) + c = z
5, 6 LL
8. x < y, y < z `S x + (b + c) = z
7 Assoc+
9. `S b + c = 0 (b = 0 c = 0) (0+), Gen, UI
10. x < y `S b + c 6= 0
3, 9 SL
`S (x = 0 . . . x = n) x n.
(3) We need:
`S (x = 0 . . . x = n x = n + 1)
xn+1
`S x = n0 x n0
(5) For the right-to-left conditional, first note by New Forms of Induction
the definition of , we have:
x n0 `S x < n0 x = n0
Result (CI):
`S (x)((y)(y < x A [y]) A [x])
(x) A [x]
(The principle of strong or complete mathematical induction.)
Proof:
For the proof, like any conditional, we begin by
assuming the antecedent, abbreviated as ($).
1. ($) `S (x)((y)(y < x A [y]) A [x])
(Pr)
Rather than directly proceeding to derive
(x) A [x], we instead attempt to show
(x) (z)(z x A [z]) by normal (weak)
induction on x.
2. z 0 `S z = 0
(Pr), (Zero) SL
3. ($) `S (y)(y < 0 A [y]) A [0]
1 UI
4. `S y 0
(Zero) Gen, UI
5. `S y < 0 A [y]
4 SL
6. `S (y)(y < 0 A [y])
5 Gen
7. ($) `S A [0]
3, 6 MP
66
8. ($), z 0 `S A [z]
2, 7 LL
9. ($) `S z 0 A [z]
8 DT
10. ($) `S (z)(z 0 A [z])
9 Gen
This establishes the base step. Next:
11. (z)(z x A [z]) `S z x A [z]
(Pr), UI
12. `S z x0 z < x0 z = x0
Taut, def.
13. `S z < x0 z x
(<Succ), Gen, UI
14. (z)(z x A [z]) `S z < x0 A [z]
11, 13 SL
0
15. (z)(z x A [z]) `S y < x A [y]
14 Gen, UI
16. (z)(z x A [z]) `S (y)(y < x0
A [y]) 15 Gen
0
17. ($) `S (y)(y < x A [y]) A [x0 ] 1 UI
18. ($), (z)(z x A [z]) `S A [x0 ] 16, 17 MP
19. ($), (z)(z x A [z]) `S z = x0 A [z]
18 PF=
0
20. ($), (z)(z x A [z]) `S z x A [z]
12, 14, 19 SL
21. ($), (z)(z x A [z]) `S (z)(z x0
A [z])
20 Gen
22. ($) `S (z)(z x A [z]) (z)(z x0
A [z])
21 DT
23. ($) `S (x)[(z)(z x A [z])
(z)(z x0 A [z])]
22 Gen
This establishes the induction step, whence:
24. ($) `S (x) (z)(z x A [z])
10, 23 MI
25. ($) `S x x A [x]
24 UI2
26. `S x x
(Ref)
27. ($) `S A [x]
25, 26 MP
28. ($) `S (x) A [x]
27 Gen
29. `S (x)((y)(y < x A [y]) A [x])
(x) A [x]
28 DT
e
Divisibility
While the division function cannot be defined for
the natural numbers alone, the relation of divisibility can be so defined.
Abbreviation: t|u for (x)(u = t x), where x
is the first variable that does not occur in t and u.
This can be read as u is evenly divisible by t, t
evenly divides u, or as u is a multiple of t.
Proof:
1. `S x = x 1
2. `S (y)(x = x y)
3. `S x|x
(1), Sym=
1 EG
2 def. |
e
Proof:
1. `S
2. `S
3. `S
4. `S
Proof:
This is roughly the transposition of (CI), with
A [x] substituted for A [x]. See book for details.e
67
x=x1
x=1x
(y)(x = 1 y)
1|x
(1), Sym=
1 (Com), Trans=
2 EG
3 def. |
e
Proof:
1. `S 0 = x 0
2. `S (y)(0 = x y)
3. `S x|0
E.
Expressibility and
(S7) Sym=
Representability
1 EG
2 def. |
e Weve now seen that number-theoretic relations
such as <, , |, etc., can be defined in S, even
though they were not taken as primitive predicate
letters. It is also easy to see that certain functions
on the natural numbers, such as the squaring funcResult (Trans|): `S x|y y|z x|z
tion, n2 , could be defined in S. Our topic over the
next few days will involve general results about
what sort of mathematical functions and relations
can be expressed or represented in system S (and
Proof:
similar systems), and what sort cannot be.
1. x|y y|z `S x|y
(Premise) SL
In the metatheory, functions and relations are
2. x|y y|z `S y|z
(Premise) SL
considered set-theoretically. An n-place relation,
3. x|y y|z `S (z)(y = x z)
1 def. |
for example, is considered to be a set of n-tuples.
4. x|y y|z `S (x)(z = y x)
2 def. |
An n-place function is considered as a set of or5. x|y y|z `S y = x b
3 Rule C
dered pairs, the first elements of which are them3 Rule C
6. x|y y|z `S z = y c
selves n-tuples. For most purposes, however,
5, 6 LL
7. x|y y|z `S z = (x b) c
we can think of them more informally as argu8. x|y y|z `S z = x (b c) 7 Assoc, Trans=
ment/value mappings.
9. x|y y|z `S (y)(z = x y)
8 EG
Let N be the set of natural numbers
10. x|y y|z `S x|z
9 def. |
11. ` x|y y|z x|z
10 DT {0, 1, 2, . . . }. We then define the following:
S
68
Within a given mathematical system such as S, 1. The identity relation on the set of natural numbers is expressible in S by the wff x1 = x2 , since:
some functions and relations may be definable and
(a) If k1 = k2 , then k1 is the same as k2 , so
some may not be definable. Let us make this more
`S k1 = k2 is an instance of (Ref=).
precise.
(b) Result (Num6=), on p. 64, established that
Below, we assume that K is an axiom system
for any natural numbers k1 and k2 , if k1 6=
with numerals for natural numbers (e.g., System S).
k2 , then `S k1 6= k2 .
Definition: A given n-place number-theoretic rela- 2. The less than relation is expressible in S by the
tion R is said to be expressible in K iff there is a wff
wff x1 < x2 , i.e., (x)(x 6= 0 x1 + x = x2 ).
A [x1 , . . . , xn ] with x1 , . . . , xn as its free variables 3. The zero function, whose value is 0 for any
such that, for any natural numbers k1 , . . . , kn :
natural number as argument, is strongly repre(i) If R holds for hk1 , . . . , kn i, then
sentable in S (or any other theory with identity)
`K A [k1 , . . . , kn ];
by the wff (x1 = x1 y = 0), since:
(ii) If R does not hold for hk1 , . . . , kn i, then
(a) For any natural number k,
`K A [k1 , . . . , kn ].
`S (k = k 0 = 0)
(b) `S (1 y)(x1 = x1 y = 0)
Definition: A given n-place number-theoretic 4. The successor function is strongly representable
function F is said to be representable in K iff there
in S by y = x01 .
is a wff A [x1 , . . . , xn , y] with x1 , . . . , xn and y as 5. The projection functions Uin are functions
its free variables such that, for any natural numbers
which, for any n arguments, simply return their
k1 , . . . , kn and m:
ith argument as value. E.g., U34 (5, 8, 2, 13) = 2,
(i) If the value of F for hk1 , . . . , kn i as argument
and U34 (7, 1, 0, 16) = 0. They are strongly repreis m, then `K A [k1 , . . . , kn , m];
sentable in S (or any other theory with identity)
(ii) `K (1 y) A [k1 , . . . , kn , y].
by wffs of the form
(Note that A [x1 , . . . , xn , y] might be an identity
statement of the form y = F (x1 , . . . , xn ), where
F is a function letter, but it need not be; it could
instead be any wff containing y and x1 , . . . , xn free
satisfying the above conditions.)
Definition: A given n-place number-theoretic
function F is said to be strongly representable in
K iff there is a wff A [x1 , . . . , xn , y] with x1 , . . . , xn
and y as its free variables such that, for any natural
numbers k1 , . . . , kn and m:
(i) If the value of F for hk1 , . . . , kni as argument
is m, then `K A [k1 , . . . , kn , m];
(ii) `K (1 y) A [x1 , . . . , xn , y].
There are only denumerably many wffs within our
language, but there is a non-denumerably infinite
69
(x1 = x1 . . . xn = xn y = xi )
since:
(a) For any hk1 , . . . , kn i, the value of Uin is ki ,
and `S (k1 = k1 . . . kn = kn ki =
ki ).
(b) `S (1 y)(x1 = x1 . . . xn = xn y =
xi ).
Sketch of proof:
Note that part (ii) of the definition of strong representability entails (by Gen and UI) part (ii) of
the definition of representability, so the right-toleft conditional holds. For the left-to-right conditional, note that if F is represented in K by
A [x1 , . . . , xn , y]. Then, we can construct a wff
B[x1 , . . . , xn , y] with the following form:
0 if R holds for hk , . . . , k i,
1
n
CR (k1 , . . . , kn ) =
1 if not.
Result: Number-theoretic functions defined by
substitution of strongly representable functions
within strongly representable functions are also
strongly representable. More precisely, if F is an
n-place function whose value for hk1 , . . . , kn i is
g(h1 (k1 , . . . , kn ), . . . , hm (k1 , . . . , kn )) where g
and h1 , . . . , hm are all strongly representable in
K, then F is also strongly representable in K.
Examples:
(a) C< (3, 7) = 0 but C< (7, 3) = 1, etc.
(b) C= (2, 2) = 0 but C= (2, 3) = 1, etc.
(c) C| (3, 27) = 0 but C| (3, 26) = 1, etc.
Note that this is the reverse of many programming
languages, etc., in which the Boolean number 1
is used for truth, and 0 is used for falsity.
Sketch of proof:
Suppose that g is (strongly) represented in K by
the wff B[x1 , . . . , xm, y] and h1 through hm are
(strongly) represented by the wffs A1 [x1 , . . . xn , y]
through Am [x1 , . . . xn , y]. It is then possible to rep70
Proof:
(1) To see the truth of the left-to-right conditional,
note that if R is expressed in K by the wff
A [x1 , . . . , xn ], then CR can be represented by
the wff:
(A [x1 , . . . , xn ] y = 0)
(A [x1 , . . . , xn ] y = 1).
(2) For the right-to-left conditional, note that
if CR is represented in K by some wff
A [x1 , . . . , xn , y], then given that `K 0 6= 1, R
can be expressed by the wff A [x1 , . . . , xn , 0].
Example: C< can be represented in S by the wff
((x)(x 6= 0 x1 + x = x2 ) y = 0)
( (x)(x 6= 0 x1 + x = x2 ) y = 1).
Recursive Functions
We have been discussing number-theoretic functions and relations and what it is for them to be
71
f (x1 , . . . , xn ) =
g(h1 (x1 , . . . , xn ), . . . , hm (x1 , . . . , xn ))
72
Proof:
This follows by substitution, since:
Result: If n-place function f is (primitive)
recursive, then so is the n-place function g
whose value, g(. . . , xi , . . . , xj , . . . ), is always
f (. . . , xj , . . . , xi , . . . ).
(Permuting variables.)
Proof:
Again, using substitution and projection:
g(. . . , xi , . . . , xj , . . . ) = f (. . . , Ujn (x1 , . . . , xn ),
. . . , Uin (x1 , . . . , xn ), . . . ) e
Proof:
This can be proven by induction on k. For k = 0,
the n-place constant function is the same as the
n-place zero function. For the rest, the n-place constant function whose value is always k + 1 can be
defined by substitution since:
n
Ck+1
(x1 , . . . , xn ) = N (Ckn (x1 , . . . , xn ))
Proof:
Our method is similar to the above:
g(x1 , . . . , xn ) = f (U1n (x1 , . . . , xn ),
U1n (x1 , . . . , xn ), . . . , Unn (x1 , . . . , xn )) e
The practical effect of these three results, especially
when combined together, is that it strengthens the
substitution rule so that not all the hs need to be
n-place functions, nor do they have to put the variables in the same order as f , nor do they have to
make use of all the xs, etc. Similar results follow
for the g and h used in the recursion rule. (We
shall simply put this into practice from now on.)
73
the list above is the notation of ordinary mathematics. We have not shown how these functions could
be represented in System S or any other axiomatic
system built upon the predicate calculus (at least
not yet anyway).
f (x1 , . . . , xn , z)
z<y
Proof:
The function g can be defined by recursion as follows:
g(x1 , . . . , xn , 0) = 0
g(x1 , . . . , xn , y + 1) = g(x1 , . . . , xn , y) +
f (x1 , . . . , xn , y)
e
Similar results follow for the form:
X
= f (x1 , . . . , xn , z)
zy
f (x1 , . . . , xn , z) =
f (x1 , . . . , xn , z)
z<y+1
zy
f (x1 , . . . , xn , z) =
y<z<v
. y)
z<(v
f (x1 , . . . , xn , z + y + 1)
f (x1 , . . . , xn , z)
z<y
stands for the (n + 1)-place bounded product function g, whose value for hx1 , . . . , xn , yi as argument is the product of all the values of f for
Remember that all the mathematical notation on hx1 , . . . , xn , 0i through hx1 , . . . , xn , y 1i.
74
R(x , . . . , x , z)
z<y
. . . is the
bounded existential quantifier, since
Proof:
z<y
z
The characteristic function for Q can be defined
same
as
notnot- . . .).
e
in terms of the characteristic function for R by
z<y
substitution as follows:
Similar results follow for bounded universal
Y
quantifiers
using , and doubly bounded ones, etc.
CQ (x1 , . . . , xn , y) =
CR (x1 , . . . , xn , z)
z<y
The notation:
zz<y R(x1 , . . . , xn , z)
is used to stand for the function g whose value
for hx1 , . . . , xn , yi as argument is the least number z less than y for which the relation R holds
for hx1 , . . . , xn , zi if there is such a z, and whose
value is y if there is no such z.
Similar results follow for bounded existential quantifiers using , doubly bounded existential quantifiers, etc.
The notation:
z
R(x , . . . , x , z)
n
Proof:
Function g can be defined using the characteristic
is used in the metalanguage to stand for the rela- function of R by substitution as follows:
tion Q that holds for hx1 , . . . , xn , yi just in case the
X
Y
relation R holds for all ordered (n + 1)-tuples of
g(x1 , . . . , xn , y) =
CR (x1 , . . . , xn , w)
the form hx1 , . . . , xn , zi where z < y.
z<y wx
z<y
Proof:
CQ (x1 , . . . , xn , y) = sg
z<y
CR (x1 , . . . , xn , z)
Similar results follow for bounded -operators 6. The function (x)y whose value is the exponent
using instead of <, and doubly bounded on the y th prime in the prime factorization of x,
operators.
is primitive recursive:
(x)y = zz<x ((py )z |x and not-((py )z+1 |x))
Result: The relations and functions listed below
are primitive recursive.
Result: If the functions g1 , . . . , gm are all (primitive) recursive, and if the relations R1 , . . . , Rm
are all (primitive) recursive, then so is the function f whose value can be informally characterized as follows:
.
x)
C< (x, y) = sg(y
f (x1 , . . . , xn ) =
g1 (x1 , . . . , xn )
if R1 (x1 , . . . , xn )
..
gm (x1 , . . . , xn )
if Rm (x1 , . . . , xn )
Proof:
The above definition is equivalent to the following:
p0 = 2
py+1 = zz(py )!+1 (py < z and Pr(z))
f (x1 , ..., xn ) =
(g1 (x1 , . . . , xn ) sg(CR1 (x1 , . . . , xn ))) + . . .
+ (gm (x1 , . . . , xn ) sg(CRm (x1 , . . . , xn ))) e
G.
h1, 0i
2
h1, 1i
4
h1, 2i
7
h1, 3i
11
h1, 4i
16
h2, 0i
5
h2, 1i
8
h2, 2i
12
h2, 3i
17
9
14
20
h3, 1i h4, 1i
13
19
h3, 2i
18
a0 , a1 , a2 , . . . , ak
78
by substitution as follows:
f #(x1 , . . . xn , y) =
z<y
(p`~(x)+z )(y)z
(b) On the other hand, suppose that f # has already been shown to be (primitive) recursive.
One can then obtain f by substitution as follows:
z<`~(y)
Course-of-Values Recursion
Because a single number can be used to encode a
finite sequence of numbers, it is possible to define
a function whose value for y as argument encodes
the sequence (course) of values of another function for all arguments leading up to and including
y.
If f is a (n + 1)-place number-theoretic function, then the notation f # is used for the
(n + 1)-place number-theoretic function whose
value for hx1 , . . . , xn , yi is the number that encodes the series of values for f for all (n + 1)tuples starting with hx1 , . . . , xn , 0i and ending
with hx1 , . . . , xn , y 1i.
Gdels -Function
The upshot of all this is that it provides yet another method of talking indirectly about sequences
Consider the following primitive recursive funcof numbers. Each sequence corresponds to a b
tion, defined as follows:
and c, which, when combined together with the
-function, give us a way of retrieving elements
(x1 , x2 , x3 ) = rm(1 + ((x3 + 1) x2 ), x1 )
in the sequence. Claims made about sequences
Surprisingly, for any series of natural numbers of numbers can be transformed into claims made
with n + 1 members
about the b and c that it would be appropriate to
use as the first two arguments to the -function
k0 , k1 , k2 , . . . , kn
for that sequence.
one can find two fixed natural numbers b and c
This will also help us prove that all recursive
such that, for any i, such that i n, (b, c, i) = ki .
functions are representable in S.
To see this, first, let c be
For further discussion of the Chinese Re(max(n, k0 , k1 , k2 , . . . , kn ))!
mainder Theorem, see the book, p. 184, 188,
419,
and:
http://www.cut-the-knot.org/
Next, consider the following sequence:
blue/chinese.shtml
u0 , u1 , u2 , . . . , un
where each ui = 1 + ((i + 1) c), for all i n.
No two members of the u-series have a factor
in common other than 1. (It is a matter of tedious
arithmetic to show this.)
It follows from this and a known principle of
modular arithmetic, the Chinese remainder theorem,
that there is at least one number b such that the
remainder upon division of b by ui is always ki
for every i from 0 to n. (The proof of this is more
tedious arithmetic.)
Therefore, because each ui is 1 + ((i + 1) c),
it follows that rm(1 + ((i + 1) c), b) = ki , which
is to say that (b, c, i) = ki .
H.
Representing Recursive
Functions in System S
Our next task is to show that every recursive function is representable in System S.
Recall that recursive functions are those obtained from the initial functions (the zero function,
the successor function and the projection functions) by some finite number of applications of the
substitution, recursion and choice of least rules.
Proof:
1. Suppose that g is represented in S by the wff
E [x1 , . . . , xn , xn+1 , y]. By definition, then:
(1a) if the value of g for hk1 , . . . , kn , kn+1 i as
argument is m, then
`S E [k1 , . . . , kn , kn+1 , m]; and
(1b) `S (1 y) E [k1 , . . . , kn , kn+1 , y].
2. We can represent f in S using the wff:
E [x1 , . . . , xn , y, 0]
(z)(z < y E [x1 , . . . , xn , z, 0])
3. We need to show that:
(3a) If the value of f for hk1 , . . . , kn i is j, then,
`S E [k1 , . . . , kn , j, 0]
(z)(z < j E [k1 , . . . , kn , z, 0])
(3b) `S (1 y)(E [k1 , . . . , kn , y, 0]
(z)(z < y E [k1 , . . . , kn , z, 0]))
4. To show (3a), first assume that the value of f
for hk1 , . . . , kn i is j. Then j must be the least y
such that g(k1 , . . . , kn , y) = 0. Hence, by (1a):
(4a) `S E [k1 , . . . , kn , j, 0]
For any 0 i < j, g(k1 , . . . , kn , i) is
something other than 0, so, by (1a) and
(1b), we can get:
(4b) `S E [k1 , . . . , kn , 0, 0]
E [k1 , . . . , kn , 1, 0] . . .
E [k1 , . . . , kn , j 1, 0]
By a corollary proven on p. 66, it follows
from (4b) that:
Proof:
(This proof is very complex, perhaps the most complex single proof of the semester. It is given in
detail in the book. Im not going to try to recreate
all the details here, but will merely give a rough
outline.)
1. Suppose that f is obtained from g and h recursively as suggested above. Then, if m is the
value of f for some hx1 , . . . , xn , yi, there must
be some finite sequence,
81
v0 , v1 , . . . , vy
(the course-of-values of f for all arguments leading up to and including y) where vy is m and
also: v0 = g(x1 , . . . , xn ), and for all 0 i < y,
vi+1 = h(x1 , . . . , xn , i, vi )
E.g., if f (x, y) is xy , the v-sequence would be:
1, x, x2 , x3 , . . . , xy
E [x1 , . . . , xn , z3 , y3 , y4 ])
multiplication (this plays the role of h). Making some minor simplifications, these are represented in S by the wffs y = 1 and y = x1 x2 .
According to the above recipe, the function xx1 2
is represented by the following wff:
(z3 ) z3 < 3
(y3 ) (y4 )(Bt[z1 , z2 , z3 , y3 ]
Bt[z1 , z2 , z30 , y4 ] y4 = y3 1)
Corollary: All recursive number-theoretic relations are expressible in System S, including all
primitive recursive ones.
Proof:
By definition, their characteristic functions are recursive, and hence their characteristic functions
are representable in S. We have already established
that whenever a relations characteristic function
is representable in a given theory with identity, the
relation is expressible in that theory.
e
This gives us an intuitive sense of the strength
of system S; more or less, it has as theorems the
appropriate arithmetical results regarding all recursive functions and relations, i.e., those that can in
principle be calculated by a mechanical procedure
by computer, calculator or similar device.
84
UNIT 4
GDELS RESULTS AND THEIR COROLLARIES
A.
The System ,
Axiomatization
We normally think of the wffs in a logical system The system has one axiom: 2
The system has one inference rule:
as having meaning, or at least as having a meanadd circle: if A is a wff, from A , infer A #.
ing given an interpretation, such as the standard
A theorem is any wff that can be derived from
interpretation for System S. However, it is possible
the
axiom by some finite number of applications of
to think of an axiomatic system as just a system of
the inference rule.
rules for manipulating syntactic strings.
Hence, the following are theorems:
Consider the following simple system for ma2#
nipulating strings of symbols:
2##
2###
2####
Syntax
and so on . . .
The basic syntactic units are the signs 2 and #.
A formula is any string of one or more of
Metatheory
these two signs, such as: #, 2#, #2##2
or 2##2#2.
Schmdel Numbering
A well-formed formula (wff) is any formula
that begins with 2. So 2#2##2, 2222, Since the system has no intended meaning, notions
2##, and 2#2 are all wffs. However, such as completeness and soundness do not apply.
However, this does not mean that we cannot
##2#2 and #22#2 are not wffs.
prove anything about it. We can prove, e.g., that
not every wff is a theorem, etc.
Metalogical results for System , can be
Semantics
made simpler by coordinating every wff with its
The wffs of system , do not have any intended Schmdel number. Schmdel numbering is much
interpretation or meaning. (This is not to say that easier than Gdel numbering. To get the Schmdel
they cannot be interpreted as having a meaning, number of a string of signs for ,, simply replace
however.) The system is only intended to be a every 2 with the digit 1 and every # with the
game of string manipulation for the very easily digit 0, and think of the result as a numeral writamused.
ten in binary notation. Let the Schmdel number
85
of the wff be the number that this binary numeral This says that x = 1, or 2 and nothing odd above 2
signifies.
divides evenly in to x. In effect, we can prove that
2##### is a theorem of , in S, since:
Examples: Hence the Schmdel number of
2##2# is 18 (10010 in binary), and the
`S 32 = 1 2|32
Schmdel number of 2##### is 32 (100000 in
binary).
(y)(y > 2 2|y y|32)
Similarly, we can prove in S that 2####2 is not
a theorem of ,, since:
`S 33 = 1 (2|33
B.
System S as Metalanguage
Because all the number-theoretic properties and
relations one would need to do metatheory for ,
are recursive, it turns out that System S could be
used a metalanguage for System ,. For example,
x is a power of 2 can be expressed in S using the
wff:
86
being a number that encodes a finite sequence of Gdel numbers whose corresponding wffs, in order, constitute a proof of the
wff with Gdel number n, etc.
These properties and relations are entirely
arithmetical in nature, just like being a power
of 2 is entirely arithmetical in nature.
Gdelian Results
Gdel found a trick to make it possible, for any
system that can do enough mathematics to express
recursive properties and relations, to construct a
closed wff written entirely in the language of that
system that in effect says that its own Gdel number is not the Gdel number of a theorem of that
system. It then follows that if the system is consistent, it cannot be complete.
1. Suppose that for System S the wff in question
is abbreviated as G . Note that since G is built
up entirely in the syntax of S, it is really a
mathematical statement, involving only 0, 0 , +, ,
=, variables, and the logical constants.
Note that:
(1a) G is true in the standard interpretation for
S iff not-`S G .
(1b) G is true in the standard interpretation
iff `S G .
2. Suppose for reductio that both:
(2a) System S is consistent, i.e., there is no wff
A such that `S A and `S A .
(2b) System S is complete, i.e., for all wffs A ,
if A is true in the standard interpretation,
then `S A .
3. It follows that G is not true in the standard interpretation. If it were true, by (2b) it would be
a theorem, but by (1a) it would also not be a
theorem, which is impossible.
4. Since G is closed, and it is not true in the standard interpretation, G is true in the standard
interpretation. It then follows by (1b) that `S G ,
but it follows from (2b) that `S G . Hence, S is
inconsistent, which contradicts (2a).
5. Because S appears to be consistent, we must
conclude that it is incomplete.
Note that this means that there are purely arithmetical truths written in the syntax of System S
87
metalogic, which has led to many other interesting 5. If P is a predicate letter, and n is the number
results besides Gdels.
of its subscript and m is the number of its superscript, then depending of which letter of the
alphabet is used (A through T), let k be one of
C. Arithmetization of Syntax
1 through 20, and let g(P) = 3 + 8(2m 3(20n+k) ).
We start with the process of Gdel numbering;
note that because of differences in the way I originally laid out the syntax and the way Mendelson
did, there are some very subtle but unimportant
differences in our way of doing Gdel numbering
below.
iff
y<x
Proof:
All standard first-order theories use all variables,
y
y<x
is primitive recursive.
89
Atfml(x) or
(Fml(y) and x = 2
y) or
y<x
y
y<x z<x
x = 23 y 211 z 25 ) or
(Fml(y) and EVbl(z) and
x = 23 23 213 z 25 y 25 ))
(For some, I give the arithmetical formulas characterizing them in the metalanguage; for the rest,
consult the book. For the most part, they can be recursively characterized fairly easily with the functions used to do encoding, especially. A rare
few involve course-of-values recursion or the like.)
(a) EVbl(x): x is the Gdel number of a singlesymbol string consisting of a variable alone,
y
i.e.:
(Vbl(y) and x = 2 ).
y
y<x
alone, i.e.:
(IC(y) and x = 2 ).
y
(k)
y<x
(FL(y) and x = 2 ).
y
y<x
(PL(y) and x = 2 ).
y
(l)
y<x
23 23 213 2z 25 y 25 25 25 )
y<x
(Ax(y) and x = 2 ) or
y
y<x
y
x = y 2 ) or
(Prf(y)
y<x z<x
Roughly, this says, x is the Gdel number of a proof iff either (i) it encodes a
sequence consisting of an axiom by itself, (ii) it is obtained from the Gdel
number of a shorter proof by appending
the Gdel number of a new Gen step to
the encoding, (iii) it is obtained from the
Gdel number of a shorter proof by appending the Gdel number of a new MP
step to the encoding, or (iv) it is obtained
92
Result: For any theory K (such as S) with a system of numerals, a recursive vocabulary and axiom set, and for which the following principle
holds:
(%) For any natural numbers r and s, if
`K r = s then r = s.
For any n-place number-theoretic function f , if
f is representable in K, then f is recursive.
Proof:
(5c) So we can obtain PA by substitution:
(1) Assume K is such a theory and assume that f
is representable in K.
PA (z1 , . . . , zn , u, v) =
(2) By the definition of representability, there is
Pf(v, Sub(. . . Sub(Sub(c, Num(u), 29),
some wff A [x1 , . . . , xn , y] with x1 , . . . , xn and
Num(z1 ), 45) . . . , Num(zn ), 21+24n))
y as its free variables such that, for all natural
numbers k1 , . . . , kn , and m:
Since Pf, Sub and Num are recursive, so
(2a) If the value of f for hk1 , . . . , kn i as arguis PA .
ment is m, then `K A [k1 , . . . , kn , m];
(6) For any natural numbers k1 , . . . , kn , r and j,
(2b) `K (1 y) A [k1 , . . . , kn , y].
if PA (k1 , . . . , kn , r, j), then the value of f for
(3) Let c be the Gdel number of the wff
hk1 , . . . , kn i as argument is r.
A [x1 , . . . , xn , y] that represents f .
(6a) Assume PA (k1 , . . . , kn , r, j).
(4) Consider the (n + 2)-place number-theoretic
(6b) By the definition of PA ,
relation PA such that PA (z1 , . . . , zn , u, v) iff v
`K A [k1 , . . . , kn , r].
is the Gdel number of a proof in K of the wff:
(6c) f is a number-theoretic function;
so it must have some value s for
hk1 , . . . , kn i as argument.
By (2a),
A [z1 , . . . , zn , u]
`K A [k1 , . . . , kn , s].
(6d) By (2b), (6b) and (6c), `K r = s.
Or in other words, PA holds for z1 , . . . , zn , u
(6e) By principle (%), it must be that r = s.
and v iff v is the Gdel number of an object- (7) For any hz1 , . . . , zn i, f will have some
language proof essentially to the effect that
value, u, and by (2a) there will be a proof
for A [z1 , . . . , zn , u] in K. Hence for any
hz1 , . . . , zn i there will be a sequence:
f (z1 , . . . , zn ) = u.
u, v
(5) We can prove that PA is a recursive relation.
Where v is the Gdel number of a proof of
(5a) Because K has a recursive vocabulary and
A [z1 , . . . , zn , u] in K, which is to say:
axiom set, the following functions and
PA (z1 , . . . , zn , u, v).
relation, discussed above, are recursive:
Pf(x, y), Sub(x, y, z), and Num(x).
Let w be the number that encodes the above
(5b) Note that c is the Gdel numsequence, i.e., 2u 3v . Hence:
ber of A [x1 , . . . , xn , y], and 29 is
the Gdel number of y, and the
PA (z1 , . . . , zn , (w)0 , (w)1 ).
Gdel numbers of x1 , . . . , xn are
(8) Then f can be obtained from PA using the
45, 69, 93, . . . (increases by 24) . . . ,
choice of least rule and the function (x)y . Con(21 + 24n).
sider:
Then we can see that:
Sub(c, Num(u), 29) is the Gdel number
f (z1 , . . . , zn ) =
of A [x1 , . . . , xn , u], and so
(w(PA (z1 , . . . , zn , (w)0 , (w)1 )))0
Sub(Sub(c, Num(u), 29), Num(z1 ),
45) is the Gdel number of
This function will return the u such that w is
A [z1 , x2 , . . . , xn , u].
the least number that encodes a sequence u, v
Repeating this process, we can see that:
such that:
Sub(. . . Sub(Sub(c, Num(u), 29), Num(z1 ),
45) . . . , Num(zn ), 21 + 24n) is the Gdel
PA (z1 , . . . , zn , u, v)
number of A [z1 , . . . , zn , u].
93
The System RR
The syntax and intended semantics for RR are the
same as for system S. For its deductive theory, it
consists of the logical axioms (A1)(A5), the inference rules MP and Gen, and the following proper
axioms.
(RR1) x = x
(RR2) x = y y = x
(RR3) x = y (y = z x = z)
(RR4) x = y x0 = y0
(RR5) x = y (x + z = y + z z + x = z + y)
94
(x) (y) x y = y x
(x) (y) (z)(x + y) + z = x + (y + z)
(x) (y) (z) x (y + z) = (x y) + (x z)
However, the lack of such principles does not interfere with results about which number-theoretic
functions are representable, and which numbertheoretic relations are expressible. Recall that the
definitions of expressibility and representability
primarily have to do with getting the appropriate
theorems for the right numerals, not for getting
general results stated with quantifiers. In fact . . .
`K B E [pBq]
The proofs of these results for RR are almost exactly the same as the corresponding proofs for S.
It would be matter of tedious backtracking to see
this. This is not surprising, since RR was custom
tailored to allow these results to go through. In the
proof of these results for S, we rarely appealed to
theorems that require (S9), and in those few occasions in which we did, we have been given a new
axiom of RR that works just as well.
Obviously, RR is incomplete, and too weak for
what we wanted. However, it will turn out to be
useful later in the unit to have a weaker system
with only a finite number of proper axioms, to
make certain other things easier to prove, especially Churchs theorem.
95
wff of the form A [x], the value of D is the Gdel (7) We can now prove the biconditional:
number of the formula obtained by substituting the
`K B E [pBq]
numeral for the Gdel number of A [x] for all free 1. B `K B
(Premise)
occurrences of x in A [x].
2. B `K (y)(D[p, y] E [y])
1 (5b)
2 UI
3. B `K D[p, q] E [q]
Proof:
4. B `K E [q]
3, (6a), MP
(1) Assume that K is a theory meeting conditions 5. B ` E [pBq]
4 (5c)
K
(i)(iii) above, and then consider any wff E [x] 6. ` B E [pBq]
5 DT
K
having x as its only free variable.
7. E [pBq] `K E [pBq]
(Premise)
(2) Because K has a recursive vocabulary and a 8. D[p, y] ` D[p, y]
(Premise)
K
system of numerals, the function D for K is a 9. D[p, y] ` y = q
8, (6a), (6b) PF=
K
recursive number-theoretic function.
9 (5c)
10. D[p, y] `K y = pBq
(3) Because all recursive functions are repre- 11. E [pBq], D[p, y] ` E [y]
7, 10 LL
K
sentable in K, and D is a recursive function, 12. E [pBq] ` D[p, y] E [y]
11 DT
K
there is some wff D[x, y], such that, for all nat- 13. E [pBq] ` (y)(D[p, y] E [y])
12 Gen
K
ural numbers k and m:
14. E [pBq] `K B
13 (5b)
(3a) If D(k) = m, then `K D[k, m].
15. `K E [pBq] B
14 DT
(3b) `K (1 y) D[k, y].
16. `K B E [pBq]
6, 15 SL
(4) Now consider the following wff:
This establishes the theorem.
e
(4a) (y)(D[x, y] E [y])
This wff more or less says that E holds of the This establishes the theorem. The Fixed-Point TheGdel number obtained from x by the diago- orem makes a certain kind of self-reference possible, which leads to all sorts of fun results.
nalization function.
(5) Let p be the Gdel number of the wff (4a).
(5a) p is p(y)(D[x, y] E [y])q
Consider now the following closed wff, F. -Consistency, True
which hereafter well call B:
Theories and Completeness
(5b) B is (y)(D[p, y] E [y])
This wff, B, says that E holds of the Definition: A theory K with a system of numerGdel number obtained from p from the als is said to be -consistent iff for every wff A [y]
diagonalization function. Let q be the containing y as its only free variable, if it is true for
every natural number n that
Gdel number of B. Hence:
(5c) q is pBq
`K A [n],
Notice that (4a) is itself of the form A [x].
Hence, the value of the diagonalization then it is not the case that `K (y) A [y].
function for its Gdel number p, will be Basically, a system is -consistent if whenever you
the Gdel number of B, i.e., q:
can prove that A [y] holds for each particular
(5d) D(p) = q
numbers, you cannot then also prove the quantiNotice that because B says that E holds fied statement there is some number y such that
of the Gdel number obtained from p A [y].
from the diagonalization function, and
q is the Gdel number of B, B in effects Definition: A theory K with the same syntax as S
says that E holds of its own Gdel num- is said to be a true arithmetical theory iff all its
proper axioms are true in the standard interpretation.
ber.
(6) By (5d) and (3a) and (3b), we can conclude:
(6a) `K D[p, q]
(6b) `K (1 y) D[p, y]
Proof:
Both are true arithmetical theories.
Proof:
Suppose K is -consistent. Then it cannot be inconsistent, because every wff is provable in an
inconsistent system. Assume for reductio that K is
inconsistent. For any wff A [y] containing y as its
only free variable, it will be true for every natural
number n that `K A [n], but it will also hold that
`K (y) A [y]. So K is not -consistent.
e
Result: All theorems of a true arithmetical theory are true in the standard interpretation.
Proof:
1. Assume K is a theory with the characteristics
above.
2. Because K is -consistent, it is consistent.
3. Because K has a recursive axiom set, the number
theoretic relation Pf(x, y), that holds between x
and y iff x is the Gdel number of a proof in K
of the wff with Gdel number y, is a recursive
relation.
4. Because every recursive relation is expressible
in K, Pf is expressible in K. Hence there is some
wff Pf [x1 , x2 ] such that, for all natural numbers
k1 and k2 :
(4a) If Pf holds for hk1 , k2 i, then
Definition: For a given closed wff A in a system
`K Pf [k1 , k2 ].
K, A is called an undecidable sentence iff neither
(4b) If Pf does not hold for hk1 , k2 i, then
`K A , nor `K A .
`K Pf [k1 , k2 ].
5. Consider the following wff:
Within a given interpretation, every closed
wff is either true, or its negation is true. Because S aims to capture everything that is
true in the standard interpretation, it could
be complete in Gdels sense only if it is complete in Posts sense, because to capture all
truths, for every closed wff A , it must capture either A or A , depending on which
is true in the standard interpretation.
Unfortunately S is complete in neither sense,
because Gdel showed that any theory similar to S has undecidable sentences.
G.
(y) Pf [y, x]
Because every recursive function is representable in K, the Fixed-Point Theorem is applicable to the above wff, and hence, there is a
closed wff , which well call G , such that:
(5a) `K G (y) Pf [y, pG q]
In effect, G is equivalent to the assertion that no
natural number is the Gdel number of a proof
of G in K, i.e., G asserts that it is not provable.
6. We must now prove that G is an undecidable
sentence of K. Let q be the Gdel number of G .
7. We will first show that it is not the case that
`K G by reductio.
(7a) Assume that `K G .
(7b) Then there must be some proof of G in K.
This proof must have a Gdel number, r.
98
Proof:
Hence Pf(r, q).
They have the features necessary for the applicaBy (4a) `K Pf [r, q].
The Gdel number of G is q, so q is pG q. bility of Gdels theorems.
Hence, `K Pf [r, pG q].
But, by (7a) and (5a), `K (y) Pf [y, pG q].
Hence `K Pf [r, pG q].
Corollary: Any theory K to which the above theBy (7h) and (7f), K is inconsistent, contra- orem applies, with the same syntax as S and RR
dicting (2) above.
and the same intended semantics as S and RR
8. Hence it is not the case that `K G . This means
(including S and RR themselves) is also incomthat no natural number is the Gdel number of a
plete in Gdels sense.
proof of G in K. Hence, for all natural numbers
n, the relation Pf does not hold for hn, qi.
9. From (8) and (4b), we can conclude that:
Proof:
(9a) For all natural numbers n, `K Pf [n, q].
For every undecidable sentence, either it or its
(9b) The Gdel number of G is q, so q is pG q.
negation is true in the standard interpretation, and
(9c) Hence, (9a) means that for all natural numhence there are sentences that are true in the stanbers n, `K Pf [n, pG q].
dard interpretation, but are not theorems of K. e
(9d) Because K is -consistent, from (9c) we
can infer that 0K (y) Pf [y, pG q].
In particular, the Gdel sentence G of K is true in
10. We now show that it is not the case that `K G , the standard interpretation but is not a theorem of
again by reductio.
K. As we have just seen, for K, neither `K G nor
(10a) Assume that `K G .
`K G . Since G is closed, either G or G must be
(10b) By (5a) and (10a), `K (y) Pf [y, pG q]. true in the standard interpretation. However, since
(10c) This abbreviates to `K (y) Pf [y, pG q]. G asserts its own unprovability, and, in fact, G is
(10d) But (10c) contradicts (9d).
not provable in K, we can conclude that G is true.
11. Hence neither `K G nor `K G . Since G
Notice that the Gdel sentence G of some apis closed, G is an undecidable sentence of K. plicable theory K is a wff written entirely the synQED.
e tax of S. Interpreted with the standard interpretation, it a sentence about natural numbers, built entirely out of the symbols 0, 0 , +, , =, bound
variables and logical signs. Moreover, it is true.
Corollary: All theories to which Gdels first
Hence, it seems that not all truths of arithmetic
theorem applies has are incomplete in Posts
can be captured in any recursively axiomatizable,
sense.
-consistent theory.
We can consistently add the Gdel sentence G
of some theory K to that theory as a new axiom,
to obtain the theory KG . Since KG has a different
Proof:
All have at least one decidable sentence, and hence axiom set from K, the number-theoretic property
do not fall under this definition of completeness.e PrAx will be different for KG from what it was
for K, but it will still be recursive, and hence the
relation Pf will also be different but still be recursive. Hence, there will be a different wff Pf [x, y]
Corollary: Systems S and RR have undecidable
that expresses the new Pf-relation, and a different
sentences, and hence, are incomplete in Posts
Gdel sentence G , different from G , which is an
sense.
undecidable sentence of KG . We can continue the
adding all we like one by one; well never achieve
completeness.
(7c)
(7d)
(7e)
(7f)
(7g)
(7h)
(7i)
99
3.
4.
5.
Proof:
1. Assume that K is such a theory. So all
recursive functions and relations are representable/expressible in K. Hence the relation Pf
is expressible in K and the function Neg (whose
value, for any Gdel number of a wff, is the
Gdel number of the negation of that wff) is
representable in K. Let the wff that expresses
Pf be Pf [x1 , x2 ], and the let the wff that represents Neg be Neg [x, y]. Hence, for all natural
numbers k1 and k2 :
(1a) If Pf holds for hk1 , k2 i, then `K
Pf [k1 , k2 ].
(1b) If Pf does not hold for hk1 , k2 i, then
`K Pf [k1 , k2 ].
(1c) If Neg(k1 ) = k2 then `K Neg [k1 , k2 ].
(1d) `K (1 y) Neg [k1 , y].
2. Consider the following open wff, hereafter abbreviated as E [x]:
(z)(Pf [z, x] (y)(Neg [x, y]
(z1 )(z1 z Pf [z1 , y])))
6.
This wff says that, for all z, z is the Gdel number of a proof in K of the wff with Gdel number
x only if, for any number y that is the Gdel
100
H.
Churchs Thesis
101
I.
Recall that the wff Pf [x1 , x2 ] expresses the relation Pf that holds between x1 and x2 just in case
x1 is the Gdel number of a proof of the wff with
Gdel number x2 .
Abbreviation: We shall now introduce the following new abbreviation:
Bew [x]
is shorthand for
(y) Pf [y, x]
While this wff does not express in S the property of being the Gdel number of a theorem of S,
this is its meaning in the standard interpretation.
(This abbreviation derived from the German word
beweisbar, meaning provable.)
Definition: The Hilbert-Bernays derivability
conditions are the following three results, for any
wffs A and B:
(HB1) If `S A , then `S Bew [pA q].
(HB2) `S Bew [pA Bq]
(Bew [pA q] Bew [pBq])
(HB3) `S Bew [pA q] Bew [pBew [pA q]q].
Similar results hold not only for S, but for any recursively axiomatizable extension of S.
For homework, you will prove (HB1). It follows
fairly easily from the fact that P f [x1 , x2 ] expresses
Pf in S. (HB2) and (HB3) are more difficult to prove,
but follow in a similar way.
3.
4.
5.
6.
C `L ? E
`L ? C E
`L ? C
`L ? E
(Premise)
1 def. C
1, 2 MP
3 DT
4 def. C
4, 5 MP
Proof:
(1) Assume that `S Bew [pA q] A .
(2) If the wff A is closed, then the wff Bew [x]
A has exactly one free variable. Hence, by the
Fixed-Point theorem, there is some wff L such
that:
(2a) `S L (Bew [pL q] A )
Notice that L asserts of itself that if it is provable, then A is true. Assume for a conditional
proof that L is provable. Because of what
L says, it follows that if it is provable, then
A holds. Weve assumed that it is provable.
Hence, A holds. Discharging the assumption,
if L is provable, then A holds. But this is
what L says. Our conditional proof is a proof
of L . Hence, L is provable, and so is A .
(3) Making this more formal:
1. `S L (Bew [pL q] A )
(2a), SL
2. `S Bew [pL (Bew [pL q] A )q]
1, (HB1)
3. `S Bew [pL q]
Bew [pBew [pL q] A q] 2, (HB2), MP
4. `S Bew [pBew [pL q] A q]
(Bew [pBew [pL q]q] Bew [pA q]) (HB2)
5. `S Bew [pL q] (Bew [pBew [pL q]q]
Bew [pA q])
3, 4 SL
6. `S Bew [pL q] Bew [pBew [pL q]q]
(HB3)
7. `S (Bew [pL q]
(Bew [pBew [pL q]q] Bew [pA q]))
((Bew [pL q] Bew [pBew [pL q]q])
(Bew [pL q] Bew [pA q]))
(A2)
8. `S Bew [pL q] Bew [pA q]5, 6, 7 MP2
9. `S Bew [pA q] A
Assumed at (1)
10. `S Bew [pL q] A
8, 9 SL
Proof:
Immediate by the right-to-left half of the biconditional, and Lbs theorem.
e
Since S is a true arithmetical theory, H is true in
the standard interpretation, despite the intuition
that H could just as easily have been disprovable.
Lbs theorem also leads to the result that the
consistency of S cannot be proven in S itself, even
though there is a wff of S whose meaning in the
standard interpretation is that S is consistent.
The result that Peano Arithmetic, or any extension thereof, cannot be used to prove its own consistency, was one of the original incompleteness
results first proved by Gdel in 1931. Although,
Gdel proved this result in a different way, Lbs
theorem provides us with a fairly easy proof of this
result.
Abbreviation: Let Con S be an abbreviation for
the following closed wff of system S:
(x) (y) (Neg [x, y] Bew [x] Bew [y])
Bearing in mind that Neg [x, y] represents the function Neg(x), whose value for a given Gdel number
of a wff as argument, is the Gdel number of the
103
While Con S seems to make a metatheoretic assertion about the system S, taken with the standard interpretation it is simply an assertion about
numbers and their arithmetical properties. It is yet
another example of a truth of arithmetic that Peano
arithmetic fails to capture. Hence, this too shows
that system S is incomplete.
As with Gdels first incompleteness result,
Result: If S is consistent, then 0S Con S .
adding additional axioms, even Con S itself, will
(Gdels Second Incompleteness Theorem)
not yield a complete system. Let us consider the
system S* obtained from S by adding Con S as an
axiom. While it is easily shown that S* is consistent (at least if S is consistent), there will then be
Proof:
1. Assume S is consistent, and assume for reductio a different wff Con S that, for similar reasons, will
not be a theorem of S*.
that `S Con S .
This also shows that there are limitations to
2. Since `S 0 6= 1, by (HB1), we have
the extent to which S (or any other consistent sys`S Bew [p0 6= 1q].
tem) can properly be used for the metalanguage in
3. By UI on Con S , we get
which to conduct its own metatheory.
`S (Neg [p0 = 1q, p0 6= 1q]
q
p
q
p
In fact, there are no closed wffs A for which
Bew [ 0 = 1 ] Bew [ 0 6= 1 ]).
4. Because Neg [x, y] represents the Neg function, it is provable in S that A is not a theorem of S.
(This can be seen by careful reflection on steps of
we have in S that `S Neg [p0 = 1q, p0 6= 1q].
the proof of Gdels second theorem.) While S can
5. By (2), (3), (4) and SL we get that
be used to prove of itself that certain sentences
`S Bew [p0 = 1q].
q
p
are theorems, it cannot be used to prove that any
6. By (A1), `S Bew [ 0 = 1 ]
(0 6= 1 Bew [p0 = 1q]). sentences arent theorems.
The last point is actually the same as the point
7. By (5) and (6), `S 0 6= 1 Bew [p0 = 1q].
that
Bew [x] does not express in S the property of
8. By transposition on (7), we get:
being a theorem of S, which well discuss further
`S Bew [p0 = 1q] 0 = 1.
below.
9. By (8) and Lbs theorem, it follows that
`S 0 = 1!
10. Since `S 0 6= 1, this means that S is inconsistent,
contrary to our hypothesis. The assumption J. Recursive Undecidability
that `S Con S must be mistaken.
e
Definition: An axiomatic system K is said to be
recursively decidable iff the following numberA similar result will hold for any extension of S,
theoretic property is recursive:
or generally, for any system with a recursive axiom set in which all recursive relations/functions TK (x) : x is the Gdel number of a theorem of K.
are expressible/representable, and for which the
(If a system is not recursively decidable, then it is
Hilbert-Bernays derivability conditions hold.
said to be recursively undecidable.)
We might put it this way: if a given axiomatic
system for number-theory is sufficiently strong, The notion of a recursively decidable system should
then if it is consistent, it cannot be used to prove not be confused with the notion of a decidable senits own consistency.
tence. A system can be recursively decidable while
Precisely because S is (we hope!) consistent, nevertheless having undecidable sentences.
Con S , is true in the standard interpretation. HowNotice that a theory can be recursively axiomaever, it is not a theorem of S.
tizable without being recursively decidable.
negation of that wff, the above wff in effect says
that it is not the case of any wff that both it and
its negation are provable. Assuming that S is consistent, Con S , is true in the standard interpretation.
However, it is not a theorem of S.
104
Proof:
1. Assume that K is a theory with the characteristics above, and assume for reductio that K is
recursively decidable.
2. Then, the number-theoretic property TK is recursive.
3. Because all recursive number-theoretic relations
are expressible in K, TK is expressible in K by
some wff T [x]. By the definition of expressibility, for all natural numbers n:
(3a) If TK holds for n, then `K T [n].
(3b) If TK does not hold for n, then `K T [n].
4. The above leads to a Gdel-like sentence, asserting its own unprovability. However, when
constructed using T rather than Bew , with (3b),
this will lead to an inconsistency.
5. By the Fixed-Point Theorem, there is some
closed wff W such that:
Proof:
By the above, TS is not recursive, and a number
theoretic property is expressible in S if and only if
it is recursive.
e
The wff Bew [x] means that x is the Gdel number of a theorem of S, but it does not express that
property. A principle similar to (3b) does not hold
for Bew . Otherwise, since G is not a theorem of S,
we would be able to prove that
`S Bew [pG q]
and hence G itself, and S would be inconsistent.
105
Result (Tarskis Theorem): The number-theoretic property Tr, which holds of a given natural
number x iff x is the Gdel number of a wff
that is true in the standard interpretation, is not
arithmetical.
K.
Churchs Theorem
Proof:
1. Assume that K* is obtained from K by adding the
particular wffs A1 , . . . , An as axioms. Assume
that K* is recursively undecidable, but assume
for reductio that K is recursively decidable.
2. Because K and K* have the same syntax, the
wffs A1 , . . . , An may be used as hypotheses in
K.
3. It is obvious that for every wff E ,
{A1 , . . . , An } `K E iff `K E .
107
Proof:
1. Assume for reductio that PF is recursively decidable, i.e., that the number-theoretic property
TPF is recursive.
2. Consider now the system PS , the predicate calculus in the language of arithmetic. This is
the system that has the same syntax as S and
RR but does not contain any proper axioms.
Hence its only axioms are the instances of axiom
schemata (A1)(A5) that contain no constants
other than 0 (a), no predicate-letters other
than = (I 2 ), and no function signs other than
0 (f 1 ), + (f12 ), and (f22 ).
108
A , B, C , etc., 3
, ,
/ 4, 59
{. . .}, 4
, , 4
, , , 4
, 4
h. . .i, 4
, n , 4
[A]R , 5
=, 5, 60
0 , 5
, , , , , 8, 16
, , 10, 3334
2, 11
|, , 14
`, 18, 39
, 29
, 30
A [x], 31
(X)M , 31
s(t), 33
6, 36
` , 42
g( ), 43, 8889
=, 6=, 51
(n x), 51
= , 52
0
, +, , 0, 56
{x|A [x]}, 59
n, 63
<, >, , , , , , , 64
t|u, 67
CR , 70
GF , 71
Uin , 69, 71
Z(x), 71
y(. . .), 72
x
, , 75
x<y x<y
zz<y (. . .), 76
, 79
f #, 79
, Bt, 82
pA q, 95
Pf , 98
Neg , 100
Bew , 102
aleph null/naught, 5
argument places, 29
arithmetical property, 106
arithmetization (of syntax), 87
atomic formula, 29
axiom, 17, 39
axiom schema, 17
axiomatic systems, 17
base step, 6
beta function, 82
binary, 4, 28, 29
bound occurrence, 30
bound variable, 30
bounded -operator, 76
bounded product, 74
bounded sum, 74
cardinality/cardinal number, 5
Cartesian product, 4
characteristic function, 70
choice of least rule, 72
Churchs theorem, 107
109
length, 77
liar paradox, 107
Lindenbaum extension lemma, 44
Lbs paradox, 102
Lbs theorem, 103
logical axiom, 17, 39
logical consequence, 10, 34
logically equivalent, 10, 34
logically imply, 10, 34
logically true, 34
logically valid, 34
logicism, 61
mathematical induction, 6, 58
maximal, 44
member, 4
metalanguage, 2
metalinguistic variables, 3
metatheory, 1
method of infinite descent, 67
model, 31
model for, 34
modeling lemma, 49
modus ponens, 7, 39
monadic, 28, 29
n-place function, 5
n-place operation, 5
n-place relation, 4
n-tuple, 4
natural deduction, 16
negation of a relation, 75
Nicod system, 27
normal model, 52
null set, 4
number-theoretic function, 68
number-theoretic relation, 68
numeral, 63
object language, 2
-consistent, 96
one-one function, 5
open formula, 30
operation, 5
operator, 8
order, 64
ordered n-tuple, 4
ordered pair, 4
overbar, 63
parentheses conventions, 9, 29
Peano arithmetic, 56
Peano axioms/postulates, 56
Peirce dagger, 14
predicate calculus, 39
predicate calculus with identity, 51
predicate letter, 28
primitive recursive, 72
projection function, 69, 71
proof, 17
proof induction, 6
proper axiom (RR), 94
proper axiom (S), 57
proper subset, 4
proper subtheory, 94
property, 4
propositional connective, 8
pseudo-derivability, 42
pure predicate calculus, 39
range, 5
recursion, 72
recursive axiom set, 91
recursive function, 72
recursive property/relation/set, 72
recursive vocabulary, 89
recursively axiomatizable, 101
recursively decidable, 104
reflexive, 5
relation, 4, 68
relative complement, 4
representable, 69
restricted -operator, 72
Robinson arithmetic, 94
Russells paradox, 60
satisfaction/satisfy, 33
satisfiable, 10, 34
schematic letter, 3
Schmdel number, 85
schmtingent, 26
schmuth tables, 25
schmuth-value assignment, 25
scope, 30
select, 26
self-contradiction, 10
111
semantic tree, 36
sentence, 30
sequence, 32
set, 4
Sheffer stroke, 14
Sheffer/Peirce dagger, 14
signum, 74
singleton, 4
Skolem-Lwenheim theorem, 50
smaller, 5
sound/soundness, 22
standard model for S, 57
statement letter, 8
strong induction, 6
strongly representable, 69
subset, 4
substitution, 71
subtheory, 94
successor, 57, 71
symmetric, 5
syntax, 8
system ,, 85
system F, 59
system L, 17
system N , 106
system PF, 39
system PF= , 51
system PP, 39
system PS , 108
system RR, 94
system S, 56
universal, 44
universe of discourse, 31
urelement, 4
use and mention, 2
valid, 34
variable, 28
variable assignment, 32
well-formed formula (wff), 8, 15, 29, 85
wff induction, 6
zero function, 69
112