Equational Logic 1 (Tourlakis) - Paper
Equational Logic 1 (Tourlakis) - Paper
George Tourlakis
November 2, 1998
Abstract
We present two different formalizations of Equational Predicate Logic, that is, first order logic
that uses Leibniz’s substitution of “equals for equals” as a primary rule of inference.
We prove that both versions are sound and complete. A by-product of this study is an
alternative proof to that contained in [GS3], that the “full” Leibniz rule is strictly stronger
than the “no-capture” Leibniz rule, this result obtained here for a complete Logic.
We also show that under some reasonable conditions, propositional Leibniz, no-capture
Leibniz, and a full-capture version are all equivalent, provided that the latter is restricted to
act on universally valid premises whenever capture is allowed.
Introduction.
colleagues at York University who, like the author, are teaching “Discrete Math”
using the [GS1] text).
We aim in this paper to provide a precise syntax and semantics of a first
order language and classical logic, as faithfully as possible to the spirit of the
Equational Logic outlined in [GS1], and then to carefully develop the metatheory
to the point that soundness can be derived in a self contained manner. We
also address (and prove) completeness by showing that the Equational Logic
of [GS1]—as it is systematically reincarnated here—is an extension of some
standard first order logic (our “standard” yardstick is the approach taken in
Shoenfield [Sh]).
As a by-product, we have reduced the very large number of Predicate Logic-
specific axiom schemata offered in [GS1] (Chapters 8 and 9) to just two, and
have reduced the additional “Leibniz” rule that is proposed in [GS1]—(8.12)
in Chapter 8—to a derived rule status (with a correction). Naturally, these
reductions will make Equational Calculus even more attractive, as they will
reduce the burden placed on memory of both students and instructors—not to
mention that it makes the metamathematics much easier.
Formalizing an informal logic is not easy. One has to try to understand what
the “intentions” of the informal version were, and implement them faithfully
through the syntactic apparatus. Unfortunately, it is in the nature of “informal
intentions” to be vague.
In this instance we inferred, partly from the practices in [GS1] and partly
from those in [GS3] the following two important points, which shaped our for-
malization (sections 1 and 2):
(2) Should we, or should we not, restrict the rule “Leibniz” so that no “capture”
of object variables occurs during its application?
† Obviously, one cannot, e.g., “prove p ∨ q from p”, if every rule is undefined except when
it applies to universally valid premises. With such rules, all theorems have to be universally
valid—and p ∨ q is not.
1. Syntax 3
The layout of the paper is as follows: Section 1 introduces the formal lan-
guage. Section 2 builds a theory on top of this language (we refer to any such
theory as an E-theory or E-logic). Section 3 introduces a host of metatheorems—
that we will use towards the metatheoretical results of sections 4, 5 (on sound-
ness and completeness, respectively) and 6—including the Deduction Theorem.
The latter is heavily used in section 6, where the equivalence between our version
of E-logic and the “original” version of [GS1] is proved. Most of the technical
apparatus developed in this paper was aimed at making the proofs in section 6
“accessible”. The “main lemma” in sections 4–5 is lemma 5.3 that shows the
eliminability of propositional variables. In section 7 we look at an alternative
formalization, based on a weaker form of the Leibniz rule that avoids object
variable “capture”. In the course of the discussion it emerges that “no-capture
Leibniz” is equivalent to a “propositional” version of Leibniz and to a full cap-
ture version—which restricts its premises to be provable with no nonlogical
axioms in those cases that capture occurs. It is also seen that either of these
formulations is strictly weaker than “full” Leibniz (Inf2 of section 2) providing
a simple alternative proof to a result of [GS3].
A word on the proof of the Deduction Theorem is pertinent here. We opted to
do it the “hard way”, that is, do it within the Equational Calculus setting rather
than in an equivalent “standard” calculus.‡ Handling the “Leibniz” rule in the
present setting required a few lemmata and a second induction (on formulas)
within the proof, making it uncharacteristically long (compare with the proof
in [Sh]). This anecdote fortifies our belief that the Equational Calculus is best
suited to do proofs, rather than study them.
1. Syntax
We think of a first order language, L, as triple L = (V, Term, Wff), where V
is the alphabet, i.e., the set of basic syntactic objects (symbols) that we use to
built “terms” and “formulas”. We start with a description of V , and then we
describe the set of terms (Term) and the set of formulas (Wff).
Alphabet
† One cannot gloss over this by “dummy renaming”. Interestingly, in [GS3] it is stated that
the Leibniz in [GS1] avoids capture. This must be a typo in [GS3], given the “evidence” to
the contrary.
‡ This would necessitate a metaproof of that equivalence, and in view of the presence of
propositional variables and the rule of substitution in [GS1]—none of which are present in, say,
[Sh, En, Men, Man, Ba]—would lengthen our exposition in a different, unnecessary, direction.
4
4. Brackets, ( and ).
8. A set of symbols (possibly empty) for constants. We normally use the meta-
symbols a, b, c, d, e, with or without subscripts, to stand for constants unless
we have in mind some alternative “standard” notation in selected areas of
application of the 1st order logic (e.g., ∅, 0, ω, etc.).
9. A set of symbols for predicates or relations (possibly empty) for each possible
“arity” n > 0. We normally use P, Q, R with or without primes to stand for
predicate symbols.
10. Finally, a set of symbols for functions (possibly empty) for each possible
“arity” n > 0. We normally use f, g, h with or without primes to stand for
function symbols.
1.1 Remark. Any two symbols mentioned in items 1–10 are distinct. More-
over (if they are built from simpler “sub-symbols”, e.g., x1 , x2 , x3 , . . . might
really be x|x, x||x, x|||x, . . . ), none is a substring (or subexpression) of any other.
Terms and Formulas
† In [GS3], while substitution is dropped in favour of adopting axiom schemata throughout,
propositional variables are retained, presumably to facilitate statement and application of the
Leibniz rule.
1. Syntax 5
1.2 Definition. (Terms) The set of terms, Term, is the ⊆-smallest set of
strings or “expressions” over the alphabet 1–10 with the following two proper-
ties:
Any of the items in 1 or 8 (a, b, c, x, y, z, etc.) are in Term.
If f is a function‡ of arity n and t1 , t2 , . . . , tn are in Term, then so is the
string f t1 t2 . . . tn .
1.3 Definition. (Atomic Formulas) The set of atomic formulas, Af, contains
precisely:
1) The symbols true, false, and every Boolean variable (that is, p, q, . . . ).
2) The strings t = s for every possible choice of terms t, s.
3) The strings P t1 t2 . . . tn for every possible choices of n-ary predicates P (for
all choices of n > 0) and all possible choices of terms t1 , t2 , . . . , tn .
a) Af ⊆ Wff.
b) If A, B are in Wff, then so are (A ≡ B), (A ⇒ B), (A ∧ B), (A ∨ B).
c) If A is in Wff, then so is (¬A).
d) If A is in Wff and x is any object variable (which may or may not occur (as
a substring) in the formula A), then the string ((∀x)A) is also in Wff.
We say that A is the scope of (∀x).
has now become “bound”. Of course, notwithstanding the notation A[x] (which
only says that x may occur in A), x might actually not be a substring of A.
In that case, intuitively, ((∀x)A), ((∃x)A) and A “mean” the same thing. This
intuition is actually captured by the axioms and rules of inference, as we show
in section 3 (3.7).
(4) In [GS1] the preferred metanotation for ((∀x)A) and ((∃x)A) is (∀x| :
A) and (∃x| : A), respectively. They also have special (meta) notation for
“bounded” quantification, that is (∀x|B : A) and (∃x|B : A) stand for ((∀x)(B ⇒
A)) and ((∃x)(B ∧ A)) respectively. In [GS1] the “B” part is called the “range”
of the quantification.
(5) To minimize the use of brackets in the metanotation we adopt standard
priorities, that is, ∀, ∃, and ¬ have the highest, and then we have (in decreasing
order of priority) ∧, ∨, ⇒, ≡. All associativities are right (this is in variance with
[GS1], but is just another acceptable—and common—convention of how to be
sloppy in the metalanguage, and get away with it).
(6) The language just defined, Wff, is one-sorted, that is, it has a single sort
or type of object variable. If one wants to admit several sorts, e.g., i1 , i2 , . . . for
variables intended to take integer values upon interpretation, r1 , r2 , . . . intended
to take real values, b1 , b2 , . . . intended to take Boolean values (true or false),
etc., then we need a more elaborate definition of syntax.
We will need a different quantifier symbol for each sort, say, ∀i , ∀r , ∀b , etc.,
and for each sort a dedicated “equality” predicate (=i , =r , =b , etc.). Moreover,
function symbols will have a sort or type associated with them (which will be
the type of their right field in the intended interpretation). This will induce
sorts (types) to all terms.
In the end of the day, we will turn around and use metanotation, suppressing
the sort-subscripts (exactly as [GS1] do) and will let the context fend for the
various (intended) sorts in the applications.
For this reason we do not burden the formal definition with many sorts. This
deliberate omission will not restrict the applicability (to the multi-sorted case)
of the few proof-theoretic and model-theoretic results included here.
It should be stated at once that should we feel the need to employ a Boolean
object variable type explicitly, b1 , b2 , . . . , we would not allow the bi to be the
same as the v1 , v2 , . . . . The former are terms, the latter are (atomic) well formed
formulas.
2. Axioms and Rules of Inference
The axioms and rules of inference will be chosen from [GS1]. We will not allow
schemata in the so-called propositional (or Boolean) axioms, in order to agree
with the present [GS1] edition. This decision will necessitate the presence of a
substitution rule among the rules of inference. We will deviate somewhat in the
choice of axioms outside the propositional domain, to maximize convenience. In
particular, we will only adopt one among the proposed axioms in [GS1] (Ax2
2. Axioms and Rules of Inference 7
below), and one easy to memorize theorem that occurs in [GS1] (namely, the
“standard” axiom-schema of instantiation or specialization) as our axiom Ax3.
This apparatus will be sufficient to ensure completeness as we will show.
2.2 Remark. (1) In any formal setting that introduces many-sorts explicitly
in the syntax, one will need as many versions of Ax2–Ax5 as there are sorts.
An alternative formal approach is to have only one sort of object variable but
introduce one-place predicates Di , Dr , Db , etc., in the alphabet—one for each
sort—with the intended meanings: Di (x) “says” that x is an integer, Db (x)
“says” that x is boolean, etc.
Thus, instead of, say, (∀b x)A one would write (∀x)Db A—which is a short
form of (∀x)(Db (x) ⇒ A)—etc.
(2) Axioms Ax4–Ax5 characterize equality between “objects”. [GS1], while
they employ this type of “Leibniz” in their exposition, they do not seem to
actually formally adopt the point of view of Predicate Calculus with equality.
In any case, adding these two axioms makes this system (explicitly) applicable
to mathematical theories such as number theory and set theory, and does not
in any way burden what we would like to outline here. Indeed, we will only use
these axioms in section 6.
(3) In Ax3 and Ax5 we imposed the condition that t must to be “substi-
tutable” in x. I.e., it should not contain free variables that can be “captured”
by quantification, as this would distort and invalidate the formula.
The following example is often given in the literature in order to illustrate
what might happen after a mindless substitution:
Take A to stand for (∃y)¬x = y. Then (∀x)A[x] ⇒ A[y] is
(∀x)(∃y)¬x = y ⇒ (∃y)¬y = y
and x = y ⇒ A ≡ A[y] is
x = y ⇒ (∃y)¬x = y ≡ (∃y)¬y = y
2.4 Remark. (1) Let P be any 1-ary (“unary”) predicate and a any constant.
Clearly, (∀x)(P x ⇒ P x) ⇒ P a ⇒ P x is not universally valid, as an interpreta-
tion over N with P x translated to say “x is even” and a to be “the” 0 shows.
Indeed a falsifying instance would be (∀x)(P N x ⇒ P N x) ⇒ P N 0 ⇒ P N 1, where
P N denotes the “concrete” interpretation of P over N that we have just sug-
gested.
Yet, (∀x)(P x ⇒ p) ⇒ P a ⇒ p is universally valid. This explains the
“condition” for the substitution rule.
(2) Leibniz for formulas is the primary rule in [GS1] that supports “equa-
tional reasoning”. In standard approaches to logic it is not a primary rule, rather
it appears as the well known “derived rule” (metatheorem) that if Γ ` A ≡ B †
and if we replace one or more occurrences of the subformula A of a formula D
(here D is C[p := A]) by B, to obtain D0 (that is C[p := B]), then Γ ` D ≡ D0 .
Shoenfield [Sh] calls this derived rule “the equivalence theorem”.
(3) [GS1] use “=” for “≡” in contexts where they want the symbol to act con-
junctionally, rather than associatively, e.g., in successive steps of an equational-
style proof.
† The
meaning of the symbol ` is defined in 2.5.
10
We next define Γ-theorems, that is, formulas we can prove from the set of
formulas Γ (this may be empty).
Th1. ThmΓ contains as a subset the closure under Inf1† of: All the axioms in
the group Ax1 and all the instances of axiom schemata Ax2–Ax5.
2.6 Remark. The restriction in the application of Inf1 means that we use it
simply to eliminate propositional variables, in particular to turn the axioms of
group Ax1 to axiom-schemata. We also, in effect, eliminate its anachronistic
presence from our logic, by “applying it up in front” in the course of a deduction.
See also 3.24.
2.7 Definition. (Γ-proofs) A finite sequence A1 , . . . , An of members of Wff is
a Γ-proof iff every Ai , for i = 1, . . . , n is one of
Pr2. A member of Γ.
† “The closure of a set S under an operation O” is the ⊆-smallest set containing the members
` A ≡ A ≡ true (2)
Γ ` A ≡ A ≡ true (3)
Γ ` A ≡ true.
Proof.
(∀x)true
= h“Leibniz-rule” and ` true ∨ true ≡ truei
(∀x)(true ∨ true)
= hAx2i
true ∨ (∀x)true
= hby ` true ∨ A ≡ truei
true
3.3 Metatheorem. For any Γ, any formula A and any choice of object vari-
able x, Γ ` A iff Γ ` (∀x)A.
Proof.
(→) We assume that Γ ` A. Thus,
(∀x)A
= h“Leibniz-rule” and Metatheorem 3.1i
(∀x)true
= hTheorem 3.2i
true
(←) We assume that Γ ` (∀x)A. By Ax3, ` (∀x)Ax ⇒ A[x := x], that is,
` (∀x)A ⇒ A, hence Γ ` (∀x)A ⇒ A.
x is substitutable in x always.
By the hypothesis and modus ponens (a valid derived rule), we obtain Γ `
A.
3.4 Corollary. (“Strong” Generalization) For any formula A, A ` (∀x)A.
A[x1 , x2 ]
hgeneralizationi
(∀x1 )A[x1 , x2 ]
hspecialization; x1 := zi
A[z, x2 ]
hgeneralizationi
(∀x2 )A[z, x2 ]
hspecialization; x2 := wi
A[z, w]
hNow z := t1 , w := t2 , in any order,
is the same as “simultaneous substitution”i
hgeneralizationi
(∀z)A[z, w]
hspecialization; z := t1 i
A[t1 , w]
hgeneralizationi
(∀w)A[t1 , w]
hspecialization; w := t2 i
A[t1 , t2 ]
3.6 Theorem. For any formulas A, B such that the object variable x is not
free in A, ` A ⇒ (∀x)B ≡ (∀x)(A ⇒ B).
3.7 Corollary. For any formulas A in which the object variable x is not free,
` A ≡ (∀x)A.
14
A ⇒ (∀x)A
= hby 3.6i
(∀x)(A ⇒ A)
= h` A ⇒ A ≡ true and Leibnizi
(∀x)true
= hby 3.2i
true
The above Corollary fulfils the promise included in remark 1.5(3). The case for
the metasymbol ∃ follows trivially by a sequence of equivalences:
(∃x)A
hformalizing the “text” (∃x)Ai
¬(∀x)¬A
= hby 3.7 and Leibniz applied to ¬pi
¬(¬A)
=
A
3.9 Corollary. For any formulas A, B such that the object variable x is not
free in A, Γ ` B ⇒ A iff Γ ` (∃x)B ⇒ A.
3. Some Metatheorems (and the occasional theorem) 15
NB. We often write this (under the stated conditions) as ` (∀x)A[x] ≡ (∀z)A[z].
Proof. We know that ` (∀x)A ⇒ A[x := z] (Ax3), since z is substitutable in x
under the stated conditions. Thus, by 3.3,
(∀z) (∀x)A ⇒ A[x := z]
= hby 3.6, since z is not free in (∀x)A)i
(∀x)A ⇒ (∀z)A[x := z]
Noting that x is not free in (∀z)A[x := z] and is substitutable in z (in A[x := z]),
we can repeat the above argument to get ⇐.
Pri1. Atomic
Let P denote the set of all prime formulas in our language. Clearly, P contains
each propositional variable v1 , v2 , . . . .
That is, a prime formula has no “explicit” propositional connectives (in the case
Pri2 any connectives are hidden inside the scope of (∀x)).
Clearly, A ∈ Wff iff A is a Propositional Calculus formula over P (i.e,
propositional variables will be all the strings in P − {true, false}).
3.12 Definition. (Tautologies in Wff) A formula A ∈ Wff is a tautology iff it
is so when viewed as a Propositional Calculus formula over P.
We call the set of all tautologies, as defined here, Taut. The symbol |=Taut A
says A ∈ Taut.
Whatever power we have got stems directly from the axioms and the rules of inference, and
the definition of Γ-theorem or Γ-proof. All else that we add is just “convenience”.
16
While a definition for an infinite set of premises is possible, we will not need it
here.
3.14 Metatheorem. (Post’s Tautology Theorem) If A ∈ Wff is a tautology in
the sense of 3.12, then ` A with a proof that uses only (substitution instances—
under Inf1—of ) axioms in the list Ax1.
Proof. First, we note the following equivalences. (In the course of this proof,
p, q, r stand for any member of P − {true, false}, not just for propositional
variables of our language.)
|=Taut true ≡ ¬p ∨ p, and also ` true ≡ ¬p ∨ p
|=Taut false ≡ ¬p ∧ p, and also ` false ≡ ¬p ∧ p
|=Taut p ⇒ q ≡ ¬p ∨ q, and also ` p ⇒ q ≡ ¬p ∨ q
|=Taut p ∧ q ≡ ¬(¬p ∨ ¬q), and also ` p ∧ q ≡ ¬(¬p ∨ ¬q)
|=Taut (p ≡ q) ≡ ((p ⇒ q) ∧ (q ⇒ p)), and also ` (p ≡ q) ≡ ((p ⇒ q) ∧ (q ⇒ p))
where `, everywhere above, is achieved by staying within the axiom group Ax1.
Thus, if we transform A into A0 by applying any sequence of the above
equivalences to eliminate all occurrences of true and false and all the connectives
except ¬ and ∨, then we have, on the one hand, that |=Taut A0 and on the other
hand (by the Leibniz rule) that ` A ≡ A0 .
Thus, by equanimity, it suffices to prove ` A0 .
A better way to say all this is that, without loss of generality, we assume that
the only connectives in A are among ∨ and ¬ and that the constants true and
false do not occur.
Moreover, since ` A ∨ A ≡ A, we may assume without loss of generality that
A is a string A1 ∨ · · · ∨ An with n ≥ 2, so that none of the Ai is a formula C ∨ D.
We are assuming metanotational abbreviations when it comes to bracketing.
Let us call an Ai reducible iff it has the form ¬(C ∨ D) or ¬(¬C). Otherwise
it is irreducible. Thus, the only possible irreducible Ai have the form p or ¬p
(where p ∈ P − {true, false}). We say that A is irreducible iff all the Ai are.
We define the reducibility degree of Ai to be the number of ¬ or ∨ connectives
in it, not counting a possible leftmost ¬. The reducibility degree of A is the sum
of the reducibility degrees of all its Ai .
So let |=Taut A, where A is the string A1 ∨ · · · ∨ An , n ≥ 2, where none of
the Ai is a formula C ∨ D, and prove (following Shoenfield) by induction on the
reducibility degree of A that ` A.
3. Some Metatheorems (and the occasional theorem) 17
Proof. NB. This statement is the same as that in definition 2.5, with a difference
only in the third clause (compare (iii) here with Th3 in 2.5).
\Γ the set inductively defined here, and show
For a proof, let us call Thm
\ Γ = ThmΓ .
that Thm
(⊆) We do induction on Thm \ Γ . We note that ThmΓ satisfies
\ Γ (compare Th1–Th2 with
(1) it contains the same initial objects as Thm
(i)–(ii)),
(2) it is closed under Inf2,
(3) it is closed under rules (iii), by 3.15 and transitivity of `.
\ Γ being the ⊆-smallest with properties (1)–(3), we are done.
The set Thm
(⊇) We do induction on ThmΓ . We note that Thm \Γ satisfies
(4) it contains the same initial objects as ThmΓ (compare (i)–(ii) with
Th1–Th2),
(5) it is closed under Inf2,
(6) it is closed under rules Inf3–Inf4 since each of them has its conclusion
tautologically implied by the premises(s), so, it is a rule of type (iii).
The set ThmΓ being the ⊆-smallest with properties (4)–(6), we are done
once more.
A ⇒ (B ≡ C) ` A ⇒ ((∀x)B ≡ (∀x)C).
(1) A ⇒ (B ≡ C) hhypothesisi
(2) A ⇒ (B ⇒ C) h(1) and 3.15i
(3) A ⇒ (∀x)(B ⇒ C) hx not free in A. (2) and 3.8 usedi
(4) (∀x)(B ⇒ C) ⇒ (∀x)B ⇒ (∀x)C h3.18i
(5) A ⇒ (∀x)B ⇒ (∀x)C h(3), (4), and 3.15i
Since
and
Γ ` A ⇒ H[p := C] ≡ H[p := D]
thus (1) follows from 3.15 and
A ⇒ G[p := C] ≡ G[p := D] , A ⇒ H[p := C] ≡ H[p := D]
|=Taut A ⇒ (G ◦ H)[p := C] ≡ (G ◦ H)[p := D]
Finally, let E be (∀x)G. By I.H., Γ ` A ⇒ G[p := C] ≡ G[p := D] .
Since A is closed, x is not free in A. Thus, by 3.19,
Γ ` A ⇒ ((∀x)G)[p := C] ≡ ((∀x)G)[p := D]
A1 , . . . , Ar |=Taut B. (2)
A ⇒ A1 , . . . , A ⇒ Ar |=Taut A ⇒ B
Γ ` (A ⇒ B)[x1 , . . . , xn ] (1)
† Let a truth assignment make the left of |=
Taut true. The “hard case” is when it makes
A true at the same time. This forces all the Ai to be true, and by (2), B to be true.
‡ Thus, in particular, Γ0 is “aware” of some additional facts. For example, Γ0 ` e = e,
where [x1 , . . . , xn ] is the list of all free variables in A, is equivalent (by 3.22) to
Γ0 ` (A ⇒ B)[e1 , . . . , en ] (2)
Γ0 , A[e1 , . . . , en ] ` B[e1 , . . . , en ]
4. Soundness
The easiest way to introduce Tarski semantics is to follow Shoenfield.
We would now like to take a formula A, transform each one of its syntactic
ingredients—except free variables—S into its “concrete” counterpart S I to fi-
nally obtain AI . We could then say that A is valid in M, and write |=M A to
mean that AI is true for all values (from M ) “plugged into” its free variables.
This can be done as follows (among other ways).
This is the correct definition for “logically implies” in those approaches that
allow “strong generalization” (3.4). In particular, this definition allows that
A |= (∀x)A—as we indeed require in order to “match” the syntactic A ` (∀x)A.
The flip side of this is that |= A ⇒ B and A |= B are not equivalent.
4. Soundness 25
4.7 Definition. (First order theories) Given a first order language L = (V, Term,
Wff), a (first order) theory Γ over L consists of all the formulas in Γ taken as
nonlogical axioms, along with the logical axioms (see 2.5), and the rules of in-
ference Inf2–Inf4. ThmΓ are the theorems of the theory.
A pure theory is one with Γ = ∅.
4.8 Remark. In the next section we will consider two different ways to “do
logic”, and therefore will have two kinds of first order theories. One will be as in
the definition above. We will call it an “E-theory”, to indicate that the logical
axioms and especially the rules of inference support equational reasoning. The
other will be as developed in [Sh]. We will call such a theory an “S-theory”.
4.9 Definition. (Soundness) A pure theory is sound, iff ` A implies |= A, that
is, iff all the theorems of the theory are universally valid.
B ∨ (∀x)C ≡ (∀x)(B ∨ C)
|=M C ≡ D (4)
One way to show completeness of the pure E-theory is to offer a proof in the
style of Henkin. A much easier way is to show that the pure E-theory is at least
as powerful as the pure S-theory.
Shoenfield allows no propositional variables, and admits therefore axiom
schemata throughout.
The axioms in Shoenfield are
S-Ax1 The schema A ∨ ¬A
S-Ax2 The dual of Ax3, namely, A[x := t] ⇒ (∃x)A, where t is substitutable
in x
5. Completeness 27
S-Inf2
A⇒B
(∃x)A ⇒ B
provided x is not free in B.
Let us write `theory A (where “theory” is “E” or “S”) to indicate what axioms
and what rules were responsible for the proof of A. We have at once,
Proof. The trivial proof on theorems (of S) is omitted. We only note that
`E A ∨ ¬A by 3.14.
5.3 Lemma. Let A be a formula over the language L of section 1, and let p
be a propositional variable that occurs in A.
Extend the language L by adding P , a new 1-ary predicate symbol.
Then, |= A iff |= A[p := (∀x)P x] and `E A iff `E A[p := (∀x)P x].
Completeness and soundness were proved for a pure “E-theory” that has a
different (much smaller) axiom set than the one given in [GS1]. The next section
shows the equivalence of the present pure theory with the one offered in [GS1].
6. Comparison with the Equational Logic of [GS1]
We prove that each rule or axiom on quantification that is offered in [GS1],
chapters 8 and 9, is “derived” (hence redundant or dependent) in the E-logic
that we have presented here (definitions 2.1, 2.3 and 2.5). We write ` meaning
`E throughout.
Nomenclature and numbers given in brackets are those in [GS1].
A.1 “Leibniz (8.12)” The following two rules are meant to supplement the
“propositional” version given in [GS1], towards predicate calculus duty
([GS1], p.148).
A≡B
(∀x)(C[p := A] ⇒ D) ≡ (∀x)(C[p := B] ⇒ D)
6. Comparison with the Equational Logic of [GS1] 29
and
D ⇒ (A ≡ B)
(1)
(∀x)(D ⇒ C[p := A]) ≡ (∀x)(D ⇒ C[p := B])
that is
|= x = 0 ⇒ (x = 0 ≡ true)
but
6|= (∀x)(D ⇒ C[p := A]) ≡ (∀x)(D ⇒ C[p := B])
that is
A.3 “One-point rule (8.14)”. Provided that x is not free in t, and t is substi-
tutable in x in A,† ` (∀x)(x = t ⇒ A) ≡ A[x := t].
(⇒) By Ax3, ` (∀x)(x = t ⇒ A) ⇒ t = t ⇒ A[t]. By 3.5 and Ax4,
` t = t, hence ` (∀x)(x = t ⇒ A) ⇒ A[t] by 3.15.
(⇐) A standard result (from Ax4, Ax5) is that ` x = y ⇒ y = x (e.g.,
[Sh]). By 3.5, ` x = t ⇒ t = x, thus (3.15 and Ax5) ` A[t] ⇒ x = t ⇒ A.
As x is not free in A[t], 3.8 yields ` A[t] ⇒ (∀x)(x = t ⇒ A).
1. (∀x)(∀y)(A ∧ B ⇒ C)
2. A ∧ B ⇒ C h1. and specializationi
3. A ⇒ B ⇒ C h2. and 3.15i
4. A ⇒ (∀y)(B ⇒ C) h3. and 3.8; y not free in Ai
5. (∀x)(A ⇒ (∀y)(B ⇒ C)) h4. and 3.4i
This concludes that for every formula A, `GS1 A implies `E A, where `GS1
indicates provability in the system outlined in [GS1].
Conversely, the set of axioms in [GS1]—along with the (corrected) “aug-
mented Leibniz” (8.12)—subsume our version of pure E-theory (our Ax3 is a
theorem in [GS1]). Thus,
6.2 Corollary. The pure first order theory outlined in [GS1] is sound and
complete.
(see the remark following 3.4) that since Inf2 yields “strong generalization”,
first order Logics such as [Bou, En] do not support it.
We show how such Logics can be recast in the Equational Logic paradigm.
We base our discussion here, for the sake of concreteness, to the version of
first order Logic as it is defined in [En]. We start by describing this logic.
We allow propositional variables in formulas only for the purpose of making
the statement of “Leibniz”, below, easy. The only connectives are ¬, ⇒, ∀. The
remaining connectives— ≡, ∨, ∧, ∃ —are introduced as abbreviations, and the
same holds for the constants (0-ary propositional connectives in reality) true
and false (for example, in view of the theorem ` (A ≡ A) ≡ (B ≡ B), we
abbreviate any occurrence of A ≡ A by true).
There is only one rule of inference, modus ponens.
The axioms (schemata) are all the possible “partial” generalizations† of the
following
En3. All formulas of the form (∀x)(A ⇒ B) ⇒ (∀x)A ⇒ (∀x)B (see 3.18),
En4. All formulas of the form A ⇒ (∀x)A, provided x is not free in A (see 3.7).
In this logic the following weaker version of Inf2, let us call it no-capture Leibniz,
is a derived rule:
A≡B
C[p := A] ≡ C[p := B]
provided neither A nor B contain free variables that are captured by quantifiers
during the above substitutions.
For a proof, we note that the following Metatheorem (“restricted general-
ization”) holds in [En]:
Γ ` G iff Γ ` (∀x)G, provided no formula in Γ has x free. Here then is the
proof of no-capture Leibniz, A ≡ B ` C[p := A] ≡ C[p := B], by induction on
the formula C.
Basis. C is atomic. If C is the string p, then C[p := A] is A and C[p := B]
is B, so our conclusion is our hypothesis.
In all other cases C[p := A] ≡ C[p := B] is the tautology C ≡ C.
Induction Step(s) (I.S.).
may or may not occur in A. The well known “universal closure of A” is a special case.
† The tautology theorem holds in [En].
7. Concluding remarks; an alternative formalism for Equational Logic 33
1. A h giveni
2. A ⇒ B h giveni
3. true ⇒ B h1., 2., 3.1, L2 and Inf3i
4. B h3. and 3.15i
Thus, in the presence of En1–En5, and Inf3, Inf4, L2 and no-capture Leibniz
are equivalent, and the system so founded is equivalent to that in [En].
34
Indeed, the proof of modus ponens above makes it clear that all we really
need is a propositional Leibniz, P L, that is
A≡B
, where p is not in the scope of a quantifier (P L)
C[p := A] ≡ C[p := B]
7.1 Metatheorem. In the presence of the axioms En1–En5 and the rules
Inf3, Inf4, all the following versions of the Leibniz rule are equivalent: P L,
“no-capture”, and L2. Adopting any of them yields a sound and complete first
order logic (equivalent to that in [En]).
In this logic, “full” Leibniz (Inf2) is not derivable.
8. Bibliography
[Ba] Barwise, J. “An introduction to first-order logic”, in Handbook of Math-
ematical Logic (J. Barwise, Ed.), 5–46, Amsterdam: North-Holland Pub-
lishing Company, 1978.