0% found this document useful (0 votes)

41 views31 pages

Atp BW

The document discusses automated theorem proving, which is using a computer program to prove mathematical theorems. It covers various logics that can be used for automated theorem proving like propositional logic, first-order logic, modal logic, and description logic. It also discusses different theorem proving techniques like resolution, tableaux, and sequent calculus that can be applied depending on the logic. The document provides examples of applying these techniques to prove theorems in different logics.

Uploaded by

Houssem Hamdi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views31 pages

Atp BW

Uploaded by

Houssem Hamdi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Automated

Theorem Proving
Scott Sanner, Guest Lecture
Topics in Automated Reasoning
Thursday, Jan. 19, 2006

Introduction

• Def. Automated Theorem Proving:

Proof of mathematical theorems by a
computer program.

• Depending on underlying logic, task

varies from trivial to impossible:
– Simple description logic: Poly-time
– Propositional logic: NP-Complete (3-SAT)
– First-order logic w/ arithmetic: Impossible

1
Applications
• Proofs of Mathematical Conjectures
– Graph theory: Four color theorem
– Boolean algebra: Robbins conjecture

• Hardware and Software Verification

– Verification: Arithmetic circuits
– Program correctness: Invariants, safety

• Query Answering
– Build domain-specific knowledge bases,
use theorem proving to answer queries

Basic Task Structure

• Given:
– Set of axioms (KB encoded as axioms)
– Conjecture (assumptions + consequence)

• Inference:
– Search through space of valid inferences

• Output:
– Proof (if found, a sequence of steps
deriving conjecture consequence from
axioms and assumptions)

2
Many Logics / Many Theorem
Proving Techniques

Focus on theorem proving for logics

with a model-theoretic semantics (TBD)
• Logics:
– Propositional, and first-order logic
– Modal, temporal, and description logic

• Theorem Proving Techniques:

– Resolution, tableaux, sequent, inverse
– Best technique depends on logic and app.

Example of Propositional
Logic Sequent Proof

• Given: • Direct Proof:

– Axioms: (I)
A |- A
None (¬R)
– Conjecture: |- ¬A, A

A ∨ ¬A ? (∨R2)
|- A∨¬A, A
(PR)
• Inference: |- A, A∨¬A
(∨R1)
– Gentzen |- A∨¬A, A∨¬A
Sequent (CR)

Calculus |- A∨¬A

3
Example of First-order Logic
Resolution Proof

• Given: • CNF:
¬Man(x) ∨ Mortal(x)
– Axioms: Man(Socrates)
∀x Man(x) ⇒ Mortal(x) ¬Mortal(y) [Neg. conj.]
Man(Socrates)
– Conjecture: • Proof:
∃y Mortal(y) ?
1. ¬Mortal(y) [Neg. conj.]
2. ¬Man(x) ∨ Mortal(x) [Given]
• Inference: 3. Man(Socrates) [Given]
4. Mortal(Socrates) [Res. 2,3]
– Refutation
5. ⊥ [Res. 1,4]
Resolution Contradiction ⇒ Conj. is true

Example of Description Logic

Tableaux Proof

• Given: • Proof:
Check unsatisfiability of
– Axioms: ∃Child.¬Male  ∀ Child.Male
None
– Conjecture: x: ∃Child.¬Male  ∀ Child.Male
¬∃
∃ Child.¬Male ⇒ x: ∀ Child.Male [  -rule ]
∀ Child.Male ? x: ∃Child.¬Male [  -rule ]
x: Child y [ ∃-rule ]
y: ¬Male [ ∃-rule ]
• Inference: y: Male [ ∀-rule ]
<CLASH>
– Tableaux
Contradiction ⇒ Conj. is true

4
Lecture Outline
• Common Definitions
– Soundness, completeness, decidability

• Propositional and first-order logic

– Syntax and semantics
– Tableaux theorem proving
– Resolution theorem proving
• Strategies, orderings, redundancy, saturation
optimizations, & extensions

• Modal, temporal, & description logics

– Quick overview of logics / TP techniques

Entailment vs. Truth

• For each logic and theorem proving
approach, we’ll specify:
– Syntax and semantics
– Foundational axioms (if any)
– Rules of inference

• Entailment vs. Truth

– Let KB be the conjunction of axioms
– Let F be a formula (possibly a conjecture)
– We say KB |- F (read: KB entails F) if F can be
derived from KB through rules of inference
– We say KB |= F (read: KB models F) if semantics
hold that F is true whenever KB is true

5
Model-theoretic semantics
• Model-theoretic semantics for logics
– An interpretation is a truth assignment to atomic
elements of a KB: I〈C,D〉 = {〈F,F〉, 〈F,T〉, 〈T,F〉, 〈T,T〉}
– A model of a formula is an interpretation where
it is true: I〈C,D〉 = 〈F,T〉 models C∨
∨D,,C⇒
⇒D, but not C∧
∧D
– Two properties of a formula F w.r.t. axioms of KB:
• Validity: F is true in all models of KB
• Satisfiability: F is true in ≥1 model of KB

• Think of truth in a set-theoretic manner

Models of KB
KB |= C C KB
⊆ Models of C

Soundness, Completeness,
and Decidability

• Two properties of ATP inference systems:

– Soundness: If KB |- C then KB |= C
– Completeness: If KB |= C then KB |- C

• For a given logic, an ATP decision

procedure returns true or false for KB |- C
• For a logic, a sound and complete decision
procedure has one of following properties:
– Decidable: Decision procedure guaranteed to
terminate in finite time
– Semidecidable: Decision procedure guaranteed
to terminate for either true or false, but not both
– Undecidable: No termination guarantee

6
Prop. Logic Syntax
• Propositional variables: p, rain, sunny
• Connectives: ⇒ ⇔ ¬ ∧ ∨
• Inductive definition of well-formed
formula (wff):
– Base: All propositional vars are wffs
– Inductive 1: If A is a wff then ¬A is a wff
– Inductive 2: If A and B are wffs then
A ∧ B, A ∨ B, A ⇒ B, A ⇔ B are wffs
• Examples:
– rain, rain ⇒ ¬ sunny
– (rain ⇒ ¬ sunny) ⇔ (sunny ⇒ ¬rain)

Prop. Logic Semantics

• For a formula F, the truth I(F) under
interpretation I is recursively defined:
– Base:
• F is prop var A then I(F)=true iff I(A)=true
– Recursive:
• F is ¬C then I(F)=true iff I(C)=false
• F is C ∧ D then I(F)=true iff I(C)=true & I(D)=true
• F is C ∨ D then I(F)=true iff I(C)=true or I(D)=true
• F is C ⇒ D then I(F)=true iff I(¬C ∨ D)=true
• F is C ⇔ D then I(F)=true iff I(C ⇒ D)=true &
I(D ⇒ C)=true

• Truth defined recursively from ground up!

7
CNF Normalization
• Many prop. theorem proving techniques req.
KB to be in clausal normal form (CNF):
– Rewrite all C ⇔ D as C ⇒ D ∧ D ⇒ C
– Rewrite all C ⇒ D as ¬C ∨ D
– Push negation through connectives:
• Rewrite ¬(C ∧ D) as ¬C ∨ ¬D
• Rewrite ¬(C ∨ D) as ¬C ∧ ¬D
– Rewrite double negation ¬ ¬ C as C
– Now NNF, to get CNF, distribute ∨ over ∧:
• Rewrite (C ∧ D) ∨ E as (C ∨ E) ∧ (D ∨ E)
• A clause is a disj. of literals (pos/neg vars)
• Can express KB as conj. of a set of clauses

CNF Normalization Example

• Given KB with single formula:
– ¬ (rain ⇒ wet) ⇒ (inside ∧ warm)
• Rewrite all C ⇒ D as ¬C ∨ D
– ¬ ¬ (¬ rain ∨ wet) ∨ (inside ∧ warm)
• Push negation through connectives:
– (¬ ¬ ¬ rain ∨ ¬ ¬ wet) ∨ (inside ∧ warm)
• Rewrite double negation ¬ ¬ C as C
– (¬ rain ∨ wet) ∨ (inside ∧ warm)
• Distribute ∨ over ∧:
– (¬rain ∨ wet ∨ inside) ∧ (¬rain ∨ wet ∨ warm)
• CNF KB: {¬ ¬rain ∨ wet ∨ inside, ¬rain ∨ wet ∨ warm}

8
Prop. Theorem Proving
• A ⇒ B iff A ∧ ¬B is unsatisfiable
• Decision procedure for propositional
logic is decidable, but NP-complete
(reduction to 3-SAT)
• State-of-the-art prop. unsatisfiability
methods are DPLL-based
A Instantiate prop vars
true false
until all clauses falsified,
B B backtrack and do for all
true false true false instantiations ⇒ unsat!

• Many optimizations, more next week

Prop. Tableaux Methods

Given negated query F (in NNF), use rules to
recursively break down:
– α-Rule: Given A∧B add A and B
– β-Rule: Given A∨B branch on A and B
– 〈Clash〉〉: If A and ¬A occur on same branch
– Clash on all branches indicates unsat!
A∧¬A∨¬B∧B

A ∧ ¬ A β-Rule ¬B ∧ B β-Rule
A α-Rule ¬B α-Rule
¬A α-Rule B α-Rule
〈Clash〉〉 〈Clash〉〉
Note: Inverse method is inverse of tableaux - bottom up

9
Propositional Resolution
• One rule:
Rule: Example application:
A ∨ B ¬B ∨ C ¬precip ∨ ¬freezing ∨ snow ¬snow ∨ slippery

A∨C ¬precip ∨ ¬freezing ∨ slippery

• Simple strategy is to make all

possible resolution inferences

• Refutation resolution is sound and

complete

Resolution Strategies
Need strategies to restrict search:
– Unit resolution:
• Only resolve with unit clauses
• Complete for Horn KB
• Intuition: Decrease clause size
– Set of support:
• SOS starts with query clauses
• Only resolve SOS clauses with non-SOS clauses
and put resolvents in SOS
• Intuition: KB should be satisfiable so refutation
should derive from query
– Input resolution:
• At each step resolve only with input (KB or query)
• I.e., don’t resolve non-input clauses
• Linear input: also allow ancestor ⇒ complete

10
Ordering Strategies
• Refutation of a clause requires
refutation of all literals
• Enforce an ordering on proposition
elimination to restrict search
– Example order: p then r then q
– General idea behind Davis-Putnam (DP) &
directional resolution (Dechter & Rish)

• Effective, but does not work with all

resolution strategies, e.g. SOS +
ordered resolution is incomplete

Prop. Inference Software

• Mainly DPLL SAT algorithms
– zChaff – highly optimized & documented
DPLL solver, source available
– siege – best performing DPLL solver,
source not available
– 2clseq – DPLL solver with constraint
propagation (balance search / reasoning)

• For some applications: BDDs

– BDDs maintain all possible models in a
canonical data structure
– CUDD ADD/BDD Package – very efficient

11
First-order logic
• Refer to objects and relations b/w them
• Propositional logic requires all
relations to be propositionalized
– Scott-at-home, Scott-at-work,
Jim-at-subway, etc…
• Really want a compact relational form:
– at(Scott, home), at(Scott, work),
at(Jim, subway), etc…
• Then can use variables and quantify
over all objects:
– ∀x person(x) ⇒ ∃y at(x,y) ^ place(y)

First-order Logic Syntax

• Terms (technical definition is inductive b/c of fns)
– Variables: w, x, y, z
– Constants: a, b, c, d
– Functions over terms: f(a), f(x,y), f(x,c,f(f(z)))
• Predicates: P(x), Q(f(x,y)), R(x, f(x,f(c,z),c))
• Connectives: ⇒ ⇔ ¬ ∧ ∨
• Quantifiers: ∀ ∃
• Inductive wff definition:
– Same as prop. but with following modifications…
– Base: All predicates over terms are wffs
– Inductive: If A is a wff and x is a variable term
then ∀x A & ∃x A are wffs

12
First-order Logic Semantics

• Interpretation I = (∆
∆I,•I)
– ∆I is a non-empty domain
– •I maps from predicate symbols P of arity n
into a subset of ×1…n ∆I (where P is true)
• Example
– ∆I is {Scott, Jim}
– •I maps at(•,•) into { 〈Scott, loc(Scott)〉〉,
〈Jim,
Jim loc(Jim)〉〉 }
– All other ground predicates are false in I,
e.g. at(Scott, loc(Jim)), at(Scott, Scott)
• NB: FOL has ∞ interpretations/models!

Substitution and Unification

• Substitution
– A substitution list θ is a list of variable-term pairs
• e.g., θ={x/3,y/f(z)}
– When θ is applied to an FOL formula, every free
occurrence of a variable in the list is replaced
with the given term
• e.g. (P(x,y) ^ ∃x P(x,y))θ = P(3,f(z)) ^ ∃x P(x,f(z))
• Unification / Most General Unifier
– The unifier UNIF(x,y) of two predicates/terms is a
substitution that makes both arguments identical
• e.g. Unif( P(x,f(x)), P(y, f(f(z))) ) = {x/f(1), y/f(1), z/1}
– The most general unifier MGU(x,y) is just that…
all other unifiers can be obtained from the MGU
by additional subst. (MGU exists for unifiable args)
• e.g. MGU( P(x,f(x)), P(y, f(f(z))) ) = {x/f(z), y/f(z)}

13
Skolemization
• Skolemization is the process of getting rid
of all ∃ quantifiers from a formula while
preserving (un)satisfiability:
– If ∃x quantifier is the outermost quantifier,
remove the ∃ quantifier and substitute a new
constant for x
– If ∃x quantifier occurs inside of ∀ quantifiers,
remove the ∃ quantifier and substitute a new
function of all ∀ quantified variables for x
• Examples:
– Skolemize( ∃w ∃x ∀y ∀z P(w,x,y,z) ) =
∀y ∀z P(c,d,y,z)
– Skolemize( ∀w ∃x ∀y ∃z P(w,x,y,z) ) =
∀w ∀y P(w,f(w),y,f(x,y))

CNF Conversion

CNF conversion is the same as the

propositional case up to NNF, then do:
– Standardize apart variables (all quantified
variables should have different names)
• e.g. ∀x A(x) ∧ ∃x ¬A(x) becomes ∀x A(x) ∧ ∃y ¬A(y)
– Skolemize formula
• e.g. ∀x A(x) ∧ ∃y ¬A(y) becomes ∀x A(x) ∧ ¬A(c)
– Drop universals
• e.g. ∀x A(x) ∧ ¬A(c) becomes A(x) ∧ ¬A(c)
– Distribute ∨ over ∧

14
First-order Theorem Proving

• Tableaux methods
– Preferred for some types of reasoning and for
subsets of FOL (guarded fragment, set theory)
– Highly successful for description and modal
logics which conform to guarded fragment of FOL
• Resolution Methods
– Most successful technique for a variety of KBs
– But… search space grows very quickly
– Need a variety of optimizations in practice
• strategies, ordering, redundancy elimination
• FOL TP complete ☺, but semidecidable
– Will return in finite time if formula entailed
– May run forever if not entailed

First-order Tableaux

Given negated query F (in NNF), use rules to

recursively break down:
– α-Rule, β-Rule: Same as for prop tableaux
– γ-Rule: Given ∀x A(x) add A(?v) for variable ?v
– δ-Rule: Given ∃x A(x) add A(f) for Skolem function f
– 〈Clash〉〉: If unifiable A and ¬A occur on same branch

∀x A(x) ∧ ∃x ¬A(x) ∨ ∃x,y ¬B(x,y) ∧ ∀x,y B(x,y)

∀x A(x) ∧ ∃x ¬A(x) β -Rule ∃x,y ¬B(x,y) ∧ ∀x,y B(x,y) β-Rule

A(?y) α / γ -Rule ¬B(c,d) α / δ / δ -Rule
¬A(c) α / δ -Rule B(?y,?z) α / γ / γ -Rule
〈Clash〉〉 〈Clash〉〉

15
First-order Resolution
• Binary Resolution Rule
Rule: Example application:
C∨D ¬E ∨ F P(3)∨Q(f(x))∨R(y) ¬Q(y)
θ=MGU(D,E)
(C ∨ F)θ P(3) ∨ R(f(x))

• Factoring Rule
Rule: Example application:
C∨D∨E P(z) ∨ Q(3) ∨ Q(z)
θ=MGU(C,D)
Cθ ∨ E P(3) ∨ Q(3)

Example of First-order Logic

Resolution Proof

16
Importance of Factoring
• Without the factoring rule, binary
resolution is incomplete
• For example, take the following
refutable clause set:
– { A(w) v A(z), ~A(y) v ~A(z) }

• All binary resolutions yield clauses

of the same form
• Clause set is only refutable if one of
the clauses is first factored

Search Control
Additional refinements of prop strategies
yield goal-directed / bottom-up search:
– SLD Resolution
• KB of definite clauses (i.e. Horn rules), e.g.
Uncle(?x,?y) := Father(?x,?z) ∧ Brother(?x,?y)
• Resolution backward chains from goal of rules
• With negation-as-failure semantics, SLD-
resolution is logic programming, i.e. Prolog
– Negative and Positive Hyperresolution
• All negative (positive) literals in nucleus clause
are simultaneously resolved with completely
positive (negative) satellite clauses
• Positive hyperres yields backward chaining
• Negative hyperres yields forward chaining

17
Database-style Inference
• Naïve approaches to resolution perform one
inference per step
• For SLD or neg. hyperres and KBs w/ large
numbers of constants / functions, can store
clause terms and perform DB-like res, e.g.
– CNF KB = { R(a,b), R(b,a), R(b,c), R(c,b),
¬R(x,y) ∨ ¬R(y,z) ∨ R(x,z) }
– Use DB join/project during SLD or neg. hyperres:
R(x,y) R(y,z) R(x,z)
{ 〈a,b〉〉, 〈b,a〉〉,
〈b,c〉〉, 〈c,b〉〉 }
× { 〈a,b〉〉, 〈b,a〉〉,
〈b,c〉〉, 〈c,b〉〉 }
⇒ { 〈a,a〉〉, 〈a,c〉〉, 〈b,b〉〉,
〈c,c〉〉, 〈c,a〉〉, 〈c,c〉〉 }

• Can cache inferences for reuse (tabling)

• Huge improvement for instance-heavy KBs

Term Indexing
• Term indexing is another general technique
for fast retrieval of sets of terms / clauses
matching criteria
• Common uses in modern theorem provers:
– Term q is unifiable with term t, i.e., ∃θ s.t. qθ = tθ
– Term t is an instance of q, i.e., ∃θ s.t. qθ = t
– Term t is a generalization of q, i.e., ∃θ s.t. q = tθ
– Clause q subsumes clause t, i.e., ∃θ s.t. qθ ⊆t
– Clause q is subsumed by clause t, i.e., ∃θ s.t. tθ ⊆q
• Techniques: (Google for “term indexing”)
– Path indexing
– Code, context, & discrimination trees

18
Age-weight Ratio
• During a resolution strategy, have two sets:
– Active: Set of active clauses for resolving with
– Frontier: Candidate clauses to resolve with Active

• Idea: Store the frontier in two queues

– Age queue: Standard FIFO queue
– Weight queue: Priority queue where clause priority
determined by heuristic measure:
• Number of literals, number of terms, etc…

• A:W ratio: Choose A clauses from age queue

for every W chosen from weight queue
– Retains completeness of strategy if A is non-zero
• I.e., fair b/c all clauses eventually selected
– Can speed up inference by orders of magnitude!

Redundancy Control
• Redundancy of clauses is a huge problem in
FOL resolution
– For clauses C & D, C is redundant if ∃θ s.t. Cθ
θ⊆D
as a multiset, a.k.a. θ-subsumption
– If true, D is redundant and can be removed
• Intuition: If D used in a refutation, Cθ
θ could be
substituted leading to even shorter refutation

• Two types of subsumption where N is a new

resolvent and A ∈ Active:
– Forward subsumption: A θ-subsumes N, delete N
– Backward subsumption: N θ-subsumes A, delete A

• Forward / backward subsumption expensive

but saves many redundant inferences

19
Saturation Theorem Proving

• Given a set of clauses S:

– S is saturated if all possible inferences
from clauses in S generate forward
subsumed clauses
– Thus, all new inferences can be deleted
without sacrificing completeness
– If S does not contain the empty clause
then S is satisfiable
• Saturation implies no proof possible!
• Usually need ordering restrictions to
reach saturation (if possible)…

Simplification Orderings
For complete ordered resolution in FOL,
must use term simplification orderings:
– Well-founded (Noetherian): If there is no
infinitely decreasing chain of terms s.t.
t0 t1 t2 … t∞
– Monotonic: If s t then f[[s]] f [t]] (f[[s]] and
f[[t]] are identical except for [term]])
– Stable under Subst.: If s t then sθ tθ

Examples: (Google for following keywords)

– Knuth-Bendix ordering
– Lexicographic path ordering

20
Literal Ordering & Selection

• Can extend term ordering to literals lit:

– If literals equal but opposite sign, then
negative literal lit positive literal
– Otherwise, treat literals as terms (modulo sign)
and literal ordering lit is just term ordering

• A selection function selects literals, and

must adhere to following rules:
– At least one literal must be selected
– Either a negative literal is among the selection,
or all maximal positive literals w.r.t. lit are
selected

• Show selected literals by underscore

– e.g., { A ∨ ¬B ∨ ¬C , D ∨ E ∨ ¬F, ¬G ∨ H ∨ I }

Ordered Resolution w/ Selection

• Binary Ordered Res w/ Selection

Rule: Example application:
C∨D ¬E ∨ F P(3)∨Q(f(x))∨R(y) ¬Q(y)
θ=MGU(D,E)
(C ∨ F)θ P(3) ∨ R(f(x))

• Ordered Factoring w/ Selection

Rule: Example application:
C∨D∨E P(z) ∨ Q(3) ∨ Q(z)
θ=MGU(C,D)
Cθ ∨ E P(3) ∨ Q(3)

21
Clause Orderings & Redundancy

• Must define specialized redundancy criterion

for forward and backward subsumption /
deletion when using ordered resolution:
– Define bag (clause) extension of literal ordering:
• {x,y1,…,ym} bag {x1,…,xn,y1,…,ym} if ∀i x lit xi
– Can define redundancy w.r.t. bag ordering:
• Clause C is redundant w.r.t. set of clauses S, if
∃ C1,…,Cn ∈ S, n ≥ 0, s.t. ∀i Ci bag C and C1,…,Cn |= C
– Under ordered res, even if C θ-subsumes D, D is not
redundant (and can’t be deleted) unless C bag D

• NB: Search restrictions of ordered res far

outweigh weakened notion of redundancy
• Ordered res is effective saturation strategy!

Equality
• A predicate w/ special interpretation
• Could axiomatize:
– x=x (reflexive)
– x=y ⇒ y=x (symmetric)
– x=y ∧ y=z ⇒ x=z (transitive)
– For each function f:
• x1=y1 ∧ … ∧ xn=yn ⇒ f(x1,…,xn)=f(y1,…,yn)
– For each predicate P:
• x1=y1 ∧ … ∧ xn=yn ∧ P(x1,…,xn) ⇒ P(y1,…,yn)
• Too many axioms… better to reason
about equality in inference rules

22
Inference
Rules for Equality
• Demodulation (incomplete)
Rule: Literal containing z Example application:
x=y L[z] ∨ D x=f(x) P(3) ∨ Q
θ=MGU(x,z) θ={x/3}
L[yθ] ∨ D P(f(3)) ∨ Q

• Paramodulation (complete)
Rule: Literal containing z Example application:
x=y ∨ C L[z] ∨ D x=f(x)∨C P(3)∨Q
θ=MGU(x,z) θ={x/3}
(L[y] ∨ C ∨ D)θ P(f(3))∨C∨Q

Equational Programming
• Used extensively for algebraic group
theory proofs
• All axioms and conjectures are unit
equality predicates with arithmetic
functions on the LHS and RHS, e.g.
– a*(x+y) = a*x+a*y ?

• In this case, associative-

commutative (AC) unification
(Stickel) important for efficiency, e.g.
– MGU(x+3*y*y, z*3*z+1) = {x/1, y/z}

23
First-order theorem
proving software
Many highly optimized first-order
theorem proving implementations:
– Vampire (1st place for many years in
CADE TP competition)
– Otter (Foundation for modern TP, still
very good, usually 2nd place in CADE)
– SPASS (Specialized for sort reasoning)
– SETHEO (Connection tableaux calculus)
– EQP (Equational theorem proving
system, proved Robbins conjecture)

First-order TP Progress
• Ever since the 1970s I at various times investigated
using automated theorem-proving systems. But it
always seemed that extensive human input--typically
from the creators of the system--was needed to make
such systems actually find non-trivial proofs.
• In the late 1990s, however, I decided to try the latest
systems and was surprised to find that some of them
could routinely produce proofs hundreds of steps long
with little or no guidance. … the overall ability to do
proofs--at least in pure operator systems--seemed vastly
to exceed that of any human.
--Steven Wolfram, “A New Kind of Science”

24
On the other hand…
• Success of modern theorem provers
relies largely on heuristic tuning
• Input KBs are analyzed for properties
which determine strategies and
various parameters of inference
• Still an art as much as a science,
much room for more principled tuning
of parameters, e.g.
– Automatic partitioning of KBs to induce
good literal orderings (McIlraith and Amir)

Gödel’s Incompleteness
Theorem
• FOL inference is complete (Gödel)
• So what is Gödel’s incompleteness
theorem (GIT) about?
• GIT: Inference in FOL with arithmetic
(+,*,exp) is incomplete b/c set of
axioms for arithmetic is not
recursively enumerable.
• Read: Inference rules are sound and
complete, but no way to generate all
axioms required for arithmetic!

25
Modal Logic
• Logic of knowledge and/or belief, e.g.
– English: Scott knows that you know that Scott
knows this lecture is boring
– Modal Logic Kn (n agents): KScottKyouKScott LIB

• Possible worlds (Kripke) semantics

– Each modal operator Ki corresponds to a set of
possible interpretations (i.e., possible worlds)
– Different axioms (T,D,4,5,…) correspond to
relations b/w worlds, Axiom 4: Kiϕ => KiKiϕ
– Semantics: Kiϕ iff ϕ is true in all worlds agent i
considers possible according to axioms & KB

• Postpone reasoning until DL…

Temporal Logic
• A modal logic where the possible
worlds are linked by time:
– LTL: Linear temporal logic w1 w2 w3
• World states evolve
deterministically w4
• State can involve action w2
w5
– CTL: Computation tree logic w1
w6
• World states can evolve w3
non-deterministically
w7
• Temporal operators specify
conditions on world evolution
• Used for verification, safety checks

26
LTL Temporal Operators

• G f: always f f f f f f f

• F f: eventually f f f

• X f: next state Xf f

• f U r: until f f f f r

• f R r: releases r r r r,f

Temporal Logic Inference

• Because time evolves infinitely,
propositional SAT methods won’t work for
LTL/CTL verification (will branch infinitely)
• However, LTL/CTL inference is monotonic!
– To check condition, start with set of all worlds
– Evolve world one step, remove states not
satisfying condition
– Continue evolution until set does not change…
this is set of all states for which condition holds
• For propositional temporal logic, number of
worlds is finite ⇒ termination ⇒ decidable!
• BDD data structure used to compactly
encode sets of worlds and evolve worlds.

27
Description Logic

• A concept oriented logic:

English FOL DL
Dog with a DWS(x) ⇔ DWS ⇔
Spot Dog(x) ^ (∃y.has(x,y) Dog  ∃has.Spot
(DWS) ^ Spot(y))

Large Dog LDWDS(x) ⇔ LDWDS ⇔

with a Dark (Dog(x) ^ Large(x)) ^ Dog  Large 
Spot (∃y.has(x,y) ∃has.(Spot  Dark)
(LDWDS) ^ (Spot(y) ^ Dark(y))

• Guarded fragment subset of FOL

Description Logic (DL)

Inference
• Natural correspondence between ALC DL
and modal logic (Schild):
– Modal propositions are concepts that hold in
possible worlds w, e.g. lecture is boring: LIB(w)
– Modal operators Ki are DL roles that link possible
worlds: Kscott(w1, w2)
– If Scott knows that the lecture-is-boring then
∀w2 Kscott(w1, w2)⇒LIB(w2) (w1 is a free variable)
– Or in DL notation ∀Kscott.LIB
• Since decidable tableaux methods known
for modal logics, these were imported into
DL and later extended to expressive DLs
• Benefit of DL: Decidable subset of FOL that
is ideal for conceptual ontology reasoning!

28
Example of Description Logic
Tableaux Proof

DL Reasoner
Output (FaCT++)
Taxonomy encodes all ⇒ relations

29
Modal, Verification, and
DL Inference Software
• Modal logic
– MSPASS (converts modal formula to FOL)
– By correspondence, also DL reasoners
• Verification (temporal and non-temporal)
– PVS (interactive TP for HW/SW verification)
– ALLOY (first-order HW/SW model checker)
– NuSMV (BDD-based LTL/CTL HW/SW verif.)
• DL Reasoning
– Classic (limited DL, poly-time inference)
– Racer (expressive DL, highly optimized)
– FaCT++ (very expr. DL, highly optimized)

Repositories of TP Problems

Many repositories of theorem proving

knowledge bases:
– TPTP: Thousands of Problems for TPs
• Algebraic group theory, geometry, set theory,
topology, software verification, NLP KBs
– SATLIB: Library of Prop. SAT problems
• Hardware verification, industrial planning
problems, hard randomized problems
– Open/ResearchCyc: Public version of Cyc
• Large common-sense repository expressed in
higher-order logic
– Semantic Web: DL ontologies in OWL
• The web is the limit!

30
Concluding Thoughts
• Many logics, inference techniques,
and computational guarantees
• Have to balance expressivity and
computational tradeoffs with task-
specific needs (Brachman & Levesque, 1985)
• Woods (1987): Don’t blame the tool!
– A poor craftsman blames the tool when
their efforts fail
– An experienced craftsman uses the right
tool for the job

5105 AI Propositional Logic
No ratings yet
5105 AI Propositional Logic
48 pages
Proposition Al 2
No ratings yet
Proposition Al 2
88 pages
Fallsem2015 16 Cp3066 Qz01ans PDNF and PCNF
No ratings yet
Fallsem2015 16 Cp3066 Qz01ans PDNF and PCNF
28 pages
07 Logic
No ratings yet
07 Logic
82 pages
Knowledge Reasoning
No ratings yet
Knowledge Reasoning
40 pages
13 Logic
No ratings yet
13 Logic
70 pages
Ai9 - Propositional Theorem Proving
No ratings yet
Ai9 - Propositional Theorem Proving
37 pages
Propositional Logic
No ratings yet
Propositional Logic
42 pages
NLP Unit 4
No ratings yet
NLP Unit 4
40 pages
AI07
No ratings yet
AI07
42 pages
AI Unit 3
No ratings yet
AI Unit 3
154 pages
Ai Online
No ratings yet
Ai Online
10 pages
CHP1
No ratings yet
CHP1
26 pages
Completeness
No ratings yet
Completeness
64 pages
Knowledge Representation - PPT Unit3
No ratings yet
Knowledge Representation - PPT Unit3
42 pages
Dokumen - Pub Ai in Cybersecurity 1nbsped 3319988417 9783319988412
No ratings yet
Dokumen - Pub Ai in Cybersecurity 1nbsped 3319988417 9783319988412
302 pages
FOL Uncertainty
No ratings yet
FOL Uncertainty
108 pages
Predicate Logic
No ratings yet
Predicate Logic
81 pages
Artificialintelligencepptmy 150322111018 Conversion Gate01
No ratings yet
Artificialintelligencepptmy 150322111018 Conversion Gate01
84 pages
Lec13 Fol
No ratings yet
Lec13 Fol
38 pages
Propositional Logic & Reasoning: Vikram Pudi IIIT Hyderabad
No ratings yet
Propositional Logic & Reasoning: Vikram Pudi IIIT Hyderabad
40 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
82 pages
Inference in FOL (AIMA Ch. 9)
No ratings yet
Inference in FOL (AIMA Ch. 9)
39 pages
Resolution Davis Putnam
No ratings yet
Resolution Davis Putnam
51 pages
CSE 381 Lecture 05 Logical Agents LastUpdate Mar 07 2021
No ratings yet
CSE 381 Lecture 05 Logical Agents LastUpdate Mar 07 2021
47 pages
Logical
No ratings yet
Logical
58 pages
Artificial Intelligence PPT-7 - Inference in FOL
No ratings yet
Artificial Intelligence PPT-7 - Inference in FOL
46 pages
Tutorial 2
No ratings yet
Tutorial 2
3 pages
Chapter 4 - Knowledge and Inference
No ratings yet
Chapter 4 - Knowledge and Inference
49 pages
Logic Fol 2
No ratings yet
Logic Fol 2
43 pages
Glossary of Artificial Intelligence
No ratings yet
Glossary of Artificial Intelligence
62 pages
Lectures
No ratings yet
Lectures
12 pages
Notes 8: Predicate Logic and Inference: ICS 270a Spring 2003
No ratings yet
Notes 8: Predicate Logic and Inference: ICS 270a Spring 2003
35 pages
PL Model Checking Efficient PL Model Checking
No ratings yet
PL Model Checking Efficient PL Model Checking
17 pages
1004 Theorem Proving 2018
No ratings yet
1004 Theorem Proving 2018
34 pages
FA15 CS188 Lecture 08 PL Semantics and Inference - pptx-2
No ratings yet
FA15 CS188 Lecture 08 PL Semantics and Inference - pptx-2
25 pages
Ai 3,4,5 Vtu nOTES
No ratings yet
Ai 3,4,5 Vtu nOTES
22 pages
Mod1 08 Reasoning
No ratings yet
Mod1 08 Reasoning
23 pages
Propositional Logic and Its Applications in Artificial Intelligence
No ratings yet
Propositional Logic and Its Applications in Artificial Intelligence
28 pages
Impossibility Results For Logic Dberlog 2011, Week 6: 1 Logics
No ratings yet
Impossibility Results For Logic Dberlog 2011, Week 6: 1 Logics
19 pages
15 KB Systems Part3 6up
No ratings yet
15 KB Systems Part3 6up
7 pages
Module 1 Artificial Intelligence Fundamentals
No ratings yet
Module 1 Artificial Intelligence Fundamentals
27 pages
Logic and Computation Exercises
No ratings yet
Logic and Computation Exercises
111 pages
Knowledge Representation
No ratings yet
Knowledge Representation
8 pages
07a Logic 4p
No ratings yet
07a Logic 4p
9 pages
Chapter 1. Logical Reasoning Propositional Logic: Pedro Cabalar
No ratings yet
Chapter 1. Logical Reasoning Propositional Logic: Pedro Cabalar
60 pages
Lecture Levine and Stuart
No ratings yet
Lecture Levine and Stuart
35 pages
Ai Unit 3 Bec Final
No ratings yet
Ai Unit 3 Bec Final
81 pages
Comparison of Two Theorem Provers: Isabelle/HOL and Coq
No ratings yet
Comparison of Two Theorem Provers: Isabelle/HOL and Coq
18 pages
Firstorderlogic JSN
No ratings yet
Firstorderlogic JSN
31 pages
Artificial Intelligence Course Code ECE4 PDF
No ratings yet
Artificial Intelligence Course Code ECE4 PDF
72 pages
16 FirstOrderLogic
No ratings yet
16 FirstOrderLogic
79 pages
Rule Based Programming: 3rd Year, 2nd Semester
No ratings yet
Rule Based Programming: 3rd Year, 2nd Semester
17 pages
SNOMED CT Editorial Guide
No ratings yet
SNOMED CT Editorial Guide
285 pages
Knowledge Representation
No ratings yet
Knowledge Representation
50 pages
KRR4 Notes
No ratings yet
KRR4 Notes
7 pages
Knowledge Representation Using Logic
No ratings yet
Knowledge Representation Using Logic
55 pages
2024 CS1200 Lec15 Notes
No ratings yet
2024 CS1200 Lec15 Notes
6 pages
B) Ontology Alignment and Mapping
No ratings yet
B) Ontology Alignment and Mapping
23 pages
Unit 3 AI
No ratings yet
Unit 3 AI
41 pages
2025 MMP AI-KRR Unit 5 Structured Representation of Knowledge and Ontology (Part 2)
No ratings yet
2025 MMP AI-KRR Unit 5 Structured Representation of Knowledge and Ontology (Part 2)
62 pages
Advanced OWL
No ratings yet
Advanced OWL
42 pages
Ontology-Aided Software Engineering
No ratings yet
Ontology-Aided Software Engineering
163 pages
Knowledge Representation: Unit-6
No ratings yet
Knowledge Representation: Unit-6
101 pages
Habilitation
No ratings yet
Habilitation
186 pages
(Lecture Notes in Computer Science 6450 - Transactions On Computational Collective Intelligence) Grzegorz J. Nalepa, Weronika T. Furmańska (Auth.), Ngoc Thanh Nguyen, Ryszard Kowalczyk (Eds.) - Transa
No ratings yet
(Lecture Notes in Computer Science 6450 - Transactions On Computational Collective Intelligence) Grzegorz J. Nalepa, Weronika T. Furmańska (Auth.), Ngoc Thanh Nguyen, Ryszard Kowalczyk (Eds.) - Transa
204 pages
Propositional Logic or How To Reason Correctly: Chapter 8 (New Edition) Chapter 7 (Old Edition)
No ratings yet
Propositional Logic or How To Reason Correctly: Chapter 8 (New Edition) Chapter 7 (Old Edition)
28 pages
PESIT Bangalore South Campus: Internal Assessment Test Ii-Solution
No ratings yet
PESIT Bangalore South Campus: Internal Assessment Test Ii-Solution
8 pages
CS6180 Lect4
No ratings yet
CS6180 Lect4
6 pages
CS2742 Midterm Test 2 Study Sheet
No ratings yet
CS2742 Midterm Test 2 Study Sheet
5 pages
First-Order Logic: CS472 - Fall 2007 Thorsten Joachims
No ratings yet
First-Order Logic: CS472 - Fall 2007 Thorsten Joachims
8 pages
An IoT-Enabled Ontology-Based Intelligent Healthcare Framework For Remote Patient Monitoring
No ratings yet
An IoT-Enabled Ontology-Based Intelligent Healthcare Framework For Remote Patient Monitoring
20 pages
Atp BW
No ratings yet
Atp BW
31 pages
Unit 3.2 Ai
No ratings yet
Unit 3.2 Ai
46 pages
Artificial Intelligence I: Knowledge Repre-Sentation
No ratings yet
Artificial Intelligence I: Knowledge Repre-Sentation
30 pages
CS2742 Midterm Test 1 Study Sheet Propositional Logic
No ratings yet
CS2742 Midterm Test 1 Study Sheet Propositional Logic
3 pages
CO3-Session 1
No ratings yet
CO3-Session 1
18 pages
A Logic-Based Approach To Web Services Composition and Verification Using OWL-S
No ratings yet
A Logic-Based Approach To Web Services Composition and Verification Using OWL-S
5 pages
Interior Design
No ratings yet
Interior Design
78 pages
Unit Ii - Cs6010 Sna: Unitii Modelling, Aggregating and Knowledge Representation 9
No ratings yet
Unit Ii - Cs6010 Sna: Unitii Modelling, Aggregating and Knowledge Representation 9
17 pages
Arthur Prior and Hybrid Logic
No ratings yet
Arthur Prior and Hybrid Logic
37 pages
Forward Chaining and Backward Chaining in Ai: Inference Engine
No ratings yet
Forward Chaining and Backward Chaining in Ai: Inference Engine
18 pages
AI-unit 2-QB
No ratings yet
AI-unit 2-QB
4 pages
AI Notes
No ratings yet
AI Notes
6 pages
Introducing To Logic and Law
No ratings yet
Introducing To Logic and Law
13 pages
Matching Knowledge Bases
No ratings yet
Matching Knowledge Bases
14 pages
Improved Matchmaking Algorithm For Semantic Web Services Based On Bipartite Graph Matching
No ratings yet
Improved Matchmaking Algorithm For Semantic Web Services Based On Bipartite Graph Matching
9 pages
An Ontology-Based Context Model To Manage Users Preferences and Conflicts
No ratings yet
An Ontology-Based Context Model To Manage Users Preferences and Conflicts
24 pages
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Atp BW

Uploaded by

Atp BW

Uploaded by

Automated

• Def. Automated Theorem Proving:

• Depending on underlying logic, task

• Hardware and Software Verification

Basic Task Structure

Focus on theorem proving for logics

• Theorem Proving Techniques:

• Given: • Direct Proof:

Example of Description Logic

• Propositional and first-order logic

• Modal, temporal, & description logics

Entailment vs. Truth

• Entailment vs. Truth

• Think of truth in a set-theoretic manner

• Two properties of ATP inference systems:

• For a given logic, an ATP decision

Prop. Logic Semantics

• Truth defined recursively from ground up!

CNF Normalization Example

• Many optimizations, more next week

Prop. Tableaux Methods

A∨C ¬precip ∨ ¬freezing ∨ slippery

• Simple strategy is to make all

• Refutation resolution is sound and

• Effective, but does not work with all

Prop. Inference Software

• For some applications: BDDs

First-order Logic Syntax

Substitution and Unification

CNF conversion is the same as the

Given negated query F (in NNF), use rules to

∀x A(x) ∧ ∃x ¬A(x) ∨ ∃x,y ¬B(x,y) ∧ ∀x,y B(x,y)

∀x A(x) ∧ ∃x ¬A(x) β -Rule ∃x,y ¬B(x,y) ∧ ∀x,y B(x,y) β-Rule

Example of First-order Logic

• All binary resolutions yield clauses

• Can cache inferences for reuse (tabling)

• Idea: Store the frontier in two queues

• A:W ratio: Choose A clauses from age queue

• Two types of subsumption where N is a new

• Forward / backward subsumption expensive

• Given a set of clauses S:

Examples: (Google for following keywords)

• Can extend term ordering to literals lit:

• A selection function selects literals, and

• Show selected literals by underscore

Ordered Resolution w/ Selection

• Binary Ordered Res w/ Selection

• Ordered Factoring w/ Selection

• Must define specialized redundancy criterion

• NB: Search restrictions of ordered res far

• In this case, associative-

• Possible worlds (Kripke) semantics

• Postpone reasoning until DL…

Temporal Logic Inference

• A concept oriented logic:

Large Dog LDWDS(x) ⇔ LDWDS ⇔

• Guarded fragment subset of FOL

Description Logic (DL)

Many repositories of theorem proving

You might also like

• Can extend term ordering to literals lit: