Konolige - Abduction Vs Closure in Causal Theories
Konolige - Abduction Vs Closure in Causal Theories
Konolige - Abduction Vs Closure in Causal Theories
Elsevier
Research Note
Abstract
Konolige, K., Abduction versus closure in causal theories (Research Note), Artificial
Intelligence 53 (1992) 255-272.
There are two distinct formalizations for reasoning from observations to explanations, as in
diagnostic tasks. The consistency based approach treats the task as a deductive one, in
which the explanation is deduced from a background theory and a minimal set of
abnormalities. The abductive method, on the other hand, treats explanations as sentences
that, when added to the background theory, derive the observations. We show that there is
a close connection between these two formalizations in the context of simple causal theories:
domain theories in which a set of sentences are singled out as the explanatorily relevant
causes of observations. There are two main results, which show that (with certain caveats)
the consistency based approach can emulate abductive reasoning by adding closure axioms
to a causal theory; and that abductive techniques can be used in place of the consistency
based method in the domain of logic based diagnosis. It is especially interesting that in the
latter case, the abductive techniques generate only relevant explanations, while diagnoses
may have irrelevant elements,
1. Introduction
Reasoning to t h e b e s t e x p l a n a t i o n is a c o m m o n t a s k in m a n y areas of
a r t i f i c i a l i n t e l l i g e n c e . O n e o f t h e c l e a r e s t e x a m p l e s is d i a g n o s i s , in w h i c h o n e
r e a s o n s f r o m o b s e r v a t i o n s s u c h as p a t i e n t s y m p t o m s to t h e i r u n d e r l y i n g c a u s e s ,
a disease or physiological malfunction. In the literature, there are two fun-
damentally d i f f e r e n t f o r m a l i z a t i o n s o f this t a s k [9, 11]. I n o n e , t h e p r o c e s s o f
f i n d i n g a c a u s e is t r e a t e d as a s t r a i g h t f o r w a r d a b d u c t i v e t a s k . R e p r e s e n t a t i v e o f
this approach is the set-covering model of diagnosis [12], which assumes two
disjoint sets, d a set of disorders, and m a set of manifestations. Disorders are
assumed to "cause" manifestations, represented by a relation d x m. The
problem of diagnosis is recast as the problem of finding a minimal cover for
observed manifestations m' C m, that is, a minimal subset of d that causes m'.
The competing formalization, the consistency based approach, is best repre-
sented by Reiter's logic based theory of diagnosis [14]. In this theory, the
functionality of a system containing a finite number of components is character-
ized by a set of first-order sentences, the domain theory. The special predicate
ab(c) is used to state that the component c is abnormal or not functioning
correctly. The observed behavior of the system is represented by a set of
sentences. A diagnosis of the behavior is a minimal set of abnormality
assumptions that is consistent with the observations and the domain theory.
These two formalizations seem fundamentally different. The abductive ap-
proach looks for a set of causes that will imply the observations; the con-
sistency based approach looks for a set of abnormality assumptions that are
consistent with the observations. Nevertheless there is a connection between
the two: Reiter showed how to express the set-covering model within his
framework. Recently, Console [2] and Poole [9] have shown that either
formalization can be used in restricted settings to compute the same explana-
tions for diagnostic tasks. In the abductive framework, the domain theory has
axioms that relate causes and their effects, e.g., c i D e would be used to say
that the effect e is a result of cause q . A corresponding consistency based
theory is created by adding closure axioms stating that the only way to achieve
an effect is by the set of causes given (e ~ c~ v c 2 v • • "). The closure axioms
are local in that they are easily derived by looking at all the implications that
have a c o m m o n head atom. The explanations computed by the two methods
are the same, as long as the domain theory contains just Horn clause
implications from causes to effects, and is acyclic.
This result applies to diagnostic tasks that require explanations, that is, the
unexpected observations must be predicted or explained from the assumed
malfunctions. In the literature, explanatory diagnosis is usually signalled by the
presence of fault models [4, 15]. Reiter's framework may also be used for a
weaker form of diagnosis, which could be called excusing diagnosis: identify
components that, if malfunctioning, would cancel or excuse predicted normal
behavior of the system that conflicts with the observations. Here we look only
at the case of explanatory diagnosis (and causal explanation in general), since
excusing diagnosis has no analog in the abductive framework.
The restrictions on the domain theory for the Console/Poole result are very
tight; in particular, there can be no correlation information (e.g., that two
causes are mutually exclusive, or that one effect is the negation of another) or
uncertainty (e.g., a cause implying a disjunction of effects). In this paper we
will examine the connection between abduction and closure in the setting of
Abduction versus closure in causal theories 257
There are both positive and negative results. With an appropriate notion of
explanatory closure, given certain technical conditions, the consistency based
approach will compute the same explanations as the abductive approach.
However, the utility of the former method is open to question, since local
closure will no longer suffice for explanatory closure; there seems to be no way
to close the domain theory other than by computing all explanations. Further,
the consistency based method is strictly stronger than the abductive one in
explanatory diagnostic tasks, and the answers it produces may have elements
that are not relevant to a causal explanation.
A second area that we address is whether abductive methods may be used in
the setting of logic based diagnosis with fault models. This area is closely
related to the previous one, except that we assume that there is already a
closed theory, and that the causes take on a specific form, namely normality
and abnormality assumptions about components. Our main result here is that
the abductive method produces kernel diagnoses, ~ but without any of the
irrelevant causes that may be present in the latter.
The next three sections of this note describe simple causal theories, and
define abductive and consistency based methods in this context. Section 5
develops the concept of explanatory closures, and Section 6 gives the main
results on emulating abduction with the consistency based method. Section 7
describes how abductive methods can perform logic based diagnosis.
wet-lawn wet-road
2 Allowing only atoms simplifies the analysis, but is not restrictive, since we can include
equivalences such as c --- ~, where ~b is a complex sentence.
Abduction versus closure in causal theories 259
The consistency based approach has been most clearly developed in the
domain of diagnosis, especially in [3, 14]. In this section we will modify the
terminology slightly to apply to the more general causal theories of Section 2,
and to make comparison to the abductive approach easier. The particulars of
the diagnostic task are discussed later, in Section 7.
0 0 0 {-Td [ d E D} is consistent.
When a denial set is maximal (that is, there is no other denial set that
contains it), no more negative causes can be consistently added to it, and thus it
is possible to deduce a set of (positive) causes from the maximal denial set, the
domain theory, and the observations [14]:
,~ O 0 0 { - T d ] d ~ D } ] - C - D. (1)
where m is the observed symptom and d i are diseases that cause the symptom.
Abduction versus closure in causal theories 261
These axioms give necessary conditions for the observation, namely, that one
of a set of diseases be present. Later papers refer to implications of this sort as
fault models [4, 15].
The difference between the consistency based approach and abduction is
twofold. First, the form of inference is distinct: rather than abducing causes
that imply the observations O given the domain theory 27, the consistency
approach tries to minimize the extent of the causation set C by denying as
many of its elements as possible. Second, these methods encode knowledge of
the domain differently: in the abductive framework, there are implications
from the causes to the effects, while in the consistency based systems, if we
want to derive explanatory rather than excusing diagnoses, the most important
information seems to be the implication from observations to possible causes.
Despite these differences, it is known that, under certain conditions on the
domain theory, abductive and consistency based explanations coincide.
Theorem 4 (Console, Poole). 4 Let (C, E, 2f) be a simple causal theory over a
propositional language, with ,~ a set of nonatomic definite clauses whose directed
graph is acyclic. Let C be a set of atoms that do not appear in the head of any
clause of 2f, and E any set of atoms. Let 17 be the Clark completion [1] of ~,.
Then the CBEs of ( C, E, 17 ) are exactly the ABEs of ( C, E, 27 ).
The simple causal theory of Fig. 1 does not satisfy the conditions, because it
contains the equivalence sun -~ rain, and sprinkler is a cause that appears as the
head of a clause. If we eliminate these anomalies, then the Clark completion of
the domain theory is:
wet-road =--rain,
wet-lawn - rain v sprinkler, (2)
sprinkler-~ warm ^ sun.
The CBEs of wet-lawn are {rain} and {sun, warm}; the ABE {sprinkler} is
missing.
For more complicated domain theories, Clark completion does not give the
required closure over abductive explanations. If the theory has cycles, for
example {a ~ b, a' ~ b, b ~ a}, then the completion will only pick out a subset
of the abductive explanations (in this case, b ~ a). If there is disjunction in the
head of a clause, the completion is undefined.
In the next sections we will extend the scope of Theorem 4 by considering a
more general notion of completion for a simple causal theory, that of explana-
tory closures.
4 Neither of these authors states the t h e o r e m in this form, although Poole ll0] is close. It is clear
that the t h e o r e m follows from their results. Poole's t h e o r e m as stated s e e m s to have a broader
application, but personal correspondence with him disclosed that the conditions of application are
as given here.
262 K. Konolige
5. Explanatory closures
Example 5. Let ( { a l , a 2, a3}, {g~, g2, g3}, ~ ) be a simple causal theory, with
2: equal to the conjunction of
a~ A a 2 ~ g l , a~ A a 2 ~ g 2 v g 3 , -qa~ v -~a 2 v ~a 3 ,
a z A a 3 D g2, a 2 /x a 3 D gl v g3 , g~ v g2 v g3 ,
a 3 m a I D g3 , a3 ^ al D gl v g2 •
gl D a I A a 2 ,
g2 ~ a2 A a 3 ,
g3 ~ a3 A a 1 .
Abduction versus closure in causal theories 263
Pl v P 2 V "'" vpn
V, V,(g, ^
=- V i ( g i A 7A~ ^ -TA~...)
where Eg i is the cautious explanation for gi, and the A is are all abductive
explanations for it. T h e p r o p o s i t i o n follows by tautological c o n s e q u e n c e . []
E x a m p l e 7. Let ( { a l , a3, a4} , {gl, g2, g3}, ~ > be a simple causal t h e o r y , with
equal to the c o n j u n c t i o n o f
a I 3 gl , 7a 1 v -la 3 ,
al D g2 , aa D gl v g 2 ,
aa ~ g3 ,
a4 ~ g3 •
T h e closures of this t h e o r y are
gl~al ,
g2 Da~ ,
g3 ~ a3 v a 4 .
If the first two closures are added to ~,, a I becomes true, a 3 becomes false, and
the only explanation for g3 is a 4.
This example shows that some causes may b e c o m e true or false, thus
modifying the available abductive explanations. However, no truly " n e w "
explanations are created by the addition of closures, since every explanation
must be the subset of one of the original ones.
Escape causes are the same idea as the unknown faults of [4, 15].
Abduction versus closure in causal theories 265
with respect to the augmented theory, and let H = ~ ' U (T(gi)}. Then 171 is
consistent, and for an arbitrary effect g, a subset A C D is an abductive
explanation o f g w.r.t. H if and only if it is an abductive explanation o f g w. r. t.
2f.
plvpEv'"vp,.
Because the only expressions containing r~ are of the form r~ D gi, the above
sentences are theorems of ~ ' only if there are corresponding theorems of
with each 7r~ replaced by -Tg~. This is impossible, since such a set is
unsatisfiable.
Assume A C_ C is an abductive explanation for g w.r.t. F/, but not w.r.t. ~.
By reasoning similar to that in the proof of Proposition 8, the following must
be theorems of ~ ' , where each p~ is either g~ or -Tg~:
Let a and a' be the causes. Adding the local closure g D a v b is insufficient,
because it is subsumed by g D c D b, so that a as a cause of g will never be
inferred. Any local closure for g cannot find the connection between c and b,
and thus has the chance of being incorrect.
Loops in the implication structure also cause problems for other global
closure methods such as circumscription, which is equivalent to Clark comple-
tion for the restricted language [13]. In the case of the above example,
minimizing g while holding the causes fixed yields g 3 b, which is again
stronger than the explanatory closure.
266 K. Konolige
Theorem 10. Let (C, E, ,~ ) be a simple causal theory, and suppose that
( C, E, 1I), its closure, is consistent and does not entail an effect g. Let A be an
A B E for g in ~, and suppose that A is consistent with 11 and independent in I1.
Then A is a subset (not necessarily proper) of some CBE for g in II.
Conversely, every CBE for g in 11 is a superset (not necessarily proper) of
some A B E for g in Z.
atoms. Second, there is a theory of causation for action types, which is used to
generate the closure conditions by circumscription. That is, the causation
axioms state what must follow if the preconditions of an action hold and the
action takes place; circumscription then generates the closure axioms. Minimi-
zation over miracles gives the desired explanations.
Another good example of the derivation of closure axioms by circumscriptive
techniques is Kautz's theory of plan recognition [5]. The domain theory is a
hierarchical set of actions; the causes are the goals at the highest level of the
hierarchy, the so-called E N D events. Relations between actions at different
levels in the hierarchy are given by a first-order domain theory. Circumscrip-
tion is used to close off the axioms, producing the explanatory closure axioms.
Given a set of observed actions, minimizing over the E N D event produces an
explanation of the observations. ~
The motivation behind the multiple circumscription in systems such as [5, 6]
has often been obscure. Given the results of this paper, it should be clear that
the circumscriptions are performing abduction by using closure and minimiza-
tion. Whether the circumscription corresponds to an appropriate closure can be
tested by checking whether it produces the explanatory closure axioms, and
whether adding these axioms changes the set of abductive explanations. The
examples and propositions of Section 5 should be helpful in this regard; for
example, by adding escape causes it is always possible to retain the original
causal structure. In general, if there are cycles in the implication structure of
the causal domain theory, then neither circumscription nor local closure will
work correctly in generating explanatory closures.
T h e accounts of these systems are of necessity somewhat simplified, but the basic structure is
correct.
Abduction versus closure in causal theories 269
that the input of device i is a logical 1, and ~ i n i that it is a 0 (similarly for out i
and output). The domain axioms are
9We could also change the vocabulary and add ok i =-~ab i as a new set of causes. But we are
trying to stay as close as possible to the logic based diagnosis terminology.
270 K, Konolige
,' 1
8. Conclusion
contain statements that the only causes are the known ones. The appropriate
closure axioms are identified in this paper as explanatory closures. The main
result of the paper is that minimization of causes in the closed theory produces
almost the same explanations as abduction in the original causal theory. The
caveat is that the abductive explanations are generally weaker than their
consistency based counterparts. There are two reasons for this: adding closures
may change the available abductive explanations; and the consistency based
method can conclude causes that are intuitively irrelevant to the observed
behavior.
If one is interested in the representation of domain knowledge, then the
abductive approach offers several advantages. It does not require the assump-
tion of complete knowledge of causation, and it is not necessary to assert the
explanatory closures. Adding the closures can lead to inconsistency and change
the available abductive explanations (although it will not add new ones). The
computational aspect of adding closures is also discouraging, since there is no
general local method that accomplishes the addition. Stronger global methods
such as circumscription will work only in special circumstances.
In logic based diagnosis, using an abductive method may be appropriate if it
is important to distinguish causes relevant to producing the observations from
those that are mere side-effects. But one must be careful here in giving too
much weight to the term "causally relevant", since the simple causal theories
we have introduced give only a very narrow interpretation of the complex
concept of causation.
Acknowledgement
I would like to thank David Poole, Eunok Paek, Oskar Dressier, and
Nicolas Helft for many helpful discussions on the evolving draft. The two
anonymous referees contributed many useful comments that I have used in
revising the paper. The research reported here was supported partially by the
NTT Corporation, and partially by the Office of Naval Research under
Contract No. N00014-89-C-0095.
References
[1] K. Clark, Negation as failure, in: Logic and Data Bases (Plenum, New York, 1978) 293-322.
[2] L. Console, D.T. Dupre, and P. Torasso, Abductive reasoning through direct deduction from
completed domain models, in: Z.W. Ras, ed., Methodologies for Intelligent Systems 4
(North-Holland, New York, 1988) 175-182.
[3] J. de Kleer, A. Mackworth, and R. Reiter, Characterizing diagnoses, in: Proceedings
AAAI-90, Boston, MA (1990).
[4] J. de Kleer and B.C. Williams, Diagnosis with behavioral modes, in: Proceedings IJCAI-89,
Detroit, MI (1989).
272 K. Konolige
[5] H. Kautz, A formal theory for plan recognition, Tech. Report TR-215, University of
Rochester (1987).
[6] V. Lifschitz and A. Rabinov, Miracles in formal theories of action, ArtiJ~ lntell. 38 (2) (1989)
225 -237.
[7] J. McCarty, First order theories of individual concepts and propositions, in: B. Meltzer and
D. Michie, eds., Machine Intelligence 9 (Edinburgh University Press, Edinburgh, 1979)
120-147.
[8] L. Morgenstern and L.A. Stein, Why things go wrong: A formal theory of causal reasoning,
in: Proceedings AAAI-88, St. Paul, MN (1988) 518-523.
[9] D. Poole, Representing knowledge for logic-based diagnosis, in: Proceedings of the Interna-
tional Conference on Fifth Generation Computing Systems, Tokyo (1988) 1282-1290.
If0] D. Poole, A methodology for using a default and abductive reasoning system, Tech. Report,
Department of Computer Science, University of Waterloo, Waterloo, Ontario (1988).
[11] D. Poole, Normality and faults in logic-based diagnosis, in: Proceedings IJCA1-89, Detroit,
MI (1989).
[12] J.A. Reggia, D.S. Nau, and Y. Wang, A formal model of diagnostic inference I. Problem
formulation and decomposition, Inf. Sci. 37 (1985).
[13] R. Reiter, Circumscription implies predicate completion (sometimes), in: Proceedings AAAI-
82, Pittsburg, PA (1982) 418-420.
[14] R. Reiter, A theory of diagnosis from first principles, Artif. lntell. 32 (1987) 57-95.
[15] P. Struss and O. Dressier, Physical negation-integrating fault models into the general
diagnostic engine, in: Proceedings 1JCAI-89, Detroit, MI (1989) 1318-1323.