Artificial Intelligence

Download as doc, pdf, or txt
Download as doc, pdf, or txt
You are on page 1of 13

ARTIFICIAL INTELLIGENCE

Architecture of Intelligence
Abstract
We start by making a distinction between mind and cognition, and by positing that
cognition is an aspect of mind. We propose as a working hypothesis a Separability
Hypothesis which posits that we can factor off an architecture for cognition from a more
general architecture for mind, thus avoiding a number of philosophical objections that
have been raised about the "Strong AI" hypothesis. hus the search for an architectural
level which will e!plain all the interesting phenomena of cognition is likely to be futile.
here are a number of levels which interact, unlike in the computer model, and this
interaction makes e!planation of even relatively simple cognitive phenomena in terms of
one level "uite incomplete.
I. Dimensions for Thinking About Thinking
A major problem in the study of intelligence and cognition is the range of#often implicit
#assumptions about what phenomena these terms are meant to cover. Are we just
talking about cognition as having and using knowledge, or are we also talking about other
mental states such as emotions and subjective awareness$ Are we talking about
intelligence as an abstract set of capacities, or as a set of biological mechanisms and
phenomena$ hese two "uestions set up two dimensions of discussion about intelligence.
After we discuss these dimensions we will discuss information processing,
representation, and cognitive architectures.
A. Dimension 1. Is intelligence separable from other mental phenomena?
When people think of intelligence and cognition, they often think of an agent being in
some knowledge state, that is, having thoughts, beliefs. hey also think of the underlying
process of cognition as something that changes knowledge states. Since knowledge states
are particular types of information states the underlying process is thought of as
information processing. %owever, besides these knowledge states, mental phenomena
also include such things as emotional states and subjective consciousness. &nder what
conditions can these other mental properties also be attributed to artifacts to which we
attribute knowledge states$ Is intelligence separable from these other mental phenomena$
It is possible that intelligence can be e!plained or simulated without necessarily
e!plaining or simulating other aspects of mind. A somewhat formal way of putting this
Separability Hypothesis is that the knowledge state transformation account can be
factored off as a homomorphism of the mental process account. hat is' If the mental
process can be seen as a se"uence of transformations' M1 (()M2 (()..., where Mi is the
complete mental state, and the transformation function *the function that is responsible
for state changes+ is F, then a subprocess K1 (() K2 ((). . . can be identified such that
each Ki is a knowledge state and a component of the corresponding Mi, the
transformation function is f, and f is some kind of homomorphism of F. A study of
intelligence alone can restrict itself to a characteri,ation of K-s and f, without producing
accounts of M-s and F. If cognition is in fact separable in this sense, we can in principle
design machines that implement f and whose states are interpretable as K-s. We can call
such machines cognitive agents, and attribute intelligence to them. %owever, the states of
such machines are not necessarily interpretable as complete M-s, and thus they may be
denied other attributes of mental states.
B. Dimension 2: Functional versus Biological
he second dimension in discussions about intelligence involves the e!tent to which we
need to be tied to biology for understanding intelligence. .an intelligence be
characteri,ed abstractly as a functional capability which just happens to be reali,ed more
or less well by some biological organisms$ If it can, then study of biological brains, of
human psychology, or of the phenomenology of human consciousness is not logically
necessary for a theory of cognition and intelligence, just as en"uiries into the relevant
capabilities of biological organisms are not needed for the abstract study of logic and
arithmetic or for the theory of flight. /f course, we may learn something from biology
about how to practically implement intelligent systems, but we may feel "uite free to
substitute non(biological *both in the sense of architectures which are not brain(like and
in the sense of being un( constrained by considerations of human psychology+ approaches
for all or part of our implementation. Whether intelligence can be characteri,ed abstractly
as a functional capability surely depends upon what phenomena we want to include in
defining the functional capability, as we discussed. We might have different constraints
on a definition that needed to include emotion and subjective states than one that only
included knowledge states. .learly, the enterprise of AI deeply depends upon this
functional view being true at some level, but whether that level is abstract logical
representations as in some branches of AI, 0arwinian neural group selections as proposed
by 1delman, something intermediate, or something physicalist is still an open "uestion.
III. Architectures for Intelligence
We now move to a discussion of architectural proposals within the information
processing perspective. /ur goal is to try to place the multiplicity of proposals into
perspective. As we review various proposals, we will present some judgements of our
own about relevant issues. 2ut first, we need to review the notion of an architecture and
make some additional distinctions.
A. Form an !ontent Issues in Architectures
In computer science, a programming language corresponds to a virtual architecture. A
specific program in that language describes a particular *virtual+ machine, which then
responds to various inputs in ways defined by the program. he architecture is thus what
3ewell calls the fi!ed structure of the information processor that is being analy,ed, and
the program specifies a variable structure within this architecture. We can regard the
architecture as the form and the program as the content, which together fully instantiate a
particular information processing machine. We can e!tend these intuitions to types of
machines which are different from computers. 4or e!ample, the connectionist
architecture can be abstractly specified as the set 5536, 5nI6, 5n/6, 5,i6, 5wij66, where
536 is a set of nodes, 5nI6 and 5n/6 are subsets of 536 called input and output nodes
respectively, 5,i6 are the functions computed by the nodes, and 5wij6 is the set of
weights between nodes. A particular connectionist machine is then instantiated by the
"program" that specifies values for all these variables.
We have discussed the prospects for separating intelligence *a knowledge state process+
from other mental phenomena, and also the degree to which various theories of
intelligence and cognition balance between fidelity to biology versus functionalism. We
have discussed the sense in which alternatives such as logic, decision tree algorithms, and
connectionism are all alternative languages in which to couch an information processing
account of cognitive phenomena, and what it means to take a 7nowledge 8evel stance
towards cognitive phenomena. We have further discussed the distinction between form
and content theories in AI. We are now ready to give an overview of the issues in
cognitive architectures. We will assume that the reader is already familiar in some
general way with the proposals that we discussing. /ur goal is to place these ideas in
perspective.
B. Intelligence as "ust !omputation
&ntil recently the dominant paradigm for thinking about information processing has been
the uring machine framework, or what has been called the discrete symbol system
approach. Information processing theories are formulated as algorithms operating on data
structures. In fact AI was launched as a field when uring proposed in a famous paper
that thinking was computation of this type *the term "artificial intelligence" itself was
coined later+ . 3atural "uestions in this framework would be whether the set of
computations that underlie thinking is a subset of uring(computable functions, and if so
how the properties of the subset should be characteri,ed.
9ost of AI research consists of algorithms for specific problems that are associated with
intelligence when humans perform them. Algorithms for diagnosis, design, planning, etc.,
are proposed, because these tasks are seen as important for an intelligent agent. 2ut as a
rule no effort is made to relate the algorithm for the specific task to a general architecture
for intelligence. While such algorithms are useful as technologies and to make the point
that several tasks that appear to re"uire intelligence can be done by certain classes of
machines, they do not give much insight into intelligence in general.
!. Architectures for Deliberation
%istorically most of the intuitions in AI about intelligence have come from introspections
about the relationships between conscious thoughts. We are aware of having thoughts
which often follow one after another. hese thoughts are mostly couched in the medium
of natural language, although sometimes thoughts include mental images as well. When
people are thinking for a purpose, say for problem solving, there is a sense of directing
thoughts, choosing some, rejecting others, and focusing them towards the goal. Activity
of this type has been called "deliberation." 0eliberation, for humans, is a coherent goal(
directed activity, lasting over several seconds or longer. 4or many people thinking is the
act of deliberating in this sense. We can contrast activities in this time span with other
cognitive phenomena, which, in humans, take under a few hundred milliseconds, such as
real(time natural language understanding and generation, visual perception, being
reminded of things, and so on. hese short time span phenomena are handled by what we
will call the subdeliberative architecture, as we will discuss later.
:esearchers have proposed different kinds of deliberative architectures, depending upon
which kind of pattern among conscious thoughts struck them. wo groups of proposals
about such patterns have been influential in AI theory(making' the reasoning view and
the goal(subgoal view.
1. Deliberation as #easoning
;eople have for a long time been struck by logical relations between thoughts and have
made the distinction between rational and irrational thoughts. :emember that 2oole-s
book on logic was titled "8aws of hought." houghts often have a logical relation
between them' we think thoughts A and 2, then thought ., where . follows from A and
2. In AI, this view has given rise to an ideali,ation of intelligence as rational thought, and
conse"uently to the view that the appropriate architecture is one whose behavior is
governed by rules of logic. In AI, 9c.arthy is mostly closely identified with the logic
approach to AI, and <9c.arthy and %ayes, =>?>@ is considered a clear early statement of
some of the issues in the use of logic for building an intelligent machine.
:esearchers in AI disagree about how to make machines which display this kind of
rationality. /ne group proposes that the ideal thought machine is a logic machine, one
whose architecture has logical rules of inference as its primitive operators. hese
operators work on a storehouse of knowledge represented in a logical formalism and
generate additional thoughts. 4or e!ample, the Aapanese 4ifth generation project came up
with computer architectures whose performance was measured in *millions of+ inferences
per second. he other group believes that the architecture itself *i.e, the mechanism that
generates thoughts+ is not a logic machine, but one which generates plausible, but not
necessarily correct, thoughts, and then knowledge of correct logical patterns is used to
make sure that the conclusion is appropriate.
%istorically rationality was characteri,ed by the rules of deduction, but in AI, the notion
is being broadened to include a host of non(deductive rules under the broad umbrella of
"non(monotonic logic" <9c.arthy, =>BC@ or "default reasoning," to capture various
plausible reasoning rules. here is considerable difference of opinion about whether such
rules e!ist in a domain(independent way as in the case of deduction, and how large a set
of rules would be re"uired to capture all plausible reasoning behaviors. If the number of
rules is very large, or if they are conte!t(dependent in complicated ways, then logic
architectures would become less practical.
At any point in the operation of the architecture, many inference rules might be applied to
a situation and many inferences drawn. his brings up the control issue in logic
architectures, i.e., decisions about which inference rule should be applied when. 8ogic
itself provides no theory of control. he application of the rule is guaranteed, in the logic
framework, to produce a correct thought, but whether it is relevant to the goal is decided
by considerations e!ternal to logic. .ontrol tends to be task(specific, i.e., different types
of tasks call for different strategies. hese strategies have to be e!plicitly programmed in
the logic framework as additional knowledge.
2. Deliberation as $oal%&ubgoaling
An alternate view of deliberation is inspired by another perceived relation between
thoughts and provides a basic mechanism for control as part of the architecture. houghts
are often linked by means of a goal-subgoal relation. 4or e!ample, you may have a
thought about wanting to go to 3ew 0elhi, then you find yourself having thoughts about
taking trains and airplanes, and about which is better, then you might think of making
reservations and so on. 3ewell and Simon <=>DE@ have argued that this relation between
thoughts, the fact that goal thoughts spawn subgoal thoughts recursively until the
subgoals are solved and eventually the goals are solved, is the essence of the mechanism
of intelligence. 9ore than one subgoal may be spawned, and so backtracking from
subgoals that didn-t work out is generally necessary. 0eliberation thus looks like search
in a problem space. Setting up the alternatives and e!ploring them is made possible by
the knowledge that the agent has. In the travel e!ample above, the agent had to have
knowledge about different possible ways to get to 3ew 0elhi, and knowledge about how
to make a choice between alternatives. A long term memory is generally proposed which
holds the knowledge and from which knowledge relevant to a goal is brought to play
during deliberation. his analysis suggests an architecture for deliberation that retrieves
relevant knowledge, sets up a set of alternatives to e!plore *the problem space+, e!plores
it, sets up subgoals, etc.
he most recent version of an architecture for deliberation in the goal(subgoal framework
is Soar <3ewell, =>>C@. Soar has two important attributes. he first is that any difficulty it
has in solving any subgoal simply results in the setting up of another subgoal, and
knowledge from long term memory is brought to bear in its solution. It might be
remembered that 3ewell-s definition of intelligence is the ability to reali,e the knowledge
level potential of an agent. 0eliberation and goal(subgoaling are intended to capture that
capability' any piece of knowledge in long term memory is available, if it is relevant, for
any goal. :epeated subgoaling will bring that knowledge to deliberation. he second
attribute of Soar is that it "caches" its successes in problem solving in its long term
memory. he ne!t time there is a similar goal, that cached knowledge can be directly
used, instead of searching again in the corresponding problem space.
his kind of deliberative architecture confers on the agent the potential for rationality in
two ways. With the right kind of knowledge, each goal results in plausible and relevant
subgoals being setup. Second, "logical rules" can be used to verify that the proposed
solution to subgoals is indeed correct. 2ut such rules of logic are used as pieces of
knowledge rather than as operators of the architecture itself. 2ecause of this, the
verification rules can be conte!t( and domain(dependent.
/ne of the results of this form of deliberation is the construction of special purpose
algorithms or methods for specific problems. hese algorithms can be placed in an
e!ternal computational medium, and as soon as a subgoal arises that such a method or
algorithm can solve, the e!ternal medium can solve it and return the results. 4or e!ample,
during design, an engineer might set up the subgoal of computing the ma!imum stress in
a truss, and invoke a finite element method running on a computer. he deliberative
engine can thus create and invoke computational algorithms. he goal(subgoaling
architecture provides a natural way to integrate e!ternal algorithms.
In the Soar view, long term memory is just an associative memory. It has the capability to
"recogni,e" a situation and retrieve the relevant pieces of knowledge. 2ecause of the
learning capability of the architecture, each episode of problem solving gives rise to
continuous improvement. As a problem comes along, some subtasks are solved by
e!ternal computational architectures which implement special purpose algorithms, while
others are directly solved by compiled knowledge in memory, while yet others are solved
by additional deliberation. his cycle make the overall system increasingly more
powerful. 1ventually, most routine problems, including real(time understanding and
generation of natural language, are solved by recognition. *:ecent work by ;atten
<;atten, et al, =>>E@ on the use of compiled knowledge in natural language understanding
is compatible with this view.+
0eliberation seems to be a source of great power in humans. Why isn-t recognition
enough$ As 3ewell points out, the particular advantage of deliberation is distal access to
and combination of knowledge at run(time in a goal(specific way. In the deliberative
machine, temporary connections are created between pieces of knowledge that are not
hard(coded, and that gives it the ability to reali,e the knowledge level potential more. A
recognition architecture uses knowledge less effectively' if the connections are not there
as part of the memory element that controls recognition, a piece of knowledge, though
potentially relevant, will not be utili,ed in the satisfaction of a goal.
As an architecture for deliberation, the goal(subgoal view seems to us closer to the mark
than the reasoning view. As we have argued elsewhere <.handrasekaran, =>>=@, logic
seems more appropriate for justification of conclusions and as the framework for the
semantics of representations than for the generative architecture.
AI theories of deliberation give central importance to human(level problem solving and
reasoning. Any continuity with higher animal cognition or brain structure is at the level
of the recognition architecture of memory, about which this view says little other than
that it is a recognition memory. 4or supporting deliberation at the human level, long term
memory should be capable of storing and generating knowledge with the full range of
ontological distinctions that human language has.
'. Is the &earch (ie) of Deliberation Too *arro)?
A criticism of this picture of deliberation as a search architecture is that it is based on a
somewhat narrow view of the function of cognition. It is worth reviewing this argument
briefly.
Suppose a 9artian watches a human in the act of multiplying numbers. he human,
during this task, is e!ecuting some multiplication algorithm, i.e., appears to be a
multiplication machine. he 9artian might well return to his superiors and report that the
human cognitive architecture is a multiplication machine. We, however, know that the
multiplication architecture is a fleeting, evanescent virtual architecture that emerged as an
interaction between the goal *multiplication+ and the procedural knowledge of the human.
With a different goal, the human might behave like a different machine. It would be
awkward to imagine cognition to be a collection of different architectures for each such
taskF in fact, cognition is very plastic and is able to emulate various virtual machines as
needed.
Is the problem space search engine that has been proposed for the deliberative
architecture is also an evanescent machine$ /ne argument against it is that it is not
intended for a narrow goal like multiplication, but for all kinds of goals. hus it is not
fleeting, but always operational.
/r is it$ If the sole purpose of the cognitive architecture is goal achievement *or "problem
solving"+, then it is reasonable to assume that the architecture would be hard(wired for
this purpose. What, however, if goal achievement is only one of the functions of the
cognitive architecture, common though it might be$ At least in humans, the same
architecture is used to daydream, just take in the e!ternal world and enjoy it, and so on.
he search behavior that we need for problem solving can come about simply by virtue
of the knowledge that is made available to the agent-s deliberation from long term
memory. his knowledge is either a solution to the problem, or a set of alternatives to
consider. he agent, faced with the goal and a set of alternatives, simply considers the
alternatives in turn, and when additional subgoals are set, repeats the process of seeking
more knowledge. In fact, this kind of search behavior happens not only with individuals,
but with organi,ations. hey too e!plore alternatives, but yet we don-t see a need for a
fi!ed search engine for e!plaining organi,ational behavior. 0eliberation of course has to
have the right sort of properties to be able to support search. .ertainly ade"uate working
memory needs to be there, and probably there are other constraints on deliberation.
%owever, the architecture for deliberation does not have to be e!clusively a search
architecture. Aust like the multiplication machine was an emergent architecture when the
agent was faced with that task, the search engine could be the corresponding emergent
architecture for the agent faced with a goal and e"uipped with knowledge about what
alternatives to consider. In fact, a number of other such emergent architectures built on
top of the deliberative architecture have been studied earlier in our work on Generic ask
architectures <=>B?@. hese architectures were intended to capture the needs for specific
classes of goals *such as classification+.he above argument is not to deemphasi,e the
importance of problem space search for goal achievement, but to resist the identification
of the architecture of the conscious processor with one e!clusively intended for search.
he problem space architecture is still important as the virtual architecture for goal(
achieving, since it is a common, though not the only, function of cognition.
/f course, that cognition goes beyond just goal achievement is a statement about human
cognition. his is to take a biological rather than a functional standard for the ade"uacy
of an architectural proposal. If we take a functional approach and seek to specify an
architecture for a function called intelligence which itself is defined in terms of goal
achievement, then a deliberative search architecture working with a long term memory of
knowledge certainly has many attractive properties for this function, as we have
discussed.
D. &ubeliberative Architectures
We have made a distinction between cognitive phenomena that take less than a few
hundred milliseconds for completion and those that evolve over longer time spans. We
discussed proposals for the deliberative architecture to account for phenomena taking
longer time spans. Some form of subdeliberative architecture is then responsible for
phenomena that occur in very short time spans in humans. In deliberation, we have access
to a number of intermediate states in problem solving. After you finished planning the
3ew 0elhi trip, I can ask you what alternatives you considered, why you rejected taking
the train, and so on, and your answers to them will generally be reliable. Hou were
probably aware of rejecting the train option because you reasoned that it would take too
long. /n the other hand, we have generally no clue to how the subdeliberative
architecture came to any of its conclusions.
9any people in AI and cognitive science feel that the emphasis on comple! problem
solving as the door to understanding intelligence is misplaced, and that theories that
emphasi,e rational problem solving only account for very special cases and do not
account for the general cognitive skills that are present in ordinary people. hese
researchers focus almost completely on the nature of the subdeliberative architecture.
here is also a belief that the subdeliberative architecture is directly reflected in the
structure of the neural machinery in the brain. hus, some of the proposals for the
subdeliberative architecture claim to be inspired by the structure of the brain and claim a
biological basis in that sense.
1. Alternative +roposals
he various proposals differ along a number of dimensions' what kinds of tasks the
architecture performs, degree of parallelism, whether it is an information processing
architecture at all, and, when it is taken to be an information processing architecture,
whether it is a symbolic one or some other type.
With respect to the kind of tasks the architecture performs, we mentioned 3ewell-s view
that it is just a recognition architecture. Any smartness it possesses is a result of good
abstractions and good inde!ing, but architecturally, there is nothing particularly
complicated. In fact, the good abstractions and inde!ing themselves were the result of the
discoveries of deliberation during problem state search. he real solution to the problem
of memory, for 3ewell, is to get chunking done right' the proper level of abstraction,
labeling and inde!ing is all done at the time of chunking. In contrast to the recognition
view are proposals that see relatively comple! problem solving activities going on in
subdeliberative cognition. .ognition in this picture is a communicating collection of
modular agents, each of whom is simple, but capable of some degree of problem solving.
4or e!ample, they can use the means(ends heuristic *the goal(subgoaling feature of
deliberation in the Soar architecture+.
0eliberation has a serial character to it. Almost all proposals for the subdeliberative
architecture, however, use parallelism in one way or another. ;arallelism can bring a
number of advantages. 4or problems involving similar kinds of information processing
over somewhat distributed data *like perception+, parallelism can speed up processing.
&ltimately, however, additional problem solving in deliberation may be re"uired for
some tasks.
2. &ituate !ognition
:eal cognitive agents are in contact with the surrounding world containing physical
objects and other agents. A new school has emerged calling itself the situated cognition
movement which argues that traditional AI and cognitive science abstract the cognitive
agent too much away from the environment, and place undue emphasis on internal
representations. he traditional internal representation view leads, according to the
situated cognition perspective, to large amounts of internal representation and comple!
reasoning using these representations. :eal agents simply use their sensory and motor
systems to e!plore the world and pick out the information needed, and get by with much
smaller amounts of internal representation processing. At the minimum, situated
cognition is a proposal against e!cessive "intellection." In this sense, we can simply view
this movement as making different proposals about what and how much needs to be
represented internally. he situated cognition perspective clearly rejects the former view
with respect to internal *sub(deliberative+ processes, but accepts the fact deliberation does
contain and use knowledge. hus the 7nowledge 8evel description could be useful to
describe the content of agent-s deliberation.
(. !oncluing #emarks
We started by asking how far intelligence or cognition can be separated from mental
phenomena in general. We suggested that the problem of an architecture for cognition is
not really well(posed, since, depending upon what aspects of the behavior of biological
agents are included in the functional specification, there can be different constraints on
the architecture. We reviewed a number of issues and proposals relevant to cognitive
architectures. 3ot only are there many levels each e!plaining some aspect of cognition
and mentality, but the levels interact even in relatively simple cognitive phenomena.

You might also like