0% found this document useful (0 votes)

65 views6 pages

Kok&Otros Multi RobotDecisionMakingUsingCoordinationGraphs

This document discusses using coordination graphs to coordinate decision making among multiple robots. It begins by introducing coordination graphs and how they can be used to decompose complex multi-agent decision problems into smaller subproblems by capturing local coordination dependencies. It then proposes applying coordination graphs in dynamic continuous domains by first discretizing the state space by assigning roles to agents and then coordinating the different roles. Finally, it indicates coordination graphs will be demonstrated in the RoboCup soccer simulation domain.

Uploaded by

Kasey Owens

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views6 pages

Kok&Otros Multi RobotDecisionMakingUsingCoordinationGraphs

Uploaded by

Kasey Owens

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Multi-robot decision making using coordination graphs

Jelle R. Kok
Matthijs T. J. Spaan
Nikos Vlassis
Intelligent Autonomous Systems Group, Informatics Institute
Faculty of Science, University of Amsterdam, The Netherlands
{jellekok,mtjspaan,vlassis}@science.uva.nl
Abstract
Within a group of cooperating agents the decision
making of an individual agent depends on the actions
of the other agents. In dynamic environments, these
dependencies will change rapidly as a result of the
continuously changing state. Via a context-specific
decomposition of the problem into smaller subproblems, coordination graphs offer scalable solutions to
the problem of multiagent decision making. We will
apply coordination graphs to the continuous domain
by assigning roles to the agents and then coordinating
the different roles. Finally, we will demonstrate this
method in the RoboCup soccer simulation domain.

Introduction

A multiagent (multi-robot) system is a collection

of agents that coexist in an environment, interact (explicitly or implicitly) with each other, and try to optimize a performance measure. Research in multiagent
systems aims at providing principles for the construction of complex systems containing multiple independent agents and focuses on behavior management issues (e.g. coordination of behaviors) in such systems.
In our case, we are interested in fully cooperative
multiagent systems in which all agents share a common goal. A key aspect in such systems is the problem of coordination: the process that ensures that the
individual decisions of the agents result in jointly optimal decisions for the group.
In principle game theoretic techniques can be applied to solve the coordination problem [7]. The problem with this approach is that the joint action space is
exponential in the number of agents. For practical situations, involving many agents, modeling an n-person
games becomes intractable. However, the particular
structure of the coordination problem can often be
exploited to reduce its complexity.
A recent approach involves the use of a coordination graph (CG) [4]. In this graph, each node represents an agent, and an edge indicates that the corresponding agents have to coordinate their actions. In
a context-specific CG [5] the topology of the graph is
dynamically updated based on the current context.

In this paper we will describe a framework to coordinate multiple robots using coordination graphs. We
assume a group of robotic agents that are embedded
in a continuous and dynamic domain and are able to
perceive their surroundings with sensors. The continuous nature of the state space makes the direct application of context-specific CGs difficult. To alleviate
the problem, we propose a discretization of the state
by assigning roles to the agents, and subsequently apply the CG-based method to the derived set of roles.
It turns out that such an approach offers additional
benefits: the set of roles allows for the definition of
natural coordination rules that exploit prior knowledge about the domain. This greatly simplifies the
modeling and the solution of the problem at hand.
The setup of the paper is as follows. In Section 2
we review the coordination problem from a gametheoretic point of view, and in Section 3 we explain
the concept of a CG. In Section 4 we will describe
our framework to coordinate agents in a continuous
dynamic environment using roles, followed by an extensive example using the RoboCup soccer simulation
domain in Section 5. Finally, we give our conclusions
and discuss possible further extensions in Section 6.

The coordination problem

In order to place the coordination problem in a

broader context, we will first review it from a game
theoretic point of view. A strategic game [7] is a tuple
(n, A1..n , R1..n ) where n is the number of agents, Ai
is the set of actions of agent i and Ri is the payoff
function for agent i. This payoff function maps the
selected joint action A = A1 ... An to a real value:
Ri (A) IR. Each agent selects an action from its
action set, and then receives a payoff based on the
actions selected by all agents. The goal of the agents
is to select, via their individual decisions, the most
profitable joint action. In the remainder of this paper,
we are interested in fully cooperative strategic games,
so-called coordination games, in which all agents share
the same payoff function R1 = . . . = Rn = R.
A Nash equilibrium defines a joint action a
A with the property that for every agent i holds
Ri (ai , ai ) Ri (ai , ai ) for all ai Ai , where ai

G1
PSfrag replacements
G2

G3
G4

Figure 1: An example coordination graph for a 4agent problem.

is the joint action for all agents excluding agent i.
Such an equilibrium joint action is a steady state from
which no agent can profitably deviate given the actions of the other agents. Formally, the coordination
problem can be seen as the problem of selecting one
out of many Nash equilibria in a coordination game.
Several different methods exist to solve a coordination game [1], for example by using communication,
learning, or by imposing social conventions. If we assume that with these methods the Nash equilibria can
be found, coordination becomes the problem of selecting the same equilibrium. However, the number
of joint actions grows exponentially with the number of agents, making it infeasible to determine all
equilibria in the case of many agents. This calls for
methods that first reduce the size of the joint action
space before solving the coordination problem. One
such approach, explained next, is based on the use of
a coordination graph that captures local coordination
requirements between agents.

Coordination graphs

A coordination graph (CG) represents the coordination requirements of a system [4]. A node in the
graph represents an agent, while an edge in the graph
defines a (possible directed) dependency between two
agents. Only interconnected agents have to coordinate their actions at any particular instance. Figure 1
shows a possible CG for a 4-agent problem. In this
example, G2 has to coordinate with G1 , G4 has to coordinate with G3 , G3 has to coordinate with both G4
and G1 , and G1 has to coordinate with both G2 and
G3 . When the global payoff function can be decomposed as a sum of local payoff functions, the global
coordination problem can be replaced by a number of
easier local coordination problems. The agents can
then find the joint optimal action by using an efficient
variable elimination algorithm in combination with a
message passing scheme [4].
The algorithm assumes that each agent knows its
neighbors in the graph (but not necessarily their payoff function which might depend on other agents).
Each agent is eliminated from the graph by solving a local optimization problem that involves only

this agent and its neighbors: the agent collects from

its neighbors all relevant payoff functions, then optimizes its decision conditionally on its neighbors decisions, and communicates the resulting conditional
payoff function back to its neighbors. A next agent is
selected and the process continues. When all agents
have been eliminated, each agent communicates its decision to its neighbors in the reverse elimination order
in order for them to fix their strategy.
The local payoff functions can be matrix-based [4]
or rule-based [5]. In the latter case the payoff rules
are defined using value rules, which specify how an
agents payoff depends on the current context. The
context being defined as a propositional rule over the
state variables and the actions of the agents neighbors. These rules can be regarded as a sparse representation of the complete payoff matrices. Next, we
will define this more formally.
Let G1 , . . . , Gn be a group of agents, where each
agent Gj has to choose an action aj Aj and let X be
a set of discrete state variables. The context c is then
an element from the set of all possible combinations
of the state and action variables, c C X A. A
value rule h p ; c : vi is a function p : C IR such
that p(x, a) = v when c is consistent with the current
context and 0 otherwise.
As an example, consider the case where two persons have to coordinate their actions to enter a narrow door. We can describe this situation using the
following value rule:
hp1

in-front-of-same-door(G1 , G2 )
a1 = enterDoor

a2 = enterDoor : 50i
This rule indicates that when the two agents are located in front of the same door and both select the
same action (entering the door), the global payoff
value will be reduced by 50. When the state is not
consistent with the above rule (and the agents are
not located in front of the same door), the rule does
not apply and the agents do not have to coordinate
their actions. By conditioning on the current state
the agents can discard all irrelevant rules, and as a
consequence the CG is dynamically updated and simplified. Each agent thus only needs to observe that
part of the state mentioned in its value rules.
For a more extensive example, see Figure 2. Beneath the left graph all value rules, defined over binary
action and context variables, are depicted together
with the agent the rule applies to. The coordination
dependencies between the agents are represented by
directed edges, where each (child) agent has an incoming edge from the (parent) agent that affects its
decision. After the agents observe the current state,
x = true, the last rule does not apply anymore and
can be removed. As a consequence, the optimal joint

rag replacements

G1
G2

G1
G3

G4
G1
G2
G3
G4

ha1 a3 x
ha1 a2 x
ha2 x
ha3 a2 x
ha3 a4 x

G3
G4

: 4i
: 5i
: 2i
: 5i
: 10i

ha1 a3
ha1 a2
ha2
ha3 a2

: 4i
: 5i
: 2i
: 5i

Figure 2: Initial coordination graph (left) and graph

after conditioning on the context x = true (right).
action is independent of the action of G4 and the edge
to G4 can be deleted from the graph as shown in the
right graph of Figure 2.
After the agents have conditioned on the state variables, the agents are one by one eliminated from the
graph. Let us assume that we first eliminate G3 in the
above example. After collecting all relevant rules, G3
has to maximize over the rules ha1 a3 : 4iha3 a2 : 5i.
For all possible actions of G1 and G2 , G3 determines
its best response and then distributes this conditional
strategy, in this case equal to ha2 : 5iha1 a2 : 4i,
to its parent G2 . After this step, G3 has no children
in the coordination graph anymore and is eliminated.
The procedure then continues and after G2 has distributed its conditional strategy ha1 : 11iha1 : 5i to
G1 , it is also eliminated. Finally, G1 is the last agent
left and fixes its action to a1 . Now a second pass
in the reverse order is performed, where each agent
distributes its strategy to its parents, who then determine their final strategy. This results in the optimal
joint action, {a1 , a2 , a3 } and a global payoff of 11.
The outcome of this algorithm is independent of
the elimination order and the distribution of the rules,
and will always result in an optimal joint action [4].
A limitation of this approach is that it is based on
propositional rules and therefore only applies to discrete domains. However, we are interested in robots
that are embedded in continuous domains. Next, we
will show how to utilize this framework in continuous
dynamic environments.

Dynamic continuous environments

We are interested in problems that involve multiple robots that are embedded in a continuous domain, have sensors with which they can observe their
surroundings, and need to coordinate their actions.
As a main example we will use the RoboCup simulation soccer domain (see [3] and references therein) in
which a team of eleven agents have to fulfill a com-

mon goal (scoring more goals than their opponent).

Depending on the current situation, certain agents on
the field have to coordinate their actions, for example the agent that controls the ball must decide to
which nearby agent to pass, etc. Such dependencies
can be modeled by a CG that satisfies the following
requirements: (i) its connectivity should be dynamically updated based on the current (continuous) state,
(ii) it should be sparse in order to keep the dependencies and the associated local coordination problems as
simple as possible.
Conditioning on a context that is defined over a
continuous domain is difficult in the original rulebased CG representation. A way to discretize the
context is by assigning roles to agents [8]. Roles are a
natural way of introducing domain prior knowledge to
a multiagent problem and provide a flexible solution to
the problem of distributing the global task of a team
among its members. In the soccer domain for instance
one can easily identify several roles ranging from active or passive depending on whether an agent is in
control of the ball or not, to more specialized ones like
striker, defender, goalkeeper, etc.
Given a particular local situation, each agent is assigned a role that is computed based on a role assignment function that is common knowledge among
agents. The set of roles is finite and ordered, so the
most important role is assigned to an agent first, followed by the second most important role, etc. By
construction, the same role can be assigned to more
than one agent, but each agent is assigned only a single role. Environment-dependent potential functions
can be used to determine how appropriate an agent is
for a particular role given the current context. For
details on the assignment of roles to agents see [8].
Such an assignment of roles provides a natural way
to parametrize a coordination structure over a continuous domain. The intuition is that, instead of directly
coordinating the agents in a particular situation, we
assign roles to the agents based on this situation and
subsequently try to coordinate the set of roles. A
priori rules exist that specify which roles should be
coordinated and how. In Section 5 we give a detailed
example from robot soccer.
The roles can be regarded as an abstraction of a
continuous state to a discrete context, allowing the application of existing techniques for discrete-state CGs.
Furthermore, roles can reduce the action space of the
agents by locking out specific actions. For example,
the role of the goalkeeper does not include the action
score, and in a passive role the action shoot is deactivated. Such a reduction of the action space can
offer computational savings, but more importantly it
can facilitate the solution of a local coordination game
by restricting the joint action space to a subspace that
contains only one Nash equilibrium.

Figure 3: A situation involving one passer and three

possible receivers. The other agents are passive.

Experiments

We have implemented this framework in our simulation robot soccer team UvA Trilearn [3] to improve upon the ball passing between teammates. The
RoboCup soccer server [2] provides a fully distributed
dynamic multi-robot domain with both teammates
and adversaries. It models many real-world complexities such as noise in object movement, noisy sensors
and actuators, limited physical ability and restricted
communication.
The RoboCup soccer simulation does not allow
agents to communicate with more than one agent at
the same time, which makes it impossible to apply the
original variable elimination algorithm. Therefore, we
have decided to make the state fully observable to all
agents. This makes communication superfluous, since
each agent can model the complete variable elimination algorithm by itself (see [6] for details). This has
no effect on the outcome of the algorithm.
In the non-coordinating case a teammate moves to
the interception point only after he has observed a
change in the ball velocity (after someone has passed
the ball) and concludes that he is the fastest teammate
to the ball. Before the ball changes velocity, he has
no notion of the fact that he will soon receive the ball
and does not coordinate with the passing player.
To accomplish coordination, all agents are first dynamically assigned a role based on the current state.
Next, these roles are coordinated by performing the
variable elimination algorithm using predefined value
rules that make use of the available actions and context variables. Hereafter, we will describe in more
detail how we use coordination graphs in order to coordinate the passer and the receiver, but also the receiver with the second receiver, that is, the player who
will be passed to by the first receiver.
First, we have implemented a role assignment
function that assigns the roles interceptor, passer,
receiver, and passive among the agents using the
continuous state information. The assignment of roles
can be computed directly from the current state information. For instance, the fastest player to the ball will

be assigned the interceptor role when he is not able to

kick the ball and will be assigned the passer role when
he can kick the ball. All receiver roles are given to the
agents that are inside a predefined range of the ball
position. The rest of the players are passive. A common situation is depicted in Figure 3, where the agent
with the ball has the passer role, the three players that
are in range of the passer are given the receiver role
and the other players are passive. This assignment of
roles defines the structure of the coordination graph:
all interceptors, passers, and receivers are connected.
Note that this assignment changes dynamically as the
state of the world changes.
Now all connected agents have to coordinate their
actions. For this, each agent can select one of the
following actions:
passT o(i, dir): pass the ball to a position with a
fixed distance from agent i in the direction dir
D = {center, n, nw, w, sw, s, se, e, ne}1 .
moveT o(dir): move in the direction dir D.
dribble(dir): move with ball in direction dir D.
score: try to score in the opponent goal.
clearBall: shoot the ball hard between the opponent defenders to the opponent side.
moveT oStratP os: move to agents strategic position (based on home and current ball position).
We also defined state variables that extract important (high-level) information from the world state.
The first is is-pass-blocked(i, j, dir) that indicates
whether a pass from agent i to the position in direction dir of agent j is blocked by an opponent or not.
In this case no opponents are located within a cone
from the passing player to this position. The second is
is-in-front-of-goal(j) that indicates whether the agent
j is located in front of the opponent goal and the last,
is-empty-space(i, dir), indicates that there are no opponents in direction dir of agent i.
Finally, we can define the complete strategy of the
team by means of value rules which specify the contribution to the global payoff in a specific context. These
value rules are specified for each player i and make use
of the above defined actions and context variables2 .
hppasser
1

has-role-receiver(j)

isPassBlocked(i, j, dir)
ai = passTo(j, dir)
aj = moveTo(dir) : u(j, dir)i j 6= i
1 North is directed towards the opponent goal and center
corresponds to a pass directly to the current agent position.
2 Note that we enumerate all rules using variables. The complete list of value rules is the combination of all possible instantiations of these variables. In all rules, dir D.

hppasser
2

hppasser
3
hppasser
4

; ai = clearBall : 10i
; is-in-front-of-goal(i)

is-empty-space(i, n)
ai = dribble(n) : 30i

ai = score : 100i
hpreceiver
5

has-role-interceptor(j)
isPassBlocked(j, i, dir)
ai = moveTo(dir) : u(i, dir)i

hpreceiver
6

has-role-passer(j)
has-role-receiver(k)
isPassBlocked(k, i, dir)

j 6= i

the complete strategy of the team when playing different kinds of opponents.
The above rules contain a lot of contextdependencies represented in the state variables. In
Figure 3 we simplified the coordination graph by
conditioning on the roles, if we now condition further on the specific context variables, we get the
graph depicted in Figure 4, corresponding to the
following value rules (we assume for simplicity that
only the context variables isPassBlocked(1, 2, s) and
isPassBlocked(2, 3, nw) are true):
G1 : hppasser
1

aj = passTo(k, dir2)
ak = moveTo(dir2)
ai = moveTo(dir) : u(i, dir)i j, k 6= i
hpreceiver
7
intercep.
hp8
hppassive
9

moveToStratPos : 10i

G2 :

intercept : 100i

G3 :

moveToStratPos : 10i

The first rule p1 indicates that a passer can shoot

the ball to the relative direction dir of the player j in
case the pass is not blocked by an opponent and the
receiver will move in that direction. The value that
is contributed to the global payoff is returned by u(j)
and depends on the position where the receiving agent
j will receive the pass (the closer to the opponent goal
the better). The next three rules indicate the other
individual options for the passer: dribbling (we only
allow forward dribbling), clearing the ball and scoring.
Using the same principle, we can also create more advanced dependencies. For example, rule p5 indicates
the situation where a receiver already moves to the
position it expects the current interceptor to pass the
ball to when it reaches the ball. Rule p6 indicates
that a receiver can already move to a position it will
expect the receiver of another pass to shoot the ball
to. Rule p7 describes the situation where a receiving
player moves to its strategic position on the field. This
action is only executed when it is not able to coordinate with one of the other agents, since it has only
a small global payoff value. Finally, rules p8 and p9
contain the single action option for respectively an interceptor (intercept the ball) or a passive player (move
to its strategic position).
With the above rules, we illustrate that even with a
small set of rules a complete (although simple) team
strategy can be specified that makes explicit use of
coordination. Furthermore, the rules are easily interpretable which makes it possible to add prior knowledge into the problem. Another advantage is that the
rules are very flexible: existing rules can directly be
added or removed. This makes it possible to change

a1 = passTo(2, s)

a2 = moveTo(s) : 50i
; a1 = dribble(n) : 30i
; a1 = clearBall : 10i

hppasser
2
hppasser
3
hpreceiver
7
receiver
hp6

hpreceiver
7

a2 = moveToStratPos : 10i

a1 = passTo(2, dir)
a2 = moveTo(dir)
a3 = moveTo(nw) : 30i

a3 = moveToStratPos : 10i

Now the variable elimination algorithm can be performed. Each agent is eliminated from the graph by
maximizing its local payoff. In the case that agent 1 is
eliminated first, it gathers all value rules that contain
a1 and distributes its conditional strategy
hppasser
1

a2 = moveTo(s)
a3 = moveTo(nw) : 80i

hppasser
1

a2 = moveTo(s)
a3 = moveTo(nw) : 50i

hppasser
1

a2 = moveTo(s) : 30i

to its parents. After agent 2 and 3 have also fixed their

strategy, agent 1 will perform passTo(2, s), agent 2 will
execute moveTo(s) to intercept the pass and agent 3
will perform moveTo(nw) to intercept a possible future pass of agent 2. In case for some unpredicted
reason the first pass fails, the graph will automatically
be updated and correspond to the new situation.
To test this approach, we played games against ourselves, with one team using explicit coordination and
the other team without using any coordination at all
during passing. The latter case was modeled by deleting the rules p5 , p6 from the list of value rules and
removing the condition aj = moveTo(dir) from the
first value rule to indicate that it is not necessary for
the receiver to anticipate the pass.
Table 1 shows the results over the course of 10 fulllength games. Each player can take 6000 decisions
during each match, so the coordination algorithm was

Figure 4: The coordination graph at Fig. 3 after conditioning on the state variables. The passer (agent 1)
decides to pass the ball to the first receiver (agent 2),
while the second receiver (agent 3) moves to a good
position for the first receiver to pass the ball to.
Table 1: Results of 10 games against ourselves, with
and without coordination in passing.

Wins
Draws
Losses
Avg. score
Passing %

With
5
3
2
0.9 ( 1.19)
82.72 ( 2.06)

Without
2
3
5
0.2 ( 0.42)
64.62 ( 2.17)

executed 60.000 times by each player. The strategy

for the team was completely specified by the value
rules above. These rules are not specific enough to
create good scoring probabilities and therefore the difference in goal difference is rather small. The actual
coordination, however, was used to improve the passing between the players and therefore we were more
interested in detailed statistics about the passing percentages. It turned out that the successful passing
percentage over these 10 matches was 82.72% (total
of 1450 passes) for the team with the CG and 64.62%
(total of 1411 passes) for the team without. These
percentages indicate that due to the better coordination of the teammates, fewer mistakes were made
when the ball was passed between teammates.

Conclusions and future work

We showed how coordination graphs can be successfully applied to cases where a group of robotic
agents are embedded in a dynamic and continuous domain. We assigned roles in order to abstract from the
continuous state to a discrete context, allowing the application of existing techniques for discrete-state CGs.
Currently, we assume that each agent observes that
part of the state that affects its local decisions and
its role assignment. As future work, we would like
to apply the same framework to domains where the
agents do not observe all required state information.
Possible solutions would be to make the action of the

agent dependent on its current state information (i.e.,

by actively looking towards relevant hidden parts of
the state space) or to derive missing state information based on the observed actions from agents (and
thus to deduce why the agent is performing its action). Second, we are interested in applying reinforcement learning techniques to a continuous-domain CG
in order to learn the payoff functions in an automatic
way. Finally, from an application point of view we
want to apply the CG model further to the simulation RoboCup, such that the agents also coordinate
during other actions than passing, like organizing the
defense or obstructing opponent passes.
Acknowledgements
This research is supported by PROGRESS, the embedded systems research program of the Dutch organization for Scientific Research NWO, the Dutch Ministry of
Economic Affairs and the Technology Foundation STW,
project AES 5414.

References
[1] C. Boutilier. Planning, learning and coordination in
multiagent decision processes. In Proc. Conf. on Theoretical Aspects of Rationality and Knowledge, 1996.
[2] M. Chen, E. Foroughi, F. Heintz, Z. Huang,
S. Kapetanakis, K. Kostiadis, J. Kummeneje, I. Noda,
O. Obst, P. Riley, T. Steffens, Y. Wang, and X. Yin.
RoboCup Soccer Server for Soccer Server Version 7.07
and later, 2002. At http://sserver.sourceforge.net/.
[3] R. de Boer and J. R. Kok. The incremental development of a synthetic multi-agent system: The UvA
Trilearn 2001 robotic soccer simulation team. Masters thesis, University of Amsterdam, The Netherlands, Feb. 2002.
[4] C. Guestrin, D. Koller, and R. Parr. Multiagent planning with factored MDPs. In Advances in Neural Information Processing Systems 14. The MIT Press, 2002.
[5] C. Guestrin, S. Venkataraman, and D. Koller.
Context-specific multiagent coordination and planning
with factored MDPs. In AAAI 8th Nation. Conf. on
Artificial Intelligence, Edmonton, Canada, July 2002.
[6] J. R. Kok, M. T. J. Spaan, and N. Vlassis. An approach
to noncommunicative multiagent coordination in continuous domains. In M. Wiering, editor, Benelearn
2002: Proceedings of the Twelfth Belgian-Dutch Conference on Machine Learning, pages 4652, Utrecht,
The Netherlands, Dec. 2002.
[7] M. J. Osborne and A. Rubinstein. A course in game
theory. MIT Press, 1994.
[8] M. T. J. Spaan, N. Vlassis, and F. C. A. Groen.
High level coordination of agents based on multiagent
Markov decision processes with roles. In A. Saffiotti,
editor, IROS02 Workshop on Cooperative Robotics,
Lausanne, Switzerland, Oct. 2002.

Book All-In-One 2
No ratings yet
Book All-In-One 2
281 pages
Cooperative Game Theory: (N-Person Games)
No ratings yet
Cooperative Game Theory: (N-Person Games)
35 pages
Sequential Games of Perfect Information UPF
No ratings yet
Sequential Games of Perfect Information UPF
58 pages
Mailath - Economics703 Microeconomics II Modelling Strategic Behavior
No ratings yet
Mailath - Economics703 Microeconomics II Modelling Strategic Behavior
264 pages
Lecture2 1
No ratings yet
Lecture2 1
41 pages
Fundamentals of Multi Agent Systems
No ratings yet
Fundamentals of Multi Agent Systems
155 pages
José M Vidal - Fundamentals of Multiagent Systems With NetLogo Examples
No ratings yet
José M Vidal - Fundamentals of Multiagent Systems With NetLogo Examples
155 pages
Book All in One
No ratings yet
Book All in One
288 pages
14 (ML) Kok - Maxplus
No ratings yet
14 (ML) Kok - Maxplus
40 pages
Distributed Subgradient Methods For Multi-Agent Optimization
No ratings yet
Distributed Subgradient Methods For Multi-Agent Optimization
28 pages
Categorial Game Theory
No ratings yet
Categorial Game Theory
40 pages
8 - Fallas de Coordinacion (Estudiar)
No ratings yet
8 - Fallas de Coordinacion (Estudiar)
65 pages
Game Theory
No ratings yet
Game Theory
34 pages
Chapter1 2 NotesGTEGT
No ratings yet
Chapter1 2 NotesGTEGT
49 pages
ECON 203 Lecture Notes
No ratings yet
ECON 203 Lecture Notes
243 pages
Malasakit Form
100% (1)
Malasakit Form
2 pages
Ijrnc2018 Flex
No ratings yet
Ijrnc2018 Flex
17 pages
2019 Algorithmic Game Theory Lecture Notes
No ratings yet
2019 Algorithmic Game Theory Lecture Notes
106 pages
Heroes of Might & Magic 2 - Manual UK
No ratings yet
Heroes of Might & Magic 2 - Manual UK
142 pages
The Confluence of Networks, Games and Learning: A Game-Theoretic Framework For Multi-Agent Decision Making Over Networks
No ratings yet
The Confluence of Networks, Games and Learning: A Game-Theoretic Framework For Multi-Agent Decision Making Over Networks
61 pages
Algorithmic Game Theory Lecture Notes
No ratings yet
Algorithmic Game Theory Lecture Notes
110 pages
Muros 2017
No ratings yet
Muros 2017
10 pages
Recent Research in Cooperative Control of Multi Vehicle Systems - Murray
No ratings yet
Recent Research in Cooperative Control of Multi Vehicle Systems - Murray
27 pages
Game-Theoretic Learning in Distributed Control: Jason R. Marden and Jeff S. Shamma
No ratings yet
Game-Theoretic Learning in Distributed Control: Jason R. Marden and Jeff S. Shamma
36 pages
Multi-Agent Control - A Graph-Theoretic Perspective
No ratings yet
Multi-Agent Control - A Graph-Theoretic Perspective
30 pages
Metrics For Ergodicity and Design of Ergodic Dynamics For Multi-Agent Systems
No ratings yet
Metrics For Ergodicity and Design of Ergodic Dynamics For Multi-Agent Systems
11 pages
EEP Oordination Raphs: Wendelin - Boehmer@cs - Ox.ac - Uk Vitaly - Kurin@cs - Ox.ac - Uk Shimon - Whiteson@cs - Ox.ac - Uk
No ratings yet
EEP Oordination Raphs: Wendelin - Boehmer@cs - Ox.ac - Uk Vitaly - Kurin@cs - Ox.ac - Uk Shimon - Whiteson@cs - Ox.ac - Uk
13 pages
Tembine Book
No ratings yet
Tembine Book
30 pages
26288-Article Text-30351-1-2-20230626
No ratings yet
26288-Article Text-30351-1-2-20230626
10 pages
Auto
No ratings yet
Auto
10 pages
Consensus and Cooperation in Networked Multi-Agent Systems
No ratings yet
Consensus and Cooperation in Networked Multi-Agent Systems
19 pages
Systems Resource Management
No ratings yet
Systems Resource Management
48 pages
Chapter A
No ratings yet
Chapter A
39 pages
On Networked Evolutionary Games Part 2 Dynamics A - 2014 - IFAC Proceedings Vol
No ratings yet
On Networked Evolutionary Games Part 2 Dynamics A - 2014 - IFAC Proceedings Vol
6 pages
Distributed Optimal Coordination Control For Continuous-Time Nonlinear Multi-Agent Systems With Input Constraints
No ratings yet
Distributed Optimal Coordination Control For Continuous-Time Nonlinear Multi-Agent Systems With Input Constraints
6 pages
J Automatica 2006 02 013
No ratings yet
J Automatica 2006 02 013
6 pages
CCC 2022
No ratings yet
CCC 2022
5 pages
On Networked Evolutionary Games Part 1 Formulat - 2014 - IFAC Proceedings Volum
No ratings yet
On Networked Evolutionary Games Part 1 Formulat - 2014 - IFAC Proceedings Volum
6 pages
Cooperative and Consensus-Based Approaches To Formation Control of Autonomous Vehicles
No ratings yet
Cooperative and Consensus-Based Approaches To Formation Control of Autonomous Vehicles
6 pages
Dist Coord TRO07
No ratings yet
Dist Coord TRO07
11 pages
Francoi Delarue Lectures
No ratings yet
Francoi Delarue Lectures
22 pages
Gordon
No ratings yet
Gordon
3 pages
The Design Development and Testing of A PDF
No ratings yet
The Design Development and Testing of A PDF
109 pages
Networks of Conforming or Nonconforming Individuals Tend To Reach Satisfactory Decisions
No ratings yet
Networks of Conforming or Nonconforming Individuals Tend To Reach Satisfactory Decisions
6 pages
Game Lnew
No ratings yet
Game Lnew
80 pages
Game Theory
No ratings yet
Game Theory
29 pages
Lecture 04
No ratings yet
Lecture 04
17 pages
Cours SCI31 - Reine Talj - A2015 - Séances 1 Et 2
No ratings yet
Cours SCI31 - Reine Talj - A2015 - Séances 1 Et 2
47 pages
CDSTR Ros 03 005 PDF
No ratings yet
CDSTR Ros 03 005 PDF
16 pages
Li Ferrari Egersted Buffa
No ratings yet
Li Ferrari Egersted Buffa
4 pages
BS 6 - M&HCV Models
No ratings yet
BS 6 - M&HCV Models
6 pages
OH-SFF Naval Manual
No ratings yet
OH-SFF Naval Manual
180 pages
(Mta) Cooperative Control PDF
No ratings yet
(Mta) Cooperative Control PDF
315 pages
An Introduction To Game Theory Notes - Latest
No ratings yet
An Introduction To Game Theory Notes - Latest
125 pages
Game Theory Apuntes
No ratings yet
Game Theory Apuntes
5 pages
Graph-Theoretic Methods For Multi-Agent Coordination: 1 Introduction: Combinatorics vs. Geometry
No ratings yet
Graph-Theoretic Methods For Multi-Agent Coordination: 1 Introduction: Combinatorics vs. Geometry
10 pages
Multi-Agent Algorithms For Solving Graphical Games
No ratings yet
Multi-Agent Algorithms For Solving Graphical Games
7 pages
Tad1241ge PDF
No ratings yet
Tad1241ge PDF
14 pages
1 - Table of Contents
No ratings yet
1 - Table of Contents
6 pages
Contents Preface Oct07
No ratings yet
Contents Preface Oct07
10 pages
Cooperative Game Theory: Basic Concepts and Computational Challenges
No ratings yet
Cooperative Game Theory: Basic Concepts and Computational Challenges
5 pages
Game1a Normal Form Game-5
No ratings yet
Game1a Normal Form Game-5
5 pages
The Raine Report Issue 02
No ratings yet
The Raine Report Issue 02
51 pages
Boiler and Boiler Calculations
No ratings yet
Boiler and Boiler Calculations
7 pages
Alphamaquet 1150 Brochure en PDF
No ratings yet
Alphamaquet 1150 Brochure en PDF
24 pages
Ssentials of AME Heory: A Concise Multidisciplinary Introduction
No ratings yet
Ssentials of AME Heory: A Concise Multidisciplinary Introduction
4 pages
Cibse Ken Dale Award Report 2020 2022 John Smyth
No ratings yet
Cibse Ken Dale Award Report 2020 2022 John Smyth
213 pages
Guidelines For Foreign Exchange Transactions - Bangladesh Bank
No ratings yet
Guidelines For Foreign Exchange Transactions - Bangladesh Bank
441 pages
Ieee 484-02
No ratings yet
Ieee 484-02
23 pages
Multi-Agent System For Decision Support in Enterprises: Dejan Lavbi
No ratings yet
Multi-Agent System For Decision Support in Enterprises: Dejan Lavbi
16 pages
T1 - Universal Beam
No ratings yet
T1 - Universal Beam
8 pages
Strategic Value Management - Michael Thiry
No ratings yet
Strategic Value Management - Michael Thiry
8 pages
Multiple Injuries After Ship Tips Over at Edinburgh Dockyard
No ratings yet
Multiple Injuries After Ship Tips Over at Edinburgh Dockyard
10 pages
Screenshot 2021-05-16 at 11.15.41 AM
No ratings yet
Screenshot 2021-05-16 at 11.15.41 AM
17 pages
PR5259610 BME Non CAA HVAC PM SOW
No ratings yet
PR5259610 BME Non CAA HVAC PM SOW
68 pages
Denim Fabric Consumption & Booking (Final)
No ratings yet
Denim Fabric Consumption & Booking (Final)
7 pages
Case Write-Up: Grocery Gateway: Customer Delivery Operations
No ratings yet
Case Write-Up: Grocery Gateway: Customer Delivery Operations
4 pages
The Development of Khepera: 1 Starting Conditions
No ratings yet
The Development of Khepera: 1 Starting Conditions
7 pages
Optimization of Non-Catalytic Transesterification of Microalgae Oil To Biodiesel Under Supercritical Methanol Condition
No ratings yet
Optimization of Non-Catalytic Transesterification of Microalgae Oil To Biodiesel Under Supercritical Methanol Condition
10 pages
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
No ratings yet
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
1 page
Barbara Hayes-Roth: An Architecture For Adaptive Intelligent Systems1
No ratings yet
Barbara Hayes-Roth: An Architecture For Adaptive Intelligent Systems1
49 pages
Integrating Planning and Learning: The PRODIGY Architecture
No ratings yet
Integrating Planning and Learning: The PRODIGY Architecture
39 pages
LeBaron.B AgentBasedComputationalFinance
No ratings yet
LeBaron.B AgentBasedComputationalFinance
26 pages
RL en Espacios Con, Nuos: (Con Diaposi0vas de Ali Nouri)
No ratings yet
RL en Espacios Con, Nuos: (Con Diaposi0vas de Ali Nouri)
23 pages
Instructional English Classification Test
No ratings yet
Instructional English Classification Test
4 pages
Owens.J - WhatHistoriansWantFromGIS
No ratings yet
Owens.J - WhatHistoriansWantFromGIS
8 pages
Markingson FINAL2 Nov 14
No ratings yet
Markingson FINAL2 Nov 14
8 pages
Delegated Content Erasure in IPFS: Future Generation Computer Systems June 2020
No ratings yet
Delegated Content Erasure in IPFS: Future Generation Computer Systems June 2020
10 pages
Argumentative Deliberation For Autonomous Agents
No ratings yet
Argumentative Deliberation For Autonomous Agents
10 pages
Lab Rheology and Injection Molding - 1
No ratings yet
Lab Rheology and Injection Molding - 1
3 pages
Nlud Circ CPL Sep 2017 Batch
No ratings yet
Nlud Circ CPL Sep 2017 Batch
1 page
Packing List 082022140
No ratings yet
Packing List 082022140
2 pages
Walsh&Littman
No ratings yet
Walsh&Littman
6 pages
Learning Planning Operators by Observation and Practice Xuemei Wang
No ratings yet
Learning Planning Operators by Observation and Practice Xuemei Wang
6 pages
Haigh&Veloso (1999) LearningSituationDependentRules (ROGUE)
No ratings yet
Haigh&Veloso (1999) LearningSituationDependentRules (ROGUE)
6 pages
Model Lite
No ratings yet
Model Lite
4 pages
Project: Date:: Short-Circuit Summary Report
No ratings yet
Project: Date:: Short-Circuit Summary Report
1 page
Test Automation
No ratings yet
Test Automation
1 page

Kok&Otros Multi RobotDecisionMakingUsingCoordinationGraphs

Uploaded by

Kok&Otros Multi RobotDecisionMakingUsingCoordinationGraphs

Uploaded by

Multi-robot decision making using coordination graphs

A multiagent (multi-robot) system is a collection

The coordination problem

In order to place the coordination problem in a

Figure 1: An example coordination graph for a 4agent problem.

this agent and its neighbors: the agent collects from

Figure 2: Initial coordination graph (left) and graph

Dynamic continuous environments

mon goal (scoring more goals than their opponent).

Figure 3: A situation involving one passer and three

be assigned the interceptor role when he is not able to

The first rule p1 indicates that a passer can shoot

to its parents. After agent 2 and 3 have also fixed their

executed 60.000 times by each player. The strategy

Conclusions and future work

agent dependent on its current state information (i.e.,

You might also like