A Review of Probabilistic Macroscopic Models For Swarm Robotic Systems
A Review of Probabilistic Macroscopic Models For Swarm Robotic Systems
Abstract. In this paper, we review methods used for macroscopic modeling and analyzing collective behavior of swarm robotic systems. Although the behavior of an individual robot in a swarm is often characterized by an important stochastic component, the collective behavior
of swarms is statistically predictable and has often a simple probabilistic description. Indeed, we show that a class of mathematical models
that describe the dynamics of collective behavior can be generated using
the individual robot controller as modeling blueprint. We illustrate the
macroscopic modelling methods with the help of a few sample results
gathered in distributed manipulation experiments (collaborative stick
pulling, foraging, aggregation). We compare the models predictions to
results of probabilistic numeric and sensor-based simulations as well as
experiments with real robots. Depending on the assumptions, the metric
used, and the complexity of the models, we show that it is possible to
achieve quantitatively correct predictions.
Vision
exactly the same from a few units to thousands of units; (ii) exibility: units can
be dynamically added or removed, they can be given the ability to reallocate
and redistribute themselves in a self-organized way; (iii) robustness: the resulting
collective system is robust not only through unit redundancy but also through
unit simplicity and an appropriate balance between exploitative and exploratory
behavior.
The main diculty in designing swarm robotic systems with desirable selforganized behavior is understanding the eect individual robot characteristics
have on the collective behavior. In the past, few analysis tools have been available to researchers. Experiments with physical robots are very costly and time
consuming, and systematically studying group behavior is often impractical.
Simulations, such as with embodied simulators [5, 13], attempt to realistically
model the environment, the robots imperfect sensing of and interactions with
it. Though simulations are much faster and much more reliable than experiments,
their results are not easily generalizable. Exhaustive scan of the design parameter space is often required to reach any conclusion. Moreover, simulations do not
scale well with the system size unless computation is performed in parallel,
the greater the number of agents, the longer it takes to obtain results.
Macroscopic modeling and mathematical analysis oer an alternative to experiments and simulations. Using mathematical analysis we can quickly and
eciently study swarm robotic systems, predict their long term behavior, gain
insight into system design: e.g., how individual robot characteristics aect group
behavior. Additionally, mathematical analysis may be used to select parameters
that optimize group performance, prevent instabilities, etc. Finally, results of
analysis can be used as feedback to guide performance-enhancing modications
of the robot controller.
In this paper we survey existing work on modeling collective behavior of
robot swarms with macroscopic models. The robots themselves in these systems
are simple, usually using reactive control: robots decide about future actions
based solely on input from sensors (including communication with other robots)
and the action they are currently executing. They do not rely on abstract representation, planning, or higher order reasoning functions. Such robots can be
represented as stochastic Markov processes. An equation, known as the Rate
Equation, describes the dynamics of their collective behavior. The Rate Equation formalism can be derived from theory of stochastic processes [8], although in
practice, the equations are usually phenomenological and can be easily written
down by considering details of the individual robot controller. The Rate Equation approach has been applied to study several distributed robot systems [14,
10, 7, 11, 1]. Below we review the elements of the mathematical formalism and
illustrate with a few sample results from the robotics domain.
Models can generally be broken into two classes: microscopic and macroscopic. Microscopic descriptions treat the robot as the fundamental unit of the
model. These models describe the robots interactions with other robots and the
environment. Solving or simulating a system composed of many such agents gives
researchers an understanding of the global behavior of the system. Examples
of such microscopic models are reported in [12, 6]; they have been used to
study collective behavior of a swarm of robots engaged in object aggregation and
collaborative pulling. Rather than compute the exact trajectories and sensory
information of individual robots, the robots interactions with other robots and
the environment are modeled as a series of stochastic events, with probabilities
determined by simple geometric considerations and systematic experiments with
one or two real robots. Running several series of stochastic events in parallel, one
for each robot, allows researchers to study the collective behavior of the swarm.
A macroscopic model, on the other hand, directly describes the collective behavior of the robotic swarm. It is computationally ecient because it uses fewer
variables. Macroscopic models have been successfully applied to a wide variety
of problems in physics, chemistry, biology and the social sciences. In these applications, the microscopic behavior of an individual (e.g., a Brownian particle in
a volume of gas or an individual residing in US) is quite complex, often stochastic and only partially predictable, and certainly analytically intractable. Rather
than account for the inherent variability of individuals, scientists model the behavior of some average quantity that represents the system they are studying
(e.g., volume of gas or population of US). Such macroscopic descriptions often
have a very simple form and are analytically tractable. It is important to remember that such models do not reproduce the results of a single experiment
rather, the behavior of some observable averaged over many experiments or
observations. The two description levels are, of course, related: we can start from
the Stochastic Master Equation that describes the evolution of a robots probability density and get the Rate Equation, a macroscopic model, by averaging
it [8]. In most cases, however, Rate Equations are phenomenological in nature,
i.e., not derived from rst principles. Below we show how to formulate the Rate
Equations describing dynamics of a homogeneous robot swarm by examining the
details of individual robot controller.
The Rate Equation is not the only approach to modeling collective behavior. Anderson [2], for example, shows how geometric analysis can be used to
predict distribution of individuals playing spatial participative games from the
microscopic rules each individual is following.
2.1
the unpredictability and incorporate it directly into the robots behavior: e.g.,
the simplest eective policy for obstacle avoidance is for the robot to turn a
random angle and move forward. In summary, the behavior of robots in a swarm
is so complex, it is best described probabilistically, as a stochastic process.
start
searching
homing
pickup
(1)
n
The conditional probabilities dene the transition rates for a Markov process
p(n, t + t|n , t)
.
t0
t
W (n|n ; t) = lim
(2)
(3)
A Markov processs future state depends only on its present state and none of the
past states.
where Nn (t) is the average number of robots in state n at time t. This is the socalled Rate Equation. It is sometimes also written in a discrete form, as a nite
dierence equation that describes the behavior of N (kT ), k being an integer
and T the discretization interval: (N (t + T ) N (t))/T . Equation 3 has the
following interpretation: the number of robots in state n will increase in time
due to transitions to state n from other states, and it will decrease in time due
to the transitions f rom state n to other states.
Rate Equations are deterministic. In stochastic systems, however, they describe the dynamics of average quantities. How closely the average quantities
track the behavior of the actual dynamic variables depends on the magnitude of
uctuations. Usually the larger the system, the smaller are the (relative) uctuations. In a small system, the experiment may be repeated many times to average
out the eect of uctuations. The agreement increases as the size of the system
grows.
2.2
The Rate Equation is a useful tool for mathematical analysis of collective dynamics of robot swarms. To facilitate the analysis, we begin by drawing the
macroscopic state diagram of the system. The collective behavior of the swarm
is captured by an FSA that is functionally identical to the individual robot FSA,
except that each state of the automaton now represents the number of robots
executing that action [10, 7, 11]. Not every microscopic robot behavior need to
become a macroscopic state. In order to keep the model tractable, it is often
useful to coarse-grain it by considering several related actions or behaviors as a
single state. For example, we may take the searching state of robots to consist
of the actions wander in the arena, detect objects and avoid obstacles. When
necessary, the searching state can be split into three states, one for each behavior; however, we are often interested in the minimal model that captures the
important behavior of the system. Coarse-graining presents a way to construct
such a minimal model.
The macroscopic automaton can be directly translated into the Rate Equations. Each state in the automaton becomes a dynamic variable Nn (t), with its
own Rate Equation. Every transition will be accounted for by a term in the
equation: a positive term for the incident (W (n|n )Nn ) arrows and negative
term for the outgoing (W (n |n)Nn ) arrows.
Finding an appropriate mathematical form for the transition rates is the main
challenge in studying real systems. The transition is triggered by some stimulus
be it another robot in a particular state, an object to be picked up, etc. In
order to compute the transition rates, we assume, for simplicity, that robots and
stimuli are uniformly distributed. The transition rates then have the following
form: W (n|n ) M , where M is the environmental stimulus encountered (e.g.,
number of sticks in the arena). The proportionality factor connects the model to
experiments, and it depends on the rate at which a robot detects sticks. It can be
roughly estimated from rst principles (scattering cross section approach [10]),
measured from simulations or experiments with one or two robots, or left as a
model parameter. There will be cases where the uniformity assumption fails: e.g.,
in overcrowded scenarios where robots, depending on their obstacle avoidance
controller, tend to clump, forming robotic clouds [11]. If the transition rates
cannot be calculated from rst principles, it may be expedient to leave them as
parameters of the model and obtain them by tting the model to data.
The Rate Equation has been used to study a variety of distributed robot systems.
Below we illustrate the approach with a few sample results from swarm robotic
experiments, for which a body of experimental and simulations data exists.
3.1
The stick-pulling experiments were carried out to study dynamics of collaboration in robots [6]. The robots task was to locate sticks scattered around the
arena and pull them out of their holes. A single robot cannot complete the task
on its own: rather, when a robot nds a stick, it lifts it partially out of the
hole and waits for a period specied by its gripping time parameter for a second
robot to nd it. If a second robot nds the rst during this time interval, it will
pull the stick out; otherwise, the rst robot releases the stick and returns to the
searching state.
Lerman et al. [10] studied a minimal continuous time model of the system.
A minimal model includes only the salient details of the process it describes.
They found that this model reproduced key experimental observations and qualitatively agreed with results of experiments and simulations (see Figure 2(a)).
Martinoli & Easton [11] formulated a more detailed model based on nite difference equations that accounts for every state in the robot control diagram.
Figure 2 depicts the collaboration rate, the rate at which robots pull sticks
out, as a function of the individual robot gripping time parameter for the minimal
(a) and the detailed (b) models. Figure 2(b) also shows results of embodied and
probabilistic numeric simulations for the same set of parameters. One can see
quantitative agreement already with swarms as small as 8 robots. The minimal
model shows the same qualitative behavior as the more detailed model.
3.2
Mathematical models have been applied to study collective collection experiments (aggregation and foraging). In the aggregation experiments, the task was
to gather small objects in a single cluster starting from a situation where they
were all randomly scattered in an arena [12, 1]. Swarms of robots of dierent
group size, or diering in the sensing and actuation capabilities, were used to
aggregate dierent types of objects. These publications considered both microscopic and macroscopic models as well as a few metrics for measuring the evolution of aggregation (average cluster size, number of clusters, size of the biggest
24 robots
0.5
3
16 robots
8 robots
10
20
30
40
Gripping time parameter
(a)
50
0
0
100
200
300
400
500
600
(b)
Fig. 2. Collaboration rate per robot vs gripping time parameter for dierent robot
group sizes and 16 sticks. (a) Results of the minimal model for 8 (short dash), 16 (long
dash) and 24 (solid line) robots. (b) Results for detailed model (solid lines), embodied
simulations (dotted lines), the microscopic model (dashed lines).
cluster). Figure 3(a) shows the results of macroscopic models predictions compared to realistic embodied simulation for swarm sizes of one and ve robots (see
[1] for details). It is worth nothing that, although certain swarm sizes considered
were extremely small, quantitative agreement between model and realistic simulation was achieved. The authors also report experiments using variable swarm
sizes, by enabling robots to decide whether to continue aggregating the objects
or rest. Also in this scenario, theoretical predictions were extremely faithful not
only in predicting dynamics of aggregation but also the number of active workers
over time.
In foraging experiments, Lerman and Galstyan studied the inuence of physical interference on the swarm performance [7]. Interference is a critical issue in
swarm robotics, in particular in foraging experiments where there is a spatial
bottleneck at the predened home region where the collected objects must
be delivered. When two robots nd themselves within sensing distance of one
another, they will execute obstacle avoidance maneuvers. Because this behavior
takes time, interference decreases robots eciency. Clearly, a single robot working alone is relatively more ecient, because it does not experience interference
from other robots (the larger the swarm, the greater the degree of interference).
However, parallel work helps speed up the foraging process and increases the
system robustness in case of individual robot failures.
Figure 3(b) shows the total time required to complete the task for two dierent interference strengths, as measured by the avoiding time . For both cases
task completion time is minimized for some swarm size and increases for larger
swarms. The greater the eect of interference (larger ), the smaller the optimal
swarm size. Results show good quantitative agreement with embodied simulations with swarms of one to 20 robots.
1600
avoid time = 3s
avoid time = 1s
model (3 s)
1400
model (1 s)
1200
time (s)
1000
800
600
400
200
0
0
10
number of robots
(a)
(b)
Fig. 3. (a) Evolution of the mean cluster size in an arena with 20 objects to gather
and swarms of one and ve robots. Macroscopic model (dashed lines) and embodied
simulation (continuous lines) are compared. (b) Time it takes the swarm of robots
to collect objects in the arena for two dierence interference strengths. Symbols are
results of embodied simulations, while lines give the models predictions.
Discussion
In this paper we have reviewed methods for macroscopically modeling and analyzing the behavior of robot swarms. Our analysis is based on the theory of
distributed stochastic processes, which is applicable to robot swarms because
the behavior of each robot is inherently probabilistic in nature and often not
completely predictable, and its future state depends only on its present state.
Despite the inherent unpredictability, the probabilistic description of the collective behavior is surprisingly simple. We showed that Rate Equations describe
how the average collective system properties change in time. These equations can
be easily written down from the details of the individual robot controller. We
illustrated the formalism by reporting a few sample results from swarm robotics
experiments presented in the past. Analysis yields important insights into the
system, such as what are the important parameters that determine the behavior,
how to optimize swarm performance, etc.
Much work remains to be done in extending stochastic mathematical models
to new domains and overcoming limitations of the current models. For example,
Lerman & Galstyan [9, 4] have moved beyond simple Markov processes to study
distributed systems composed of adaptive robots that can change their behavior
based on their estimates of the global state of the system. Another unexplored
area is in modeling systems in which position has to be taken into account.
Such systems include any that are based on diusing pheromone elds. Another
research direction is to move beyond the mean-eld approximation and develop
exact statistical formulations of problems. Such formulations will enable us to
study directly stochastic eects, including the strength of uctuations.
Acknowledgments
References
1. Agassounon, W., Martinoli, A. and Easton, K. 2004 Macroscopic Modeling of Aggregation Experiments using Embodied Agents in Teams of Constant and TimeVarying Sizes. Special issue on Swarm Robotics, Dorigo, M. and Sahin, E. editors,
Autonomous Robots, 17(2-3):163191.
2. Anderson, C. 2003. Linking Micro- to Macro-level Behavior in the AggressorDefender-Stalker Game, in Workshop on the Mathematics and Algorithms of Social
Insects (MASI-2003), December, 2003, Atlanta, GA.
3. Bonabeau, E., Dorigo M. and Theraulaz, G. 1999. Swarm Intelligence: From Natural
to Articial Systems. Oxford University Press, New York.
4. Galstyan, A. and Lerman, K. 2004. Analysis of a Stochastic Model of Adaptive
Task Allocation in Robots, to appear in Workshop on Engineering Self-Organizing
Systems at AAMAS-2004.
5. Gerkey, B. P., Vaughan, R. T., Stoy, K., Howard, A., Sukhatme G. S., Mataric, M.
J. 2001. Most Valuable Player: A Robot Device Server for Distributed Control, in
Proc. of the IEEE/RSJ International Conference on Intelligent Robots and Systems
(IROS 2001), Wailea, Hawaii, October 29 - November 3, 2001.
6. Ijspeert, A. J., Martinoli, A., Billard, A. and Gambardella L. M. 2001. Collaboration
through the Exploitation of Local Interactions in Autonomous Collective Robotics:
The Stick Pulling Experiment. Autonomous Robots 11(2):149171.
7. Lerman, K. and Galstyan, A. 2002a. Mathematical model of foraging in a group of
robots: Eect of interference. Autonomous Robots, 13(2):127141.
8. Lerman, K. and Galstyan, A. 2002b. Two paradigms for the design of articial
collectives. In Proc. of the First Annual workshop on Collectives and Design of
Complex Systems, NASA-Ames, CA.
9. Lerman, K. and Galstyan, A. 2003. Macroscopic Analysis of Adaptive Task Allocation in Robots. pp. 19511956. In Proc. of the Int. Conf. on Intelligent Robots and
Systems (IROS-2003), Las Vegas, NV.
10. Lerman, K., Galstyan, A., Martinoli, A. and Ijspeert, A. 2001. A macroscopic
analytical model of collaboration in distributed robotic systems. Articial Life
Journal, 7(4):375393.
11. Martinoli, A., Easton, K. and Agassounon, W. 2004. Modeling of Swarm Robotic
Systems: A Case Study in Collaborative Distributed Manipulation. Special Issue
on Experimental Robotics, Siciliano, B., editor, Int. Journal of Robotics Research,
23(4):415436.
12. Martinoli, A., Ijspeert, A. J., and Gambardella, L. M. 1999. A probabilistic model
for understanding and comparing collective aggregation mechanisms. pp. 575584.
In D. Floreano, J.-D. Nicoud, and F. Mondada, editors, LNAI:1674, Springer, New
York, NY.
13. Michel, O. 2004. Webots: Professional Mobile Robot Simulation. Int. J. of Advanced Robotic Systems, 1(1):3942.
14. Sugawara, K. and Sano, M. 1997. Cooperative acceleration of task performance:
Foraging behavior of interacting multi-robots system. Physica D100:343354.