Cmpe536 L8

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 88

Swarm

Intelligence
Outline
• Background
o What is a Swarm Intelligence (SI)?
o Examples from nature
o Origins and Inspirations of SI
• Ant Colony Optimization
• Particle Swarm Optimization
What is a Swarm?

• A loosely structured collection of interacting agents


o Agents:
• Individuals that belong to a group (but are not necessarily identical)
• They contribute to and benefit from the group
• They can recognize, communicate, and/or interact with each other
• The instinctive perception of swarms is a group of
agents in motion – but that does not always have to be
the case.
• A swarm is better understood if thought of as agents
exhibiting a collective behavior
Swarm Intelligence (SI)
• An artificial intelligence (AI) technique based on the
collective behavior in decentralized, self-organized
systems
• Generally made up of agents who interact with each
other and the environment
• No centralized control structures
• Based on group behavior found in nature
Examples of Swarms in
Nature:
• Classic Example: Swarm of Bees
• Can be extended to other similar systems:
o Ant colony
• Agents: ants
o Flock of birds
• Agents: birds
o Traffic
• Agents: cars
o Crowd
• Agents: humans
o Immune system
• Agents: cells and molecules
Two Common SI
Algorithms
Inspiration from swarm intelligence has led to some highly
successful optimisation algorithms.

o Ant Colony (-based) Optimisation – a way to solve


optimisation problems based on the way that ants indirectly
communicate directions to each other.

o Particle Swarm Optimisation — a different way to solve


optimisation problems, based on the swarming behaviour of
several kinds of organisms.
Biologically Inspired
Computation

Ant Colony Optimisation


Ant Colony
Optimization (ACO)
• The study of artificial systems modeled after the
behavior of real ant colonies and are useful in solving
discrete optimization problems
• Introduced in 1992 by Marco Dorigo
o Originally called it the Ant System (AS)
o Has been applied to
• Traveling Salesman Problem (and other shortest path problems)
• Several NP-hard Problems
• It is a population-based metaheuristic used to find
approximate solutions to difficult optimization problems
Overview
“Ant Colony Optimization (ACO) studies
artificial systems that take inspiration
from the behavior of real ant colonies
and which are used to solve discrete
optimization problems.”
-Source: ACO website, http://iridia.ulb.ac.be/~mdorigo/ACO/about.html
Artificial Ants
• A set of software agents
• Stochastic
• Based on the pheromone model
o Pheromones are used by real ants to mark paths. Ants follow these paths (i.e.,
trail-following behaviors)
• Incrementally build solutions by moving on a graph
• Constraints of the problem are built into the heuristics of
the ants
A key concept: Stigmergy
Stigmergy is:
indirect communication via interaction with the environment.

• A problem gets solved bit by bit ..


• Individuals communicate with each other in the above way, affecting what each other does
on the task.
• Individuals leave markers or messages – these don’t solve the problem in themselves, but
they affect other individuals in a way that helps them solve the problem …
Naturally Observed Ant Behavior

All is well in the world of the ant.


Naturally Observed Ant Behavior

Oh no! An obstacle has blocked our path!


Naturally Observed Ant Behavior

Where do we go? Everybody, flip a coin.


Naturally Observed Ant Behavior

Shorter path reinforced.


Stigmergy in Ants
Ants are behaviorally unsophisticated, but collectively they
can perform complex tasks.

Ants have highly developed sophisticated sign-based


stigmergy
o They communicate using pheromones;
o They lay trails of pheromone that can be followed by other
ants.

• If an ant has a choice of two pheromone trails to follow,


one to the NW, one to the NE, but the NW one is
stronger – which one will it follow?
Pheromone Trails
Individual ants lay pheromone trails while travelling from
the nest, to the nest or possibly in both directions.
The pheromone trail gradually evaporates over time.
But pheromone trail strength accumulate with multiple ants
using path.

Food source
Nest
Pheromone Trails continued
Initial state:
E t=0 E t=1 E
no ants

30 ants 30 ants

D 15 ants
D 10 ants
D
15 ants 20 ants
d=1 d = 0.5
τ = 15 τ = 30

H C H C H C
τ = 15 τ = 30
d=1 d = 0.5
15 ants 15 ants 10 ants 20 ants
B B B

30 ants 30 ants

A A A

(a) (b) (c)


Ant Colony Optimisation
Algorithms: Basic Ideas
Ants are agents that:
Move along between nodes in a graph.

They choose where to go based on pheromone strength.

An ant’s path represents a specific candidate solution.

When an ant has finished a solution, pheromone is laid on its path, according
to quality of solution.

This pheromone trail affects behaviour of other ants by `stigmergy’ …


Using ACO
• The optimization problem must be written in the form of a path
finding problem with a weighted graph
• The artificial ants search for “good” solutions by moving on the
graph
o Ants can also build infeasible solutions – which could be helpful in solving some
optimization problems
• The metaheuristic is constructed using three procedures:
o ConstructAntsSolutions
o UpdatePheromones
o DaemonActions
Construct Ants Solutions
• Manages the colony of ants

• Ants move to neighboring nodes of the graph

• Moves are determined by stochastic local decision policies based


on pheromone trails and heuristic information

• Evaluates the current partial solution to determine the quantity of


pheromones the ants should deposit at a given node
Update Pheromones
• Process for modifying the pheromone trails
• Modified by
o Increase
• Ants deposit pheromones on the nodes (or the edges)
o Decrease
• Ants don’t replace the pheromones and they evaporate
• Increasing the pheromones increases the probability of
paths being used (i.e., building the solution)
• Decreasing the pheromones decreases the probability of
the paths being used (i.e., forgetting)
Daemon Actions
• Used to implement larger actions that require more than
one ant

• Examples:
o Perform a local search
o Collection of global information
Applications Of ACO
• Vehicle routing with time window constraints
• Network routing problems
• Assembly line balancing
• Heating oil distribution
• Data mining
E.g. A 4-city TSP
Initially, random levels of pheromone are scattered on the edges

A B

C D
Pheromone

AB: 10, AC: 10, AD, 30, BC, 40, CD 20


Ant
E.g. A 4-city TSP
An ant is placed at a random node

A B

C D
Pheromone

AB: 10, AC: 10, AD, 30, BC, 40, CD 20


Ant
E.g. A 4-city TSP
The ant decides where to go from that node,
based on probabilities
calculated from: A B
- pheromone strengths,
- next-hop distances.

Suppose this one chooses BC

C D
Pheromone

AB: 10, AC: 10, AD, 30, BC, 40, CD 20


Ant
E.g. A 4-city TSP
The ant is now at C, and has a `tour memory’ = {B, C} – so he cannot
visit B or C again. A B
Again, he decides next hop
(from those allowed) based
on pheromone strength
and distance;
suppose he chooses
CD

C D
Pheromone

AB: 10, AC: 10, AD, 30, BC, 40, CD 20


Ant
E.g. A 4-city TSP
The ant is now at D, and has a `tour memory’ = {B, C, D}
There is only one place he can go now:
A B

C D
Pheromone

AB: 10, AC: 10, AD, 30, BC, 40, CD 20


Ant
E.g. A 4-city TSP
So, he has nearly finished his tour, having gone over the links:
BC, CD, and DA.
A B

C D
Pheromone

AB: 10, AC: 10, AD, 30, BC, 40, CD 20


Ant
E.g. A 4-city TSP
So, he has nearly finished his tour, having gone over the links:
BC, CD, and DA. AB is added to complete the round trip.
A B

Now, pheromone on the tour


is increased, in line with the
fitness of that tour.

C D
Pheromone

AB: 10, AC: 10, AD, 30, BC, 40, CD 20


Ant
E.g. A 4-city TSP

A B

Next, pheromone everywhere


is decreased a little, to model
decay of trail strength over
time

C D
Pheromone

AB: 10, AC: 10, AD, 30, BC, 40, CD 20


Ant
E.g. A 4-city TSP
We start again, with another ant in a random position.

A B
Where will he go?

Next , the actual algorithm


and variants.

C D
Pheromone

AB: 10, AC: 10, AD, 30, BC, 40, CD 20


Ant
The ACO algorithm for the TSP
We have a TSP, with n cities.
1. We place some ants at each city. Each ant then does this:
• It makes a complete tour of the cities, coming back to its starting city, using a transition rule
to decide which links to follow. By this rule, it chooses each next-city at random, but based
partly by the pheromone levels existing at each path, and based partly by heuristic
information.

2. When all ants have completed their tours.


Global Pheromone Updating occurs.
• The current pheromone levels on all links are reduced (I.e. pheromone levels decay over
time).
• Pheromone is laid (belatedly) by each ant as follows: it places pheromone on all links of its
tour, with strength depending on how good the tour was.
The ACO algorithm for the TSP

[a simplified version with all essential details]


We have a TSP, with n cities.
1. We place some ants at each city. Each ant then does this:
• It makes a complete tour of the cities, coming back to its starting city, using a transition rule
to decide which links to follow. By this rule, it chooses each next-city at random, but biased
partly by the pheromone levels existing at each path, and biased partly by heuristic
information.
2. When all ants have completed their tours.
Global Pheromone Updating occurs.
• The current pheromone levels on all links are reduced (I.e. pheromone levels decay over
time).
• Pheromone is lain (belatedly) by each ant as follows: it places pheromone on all links of its
tour, with strength depending on how good the tour was.
Then we go back to 1 and repeat the whole process many times, until
we reach a termination criterion.
ACO Algorithm

Set all parameters and initialize the pheromone trails


Loop
Sub-Loop
Construct solutions based on the state transition rule
Apply the online pheromone update rule
Continue until all ants have been generated
Apply Local Search
Evaluate all solutions and record the best solution so far
Apply the offline pheromone update rule
Continue until the stopping criterion is reached
ACO System, cont.
• Often applied to TSP (Travelling Salesman Problem):
shortest path between n nodes
• Algorithm in Pseudocode:
o Initialize Trail
o Do While (Stopping Criteria Not Satisfied) – Cycle Loop
• Do Until (Each Ant Completes a Tour) – Tour Loop
• Local Trail Update
• End Do
• Analyze Tours
• Global Trail Update
o End Do

41
Ant Systems Algorithm for TSP
Initialize

Place each ant in a randomly chosen city

For Each Ant

Choose NextCity(For Each Ant)

yes more cities


to visit

No

Return to the initial cities

Update pheromone level using the tour cost for each ant

No
Stopping
criteria
yes

Print Best tour


43
44
45
46
47
48
Ant Colony Optimization
Characteristics of the algorithm

 An ant is a solution.

 Solutions (ants) are at different places in the solution space.

 How they change is based on the probability of changing to a different schedule.

 An ant completes its tour after selection a choice for each stand.

 Utilities (objective function values) of each tour are calculated.

 Pheromone levels are updated after all of the ants have completed all of
their tours.
Ant Colony Optimization

Characteristics of the algorithm

The probability of moving from Point A to Point B


depends on the attractiveness of the move and the A
trail level.

Attractiveness - an a priori desirability (typically


based on the inverse distance) of the move. ?
Trail level - an a posteriori, computed attractiveness.
This is computed after an ant completes a solution,
increasing or decreasing the desirabilities of C
components of a solution. B

 
 
 Amount of pheromone A B Desirabili ty of state transition A B  
Prob A B   
  Amounts of pheromones Desirabili ties  
 
Ant Colony Optimization

Characteristics of the algorithm

Amounts of pheromones are updated after


a solution has been generated. A
(1-p) prior pheromone + pheromone

Where: ?
p = a coefficient related to the evaporation rate.

if AB path is used, pheromone levels increase


C
pheromone = Q / Lk B
Where:
Q = a constant
Lk = solution length (quality)

Else pheromone levels decrease, since in this case

pheromone = 0
Ant Colony Optimization

Advantages:

• It is intuitive to biologically-minded people, mimicking nature.

• The system is built on positive feedback (pheromone attraction)


and negative attractiveness (pheromone evaporation).

• Pheromone evaporation helps avoid convergence to a local optima.

Disadvantages:

• For routing problems it may make more sense, but for harvest
scheduling problems, it requires a conceptual leap of faith.

• Fine-tuning the sensitive parameters may require significant effort.


Ant Colony Optimization

Necessary parameters

1) The initial population of ants (an ant cycle).

2) The total number of tours.

3) The evaporation rate.

4) The amount of pheromone applied to a section of a cycle or solution.

5) The constant Q.
Methodology
Parameters of ACO Algorithm

 ij : Pheromone trail of combination (i,j)


ij : Local heuristic of combination (i,j)
Pij : Transition probability of combination (i,j)
 : Relative importance of pheromone trail
 : Relative importance of local heuristic
q0 : Determines the relative importance of exploitation
versus exploration
 : Trail persistence (pheromone decay rate)
 ij
The transition rule
is the amount of pheromone currently on the path that goes
directly from city i to city j.
ij is the heuristic value of this link – in the classic TSP
application, this is chosen to be 1/distance(i,j) -- i.e. the shorter
the distance, the higher the heuristic value.
pk (i, j ) is the probability that ant k will choose the link that goes
from i to j

State Transition Probability:


( ij ) (ij ) 
P k ij 
 il il
(
lU
) 
( ) 

unvisited cities SetU


Where our ant is at city i, and j is a city as yet unvisited on its
tour, and the summation is over all of k’s unvisited cities
Global pheromone update
 k
ij is amount of pheromone added to the (i, j) link by ant k.

NA is the number of ants

 is a parameter called the pheromone decay rate.

Lk is the length of the tour completed by ant k


Pheromone
 ijk at the next iteration becomes: evaporation

NA

Where  ijnew   ijold    ijk


k 1
Q Q
  Q ,
k
 
k
ij
or   k k
ij
ij
d ij L
Generate initial ants
Ant Colony Optimization

For all ants,


Basic Process develop an ant tour

Assess the quality of the


ant tours

Change probabilities of
pieces of each
potential tour

Has the
best solution Yes
Update pheromone levels been improved? Save best solution

No

Have we
No reached the
stopping
criteria?

Yes
Stop and report
the best solution
found during search
A simple TSP example []

[]

A B 2

[]

C 3

[]

4
D
E []

dAB =100;dBC = 60…;dDE =150


5
Iteration 1

[A] [B]

2
1
A B
[C]

[D] [E]

4 D 5

E
How to build next sub-solution?

[A]

[A]
A B

1

[A] [  ij ( t )] 
[  ij ] 

  [  ( t )] C[  ]  if j  allowed k

pij ( t )  
k
ik ik
kallowedk
1
 [A,D] [A]

0 otherwise
1
1
D
E
Iteration 2
[E,A] [C,B]

5
3
A B
[B,C]

2
C
[A,D]
[D,E]

D 4

E
Iteration 3

[D,E,A] [E,A,B]

4
5
A B
[A,D,C]

[B,C,D]
[C,B,E]

D 3

E
Iteration 4
[B,C,D,A] [D,E,A,B]

2 4
A B
[E,A,B,C]

5
C

[C,B,E,D]
[A,DCE]

3
D 1
E
Iteration 5
[C,B,E,D,A] [A,D,C,E,B]

1
3
A B
[D,E,A,B,C]

4
C

[E,A,B,C,D]

[B,C,D,A,E]

5
D
E 2
Path and Pheromone Evaluation
[A,D,C,E,B]
Q
 if ( i , j )  tour
L1 =300  ik, j   Lk
1 0 otherwise

[B,C,D,A,E]

L2 =450
2
[C,B,E,D,A]

 total
3 A ,B
   
1 L =260
A ,B
3
2
A ,B   3
A ,B   4
A ,B   5
A ,B
[D,E,A,B,C]

L4 =280
4

[E,A,B,C,D]

L5 =420
5
End of First Run

Save Best Tour (Sequence and length)

All ants die

New ants are born


ACO algorithms for TSP
• Ant System
• Elitist Ant System
• Rank-based Ant System
• Ant Colony System
• MAX-MIN Ant System

67
Ant System: Pheromone
initialization
• Pheromone initialization
Τij = m / Cnn,
where:
 m – number of ants
 Cnn – path length of nearest-neighbor algorithm

68
Ant System: Tour construction
• Ant k is located in city i
k
• N i is the neighborhood of city i
• Probability to go to city : j  N ik

 
 ij ij
p 
k

 il il
 
ij

lN ik

69
Tour construction: comprehension

 ij ij
p 
k

 il il
 
ij

lN ik

• α = 0 – greedy algorithm
• β = 0 – only pheromone is at work
 quickly leads to stagnation

70
Ant System: update pheromone
trails – evaporation
Evaporation for all connections∀(i, j) ∈ L:

τij ← (1 – ρ) τij,

ρ ∈[0, 1] – evaporation rate

Prevents convergence to suboptimal solutions

71
Ant System: update pheromone
trails – deposit
• Tk – path of ant k
• Ck – length of path Tk
• Ants deposit pheromone on visited arcs:
m
 ij   ij    , i, j  L
k
ij
k 1

1 / C k
, i , j   T k
 ijk  
0, otherwise
72
Elitist Ant System
• Best-so-far ant deposits pheromone on each iteration:

m
 ij   ij     e k
ij
best
ij
k 1

1 / C best , i, j  T best


 best
ij 
0, otherwise

73
Rank-based Ant System
• Another improvement over AS.
• Each ant deposits an amount of pheromone that decreases
with its rank.
• In each iteration, only the best (w-1) ranked ants and the
best-so-far ant are allowed to deposit pheromone.

74
MAX-MIN Ant System
1. Only iteration-best or best-so-far ant deposits pheromone
2. Pheromone trails are limited to the interval [τmin, τmax]

 new
ij   old
ij   e
ij
 min   ij   max

75
Ant Colony System
• Differs from Ant System in three points:
 More aggressive tour construction rule
 Only best ant evaporates and deposits pheromone
 Local pheromone update

76
Ant Colony System
1.Tour Construction

 
arg max lN k  ilil , if q  q0
j i

like Ant System , otherwise

2. Local pheromone update:

τij ← (1 – ξ)τij + ξτ0,


77
Ant-Q & Ant Colony System (ACS)

arg max[( il ) ( il )  ] q  q0 Exploitation


 lU
v

 V q  q0 Exploration

( iv ) ( iv ) 
Piv 
 ( il ) (il ) 
lU

Local Updating
 ijnew   ijold  (1   ) 0
(Online Updating)
Global Updating
(Offline Updating)  ijnew   ijold  (1   ) ije
Not just for TSP of course
ACO is naturally applicable to any sequencing problem, or
indeed any problem
All you need is some way to represent solutions to the problem as
paths in a network.
E.g.
Single machine scheduling with due-dates
These jobs have to be done; their length represents the time they
will take.

A
B
C
D
E
E.g.
Single machine scheduling with due-dates
These jobs have to be done; their length represents the time they
will take.

Each has a `due date’, when it needs to be finished

A 3pm
B 3:30pm
C 5pm
D 4pm
E 4:30pm
E.g.
Single machine scheduling with due-dates
These jobs have to be done; their length represents the time they
will take.

Each has a `due date’, when it needs to be finished

A 3pm Only one `machine’ is


available to process these jobs,
B 3:30pm so can do just one at a time.
C 5pm
D 4pm [e.g. machine might be
human tailor, photocopier,
E 4:30pm Etc …]
An example schedule
2 pm 3 pm 4 pm 5 pm 6 pm

A due 3pm B – 3:30 C - 5pm D – 4pm E -4:30

A is 10min late Fitness might be average lateness;


B is 40min late in this case 46min
C is 20min early (lateness = 0)
D is 90min late or fitness could be Max lateness,
E is 90min late in this case 90min
Another schedule
2 pm 3 pm 4 pm 5 pm 6 pm

B – 3:30 A due 3pm D – 4pm E -4:30 C - 5pm

A is 70min late Fitness might be average lateness;


B is 30min early (0 lateness) in this case again 46min
C is 60min late
D is 50min late or fitness could be Max lateness,
E is 50min late in this case 70min
Applying ACO to this
problem
Just like with the TSP, each ant finds paths in a network, where,
in this case, each job is a node. Also, no need to return to start
node – path is complete when every node is visited.
Initially, random levels of pheromone are scattered on the edges,
an ant starts at a Start node (so the first link it chooses defines
the first task to schedule on the machine); as before it uses a
transition rule to take one step at a time, biased by pheromone levels,
and also a heuristic score, each time choosing the next machine
to schedule.
A B

E Start

C D
See here if you’re very interested in ACO:

• http://iridia.ulb.ac.be/dorigo/ACO/ACO.html

ACO is a thriving and maturing research area – it has its own


conferences. It gets very good results on some difficult problems.
Following the above link will help you find examples.

ACO research and practice tends to concentrate on:


• hybridisation with other methods; e.g. it is common to
improve an individual ant’s solution by local search, and then
lay pheromone.
• New and adaptive ways to control the relative influence of
heuristics, pheromone strength and pheromone decay.
Resources
 Ant Colony Optimization by Marco Dorigo and Thomas Stϋtzle, The
MIT Press, 2004
 Swarm Intelligence by James Kennedy and Russell Eberhart with
Yuhui Shi, Morgan Kauffmann Publishers, 2001
 Advances in Applied Artificial Intelligence edited by John Fulcher, IGI
Publishing, 2006
 Data Mining: A Heuristic Approach by Hussein Abbass, Ruhul Sarker,
and Charles Newton, IGI Publishing, 2002
 “Ant Colony Optimization” Curatored by Marco Dorigo,
http://www.scholarpedia.org/article/Ant_Colony_Optimization
 “Ant Colony Optimization” by Marco Dorigo,
http://iridia.ulb.ac.be/~mdorigo/ACO/ACO.htm;
 “Particle Swarm Optimization” http://www.swarmintelligence.org
 “Swarm Intelligence” http://en.wikipedia.org/wiki/Swarm_intelligence
 Picture of Flik, http://www.pixar.com

You might also like