0% found this document useful (0 votes)

57 views38 pages

An Analysis of The Search Mechanisms of The Bees Algorithm

Uploaded by

almanea.lamia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views38 pages

An Analysis of The Search Mechanisms of The Bees Algorithm

Uploaded by

almanea.lamia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

University of Birmingham

An Analysis of the Search Mechanisms of the Bees

Algorithm
Baronti, Luca; Castellani, Marco; Pham, Duc

DOI:
10.1016/j.swevo.2020.100746
License:
Creative Commons: Attribution-NonCommercial-NoDerivs (CC BY-NC-ND)

Document Version
Early version, also known as pre-print

Citation for published version (Harvard):

Baronti, L, Castellani, M & Pham, D 2020, 'An Analysis of the Search Mechanisms of the Bees Algorithm',
Swarm and Evolutionary Computation, vol. 59, 100746. https://doi.org/10.1016/j.swevo.2020.100746

Link to publication on Research at Birmingham portal

General rights
Unless a licence is specified above, all rights (including copyright and moral rights) in this document are retained by the authors and/or the
copyright holders. The express permission of the copyright holder must be obtained for any use of this material other than for purposes
permitted by law.

•Users may freely distribute the URL that is used to identify this publication.
•Users may download and/or print one copy of the publication from the University of Birmingham research portal for the purpose of private
study or non-commercial research.
•User may use extracts from the document in line with the concept of ‘fair dealing’ under the Copyright, Designs and Patents Act 1988 (?)
•Users may not further distribute the material nor use it for the purposes of commercial gain.

Where a licence is displayed above, please note the terms and conditions of the licence govern your use of this document.

When citing, please reference the published version.

Take down policy
While the University of Birmingham exercises care and attention in making items available there are rare occasions when an item has been
uploaded in error or has been deemed to be commercially or otherwise sensitive.

If you believe that this is the case for this document, please contact [email protected] providing details and we will remove access to
the work immediately and investigate.

Download date: 11. Dec. 2023

An Analysis of the Search Mechanisms
of the Bees Algorithm

Luca Barontia , Marco Castellania , Duc Truong Phama

a Department of Mechanical Engineering, University of Birmingham, United Kingdom

Abstract
The Bees Algorithm has been successfully applied for over a decade to a large
number of optimisation problems. However, a mathematical analysis of its
search capabilities, the effects of different parameters used, and various de-
sign choices has not been carried out. As a consequence, optimisation of the
Bees Algorithm has so far relied on trial-and-error experimentation. This paper
formalises the Bees Algorithm in a rigorous mathematical description, beyond
the qualitative biological metaphor. A review of the literature is presented,
highlighting the main variants of the Bees Algorithm, and its analogies and dif-
ferences compared with other optimisation methods. The local search procedure
of the Bees Algorithm is analysed, and the results experimentally checked. The
analysis shows that the progress of local search is mainly influenced by the size
of the neighbourhood and the stagnation limit in the site abandonment proce-
dure, rather than the number of recruited foragers. In particular, the analysis
underlines the trade-off between the step size of local search (a large neigh-
bourhood size favours quick progress) and the likelihood of stagnation (a small
neighbourhood size prevents premature site abandonment). For the first time,
the implications of the choice of neighbourhood shape on the character of the lo-
cal search are clarified. The paper reveals that, particularly in high-dimensional
spaces, hyperspherical neighbourhoods allow greater search intensification than
hypercubic neighbourhoods. The theoretical results obtained in this paper are
in good agreement with the findings of several experimental studies. It is hoped
that the new mathematical formalism here introduced will foster further under-
standing and analysis of the Bees Algorithm, and that the theoretical results
obtained will provide useful parameterisation guidelines for applied studies.
Keywords: Bees Algorithm, Optimisation, Statistical Analysis

∗ MarcoCastellani
Email addresses: [email protected] (Luca Baronti), [email protected]
(Marco Castellani), [email protected] (Duc Truong Pham)

Preprint submitted to Swarm and Evolutionary Computation February 12, 2020

1. Introduction

The Bees Algorithm [1] is a nature-inspired intelligent technique that has

found application in a wide range of complex optimisation problems [2, 3]. The
main idea motivating this algorithm is to model the foraging behaviour of honey
bees to address the exploration vs exploitation trade-off. According to that
model, agents simulating scout bees randomly explore the solution space looking
for areas of high fitness. The scouts that found the most promising solutions
recruit (through performing a waggle dance [4]) other agents (forager bees) for
local exploitative search. Local search is conducted in parallel at different sites,
that is, within neighbourhoods centred on the solutions marked by the scouts.
Despite the initial idea being to maintain a clear separation between the
global explorative and local exploitative search efforts, it soon became clear
[5] that other factors such as the number of parallel local searches influence
the exploration vs. exploitation balance. Several empirical studies [6, 7] tried
to shed light on the properties of the Bees Algorithm and related optimisation
techniques, and how their parameterisation affects the search effort. However, to
the best of the authors’ knowledge, a theoretical analysis of the Bees Algorithm
behaviour was never attempted.
This paper addresses the above gap in the literature. Understanding in de-
tail the dynamic behaviour of a population-based optimisation algorithm on
arbitrarily complex fitness landscapes is extremely complex. For this reason,
the literature on nature-inspired algorithms overwhelmingly relies on qualita-
tive biological analogies and empirical comparisons. However, by investigating
well-defined cases under a theoretical framework, important insights on the al-
gorithm behaviour can be gained [8]. This study focuses on the local search
procedure of the Bees Algorithm, using the clearly delimited boundaries of the
site neighbourhood to infer important properties.
The proposed study is timely, as a large number of variants in operators
and parameterisations have been developed for this popular algorithm [9]. In
the light of the No Free Lunch Theorem [10], it is essential to unravel the
implications of these different choices of operators and parameters.
The rest of this paper is organised as follows: a general formulation of the
Bees Algorithm is given in section 2. The main variants of (section 3) and
related algorithms to (section 4) the Bees Algorithm are discussed. The second
part of the paper focuses on the local search (section 5) and site abandonment
(section 6) procedures. Section 7 analyses the implications of using different
neighbourhood shapes. The main findings are discussed in section 8, and the
conclusions are drawn in section 9.

2. Formal Definition of the Bees Algorithm

The Bees Algorithm iteratively looks for better solutions to a specified opti-
misation problem. The algorithm is terminated when a given stopping criterion
is met (e.g. a pre-set number of optimisation cycles has elapsed, a solution of

2
satisfactory quality is found). Despite minor differences, the notation concern-
ing the main parameters and operators of the Bees Algorithm is consistent in
the literature. With some minor changes, it is also used in this paper:
• ns number of scout bees used only in the global search;
• nb number of sites where local search is performed;
• nr number of recruited forager bees for each of the nb sites;
• stlim number of cycles of local stagnation before a site is abandoned;
• ngh initial neighbourhood size of the nb sites;
• α neighbourhood shrinking parameter (0 < α < 1);
In the standard Bees Algorithm, the parameter ns describes the total number
of scouts used for random exploration (here ns) plus the number of scouts
(nb) marking the neighbourhoods (sites) selected for local search. That is,
nsstandard = ns + nb. Also, it is customary to allocate a larger number of
foragers (nre) to the very best ne < nb (elite) sites, and less (nrb < nre) to the
remaining nb − ne best sites. This distinction is not necessary for the analysis
proposed in this paper, and for the sake of compactness is dropped. Henceforth,
the parameter nr will refer likewise to nre or nrb.
In this study only continuous optimisation is considered, and each solution
is represented by an N -dimensional vector of real-valued decision variables sg =
{sg [1], ..., sg [N ]} ∈ Rn . The solutions are evaluated by a fitness function F
specific to the problem domain, which the algorithm aims to maximise. The
analysis of this paper is equally valid for a minimisation problem (min{F (·)} ≡
max{−F (·)}).
In this paper, each of the s ∈ {s(1) , . . . , s(nb) } nb sites selected for local
search is denoted by a centre sg and two additional variables: the time-to-live
integer variable sttl , and the local search edge se . The time-to-live variable sttl
is a counter that indicates the number of remaining cycles of stagnation before
the site is abandoned. The edge se defines the current spatial extent (henceforth
called search scope) of the local search.
For the sake of simplicity, unless otherwise stated, all the decision variables
will be henceforth defined in the same interval. Accordingly, se and ngh are
scalars, and the search scope at a given site s is delimited by a hypercube C
of edge se centred in the solution sg . Hereafter, this region will be indicated
as C(sg , se ). Local search is performed uniformly sampling nr solutions inside
C(sg , se ). In the general case that the interval of definition is not equal for
all parameters, se and ngh will be defined as vectors of size N . In this case,
local search is performed inside a box (i.e. an N -orthotope) of edges se =
{se [1], . . . , se [N ]} centred in sg . When relevant, the consequences of using box
sampling rather than cubic sampling will be discussed.
The algorithm steps are described in box 1. Except for minor changes (i.e.
no elite sites), the procedure described in box 1 can be regarded as the Stan-
dard Bees Algorithm (SBA [5]). In the neighbourhood shrinking procedure

3
Bees Algorithm: Main Steps

1. (Initialisation) The initial population P is created sampling ns+nr·nb

solutions at random across the N -dimensional solution space. Each
solution sg marks the centre of a neighbourhood (site) s of edge se (i.e.
a search scope C(sg , se )). The two variables sttl and se are initialised
for each site as follows: sttl = stlim and se = ngh · (M − m), where
M and m are respectively the upper and lower limit of the interval of
definition of the variables;
2. (Selection) The best nb sites s(1) , . . . , s(nb) centred on the solutions of
highest fitness are selected from P for local search (Waggle Dance),
the others are removed from the population;
3. (Local Search) For each of the nb sites s ∈ {s(1) , . . . , s(nb) } in P
selected for local search, the following steps are performed:
(a) If sttl = 0 the site is abandoned (Site Abandonment), and nr
new solutions are randomly sampled across the search space. The
best v of these nr solutions is used as the centre for a new site s,
which is initialiased with sttl = stlim and se = ngh · (M − m);
(b) (Foraging) If sttl > 0, nr solutions v1 , . . . , vnr are randomly sam-
pled with uniform distribution within C(sg , se ). The solution v of
highest fitness is selected, whilst the other solutions are discarded:
i. if F (v) > F (sg ), v replaces sg as the site centre (sg = v).
The time to live is set to sttl = stlim and the edge is kept
unchanged (se );
ii. If F (v) ≤ F (sg ), the local search is said to stagnate.
The edge is reduced by a constant factor se = αse
(Neighbourhood Shrinking), and the time to live is re-
duced to sttl = sttl − 1;
4. (Global Search) ns new (scout) solutions v1 , . . . , vns are uniformly
sampled in the search space. They become the centres of ns new sites,
each initialised with sttl = stlim and se = ngh · (M − m).
5. (Population Update) The nb scouts marking the centres of the sites
evolved via local search (3) and the ns scouts created by global search
(4) make up the new population P ;
6. (Stopping Criterion) if the stopping criterion is not met go back to
step 2, otherwise return the solution of highest fitness found;

Algorithm 1: The Bees Algorithm

4
(described in step 3b) if the interval of definition of the parameters is not the
same for all variables, the i-th dimension of the box is reduced as sei = αsei . The
initialisation and site abandonment procedures are designed to keep constant at
each generation the sampling rate of the solution space1 .
Local search aims to find the fitness optimum within a neighbourhood cen-
tred on a promising solution. Because the centre of the neighbourhood is up-
dated as better solutions are found (step 3), the scope of local search dynamically
changes, and eventually includes the local attractor point in the search space
(i.e. a local optimum). It should be noted that, like any stochastic optimisa-
tion procedure, local search is not guaranteed to stop at the local optimum. In
particular, local search may be prematurely abandoned when (i) it stagnates
for stlim iterations (e.g. stops on a flat surface) or (ii) global search finds more
promising regions (fitter solutions) elsewhere in the search space (step 2).
Global random search aims to find previously unexplored regions of high
fitness in the search space. Global search can also be used to increase adaptation
to changes in dynamic fitness landscapes. The solutions found via local (i.e. the
centres of the nb neighbourhoods) and global search are ranked at the end of
every optimisation cycle, and the fittest nb solutions are kept as seeds (centres)
for the next optimisation cycle. As the local exploitation of one given site
progresses, the probability that this site is abandoned because random search
found a fitter solution decreases. For this reason, some authors do not use global
search [5], or give randomly generated solutions (young bees) time to ’grow up’
[11].

3. Main Variants of the Bees Algorithm

Besides the standard procedure described in section 2, many different vari-

ants of the Bees Algorithm exist. A recent survey [9] separated three main
branches of the Bees Algorithm, namely the Basic Bees Algorithm (BBA),
the Shrinking-based Bees Algorithm (ShBA), and the Standard Bees Algorithm
(SBA).
The BBA refers to the basic form of the Bees Algorithm, which is mentioned
in several articles [5, 12, 13, 14] and performs parallel local searches using fixed
neighbourhoods (i.e. no neighbourhood shrinking).
The ShBA [1] includes the neighbourhood shrinking procedure. The heuris-
tics behind the neighbourhood shrinking procedure is to intensify the exploita-
tion effort as the local search progresses, focusing the sampling on increasingly
smaller regions of the solution space.
Finally the SBA, so termed in numerous articles [5, 11, 15], includes the
neighbourhood shrinking and site abandonment procedures. The heuristics be-
hind site abandonment is to abandon a site once the local search stagnates, to
avoid being trapped into local fitness peaks.

1 This is particularly useful when the performance of the BA is compared with the perfor-

mance of other techniques.

5
Many recruitment, neighbourhood alteration, and site abandonment heuris-
tics were proposed in the literature.
Ghanbarzadeh [16] proposed two methods for setting the number of recruited
foragers proportionally to a) the fitness or b) the location of the sites. Other
authors proposed recruitment schemes where the number of foragers was pro-
portional to the fitness of the site, and decreased it progressively by a fixed
amount [13], or according to a fuzzy logic policy [17]. Pham et al [18] used
Kalman filtering to allocate number of bees to the sites selected for local search.
This strategy was used to train a Radial Basis Function neural network, and
improved the learning accuracy and speed of the neural network. Finally, Iman-
guliyev [19] proposed a recruitment scheme where the number of foragers for a
site was computed on the efficiency rate of the site, rather than its fitness score.
In its basic instance [16], the search scope of a site is changed (reduced)
when local search fails to improve. Ahmad [20] proposed two different methods
to change dynamically the neighbourhood of a site: a) BA-NE where the search
scope is increased if a better solution is found and kept invariant otherwise,
and b) BA-AN, where the neighbourhood is asymmetrically increased along the
direction that led to the last improvement and decreased otherwise.
When a site is abandoned, the best-so-far local solution is usually kept in
memory [5]. However, in some cases [13, 21] all the local solutions found before
abandoning a site are retained for later use. In Hierarchical Site Abandonment
[17] when a site s is abandoned, all the other sites with fitness lesser or equal
to s are abandoned too.

4. Related Techniques

The Bees Algorithm is part of a large family of metaheuristics mimicking

the collective intelligence of honey bees [22]. Despite the common inspiration,
these algorithms often differ for the operators used and the solution sampling
strategy. The main differences between the Bees Algorithm and a number of
other algorithms [23] based on the foraging behaviour of honey bees have been
discussed in the literature [5, 6, 7]. An exhaustive comparison would be beyond
the scope of this paper. This section will focus on two algorithms that have
important similarities with the local search strategy of the Bees Algorithm:
Variable Neighbourhood Search (VNS) [24] and LJ Search [25].

4.1. Variable Neighbourhood Search

Variable Neighbourhood Search iterates cycles of local search around a seed
solution using neighbourhoods of different size. This idea has been successfully
used [26] in numerous applications, like the Traveling Salesman [24] and the
Vehicle Routing [27, 28] problems. The main steps of the VNS algorithm are:

6
Variable Neighbourhood Search: Main Steps

1. Initialise k neighbourhoods S1 , . . . , Sk of variable size around a

randomly generated centre x, initialise i = 1;

2. Take neighbourhood Si :
(a) sample a solution v uniformly inside the neighbourhood Si ;
(b) apply a local search procedure using v as seed to find a new
solution v 0 ;
i. if v 0 is fitter that x, set x = v 0 and i = 1;
ii. else set i = i + 1;

3. If i > k, terminate the algorithm and return the best found solu-
tion, otherwise iterate from step 2;

VNS is akin to local neighbourhood search at a BA site. A variant of this

approach, called Reduced VNS (RVNS) [29, 30], skips the local search at step
2b, to keep v 0 = v. RVNS is equivalent to a BBA where nb = 1, nr = 1, and
ns = 0 (no global search). Parallel VNS [31, 32] resembles more closely the BA,
since it performs a number of concomitant local VNS search procedures.
The main difference between VNS and the BA is that the former uses a
randomly generated sequence of neighbourhood sizes for local search, whilst the
latter uses a fixed (BBA) or deterministically shrunk (ShBA) neighbourhood
size. In addition, VNS uses a fixed number of foragers per optimisation cycle,
whilst the number of foragers is determined by the quality of the neighbourhood
in the BA. That is, as mentioned in Section 2, the standard Bees Algorithm al-
locates more foragers to the most promising (elite) ne neighbourhoods, and less
to the remaining neighbourhoods. The decision on the allocation of foragers per
site is dynamically updated at every optimisation cycle

4.2. LJ Search
The LJ Search Method was successfully used to optimise feedback control
in nonlinear systems [33], as well as time-optimal [34] and time-delay [35, 36]
systems. Given an N-dimensional minimisation problem, the LJ Search Method
pseudocode is:

7
LJ Search: Main Steps

1. Let se1 be the N-dimensional vector of the spans (max−min value)

of the interval of definition of the N decision variables, s1 the seed
solution, and t = 1 an index;
2. Sample nr solutions v1 , . . . , vnr as follows:

vi = st + u · set i ∈ {1, . . . , nr}

where u is a vector of N real values independently sampled with

uniform distribution in [−0.5, 0.5];
3. Compute the next solution as:

st+1 = arg min {F (v) | v ∈ {st , v1 , . . . , vnr }}

4. Reduce the ranges of a given factor 0 < α < 1:

set+1 = αset

and increment the counter t = t + 1;

5. check if the stop criterion is met. If so return st , otherwise go
back to step 2;

Except for the initialisation of the neighbourhood, the LJ Search algorithm

closely resembles the BA local search procedure at one site with neighbourhood
shrinking. That is, the ShBA can be described as a multi-LJ Search method.
Surprisingly, to the best of the authors’ knowledge, the similarity between LJ
Search and the Bees Algorithm has so far been overlooked in the literature.
A mathematical analysis of the properties of the LJ Search method was pub-
lished by Gopalakrishnan [37]. In particular, Gopalakrishnan [37] proved that
the succession of solutions s1 , . . . , sn converges to a local optimum. Unfortu-
nately, the proof of convergence is only valid for an infinite number of iterations.
Moreover, in his analysis Gopalakrishnan [37] took in consideration hyperspher-
ical neighbourhoods instead of the hypercubic ones that are actually used in
the LJ Search algorithm. As will be shown in section 7, the use of different
neighbourhoods has important consequences on the properties of the search.
The next sections analyse the properties of the local search procedure in the
Bees Algorithm.

5. Analysis of Local Search Properties

This section clarifies the properties of the local search procedure of the SBA,
such as its average step size and speed of convergence. These properties are

8
estimated from an a-posteriori analysis on the ’lifetime’ of a generic site s, from
its discovery by a scout to abandonment when local search stalls (i.e. it fails
to progress for stlim iterations). The case that the site is replaced by a more
promising site found via global search is not included. If needed, the results of
the below analysis are applicable to describe the behaviour of local search from
any point in time, not necessarily the discovery of the site, until abandonment.
Importantly, the final solution may not be the local optimum, that is, local
search may only provide an approximation of the local optimum.

5.1. Local Search: Introduction and Definitions

The following nomenclature will be used:
• s is the site, described at any iteration (cycle) t by a triple st = {sgt , set , sttl
t }:
the centre sgt , the edge set and the time to live sttl
t ;

• n is the number of local search cycles from discovery to abandonment of

the site;
• sg1 is the starting point (site centre) of the local search procedure;

• sgt with 1 < t < n is the site centre after t local search cycles;
• sgn is the final result of the local search, namely, the neighbourhood centre
at the last local search cycle, before the site is abandoned (i.e. sttl
n = 0);

• S is the series of solutions found by local search at site s:

S = {sg1 , . . . , sgn } (1)

The solution found in the tth local search cycle Lnr (sgt ) can be formalised as the
result of the following endomorphic function (maximisation problem):

Lnr (sgt ) = arg max{F (v) | v ∈ {sgt , v1 , . . . , vnr }} (2)

with vi ∼ C(sgt , set ) as a uniform sampling of a solution vi in the hypercube

centred in sgt with edge set . This sampling will be further discussed in section 7.
Since this is an a posteriori analysis, it will be assumed that every set of
candidate solutions sampled at each cycle is known.
A local maximum Lopt is defined as:

Lopt is a l.m. ⇔ ∃ > 0 | F (Lopt ) ≥ F (v) (3)

for all v ∈ C(Lopt , ). If Lopt is the optimum of the subregion C(sgt , set ), the
operator Lnr provides a stochastic approximation of the local optimum within
C(sgt , set ). The expected quality of this approximation increases monotonically
with the number nr of candidate solutions sampled.
Lopt = lim Lnr (s) = arg max {F (v) | v ∈ C(sgt , set )} (4)
nr→∞ v

9
The series of solutions S defined in eq. (1) shares the same convergence prop-
erties of the LJ Search proved in [37]. Namely, without site abandonment,
a number of steps n exists such that the series of solutions S will eventually
converge to a local optimum.
Due to the monotonically increasing nature of the series F (sg1 ), . . . , F (sgn )
(see eq. (2)), sgn is the best solution found in the n iterations of local search at
site s.
The standard neighbourhood shrinking heuristic can be formally defined for
a hypercube as follows (0 < α < 1):
(
e set Lnr (sgt ) 6= sgt
st+1 = (5)
αset Lnr (sgt ) = sgt

Neighbourhood shrinking can be similarly defined in the more general case of

box sampling, where the interval of definition is not the same for all the N
variables. In this latter case, set is the N-dimensional vector of search scope
edges, and each edge is reduced of the same fixed factor α. It also holds for
isotropic (hyperspherical) sampling. The standard site abandonment heuristic
is applied when:
sgn = sgn−1 = · · · = sgn−stlim (6)
In other terms, the local search is terminated when stlim consecutive fix points 2
of Lnr are found. It is also true that:

sgt = sgt+k ⇔ sgt = sgt+1 = · · · = sgt+k (7)

for any positive index k ∈ N.

The above definitions and properties are shared by most BA techniques, or
can be easily modified to include other BA variants.
Proposition 1 (Search scope Intersection: Successive Solutions). Given two
consecutive solutions sgt and sgt+1 , sgt is included in the search scope of sgt+1

sgt ∈ C(sgt+1 , set+1 ) (8)

Proof. The proof is trivial: if local search stagnates at cycle t, sgt+1 = sgt and
set+1 < set (neighbourhood shrinking). Then sgt ∈ C(sgt+1 , set+1 ) = C(sgt , set+1 ). If
local search progresses at cycle t, sgt+1 6= sgt and set+1 = set (no neighbour-
hood shrinking). Remembering that sgt+1 ∈ C(sgt , set ), it follows that sgt ∈
C(sgt+1 , set ) = C(sgt+1 , set+1 ).
This property also holds in case of box and hyperspherical sampling is used.

2A fix point x of a function f is a point such as f (x) = x

10
5.2. Bounds on Reach
Hereafter, the distance in the solution space that local search is able to cover
at a given site in a given number of cycles will be indicated as the reach of local
search. That is, the reach is the distance between the starting point of local
search (s1 ) and the best approximation of the local optimum after n cycles (sn ),
namely d(s1 , sn ). The upper and lower boundaries of the reach are defined as
follows:
Proposition 2 (Reach). The reach of local search in n hlearning cycles at a
√ i
g se1 N
given site s centred on solution s1 is bounded within the 0, n 2 interval,
where se1 is the site edge at the start of the search.
Proof. Minimum reach occurs when local search stalls since the very beginning,
namely sgt = sgt+1 ∀t ∈ {1, . . . , n = stlim}, and thus sg1 = sgn .
At cycle t, local search is bounded within the hypercube C centred in sgt ,
where the farthest solutions lie at the four vertexes of C. To attain maximum
reach, local search must progress at each cycle, so as the initial site edge se1 is not
reduced (i.e. no neighbourhood shrinking). The maximum step size per cycle is
bounded by the distance between the centre and a vertex of the N -dimensional
√
hypercube C, that is dv = set N/2. The upper bound of the reach at a given site
is therefore n times dv .
Proposition 2 gives the boundaries of ’how far’ local search can travel in n
learning cycles. The maximum step size is achievable only when the segment
that joins sg1 to sgn is parallel to the diagonal of the hypercube C, and every pair
√
of subsequent solutions sgt and sgt+1 (1 ≤ t < n) are distant d(sgt , sgt+1 ) = set N/2.
For example, this would be the case of a fitness landscape consisting of a sloped
hyperplane aligned with the diagonal of the hypercube C, or a hypersphere of
centre c lying in the direction of one of the diagonals of the hypercube centred
in sg1 .
Considering unitary time steps per iterations, the reach can be considered
as a measure of the ’travelling speed’ of the local search in the solution space.
Closely related to the reach is the convergence time of the local search (i.e.
the number of iterations taken to reach the local attractor). If the distance
between the centre of the site s1 and the optimum Lopt is d(sg1 , Lopt ), according
to Proposition 2 the minimum number of iterations nmin required to reach Lopt
are:
2d(sg1 , Lopt )
nmin = √ (9)
se1 N
This is the lower bound on the convergence time, and can be used to evaluate
the efficiency of local search on different fitness landscapes.
In the more general case of asymmetric boundaries (i.e. box sampling), the
maximum reach can be computed as follows:
v
uN −1
nu X 2
maxreach = t se [i] (10)
2 i=0 1

11
where se1 [i] is the i-th component of vector se1 . Finally, it is worth mentioning
that most - if not all - BA variants use a hypercube to define the scope of local
search. Consequently, the foragers are sampled inside an anisotropic region, and
the maximum reach depends on the orientation of the segment that joins sg1 to
Lopt respect to the diagonal of the hypercube.

5.3. Expected Progress

To understand the behaviour of local search, it is useful to calculate the
expected step size of one iteration of the procedure under certain assumptions
on the fitness surface.
Proposition 3 (Expected Step Size). Given a strict monotonic increasing one-
dimensional fitness landscape in [0, `] ∈ R (e.g. a straight line), and a site
centred in set = `/2 with edge sgt = `, the expected step size of one local search
iteration (i.e. the average distance between sgt and sgt+1 ) is:

0.5nr+1 + nr `
d(sgt , sgt+1 ) = ` − (11)
nr + 1 2
Proof. See electronic appendix.

The result of proposition 3 is valid for any locally monotonic fitness slope,
as long as C is fully in the monotonic region. Its validity extends also to multi-
dimensional surfaces such as regions of hyperplanes or hyperspheres. For this
to happen, the search scope must be isotropic (i.e. a hypersphere), the fitness
landscape must be strictly monotonic along the straight line joining the centre
of C to the local fitness maximum, and the fitness landscape inside the search
scope must be symmetric respect to said straight line. If these conditions are
verified, the expected step size will be given by eq. (11) for the direction where
the slope is monotonic, and zero (no bias) in the other directions. The above
conditions apply in the common case where local search is climbing one side of
a fairly regular hill or slope, but C does not include the fitness maximum yet.

5.3.1. Expected Step Size: Experimental Verification

The theoretical result of proposition 3 was verified numerically (figure 1) on
three 2D cases: on an inclined plane (leftmost column), near the top of a spher-
ical hill (the hill top is at the border of the search scope, rightmost column),
and far from the top of the spherical hill (middle column). The neighbourhood
is a circle of radius 0.5 centred in {0.5, 0.5}. In all cases, the fitness surface is
monotonically increasing along the horizontal diameter line, and symmetric with
respect to that line. The number of foragers was varied (nr = 1, 10, 20). The
plots show that the average (red triangle) of 103 independent local search trials
is always in good agreement with the theoretical expectation (at the bottom of
each sub-figure) along the horizontal diameter line (where the fitness landscape
is monotonic), and aligned to the centre of the search scope in the vertical
direction (i.e. no bias in the vertical direction).

12
Far from the peak, where the curvature of the sphere is small, the spread of
the solutions on the fitness landscape is large, and indistinguishable from the
spread on the planar surface. Near the hill top, where the curvature is large,
the solutions are tightly clustered near the fitness maximum. This behaviour
suggests that local search becomes increasingly focused and exploitative as it
approaches the local fitness maximum.
Figure 1 also shows little difference between the spread of the solutions
obtained using 10 and 20 foragers. Indeed, the expected step size grows in
sublinear fashion with the number of foragers (eq: 11). Figure 2 shows how the
average step size of 104 independent local search trials varies with the number
of foragers (nr). Also in this case, the search trials were performed in a circle
of radius 0.5 centred in {0.5, 0.5}, and the plot shows the result of local search
(st+1 ) along the direction of the slope of an inclined plane. The numerical
averages (blue dot) are in good agreement with the theoretical expectations of
eq. (11) (red line). The plot highlights how the result of local search quickly
reaches the borders of the neighbourhood, that is the asymptotic value of 1. In
general, it can be said that the size of the neighbourhood determines more than
the number of foragers the ability of local search to quickly climb (descend) a
fitness slope.

6. Site Abandonment: Local Search Stalling Probability

This section analyses the probability that a site may be abandoned due to
lack of progress of local search.

6.1. Site Abandonment: Definitions and Properties

Let sgt be the centre of site s at the tth local search cycle and C(sgt , set ) the
search scope. Let LRst and GRst be the subregions of C including solutions of
respectively lower or equal, and higher fitness. Explicitly:

LRst ⊆ C(sgt , set ) v ∈ LRst ⇔ F (v) ≤ F (sgt )

(12)
GRst ⊂ C(sgt , set ) v ∈ GRst ⇔ F (v) > F (sgt )

where
LRst ∪ GRst = C(sgt , set ) (13)
According to the above definitions, it can be said that local search progresses if
the output of the endomorphism Lnr belongs to GRst :

Lnr (sgt ) = sgt+1 6= sgt ⇔ sgt+1 ∈ GRst

(14)

In general, LRst and GRst may include non-contiguous subregions, since the
region covered by C may contain several local optima.
Hereafter, the border (hypersurface) of an N -dimensional region A will be
indicated as A− , and the volume of A will be indicated as V(A). To analyse the

13
(a) F H-Plane (b) F H-Sphere (x = 10) (c) F H-Sphere (x = 1)
nr = 1
nr = 10
nr = 20

Figure 1: Search results (st+1 ) of 103 independent local search trials in three 2D fitness
landscapes: a plane sloped in the horizontal direction (left column), a hypersphere with
centre in x = 10, y = 0.5 (middle column), and a hypershpere centred in x = 1, y = 0.5 (right
column). An isotropic circular search scope of centre sgt = [0.5, 0.5] (green square) and radius
srt = 0.5 was used. The number of foragers nr was set to 1 (top row), 10 (middle row), and 20
bottom row. The blue dots represent the solutions found in the local search trials, and their
arithmetic average is marked by the red triangle. The maximum is always on the border of
the search scope, at the right-end extreme of the horizontal diameter line. At the bottom of
each panel, the expected step size (11) in the direction of the maximum is shown.

14
Average Step Size Covered by a Single L(sgt )
0.5

Average d(st , st+1 )

0.4

0.3

0.2
predicted
sampled
0.1
0 5 10 15 20
nr
Figure 2: Result of local search using an isotropic search scope of radius srt = 0.5 and centred
in sgt = {0.5, 0.5} on a sloped planar fitness surface. The predicted value (red line) along the
direction of the slope was calcuated from eq. (11), and closely matches the average values of
104 independent local search runs (blue dots).

likelihood that local search stalls and the site is abandoned, it is useful to define
the following two ratios:

V(LRst ) V(GRst )
|LRst | = |GRst | = (15)
V (C(sgt , set )) V (C(sgt , set ))

Also, from eq. (15) it follows that a solution sgt is the optimum of the subregion
C(sgt , set ) iff:

F (sgt ) ≥ F (v) ⇔ |LRst | = 1 ∧ |GRst | = 0 (17)

for all v ∈ C(sgt , set ). The local exploitative search of the BA aims to locate Lopt ,
inside the search scope C(sgt , set ). If the GRst region is significantly smaller than
the search scope, the probability of finding a better solution than sg is small,
and progress may be slow or stop. The neighbourhood shrinking procedure may
mitigate this problem, progressively reducing the search scope and increasing
the probability that a forager is generated inside GRst . For this to happen,
neighbourhood shrinking needs to keep GRst inside the search scope. Unless it

15
is a local optimum, it can be shown that the site centre sg is at least contiguous
to GRst . That is:
Proposition 4. A solution sg is either a local optimum of the fitness function
F , or lies on the border GR−
st of GRst .

Proof. This can be proven by contradiction:

sg 6∈ GR−
st ⇔ ∃ > 0 | v ∈ LRst ∀v ∈ B(sg , ) (18)

where B(sg , ) is an N -dimensional ball of radius and centred in sg . However:

v ∈ LRst ⇔ F (v) ≤ F (sg ) (19)

so sg is either a local optimum or is inside GR−

st .

This property holds for virtually any BA variant. An important conse-

quence is that there is no neighbourhood reduction that completely excludes
GRst from the search scope. Accordingly, neighbourhood shrinking does not af-
fect the ability of local search to progress to higher regions of fitness. If the GRst
region includes multiple peaks and sg is not on the main peak, neighbourhood
shrinking might make the main peak unreachable. If instead the fitness land-
scape is locally convex, neighbourhood shrinking will not affect the ability of
local search to reach the main peak in GRst . However, neighbourhood shrink-
ing affects the lower bound of the convergence time towards the optimum (eq.
9). Thus, it can be said that neighbourhood shrinking is most beneficial in the
latest iterations of the local search at a site, when a good local optimum - or
the global optimum - has been approximately located and the search focus is
shifted from speed to the accuracy of the solution. In many cases, it can be
argued that local search is indeed more likely to stall (and hence neighbourhood
shrinking to be performed) once local search approaches the local peak, and
GRst becomes increasingly small.

6.2. Stalling Probability Without Neighbourhood Shrinking

Let us consider a site s centred on sgt at cycle t. The probability that local
search without neighbourhood shrinking stalls at s will be henceforth indicated
as P (sgt = sgt+sttl ). It is computed as follows:
t

Proposition 5 (Stalling Probability Without Shrinking). Given site s centred

on sgt at cycle t, the probability that local search without neighbourhood shrinking
stalls is: ttl
P (sgt = sgt+sttl ) = |LRst |nr·st (20)
t

Proof. See electronic appendix.

16
One important aspect of the stalling probability is that, since local search
is random, it is not affected by the slope of the fitness surface. Proposition 5
is valid regardless whether sttl
t = stlim, that is, local search has not begun to
stagnate yet, or sttl
t < stlim and local search has already begun to stagnate.
Variants that use a dynamic assignment of foragers, like [13, 18, 17], yield
a more complex behaviour that leads to a different stalling probability formu-
lation. Some ideas on how to deal with these variants will be discussed later
in this section. If neighbourhood shrinking is used, the progressive reduction of
the search scope needs to be taken into account. In this case, it is possible that
if local search is trapped in a secondary peak, the GRst region may be lost as
the search scope is reduced.

6.3. Stalling Probability With Neighbourhood Shrinking

Let us consider the case where after t cycles, local search stagnates for k
cycles at sgt inside the basin of attraction of a local optimum Lopt (sgt 6= Lopt and
GRst is one unique region). In this case, proposition 4 stipulates that sgt lies on
the border of GRst . The most likely cause for repeated stalling of local search is
that GRst is small compared to C(sgt , set ). However, if GRst is small in comparison
with C(sgt , set ), and sgt is on the border of GRst , the distance d between sgt and
Lopt is likely to be small compared to the search edge (d(sgt , Lopt ) set ). That
is, GRst is likely to be far from the border of the search scope, and is not going to
be changed by the neighbourhood shrinking procedure. One possible exception
would be that GRst was very long and thin (e.g. a narrow valley in a 2D space),
and sgt and Lopt were on the two opposite sides of GRst (in the 2D example, on
the two sides of the valley). In this case. neighbourhood shrinking might reduce
GRst , although of a very small amount (one extreme of the valley might be cut
out).
If the GRst region at time t is unchanged after k successive applications of
the neighbourhood shrinking procedure, it is possible to compute the relative
coverages (eq. 15) of LRst+k and GRst+k as follows:
Lemma 1 (Coverage Reduction with constant GRst ). Let sgt be the centre of
site s at cycle t in the N -dimensional solution space. If local search stagnates for
k cycles (sgt = sgt+k ), and the region GRst 6= ∅ is not changed by neighbourhood
shrinking, the relative coverages of LRst+k and GRst+k become:

17
a portion of GRst is close to the border of C(sgt , set ), neighbourhood shrinking
reduces mostly the largest area (LRst ), and GRst+1 ' GRst . Equation (22)
shows that the relative coverage of GRst , and hence the likelihood of sampling a
fitter solution than sgt , grows (1/α > 1) exponentially. Neighbourhood shrinking
is therefore a powerful heuristics to foster progress in the the local search pro-
cedure. Neighbourhood shrinking introduces also a trade-off between reducing
the reach of local search, and hence slowing down the convergence to the local
optimum (see eq. 9), and making local search progress more likely, thus avoiding
several cycles of stalling. The probability of a complete stalling of local search
(i.e. site abandonment) can be calculated from eq. (22) as follows:
Proposition 6 (Stalling Probability With Neighbourhood Shrinking and Con-
stant GRst ). Let sgt be the centre of site s at cycle t in the N -dimensional
solution space. The probability that local search stagnates for k cycles if GRst is
not changed by neighbourhood shrinking is:
k nr
g g
Y 1
P (st = st+k ) = 1 − hN |GRst | (23)
α
h=1

Proof. After h cycles of stalling, the probability Pnr=1 (sgh = sgh+1 ) of not sam-
pling a single solution fitter than sgt in C(sgt , set+h ) is determined by the relative
coverage of LRst+h , which is defined in eq. (21):
1
Pnr=1 (sgh = sgh+1 ) = |LRst+h | = 1 − |GRst | (24)
αkN
The probability of stalling at any cycle h is equal to the probability of not
picking a fitter solution than sgt in nr independent samples of C(sgt , set+h ):
P (sgh = sgh+1 ) = Pnr=1 (sgh = sgh+1 )nr (25)
The probability of k consecutive cycles of stalling is calculated from eqs. (24)
and (25):
k−1
P (sgt = sgt+k ) = P (sgh = sgh+1 ) =
Y

h=1
(26)
k−1
Y nr
1
= 1 − hN |GRst |
α
h=1

This result is valid as long as the number of recruited bees is constant for
the k cycles monitored. If the number of bees changes at every iteration, for
example as in Packianather et al [13], nr in eq. (23) should be replaced by a
variable number nrk .
The stalling probability can never be 0, since LRst 6= ∅ for any st . It
should also be noted that the results of propositions 5 and 6 are independent of
the neighbourhood shape. The implications of using hyperspherical instead of
hypercubic neighbourhoods will be discussed in Section 7.

18
6.4. A large stlim or nr?
Proposition 6 is important to understand the behaviour of the Bees Algo-
rithm when neighbourhood shrinking does not change GRst , or at least does
not change it significantly. As discussed in Section 6.3, this occurrence is most
likely when neighbourhood search is near the local optimum, that is, GRst is
small and near the centre of C. In this case, the probability that local search
stagnates is large (|GRst | is small), and the site may be abandoned after stlim
cycles of stalling before the local optimum is found (local search stalls).
The probability that local search stalls depends on the number nr of solutions
that are sampled in one local search cycle, the stalling limit stlim, and the size of
the search scope. The larger nr and stlim are, the more likely is to pick at least
one solution within GRst , and thus the smaller is the likelihood that local search
stalls. However, the effect of nr and stlim on the stalling probability is not the
same, due to the nonlinear reduction of the search scope by neighbourhood
shrinking. Given a fixed number of sampling opportunities (equal to nr · stlim),
the question is whether it is more beneficial to sample thoroughly C for lesser
iterations (large nr), or sample less intensely C for longer times (large stlim).
In this section, it is assumed that nr and stlim can be increased by an
integer factor q > 1, and the local search stalling probability will be indicated
as Pnr (st = st+stlim ), where the index nr accounts for the number of candidate
solutions sampled in C in one local search cycle.
Lemma 2. Let sgt be the centre of site s at cycle t in the N-dimensional so-
lution space. Assuming that GRst is not changed by neighbourhood shrinking,
an increase in the stalling limit by an integer factor q > 1 modifies the stalling
probability of local search as follows:

Pnr (sgt = sgt+q·stlim ) = Pnr (sgt = sgt+stlim ) · T (27)

where T as:
q·stlim nr
Y 1
T = 1− |GRst | (28)
αkN
k=stlim+1

Proof. From Proposition 6, Pnr (sgt = sgt+q·stlim ) is equals to:

19
Lemma 3. Let st be the centre of site s at cycle t in the N-dimensional so-
lution space. Assuming that GRst is not changed by neighbourhood shrinking,
an increase in the number of foragers by an integer factor q > 1 modifies the
stalling probability of local search as follows:

Pq·nr (sgt = sgt+stlim ) = Pnr (sgt = sgt+stlim )q (30)

Proof. The proof is straightforward:

stlim
Y q·nr
Pq·nr (sgt = sgt+stlim ) = 1 − β k |GRst | =
k=1
!q (31)
stlim
Y nr
k
= 1 − β |GRst | = Pnr (sgt = sgt+stlim )q
k=1

The next remark will prove that if GRst is not changed by neighbourhood
shrinking, increasing the stalling limit by an integer factor q > 1 has more effect
on decreasing the stalling probability than increasing the number of foragers by
the same factor.
Proposition 7 (stlim vs. nr). Let st be the centre of site s at cycle t in the
N-dimensional solution space. Assuming that GRst is not changed by neigh-
bourhood shrinking, an increase in the stalling limit of an integer factor q > 1
reduces the stalling probability more than an equal increase in the number of
foragers.
Pnr (st = st+q·stlim ) < Pq·nr (st = st+stlim ) (32)
Proof. See electronic appendix.
Proposition 7 can also be proven considering a fixed number of available
sampling opportunities T = (q · nr) · stlim = nr · (q · stlim) of the search scope.
If the choice is to increase the number of foragers, C will be sampled q · nr
times for at most stlim cycles of stalling before being abandoned. If GRst is
unchanged by neighbourhood shrinking, all candidate solutions will be sampled
1

with a stalling probability πnr ≥ A = 1 − αstlim·N |GRst | (see eq. (A.32) in
the electronic appendix). If instead the choice is to increase the stalling limit,
C will be sampled nr times for q · stlim cycles, and (q · stlim − stlim) · nr of
1
these samples will have a stalling probability πstlim ≤ A = 1 − αstlim·N |GRst |
(see eq. (A.33) in the electronic appendix).
As long as GRst is not disjoint multimodal, proposition 7 gives the practi-
tioner a useful guideline to parameterize the Bees Algorithm. This case is not
uncommon as the scope of the local search has narrowed down on the attraction
basin of one peak of performance. If GRst contain several peaks, there is the
risk that repeated applications of the neighbourhood shrinking procedure may
cut the main peak out of GRst . In this latter case, a high nr ensures that many
sampling attempts are made before the main peak is lost. Unfortunately, the

20
actual fitness landscape is not known, and trial-and-error is usually needed to
address the nr vs. stlim trade-off. However, several empirical studies [5, 6, 7]
obtained the best performances over a large set of varied benchmarks using large
stlim values, suggesting a wide applicability of proposition 7.

7. Local Search Scope Shape

Among the numerous variants of the BA, the shape of the search scope is one
of the least researched features in the literature. In the standard formulation of
the Bees Algorithm (Section 2), the search scope C(sgt , set ) of a site s at the cycle
t, is defined as a hypercube of side set centred in sgt . A new candidate solution
v ∈ C(sgt , set ) is generated uniformly sampling the hypercube C(sgt , set ).
The main limitation of this hypercubic sampling is the anisotropic character
of the search, which has the shortest extent in the direction of the coordinate
axes, and the longest aligned with the diagonals of the C(sgt , set ) hypercube. This
anisotropy introduces a bias in the local search.
Moreover, as the dimensionality of the solution space increases, the volume
of the C(sgt , set ) hypercube exponentially increases, making the sampling more
sparse (curse of dimensionality, [38]).

7.1. Isotropic Local Search

An isotropic search scope can be implemented using a hypersphere (ball )
B centred in sgt of radius srt . The reader is referred to [39] and [40] for the
algorithmic details of how to achieve a uniformly distributed spherical sampling.
Cubic sampling can be replaced by spherical sampling keeping the rest of
the Bees Algorithm unchanged. Neighbourhood shrinking in this case shrinks
the hypersphere radius instead of the hypercube edge.
However, replacing cubic with spherical sampling does change the properties
of the search. For instance, the maximum reach of local search √
(proposition 2)
se N
in one cycle changes from the diagonal of the hypercube t 2 to the radius
of the hypersphere srt , and does not scale any more with the dimensionality of
the search space. Moreover, if cubic sampling is used, the volume of the search
scope grows with the number of dimensions:

V(C(sgt , set )) = (set )N (33)

whilst the volume of the hypersphere initially grows and then decreases with
the number of dimensions [41]:

π /2
N

V(C(sgt , srt )) = VN · (srt )N = (sr )N (34)

Γ ( /2 + 1) t
N

where Γ is gamma function. More precisely, keeping the radius srt fixed, the
volume increases for the first N ∗ dimensions, where

N ∗ = {N | DN −1 < srt ≤ DN } (35)

21
Parameter N nr set sttl
t sgt GRst center GRst radius
Value 4 15 10 8 [1, 0, 0, 0] [0, 0, 0, 0] 1

Table 1: Parameters setting used in the tests.

with
Γ (N/2 + 3/2)
DN = √ (36)
πΓ (N/2 + 1)
and sharply decreases afterwards, approaching zero for large N values. As
mentioned in Section 6, replacing cubic with spherical sampling does not alter
the validity of propositions 5 and 6.
Proposition 8 (Scope Variation Invariance). Let C(sgt , set ) and B(sgt , sr ) be the
local search scope using respectively cubic and spherical sampling, and |GRst |C
and |GRst |S be the relative coverage of the GRst region using respectively cubic
and spherical sampling. If neighbourhood shrinking does not change the GRst
region, shrinking the edge/radius of the search scope of a factor α leads to the
same change in the respective coverages:
1
C(sgt , αset ) ⇒ |GRst |C
αN (37)
1
B(sgt , αsrt ) ⇒ N |GRst |S
α
Proof. See supplementary material.
The consequence of proposition 8 is that the stagnation probability is com-
puted in the same way (proposition 6) regardless of the kind of sampling used.
However, the different behaviour of the search scope volume in the two cases
has important implications for high dimensional spaces.
A possible enhancement of the current algorithm would be to switch the
shape of the search scope opportunistically to foster the exploratory (cubic
sampling) or exploitative (spherical sampling) goal of local search.

7.2. Stalling Probability: Experimental Verification

To verify and visualise the theoretical predictions of section 6, and how the
stalling probability varies with the shape of the neighbourhood, the following
experimental tests were carried out. The situation where local search has con-
verged inside a large basin of attraction was mimicked. A four-dimensional
fitness landscape of hyperspherical shape was considered. The hyperspherical
basin had unitary radius, and was centred in the origin of the Cartesian space.
In this landscape, local search was performed using a hypecubic neighbourhood
g
of edge set = 10, time to live sttl
t = 8, and initially centred in st = [1, 0, 0, 0].
Fifteen forager bees were used to search the neighbourhood. The parameters of
the example are summarised in table 1.

22
In this case, the V (GRst ) V (C(sgt , set )) region is a hypersphere centred
in the origin with unitary radius3 . As per proposition 4, sgt lies on the (open)
surface of the hypersphere GRst . It can be shown that he following variables
take the values:
|GRst | ≈ 4.9348 · 10−4
V(C(sgt , set )) = 104 (38)
V(GRst ) ≈ 4.9348

and, by complementarity:

V(LRst ) ≈ 9995.0652 |LRst | ≈ 0.9995 (39)

When no neighbourhood shrinking is used (case 1), the stalling probability is

given by proposition 5:

P (sgt = sgt+sttl ) = 0.999515·8 ≈ 0.9425 (40)

When neighbourhood shrinking is used (case 2, shrinking factor α = 0.9),

the GRst region is unchanged, therefore its volume is the same. Local search
stalls after sttl
t = 8 consecutive cycles of stagnation. For each of these cycles of
stagnation, neighbourhood shrinking is applied. The volumes of the initial and
final search scope are:

V(C(sgt , set )) = 104

(41)
V(C(sgt+sttl , set+sttl )) = 343.3684
t t

where set+sttl ≈ 4.3047. The initial and final relative coverage of the GR regions
t
are:
|GRst | ≈ 4.9348 · 10−4 |GRst+sttl | ≈ 1.4372 · 10−2 (42)
t

According to proposition 6 the stalling probability is now equal to:

ttl
st nr
1
P (sgt sgt+sttl )
Y
= = 1− kN
|GRst | =
α
k=1
(43)
8 15
Y 1 −4
= 1− (4.9348 · 10 ) ≈ 0.67144
0.9k·4
k=1

When a hyperspherical (isotropic) neighbourhood of radius sr = 5 (equivalent

to a cubic sampling with edge set = 10) is used (case 3), the search scope volume
V(B(sgt , srt )) and the relative coverage of GRs are:

|GRst | ≈ 1.6 · 10−3

V(B(sgt , srt )) ≈ 3084.2514 (44)
|LRst | ≈ 0.9984

3 The actual radius of GRst is 1− since sgt 6∈ GRst .

23
Sampling Type
Cubic Spheric
Predicted Experimental Predicted Experimental
Without NS 0.9425 0.9429 0.8252 0.8249
With NS 0.6714 0.67137 0.2725 0.2716

Table 2: Predicted stalling probability and experimental frequency of stalling events for the
four cases described in section 7.2.

If neighbourhood shrinking is not used, the predicted stalling probability is

(proposition 5):
P (sgt = sgt+sttl ) = 0.998415·8 ≈ 0.8252 (45)
t

If neighbourhood shrinking is performed (case 4), the volumes of the initial

and final (after sttl
t = 8 consecutive cycles of stagnation) search scope are:

V(B(sgt , srt )) ≈ 3084.2514

(46)
V(B(sgt+sttl , srt+sttl )) ≈ 105.9034
t t

and the initial and final GRs relative coverages are:

|GRst | ≈ 1.6 · 10−3 |GRst+sttl | ≈ 4.6597 · 10−2 (47)

The predicted stalling probability is (proposition 6):

8 15
Y 1
P (st = st+sttl ) = 1− (1.6 · 10−3 ) ≈ 0.2725 (48)
0.9k·4
k=1

The theoretical predictions were numerically tested, performing 106 independent

optimisation runs for each of the above four cases. Table 2 compares the pre-
dicted and experimental stalling frequency (number of times local search stalled
divided by the total number of runs) for the four cases.
The empirical results prove the validity of the theoretical predictions. In
particular, it is apparent that neighbourhood shrinking increases the probabil-
ity of progress in local search, thus reducing the stalling probability. At the
same time, the empirical examples show the significance of the consequences
associated to the choice of neighbourhood shape. In general, the analysis of this
section points out that the standard practice of using cubic sampling should
be reevaluated in term of the search bias introduced, and the evolution of the
stalling probabilities with repeated iterations of neighbourhood shrinking. In
detail, a hypersphere of diameter 2srt has a smaller hypervolume than a hyper-
cube of edge set = 2srt , and determines a more exploitative search with higher
probability of finding solutions in GRst .
The tests for the four cases were also repeated varying the dimensionality of
the fitness landscape from 2 to 12. The experimental results are shown in fig. 3
and confirm neighbourhood shrinking and hyperspherical sampling are effective
policies against premature stalling.

24
Stalling Probability Varying the Dimensionality

0.8
Stalling Probability

0.6

0.4

Cubic Without NS
0.2
Cubic With NS
Spheric Without NS
0 Spheric With NS

2 4 6 8 10 12
Number of Dimensions (N )

Figure 3: Stalling probability using different sampling methods, with and without the neigh-
bourhood shrinking. All the parameters are kept fixed except the number of dimensions of
the problem.

8. Discussion

There is a marked imbalance in the Swarm Intelligence literature, with a

prevalence of experimental over theoretical studies. Despite the large success
in applications, mathematical analysis of the algorithms is still limited, leav-
ing several open questions on the behaviour, parameterisation, complexity, and
nature of the various algorithms. Explanation of the metaheuristics is often
limited to the biological metaphor, which may prevent the reader from gaining
a full understanding of the search mechanisms [42]. The scarcity of analyti-
cal foundations in Swarm Intelligence research has been pointed out by several
authors [43, 44, 45].
In this paper, the properties and main features of the Bees Algorithm were
formalised and analysed. Despite a number of experimental studies [5, 6, 7, 9]
benchmarked the capabilities of the Bees Algorithm, an analytical investigation
of its behaviour and operators had never been carried out. The results of the
proposed study clarify and support the previous experimental findings, as well
as reveal so far overlooked properties. The main findings are summarised below.
The similarities and differences between the Bees Algorithm and standard
optimisation methods were discussed. In particular, the Bees Algorithm can be
regarded as a parallel version of the LJ Search and VNS methods, where the
sampling of the neighbourhood is adaptively allocated (waggle dance) according
to the fitness of the seed solution. In terms of local search, the main difference

25
with the two aforementioned methods is in the way the neighbourhood is varied:
the Bees Algorithm uses neighbourhood shrinking, whilst VNS tries a number
of randomly generated shapes, and standard LJ shrinks the neighbourhood re-
gardless of the progress of local search. Also, the Bees Algorithm terminates
the local search after stlim stagnation cycles, whilst LJ Search customarily ter-
minates the search after a fixed number of iterations regardless of the progress.
In terms of overall metaheuristic, the Bees Algorithm performs several local
searches in parallel, adaptively shifting the sampling effort at each generation
according to the progress of the search. Neighbourhoods can be abandoned
due to lack of progress, or replaced with more promising ones found via global
search. For a comparison between the Bees Algorithm and akin swarm optimi-
sation techniques [46, 23] the reader is referred to [5].
The theoretical analysis of the properties of local search showed that the
expected step size quickly approaches the maximum value as the number of for-
ager bees is increased. If local search is desired to quickly climb (descend) the
fitness slope, a large neighbourhood size is more beneficial than a large number
of foragers. Analysis of the stalling probability also found limited benefits in
increasing the number of local foragers. That is, neighbourhood shrinking and a
large stagnation limit are the most effective policies against premature stagna-
tion of local search. This latter result is in good agreement with the indications
of several experimental studies [5, 6, 7], where best performances were obtained
using the largest allowed value for the stagnation limit stlim.
One of the main contributions of this theoretical analysis regards the shape
of the local neighbourhood. For ease of implementation, nearly all versions of
the Bees Algorithm used hypercubic local neighbourhoods. As demonstrated,
hypercubic sampling biases the search along the directions of the diagonal, and
has poor exploitation capabilities in high dimensional spaces due to the curse
of dimensionality. That is, the volume of hypercubic neighbourhoods is a power
function of the search scope edge se . As suggested in section 7, the neighbour-
hood shape might be varied during the search to switch from explorative (cubic
sampling) to exploitative (spherical sampling) search strategies.

9. Conclusions

The Bees Algorithm is a popular optimisation method inspired by the for-

aging behaviour of honey bees. Despite several experimental investigations, the
properties of the Bees Algorithm have never been formally analysed. This paper
covers this gap, focusing particularly on the properties of local search.
The main indications are that the local search capabilities of the Bees Al-
gorithm are mainly determined by the size and shape of the neighbourhood,
and the number of allowed stagnation cycles. A large neighbourhood enables a
quicker progress on the fitness landscape. Conversely, reducing the neighbour-
hood size helps avoiding premature stagnation of local search. The effect of
increasing forager recruitment on the expected search step size (section 5) and
stagnation probability (section 6) grows sublinearly with the number of bees.

26
The shape of the neighbourhood function has been so far largely overlooked
in the Bees Algorithm literature. However, it was shown in section 7 that
the customary choice of hypercubic sampling creates large neighbourhoods in
high-dimensional spaces due to the curse of dimensionality. On the other hand,
hyperspherical sampling creates neighbourhoods of sizes that vary according to
the gamma function, and tend to become small in high-dimensional spaces (zero
for infinitely high-dimensional spaces). Thus, the exploitation capability of local
search is highly influenced by the choice of neighbourhood shape.
Overall, the Bees Algorithm can be seen as a parallel adaptive version of the
LJ Search and VNS algorithms (section 4), in which the modification of the
neighbourhood size and allocation of sampling opportunities are dynamically
adjusted according to the fitness of the neighbourhood centres and the local
progress of the search. Differently from LJ Search and VNS, the Bees Algorithm
also keeps on searching the fitness landscape for new promising neighbourhoods
via the global search procedure.
Throughout the paper, the Bees Algorithm was presented in a rigorously
mathematical and algorithmic format, beyond the customary qualitative de-
scription based on the biological metaphor. It is hoped that this new formalism
improves the understanding of the Bees Algorithm, and spurs new analytical
studies on its properties, and its similarities to and differences with other Swarm
Intelligence metaheuristics.

[1] D. T. Pham, A. Ghanbarzadeh, E. Koç, S. Otri, S. Rahim, M. Zaidi, The

bees algorithm — A novel tool for complex optimisation problems, in: In-
telligent Production Machines and Systems, Elsevier, 454–459, 2006.

[2] D. T. Pham, M. Castellani, H. A. Le Thi, Nature-inspired intelligent op-

timisation using the bees algorithm, in: Transactions on Computational
Intelligence XIII, Springer, 38–69, 2014.
[3] D. T. Pham, L. Baronti, B. Zhang, M. Castellani, Optimisation of Engineer-
ing Systems With the Bees Algorithm, International Journal of Artificial
Life Research (IJALR) 8 (1) (2018) 1–15.
[4] K. Frisch, The role of dances in recruiting bees to familiar sites, Animal
Behaviour 16 (4) (1968) 531–533.
[5] D. T. Pham, M. Castellani, The Bees Algorithm: Modelling Foraging Be-
haviour to Solve Continuous Optimization Problems, Proceedings of the
Institution of Mechanical Engineers, Part C: Journal of Mechanical Engi-
neering Science 223 (12) (2009) 2919–2938.
[6] D. T. Pham, M. Castellani, Benchmarking and comparison of nature-
inspired population-based continuous optimisation algorithms, Soft Com-
puting 18 (5) (2014) 871–903.

[7] D. T. Pham, M. Castellani, A comparative study of the Bees Algorithm as

a tool for function optimisation, Cogent Engineering 2 (1).

27
[8] A. Auger, B. Doerr, Theory of randomized search heuristics: Foundations
and recent developments, vol. 1, World Scientific, 2011.
[9] W. A. Hussein, S. Sahran, S. N. H. S. Abdullah, The variants of the Bees
Algorithm (BA): a survey, Artificial Intelligence Review 47 (1) (2017) 67–
121.

[10] D. H. Wolpert, W. G. Macready, No free lunch theorems for optimization,

IEEE transactions on evolutionary computation 1 (1) (1997) 67–82.
[11] M. Castellani, Q. T. Pham, D. T. Pham, Dynamic optimisation by a modi-
fied bees algorithm, Proceedings of the Institution of Mechanical Engineers,
Part I: Journal of Systems and Control Engineering 226 (7) (2012) 956–971.
[12] D. T. Pham, M. Castellani, A. Fahmy, Learning the inverse kinematics of
a robot manipulator using the bees algorithm, in: Industrial Informatics,
2008. INDIN 2008. 6th IEEE International Conference on, IEEE, 493–498,
2008.

[13] M. Packianather, M. Landy, D. T. Pham, Enhancing the speed of the Bees

Algorithm using Pheromone-based Recruitment, in: Industrial Informatics,
2009. INDIN 2009. 7th IEEE International Conference on, IEEE, 789–794,
2009.
[14] B. Yuce, M. S. Packianather, E. Mastrocinque, D. T. Pham, A. Lambiase,
Honey bees inspired optimization method: the bees algorithm, Insects 4 (4)
(2013) 646–662.
[15] Q. T. Pham, D. T. Pham, M. Castellani, A modified bees algorithm and
a statistics-based method for tuning its parameters, Proceedings of the
Institution of Mechanical Engineers, Part I: Journal of Systems and Control
Engineering 226 (3) (2012) 287–301.
[16] A. Ghanbarzadeh, Bees Algorithm: a Novel Optimisation Tool, Ph.D. the-
sis, Engineering, 2007.
[17] D. Pham, A. H. Darwish, Fuzzy selection of local search sites in the Bees
Algorithm, in: Proceedings of the 4th Virtual International Conference on
Intelligent Production Machines and Systems, 1–14, 2008.
[18] D. T. Pham, H. A. Darwish, Using the bees algorithm with Kalman filtering
to train an artificial neural network for pattern classification, Proceedings
of the Institution of Mechanical Engineers, Part I: Journal of Systems and
Control Engineering 224 (7) (2010) 885–892.

[19] A. Imanguliyev, Enhancements for the Bees Algorithm, Ph.D. thesis,

Cardiff University, 2013.
[20] S. Ahmad, A study of search neighbourhood in the bees algorithm, Ph.D.
thesis, Cardiff University, 2012.

28
[21] D. T. Pham, E. Koç, Design of a two-dimensional recursive filter using the
bees algorithm, International Journal of Automation and Computing 7 (3)
(2010) 399–402.
[22] A. Rajasekhar, N. Lynn, S. Das, P. N. Suganthan, Computing with the col-
lective intelligence of honey bees–a survey, Swarm and Evolutionary Com-
putation 32 (2017) 25–48.
[23] D. Karaboga, B. Basturk, A powerful and efficient algorithm for numerical
function optimization: artificial bee colony (ABC) algorithm, Journal of
global optimization 39 (3) (2007) 459–471.

[24] N. Mladenović, P. Hansen, Variable neighborhood search, Computers &

operations research 24 (11) (1997) 1097–1100.
[25] R. Luus, T. Jaakola, Optimization by direct search and systematic reduc-
tion of the size of search region, AIChE Journal 19 (4) (1973) 760–766.
[26] P. Hansen, N. Mladenović, Variable neighborhood search, in: Handbook of
metaheuristics, Springer, 145–184, 2003.
[27] L.-M. Rousseau, M. Gendreau, G. Pesant, Using constraint-based operators
with variable neighborhood search to solve the vehicle routing problem
with time windows, in: Proceedings of the 1st Workshop on Integration
of AI and OR Techniques in Constraint Programming for Combinatorial
Optimization Problems, 43–58, 1999.
[28] O. Bräysy, Local search and variable neighborhood search algorithms for
the vehicle routing problem with time windows, Vaasan yliopisto, 2001.
[29] R. Whitaker, A fast algorithm for the greedy interchange for large-scale
clustering and median location problems, INFOR: Information Systems
and Operational Research 21 (2) (1983) 95–108.
[30] N. Mladenović, J. Petrović, V. Kovačević-Vujčić, M. Čangalović, Solving
spread spectrum radar polyphase code design problem by tabu search and
variable neighbourhood search, European Journal of Operational Research
151 (2) (2003) 389–399.

[31] F. Garcia-López, B. Melián-Batista, J. A. Moreno-Pérez, J. M. Moreno-

Vega, The parallel variable neighborhood search for the p-median problem,
Journal of Heuristics 8 (3) (2002) 375–388.
[32] A. Djenić, N. Radojičić, M. Marić, M. Mladenović, Parallel VNS for bus
terminal location problem, Applied Soft Computing 42 (2016) 448–458.
[33] R. Luus, Optimal control by direct search on feedback gain matrix, Chem-
ical Engineering Science 29 (4) (1974) 1013–1017.

29
[34] R. Luus, A practical approach to time-optimal control of nonlinear sys-
tems, Industrial & Engineering Chemistry Process Design and Development
13 (4) (1974) 405–408.
[35] S. H. Oh, R. Luus, Optimal feedback control of time-delay systems, AIChE
Journal 22 (1) (1976) 140–147.

[36] G. G. Nair, Suboptimal control of nonlinear time-delay systems, Journal of

Optimization Theory and Applications 29 (1) (1979) 87–99.
[37] G. Gopalakrishnan Nair, On the convergence of the LJ search method,
Journal of Optimization Theory and Applications 28 (3) (1979) 429–434.

[38] R. E. Bellman, Adaptive control processes: a guided tour, Princeton uni-

versity press, 2015.
[39] J. Cook, Rational formulae for the production of a spherically symmetric
probability distribution, Mathematics of Computation 11 (58) (1957) 81–
82.

[40] M. E. Muller, A note on a method for generating points uniformly on n-

dimensional spheres, Communications of the ACM 2 (4) (1959) 19–20.
[41] T. Stibor, J. Timmis, C. Eckert, On the use of hyperspheres in artificial im-
mune systems as antibody recognition regions, in: International Conference
on Artificial Immune Systems, Springer, 215–228, 2006.
[42] C. L. Camacho-Villalón, M. Dorigo, T. Stützle, Why the Intelligent Wa-
ter Drops Cannot Be Considered as a Novel Algorithm, in: International
Conference on Swarm Intelligence, Springer, 302–314, 2018.
[43] X.-S. Yang, Swarm-based metaheuristic algorithms and no-free-lunch the-
orems, in: Theory and New Applications of Swarm Intelligence, InTech,
1–16, 2012.
[44] J. Swan, S. Adriaensen, M. Bishr, E. K. Burke, J. A. Clark, P. De Caus-
maecker, J. Durillo, K. Hammond, E. Hart, C. G. Johnson, et al., A re-
search agenda for metaheuristic standardization, in: Proceedings of the XI
metaheuristics international conference, 7–10, 2015.
[45] A. P. Piotrowski, Across Neighborhood Search algorithm: A comprehensive
analysis, Information Sciences 435 (2018) 334–381.
[46] J. Kennedy, R. Eberhart, Particle swarm optimization, in: Proceedings of
the 1995 IEEE International Conference on Neural networks, IEEE, 1942–
1948, 1995.
[47] S. M. Ross, Introduction to probability models, Academic press, 2014.

30
An Analysis of the Search Mechanisms
of the Bees Algorithm
(Electronic Appendix)
Luca Baronti, Marco Castellani, and Duc Truong Pham
Department of Mechanical Engineering, University of Birmingham, United
Kingdom

A. Theorems Proofs

A.1. Expected Step Size

Proposition 3 (Expected Step Size). Given a strict monotonic increasing one-
dimensional fitness landscape in [0, `] ∈ R (e.g. a straight line), and a site
centred in sgt = `/2 with edge set = `, the expected step size of one local search
iteration (i.e. the average distance between sgt and sgt+1 ) is:

0.5nr+1 + nr `
d(sgt , sgt+1 ) = ` − (A.1)
nr + 1 2
Proof. The goal is to calculate the expected output of the stochastic local search
operator sgt+1 = Lnr (sgt ) defined in eq. (2), with the search scope within [0, `].
This output can be expressed as the following continuous random variable:

X = arg max{F (sgt ), F (x1 ), . . . , F (xnr )} =

xi
(A.2)
= arg max {φ(x1 ), . . . , φ(xnr )}
xi

where:
xi ∼ U (0, `) and φ(xi ) = max{F (sgt ), F (xi )} (A.3)
The expected value E of a continuous random variable Y defined in the
interval [a, b] is computed as [47]:
Z b
E[Y ] = y · P DFY (y)dy (A.4)
a

where P DFY (y) is the probability density function of Y . The probability den-
sity function of a random variable Y is equal to the derivative of the cumula-
tive distribution function CDFY (y). In the case of the variable X defined in
eq. (A.2):
nr
Y
CDFX (x) = P (φ(xi ) ≤ x) = CDFφ(x) (x)nr (A.5)
i=1

31
where P (φ(xi ) ≤ x) is the probability that one random sample xi of the search
scope is less or equal to x. Differentiating CDFX (x) and plugging the derivative
into eq. (A.4):
Z l
E[X] = φ(x) · nr · CDFφ(x) (x)nr−1 P DFφ(x) (x)dx (A.6)
0

Using the Law of the Unconscious Statistician [47] is possible to make the fol-
lowing substitution:
Z `
E[X] = φ(x) · nr · CDFφ(x) (x)nr−1 P DFφ(x) (x)dx
0
Z `
(A.7)
nr−1
= φ(x) · nr · CDFx (x) P DFx (x)dx
0

The cumulative distribution function and the probability density function of a

random variable X sampled with uniform probability in U (0, `) are:

0
 x<0
CDFU (x) = x` 0≤x<` (A.8)

1 y≥`


and (
1
` 0≤x≤`
P DFU (x) = (A.9)
0 elsewhere

From eq. (A.2) it is known that x ∼ U (0, `), also F is assumed to be monotonic4 ,
therefore:
Z `
E[X] = max{x, sgt } · nr · CDFU (x)nr−1 P DFU (x)dx =
0
Z `/2 x nr−1 1 Z ` x nr−1 1
= `/2 · nr · dx + x · nr · dx =
0 ` ` `/2 ` `
`/2 l
nr · xnr nr · xnr+1
= + =
2nr(`nr−1 ) 0 `nr (nr + 1) `/2 (A.10)
nr+1 nr · `(1 − 0.5nr+1 )
= 0.5 `+ =
nr + 1
nr0.5nr+1 ` + 0.5nr+1 ` + nr · ` − nr0.5nr+1 `
= =
nr + 1
0.5nr+1 + nr
=`
nr + 1

4 In the proof the case of monotonic increasing fitness is considered. This is not a loss of

generality since only the expected step size is considered, not the direction.

32
Therefore the average step size is:
0.5nr+1 + nr `
d(sgt , sgt+1 ) = |sgt+1 − sgt | = ` − (A.11)
nr + 1 2

A.2. Coverage Reduction with constant GRst

Lemma 1 (Coverage Reduction with constant GRst ). Let sgt be the centre of
site s at cycle t in the N -dimensional solution space. If local search stagnates for
k cycles (sgt = sgt+k ), and the region GRst 6= ∅ is not changed by neighbourhood
shrinking, the relative coverages of LRst+k and GRst+k become:
1 1
|LRst+k | = (|LRst | − 1) + 1 = 1 − |GRst | (A.12)
αkN αkN
1
|GRst+k | = |GRst | (A.13)
αkN
Proof. Since GRst is not changed, GRst = GRst+j and V(GRst ) = V(GRst+j )
∀j = 1, . . . , k. Also, remembering eq. (13):
LRst ∪ GRst = C(sgt , set ) ⇒ V(LRst ) + V(GRst ) = V(C(sgt , set )) (A.14)
However, neighbourhood shrinking reduces the local search edge of a factor α
(Section 2 and eq. 5). That is, set+1 = αset and after k successive repetitions
of neighbourhood shrinking set+k = αk set . The volume of the search scope is
reduced accordingly:
V(C(sgt+k , set+k )) = V(C(sgt , αk set )) = αkN V(C(sgt , set )) (A.15)

Since GRst is not reduced, the reduction LRst will be equal to the reduction of
C(sgt , set )). That is:
V(LRst+k ) = V(LRst ) − V(C(sgt , set )) − V(C(sgt+k , set+k ))

(A.16)
From the definition of the relative coverage (eqs. (15) and (A.15)):
V(LRst+k ) V(LRst+k )
|LRst+k | = g = kN (A.17)
V(C(st+k , st+k ))
e α V(C(sgt , set ))
V(LRst+k )
And from eq. (A.16), αkN V(C(sg e is equals to:
t ,st ))

1 V(LRst ) − (V(C(sgt , set )) − V(C(sgt+k , set+k )))

· =
αkN V(C(sgt , set ))
1 V(LRst ) − (V(C(sgt , set )) − αkN V(C(sgt , set )))
= · =
αkN V(C(sgt , set ))
(A.18)
1 V(LRst ) (1 − αkN ) V(C(sgt , set ))
= · − · =
αkN V(C(st , set ))
g
αkN V(C(sgt , set ))
1 (1 − αkN )
= · |LRst | −
αkN αkN

|GRst+k | = 1 − |LRst+k | =
(A.20)

1 1
= 1 − 1 − kN |GRst | = kN |GRst |
α α

A.3. Stalling Probability Without Shrinking

Proposition 5 (Stalling Probability Without Shrinking). Given site s centred
on sgt at cycle t, the probability that local search without neighbourhood shrinking
stalls is: ttl
P (sgt = sgt+sttl ) = |LRst |nr·st (A.21)
t

Proof. The site stalls if the search stagnates for the next sttl
t cycles, that is, if all
the nr candidate solutions generated during sttl t local search cycles lie in LRsk .
When local search stagnates, the centre of the site is unchanged, and if the
search scope is not changed (no neighbourhood shrinking), |LRst | is constant:

|LRst | = |LRsk | ∀k ∈ N | t ≤ k < t + sttl

t (A.22)

The joint probability that all the solutions sampled during one given cycle
k of local search belong to LRsk is indicated as:

P (v ∈ LRsk ) ∀v ∈ {v1 , . . . , vnr } vi ∼ C(sgk , sek ) (A.23)

Due to the uniform sampling, the probability that one solution is picked from
LRsk corresponds to |LRsk |. Remembering eq. (A.22), it follows that:

P (v ∈ LRsk ) = |LRsk |nr = |LRst |nr (A.24)

The stalling probability can then be computed as the joint probability of sttl
t
consecutive stagnations of local search cycles:
t+sttl
t −1
Y ttl
P (st = st+sttl
t
)= |LRsk |nr = |LRst |nr·st (A.25)
k=t

34
A.4. stlim vs. nr
Proposition 7 (stlim vs. nr). Let st be the centre of site s at cycle t in the
N-dimensional solution space. Assuming that GRst is not changed by neigh-
bourhood shrinking, an increase in the stalling limit of an integer factor q > 1
reduces the stalling probability more than an equal increase in the number of
foragers.
Pnr (st = st+q·stlim ) < Pq·nr (st = st+stlim ) (A.26)
Proof. Remembering lemmas 2 and 3, eq. (32) can be re-written as:

A < Pnr (st = st+stlim )q (A.27)

with
q·stlim nr
Y 1
A = Pnr (st = st+stlim ) 1− kN
|GR st | (A.28)
α
k=stlim+1

with Pnr (st = st+stlim ) a non-null probability, and hence a positive real number.
Equation (A.27) can thus be rewritten as:
q·stlim nr
Y 1
1− kN
|GR st | < Pnr (st = st+stlim )q−1 (A.29)
α
k=stlim+1

Remembering proposition 6:

stlim
Y nr !q−1
q−1 1
Pnr (st = st+stlim ) = 1 − kN |GRst | (A.30)
α
k=1

Equation (A.29) becomes:

q·stlim nr stlim (q−1)·nr
Y 1 Y 1
1− kN
|GR st | < 1− kN
|GRst | (A.31)
α α
k=stlim+1 k=1

The two terms inside the brackets on the right and left hand sides of A.31 express
the relative coverage of LRst at time k. That is, they represent the probability
of picking a solution of lower fitness than st inside C at time k. They become
smaller as k increases (α < 1), and thus:
1 1
1− |GRst | ≤ 1 − |GRst | (A.32)
αstlim·N αkN
for all k ∈ {1, stlim}. Likewise:
1 1
1− |GRst | ≤ 1 − |GRst | (A.33)
αkN αstlim·N
for all k ∈ {stlim + 1, q · stlim}. Accordingly:

X<Y and W <Z (A.34)

Equation (A.31) (W < Y ) is certainly true if W < Z ≤ X < Y . Setting

1
A = 1 − αstlim·N |GRst | , Z ≤ X can be rewritten as:

q·stlim stlim
nr (q−1)·nr
Y Y
(A) ≤ (A) (A.36)
k=stlim+1 k=1

Developing the sequence of products:

nr·q·stlim (q−1)·nr·stlim
(A) ≤ (A) (A.37)

Given that A is a stalling probability, and hence 0 < A ≤ 1, the inequality

eq. (A.37) is true because the left hand side is raised to a higher power than the
right hand side of the inequality.

A.5. Scope Variation Invariance

Proposition 8 (Scope Variation Invariance). Let C(sgt , set ) and B(sgt , sr ) be the
local search scope using respectively cubic and spherical sampling, and |GRst |C
and |GRst |S be the relative coverage of the GRst region using respectively cubic
and spherical sampling. If neighbourhood shrinking does not change the GRst
region, shrinking the edge/radius of the search scope of a factor α leads to the
same change in the respective coverages:
1
C(sgt , αset ) ⇒ |GRst |C
αN (A.38)
1
B(sgt , αsrt ) ⇒ N |GRst |S
α
Proof. This can be directly proven as follows:

V(GRst ) V(GRst )
= =
V(C(sgt , αset )) (αset )N
(A.39)
1 V(GRst ) 1 V(GRst )
= N · = N ·
α (set )N α V(C(sgt , set ))

36
V(GRst ) V(GRst )
= =
V(B(sgt , αsrt )) VN · (αsrt )N
(A.40)
1 V(GRst ) 1 V(GRst )
= N · = N ·
α r
VN · (st )N α V(B(sgt , srt ))

Bee Algorithm
100% (1)
Bee Algorithm
37 pages
Bees Algorithm
100% (1)
Bees Algorithm
9 pages
Bees Algorithm
No ratings yet
Bees Algorithm
220 pages
The Bees Algorithm: Modelling Foraging Behaviour To Solve Continuous Optimization Problems
No ratings yet
The Bees Algorithm: Modelling Foraging Behaviour To Solve Continuous Optimization Problems
21 pages
Mathematics 10 02211 v2
No ratings yet
Mathematics 10 02211 v2
38 pages
Ai 05 00109
No ratings yet
Ai 05 00109
19 pages
A Comprehensive Review of Bat Inspired Algorithm: Variants, Applications, and Hybridization
No ratings yet
A Comprehensive Review of Bat Inspired Algorithm: Variants, Applications, and Hybridization
33 pages
The Bees Algorithm and Mechanical Design
No ratings yet
The Bees Algorithm and Mechanical Design
6 pages
2015 Elsevier Composite Artificial Bee Colony Algorithms From Component Based Analysis To High Performing Algorithms
No ratings yet
2015 Elsevier Composite Artificial Bee Colony Algorithms From Component Based Analysis To High Performing Algorithms
20 pages
Randomized Memetic Artificial Bee Colony Algorithm
No ratings yet
Randomized Memetic Artificial Bee Colony Algorithm
11 pages
Bees Algorithm1
No ratings yet
Bees Algorithm1
11 pages
Biomimetics 09 00634
No ratings yet
Biomimetics 09 00634
13 pages
Simultaneous Design Optimization of SemiRigid Plane Steel Frames With Semi-Rigid Bases Using Bees and Genetic Algorithms
No ratings yet
Simultaneous Design Optimization of SemiRigid Plane Steel Frames With Semi-Rigid Bases Using Bees and Genetic Algorithms
20 pages
Honey Badger Algorithm Notes
No ratings yet
Honey Badger Algorithm Notes
6 pages
Final Manuscript
No ratings yet
Final Manuscript
15 pages
Honey Badger Algorithm New Metaheuristic Algorithm For Solving Optimization Problems-2022
No ratings yet
Honey Badger Algorithm New Metaheuristic Algorithm For Solving Optimization Problems-2022
27 pages
Bees Algorithm1
No ratings yet
Bees Algorithm1
11 pages
Bees Algorithm
No ratings yet
Bees Algorithm
10 pages
Honey-Bees Mating Optimization (HBMO) Algorithm: A New Heuristic Approach For Water Resources Optimization
No ratings yet
Honey-Bees Mating Optimization (HBMO) Algorithm: A New Heuristic Approach For Water Resources Optimization
20 pages
المستند a85
No ratings yet
المستند a85
19 pages
Foraging Bee Optimization Algorithm
No ratings yet
Foraging Bee Optimization Algorithm
14 pages
Aderhold 2010
No ratings yet
Aderhold 2010
12 pages
Ijcs 48 3 04
No ratings yet
Ijcs 48 3 04
9 pages
Chaotic Bee Swarm Optimization Algorithm For Path Planning of Mobile Robots
No ratings yet
Chaotic Bee Swarm Optimization Algorithm For Path Planning of Mobile Robots
6 pages
Abdel-Basset Et Al 2021
No ratings yet
Abdel-Basset Et Al 2021
37 pages
A Powerful and Efficient Algorithm For Numerical
No ratings yet
A Powerful and Efficient Algorithm For Numerical
14 pages
1 s2.0 S0045782521005259 Main
No ratings yet
1 s2.0 S0045782521005259 Main
45 pages
Owt 2019 Presentation
No ratings yet
Owt 2019 Presentation
19 pages
2015 Elsevier A Directed Artificial Bee Colony Algorithm
No ratings yet
2015 Elsevier A Directed Artificial Bee Colony Algorithm
9 pages
tối ư hóa
No ratings yet
tối ư hóa
8 pages
13-12-22 - (Inderscience) Comparison of Bald Eagle Search (BES) Algorithm Paper - Rebuttal
No ratings yet
13-12-22 - (Inderscience) Comparison of Bald Eagle Search (BES) Algorithm Paper - Rebuttal
14 pages
Robot Use ABO
No ratings yet
Robot Use ABO
8 pages
Sample
No ratings yet
Sample
9 pages
2015 Elsevier Reduction of Artificial Bee Colony Algorithm For Global Optimization
No ratings yet
2015 Elsevier Reduction of Artificial Bee Colony Algorithm For Global Optimization
5 pages
Artificial Bee Colony Algorithm: A Survey: Jagdish Chand Bansal
No ratings yet
Artificial Bee Colony Algorithm: A Survey: Jagdish Chand Bansal
37 pages
An Empirical Comparison Between The Artificial Bee
No ratings yet
An Empirical Comparison Between The Artificial Bee
5 pages
Bees
No ratings yet
Bees
1 page
ESL Brains Where Would You Rather Live TV 6285
No ratings yet
ESL Brains Where Would You Rather Live TV 6285
5 pages
Lec #15-ABC
No ratings yet
Lec #15-ABC
17 pages
Firefly Dilip
No ratings yet
Firefly Dilip
11 pages
AIWH
No ratings yet
AIWH
19 pages
Chun Feng2014
No ratings yet
Chun Feng2014
9 pages
83-522-1-PB Bees
No ratings yet
83-522-1-PB Bees
16 pages
Paper1-Fast-Convergent Artificial Bee Colony With An Adaptive Local Search
No ratings yet
Paper1-Fast-Convergent Artificial Bee Colony With An Adaptive Local Search
9 pages
Comparison of Artificial Bee Colony Algorithm With Other Algorithms
No ratings yet
Comparison of Artificial Bee Colony Algorithm With Other Algorithms
4 pages
Applied Sciences: A Spring Search Algorithm Applied To Engineering Optimization Problems
No ratings yet
Applied Sciences: A Spring Search Algorithm Applied To Engineering Optimization Problems
21 pages
Unit 3
No ratings yet
Unit 3
37 pages
Artificial Bee Colony Algorithm
No ratings yet
Artificial Bee Colony Algorithm
25 pages
Using The Bees Algorithm To Schedule Jobs For A Machine: D.T. Pham, E. Koç, J.Y. Lee, J. Phrueksanant
No ratings yet
Using The Bees Algorithm To Schedule Jobs For A Machine: D.T. Pham, E. Koç, J.Y. Lee, J. Phrueksanant
10 pages
Mixolydian Scales - Trumpet - Lexcerpts
No ratings yet
Mixolydian Scales - Trumpet - Lexcerpts
1 page
Bee Colony Algorithm
No ratings yet
Bee Colony Algorithm
21 pages
Paraflex Cram Type o 2x18 Playa Edition CC v4
No ratings yet
Paraflex Cram Type o 2x18 Playa Edition CC v4
8 pages
Artificial Bee Colony Algorithm
No ratings yet
Artificial Bee Colony Algorithm
22 pages
Applied Mathematics and Computation
No ratings yet
Applied Mathematics and Computation
19 pages
Artificial Bee Colony Algorithm For Traveling Salesman Problem
No ratings yet
Artificial Bee Colony Algorithm For Traveling Salesman Problem
5 pages
OpenShift - Container - Platform 4.6 Service - Mesh en US
No ratings yet
OpenShift - Container - Platform 4.6 Service - Mesh en US
229 pages
Honey
No ratings yet
Honey
5 pages
Artificial Bee Colony Harmony Search and Bee Algorithms On Numerical Optimization
No ratings yet
Artificial Bee Colony Harmony Search and Bee Algorithms On Numerical Optimization
6 pages
CKLA G5 U2 Early-American-Civilizations TG
No ratings yet
CKLA G5 U2 Early-American-Civilizations TG
317 pages
Book Review Medicine Your Fingertips by Gireesh Ku
No ratings yet
Book Review Medicine Your Fingertips by Gireesh Ku
2 pages
Jaya A Novel Optimization Algorithm: What, How and Why?: Hari Mohan Pandey
No ratings yet
Jaya A Novel Optimization Algorithm: What, How and Why?: Hari Mohan Pandey
3 pages
Communication in The Workplace
100% (2)
Communication in The Workplace
120 pages
NORTH Kellyville Growth Centres Precinct Development Control Plan - in Force - 06 July
No ratings yet
NORTH Kellyville Growth Centres Precinct Development Control Plan - in Force - 06 July
322 pages
Tortoise Finds His Home by Maya Fowler, Katrien Coetzer, and Damian Gibbs
No ratings yet
Tortoise Finds His Home by Maya Fowler, Katrien Coetzer, and Damian Gibbs
19 pages
The Mediating Role of Brand Image in The Relationship Between Storytelling Marketing and Purchase Intention: Case Study of PX Mart
No ratings yet
The Mediating Role of Brand Image in The Relationship Between Storytelling Marketing and Purchase Intention: Case Study of PX Mart
14 pages
Dipa Karmakar - in Perfect Balance: Author: Sreelata Menon Illustrator: Sonal Gupta Vaswani
No ratings yet
Dipa Karmakar - in Perfect Balance: Author: Sreelata Menon Illustrator: Sonal Gupta Vaswani
34 pages
Addressing Mental Health in Maldives - WHO Report
No ratings yet
Addressing Mental Health in Maldives - WHO Report
16 pages
43 - Bunty and Bubbly
No ratings yet
43 - Bunty and Bubbly
17 pages
Mouse in The House: Author: Sowmya Rajendran Illustrator: Tanaya Vyas
No ratings yet
Mouse in The House: Author: Sowmya Rajendran Illustrator: Tanaya Vyas
15 pages
Io Non Ho Paura!
No ratings yet
Io Non Ho Paura!
17 pages
American Farmhouse - Jay Osborne - Ebook
No ratings yet
American Farmhouse - Jay Osborne - Ebook
56 pages
I Want To Be A Baker FKB 1
No ratings yet
I Want To Be A Baker FKB 1
26 pages
TN23 Precastdrainagepits
No ratings yet
TN23 Precastdrainagepits
12 pages
Red Hat Satellite-6.11-Administering Red Hat Satellite-En-Us
No ratings yet
Red Hat Satellite-6.11-Administering Red Hat Satellite-En-Us
169 pages
Red Hat Enterprise Linux-7-Kernel Administration Guide-En-US
No ratings yet
Red Hat Enterprise Linux-7-Kernel Administration Guide-En-US
73 pages
Configuring Robot Movement
No ratings yet
Configuring Robot Movement
12 pages
Akn Ls Act 2010 8 Eng 2010 06 14
No ratings yet
Akn Ls Act 2010 8 Eng 2010 06 14
35 pages
7G Unit 2 First-Day Fly STUDENT COPY Ed2.0
No ratings yet
7G Unit 2 First-Day Fly STUDENT COPY Ed2.0
8 pages
SRS Report w1957448
No ratings yet
SRS Report w1957448
51 pages
Article
No ratings yet
Article
12 pages
Towards A Dementia Plan: A WHO Guide
No ratings yet
Towards A Dementia Plan: A WHO Guide
82 pages
Tennis: Development, Globalisation, and Sociology in The International Context
No ratings yet
Tennis: Development, Globalisation, and Sociology in The International Context
5 pages
Processor-In-The-Loop Demonstration of MPC For HEVs Energy Management System
No ratings yet
Processor-In-The-Loop Demonstration of MPC For HEVs Energy Management System
6 pages
The Diagnostic
No ratings yet
The Diagnostic
10 pages
9G Unit 6 What Is Public Art STUDENT COPY Ed1.0 - Google Docs1
No ratings yet
9G Unit 6 What Is Public Art STUDENT COPY Ed1.0 - Google Docs1
4 pages
Interpolation and Extrapolation Optimal Designs 2: Finite Dimensional General Models
From Everand
Interpolation and Extrapolation Optimal Designs 2: Finite Dimensional General Models
Giorgio Celant
No ratings yet
Learn Design and Analysis of Algorithms in 24 Hours
From Everand
Learn Design and Analysis of Algorithms in 24 Hours
Alex Nordeen
No ratings yet
Search Algorithm: Fundamentals and Applications
From Everand
Search Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Differential Evolution: Fundamentals and Applications
From Everand
Differential Evolution: Fundamentals and Applications
Fouad Sabry
No ratings yet

An Analysis of The Search Mechanisms of The Bees Algorithm

Uploaded by

An Analysis of The Search Mechanisms of The Bees Algorithm

Uploaded by

University of Birmingham

An Analysis of the Search Mechanisms of the Bees

Citation for published version (Harvard):

Link to publication on Research at Birmingham portal

When citing, please reference the published version.

Download date: 11. Dec. 2023

Luca Barontia , Marco Castellania , Duc Truong Phama

Preprint submitted to Swarm and Evolutionary Computation February 12, 2020

The Bees Algorithm [1] is a nature-inspired intelligent technique that has

2. Formal Definition of the Bees Algorithm

1. (Initialisation) The initial population P is created sampling ns+nr·nb

Algorithm 1: The Bees Algorithm

3. Main Variants of the Bees Algorithm

Besides the standard procedure described in section 2, many different vari-

mance of other techniques.

The Bees Algorithm is part of a large family of metaheuristics mimicking

4.1. Variable Neighbourhood Search

1. Initialise k neighbourhoods S1 , . . . , Sk of variable size around a

VNS is akin to local neighbourhood search at a BA site. A variant of this

1. Let se1 be the N-dimensional vector of the spans (max−min value)

vi = st + u · set i ∈ {1, . . . , nr}

where u is a vector of N real values independently sampled with

st+1 = arg min {F (v) | v ∈ {st , v1 , . . . , vnr }}

4. Reduce the ranges of a given factor 0 < α < 1:

and increment the counter t = t + 1;

Except for the initialisation of the neighbourhood, the LJ Search algorithm

5. Analysis of Local Search Properties

5.1. Local Search: Introduction and Definitions

• n is the number of local search cycles from discovery to abandonment of

• S is the series of solutions found by local search at site s:

S = {sg1 , . . . , sgn } (1)

Lnr (sgt ) = arg max{F (v) | v ∈ {sgt , v1 , . . . , vnr }} (2)

with vi ∼ C(sgt , set ) as a uniform sampling of a solution vi in the hypercube

Lopt is a l.m. ⇔ ∃ > 0 | F (Lopt ) ≥ F (v) (3)

Neighbourhood shrinking can be similarly defined in the more general case of

sgt = sgt+k ⇔ sgt = sgt+1 = · · · = sgt+k (7)

for any positive index k ∈ N.

sgt ∈ C(sgt+1 , set+1 ) (8)

2A fix point x of a function f is a point such as f (x) = x

5.3. Expected Progress

5.3.1. Expected Step Size: Experimental Verification

6. Site Abandonment: Local Search Stalling Probability

6.1. Site Abandonment: Definitions and Properties

LRst ⊆ C(sgt , set ) v ∈ LRst ⇔ F (v) ≤ F (sgt )

Lnr (sgt ) = sgt+1 6= sgt ⇔ sgt+1 ∈ GRst

Average d(st , st+1 )

F (sgt ) ≥ F (v) ⇔ |LRst | = 1 ∧ |GRst | = 0 (17)

Proof. This can be proven by contradiction:

where B(sg , ) is an N -dimensional ball of radius  and centred in sg . However:

v ∈ LRst ⇔ F (v) ≤ F (sg ) (19)

so sg is either a local optimum or is inside GR−

This property holds for virtually any BA variant. An important conse-

6.2. Stalling Probability Without Neighbourhood Shrinking

Proposition 5 (Stalling Probability Without Shrinking). Given site s centred

Proof. See electronic appendix.

6.3. Stalling Probability With Neighbourhood Shrinking

Pnr (sgt = sgt+q·stlim ) = Pnr (sgt = sgt+stlim ) · T (27)

Proof. From Proposition 6, Pnr (sgt = sgt+q·stlim ) is equals to:

Pq·nr (sgt = sgt+stlim ) = Pnr (sgt = sgt+stlim )q (30)

Proof. The proof is straightforward:

7. Local Search Scope Shape

7.1. Isotropic Local Search

V(C(sgt , set )) = (set )N (33)

V(C(sgt , srt )) = VN · (srt )N = (sr )N (34)

N ∗ = {N | DN −1 < srt ≤ DN } (35)

Table 1: Parameters setting used in the tests.

7.2. Stalling Probability: Experimental Verification

V(LRst ) ≈ 9995.0652 |LRst | ≈ 0.9995 (39)

When no neighbourhood shrinking is used (case 1), the stalling probability is

P (sgt = sgt+sttl ) = 0.999515·8 ≈ 0.9425 (40)

When neighbourhood shrinking is used (case 2, shrinking factor α = 0.9),

V(C(sgt , set )) = 104

According to proposition 6 the stalling probability is now equal to:

When a hyperspherical (isotropic) neighbourhood of radius sr = 5 (equivalent

|GRst | ≈ 1.6 · 10−3

3 The actual radius of GRst is 1− since sgt 6∈ GRst .

Lopt is a l.m. ⇔ ∃ > 0 | F (Lopt ) ≥ F (v) (3)

where B(sg , ) is an N -dimensional ball of radius and centred in sg . However: