Towards Learning-Based Distributed Task Allocation Approach For Multi-Robot System

This paper presents a novel approach using Graph Convolutional Networks (GCNs) to enhance the Consensus-Based Bundle Algorithm (CBBA) for task allocation in multi-robot systems. The integration of GCNs aims to improve the efficiency and accuracy of task distribution by learning and predicting the score function essential for decision-making in real-time environments. The proposed AI-enhanced CBBA is evaluated against existing methods, demonstrating its potential in managing complex task allocations effectively.

Uploaded by

huyixiang01

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views6 pages

Towards Learning-Based Distributed Task Allocation Approach For Multi-Robot System

Uploaded by

huyixiang01

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

2024 10th International Conference on Automation, Robotics, and Applications

Towards Learning-based Distributed Task Allocation

Approach for Multi-Robot System
Zakaria Chekakta∗ , Nabil Aouf∗ , Shashank Govindaraj† , Fabio Polisano† and Geert De Cubber‡
∗
City, University of London, United Kingdom
† Space
Applications Services NV/SA, , Brussels, Belgium
‡ Royal Military Academy, Brussels, Belgium
2024 10th International Conference on Automation, Robotics and Applications (ICARA) | 979-8-3503-9424-5/24/$31.00 ©2024 IEEE | DOI: 10.1109/ICARA60736.2024.10553196

Emails: [email protected], [email protected], [email protected],

[email protected], [email protected]

Abstract—This paper introduces a novel application of Graph times, task durations, task deadlines, and fuel constraints are
Convolutional Networks (GCNs) for enhancing the efficiency of factors. Finding the optimal solution to this task allocation
the Consensus-Based Bundle Algorithm (CBBA) in multi-robot problem in real-time environments becomes computationally
task allocation scenarios. The proposed approach in this research
lies in the integration of a learning-based strategy to approximate unfeasible as the number of tasks/agents grows. However,
the heuristic methods traditionally used for scoring in the CBBA distributed algorithms assume ideal communication conditions
framework. By employing GCNs, the proposed methodology aims and rely on consensus for consistent situational awareness
to learn and predict the score function, which is crucial for (SA). Current state-of-the-art consensus-based task allocation
task allocation decisions in multi-robot systems. This approach algorithms incorporate heuristics into agent score functions in
not only streamlines the allocation process but also potentially
improves the accuracy and efficiency of task distribution among order to optimize a given objective. While extensive research
robots. The paper presents a detailed exploration of how GCNs has been done in the area of multi-agent learning of optimal
can be effectively tailored for this specific application, along policies [8], [9]. Each solution involves trade-offs between
with results demonstrating the advantages of this learning- efficiency, optimality, and robustness [10], [11], [12], [13].
based approach over conventional heuristic methods in various An alternative approach to resolving this dilemma is the
simulated multi-robot task allocation scenarios.
Keywords—Task Allocation, Multirobot System, Distributed implementation of the auction algorithm. In the auction algo-
Algorithms, Graph Convolutional Neural Networks rithm, agents bid on individual tasks, and a central system or
designated agent acts as an auctioneer to select the winning
I. I NTRODUCTION bids. The bundle algorithm simplifies this by having agents bid
The task allocation problem aims to find a globally fea- on groups of tasks, or bundles, rather than single tasks. While
sible allocation of tasks to agents while optimizing one or both methods offer dynamic and potentially efficient task
more objectives. For Multi-Robot Systems (MRS) with varied allocation, they may not be as robust as consensus algorithms
capabilities, two main challenges arise: the high computa- in adapting to changes in communication networks. However,
tional complexity of traversal algorithms and the limitations traditional auction algorithms are generally more computa-
of centralized algorithms, including reduced task range and tionally efficient compared to consensus algorithms, which
single point of failure risk [1], [2]. To address these, heuristic excel in robustness but may lack in speed, especially in large-
algorithms are used as a more efficient, though not always scale systems. The choice between these methods depends
optimal, alternative to traversal algorithms. The effectiveness on the MRS’s specific needs for adaptability, efficiency, and
of a given heuristic is dependent on various factors including communication robustness.
the constraints and parameters of the problem being solved The Consensus-Based Bundle Algorithm (CBBA) [4] dis-
and the objective being optimized [3]. Additionally, distributed cussed in this paper is a hybrid approach for task allocation
algorithms replace centralized ones, enhancing task range in MRS, combining auction-based methods and consensus
and system robustness by distributing decision-making across algorithms. It uses auctions to distribute tasks among robots
agents. Distributed consensus-based algorithms can solve task and employs a consensus mechanism to resolve any conflicts
allocation problems in a cooperative planning process consist- arising from overlapping bids or dependencies. CBBA stands
ing of two phases [4]. In the first phase, an agent constructs out for its efficient convergence, quickly reaching a stable state
a schedule of selected tasks through an internal decision- of task allocation. Additionally, it guarantees at least 50%
making process. This process has previously been referred to optimality in its solutions, when the bidding price has the
as a utility function [5], a score function [6], or an objective diminishing marginal gain (DMG) property [14]. Balancing
function [7]. In the second phase, agents communicate bids speed and efficiency with a reasonable level of accuracy. The
on their selected task allocation and resolve conflicts by CBBA effectively addresses the challenges of distributed task
assigning tasks to the agents with the highest bids. Agents allocation by combining the dynamic nature of auctions with
perform one task at a time, and each agent can be assigned the conflict resolution capabilities of consensus algorithms.
multiple tasks that they execute based on a schedule. Travel On the other hand, Graph Convolutional Networks (GCNs)

979-8-3503-9424-5/24/$31.00 ©2024 IEEE 34

Authorized licensed use limited to: National Univ of Defense Tech. Downloaded on December 15,2024 at 14:06:06 UTC from IEEE Xplore. Restrictions apply.
exhibit exceptional capabilities in their application to large- TABLE I
scale robotic teams. These networks showcase exceptional PARAMETERS D ESCRIPTION
performance and demonstrate an impressive ability to gener- Parameter Description
alize across a wide array of complex tasks [15]. This includes N Number of agents
sophisticated applications like coordinated flocking, advanced M Number of tasks
xij = 1 If agent i is assigned to task j and 0 otherwise
navigation strategies, and precise control mechanisms [16], xi ∈ {0, 1}M Vector with the jth element as xij
[17], [18], [19]. The proficiency of GCNs in seamlessly Ia = {1, · · · , N } Set of agents
adapting to these diverse and challenging tasks underscores It = {1, · · · , M } Set of tasks
ηi ∈ (It ∪ ∅)Ki Ordered set of tasks assigned to agent i
their pivotal role in revolutionizing the capabilities of multi- ∅ Empty set symbolizing no task
robot systems. Sij (xi , ηi ) Score function of assigning task j to agent i
In this study, we introduce an advanced AI-enhanced Ki Maximum number of tasks to agent i
Consensus-Based Bundle Algorithm (AI-CBBA), tailored
specifically for optimizing task allocation for multi-robot sys-
tems. The paper begins by constructing a formal model for the
In the CBBA framework, the task allocation process com-
task allocation problem. This is followed by an exploration
prises two distinct phases. The initial phase is dedicated to
of the consensus-based bundle algorithm, providing a brief
the generation of bids for tasks (Path Planning) by individual
understanding of its mechanisms. Building upon this, the study
agents, while the second phase, known as conflict resolution,
delves into a set of heuristic methods designed for the original
centers on the exchange of information among agents regard-
CBBA, enhancing its efficiency and effectiveness. A pivotal
ing their bids and the provisional allocation of tasks.
component of our approach is the integration of a GCN-based
a) Phase I: Task Planning (Bundle Building): The path
architecture to predict the score function. The paper final-
planning algorithm presented in Algorithm 1 outlines the
izes with a presentation of results and a detailed discussion,
procedure for constructing an optimized task bundle for an
where the performance of our proposed algorithm is compared
individual agent within a multi-agent system. At the beginning
against the existing state-of-the-art solutions, demonstrating its
of each iteration, the agent’s task state is initialized with
potential in managing complex task allocations. The proposed
the current bundle, owned tasks, potential task set, and task
algorithm has been used for Explosive Ordnance Disposal
sequence. The core of the algorithm lies in the while-loop,
mission [20].
which ensures that only the maximum permissible number of
II. L EARNING - BASED D ISTRIBUTED TASK A LLOCATION tasks Ki are considered for bundle construction. Within the
loop, the algorithm performs a sequence of steps to evaluate
A. Problem Formulation
which tasks from the set of unallocated tasks It − ζi (t) should
In this section, we introduce a mathematical formulation be included in the agent’s bundle.
for the task allocation problem aimed at maximizing the total For each of these tasks, a score S̃ij is computed,
score. We propose a binary integer programming model where (
the decision variable xij signifies whether task j is assigned to 0, if j ∈ ζi ,
S̃ij [ζi ] = ηi ⊕n {j} ηi (2)
agent i, and the score function Sij (xi , ηi ) measures the utility maxn≤|ηi | {Θi − Θi }, otherwise.
of such an assignment.
The formulation is bound by constraints ensuring each agent where Θηi i is the total reward minus the cost for the
can undertake no more than a set number of tasks Ki , each sequence ηi , and ”⊕n ”, denotes the operation that inserts task
task may only be allocated to one agent, and the total number j at the n-th position in the sequence ηi .
of tasks assigned does not exceed the number of available tasks The decision to add a task to the bundle is contingent
or the cumulative maximum capacities of the agents. Table I upon the score S̃ij exceeding a dynamic threshold ωij , which
outlines the parameters used in our model. reflects the task’s relative value and competitiveness against
bids from other agents.
N X
X M Following the scoring process, the algorithm identifies the
maximize Sij (xi , ηi )xij task with the highest utility to be included in the agent’s
x
i=1 j=1 path and updates the task sequence and the set of potential
M
X tasks accordingly. Task ownership is reassigned to reflect
subject to xij ≤ Ki , ∀i ∈ Ia , this inclusion, and the associated thresholds are updated,
j=1 which will influence subsequent iterations and allocations. The
N (1)
X algorithm concludes its current iteration when the task bundle
xij ≤ 1, ∀j ∈ It , is solidified, meeting the capacity constraints of the agent. This
i=1 iterative process is executed by each agent in a distributed
N X
M N
X X manner, ensuring the system converges to a consistent global
xij = min{M, Ki },
assignment that maximizes the overall score functions of all
i=1 j=1 i=1
participating agents, thereby optimizing the allocation of tasks
xij ∈ {0, 1}, ∀(i, j) ∈ Ia × It . across the agent network.

35
Authorized licensed use limited to: National Univ of Defense Tech. Downloaded on December 15,2024 at 14:06:06 UTC from IEEE Xplore. Restrictions apply.
Algorithm 1 Path Planning Algorithm for agent-i/iteration (t+ for their unparalleled prowess in feature extraction and rep-
1) resentation learning. Among the array of deep learning archi-
1: Process: Construct Bundle(νi (t), σi (t), ζi (t), ηi (t)) tectures, Convolutional Neural Networks (CNNs) have gar-
2: νi (t + 1) = νi (t) nered widespread acclaim, especially for their performance
3: σi (t + 1) = σi (t) in processing data characterized by a Euclidean or grid-like
4: ζi (t + 1) = ζi (t) topology.
5: ηi (t + 1) = ηi (t) Despite their success, traditional CNNs encounter signif-
6: while |ζi | ≥ Ki do icant challenges when confronted with data embedded in
η ⊕ {j}
7: S̃ij [ζi ] = maxn≤|ηi | Θi i n − Θηi i , j ∈ It − ζi (t) non-Euclidean spaces, such as the intricate webs of social
8: νij = I(S̃ij >ωij ) ∀j ∈ I t
or information networks, where translation invariance is no
9: Ji = argmax S̃ij [ζi ] × νij longer a given. To bridge this gap, Graph Convolutional
j Networks (GCNs) have been introduced as a robust method
η ⊕n {Ji }
10: ni,Ji = argmaxΘi i for navigating the complex terrain of graph-structured data.
n GCNs have revolutionized our ability to tap into the rich
11: ηi = ηi ⊕ni,Ji {Ji }
vein of information contained within non-Euclidean domains,
12: ζi = ζi ⊕end {Ji }
enabling the extraction of salient features that conventional
13: ωiJi (t + 1) = S̃iJi
methods would struggle to discern.
14: σiJi (t + 1) = i
Considering a graph G = (V, E) with V as the set of
15: End Process vertices and E as the edges denoting relationships, graph
convolutions can be processed in either the spatial or spectral
domains. Spatially, convolutions aggregate feature information
b) Phase II: Conflict Resolution Procedure: During con- from a node’s local neighborhood directly, leveraging residual
flict resolution, agents communicate their bid values and the connections for deep memory across layers. Each vertex is
provisional winners for each task. The task is provisionally equipped with its own neural network, and its activation in
awarded to the agent with the highest marginal score for that (k)
the k th layer, denoted by hv , is given by the equation:
task. An agent that has been outbid for a task must relinquish  
the task and any subsequent tasks in its bundle that were X
dependent on it. h(k)
v =σ
W (k) xv + θ(k) h(k−1)
u

This phase operates under the principle of Diminishing u∈N (v)
Marginal Gain (DMG), which posits that the marginal score (k) (k)
for a task, denoted by S̃ij [ηi ], should not increase with the where W and θ are learned parameters for intra- and
addition of tasks to the agent’s bundle. Formally, this is inter-nodal connections, respectively, and σ(·) represents a
expressed as: nonlinear activation function.
For the spectral domain, graph convolutions apply through
the transformation of features into the Fourier space using
S̃ij [ηi ] ≥ S̃ij [ηi ⊕end {j}], (3)
eigendecomposition of the normalized graph Laplacian L =
1 1
where ηi is the current task bundle for agent i, and j is a I − D− 2 AD− 2 = U ΛU T . Here, U contains the eigenvec-
new task being considered. tors, Λ is a diagonal matrix of eigenvalues, and the Fourier
Convergence to a stable task allocation and a guarantee of transformed features are U T x. A filter parameterized by Θ
at least 50% optimality are ensured by the CBBA under the operates on these transformed features, which is expressed as:
DMG condition for the scoring function. Should the scoring
function not naturally fulfill the DMG criterion, a warping gθ′ ⋆ x = U gθ ΛU T x
mechanism is applied. The warping adjusts the score S̃ij [ηi ] where gθ′ denotes the filtered signal. The adjacency matrix,
to:
with self-loops, is denoted by Ã = A + IN , and the layer-
wise propagation in the spectral GCN, which is utilized in
S̃ij [ηi ] = min{S̃ij [η]}, ∀η ⊆ ηi , (4) this research, follows:
which assists in algorithm convergence when the natural 1 1

H (l+1) = σ D̃− 2 ÃD̃− 2 H (l) W (l)
scoring function lacks diminishing returns.
In practical applications, such as multi-robot systems, graph
B. Learning-based Optimization structures capture the complexity of interactions within the
In recent years, learning-based optimization has emerged system and between agents and environments. The agent-
as a frontier in advanced computational methodologies, of- entity graph and task-entity graph encode these interactions.
fering profound insights into complex problem-solving. This Through machine learning methodologies, specifically graph
paradigm shift has been largely propelled by the advent and convolutional networks, we analyze these complex structures.
subsequent dominance of deep learning techniques, known For instance, we encode the position and attributes of tasks in

36
Authorized licensed use limited to: National Univ of Defense Tech. Downloaded on December 15,2024 at 14:06:06 UTC from IEEE Xplore. Restrictions apply.
a vector, apply a GCN to learn meaningful features from these The scoring functions for the heuristics are defined as
relationships, and use the extracted features to understand follows:
the underlying data structure. The proposed distributed task

H = γij − ∆Eij [ζi ]
 1

allocation algorithm depicted in Figure 1 demonstrates the

 H2 = γij
application of spectral GCNs for such feature extraction. γ
H3 = ∆Eijij[ζi ] (6)

 H4 = γij −∆Eij [ζi ]


Ei [ζi ]

where γij denotes the reward associated with task j by agent

i, ∆Eij [ζi ] represents the additional energy expenditure for
incorporating task j into the sequence, and Ei [ζi ] symbolizes
the remaining energy budget of agent i.
To optimally leverage these heuristics, we employ a machine
learning model trained via a neural network to predict the
effectiveness of each heuristic extension. We adopt a Graph
Convolutional Network (GCN) tailored for learning from
graph-structured data. The architecture of the developed GCN
model, as depicted in Figure 2, comprises:
Fig. 1. Distributed Task Allocation
• A tripartite Graph CNN structure with convolutional

In alignment with the CBBA’s foundational principles, we layers yielding 32-, 16-, and 8-dimensional feature maps,
utilize ξi (t) to signify the task sequence within agent i’s bundle designed to distill environment-specific information such
at time t, where ξi represents the ordered set of tasks. Notably, as task connectivity and site distances.
the sequence ξi (t) does not necessarily correspond with the • A mean pooling layer follows to aggregate node features

order of tasks within the bundle ζi (t). The path length, now into a comprehensive graph-level representation.
interpreted as energy consumption, is represented by D[ξi (t)]. • Two dense layers, each with eight neurons, to process the

To adapt CBBA for scenarios with energy constraints, each pooled graph features.
agent commences by initializing their bundle ζi (t) to include • The model culminates in an output layer that provides a

starting point µi and a terminating point νi . For an agent predictive assessment of the aggregate rewards.
i’s current bundle ζi (t), the marginal utility Sij [ζi (t)] of
appending task j is conceptualized as the task’s reward less
the incremental energy cost, now expressed as:
(
γij − ∆Eij [ζi (t)], if ∆Eij [ζi (t)] ≤ Ei [ζi (t)],
Sij [ζi (t)] =
0, otherwise
(5)
Here, Ei [ζi (t)] is the residual energy for agent i post-
traversal of ξi (t), while ∆Eij [ζi (t)] denotes the additional
energy required if task j is to be included in ξi (t).
Fig. 2. Proposed Graph Convolutional Network
In the revised marginal utility equation, if the vertex x is in
close proximity to νi , it is conceivable for ∆Eij [ζi (t)⊕{x}] to By integrating the GCN predictions with our heuristic
be less than or equal to ∆Eij [ζi (t)], potentially contravening framework, we aim to enhance the decision-making process
the DMG principle. To mitigate this and ensure convergence in the allocation of tasks, ensuring an informed and adaptive
when utilizing non-DMG score functions, a warping mech- approach.
anism is introduced, adjusting the score to minξ⊆ζi (t) Sij [ξ],
thereby aiding the convergence process where traditional DMG III. R ESULTS AND DISCUSSION
is not inherently present. Figure 3 presents a series of bar charts comparing the perfor-
The score function’s direct correlation to both reward and mance of the predictive model across four different heuristic
energy consumption raises concerns about its scale invariance. methods: H1, H2, H3, and H4. For each heuristic, Series1
When the mapping of tasks to agents is scaled linearly, the rel- represents the values obtained using the heuristic method,
ative value of the scores, and consequently the task allocation while Series2 represents the predicted values generated by the
decisions, may be altered. In our proposed methodology, we model. The predictions for H1 are closely aligned with the
introduce a suite of heuristic extensions to the CBBA, each heuristic values, indicating a high degree of accuracy for this
characterized by a novel scoring function. The diversity of method. This is particularly evident in instances where the two
these heuristic extensions is tailored to address the varying series produce almost identical bar heights (e.g., at intervals
demands of distinct allocation problems, potentially outper- 1, 4, 6, and 10). This indicates that the model has learned the
forming the application of a single heuristic in all scenarios. pattern for H1 and can replicate its decision-making process

37
Authorized licensed use limited to: National Univ of Defense Tech. Downloaded on December 15,2024 at 14:06:06 UTC from IEEE Xplore. Restrictions apply.
with high fidelity. For H2, the model appears to have greater
variance in its predictive accuracy. While some predictions are
quite close to the heuristic values (e.g., intervals 5 and 8), there
are others where there is a noticeable difference (e.g., intervals
2 and 9). The model demonstrates a similar pattern of accuracy
with the H3 method as with H1, with many of the predictions
being close to the heuristic values. The close correspondence
in intervals 3, 4, and 7 suggests that the model is largely
effective in estimating the H3 heuristic method’s outcomes.
Lastly, H4 shows a mixed pattern where the model accurately
predicts the heuristic values in several intervals (such as 2, 5,
and 7), but also deviates significantly in others (such as 1, 8,
and 10).
Overall, across all four heuristic methods, the model seems
capable of making reasonably accurate predictions and appears
to be a promising tool for replicating the patterns in different
heuristic methods. However, the variations in predictive ac-
curacy across different methods and intervals show that there
are few unique characteristics in each heuristic that the model
is variably capturing. Further analysis would be beneficial to
understand these differences, refine the model accordingly, and
potentially improve its predictive performance.

Fig. 4. AI-enhanced CBBA vs Original (Typical) CBBA vs Real score

might be effective for a smaller number of tasks, its efficiency

Fig. 3. Series1: The Heuristic method, Series2: The prediction made by
diminishes as the task count rises. The ICBA presents an
the model improvement over CBBA, as evidenced by the lower trajectory
of its curve. Prim’s algorithm, traditionally used for finding a
The two graphs presented in Figure 4 offer a visualization minimum spanning tree and here adapted for task allocation,
of the performance of AI-enhanced CBBA and Typical CBBA demonstrates a performance that initially parallels the ICBA
(Original) in comparison to the real score values over a series but eventually outperforms it as the number of tasks becomes
of tasks or evaluations.It is observed that both AI-CBBA larger. This indicates Prim’s effectiveness in creating more
and Typical CBBA closely track the real score values.The efficient path plans over larger task sets, by leveraging its
AI-enhanced CBBA appears to follow the trend of the real inherent nature of connecting points in a graph minimally.
score more tightly than the Typical CBBA, showing that Lastly, the AI-CBBA shows the best performance among
the integration of AI methods within the CBBA framework all the algorithms, maintaining the lowest average distance
enhances its ability to mirror the actual score outcomes. across the task spectrum. Its curve shows that the incorpo-
In Figure 5, we present a performance comparison of ration of AI techniques into the standard CBBA framework
CBBA, ICBA, Prim’s algorithm, and AI-CBBA relative to the significantly optimizes the allocation and sequencing of tasks.
number of tasks assigned, with the average distance metric This optimization stems from the AI’s capability to learn
as the evaluation criterion. As the number of tasks increases from the environment and the ability to predict more efficient
from 0 to 50, all algorithms exhibit an increasing trend in the allocations or paths.
average distance, which is intuitive since more tasks typically The data presented in Table II offers insightful revelations
translate to greater traversal distances for agents. The CBBA concerning the time efficiency of the proposed approach,
shows the steepest increase in distance, indicating that while it evaluated under varying operational scales characterized by the

38
Authorized licensed use limited to: National Univ of Defense Tech. Downloaded on December 15,2024 at 14:06:06 UTC from IEEE Xplore. Restrictions apply.
R EFERENCES
[1] M. Campion, P. Ranganathan, and S. Faruque, “Uav swarm communica-
tion and control architectures: a review,” Journal of Unmanned Vehicle
Systems, vol. 7, no. 2, pp. 93–106, 2018.
[2] Y. Zhou, B. Rao, and W. Wang, “Uav swarm intelligence: Recent
advances and future trends,” Ieee Access, vol. 8, pp. 183 856–183 878,
2020.
[3] P. E. Hart, N. J. Nilsson, and B. Raphael, “A formal basis for the heuristic
determination of minimum cost paths,” IEEE transactions on Systems
Science and Cybernetics, vol. 4, no. 2, pp. 100–107, 1968.
[4] H.-L. Choi, L. Brunet, and J. P. How, “Consensus-based decentralized
auctions for robust task allocation,” IEEE transactions on robotics,
vol. 25, no. 4, pp. 912–926, 2009.
[5] G. A. Korsah, A. Stentz, and M. B. Dias, “A comprehensive taxonomy
for multi-robot task allocation,” The International Journal of Robotics
Research, vol. 32, no. 12, pp. 1495–1512, 2013.
[6] L. Johnson, H.-L. Choi, S. Ponda, and J. P. How, “Allowing non-
submodular score functions in distributed task allocation,” in 2012 IEEE
51st IEEE Conference on Decision and Control (CDC). IEEE, 2012,
pp. 4702–4708.
[7] L. B. Johnson, H.-L. Choi, and J. P. How, “The role of information
Fig. 5. Average Distance vs Tasks assumptions in decentralized task allocation: A tutorial,” IEEE Control
Systems Magazine, vol. 36, no. 4, pp. 45–58, 2016.
[8] L. Busoniu, R. Babuska, and B. De Schutter, “A comprehensive survey
of multiagent reinforcement learning,” IEEE Transactions on Systems,
number of tasks and the agents. The first column, representing Man, and Cybernetics, Part C (Applications and Reviews), vol. 38, no. 2,
the minimum time, consistently increases with the number pp. 156–172, 2008.
[9] L. Panait and S. Luke, “Cooperative multi-agent learning: The state of
of agents. The average time, denoted in the second column, the art,” Autonomous agents and multi-agent systems, vol. 11, pp. 387–
similarly ascends with the number of agents. Maximal time 434, 2005.
in the third column escalates as well with agent count. The [10] M. Barer, G. Sharon, R. Stern, and A. Felner, “Suboptimal variants of
the conflict-based search algorithm for the multi-agent pathfinding prob-
standard deviation of time, illustrated in the fourth column, lem,” in Proceedings of the International Symposium on Combinatorial
indicates a growing variability in the time to complete tasks. Search, vol. 5, no. 1, 2014, pp. 19–27.
[11] J. P. Van Den Berg and M. H. Overmars, “Prioritized motion planning
for multiple robots,” in 2005 IEEE/RSJ International Conference on
TABLE II Intelligent Robots and Systems. IEEE, 2005, pp. 430–435.
T IME R ESULTS S UMMARY IN [ S ] [12] T. Standley and R. Korf, “Complete algorithms for cooperative pathfind-
ing problems,” in IJCAI. Citeseer, 2011, pp. 668–673.
Number of Agents Min Time Avg Time Max Time Std Dev [13] W. Wu, S. Bhattacharya, and A. Prorok, “Multi-robot path deconfliction
2 0.0488 0.5393 0.9020 0.2682 through prioritization by path prospects,” in 2020 IEEE international
4 0.0917 1.0006 1.5168 0.4583 conference on robotics and automation (ICRA). IEEE, 2020, pp. 9809–
6 0.1372 1.4613 2.1640 0.6599 9815.
8 0.2154 1.9308 2.8104 0.8517 [14] C. Luo, Q. Huang, F. Kong, S. Khan, and Q. Qiu, “Applying machine
10 0.2462 2.3975 3.4392 1.0542 learning in designing distributed auction for multi-agent task allocation
with budget constraints,” in 2021 20th International Conference on
Advanced Robotics (ICAR). IEEE, 2021, pp. 356–363.
IV. C ONCLUSION [15] J. Blumenkamp, S. Morad, J. Gielis, Q. Li, and A. Prorok, “A framework
for real-world multi-robot systems running decentralized gnn-based
In this study, we successfully integrated Graph Convolu- policies,” in 2022 International Conference on Robotics and Automation
tional Networks (GCNs) into the Consensus-Based Bundle (ICRA). IEEE, 2022, pp. 8772–8778.
[16] A. Khan, E. Tolstaya, A. Ribeiro, and V. Kumar, “Graph policy gradients
Algorithm (CBBA) to enhance task allocation in multi-robot for large scale robot control,” in Conference on robot learning. PMLR,
systems, creating an AI-enhanced version (AI-CBBA). This 2020, pp. 823–834.
integration marks a shift from traditional heuristic methods [17] E. Tolstaya, F. Gama, J. Paulos, G. Pappas, V. Kumar, and A. Ribeiro,
“Learning decentralized controllers for robot swarms with graph neural
to a learning-based approach. AI-CBBA outperforms existing networks,” in Conference on robot learning. PMLR, 2020, pp. 671–682.
algorithms like original CBBA, Improved CBBA (ICBA), and [18] Q. Li, F. Gama, A. Ribeiro, and A. Prorok, “Graph neural networks
Prim’s algorithm in task allocation efficiency. It excels in for decentralized multi-robot path planning,” in 2020 IEEE/RSJ Inter-
national Conference on Intelligent Robots and Systems (IROS). IEEE,
managing complex task loads, demonstrating AI’s capability to 2020, pp. 11 785–11 792.
learn and optimize both task allocation and sequencing. Our [19] R. Kortvelesy and A. Prorok, “Modgnn: Expert policy approximation in
findings indicate that AI-CBBA could significantly advance multi-agent systems with a modular graph neural network architecture,”
in 2021 IEEE International Conference on Robotics and Automation
multi-robot system coordination, promising improvements for (ICRA). IEEE, 2021, pp. 9161–9167.
complex operations across various domains. [20] E. Ghisoni, S. Govindaraj, A. M. C. Faulı́, G. De Cubber, F. Polisano,
N. Aouf, D. Rondao, Z. Chekakta, and B. de Waard, “Multi-agent
ACKNOWLEDGMENT system and ai for explosive ordnance disposal,” CEIA HUMANITARIAN
CLEARANCE TEAMWORK, p. 26.
The research presented in this paper was financed by the
European Commission and managed by the European Defense
Agency in the framework of the Preparatory Action on De-
fense Research under Grant Agreement 884866 (AIDED).

39
Authorized licensed use limited to: National Univ of Defense Tech. Downloaded on December 15,2024 at 14:06:06 UTC from IEEE Xplore. Restrictions apply.

Real-Time Taxi Spatial Anomaly Detection Based On Vehicle Trajectory Prediction
No ratings yet
Real-Time Taxi Spatial Anomaly Detection Based On Vehicle Trajectory Prediction
12 pages
2011 - P - IROS - A Fast Distributed Auction and Consensus Process Using Parallel Task Allocation and Execution
No ratings yet
2011 - P - IROS - A Fast Distributed Auction and Consensus Process Using Parallel Task Allocation and Execution
6 pages
Heterogeneous Multi-Agent Task Allocation Based On Graph-Based Convolutional Assignment Neural Network
No ratings yet
Heterogeneous Multi-Agent Task Allocation Based On Graph-Based Convolutional Assignment Neural Network
19 pages
PSO-based Distributed Algorithm For Dynamic Task Allocation in A Robotic Swarm
No ratings yet
PSO-based Distributed Algorithm For Dynamic Task Allocation in A Robotic Swarm
10 pages
6 2015 A Heuristic Distributed Task Allocation Method For Multivehicle Multitask Problems and Its Application PK
No ratings yet
6 2015 A Heuristic Distributed Task Allocation Method For Multivehicle Multitask Problems and Its Application PK
14 pages
Bi CL
No ratings yet
Bi CL
6 pages
Ictai09 Finalversion
No ratings yet
Ictai09 Finalversion
8 pages
Motion Planning
No ratings yet
Motion Planning
48 pages
34852-Article Text-38919-1-2-20250410
No ratings yet
34852-Article Text-38919-1-2-20250410
9 pages
FA-QABC-MRTA: A Solution For Solving The Multi-Robot Task Allocation Problem
No ratings yet
FA-QABC-MRTA: A Solution For Solving The Multi-Robot Task Allocation Problem
12 pages
FreeTutorials Us
No ratings yet
FreeTutorials Us
15 pages
Test 2 28
No ratings yet
Test 2 28
17 pages
A Survey On Task Allocation and Scheduling in Robotic Network Systems
No ratings yet
A Survey On Task Allocation and Scheduling in Robotic Network Systems
43 pages
Busqueda Con Robot
No ratings yet
Busqueda Con Robot
11 pages
Adaptive Usvs Swarm Optimization For Target Tracking in Dynamic Environments
No ratings yet
Adaptive Usvs Swarm Optimization For Target Tracking in Dynamic Environments
9 pages
Federated and Transfer Learning: Roozbeh Razavi-Far Boyu Wang Matthew E. Taylor Qiang Yang
No ratings yet
Federated and Transfer Learning: Roozbeh Razavi-Far Boyu Wang Matthew E. Taylor Qiang Yang
371 pages
Bischoff Et Al - 2020 - Multi-Robot
No ratings yet
Bischoff Et Al - 2020 - Multi-Robot
8 pages
Maximizing Distributed Task Allocation Using Cost Reduction Based Task Reassignment in Multi Robot System
No ratings yet
Maximizing Distributed Task Allocation Using Cost Reduction Based Task Reassignment in Multi Robot System
20 pages
Learning NP-Hard Multi-Agent Assignment Planning Using GNN: Inference On A Random Graph and Provable Auction-Fitted Q-Learning
No ratings yet
Learning NP-Hard Multi-Agent Assignment Planning Using GNN: Inference On A Random Graph and Provable Auction-Fitted Q-Learning
12 pages
Distributed Optimization Methods For Multi-Robot Systems Part 2 Survey
No ratings yet
Distributed Optimization Methods For Multi-Robot Systems Part 2 Survey
16 pages
Robotics 04 00316 v2
No ratings yet
Robotics 04 00316 v2
25 pages
Meta Federated Reinforcement Learning For Distributed Resource Allocation
No ratings yet
Meta Federated Reinforcement Learning For Distributed Resource Allocation
11 pages
Learning Scalable Policies Over Graphs For Multi-Robot Task Allocation Using Capsule Attention Networks
No ratings yet
Learning Scalable Policies Over Graphs For Multi-Robot Task Allocation Using Capsule Attention Networks
8 pages
Multi-Robot Systems As Computer Clusters
No ratings yet
Multi-Robot Systems As Computer Clusters
14 pages
A Distributed Task Allocation Algorithm For A Multi-Robot System in Healthcare Facilities
No ratings yet
A Distributed Task Allocation Algorithm For A Multi-Robot System in Healthcare Facilities
26 pages
What Are The Theoretical Foundations of The GWO-PS
No ratings yet
What Are The Theoretical Foundations of The GWO-PS
4 pages
Distributed Channel Allocation For Mobile 6G Subnetworks Via Multi-Agent Deep Q-Learning
No ratings yet
Distributed Channel Allocation For Mobile 6G Subnetworks Via Multi-Agent Deep Q-Learning
6 pages
Particle Swarm
No ratings yet
Particle Swarm
7 pages
Multiprocessor Scheduling Using Particle Swarm Opt
No ratings yet
Multiprocessor Scheduling Using Particle Swarm Opt
15 pages
Task Petri Nets For Agent Based Computing
No ratings yet
Task Petri Nets For Agent Based Computing
12 pages
09-1171A-C Developer Guide Problem Solving Handbook
No ratings yet
09-1171A-C Developer Guide Problem Solving Handbook
112 pages
(BASE) A Scheduling Method For Multi Robot Assembly of A 2021 Robotics and Computer
No ratings yet
(BASE) A Scheduling Method For Multi Robot Assembly of A 2021 Robotics and Computer
10 pages
Multi-Robot Task Allocation in Uncertain Environments
No ratings yet
Multi-Robot Task Allocation in Uncertain Environments
9 pages
Distributed Machine Learning For Multiuser Mobile Edge Computing Systems
No ratings yet
Distributed Machine Learning For Multiuser Mobile Edge Computing Systems
14 pages
Cooperative Distributed Robust Trajectory Optimization Using Receding Horizon Milp, TCST.2010.2045501
No ratings yet
Cooperative Distributed Robust Trajectory Optimization Using Receding Horizon Milp, TCST.2010.2045501
9 pages
Nia, Si
No ratings yet
Nia, Si
70 pages
Combinatorial Optimization and Reasoning With Graph Neural Networks
No ratings yet
Combinatorial Optimization and Reasoning With Graph Neural Networks
58 pages
Combinatorial Optimization and Reasoning With Graph Neural Networks
No ratings yet
Combinatorial Optimization and Reasoning With Graph Neural Networks
61 pages
A Coalition Formation From Software Agents To Robot Dectection
No ratings yet
A Coalition Formation From Software Agents To Robot Dectection
34 pages
Graph Coloring by Multiagent Fusion Search: Journal of Combinatorial Optimization, 2009, 18 (2) : 99-123
No ratings yet
Graph Coloring by Multiagent Fusion Search: Journal of Combinatorial Optimization, 2009, 18 (2) : 99-123
23 pages
Lesson 3 - Part 1 Uninformed - Searching Algorithms
No ratings yet
Lesson 3 - Part 1 Uninformed - Searching Algorithms
56 pages
DIS 11-12 W06 Lecture PDF
No ratings yet
DIS 11-12 W06 Lecture PDF
90 pages
Robust Deep Learning For Wireless Network Optimization
No ratings yet
Robust Deep Learning For Wireless Network Optimization
7 pages
Distributed Optimization Methods For Multi-Robot Systems Part I Tutorial
No ratings yet
Distributed Optimization Methods For Multi-Robot Systems Part I Tutorial
17 pages
Hierarchical Distributed Control For Global Network Integrity Preservation in Multirobot Systems
No ratings yet
Hierarchical Distributed Control For Global Network Integrity Preservation in Multirobot Systems
14 pages
Distributed Subgradient Methods For Multi-Agent Optimization
No ratings yet
Distributed Subgradient Methods For Multi-Agent Optimization
28 pages
Letter: A Novel Competition-Based Coordination Model With Dynamic Feedback For Multi-Robot Systems
No ratings yet
Letter: A Novel Competition-Based Coordination Model With Dynamic Feedback For Multi-Robot Systems
3 pages
IJETR041324
No ratings yet
IJETR041324
5 pages
Research Proposal
No ratings yet
Research Proposal
4 pages
16945-Article Text-20439-1-2-20210518
No ratings yet
16945-Article Text-20439-1-2-20210518
9 pages
Multi-Robot Coordination Using Embedded Controller
No ratings yet
Multi-Robot Coordination Using Embedded Controller
7 pages
A Dynamic AI-based Algorithm Selection For Virtual Network Embedding
No ratings yet
A Dynamic AI-based Algorithm Selection For Virtual Network Embedding
17 pages
NeurIPS 2019 Exact Combinatorial Optimization With Graph Convolutional Neural Networks Paper
No ratings yet
NeurIPS 2019 Exact Combinatorial Optimization With Graph Convolutional Neural Networks Paper
13 pages
Applsci 12 11116 With Cover
No ratings yet
Applsci 12 11116 With Cover
5 pages
Consensus and Cooperation in Networked Multi-Agent Systems
No ratings yet
Consensus and Cooperation in Networked Multi-Agent Systems
19 pages
A Multi-Robot-Based Architecture and A Trust Model
No ratings yet
A Multi-Robot-Based Architecture and A Trust Model
21 pages
RL Chap 3
No ratings yet
RL Chap 3
15 pages
Combinatorial Optimization and Reasoning With Graph Neural Networks
No ratings yet
Combinatorial Optimization and Reasoning With Graph Neural Networks
8 pages
Edge-Weighted Consensus-Based Formation Control Wi
No ratings yet
Edge-Weighted Consensus-Based Formation Control Wi
19 pages
Cloud Brokering
From Everand
Cloud Brokering
Felipe Díaz-Sánchez
No ratings yet
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
Lecture 1 - Intro
No ratings yet
Lecture 1 - Intro
31 pages
Neural Networks and Deep Learning Notes
No ratings yet
Neural Networks and Deep Learning Notes
88 pages
Rishti ALI: Indian Institute of Technology, Kanpur
No ratings yet
Rishti ALI: Indian Institute of Technology, Kanpur
3 pages
2023 AUSSEF Analysing Flood Hazards in Queensland Using Deep Learning Neutral Networks
No ratings yet
2023 AUSSEF Analysing Flood Hazards in Queensland Using Deep Learning Neutral Networks
28 pages
Immediate Download Comprehensible Science ICCS 2021 Lecture Notes in Networks and Systems 315 Tatiana Antipova Ebooks 2024
No ratings yet
Immediate Download Comprehensible Science ICCS 2021 Lecture Notes in Networks and Systems 315 Tatiana Antipova Ebooks 2024
35 pages
DL Practical 3
No ratings yet
DL Practical 3
5 pages
ISE-2 5 DL Marks New Imp
No ratings yet
ISE-2 5 DL Marks New Imp
17 pages
VB Final AI ML Resume
No ratings yet
VB Final AI ML Resume
1 page
NMITCON 2024 Submitted 23 24 B27 Lip Reading Using Deep Learning Project Paper
No ratings yet
NMITCON 2024 Submitted 23 24 B27 Lip Reading Using Deep Learning Project Paper
8 pages
IITD CPADSAI F5904ed02d
No ratings yet
IITD CPADSAI F5904ed02d
20 pages
Violence Detection
No ratings yet
Violence Detection
8 pages
Abebeand Nardos Vol 35 Art 3
No ratings yet
Abebeand Nardos Vol 35 Art 3
13 pages
Unit 2
No ratings yet
Unit 2
34 pages
Isb Lai-B16 Brochure
No ratings yet
Isb Lai-B16 Brochure
27 pages
NNDL QB Part B
No ratings yet
NNDL QB Part B
7 pages
Main Project-PPT Script
No ratings yet
Main Project-PPT Script
13 pages
Ad3511-Deep Learning-Lab Manual
No ratings yet
Ad3511-Deep Learning-Lab Manual
53 pages
Multilayer Neural Network
No ratings yet
Multilayer Neural Network
27 pages
Dilip Final Report
No ratings yet
Dilip Final Report
46 pages
Deep Fake Detection Document
No ratings yet
Deep Fake Detection Document
10 pages
Lab Manual R20A6610 Deep Learning Year-IV Semester-I
No ratings yet
Lab Manual R20A6610 Deep Learning Year-IV Semester-I
68 pages
1 s2.0 S2405918822000174 Main
No ratings yet
1 s2.0 S2405918822000174 Main
22 pages
CSCE689 DRL Project Report
No ratings yet
CSCE689 DRL Project Report
7 pages
BatteryML Paper
No ratings yet
BatteryML Paper
22 pages
BTP Report Final 2
No ratings yet
BTP Report Final 2
24 pages
AI-3 Nep
No ratings yet
AI-3 Nep
25 pages
Soft Computing Unit - 3
No ratings yet
Soft Computing Unit - 3
34 pages
Machine Learning With Unstructured Data
No ratings yet
Machine Learning With Unstructured Data
25 pages
Books - Sheet1
No ratings yet
Books - Sheet1
2 pages

Towards Learning-Based Distributed Task Allocation Approach For Multi-Robot System

Uploaded by

Towards Learning-Based Distributed Task Allocation Approach For Multi-Robot System

Uploaded by

2024 10th International Conference on Automation, Robotics, and Applications

Towards Learning-based Distributed Task Allocation

Emails: [email protected], [email protected], [email protected],

979-8-3503-9424-5/24/$31.00 ©2024 IEEE 34

where γij denotes the reward associated with task j by agent

Fig. 4. AI-enhanced CBBA vs Original (Typical) CBBA vs Real score

might be effective for a smaller number of tasks, its efficiency

You might also like