SSRN 4916874
SSRN 4916874
ed
Towards Autonomous Airport Surface Movement Operations
using Hierarchical Multi-Agent Planning
iew
Malte von der Burg [email protected]
Jorick Kamphof
Joost Soomers
Alexei Sharpanskykh [email protected]
TU Delft, Faculty of Aerospace Engineering
Kluyverweg 1, 2629 HS Delft, The Netherlands
ev
Abstract
Coordinating the movements of aircraft along the surface of busy airports is a complex
r
task involving both humans and machines. To reduce the aircraft emissions, new engine-
off taxiing techniques are available. The most promising one uses tugs to tow aircraft
to and from the runways. As this adds additional vehicles to the traffic, the operational
er
complexity for Air Traffic Controllers increases. Especially in congested traffic situations,
this may overwhelm their cognitive capacity. Artificial intelligence systems could provide
decision-support, and take over repetitive tasks of human operators, such as air traffic
controllers. This paper explores how a hierarchical multi-agent system model can facilitate
pe
both safe and efficient airport surface movement operations in a fully-automated setting. To
this end, we combine and extend state-of-the-art multi-agent motion planning algorithms,
yielding the Multi-Agent Motion Planning on Airport Surfaces (AS-MAMP) algorithm: it
builds upon Priority-Based Search (PBS) and its greedy variant GPBS with the new Safe
Interval Motion Planning (SIMP) algorithm as its low-level solver. We include the different
processes during aircraft taxiing such as pushback, tug coupling, and decoupling into path
planning using a novel activity-based search. Motions of aircraft and ground vehicles are
ot
based on finite acceleration rates, and are calculated in continuous space and time. SIMP
detects and resolves conflicts taking the two-dimensional, circular shapes of vehicles into
account. This ensures collision-free routing also for the confined spaces of the taxiway
tn
systems of large airports. By evaluating four cases based on the potential bottlenecks in
taxiway networks, we confirm that our system design yields safe and efficient operations.
Using sensitivity analyses, we show that both the chosen search-mode in PBS as well as
changes to the acceleration of agents have the strongest influence on the taxi time and
taxi distance as key performance indicators. Moreover, given our simulation conditions, we
show that the multi-agent system model can handle runway throughput levels that match
rin
1. Introduction
The air transport sector faces three connected challenges. First, the demand is predicted to
increase to around 10 billion annual passengers by 2050 (IATA, 2021), more than doubling
the amount of 2019. Second, the sector strives to achieve net-zero emissions by the same
Pr
time (IATA, 2021) to meet the Paris Agreement (UNFCCC, 2015). As third challenge,
the level of safety must be sustained or rather increased. Zooming in on airports as the
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
key points in the air transport network, recent incidents during taxiing and at runways
(SKYbrary, 2024a, 2024b) underscore the importance of safety of the operations.
The safe and efficient planning and execution of all ground movements is referred to
as airport surface movement operations (ASM Ops), and is a challenging task especially
iew
at large airports involving both humans and machines (Morris et al., 2016). Air Traffic
Control Officers (ATCOs) provide clearances and guidance for all vehicles moving along the
taxiway network. Once an arriving flight has touched down and vacated the runway, its
pilot follows the route instructed by ATCOs through the airport’s taxiway system towards
the bay area and its assigned stand. In some cases, the stand is still occupied by another
aircraft, so that the arriving aircraft must wait e.g. at a remote holding platform. After
ev
reaching the stand, the turn-around processes commence, some of which are to unload the
baggage, refuel, clean and let new passengers board the aircraft. Once ready for departure,
the aircraft is pushed back from the stand, the pilots start the engines, and taxi along the
taxiways towards a designated runway. They enter the runway and take off once cleared
to do so by a tower controller. Besides aircraft, ground vehicles such as aircraft towing
r
vehicles that use or cross active taxiways must be integrated into the flow as well. While
the aircraft pilots and ground vehicle drivers must ensure the safety of their vehicle at all
er
times, they rely on the overall coordination by and communication through ATCOs.
For ASM Ops, the three challenges of meeting the predicted demand, reducing emissions,
and improving safety are highly interrelated. To address the demand, either bigger or
more aircraft are needed, and airports must facilitate this growth with additional capacity.
pe
However, it is expected that infrastructural expansions of airports are insufficient to achieve
higher airport capacity (Eurocontrol, 2018). Thus, the congestion on the airport surface
will increase. This, in turn, will amplify the workload of ATCOs (Chua et al., 2017)
which increases the risk of human errors and may induce additional delays and congestion.
Furthermore, the taxi times of aircraft, i.e. the time they spent taxiing over the airport
ot
surface, become harder to predict. In addition to having a direct influence on the respective
flights, this may trigger knock-on effects on the entire network, resulting in even higher
inefficiencies (Eurocontrol, 2021).
Regarding emissions, the taxi times can reach a significant share of the total flight time
tn
especially for short-haul flights, with combined taxi-in and taxi-out times reaching up to
30 min for European flights (Eurocontrol, 2023). Furthermore, an analysis using a fuel-
burn model showed that between 2% (long-haul flight, 9630km) and 17% (short-haul flight,
350km) of total fuel is burnt during taxiing (Rowland, 2022). The corresponding emissions
rin
can be effectively reduced when the aircraft do not have to use their engines for taxiing.
In recent years, various new engine-off techniques have been explored and potentially yield
significant decreases in carbon and noxious emissions (Lukic et al., 2019; Zoutendijk et al.,
2023). On the other hand, solutions such as the TaxiBot concept from IAI (2013), which
can tow aircraft to runways and back, add additional vehicles to the traffic, again amplifying
ep
the issues of airport capacity, ATCO workload, and increased probability for human errors.
To address these interrelated challenges, both technological and operational advances are
urgently needed that make the operations safer, more efficient, predictable, and minimize
the associated emissions. The Airport Collaborative Decision Making (A-CDM) concept
Pr
from Eurocontrol (2016) has been implemented at various airports yielding more informed,
predictable, and efficient decision making. Furthermore, multiple airports are working on
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
further implementing the services of the Advanced Surface Movement Guidance and Control
System (A-SMGCS), offering improved safety nets and situation awareness for ATCOs
(ICAO, 2004; Lane et al., 2020). One possible direction is to further increase the level
of automation of the operations. Artificial intelligence systems could provide extensive
iew
decision-support for ATCOs, and even take over certain repetitive tasks. To this end,
both NASA and DLR have developed future concepts of operations for air transport in
the US and in Europe, respectively (Hooey et al., 2014; Okuniek et al., 2016). In both
concepts, the surface trajectory-based operations (STBO) use four-dimensional trajectories
(4DTs) along the taxiing route of aircraft. Each concept was evaluated using human-in-
the-loop experiments. In a joint effort, NASA and DLR have also developed a shared
ev
concept of operations and evaluated it using fast-time simulations (Okuniek et al., 2018).
Moreover, previous research assessed the achievable positioning tolerances of pilots and
auto-pilot systems when following the 4DTs, and revealed that trajectory errors below 22 m
are feasible to achieve, with mean errors even below 7 m for (partially) automated aircraft
taxiing (Bernatzky, 2019).
r
However, high levels of automation are deemed to be reachable only in the far-term
(Hooey et al., 2014). Indeed, such human-automation teaming does not yet exist in real-
er
world applications since today’s technology is not sophisticated enough (Rieth & Hagemann,
2022). To gain the trust of ATCOs, the artificial side must be proven to be capable of han-
dling tasks autonomously in a safe and efficient way. This necessitates a control system
model that realistically incorporates the interrelated processes of ASM Ops as well as the
pe
new engine-off taxiing techniques to compute conflict-free yet environmentally friendly tra-
jectories also in congested traffic situations. To this end, both the various processes as
well as the heterogeneous fleet of differently sized aircraft and ground vehicles must be
represented in detail. Nonetheless, in previous work, such models used multiple simplifying
assumptions to handle the inherent complexity. For instance, the fast-time simulations to
ot
test the shared concept of NASA and DLR relied on fixed taxiing routes within only one bay
area of the studied airport layout (Okuniek et al., 2018). As part of an exploratory control
system model to test new operational concepts such as engine-off taxiing or the effect of
existing ATC procedures, a powerful and flexible path planning algorithm is missing that
tn
as randomness can be explicitly modelled with an agent-based model (Helbing & Balietti,
2015). Furthermore, multi-agent simulations allow for inherent modularity, flexibility, and
expressiveness that are beneficial when considering a detailed model. We embedded the
newly developed algorithm for multi-agent motion planning on airport surfaces (AS-MAMP)
into the MAS model. To yield fast solutions, we deploy a two-level, sub-optimal solver. As
ep
its high-level search, multi-agent coordination is based on the Priority-Based Search (PBS)
algorithm (Ma, Harabor, et al., 2019) and its greedy variant GPBS (Chan et al., 2023).
Trajectories that satisfy the constraints from the high-level are calculated with the novel
Safe Interval Motion Planning (SIMP) algorithm, which is one of the main contributions
Pr
of this work. It is inspired by the Safe Interval Path Planning (SIPP) algorithm (Phillips
& Likhachev, 2011) and its variants. SIMP further extends the capabilities of SIPP-based
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
low-level planners to deal with all important characteristics of ASM Ops such as the agent
kinodynamics and sizes, safety zones around agents, and the different operations like push-
back, engine-start, tug coupling, and decoupling.
Based on the MAS concept, we developed a simulator in Python that implements the
iew
AS-MAMP algorithm. We already used it in two previous papers (von der Burg & Sharpan-
skykh, 2023, 2024). Their focus was on the application and evaluation of operational con-
sequences of AS-MAMP-based models for realistic ASM ops in existing airports, and not
on the algorithmic basis underlying these models, which is the main focus of this paper.
In the following, we first provide a description of ASM Ops and define a list of re-
quirements that we deem indispensable to model the operations realistically (Section 2).
ev
Based on these, we outline previous research related to modelling ASM Ops and multi-
agent path planning algorithms in Section 3. We then describe the multi-agent system
model for autonomous ASM Ops (Section 4) including a detailed description of the AS-
MAMP algorithm integrated into the MAS (Section 5). We evaluate the path planning in
r
Section 6 by discussing the influence of important parameters on the AS-MAMP algorithm
using four case studies based on potential bottleneck in airport layouts. In Section 7, we
provide an overview of future explorations of the path planning algorithm and the MAS
er
model. Finally, a summary of the main findings and contributions of this work are outlined
in Section 8.
pe
2. Airport Surface Movement Operations (ASM Ops) and Modelling
Requirements
Currently, aircraft use their own engines to taxi on the airport surface. We refer to this
taxiing mode as multi-engine taxiing (MET) in this paper. For outbound aircraft, the
existing procedures can be split into three parts: 1) the pushback and (possibly) push-pull
ot
it is carried out to avoid engine blast. After landing, arrival flights taxi to the stand, during
which their engines cool down.
In future ASM Ops, engine-off taxiing techniques may be deployed such as the TaxiBot
system (2013), which we refer to as tug-enabled taxiing (TET) in this work. Based on the
rin
TaxiBot specification and interviews with operational experts, outbound TET operations
can be divided into four stages: 1) pushback and direction switch, 2) tug-enabled taxiing to
a decoupling location close to the runway, 3) decoupling and all-clear signal between tug and
aircraft, and 4) taxiing onto the runway. Dependent on the airport regulations, engine-start
can potentially be performed during part 2). Nonetheless, holding at the decoupling location
ep
may be necessary when the duration of pushback, taxiing, and decoupling is shorter than the
time needed to start the engines. For arriving aircraft, TET comprises three relevant stages:
1) exiting the runway and taxiing to a coupling location close to the runway, 2) coupling to
the waiting tug, and 3) tug-enabled taxiing to the ramp. For both flight directions, the tugs
Pr
have to drive to the coupling location, and travel to the next assignment or the tug-base
once decoupled. Airports may provide extra service-roads for the tugs, or require them to
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
use the taxiway network managed by ground control. Eventually, the tugs are expected to
be equipped with electric engines (Zoutendijk et al., 2023). In turn, this requires recharging
over the day of operations, either at dedicated charging stations or at the stand, affecting
the task assignment and route planning of tugs after decoupling.
iew
To enable autonomous operations, a control system must be capable to coordinate the
movements of the mixed fleet of aircraft deploying either taxiing technique as well as the
return movements of tugs. Based on the current ASM Ops and with the focus on path
planning, we identified the following characteristics of ASM Ops. They pose modelling
requirements when developing a comprehensive model to study new operational concepts
of ASM Ops in detail:
ev
(1) Multiple actors: pilots, ground vehicle drivers, and other ground personnel have to
share limited resources, i.e. runways, taxiways, and gate areas. Furthermore, these
actors have to fulfil the general objectives of safe and efficient operations as well
as their individual tasks. Since these can be conflicting with one another and between
r
actors, control and coordination is necessary.
(2) Layout: most airports have a marked network of taxiways creating a two-dimensional
er
layout with confined space to be traversed. For large airports, such as Amsterdam
Airport Schiphol (AAS), the taxiway system is rather complex and contains not only
straight and curved segments, but also compounded intersections, runway crossings
pe
and go-arounds, as well as long single-lane taxiways. While aircraft and other vehicles
can generally stop and wait at arbitrary locations, airports usually have designated
holding points. In most cases, aircraft cannot turn around on a taxiway segment,
meaning that deadlocks must be anticipated as these are time-consuming to resolve.
(3) Online setting: aircraft appear and disappear continuously at the airport boundaries
ot
(4) Interrelated operations: at most airports, multiple routes between a runway and
bay area exist, and various processes take place in parallel (Stergianos et al., 2016).
tn
way mode of operation (RMO) determining the active runways, safety distances
between vehicles, traffic rules such as standard taxiway directions, or prioritiza-
tion of certain vehicles;
c. time constraints that must either be surpassed or reached in time (i.e. dead-
ep
lines), e.g. an arriving flight whose assigned stand is still occupied, a departing
flight that must take off within a departure slot due to airspace restrictions (i.e.
CTOT-slots assigned by Eurocontrol), or aircraft in need of a tug or towing
vehicle;
Pr
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
(5) Heterogeneous vehicle fleet: besides different aircraft types, the use of different
engine-off taxiing techniques will lead to a heterogeneous fleet of aircraft and addi-
tional vehicles like tugs. Factors include:
iew
a. differing vehicle dimensions between different aircraft types as well as in
relation to tugs, since the geometric shapes may overlap with multiple other
taxiway segments or entire intersections;
b. kinodynamics or non-holonomic properties of the vehicles, accounting for fi-
nite acceleration and deceleration, minimal speed to avoid stop-and-go, speed
restrictions on both straight as well as curved segments, and sufficient safety
ev
distances.
r
(7) Uncertainty in operations: in actual operations, changes to planned actions or
anticipated effects arise that have to be handled accordingly, also during the routing
er
of vehicles (Lee, 2014). For that, examples include:
tently, or in some cases also intentionally, divert from cleared routes or com-
mands from ATCOs;
e. delays, in procedures / actions, communication, and various other forms (Morris
et al., 2016);
rin
f. disruptive events, at least those that are confined, e.g. vehicle breakdowns or
blocked taxiways.
(8) Interaction with human operators: to conduct future research in the direction of
human-automation teaming, the model needs to be flexible enough to eventually deal
ep
In summary, the long-term goal of a model of an autonomous control system for ASM
Ops must be to ensure safe and efficient operations that are as predictable and environmen-
Pr
tally friendly as possible, while handling operational uncertainties as well as inputs from
human operators.
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
3. Related Work
In its current form, ASM Ops are controlled and coordinated within a centralized system by
air traffic control officers (ATCOs) who coordinate the flow of aircraft from a central loca-
iew
tion, i.e. the air traffic control tower. Most previous work modelled ASM Ops as centralized
system, and studied related sub-problems. Other research has focused on developing new
concepts of operations of ASM Ops as a centralized, but also as a distributed system. We
will describe these in the next subsection.
Most previous research used a pre-defined priority order to plan the routes of aircraft
sequentially, usually based on the first-come-first-served principle. However, paths can also
ev
be planned without such stringent order with potentially better solutions. To this end, the
domain of multi-agent path planning (MAPP) provides a multitude of algorithms to solve
both simplified and complex problems. Interestingly, many MAPP-related papers motivate
their work with ASM Ops (e.g. Cohen et al., 2019; Ma et al., 2016; Walker, 2022), while
such algorithms are often neglected in operations-focused research (e.g. Jiang et al., 2021;
r
Yin et al., 2024; Zaninotto et al., 2021; X. Zou et al., 2018). To close this gap, we provide
a comprehensive review of MAPP algorithms and their underlying concepts in Section 3.2.
er
Finally, in Section 3.3, we outline how we address the modelling requirements of ASM
Ops defined in Section 2. Our approach uses a multi-agent system model with integrated
multi-agent motion planning algorithm. We conclude this section by listing the scientific
pe
contributions presented in this paper.
In previous research, specific sub-problems of ASM Ops were studied. The Ground Routing
Problem (GRP) addresses the task of scheduling the aircraft movements on the airport
ot
surface between runways and stands in an efficient and safe manner while respecting the
operational requirements. Atkin et al. (2010) reviewed and compared various mixed-integer
linear programming (MILP) and genetic algorithm (GA) approaches. While these were
used in various models for (multi-)objective optimization to study variations of this routing
tn
and scheduling problem, the authors conclude that more research is needed that takes the
interdependencies of the processes affecting ASM Ops into account. The Runway Schedul-
ing Problem (RSP) addresses the task of maximizing the utilization of the available runway
capacity through scheduling the landing or takeoff sequence at runways. Both Bennell et
rin
al. (2011) and Lieder et al. (2015) reviewed the common solution approaches that used
either mixed-integer programming (MIP), dynamic programming (DP), branch-and-bound
(B&B), or heuristic algorithms. Furthermore, Guépet et al. (2017), provide a review of
studies that fully or partially integrate ground and runway operations at airports, i.e. com-
bining both GRP and RSP. Using a heuristic sequential approach, they show that a better
ep
integration of these two sub-problems reduces the taxi times significantly. Other studies
argued that most inefficiencies of taxiing operations originate from runway congestion, and
that these are effectively reduced through stand holding, i.e. by delaying the time the
aircraft is pushed back from the stand (Balakrishnan & Jung, 2007; Ravizza et al., 2014).
Pr
Guépet et al. (2016) provide a review of deterministic and queuing models that address this
sub-problem of ASM Ops. Furthermore, Ashok et al. (2017) assessed that a pushback con-
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
trol strategy, but also derated takeoffs, i.e. taking off with lower thrust levels, are effective
to reduce the air quality impacts of aircraft operations.
Li, Gong, et al. (2019) argued that the congestion and uncertainties in the air transport
system are the major constraints on the available capacity. They used a fast-time simulation
iew
for anticipating delays through a probabilistic approach in planning and scheduling of ASM
Ops. Yin et al. (2024) reviewed approaches for taxi time prediction and analysis. Different
models are used to evaluate and optimize taxi time predictions under uncertainty, e.g. using
a stochastic model (Lee, 2014), chance-constraint programming (X. Wang et al., 2021),
or a fuzzy rule-based approach (Brownlee et al., 2018). Other studies used data-driven
approaches to predict the landing times of aircraft (Dhief et al., 2020; Z. Wang et al.,
ev
2018), or to visually show the runway exit prediction (Woo et al., 2022).
As another research direction, new concepts of operations (CONOPS) are studied. Mor-
ris et al. (2016) proposed a centralized automation system to handle tower operations. They
used discretized time and a grid-based layout with obstacles to form taxiways of Dallas Fort
r
Worth airport (DFW). Uncertainty during taxiing of aircraft is handled through continuous
scheduling and monitoring. Using a discrete-time Markov chain (DTMC) they were able
to partly transfer uncertainties during plan execution into probabilistic behaviour models.
er
Both NASA and DLR have developed new CONOPS to increase efficiency and predictabil-
ity as well as to reduce the environmental impact of ASM Ops. NASA’s CONOPS for
far-term Surface Trajectory-Based Operations (STBO) envisions that pilots actively partic-
ipate in precisely meeting time-based goals (Hooey et al., 2014). It comprises two phases: in
pe
phase 1, minimal flight deck changes are required for pilots to conform to time-based clear-
ances at specific trajectory points. In phase 2, high precision and conformance is enabled
by providing full 4D trajectories comprising (x,y)-locations at all times t along the taxi
route, which requires advanced flight deck equipment. However, the authors note that this
phase demands further research. They developed decision-support tools for ATCOs to eval-
ot
uate their CONOPS using human-in-the-loop simulations, which showed promising results
(Hayashi et al., 2015; Jung et al., 2011). Furthermore, NASA conducted studies to test how
well pilots are able to conform to 4D taxi clearances when supported with different flight
deck displays providing trajectory information (Bakowski et al., 2015, 2017). DLR imple-
tn
mented both a departure management system (DMAN) and a surface management system
(SMAN) to test the Airport Collaborative Decision Making (A-CDM; Eurocontrol, 2016)
and the Advanced Surface Movement Guidance and Control System (A-SMGCS; ICAO,
2004) concepts with human-in-the-loop simulations (Gerdes & Schaper, 2015). The simula-
rin
tor has a built-in conflict detection and resolution functionality to monitor the conformance
to the 4DT clearances. In a joint effort, NASA and DLR also developed a shared concept of
operations and evaluated it using fast-time simulations (Okuniek et al., 2018). They used
pre-defined routes between each origin-destination pair combined with the first-come-first-
served principle to assign priorities between aircraft. However, both aspects likely lead to
ep
as studied by Udluft (2017). In such a system, communication and control mechanisms are
shifted to multiple local entities. While they result in higher robustness, resilience, and bet-
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
ter solutions for local problems, a global optimum cannot be guaranteed and the available
global resources may not be used efficiently.
iew
Much research has concentrated on the classical multi-agent path finding (MAPF) problem
in which several assumptions are utilized to simplify the search: the environment is defined
as static, both time and space are discretized, agents are depicted as points that only
occupy a single location per timestep, and move between these within unit time (Stern,
2019). Moreover, it is typically assumed that the generated plans are perfectly executed
ev
(Ma et al., 2017). Furthermore, in most MAPF-related research, an offline setting (i.e. pre-
known start and goal locations of all agents) is assumed. Comprehensive overviews of the
MAPF problem and its algorithmic landscape are provided in (Stern & Sturtevant, 2019,
with MAPF definitions, variants, benchmarks), (Felner et al., 2017, with MAPF variants
r
focused on search-based optimal solvers), and (Gao et al., 2024, with classic but also beyond
classic MAPF solvers).
er
Two-level solvers have proven to be the most suitable to solve such problems: they
combine the strengths of different algorithms and yield better solution quality and lower
computational times than other approaches such as reduction-based, rule-based, or other
one-level search-based solvers. Prominent examples of optimal two-level MAPF solvers are
pe
M* (Wagner & Choset, 2011), the Increasing Cost Tree Search (ICTS) (Sharon et al., 2013),
and Conflict-Based Search (CBS) (Sharon et al., 2012). Especially the latter one has found
broad use for classical MAPF instances. Furthermore, to reduce the computational time
needed to run this optimal solver, sub-optimal variants were developed that still return
near-optimal solutions in most cases (Barer et al., 2014; Cohen et al., 2016).
ot
CBS and its variants are based on the ideas of building a binary conflict-tree during the
search, and separating conflict detection and conflict resolution into two different search
levels. In the high-level, conflicts are detected sequentially. For every occurring conflict,
two new child nodes, one per conflicting agent, are added to the search tree including a
tn
new constraint that prohibits one of them of occupying the conflicting state. Then, the
low-level search resolves the conflict by finding a new path for the constrained agent, for
which any single-agent path finding (SAPF) algorithm such as the well-known A* algorithm
may be used. This process is repeated until the solution is conflict-free, usually employing
rin
a best-first search strategy. Barer et al. (2014) proposed the bounded-suboptimal variant
Enhanced CBS (ECBS). It uses a focal search on the low-level of CBS to find paths which
satisfy the high-level constraints, but also minimize the unresolved conflicts with the paths
of other agents. By reducing the number of conflicts that the high-level still has to resolve,
this procedure speeds up the search. Li, Ruml, and Koenig (2021) introduced Explicit
ep
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
slow for applications requiring close to real-time path planning computations, especially
when relaxing the assumptions of classical MAPF.
In contrast to CBS and its variants, Ma et al. introduced the Priority-Based Search
(PBS) algorithm. PBS generates a priority-tree to explore the state space by applying
iew
different priority orderings for the agents. During conflict resolution, the lower prioritized
agent has to avoid all paths of those higher in priority. To speed up the algorithm, a
depth-first search is applied, and an optionally provided initial priority order can further
improve its runtime. While PBS is unbounded suboptimal and incomplete, the authors
claim that it returns near-optimal solutions. For that, they compared PBS to CBS, also on
an airport-like map (DAO-map brc202d), yielding similar solution qualities but a greater
ev
performance of PBS regarding success rate and runtime. PBS was used in several recent
studies (Li et al., 2023; Zheng et al., 2023), and Chan et al. (2023) deem PBS to be the
leading suboptimal yet fast MAPF algorithm. They further improved its runtime especially
for MAPF instances with many agents and obstacles by introducing Greedy PBS (GPBS)
that greedily minimizes the number of conflicts between agents to speed up its execution.
r
In the depth-first search within the priority-tree, the child node with the lowest number of
remaining conflicting agent pairs is expanded next. Furthermore, they introduced several
er
GPBS extensions to further improve its execution speed. They adopt the partial expansion
(PE) technique from EPEA* (Goldenberg et al., 2014) to randomly create only one of
the two child nodes (the second child node is only created when backtracking), and target
reasoning (TR; Li et al., 2020) to first resolve conflicts with an agent waiting at its goal.
pe
Furthermore, besides the new priority order added to a child node of N , they propose to
also include the set of induced constraints (IC) between the existing priorities: when agent
ai is given higher priority than agent aj , ICi≺j (N ) mark the implicit priority relations in N
between all agents higher in priority than ai that must have higher priority than any agents
lower in priority than aj . In GPBS with IC, the conflicting pair with the largest |ICi≺j (N )|
ot
the idea of windowing the search, which is also referred to as bounded-horizon planning in
other articles (Li, Tinka, et al., 2021). Using a windowed approach, conflict resolution is
done only up to a pre-defined time horizon w, while conflicts beyond that limit are ignored.
This decreases the search space and therefore results in a lower runtime of the path finding
rin
algorithm.
In previous research, some of the assumptions of classical MAPF mentioned above were
relaxed. Švancara et al. (2019) explored the online MAPF variant in which new agents may
dynamically appear and disappear. In such an online setting, highest solution quality can be
achieved by re-planning the paths of all existing and newly appearing agents. Despite being
ep
computationally expensive, this may change existing agent plans, which can be undesirable
in certain cases as it requires additional communication and reaction overhead. Walker et al.
(2018) defined the MAPFR problem to solve MAPF instances with continuous, non-uniform
edge weights. Furthermore, they generalized ICTS for non-unit costs called Extended-ICTS
Pr
(E-ICTS) by increasing the target cost in the high-level search with a cost-interval instead
of a unit-cost increment: the low-level search has to find a solution whose cost is within
10
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
the updated cost-range. Besides the optimal version, they also introduced two bounded-
suboptimal variants of E-ICTS.
Li, Surynek, et al. (2019) explored MAPF instances with Large Agents (LA-MAPF), in
which agents occupy not one but a set of points in a graph at any timestep. While this
iew
poses the possibility to introduce safety distances between agents, it also necessitates new
conflict types and increases the amount of possible collisions. Especially for conflict-based
approaches like CBS, this may lead to many additionally created nodes in the constraint
tree increasing its runtime. To mitigate this effect, the authors introduced Multi-Constraint
CBS (MC-CBS) in which closely related conflicts are added as multiple constraints to a
newly generated child node in the high-level search of CBS. Walker et al. (2020) generalized
ev
MC-CBS through the use of bipartite reduction (BR), forming CBS with time-annotated
bicliques (CBS+TAB). They showed that CBS+TAB is complete and optimal, and claim
that it can be used on both discretized and continuous environments. Furthermore, they
introduced two sub-optimal yet complete variants called CBS+TAB with conditional con-
straints (CBS+TCC) and CBS+TAB with conflicting paths (CBS+TCP). Both are based
r
on the idea of additionally using conditional constraints in the low-level solver. When the
low-level cannot find a solution, it is invoked again with deactivated conditional constraints
er
to guarantee completeness while otherwise allowing for improved performance.
Ma et al. (2018) formulated the problem of MAPF with Deadlines (MAPF-DL) in which
the number of agents reaching their goal within a deadline shall be maximized. To this end,
they adapted CBS to deal with deadlines (CBS-DL), introduced Death-Based Search (DBS),
pe
and combined both to form meta-agent DBS (MA-DBS). Like CBS, DBS is a complete and
optimal two-level solver. However, rather than imposing agent constraints on nodes or
edges, DBS labels individual agents as unsuccessful. Using a best-first search, it finds a
conflict-free solution with the smallest set of unsuccessful agents.
Ma et al. (2017) generalized the MAPF problem by taking delay probabilities into
ot
account (MAPF-DP) and provide robust plan execution policies. In comparison to this,
Atzmon et al. (2020) directly considered robustness during planning. In their approach, k-
robust MAPF plans are generated that allow up to k unexpected delays to occur per agent
before being invalid. Semiz and Polat (2021) studied Incremental-MAPF (I-MAPF) to re-
tn
plan when environmental changes occur that temporarily make locations unavailable for
agents. MAPF plans can usually not be directly executed by real-world robots (Yakovlev
et al., 2019). To address this issue, Hoenig et al. (2016) proposed MAPF-POST which
uses a temporal network to post-process MAPF solutions so that non-holonomic robots can
rin
However, from the literature that they reviewed, only the search-based and sampling-based
techniques have been applied for multi-agent cases. Sampling-based approaches are often
11
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
based on the RRT* algorithm and are utilized to find paths and generate roadmaps in open
environments. For quadrotor swarms in three-dimensional open spaces, Hönig et al. (2018)
presented an approach that generates a common roadmap between agents. Their MAPF
solver with Generalized Conflicts (MAPF/C) then annotates the roadmap with conflicts
iew
and yields coordinated, discretized plans for all agents that are then iteratively refined to
yield arbitrarily smooth trajectories. Kottinger et al. (2022) introduced Kinodynamic CBS
(K-CBS) which is a decentralized MRMP algorithm. In the high-level search, K-CBS de-
tects collision time intervals in an existing motion graph generated by any sampling-based
solver. To resolve conflicts, this graph is subsequently refined by sampling an alternative
motion and checking its set of discretized points for validity. They claim that their approach
ev
is general, scalable, and yields probabilistically complete solutions.
However, the taxiway layouts of airports do not resemble open spaces, but rather consist
of sparsely located intersections connected by roads in a continuous, two-dimensional Eu-
clidean environment. Sampling-based techniques are thus not applicable to coordinate the
agents in ASM Ops. Instead, the taxiway network can be transformed into a graph using
r
the physical coordinates of intersections as nodes and the length of the interconnecting taxi-
ways as edge weights, yielding a representation in continuous space. Note that alternative
er
transformations are also feasible as done by Morris et al. (2016), which used a grid-based
layout with obstacles to form the taxiway network of Dallas Fort Worth airport (DFW).
For both, the above mentioned two-level search-based solvers can be applied when cou-
pled with a low-level algorithm that can deal with the continuous dimensions and non-
pe
holonomic agent properties. Phillips and Likhachev (2011) introduced the Safe Interval
Path Planning (SIPP) algorithm, which is in first instance a SAPF solver. SIPP is based on
the idea of summarizing timesteps into time intervals: the duration that a moving obstacle
occupies a graph location is defined as unsafe interval (USI), while the time between two
USIs as safe interval (SI). This representation of the time dimension reduces the amount of
ot
potential states at every graph location and therefore the search space. Besides decreasing
the runtime significantly, this also enables planning with continuous time. In contrast to
A*, SIPP generates successive states based on the set of possible motions from the current
one. The earliest arrival time at the new location is then compared against the safe intervals
tn
of that location. A successor is only generated when the motion is feasible, i.e. fulfils the
underlying kinematic conditions. This makes SIPP well suited to cope with continuous time
and space, arbitrary motion durations, and different agent speeds.
For this reason, SIPP was subsequently applied as low-level solver to solve multi-agent
rin
instances. Andreychuk et al. (2019) introduced and later improved (2021) CCBS as optimal
solver based on CBS and SIPP. Their algorithm is capable of dealing with non-uniform
action durations, continuous time, as well as accounting for the geometric shape of agents
and obstacles. Although the environment is discretized into cells, i.e., expressed as a 2k -
connected grid, they claim that CCBS can work on any graph. Recently, Walker et al. (2024)
ep
chuk, 2017), and later improved to yield optimal solutions called Time-Optimal Any-Angle
SIPP (TO-AA-SIPP; Yakovlev & Andreychuk, 2021). However, the authors note that TO-
12
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
AA-SIPP might be too slow to be applicable to multi-agent planning. To address this issue,
Y. Zou and Borst (2024) recently introduced Zeta*-SIPP that reduces its runtime by up
to 90 % on average. However, these algorithms do not yet account for both kinodynamic
constraints and the size of agents. To this end, Yakovlev et al. (2019) proposed modifica-
iew
tions to AA-SIPP in combination with a prioritized multi-agent planner to generalize the
algorithm to handle different sizes and speeds of agents as well as movements in continuous
time.
Ma, Hönig, et al. (2019) used SIPP with a reservation table (SIPPwRT) to generalize
SIPP in continuous time and large agents moving on 4-neighbour grids. The shape of agents
is expressed by a safety radius and can be increased to add a safety margin around the agent.
ev
Together with the velocities of two agents, but without accounting for acceleration and
deceleration rates, they then compute time offsets to set safe intervals for lower prioritized
agents. The authors claim that SIPPwRT is therefore guaranteed to be collision-free, and
tested their algorithm for multi-agent pickup and delivery tasks in warehouses.
r
Li et al. (2022) generalized SIPP to handle soft obstacles, called SIPP with Soft Con-
straints (SIPPS). Each state S in SIPPS also contains a lower bound estimate of the number
of soft collisions along the path to S. Soft collisions mark the pending conflicts with the
er
plans of other agents that are not yet set as hard constraints. SIPPS thus minimizes the
number of unresolved conflicts when creating a new plan: in the open-list, the state with
the lowest number of soft collisions is expanded next, breaking ties in favour of states with
pe
lowest f-score. The authors used SIPPS in their proposed algorithm MAPF-LNS2 that
carries out a large neighbourhood search (LNS) to gradually minimize the total number of
collisions by repeatedly replanning for a subset of agents. Within a defined runtime limit,
MAPF-LNS2 is thus able to either return a conflict-free solution or one with only a few
unresolved conflicts. Moreover, SIPPS could also be adapted to pick the state with the
lowest f-score and breaking ties based on the lowest number of soft collisions so that it
ot
preserves the properties of e.g. CBS and its variants like EECBS when used as its low-level
solver. In GPBS, Chan et al. (2023) deployed both alternatives of SIPPS.
Ali and Yakovlev (2023) argued that all previous work on SIPP does not account for
tn
the kinodynamic constraints of agents prohibiting them to stop instantaneously. They show
that these agent properties make vanilla SIPP and its variants incomplete. To address this
issue, they proposed SIPP with (Wait) Interval Projection (SIPP-IP) and prove that it is
both complete and optimal. In SIPP-IP, each state S has an associated non-overlapping,
projected safe interval representing the time interval that can be safely reached from the safe
rin
agent properties and behaviour as well as randomness can be integrated into an agent-based
model (Helbing & Balietti, 2015). Furthermore, multi-agent simulations allow for inherent
13
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
modularity, flexibility, and expressiveness that are beneficial when considering a detailed
model. These allow us, in combination with the hierarchical structure of the MAS model, to
address Requirements 1, 7, and 8 from Section 2. Further details are provided in Section 4
that outlines the architecture of the MAS model.
iew
Based on the path planning concepts reviewed in Section 3.2, we embedded a tailored
algorithm for multi-agent motion planning on airport surfaces (AS-MAMP) into the multi-
agent system to satisfy the remaining requirements from Section 2 for modelling ASM Ops
in detail. As outlined in Section 5, we combine PBS with our newly developed Safe-Interval
Motion Planning (SIMP) algorithm. The latter is an augmented version of SIPP that
takes the heterogeneous, non-holonomic agent properties as well as their dimensions into
ev
account (satisfying Requirement 5). It uses motions in continuous time and space along
a graph that represents the taxiway network (Requirement 2). Furthermore, SIMP can
cope with the different interrelated processes (Requirement 4) and has a low computational
time (Requirement 6) given the complexity of the search with kinodynamics and shapes of
agents. To further cut down on runtime and to satisfy Requirement 3, we use a windowed
r
search. This also allows us to deal with uncertainty in the operations (Requirement 7): the
routes can be re-planned if, for example, the schedules change or taxiways are blocked or
er
become unavailable. Moreover, we can guarantee conflict-free plans for all concurrent routes
(Requirement 1) since we explicitly detect and resolve all conflicts between the shapes of two
agents that account for both safety distances and positioning tolerances. We can verify this
property directly through the simulator, in which we assume instantaneous communication
pe
and perfectly executed commands at this point. Time constraints of agents (Requirement
4c) are handled by the Routing Agent as part of the MAS: it assesses whether e.g. holding is
necessary and adjusts the activity sequence representing the taxiing processes of the agents
accordingly.
In summary, the main contributions of this paper are:
ot
• the AS-MAMP algorithm for conflict-free multi-agent motion planning that accounts
tn
for the heterogeneous fleet of agents, their shapes and kinodynamic constraints;
• the SIMP algorithm as low-level solver in PBS that extends the ideas of Ali and
Yakovlev (2023) to deal with kinodynamic agent properties, and that calculates the
rin
• the underlying motion principles based on constant acceleration as well as the treat-
ment of space-time reservations.
lored to ASM Ops, we believe that the presented ideas could be useful for other applications
requiring a detailed representation of agent kinodynamics and shapes. Due to the inherent
complexity of embedding the operational details of such real-world systems in the algo-
rithms, we did not yet implement high-level alternatives like EECBS or MAPF-LNS2, or
Pr
techniques to further improve SIMP such as using the soft constraints from SIPPS. We
leave this for future work, as further outlined in Section 7.
14
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
4. Multi-Agent System Model for Autonomous Airport Surface
Movement Operations
In this section, the conceptual architecture of our hybrid multi-agent system (MAS) for
autonomous airport surface movement operations (ASM Ops) is described. Fig. 1 shows
iew
its control structure, comprising both centralized and distributed agents. The centralized
Airport Ops Agent schedules the flights and tasks of ground vehicles, and determines con-
straints for the taxiway network (i.e., the Airport Layout) originating from the runways in
use or temporarily unavailable taxiway segments. Both schedules and layout constraints
are shared with the centralized Routing Agent. Its task is to compute conflict-free routes
for all vehicles that are moving or will move over the airport surface within the predefined
ev
planning window wplng , and to re-compute these at least once every re-planning period
hplng . The generated routing plans are communicated to the Localized Agents that are
positioned at every junction in the taxiway network. Each Localized Agent controls those
Moving Agents that are travelling towards its location, sends them instructions according
r
to the routing plans, and monitors the execution of those instructions. To do so, they use
the Surface Movement Radar (SMR) to obtain the position and speed of a Moving Agent.
All necessary information such as the past and planned route, activity sequence, and cur-
er
rent SMR data is stored in the Datastrip (DS) associated to each Moving Agent. Once a
Moving Agent has passed the position of a Localized Agent, it hands over the control to the
next Localized Agent by transferring the respective DS. The Moving Agents represent the
pe
aircraft pilots and tug drivers, and are modelled to be fully cooperative. We thus assume
that they execute the received instructions to the best of their possibilities when moving
over the airport surface.
E CENTRAL AGENTS
N Global Clock
V Airport Ops Agent
I
tn
Airport Layout
R
Routing Agent
O
N Surface Movement Radar
M
LOCALIZED AGENTS
E
rin
N Datastrips
T MOVING AGENTS
Figure 1: Architecture of hybrid multi-agent system (MAS) for autonomous airport surface
ep
The Environment comprises a Global Clock to synchronize the time between all agents, the
Airport Layout representing the taxiway network, the Surface Movement Radar to track
15
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
Moving Agents on the airport surface, and the Datastrips associated to every Moving Agent.
The latter is used to pass control between Localized Agents and to store the necessary
operational as well as analysis data.
iew
The taxiway network is represented by a graph G = (V, E) with nodes V and directional
edges E. An exemplary Airport Layout is shown in Fig. 2. Tug decoupling locations (orange
nodes) are specific to each runway (grey edges) and are used for tugs to couple to or decouple
from aircraft that taxi with tug-enabled taxiing. Each bidirectional road segment between
two nodes is constructed from two unidirectional edges that connect the nodes. Taxiway
edges (black) as well as service road edges (blue) can be obtained from the actual locations
ev
of these taxiways at an airport, such as Amsterdam Airport Schiphol. Ramp nodes (green)
represent the aircraft parking positions. The airport layout graph is fully accessible to
the Routing Agent and remains static throughout the simulation. However, edges can be
constrained to prohibit vehicles to be routed over these. Such layout constraints are set by
the Airport Ops Agent and described below.
r
er
pe
ot
tn
rin
paths are shown as black, runways as grey, and service roads as blue edges. Small nodes
represent taxiway intersections (black) or service road intersections (blue). Large nodes
indicate aircraft ramps (green), stopbars (red), decoupling locations (orange), or their as-
sociated all-clear points (grey).
Pr
16
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
4.2 Specification of Airport Ops Agent
The Airport Ops Agent manages all information that influences the airport surface move-
ment operations as a whole: it schedules the flights and tasks of ground vehicles, handles
necessary changes of these, and determines the active runways, i.e., the runway mode of
iew
operation (RMO) for the next 15 minutes. Furthermore, it notifies the Routing Agent of
any temporarily unavailable taxiway segments. The Airport Ops Agent has the following
properties:
P1. Receive Request Property: both Routing Agent and Localized Agents can send
information that affect the operations to the Airport Ops Agent, which then acts
ev
accordingly. For instance, a Localized Agent could report a blocked taxiway segment,
or the Routing Agent may request a change in the schedules to allow for conflict-free
routing.
P2. Update Schedule Entry Property: the Airport Ops Agent schedules all flights
r
and tasks of tugs and updates the entry in the respective schedule accordingly, in-
cluding any necessary changes that arise during the operations (cp. P1). The updated
schedules as well as the time point of the earliest change are shared with the Routing
Agent.
er
P3. Update RMO Property: at each time point t, the Airport Operations Agent checks
whether the active runways have to change. If so, it adjusts the blockage intervals
pe
from t + 15 min onwards for all edges affected by the RMO change. It then shares the
new set of layout constraints with the Routing Agent.
P4. Request Replanning Property: if the time point of earliest change in either sched-
ules or RMO falls within the planning window wplng , the Airport Ops Agent sends a
request for immediate re-planning of the routes to the Routing Agent.
ot
gathers the updated schedule data, layout constraints, along with local information from
the Localized Agents, and converts this into route parameters per agent. Then, it computes
a plan for all vehicles within the planning window wplng using the AS-MAMP algorithm
specified in Section 5, and shares the updated routing plans with the Localized Agents. The
rin
window, while the Localized Agents do so when the deviation between planned and
executed route of a Moving Agent exceeds time thresholds. The request is stored
internally and further handled in P2.
P2. Check Replanning Property: at each time point t the Routing Agent checks
Pr
whether a re-planning request was made or the internal re-planning timer elapsed. If
so, the timer is reset, and the routes are re-planned by executing P3, P6, and P7.
17
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
P3. Update Constraints for Route Planning Property: the layout constraints from
the Airport Ops Agent are gathered and stored internally. These change foremost
when the RMO switches, and allow the Routing Agent to anticipate on upcoming
taxiway closures or re-openings corresponding to the altered RMO.
iew
P4. Handle Time Constraints of Moving Agent Property: to address potential
time constraints of a Moving Agent (Requirement 4c), the Routing Agent assesses
whether these are achievable and whether remote and/or stand holding is necessary,
which influences the activity sequence of the Moving Agent that it defines through
P5. Furthermore, it sets operational constraints pertaining to the time constraints,
ev
to account for these during path planning done by its P7.
P5. Define Activity Sequence of Moving Agent Property: to include the various
parts of the route of a Moving Agent such as push-pull manoeuvres, coupling / de-
coupling of tugs, or holding in path planning, the Routing Agent converts these into
r
one of the following three activities:
er
• go-to activities comprise a start location and a set of goal locations, so that
in path planning, two degrees of freedom exist: time and route. This activity
is used as input to taxi from one point at the airport to another point at the
airport, for example for regular taxiing.
pe
• follow activities consist of a predefined list of path segments that must se-
quentially be part of the route. Therefore, time is the only remaining degree of
freedom in planning, and the path cannot be changed. This activity is used for
instance for pushback and push-pull manoeuvres.
• wait activities prescribe a waiting location and waiting duration that have to
ot
Using a combination of these, the Routing Agent defines an activity sequence accord-
ing to the schedule of a Moving Agent, which has to be executed in the respective
order.
P6. Update Route Parameters of Moving Agents Property: per Moving Agent
rin
that is scheduled to taxi within the planning window, the Routing Agent collects all
necessary parameters to plan its route. This encompasses the kinodynamic represen-
tation according to the vehicle type of the Moving Agent, the activity sequence derived
from its schedule (shared by the Airport Ops Agent), and in case the Moving Agent
is already taxiing also the latest execution data (shared by the controlling Localized
ep
Agent).
P7. Do Route Planning Property: the Routing Agent invokes the AS-MAMP algo-
rithm (cp. Section 5) to re-plan the routes of all agents that are part of the current
Pr
planning round. The resulting new conflict-free routes are shared with the Localized
Agents.
18
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
4.4 Specification of Localized Agents
Each Localized Agent controls the Moving Agents that are travelling towards its location
and hands over the control to the next Localized Agent once a Moving Agent has passed its
location. Based on the generated routing plans, it gives instructions to the Moving Agents,
iew
and monitors their execution. It acquires the necessary position and speed data from the
radar. When the executed route deviates from the plan, a Localized Agent can adjust the
route locally to minimize the deviations. If the deviation to the planned route exceeds
time thresholds, it can request re-planning of the routes. The Localized Agents thus act
as link between the centralized planning and the local operations. They have the following
properties:
ev
P1. Receive Updated Routing Plan Property: once a new routing plan is issued by
the Routing Agent, the Localized Agent updates the planned routes of the Moving
Agent under its control and generates new corresponding instructions by executing
r
P2.
er
nication, the Localized Agents send instructions for executing the next part of the
planned routes to the Moving Agents. This property is thus executed whenever a
Moving Agent has to change its speed or heading within the pre-defined execution
pe
time window wexec . Furthermore, coupling/decoupling as well as engine-start instruc-
tions are sent.
P3. Monitor Execution of Planned Routes Property: at each time point t, each
Localized Agent obtains the positions, headings, and velocities of all Moving Agents
under its control from the radar. The agent then estimates the time and speed at
ot
which each Moving Agent will pass its location, and shares these with the Routing
Agent. The Localized Agents also use the radar data to monitor that their instructions
are executed according to the planned routes. If deviations arise, they can primarily
tn
re-instruct the Moving Agents to close the gap. As last resort, they can execute P4
to trigger centralized re-planning of the routes.
P4. Request Replanning Property: with this property, the Localized Agents request
re-planning from the Routing Agent iff the deviation between a planned and executed
rin
P5. Communicate with Moving Agent Property: the Localized Agents communi-
cate with the Moving Agents under their control whenever required. Thus, they can
pass on information from or about a Moving Agent to the centralized agents.
ep
P6. Handover Control Property: whenever a Moving Agent has passed the location
of a Localized Agent, it transfers the respective Datastrip and thus the control re-
sponsibility to the closest next Localized Agent on the planned route of the Moving
Pr
Agent. Once a Moving Agent has reached its goal, the DS is placed in storage, and
can be utilized for post-operation analyses.
19
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
4.5 Specification of Moving Agents
Moving Agents represent either an aircraft or a tug. They are modelled to have a circular size
with a shape diameter corresponding to their respective type. All aircraft are categorized
according to the six size-types of the ICAO aerodrome reference codes (ICAO, 2016), as
iew
listed in Table 1. Tugs are modelled with a diameter of 12 m based on the reference TaxiBot
system (2013).
ev
ICAO-B 25 Cessna, Learjet
ICAO-C 40 A320, B737, EMB 170/190
ICAO-D 54 A300F, B767
ICAO-E 72 A350, B747, B787
ICAO-F 80 A380, B747-800F
r
Table 1: Categorization of aircraft sizes into ICAO-types
er
When consecutive aircraft take off from a runway, the Moving Agents have to keep a
time-based separation minima to ensure sufficient wake turbulence separation. The times
are dependent on the type-combination of leading and following aircraft, as listed in Table 2.
pe
The values were mapped to the ICAO-types from Table 1 on the basis of the RECAT-EU
time-based separation minima (Rooseleer & Treve, 2018).
Following
ICAO-A ICAO-B ICAO-C ICAO-D ICAO-E ICAO-F
Leading
ot
ICAO-A 80 60 60 60 60 60
ICAO-B 100 60 60 60 60 60
ICAO-C 120 60 60 60 60 60
tn
Table 2: Minimal separation times in [s] for departing aircraft to mitigate wake turbulence.
rin
The role of the Moving Agents is to execute the instructions that they received from the
Localized Agents. They thus have the following properties:
ep
P1. Execute Instructions Property: the Moving Agents execute the received instruc-
tions to the best of their possibilities to limit the amount of necessary re-instructions.
The trajectories provided to the Moving Agents include an allowable positioning tol-
erance around the speed profile in analogy to the Ownship-display used in (Bakowski
Pr
et al., 2017).
20
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
P2. Communicate with Localized Agent Property: in any case, the Moving Agents
can communicate with the controlling Localized Agent to exchange status updates
and relevant operational information. They thus notify the Localized Agent whenever
a process such as tug decoupling or engine-start is estimated to finish. As further
iew
example, they could notify the Localized Agent that their preferred maximal speed is
lower than expected, so that the routing plans can be adapted accordingly.
ev
planning algorithm for ASM Ops (AS-MAMP): it is a two-level solver with a low-level
search to find individual agent trajectories, and a high-level search to coordinate all agents
so that their trajectories are conflict-free. AS-MAMP combines the newly developed Safe
Interval Motion Planning (SIMP) algorithm as low-level with an adapted version of PBS (cp.
Section 3.2) as high-level. In the following, we first provide a general overview of the AS-
r
MAMP algorithm, and elaborate on the underlying concepts in the successive subsections
of this section.
er
PBS de-conflicts agents whose space-time trajectories intersect by constructing a priority
ordering: it maintains a priority tree with each parent-node having up to two child-nodes
to establish a priority-relation between a conflicting pair of agents. Per child-node, a new
priority-pair is added that constrains one of the two agents to avoid the path of the other
pe
agent. Based on the sum-of-cost of the paths of all agents in each child node, the algorithm
then chooses the priority order that results in the lowest overall cost, and expands this node
next. In contrast to the original PBS algorithm, we define the cost of an agent trajectory as
a linear combination of its taxi time and taxi distance. The high-level search continues until
a child-node is expanded without any collisions. The changes to PBS are further outlined
ot
temporarily block the nodes and edges along their path as well as those swept by their
shape. The corresponding time durations per node, edge, or part of an edge form unsafe
intervals (USIs) during which other agents must not traverse the respective parts of the
graph underlying the airport layout. Further details on the graph reservations are discussed
rin
in Section 5.4, and Section 5.5 describes the conflict detection procedure.
To resolve a conflict, the plan of each of the two agents involved in the collision is
updated in the newly created PBS child-nodes by invoking the low-level solver. Its task
is to compute a new trajectory for a single agent that respects all reservations of agents
higher in priority. As such, we have developed the Safe Interval Motion Planning (SIMP)
ep
algorithm. It is based on the SIPP algorithm (cp. Section 3.2) as it reasons with safe time
intervals, defining states on nodes, and using motion principles for the mapping between
states. However, we modified the search by defining safe intervals not only on nodes but
also on edges, and embedded new mechanisms to deal with the kinodynamic restrictions,
Pr
the constraints set by the Routing Agent, and the reservations of agents higher in priority.
These are motivated and briefly described in the following.
21
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
Due to the kinodynamic constraints of Moving Agents, speeding up or slowing down
may take place over multiple edges. Additionally, it may not be possible to reach the
required lower speed to traverse curved segments. Therefore, we do not assume a constant
velocity per edge, but compute motions with piecewise constant acceleration. Per vehicle
iew
type, a maximal speed limit is set for both straight and curved edges. Furthermore, to avoid
stop-and-go movements, the agents need to have a minimal velocity with the exception of
special locations in the taxiway network to fulfil wait-activities. Instead of using a set of
predefined motions between states, we calculate feasible motions on-the-fly that respect the
kinodynamic agent constraints. To this end, we define motion principles as outlined in
Section 5.2.
ev
Due to the kinodynamically confined motions, and dependent on the reservations set by
other agents, reaching a node as early as possible does not necessarily lead to the fastest
motion over multiple edges. Therefore, in contrast to classical MAPF solvers and also the
original SIPP algorithm, past motions influence both the feasibility and cost of upcoming
states. As noted by (Ali & Yakovlev, 2023), this violates a core assumption of SIPP and
r
makes it incomplete. To handle the minimal speed requirements, we take their approach of
projecting wait intervals further: states are defined with Feasible Motion Intervals (FMI)
er
marking both time interval and speed range that can be reached from the previous state.
This approach is further outlined in the state definition and representation of the state
space (cp. Section 5.7).
We minimize the amount of generated states by defining a heuristic function that uses
pe
domain-specific circumstances to estimate the remaining cost to reach the goal location: it
takes the vehicle category, travel direction, and activity sequence including the start and
goal velocities into account (cp. Section 5.6). Furthermore, when generating successor states
(cp. Section 5.9), we anticipate on upcoming speed restrictions and potential overlaps with
the reserved space of other vehicles. Nonetheless, due to the nature of the motion principles,
ot
overlaps may remain along edges. If the contemplated motion over an edge overlaps with
the motion in the same direction of another vehicle, we analytically detect an overlap and
resolve it accordingly. If necessary, we use a backtracking scheme to do so (cp. Section 5.8).
The SIMP algorithm terminates if either a new plan is found, or no unexplored states are
tn
in Algorithm 3 in Appendix A for reference. Changes to the original algorithm from (Ma,
Harabor, et al., 2019) are marked by red line-numbers.
PBS optimizes the sum-of-cost of the agent trajectories. Thus, we define the cost of a
PBS search-node N as
∑A
ep
with the number of agents A, the cost cplan of the trajectory of agent i, and (cprio )i as
dimensionless cost to weigh the importance of the route of agent i for the overall optimization
Pr
goal. For instance, aircraft can be assigned a higher value for cprio than tugs to increase
their prioritization: an increase in plan cost of the aircraft is penalized more than that of the
22
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
tug, potentially influencing the resulting priority order. The cost of each agent’s trajectory
is defined as linear combination of taxi time ttaxi and taxi distance dtaxi :
iew
with cd [s/m] translating the travelled distance into a cost with unit time. The low-level
solver of AS-MAMP uses the same cost-function to determine an agent trajectory, which is
further outlined in Section 5.6.
In the root-node of PBS, the trajectories of all agents are calculated taking merely the
constraints set by the Routing Agent (cp. its P3 and P3) into account. This represents the
single-agent motion planning (SAMP) solution of the AS-MAMP search for each Moving
ev
Agent. Note that a partial priority-ordering can be set in the root-node of the adapted
version of PBS as well.
The next search-node is picked based on the chosen search-mode: in this work, we dis-
tinguish between either a depth-first, best-first, or greedy search. The depth-first search
r
uses the top node of the stack as new parent-node. Newly created child-nodes are thus
placed on top of the stack in order of decreasing cost. The greedy search follows the
approach of Chan et al. (2023) to greedily place the child-node with the lowest number of
er
remaining conflict-pairs on top of the stack irrespective of its sum-of-cost. However, in
contrast to the original Greedy PBS (GPBS) implementation, we do not alter the low-level
solver. With these two search-modes, PBS only backtracks when it cannot find a solution
pe
in the explored branch. While these quickly lead to a solution, the resulting priority or-
derings may be sub-optimal and thus lead to higher taxi times of the agents. In contrast,
the best-first search returns the optimal solution within the priority-tree as it continues
to pick the node with the lowest cost in the priority-tree until the solution is conflict-free.
However, this search-mode is both time-wise and memory-wise only feasible for small agent
populations. In Section 6, we use this search-mode to assess the suboptimality of the depth-
ot
first and greedy searches for the analysed case studies. Note that other search-modes can
be used as well, such as a focal search, or other means to pick child nodes that lead to a
faster and/or higher quality solution. We will explore such in future research.
tn
The authors of GPBS also introduced the concept of using induced constraints (IC)
to further speed up the high-level search. While their use does not necessarily sacrifice
solution quality, it does affect the chosen conflict to be resolved (cp. Section 5.5) and thus
the resulting priority order. We adopt this technique as variation of all three search-modes,
and study its influence in Section 6. In the following, we will outline the remaining building
rin
• since the radius of curvature r of all taxiway segments is large in comparison to the
Pr
travelled distance and the change in velocity within that distance, we neglect the
normal component of the acceleration
23
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
von der Burg, Kamphof, Soomers, & Sharpanskykh
ed
• despite that, velocity limits have to be met to traverse curved taxiway segments that
have a radius of curvature below a pre-defined threshold value.
These allow us to use the equations for rectilinear motions, i.e. with constant accelera-
tion along a straight line:
iew
acceleration a(t) = const. in [m/s²] (3)
velocity v(t) = vi + a t in [m/s] (4)
1
distance d(t) = di + vi t + a t2 in [m] (5)
2
ev
with elapsed time t, initial velocity vi , and initial distance di . Since we consider relative
distances when traversing taxiway segments, di = 0. Rewriting these equations yields:
√
final velocity vf = vi + a t = vi2 + 2 a d (6)
r
2d vf − vi
traversal time t= = (7)
vi + vf a
travelled distance
er 1
d = vi t + a t 2 =
2
vi + vf
2
t
We apply these equations to calculate on-the-fly motions along the graph edges during
(8)
pe
single-agent path planning. Since the above equations are only valid for constant accel-
eration, we split motions into piecewise segments of constant acceleration if needed, and
allow for kinks both at nodes and anywhere on edges. In general, the velocity profile is
optimized for smallest traversal time and highest final velocity. The velocity is bounded
by an agent-dependent maximal velocity vmax as well as a minimal velocity vmin when in
motion.
ot
During planning, reaching a full stop is only allowed at the end-nodes of edges. The
agents can wait and hold arbitrarily long at these, as long as no reservations from other
agents exist. While waiting is bound to nodes, holding may also occur at the beginning
(but not at the end) of a motion along an edge. By convention, a motion always connects
tn
the state at the beginning of an edge with the state at the beginning of a subsequent edge.
A trajectory over multiple edges is thus split into one motion per edge before generating
the resulting states on the respective nodes (cp. Section 5.7). Note that the duration of
motions can have arbitrary values, allowing for execution in continuous time.
rin
Fig. 3 illustrates different motions over one or multiple edges with a path length of
d = 250 m. The exemplary motions are plotted in both a (d,t)-graph (top) and a (d,v)-
graph (bottom). Speed limits and minimal traversal times as restrictions are depicted by
red shades. In the first column, a velocity limit of vf,lim = 5 m/s must be reached at
the end of the path. Thus, to minimize the traversal time, acceleration from vi < vf,lim
ep
to vf = vf,lim takes place immediately (blue line), while deceleration from vi > vf,lim to
vf = vf,lim happens towards the end of the path (orange line).
In contrast, when a minimal traversal time due to time restrictions has to be exceeded
(second column), the motion is optimized for maximal final velocity: acceleration is carried
Pr
out towards its end (blue line), while deceleration occurs immediately (orange line). Notice
that while both motions take the same time, the blue motion will have a significantly shorter
24
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4916874
Towards Autonomous ASM Ops using Hierarchical Multi-Agent Planning
ed
Y U H V W U L F W L R Q W U H V W U L F W L R Q Y W U H V W U L F W L R Q D F F G H F P R W L R Q
W L P H W >