Computational Logistics: Joachim R. Daduna Gernot Liedtke Xiaoning Shi Stefan Voß
Computational Logistics: Joachim R. Daduna Gernot Liedtke Xiaoning Shi Stefan Voß
Daduna
Gernot Liedtke
Xiaoning Shi
Stefan Voß (Eds.)
LNCS 14239
Computational
Logistics
14th International Conference, ICCL 2023
Berlin, Germany, September 6–8, 2023
Proceedings
Lecture Notes in Computer Science 14239
Founding Editors
Gerhard Goos
Juris Hartmanis
Computational
Logistics
14th International Conference, ICCL 2023
Berlin, Germany, September 6–8, 2023
Proceedings
Editors
Joachim R. Daduna Gernot Liedtke
Hochschule für Wirtschaft und Recht Berlin Deutsches Zentrum für Luft- und Raumfahrt
Berlin, Germany (DLR)
Berlin, Germany
Xiaoning Shi
Deutsches Zentrum für Luft- und Raumfahrt Stefan Voß
(DLR) University of Hamburg
Berlin, Germany Hamburg, Germany
© The Editor(s) (if applicable) and The Author(s), under exclusive license
to Springer Nature Switzerland AG 2023
This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of
the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation,
broadcasting, reproduction on microfilms or in any other physical way, and transmission or information
storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now
known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication
does not imply, even in the absence of a specific statement, that such names are exempt from the relevant
protective laws and regulations and therefore free for general use.
The publisher, the authors, and the editors are safe to assume that the advice and information in this book
are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the
editors give a warranty, expressed or implied, with respect to the material contained herein or for any errors
or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in
published maps and institutional affiliations.
This Springer imprint is published by the registered company Springer Nature Switzerland AG
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
For decades, logistics tasks were of minor importance in economics as well as in busi-
ness practice; the basic functions transportation, transshipment and warehousing were
regarded as necessary factors, but not essential from a market-related as well as a compet-
itive point of view. Discussions about logistics were mainly related to the military sector,
from which, however, the key impact for a fundamental rethinking came at the end of
the 1940s. The employment of qualified former military personnel with leadership and
management experience in the civilian economy resulted in a transfer of essential logis-
tical principles into operational planning and control, the importance of which increased
steadily with the internationalization and, in particular, the globalization of the economy.
In the context of further developments, it became clear that logistics is not every-
thing, but without logistics, everything is nothing. It became increasingly obvious that
the mobility of goods is and will continue to be an essential prerequisite for ensuring sat-
isfactory efficiency in service provision processes organized on the basis of the division
of labor within the framework of spatially decentralized structures, which increasingly
determined the industrial sector as well as retail trade. Linked to this was the neces-
sity to install suitable external and internal information management for the continuous
monitoring and control of a globally connected cooperation.
The essential importance of reliable and efficient flows of goods and information
has become apparent to the general public. This holds especially as a result of the
consequences of the Covid-19 pandemic and the effects of the Russian war against
Ukraine. But even singular events, such as a well-observed accident in the Suez Canal
not too long ago, can lead to serious disruptions in logistical processes, combined with
significant negative economic consequences. Even despite increasing de-globalization
and backshoring, efficiency and costs must not be the only key objectives; ensuring
resilient logistics structures must also be included. Ultimately, this also means that
efficient logistics must be seen as a necessary core competence for the economic success
of companies, especially in the manufacturing and retail trade sectors.
In recent years, far-reaching progress has been made in logistics as well as in informa-
tion and communication technology, for example in the areas of theoretical foundations,
the available technical systems and also in operational realizations. In the foreground
were the development and improvement of algorithms, the increased automation in trans-
port and warehousing up to autonomous systems, the use of processes from the field of
artificial intelligence as well as the development of new materials and manufacturing
processes and also advances in communication technology. This results in the mandatory
necessity of a problem-adequate and object-oriented combination of logistical functions,
quantitative solution procedures and methods for decision support with elements of effi-
cient information management in theory and practice. To push these developments is the
central aim of the International Conferences on Computational Logistics (ICCL), which
have been held worldwide for 13 years now.
Starting as a joint endeavor between Hamburg and Shanghai the ICCL was held in
Shanghai (China) in 2010 (as well as in 2012), followed in 2011 by Hamburg (Germany),
2013 Copenhagen (Denmark), 2014 Valparaiso (Chile), 2015 Delft (The Netherlands),
vi Preface
2016 Lisbon (Portugal), 2017 Southampton (UK), 2018 Salerno (Italy), 2019 Barran-
quilla (Colombia), 2020 as well as 2021 Enschede (The Netherlands (online)) and 2022
Barcelona (Spain). The 14th ICCL conference, which took place in Berlin (Germany)
in September 2023, also aimed to bring together representatives from fields of research
and development as well as from practice with the objective of an intensive exchange
of knowledge and experiences. Presentations and discussions of current research results
as well as developments in various areas of logistics were at the foreground. The main
topics of this conference were grouped into five streams as follows:
1. Computational Logistics
Starting from observing the recent trend of using generative artificial intelligence
tools, the first paper provides an insight into this regarding logistics. Moreover, among
others, this group of papers considers issues of recommendation systems and logistics
platforms, internal hospital logistics, cybersecurity aspects for synchromodal freight
operations, as well as an emprical study regarding cross-border e-commerce.
2. Maritime Shipping
The largest group of papers, extending a major focus of the ICCL conferences right
from the start, concerns maritime shipping, including considerations of inland water-
ways. Specifically, the included papers span a wide area from developing digital
twins in seaports, tramp ship routing, liquified natural gas (LNG) delivery programs,
conventional as well as roll-on roll-off (RORO) stowage planning up to problems
regarding the allocation of shore-side electricity for grouped container terminals,
among others.
3. Vehicle Routing
Vehicle routing problems make up the core of many logistics considerations. Beyond
the inclusion of advances in classical vehicle routing, we encounter quite a few prac-
tical applications using a variety of modeling approaches. In more detail, we have
works dealing with the vehicle routing problem with time windows, a truck-drone
routing problem, risk and uncertainty considerations, e.g., in humanitarian logistics
as well as an e-waste collection problem. An interesting twist are novel applications
including the snow grooming routing problem, which is an extension of an arc rout-
ing problem with profits. The problem arises in cross-country skiing facilities when
scheduling the grooming of track networks.
4. Traffic and Transport
This volume extends the interest in public transport works, compared to earlier ones.
This includes papers on rolling stock circulation planning as well as robust bus driver
rostering with uncertainty, but also the prediction and analysis of travel times for
transit ferries and ridesharing issues in rural areas with autonomous electric vehicles.
The usability of public transport regarding health care services in rural areas as well
as questions regarding the combination of public transport and freight transport are
considered, too.
5. Combinatorial Optimization
While many of the above problems might be considered as combinatorial optimization
problems, there is more. Applications relate to location planning, job shop scheduling,
but also various path planning problems and rich lot-sizing problems.
Preface vii
The editors thank all the authors for their submissions as well as the program commit-
tee and the reviewers for their helpful support and important feedback which have made
a significant contribution to the realization of this book. Reviewing was single blind with
33 full papers accepted out of 71 submissions. Each paper was assigned to at least two
reviewers and the average number of reviews received per submission was 2.42. Finally,
we would like to express our thanks to Julia Bachale for her comprehensive support and
assistance during the preparation of the conference and also the manuscript. We trust that
the present volume supports the more important advances within computational logistics
and inspires all participants and readers to further research activities.
Computational Logistics
Maritime Shipping
Allocation of Shore Side Electricity: The Case of the Port of Hamburg . . . . . . . . 139
Jingjing Yu, Philip Cammin, and Stefan Voß
Vehicle Routing
A Regret Policy for the Dynamic Vehicle Routing Problem with Time
Windows . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235
Peter Dieter
Combinatorial Optimization
UAV Path Planning for Area Coverage and Energy Consumption in Oil
and Gas Exploration Environment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 467
Salim Sulaiman Maaji and Dario Landa-Silva
xiv Contents
Stefan Voß(B)
1 Introduction
After more than 40 years [48], artificial intelligence (AI) is gaining rejuvenated
popularity based on powerful computational environments as well as advances in
information systems research. Acronyms and phrases like deep learning, machine
learning, or generative AI currently make it into the popular press implying hopes
and fears all around the globe. Generative AI can be a system using AI techniques
being able to generate text, software, images, or other types of files/documents
in response to prompts, questions or requests; see, e.g., [41]. At the core of these
systems we see comprehensive language models being able to produce data based
on appropriately trained systems using possibly large data sets (making this a
big data application).
Popular generative AI systems include ChatGPT, a chatbot (a chatbot may
be seen as a computer program that simulates and processes human conversa-
tion) built by OpenAI using a large language model like GPT-3 or GPT-4.1
1
See, e.g., https://openai.com/blog/chatgpt; last access 30 April 2023.
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 3–17, 2023.
https://doi.org/10.1007/978-3-031-43612-3_1
4 S. Voß
Other systems include Bard, which is related to the company Google2 , Perplex-
ity3 , Bing4 , as well as the Chinese counterpart Ernie available on Baidu [42,50].
A gentle introduction can be found at different places, e.g., in [19].
Generative AI has potential applications across a wide range of industries
and domains, including software development, marketing, and fashion [4]. Most
remarkable in the open public seems to be the discussion regarding its use in
education (see, e.g., [25]).
Notable examples regarding the use of ChatGPT in logistics include [47] with
a focus on Industry 5.0 and [52] concentrating on potential transportation appli-
cations. Other examples reaching out towards logistics in specific domains as
well as supply chain management are [6,17]. In a subsequent companion paper,
we focus on issues of bus bunching and bus bridging (see, e.g., [9]) in public
transport and also compare answers from ChatGPT with those from Bing [45].
In general, however, the logistics domain has not yet been comprehensively cov-
ered regarding the description of using generative AI systems. Moreover, while
attempting to showcase the opportunities and future applications, the limita-
tions regarding content also need to be exemplified. In this paper, we aim to
provide some insights regarding the use of ChatGPT in the logistics domain and
exemplify possible limitations by investigating a specific problem in stochastic
vehicle routing which has not yet been comprehensively covered in the academic
literature, indicating that even large language models might face some difficulties
for handling it, possibly due to the missing academic considerations in related
works.
The remainder of the paper is structured as follows. First, we provide a
literature review and problem description regarding the specific vehicle routing
problem (VRP) to be considered. Moreover, we provide some entries towards
using ChatGPT and generative AI to conduct academic studies and establish
related reports. Then, in Sect. 3, we utilize ChatGPT in an attempt to gain
insights regarding the advancements of a specific problem, namely the stochastic
VRP with an uncertain number of vehicles. Parts of the paper are using results
from ChatGPT where practical.5 Finally conclusions are drawn and selected
future research needs are exemplified.
Asking ChatGPT about important references on the VRP gives five mean-
ingful entries including the following (plus one reference that could not be found
in popular databases): [11,43].
VRPs are among the most studied problems in operations research and dis-
crete applied mathematics with quite a few textbooks and well-known compendi-
ums devoted to the problem with a most recent one being [12]. Also surveys seem
to be a dime a dozen including those focusing, e.g., on specific issues such as
logistics distribution [18], reverse logistics [36] or stochastic VRPs [32].
Related to the VRP (and our research focus; see below) is also the topic of
vehicle scheduling where the basic idea is to cover a given set of trips by means of
a set of vehicles. To be more specific and choosing a specialized problem setting,
given a set of timetabled trips and a fleet of vehicles the integrated vehicle and
duty scheduling problem seeks to find minimum-cost vehicle blocks (the schedule
of a vehicle) and valid driver duties such that each active trip is covered by
one block, each active trip segment is covered by one duty, and each deadhead,
pull-in, and pull-out trip used in the vehicle schedule is also covered by one duty.
That is, a vehicle performs a feasible sequence of trips from the time it leaves a
depot until it returns to the depot. Example references include [7,8,21].
Using different words, this can be described as follows:
“The integrated vehicle and duty scheduling problem (IVDSP) is a com-
binatorial optimization problem that involves scheduling a set of vehicles
and their drivers over a planning horizon to fulfill a given set of trips or
duties. The objective of the IVDSP is to minimize the total cost, which
typically includes the cost of vehicle operations, driver labor, and penalties
for violations of operational constraints.
The IVDSP is a challenging problem due to its complexity and the large
number of decision variables involved. The problem involves simultane-
ously scheduling vehicles and drivers, considering factors such as vehicle
capacity, driver skills, rest and work regulations, and travel times. Addi-
tionally, the problem often includes uncertain demand, which further com-
plicates the planning process.” (ChatGPT)
6 S. Voß
hardly be found with [38] being an exception, as pointed out in the previous
paragraph.
The above considerations open up the avenue towards formulating and inves-
tigating an SVRP or a vehicle scheduling problem with an uncertain number of
vehicles. Regarding mathematical formulations, this relates to issues of right-
hand-side (RHS) uncertainty. Note that in mathematical programming, RHS
uncertainty refers to the uncertainty in the values of the right-hand side coeffi-
cients of a linear programming or mixed-integer programming model. To exem-
plify, we may consider RHS uncertainty in the context of SVRPs as well as inte-
grated vehicle and crew/duty scheduling problems and show that these problem
domains may actually be treated in a way that allows for deriving reasonable
solution approaches as well as opening up for future research efforts [8]. The
consideration of an uncertain number of vehicles may not only be motivated by
means of disaster relief issues but also be motivated by means of electric vehicles
having a restricted usable time of the battery, possibly making them unavailable
after some time.
2.2 Generative AI
6
The dataset, code, and models are publicly available at https://github.com/Hello-
SimpleAI/chatgpt-comparison-detection; last access 30 April 2023.
8 S. Voß
through thinking, experiences and senses [29]. It is the ability to generate knowl-
edge by using existing information (as is also a common and well-known issue in
information management; see, e.g., [44]). We do not claim that ChatGPT and
generative AI tools are able towards successfully pursuing this generation, but
the fear of related claims is there.
Finally, we should note that comparative studies of different chatbots may
also be conducted but are still rare; see, e.g., [28]. This is seen in front of what
happened during the last 20 years regarding the use of chatbots especially in
supply chain management where simple tasks such as tracking shipments or
managing inventory are accompanied by related tools. Marrying natural lan-
guage processing and optimization is also on the horizon with first efforts being
on the way [34].
3 Example Explorations
First of all, we can put forward the core focus of our paper to ChatGPT in
asking “Successfully Using ChatGPT in Logistics: Are We There Yet?” leading
to the following answer.
The answer seems good despite the fact that the given references do not seem
to exist. It is an experience.
To be more specific, the mentioned authors exist and have worked on similar
problems but they have not published those specific papers. The titles of those
papers seem to be invented and they cannot be found with popular search engines
(including Google Scholar). The journals exist, but they have other works in the
mentioned issues and page numbers.
Now, let us focus on a specific issue in mathematical programming for solving
the SVRP with an uncertain number of vehicles.
Question: What is right-hand-side uncertainty?
Then, however, we get four references that are non-existing (or can not be
found at the places/journals that they are claimed to be cited from). That is,
ChatGPT tends to generate “hallucinated” information beyond the given knowl-
edge; see, e.g., [2] for a related consideration. Hallucinations are factual inaccura-
cies, but they seem to appear as scientifically plausible (at least at first sight). Of
course this is problematic because given references may give a false impression
of existing work that does not exist.
In a yet not comprehensible form, we have worked on RHS uncertainty for
the integrated vehicle and duty/crew scheduling problem. Again, the aim is to
build minimum cost vehicle and daily crew schedules and rosters that cover a
given set of timetabled trips as well as all daily crew duties incorporating a pre-
defined days-off pattern. Details of the problem setting and a modeling approach
can be found in [7,8,21], where the first puts up the overall problem while the
latter extend in various dimensions especially including delay propagation and
uncertainty in the number of vehicles. Those works had been done without using
ChatGPT, and even attempting to let ChatGPT rejuvenate written code in
Python towards solving the problem failed. Though, when we go back to a simple
VRP or even further towards using it to provide code to input instances of
the traveling salesman problem to a commercial solver like CPLEX, we might
succeed.7
4 Conclusions
In this paper, we attempted to investigate the use of a specific generative artificial
intelligence system, widely discussed even in the open public with quite a bit of
media coverage. If generally known issues are considered, ChatGPT, the system
7
Question: Can you write code to input a problem instance of the traveling salesman
problem to CPLEX?
Answer: “Yes, I can provide a code snippet in Python to input a TSP instance
to CPLEX. Here is an example code that reads a TSP instance from a file in the
TSPLIB format, creates a CPLEX model, and solves it: ... ” (ChatGPT).
Using ChatGPT in Logistics 13
8
Examples might be the use of INCOTERMS in logistics; see, e.g., https://iccwbo.
org/business-solutions/incoterms-rules/incoterms-2020/; last access 30 April 2023.
14 S. Voß
References
1. AI HLEG: Ethics Guidelines for Trustworthy AI. European Commis-
sion, Brussels (2019). https://digital-strategy.ec.europa.eu/en/library/ethics-
guidelines-trustworthy-ai. High-Level Expert Group on Artificial Intelligence.
Accessed 22 June 2023
2. Bang, Y., et al.: A multitask, multilingual, multimodal evaluation of ChatGPT on
reasoning, hallucination, and interactivity (2023). https://doi.org/10.48550/arXiv.
2302.04023. Available on Arxiv
3. Berhan, E., Beshah, B., Kitaw, D., Abraham, A.: Stochastic vehicle routing prob-
lem: a literature survey. J. Inf. Knowl. Manag. 13(3), 1450022 (2014). https://doi.
org/10.1142/S0219649214500221
4. Castelli, M., Manzoni, L.: Editorial: generative models in artificial intelligence
and their applications. Appl. Sci. 12(9), 4127 (2022). https://doi.org/10.3390/
app12094127
5. Du, H., et al.: Chat with ChatGPT on intelligent vehicles: an IEEE TIV perspec-
tive. IEEE Trans. Intell. Veh. 8(3), 2020–2026 (2023). https://doi.org/10.1109/
TIV.2023.3253281
6. Frederico, G.F.: ChatGPT in supply chains: initial evidence of applications and
potential research agenda. Logistics 7(2), 26 (2023). https://doi.org/10.3390/
logistics7020026
7. Ge, L., Kliewer, N., Nourmohammadzadeh, A., Voß, S., Xie, L.: Revisiting the
richness of integrated vehicle and crew scheduling. Public Transp. (2022). https://
doi.org/10.1007/s12469-022-00292-6
8. Ge, L., Nourmohammadzadeh, A., Voß, S., Xie, L.: Robust optimization for inte-
grated vehicle and crew scheduling based on uncertainty in the main inputs. In: The
Fifth Data Science Meets Optimisation Workshop at IJCAI-2022, Vienna (2022).
https://sites.google.com/view/ijcai2022dso/. Accessed 30 Apr 2023
9. Ge, L., Voß, S., Xie, L.: Robustness and disturbances in public transport. Public
Transp. 14, 191–261 (2022). https://doi.org/10.1007/s12469-022-00301-8
10. Gendreau, M., Laporte, G., Séguin, R.: Stochastic vehicle routing. Eur. J. Oper.
Res. 88, 3–12 (1996). https://doi.org/10.1016/0377-2217(95)00050-X
11. Golden, B., Raghavan, S., Wasil, E. (eds.): The Vehicle Routing Problem: Latest
Advances and New Challenges. Springer, New York (2008). https://doi.org/10.
1007/978-0-387-77778-8
12. Golden, B., Wang, X., Wasil, E.: The Evolution of the Vehicle Routing Problem.
Springer, Cham (2023). https://doi.org/10.1007/978-3-031-18716-2
13. Guo, B., et al.: How close is ChatGPT to human experts? Comparison corpus, eval-
uation, and detection (2023). https://doi.org/10.48550/arXiv.2301.07597. Avail-
able on Arxiv
Using ChatGPT in Logistics 15
14. Haman, M., Školnı́k, M.: Using ChatGPT to conduct a literature review. Account.
Res. 1–3 (2023). https://doi.org/10.1080/08989621.2023.2185514
15. Heilig, L., Lalla-Ruiz, E., Voß, S.: Digital transformation in maritime ports: anal-
ysis and a game theoretic framework. NETNOMICS 18, 227–254 (2017). https://
doi.org/10.1007/s11066-017-9122-x
16. Kasneci, E., et al.: ChatGPT for good? On opportunities and challenges of large
language models for education. Learn. Individ. Differ. 103, 102274 (2023). https://
doi.org/10.1016/j.lindif.2023.102274
17. Kim, J., Lee, J.: How does ChatGPT introduce transport problems and solutions
in North America? Findings (2023). https://doi.org/10.32866/001c.72634. Entry
72634
18. Konstantakopoulos, G., Gayialis, S., Kechagias, E.: Vehicle routing problem and
related algorithms for logistics distribution: a literature review and classifica-
tion. Oper. Res. Int. J. 22, 2033–2062 (2022). https://doi.org/10.1007/s12351-
020-00600-7
19. Lin, H.Y.: Large-scale artificial intelligence models. Computer 55(5), 76–80 (2022).
https://doi.org/10.1109/MC.2022.3151419
20. McGee, R.W.: How would American history be different if LBJ had lost the 1948
election? A ChatGPT essay (2023). https://doi.org/10.2139/ssrn.4413418. Avail-
able at SSRN
21. Mesquita, M., Moz, M., Paias, A., Pato, M.: A decomposition approach for the
integrated vehicle-crew-roster problem with days-off pattern. Eur. J. Oper. Res.
229, 318–331 (2013). https://doi.org/10.1016/j.ejor.2013.02.055
22. Minoux, M.: Robust linear programming with right-hand-side uncertainty, duality
and applications. In: Floudas, C., Pardalos, P. (eds.) Encyclopedia of Optimization,
pp. 3317–3327. Springer, Boston (2008). https://doi.org/10.1007/978-0-387-74759-
0 569
23. Minoux, M.: Two-stage robust LP with ellipsoidal right-hand side uncertainty is
NP-hard. Optim. Lett. 6(7), 1463–1475 (2012). https://doi.org/10.1007/s11590-
011-0341-z
24. Möller, P. (ed.): ChatGPT and the Like: Artificial Intelligence in Logis-
tics. DHL (2023). https://dhl-freight-connections.com/en/trends/chatgpt-and-
the-like-artificial-intelligence-in-logistics/. Accessed 10 May 2023
25. Mollick, E.R., Mollick, L.: Using AI to implement effective teaching strategies in
classrooms: five strategies, including prompts (2023). https://doi.org/10.2139/ssrn.
4391243. Available at SSRN
26. Neugebauer, J., Heilig, L., Voß, S.: Digital twins in seaports: current and future
applications. In: Daduna, J., et al. (eds.) ICCL 2023. LNCS, vol. 14239, pp. 202–
218. Springer, Cham (2023)
27. Nourmohammadzadeh, A., Voß, S.: An effective matheuristic approach for robust
bus driver rostering with uncertain daily working hours. In: Daduna, J., et al. (eds.)
ICCL 2023. LNCS, vol. 14239, pp. xx–yy. Springer, Cham (2023)
28. O’Leary, D.E.: An analysis of three chatbots: BlenderBot, ChatGPT and LaMDA.
Intell. Syst. Account. Finance Manag. 30(1), 41–54 (2023). https://doi.org/10.
1002/isaf.1531
29. Otero, I., Salgado, J.F., Moscoso, S.: Cognitive reflection, cognitive intelligence,
and cognitive abilities: a meta-analysis. Intelligence 90, 101614 (2022). https://
doi.org/10.1016/j.intell.2021.101614
30. Ouorou, A.: Robust models for linear programming with uncertain right hand side.
Networks 68(3), 200–211 (2016). https://doi.org/10.1002/net.21693
16 S. Voß
31. Oyola, J., Arntzen, H., Woodruff, D.L.: The stochastic vehicle routing problem, a
literature review, part II: solution methods. EURO J. Transp. Logist. 6(4), 349–388
(2017). https://doi.org/10.1007/s13676-016-0099-7
32. Oyola, J., Arntzen, H., Woodruff, D.L.: The stochastic vehicle routing problem, a
literature review, part I: models. EURO J. Transp. Logist. 7(3), 193–221 (2018).
https://doi.org/10.1007/s13676-016-0100-5
33. Özaltın, O.Y., Prokopyev, O.A., Schaefer, A.J.: The bilevel knapsack problem with
stochastic right-hand sides. Oper. Res. Lett. 38(4), 328–333 (2010). https://doi.
org/10.1016/j.orl.2010.04.005
34. Ramamonjison, R., et al.: NL4Opt competition: formulating optimization problems
based on their natural language descriptions (2023). https://doi.org/10.48550/
arXiv.2303.08233. Available on Arxiv
35. Ritzinger, U., Puchinger, J., Hartl, R.F.: A survey on dynamic and stochastic
vehicle routing problems. Int. J. Prod. Res. 54, 215–231 (2016). https://doi.org/
10.1080/00207543.2015.1043403
36. Sar, K., Ghadimi, P.: A systematic literature review of the vehicle routing problem
in reverse logistics operations. Comput. Ind. Eng. 177, 109011 (2023). https://doi.
org/10.1016/j.cie.2023.109011
37. Savelsbergh, M., Ulmer, M.: Challenges and opportunities in crowdsourced delivery
planning and operations. 4OR 20, 1–21 (2022). https://doi.org/10.1007/s10288-
021-00500-2
38. Shi, Q., Yang, W., Yang, Z.M., Zhao, Q.C.: Relief materials vehicles planning in
natural disasters. IEEE/CAA J. Automatica Sin. 5(2), 595–601 (2018). https://
doi.org/10.1109/JAS.2017.7510850
39. Soeffker, N., Ulmer, M.W., Mattfeld, D.C.: Stochastic dynamic vehicle routing in
the light of prescriptive analytics: a review. Eur. J. Oper. Res. 298, 801–820 (2022).
https://doi.org/10.1016/j.ejor.2021.07.014
40. Solis, T.: Die ChatGPT-Richtlinien der 100 größten deutschen Univer-
sitäten (2023). https://www.scribbr.de/ki-tools-nutzen/chatgpt-universitaere-
richtlinien/. Accessed 22 June 2023
41. Stokel-Walker, C., Van Noorden, R.: The promise and peril of generative AI. Nature
614, 214–216 (2023). https://doi.org/10.1038/d41586-023-00340-6
42. Sun, Y., et al.: ERNIE: enhanced representation through knowledge integration
(2019). https://doi.org/10.48550/arXiv.1904.09223. Available on Arxiv
43. Toth, P., Vigo, D.: Vehicle Routing: Problems, Methods, and Applications, 2nd
edn. SIAM, Philadelphia (2014)
44. Voß, S., Gutenschwager, K.: Informationsmanagement. Springer, Berlin (2001).
https://doi.org/10.1007/978-3-642-56878-7
45. Voß, S.: Bus bunching and bus bridging: what can we learn from generative AI
tools like ChatGPT? Sustainability 15(12, article #9625) (2023). https://doi.org/
10.3390/su15129625
46. Wang, D., Lu, C.T., Fu, Y.: Towards automated urban planning: when genera-
tive and ChatGPT-like AI meets urban planning (2023). https://doi.org/10.48550/
arXiv.2304.03892. Available on Arxiv
47. Wang, F.Y., Yang, J., Wang, X., Li, J., Han, Q.L.: Chat with ChatGPT on Industry
5.0: learning and decision-making for intelligent industries. IEEE/CAA J. Auto-
matica Sin. 10(4), 831–834 (2023). https://doi.org/10.1109/JAS.2023.123552
48. Winston, P.H.: Artificial Intelligence. Addison-Wesley, Boston (1992). (1st edn. in
1977)
Using ChatGPT in Logistics 17
49. Xiong, C., Chen, X., He, X., Lin, X., Zhang, L.: Agent-based en-route diversion:
dynamic behavioral responses and network performance represented by macro-
scopic fundamental diagrams. Transp. Res. Part C Emerg. Technol. 64, 148–163
(2016). https://doi.org/10.1016/j.trc.2015.04.008
50. Yang, Z.: The bearable mediocrity of Baidu’s ChatGPT competitor. MIT Tech-
nology Review (2023). https://www.technologyreview.com/2023/03/22/1070154/
baidu-ernie-bot-chatgpt-reputation/. Accessed 10 May 2023
51. Zhang, C., et al.: One small step for generative AI, one giant leap for AGI: a com-
plete survey on ChatGPT in AIGC era (2023). https://doi.org/10.48550/arXiv.
2304.06488. Available on Arxiv
52. Zheng, O., Abdel-Aty, M., Wang, D., Wang, Z., Ding, S.: ChatGPT is on the
horizon: could a large language model be all we need for intelligent transportation?
(2023). https://doi.org/10.48550/arXiv.2303.05382. Available on Arxiv
53. Zhu, J.J., Jiang, J., Yang, M., Ren, Z.J.: ChatGPT and environmental research.
Environ. Sci. Technol. (2023). https://doi.org/10.1021/acs.est.3c01818
When Routing Meets Recommendation:
Solving Dynamic Order
Recommendations Problem
in Peer-to-Peer Logistics Platforms
Zhiqin Zhang1 , Waldy Joe1 , Yuyang Er2 , and Hoong Chuin Lau1(B)
1
School of Computing and Information Systems, Singapore Management University,
Singapore, Singapore
{zqzhang.2020,waldy.joe.2018}@phdcs.smu.edu.sg, [email protected]
2
AI Singapore, Singapore, Singapore
[email protected]
1 Introduction
Peer-to-Peer (P2P) transportation platforms, such as Uber, Lyft, and Food-
Panda, have been gaining popularity in recent years with greater adoption of
internet technology (such as e-commerce and mobile apps) and the growing
demand for more efficient urban mobility. These platforms include ride-hailing
or ride-sharing and crowdsourced delivery service providers where the former
provide transportation services for people while the latter for goods. The key
feature of such platforms is that the platform owners do not own the physical
assets (fleet of vehicles) nor employ the transport operators (drivers or riders,
used interchangeably) unlike the traditional taxi companies or logistics service
providers. Thus, the efficiency of such P2P transportation model lies in the suc-
cessful matching of demand and supply, i.e., how to match the delivery tasks
(or orders or jobs, used interchangeably) with the appropriate drivers that will
result in successful assignment and completion of the jobs. This is an interesting
research challenge as unlike in the traditional routing problems, the drivers have
a choice to accept or reject the jobs.
The work presented in this paper is motivated by a real-world problem sce-
nario involving a P2P logistics platform uParcel1 , where new delivery tasks are
being recommended throughout the day in the form of a ranked list (also known
as menu) to individual crowdsourced drivers who have already existing pre-
scheduled delivery tasks. This problem is essentially a combination of a user rec-
ommendation problem [23] and a Same-Day Delivery Problem (SDDP) which is
a variant of a Dynamic Vehicle Routing Problem (DVRP) [5]. For a successful
matching of delivery tasks to drivers, the recommendations need to take into
account not only the platform’s profitability (which in classical SDDP is deter-
mined by how good the delivery routes are), but also the drivers’ preferences for
the tasks recommended.
Both, the user recommendation problem and the SDDP have been widely
studied and researched in their respective fields. However, there is a great poten-
tial for synergy between the two research domains in addressing the problem pre-
sented in this paper. The traditional Recommender Systems (RS) are only able
to capture users’ general personal preferences [29] while the Sequential Recom-
mender Systems (SRS) go one step further by taking into account the sequential
dependencies of user-item interactions for more accurate recommendation [7].
Although an SRS takes into account the existing state of a user, it is only able
to recommend the user’s next action, i.e., in the context of e-commerce, the item
that he or she will purchase next. This is insufficient in addressing the problem
in this paper which is akin to recommending a new product given that users
have purchased a list of items (pre-scheduled orders) and that they are planning
to purchase another list of items in the future (new orders that arrive dynami-
cally). Meanwhile, current works in solving the SDDP assume that the drivers
1
uParcel is a Singapore start-up company which offers on-demand delivery and courier
services for business and consumers. See https://www.uparcel.sg/ (last access date
02 July 2023).
20 Z. Zhang et al.
will always accept the assigned delivery tasks [27,28], which is not the case in a
crowdsourced setting.
To address this gap, we bring together two key ideas from Reinforcement
Learning and Recommender Systems. More precisely, we propose an adaptive
recommendation heuristic (ARH) that incorporates reinforcement learning (RL)
to learn the parameter selection policy within the heuristic and a RS model called
xDeepFM [15] to predict the driver’s probability of accepting an order based on
their preference. ARH relies on a linear scalarization function that takes into
account both routing information and order-driver interaction in generating the
recommendation, and the RL-trained policy is utilized to adaptively determine
the exact weightages of this linear function at a given pre-decision state.
2 Related Works
Same-Day Delivery Problem. There are two broad categories of approaches for
solving SDDP or DVRP in general namely offline or pre-processsed decision
support and online decision [24]. In offline approaches, policies or values for
decision-making are computed prior to the execution of a plan. Here, the problem
is usually formulated and solved as an Markov Decision Process (MDP). Unfor-
tunately, MDP-based approaches (specifically tabular-based ones) fall into the
curse of dimensionality and hence are not suitable for most real-world problems
[19]. Approximate Dynamic Programming (ADP) approaches [20] are commonly
used to tackle the scalability issue, and one such ADP method for the DVRP is
Approximate Value Iteration (AVI) [1]. Increasingly, there also have been many
recent works that addressed to solve the DVRP using RL [6,8,13,14]. Mean-
while, lookahead approaches have been applied successfully to solve dynamic
and stochastic routing problems. These are commonly termed as rollout algo-
rithms, e.g., [25], the pilot method, e.g., [17] and Multiple Scenario Approach
(MSA), e.g., [4,28]. The approaches proposed in these works are not directly
applicable to our problem since we are dealing with crowdsourced drivers; how-
ever, like many current works that use RL to solve the DVRP, we utilize RL to
learn a policy to guide our recommendation heuristic.
that integrates the Factorization Machine (the wide component) and the Multi-
Layer Perceptron (the deep component) was proposed in [11]. xDeepFM, an
extension of the DeepFM that can jointly model explicit and implicit feature
interactions was proposed in [15]. In this paper, we select xDeepFM to predict
drivers’ probability of selecting an order as it combines both content-based and
content filtering (hybrid approach) and mitigates the cold start and data sparsity
issues by relying on the factorization of the sparse user-item matrix.
Dynamic Matching Problem. [3] introduced the concept of dynamic order match-
ing in a Peer-to-Peer (P2P) logistics platform, which shares similarities with the
problem addressed in this paper. The authors framed the problem as a two-
stage decision problem and proposed a multiple scenario approach that involved
sampling various driver selection scenarios and solving an integer program for
each scenario to generate the final menu. However, it should be noted that the
authors of [3] considered a simplified version of the DORP where drivers are
only allowed to select one order, and no time windows or capacity constraints
are imposed. Furthermore, the authors modeled the drivers’ preferences using a
pre-determined utility function. In contrast, the problem addressed in this paper
takes into account real operational constraints and incorporates drivers’ prefer-
ences based on historical data, making it a more realistic and comprehensive
representation of the DORP.
We assume that dynamic orders are generated throughout the planning horizon,
and at a predetermined frequency, the platform consolidates the newly arrived
dynamic orders and any unassigned orders and generates a personalized order
recommendation list for the crowdsourced drivers. The dynamic orders consist
of pickup and delivery tasks, order size, time window requirements for both
pickup and delivery, and an expiration time. If a driver arrives at the pickup
or delivery location earlier than the specified time window, a waiting time will
be incurred. In the event that no driver accepts an order and arrives at the
pickup location after the specified time limit, the order will be automatically
canceled. A simple illustration of the proposed problem can be found in the
supplementary materials2 .
The objective of the problem is to maximize the number of orders fulfilled
within the given planning horizon, specifically, a full working day. This objec-
tive can be achieved by ensuring that the order recommendation list considers
both the driver’s preference and routing considerations. The driver’s preference
ensures that they have a high probability of selecting at least one of the recom-
mended orders, while the routing considerations ensure that drivers with optimal
delivery routes can fulfill more orders while minimizing delivery costs.
2
https://anonymous.4open.science/r/iccl2023-7ADC/SupplementaryMaterials.pdf.
22 Z. Zhang et al.
the pre-decision state to the post-decision state, which occurs after executing
the action xk . However, this second transition introduces additional complexity
due to multiple uncertainties that influence the outcome of the action taken.
Specifically, there are two intermediate states that arise from two sources of
uncertainty, namely the selection behaviors of the drivers and the assignment
rule.
Reward/Objective Function. The reward R(Sk , xk ) of an action xk is defined
as the total number of orders fulfilled for the decision epoch k. The solution to
this Markov Decision Process (MDP) is a policy that generates the menus at
each state. The optimal solution is the policy that maximizes the current reward
and the total expected rewards for future states.
4 Solution Approach
Generating the personalized order recommendation lists directly based on the
model formulation presented in the previous section will involve an enumeration
of all possible permutations of unassigned orders of variable length for each
driver. Even if we limit the length of the lists, the action space will still be
combinatorial. Thus, to address the curse of dimensionality, we design a heuristic
approach to generate the recommendation lists.
The remaining part of this section is organised as follows. We first introduce
how our proposed ARH generates the recommendation lists. We then explain
the two key components within the ARH, namely how RL is used to train the
weightage selection policy for this heuristic; and how the drivers’ preference is
modelled and learnt from historical data by using xDeepFM.
In DORP, the state space and action space are too large to enumerate for large-
scale problems. Compared with most dynamic same-day delivery problems such
as [10,12] where a decision involves the assignment of one dynamic order to
one rider, our problem involves a multi-order multi-driver matching, which leads
to a very large state space. In addition, the action space also faces the “curse
of dimensionality”. Unlike the action space in [3], the recommendation lists for
riders in DORP need to be sorted by the relevance (rider-order relevance) which
contributes to a larger action space. This task is even made more challenging
as decisions need to be done in real-time. Thus, we propose a heuristic solution
approach to aggregate the action space.
Heuristic Scoring Function. In our heuristic, the key idea is to compute a
driver-order pair score based on the routing cost and riders’ personal preference.
For a given rider-order pair score, the higher this score is, the more suitable
for this rider to serve this order by our consideration. The recommendation list
for each rider is then generated by these scores. The function to calculate the
driver-order score is given as follows:
24 Z. Zhang et al.
– A routing heuristic to calculate the cost for a given driver to serve a given
order.
– RL-trained policy to determine the weights attached to each objective at a
given pre-decision state.
– Current time t for this decision epoch k. As the authors of [6,26] select the
current point of time of this decision epoch, we also include this feature in
our representation of state.
26 Z. Zhang et al.
The action is to select one heuristic to calculate the rider-order pair score for
this current decision epoch. To be more specific, we use five parameters in our
proposed heuristic. These five values of α are 0, 0,25, 0.5, 0.75 and 1, respectively.
The transition and reward function remain the same as the original MDP.
Then, we formulate this problem and use a RL approach to train a policy to
select a proper heuristic to calculate the rider-order score in each decision epoch.
By these scores, we can directly generate the personalized recommendation lists
to riders.
Deep Q-Learning for Weight Selection. For a given state Sk , the action xk is
to select the most proper heuristic to generate the personalized recommendation
lists for riders. Thus, the solution of the DORP is to learn a policy π ∈ Π that
selects the action for each state. The optimal solution π∗ can maximize the total
reward. Q-learning [30] learns a value Q(Sk , x) for each state and each action
pair. This Q-value can estimate the immediate reward plus the expected future
rewards if the action x is taken to the state Sk . Definitely, we cannot calculate
and learn all these state-action pairs due to the size of the state and action space
(even after aggregation). Thus, we use the DQN approach to estimate this value.
When Routing Meets Recommendation 27
from the pickup point to the dropoff point. The challenge is the former. Unlike in
typical RS, this feature is highly dynamic. The main challenge lies in calculating
this value in real time as there could be some error in calculating expected versus
actual values since such highly dynamic data may not be available or accurate.
If it is not handled properly, these errors could accumulate over time, resulting
in incorrect predictions. To overcome this, we propose to use a proxy feature,
i.e., last known location to represent the current location of the driver in order
to calculate the distance of the current location to the pickup location. During
training, the last known location of a driver is defined as the location at the time
point when jobs have last been accepted by this driver.
Negative Sampling. The given dataset consists of historical jobs accepted by
individual drivers (item features), as well as a list of drivers (user features). For
training the model, the accepted jobs form the positive training samples. As the
platform does not allow drivers to reject jobs, all non-accepted jobs are auto-
matically considered implicit negative samples. Hence most samples recorded
are positive use cases. Thus, implicit negative samples are generated to ensure
a more balanced data during model training. However, not all negative samples
are useful and they may in turn introduce noise, hence negative sampling was
used to decrease the amount of negative use cases. The chosen approach is by
filtering within a time window and computing distance relative to the driver’s
most recent job/known location. We made the assumption that an active driver
who accepted order(s) or performing a job at a given time window is likely to
have seen and “rejected” the non-accepted jobs at the same time window.
5 Experiments
We evaluate our proposed approach using a real-world dataset that we collected
from a local logistics platform. We ensure that the problem settings and input
data closely resemble the real-world scenario.
where N is the number of total training instances, yi and yˆi is predicted value 0
or 1 (accept or not) and the related label.
Metrics. To evaluate the performance of the various baseline models, we use
Precision@k and Recall@k where k refers to the top-k order recommendations
given to a driver. To be more specific, the precision@k is defined as the number
of recommended orders at top-k that are accepted by a given driver divided by
k which is the number of top-k orders the model recommended to this driver.
# of recommended orders @k that are accepted
P recision@k = (6)
# of recommended orders @k
30 Z. Zhang et al.
The recall@k is the proportion of accepted orders found in the top-k recom-
mendations.
# of recommended orders @k that are acceptd
Recall@k = (7)
total # of accepted orders
We choose k = 3 and 5 as these are realistic lengths of a recommendation list
that are both not too long (difficult to make decision) but at the same time give
drivers enough options to choose.
Table 1. xDeepFM model with additional distance features outperforms the other
baseline models in both precision and recall measures.
Firstly, we show the evaluation results of the prediction model. Then, we compare
the results of our ARH method with three baseline models described in Sect. 5.1.
xDeepFM. Figure 3 visualizes the convergence of the models with different
components. In each graph, the horizontal axis is the epochs, while the vertical
axis is the log loss. The blue and orange lines show the loss in the training data
set and test data set, respectively. From these graphs, we can find that the FM
model cannot predict a good result as the loss on test data set converge to about
1.25 to 1.5 after training. If introducing the DNN, the test loss of the prediction
model will reduce to about 0.6 but it is not stable, comparing with the FM and
the CIN model. The final prediction model, xDeepFM with all components can
get the best results as the test loss converge to the smallest value.
Table 1 also shows the results that xDeepFM with the full suite of components
outperforms the classical FM or FM with the subsets of the components. In
addition, inclusion of distance-based feature improves the prediction.
ARH. After having trained 200 episodes (one episode includes 144 epochs) on
the training dataset, our proposed ARH with the trained DQN policy can achieve
better results on most test datasets. Table 2 shows the results of the performance
of our proposed ARH and three baseline algorithms. The entries are the order
fulfilled rate defined as Eq. 9, so larger numbers represent better performance.
The first column is instances in the two weeks data set. The number in the brack-
ets represents the total number of the dynamic orders in one instance. Columns
two to four are the results of three baseline models described in Sect. 5.1. The
fifth column shows the best results among these three baselines. The last column
shows the performance of our proposed ARH. From this table, we can observe
that our proposed ARH can get the best results in 10 instances. ARH is the
worst only in instance 12 with a slight difference. For the remaining instances,
the ARH can outperform some of the baselines. As the statistical results shows,
the average order fulfill rate of the ARH on the test dataset is the highest and
the its variance is the smallest.
Apart from the order fulfill rate, we present a summary of the comparison
on two other metrics (namely, the total routing cost and the driver preference
measured by the probability of the driver accepting the order recommended to
him/her based on the driver’s historical data). Figure 4 shows that for rout-
ing cost, RH (i.e., solely considering routing cost) achieves the best perfor-
mance compared with the other three algorithms (which is expected). Similarly,
DP performs best in terms of driver preference compared with the other three
When Routing Meets Recommendation 33
References
1. Agussurja, L., Cheng, S.F., Lau, H.C.: A state aggregation approach for stochastic
multiperiod last-mile ride-sharing problems. Transp. Sci. 53(1), 148–166 (2019).
https://doi.org/10.1287/trsc.2018.0840
2. Akiba, T., Sano, S., Yanase, T., Ohta, T., Koyama, M.: Optuna: a next-
generation hyperparameter optimization framework. In: Proceedings of the 25th
ACM SIGKDD International Conference on Knowledge Discovery & Data Mining,
pp. 2623–2631 (2019). https://doi.org/10.1145/3292500.3330701
3. Ausseil, R., Pazour, J.A., Ulmer, M.W.: Supplier menus for dynamic matching
in peer-to-peer transportation platforms. Transp. Sci. 56(5), 1304–1326 (2022).
https://doi.org/10.1287/trsc.2022.1133
4. Bent, R.W., Van Hentenryck, P.: Scenario-based planning for partially dynamic
vehicle routing with stochastic customers. Oper. Res. 52(6), 977–987 (2004).
https://doi.org/10.1287/opre.1040.0124
5. Berbeglia, G., Cordeau, J.F., Laporte, G.: Dynamic pickup and delivery problems.
Eur. J. Oper. Res. 202(1), 8–15 (2010). https://doi.org/10.1016/j.ejor.2009.04.024
6. Chen, X., Ulmer, M.W., Thomas, B.W.: Deep Q-learning for same-day delivery
with vehicles and drones. Eur. J. Oper. Res. 298(3), 939–952 (2022). https://doi.
org/10.1016/j.ejor.2021.06.021
7. Chen, X., et al.: Sequential recommendation with user memory networks. In: Pro-
ceedings of the Eleventh ACM International Conference on Web Search and Data
Mining, pp. 108–116 (2018). https://doi.org/10.1145/3159652.3159668
8. Chen, Y., et al.: Can sophisticated dispatching strategy acquired by reinforcement
learning? In: Proceedings of the 18th International Conference on Autonomous
Agents and MultiAgent Systems, pp. 1395–1403 (2019). https://doi.org/10.48550/
arXiv.1903.02716
9. Cheng, H.T., et al.: Wide & deep learning for recommender systems. In: Proceed-
ings of the 1st Workshop on Deep Learning for Recommender Systems, pp. 7–10
(2016). https://doi.org/10.1145/2988450.2988454
10. Dayarian, I., Savelsbergh, M.: Crowdshipping and same-day delivery: employing
in-store customers to deliver online orders. Prod. Oper. Manag. 29(9), 2153–2174
(2020). https://doi.org/10.1111/poms.13219
34 Z. Zhang et al.
11. Guo, H., Tang, R., Ye, Y., Li, Z., He, X.: DeepFM: a factorization-machine
based neural network for CTR prediction. arXiv preprint arXiv:1703.04247 (2017).
https://doi.org/10.48550/arXiv.1703.04247
12. Hou, S., Gao, J., Wang, C.: Optimization framework for crowd-sourced delivery
services with the consideration of shippers’ acceptance uncertainties. IEEE Trans.
Intell. Transp. Syst. 24(1), 684–693 (2022). https://doi.org/10.1109/TITS.2022.
3215512
13. Joe, W., Lau, H.C.: Deep reinforcement learning approach to solve dynamic vehicle
routing problem with stochastic customers. In: Proceedings of the International
Conference on Automated Planning and Scheduling, vol. 30, pp. 394–402 (2020).
https://doi.org/10.1609/icaps.v30i1.6685
14. Li, X., et al.: Learning to optimize industry-scale dynamic pickup and deliv-
ery problems. In: 2021 IEEE 37th International Conference on Data Engineering
(ICDE), pp. 2511–2522. IEEE (2021). https://doi.org/10.1109/ICDE51399.2021.
00283
15. Lian, J., Zhou, X., Zhang, F., Chen, Z., Xie, X., Sun, G.: xDeepFM: combining
explicit and implicit feature interactions for recommender systems. In: Proceedings
of the 24th ACM SIGKDD International Conference on Knowledge Discovery &
Data Mining, pp. 1754–1763 (2018). https://doi.org/10.1145/3219819.3220023
16. McMahan, H.B., et al.: Ad click prediction: a view from the trenches. In: Proceed-
ings of the 19th ACM SIGKDD International Conference on Knowledge Discovery
and Data Mining, pp. 1222–1230 (2013). https://doi.org/10.1145/2487575.2488200
17. Mendoza, J.E., Castanier, B., Guéret, C., Medaglia, A.L., Velasco, N.: Construc-
tive heuristics for the multicompartment vehicle routing problem with stochastic
demands. Transp. Sci. 45(3), 346–363 (2011). https://doi.org/10.1287/trsc.1100.
0353
18. Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature
518(7540), 529–533 (2015). https://doi.org/10.1038/nature14236
19. Pillac, V., Gendreau, M., Guéret, C., Medaglia, A.L.: A review of dynamic vehicle
routing problems. Eur. J. Oper. Res. 225(1), 1–11 (2013). https://doi.org/10.1016/
j.ejor.2012.08.015
20. Powell, W.B.: Approximate Dynamic Programming: Solving the Curses of Dimen-
sionality, vol. 842. Wiley, Hoboken (2011)
21. Powell, W.B.: Designing lookahead policies for sequential decision problems in
transportation and logistics. IEEE Open J. Intell. Transp. Syst. 3, 313–327 (2022).
https://doi.org/10.1109/OJITS.2022.3148574
22. Rendle, S.: Factorization machines. In: 2010 IEEE International Conference on
Data Mining, pp. 995–1000. IEEE (2010). https://doi.org/10.1109/ICDM.2010.
127
23. Resnick, P., Varian, H.R.: Recommender systems. Commun. ACM 40(3), 56–58
(1997)
24. Ritzinger, U., Puchinger, J., Hartl, R.F.: A survey on dynamic and stochastic
vehicle routing problems. Int. J. Prod. Res. 54(1), 215–231 (2016). https://doi.
org/10.1080/00207543.2015.1043403
25. Secomandi, N.: A rollout policy for the vehicle routing problem with stochastic
demands. Oper. Res. 49(5), 796–802 (2001). https://doi.org/10.1287/opre.49.5.
796.10608
26. Ulmer, M.W., Thomas, B.W.: Meso-parametric value function approximation for
dynamic customer acceptances in delivery routing. Eur. J. Oper. Res. 285(1), 183–
195 (2020). https://doi.org/10.1016/j.ejor.2019.04.029
When Routing Meets Recommendation 35
27. Ulmer, M.W., Thomas, B.W., Mattfeld, D.C.: Preemptive depot returns for
dynamic same-day delivery. EURO J. Transp. Logist. 8(4), 327–361 (2019).
https://doi.org/10.1007/s13676-018-0124-0
28. Voccia, S.A., Campbell, A.M., Thomas, B.W.: The same-day delivery problem for
online purchases. Transp. Sci. 53(1), 167–184 (2017). https://doi.org/10.1287/trsc.
2016.0732
29. Wang, S., Hu, L., Wang, Y., Cao, L., Sheng, Q.Z., Orgun, M.: Sequential recom-
mender systems: challenges, progress and prospects. In: 28th International Joint
Conference on Artificial Intelligence, IJCAI 2019, pp. 6332–6338 (2019). https://
doi.org/10.24963/ijcai.2019/883
30. Watkins, C.J., Dayan, P.: Q-learning. Mach. Learn. 8(3), 279–292 (1992). https://
doi.org/10.1007/BF00992698
A Reactive-Periodic Hybrid Optimization
for Internal Hospital Logistics
1 Introduction
This research was conducted at the Fraunhofer Institute for Material Flow and Logistics
as part of the KIK-Dispo project, funded by the Federal Ministry of Education and
Research of Germany (Grant No. 01IS19041B). The responsibility for the content lies
with the authors.
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 36–55, 2023.
https://doi.org/10.1007/978-3-031-43612-3_3
A Reactive-Periodic Hybrid Optimization for Internal Hospital Logistics 37
health systems, which results in guaranteeing higher levels of health services for
a major part of the population. According to data from 1980 onwards, in most
OECD1 countries the percentage of GDP spent on health has increased over
time [9]. Where, for example, Germany in 2020 allocated 12.5% of its GDP for
healthcare, with approximately 30% of that expenditure being directed towards
hospital services [18]. Logistics operation among different hospital activities is
responsible for 20% to 40% of total hospital cost [11]. Therefore one of the
interesting topics for researchers is investigating efficient algorithms for reducing
the costs of logistics services.
In order to enhance the efficiency of hospital logistics, a crucial part of the
cost, Internal Hospital Logistics (IHL) optimization must be prioritized. Effective
management of IHL has been a longstanding challenge. Recently, the COVID-19
pandemic has added additional challenges to IHL operations by placing a heavy
burden on their processes. Developing efficient IHL operations frees up valuable
transport resources, particularly when hospitals face strict distancing guidelines
and limited transport resources. In such circumstances, hospitals may need to
quickly expand intensive care capacities to ensure continued care in the event of
increased patient numbers. Additionally, there has been a significant increase in
demand for protective equipment and hygiene products. Logistics management
must therefore implement fast, pragmatic solutions to prevent such bottlenecks
within hospitals.
IHL’s timely transport services are typically planned between different loca-
tions within the hospital with an intention of maintaining continuous care. As
such, the problem of collecting and delivering patients and materials (e.g., medi-
cal equipment and consumables) within hospitals can be considered as the Pickup
and Delivery Problem with Time Windows (PDPTW) [8]. However, in real-world
IHL problems, uncertainty in the transport data (e.g. unknown arrival times or
time windows) makes it challenging to solve such a dynamic problem, efficiently.
To effectively handle the dynamic PDPTW challenges, several steps need to
be taken. One crucial step is collecting and arranging transports for optimization.
This involves determining how to collect data on new transports, how to integrate
them with existing ones, and how to prioritize them. Another step is selecting
an appropriate algorithm that can handle the dynamic nature of the PDPTW.
The algorithm should rapidly and efficiently re-optimize schedules when new
transports arrive or conditions change. Finally, it is essential to implement the
schedules produced by the algorithm. This can be achieved using an automated
transport management system that assigns transports to transporters, tracks
their progress, and updates schedules in real-time. Addressing all these steps is
crucial for providing high-quality services at a minimal cost.
Like many other optimization problems, the PDPTW can be solved using
exact and heuristic methods. These methods are mostly differentiated by their
1
The OECD (Organisation for Economic Co-operation and Development) is an inter-
national organization made up of 38 member countries. These member countries are
generally considered to be developed, industrialized economies with relatively high
levels of economic growth, stability, and well-being.
38 E. Ehsanfar et al.
According to a study [2], a hospital with 1000 beds typically requires approxi-
mately 750 transportations daily. Transports are requested by clinical and execu-
tive staff and conducted by transporters. In this paper, data from a pilot hospital
in Germany with approximately 2100 beds is utilized. This data is collected from
specialized software designed for logistics and services management. The soft-
ware facilitates the hospital in acquiring data pertaining to new transportation
requests, assimilating them with the already existing ones, and classifying them
based on critical factors such as urgency and proximity. The average statistics
for transports and tours in this hospital are presented in Table 1.
In order to enhance the efficacy of IHL transport operations, medical institu-
tions often implement management software systems, which facilitate the record-
ing, processing, and handling of all transportation operations while also furnish-
ing electronic assistance for manual dispatch and automatic scheduling. Auto-
mated scheduling software allows hospitals to distribute transport services based
on current demand and available resources. In the daily operations of hospi-
tals, medical staff members enter transport orders into the logistic management
software. These orders contain information about each transport’s pickup and
delivery locations, time windows, transport type, and priority. Logistic managers
then assign these transports to available transporters based on their skills and
availability.
Table 1. The average daily statistics for transports and tours in the pilot hospital.
Transports Tours
Number or Transports 1900 Number of Tours 122
Completed Transports 1500 Tours at Night Shifts 26
Emergency Transports 40 Tours at Day Shifts 96
Patient Transports 1100 Average Duration 550 min
Cito Transports 220 Average Number of Transports 19
Material Transports 570
Fig. 1. Example of a transport tour based on real data collected from a hospital. The
tour consists of 22 transports that were performed between 14 locations. (Color figure
online)
From the perspective of hospitals, reducing logistic costs and improving trans-
port service quality are two critical objectives. Providing high-quality service
means ensuring that each transport service is provided within its predefined
time window. Longer travel times result in higher costs for the hospital, partic-
ularly for skilled transporters who are paid more per hour of work. As a result,
hospitals aim to efficiently utilize these valuable resources. Minimizing the total
travel time of transporters helps to meet the hospitals’ demand for cost reduc-
tion and efficient resource utilization, while also ensuring that transport services
are provided within the designated time windows.
42 E. Ehsanfar et al.
The problem involves assigning each transport to a transporter with the nec-
essary skill and determining the sequence of transports to minimize the total
travel time for all transporters. The objective of the current paper is to mini-
mize the sum of the lateness of all transports and the total travel time for all
transporters while also considering the various constraints and characteristics of
the transport and transporters. For example, each transport should be assigned
to a transporter with the necessary skill, and it is necessary to ensure that each
transport is conducted within its given time window.
3 Algorithm
The dynamic and complex nature of hospital logistics has resulted in an increased
demand for efficient algorithms that can quickly adapt to unexpected changes
to maintain high-quality patient care services. In response, a novel algorithm is
presented in this section, which consists of a pre-processing phase followed by a
two-step heuristic solver.
3.1 Pre-processing
Urgent Pool: To schedule transports from the urgent pool, the algorithm fol-
lows a reactive scheduling policy, which focuses on urgent transports. We clas-
sify the urgent pool based on their priority level as either emergencies or highly
urgent transports. Emergencies have the highest priority and must be handled
immediately without any combination with other transports. Examples of emer-
gency transports include emergency medical supplies for the operating room,
moving patients who have suffered accidents, or transporting critical patients
44 E. Ehsanfar et al.
from the emergency department to the intensive care unit. As soon as the sys-
tem identifies an emergency, it assigns the best available transporter, i.e., the
available skilled transporter closest to the pickup location. The process of assign-
ing the best transporters to each transport is shown in Algorithm 2.
Step 1 - Earliest due date first (EDDF) Algorithm 6: The earliest due date
first algorithm is employed to optimize tours considering the service due date
such that transports with earlier due dates are assigned first. As a result of
step 1, a set of tours is constructed as the initial solution.
Step 2 - R&R algorithm: The R&R algorithm is applied to improve the initial
solution. This algorithm is repeated several times to optimize tours while
minimizing total service lateness. The number of transports to be reassigned
and the number of iterations at each iteration can be adjusted based on the
transports pool size and the desired solution quality. In the next part, we will
discuss R&R in general and the algorithm in more detail.
has since been extensively studied and improved by researchers in the field. For
example, Ropke and Pisinger [15] developed various ruin methods such as ran-
dom, worst, cluster, or history-based, and then a greedy algorithm was used to
recreate the solution. Later, Pisinger and Ropke [10] proposed an R&R algorithm
that employs adaptive large neighborhood search. More recently, the state-of-
the-art R&R algorithm has been proposed for the Vehicle Routing Problem with
Time Windows (VRPTW) [3].
The R&R algorithm is particularly effective in dealing with complex and
“discontinuous” problems, where small changes in the solution can result in large
differences in quality. This is often the case in the VRPTW, where solutions must
meet a variety of constraints and a small change in the scheduled transportation
can cause a solution to become infeasible [3]. Using a large number of transports
to remove from the solution can significantly enhance the performance of the
R&R algorithm, allowing it to explore a larger solution space and increase the
likelihood of finding high-quality solutions [3].
This paper optimizes scheduled transportation by employing the R&R algo-
rithm (Algorithm 7). The algorithm begins by calculating an empty travel factor
for each tour, which represents the total empty travel duration of each trans-
porter divided by the total tour duration. This factor indicates the amount of
time each transporter spends traveling between locations without performing
any services. Tours with higher empty travel factors are more likely to benefit
from improvement.
The R&R algorithm sorts tours based on their empty travel factor and
attempts to reduce the empty travel duration by reassigning transports to other
transporters.
Ruin: The iterative R&R algorithm initiates by identifying the tours with the
highest empty travel time factor. At each iteration, the tours are sorted based on
this factor, and the tour with the highest score is chosen to be ruined. Ruining
the chosen tour involves removing all transports from the tour and reintroducing
them back into the transport pool.
Recreation: The recreate step has two sub-steps: (1) inserting the removed
transports into another existing tour or (2) creating a new tour with an empty
transporter. When deciding on the insertion location for a removed transport,
a simple criterion of minimizing the overall lateness caused by the insertion can
be taken into account.
When determining the insertion location for a removed transport, the inser-
tion is accepted into the existing tour if it adds zero lateness. If there is a late-
ness, the tour is continued to be built, and the overall lateness is compared to
the previous overall lateness. If the new tour has less lateness, the new insertion
is accepted. From an operational perspective, it is crucial to acknowledge that
transporters have limited knowledge of their future transports, usually, they are
only aware of a small number of upcoming ones. Therefore, the creation of new
tours during rescheduling does not present significant challenges. This is mainly
because the rescheduling process and the generation of new tours occur seam-
lessly in the background, ensuring a streamlined and effective workflow for the
A Reactive-Periodic Hybrid Optimization for Internal Hospital Logistics 49
The hybrid algorithm proposed in this paper is illustrated in Fig. 2. The imple-
mentation of the hybrid algorithm starts with creating a list of transporters that
includes their names, availability, and the last seen location of each. If a trans-
porter has conducted any transport previously, the last location is considered
as the delivery location of the last transport. Otherwise, it is assumed that the
transporters begin their shift from the logistics center of the hospital.
4 Computational Experiment
A set of computational experiments was conducted based on real-world hospital
data. The hospital in question has an average of 42 transporters per working
shift, with facilities located on different campuses, buildings, levels, and rooms,
totaling 364 locations. The computational experiments involved simulating 6
months of logistic planning, from January 1st, 2019 to June 30th, 2019, based on
actual historical data. All transport orders and schedules were stored in the hos-
pital’s logistic management systems. Transporter route schedules were assigned
manually by logistic operators or automatically by a greedy algorithm. The
transport orders were sorted based on their arrival time to ensure chronological
processing. The list of available transporters was obtained from the historical
data, and this information was used to create shifts for each transporter. Table 2
summarizes the primary features of the ILH data. The optimization period was
set to 10 min. Therefore, the system waits for 10 min for re-optimization and
then optimizes the non-urgent transports. We conducted the experimental results
using a machine with 40 CPU cores and 187 GB of RAM. In all cases, the process
of reactive assignments occurred within a matter of milliseconds, while periodic
assignments, including the ruin & recreate phase, took a maximum of 1.43 s per
period.
Table 2. The primary features of the historical data and their respective types
To create shifts, the first timestamp that a transporter appeared in the data
was set as the start time of the shift, and the last timestamp was set as the
A Reactive-Periodic Hybrid Optimization for Internal Hospital Logistics 51
end time of the shift. This allowed for the assignment of transports to specific
transporters based on their availability during their shifts. A loop was used to
feed each transport into the hybrid algorithm, and each was assigned to the most
suitable transporter. The waiting time for each transporter was considered the
same as the real data to ensure accurate simulation. Once tours were scheduled,
lateness and travel time were recorded.
To assess the performance of the hybrid algorithm, the results were com-
pared to the actual historical data. The evaluation metrics consisted of the total
lateness for both emergency and non-emergency services, the total transporters’
travel time, the percentage of emergency transports delivered on time, and the
travel time balance index (T T BI) among transporters. A higher T T BI value
indicates an uneven distribution of travel time among the transporters. The
T T BI is calculated using the following formula:
where max travel time is the total travel time taken by the transporter with
the highest workload, min travel time is the total travel time taken by the
transporter with the lowest workload, and average travel time is the average
travel time taken by all the transporters. A higher T T BI indicates an uneven
travel time distribution among the transporters.
It was observed that weekends and holidays had an average of 423 daily
transports per day, whereas workdays had an average of 2070 daily transports.
Therefore, two types of days were defined in hospital operations: busy and non-
busy days, where non-busy days were considered those with less than 500 trans-
ports per day. Table 3 presents the average daily evaluation parameters for both
busy and non-busy days, where the results of the hybrid algorithm are compared
with the hospital records. Additionally, the benchmarks for each day of the week
from Monday to Sunday were evaluated and presented in Tables 4, 5, 6, 7 and
8. The results indicate that the approach was effective in optimizing scheduled
tours for different days and transport volumes.
Table 3. The average daily evaluation parameters for days with less than and more
than 500 transports during the six-month period of historic hospital operation data.
The results of the hybrid algorithm are compared with the hospital records
Table 4. The average daily sum of lateness, in minutes, for emergency transports on
weekdays within the experience horizon, as obtained from the results of the hybrid
algorithm and hospital records
Table 5. The average on-time delivery percentages of emergency transports for week-
days within the experience horizon, comparing the results of the hybrid algorithm with
hospital records.
Table 6. The average daily TTBI parameter for weekdays within the experience hori-
zon for the results of the hybrid algorithm and the hospital records.
Table 7. The average daily sum of lateness, in minutes, for non-urgent transports on
weekdays within the experience horizon, as obtained from the results of the hybrid
algorithm and hospital records
Table 8. The average daily sum of total travel time, in minutes, on weekdays within the
experience horizon, as obtained from the results of the hybrid algorithm and hospital
records
References
1. Bent, R., Van Hentenryck, P.: A two-stage hybrid algorithm for pickup and delivery
vehicle routing problems with time windows. Comput. Oper. Res. 33(4), 875–893
(2006). https://doi.org/10.1016/j.cor.2004.08.001
2. Bredehorn, T., Johow, L.: KI in der Krankenhaus-Transport-Disposition (2020).
https://publica.fraunhofer.de/handle/publica/265840
3. Christiaens, J., Vanden Berghe, G.: Slack induction by string removals for vehicle
routing problems. Transp. Sci. 54(2), 417–433 (2020). https://doi.org/10.1287/
trsc.2019.0914
A Reactive-Periodic Hybrid Optimization for Internal Hospital Logistics 55
Department of Management Information Systems, San Diego State University, San Diego,
CA 92182, USA
[email protected]
1 Introduction
If anything, the last three years have taught the maritime industry that the supply chain
is vulnerable and very much dependent on global events. Ports around the world have
witnessed some of the biggest cargo congestion challenges in modern history. This
congestion, driven by a continuous increase in the gap between consumer demand and
logistics supply, exacerbated by supply-chain disruptions (among which, cybersecurity
breaches are a main concern), is negatively impacting the maritime footprint, despite
large investments by the industry to streamline, modernize, and reduce supply chain
carbon emissions. Consumer demand for goods shipped across the world is set to be
steady or increase. Major ports around the world are witnessing similar trends, leading
to a rise in regional congestion and emissions at bottlenecks, whether in the port, at port
gates, or in dray corridors. Saxon and Stone (2017) build on the container disruption of
the shipping business and how the industry unfolds in the next 50 years. They suggest
that by 2067, the annual growth of global trade of containers will range from 1.9% to
3.2% in the high case scenario. This equates to an increase from 182M TEUs (2016)
to 464M TEUs and 858M TEUs by 2067 respectively for low and high. Every year,
container ships plying the world’s waterways spew about 1 billion metric tons of CO2
into the air (about 3% of global greenhouse gas emissions). Transportation stakeholders
must continue to invest and develop groundbreaking solutions that monitor, track, and
reduce cargo’s carbon footprint.
The increase of a global cargo movement, coupled with the added complexity of
equipment and systems that are deployed to optimize the movement of goods, pose a
new challenge of measuring the impact of intermodal container travel. This comes at a
critical time as private sectors and government agencies are seeking reliable measure-
ment sources to accurately assess carbon neutrality and drive decarbonization. Despite
many efforts to measure carbon impact, there continue to be a lack of standardization
and discrepancies among the metrics deployed to measure GHG emissions and their
impact. The goal is to accurately measure GHG emissions and levelized costs of Mt-km
(LCOTKM) analyzing the multi-variation by modes of energy and goods transportation
and compare to projected values. Figure 1 shows a shipping emissions projection.
To control carbon emissions, we first need to accurately measure their impact. We
propose a Digital-Twin enabled, AI-driven RT Distributed Optimization Platform, based
upon a system level view of the intermodal network, mapping energy consumption, and
calculating emissions, and offering suggestions for route and mode switching of contain-
ers, which could greatly improve those metrics. Due to the diversity and complexity of
58 C. Paternina-Arboleda et al.
data this Platform would depend upon, success requires a massive data ingestion from
hundreds of data sources while combining extensive data modeling, AI processing,
and algorithms along with simulation. Investing in a high-fidelity simulation Platform
such as Argonne’s POLARIS simulation (Auld et al., 2016), and proven optimization
meta-modeling information systems (Velasquez-Bermudez, 2020), is more beneficial
and rewarding in the long run, as real-time Intermodal efficient mode switch is essential
for these systems to convey good results. Jubiz-Diaz et al. (2021) modeled the Colombia
Intermodal Transportation Network, and examine the relationship between infrastruc-
ture investment, freight accessibility, and GDP, with a data-driven geospatial approach,
targeting investments with greatest GDP impact.
A digital twin is a virtual model designed to accurately reflect a physical object
by using real-time data (IBM, n.d.). According to IBM, there are four types of digital
twins depending on the level of magnification. To successfully implement a digital
twin infrastructure capable of real-time coordination and reduce carbon emissions for
intermodal freight transportation systems in the United States, several sources of data
and information will need to be integrated to form various component twins.
An AI-driven real-time distributed optimization (RT-DO) system is deployed, con-
sidering the dynamic nature of the intermodal freight transportation system. The model
uses real-time data and analytics to optimize operations, reducing costs and carbon emis-
sions. The system considers real-time data on transportation routes, cargo volumes, and
costs. The modeling methodology is based on the concepts of advanced parallel, and dis-
tributed analytics, as support to “artificial brains,” the decision support systems in the era
of Industry 4.0. The system is deployed at different levels. The first level presents real-
time distributed optimization mathematical models for cargo routing which are used to
determine the most efficient/sustainable path for traffic flow in a transportation network,
considering a wide range of factors such as intermodal switching, traffic, and demand
resource limitations. Intermodal switching refers to the transfer of cargo between differ-
ent modes of transportation, such as from a ship to a truck or from a train to a waterway.
We base our RT Optimization modeling framework in Large-Scale Optimization theo-
ries, such as Benders/Lagrangean decomposition, and a novel Event-Driven Real-Time
modeling structure described in Abril et al. (2023). Paternina-Arboleda et al. (2008)
show an integrated simulation-optimization framework for logistical systems.
The main contribution of the research paper entitled “Cybersecurity Considerations
for the Design of a Digital Twin Enabled AI-Driven Real-Time Distributed Optimization
of Container Carbon Emissions Reduction for Synchromodal Freight Operations” lies
in highlighting and addressing the crucial cybersecurity considerations associated with
the design and implementation of a digital twin-enabled AI-driven real-time distributed
optimization system for synchromodal freight operations.
The paper identifies the specific cybersecurity challenges that need to be taken into
account when designing such a system, including securing the data flow, ensuring the
security of digital twin models, and safeguarding the system against cyber-attacks. It pro-
poses practical solutions to address these challenges, such as implementing encryption
mechanisms, access control policies, anomaly detection systems, and other cybersecurity
measures.
Cybersecurity Considerations for the Design 59
From the cybersecurity perspective, Wang and Liu (2022) discusses the increasing
application of cyber-physical systems (CPSs) in the rail industry and the corresponding
rise in cyber threats that can cause significant failures and consequences. The authors
propose a risk management methodology for addressing cyber security risks in rail
CPSs, focusing on proactive identification, clear definition, and proper handling of these
risks. Beaumont (2018) presents a case study of automated maritime container terminals
(CTs). It has the aim of demonstrating that the risks derived from the use of technol-
ogy associated with the Fourth Industrial Revolution (4IR) are both real and dangerous.
In the shipping industry, the four largest shipping lines in the world have been hit by
cyber-attacks since 2017 (Song, 2021). Cybersecurity is significant in maritime trans-
portation because the costs of cyberattacks can be vast and the consequences fatal. For
instance, digital navigation systems could be manipulated so that they sheer off or run
aground, which would endanger the lives of the crew, people at sea and on land. The
financial effects on shipowners and ship operators would also be immense (Tsvetkova
et al., 2021). Woschank et al. (2020) mentions that AI may be used also in higher-level
processes to detect fraud, prevent cybersecurity threats, and generally optimize higher-
level processes for Smart Logistics in the future. De la Peña-Zarzuelo et al. (2020),
present Internet of Things and sensing solutions, cybersecurity, horizontal and vertical
system integration, cloud computing, 3D printing and additive manufacturing, big data
and business analytics, augmented reality and simulation and modeling are the pillars
of Industry 4.0. From all viewpoints, cybersecurity is a top concern for all AI-driven
digital systems of today’s logistics industry.
The paper emphasizes the critical importance of protecting the integrity, confidential-
ity, and availability of data in a digital twin-enabled AI-driven system. It underscores the
potential risks and vulnerabilities that can arise in the transportation industry, especially
when leveraging advanced technologies for real-time optimization and carbon emissions
reduction. This paper is significant as it sheds light on the importance of cybersecurity
in the context of digital twin-enabled AI-driven systems for synchromodal freight oper-
ations. It provides valuable insights for researchers, practitioners, and decision-makers
in the transportation industry, offering a foundation for the development and imple-
mentation of secure and resilient systems that can effectively reduce carbon emissions,
optimize operations, and improve sustainability.
Fig. 2. Methodology for predicting Emissions Inventories (an example for Port Emissions),
Paternina-Arboleda et al. (2023)
We are expanding the methodology to account for geospatial emissions for the inter-
modal network to be optimized based on the predicted metrics, with valuable insights
to improve environmental practices and foster sustainable development.
and provide reliable and efficient services, (d) have an appropriate level of redundancy,
to have multiple options for reaching their destination in the event of disruptions, and
(e) be environmentally sustainable (Yang et al., 2023). The global resiliency metric
of a system depends on the resiliency of each individual transportation mode within
because a transportation system is made up of multiple modes, each of which has its
own characteristics and vulnerabilities. The mode switching optimization models must
be tightly coupled with the disruption models. Some metrics proposed for our system
are:
• System capacity as a measure of the ability to handle traffic volumes during
normal/peak periods, considering number/capacity of lanes, number/frequency of
vehicles, availability of modes.
• Travel time reliability as a measure of the predictability of travel times of a network.
This metric considers i.e., congestion, incidents, and weather conditions that can
cause delays or disruptions.
• System adaptability as a measure of the ability of a transportation network to adapt to
changes in demand, technology, or other factors that can impact services. This metric
considers factors such as new technologies, flexibility to adjust based on demand,
and respond to disruptions.
• System redundancy as a measure of availability of alternative routes or modes in the
event of disruptions to the primary network. This metric considers factors such as
number and location of alternative routes, availability of transit or other modes, and
ability to switch between modes.
• Environmental impact as a measure of the sustainability, which takes into consid-
eration factors such as energy consumption, emissions, and the impact on local
ecosystems.
The resiliency of each transportation mode is influenced by a range of factors, includ-
ing quality and age of infrastructure, availability of alternative transportation modes,
the level of investment in research and development, and the degree of coordination
and communication between transportation providers and policymakers (Trucco and
Petrenj, 2023). Understanding resiliency of individual transportation modes is critical
to developing strategies for enhancing resiliency of whole transportation systems. Our
proposed solution is designed to determine the sensitivity to inputs such as routes, low
carbon infrastructure rollout, GHG reductions, and other variables that can help foster
resiliency in several ways. The solution can help identify vulnerabilities in the transporta-
tion system by simulating the effects of various input scenarios on system performance
via a digital twin. We can simulate the impact of a disruption to a key route or a delay
in the rollout of low carbon infrastructure and help identify areas where the system is
most vulnerable.
We could prioritize investments in infrastructure and other resources based on the
potential impact on system performance (identify most critical investments to enhance
resiliency of the system and reduce vulnerability to disruptions), while testing strate-
gies to enhance the resiliency of the system. It provides a visual representation of the
system’s performance under different input scenarios to help understand the potential
impact of actions and decisions on the resiliency of the system to promote more effective
collaboration. Intermodal logistics can be impacted by a range of disruptions (local and
62 C. Paternina-Arboleda et al.
global). Some examples are: (a) natural disasters, (b) labor strikes, (c) disruptions in the
supply chain, (d) global trade disruptions, (e) Pandemics, (f) Network Congestion, and
the focus of this article, (g) Cybersecurity threats. Effective management of disrup-
tions requires a comprehensive understanding of the system’s vulnerabilities, as well
as proactive planning and response strategies to mitigate the impact of disruptions and
maintain the resiliency of the transportation system.
in response to changing conditions for GHG emissions or the economic cost of avoided
carbon, while meeting efficiency needs (economic cost/freight ton-Km).
• Step 3 – Generation of the visual user Interface: The system automatically generates
a user interface. The shell windows are associated with main/secondary tables.
• Step 4 – Analysis and Review of the Algebraic Model Formulation: It involves coordi-
nating two simultaneous activities: Review loaded algebraic formulation into MMIS,
and results.
• Step 5 – Run the system and store the Algebraic Model results: Performed
automatically.
• Step 6 – Set the Algebraic Model: Necessary changes to the MMIS and IDIS. This
cyclic process ends when the implemented model produces the correct results, ready
to be delivered.
• Step 7 – Data Access by the end-user: Finally, the end-user can access the model for
use, based on the data stored in the IDIS and results generated by the models.
The predictions from all three base models will be combined for the final regional
inference of the best route the container should take based on the location, traffic condi-
tions, and weather. The algorithms for the AI need to focus specifically on minimizing
Cybersecurity Considerations for the Design 65
the CO2 emissions overall based on the ton-mileage and minimizing the time-to-arrival
when taking the route by avoiding any traffic. Since CO2 emissions are based on ton-
mileage, without exact emissions per hour data, the best the system can do is avoid
idling in traffic to reduce those idling emissions by minimizing the time-to-arrival and
ton-mileage for the route (Cammin et al., 2023; Paternina-Arboleda et al., 2023).
The AI system will continuously update this information as historical data con-
tinues to be generated and new optimized or updated routes are discovered. Figure 4
shows a high-level view of the AI Process from collecting regional historical data to
generating decisions from inferences made by the system. In previous research articles,
Paternina-Arboleda and Das (2001, 2005) demonstrate how artificial intelligence tech-
niques, specifically Multi-Agent RL, can be used to address complex logistics problems.
By leveraging the power of AI, logistics companies can optimize their operations, reduce
costs, and improve efficiency and customer satisfaction.
seasons have different weather patterns that must be kept in mind. Comparing yearly
data could prove to show an abundance of statistical insights. By gathering this historical
data, the AI model will have some data points and ability to make rudimentary decisions.
There is a variety of different information metrics that will assist in the training of
the model. To start off, priority levels need to be set for each method of transportation,
cost, schedule, weather, payload type and traffic patterns. These set priority levels will
lead the model to make more accurate predictions. These priority levels will also be a
large part of the COA selection list that the model outputs.
The mode of transportation and associated personnel are required to physically move
the goods from point A to point B. Trucks are, and will continue to be, the primary
method of travel. However, the transportation method could also be a boat or train and
the associated personnel for each method. Driver information for each situation will have
to be inputted into the system and protected. Driver will be used as a colloquial term in
this paper to represent a truck driver, ship driver and train driver into one central category.
This information varies from driver to driver and method to method. For example, a truck
driver will need to have all their medical information, personal information such as social
and next of kin, time on the road, consecutive days traveled, average speed, follow on
trip and GPS positioning data.
The system will need to keep payload information with set priorities for different
manifests. Each item or good will include the name, Radio Frequency ID (RFID), weight
and priority level. Each item in the manifest will combine to bring the overall payload
to a quantifiable level of priority. This will be inserted into the model for further course
of action training of data.
For the data to reach the Hub and central data repository securely, a data transportation
method needs to be established. The endpoint devices and users will be connected to
the Hub via a private Virtual Private Network (VPN). A VPN is used to “mitigate the
security risks inherent to providing remote network access by offering strong encryption
to provide data security and strong authentication to limit access to applications based
on defined security policies” (Loshin, 2019). Aggarwal, (2022), and Aggarwal (2023)
define data encryption as a security tool that transforms plaintext data into encoded data
called ciphertext that can only be read with a unique key that is given at the time of
the encryption, which can be used for both in transit and at rest data. The two different
types of VPN that the system can implement are TLS VPN or IPsec VPN in tunnel
mode. Both modes reach the same conclusion of secure data traffic and communication
through slightly different means. A TLS VPN lives in the application layer, layer 4.
IPSec VPNs live in the network layer, layer 3. In other words, “The major difference
between an IPsec VPN and an SSL VPN comes down to the network layers at which
encryption and authentication are performed” (Loshin, 2019). Again, both VPNs provide
a private, secure network for communication, data storage, data transmission and network
operations. The project manager can implement whichever one his engineering team
recommends.
The above criteria, data, and information can be combined for model use to output the
Courses of Action (COAs). The model will use the data as inputs and produce COAs that
show cost-reduction, environmental benefit, scheduling patterns, transportation method,
Cybersecurity Considerations for the Design 67
destinations, and departure suggestions. These COAs will be the optimal transportation
strategy and give management options to pursue.
The goal of our system is to provide transportation efficiency through automation, costs-
reducing methods, and increase sustainability of the transportation ecosystem. Artificial
intelligence (AI) has the potential to improve the intermodal freight transportation sys-
tem’s efficiency by optimizing operations in real-time and self-evolving the models to
make better/faster decisions. The system accomplishes this goal. For the system to give
accurate predictive and reactive solutions, there needs to be data structure, software sys-
tem, and hardware system requirements of the system. When implemented correctly, the
system will output a corrective course of action that is an optimal solution.
Identifying and establishing a data structure is the beginnings of building out the
system. The data structure needs to be set up in a way that optimizes the dashboard’s
utilities. The structure of the data will be in a data frame form. What this will do is
allow for data to be structured cleanly and in a format that can be queried. A data frame
is defined as “a data structure that organizes data into a 2-dimensional table of rows
and columns, much like a spreadsheet” (“Data Frames,” 2022). This allows for easy
sorting and indexing of the raw data. The data structure will be set up very similarly
to an excel spreadsheet with numbered rows and columns. Data frames are “one of the
most common data structures used in modern data analytics because they are a flexible
and intuitive way of storing and working with data” (“Data Frames,” 2022). This ease of
implementation and flexibility will pay dividends once queries, analysis and data start
to flow through. As an AI-based model that will be learning from the data provided, the
flexibility of data frames allows for results to be optimized.
The proposed decision support system follows as an extension of Port Community
Systems (PCS), Freight optimization systems, and Freight brokering systems. A PCS is
a platform that connects the various stakeholders in a port community, such as shipping
companies, freight forwarders, port authorities, and customs officials, to facilitate com-
munication, coordination, and collaboration. Data is a crucial component of a PCS, and
various data types are involved in the system (Moros-Daza et al., 20167, 2018, 2020).
Vehicle routing optimization (VRO) is the process of finding the most efficient routes for
vehicles to transport goods or people. Data is a crucial component of VRO, and various
data types are involved in the optimization process (De la Cruz et a., 2013, Amador et al.,
2014, Palma et al., 2019). A freight brokering system is a platform that connects ship-
pers with carriers to facilitate the movement of goods. Data is a crucial component of a
freight brokering system, and various data types are involved in the system (Moros-Daza
et al., 2019). Sarabia et al. (2006), present simulation-based decision support models as
computer-based tools used to optimize river cargo transportation. These models simu-
late the river system and cargo flows to help decision-makers identify the most efficient
transportation routes and schedules. Various data types are involved in SDSM for river
cargo transportation. The typical data types that need to be consider in such systems
follow (Table 1).
68 C. Paternina-Arboleda et al.
This section focuses on the design of the data flow system that enables real-time synchro-
modal transportation optimization using digital twin technology and AI. The proposed
system aims to provide a comprehensive view of the transportation network, including
infrastructure, assets, and operations, to optimize the transportation flow and minimize
carbon emissions. The system’s design includes the integration of various data sources,
such as real-time sensor data, historical transportation data, and external data sources,
to enable the AI algorithms to make informed decisions in real-time. Figure 4 shows the
context diagram for the system data flow.
The primary data input into the system at regular intervals will be the GPS technology
tracking the containers. The GPS tracking will be used by the system to create the content
twin of the containers as they are moved between modes of transport from starting point
to destination. Supplemental information such as the current engine being used to move
the container, current starting point, and destination will also need to be gathered each
time the container changes mode of transport until the container reaches its destination.
This information will allow the system to track CO2 emissions based on the engine
being used for transport and calculate anticipated & actual CO2 emissions based on the
starting point, destination, and actual miles traveled.
To create the content twins of weather and traffic conditions, real-time weather,
and traffic data APIs such as Weather.gov, WeatherAPI.com and Google Traffic will be
utilized to access the available data from the cloud. With this information collected, the
system will be able to monitor the weather conditions and local traffic conditions where
transport is taking place. Adjustments to routes will be made possible by analyzing the
weather and traffic data allowing the system to make routing decisions in a timely and
emission friendly manner. Figure 5 shows the context diagram of the system data flow.
The core component twins of the container, weather, and traffic will work together
to form three distinct asset twins within the digital twin system. Container and weather
component twins will form a single asset twin to study the interactions of the container
routes and weather. Similarly, the container and traffic component twins will form a
single asset twin to study the interactions between container routes and traffic.
Finally, the weather and traffic component twins will form a single asset twin to study
the interactions between weather and traffic where intermodal transport occurs. The data
collected from these three asset twins will be processed and analyzed by the system
for optimization of routes, schedules, and CO2 emission reduction during intermodal
container transport, forming the complete digital twin data structure (Fig. 6).
Real-time data can be obtained from a variety of sources. Advanced analytics and ML
algorithms can be used to process and analyze real-time data, providing insights and sug-
gestions to stakeholders in real-time. Table 1 specifies the update frequency, latency, and
spatial granularity of dynamic data. The goal is for system level optimizations to be com-
puted within a day and dynamic adjustments in under 1 h. Real-time data can be critical
for optimizing intermodal transportation networks by providing up-to-date information
on infrastructure, traffic patterns, weather conditions, and other factors that can impact
transportation operations. Some common types of RT data used in intermodal transporta-
tion optimization include Vessel AIS, GPS data, Rail freight traffic, road Traffic, Port
traffic, Weather, Inventory and shipment data, and data from Customer Experience (CX)
and Customer Relationship Management (CRM) platforms along with feedback chan-
nels (for which Natural Language Processing models are needed). All the above sources
can provide RT or near-RT information. Effective use of RT data improves network
optimization, leading to increased efficiency, reduced costs, and improved customer
satisfaction.
70 C. Paternina-Arboleda et al.
4 Cybersecurity Considerations
The CIA triad, which stands for Confidentiality, Integrity, and Availability, is a fun-
damental concept in cybersecurity that applies to various systems and technologies,
including digital twins. Let’s discuss the CIA triad in the context of ports, container, and
shipping digital twin systems:
Cybersecurity Considerations for the Design 71
Confidentiality: Confidentiality in the context of digital twin systems for ports, con-
tainers, and shipping refers to protecting sensitive information from unauthorized access.
It involves ensuring that only authorized individuals or entities can access and view the
data related to the digital twin system. Confidentiality measures can include strong
access controls, encryption of sensitive data, and secure communication channels to
prevent eavesdropping or data interception. For example, in a shipping digital twin, con-
fidential information may include cargo details, shipping schedules, or trade secrets of
the involved parties. Safeguarding this information is critical to prevent unauthorized
disclosure or exploitation by malicious actors.
Integrity: Integrity refers to the assurance that the data and information within the
digital twin system remain accurate, unaltered, and trustworthy. It involves protecting
the data from unauthorized modifications, ensuring that it reflects the true state of the
physical system being represented. In the context of ports, containers, and shipping
digital twins, maintaining data integrity is crucial to avoid manipulation or tampering that
could result in incorrect decisions, compromised safety, or financial losses. Implementing
integrity controls such as data validation, digital signatures, and audit trails can help
ensure the reliability and authenticity of the digital twin’s data.
Availability: Availability refers to ensuring that the digital twin system is accessible
and operational whenever it is needed. It involves safeguarding the system against disrup-
tions, outages, or denial-of-service attacks that could render it inaccessible or unusable.
For ports, containers, and shipping digital twins, availability is vital for maintaining
smooth operations and efficient logistics. Downtime or unavailability of the digital twin
system can lead to delays, inefficiencies, and potential financial losses. Implementing
robust backup and disaster recovery mechanisms, redundancy, and proactive monitoring
can help ensure high availability of the digital twin system.
In brief, the CIA triad provides a comprehensive framework for addressing cyber-
security concerns in ports, container, and shipping digital twin systems. By prioritizing
confidentiality, integrity, and availability, organizations can establish a strong security
foundation to protect sensitive data, maintain the accuracy of information, and ensure
the continuous and reliable operation of their digital twin systems.
The first code type that logically should be secured is the source code. The source
code that is used as a baseline configuration for all development could potentially be
compromised and go undetected until deployment. This type of source code data poi-
soning and embedded viruses can hide if the malicious actor enters them smartly in
the beginning of the development process. These data vulnerabilities can be part of
the implementation process and deployed to production units without the developer’s
knowledge. Drivers and cyber operations dashboards would not be able to track security
alerts and be utilized as they were intended. To mitigate this risk, a best practice known
as “Shift-Left” can be implemented into the systems’ security posture.
The Development, Security, Operations (DevSecOps) process incorporates secu-
rity into the early stages of the software development life cycle. According to IBM,
DevSecOps “integrates application and infrastructure security seamlessly into Agile
and DevOps processes and tools. It addresses security issues as they emerge when they
are easier, faster, and less expensive to fix. Additionally, DevSecOps makes applica-
tion and infrastructure security a shared responsibility of development, security, and IT
operations teams, rather than the sole responsibility of a security silo” (“DevSecOps,”
2023). DevSecOps gives developers access to security tools throughout the process so
that there are no integration problems at the end. The security vulnerabilities and flaws
would be shifted left and found sooner rather than later.
“Shift left”, promotes the integration of security measures into the early stages of
the DevOps process (“DevSecOps”, 2023). By moving security from the right (end)
to the left (beginning) of the development cycle, software engineers are encouraged to
prioritize security from the very beginning of the project. In a DevSecOps environment,
cybersecurity architecture and development engineering is incorporated as part of the
development team to guarantee that every phase has security measures. Furthermore,
the configuration items in the stack are securely patched, configured, and documented.
This will benefit change management and version control measures. By adopting a shift
left approach, DevSecOps teams can detect security risks and vulnerabilities early and
take immediate action to address them. This ensures that not only is the product being
built efficiently, but that security is also being implemented simultaneously.
The next cybersecurity risk that will be relevant to the system is the risk of lateral
movement, network device compromises and privilege escalation after bypassing the
initial access controls and authentication stages. This risk is because of the increased
attack surface of traditional cybersecurity measures that focus on this initial access
barrier (North/South traffic) and allow freedom to move around within the network
(East/West traffic). To mitigate this risk, using zero trust cybersecurity practices would
be a sufficient mitigation path. Zero Trust Architecture (ZTA) takes an orthogonal view
of cybersecurity by accounting for internal movement and east/west traffic within a
network. This lets ZTA be viewed as “data-centered” rather than “user-centered”. ZTA
secures the data that flows all through the network while traditional measures attempted
to secure the users in entering the network. The United States Government Accountabil-
ity Office (GAO), states “Zero trust architecture (ZTA) is a cybersecurity approach that
authenticates and authorizes every interaction between a network and a user or device”
(Zero Trust Architecture, 2022). In a normal cybersecurity paradigm, a network user is
allowed to roam freely east and west through the network once access is granted. An
Cybersecurity Considerations for the Design 75
attack surface is much smaller using a zero-trust mindset because there are consistent
authorization measures that are implemented into the system design. This will be imple-
mented as Policy-as-Code and will be automated through firewall parameters, router
rules and ACLs. ZTA will be very important for the system because of the data struc-
ture and hybrid-cloud environment. The GAO describes the importance of ZTA with
some background. “Given the increasingly complex nature of IT networks, including
cloud and hybrid environments, ZTA’s goals are to reduce opportunities for attackers by
restricting access and to detect attacks by monitoring user behavior and other network
activity” (Zero Trust Architecture, 2022). Given the hybrid data storage and data move-
ment paths through secure VPNS, this major reduction of attack surface will directly
lower system’s security risk.
The last major cybersecurity risks the system will be faced with is being in the dark
about security threats and attacks. The system will need the ability to receive notifications
of cyber security attacks. Whenever there is data out there that can be compromised, the
system vulnerabilities will be attacked by cyber criminals. The criminals hope to gain
some sort of gain. Whether that gain be a financial, competitive advantage, competitor
insights or some sort of notoriety. When these cyber-attacks occur, the system will have
a prompt, security alert and security management subsystem to alert security analysts of
the breach. As mentioned, the Security Incident/Event Management (SIEM) subsystem
is sufficient to mitigate this cybersecurity risk and accomplish security monitoring. This
subsystem will be expanded on in the next section to automate the responses. However,
to automate a response to a security breach, a type of SIEM subsystem is the foundation
for mitigating security alerts that will arise. The SIEM tools are “a way to centrally
collect pertinent log and event data from various security, network, server, application,
and database sources. SIEMs then detect and alert on security events” (Froehlich, 2023).
This allows security analysts to review the log data, identify the threat, isolate, and
mitigate as appropriate. As mentioned, this will be automated by another subsystem in
later phases. Another benefit of the SIEM subsystem is that “Aggregated data is analyzed
by the SIEM in real time to spot potential security issues. Because multiple data sources
are analyzed, the SIEM identifies threats by correlating information from more than one
source” (Froehlich, 2023). The system will leverage this feature to collect and store data
at each local host and aggregated host data at the hub. Additionally, “The SIEM then
intelligently ranks the events in order of criticality” (Froehlich, 2023). The SIEM AI
feature that stacks and ranks the security alerts will be very beneficial to cost savings
and efficiency. Buckbee (2022) states some of the top SIEM tools used including Splunk,
IBM QRadar, and Logrhythm.
Table 2 shows a summary of the cybersecurity considerations and the measures to
counteract.
76 C. Paternina-Arboleda et al.
There is a large need for the system to have overarching security protections on sensitive
data to protect customer, supplier, and employee information. Data Privacy rulesets are
baked into the source code that the system uses. Both data at rest and data in transit
should have cybersecurity measures built into the system design.
The primary method to store and secure the data will be in a cloud instance. This
allows the cloud storage infrastructure to be managed at the Hub and treats the production
units as Thin Clients. Thin clients are defined as “a virtual desktop computing model
that runs on the resources stored on a central server instead of a computer’s resources”
(Gillis, 2021). Both cloud and hard drive based local backup methods have advantages
and disadvantages. To take advantage of both and fully address any possible data loss,
a hybrid backup strategy is recommended, where backups are created locally and in the
cloud. The 3-2-1 backup rule industry standard describes that the data is kept in three
places, across two media, with one backup stored offsite, such as in the cloud.
Local hard drive data storage will be utilized on an incremental, scheduled basis. This
local, hard drive backup refers to the process of backing up the system, applications, and
data to a device located internally or as a connected component, such as tape, disk, hard
disk, flash drive, CD, external hard drive, or other media. This type of backup will be
located and kept solely by data custodians located on site at the Hub. Due to the growing
storage capacity restraints for the massive amounts of data that gets produced, this will
not be a full data storage strategy. However, a very important data storage strategy if the
cloud was ever compromised.
On-Prem means the data is stored on the Hub’s premises but in a cloud storage
location. Cloud based data storage on-premises backup refers to the process of backing
up the system, applications, and data to a public or private cloud that is located on the
Hub premises. The cloud will be configured as a secure database that collects the data
and stores it into the centralized servers. These servers are located on site at the Hub.
Off-premises backup refers to the process of backing up the system, applications, and
data to a public or private cloud that is located off the premises. (i.e., on a production
unit). This will be the primary method of collecting and storing data for the different
transportation methods and personnel involved. Off Prem is when the data is not part of
the centralized cloud storage infrastructure. Rather, a means to store local data of driver
information and movement in a separate, secure cloud storage that acts as a thin client.
Thin clients are very useful because it reduces the cyber-attack surface and protects the
data that is generated by each unit. However, there is a drawback. This drawback being
“Normally thin clients take the form of low-cost computing devices that heavily rely on
a server for computation” (Gillis, 2021). This puts the computational power back onto
the server side located at the Hub instead of the clients themselves on the truck, train, or
boat.
Protecting data at rest within the Hub database and storage locations can be accom-
plished by enforcing three key security procedures of database encryption, access and
authentication (“Protecting data at rest,” 2023). Enforcing access controls and proper
authentication using MFA points protects the core cybersecurity triad. Confidentiality,
data integrity and availability are protected because there is no unauthorized access to the
data being stored. All data that is meant to be seen by authorized personnel is available.
78 C. Paternina-Arboleda et al.
If there is unauthorized access, the database encryption is the fallback security measure
that ensures the sensitive data is protected. The method to protect sensitive data in transit
is to set up an enterprise level Virtual Private Network (VPN) and direct, secure con-
nection between the Hub and fielded units. This method of protecting sensitive data in
transit allows the system to mitigate security risks inherent to providing remote network
access by offering strong encryption to provide data security and strong authentication,
to limit access to applications based on security policies (Loshin, 2019).
Having the ability to hide the identity of information is another way to protect
private data, as it would make it difficult to pinpoint whose data it is. QueryPie (2022)
goes into detail about the difference between anonymization and pseudonymization, but
how both can be beneficial to protecting personal data. Anonymization is the process of
eliminating all identifiers from the data, whereas pseudonymization uses a pseudonym
identifier takes place of the actual identifier of the data.
Cybersecurity and data privacy are prioritized steps to ensuring safe use of the pro-
posed optimization system. To reach the appropriate level of security and privacy for
the system, there are several legal and regulatory issues that must be addressed. Intellec-
tual Property (IP) rights are one of the leading legal obstacles that must be considered
when using AI-driven models and algorithms. Medeiros and Sanft (2018) state that IP
is associated with inventions, brands, new technologies, source code, and artistic work
that have patents, trademarks, and copyright ownership.
6 Managerial Insights
This article highlights the critical managerial insights for designing an AI-driven dis-
tributed optimization system with a strong focus on cybersecurity considerations. By
adopting a cybersecurity-first approach, investing in robust cybersecurity measures, fos-
tering awareness, collaborating with industry partners, conducting regular risk assess-
ments, and planning for incident response and recovery, managers can enhance the secu-
rity and resilience of the system, protecting valuable data and ensuring the successful
implementation of container carbon emissions reduction in freight operations.
It highlights the importance of adopting a cybersecurity-first approach when design-
ing an AI-driven distributed optimization system for container carbon emissions reduc-
tion in freight operations. Managers should prioritize cybersecurity considerations right
from the system’s design phase to mitigate potential risks and safeguard sensitive data.
It highlights the importance of investing in robust cybersecurity measures. Given
the increasing connectivity and interdependence of systems in the freight industry,
it is imperative to invest in robust cybersecurity measures. Managers should allocate
resources for implementing encryption mechanisms, access controls, anomaly detection
systems, and other cybersecurity solutions to protect the system against cyber-attacks
and unauthorized access.
The article also highlights the importance of fostering a culture of cybersecurity
awareness. It is essential to conduct regular risk assessments and audits to identify
vulnerabilities, evaluate the effectiveness of implemented cybersecurity measures, and
ensure compliance with relevant industry standards and guidelines. Managers should
allocate resources for periodic assessments, penetration testing, and audits to detect and
address any weaknesses or gaps in the system’s security.
Last, the article shows the importance of planning for incident response and recovery.
Despite preventive measures, it is important to have a well-defined incident response
plan in place. Managers should establish protocols for detecting, responding to, and
recovering from cybersecurity incidents. This includes defining roles and responsibili-
ties, establishing communication channels, and regularly testing incident response plans
through tabletop exercises and simulations.
7 Conclusions
The design of a digital twin-enabled AI-driven real-time distributed optimization system
for synchromodal freight operations has the potential to significantly reduce carbon
emissions and increase efficiency in the transportation industry. However, with the use
of advanced technologies also comes the potential for cybersecurity risks.
This article highlights the key cybersecurity considerations that need to be addressed
in the design of such a system. These include securing the data flow, ensuring the security
of the digital twin models, preserving the privacy for all stakeholders, and protecting the
system against cyber-attacks. The proposed solutions to these challenges include the use
of encryption, access control, and anomaly detection systems, and mitigation strategies
when an attack happens, among others.
The success of the proposed system depends on the implementation of these cyber-
security measures. The transportation industry must prioritize cybersecurity to ensure
Cybersecurity Considerations for the Design 81
the safety and integrity of the transportation network and its stakeholders. By adopting a
cybersecurity-first approach, the users of this system can mitigate risks and ensure that
the benefits of the system are fully realized.
Future research should focus on further refining the proposed cybersecurity solutions
and addressing new cybersecurity challenges that may arise as the system is deployed.
One area that requires attention is the potential impact of emerging technologies, such as
quantum computing, on the security of the system. Quantum computing has the potential
to break the encryption algorithms currently used to secure data, and thus, there is a need
to explore new encryption methods that are resistant to quantum attacks.
Another area of interest is the development of improved, more automated machine
learning algorithms that can detect and prevent cyber-attacks in real-time. Traditional
cybersecurity measures, such as firewalls and intrusion detection systems, may not be
sufficient in detecting sophisticated cyber-attacks that can exploit system vulnerabilities.
Machine learning algorithms can learn from past cyber-attacks and detect new and
unknown cyber-attacks, leading to better and more effective cybersecurity measures.
Furthermore, future research should also explore the potential of blockchain technol-
ogy in securing the system. Blockchain technology provides a decentralized and tamper-
proof data storage and transaction control mechanism that can enhance data security and
integrity, making it a promising technology for securing digital twin-enabled AI-driven
real-time distributed optimization systems.
Overall, this paper describes certain topics as to how a cybersecurity-aware design
approach can enable the successful implementation of a digital twin-enabled AI-driven
real-time distributed optimization system for synchromodal freight operations, leading
to significant benefits for the transportation industry and the environment.
References
Cost of a Data Breach Report 2022. IBM (2022). https://www.ibm.com/reports/data-breach?
utm_medium=OSocial&utm_source=Blog&utm_content=SSSWW&utm_id=Security-Intell
igence-Blog-Banners%20&_ga=2.169531361.1119445732.1681783239-1093745095.168
1783239. Accessed 14 Apr 2023
What is DevSecOps? IBM (2023). https://www.ibm.com/topics/devsecops#Benefits%20of%20D
evSecOps. Accessed 17 Apr 2023
Zero Trust Architecture. GAO (2022). https://www.gao.gov/products/gao-23-106065. Accessed
15 Apr 2023
Protecting data at rest. AWS Inc. (2023). https://docs.aws.amazon.com/wellarchitected/latest/sec
urity-pillar/protecting-data-at-rest.html. Accessed 17 Apr 2023
US Department of Commerce, National Oceanic and Atmospheric Administration: API web
service (2023). https://www.weather.gov/documentation/services-web-api. Accessed 10 Apr
2023
NIST. Post-quantum cryptography: CSRC (2017). https://csrc.nist.gov/projects/post-quantum-cry
ptography. Accessed 18 Apr 2023
NIST. Round 3 submissions - post-quantum cryptography: Computer Security Resource Cen-
ter (CSRC) (2023). https://csrc.nist.gov/Projects/post-quantum-cryptography/post-quantum-
cryptography-standardization/round-3-submissions. Accessed 17 Apr 2023
NVIDIA. NVIDIA H100 Tensor Core GPU Datasheet. NVIDIA (2022). https://resources.nvidia.
com/en-us-tensor-core/nvidia-tensor-core-gpu-datasheet. Accessed 13 Apr 2023
82 C. Paternina-Arboleda et al.
Lipner, S.B., et al.: Security kernels. In: Proceedings of the May 6–10, 1974, National Computer
Conference and Exposition on - AFIPS 1974, vol. 43, pp. 977–978 (1974). https://doi.org/10.
1145/1500175.1500361
Loshin, P.: IPsec vs. SSL VPN: comparing speed, security risks and technology. Tech Target
(2019). https://www.techtarget.com/searchsecurity/tip/IPSec-VPN-vs-SSL-VPN-Comparing-
respective-VPN-security-risks
Moros-Daza, A., Solano, N.C., Amaya, R., Paternina, C.: A multivariate analysis for the creation
of Port Community System approaches. Transp. Res. Procedia 30, 127–136 (2018). ISSN:
2352-1465. https://doi.org/10.1016/j.trpro.2018.09.015
Moros-Daza, A., Hoz, D.-D., Jaller-Martelo, M., Paternina-Arboleda, C.D.: Using advanced infor-
mation systems to improve freight efficiency: results from a pilot program in Colombia. In:
Paternina-Arboleda, C., Voß, S. (eds.) ICCL 2019. LNCS, vol. 11756, pp. 22–38. Springer,
Cham (2019). https://doi.org/10.1007/978-3-030-31140-7_2
Moros-Daza, A., Amaya-Mier, R., Paternina-Arboleda, C.: Port Community Systems: a structured
literature review. Transp. Res. Part A Policy Pract. 133, 27–46 (2020). ISSN: 0965-8564. https://
doi.org/10.1016/j.tra.2019.12.021
Möller, M., Vuik, C.: On the impact of quantum computing technology on future developments
in high-performance scientific computing. Ethics Inf. Technol. 19(4), 253–269 (2017). https://
doi.org/10.1007/s10676-017-9438-0
Nichols, B.: The Current State of DevSecOps Metrics. CMU (2021). https://insights.sei.cmu.edu/
blog/the-current-state-of-devsecops-metrics/
Palma-Blanco, A., González, E.R., Paternina-Arboleda, C.D.: A two-pheromone trail ant colony
system approach for the heterogeneous vehicle routing problem with time windows, multiple
products and product incompatibility. In: Paternina-Arboleda, C., Voß, S. (eds.) ICCL 2019.
LNCS, vol. 11756, pp. 248–264. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-
31140-7_16
Paternina-Arboleda, C.D., Das, T.K.: A multi-agent reinforcement learning approach to obtaining
dynamic control policies for stochastic lot scheduling problem. Simul. Model. Pract. Theory
13(5), 389–406 (2005)
Paternina-Arboleda, C.D., Das, T.K.: Intelligent dynamic control policies for serial production
lines. IIE Trans. 33(1), 65–77 (2001). https://doi.org/10.1023/A:1007641824604
Paternina-Arboleda, C.D., Montoya-Torres, J.R., Fabregas-Ariza, A.: Simulation-optimization
using a reinforcement learning approach. In: 2008 Winter Simulation Conference, Miami, FL,
USA, pp. 1376–1383 (2008). https://doi.org/10.1109/WSC2008.4736213
Paternina-Arboleda, C.D., Agudelo-Castañeda, D., Voß, S., Das, S.: Prediction of SO2 ports
emissions inventories. Sustainability (2023, Submitted)
Rakos, D.: Understanding GPU caches. [web log] (2021). https://www.rastergrid.com/blog/gpu-
tech/2021/01/understanding-gpu-caches/. Accessed 12 Apr 2023
Sarabia, Carolina, M., John, H., Rios-Griego, Carlos, D., Paternina-Arboleda: Simulation-based
decision support models for river cargo transportation. In: 2006 IEEE Systems and Information
Engineering Design Symposium, pp. 142–145 (2006)
Saxon, S., Stone, M.: Container shipping: the next 50 years. A McKinsey & Company report
(2017)
Saxena, A.: What are AI accelerators and how does it work? BISinfotech (2022). https://www.bis
infotech.com/what-are-ai-accelerators-and-how-does-it-work/. Accessed 14 Apr 2023
Shankland, S.: Quantum computers could crack today’s encrypted messages. That’s a prob-
lem. CNET (2021). https://www.cnet.com/tech/computing/quantum-computers-could-crack-
todays-encrypted-messages-thats-a-problem/. Accessed 17 Apr 2023
Song, D.: A literature review, container shipping supply chain: planning problems and research
opportunities. Logistics 5, 41 (2021). https://doi.org/10.3390/logistics5020041
84 C. Paternina-Arboleda et al.
Trucco, P., Petrenj, B.: Characterisation of resilience metrics in full-scale applications to interde-
pendent infrastructure systems. Reliab. Eng. Syst. Saf. 235, 109200 (2023). ISSN: 0951-8320.
https://doi.org/10.1016/j.ress.2023.109200
Tsvetkova, A., Gustafsson, M., Wikström, K.: Digitalizing maritime transport: digital innovation
as a catalyzer of sustainable transformation. In: Montero, J., Finger, M. (eds.) A Modern Guide
to the Digitalization of Infrastructure. Elgar Modern Guides, pp. 123–148. Edward Elgar (2021)
Velasquez-Bermudez, J.M., Abril, D., Paternina-Arboleda, C.D.: Optimizing port operations to
cope with shipping congestion in South American countries. OR/MS Today 49(2) (2022)
Velásquez-Bermúdez, J.M., Khakifirooz, M., Fathi, M. (eds.): Large Scale Optimization in Supply
Chains and Smart Manufacturing: Theory and Applications. Springer, Cham (2020). ISBN:
978-3-030-22790-6. https://doi.org/10.1007/978-3-030-22788-3
Wang, Z., Liu, X.: Cyber security of railway cyber-physical system (CPS) – a risk management
methodology. Commun. Transp. Res. 2, 100078 (2022). ISSN: 2772-4247. https://doi.org/10.
1016/j.commtr.2022.100078
Woschank, M., Rauch, E., Zsifkovits, H.: A review of further directions for artificial intelligence,
machine learning, and deep learning in smart logistics. Sustainability 12, 3760 (2020). https://
doi.org/10.3390/su12093760
Yang, Z., et al.: Indicator-based resilience assessment for critical infrastructures – a review. Saf.
Sci. 160, 106049 (2023). ISSN: 0925-7535. https://doi.org/10.1016/j.ssci.2022.106049
Zemp, B.: The Intersection Between AI and Blockchain Technology – Industries of Tomor-
row. Forbes (2023). https://www.forbes.com/sites/forbesbooksauthors/2023/02/28/the-inters
ection-between-ai-and-blockchain-technology--industries-of-tomorrow/?sh=1731ee7d4de7.
Accessed 14 Apr 2023
Zhang, W., Guhathakurta, S., Khalil, E.B.: The impact of private autonomous vehicles on vehicle
ownership and unoccupied VMT generation. Transp. Res. Part C Emerg. Technol. 90, 156–165
(2018)
Customer’s Choice in the Context
of Cross-Border E-Commerce: An
Application of Structural Equation
Modelling
1 Introduction
In the post-pandemic era, all business models, i.e., traditional business models,
E-Commerce and Cross-Border E-Commerce (CBEC) have some gradual devel-
opments. With the popularity of smart phones and other devices, the devel-
opment of the Internet of Things (IoT) technologies, the establishment of 5G
Supported by the Project ‘FuturePorts’, DLR (Deutsches Zentrum für Luft- und Raum-
fahrt, German Aerospace Center).
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 85–102, 2023.
https://doi.org/10.1007/978-3-031-43612-3_5
86 Y. Liu and X. Shi
1
ESEM+CFA is a combination of ESEM and CFA. It exhibits the similarity with
ESEM within CFA, which is introduced in Mplus User’s Guide [29].
2
SmartPLS 4: the current Version is SmartPLS 4.0.9.4, https://www.smartpls.com/.
Customer’s Choice in Cross-Border E-Commerce 87
amounts of feedback within the limited time. Thirdly, instead of naming the
factors in ESEM in advance, significant factors are to be redefined by running
ESEM+CFA. Fourthly, in ESEM indicators have cross-loading, the assignment
of indicators may differ from our original design. In other words, it is challenging
for the process of factor redefinition. Moreover, while refining the model fit, the
modification indices (MI) are not unified. Therefore, two distinctive processes
are discussed and applied based on a better result. Lastly, some evaluation cri-
teria of PLS-SEM are applied for assessing the measurement models with some
concerns on their validity. The factor redefining process provided by this research
is going to overcome the above-mentioned challenges.
2 Literature Review
CBEC is a busi-
ness model based
on Internet and E-
Commerce websites
or platform, where
two parties involved
in the transaction
come from different
countries or terri-
tories [15]. Hu and
Luo [17] state that
CBEC is efficient
and low-cost, and it
is an application of Fig. 1. Process of CBEC
internet technology
on the field of for-
eign trade. Different roles in CBEC are characterized by different functions
and responsibilities (Fig. 1). For instance, customs is regulating and facilitating
international trade [6] and the customs also aims to maintain a certain degree
of fairness and transparency in global trade. It is vital for all parties involved
in CBEC to know customs declarations and regulations. The customs releases
declared items comprehensively, such as, the calculation of customs duties includ-
ing over-size or over-weight products’ duties and taxes. When the sellers launch
their products, the CBEC platform should automatically inform them about the
relevant regulations. Generally speaking, the owner of products and its represen-
tative are responsible to submit declarations [45]. For the purpose of reducing the
operational time, such a declaration should be submitted with considerations of
accuracy, authenticity and completeness. It is noted for cross-border trade that
the payment of customs duties complies with Incoterms3 such as Delivered Duty
Paid (DDP), Delivered Duty Unpaid (DDU), etc., where DDP is relative widely
used due to efficiency and safety concerns in E-Commerce [9].
3
Incoterm: International Commercial Term.
88 Y. Liu and X. Shi
purchases on the same platform. A series of studies have explored the repurchase
intention or behaviour with SEM.
Methodologies, principles, concepts, the criteria of metrics, and theories of
SEM are listed in this section as prerequisite for the next sections. RMSEA
(Root Mean Square Error of Approximation), CFI (Comparative Fit Index), TLI
(Tucker Lewis Index), df (Degree of Freedom), and the probability of RMSEA ≤
0.05 as criteria are comprehensively introduced in the references [21,37,41]. For
evaluation of the measurement model in SEM, an Average Variance Extracted
(AVE) ≥ 0.5 is acceptable [13]. However, Shrestha [41] states if composite reality
> 0.6 but AVE < 0.5, the convergent validity remains acceptable. The theory
of ESEM, guideline of Mplus, and examples of Mplus support our research [2,
27,29]. Modification benefits a better model fit. With regard to investigating
the exploration of the ESEM framework, at the outset, a pair with highest
modification index is modified [48]. Even though many articles have explored
the CRI prior to the pandemic, it remains valuable to investigate it in the post
pandemic era.
3 Methodology
3.2 ESEM+CFA
In the early stage, EFA is applied to observe the extracted factor numbers, which
will be utilized in ESEM+CFA. ESEM within CFA is introduced in Mplus User’s
Guide [29] and Mai and Wen’s study [25]. Instead of ESEM within CFA, we
prefer to call it ESEM+CFA, as we think that this model is constructed by
two independent models. Hence, this model is an integration, not an inclusion.
Accordingly, we suppose two different modification processes in Sect. 3.3.
Beyond their original struc-
tural models they also share the Table 1. Criteria for Model Evaluation
same structural model through Metrics Cut-off value Sources
RMSEA ≤0.08 [8, 12, 13, 21, 29, 35,
connection with each other. Being 37, 41, 48]
SRMR ≤0.08
different from typical SEM, the CFI ≥0.8
hypothesis is not to be established TLI ≥0.8
Probability of ≥ 0.05
in advance. After estimation, the RMSEA≤0.05
Chi Square –
factors are named based on their Chi Square ≤3
df
indicators. In our view, the CFA Cronbach’s alpha ≥0.6
AVE ≥0.5
or EFA is an analysis method for R Square ≥0.5
≥0.9
factor analysis. While factor anal- NFI (Normed Fit
Index)
ysis do not place emphasize on Indicators loading ≥0.6
Composite ≥0.6
the analysis of factor relationship, reliability
HTMT ≤0.9
SEM concentrates on it. ESEM is (Heterotrait-
Monotrait Ratio)
a SEM based on EFA, highlight-
ing the analysis of significant relationships among factors. Such an exploratory
process is not only for the relationship among factors, but also for indicators.
The definition of ESEM also provides us a new understanding of SEM. For
instance, how to understand the Confirmatory Structural Equation Modelling
(CSEM). In other words, if researchers plan to verify an old model, whether
this process can be called CSEM. Furthermore, there are two kinds of SEM,
i.e., PLS-SEM (Partial Least Squares SEM) and CB-SEM (Covariance Based
SEM). They can be distinguished by some differences including distinct criteria
for model assessment. PLS-SEM is for the extension of a verified model, theory
testing and comparison is a feature of CB-SEM [13,14]. To conduct an analysis
of ESEM+CFA based on CB-SEM, Mplus is to be applied. ESEM, Bi-factor,
ESEM+CFA are highlights of Mplus [29]. At present Mplus 8.9 is available [29],
and in our study Mplus 8.3 is deployed for data implementation. To observe
the measurement model, SmartPLS 4 is also applied. The applied criteria are
succinctly depicted in Table 1, providing a overview.
3.3 Modification
3.4 Questionnaire
Questionnaires Content
Collection Period Feb 20. 2023 to March 17. 2023
Numbers of question 16
Total participants 187
Response Rate around 36%
Languages English, Chinese, German
Female, Male 46%, 54%
Spread medium Linkedin, WeChat, Email and
What’sApp, Survey Circle, SMS
4
UK, Australia, Japan, Panama, Norway, Ireland, and Switzerland.
92 Y. Liu and X. Shi
satisfaction, online service quality, offline service quality, trust, and technical
factors. 28 indicators are analyzed using SPSS to study EFA. and six indica-
tors from CRI are employed for CFA. Their indicators are defined from different
literature. The keywords of indicators are succinctly described in Table 3.
Table 3. Keywords of indicators
4.1 Results
(1) 28 indicators are estimated on SPSS with rotated Varimax under Maxi-
mum Likelihood. According to the estimation performed by SPSS, the KMO
(Kaiser-Meyer-Olkin) is 0.878, and the p-value is approximate to 0. Hence,
in accordance with [41] the result is acceptable. Therefore, 28 indicators are
extracted into six factors to be applied for the subsequent steps.
(2) The integration of ESEM+CFA is implemented on the Mplus. Based on
the first-round estimation, RMSEA is 0.076, CFI is 0.878, the probability of
RMSEA is 0.000, and SRMR is 0.050. So the model does not fit. Afterwards,
it is to be modified using the MI until no pair exceeds 10. According to the
first modification method, 12 steps are carried out. After being modified
by the second modification process, its model fit is also acceptable. Despite
this, it does not outperform the first one (Table 4b).
(3) After being modified, 3 factors have significant impacts on the CRI. Accord-
ing to the result and their indicators, they are redefined as Satisfaction (SA),
MIS Technical factor (MIS), and Facilitation (FA), respectively (Fig. 5a).
Customer’s Choice in Cross-Border E-Commerce 93
(a) With the first modification process (b) With the second modification process
Other 3 factors are not going to be defined any more. However, the coefficient
of Facilitation is negative, i.e. −0.216, it is inconsistent with common sense.
Following that, in order to proceed with further exploration these 3 factors
with CRI are extracted for establishing a new model further (Fig. 5a).
(4) As estimated, Facilitation has no significant impact on CRI anymore. This
provides a new vision which is different from current policies and com-
mon sense which is previously accepted [10,40,47]. In contrast to that,
the impacts of MIS Technical factor and Satisfaction persist significantly.
Figure 5a → Fig. 5b demonstrates the transformation of model. In the
extracted model, correlations that cross structural model have existed, there-
fore, model-crossing is allowed. This step is implemented on Mplus. Follow-
ing that, the further evaluation of measurement model and structural model
are required.
The minimum path loading ωmin is inversely derived in accordance with our
sample size and applied significance level 5% [13]. Based on observation, the path
loading of Satisfaction and MIS Technical Factor exceed 0.167, but Facilitation
not. Apart from that, it is discussed whether the factor loading needs to be
higher than 0.4, 0.5 or 0.7 [4]. CRI is an endogenous variable in the model,
its R Square on SmartPLS is 0.645, while on Mplus is 0.641. Therefore, the
power of prediction of CRI is moderate [13]. Table 5 explicates the result of
the extracted model estimated on Mplus and SmartPLS, but Mplus can output
the probability of RMSEA ≤ 0.05 individually. Regardless of the differences of
outputs, the results are almost the same. Hence, the structural model of the
extracted model fits well.
With the result of the extracted model, a proactive business strategy jour-
ney is designed (Fig. 8). In terms of that, some items are hoped to applied for
improvement. Even though not all of the indicators loading are acceptable. For
the purpose of further observation and verification on this extracted model and
those of criteria forward this diagram is derived from the valid p-value.
4.2 Discussions
(1) The measurement models are going to be optimized by removing indica-
tors, namely the indicator with lowest loading is deleted initially. In line
with modification only one pair is deleted in each step. In Fig. 6, Facilita-
tion remains insignificantly. There is no cross-loading among different mea-
surement model. Additionally, all of 13 indicators are reliable theoretically.
Table 8a and 8b depict the results of the variation in the measurement model
and the model fit. All of the metrics are definitely improved, especially AVE.
The results of convergent reliability of CRI, Facilitation and MIS Technical
Factor meet the required criteria and are deemed as acceptable, and AVE
of Satisfaction is approximate to 0.5.
96 Y. Liu and X. Shi
Criteria Result
Composite
Cronbach’s RMSEA 0.057 Acceptable
reliability AVE
alpha
(rho c) CFI 0.967 Acceptable
CRI 0.842 0.792 0.539 TLI 0.955 Acceptable
FA 0.845 0.856 0.744 SRMR 0.050 Acceptable
MIS 0.771 0.769 0.632 NFI 0.912 Acceptable
SA 0.826 0.829 0.499 Chi Square
1.535 Acceptable
df
(a) Result of measurement model
(b) Result of model fit
(2) Variation of the sample size within ESEM+CFA. To investigate the effect of
the sample size, the different sample sizes are applied for additional compar-
ison on Mplus; therefore, experiments with three different sample sizes are
conducted. They are distinguished by the response time, namely the first
week (75 available responses), the second week (124 available responses)
and overall, respectively. The result of 75 sample sizes shows the model fit is
not acceptable. After modification, almost all metrics dissatisfy with cut-off
value. With the increased sample sizes the model fit is definitive improved.
The Fig. 7 displays that the probability of RMSEA ≥ 0.05 is influenced by
sample size most significantly. Beyond that, all of metrics are refined with
the increased sample sizes. Accordingly, it is valuable to study on it with a
larger sample size in the future.
(3) Comparison with previous research. On one hand, our experiment explicates
the significant recursive relationship of “Satisfaction → CRI” and “MIS
Technical → CRI”. As described in literature, Satisfaction has significant
impacts on CRI. Perhaps the definition of factors and the design of indicators
jointly affect the result. On the other hand, the loading reliability of “as
a preference for future shopping” in our case is regarded as acceptable.
However, in [28], “as a preference for future shopping” and “recommend
it to others” are both acceptable. Maybe the different research objectives,
sample sizes, different backgrounds, different models or respondents (target
groups) lead to the above-mentioned different results.
still space to refine and leverage our understanding on the CBEC with focus on
customer intention, choice and derived cargo volumes.
References
1. Amazon: Valid Tracking Rate (VTR) (2023). https://sellercentral.amazon.co.uk/
help/hub/reference/external/G201817070?locale=en-GB. Accessed 11 Apr 2023
2. Asparouhov, T., Muthén, B.: Exploratory structural equation modeling. Struct.
Equ. Model. 16(3), 397–438 (2009). https://doi.org/10.1080/10705510903008204
3. Cao, X., Shen, B., Liu, Y., Wang, M.: Research on evaluation of B to C e-commerce
website based on AHP and grey evaluation. In: Second International Symposium
on Electronic Commerce and Security, vol. 2, pp. 405–408 (2009). https://doi.org/
10.1109/ISECS.2009.242
4. Cheung, G., Cooper-Thomas, H., Lau, R., Wang, L.: Reporting reliability, con-
vergent and discriminant validity with structural equation modeling: a review and
best-practice recommendations. Asia Pac. J. Manage. (2023). https://doi.org/10.
1007/s10490-023-09871-y
5. Corbitt, B.J., Thanasankit, T., Yi, H.: Trust and e-commerce: a study of consumer
perceptions. Electron. Commer. Res. Appl. 2(3), 203–215 (2003). https://doi.org/
10.1016/S1567-4223(03)00024-3
6. Customs & Excise Division Dominica: Role of Customs (2022). http://www.
customs.gov.dm/index.php/about-customs/role-of-customs. Accessed 20 Mar
2023
7. Dan, V.: Empirical and Nonempirical Methods, pp. 1–3. Wiley (2017). https://
doi.org/10.1002/9781118901731.iecrm0083
8. Dash, G., Paul, J.: CB-SEM vs PLS-SEM methods for research in social sciences
and technology forecasting. Technol. Forecast. Soc. Chang. 173, 121092 (2021).
https://doi.org/10.1016/j.techfore.2021.121092
9. Deutsche Post Group DHL: Customs, duties & taxes (2023). https://www.
dhl.com/discover/en-my/e-commerce/e-commerce-advice/ddu-vs-ddp-for-
ecommerce-business. Accessed 17 Mar 2023
10. Fang, L., Shi, X., Hu, H.: Analysis of influencing factors and location prediction
of dry port based on logit model. J. Chongqing Jiaotong Univ. 5, 31 (2012)
11. Gaskell, T.: The process of empirical research: a learning experience?
Res. Post-Compulsory Educ. 5(3), 349–360 (2000). https://doi.org/10.1080/
13596740000200084
12. Hair, J., Black, W., Babin, B.: Multivariate Data Analysis: A Global Perspective.
Global Edition, Pearson Education (2010)
13. Hair, J., Hult, G.T.M., Ringle, C., Sarstedt, M., Danks, N., Ray, S.: Partial Least
Squares Structural Equation Modeling (PLS-SEM) Using R: AWorkbook. Springer,
Heidelberg (2021). https://doi.org/10.1007/978-3-030-80519-7
14. Hair Jr., J.F., Matthews, L.M., Matthews, R.L., Sarstedt, M.: PLS-SEM or CB-
SEM: updated guidelines on which method to use. Int. J. Multivariate Data Anal.
1, 107–123 (2017). https://doi.org/10.1504/IJMDA.2017.087624
100 Y. Liu and X. Shi
15. Hazarika, B.B., Mousavi, R.: Review of cross-border e-commerce and directions for
future research. J. Glob. Inf. Manag. 30(2), 1–18 (2021). https://doi.org/10.4018/
JGIM.20220301.oa1
16. Hellier, P., Geursen, G., Carr, R., Rickard, J.: Customer repurchase intention: a
general structural equation model. Eur. J. Mark. 37, 1762–1800 (2003). https://
doi.org/10.1108/03090560310495456
17. Hu, B., Luo, Q.: Cross-border e-commerce mode based on Internet +. In: IOP
Conference Series: Materials Science and Engineering, vol. 394, no. 5, p. 052014
(2018). https://doi.org/10.1088/1757-899X/394/5/052014
18. Kaiser, U., Kesler, R., Reisinger, M.: The interplay between product variety and
customer satisfaction: theory and evidence (2023). https://doi.org/10.2139/ssrn.
4022124. Accessed 09 Apr 2023
19. Kawa, A., Zdrenka, W.: Conception of integrator in cross-border E-commerce.
LogForum 12, 63–73 (2016). https://doi.org/10.17270/J.LOG.2016.1.6
20. Khan, M.K., Nawaz, M., Ishaq, M.I., Tariq, M.I.: Product versus service: old myths
versus new realities. J. Basic Appl. Sci. Res. 4(1), 15–20 (2014)
21. Kline, R.: Principles and Practice of Structural Equation Modeling. Methodology
in the Social Sciences, 4th edn. Guilford, London (2015)
22. Lei, F., Lei, S., Liu, J., Wu, L.: Studying customer behavior of shopping on e-
commerce platforms from the perspective of college students. In: 2020 International
Conference on E-Commerce and Internet Technology (ECIT), pp. 41–44 (2020).
https://doi.org/10.1109/ECIT50008.2020.00017
23. Luo, H., Liang, Y., Wu, Z., Liu, Z., Shu, B., Luo, Y.: Research on the impact
of cross-border e-commerce platform quality on customer cross-buying intention.
In: 16th International Conference on Service Systems and Service Management
(ICSSSM), pp. 1–6 (2019). https://doi.org/10.1109/ICSSSM.2019.8887725
24. Luo, H., Zhang, X., Huang, J., Su, F., Huang, W., Luo, Y.: An empirical study on
customer repurchase willingness of fresh e-commerce products based on customer
perceived value. In: 16th International Conference on Service Systems and Service
Management (ICSSSM), pp. 1–6 (2019). https://doi.org/10.1109/ICSSSM.2019.
8887610
25. Mai, Y., Wen, Z.: Exploratory structural equation modeling (ESEM): an integra-
tion of EFA and CFA. Adv. Psychol. Sci. 21(5), 934 (2013). https://doi.org/10.
3724/SP.J.1042.2013.00934
26. Matthew, D., Hellianto, G.R., Putra, N.S., Sundjaja, A.M.: The effect of monthly
promotion, gamification, user interface usability & attractiveness on the market-
place repurchase intention. In: International Conference on Informatics, Multime-
dia, Cyber and Information System, pp. 193–199 (2021). https://doi.org/10.1109/
ICIMCIS53775.2021.9699114
27. Morin, A.J.S., Marsh, H.W., Nagengas, B.: Supplementary materials for: Chapter
10. Exploratory structural equation modeling (2013). https://www.statmodel.
com/download/Chapter%2010%20-%20ESEM-%20supplements%2020131.pdf.
Accessed 11 Feb 2023
28. Mou, J., Cui, Y., Kurcz, K.: Trust, risk and alternative website quality in B-Buyer
acceptance of cross-border e-commerce. J. Glob. Inf. Manag. 28, 167–188 (2020).
https://doi.org/10.4018/JGIM.2020010109
29. Muthén, L.K., Muthén, B.O.: MPlus version 8 user’s guide. Muthén & Muthén
(2017)
30. Nurdani, Y., Sandhyaduhita, P.I.: Impact of express delivery service quality
towards repurchase intention by B2C and C2C: a case of Indonesia. In: Inter-
Customer’s Choice in Cross-Border E-Commerce 101
47. Tousypanah, H., Shi, X., Viohl, B., Hoffmann, J.: Analysis of influencing factors
and location prediction of dry port based on logit model. UNCTAD Transp. Newsl.
43, 6–10 (2009)
48. Tóth-Király, I., Bõthe, B., Rigó, A., Orosz, G.: An illustration of the exploratory
structural equation modeling (ESEM) framework on the passion scale. Front. Psy-
chol. 8 (2017). https://doi.org/10.3389/fpsyg.2017.01968
49. Valdez-Juárez, L.E., Gallardo-Vázquez, D., Ramos-Escobar, E.A.: Online buyers
and open innovation: security, experience, and satisfaction. J. Open Innov.: Tech-
nol. Mark. Complexity 7(1), 37 (2021). https://doi.org/10.3390/joitmc7010037
50. Voß, S., Mejia, G., Voß, A.: Mystery shopping in public transport: the case of
bus station design. In: Stephanidis, C., Marcus, A., Rosenzweig, E., Rau, P.-L.P.,
Moallem, A., Rauterberg, M. (eds.) HCII 2020. LNCS, vol. 12423, pp. 527–542.
Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60114-0 36
51. Yu, Y.W., Chiu, S.C., Chumpoothepa, S., Chen, J.K.: Factors influencing online
repurchase intention: the case of clothing customers in Thailand. In: Proceedings
of PICMET 2014 Conference: Portland International Center for Management of
Engineering and Technology; Infrastructure and Service Integration, pp. 658–669
(2014)
Maritime Shipping
Towards a Deep Reinforcement Learning
Model of Master Bay Stowage Planning
1 Introduction
In the past century, maritime transport has become the backbone of global trade
and modern consumerism. Many transported goods are shipped by container ves-
sels of liner shipping companies. To ensure timely arrivals and resource-efficient
operations, these liner companies use stowage planning to allocate containers to
vessel slots at each port of the voyage. The goal is to maximize vessel utilization
and minimize operational costs by creating robust stowage plans. This is an NP-
hard task due to (hatch-)overstowage [18,35], which is further complicated by
the problem size (20,000 Twenty Foot Equivalent Unit (TEU) vessels visiting at
least 10 ports) and combinatorial aspects as container dimensions, seaworthiness
and stowage regulations, demand uncertainty and planning best practices.
Several contributions have been unable to directly solve their problem for-
mulation (e.g., [8,23]). Consequently, it is suggested to hierarchically decompose
the stowage problem into master bay and slot planning (e.g., [28,37]), which is
further explained in Sect. 3. Even though plenty of contributions tried, a scalable
algorithm for a representative decomposed problem is yet to be found.
This paper will provide a proof of concept for a novel application of deep rein-
forcement learning (DRL) to solve the master bay planning problem (MBPP)
This work is partially sponsored by the Danish Maritime Fund under grant 2021-069.
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 105–121, 2023.
https://doi.org/10.1007/978-3-031-43612-3_6
106 J. van Twiller et al.
as a Markov decision process (MDP). To the best of our knowledge, the only
stowage contributions involving reinforcement learning (RL) relax many of the
complex combinatorial aspects [33,38]. Furthermore, DRL is used to solve opti-
mization problems similar to MBPP [14,17,22,24]. Hence, we believe that DRL
implementations can contribute to solving the stowage planning problem.
In our work, we model an episodic MDP that maximizes vessel utilization
and minimizes hatch-overstowage for equal-sized cargo with two weight classes,
while satisfying demand and location capacity, as well as ensuring longitudinal
and vertical stability. We refer to Sect. 4 for details on the problem. This is not a
full-featured MBPP, but rather a non-trivial problem to provide us with a proof
of concept. Each episode generates a Gaussian equivalent to the Mixed instances
by [4]. We have implemented a proximal policy optimization (PPO) architecture
that learns an actor and critic network to find a policy that can efficiently solve
the MDP. This architecture is compared against an equivalent mixed integer
program (MIP) to evaluate performance.
Our experiments show that PPO can learn a policy that optimizes the objec-
tive function on a limited training budget. Subsequently, the policy can be gener-
alized to efficiently find reasonable solutions for a non-trivial MBPP in a fraction
of the MIP runtime on limited hardware. Thus, we have provided preliminary
evidence for the potential of DRL in stowage planning.
The remainder of this paper is structured as follows: Sect. 2 introduces the
domain of container vessel stowage planning. Section 3 outlines related work,
and Sect. 4 defines the MBPP with a MIP formulation. In Sect. 5, some RL
preliminaries are provided, while our MDP and PPO architecture are described
in Sect. 6. Finally, Sect. 7 compares the results of our PPO architecture with a
MIP solver, and Sect. 8 concludes the main findings of this study.
and 30 tons, depending on the load. Note that empty containers also need to be
redistributed from time to time. Any container has an origin named port of load
(POL), and a destination called the port of discharge (POD). Furthermore, cargo
types can also differ, as Specials include reefers, IMDG that carry dangerous
goods, and OOG containers that deviate from the typical cargo dimensions. To
ensure safety, IMDGs are often separated from other cargo types.
Above and below-deck locations are separated by hatch covers, and thus the
cover must be removed to reach below-deck cargo in any bay. If cargo destined for
future ports is still placed on the cover, cranes must remove and reload (restow)
the on-deck containers that overstow the below-deck cargo. This is referred to
as hatch overstowage. Note that minimizing hatch overstowage is an NP-hard
task, which prevents us from finding an optimal solution in polynomial time [35].
Equivalently, stack overstowage can also occur if stacks are not in ascending order
of PODs from top to bottom. Furthermore, vessels must be seaworthy according
to safety regulations. In general, the cargo must be distributed along the vessel
to ensure acceptable levels of stability and stress forces [18].
3 Related Work
In essence, container vessel stowage planning is a multi-port problem, which
aims to balance a myriad of combinatorial aspects [18]. The field can roughly be
subdivided into single-port work to create light-weight operational stowage plans
(e.g., [2,3,10,23]), and multi-port work to generate realistic stowage plans (e.g.,
[4,8,28,30,31,37]). As demonstrated by [8,23], an exact and optimal solution to
the multi-port problem is yet to be found. Consequently, [37] suggested a hierar-
chical decomposition into master bay planning to allocate groups of containers to
general locations on the vessel (e.g., [7,9,26]), and slot planning to subsequently
allocate containers to slots in locations (e.g., [19,21,29]). For a comprehensive
description of hierarchical decomposition, we refer to [18]. Recently, heuristic
frameworks have also gained traction as an alternative to hierarchical decompo-
sition (e.g., [23,27]).
108 J. van Twiller et al.
s.t. xb,d
τ,k ≤ qτ,k ∀p ∈ P, τ ∈ T OB (p), k ∈ K (3)
b∈B d∈D
xb,d
τ,k ≤ cb,d ∀p ∈ P, b ∈ B, d ∈ D (4)
k∈K τ ∈T OB (p)
xb,d
(p,j),k ≤ M zb,p ∀p ∈ P, b ∈ B, d ∈ DH (5)
k∈K j∈P :j>p
xb,d
(i,p),k ≤ M zb,p ∀p ∈ P, b ∈ B, d ∈ DH (6)
k∈K i∈P :i<p
xb,d
τ,k − M (1 − zb,p ) ≤ yb,p ∀p ∈ P, b ∈ B, d ∈ DO (7)
k∈K τ ∈T ROB (p)
The primary decision variable of our MIP model is xb,d (i,j),k ∈ R≥0 , which is
the number of TEU from port i to j of class k planned to be stowed in bay b
and deck d. The secondary decision variable yb,p ∈ R≥0 is the number of TEU
that causes hatch overstowage in bay b at port p, while the last decision variable
zb,p ∈ {0, 1} indicates whether the hatch is opened in bay b at port p. Equation 2
shows that the MIP aims to maximize vessel utilization and minimize hatch
overstowage, where f1 is the gain per loaded TEU and f2 is the cost per hatch
overstow in TEU. Equation 3 limits the total vessel utilization by demand qτ,k ,
110 J. van Twiller et al.
while Eq. 4 limits the onboard cargo by the capacity volume cb,d . Equation 5
and 6 connects stowed containers xb,d(i,j),k to hatch moves zb,p using the big M
notation, whereas Eq. 7 computes hatch restows based on whether the cargo is
placed on the to-be-opened hatch. Let us define LCG(p) = LM (p)/TW (p) and
VCG(p) = VM (p)/TW (p)
at port p, and their optimal equivalents LCG ∗ =
∗
b LP b /|B| and V CG = d VP d /|D|. Equations 8 and 9 limit the difference
between the actual and optimal LCG with parameter υ. Similarly, Eqs. 10 and
11 limit the difference between the actual and optimal VCG with parameter
υ. Note that Eqs. 8–11 are linear transformations of |LCG(p) − LCG ∗ | < υ
and |VCG(p) − VCG ∗ | < υ. Equations 12–14 define the total weight TW (p),
longitudinal moment LM (p), and vertical moment VM (p). Notice that LP b and
VP d refer to the longitudinal and vertical position of locations respectively.
Ts Ts−t
t
J = Eπ γ rt (15) V (st ) = Eπ k
γ rt+k |s = st (16)
t=0 k=0
Ts−t
Q(st , at ) = Eπ k
γ rt+k |s = st , a = at (17)
k=0
V (s) = max R(s, a) + γ P(s |s, a)V (s ) (18)
a
s
Instead, one could consider learning from experience using model-free meth-
ods, often suffering from bias. For example, Monte Carlo methods perform trial-
and-error simulations, whereas temporal difference learning obtains approximate
functions with bootstrapping [34]. Another way is through DRL, which approx-
imates good solutions with parameterized functions in large state and action
spaces (e.g., policy πθ ). The policy gradient theorem states that the derivative
of the expected return is proportional to the terms in Eq. 19, where ψπ (st ) is the
on-policy probability of state st . Doing so, πθ can be learned by updating θ with
respect to expected return J [34]. Due to large policy updates, gradient-based
methods can suffer from variance. Anybody unfamiliar with RL is suggested to
read more on the topic in [34].
∇J(θ) ∝ Eπ ψπ (st ) ln ∇πθ (at |st )Qπ (st , at ) (19)
st at
⎧ b,d
⎪
⎨uτ,k (t) if τ ∈ T OB (p) \ {τt }
ub,d (t + 1) = lkb,d (t) if τ = τt ∀b ∈ B, d ∈ D, τ ∈ T, k ∈ K (20)
τ,k
⎪
⎩
0 otherwise
Since agents struggle with sparsity [13], a constant reward signal is neces-
sary for each action. Equation 21 defines the reward of satisfying demand with
reward coefficient f1 and input τt = (p, j ) for current port p and future port
j . A penalty is incurred with coefficient f3 if the agent ships more cargo than
demanded. Note that f3 > f1 holds to enforce feasibility.
f1 b,d,k τ ∈T OB (p) ub,d b,d uτt ,k ≤ qτt ,k
b,d
τ,k , if
rτDS
t
= ∀k ∈ K (21)
−f3 b,d,k τ ∈T OB (p) ub,d
τ,k , otherwise
Equation 26 defines a reward function R(st , at ) for each step t, where the
above-mentioned terms are summed. However, a final term penalizes unsatisfied
demand by including the upper bound of rτDSt
with parameter f1 . Hence, we
maximize a reward function with an upper bound of 0.
r(st , at ) = rτDS
t
+ rτLS
t
+ rτVS
t
+ rτCS
t
+ rτHO
t
− f1 qτ,k (26)
k,τ ∈T OB (p)
Fig. 2. Overview of master bay planning MDP with colours corresponding to PODs.
Let us define the loss function with respect to ω as the mean squared error
between the estimated value function and the sum of discounted rewards in
Eq. 29. Given the estimated advantage Ât , Eq. 30 defines a clipped loss func-
tion with respect to θ, where pt (θ) = ππθ θ (a(at |s t)
t |st )
is the importance sampling
old
ratio between the current and previous policy distribution. The clipping limit
prevents disruptive changes to θ. In Eq. 31, both functions are combined with
entropy regularization, where cf 1 and cf 2 are coefficients and S[πθ ](st ) is the
entropy term that promotes exploration based on πθ and st . Hence, Eq. 31 is
maximized to update θ and thereby policy πθ .
2
VF
t
L (ω) = Êt Vω (st ) − t
γ r(st , at ) (29)
i=0
LCLIP (θ) = Êt min(pt (θ)Ât , clip(pt (θ), 1 − , 1 + )Ât ) (30)
LCLIP +V F (θ) = Êt LCLIP (θ) − cf 1 LV F (ω) + cf 2 S[πθ ](st ) (31)
7 Results
Table 1. Evaluation of PPO and MIP on two sets of 100 test instances generated from
Gaus-MBPP and Unif-MBPP. Results are expressed in the mean average and standard
deviation metrics for the objective value (Obj.) and runtime in seconds (Time). The
number of feasible solutions (#) is given without std. This also holds for the optimality
gap (Gap) that is computed relative to the optimal MIP objective.
8 Conclusion
This paper presents a DRL approach towards solving the MBPP in container
vessel stowage planning. In particular, we introduce a MDP equivalent to a MIP
model with NP-hard combinatorial aspects, as well as suggest PPO to solve the
MDP. These preliminary experiments show that PPO learns to optimize the
objective value during training on limited hardware, after which its policy can
be generalized to efficiently find reasonable solutions for test instances. Hence,
we have provided a proof of concept for applying PPO to the MBPP.
In future work, we will extend the environment to become a full-featured
MBPP with a representative demand simulator. The algorithm will also be
improved to increase performance and deal with more complex problems. Since
voyages are inherently sequential, we could implement temporal dynamic poli-
cies using recurrent neural networks (e.g., long short-term memory networks).
Considering the cellular shape of container vessels, we might also leverage graph
representation learning in our architecture.
Towards Deep Reinforcement Learning for Master Bay Planning 119
References
1. Akiba, T., Sano, S., Yanase, T., Ohta, T., Koyama, M.: Optuna: a next-generation
hyperparameter optimization framework (2019). http://arxiv.org/abs/1907.10902
2. Ambrosino, D., Anghinolfi, D., Paolucci, M., Sciomachen, A.: An experimental
comparison of different heuristics for the master bay plan problem. In: Festa,
P. (ed.) SEA 2010. LNCS, vol. 6049, pp. 314–325. Springer, Heidelberg (2010).
https://doi.org/10.1007/978-3-642-13193-6 27
3. Ambrosino, D., Sciomachen, A., Tanfani, E.: Stowing a containership: the master
bay plan problem. Transp. Res. Part A 38(2), 81–99 (2004). https://doi.org/10.
1016/j.tra.2003.09.002, http://linkinghub.elsevier.com/retrieve/pii/S09658564030
00892, http://www.sciencedirect.com/science/article/pii/S0965856403000892
4. Avriel, M., Penn, M., Shpirer, N., Witteboon, S.: Stowage planning for container
ships to reduce the number of shifts. Ann. Oper. Res. 76(1-4), 55–71 (1998).
http://search.ebscohost.com/login.aspx?direct=true&db=bth&AN=18925445&
site=ehost-live, http://link.springer.com/article/10.1023/A:1018956823693
5. Azevedo, A., Cassilda Maria, R., de Sena, G.J., Chaves, A.A., Neto, L.L.S., Moretti,
A.C.: Solving the 3D container ship loading planning problem by representation by
rules and meta-heuristics. Int. J. Data Anal. Tech. Strategies 6(3), 228–260 (2014).
https://doi.org/10.1504/IJDATS.2014.063060
6. Bergstra, J., Bardenet, R., Bengio, Y., Kégl, B.: Algorithms for hyper-parameter
optimization. In: NIPS 2011: Proceedings of the 24th International Con-
ference on Neural Information Processing Systems, pp. 2546–2554 (2011).
https://proceedings.neurips.cc/paper files/paper/2011/file/86e8f7ab32cfd12577bc
2619bc635690-Paper.pdf
7. Bilican, M.S., Evren, R., Karatas, M.: A mathematical model and two-stage heuris-
tic for the container stowage planning problem with stability parameters. IEEE
Access 8, 113392–113413 (2020). https://doi.org/10.1109/ACCESS.2020.3003557
8. Botter, R., Brinati, M.: Stowage container planning: a model for getting an optimal
solution. Comput. Appl. Autom. Shipyard Oper. Ship Design VII(C), 217–228
(1992)
9. Chao, S.L., Lin, P.H.: Minimizing overstowage in master bay plans of large con-
tainer ships. Maritime Econ. Logist. 23(1), 71–93 (2021). https://doi.org/10.1057/
s41278-019-00126-6
10. Delgado, A., Jensen, R.M., Guilbert, N.: A placement heuristic for a commercial
decision support system for container vessel stowage. In: 38th Latin America Con-
ference on Informatics, CLEI 2012 - Conference Proceedings (2012). https://doi.
org/10.1109/CLEI.2012.6427181
11. Ding, D., Chou, M.C.: Stowage planning for container ships: a heuristic algo-
rithm to reduce the number of shifts. Eur. J. Oper. Res. (2015). https://
doi.org/10.1016/j.ejor.2015.03.044, http://linkinghub.elsevier.com/retrieve/pii/
S0377221715002660
12. Dubrovsky, O., Levitin, G., Penn, M.: A genetic algorithm with a compact solu-
tion encoding for the container ship stowage problem. J. Heuristics 8(6), 585–
599 (2002). https://doi.org/10.1023/A:1020373709350, http://link.springer.com/
article/10.1023/A:1020373709350
13. Engstrom, L., et al.: Implementation matters in deep policy gradients: a case study
on PPO and TRPO (2020). http://arxiv.org/abs/2005.12729
14. Foa, S., Coppola, C., Grani, G., Palagi, L.: Solving the vehicle routing problem
with deep reinforcement learning (2022). http://arxiv.org/abs/2208.00202
120 J. van Twiller et al.
15. Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press (2016). http://
www.deeplearningbook.org
16. Hamedi, M.: Containership load planning with crane operations. Ph.D. thesis, Uni-
versity of Maryland (2011)
17. Hottung, A., Kwon, Y.D., Tierney, K.: Efficient active search for combinatorial
optimization problems. In: International Conference on Learning Representations
2022 (2021). http://arxiv.org/abs/2106.05126
18. Jensen, R.M., Pacino, D., Ajspur, M.L., Vesterdal, C.: Container vessel stowage
planning. Weilbach (2018)
19. Kebedow, K.G., Oppen, J.: Including containers with dangerous goods in the slot
planning problem. In: 2019 Proceedings of the International Conference on Indus-
trial Engineering and Operations Management, pp. 225–232 (2019)
20. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: 2015 Inter-
national Conference on Learning Representations (2014). http://arxiv.org/abs/
1412.6980
21. Korach, A., Brouer, B.D., Jensen, R.M.: Matheuristics for slot planning of con-
tainer vessel bays. Eur. J. Oper. Res. 282(3), 873–885 (2020). https://doi.org/10.
1016/j.ejor.2019.09.042
22. Kwon, Y.D., Choo, J., Kim, B., Yoon, I., Gwon, Y., Min, S.: POMO: policy opti-
mization with multiple optima for reinforcement learning (2020). https://doi.org/
10.48550/ARXIV.2010.16011, https://arxiv.org/abs/2010.16011
23. Larsen, R., Pacino, D.: A heuristic and a benchmark for the stowage planning
problem. Maritime Econ. Logist. 23(1), 94–122 (2021). https://doi.org/10.1057/
s41278-020-00172-5
24. Mirhoseini, A., et al.: A graph placement methodology for fast chip design. Nature
594(7862), 207–212 (2021). https://doi.org/10.1038/s41586-021-03544-w
25. Mnih, V., et al.: Playing Atari with Deep Reinforcement Learning (2013)
26. Pacino, D.: An LNS approach for container stowage multi-port master planning.
In: Computational Logistics, pp. 35–44 (2013). https://doi.org/10.1007/978-3-642-
41019-2 3, http://link.springer.com/10.1007/978-3-642-41019-2 3
27. Pacino, D.: Crane intensity and block stowage strategies in stowage planning. In:
Cerulli, R., Raiconi, A., Voß, S. (eds.) ICCL 2018. LNCS, vol. 11184, pp. 191–206.
Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00898-7 12
28. Pacino, D., Delgado, A., Jensen, R.M., Bebbington, T.: Fast generation of near-
optimal plans for eco-efficient stowage of large container vessels. In: Böse, J.W., Hu,
H., Jahn, C., Shi, X., Stahlbock, R., Voß, S. (eds.) ICCL 2011. LNCS, vol. 6971, pp.
286–301. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-24264-
9 22
29. Pacino, D., Jensen, R.M.: A 3-phase randomized constraint based local
search algorithm for stowing under deck locations of container ves-
sel bays (2010). https://en.itu.dk/-/media/EN/Research/About-ITU-Research/
Technical-Reports/2010/ITU-TR-2010-123-pdf
30. Parreño-Torres, C., Çalık, H., Alvarez-Valdes, R., Ruiz, R.: Solving the generalized
multi-port container stowage planning problem by a matheuristic algorithm. Com-
put. Oper. Res. 133, 105383 (2021). https://doi.org/10.1016/j.cor.2021.105383
31. Roberti, R., Pacino, D.: A decomposition method for finding optimal container
stowage plans. Transp. Sci. 52(6), 1444–1462 (2018). https://doi.org/10.1287/trsc.
2017.0795
32. Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy
optimization algorithms (2017). http://arxiv.org/abs/1707.06347
Towards Deep Reinforcement Learning for Master Bay Planning 121
33. Shen, Y., Zhao, N., Xia, M., Du, X.: A deep Q-learning network for ship stowage
planning problem. Pol. Maritime Res. 24(S3), 102–109 (2017). https://doi.org/10.
1515/pomr-2017-0111
34. Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press,
Cambridge (2018)
35. Tierney, K., Pacino, D., Jensen, R.M.: On the complexity of container stowage
planning problems. Discrete Appl. Math. 169, 225–230 (2014). https://doi.org/10.
1016/j.dam.2014.01.005, http://dx.doi.org/10.1016/j.dam.2014.01.005
36. van Twiller, J., Sivertsen, A., Pacino, D., Jensen, R.M.: Literature survey on the
container stowage planning problem. Under review (2023)
37. Wilson, I.D., Roach, P.A.: Container stowage planning: a methodology for generat-
ing computerised solutions. J. Oper. Res. Soc. 51(11), 1248–1255 (2000). https://
doi.org/10.1057/palgrave.jors.2601022, https://www.tandfonline.com/doi/full/10.
1057/palgrave.jors.2601022
38. Zhao, N., Guo, Y., Xiang, T., Xia, M., Shen, Y., Mi, C.: Container ship stowage
based on Monte Carlo tree search. J. Coastal Res. 83, 540–547 (2018). https://
doi.org/10.2112/SI83-090.1
39. Zhu, H., Ji, M., Guo, W.: Integer linear programming models for the containership
stowage problem. Math. Probl. Eng. 2020 (2020). https://doi.org/10.1155/2020/
4382745
The Dynamic RORO Stowage Planning
Problem
1 Introduction
The interest in reducing Green House Gas (GHG) emissions within the ship-
ping sector has greatly increased over the past few years [17]. Governments and
organizations are adopting new targets for reducing GHG emissions. The Inter-
national Maritime Organization’s (IMO) target is to reduce GHG emissions by
50% from shipping by 2050 compared to the GHG levels in 2008 [2]. The GHG
emissions from shipping depend on various parameters, i.e., ship size, weight,
hull shape, ballast water management, and sailing speed [17]. As the shipping
industry is experiencing a general increase in demand, it is paramount to develop
new strategies and technologies for the industry [2,6].
One of these shipping industries is Roll-on/Roll-off (RORO) shipping. RORO
shipping focuses on the transportation of wheeled cargo such as semi-trailers,
cars, buses, and farm equipment. RORO ships have the advantage of being able
This work is partly funded by the Innovation Fund Denmark (IFD) under File No.
0177-00022B.
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 122–138, 2023.
https://doi.org/10.1007/978-3-031-43612-3_7
The Dynamic RORO Stowage Planning Problem 123
2 Related Work
Initial research regarding the optimization of stowage plans for RORO shipping
utilized a lane layout. The deck would be divided into a set of strips; whereafter
the cargo would be stowed in a FILO queue. A mathematical model was devel-
oped to maximize revenue gained by stowing cargo in FILO lanes. The model
would define the lanes’ length, width, and height and simultaneously ensure
ship stability [12]. The mathematical model could not solve large instances, so
a heuristic approach was applied to large real-life instances. The authors also
studied optimizing the routing and scheduling of RORO vessels [13].
124 A. R. Main et al.
A different approach was later proposed, where the lane layout was replaced
by a grid, which allowed for sideways movement of cargo. A mathematical model
was proposed to assign positions of cargo on board. Several objectives were
utilized for finding good positioning, including grouping constraints of cargo
to minimize shifting. The solutions structure was evaluated by calculating the
shortest path for cargo to exit the vessel. The complexity of the model, however,
would limit its use in real-life instances [5]. Later a shortest path heuristic for
evaluating the stowage plan was presented [3].
Following the grid layout’s introduction, research focused on increasing the
performance of using MIP formulations for stowage planning and maximizing
revenue. The discretization of the vessel layout was optimized to reduce the
need for a very fine grid representation. Furthermore, refrigerated cargo and
hazardous cargo were considered during the stowage process. The need for shift-
ing of cargo was removed by computing the shortest path from a position to an
exit and utilizing inequality constraints to force a non-shift policy. The math-
ematical model also includes ship stability; however, the model was shown to
solve medium-sized problems well but not large instances [14].
An Adaptive Large Neighborhood Search was presented to minimize the num-
ber of cargo shifts. The study introduced a mathematical model to optimize
cargo assignment on board and showed that it was incapable of solving large
instances. This method showed that a ship could be stowed efficiently for each
deck by minimizing the number of shifts. The model utilizes Dijkstra’s algorithm
to evaluate how many shifts are generated by a given stowage plan [4].
The grid layout increases the computational complexity. A third approach to
the deck layout is to use slots. The slots represent specific positions on the deck,
and cargo can be assigned to these positions. To reduce the GHG emission by
shipping cargo, the assignment of cargo could be used to minimize the amount
of ballast water needed to stabilize the ship, thus reducing dead weight [8] and
thereby fuel consumption. The authors also conducted studies to estimate cargo
discharge times [9] and optimize the discharging process by dual cycling [10].
Drawing on the prior research within RORO stowage planning, the study
of this paper focused on the interconnection between maximizing revenue and
GHG emissions. A grid representation was utilized to describe the shifting and
cargo movement better. Ballast tanks and stability constraints were not included.
However, dynamic cargo arrival times were added to analyze the relationship
between time spent stowing the vessel, fuel consumption, and revenue.
The following mathematical model is an extension of the work done by [3]. The
original model has been modified to include the arrivals of each individual piece
of cargo. Furthermore, the objective function has been modified to be a weighted
function between fuel consumption and revenue. The weight α can be set to how
important it is for shippers to reduce GHG emissions vs. transporting cargo.
The function for fuel consumption is assumed to be non-linear with respect to
The Dynamic RORO Stowage Planning Problem 125
sailing speed. The consumption of fuel has previously been described as cubic
[15]. For the mathematical model, the function is linearized by discretizing it
into W points, where B(v) is the fuel consumption, v is the speed, v ∗ is the
design speed, and B(v ∗ ) is the design consumption.
v 3
B(v) = B(v ∗ ) (1)
v∗
A predefined route for port calls is assumed to be available, resulting in a
set of ports P . The ports are assumed to be separated into a loading region
P L and an unloading region P U . Each piece of cargo can be picked up at its
loading port PcL and unloaded at its unloading port PcU . In the original model,
each piece of cargo c ∈ C consisted of several vehicles of the same width, length,
and weight. However, in the following model, each piece of cargo is assumed to
have a single vehicle. This allows for a better description of the individual arrival
times of each cargo. The inter-arrival times of the cargo were assumed to follow
an exponential distribution.
The deck layout of the ship follows a grid representation of I rows and J
columns resulting in (i, j) pairs of locations. Each piece of cargo is modeled as
a rectangle. The positioning of a piece of cargo follows the cargo’s lower left
corner. Due to cargo width and height, a piece of cargo will span over several
grid points. The grid layout contains restricted areas where, e.g., pillars, ramps,
and walls are located.
If a piece of cargo is shifted, it is removed from the vessel and returned to its
original position. The model guarantees that the ship makes it in time for the
next port. The ship can, therefore, in order not to experience tardiness regulate
sailing speed.
Table 1. Sets, Parameters, and Decision Variables for the mathematical model
Sets
C Set of cargoes
P Set of ports
I Set of rows
J Set of columns
N Set of squares where cargo can be placed N ⊆ I × J
Nc Set of squares where the lower left corner of cargo c ∈ C can be
placed, Nc ⊆ N
Qijc The set of squares where placing cargo c ∈ C would cover position
(i, j)
Dijcd If cargo d ∈ C is in position (i, j). The set contains all squares that
would initiate a shift of cargo d if cargo c ∈ C uses these squares
(continued)
126 A. R. Main et al.
Table 1. (continued)
Sets
L
Dijcd Same as Dijcd , however, specifically for when cargo is being loaded
individually and sequentially
Bij Set of neighboring squares for position (i, j)
PL Set of loading ports, P L ⊆ P
PU Set of unloading ports, P U ⊆ P
PcL Set of loading ports p ∈ P L for cargo c ∈ C
PcU Set of unloading ports p ∈ P U for cargo c ∈ C
P LU Set of ports where shifting can occur P LU = P \{1, |P |}
PcLU Set of ports where shifting can occur for cargo
c ∈ C, P LU = P \{1, |P |}
CpR Set of cargo to be loaded or unloaded at port p ∈ P
CpN Set of cargo not to be loaded or unloaded at port p ∈ P
CpRL Set of cargo to be loaded at port p ∈ P including p = 1
W Set of discrete points for nonlinear fuel consumption discretization
EI , EJ Set of exit squares for row and column pair (I, J).
Parameters
α Parameter for choosing the importance level between fuel
consumption and revenue
Mdp Big M
Rc Number of vehicles in cargo c ∈ C
hc Handling time of cargo c ∈ C
Acp Arrival time of cargo c ∈ C at port p ∈ P
Tpmean Average time to sail between ports p, p + 1 ∈ P
Tpslow Time to sail between ports p, p + 1 ∈ P at the slowest vessel speed
possible
Tpf ast Time to sail between ports p, p + 1 ∈ P at the fastest vessel speed
possible
C rev Revenue gain for transporting a piece of cargo
C f uel Fuel cost per ton
Cwcon
Coefficient of fuel consumption at discrete point w ∈ W
time
Cpw Coefficient of time to sail at discrete point w ∈ W at port p ∈ P
(continued)
The Dynamic RORO Stowage Planning Problem 127
Table 1. (continued)
Sets
Decision Variables
xijc 1 if the lower left corner of a vehicle from cargo c ∈ C is placed in
position (i, j), 0 otherwise
ycdp 1 if cargo c ∈ C is stowed before cargo d ∈ C in port p ∈ P L , 0
otherwise
δijcp 1 if cargo c is shifted at position (i, j) in port p
dijcp Supply or demand variable for cargo c at position (i, j) in port p. If
dijcp > 0, then position (i, j) is a supply square. If dijcp < 0, it is a
demand square. If dijcp = 0, it is a transit square
fijklcp Flow of cargo c from position (i, j) to position (k, l) at port p.
Bp Fuel consumption per time unit between port p and p + 1
STcp Start time of loading of cargo c ∈ C at port p ∈ PcL
T T Sp Time to sail between port p and p + 1 ensuring
Tpf ast ≤ T T Sp ≤ Tpslow
λwp Used to select the discrete point w ∈ W of the nonlinear fuel
consumption function between port p and p + 1
Deptp Departure time at port p ∈ P
The notation for the mathematical model is described in Table 1, and the
formulation is as follows.
min α C f uel Bp Tpmean − (1 − α) C rev xijc (2)
p∈P LU c∈C (i,j)∈Nc
Subject to:
xijc ≤ Rc ∀c ∈ C (3)
(i,j)∈Nc
xi j c ≤ 1 ∀(i, j) ∈ N (4)
(i ,j )∈Qijc
in position (i, j), which is the chosen spot for cargo c. Constraints (8)–(9) state
the demand and supply for cargo c in port p similar to (6)–(7). Constraints 10
ensure that the outflow minus the inflow must be equal to the demand/supply in
the position (i, j) for each cargo c in port p. Constraint (11) ensure that if a piece
of cargo that is being loaded or unloaded in port p uses a position (k, l) from a
stationary piece of cargo d, it would result in the shifting of cargo d. Constraints
(12) ensure that the start time plus handling time for stowing cargo c in port p is
less than the starting time of cargo d if we stow cargo c before cargo d. Constraints
(13) ensure a sequence of stowing one piece of cargo c after another d in port p.
Constraints (14) ensure that the start time for stowing cargo c in port p must be
greater than or equal to the arrival time of cargo c if at all chosen to be stowed.
Constraints (15) ensure that if position (i, j) is chosen for cargo c, all other pieces
of cargo stowed after cannot cross the covered positions by cargo c. Constraints
(16) ensure that the departure time must be greater than the starting time of cargo
c plus the handling time of cargo c. Constraints (17) calculate the fuel consumption
for a given λwp value chosen. Constraints (18) calculate the time to sail between
port p and p + 1 given a λwp -value chosen. Constraints 19 ensure only one point
on the discretized non-linear fuel consumption curve can be picked. Constraints
(20) calculate the time to sail between ports p and p + 1. The departure time is
given to be between the time it takes to sail at the fastest and slowest speed. The
departure time is penalized if shifting of cargo occurs. Constraints (21) ensure that
the ship leaves at the latest time possible. The remaining constraints (22)–(30) are
for variable declarations.
4 A Heuristic Approach
To represent the deck layout without having an over-representation of the spaces
taken up by cargo, a grid size of 150 × 400 is needed [4]. Since the MIP formu-
lation is not able to solve the problem even for toy-size instances, we present a
heuristic approach. The heavy computations required to evaluate a single solu-
tion indicate the need to use strongly guided solution methods that do not
require many iterations. We propose a heuristic procedure where a pool of can-
didate solutions is iteratively improved by a truncated local search.
The pseudo-code for the heuristic is presented in Algorithm 1. The algorithm
encodes the solution as a sequence of cargo loads. A deterministic procedure then
decodes the sequence by assigning specific positions to the cargo. The initial
solution is constructed by a sequence where cargo is sorted in descending order
with regard to their leg length (line 1). The cargo positions are generated by
the deterministic decoding heuristic_placement (described later). An initial
pool of solutions is created in line 2. For the first iteration of the heuristic, the
pool only contains a single solution; however, later, it will represent the set of
current solutions. Out of all the solutions in the pool of current solutions, only
the best β solutions are selected for further processing (line 4). The next step
of the algorithm is to generate a new pool of solutions (lines 13–15). For each
of the best solutions (sols), η truncated local search procedures are executed.
Each of the found solutions is added to the new pool of current solutions.
130 A. R. Main et al.
The local search procedure performs κ swaps in the stowage sequence. From
the heuristic placement, it is known which piece of cargo blocks most other cargo.
The most blocking cargo is swapped to a new random position within the stowage
sequence. Using the heuristic placement again, the new sequence is evaluated. If
the solution is not better, the second worst piece of cargo is tried swapped. If
the solution is better, a new, better stowage sequence has been discovered, and
the most blocking cargoes have therefore changed. The local search continues
until a total of κ swaps have been performed.
During the search, all solutions are evaluated using an approximation. Given
a cargo stowage sequence, the placement heuristic assigns cargo to a specific
position iteratively. For each piece of cargo, a set of available positions is calcu-
lated (starting from the top-right corner of the cargo). A position is not available
if any of the grid positions, potentially occupied by the cargo, are already occu-
pied. This is calculated for the entire leg the cargo will be on board. Figure 1
shows an example set of available positions in green. Notice how only positions in
direct contact with the vessel structure or other placed cargo are available. Each
available position is evaluated based on its potential impact on the objective
function. Positions towards the bow and starboard are prioritized. Calculating
shifting costs is computationally heavy; we approximate the evaluation consid-
ering cargo c ∈ C blocking cargo d ∈ C iff PdL > PcL ∨ PdU < PcU looking from
cargo c towards the bow.
The Dynamic RORO Stowage Planning Problem 131
5 Computational Study
The instances were available online [4] and were changed to fit the new problem
formulation in relation to cargo only containing a single vehicle and adding
arrival times of cargo. The inter-arrival times of the cargo were assumed to
follow an exponential distribution with μ = 60, and it was assumed that 20%
of cargo was available to stow immediately. The distances between ports were
generated by selecting routes between ports within the European RORO market.
The fuel costs were estimated through available bunker prices [16]. The revenue
generated for a piece of cargo was selected by analyzing pricing for spot cargo in
the RORO market. Due to a lack of better data, fuel consumption was estimated
based on the smallest vessel from LinerLib [1]. The grid resolutions used range
from 75 × 200 to 150 × 400 and are described as large instances. The number of
port calls ranges from 6–10, and the number of cargo units is app. 200–300.
The study was conducted on an Intel R
Xeon R
Gold6226R processor. The
mathematical model was solved for toy instances with a grid resolution of 10 × 10
up to 35 × 35. The number of cargoes was 77–193. The MIP was given 4 h to
perform precomputation of needed sets and model generation and given 1 h to
solve the problem. The problem was solved using Gurobi 9.1.0 and four processor
cores. It was not possible to generate solutions for the instances within 1 h for
grid resolutions as low as 10 × 10 and 77 pieces of cargo. The heuristic was tested
on several large problem instances. For each of these problem instances, several
different arrival patterns were generated. Furthermore, the algorithm was tested
several times for each instance with different seeds to evaluate the algorithm’s
performance. The heuristic was implemented in Julia 1.6.1 and proved capable
of generating stowage plans.
132 A. R. Main et al.
Fig. 2. The average objective function plotted for different α-values along with the
95% confidence interval.
The initial analysis showed that increasing the α-value will result in less
cargo being stowed. This is intuitively supported by the nature of the objective
function. Increasing the α-value will emphasize fuel consumption (and thereby
GHG emissions) more than revenue. However, the variance increases as α goes to
1, as can be seen in Fig. 2. Instances generally differ regarding the number of port
calls and the different port sequences. The timing becomes more pronounced as
the cargo stowed does not dominate the objective function. If we look at the
amount of cargo stowed in Fig. 3 and the number of shifts occurring in the
stowage plan in Fig. 4, the number decreases with increasing α-values. This also
holds true for the variance. However, this is expected as the variance must follow
when less cargo is stowed.
We aimed to investigate how the heuristic behaved if the coefficients of the
objective function changed. The objective function was affected by the cargo’s
revenue, the fuel cost, and indirectly by the handling time of the cargo. The han-
dling time impacted the vessel’s departure time and, thereby, fuel consumption.
From the initial analysis, we can conclude that the algorithm is of most interest
if 0.00 ≤ α ≤ 0.30. The most cargo was stowed in this interval, so a sensitivity
analysis is conducted within this range.
The Dynamic RORO Stowage Planning Problem 133
Fig. 3. The average number of stowed cargo plotted for different α-values along with
the 95% confidence interval.
Fig. 4. The average number of shifts incurred in the stowage plan plotted for different
α-values along with the 95% confidence interval.
An initial baseline was established for the sensitivity analysis. The problems
were solved for a handling time of 2 min per cargo, a revenue of 270 euros
per cargo, and a fuel consumption cost of 520 euros per ton. A baseline was
generated for each α-value. For the baseline, the mean and standard deviation
were calculated across the instances for how many units of cargo were stowed
on board. The sensitivity analysis used the same setup as the one used for
the computational study above. The first analysis was conducted where the
handling time was kept fixed, whereafter the other parameters varied slightly.
The percentage change in the number of cargo stowed changed significantly when
revenue and fuel costs fluctuated, as shown in Fig. 5. For an α-value of 0.0, there
is very little change in the amount of cargo stowed. However, as α increases, the
variance becomes significantly more noticeable. Lowering the revenue results in
134 A. R. Main et al.
Fig. 5. The percentage change in the average number of cargoes stowed and the 95%
confidence intervals for fixed handling time.
Fixing the fuel costs at the baseline level will allow the handling time to
fluctuate. The heuristic was highly susceptible to increased handling time, as
seen in Fig. 6. An increase in handling time of a single minute resulted in a
significant drop in stowed cargo irrespective of the α-value compared to the
baseline. The susceptibility of the algorithm is most likely connected to the
stowage sequence and the number of cargoes stowed. If the stowage list is badly
ordered in relation to cargo arrival, a lot of time will be spent waiting for cargo.
This is only exacerbated by potential cargo shifts. The time penalty for each
shift is double the handling time. This results in the algorithm being penalized
heavily in relation to fuel consumption incurred. This also holds true for an α-
value of 0.0. No direct cost is incurred at this alpha value; however, shifts result
in time being spent not stowing cargo. Thereby less cargo could end up being
stowed as the ship is forced to leave port.
The Dynamic RORO Stowage Planning Problem 135
Fig. 6. The percentage change in the average number of cargoes stowed and the 95%
confidence intervals for fixed fuel costs.
136 A. R. Main et al.
Fig. 7. The percentage change in the average number of cargoes stowed and the 95%
confidence intervals for fixed revenue.
6 Conclusion
This study of the dynamic RORO stowage planning problem focused on cargo
arriving at the port over time. The cargo had to be assigned a position on
board the vessel. Sailing between ports, the stowage plan should consider the
fuel consumption incurred from waiting for arriving cargo, and the vessel could
leave early to reduce costs. Furthermore, the number of cargo re-handling could
be minimized to reduce time spent stowing the ship. A MIP formulation was
presented; however, it could not solve even toy-sized instances of the problem,
so a heuristic was applied instead. The heuristic proved effective in generating
feasible stowage plans. The heuristic showed a clear sensitivity to changes in
coefficients for revenue and fuel consumption costs. However, the algorithm was
highly sensitive to the handling time of each piece of cargo on board the vessel.
Given how sensitive the solutions are to changes in the parameters, it might
be advisable to study different cost structures and to perform a deeper analysis
of alternative objectives as seen in [18]. Furthermore, an investigation into a
different problem representation could prove advantageous in relation to the
algorithm’s performance.
The Dynamic RORO Stowage Planning Problem 137
References
1. Brouer, B.D., Alvarez, J.F., Plum, C.E.M., Pisinger, D., Sigurd, M.M.: A base
integer programming model and benchmark suite for liner-shipping network design.
Transp. Sci. 48(2), 281–312 (2014). https://doi.org/10.1287/trsc.2013.0471
2. Faber, J., et al.: Fourth IMO GHG study 2020. Technical report, International
Maritime Organization, London (2021)
3. Hansen, J.R., Fagerholt, K., Stålhane, M.: A shortest path heuristic for evaluat-
ing the quality of stowage plans in roll-on roll-off liner shipping. In: Bektaş, T.,
Coniglio, S., Martinez-Sykora, A., Voß, S. (eds.) ICCL 2017. LNCS, vol. 10572, pp.
351–365. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68496-3 24
4. Hansen, J.R., Fagerholt, K., Stålhane, M., Rakke, J.G.: An adaptive large neigh-
borhood search heuristic for the planar storage location assignment problem: appli-
cation to stowage planning for Roll-on Roll-off ships. J. Heuristics 26(6), 885–912
(2020). https://doi.org/10.1007/s10732-020-09451-z
5. Hansen, J.R., Hukkelberg, I., Fagerholt, K., Stålhane, M., Rakke, J.G.: 2D-packing
with an application to stowage in roll-on roll-off liner shipping. In: Paias, A., Ruth-
mair, M., Voß, S. (eds.) ICCL 2016. LNCS, vol. 9855, pp. 35–49. Springer, Cham
(2016). https://doi.org/10.1007/978-3-319-44896-1 3
6. IEA: International Shipping. Technical report, International Energy Agency, Paris
(2022). https://www.iea.org/reports/international-shipping. Accessed 29 Apr 2023
7. Jia, B., Fagerholt, K.: Step-wise stowage planning of roll-on roll-off ships trans-
porting dangerous goods. Maritime Transp. Res. 2(June), 100029 (2021). https://
doi.org/10.1016/j.martra.2021.100029
8. Jia, B., Fagerholt, K., Reinhardt, L.B., Rytter, N.G.M.: Stowage planning with
optimal ballast water. In: Lalla-Ruiz, E., Mes, M., Voß, S. (eds.) ICCL 2020. LNCS,
vol. 12433, pp. 84–100. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-
59747-4 6
9. Jia, B., Rytter, N.G.M., Reinhardt, L.B., Haulot, G., Billesø, M.B.: Estimating
discharge time of cargo units – a case of Ro-Ro shipping. In: Paternina-Arboleda,
C., Voß, S. (eds.) ICCL 2019. LNCS, vol. 11756, pp. 122–135. Springer, Cham
(2019). https://doi.org/10.1007/978-3-030-31140-7 8
10. Jia, B., Tierney, K., Reinhardt, L.B., Pahl, J.: Optimal dual cycling operations in
roll-on roll-off terminals. Transp. Res. Part E: Logist. Transp. Rev. 159(February),
102646 (2022). https://doi.org/10.1016/j.tre.2022.102646
11. Mattfield, D.C.: The Management of Transshipment Terminals, 1st edn. Springer,
New York (2006). https://doi.org/10.1007/0-387-30854-7
12. Øvstebø, B.O., Hvattum, L.M., Fagerholt, K.: Optimization of stowage plans for
RoRo ships. Comput. Oper. Res. 38(10), 1425–1434 (2011). https://doi.org/10.
1016/j.cor.2011.01.004
13. Øvstebø, B.O., Hvattum, L.M., Fagerholt, K.: Routing and scheduling of RoRo
ships with stowage constraints. Transp. Res. Part C: Emerg. Technol. 19(6), 1225–
1242 (2011). https://doi.org/10.1016/j.trc.2011.02.001
14. Puisa, R.: Optimal stowage on Ro-Ro decks for efficiency and safety. J. Marine Eng.
Technol. 20(1), 17–33 (2018). https://doi.org/10.1080/20464177.2018.1516942
15. Reinhardt, L.B., Plum, C.E., Pisinger, D., Sigurd, M.M., Vial, G.T.: The liner
shipping berth scheduling problem with transit times. Transp. Res. Part E: Logist.
Transp. Rev. 86, 116–128 (2016). https://doi.org/10.1016/j.tre.2015.12.006
16. Ship & Bunker: EMEA Average (2023). https://shipandbunker.com/prices/av/
global/av-g20-global-20-ports-average#VLSFO. Accessed 29 Apr 2023
138 A. R. Main et al.
1 Introduction
Shore side electricity (SSE) has been given great attention to significantly reduce
at-berth emissions from vessels. For instance, as of 2023, the Port of Los Angeles
invested in 79 SSE supply points and requires all container, cruise, and reefer
vessels to use SSE or an approved emission control technology during berthing
(POLA, 2023). By 2021, the average coverage rate of SSE at 11 major coastal
ports in China is about 71.5%, and vessels that have berthing durations longer
than three hours are required to use SSE (CAA, 2022). The European Com-
mission aims at reducing net greenhouse gas emissions by at least 55% by 2030
compared to 1990 levels; for instance, vessels larger than 5,000 gross tonnages
berthing at European ports for more than two hours would be required to use
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 139–153, 2023.
https://doi.org/10.1007/978-3-031-43612-3_8
140 J. Yu et al.
SSE as of January 2030 (see the “Fit for 55” package (EP, 2023)). In 2021,
the ports of Rotterdam, Hamburg, Bremen, Antwerp, and Haropa agreed to a
memorandum of understanding (MOU) to provide SSE to ultra-large container
vessels, 2028 at the latest (POA, 2021).
More specifically, the Port of Hamburg contribution manifests in the provi-
sion of eight SSE supply points for container vessels (HPA, 2022). From an eco-
nomic perspective, Williamsson et al. (2022) note that one of the obstacles to the
adoption of SSE is the huge initial investment in SSE supply points. For example,
the cost of constructing one SSE supply point at the Port of Hamburg is about
nine million EUR (HPM, 2019). The correct allocation of SSE supply points to
terminals is crucial to maximize the SSE consumption, and thus, to maximize
the environmental benefit with a limited investment. Consequently, allocation
plans should be scrutinized by using suitable methods; however, we deem the
existing methods having some drawbacks. Our literature review outlines that
most methods are lacking consideration of the temporal overlap of arriving ves-
sels (neglecting the berth allocation problem (BAP)); this potentially reduces
the accuracy concerning past assessment periods. Moreover, existing methods do
not consider terminal-assigned port calls for a terminal group with the shared
objective to maximize the overall environmental benefit; this requires separate
applications of methods for each port or terminal, as well as further analysis to
find the best allocation that maximizes the overall environmental benefit.
Consequently, this paper integrates the shared investment and shared goal by
terminals to maximize the overall environmental benefit by maximizing the SSE
consumption. Different scenarios are explored, which concern a variable number
of SSE supply points and SSE-enabled vessels. The objective of this paper is
(1) to optimize the allocation of SSE supply points among container terminals
under different scenarios with respect to an uncertain future, and (2) to provide
the environmental benefit of this allocation. The aim of this paper is to adapt
the methodological support for SSE adoption to fit a grouped terminal setting
and to investigate the current plan for the Port of Hamburg. Toward this aim,
an optimization model and a solution approach using a genetic algorithm (GA)
are implemented and the obtained allocations are discussed. To the best of our
knowledge, this paper is the first to focus on the shared investment and shared
goal of a terminal group. The case study concerns the three container terminals
operated by the Hamburger Hafen und Logistik AG (HHLA) at the Port of
Hamburg.
The remainder of this paper is structured as follows. Section 2 presents a review
of relevant studies. Section 3 then describes the optimization model. In Sect. 4, the
developed optimization model is applied in a case study followed by a presentation
of the results and discussions. At last, conclusions are summarized in Sect. 5.
2 Literature Review
Table 1 summarizes the most recent literature concerning the adoption of SSE in
maritime ports. The first research branch, assessment, primarily estimates the
Allocation of Shore Side Electricity 141
environmental benefit of using SSE for ports for a certain time horizon. Herein,
we present the building blocks that are changed for different future scenarios. For
instance, Lathwal et al. (2021) estimate the environmental benefits of using SSE
with varying electricity generation structures, such as reducing the share of coal
generation and increasing the share of renewable generation in 2030 compared
to 2017. This leads to different emission factors. Most works in this branch
assume all vessels to be SSE-enabled and SSE-served. Contrary, Dai et al. (2019)
experiment with the number of SSE supply points for ten consecutive years. The
authors calculate the SSE consumption by multiplying the increasing adoption
rate of SSE with the total energy consumption of vessels during berthing. The
adoption rate is a simple fraction between the total number of berths and the
number of SSE supply points. Nguyen et al. (2021) feature two scenarios based
on two shipping routes, each of which has a different number of SSE-enabled
vessels. Gore et al. (2023) consider different numbers of SSE-enabled vessels
based on their calling frequency and make assumptions on the subsequently
required number of SSE supply points. In summary, the building blocks that
are changed for different future scenarios comprise varying emission factors, the
number of SSE-enabled vessels, the number of SSE supply points, as well as the
annual energy demand at berths.
The second branch, optimization, in a nutshell, yields the number of SSE
supply points to be installed to abate air emissions. Unlike the works in the
assessment branch, only the emission factors are subject to change to build dif-
ferent scenarios. Peng et al. (2019, 2021) optimize which specific berths of one
terminal should be equipped with SSE supply points to minimize the cost and
minimize the CO2 emissions, providing a trade-off analysis. Similarly, Vaishnav
et al. (2016) maximize inter alia the economic benefits. In contrast to the assess-
ment works, Peng et al. (2019, 2021) do not change the number of SSE-enabled
vessels for scenarios. To address the question of which berths should be equipped
with SSE supply points, it is useful to know which berths are actually used the
most. Insights to more detailed operational data may be helpful, such as those
data obtained by solving the berth allocation problem for vessels. Berths with
the longest utilization rate, and occupied by large vessels with a higher energy
demand are to be preferred. The work by Peng et al. (2021) solves the BAP.
The authors assume all vessels to be SSE-enabled, arrival times to be based on a
probability distribution, their berthing durations to be based on averaged histor-
ical vessel size, and the berth utilization to be based on the case study port. The
use of averaged historical data is commonly observed in strategic-oriented works
as can be seen in Table 1, and applies to both the optimization and assessment
branches. The takeaway is that the works in the optimization branch do not
consider a varying number of SSE-enabled vessels for different scenarios. This
cannot reflect current or near-future scenarios. Taken together, the references
foci are either concerned only with a single terminal or a group of ports. We
deem it necessary to address the rate of SSE-enabled vessels, as well as a new
scope of application, i.e., a group of terminals. We justify this grouping with the
case of the Port of Hamburg where HHLA operates three terminals whose berths
142
Table 1. Summary of literature regarding the investment of SSE supply points in ports
Empty bullet points (◦) signal varying parameters leading to multiple scenarios, whereas filled bullet points (•) signal fixed parameters.
143
144 J. Yu et al.
are planned to be equipped with SSE supply points. Moreover, the use of vessels’
actual arrival times and berthing durations (port call data) is instrumental in
increasing the accuracy of the results, at least for an a posteriori analysis. For
future scenarios with uncertain port call data, an a priori analysis can be fed
with adapted/generated port call data, e.g., increasing the number of port calls.
Based on the literature review, we consider the following aspects to build
various scenarios: The number of SSE-enabled vessels and the number of SSE
supply points. Furthermore, we solve a BAP using a varying number of SSE-
enabled vessels together with the use of vessels’ actual arrival times and berthing
durations. The aim is to identify the number of berths that should be equipped
with SSE supply points, i.e., to optimize the allocation of SSE supply points
among the three terminals of HHLA in our case study.
3 Optimization Model
In this section, we develop an optimization model to allocate SSE supply points
to different terminals, and to solve the BAP, with the objective to maximize
the total SSE consumption of all vessels. Each vessel can only berth at its indi-
vidually defined target terminal. The set of terminals can be appended, imagin-
ably even spanning multiple ports. The BAP is simplified by using the historic
berthing start and departure time as parameters. Let us introduce the assump-
tions, input sets and parameters, as well as the decision variables:
Assumptions
Sets
Parameters
Decision variables
Subject to
yijg = 1 ∀j ∈ Vg , ∀g ∈ G (2)
i∈Bg
SavedEmissions = C SSE EFkf uel − EFkSSE ∀k ∈ K (5)
T otalEmissions = Wj EFkf uel − SavedEmissions ∀k ∈ K (6)
j∈V
4 Case Study
The case study is based on the container terminals operated by HHLA in the
Port of Hamburg. Those terminals comprise the Container Terminal Altenwerder
(CTA), the Container Terminal Burchardkai (CTB), and the Container Terminal
Tollerort (CTT) with four, ten, and four berths, respectively. In the aforemen-
tioned MOU of five large ports in Europe (including the Port of Hamburg), it
is stated that “[Article 2] The Parties agree that all berths at container termi-
nals serving, on a regular basis, Ultra Large Container vessels (ULCV) with a
length overall (LOA) above 366 m, will be subject to this MoU [...] OPS [onshore
power supply] installations, as referred to in Article 2, must be operational by
1 January 2028 at the latest” (POA, 2021). The plan of the Port of Hamburg is
to allocate five SSE supply points to HHLA’s container terminals by 2024, i.e.,
CTA (1), CTB (3), and CTT (1) (HPA, 2022).
146 J. Yu et al.
The dataset includes the entire port calls made at those terminals during Decem-
ber 2021. Those accumulate to 473 port calls by 135 unique vessels. The dataset
has been obtained on 27 January 2022 from HHLA (2020).
For each port call, the vessel name, vessel type (seagoing vessel, feeder),
scheduled berthing start time, departure time, and target terminal are available.
Based on the vessel name, the deadweight tonnage (DWT) and vessel length
in meters are collected from vesselfinder.com. The power of a vessel’s auxiliary
engines is estimated as P aux = 0.22 × P main , where P main is the power of
the main engines estimated by the DWT regression equation, i.e., P main =
2.4351×DW T 0.8673 (Wan et al., 2020). Based on some collected sample data, we
adjust the regression equation by multiplying a correction factor of 23 . The power
of the auxiliary engines is used to estimate the energy consumption of the vessel
during berthing by Eq. (4). The load factor, λaux , is highly correlated with the
port and its determination would constitute a separate project (see, e.g., efforts
by the Port of Los Angeles to express a nonlinear relationship between the load
factor and capacity, where container vessels are binned by twenty-foot equivalent
unit (TEU) (POLA, 2022)); here, we simplify and set λaux as 0.1. Emission
factors are used to assess the environmental benefit of using SSE (UBA, 2022)
compared to using auxiliary engines with low sulphur marine gas oil (LSMGO)
(POLA, 2022) and are shown in Table 2.
We use two aspects to develop a series of scenarios: the number of SSE sup-
ply points and the number of SSE-enabled vessels. Increasing the number of
SSE supply points enables the exploration of the trade-offs between investments
and environmental benefits. With a maximum of nine SSE supply points to be
allocated among the three terminals, 150 allocations exist (φig ≥ φ(i+1)g , ∀i ∈
Bg , ∀g ∈ G). This is sufficient to provide some solution space under and above
the planned allocation of five SSE supply points by the Port of Hamburg.
The number of SSE-enabled vessels is uncertain and beyond the decision
scope of the port. It is reasonable to suspect that larger container vessels are
prioritized for being equipped with SSE in the initial build or for retrofitting.
Hence, DWT classes are suitable to increase the number of SSE-enabled vessels
for different scenarios. The inspection of the extended port call data shows that
Allocation of Shore Side Electricity 147
all vessels having a length larger than 366 m fall in the class of vessels ≥140,000
DWT. This class represents the type of vessels that the Port of Hamburg aims
at providing SSE first. In total, three DWT classes are defined to provide some
solution space for future developments, i.e., ≥200,000 DWT, ≥140,000 DWT,
and ≥100,000 DWT. The vessels in the three DWT classes are subsequently
treated as SSE-enabled vessels. Figure 1 provides the distribution of port calls
at each terminal in three DWT-intervals: [100,000, 140,000) DWT, [140,000,
200,000) DWT, and [200,000, 250,000) DWT.
Parameter Values
Maximum number of 300
generations
Population 100, 120
Crossover probability 0.8
Mutation probability 0.01
Crossover function scattered, intermediate
Mutation function adaptive feasible
148 J. Yu et al.
4.3 Application
The presented optimization model, presented in Sect. 3, is solved using the MAT-
LAB GA optimization toolbox. The performance of the GA is largely influ-
enced by its parameters. The parameter settings of the GA are listed in Table 3,
which are based on initial experiments with respect to the scenarios developed
in Sect. 4.2.
Based on the 150 allocations and the three DWT classes (i.e., ≥200,000 DWT,
≥140,000 DWT, and ≥100,000 DWT) described in Sect. 4.2, two population
values, two crossover functions, and three GA execution repetitions to increase
the probability to find better solutions, there are 5,400 solutions obtained.
For each scenario, the best solution is selected and shown in Fig. 2. The objective
value represents the SSE consumption of SSE-enabled vessels. Table 4 presents
the two best solutions for each scenario in more detail. Moreover, the best solu-
tion that adheres to the MOU-based constraint is shown. The MOU text provides
some room for interpretation concerning when a berth regularly serves vessels in
class ≥140,000 DWT: “The Parties agree that all berths at container terminals
serving, on a regular basis, Ultra Large Container vessels (ULCV) with a length
overall (LOA) above 366 m [140,000 DWT], will be subject to this MoU” (POA,
2021). The MOU could imply that each terminal should be allocated at least one
SSE supply point (MOU-based constraint), to be able to serve vessels of class
≥140,000 DWT at all terminals.
For the class ≥200,000 DWT, three supply points are sufficient to meet the
energy demand of all SSE-enabled vessels. The best solution is to allocate two
Table 4. Allocation of SSE supply points for different scenarios
Scenario SSE allocation SSE SOx (t) NOx (t) PM2.5 (t) PM10 (t) CO2 (kt)
Class No. CTA CTB CTT (GWh) T S % T S % T S % T S % T S %
≥0 0 0 0 0 n/a 1.97 n/a n/a 9.39 n/a n/a 0.80 n/a n/a 0.90 n/a n/a 3.27 n/a n/a
≥200 1 0 1 0 0.50 1.85 0.12 6.13 8.57 0.82 8.74 0.72 0.08 10.50 0.81 0.09 10.13 3.11 0.16 4.90
≥200 1 0 0 1 0.33 1.89 0.08 4.11 8.84 0.55 5.86 0.75 0.06 7.05 0.83 0.06 6.80 3.16 0.11 3.29
≥200 2 0 1 1 0.83 1.77 0.20 10.24 8.02 1.37 14.60 0.66 0.14 17.55 0.74 0.15 16.92 3.00 0.27 8.19
≥200 2 0 2 0 0.62 1.82 0.15 7.65 8.36 1.02 10.91 0.70 0.11 13.12 0.78 0.11 12.65 3.07 0.20 6.12
≥200 3 0 2 1 0.96 1.74 0.23 11.76 7.81 1.57 16.77 0.64 0.16 20.16 0.72 0.17 19.44 2.96 0.31 9.41
≥200 3 0 1 2 0.83 1.77 0.20 10.24 8.02 1.37 14.60 0.66 0.14 17.55 0.74 0.15 16.92 3.00 0.27 8.19
≥200 3 1 1 1 0.83 1.77 0.20 10.24 8.02 1.37 14.60 0.66 0.14 17.55 0.74 0.15 16.92 3.00 0.27 8.19
≥140 1 0 1 0 0.85 1.76 0.21 10.42 7.99 1.39 14.85 0.66 0.14 17.85 0.74 0.15 17.21 2.99 0.27 8.33
≥140 1 0 0 1 0.71 1.80 0.17 8.75 8.21 1.17 12.48 0.68 0.12 15.00 0.77 0.13 14.47 3.04 0.23 7.00
≥140 2 0 1 1 1.56 1.59 0.38 19.17 6.82 2.57 27.33 0.54 0.26 32.86 0.61 0.28 31.68 2.77 0.50 15.32
≥140 2 0 2 0 1.54 1.60 0.37 18.89 6.86 2.53 26.93 0.54 0.26 32.38 0.62 0.28 31.22 2.77 0.49 15.10
≥140 3 0 2 1 2.26 1.42 0.55 27.77 5.67 3.72 39.60 0.42 0.38 47.60 0.48 0.41 45.90 2.54 0.73 22.20
≥140 3 0 3 0 1.74 1.55 0.42 21.38 6.52 2.86 30.49 0.51 0.29 36.65 0.58 0.32 35.34 2.71 0.56 17.09
≥140 3 1 1 1 1.58 1.59 0.38 19.37 6.79 2.59 27.61 0.54 0.27 33.20 0.61 0.29 32.01 2.76 0.51 15.48
≥140 4 0 3 1 2.52 1.36 0.61 30.94 5.25 4.14 44.11 0.38 0.43 53.03 0.44 0.46 51.13 2.46 0.81 24.73
≥140 4 0 2 2 2.28 1.42 0.55 27.99 5.64 3.75 39.91 0.42 0.38 47.98 0.48 0.41 46.26 2.54 0.73 22.38
≥140 4 1 2 1 2.28 1.42 0.55 27.97 5.64 3.74 39.88 0.42 0.38 47.94 0.48 0.41 46.23 2.54 0.73 22.36
≥140 5 0 4 1 2.54 1.35 0.61 31.24 5.21 4.18 44.54 0.37 0.43 53.55 0.43 0.46 51.63 2.45 0.82 24.97
≥140 5 1 3 1 2.53 1.36 0.61 31.11 5.22 4.16 44.36 0.37 0.43 53.33 0.44 0.46 51.42 2.45 0.81 24.87
≥140 6 0 4 2 2.56 1.35 0.62 31.46 5.18 4.21 44.85 0.37 0.43 53.92 0.43 0.47 51.99 2.44 0.82 25.15
≥140 6 1 4 1 2.56 1.35 0.62 31.44 5.18 4.21 44.82 0.37 0.43 53.89 0.43 0.47 51.96 2.45 0.82 25.13
≥140 7 1 4 2 2.58 1.35 0.62 31.65 5.15 4.24 45.13 0.37 0.44 54.26 0.43 0.47 52.32 2.44 0.83 25.31
≥140 7 0 4 3 2.56 1.35 0.62 31.46 5.18 4.21 44.85 0.37 0.43 53.92 0.43 0.47 51.99 2.44 0.82 25.15
≥100 1 0 1 0 0.85 1.76 0.21 10.48 7.98 1.40 14.94 0.66 0.14 17.96 0.74 0.16 17.32 2.99 0.27 8.38
≥100 1 0 0 1 0.71 1.80 0.17 8.75 8.21 1.17 12.48 0.68 0.12 15.00 0.77 0.13 14.47 3.04 0.23 7.00
≥100 2 0 1 1 1.56 1.59 0.38 19.17 6.82 2.57 27.33 0.54 0.26 32.86 0.61 0.28 31.68 2.77 0.50 15.32
≥100 2 0 2 0 1.55 1.59 0.37 19.02 6.84 2.55 27.12 0.54 0.26 32.60 0.61 0.28 31.43 2.77 0.50 15.21
≥100 3 0 2 1 2.20 1.44 0.53 27.03 5.77 3.62 38.54 0.43 0.37 46.33 0.50 0.40 44.68 2.56 0.71 21.61
≥100 3 1 2 0 1.85 1.52 0.45 22.78 6.34 3.05 32.49 0.49 0.31 39.05 0.56 0.34 37.65 2.67 0.59 18.21
≥100 4 0 3 1 2.55 1.35 0.62 31.33 5.19 4.19 44.68 0.37 0.43 53.71 0.43 0.46 51.79 2.45 0.82 25.05
≥100 4 1 2 1 2.37 1.39 0.57 29.14 5.49 3.90 41.56 0.40 0.40 49.96 0.46 0.43 48.17 2.51 0.76 23.30
≥100 5 1 3 1 2.78 1.30 0.67 34.22 4.81 4.58 48.79 0.33 0.47 58.65 0.39 0.51 56.55 2.37 0.89 27.36
≥100 5 2 2 1 2.66 1.32 0.64 32.74 5.00 4.38 46.69 0.35 0.45 56.12 0.41 0.48 54.11 2.41 0.86 26.18
≥100 6 2 3 1 3.00 1.24 0.73 36.91 4.45 4.94 52.64 0.29 0.51 63.28 0.35 0.55 61.01 2.30 0.96 29.51
Allocation of Shore Side Electricity
≥100 6 1 4 1 2.90 1.27 0.70 35.59 4.62 4.76 50.75 0.31 0.49 61.01 0.37 0.53 58.83 2.34 0.93 28.46
≥100 7 3 3 1 3.20 1.20 0.77 39.28 4.13 5.26 56.01 0.26 0.54 67.33 0.31 0.58 64.93 2.24 1.03 31.41
≥100 7 2 4 1 3.13 1.21 0.76 38.52 4.23 5.15 54.92 0.27 0.53 66.03 0.33 0.57 63.66 2.26 1.01 30.80
• Total amount of emissions in the current scenario (T), i.e., emissions by all port calls ≥0k DWT. Saved emissions in the current scenario, absolute (S), percent (%)
compared to the baseline scenario.
149
• The baseline scenario requires vessels to use LSMGO only (first row). • The SSE consumption is the objective value (SSE).
150 J. Yu et al.
SSE supply points to CTB and one to CTT, i.e., CTA (0), CTB (2), CTT (1),
leading to 0.96 GWh of SSE consumption. Generally, this allocation correlates
to the distribution of port call DWT-classes at each terminal as shown in Fig. 1.
As mentioned before, the class ≥140,000 DWT covers the prioritized vessels
based on the MOU. Based on Fig. 2, four supply points appear to be sufficient,
and Table 4 confirms a marginal performance difference between the best allo-
cations using four and five SSE supply points. The best allocation using four is
CTA (0), CTB (3), CTT (1), 2.52 GWh, as well as CTA (0), CTB (4), CTT (1),
2.54 GWh, using five. This best allocation using five supply points is slightly bet-
ter compared to the allocation planned by the Port of Hamburg, i.e., CTA (1),
CTB (3), CTT (1) with 2.53 GWh of SSE consumption.
Concerning the class ≥100,000 DWT, up to seven SSE supply points can be
exhausted, leading to 3.20 GWh of SSE consumption. Notably, in this class, the
allocation CTA (1), CTB (3), CTT (1), leading to 2.78 GWh, as planned by the
Port of Hamburg, is the best for allocating five supply points.
Negative implications for the MOU-based constraint are exhibited, only if
less than five supply points are allocated. More concretely, for the class ≥200,000
DWT and allocating three supply points, the only feasible allocation is CTA (1),
CTB (1), CTT (1) that decreases SSE consumption by 0.13 GWh compared to
the best allocation, i.e., CTA (0), CTB (2), CTT (1). For the classes ≥140,000
DWT and ≥100,000 DWT, more significant implications are observed with three
and four supply points. Regarding ≥140,000 DWT, for instance, the SSE con-
sumption is reduced by 0.68 GWh and 0.24 GWh.
In summary, Fig. 2 shows SSE supply points and SSE-enabled vessels to cor-
relate positively with the consumed SSE until the energy demand is fully met.
The energy demand by port calls in class ≥140,000 DWT is almost satisfied
by four supply points. Allocating more than four supply points yields a much
lower cost-effectiveness, e.g., an increase from four to five supply points leads
to only 0.02 GWh more SSE consumption, while an increase from three to four
yields 0.26 GWh. However, in the advent that class ≥100,000 DWT vessels are
SSE-enabled, further increasing the supply points to seven is beneficial.
Let us describe the limitations of this study. First, the results are limited by
the data set of one month. Future studies could include annual data sets. Sec-
ond, this study assumes vessels to use SSE during the entire berthing duration.
However, only a partial consumption may take place with respect to time of
usage energy pricing (Yu et al., 2022), or insufficient power supply due to energy
production or infrastructure limitations. The third limitation is the heuristic
approach that cannot guarantee to find the best solution for a BAP that could
further increase the SSE consumption and therefore change the ranking of the
presented allocation plans. Fourth, the BAP in this paper is simplified by using
the historic scheduled berthing start and departure time of our case study termi-
nals that have no SSE supply points yet. If the port had SSE supply points, the
times might have been adjusted in favor of SSE consumption. Fifth, the auxiliary
engines’ power and load factor are based on simplified methods where obtaining
precise values requires a case study per port. Moreover, the emission factors could
Allocation of Shore Side Electricity 151
be implemented to change over the assessed time period. At last, the potential
use of scrubber and heavy fuel oil is neglected, leading to uncertainty of the
presented SOx emissions. At the end of 2020, about 14% of the container vessel
world fleet had been equipped with scrubbers (ICCT, 2021). The SSE imple-
mentation performance depends on the SSE allocation plan and other factors
such as the BAP plan, electricity pricing, incentives, and the number of SSE-
enabled vessels. An emissions inventory, such as presented in Table 4, thus can
only be regarded a composite index, a proxy measure for the SSE implementa-
tion performance. However, emissions inventory methodologies and key figures
can be standardized (Cammin and Voß, 2021), and subsequently be utilized to
establish sensible goals, i.e., specific, measurable, achievable, relevant, and time-
bound (SMART) goals (Drucker, 1954). A component of a SMART goal could
be the saved emissions relative to the baseline emissions, as presented in Table 4.
So far, SMART goals incorporating such quantifications of emissions have not
been declared in the MOU. Thus, rethinking the design of goals could benefit
future evaluations of SSE implementations.
5 Conclusions
In this study, we addressed the SSE supply point allocation problem. More
specifically, we implemented an optimization model for a terminal group with
the shared objective to maximize the environmental benefit by using SSE while
solving the BAP. We applied our model to a case study concerning the largest
container terminal group in the Port of Hamburg under a series of scenarios with
varying numbers of SSE supply points (from one to nine) and SSE-enabled ves-
sels (classes ≥200,000 DWT, ≥140,000 DWT, and ≥100,000 DWT). Based on
the results, we discussed the trade-offs between the environmental benefit and
the allocation of SSE supply points. For instance, the energy demand of vessels
in classes ≥200,000 DWT, ≥140,000 DWT, and ≥100,000 DWT are almost sat-
isfied by three, four, and seven supply points, respectively. With a total number
of five supply points, as planned by the Port of Hamburg, the best plan enables
0.96 GWh, 2.54 GWh, and 2.78 GWh of SSE consumption, for vessels in class
≥200,000 DWT, ≥140,000 DWT, and ≥100,000 DWT, respectively. We believe
our work can spark discussions revolving the intelligent use of funds, concerning
terminal groups, and future port cooperations. Future studies could extend this
study by considering the entirety of terminals of the ports of Rotterdam, Ham-
burg, Bremen, Antwerp, and Haropa that are subject to the MOU to promote
SSE, and to expose the implications to local air quality.
References
CAA. Evaluation of Air and Climate Synergies of Typical Chinese Ports. Clean Air Asia
(2022). http://allaboutair.cn/uploads/soft/230308/1-23030Q34007.pdf. Accessed 15
Apr 2023
152 J. Yu et al.
Cammin, P., Voß, S.: Towards smart maritime port emissions monitoring: a platform
for enhanced transparency. In: Chandra Kruse, L., Seidel, S., Hausvik, G.I. (eds.)
DESRIST 2021. LNCS, vol. 12807, pp. 71–76. Springer, Cham (2021). https://doi.
org/10.1007/978-3-030-82405-1 9
Dai, L., Hao, H., Wang, Z., Shi, Y., Ding, W.: An environmental and techno-economic
analysis of shore side electricity. Transp. Res. Part D: Transp. Environ. 75, 223–235
(2019). https://doi.org/10.1016/j.trd.2019.09.002
Drucker, P.: The Practice of Management. Harper, New York (1954)
EP: Sustainable maritime fuels ‘Fit for 55’ package: the FuelEU Maritime
proposal (2023). https://www.europarl.europa.eu/RegData/etudes/BRIE/2021/
698808/EPRS BRI(2021)698808 EN.pdf
Gore, K., Rigot-Müller, P., Coughlan, J.: Cost-benefit assessment of shore side elec-
tricity: an Irish perspective. J. Environ. Manage. 326, 116755 (2023). https://doi.
org/10.1016/j.jenvman.2022.116755
Herrero, A., Piris, A.O., Diaz-Ruiz-Navamuel, E., Gutierrez, M.A., Lopez-Diaz, A.-I.:
Influence of the implantation of the onshore power supply (OPS) system in Spanish
medium-sized ports on the reduction in CO2 emissions: the case of the Port of
Santander (Spain). J. Marine Sci. Eng. 10(10), 1446 (2022). https://doi.org/10.3390/
jmse10101446
HHLA. Report vessel operations: Sailing list. Hamburger Hafen und Logistik AG
(2022). https://coast.hhla.de/report?id=Standard-Report-Segelliste. Accessed 10
Feb 2023
HPA. Plugging in to eliminate emissions. Hamburg Port Authority (2022). https://
www.hafen-hamburg.de/en/port-of-hamburg-magazine/hafen-und-klima/stecker-
rein-emissionen-aus/. Accessed 10 Feb 2023
HPM. Cleaner air and climate protection in Port of Hamburg - New shore-based power
units for cruise liners and containerships. Hamburg Port Marketing (2019). https://
www.hafen-hamburg.de/en/press/news/cleaner-air-and-climateprotection-in-port-
of-hamburg-new-shore-based-power-units-for-cruiseliners-and-containerships-
36499/. Accessed 10 Feb 2023
ICCT. Global scrubber washwater discharges under IMO’s 2020 fuel sulfur limit. Inter-
national Council on Clean Transportation (2021). https://theicct.org/sites/default/
files/publications/scrubber-discharges-Apr2021.pdf. Accessed 10 Mar 2023
Lathwal, P., Vaishnav, P., Morgan, M.G.: Environmental and health consequences of
shore power for vessels calling at major ports in India. Environ. Res. Lett. 16(6),
064042 (2021). https://doi.org/10.1088/1748-9326/abfd5b
Martı́nez-López, A., Romero, A., Orosa, J.A.: Assessment of cold ironing and LNG as
mitigation tools of short sea shipping emissions in port: a Spanish case study. Appl.
Sci. 11(5), 2050 (2021). https://doi.org/10.3390/app11052050
Nguyen, D.-H., et al.: Reduction of NOx and SO2 emissions by shore power adoption.
Aerosol Air Qual. Res. 21(7), 210100 (2021). https://doi.org/10.4209/aaqr.210100.
Peng, Y., Dong, M., Li, X., Liu, H., Wang, W.: Cooperative optimization of shore
power allocation and berth allocation: a balance between cost and environmental
benefit. J. Clean. Prod. 279, 123816 (2021). https://doi.org/10.1016/j.jclepro.2020.
123816
Peng, Y., Li, X., Wang, W., Wei, Z., Bing, X., Song, X.: A method for determining
the allocation strategy of on-shore power supply from a green container terminal
perspective. Ocean Coast. Manag. 167, 158–175 (2019). https://doi.org/10.1016/j.
ocecoaman.2018.10.007
Allocation of Shore Side Electricity 153
1 Introduction
Stockyard in dry bulk terminal acts as a centralized storage area for bulk mate-
rials, and also functions as an essential buffer for differences between incoming
and outgoing streams of bulk materials [16]. Under the influence of the increas-
ing demand for bulk cargo and the trend towards larger vessels, storage space
in the stockyard tends to become a scarce resource, leading terminal operators
to focus on stockyard storage space management. Utilizing the storage space
reasonably can help improve the operational efficiency of the cargo transport
chain. Therefore, it is necessary to study the storage space allocation problem
in dry bulk terminals.
Apart from the operation efficiency, the port now needs to take responsibility
for environmental concerns driven by the growing emphasis on a cleaner environ-
ment, neutral climate, and sustainability. A large amount of dust is generated
during the operation in the dry bulk stockyard, such as stacking materials into
the empty fields of stock pads. Mist cannon (also known as dust cannon, or
dust suppression cannon) is a widely used facility to control dust. The fogging
process involves the action of fog nozzles which nebulize water into very small
micro-droplets of water under pressure. The fog drives airborne dust particles
to the ground and wets the surface to prevent fugitive dust particles. When
water is combined with dust in the air, due to the adhesion of the surface of
water molecules, it will be combined with the dust, and the effect of gravity will
drop after condensing, to achieve the purpose of dust suppression. Materials are
stacked into stock pads accompanied by the water spray of the mist cannon. We
consider mist cannon operation in the space allocation process, which is rarely
addressed in the existing literature.
The stockyard is a complex logistics system that consists of components
of different operation machines like stacker-reclaimers and mist cannons. We
study Storage Space Allocation Problem (SSAP) in the stockyard of dry bulk
terminals, which together considers the assignment of stacker-reclaimers and
mist cannons for incoming materials. In this study, we consider import dry bulk
terminals in which cargo flows from the berth area to the stockyard area. We
aim to determine the specific storage locations for incoming materials with the
objective of minimizing energy consumption. The contributions of the paper
include the following: (1) We first model the operations of mist cannons to control
the dust and study the storage space allocation, stacker-reclaimer assignment
and mist cannon assignment in an integrated manner. (2) We incorporate energy
consumption into the optimisation objective, corresponding to the requirements
of sustainable operations.
The rest of the paper is organized as follows. Section 2 reviews relevant papers
in the literature. Section 3 describes the storage space allocation problem fol-
lowed by mathematical formulation in Sect. 4. Section 5 shows the preliminary
results from computational experiments to test the effectiveness of the mathe-
matical model. Finally, conclusions are drawn in Sect. 6.
2 Literature Review
Storage space management in dry bulk terminal stockyards has received far
less attention than that in container terminal yards. Storage space allocation
problem (SSAP) is usually investigated with other operational problems (such as
berth allocation problem (BAP), train scheduling, etc.) in an integrated manner.
Table 1 lists the related work on the SSAP problem in dry bulk terminals.
Ouhaman et al. [13] studied the SSAP in an export dry bulk terminal. The
authors formulated the problem as a mixed-integer linear programming (MILP)
problem and proposed a heuristic method to solve large-scale data sets. They
limited the problem to the material flow from the production plant to storage
156 X. Tang and F. Schulte
Table 1. Related literature on storage space allocation problem in dry bulk terminals
hangars. Tang et al. [17] addressed the integrated berth scheduling and SSAP in
the context of bulk raw material ports of large iron and steel companies. They
developed an integer programming model to minimize total costs and solved the
model using a Benders decomposition-based approach. Sun et al. [16] followed
up on the work of Tang et al. [17]. They first developed a mixed-integer pro-
gramming (MIP) formulation that could avoid generating scattered small fields
and schedule the unloading, stacking and reclaiming operations. A logic-based
Benders approach was proposed to solve the problem optimally. SSAP in large
iron ore terminal stockyards was studied in the work of Tang et al. [18]. They
used continuous variables to describe the specific locations of the empty fields
to highlight the continuous cargo flow characteristic of dry bulk stockyard oper-
ation. A MIP model was developed and a heuristic algorithm based on genetic
Stockyard Space Allocation with Mist Cannons and Energy Expenditure 157
algorithm was proposed to solve it. Ago et al. [1] considered both the SSAP and
conveyor belt routing problems, using a Lagrangian decomposition algorithm.
Menezes et al. [12] studied a production planning and scheduling problem in
bulk cargo terminals. The problem considered planning and scheduling the flow
of products between supply, storage, and demand nodes and a branch and price
algorithm was applied to solve it. De Andrade et al. [3,4] extended this problem
by integrating berth allocation and yard assignment decisions.
Boland et al. [7] studied stockyard management in coal export terminals,
considering the rail system transporting coal from mines to the terminal. The
decisions in this study included berth allocation, stockpile location and the
start times of stockpile assembly and reclaiming decisions. Robenek et al. [14]
solved the integrated berth allocation and yard assignment problem in import
and export bulk ports as a single large-scale optimization problem. They con-
structed a MIP model and applied an exact algorithm based on a branch and
price framework. Al-Hammadi and Diabat [2] added constraints on the stacking
space capacity to the model of Robenek et al. [14]. Babu et al. [5] minimized the
delay of ships in bulk terminals by simultaneously considering ship scheduling,
stockyard planning, and train scheduling.
As an important component in the stockyard, the stacker-reclaimer is also
a factor that could influence the operation in the stockyard, which is consid-
ered in the decision-making of stockyard operation. Hu and Yao [9] discussed
the stacker-reclaimer scheduling problem (SRSP) with the objective of mini-
mizing the maximum completion time. The scheduling problem is formulated
as a MIP model, and a genetic algorithm is developed to solve it. Belassiria et
al. [6] also studied SRSP and developed two different heuristic methods, a branch
and bound method, and a tabu search algorithm. Hanoun et al. [8] addressed
the operational scheduling of the continuous coal handling problem. A model
of stockyard operations within a coal mine was described and the problem was
formulated as a Bi-Objective Optimization Problem (BOOP). In van Vianen et
al.’s work [20], simulation was applied to reschedule the stacker-reclaimers oper-
ation to increase the dry bulk terminal’s performance by reducing the waiting
time of cargo trains being loaded at the terminal. Unsal and Oguz [19] integrated
berth allocation, yard space allocation, and SRSP and modelled the master and
subproblems with MIP and constraint programming, respectively, solving the
problem with a logic-based Benders decomposition algorithm.
While the climate and environmental sustainability objectives are increas-
ingly recognized in port (operations) research [10,11,15], the existing research
related to the considered problem mostly focuses on the operation efficiency of
the stockyard with little consideration of environmental aspects. Dust suppres-
sion in the stockyard is paid little attention to. Our study is an attempt to
incorporate environmental protection considerations and neutral climate targets
into stockyard space management.
158 X. Tang and F. Schulte
3 Problem Description
The whole stockyard consists of some long stock pads, which may be pre-occupied
by existing stockpiles, leaving empty fields available for incoming materials.
Figure 1 shows the layout of a typical dry bulk terminal stockyard. One stacker-
reclaimer (abbreviated as “SR” below) travels on a track between two pads, and
the two pads are served by the same SR. Between two stock pads is a space
interval. Three intervals can be found in Fig. 2, and we label them as Int 1, 2,
and 3. Like SR, mist cannon (abbreviated as “MC” below) is also mobile. The
difference is that MC can move among intervals instead of moving on a fixed
track. They usually move from one interval to another through the end area of
the stock pad. When an SR stacks materials into the stockyard, the MC sprays
water mist to control the dust generated. There are two moving routes for the
SR in Fig. 2. After completion of handling material 1, if the MC continues to
serve material 2, it needs to move from Int 1 to Int 2, which brings vertical
transport distance. Otherwise, if the MC continues to serve material 3 planned
to be allocated in pad 2, it only needs to move horizontally.
specific storage position for each incoming material with the objective of mini-
mizing the energy consumption of operations in the stockyard. The main energy
consumers in the stockyard are conveyor belts, SRs, and MCs. The energy con-
sumption of SR and MC are further divided into two parts: energy consumption
when handling materials and moving from the position of handled material to
the position of material to be next served. To calculate energy consumption, we
multiply the power of operating machines by the time consumed during handling
or moving. Time can be obtained by dividing distance by moving speed.
When calculating the travel distance, we need to mark the position of each
material and the working machine. We, therefore, introduce a virtual material 0
for each SR and MC (each SR and MC will at least serve material 0) and then
set the position of the virtual material as the initial position of the machines.
Hence, the moving distance can be normally calculated.
4 Mathematical Model
4.1 Assumptions
The following assumptions are made in this problem:
(1) There are plenty of empty fields to accommodate the incoming materials
over the whole planning period, which means there is always enough space
for stacking.
(2) One material can be stacked next to another material if a safe distance is
maintained to avoid mixing. We consider a dedicated terminal for a partic-
ular cargo type. For example, in an iron ore terminal, the different types of
iron ore are usually distinguished from each other by the size of the diameter
of the particles and there is no strict restriction that they cannot be stacked
next to each other, only that a safe distance is maintained to avoid mixing.
(3) Once the goods are stacked in the yard, the position will not change during
the planning period, which is consistent with the situations in practice.
(4) The operating capability of machines in the stockyard is constant during
the handling process.
(5) The moving distance is calculated from the end coordinate of the material
that has just been completed handling to the start coordinate of the material
to be next served.
160 X. Tang and F. Schulte
Decision Variables
mi mi mi Pc xsi + p ly p kip
min Psrh + Pwch + Pc +
i
capsrh i
capsrh i
capc i
specm
θ0js |xsj − xss0 | θijs xsj − xsi − li
+ Psrm + Psrm
s j
spesrm s i j
spesrm
μ0jw dis n nαjn − n nαnw st
+ Pwcm
w j
spewcm
μijw dis n nαjn − n nαin
+ Pwcm (1)
w i j
spewcm
ρ0jw |xsj − xsw0 | π0jw (L − xsw0 + L − xsj )
+ Pwcm + Pwcm
w j
spewcm w j
spewcm
ρijw xsj − xsi − li
+ Pwcm
w i j
spewcm
πijw (L − xsi − li + L − xsj )
+ Pwcm
w i j
spewcm
162 X. Tang and F. Schulte
s.t.
kip = 1, ∀ i ∈ C \ {0} (2)
p∈P
x0 s = 1, ∀s∈S (3)
xis = 1, ∀ i ∈ C \ {0} (4)
s∈S
kip ≥ 1 + M (1 − xis ) , ∀ i ∈ C \ {0} , s ∈ S (5)
{p∈P |asp =1}
kip ≤ 1 − M (1 − xis ) , ∀ i ∈ C \ {0} , s ∈ S (6)
{p∈P |asp =1}
θijs ≤ 1 + M (1 − xjs ) , ∀ j ∈ C \ {0} , s ∈ S (7)
i
θijs ≥ 1 − M (1 − xjs ) , ∀ j ∈ C \ {0} , s ∈ S (8)
i
θijs ≤ 1 + M (1 − xis ) , ∀ i ∈ C \ {0} , s ∈ S (9)
j∈C\{0}
xis > 1 − M (1 − δ) , ∀s∈S (10)
i
θ0js ≥ 1 − M (1 − δ) , ∀s∈S (11)
j∈C\{0}
θ0js ≤ 1 + M (1 − δ) , ∀s∈S (12)
j∈C\{0}
as the above two constraints, but they describe the situation of an MC moving
from the initial position to its first serving material. Constraints (41) and (42)
are the premise of variable ρijw and πijw . Constraint (43) and (44) state that
only one of the two variables ρijw and πijw can be 1 if there is pair i→j in
the material handling sequence of MC w. Constraint (45) and (46) show the
value range of start and end coordinates. Lastly, constraint (47) determines the
domains of 0–1 variables.
5 Computational Experiments
We generated 12 instances and divided them into 3 sets according to the number
of stock pads, SRs, and MCs. We consider |C| materials, |P | stock pads, |S| SRs
and |W | MCs in computational experiments. The configuration of each instance
can be found in Table 1. Set 1 has 4 stock pads with 2 SRs and 1 MC. Set 2 has
the same configuration as set 1, except for 2 MCs. Set 3 has 6 stock pads with
3 SRs and 3 MCs. Each set consists of computational instances with different
material numbers. The material information (mass and length) in instances with
the same material numbers is the same. (Instances 1, 5, and 9 have the same
information, which is the same situation for instances 2, 6, and 10; instances 3,
7, and 11 and instances 4, 8, and 12.) The material information is generated
randomly.
Table 2 shows the results obtained within the time limit. Since the first three
terms in the objective (1) are fixed terms, we use “Objective value of variable
parts” in the table to explicitly show the objective value dependent on decision
variables, that is, the value of objective (1) excluding the first three terms. The
“objective value” in the following text refers to the value of objective excluding
the fixed terms. Optimal solutions can be found for 9 of the 12 instances. The
increase in the number of stock pads, SRs, MCs and materials expands the scale
of the model, thus making the computation time increase dramatically. It can
be noticed that the results of instances 1, 5, and 9 are the same (Instances 2, 6,
and 10 are in the same situation.), which means the change of some parameters
like the number of stock pads, SRs and MCs have little influence on optimal
solutions to some extent.
We compare the composition of objective value (of variable parts) of instance
5–8 in Fig. 3. The energy consumption of the conveyor belt moving to the spec-
ified stacking position for materials takes a larger proportion compared to the
other two components (energy consumption of SR and MC) during moving. It
is observed that there is a clear absolute variation of the energy consumption of
the belt conveyor moving among different instances, while the change of energy
166 X. Tang and F. Schulte
5 5 4 2 2 15.25 9.91
6 5 4 2 2 13.25 41.12
2
7 6 4 2 2 19.76 104.54
8 7 4 2 2 24.17 3600
9 5 6 3 3 15.25 226.45
10 5 6 3 3 13.25 216.92
3
11 6 6 3 3 19.76 3600
12 7 6 3 3 23.36 3600
With the calculation results in Sect. 5.1, we can extract some managerial insights
which may be helpful for storage space management in the stockyard.
(1) The number of stock pads open can be determined according to the number
of incoming materials. Stockyards do not always need to open all the stock
pads. We can find that the objective values of instances 1,5,9 are the same,
which means the results obtained in the background of 4 stock pads and 6
stock pads are the same. Opening stock yards depending on the number of
materials may help save operational costs of the stockyard.
(2) Materials are stacked into the adjacent stock pads which one stacker-
reclaimer and mist cannon can cover, which could shorten the travel dis-
tance of SR and MC, thus saving the energy consumed. As shown in Fig. 4
and 5, all the materials are stacked in the adjacent pad 1 and 2.
(3) Materials tend to be stacked in the stock pads close to the stockyard entrance
so that the conveyor belts do not need to move a long distance, which
could reduce the energy consumption of the conveyor belt. Figure 4 and 5
show that materials are stacked in the pad 1 and 2 which are closest to the
entrance of the stockyard.
Stockyard Space Allocation with Mist Cannons and Energy Expenditure 167
6 Conclusion
We study the Storage Space Allocation Problem with the consideration of oper-
ation machines in the stockyard. Existing studies investigate the storage space
allocation problem with other operation problems such as berth allocation and
stacker reclaimer scheduling. However, they pay little attention to environmen-
tal sustainability. We take the operation of mist cannon to control dust during
stacking into consideration and aim to minimize the energy consumption during
stacking operation. A mixed integer programming model is established, and we
show the effectiveness of the proposed model by conducting computational exper-
iments on some small-scale instances. Some managerial insights are extracted,
which may help in storage space management as a reference.
Our future work will focus on the improvement of the formulation of the
model and the development of an efficient algorithm to solve the model for large-
scale instances. And we need to collect operational data from real-world practice
as the input of computational experiments to validate the results considering
practical implications.
References
1. Ago, M., Nishi, T., Konishi, M.: Simultaneous optimization of storage allocation
and routing problems for belt-conveyor transportation. J. Adv. Mech. Design Syst.
Manuf. 1(2), 250–261 (2007)
2. Al-Hammadi, J., Diabat, A.: An integrated berth allocation and yard assignment
problem for bulk ports: formulation and case study. RAIRO - Oper. Res. 51(1),
267–284 (2017). https://doi.org/10.1051/ro/2015048. Identifier: ro151165
3. de Andrade, J.L.M., Menezes, G.C.: An integrated planning, scheduling, yard allo-
cation and berth allocation problem in bulk ports: model and heuristics. In: Mes,
M., Lalla-Ruiz, E., Voß, S. (eds.) ICCL 2021. LNCS, vol. 13004, pp. 3–20. Springer,
Cham (2021). https://doi.org/10.1007/978-3-030-87672-2 1
4. de Andrade, J.L.M., Menezes, G.C.: A column generation-based heuristic to solve
the integrated planning, scheduling, yard allocation and berth allocation problem
in bulk ports. J. Heuristics 29(1), 39–76 (2022). https://doi.org/10.1007/s10732-
022-09506-3
5. Babu, S.A.K.I., et al.: Minimizing delay of ships in bulk terminals by simultaneous
ship scheduling, stockyard planning and train scheduling. Maritime Econ. Logist.
17(4), 464–492 (2015). https://doi.org/10.1057/mel.2014.20
6. Belassiria, I., Mazouzi, M., Elfezazi, S., Torbi, I.: A heuristic methods for stacker
and reclaimer scheduling problem in coal storage area. In: 2019 International Col-
loquium on Logistics And Supply Chain Management (LOGISTIQUA) (2019)
7. Boland, N., Gulczynski, D., Savelsbergh, M.: A stockyard planning problem. EURO
J. Transp. Logist. 1(3), 197–236 (2012). https://doi.org/10.1007/s13676-012-0011-
z
8. Hanoun, S., Khan, B., Johnstone, M., Nahavandi, S., Creighton, D.: An effec-
tive heuristic for stockyard planning and machinery scheduling at a coal handling
facility. In: 2013 11th IEEE International Conference On Industrial Informatics
(INDIN), pp. 206–211 (2013)
Stockyard Space Allocation with Mist Cannons and Energy Expenditure 169
9. Hu, D., Yao, Z.: Stacker-reclaimer scheduling in a dry bulk terminal. Int. J. Com-
put. Integr. Manuf. 25(11), 1047–1058 (2012)
10. Iris, Ç., Lam, J.S.L.: A review of energy efficiency in ports: operational strategies,
technologies and energy management systems. Renew. Sustain. Energy Rev. 112,
170–182 (2019)
11. Lam, J.S.L., Notteboom, T.: The greening of ports: a comparison of port manage-
ment tools used by leading ports in Asia and Europe. Transp. Rev. 34(2), 169–189
(2014)
12. Menezes, G.C., Mateus, G.R., Ravetti, M.G.: A branch and price algorithm to
solve the integrated production planning and scheduling in bulk ports. Eur. J.
Oper. Res. 258(3), 926–937 (2017). https://doi.org/10.1016/j.ejor.2016.08.073
13. Ouhaman, A.A., Benjelloun, K., Kenne, J.P., Najid, N.: The storage space allo-
cation problem in a dry bulk terminal: a heuristic solution. In: 21st IFAC World
Congress on Automatic Control - Meeting Societal Challenges, vol. 53, pp. 10822–
10827 (2020). https://doi.org/10.1016/j.ifacol.2020.12.2868
14. Robenek, T., Umang, N., Bierlaire, M., Ropke, S.: A branch-and-price algorithm to
solve the integrated berth allocation and yard assignment problem in bulk ports.
Eur. J. Oper. Res. 235(2), 399–411 (2014). https://doi.org/10.1016/j.ejor.2013.08.
015
15. Schulte, F., Voß, S., Ascencio, L.M., González-Ramı́rez, R.G.: Directions for sus-
tainable ports in Latin America and the Caribbean. Int. J. Transp. Econ. 43(3),
315–337 (2016)
16. Sun, D., Meng, Y., Tang, L., Liu, J., Huang, B., Yang, J.: Storage space allocation
problem at inland bulk material stockyard. Transp. Res. Part E: Logist. Transp.
Rev. 134 (2020). https://doi.org/10.1016/j.tre.2020.101856
17. Tang, L., Sun, D., Liu, J.: Integrated storage space allocation and ship scheduling
problem in bulk cargo terminals. IIE Trans. 48(5), 428–439 (2016). https://doi.
org/10.1080/0740817x.2015.1063791
18. Tang, X., Jin, J.G., Shi, X.: Stockyard storage space allocation in large iron ore
terminals. Comput. Industr. Eng. 164, 107911 (2022). https://doi.org/10.1016/j.
cie.2021.107911
19. Unsal, O., Oguz, C.: An exact algorithm for integrated planning of operations in
dry bulk terminals. Transp. Res. Part E: Logist. Transp. Rev. 126, 103–121 (2019).
https://doi.org/10.1016/j.tre.2019.03.018
20. van Vianen, T., Ottjes, J., Lodewijks, G.: Simulation-based rescheduling of the
stacker-reclaimer operation. J. Comput. Sci. 10, 149–154 (2015). https://doi.org/
10.1016/j.jocs.2014.06.004
Planning LNG Annual Delivery Programs
with Speed Optimization and Multiple
Loading Ports
1 Introduction
The earth contains enormous amounts of natural gas, which are often located far
from where the gas is needed. For long distances the gas must be transported by
sea. To make transportation volume-efficient and safe, the natural gas is cooled
down and condensed to its liquid form, i.e., into Liquefied Natural Gas (LNG),
at liquefaction plants, before being loaded onto specially designed LNG vessels
for transportation to its destination where the LNG is boiled to natural gas at
a regasification port.
The demand for LNG has steadily been increasing and today there are liqui-
dation and regasification ports organized for shipping of LNG in large parts of
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 170–184, 2023.
https://doi.org/10.1007/978-3-031-43612-3_10
Planning LNG Annual Delivery Programs 171
the world. Countries in some areas, especially the Middle East, USA, and Aus-
tralia, are net exporters of LNG, while countries in areas including China, Japan,
and Europe are net importers. Large geographical distances between suppliers
and customers create a high demand for shipping LNG by sea.
The average charter rate (rental price) for a 160 000 cubic meter LNG vessel
in 2021 was around 89 000 USD/day, up from around 59 300 USD/day in 2020 [4].
However, the spot charter rate reached as high as 400 000 USD/day during 2022,
mainly due to political tensions related to Russia being at war and cutting off
gas supplies to Europe, as well as the sabotage of two pipelines in the Baltic Sea
[3]. Although this is an unusual situation, it highlights the current importance
of optimizing LNG transportation to maximize the utilization of the vessel fleet.
In this paper, we study a special case of the maritime inventory routing
problem for an LNG producer, which is the planning of so-called Annual Delivery
Programs (ADPs). We denote this planning problem as the LNG-ADP, in which
the producer must fulfill a series of long-term contracts with customers all over
the world while also having the option to sell LNG in the spot market. The
producer is in charge of the inventory of LNG at its liquefaction plant(s), the
loading port with a limited number of berths, and the routing and scheduling
of a heterogeneous fleet of LNG vessels. The LNG-ADP describes the customer
deliveries of LNG cargoes for the next 12 months. In the LNG-ADP, the LNG
producer aims at maximizing the profits, which consists of the revenue from the
sales of LNG less the sailing and vessel chartering costs.
The LNG-ADP is, due to its importance, well studied. An important pioneer-
ing study was published by Rakke et al. [10], who proposed a Rolling Horizon
Heuristic (RHH) for this planning problem. Later, Stålhane et al. [11] used a
construction and improvement heuristic that makes a set of solutions with a
greedy insertion procedure, improves them with a first-descent neighborhood
search and/or branch-and-bound on a mathematical formulation. Al Haidous et
al. [1] base their model on the same assumptions as [10], but consider the spe-
cific case of using a homogeneous fleet of LNG-delivery vessels, with the goal of
minimizing fleet size. Andersson et al. [2] also model the LNG-ADP problem in
the same way as [10], but add four families of valid inequalities to improve the
lower bounds on the problem’s optimal value.
The study by Mutlu et al. [8] stands out by including two extensions: 1) the
production of LNG is a decision variable and not an input parameter, and 2)
allowing split deliveries, where a vessel can unload partial cargoes in more than
one unloading port. Li et al. [6] were the first to include transshipment in the
LNG-ADP problem, motivated by the Yamal LNG case where a transshipment
port helps the producer to avoid longer-than-necessary voyages with ice-breaking
LNG vessels, as normal LNG vessels can take over the LNG and deliver it to the
customers. Li et al. [6] introduce a continuous-time formulation for this problem,
while Li et al. [5] later proposes a time-discrete model for the same problem. Both
models are embedded into and solved with an RHH.
In this paper, we expand previous models of the LNG-ADP by including opti-
mization of the vessels’ sailing speeds. In contrast to currently existing models
172 H. V. Haug et al.
from the literature, our models also allows for multiple loading ports, which is
becoming relevant for some producers as the industry develops. Hence, we refer
to our version of the LNG-ADP with Speed Optimization and Multiple Load-
ing Ports (LNG-ADP-SO-MLP). Speed optimization in ship routing has been
widely studied, e.g., [9,12], although not in the context of LNG-ADP planning.
We propose a novel Mixed Integer Programming (MIP) model for the LNG-ADP-
SO-MLP. We generate a number of realistic instances based on data from two
LNG producers and show that our MIP model can be efficiently solved with low
optimality gaps for instances with a planning horizon of up to 180 days using
a commercial MIP-solver. Furthermore, the results show that including speed
optimization in the planning can improve the profits and make the planning
more flexible.
The remaining of this paper is organized as follows. Section 2 gives a formal
definition of the LNG-ADP-SO-MLP, while our MIP model for this problem is
introduced in Sect. 3. The computational study is presented in Sect. 4, before we
conclude in Sect. 5.
2 Problem Definition
The LNG-ADP-SO-MLP considers an LNG producer’s planning of the supply
of LNG to customers over a planning horizon with a given vessel fleet. The
producer controls one or more loading (liquefaction) ports where the LNG is
produced (liquefied). Each loading port has a given daily production, which
may vary throughout the year. Furthermore, each loading port has a given set
of berths from which the vessels load LNG. The number of available berths may
also vary over time, e.g., due to maintenance. In addition, each loading port has
storage tanks with a given maximum and minimum storage capacity.
There is a given set of customers, which either require LNG Delivered Ex-Ship
(DES), where the LNG producer transports the cargo to the customer with its
own vessels, or Free-On-Board (FOB), where the customers pick up the cargo
at an agreed loading port with their own vessels. Each DES contract has an
associated unloading port, where the vessels unload the LNG cargoes. Since the
customer decides when it wants the LNG delivered, we assume there is always
available inventory capacity and berth capacity at the unloading ports. Both
DES and FOB customers define their demand in advance for specific subsets
of the planning horizon, hereby referred to as partitions. A customer typically
has annual, quarterly, and/or monthly partitions with an associated delivery
requirement. The customers’ required contract volumes are satisfied by one or
more deliveries in each partition. For a DES customer, the minimum demand
within each partition must be satisfied, and a revenue is associated with deliveries
above this minimum up to a specified maximum demand within the partition.
For FOB contracts, each delivery must be the size of the vessel picking it up. In
addition to these contract deliveries, it is also possible to have optional one-time
deliveries of spot cargoes with associated customer destinations and revenues.
These have to be delivered either using the LNG producer’s vessels (like DES)
or by using the customer’s own vessel (like FOB).
Planning LNG Annual Delivery Programs 173
Each vessel in the LNG producer’s fleet has a given fuel consumption depend-
ing (non-linearly) on the sailing speed, as well as a maximum and minimum
sailing speed. The distances between ports are known in advance. If the LNG
producer is short on vessels, additional vessels can be chartered at a given daily
rate. Each vessel has a specific loading capacity and is always loaded up to
capacity due to physical limitations. It is also required to unload the entire load
at the unloading port, except for a minimum tank volume share that is needed
to keep the tanks cool due to the boil-off effect, which means that a fraction
of the LNG that is transported will evaporate during a voyage, typically at a
rate of 0.10–0.12% per day [7]. The actual delivery volume of each delivery is
therefore defined by the capacity of the vessel and the boil-off, which depends
on the sailing distance. Each vessel has a given start position at the beginning
of the planning horizon. A vessel becomes available for a new voyage when it
returns to a loading port, and may therefore become available some days into
the beginning of the planning horizon.
Some vessels might be scheduled for maintenance during the planning hori-
zon. This must take place at a pre-defined maintenance port and start within a
given time window. After maintenance, the vessel has to go through a purge and
a cool-down procedure, which must be done at a loading port and is assumed to
take a given amount of time. All vessels are assumed to have the same specified
time associated with loading and unloading (i.e., one day).
The main decisions of the LNG-ADP-SO-MLP are where to send each vessel
at what times and at what speeds, i.e., the vessel routes and schedules, while
making sure that all customer delivery requirements are satisfied and that the
inventory levels at the loading ports are kept within its limits at all times. Not
all vessels can visit all ports, and some vessels cannot serve all customers due to
their properties not being compatible with the customer’s unloading port. Also,
some vessels are restricted to serving specific customer contracts. Furthermore,
no more vessels than there are available berths can load on a given day at a
given production port. Additional decisions include how much spot cargo to sell
within certain demand and revenue specifications.
The planning objective of the LNG-ADP-SO-MLP is to maximize the gross
margin less the transportation costs (hereby referred to as the “profit”). The
revenue comes from spot cargoes transported DES and FOB, from long-term
contracted LNG deliveries (DES and FOB), and lastly from the value of the
remaining LNG at the end of the planning period. The costs are the (speed-
dependent) sailing costs for the producer’s fleet of vessels, as well as the char-
tering costs. The resulting plan is an Annual Delivery Program, describing the
routes and schedules for the vessels in the fleet over the given planning horizon.
3 Mathematical Model
Fig. 1. Illustration of an arc ((i, t), (j, t )) and the vessel activities it represents
in (i, t) = (0,|T | + 1), where |T | is the length of the planning horizon. These arcs
also have 0 associated costs.
Since there is always an option to either shut down production or accept
potential additional spot contracts for the LNG producer throughout the year, an
assumption about defining an artificial spot FOB contract is made. This artificial
spot FOB contract has zero revenue and allows surplus LNG to potentially be
sold throughout the planning period. Each cargo loaded by the artificial spot
FOB contract in the final ADP represents a possibility for the LNG producer to
find a corresponding real-life spot contract for the given cargo’s pick-up interval.
Following this, we generate the network that is used as input to the MIP
model. Note that each arc has a calculated cost and also represents a given
associated sailing speed, and hence fuel cost. As an example, consider the two
arcs ((i, t), (j, t )) and ((i, t), (j, t + 1)), the latter arc allows for spending an
additional day than the former on the sailing between the ports i and j. This
allows for sailing at a lower speed and sailing cost.
3.2 Notation
Sets
V Set of producer-operated vessels that are available during the planning horizon
VM Set of vessels that require maintenance during the planning horizon, V M ⊂ V
Vi Set of vessels that can serve node i, Vi ∈ V
N Set of ports
NL Set of loading ports, N L ⊂ N
NU Set of unloading ports, N U ⊂ N
NS Set of spot unloading ports, N S ⊂ N
NM Set of maintenance ports, N M ⊂ N
176 H. V. Haug et al.
Parameters
S
Cvitjt Sailing cost of each feasible arc ((i, t), (j, t )) for vessel v
C
Citj Costs of using a charter vessel to deliver a cargo at port j, loading the cargo at
loading port i at time t
NvST ART Start port of vessel v
TvST ART First time period where vessel v is available to be scheduled
O
Tvij Operational time associated with sailing from port location i to port location j for
vessel v
C
Tij Sailing time for a charter vessel sailing from loading port i to unloading port j
TfOF
i
OB
Operational time associated to port location j for FOB cargo f
Lv Capacity of vessel v
C
L Upper limit for capacity of a charter vessel
LC Lower limit for capacity of a charter vessel
LFf
OB
Loading quantity of FOB cargo f
D jp Maximum demand of unloading port j in partition p
D jp Minimum demand of unloading port j in partition p
RfSF OB Revenue per volume unit of LNG loaded for FOB spot contract f
RfU F OB Revenue per volume unit of LNG loaded for long-term FOB contract f
DES
Rjt Revenue per volume unit of LNG for delivering DES contract to customer j at time t
RiEN D Unit value of LNG left in storage tanks at loading port i at the end of the planning
horizon
Bjt Berth capacity at port j at time t
QP
it Produced quantity of LNG in loading port i in time period t
Variables
xvitjt 1 if vessel v sail arc ((i, t), (j, t )), and 0 otherwise
zf t 1 if a cargo from FOB contract f is done loading in time period t , and 0 otherwise
witj 1 if a charter vessel starts sailing from loading port i at time t to deliver a cargo at j,
and 0 otherwise
gitj Amount loaded by a charter vessel in loading port i at time t to deliver in
unloading port j
sit Remaining storage at loading port i at the end of time period t
DES
maxz = Rjt Lv (1 − (t − t)E)xvitjt
v∈V ((i,t),(j,t ))∈AU
v
DES C
DES
+ Rj,t+T C (1 − Tij E)gitj + Rjt Lv (1 − (t − t)E)xvitjt
ij
i∈N L t∈T L j∈N U v∈V ((i,t),(j,t ))∈AS
v
DES C
U F OB F OB
+ Rj,t+T C (1 − Tij E)gitj + Rf Lf zf t
ij
i∈N L t∈T L j∈N S i∈N L f ∈F U t ∈T
i
SF OB F OB
EN D
+ Rf Lf zf t + Ri si,|T |
i∈N L f ∈F S t ∈T i∈N L
i
S
C
− Cvitjt xvitjt − Citj witj
v∈V ((i,t),(j,t ))∈Av i∈N L t∈T j∈N U
(1)
sit = si,t−1 + QP
it − Lv xvitjt − gitj
j∈N U t ∈T v∈Vi j∈N U
(3)
− LF
f
OB
zf t , i ∈ N L , t ∈ T L \{1}
f ∈Fi t ∈TfF OB
Si ≤ sit ≤ Si , i ∈ N L, t ∈ T L (4)
Maintenance and Vessel Flow Constraints. Constraints (5) state that the
vessels in need of maintenance perform this exactly one time. Constraints (6)
ensure that each vessel either starts at its start position at its first available day
or is not used at all. Constraints (7) are vessel flow conservation constraints.
xvitjt = 1, v ∈ VM
(5)
((i,t),(j,t ))∈AM
v
|T |
t
−1
xvitjt = xvjt it , v ∈ V, j ∈ N , t ∈ T (7)
i∈N t=0 i∈N t=t +1
Djp ≤ Lv (1 − (t − t)E)xvitjt
v∈Vi i∈N L t∈Tv t ∈Tjp
+ gitj (1 − TijC E) ≤ Djp , j ∈ N U ∪ N S, p ∈ Pj
i∈N L t∈(Tjp −Tij
C)
(8)
Constraints (9) and (10) are the FOB contract constraints. Constraints (9)
make sure that each long-term FOB cargo contract is fulfilled once within its time
window, while Constraints (10) make sure each that spot FOB cargo contract is
fulfilled at most once in the relevant time window, except for the artificial spot
FOB contracts (contract 1), which can be fulfilled as many times as wanted.
zf t = 1, j ∈ N L , f ∈ FjU
F OB
(9)
t ∈Tf
zf t ≤ 1, j ∈ N L , f ∈ FjS \{1}
(10)
t ∈TfF OB
Planning LNG Annual Delivery Programs 179
Berth and Charter Constraints. Constraints (11) are the berth capacity
constraints. Constraints (12) ensure that if a charter vessel loads a cargo, the
amount loaded is bounded within an upper and lower limit. This allows for some
flexibility in the capacities of the charter vessels.
t +T O
vij
xvitjτ + wjt j
v∈V P i∈N t∈T τ =t +1 j ∈N U
(11)
t +TfOF OB
j
+ zf τ ≤ Bjt , j ∈ N L, t ∈ T U
f ∈Fj τ =t +1
C
LC witj ≤ gitj ≤ L witj , i ∈ N L, t ∈ T L, j ∈ N U ∪ N S (12)
Variable Domains. Constraints (13) to (17) specify the domains of our three
binary variables xvitjt , zf t and wvitjt as well as the continuous variables gitj and
sit . The binary variable zf t is only defined for loading ports and the relevant
time periods. The binary decision variable for using a charter vessel, variable
witj , is only defined for arcs that go from a loading node to an unloading node.
gitj ≥ 0, i ∈ N L, t ∈ T L, j ∈ N U ∪ N S (16)
4 Computational Study
The MIP model (including the network generation) was implemented in Python
and solved by Gurobi (version 10.0.0) on a Dell Optiplex 7780 computer with an
Intel Core i7-10700 @ 2.90 GHz. In the following, we describe the test instances
used in Sect. 4.1, before we present the computational results in Sect. 4.2 and
provide some managerial insights in Sect. 4.3.
180 H. V. Haug et al.
A number of realistic test instances are generated based on data for two differ-
ent LNG producers, one with only one liquefaction (loading) port, and one with
two. Table 1 presents the different configurations of test instances, varying in the
number of loading- and unloading ports |N L | and |N U |, long-term FOB con-
tracts |F U |, vessels |V|, and time periods |T |. Three different test instances were
generated for each configuration, slightly differing in the number of unloading
ports (customer ports) and long-term FOB contracts.
The black horizontal line in Table 1 splits the configurations into two sets,
where the upper set of configurations (with a prefix ‘n’) is based on data for the
LNG producer with one loading port, while the set at the bottom (with a prefix
’a’) is based on data from the producer with two loading ports. The leftmost
column represents the configuration ID, where each of the numbers indicates the
corresponding size of the set on the corresponding position in the instance ID.
For example, configuration n-2-6-12-11-60 has two loading ports, six unloading
ports, 12 FOB long-term contracts, 11 vessels, and 60 time periods of one day.
Table 1. Overview of the different test instances’ problem sizes, differing in the number
of loading ports |N L |, average number of unloading ports |N U |, average number of
long-term FOB contracts |F U |, number of vessels |V| and number of time periods |T |.
Configuration ID |N L | |N U | |F U | |V| |T |
n-1-2-2-18-30 1 2 2 18 30
n-1-6-16-18-45 1 6 16 18 45
n-1-6-16-18-60 1 6 16 18 60
n-1-6-10-18-75 1 6 20 18 75
n-1-7-21-18-90 1 7 21 18 90
n-1-7-27-23-120 1 7 23 23 120
n-1-7-40-23-180 1 7 40 23 180
n-1-17-75-23-365 1 17 75 23 365
a-1-6-12-11-60 1 6 12 11 60
a-2-6-12-11-60 2 6 12 11 60
a-2-6-38-15-180 2 6 38 15 180
Table 2. Summary of the configurations’ optimality gaps and objective function values
averaged over three instances for each configuration.
Even though one should be careful with comparisons like this, the low gaps
from running our model indicates that our modeling approach is efficient and
can be used to find good solutions for all practical purposes. It should be noted
though that we were not able to find feasible solutions for the largest test instance
with 365 days planning horizon. However, the results show that our model could
be embedded in a rolling horizon heuristic, e.g., like in [5,10], for obtaining
practical solutions also for instances with longer planning horizons.
182 H. V. Haug et al.
We know that each of the two cases in Table 3 has the same lower demand
limits and LNG prices regardless of speed configuration. Therefore, the increase
in DES revenue comes from delivering amounts of LNG that are closer to the
upper demand limits of the customers. Overall, the DES revenue (the first two
terms) increase by 3.1 mill. USD compared to the configuration without speed
optimization, which is an increase of 0.34%. This is possible including speed
optimization allows for sailing at higher speeds so that the producer can deliver
somewhat larger volumes of LNG to the customers. The higher sailing speeds
also results in higher fuel costs for the own vessels (increase of 12.9%). Charter
costs decrease by 35.7% in the case of speed optimization since the producer’s
own fleet is able to transport more due to the higher sailing speeds. This indicates
that if the charter rates were to increase, speed optimization could turn out even
more valuable than it does in the current test instances.
It should be noted that our test instances do not include spot contracts, nor
does our model place a value on artificial FOB. The effect of speed optimization
is expected to become even stronger if these elements are included since there
would be more potential to exploit additional revenue sources. The additional
cost that any extra revenue would imply is often relatively low, so exploiting
more revenue opportunities is likely to be beneficial.
Figure 2 shows the distribution of the speeds of the vessels in the optimal
solution of instance n-1-7-27-23-120-a. The left violin plot is the case with
Planning LNG Annual Delivery Programs 183
Fig. 2. Distribution of vessel speeds with- and without speed optimization for instance
n-1-7-27-23-120-a. The central horizontal line denotes the average speed of the ves-
sels.
speed optimization, while the right plot is the case with default speeds. The plots
show that the speed optimization model uses the whole spectrum of allowed
vessel speeds, with a slight peak at around 14 and 15 knots. The average is
slightly lower than the default service speeds, which are concentrated around
16 and 17 knots. This would imply a lower fuel cost per distance traveled on
average.
5 Concluding Remarks
We have considered the planning Annual Delivery Programs (ADPs) in the Liq-
uefied Natural Gas (LNG) industry, in which an LNG producer must fulfill a
series of long-term contracts with customers all over the world with a given
fleet of LNG vessels. We have expanded previous definitions and models for this
problem by including speed optimization and the possibility of having multiple
loading ports, which is becoming relevant for some producers as the industry
develops. We denote this planning problem as the LNG-ADP with Speed Opti-
mization and Multiple Load Ports (LNG-ADP-SO-MLP).
Our contribution to the literature is a rich formulation of the LNG-ADP
planning problem, modeled as a mixed-integer programming (MIP) model. The
model is designed to handle each leg of a voyage separately (because of the
multiple loading ports), rather than a round trip (as in the previous models
in the literature). This also allows for efficient modeling of vessel maintenance,
184 H. V. Haug et al.
as the vessel can sail directly to a maintenance port after unloading without
returning to a loading port.
We used a commercial MIP-solver to test our model on a number of realistic
instances for two different LNG producers (one with two loading ports). The
computational study showed that the commercial solver was able to find feasible
solutions with less than 1% optimality gaps within one hour for instances with a
planning horizon of up to 180 days. It was also shown that including speed opti-
mization brings some extra flexibility and increases the profit. The commercial
solver did not find a feasible solution for the largest instance with a full planning
horizon of a year. However, the promising results for the instances with shorter
planning horizons show that our model can be embedded into a Rolling Hori-
zon Heuristic (RHH) in order provide practical solutions also for instances with
longer planning horizons.
References
1. Al-Haidous, S., Msakni, M.K., Haouari, M.: Optimal planning of liquefied natural
gas deliveries. Transp. Res. Part C: Emerg. Technol. 69, 79–90 (2016)
2. Andersson, H., Christiansen, M., Desaulniers, G., Rakke, J.G.: Creating annual
delivery programs of liquefied natural gas. Optim. Eng. 18(1), 299–316 (2017)
3. FreightWaves: LNG shipping rates ‘shooting for the stars’ at $500,000 per day.
https://www.freightwaves.com/news/lng-shipping-rates-shooting-for-the-stars-
at-500000-per-day
4. GIIGNL: GIIGNL Annual Report 2022. The International Group of Liquefed Natu-
ral Gas Importers, Neuilly-sur-Seine, France. https://giignl.org/resources2/ (2022)
5. Li, M., Fagerholt, K., Schütz, P.: Maritime inventory routing with transshipment:
the case of Yamal LNG. Flex. Serv. Manuf. J. (2022). https://doi.org/10.1007/
s10696-022-09476-5
6. Li, M., Schütz, P.: Planning annual LNG deliveries with transshipment. Energies
13(6), 1490 (2020)
7. Msakni, M.K., Haouari, M.: Short-term planning of liquefied natural gas deliveries.
Transp. Res. Part C: Emerg. Technol. 90, 393–410 (2018)
8. Mutlu, F., Msakni, M.K., Yildiz, H., Sönmez, E., Pokharel, S.: A comprehensive
annual delivery program for upstream liquefied natural gas supply chain. Eur. J.
Oper. Res. 250(1), 120–130 (2016)
9. Norstad, I., Fagerholt, K., Laporte, G.: Tramp ship routing and scheduling with
speed optimization. Transp. Res. Part C: Emerg. Technol. 19(5), 853–865 (2011)
10. Rakke, J., et al.: A rolling horizon heuristic for creating a liquefied natural gas
annual delivery program. Transp. Res. Part C: Emerg. Technol. 19(5), 896–911
(2011)
11. Stålhane, M., Rakke, J.G., Moe, C.R., Andersson, H., Christiansen, M., Fagerholt,
K.: A construction and improvement heuristic for a liquefied natural gas inventory
routing problem. Comput. Ind. Eng. 62(1), 245–255 (2012)
12. Wen, M., Ropke, S., Petersen, H.L., Larsen, R., Madsen, O.B.G.: Full-shipload
tramp ship routing and scheduling with variable speeds. Comput. Oper. Res. 70,
1–8 (2016)
Tramp Ship Routing with Bunker
Optimization and Flexible Cargo
Quantities: Case from Dry Bulk Shipping
1 Introduction
Maritime transportation is the backbone of international trade, with more than
80% of the total volume traded internationally [12]. One important segment is
dry bulk shipping, which concerns transporting raw materials shipped in bulk,
such as iron ore, coal, and grain. In this paper, we study a real-life tramp ship
routing and scheduling problem in which a dry bulk shipping operator is to
deploy its vessel fleet to maximize the profit for the entire fleet. To do so, the
operator has to select from sets of mandatory contracted cargoes and optional
spot cargoes, potentially paying a cost for not servicing a contracted cargo. In dry
bulk shipping, an operator can often choose the amount of cargo to transport
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 185–201, 2023.
https://doi.org/10.1007/978-3-031-43612-3_11
186 S. Omholt-Jensen et al.
within a predetermined interval. Thus, the operator also needs to decide how
much of each cargo to transport, which can be challenging given that the vessels
may carry multiple cargoes simultaneously. Furthermore, as the ships eventually
run out of bunker (fuel), the operator must choose where to bunker and the
quantity of bunker to purchase. We denote this planning problem as the Tramp
Ship Routing and Scheduling Problem with Bunker Optimization (TSRSPBO).
There have been significant contributions to the academic field of tramp
ship routing and scheduling over recent years, summarized in the review papers
[6,9,11]. The aspect of flexible cargo quantities in ship routing is considered by
[2,7,8], while bunker optimization was included by [10,13]. However, none of
the previous studies integrates these two important aspects. Furthermore, the
latter two studies do only consider full shiploads, meaning that a vessel can have
at most cargo on board at the same time, in contrast to the problem we con-
sider in this paper. Hence, to our knowledge, the TSRSPBO studied here is the
first to integrate both flexible cargo quantities and bunker optimization. Fur-
thermore, by allowing vessels to detour to refuel bunker and the simultaneous
service of multiple cargoes, the TSRSPBO realistically represents our case com-
pany Western Bulk’s operational environment. We propose an arc flow model
for the TSRSPBO, and to solve instances of larger sizes, we propose a solution
method based on a path flow model with routes (columns) generated a priori.
For each of these routes, we have to determine the optimal cargo and bunker
quantities along the route, which we do by solving a linear programming model.
By solving test instances based on real-life data, we show how our models and
proposed solution method may act as an aid to chartering managers in dry bulk
shipping companies.
The remaining of this paper is organized as follows. Section 2 provides a
definition of the TSRSPBO, while Sect. 3 presents its arc flow model. Then, in
Sect. 4, we propose the path flow solution method. The computational study is
provided in Sect. 5, while concluding remarks are given in Sect. 6.
2 Problem Definition
This section formally defines the TSRSPBO, which is relevant to many shipping
companies operators, e.g., in the dry bulk shipping industry. The TSRSPBO
considers a ship operator controlling a given heterogeneous fleet of vessels with
differing cargo and bunker carrying capacities, as well as initial positions and
bunker levels. Each vessel has a given sailing speed and fuel consumption.
The vessel fleet is set up to transport a set of cargoes for the following plan-
ning horizon, e.g., the next two months. Each cargo is specified by its cargo
type, pickup and delivery ports and time windows, and freight rate. A cargo
can either be a contractual mandatory cargo or a spot cargo. A contractual
cargo usually stems from a long term Contract of Affreightment (CoA), and is
therefore denoted a CoA cargo. A spot cargo is an optional cargo for which the
ship operator not yet has signed a contract. In dry bulk shipping it is common
that the cargo contracts offer some flexibility to the ship operator regarding the
Tramp Ship Routing with Bunker Optimization 187
This section presents the arc flow (AF) model for the TSRSPBO. We call this
model an AF model because we here use arc flow variables specifying which arcs
the vessels traverse within a network representation of the problem. Section 3.1
describes the modeling assumptions and notation, while Sect. 3.2 presents the
AF model.
188 S. Omholt-Jensen et al.
In Sect. 2, it was stated that there is a delivery time window for each cargo. In
reality, it is often only a pickup time time window, while it is specified that deliv-
ery should occur at the ”earliest convenience”. Thus, to exclude the possibility
of a vessel carrying a cargo for too long time, artificial delivery time windows are
constructed by multiplying the direct sailing time between the cargo’s pickup
and delivery ports and port time with a factor, e.g., 1.5. This design choice
allows detours to purchase bunker and pick up other cargoes while ensuring the
delivery of all cargoes within a reasonable time.
Furthermore, we assume that the time it takes to load bunker is not depen-
dent on the amount. This assumption is in line with [13]. A fixed bunker time
is therefore added to the sailing time for trips arriving at a bunker port. At
the beginning of the planning period, vessels might be in port or at sea as they
operate 24/7. We assume that each vessel becomes available at a given port, rep-
resented by an origin node, corresponding to the discharge port of their ongoing
voyage. After completing the last port call in the planning horizon, a vessel ends
its route by traveling to an artificial destination node at zero cost and time.
The vessel’s bunker load at this destination node is considered a resource, so the
difference in bunker load between the origin and destination node is priced at
an average price for all the bunker ports. The value of this additional bunker is
reflected in the model’s objective.
The notation and formulation presented here is based on the arc flow formu-
lation presented in [4,5,13]. Let the fleet of vessels be represented by the set V.
There are N cargoes available for transport in the following planning horizon,
where the sets of pickup and delivery nodes are given by N P = {0, . . . , N } and
N D = {N + 1, . . . , 2N }, respectively. These sets allow each cargo i to be repre-
sented by a pickup node i ∈ N P and a delivery node (N + i) ∈ N D . As such, the
set of all cargo-related nodes is represented as N = N P ∪ N D . The set of pickup
nodes N P is partitioned into N P = N C ∪ N O , where N C and N O denote the
mandatory contracted and optional spot cargoes, respectively. A vessel v has a
node of origin o(v), while it is assigned an artificial destination node d(v). Due to
cargo capacity, depth, and cargo handling equipment restrictions, some vessels
in V and nodes in N may be incompatible. Thus, the set of nodes a vessel v may
visit is represented as Nv ⊆ N ∪ {o(v), d(v)}. (Nv , Av ) defines a base network
for each vessel v, where the set of arcs Av ⊆ {(i, j)|i ∈ Nv , j ∈ Nv } represents
all arcs traversable by vessel v with respect to time, capacity and bunker con-
sumption. The pickup and delivery nodes compatible with vessel v are defined
as NvP = N P ∩ Nv and NvD = N D ∩ Nv .
The next step is extending the base network (Nv , Av ) to allow vessel visits
at bunker nodes (ports). Let Bv be the set of bunker nodes that vessel v may
visit. The set of base nodes Nv for vessel v is extended to include the nodes in
Bv such that N̂v = Nv ∪ Bv . Furthermore, the set of base arcs Av is extended
by adding all feasible arcs connecting nodes in Nv \d(v) with nodes in Bv . As
such, the extended set of arcs is defined as Âv = Av ∪ AB v , where Av is the set
B
Tramp Ship Routing with Bunker Optimization 189
of arcs for vessel v connecting bunker option nodes Bv to the nodes in Nv . The
extended cargo-bunker network is thereby (N̂v , Âv ).
S
Let Tijv be the sailing time between nodes i and j for vessel v. If node j is a
S
bunker node, a fixed time is added to Tijv , signifying the time spent bunkering.
The time required to load or discharge one unit of cargo i with vessel v is
Q
defined as Tiv . The variable voyage cost Cijv accounts for the costs of visiting
node i and the sailing from node i to node j for vessel v. Note that the cost
of the purchased bunker is not included in this parameter as the purchase of
bunker is modeled separately. Instead, let Pi denote the cost of purchasing one
unit of bunker available in bunker node i. The bonus bunker remaining at the
destination node d(v) is rewarded by P per unit (calculated as the mean of all
available bunker node prices). The voyage charter cost of servicing a CoA cargo
i by a spot ship is denoted CiS . Furthermore, the bunker consumption for vessel
v traversing arc (i, j) ∈ Âv is denoted BijvS
. The bunker consumed per time unit
P
in port i by vessel v is given by Biv .
The quantity of cargo that can be transported is flexible within interval
Qi , Qi . There is a per unit revenue Ri for transporting cargo i ∈ N P and
a time window T iv , T iv for when service must start at node i (strengthened
for each vessel v). Additionally, the bunker level on board each vessel v must
be within a lower limit B v and an upper limit B v to ensure realistic and safe
operation. Finally, the cargo carrying capacity of vessel v is denoted Kv .
The binary arc flow variable xijv is assigned the value 1 if vessel v traverses
arc (i, j) ∈ Âv , and 0 otherwise. The variable tiv denotes the time vessel v begins
service at node i. The quantity of cargo i transported by vessel v is represented
by qiv , and the quantity of bunker purchased by vessel v at bunker node i is
given by the variable biv . The total cargo load on board a vessel v when it leaves
C B
node i is represented by the variable liv . Similarly, the variable liv represents the
total bunker load on board vessel v just after completing service at node i. To
represent whether a CoA cargo is serviced by a spot ship, the binary variable zi
is assigned the value 1, or 0 otherwise. Finally, the binary variable yi is assigned
a value of 1 if the optional spot cargo i is serviced, and 0 otherwise.
Objective Function
max Ri qiv − Cijv xijv − CiS zi
v∈V i∈N P v∈V (i,j)∈Âv i∈N C
(1)
− Pi biv + P · (ld(v)v
B
− Bv0 )
v∈V i∈Bv v∈V
The objective function (1) maximizes the total profit. The first term specifies
the revenue generated for both contracted and optional cargoes, while the second
190 S. Omholt-Jensen et al.
term specifies the variable voyage costs. The third term represents the voyage
charter costs associated with servicing CoA cargoes with spot ships. The fourth
term describes the cost associated with purchasing bunker, while the last term
calculates the value of the bonus bunker remaining at the destination node. Note
that if the bunker level at the destination is higher than at the origin, this term
will be positive.
Constraints (2) ensure that all mandatory CoA cargoes are transported by a
vessel in the predefined fleet or by a spot ship. In contrast, Constraints (3) state
that all spot cargoes may be transported. Constraints (4) ensure that all vessels
begin their routes from their origin node and travel to a pickup node, a bunker
option node, or directly to their destination node. Constraints (5) denote the
flow conservation, while Constraints (6) force each vessel’s route to end at its
artificial destination node. Constraints (7) ensure that the same vessel services
both the pickup and the cargo’s corresponding delivery node.
Temporal Constraints
Q
xijv tiv + Tiv S
qiv + Tijv − tjv ≤ 0 v ∈ V, (i, j) ∈ Âv
|i ∈ NvP (8)
Q
xijv tiv + Tiv S
qi−N,v + Tijv − tjv ≤ 0 v ∈ V, (i, j) ∈ Âv
|i ∈ NvD (9)
xijv tiv + S
Tijv − tjv ≤ 0 v ∈ V, (i, j) ∈ Âv
|i ∈
/ NvP ∧ i ∈
/ NvD (10)
Q
+i,v − tN +i,v ≤ 0 v ∈ V, i ∈ NvP
S
tiv + Tiv qiv + Ti,N (11)
Tramp Ship Routing with Bunker Optimization 191
Constraints (8)–(9) specify the time progression for the pickup and delivery
nodes, respectively. Note, however, that the cargo quantity variable qiv is only
defined for pickup nodes. Constraints (10) describe a similar time flow for origin,
bunker, and destination nodes as there is no cargo loading or unloading at these
nodes. Constraints (11) are precedence constraints forcing the pickup node to
be serviced before its corresponding delivery node. Finally, Constraints (12) are
the time window constraints.
Cargo Constraints
C
xijv liv + qjv − ljv
C
=0 v ∈ V, (i, j) ∈ Âv
|j ∈ NvP (13)
C
xi,N +j,v liv − qjv − lN
C
+j,v = 0 v ∈ V, (i, N + j) ∈ Âv
|j ∈ NvP (14)
C
xijv liv − C
ljv =0 v ∈ V, (i, j) ∈ Âv
|j ∈ Bv (15)
C
lo(v),v =0 v∈V (16)
qiv ≤ liv
C
≤ Kv xijv v ∈ V, i ∈ NvP (17)
j∈N̂v
0 ≤ lN
C
+i,v ≤ (Kv − qiv ) xN +i,jv v ∈ V, i ∈ NvP (18)
j∈N̂v
Qi xijv ≤ qiv ≤ Qi xijv v ∈ V, i ∈ NvP (19)
j∈N̂v j∈N̂v
Constraints (13)–(15) represent the cargo load balance constraints for pickup,
delivery and bunker nodes, respectively. Constraints (16) initialize the cargo load
on board the vessels to zero. Constraints (17) and (18) describe the cargo load
capacity restrictions at pickup nodes and delivery nodes, respectively. Finally,
Constraints (19) ensure that the cargo loaded at pickup port i is within the
minimum and maximum cargo load limits. The summations in Constraints (17)–
(19) are introduced to strengthen the formulation.
Bunker Constraints
P Q
B
xijv liv − Bijv
S
− Bjv Tjv qjv − ljv
B
=0 v ∈ V, (i, j) ∈ Âv
|j ∈ NvP (20)
P Q
B
xijv liv − Bijv
S
− Biv Tjv qj−N,v − ljv
B
=0 v ∈ V, (i, j) ∈ Âv
|j ∈ NvD (21)
192 S. Omholt-Jensen et al.
B
xijv liv − Bijv
S
+ bjv − ljv
B
= 0 v ∈ V, (i, j) ∈ Âv
|j ∈ Bv (22)
B
xijv liv − Bijv
S
− ljv
B
= 0 v ∈ V, (i, j) ∈ Âv
|j ∈ {d(v)} (23)
B
=
lo(v)v Bv0 v ∈ V (24)
P Q
S
B v + Bijv + Bjv Tjv qjv xijv ≤ liv
B
≤ Bv xijv v ∈ V, i ∈ N̂v (25)
j∈N̂v j∈N̂v
biv ≤ B v − B v xijv v ∈ V, i ∈ Bv (26)
j∈N̂v
Constraints (20)–(23) describe the bunker balance constraints for the pickup,
delivery, bunker and destination nodes, respectively. Constraints (24) specify
the bunker level on board vessel v at the beginning of the planning period. Con-
straints (25) make sure that the bunker loads on board the vessels are kept within
its limits, while Constraints (26) ensure that the amount bunker purchased is
feasible. The summations in (25) and (26) are again included to strengthen the
formulation.
Variable Domains
In the above arc flow model, the balance constraints for time, cargo, and
bunker are non-linear. The same is the case for Constraints (18), handling the
discharge load capacities of the vessels, as well as for Constraints (25), controlling
the bunker load capacities of the vessels. These constraints are all linearized,
using standard linearization techniques with big-M constraints (e.g., [4]), to make
the model solvable by a commercial MIP-solver.
Tramp Ship Routing with Bunker Optimization 193
max Psv xsv − CiS zi (33)
v∈V s∈Sv i∈N C
Aisv xsv + zi = 1 i ∈ NC (34)
v∈V s∈Sv
Aisv xsv − yi = 0 i ∈ NO (35)
v∈V s∈Sv
xsv = 1 v∈V (36)
s∈Sv
cargo. Constraints (34) state that each contracted cargo is either transported by
a vessel in the fleet or by a spot ship, similar to Constraints (2) in the arc flow
model. Constraints (35) specify that optional cargoes may be serviced by a vessel
in the fleet, similar to Constraints (3). Constraints (36) ensure that each vessel
is assigned exactly one schedule. Finally, Constraints (37) define the domains of
the binary schedule assignment variables.
First, all routes are generated using a depth-first search for each vessel. Here,
we use the minimum cargo quantities Qi to ensure that all feasible routes are
generated, and we also make sure that they adhere to the vessel-specific con-
straints in the arc flow model. Given the set of all feasible routes, we solve a
Vessel Scheduling Problem (VSP) to determine the optimal amounts of bunker
and the cargo quantities along the route (together with timing of the activities).
It should be noted that, since the route is now given, the VSP does not
have any binary variables and is a Linear Programming (LP) model, which is
considerably easier to solve than the whole AF model. Solving the VSP for each
feasible route for each vessel yields the set of optimal schedules Sv for each vessel
v. The objective function value associated with each optimal schedule s is the
vessel-specific optimal profit Psv for route s used in the path flow formulation.
5 Computational Study
The arc flow and the path flow models were implemented in C++ and solved by
Gurobi (version 10.0.0). All of the computational experiments were performed on
a MacBook Pro with an Apple M1 Pro processor. In the following, we describe
the test instances used in Sect. 5.1, before we present the computational results
in Sect. 5.2 and provide some managerial insights in Sect. 5.3.
This section explains how real-life cargo and voyage data from the industry
partner, Western Bulk1 , was compiled into suitable test instances. In addition
to Western Bulk’s operational data, routing and port information were pro-
vided by Maritime Optima2 , a maritime data analytics company. The remaining
input data needed to generate realistic test instances was either gathered or con-
structed from open sources, estimated from available data, or randomly sampled
from reasonable uniform intervals.
We generated a set of test instances of different size classes, as summarized
in Table 1. Within each class, we generated five instances. For each test instance,
1
www.westernbulk.com.
2
www.maritimeoptima.com.
Tramp Ship Routing with Bunker Optimization 195
the number of spot cargoes was fixed to be twice the number of CoA cargoes.
As such, for a test instance with 15 cargoes, there are five CoA cargoes and 10
spot cargoes. This is representative for Western Bulk’s operational environment.
The cargoes’ MoLOO limits were estimated by creating a ±10% interval
from the actual quantities of historical transported cargoes. The origin ports of
the vessels were randomly drawn from a set of the most frequent delivery ports
visited by Western Bulk’s vessels. A final group of parameters was randomly
sampled from sensible intervals. These include the time when vessels first become
available to transport cargoes, T o(v)v , the beginning of the time windows for
pickup ports, T iv , i ∈ NvP , and the initial bunker level on board the vessels, Bv0 .
For pickup ports, the upper time windows were constructed by adding ten days
to the start, which corresponds to what is typical in Western Bulk’s operation.
The results for the arc flow model are summarized in Table 2. The column Solved
shows the number of test instances within each class that were solved to opti-
mality for the AF model within the maximum allowed running time, which is set
to one hour. The Time column shows the average solution times in seconds. The
BestBoundAF column displays the best dual bound found when solving the AF
model, averaged over the test instances in each class. Similarly, the ObjValsAF
column states the best primal bounds, i.e., the average profit of the obtained
solutions. The Gap column shows the average percentage difference between
the best primal and dual bounds.
Table 2 shows that only the instances of the first two classes were solved to
optimality within one hour. For the C9V3B4 class, two out of five test instances
were solved to optimality. For the larger instances, the AF model could not
provide an optimal solution within one hour, and the gaps are high. It should
be noted that introducing bunker optimization to the tramp ship routing and
scheduling problem significantly increases the problem complexity. To illustrate
this, the arcs going to and from the bunker nodes constitute as much as 62.5%
of the total number of arcs (on average across all test instances).
196 S. Omholt-Jensen et al.
Table 2. Arc flow (AF) model results averaged over instances in each instance class
The results of the path flow solution method are summarized in Table 3,
where the Solved column shows the number of test instances within each class
that were solved to optimality within one hour. The NR column displays the
average number of routes generated for instances within each class. The column
Time shows the solution times in seconds, wheremost of the time is spent in the
generation of the routes and solving the LPs to determine the optimal bunker
amounts and cargo quantities. The NSS and NSC columns state the average
number of spot ships used and the number of spot cargoes transported in the
optimal solutions, respectively. Furthermore, the ObjValTOT column states the
optimal objective values, i.e., the profit, averaged over the test instances in each
class. Finally, the Δ column reports the average percentage improvements in the
objective value found when solving the path flow model compared to the best
objective value found when solving the AF model (after one hour).
Table 3. Path flow model results averaged over the instances in each instance class
In general, the path flow solution method clearly outperforms the AF model,
both with respect to solution time and quality. As we can see from Table 3, the
path flow solution method is able to solve all except for three of the largest
instances. Nevertheless, the run time grows quickly along with the number of
feasible routes/columns when the instance size increases. Eventually, this also
results in memory problems, and the largest instances solved here represent more
Tramp Ship Routing with Bunker Optimization 197
or less the limit of the path flow solution method. For even larger test instances,
either heuristics or more advanced exact methods will be needed.
Our model offers potential for a wide range of analyses that can help managers
in making the addressed decisions. In this section, we concentrate on exploring
how the negotiation of flexible cargo quantities and bunker purchase discounts
may impact profits. There is surely a large range of further factors that could
be analysed, such as fleet diversity, fleet age, or repositioning flexibility (see [1])
but we leave this to future research also as some of these issues may require
extensions of our proposed model.
Fig. 1. The effect of utilizing flexible cargo quantities for a test instance in C15V5B10
For large parts of Fig. 1, there is a (close to) linear profit increase from
increased flexibility. This comes simply from the possibility of transporting larger
quantities of the same cargoes. However, when the MoLOO limits are increased
from ±3% to ±4% and from ±24% to ±25%, we notice two jumps in the profit.
This is because the additional flexibility makes it possible to reduce the quantities
for some cargoes, and hence be able to transport additional spot cargoes. This
shows that increased contract flexibility in cargo quantities may have a huge
impact on profitability. These results are in accordance with [8].
198 S. Omholt-Jensen et al.
Fig. 2. Bunker prices and total number of bunker port visits in test instances C15V5B10
Tramp Ship Routing with Bunker Optimization 199
Fig. 3. Averaged effects of procuring discounted bunker for test instances C15V5B10
Figure 3 shows the increase in profit averaged over the five test instances
in C15V5B10 at different bunker discount rates. The bunker discount rates are
applied one at a time to each of the bunker ports. As expected, increasing the
discount rate leads to increased profit. A bunker price discount at the ports of
Singapore, Al Fujayrah and/or Kandia has the highest impact on the profits.
Their impact on profits is in line with Fig. 2, as these ports are among the most
frequently visited ports. As such, if Western Bulk has to choose which port they
should focus their efforts on procuring a discounted bunker purchase contract,
Fig. 3 suggests either one of these. For the previously discussed unattractive
ports Gladstone and Port Elizabeth, even a bunker price discount of 20% has
(almost) no impact on the solutions. Interestingly, Hong Kong, a port that was
never visited in Fig. 2, displays a greater impact on the average profit for the
test instances than Zhoushan Port, which was visited five times even without a
bunker discount. The explanation is that Hong Kong is more centrally located in
the Indo-Pacific region than Zhoushan Port. Whereas Zhoushan Port is located
further north, closer to Shanghai, Hong Kong lies further south, closer to Sin-
gapore. As such, decreasing the price of bunker in Hong Kong can lead to more
port visits than lowering the bunker price in Zhoushan. Bunker purchase man-
agers in Western Bulk may therefore leverage the information provided in Fig. 3
during bunker purchase negotiations.
6 Concluding Remarks
We have considered a real rich ship routing problem arising in the dry bulk ship-
ping segment, a problem we have denoted Tramp Ship Routing and Scheduling
Problem with Bunker Optimization (TSRSPBO). The TSRSPBO includes, in
200 S. Omholt-Jensen et al.
addition to the routing and scheduling of the given vessels in the fleet, deci-
sions about how much to bunker (refuel) in which ports. Furthermore, the cargo
quantities to be transported in the TSRSPBO are given in an interval (i.e.,
MoLOO limits), which means that determining the optimal transport quantities
also becomes a decision. We presented an arc flow model for the TSRSPBO, but
the commercial MIP solver is only able to solve tiny instances of the problem,
partly because the size of the network increases drastically with the inclusion of
the bunker ports. Hence, we have proposed a path flow solution method based
on a Dantzig-Wolfe decomposition of the problem. This solution method consists
of a path flow model using all feasible vessel routes (columns) as input. For each
such route, we determine the optimal amounts of both the bunker and cargoes
along the route by solving an LP problem.
The solution method was tested on a set of realistic instances generated
based on data from Western Bulk, a dry bulk shipping operator. The computa-
tional study suggests that the path flow solution method outperforms solving the
arc flow model by a commercial solver and that it is able to consistently solve
instances of up to 30 cargoes, 10 vessels, and 10 bunker ports within 30 min.
Furthermore, additional tests show how different MoLOO limits and bunker dis-
counts in different ports may affect the ship operator’s profit. Hence, the ship
operator may use these insights in the negotiation of freight contracts and to
determine for which ports it is most beneficial to negotiate bunker price dis-
counts.
References
1. Bai, X., Cheng, L., Çağatay, I.: Data-driven financial and operational risk man-
agement: empirical evidence from the global tramp shipping industry. Transp. Res.
Part E: Logistics Transp. Rev. 158, 102617 (2022). https://doi.org/10.1016/j.tre.
2022.102617
2. Brønmo, G., Christiansen, M., Nygreen, B.: Ship routing and scheduling with flex-
ible cargo sizes. J. Oper. Res. Soc. 58(9), 1167–1177 (2007). https://doi.org/10.
1057/palgrave.jors.2602263
3. BunkerEx: BunkerEx — Bunker Prices — Bunker Broker United Kingdom (2022).
https://www.bunker-ex.com. (accessed 2022-11-11)
4. Christiansen, M., Fagerholt, K.: Chapter 13: ship routing and scheduling in indus-
trial and tramp shipping. In: Toth, P., Vigo, D. (eds.) Vehicle Routing, pp. 381–408.
Society for Industrial and Applied Mathematics, Philadelphia, PA (2014). https://
doi.org/10.1137/1.9781611973594.ch13
5. Christiansen, M., Fagerholt, K., Nygreen, B., Ronen, D.: Chapter 4 maritime trans-
portation. In: Barnhart, C., Laporte, G. (eds.) Handbooks in Operations Research
and Management Science, Transportation, vol. 14, pp. 189–284. Elsevier (2007).
https://doi.org/10.1016/S0927-0507(06)14004-9
6. Christiansen, M., Fagerholt, K., Nygreen, B., Ronen, D.: Ship routing and schedul-
ing in the new millennium. Eur. J. Oper. Res. 228(3), 467–483 (2013). https://
doi.org/10.1016/j.ejor.2012.12.002
7. dos Santos, P.T.G., Kretschmann, E., Borenstein, D., Guedes, P.C.: Cargo routing
and scheduling problem in deep-sea transportation: case study from a fertilizer
Tramp Ship Routing with Bunker Optimization 201
1 Introduction
Digital twins have evolved as one of the main drivers for digital transformation
in major seaports and are seen as key technology of Industry 4.0 applications. In
industry, digital twin technologies are recognized as one of the most important
areas of investment for terminal operators in the coming years [27]. The main
reason is that they can be implemented in various forms and support many use
cases to enable automation and improve productivity, efficiency, resilience and
sustainability in terminals and seaports.
According to Vanderhorn et al. [39] a digital twin is a “virtual representa-
tion of a physical system (and its associated environment and processes) that is
updated through the exchange of information between the physical and virtual
system.” As such, it is comprised of a set of technologies to gather (near) real-
time data from infrastructure and superstructure and its environment, associate
it with related port processes, visualize and analyze it to make decisions having
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 202–218, 2023.
https://doi.org/10.1007/978-3-031-43612-3_12
Digital Twins in Seaports: Current and Future Applications 203
2 Related Literature
case of fleet management. In this case, the focus is on historical data, which
serves as a basis for analyzing bottlenecks and technical problems in order to
avoid them during planning [17]. Another example in this context can be found
in the Port of Livorno, Italy, where advanced technologies, such as 3D LIDAR
sensors and Wide Dynamic Range (WDR) cameras, are used for the localization
of cargo boxes and forklifts. Based on the planned tasks, the forklift driver is
guided within the terminal by wearing AR glasses.3 An example of a digital twin
in multi-purpose terminals is the Port of Oulu, Finland, where both vessel- and
terminal related information are represented in a geographic information system
(GIS) in order to increase situational awareness for improving the efficiency and
safety in the terminals. The digital twin provides a number of tools enabling
planning of port infrastructure elements, such as industrial lighting or cameras
of the monitoring system [5].
Looking at the application area of container terminals, the digital twin at the
Port of Qingdao realizes a wide variety of use cases. The status of vehicles along
with respective processes are represented in a complex 3D visualization. Not
only the positions and possible technical problems are displayed based on (near)
real-time GNSS and IoT data, but also simulations and optimization methods
for path planning and forecasts of the future handling rate based on ML are inte-
grated. As required for digital twins, decisions of the digital twin are propagated
to the control units of the AGVs [43]. Another focus is on various visualizations
for the operating status or the system behavior, as well as the structured imple-
mentation [42]. Similar applications can be found in the Port of Shanghai and
Dalian, China, where a very detailed digital twin supporting decision making for
automated and semi-automated terminals has been developed [10,21]. Several
digital twins apply big data technologies, such as Hadoop, to process various
sources of data. In the Port of Valencia, for instance, key performance indicators
are calculated based on operational data, AIS (automatic identification system),
and weather data. Another advanced digital twin for a container terminal in the
Port of Hamburg, Germany, is detailed in Sect. 4.
The analysis of the digital twins in Table 1 reveals that data sources, process-
ing techniques, user interfaces and twin integration vary across the different
application areas. For example, some applications rely solely on IoT data, while
others use many data sources and establish advanced decision support. Simi-
larly, some use cases require a high level of detail, while others can operate at
a more abstract level. Although the described applications give a good idea of
the use cases, the following section will take a generalized look at the different
layers of implementations and their maturity. The following analysis focuses on
the technical layers described in Sect. 2 which were developed by Yang et al. [43]
and extended by a sixth layer of self-adaption in Redelinghuys et al. [28].
3
See, e.g., https://www.ericsson.com/en/blog/2020/12/digital-twins-port-
operations.
208 J. Neugebauer et al.
The physical resource layer is present in all digital twins listed in Table 1,
albeit to very different degrees. In the Port of Anzio, Italy, only certain aspects
regarding energy production and consumption are represented while the digital
twin in Shanghai includes models of vehicles, infrastructure, weather and other
factors influencing terminal operations. Interacting mechanisms as well as the
current state of each object are taken into account and thus show the physical
resources in more detail. Across the layer, it is important to note that the level
of detail and the number of objects being represented are almost unlimited.
The trade off between the complexity and utility of displaying detailed physical
resources and the associated cost of implementing them must be considered.
This layer describes only the resource and its general attributes, excluding more
advanced process definitions or rules and more extensive environmental and
business-related information.
To model additional aspects besides the physical objects, especially environ-
mental aspects and business processes (as required by digital twins by Van Der
Horn [39]), the virtual resource layer is used. Similar to the previous level, the
more detailed and extensive the contextual information, the better and more
accurate the digital twin. During the interviews, the person in charge at the
ports of Antwerp-Bruges mentioned their goal to build the port’s nervous sys-
tem, including things like smell, which was an example for air quality sensors
and sight for example, realized with cameras and various visualizations. Other
environmental factors are derived from IoT and external data, which included,
for example, infrastructure conditions and vessel tracking. Examples for com-
bining physical objects with business processes in which they interact are given
in the case study presented in Sect. 4.
Data connectivity as a layer of digital twins is described in detail in the Port
of Qingdao. In order to use all the information and connect the representation
of physical resources and their behavior as well as 3D models, the network,
protocols, interfaces and general data exchange must be defined and managed.
This is critical for establishing an integration and is the focus of many digital twin
projects. For the Port of Singapore, for example, an open source framework was
used to model the digital twin in order to ensure future ease of integration and
extensibility, and all of the ports interviewed cited the specification of interfaces
as one of the biggest challenges in development. The data connection layer is
directly linked to the twin data layer, which includes the storage and processing
of data, but also advanced data processing techniques such as big data analytics
and other data mining techniques. For example, in the interview with the Port of
Hamburg, it was mentioned that traffic information is combined with simulation
data to be used in various applications. Another example is a BIM infrastructure
model, used for the before-mentioned Köhlbrand bridge, which was then used in
the application service layer to predict damage and schedule maintenance.
The application service layer includes decision support, using advanced meth-
ods, such as simulation modeling, optimization approaches and ML, but also
monitoring and business intelligence dashboards for supporting human decision
making. Examples from the digital twins include the actual traffic management
Digital Twins in Seaports: Current and Future Applications 209
service and its visualization implemented at the Port of Livorno but also the
analysis and optimization of operations for the container terminal at the Port
of Mawan. Note that although all projects listed in Table 1 incorporate this
layer, albeit with varying degrees of detail, not all feed these insights back into
the physical domain in an automated or semi-automated and (near) real-time
manner.
The self-adaptation layer can be realized when data from the lower layers are
considered on the application layer to adapt to certain situations and propagate
decisions back to the physical resources so that they can adapt accordingly. If
the self-adaptation layer is implemented in an architecture for cross-layer infor-
mation and the various layers are implemented with a high level of detail, it
is assumed that there is a huge potential for automation. In addition, a large
number of use cases can then be supported. This is also the goal in the presented
case study in the following section.
However, only a part of the data comes directly from the physical devices. In
the first phase, the project focuses on collecting data from SCs and gantry cranes.
This is achieved by connecting the equipment’s programmable logic controller
(PLC), sensors, and GNSS though an edge processor to an on-premises message
broker, which is communicating with a cloud message broker using the MQTT
(Message Queuing Telemetry Transport) protocol, as shown in Fig. 2.
Digital Twins in Seaports: Current and Future Applications 211
The virtual resource layer focuses mainly on building cycles of SCs and gantry
cranes. A cycle contains all equipment activities for moving a container from
one position (e.g., in the yard) to another position (e.g., at the quay). In some
situations also dual cycles need to be considered, meaning that two cycles are
considered together, such as when loading and unloading a full container from a
vessel in two subsequent moves. Note that one cycle can also contain more than
one container move (e.g., twin moves). For building cycles, from the edge device
containing PLC, GNSS and sensor data are combined with TOS data in order
to relate the IoT data with the business process context. The contextual data is
given by a job instruction from the TOS, representing all information to conduct
the movement of a container, such as the start and end positions, container
information, planned schedules, etc., resulting from the first planning phases
(e.g., vessel planning). The combination enables situational awareness about the
behavior and conditions of terminal equipment in relation to certain operations,
time, and other environmental impacts, such as weather if provided. Moreover, it
supports multiple use cases and has implications for all areas of operations since
it allows to analyze operations in detail and use extracted insights to improve the
planning and enhance the transparency for terminal operators and customers.
Added to this is ecologically relevant information such as fuel consumption or
tire wear, which can also be used later to monitor the total GHG emissions,
not only in general, but also the consumption of individual containers during
their lifetime at the terminal. In the project, the data from the equipment is
further used to enable predictive maintenance. As shown in Fig. 3, the IoT and
contextual data is shown when selecting physical objects in the digital twin. Note
that this is an interim result; later, extensive data dashboards and predictions
are included to better monitor conditions and key performance indicators.
Data connectivity, especially in a project of this size, with a multitude of
stakeholders, with data sources from various types of equipment from diffe-
rent suppliers and different information systems (e.g., TOS, ERP) in numer-
ous formats, becomes a complex task and thus requires standardized interfaces.
Currently one major problem with respect to the equipment is that suppliers
are not using the same definitions for measuring certain aspects, e.g., the opera-
tional time of a SC. In one case the engine-on time is used, in the other case, the
212 J. Neugebauer et al.
working hours of the driver are used. At the EUROGATE container terminals,
SCs of different suppliers are in use, making it difficult to compare metrics.
The Terminal Industry Committee 4.0 (TIC 4.0) is working on data stan-
dards for container terminals and includes crane manufacturers, TOS suppliers
and more (see [36]). A number of publications has been done for standardizing
the syntax and semantics for different equipment data sources [35]. In the project,
TIC 4.0 standards are used to assemble position data and technical information
from SCs (e.g., tire pressure, oil temperature, etc.) and gantry cranes, send them
in (near) real-time to message brokers and combine them with standardized job
instructions filled with information from the TOS to build respective cycles.
One major advantage of the proposed digital twins is that the standardization
allows to connect information systems (e.g., TOS) and equipment from differ-
ent suppliers/providers without changing the core implementation by talking
or translating into the same language. Therefore, all higher level applications,
including advanced decision support functionality, can read and write data in
this predefined format and solutions can be developed without spending time
on interface definition and in an agile manner. As equipment suppliers have just
started with implementing TIC 4.0 compliant data for newer equipment, it is
still necessary to transform the equipment data in TIC messages. As shown in
Fig. 2, a special edge processor has been implemented in the project to trans-
form all data records before forwarding them to the cloud. The (near) real-time
data connectivity is enabled by a scalable message broker system that connects
edge devices from physical objects in on-premises networks with the cloud-based
streaming processors and the twin data layer.
The gathered data is processed in several data processing stages for multiple
purposes in the twin data layer, which is build with AWS cloud services. This
layer includes a cloud-based data lake and big data technologies to prepare the
Digital Twins in Seaports: Current and Future Applications 213
data for the higher level applications. At both the data connectivity and twin
data layers, some data streams need to be processed in (near) real time (e.g.,
objects for (near) real-time visualization). This affected the cloud services and
technologies chosen. A list of all applications realized for the digital twin at the
CTH is almost impossible to compile and is also not the goal of this work. Rather,
those that show the decision area of the digital twin are discussed. Currently the
terminal planning is realized with rule-based approaches using the TOS. For
dispatching the SCs, for example, a score for each container is calculated based
on several weighted factors, such as the shortest distance from a theoretical
position of an SC to its planned position. This is a rather greedy approach that
does not consider the combination of different moves.
Therefore, the goal is to improve this planning with more advanced algo-
rithms based on the vast availability of granular data, which will be discussed in
future works. Starting from the integration of accurate positions based on GNSS
data, the distance-based calculations can be replaced by those that include, for
example, the estimated actual distances traveled by the SCs. In the project, a
ML model for predicting travel times will be implemented. In order for these pre-
dictions to be as accurate as possible and thus represent the physical resources
most accurately, weather information is collected by sensors on the container
terminal. This provides information on precipitation, wind speed and direction,
current visibility and much more. Weather conditions also have an impact on
planned operations and maintenance activities, which may need to be postponed
due to several weather conditions. Thus, weather predictions also serve for early
warnings so that plannings can be adjusted accordingly. In addition, water levels
and more as well as arrival time predictions of vessels from external sources are
integrated.
As a second stage a simulation will provide a feedback loop for optimiza-
tion methods and will allow for a more in-depth analysis of manual decisions.
Through simulating past shifts with new conditions (e.g., more SCs being used)
lessons learned can be taken for improving the current situation. By simulating
the next shift, the future performance of the terminal can be evaluated without
interrupting ongoing operations. In the final phase of realizing the described dig-
ital twin, a simulation-based optimization is implemented for the dispatching of
SCs and container storage allocation, using all available information, integrating
ML, and applying operations research methods to find an optimized plan and
evaluate it through simulation. By taking into account granular data from the
past, the simulation helps to validate the plan in a more accurate and complex
representation of the container terminal. The results are subsequently reported
to the TOS and passed on, e.g., to SC drivers, who then take an optimized route
better utilizing available resources, avoiding unproductive moves, waiting times,
and minimizing the environmental impact. Moreover, the solution can lead to a
greater operational resilience as unforeseen events can be simulated and emer-
gency plans or more resilient plans can be assessed in advance. Since this system
controls for the movement of the vehicles, it is also easier to use automated
equipment in the future.
214 J. Neugebauer et al.
– Choosing and integrating suitable IoT devices and data processing technolo-
gies, also regarding implementation and operation costs
– Ensuring data quality, standardization, and security
– Scaling and adapting to changing needs
– Interacting with other systems and stakeholders (e.g., port community sys-
tems)
– Complying with ethical and legal regulations (e.g., the General Data Protec-
tion Regulation (GDPR) in Europe)
– Estimating and managing implementation costs and time
– Avoiding model bias and prejudice
– Expanding the scope to other parts of the supply chain
– Anticipating long-term impacts and dependencies on the digital twin
This paper has examined how digital twins can be used in seaports and
container terminals to optimize and improve terminal planning and operations
as well as maintenance. To this end, a review of existing publications on the
topic was conducted and a number of port cases were discussed to assess the
current state of research and practice. The interviewed experts expressed a need
for more standardization, further collaboration, and more empirical evidence
to support the adoption and evaluation of digital twins in this domain. Some
possible solutions or recommendations to address the challenges and limitations
of digital twins are:
Regarding the latter aspect, we see that in industry the question whether
digital twins are just costly toys or valuable support tools still arises.4 In this
context, this work has shown the possibilities of a digital twin on different levels
and with different degrees of complexity. Furthermore, it has been shown that an
in-depth requirements analysis in terms of valuable use cases for the individual
port or terminal is decisive to gain value (as further discussed in [22]).
4
See, e.g., http://www.bpoports.com/BPC/agenda-3.pdf.
Digital Twins in Seaports: Current and Future Applications 215
As digital twins become more mature and widely adopted, we expect to see
more benefits and opportunities for seaports and container terminals in two main
areas. The first area is expanding the scope and impact of digital twins to other
parts of the supply chain. Digital twins can enable a seamless integration and
coordination of port operations with other actors and processes in the supply
chain, such as shippers, carriers, logistics providers, customs authorities, and
end customers. This can improve the visibility, resilience, and efficiency of the
entire supply chain, as well as reduce costs, risks, and environmental impacts.
Digital twins can also support the development of new supply chain concepts
and solutions, such as synchromodality or the circular economy. The second area
is creating new business models and value propositions for port services and
products. Digital twins can enable ports and terminals to offer new services and
products that leverage their data and insights to create value for their customers
and partners. For example, ports and terminals may provide data-driven services
such as (near) real-time monitoring, forecasting, optimization, or simulations of
port operations and processes to their customers. They can also integrate data-
enabled objects such as “smart” containers that can communicate and interact
with the digital twin (see, e.g., [14]). These services and products can create new
revenue streams and competitive advantages for worldwide ports and container
terminals.
References
1. Agostinelli, S., Cumo, F., Nezhad, M.M., Orsini, G., Piras, G.: Renewable energy
system controlled by open-source tools and digital twin model: zero energy port
area in Italy. Energies 15(5), 1817 (2022). https://doi.org/10.3390/en15051817
2. Agostinelli, S., Neshat, M., Majidi Nezhad, M., Piras, G., Astiaso Garcia, D.:
Integrating renewable energy sources in Italian Port areas towards renewable
energy communities. Sustainability 14(21), 13720 (2022). https://doi.org/10.3390/
su142113720
3. Ali-Tolppa, J., Kajo, M.: Mobility and QoS prediction for dynamic coverage opti-
mization. In: Proceedings of the IEEE/IFIP Network Operations and Management
Symposium (NOMS), pp. 1–2. IEEE, Budapest, Hungary (2020). https://doi.org/
10.1109/NOMS47738.2020.9110396
4. Battilani, C., Galli, G., Arecco, S., Casarino, B., Granero, A., Lavagna, K., Varna,
R., Ventura, M., Revetria, R., Damiani, L.: Business process re-engineering in pub-
lic administration: the case study of Western Ligurian sea port authority. Sustain.
Futures 4, 100065 (2022). https://doi.org/10.1016/j.sftr.2022.100065
5. Blare.Tech: Increasing port operations efficiency and safety thanks to the full situ-
ational awareness through the use of digital twin (2023). https://www.blare.tech/
project/port-of-oulu-digital-twin
6. Cai, J., Li, G., Zhang, L.: Study on the construction of digital twin-oriented oper-
ation ecosystem model of large-scale comprehensive port. In: Yang, M., Lanzinha,
J.C., Samui, P., Bao, X., Hu, J. (eds.) Advances in Transdisciplinary Engineering,
pp. 1280–1285. IOS Press (2022). https://doi.org/10.3233/ATDE220998
7. Cavalli, L., et al.: Addressing efficiency and sustainability in the port of the future
with 5G: the experience of the Livorno port. A methodological insight to measure
216 J. Neugebauer et al.
22. Neugebauer, J., Heilig, L., Voß, S.: Anforderungsanalyse zur Umsetzung eines
digitalen Zwillings im Containerterminal. HMD Praxis der Wirtschaftsinformatik
60(1), 110–131 (2023). https://doi.org/10.1365/s40702-022-00941-1
23. Neugebauer, J., Heilig, L., Voß, S.: Digital twins in the context of seaports and
terminal facilities. In: Flexibil Services and Manufacturing, pp. 1–57 (2023)
24. Özkanli, A., DeniZhan, B.: Digitalization roadmap for Turkish seaports. Eur. J.
Sci. Technol. 358–363 (2020). https://doi.org/10.31590/ejosat.araconf46
25. Pagano, P., Antonelli, S., Tardo, A.: C-ports: a proposal for a comprehensive stan-
dardization and implementation plan of digital services offered by the “ort of the
future” Comput. Ind. 134, 103556 (2022). https://doi.org/10.48550/arXiv.2104.
13175
26. Pavlić Skender, H., Ribarić, E., Jović, M.: An overview of modern technologies in
leading global seaports. Pomorski Zbornik 59(1), 35–49 (2020). https://doi.org/
10.18048/2020.59.02. https://hrcak.srce.hr/249260
27. Port Technology: PSA highlights digital twin as biggest emerging technology
trend (2021). https://www.porttechnology.org/news/psa-highlights-digital-twin-
as-biggest-emerging-technology-trend/
28. Redelinghuys, A.J.H., Basson, A.H., Kruger, K.: A six-layer architecture for the
digital twin: a manufacturing case study implementation. J. Intell. Manuf. 31(6),
1383–1402 (2019). https://doi.org/10.1007/s10845-019-01516-6
29. Rost, P., et al.: Customized industrial networks: network slicing trial at Ham-
burg seaport. IEEE Wirel. Commun. 25(5), 48–55 (2018). https://doi.org/10.1109/
MWC.2018.1800045
30. Sarabia-Jacome, D., Lacalle, I., Palau, C.E., Esteve, M.: Enabling industrial data
space architecture for seaport scenario. In: Proceedings of the IEEE 5th World
Forum on Internet of Things (WF-IoT), p. 101–106. IEEE, Limerick, Ireland
(2019). https://doi.org/10.1109/ACCESS.2019.2963283
31. Sarabia-Jacome, D., Palau, C.E., Esteve, M., Boronat, F.: Seaport data space for
improving logistic maritime operations. IEEE Access 8, 4372–4382 (2020). https://
doi.org/10.1109/ACCESS.2019.2963283
32. Simoni, M., Schiavone, F., Risitano, M., Leone, D., Chen, J.: Group-specific busi-
ness process improvements via a port community system: the case of Rotterdam.
Prod. Plan. Control 33(4), 371–385 (2022). https://doi.org/10.1080/09537287.
2020.1824029
33. Tardo, A., Pagano, P., Antonelli, S., Rao, S.: Addressing digitalization though out
a prototyping framework for agile smart services development: the case of Livorno
Port. J. Phys: Conf. Ser. 2311(1), 012007 (2022). https://doi.org/10.1088/1742-
6596/2311/1/012007
34. Taylor, S.J.E., Johansson, B., Jeon, S., Lee, L.H., Lendermann, P., Shao, G.: Using
simulation and digital twins to innovate: are we getting smarter? In: Proceedings of
the Winter Simulation Conference (WSC), pp. 1–13. IEEE, Phoenix, USA (2021).
https://doi.org/10.1109/WSC52266.2021.9715535
35. Terminal Industry Committee 4.0: Publications of the Terminal Industry Commit-
tee 4.0 (2023). https://tic40.org/publications/
36. Terminal Industry Committee 4.0: TIC4.0 Members (2023). https://tic40.org/tic4-
0-members/
37. Tesse, J., Baldauf, U., Schirmer, I., Drews, P., Saxe, S.: Extending internet of things
enterprise architectures by digital twins exemplified in the context of the Hamburg
port authority. In: Proceedings of the 27th Americas Conference on Information
Systems (AMCIS), pp. 1–11. Montreal, Canada (2021)
218 J. Neugebauer et al.
1 Introduction
most frequent accident was the collision with infrastructure and bridges. This
type accounted for 38–40% of all accidents. The second most frequent type of
accident was the collision between ships (18–19%). Due to increasing low water
periods caused by climate change, low water levels will lead to smaller vessels
and, therefore, an increase in transportation costs [13]. In order to make accu-
rate predictions, it is critical to consider and account for exceptions in the inland
waterway network.
Intermodal connectivity and transport efficiency in inland waterways are cru-
cial to model cost competitiveness compared with other transportation modes
such as road and rail [21]. Waterway transportation has an irreplaceable key
position in the entire transportation development process. It is also one of the
important modes of transportation to have a significant ecological impact [12].
The maritime operations in a port involve many parties, including pilots,
tug boats, boatmen, agents, supervision agencies such as customs and police,
stevedores and others. The arrival of a ship triggers activities at all these parties,
who then determine the performance of the port as a whole by each contributing
their specialist activity [19]. The need for reliable tools to verify and ensure
the accuracy of the estimated time of arrival (ETA) information provided by
ships as they approach ports has never been more critical than it is today [3,
6]. This paper establishes a groundwork for future research by demonstrating
the advantages of using neural networks (NN) with simple input parameters
for ETA prediction in inland waterway transportation. The development and
testing of various NN architectures make it possible to consider external factors
such as weather forecasts, river depth forecasts, and infrastructural factors when
predicting ETA.
In Sect. 2, this paper presents the related research. The methodology used is
delineated, and the process of data preparation is provided in Sect. 3. Moving on
to Sect. 4, the training of our neural network model is detailed, along with the
accompanying results. Finally, the conclusion of the paper offers the limitations
and potential future research in Sect. 5.
2 Related Research
The utilisation of Automatic Identification System (AIS) data for ETA predic-
tion in seagoing vessels is a widely researched topic. It has been shown in various
research work [11,15,23] that data-driven algorithms achieve higher accuracy in
terms of the time of ETA error. The main goal of this research work is to improve
ETA time calculation to improve efficiencies in port operations. Valero [18] pre-
dicts ETA times to improve short sea shipping. El Mekkaoui [8] focuses more
on the improvements of predictions for bulk ports, and Pani [16], and Yu [23]
focus on container and transhipment terminals. The applied methods in overseas
shipping research used vary from machine learning algorithms such as NNs [2,7],
random forest [23], reinforcement learning [17], bayesian learning [15] and deep
learning [8,11] to pathfinding algorithms [1].
There are significant differences in IWT when estimating the ETA for seago-
ing vessels. Water levels and weather conditions are crucial factors that can
Neural Network ETA Prediction in Inland Waterway Transport 221
affect the vessel’s speed and route. Also, ports are only sometimes fixed, and
routes may need to be adjusted accordingly, making ETA prediction more com-
plex. Limited research is carried out to address ETA predictions in the context
of IWT specifically. Zhong et al. [25] proposed a deep learning method based
on bi-directional long short-term memory recurrent neural networks (BLSTM-
RNNs) for restoring AIS trajectory data and applied it to inland ship trajec-
tory restoration. The paper focused on ETA prediction in IWT. Noman et al.
[14] developed a data-driven approach for ETA prediction using gradient boost-
ing decision trees (GBDT), multi-layer perceptron neural networks (MLP), and
gated recurrent unit NNs (GRU) algorithms trained on past inland waterway
AIS data. The approach was tested for both natural and artificial waterways,
and the results showed that the GRU model outperformed the other models in
accuracy and efficiency. Overall, both papers focused on ETA prediction in the
context of IWT and proposed different machine-learning algorithms to address
this problem. Zhong et al. [25] focused on restoring AIS trajectory data, while
Noman et al. [14] focused on developing a data-driven approach for ETA predic-
tion. Two closely related papers have been identified. The paper of Xie and Liu
[22] proposed a deep learning model based on long short-term memory networks
(LSTMs) for vessel traffic flow prediction in inland waterways. The model was
designed to predict a wide range of traffic flow aspects, including short-term,
long-term, and the influence of water level factors. Yu et al. [24] explored deep
learning approaches for AIS data association in the context of maritime domain
awareness. The paper presented two methods for inferring ship association prob-
ability. One predicts the ship’s position before computing association probability,
while the other compute association probability directly using only longitude,
latitude, and time.
incorporates data from the Wuhan Yangtze River, including water levels that
fluctuate during flood periods throughout the year. This model does not consider
water levels. The author optimises the model instead of comparing the prediction
outputs to average calculations.
3 Methodology
This work’s objective is to predict the duration it takes for ships to travel from
the initial location A to the final location B, with a route passing through several
segments s. An A* algorithm is implemented to determine the segments travelled
from A to B, followed by the typical industry practice of computing the average
travel time. Afterwards, a NN is designed and trained to compare the model’s
predictions with the computed average. Our methodology and data preparation
process are outlined in Sect. 3, while the training of our NN model and the
corresponding results are presented in Sect. 4.
The maritime domain utilises the AIS to enhance ship traffic safety through real-
time broadcasting of vessel information, including identity, speed, location, and
course. For this study, prefiltered AIS data was downloaded from a cooperation
partner via an API call. The received data contains the position data of the ships,
including the following information, MMSI (Maritime Mobile Service Identities,
the unique ship number), Status of the ship (underway, moored, etc.), speed over
ground, course over ground, true heading, time of last update, ETA, destination
(manual input from the skipper), data source, maximum draught of the ship, new
streaming update and location (latitude and longitude). The data were filtered
based on the “Moored” status and vessels, not in motion, requiring a speed over
ground greater than zero.
Segments with a radius of 2.5 km are generated to cover the entire area.
The position data, consisting of start latitude, longitude, and end latitude and
longitude, is mapped to start node ID and end node ID. The position data is then
looped through to eliminate multiple position data for each segment, keeping
only the data closest to the midpoint of the segment. The resulting table is
referred to as segment crossings. The crossing duration is computed from the
start and end times of each segment crossing. For each segment, the number
of ships crossing it is calculated. Segments with less than 50 appearances and
trip durations below 50 s are excluded. From the resulting ship-crossing data,
the stop time and destination were dropped from the dataset. The dataset was
further enriched by adding this additional data, such as compass bearing. Any
rows with missing values were removed to ensure data integrity and accuracy in
the analysis. A NN model is created to predict the duration to travel from one
segment to another.
To improve the reliability of predictions for IWT, it is advisable to make
predictions for individual segments and obtain accurate predictions for complete
Neural Network ETA Prediction in Inland Waterway Transport 223
ship trips that traverse multiple sections. Therefore, developing and incorporat-
ing trip-level prediction models in addition to segment-level models is recom-
mended. Consequently, an algorithm was created to identify the travels of a ship
and add a trip ID to the dataset. The code iterates over all segment’s crossings
and adds a new value, “trip id”. For each row, it assigns a trip ID based on the
MMSI (an identifier for a vessel), the start time, and the start and end nodes. If
the MMSI changes or the time difference between consecutive rows is more than
1000 s seconds, or the start node of the current row is not the same as the end
node of the previous row, then it increments the trip ID counter. The following
Fig. 1 shows a sample trip in the dataset.
Similar to Derrow-Pinion [5], the Average Travel Times (ATT) between each
pair of cells are calculated based on the actual times ds for a ship-crossing s.
The ATT is calculated by the sum of the duration of the ship-crossing ds of the
number of start and end node pairs N and then divided by N .
N
i=1 ti
AT T = (1)
N
about the characteristics of the features. To ensure that the features are organ-
ised in the correct format for training the model, train features are transposed
and described. Using z-score normalisation in the previous approach could have
been more effective in producing accurate results. This paper utilised min-max
normalisation to scale the data between 0 and 1, preserving the original range.
The equation for min-max normalisation is as follows:
x − min(x)
x = (2)
max(x) − min(x)
x is the original value of the data, min(x) is the minimum value of the data,
max(x) is the maximum value of the data, and x is the normalized value of x
between 0 and 1. The numerator, (x − min(x)), subtracts the minimum value
from the original value to measure the distance between the original value and the
minimum value of the data. The denominator, (max(x)−min(x)), calculates the
range of the data by subtracting the minimum value from the maximum value.
By dividing the distance between the original value and the minimum value by
the range of the data, the resulting value is normalised between 0 and 1.
A validation set that is not used for training but to evaluate the model’s
performance on unseen data is created. 80 % of the data are used for training,
20 % are used for validating the model. The actual travel duration was excluded
from both the train and validation datasets.
The used input parameters are the following: the id of the start segment,
the id of the end segment, the Status of the ship (underway, moored, reserved),
Course over ground, True heading, Time of last update, ETA, Destination (man-
ual input from the skipper), Data Source, Maximum draught of the ship, new
streaming update, location (latitude and longitude), compass bearing and the
actual travel time from origin segment to destination segment.
Several correlations can be observed among the provided input parameters.
The course over ground and true heading is typically closely related, representing
the direction of ship movement. The time of last update and the new streaming
update are correlated, with the new streaming update expected to have a more
recent timestamp. The maximum draught of the ship and moored status might
be correlated, as the draught becomes less critical when the ship is stationary.
Furthermore, a correlation exists between the specified destination and the esti-
mated ETA, as the ETA reflects the projected time of reaching the destination.
The created model design is adapted from Chondrodima [4] using similar
parameters for performance evaluation and hyper-parameter selection. This is
a sequential NN model, an artificial deep-learning architecture. The sequential
model is a linear stack of layers, where the output of one layer serves as the
input for the next layer sequentially. The model has three layers, the first dense
layer with 30 output units and 510 trainable parameters. The activation func-
tion is a ReLu function-the second dense layer (hidden layer) with 30 output
units. The activation function is a ReLu function. The third and final dense
layer, with one output unit, represents the model’s output prediction. It has 31
trainable parameters. The activation function is a ReLu function. The model
Neural Network ETA Prediction in Inland Waterway Transport 225
has 1,471 parameters, all of which are trainable. The parameters of the model
are adaptively adjusted during training based on the patterns and information
contained in the training data. This adaptive learning process allows the model
to learn the appropriate representations and relationships in the data, enabling
it to make better predictions on new, unseen dataThese parameters are adjusted
during training to optimise the model’s performance on the given task. The
model has no non-trainable parameters, meaning all parameters are updated
during training. Figure 2 shows the design of the NN.
Fig. 2. NN with three layers, an input layer, two hidden layers and one output layer
Similar to the work of Xiu [22], the root two widely used criteria are adopted
to measure the error of the predicted data, they are the Root Mean Squared Error
(RMSE) and the Mean Relative Error (MRE). These criteria are commonly used
to evaluate the accuracy of predictive models.
4 Results
A case study was conducted to demonstrate the higher accuracy of neural
network-based ETA prediction for IWT. The code was implemented in Python
within a Google Colab environment, utilising an Intel Xeon CPU @2.20 GHz, 25
GB RAM, a Tesla K80 accelerator, and 12 GB GDDR5 VRAM.
The following section provides an overview of the case study data and an eval-
uation of the model training results. Subsequently, the predictions are compared
with the average results of the entire network and two specific areas.
The resulting area contains the busy area in Duisburg, a crossing close to
Doornenburg, the border crossing in Bimmen, and a straight line without any
stops between Emmerich and Wesel. After data preparation, 82 segments and
152,650 rows of ship-crossing data were obtained. Figure 3 illustrates the remain-
ing segments post-data filtering.
For comparison, two different approaches were employed. The first approach, the
Overall NN (ONN), involved using the entire network to train the model. The
second approach Individual NN (INN), utilised individually trained models for
each start and end node pair. The training and validation are made for the same
in the dataset, allowing for a comprehensive assessment of the performance of
each method.
The model was trained using data from January to March, and the training
process involved 100 epochs. The history epoch loss graph shows the model’s
MAE change over successive epochs and is illustrated in the following Fig. 4.
Similarly, the validation loss graph shows the change in loss on a separate
validation set during training. The validation graph provides insights into how
well the model generalises to new data. Since the validation loss remains consis-
tently low with the training loss, the model performs well on both the training
and validation data. After the 20th epoch, the ONN model’s learning rate is not
improving significantly. For the different INN models, the learning drops at the
80th epoch.
Xie [22] conducted predictions on a more granular level. The evaluation met-
rics for the models are crucial, and the Root Mean Square Error (RMSE) for
the ONN model is 5.8%, and the Mean Relative Error (MRE) is 12.32 %. For
Neural Network ETA Prediction in Inland Waterway Transport 227
(a) Loss function for ONN (b) 10 sample loss functions for INNs
Fig. 4. Loss and Validation Loss Function shows the differences between ONN and
INN
the INN model, the RMSE is 25%, and the MRE is 35.8%. However, the model
trained in this study produced similar results for a one-day prediction.
NNs are commonly evaluated by randomly partitioning data into training, val-
idation, and testing sets. However, this section uses historical data to predict
future data. Although the predicted data is also historical, it represents the
future concerning the training data used. A future dataset containing 59,000
entries of start and end segment information for April was obtained and prepro-
cessed for analysis.
t = (s1 , s2 , . . . , sn ) (3)
A trip t contains multiple segment-crossings si , for i = 1 . . . n. Using the struc-
ture of the trip t, the total duration is calculated using the following equation.
Ns
dt = ds (4)
s=1
For all trips Nt in the test dataset, the average AT T is calculated. The following
Eq. 7 shows the prediction error P E for the Average Calculation (AC):
Nt
P EAC = AT T − dt (7)
t=1
P EAC is calculated by substracting the sum of all trip durations dt from the
average travel times.
To create trip duration predictions from the NN, the algorithm iterates over
each segment in the route and predicts the duration of the segment-crossings.
The predicted duration is added to the current timestamp, which serves as an
input to the NN, and this updated timestamp is used to make the next prediction
for the next segment crossings. Predictions are made for ONN, and INN uses the
same methodology. Both the ONN and INN models are employed to generate
the predictions.
Ns
P N Nt = P N Ni (8)
i=1
The prediction duration of the trip for the NN (PNN) is the sum of all
segment crossing predictions.
Nt
i=1 P N Ni
PNN = (9)
Nt
For all trips Nt in the test dataset, the average of PNN is calculated.
Nt
P EN N = P N N − di (10)
i=1
The prediction error for the NN is calculated by subtracting the sum of all
trip durations from the PNN.
The dataset was analysed and segmented into short trips (3 segment cross-
ings), medium trips (9 segment crossings), and long trips (18 segment crossings).
To compare different trip lengths, long trips with more than 18 segment cross-
ings were truncated to 18 segment crossings, ensuring that at least 100 trips
were available for each trip length category.
The results of ETA prediction error using the three different models - AC,
ONN, and INN - on different data sets with varying numbers of segments and
durations. The first column in table 1 indicates the number of crossed segments
Ns per trip t, while the second column represents the duration dt in seconds.
The following three columns show the average predictions made by each model
for the respective data sets. Finally, the last three columns indicate the relative
standard deviation in percentage for each model’s predictions.
Neural Network ETA Prediction in Inland Waterway Transport 229
The results indicate that the NN methods have smaller prediction errors than
the AC method for all three trip length sets. The relative standard deviation is
the smallest for the ONN method for the 3-segment and 9-segment crossings,
while the INN method has the lowest relative standard deviation for the 18-
segment crossing. Overall, the results suggest that the ONN and INN method
are slightly more accurate in predicting segment-crossing duration than the AC
method.
Computational experiments have been performed on two selected regions to
provide an additional comparison, as shown in Fig. 5.
Fig. 5. Area of Kleve (green) and Duisburg (red) for comparison. (Color figure online)
Table 2. Results of the comparison of prediction methods for DUI and KLE
Trip durations are, in general, a bit shorter in the DUI area. The largest
inland port in Europe is located in Duisburg, so the dataset includes shorter
trips that involve docking at specific locations within the port. In the case of
the Klave region, noisy data may be attributed to crossing the border between
Germany and the Netherlands.
5 Conclusion
References
1. Alessandrini, A., Mazzarella, F., Vespe, M.: Estimated time of arrival using his-
torical vessel tracking data. IEEE Trans. Intell. Transp. Syst. 20(1), 7–15 (2019).
https://doi.org/10.3390/app11052429
2. Bautista-Sánchez, R., Barbosa-Santillan, L.I., Sánchez-Escobar, J.J.: Method for
Select Best AIS Data in Prediction Vessel Movements and Route Estimation. Appl.
Sci. 11(5), 2429 (2021). https://doi.org/10.3390/app11052429
3. Bouchard, P., Moros-Daza, A., Voß, S.: Simulation of an AIS system for the port of
Hamburg. In: Mes, M., Lalla-Ruiz, E., Voß, S. (eds.) ICCL 2021. LNCS, vol. 13004,
pp. 21–35. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-87672-2 2
4. Chondrodima, E., Georgiou, H., Pelekis, N., Theodoridis, Y.: Particle swarm opti-
mization and RBF neural networks for public transport arrival time prediction
using GTFS data. Int. J. Inf. Manage. Data Insights 2(2), 100086 (2022). https://
doi.org/10.1016/j.jjimei.2022.100086
5. Derrow-Pinion, A., et al.: ETA prediction with graph neural networks in google
maps. In: Proceedings of the 30th ACM International Conference on Information &
Knowledge Management, pp. 3767–3776 (2021). https://doi.org/10.1145/3459637.
3481916
6. El Mekkaoui, S., Benabbou, L., Berrado, A.: Predicting ships estimated time of
arrival based on AIS data. In: Proceedings of the 13th International Conference on
Intelligent Systems: Theories and Applications, pp. 1–6 (2020)
7. El Mekkaoui, S., Benabbou, L., Berrado, A.: Machine learning models for effi-
cient port terminal operations: case of vessels’ arrival times prediction. IFAC-
PapersOnLine 55(10), 3172–3177 (2022)
8. El Mekkaoui, S., Benabbou, L., Caron, S., Berrado, A.: Deep learning-based ship
speed prediction for intelligent maritime traffic management. J. Mar. Sci. Eng.
11(1), 191 (2023). https://doi.org/10.3390/jmse1101019
9. Eurostat: eurostat Statistics explained. https://ec.europa.eu/eurostat/statistics-ex
plained/index.php?title=Inland waterway transport statistics#Eurostat#Statisti
csExplained
10. Fancello, G., Pani, C., Pisano, M., Serra, P., Zuddas, P., Fadda, P.: Prediction
of arrival times and human resources allocation for container terminal. Maritime
Economics & Logistics 13(2), 142–173 (2011). https://doi.org/10.1057/mel.2011.3
11. Huang, C., Huang, Y., Yu, Y., Xiao, B.: Predicting liner arrival time based on deep
learning. In: 2021 IEEE 3rd International Conference on Civil Aviation Safety and
Information Technology (ICCASIT), pp. 1127–1131. IEEE (2021)
12. Jing, M., Zheng, W.: Economic development advantages of low-carbon economy
under waterway transportation. IOP Conf. Ser. Mater. Sci. Eng. 780(6), 062029
(2020). https://doi.org/10.1088/1757-899X/780/6/062029
232 P. Wenzel et al.
13. Jonkeren, O., Rietveld, P., van Ommeren, J., te Linde, A.: Climate change and
economic consequences for inland waterway transport in Europe. Reg. Environ.
Change 14(3), 953–965 (2013). https://doi.org/10.1007/s10113-013-0441-7
14. Noman, A., Heuermann, A., Wiesner, S., Thoben, K.D.: Towards data-driven GRU
based ETA prediction approach for vessels on both inland natural and artificial
waterways. vol. 2021, pp. 2286–2291 (2021). https://doi.org/10.1109/ITSC48978.
2021.956488
15. Ogura, T., Inoue, T., Uchihira, N.: Prediction of arrival time of vessels considering
future weather conditions. Appl. Sci. 11(10), 4410 (2021). https://doi.org/10.3390/
app1110441
16. Pani, C., Fadda, P., Fancello, G., Frigau, L., Mola, F.: A data mining approach
to forecast late arrivals in a transhipment container terminal. Transport 29(2),
175–184 (2014). https://doi.org/10.3846/16484142.2014.930714
17. Park, K., Sim, S., Bae, H.: Vessel estimated time of arrival prediction system based
on a path-finding algorithm. Marit. Transp. Res. 2, 100012 (2021). https://doi.org/
10.1016/j.martra.2021.100012
18. Valero, C.I., Martı́nez, n., Oltra-Badenes, R., Gil, H., Boronat, F., Palau, C.E.:
Prediction of the estimated time of arrival of container ships on short-sea shipping:
a pragmatical analysis. IEEE Latin Am. Trans. 20(11), 2354–2362 (2022)
19. Veenstra, A., Harmelink, R.: On the quality of ship arrival predictions. Marit. Econ.
Logistics 23(4), 655–673 (2021). https://doi.org/10.1057/s41278-021-00187-6
20. Wehrle, R., Wiens, M., Schultmann, F.: A framework to evaluate systemic risks
of inland waterway infrastructure. Prog. Disaster Sci. 16, 100258 (2022). https://
doi.org/10.1016/j.pdisas.2022.100258
21. Wiegmans, B., Konings, R.: Intermodal Inland waterway transport: modelling con-
ditions influencing its cost competitiveness. Asian J. Shipping Logistics 31(2), 273–
294 (2015). https://doi.org/10.1016/j.ajsl.2015.06.006
22. Xie, Z., Liu, Q.: LSTM networks for vessel traffic flow prediction in inland water-
way. In: 2018 IEEE International Conference on Big Data and Smart Computing
(BigComp), pp. 418–425 (2018). https://doi.org/10.1109/BigComp.2018.00068
23. Yu, J., et al.: Ship arrival prediction and its value on daily container terminal oper-
ation. Ocean Eng. 157, 73–86 (2018). https://doi.org/10.1016/j.oceaneng.2018.03.
038
24. Yu, J.Y., Sghaier, M.O., Grabowiecka, Z.: Deep learning approaches for AIS data
association in the context of maritime domain awareness. In: 2020 IEEE 23rd Inter-
national Conference on Information Fusion (FUSION), pp. 1–8 (2020). https://doi.
org/10.23919/FUSION45008.2020.9190283
25. Zhong, C., Jiang, Z., Chu, X., Liu, L.: Inland ship trajectory restoration by recur-
rent neural network. J. Navig. 72(6), 1359–1377 (2019). https://doi.org/10.1017/
S0373463319000316
Vehicle Routing
A Regret Policy for the Dynamic Vehicle
Routing Problem with Time Windows
Peter Dieter(B)
Abstract. In this work, we present a regret policy for the dynamic vehi-
cle routing problem with time windows (DVRPTW) in which customer
order arrivals are revealed over a day and which has been the considered
problem in the EURO Meets NeurIPS 2022 Vehicle Routing Competi-
tion. The problem requires two types of decisions: Dispatching orders
and planning the respective vehicle routes. The objective is to minimize
the travel time while dispatching all orders over the day and adhering to
the respective time windows. The regret policy we present in this paper
focuses on the former decision, namely the dispatching of orders, as the
routing can then be performed by a regular VRPTW solver. The basic
idea of the policy is to calculate a regret value for each order which rep-
resents possible cost savings that we would miss when dispatching the
order immediately. To attain this regret value, we make use of the cus-
tomer distribution from which orders are sampled. If the value is below
a predefined threshold, the order is dispatched and routes are planned
with a state-of-the-art VRPTW solver. Otherwise, the order needs to
be dispatched at a later stage. The proposed policy outperforms several
benchmark policies and ranked 6th place among approximately 50 teams.
Furthermore, the proposed regret policy has a high level of explainability,
is simple to implement, and can be generalized to other problems in the
domain of dynamic vehicle routing.
1 Introduction
efficient planning is difficult since customer orders arrive in the course of a day
and before some decisions are already made and implemented. Next to plan-
ning vehicle tours, it is important to decide which customer orders to dispatch,
especially when orders have time windows. Too early dispatching might lead to
inefficiencies since it limits the possibility of consolidation. Too-late dispatch-
ing on the other hand might lead to limited flexibility in planning tours since
time-windows need to be adhered to. This paper describes a regret policy for
the dynamic vehicle routing problem with time windows (DVRPTW), which
has been the posed problem in the EURO Meets NeurIPS 2022 Vehicle Routing
Competition [9]. In the considered problem, orders are sampled from a known
customer distribution over a day. Once an hour, it needs to be decided which
orders need to be dispatched and which ones to postpone to consolidate with
future requests that may arrive later. After it has been decided which orders
are dispatched, feasible routes for these orders need to be created. Customer
time windows are hard and an unlimited vehicle fleet is assumed. The overall
goal is to minimize the time traveled during all decision epochs. The policy pre-
sented in this paper only focuses on the dispatching decision, as the routing task
is then a standard VRPTW, for which there exists an abundance of literature
[1]. This is done by calculating a regret value for each undue order, i.e., orders
which can be postponed, considering the total set of customers from which orders
are randomly sampled. Furthermore, we describe methods to extend the regret
value with information on time windows and other undue orders. The result-
ing VRPTW instance is then solved by a state-of-the-art hybrid genetic search
(HGS) algorithm developed by [10], which is an extension of the HGS solver
which has been proven successful for other VRP variants [16,17]. We show that
the proposed policy outperforms two benchmark policies. Due to its high level of
explainability, easiness of implementation, and generalizability to other problem
settings, we believe that the proposed policy can serve as a benchmark policy for
other studies on dynamic vehicle routing, in which anticipating future customer
orders is crucial. Furthermore, several possible policy extensions exist and could
be considered in future work. The further structure of the paper is as follows:
Sect. 2 includes related work. The problem is described in Sect. 3. In Sect. 4, we
present the developed regret policy. In Sect. 5, we evaluate the policy by com-
paring it to the two benchmark policies. The paper concludes with a summary
and an outlook in Sect. 6.
2 Related Work
The body of literature on dynamic vehicle routing is vast and covers differ-
ent problem settings as well as various algorithmic contributions. For a general
overview of stochastic dynamic vehicle routing, we refer to [13] and [5]. In this
section, we first examine related work on the concept of regret in dynamic vehi-
cle routing. Second, we present the winning approaches of the EURO Meets
NeurIPS 2022 Vehicle Routing Competition challenge.
A Regret Policy for the DVRPTW 237
To the best of our knowledge, the study of [3] is the first one on dynamic
vehicle routing that used the concept of regret. The authors consider a prob-
lem with the objective to maximize the number of orders served. The problem,
therefore, involves deciding which orders to serve and which ones not to serve,
considering time windows and a limited vehicle fleet. The authors suggest a
multi-scenario approach in which scenarios are created by sampling potential
future orders from a known customer distribution. A regret value for each order
already in the system o is then calculated which is defined as the difference in
objective value between the optimal solution considering all orders (i.e., already
in the system or drawn in the scenario) and the best solution serving request o.
This idea is further developed in [14].
[6] apply a multi-scenario approach for a problem in which the main objective
is to maximize the number of orders served and is more similar to the problem
considered in this paper, as orders can be postponed to later epochs. Rather than
solving a problem for each scenario once and then choosing one of the solutions,
they suggest a branch-and-regret heuristic which identifies structural decisions
(e.g., postponing an order) for each scenario and branches on them until all sce-
nario problems have solutions where all known orders are served according to
the same plan. The concept of regret represents the possible improvement that
might be gained from postponing an order to be dispatched in a later decision
epoch. This is similar to our proposed regret policy. However, our suggested
approach does not require sampling orders to create scenarios but directly uses
the known customer distribution, which allows for faster computations. Further-
more, in contrast to [6], we consider a problem in which all orders need to be
served and the total travel time over all decision epochs must be minimized.
The two most successful approaches to the challenge both applied machine
learning to the problem. In the winning approach, [7] reformulated the problem
into a prize-collecting VRPTW, i.e., a prize is assigned to each order during an
epoch and this prize is collected if an order is visited in the respective epoch.
The authors then apply imitation learning, by learning the prizes for each order
based on solutions with complete information. Instead of learning from solutions
with full information, [11] apply a reinforcement learning approach that learns
how to modify the costs of traversing arcs to an order in a respective epoch.
While the two approaches outperformed our proposed policy (by 4.95% and
2.4%, respectively), the regret policy is easier to implement and offers a higher
degree of explainability.
3 Problem Description
The problem at hand can be defined as a multi-stage VRPTW. We consider a
discrete-time horizon H = {1, ..., n} such that each decision occurs at a discrete
time (decision epoch) h ∈ H. Further, let Gh = (Vh , Ah ) be a complete graph,
where Vh = {0, Nh } is the set of nodes, Nh is a set of order nodes in epoch h
and 0 identifies the depot. The set of arcs is denoted by Ah = {(i, j) : i, j ∈ Vh }.
The time of traversing arc (i, j) is given by cij . Each order i possesses a time-
window {ei , li } where ei denotes the earliest arrival time and li the latest arrival
238 P. Dieter
time. Moreover, service times and demands for order i are given by si and qid ,
respectively. A vehicle from an unlimited fleet is denoted by k ∈ Kh , where Kh
is a set of vehicles in epoch h. The fleet is homogeneous and a vehicle possesses
a capacity of q v . In each decision stage h, a set of new orders Oh arrives. Orders
which arrived before epoch h but have not been dispatched yet are denoted by
Fh and the set of all available orders in epoch h is then given by Nh = Oh ∪ Fh .
Out of these orders, some might be due as dispatching them in a later epoch,
would lead to non-adherence to its time window. We define the set of these due
orders as Dh ⊆ Nh . In the last decision epoch n, all orders are due, so Nn = Dn .
The decision variable xijk denotes if arc (i, j) : i, j ∈ Vh is traversed by vehicle
k and tik denotes the arrival time of vehicle k at order i. A decision vector in
epoch h defined as xh = {xijk : i, j ∈ Vh ; k ∈ Kh }. As xh indicates which orders
are dispatched in h and which are not, the set of not dispatched orders at epoch
h + 1, Fh+1 , is dependent on xh . For example, if we would dispatch all orders in
epoch h, Fh+1 would be an empty set. The problem in epoch h can be formulated
as follows:
minimize cij · xijk + E[Q(xh )] (1)
i∈Vh j∈Vh k∈K
subject to
xjik = 1 ∀i ∈ Dh (2)
j∈Vh k∈Kh
j=i
xijk − xjik = 0 ∀k ∈ Kh ; j ∈ Vh (3)
i∈Vh i∈Vh
qid · xjik ≤ q v ∀k ∈ Kh (4)
j∈Vh i∈Nh
Decision Process (MDP). In such a formulation, a state would contain all orders
revealed but not yet dispatched and the action space would contain all feasible
vehicle tours. A transition between two decision epochs would occur by removing
all dispatched orders from the state and adding all orders that newly entered
the system. The objective is to find a policy that minimizes the expected cost
of serving all orders. For a detailed problem formulation as an MDP, we refer
to [2].
4 Regret Policy
In this section, we first describe the general algorithmic procedure of the regret
policy. We then describe the regret function and explain two extensions of the
basic regret function, namely the incorporation of information about time win-
dows and undue orders. An open-source Python implementation of the policy is
available on GitHub1 .
The motivation behind the proposed regret function is the following: If an undue
order u ∈ Uh is dispatched in epoch h, we can expect that u will be served con-
secutively before or after its closest order d ∈ Dh which is marked as dispatched.
Closest is here defined by the lowest average arc costs from u to d and vice versa.
Let cud be the time (costs) needed to travel from the customer of order u to the
customer of order d. We then approximate that dispatching order u will result
1
https://github.com/PeterDieter/RegretPolicy DVRPTW.
240 P. Dieter
where square brackets are Iverson brackets, i.e., its inner value is 1 if the condi-
tion ( cui +c
2
iu
< cud +c
2
du
) holds and 0 otherwise.
To provide further explanation on the regret function, it is visualized in Fig. 1,
where the regret value for order i is determined. There are two orders which are
marked as due. The distance cid is an approximation of the additional routing
costs that would occur when order i is dispatched together with the two due
A Regret Policy for the DVRPTW 241
orders. However, orders from two closer customers (1 and 2) might be drawn in
the future and 99 orders are expected to be drawn until order i is due. If the
calculated regret value of 1.2 is below the threshold, the order will be dispatched.
Parameter Values
t {0.04, 0.05, 0.06, 0.07, 0.08, 0.09, 0.10}
h1 {0.005, 0.01, 0.02, 0.03, 0.04}
h2 {0.01, 0.004, 0.002, 0.001, 0.0005}
information on the systems’ environment). The time limit for the HGS solver
is set to 10 s for all policies (regret and benchmark policies). Experiments are
performed on an Intel Core i7 with 2.6GHz and 12 GB RAM.
5.2 Results
The results are summarized in Table 2, which shows the improvements of the
policy in the row, compared to the policy in the column. Additionally, the per-
centage of instances in which one policy led to better results is shown in brackets.
Detailed results over all tested instances can be found in the Appendix (Table 4).
On average, the proposed policy leads to improvements of around 8.45% com-
pared to the greedy benchmark and 40.88% compared to the lazy benchmark.
Further, the regret policy always leads to better results in all cases compared
to the greedy benchmark and in 47 out of 50 instances (94%) compared to the
greedy policy. Moreover, we can deduce that a greedy policy performs better
than a lazy policy (improvement of 35.14%).
5.3 Discussion
Firstly, we can see from the results that the regret policy outperforms the two
benchmarks to a large extent. Further, we can see from the results that a greedy
policy performs better than a lazy policy. Therefore, it might be better to attain
some slack in the routing decision compared to trying to maximize the con-
solidation of orders by dispatching them as late as possible. The regret policy
seems to achieve a better balance between those two extremes. In fact, our pro-
posed method can be seen as a combination of these two benchmark policies.
If the threshold value is 0, only due orders will be dispatched (lazy policy), if
the threshold value is infinitely big, all orders will be dispatched (greedy policy).
This perspective contributes to the explainability of the proposed method since
the greediness/laziness of our method is determined by the threshold value and
can be tuned accordingly. In terms of ease of implementation, the regret policy
remains fairly simple compared to the two benchmarks, since standard opera-
tions can be used to calculate the regret. However, this simplicity also comes
with a price, since it is difficult to capture all relevant information in a (simple)
function. In the following Section (6), we mention how the regret function could
be extended and further propose hybridizations of our proposed method with
other methods.
244 P. Dieter
6 Conclusion
In this paper, we present a regret policy for the dynamic vehicle routing problem
with time windows (DVRPTW). In the studied problem, orders are sampled from
a known customer distribution over a day. Once an hour, it needs to be decided
which orders need to be dispatched and which ones to postpone. After it has been
decided which orders are dispatched, feasible routes for these orders need to be
created. The policy presented in this paper considers the first decision stage, i.e.,
determining which orders to dispatch, by anticipating orders that might arrive
in the future.
We first introduce the general procedure of the policy and the basic regret
function, which is applied to determine a regret value for each undue order, i.e.,
orders that can be postponed. This regret value represents possible improvements
that might be gained by postponing the order and consolidating it with future
orders. If the regret value is below a certain threshold, the order is dispatched.
Furthermore, we present the methods applied to incorporate information about
other undue orders and time windows in the regret function. The developed pol-
icy consistently outperforms benchmark policies such as a greedy dispatching
rule which dispatches all orders as soon as possible or a lazy dispatching rule
which dispatches only orders which are due. Due to its high level of explainabil-
ity, ease of implementation, and generalizability to other variants of dynamic
vehicle routing, the proposed regret policy can serve as a benchmark policy for
other work on anticipatory vehicle routing. Moreover, multiple extensions of the
proposed policy could be investigated. Currently, only the closest due order is
regarded to determine the regret value. Future work could look for ways to incor-
porate the entire set of due (or already dispatched) orders to determine regret
values. Furthermore, it could be tried to better account for time windows, e.g., by
immediately constructing an initial route plan in the proposed policy. Another
possible extension is the development of dynamic thresholds. Even though the
time until an order is due is implicitly accounted for in the regret policy, dynamic
thresholds could further improve the policy. For example, it might be beneficial
to release orders more easily when the number of epochs the orders are in the
system is low, as this results in wider time windows for the routing decision,
and consequently, fewer vehicles could be used. Methods such as Value Func-
tion Approximation (VFA) could be used to determine these state-dependent
thresholds. Another possible hybridization of methods would be to reformulate
the problem as a price collection VRPTW and use the proposed regret function
to determine prizes for undue orders.
A Regret Policy for the DVRPTW 245
Appendix
Table 3. Sets, Parameters and Decision Variables
Notation Definition
Sets
H Set of decision epochs, H := {1, ..., n}
Nh Set of all known orders in epoch h
Oh Set of orders that arrived in epoch h
Fh Set of orders that arrived before epoch h but have not been dispatched yet
Dh Set of all orders that are due in epoch h, Dh ⊆ Nh
Vh Set of all nodes in epoch h, where 0 is the Depot, Vh := {0} ∪ Nh
Kh Set of vehicles in epoch h
Parameters
cij Travel time from node i ∈ Vh to node j ∈ Vh
qv Vehicle capacity (same for all vehicles)
qid Demand of order i ∈ Nh
si Service time for serving order i ∈ Nh
[ei , li ] Time window of order i ∈ Nh
Decision Variables
xijk = 1 if vehicle k ∈ Kh travels from node i ∈ Vh to node j ∈ Vh
(0 otherwise)
tik arrival time of vehicle k ∈ Kh at node i ∈ Vh
Table 4. (continued)
References
1. Baldacci, R., Mingozzi, A., Roberti, R.: Recent exact algorithms for solving the
vehicle routing problem under capacity and time window constraints. Eur. J. Oper.
Res. 218(1), 1–6 (2012). https://doi.org/10.1016/j.ejor.2011.07.037, https://www.
sciencedirect.com/science/article/pii/S0377221711006692
2. Baty, L., Jungel, K., Klein, P.S., Parmentier, A., Schiffer, M.: Combinatorial opti-
mization enriched machine learning to solve the dynamic vehicle routing problem
with time windows (2023)
3. Bent, R., Van Hentenryck, P.: Regrets only! Online stochastic optimization under
time constraints. In: Proceedings of the 19th National Conference on Artifical
Intelligence, pp. 501–506. AAAI’04, AAAI Press (2004)
4. Ecker, T., Hans, M., Neuhaus, F., Spielvogel, J.: Same-day delivery: ready for
takeoff, January 2020. https://www.mckinsey.com/industries/retail/our-insights/
same-day-delivery-ready-for-takeoff
5. Hildebrandt, F.D., Thomas, B.W., Ulmer, M.W.: Opportunities for reinforcement
learning in stochastic dynamic vehicle routing. Comput. Oper. Res. 150, 106071
(2023). https://doi.org/10.1016/j.cor.2022.106071
6. Hvattum, L.M., Løkketangen, A., Laporte, G.: A branch-and-regret heuristic for
stochastic and dynamic vehicle routing problems. Networks 49(4), 330–340 (2007).
https://doi.org/10.1002/net.20182
7. Jungel, K., Baty, L., Klein, P.: Structured Learning for the EURO Meets NeurIPS
2022 Vehicle Routing Competition. Technical report (2022)
8. Klapp, M.A., Erera, A.L., Toriello, A.: The dynamic dispatch waves problem for
same-day delivery. Eur. J. Oper. Res. 271(2), 519–534 (2018). https://doi.org/10.
1016/j.ejor.2018.05.032
9. Kool, W., et al.: EURO Meets NeurIPS 2022 Vehicle Routing Competi-
tion (2022). https://euro-neurips-vrp-2022.challenges.ortec.com/assets/pdf/euro-
neurips-vrp-2022-rules-2-aug.pdf. Accessed 15 Nov 2022
10. Kool, W., et al.: Hybrid Genetic Search for the Vehicle Routing Problem with Time
Windows: a High-Performance Implementation. Technical report, 12th DIMACS
Challenge, April 2022
11. Kwon, Y.D., Choo, J., Bae, D., Kim, J., Hottung, A.: Cost Shaping via Reinforce-
ment Learning for Vehicle Routing Problems. Technical report (2022)
12. Ulmer, M.W., Mattfeld, D.C., Köster, F.: Budgeting time for dynamic vehicle rout-
ing with stochastic customer requests. Transp. Sci. 52(1), 20–37 (2018). https://
doi.org/10.1287/trsc.2016.0719
13. Approximate Dynamic Programming for Dynamic Vehicle Routing. ORSIS, vol.
61. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-55511-9
14. Van Hentenryck, P., Bent, R., Upfal, E.: Online stochastic optimization under
time constraints. Ann. Oper. Res. 177, 151–183 (2010). https://doi.org/10.1007/
s10479-009-0605-5
15. Vidal, T.: Hybrid genetic search for the CVRP: open-source implementation and
swap* neighborhood. Comput. Oper. Res. 140, 105643 (2022). https://doi.org/10.
1016/j.cor.2021.105643
16. Vidal, T., Crainic, T.G., Gendreau, M., Lahrichi, N., Rei, W.: A hybrid genetic
algorithm for multidepot and periodic vehicle routing problems. Oper. Res. 60(3),
611–624 (2012). https://doi.org/10.1287/opre.1120.1048
17. Vidal, T., Crainic, T.G., Gendreau, M., Prins, C.: A unified solution framework
for multi-attribute vehicle routing problems. Eur. J. Oper. Res. 234(3), 658–673
(2014). https://doi.org/10.1016/j.ejor.2013.09.045
A Tabu Search Algorithm for the Traveling
Purchaser Problem with Transportation Time
Limit
{pieter.vansteenwegen,dirk.cattrysse}@kuleuven.be
Abstract. This study extends the well-known traveling purchaser problem (TPP)
by considering a transportation time limit of perishable food in cold-chain logistics.
The problem is called the traveling purchaser problem with transportation time
limit (TPP-TTL). The objective of the TPP-TTL is to find a route and procurement
plan for the purchaser to satisfy the demand of a number of product types with
minimum cost. To satisfy the product demand, the purchaser visits a number
of capacitated markets, in which the available amount of products is limited.
Furthermore, since the travel times cause deterioration on the perishable products,
a transportation time limit is taken into account in the TPP-TTL for each product
type. The problem is formulated as a mixed-integer programming model and
solved by using a tabu search (TS) algorithm. In the computational experiments, TS
is carried out for a number of different-sized instances and the results are compared
to the results obtained by GUROBI solver to determine the performance of the
proposed algorithm. The results of the experiments show that the TS is capable to
find many optimal results with less computational time than the GUROBI solver.
1 Introduction
The traveling purchaser problem (TPP) is a well-known combinatorial optimization
problem that was first introduced by Ramesh [17] in 1981. The TPP is a variant of the
traveling salesperson problem (TSP), in which a set of product demands is satisfied from
a number of markets. Distinct from the TSP, the TPP takes into account the decisions
about the selection of the markets to be visited, the number of products purchased from
each market, and the route of the purchaser. The aim of the problem is to satisfy product
demand with minimum total traveling and procurement costs [13].
The original version of the TPP has been extended in many different ways in the
literature. Boctor et al. [5] introduced the uncapacitated TPP, in which the amount of a
product is sufficient to satisfy demand if it is available in a market. Almeida et al. [1]
methodology, which employs an advanced local search procedure to explore the solution
space.
The remainder of the paper is organized as follows. Section 2 introduces the details
of the TPP-TTL and its mathematical formulation. The proposed TS is presented in
Sect. 3. The computational results and performance analysis of the TS are given in
Sect. 4. Finally, the conclusions are given in Sect. 5.
Model
Min cij xij + pik zik (1)
i∈V j∈V k∈K i∈Mk
S.t.
zik = dk k∈K (2)
i∈Mk
ri + si + tij ≤ rj + γ 1 − xij i ∈ M , j ∈ V, i = j (11)
ri + si + tij ≥ rj − γ 1 − xij i ∈ M , j ∈ V, i = j (12)
si = hi zik i∈M (13)
k∈K
zik ≥ 0 k ∈ K, i ∈ Mk (18)
ri ≥ 0 i∈V (19)
si ≥ 0 i∈M (20)
Equation (1) in the proposed model describes the objective function to minimize the
total transportation and procurement cost of the purchaser. Constraints (2) provide that
each product demand is satisfied from the markets. Constraints (3) and (4) ensure that
the purchased amount of a product type at a specific market cannot exceed the available
stock. Constraints (5) and (6) guarantee that the purchaser enters and exits from a market
if a procurement is made from the related market. Constraints (7) and (8) set the initial
and last node of the route as the depot node. Constraints (9)–(12) determine the arrival
time of the purchaser to the visited nodes considering the traveling time between the
locations and service time at markets. These constraints also eliminate the sub-tours by
tracking the arrival time between the visited nodes. Constraints (13) compute the service
time at the visited markets. Constraints (14) guarantee that the travel time and service
time of each product in the vehicle cannot exceed the corresponding time limit. Finally,
constraints (15)–(20) identify the domain of the decision variables.
3 Solution Methodology
Tabu search, which was first introduced by Glover in 1986, is a well-known meta-
heuristic to solve combinatorial optimization problems [9, 10]. The main feature of TS
is the memory mechanism that prevents the search from cycling back to previously
found solutions. The memory structure of TS, called tabu list, guides the search of the
algorithm. The tabu list allows to find unexplored solutions by prohibiting the usage of
recently utilized moves in the tabu list. With the help of this memory structure, TS has
proved to be effective to solve various routing problems [8, 16, 18]. Moreover, TS is
used in [20] to solve TPP, in which two dynamic tabu search strategies are operated.
In this context, TS is employed in this study to find effective results for the TPP-TTL.
Figure 1 presents the pseudo code of the proposed algorithm. The details of the main TS
procedures are described in the following sub-sections.
case the demand is not completely met from the visited markets, an unvisited market with
minimum cost for the selected product is inserted into the first position in the route. This
product selection and adding procedure repeats until all demand is satisfied. Although
the capacity restriction of the markets can be completely satisfied by this procedure, the
transportation time limit of any product may be violated while inserting another product
into the solution. In such a case, a feasible solution is provided by randomly selecting a
product that violates the time restriction and purchasing it from a proper market. Here,
in addition to the visited markets, unvisited markets are also taken into account. Then,
a market is randomly selected from the considered list of markets. If a proper market is
not found, the initial solution generation procedure is restarted in the algorithm. Since
the markets are randomly selected in this procedure, different results can be obtained at
each restart. In this way, at the end of the procedure, a feasible initial solution (X 0 ) is
generated to be improved in the main loop of the TS.
In each iteration of the TS, the algorithm employs two independent local search proce-
dures to improve the existing solutionX. In the first local search procedure, a number
of neighborhoods are generated by applying a procurement-based move operator (Lines
5–7 in Fig. 1). The procurement-based move operators are used to improve the existing
solution by making changes in the procurement plan. To change the procurement plan,
three moves are considered: unvisited market insertion, visited market removal, and
product exchange between the visited markets. For each application of the procurement-
based search, one of the move operators is randomly selected. Based on the selected
operator, all feasible neighbor solutions are generated. Regarding the tabu list criteria of
the
TS, the best alternative
solution X ∗ is selected from a set of candidate neighborhoods
( X 1 , X 2 , X 3 , . . . ).
254 I. Kucukoglu et al.
In the second local search procedure, a set of candidate solutions ( X 1 , X 2 , X 3 , . . . )
are generated based on X ∗ by using a route-change-based move operator (Lines 8–
10 in Fig. 1). Similar to the procurement-based search approach, four different move
operators are used to generate neighborhoods. Four move operators are considered:
market exchange, a market movement to a further position, a market movement to a
backward position, and a sub-path reverse in the route. After all feasible candidate
solutions are generated, the best solution X ∗ , which is not tabu or leads to a new best
solution, is selected to be assigned as the current solution X.
4 Computational Results
In the computational studies, first, the difference between the TPP and TPP-TTL based
on the total cost is analyzed by using the GUROBI solver. Following this analysis, the
performance of the proposed TS is demonstrated by comparing the proposed algorithm
with the GUROBI solver. For these studies, a well-known capacitated TPP benchmark
problem set introduced by Laporte et al. [14] is used. The TPP instances are adapted to
the TPP-TTL by randomly assigning a transportation time limit for each product type.
To generate a random value for the transportation time limits, first, a reference duration
time (RDT ) for the problem is calculated regarding the transportation and service time
of the purchaser. The transportation time of the purchaser is calculated by using the
nearest neighborhood algorithm ignoring the transportation time limit of the products.
On the other hand, the service time is determined regarding the lowest service time for
each product type. After the RDT is obtained for the problem, a transportation time limit
is randomly specified as tlk = 0.50 × RDT + rnd () × RDT where rnd () is a uniformly
A Tabu Search Algorithm for the Traveling Purchaser Problem 255
distributed random number in the interval [0, 1]. In case the generated time limits leads
an infeasible route for the purchaser, the procedure is restarted. Based on this procedure,
27 different-sized TPP-TTL instances are generated where the problem size is specified
by the pair of two problem data: {|V |, |K|} where the value of each parameter is specified
as 10, 20, or 30 for TPP-TTL instance generation.
The proposed TS is implemented for each TPP-TTL instance by setting the Iter Max =
1000 and L = 50. With these parameter settings, ten independent runs are carried out
for each problem. The results of the TS are investigated through the total cost (f ) and
solution time (t) for both the best solution obtained in the runs and the average of the
runs. Here, it should be noted that the f and t are used to represent the GUROBI results
to indicate optimum solution and solution time, respectively. Each TS and GUROBI
computation is carried out on a workstation with Intel® Xeon® CPU E5-2643 v3 CPU
and 64 GB of memory.
To indicate the difference between the TPP and TPP-TTL results based on the total
cost, the model formulation proposed for the TPP-TLL is simply adapted to the TPP by
ignoring the constraints (9)–(14). However, the Miller-Tucker-Zemlin sub-tour elimina-
tion constraints [15] are included in the model to eliminate sub-tours. To find the optimal
solution for TPP and TPP-TTL each instance, the GUROBI is executed without setting
any time limit. Table 1 shows the results of the GUROBI for both the TPP and TPP-TTL
instances, where the percentage gap between the two solutions is computed by using
Eq. (21), where fTPP and fTPP−TTL are the optimum solutions obtained for the TPP and
TPP-TTL instances, respectively.
fTPP−TTL − fTPP
Gap1 % = × 100% (21)
fTPP
The Gap1 % values presented in Table 1 show that up to 15.02% increase on total
cost is observed due to the transportation time limits of the products. Particularly, the gap
between the solution is high when the number of product types is high (i.e., {|V |, |K|} =
{(10, 30), (20, 30), (30, 30)}). On the other hand, the same results are obtained for four
instances. In addition to the comparison based on the total cost, it should be seen from
Table 1 that the solution time of GUROBI for the TPP-TLL is quite high according to the
TPP model computation. Therefore, it should be stated that finding an optimal solution
for the TPP-TTL requires more computational effort with respect to the TPP.
256 I. Kucukoglu et al.
In addition to the comparison of TPP and TPP-TTL, the second part of the compu-
tational experiments is carried out to demonstrate the efficiency of the proposed TS. In
this context, the TS results are compared to the optimal TPP-TTL solutions obtained by
GUROBI. The results of GUROBI and TS, and the comparison are given in Table 2, in
which Gap2 % indicates the percentage gap between the best/average result of the TS
A Tabu Search Algorithm for the Traveling Purchaser Problem 257
and GUROBI result. Gap2 % is calculated by using Eq. (22) where fTS is the best/average
result of the TS and fGUROBI is the GUROBI result.
Problem # GUROBI TS
Best Solution Average Solution
f t f Gap2 % t f Gap2 % t
1 2700 6.41 2700 0.00 0.45 2700.0 0.00 0.53
2 2710 4.57 2710 0.00 0.94 2710.0 0.00 1.12
3 3393 5.60 3393 0.00 1.04 3393.0 0.00 1.32
4 3486 20.79 3486 0.00 0.79 3486.0 0.00 0.72
5 3638 11.43 3638 0.00 0.99 3638.0 0.00 0.92
6 3882 14.74 3882 0.00 0.73 3887.8 0.15 0.86
7 4181 95.95 4181 0.00 0.85 4181.0 0.00 0.93
8 4303 81.55 4303 0.00 1.06 4303.0 0.00 1.15
9 4050 97.71 4050 0.00 1.11 4052.6 0.06 1.09
10 3048 138.24 3048 0.00 1.15 3048.0 0.00 1.12
11 3530 180.33 3530 0.00 1.23 3530.0 0.00 1.29
12 3991 157.71 3991 0.00 1.32 3991.0 0.00 1.31
13 4344 470.27 4344 0.00 1.22 4348.9 0.11 1.19
14 4245 444.65 4245 0.00 1.41 4245.0 0.00 1.38
15 3760 350.35 3760 0.00 1.38 3771.3 0.30 1.41
16 4274 678.94 4274 0.00 1.44 4276.5 0.06 1.35
17 6855 340.95 6855 0.00 1.51 6855.0 0.00 1.49
18 5597 906.16 5597 0.00 1.47 5601.2 0.08 1.53
19 3467 451.66 3467 0.00 1.62 3467.0 0.00 1.55
20 3684 440.06 3684 0.00 1.58 3684.0 0.00 1.49
21 4871 545.06 4871 0.00 1.69 4872.1 0.02 1.71
22 4788 881.62 4788 0.00 1.55 4795.6 0.16 1.43
23 7063 956.97 7063 0.00 1.78 7077.9 0.21 1.74
24 5867 1118.01 5867 0.00 1.71 5869.7 0.05 1.73
25 5250 955.05 5250 0.00 1.83 5275.3 0.48 1.79
26 6572 884.27 6572 0.00 1.81 6593.3 0.32 1.83
27 7507 858.87 7507 0.00 1.93 7542.1 0.47 1.88
Average 4484 411.03 4484 0.00 1.32 4488.7 0.09 1.33
258 I. Kucukoglu et al.
fTS − fGUROBI
Gap2 % = × 100% (22)
fGUROBI
Based on the best-found result of the TS given in Table 2, the optimal solution is
provided for each instance by the proposed algorithm. Furthermore, the average result
of the TS is the same with the optimal solution for 14 of 27 instances. For the remaining
instances, the highest percentage gap is 0.48%. For the overall performance of the TS, the
average percentage gap of the proposed algorithm is 0.09% with respect to the optimal
solution. Furthermore, the average computational time of the TS is 1.33 s, while the
average computational time of the GUROBI is 411.03 s. Regarding the solution quality
and computational effort of the proposed algorithm, it can be stated that the TS is capable
to find the optimal or near-optimal solution for the TPP-TTL in much less computational
time.
5 Conclusions
In this study, the traveling purchaser problem with transportation time limit, which is an
extension of the well-known traveling purchaser problem, is proposed by considering
the transportation restrictions of perishable food in cold-chain logistics. The TPP-TTL
extends the capacitated version of the TPP by including a transportation time limit for
each product type in the vehicle. In this context, any product cannot be kept in the vehicle
for more than a specified time limit. Regarding both the market capacity and product
transportation restrictions, the TPP-TTL is formulated as a mixed-integer programming
model, in which the objective is to minimize the total transportation and procurement
cost of the purchaser. To solve the problem, a tabu search algorithm is introduced. The
proposed algorithm employs a number of procurement-based and route-change-based
local search moves to generate neighborhoods in each iteration. The performance of the
proposed algorithm is tested on different-sized TPP-TTL problems generated by using
a well-known TPP benchmark problem set. The TS is compared to the GUROBI solver
to identify the solution quality of the proposed algorithm. Results of the computations
show that the proposed TS outperforms the GUROBI solver by finding optimal results
in much less computational time.
For future research, this study can be extended with considering different real-life
restrictions of cold chain logistics. First, the deterioration cost of perishable products due
to the traveling in a vehicle can be integrated. On the other hand, the procurement plan can
consider multiple trips by the purchaser, where each trip can be done with a different type
of vehicle (i.e., temperature controlled vehicle, traditional vehicle). With this assumption,
the quality level of some products that are more sensitive to temperature change may be
transported with a temperature-controlled vehicles with additional cooling cost. Another
practical assumption for the TPP-TTL may be the proportional transportation time limit
for the product at different markets. This assumption may be valid when a product
is produced in different time periods in different markets. Considering the quality of
a product at each market depends on the production date, the purchaser may need to
include this assumption for its procurement and route plan.
A Tabu Search Algorithm for the Traveling Purchaser Problem 259
References
1. Almeida, C.P., Gonçalves, R.A., Goldbarg, E.F., Goldbarg, M.C., Delgado, M.R.: An exper-
imental analysis of evolutionary heuristics for the biobjective traveling purchaser problem.
Ann. Oper. Res. 199(1), 305–341 (2012)
2. Batista-Galvan, M., Riera-Ledesma, J., Salazar-Gonzalez, J.J.: The traveling purchaser prob-
lem, with multiple stacks and deliveries: a branch-and-cut approach. Comput. Oper. Res. 40,
2103–2115 (2013)
3. Bianchessi, N., Irnich, S., Tilk, C.: A branch-price-and-cut algorithm for the capacitated
multiple vehicle traveling purchaser problem with unitary demand. Discret. Appl. Math. 288,
152–170 (2021)
4. Bianchessi, N., Mansini, R., Speranza, M.G.: The distance constrained multiple vehicle
traveling purchaser problem. Eur. J. Oper. Res. 235, 73–87 (2014)
5. Boctor, F.F., Laporte, G., Renaud, J.: Heuristics for the traveling purchaser problem. Comput.
Oper. Res. 30(4), 491–504 (2003)
6. Cheaitou, A., Hamdan, S., Larbi, R., Alsyouf, I.: Sustainable traveling purchaser problem
with speed optimization. Int. J. Sustain. Transp. 15(8), 621–640 (2021)
7. Choi, M.J., Lee, S.H.: The multiple traveling purchaser problem for maximizing system’s
reliability with budget constraints. Expert Syst. Appl. 38(8), 9848–9853 (2011)
8. Gendreau, M., Hertz, A., Laporte, G.: A Tabu search heuristic for the vehicle routing problem.
Manag. Sci. 40(10), 1276–1290 (1994)
9. Glover, F.: Tabu search – Part I. ORSA J. Comput. 1(3), 190–206 (1989)
10. Glover, F.: Tabu search – Part II. ORSA J. Comput. 2(1), 4–32 (1990)
11. Gouveia, L., Paias, A., Voß, S.: Models for a traveling purchaser problem with additional
side-constraints. Comput. Oper. Res. 38, 550–558 (2011)
12. Kucukoglu, I.: The traveling purchaser problem with fast service option. Comput. Oper. Res.
141, 105700 (2022)
13. Manerba, M., Mansini, R., Riera-Ledesma, J.: The traveling purchaser problem and its
variants. Eur. J. Oper. Res. 259(1), 1–18 (2017)
14. Laporte, G., Riera-Ledesma, J., Salazar-González, J.-J.: A branch-and-cut algorithm for the
undirected traveling purchaser problem. Oper. Res. 51(6), 940–951 (2003)
15. Miller, C.E., Tucker, A.W., Zemlin, R.A.: Integer programming formulation of traveling
salesman problems. J. ACM 7(4), 326–329 (1960)
16. Qiu, M., Zhuo, F., Eglese, R., Qiong, T.: A Tabu search algorithm for the vehicle routing
problem with discrete split deliveries and pickups. Comput. Oper. Res. 100, 102–116 (2018)
17. Ramesh, T.: Traveling purchaser problem. Opsearch 18(1–3), 78–91 (1981)
18. Taillard, E., Badeau, P., Gendreau, M., Guertin, F., Potvin, J.-Y.: A Tabu search heuristic for
the vehicle routing problem with soft time windows. Transp. Sci. 31(2), 170–186 (1997)
19. Tsang, Y.P., Choy, K.L., Wu, C.H., Ho, G.T.S., Lam, H.Y., Tang, V.: An intelligent model for
assuring food quality in managing a multi-temperature food distribution centre. Food Control
90, 81–97 (2018)
20. Voß, S.: Dynamic tabu search strategies for the traveling purchaser problem. Ann. Oper. Res.
63, 253–275 (1996)
GRASP Solution Approach
for the E-Waste Collection Problem
1 Introduction
With the rapid advancement of technology and the continuous development of
production and industrial design, electronic and electrical devices have become
ubiquitous in lives, providing convenience and comfort on a daily basis. However,
this progress has also resulted in a significant drawback-the ever-shortening life
cycle of electrical and electronic devices. Consequently, there has been a marked
increase in the annual amount of e-waste - discarded electrical and electronic
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 260–275, 2023.
https://doi.org/10.1007/978-3-031-43612-3_16
GRASP Solution Approach for the E-Waste Collection Problem 261
devices and their components which are not reused. According to UNITAR,
in 2019, the world generated a staggering 53.6 Megatons of e-waste, of which
only 17.35% was properly managed in an environmentally sound manner, while
the fate of the remaining e-waste was unknown [1]. The collection of e-waste
plays a pivotal role in its overall management and treatment cycle, typically
involving the use of heavy vehicles and differing from conventional waste man-
agement practices. If not adequately planned, the e-waste collection process can
create numerous challenges for urban areas, such as straining the urban trans-
port system, contributing to increased air pollution, and burdening the e-waste
management companies. Unfortunately, the literature on optimizing the e-waste
collection process remains relatively limited. In the field of operations research,
the e-waste collection problem can be effectively modeled as a variant of the Vehi-
cle Routing Problem (VRP), which is extensively studied and widely applied in
the realm of logistics and supply chain management. To gain a comprehensive
understanding of the development of VRP models, methods, and applications,
interested readers are encouraged to refer to the work by Braekers et al. [2].
We provide a summary and highlight relevant literature on various algorithms
proposed to address different variants of the VRP in the context of e-waste man-
agement. Yao et al. [13] focused on two subproblems: optimizing the location of
Waste of Electric and Electronic Equipment (WEEE) transit sites and the VRP.
They proposed an exact method using the quadratic optimization and an ant
colony algorithm to solve instances in the context of Shanghai (China). Mar-
Ortiz et al. [11] formulated the WEEE collection problem as the VRP with
Split Loads and Time Windows and proposed a Greedy Randomized Adaptive
Searching Procedure (GRASP) metaheuristic to solve a large dataset in the con-
text of Galicia (Spain). Król et al. [10] presented a model for e-waste collection
at both doorsteps and collection points within a single day. The problem is
formulated as the VRP with intermediate facilities, considering heterogeneous
vehicles, breaks between shifts, and regional cost differences. A multiple neigh-
borhood search algorithm was proposed to tackle this problem. Pourhejazy et
al. [12] formulated the actual waste-collecting scheme as a Capacitated General
Routing Problem with Time Windows and adopted Tabu Search to handle it.
They considered population density to separate requests from on-call customers
and also accounted for a scheme of heterogeneous vehicles.
To the best of our knowledge, none addresses the waste collecting with dis-
tinct periods within distinct days in the schedule. In the most recent research
[7], a Heterogeneous VRP with Multiple Time Windows (HVRP-MTW) model
is introduced to optimize e-waste collection vehicle routing. CPLEX is employed
to solve the Mixed Integer Linear Programming formulation of the problem.
However, CPLEX is inefficient in solving practical-scale instances due to a sig-
nificant amount of time to handle the problem. In this work, we propose an
efficient approach based on the GRASP metaheuristic to obtain solutions for
larger instances within reasonable computational times. The remainder of this
article is organized as follows. Section 2 provides details of the problem descrip-
tion and the proposed algorithm. In Sect. 3, we discuss our experimental setup,
262 A. Gunawan et al.
2 Methodology
2.1 Problem Description
The problem addressed arises from the e-waste collection process in the context
of Singapore, which was modeled as a Mixed Integer Linear Program (MILP)
in [7]. E-waste public drop-off bins (e-bins) have been set up across Singapore.
Unlike conventional waste, which is collected daily, the e-bins only require col-
lection when the amount of e-waste reaches a specified level. Every day, the
system updates information about the e-bins that need to be collected, allow-
ing staff to carry out the collection task. Additionally, companies also cater to
on-demand e-waste collection requests from households and businesses. For each
on-demand request, customers are charged a fee. Moreover, the on-demand col-
lection requests need to be completed within a specific time frame specified by
the customer when making the request through a mobile application, creating
time windows.
Each customer’s request includes the preferred collection days and time win-
dows, resulting in a multiple time window problem. If the vehicle arrives before
the lower bound of the time window, an idle cost will be incurred. Conversely,
a penalty cost will be imposed if the vehicle arrives after the upper bound of
the time window, with the penalty cost typically being greater than the idle cost
due to its impact on the customer satisfaction. The company typically employs a
heterogeneous fleet of vehicles with two different types. These vehicles have vary-
ing capacities and operating costs per unit of time. Different collection points
have different amounts of e-waste that need to be collected and require a certain
amount of time for processing. Figure 1 illustrates a solution of the described
HVRP-MTW. The problem involves planning the routes for the existing vehi-
cles within a specific collection period, with the objective of maximizing profits
from the e-waste collection process.
To establish initial routes that satisfy the capacity constraint, the Clarke
and Wright savings heuristic is utilized [3]. However, the traditional Clarke and
Wright procedure is ineffective when dealing with a heterogeneous vehicle fleet.
In the merge step, the heuristic tends to combine all subtours until the total
demand reaches the capacity of the largest vehicle in the fleet. However, relying
solely on the use of large vehicles is not an optimal approach for solving the
heterogeneous fleet problem. To address this issue, we have adopted the Realistic
Opportunity Savings - 𝛾 (𝑅𝑂𝑆 − 𝛾) [6] to calculate the savings of the Clarke
and Wright heuristic in our solution construction phase: 𝑠𝑖 𝑗 = 𝑠𝑖 𝑗 + (1 − 𝛾)𝑡 𝑖 𝑗 .
Where 𝛾 is a route-shape parameter with the value varying from 0.0 to 3.0 [6].
By varying the value of 𝛾, the composition of the mixed fleet can be altered.
For instance, when 𝛾 = 0, the solution tends to favor larger vehicles, while
increasing 𝛾 to 2.5 results in the utilization of more small vehicles in the initial
solution. Therefore, 𝛾 serves as a diversification factor in creating a mixed fleet
initial solution. 𝑠𝑖 𝑗 are the savings calculated in Realistic Opportunity Savings
(ROS): 𝑠𝑖 𝑗 = 𝑠¯𝑖 𝑗 + 𝛿(𝜃)𝐹 (𝑃(𝑧 𝑖 + 𝑧 𝑗 ) − 𝑧 𝑖 − 𝑧 𝑗 ). The parameter 𝛿(𝜃) is a binary
indicator of whether the vehicle threshold 𝜃 has been exceeded. This threshold
is calculated by the formula 𝜃 = 𝑃(𝑧𝑖 + 𝑧 𝑗 ) − 𝑃(max(𝑧 𝑖 , 𝑧 𝑗 )), where 𝑃(𝑧) denotes
the capacity of the smallest vehicle that can serve a demand of 𝑧. If the vehicle
threshold is not exceeded, then 𝛿(𝜃) is set to 0. Conversely, when 𝜃 > 0, indicating
that the threshold has been exceeded, 𝛿(𝜃) is set to 1. 𝐹 (𝑧) in our study is
the operation cost rate of the largest vehicle that has a capacity less than or
264 A. Gunawan et al.
Algorithm 2 presents the Clarke and Wright savings heuristic with the
𝑅𝑂𝑆 − 𝛾 approach. First, the algorithm creates individual routes for each node
(e-bins and households) (Line 2). Then, it calculates the savings for each pair of
nodes using the 𝑅𝑂𝑆 − 𝛾 approach and sorts the savings list in the descending
order (Lines 3–4). Starting with the top of the sorted savings list, the algorithm
examines the savings 𝑠𝑖 𝑗 for two distinct routes, 𝑣 𝑖 and 𝑣 𝑗 . If it is possible to
merge these two routes (i.e., 𝑖 and 𝑗 are endpoints of their respective subtours,
and the merged subtour containing both 𝑖 and 𝑗 has a total demand not greater
than the capacity of the largest vehicle), the algorithm combines both by remov-
ing the arcs (𝑖, 0) and ( 𝑗, 0) and adding a new arc (𝑖, 𝑗). The merging process
continues until no further pairs of routes is feasible to merge, and the merged
routes are returned at the end of the algorithm (Lines 5–12).
Repair Procedure. The initial solution generated by ROS-𝛾 satisfies the vehi-
cle capacity constraint but it does not take the collection period into account.
To address this issue, Algorithm 3 proposes a repair procedure. It is initiated
with an initial solution, denoted as 𝑆. To ensure that all routes in 𝑆 comply with
the collection period constraint, the algorithm checks the routes violating the
collection day and stores them in the infeasible routes set, denoted as Ω (Line
2). To identify the violated nodes in each infeasible route, we select a collection
day for this route and then remove the nodes which are not available in the
selected collection day. This step is done by the following procedures. The algo-
rithm chooses the day with the highest customer availability as collection day
𝑘 𝜔 of an infeasible route 𝜔. If there are more than one day with same number of
highest customer availability, we choose collection day 𝑘 𝜔 randomly from these
days 𝐾 𝜔 (Lines 3 - 12).
GRASP Solution Approach for the E-Waste Collection Problem 265
The repair procedure then examines each node 𝑗 in the infeasible route 𝜔
to determine its availability on day 𝑘 𝜔 . Nodes that are unavailable on day 𝑘 𝜔
are recorded in a set, denoted as 𝜋 𝑤 ∈ Π, where Π is the set that stores all
nodes that violate the collection day of all infeasible routes. To repair the infea-
sible routes in 𝑆, the algorithm employs five repair operators (Lines 13–29). The
repair operators have a primary focus on enhancing the solution feasibility while
minimizing modifications to the initial solution’s route structure. Additionally,
these operators are designed to address varying levels of complexity in handling
infeasible solutions. Ideally, the repair procedure aims to rectify an infeasible
solution using a single swap operator, thus reducing the algorithm’s time com-
plexity. Below, we provide a breakdown of the five repair operators and their
respective details:
– R1: This operator is implemented by swapping a violated node 𝑖 from the
set of violated nodes 𝜋 𝜔 in route 𝜔 with another violated node 𝑗 from the
set of violated nodes 𝜋 𝜔 in another infeasible route 𝜔 ∈ Ω. However, for
this operator to be applied, it is necessary for the collection day 𝑘 𝜔 of the
infeasible route 𝜔 to be compatible with the available period 𝐻 𝑗 of node 𝑗. If
the node 𝑗 does not exist, meaning that R1 can only deal with several violated
nodes, the operator will return False, and we need to employ additional repair
operators to resolve the remaining violated nodes in 𝜔 (Line 15).
– R2: The R2 operator is implemented by selecting a violated node 𝑖 from the
set of violated nodes 𝜋 𝜔 in an infeasible route 𝜔 and swapping it with another
node 𝑗 ∉ 𝜋 𝜔 of another infeasible route 𝜔 . Similar to R1, the R2 operator
requires the available period 𝐻 𝑗 of node 𝑗 to be compatible with the collection
day 𝑘 𝜔 of infeasible route 𝜔. If the node 𝑗 does not exist, the operator returns
False, indicating that additional repair operators are required to address the
remaining violated nodes (Line 17).
– R3: This operator operates on the same principles as R1 and R2, but instead
of selecting a swap node 𝑗 from an infeasible route 𝜔 , it selects 𝑗 from a
feasible route 𝑣 ∉ Ω. Once 𝑗 has been selected, R3 swaps the violated node
𝑖 ∈ 𝜋 𝜔 with 𝑗, provided that the available period 𝐻 𝑗 is compatible with the
collection day 𝑘 𝜔 of route 𝜔. If there is no node 𝑗 in any feasible route 𝑣
available on 𝑘 𝜔 , the operator returns False, indicating that additional repair
operators are required to address the remaining violated nodes (Line 19).
– R4: The R4 operator inserts the violated node 𝑖 ∈ 𝜋 𝜔 into a feasible route
𝑣 ∉ Ω, provided that there is at least one day where the available day of
node 𝑖, 𝐻𝑖 , matches the collection period 𝐾 𝑣 of feasible route 𝑣. If there is no
feasible route 𝑣 with a suitable collection period to insert the violated node
𝑖, the operator returns False (Line 21).
– R5: The R5 operator repairs the current infeasible solution by creating a new
single trip with the violated node 𝑖 ∈ 𝜋 𝜔 and then deleting 𝑖 from the current
infeasible route 𝜔 (Line 23).
The initial solution then satisfies both capacity and collection period con-
straints. To diversify the initial solution, we randomly select a collection day in
feasible collection day for each route. Moreover, to determine the departure time
266 A. Gunawan et al.
Local Search. Once the initial solution has been obtained, the following local
search operators are applied to intensify the search for the optimal solution:
– N1 - Intra-relocation: The N1 operator explores the neighborhood solu-
tion by removing customer 𝑖 from route 𝑣 𝑖 and then reinserting 𝑖 at another
position in the same route 𝑣 𝑖 .
– N2 - Inter-relocation: The N2 operator removes customer 𝑖 from its current
route 𝑣 𝑖 and reinserts 𝑖 to a new route 𝑣 𝑗 provided that there is at least one
day in the available period of 𝑖 that coincides with the collection day of 𝑣 𝑗 .
Additionally, the total capacity of 𝑣 𝑗 after the reinsertion must not exceed
the capacity of the largest vehicle used.
GRASP Solution Approach for the E-Waste Collection Problem 267
– N11 - combine tours: The N11 operator selects two random tours 𝑣 𝑖 and 𝑣 𝑗
that have a combined total capacity that is less than the maximum capacity
of the largest vehicle in the fleet. It then combines these tours to create a new
tour. Two neighborhoods are created by the N11 operator, and the one with
the higher profit is selected.
All local search operators are applied in the first-improving strategy, as shown
in Algorithm 4.
Algorithm 5. Path-relinking
1: Input: 𝑆𝑖 , 𝑆𝑔
2: 𝑆 ← 𝑆𝑖 ;
3: 𝑓 (𝑆 ∗ ) ← max { 𝑓 (𝑆𝑖 ), 𝑓 (𝑆𝑔 ) };
4: 𝑆 ∗ ← arg max { 𝑓 (𝑆𝑖 ), 𝑓 (𝑆𝑔 ) };
5: Symmetric difference, Δ(𝑆, 𝑆𝑔 ) ← 𝑖 : 𝑆 [𝑖 ] ≠ 𝑆𝑔 [𝑖 ] ;
6: while Δ(𝑆, 𝑆𝑔 ) ≠ ∅ do
7: An index 𝑡 ∈ Δ(𝑆, 𝑆𝑔 ) where 𝑆 [𝑡 ] ≠ 𝑆𝑔 [𝑡 ] ;
8: Select a node 𝑖 = 𝑆 [𝑡 ] and 𝑗 = 𝑆 [𝑆𝑔 [𝑡 ] ] ;
9: 𝑆 ←Swap two selected nodes, swap(𝑆𝑖 , 𝑖, 𝑗 );
10: if valid(𝑆 ) = True then
11: if 𝑓 (𝑆 ) > 𝑓 (𝑆 ∗ ) then
12: 𝑆∗ ← 𝑆 ;
13: 𝑓 (𝑆 ∗ ) ← 𝑓 (𝑆 ) ;
14: end if
15: end if
16: end while
17: Return: 𝑆 ∗ , 𝑓 (𝑆 ∗ ) ;
as the current best solution 𝑆 ∗ (Lines 3–4). Then, Line 5 computes the symmetric
difference Δ between two solutions 𝑆 and 𝑆 𝑔 , which stores the index where nodes
in 𝑆 differ from 𝑆 𝑔 . The procedure performs the swap operator to relink the
solution 𝑆 to 𝑆 𝑔 (Lines 6–9). An example for a solution with eight nodes is
illustrated in Fig. 2. If the explored solution 𝑆 in the relinking process has a
higher profit than the current best solution 𝑆 ∗ , it becomes the best solution
in the following relinking path. There is a validity step in this procedure to
avoid partial solutions created in the relinking process (Line 10). The procedure
terminates when 𝑆 reaches 𝑆 𝑔 (i.e., Δ(𝑆, 𝑆 𝑔 ) = ∅), and it returns the best solution
in the relinking process 𝑆 ∗ .
elite solution 𝑆 (Lines 21–22). The better solution achieved from forward and
backward PR becomes the current solution 𝑆, and if it is not in 𝑃, it is added
to the pool (Lines 23–30). Next, the current solution 𝑆 is compared with the
best solution 𝑆 ∗ . If 𝑆 is better than 𝑆 ∗ , it becomes the best solution 𝑆 ∗ for the
following iteration (Lines 32–34). After 𝑖 𝑚𝑎𝑥 iterations, the algorithm returns
the best found solution 𝑆 ∗ (Line 37).
3 Computational Results
No Name N1 N2 𝑉1 𝑉2 No Name N1 N2 𝑉1 𝑉2
1 C1_6_1_01 4 4 4 4 19 R1_6_1_01 15 15 8 8
2 C1_6_1_02 4 8 6 6 20 R1_6_1_02 20 20 10 10
3 C1_6_1_03 4 12 8 8 21 R1_6_1_03 25 25 13 13
4 C1_6_1_04 8 4 3 3 22 R1_6_1_04 30 30 15 15
5 C1_6_1_05 12 4 4 4 23 R1_6_1_05 35 35 18 18
6 C1_6_1_06 6 6 3 3 24 R1_6_1_06 40 40 20 20
7 C1_6_1_07 8 8 4 4 25 R1_6_1_07 45 45 23 23
8 C1_6_1_08 10 10 5 5 26 R1_6_1_08 50 50 25 25
9 C1_6_1_09 12 12 6 6 27 R1_6_1_09 55 55 28 28
10 C2_6_1_01 4 4 2 2 28 R2_6_1_01 15 15 8 8
11 C2_6_1_02 4 8 3 3 29 R2_6_1_02 20 20 10 10
12 C2_6_1_03 4 12 4 4 30 R2_6_1_03 25 25 13 13
13 C2_6_1_04 8 4 3 3 31 R2_6_1_04 30 30 15 15
14 C2_6_1_05 12 4 4 4 32 R2_6_1_05 35 35 18 18
15 C2_6_1_06 6 6 3 3 33 R2_6_1_06 40 40 20 20
16 C2_6_1_07 8 8 4 4 34 R2_6_1_07 45 45 23 23
17 C2_6_1_08 10 10 5 5 35 R2_6_1_08 50 50 25 25
18 C2_6_1_09 12 12 6 6 36 R2_6_1_09 55 55 28 28
The exact solutions are obtained by IBM ILOG CPLEX Optimization Studio
- Academic version 12.10.0.0 using branch and bound methods on a personal
computer with Intel Core i7-12700 2.10 GHz, 16 GB RAM, 1000 GB HDD +
500 GB SSD, Windows 10 Education version 22H2 64-bit. The time limits are
set to 4 h per instance with the mathematical model adopted from [7]. Following
a series of tuning experiments, we have determined the maximum number of
iterations to 𝑖 𝑚𝑎𝑥 = 5000. For the PR procedure, we set a pool size of 𝜆 = 20
and a diversification coefficient of 𝜇 = 0.007. The GRASP-PR algorithm was
implemented by using Python 3.11.
Table 2 provides the results obtained by both GRASP-PR and CPLEX in
terms of solution quality and computational time. The bold numbers are optimal
solutions obtained by CPLEX, only 4 out of 36 instances. The remaining ones are
the best found so far or unsolved. Even for small instances like C1_6_1_01 and
C2_6_1_01, CPLEX requires 1200 and 2160 s seconds, respectively, to obtain
results. On the other hand, GRASP-PR can generate solutions within signifi-
cantly shorter computational times. For example, GRASP-PR can achieve an
optimal solution for instance C1_6_1_01 in 0.54 s. The maximum computa-
272 A. Gunawan et al.
tional time required by GRASP-PR for all instances is 982.15 s, with an average
running time of 296.55 s per instance.
In terms of solution quality, unlike CPLEX, GRASP-PR can successfully
solve all test instances and reach optimal solutions for instances solved by
CPLEX optimally. Moreover, for the remaining instances where CPLEX fails
to achieve optimality, the best found solution by GRASP-PR in 10 runs outper-
forms CPLEX in 7 out of 13 instances. For instance C1_6_1_03, the solution
found by GRASP-PR can improve the profit in CPLEX’s solution by 29.27%,
while for a larger instance R2_6_1_01 with 30 customers, the improvement is
as high as 75.85%. However, GRASP-PR shows limitations in its ability to scale
up to larger problem instances. The solutions obtained by GRASP-PR for the
same instances in different runs become inconsistent when the size of the problem
increases. Specifically, for the two largest instances in our test set, R2_6_1_08
and R2_6_1_09, with 100 and 110 customers, respectively, the average objec-
tive value of ten runs is almost 1.5 times worse than the objective value of the
best-found solution.
We also compare the 2-opt and 3-opt local search operators in terms of solu-
tion quality. We created two alternative scenarios: (i) the GRASP-PR meta-
heuristic without the 2-opt operator, and (ii) the GRASP-PR metaheuristic
without the 3-opt operator. Results are summarized in Table 3, together with
those of the original algorithm in terms of solution quality and computational
times.
We observe that both scenarios produce worse solutions than those of the
original GRASP-PR metaheuristic. This outcome is expected since the HVRP-
MTW represents a tightly constrained problem, and reducing the local search
operator leads to a decrease in the search space, thereby making it more chal-
GRASP Solution Approach for the E-Waste Collection Problem 273
No Instance GRASP-PR i ii
Best Avg. 10 CPU (s) Best Avg. 10 CPU (s) Best Avg. 10 CPU (s)
1 C1_6_1_01 377 377 0.54 377 373.3 0.54 377 377 0.49
2 C1_6_1_02 –222 –222 51.17 –222 –228.4 51.65 –222 –222 48.35
3 C1_6_1_03 –749 –823.7 83.30 –1045 –1086.9 83.40 –1052 –1101.2 74.60
4 C1_6_1_04 1191 1190.5 47.97 1191 1187.9 49.41 1191 1187.8 42.77
5 C1_6_1_05 1884 1815.7 77.84 1850 1786.6 73.53 1884 1776.9 60.26
6 C1_6_1_06 –77 –102.7 52.17 –77 –104.1 49.84 –77 –99.8 45.63
7 C1_6_1_07 799 629.1 79.53 799 669.3 80.41 751 648.5 67.08
8 C1_6_1_08 614 528 115.77 596 507.5 118.63 592 513.6 101.48
9 C1_6_1_09 1963 1790.4 147.68 1932 1732.9 141.90 1863 1690.2 125.71
10 C2_6_1_01 –306 –306 29.64 –306 –306 30.71 –306 –306 25.65
11 C2_6_1_02 –595 –595 40.25 –595 –595 42.10 –595 –595 32.43
12 C2_6_1_03 –939 –1020.5 68.36 –957 –1034.5 65.50 –1019 –1083.1 55.09
13 C2_6_1_04 670 646.9 59.00 669 630.9 59.26 670 592.6 54.39
14 C2_6_1_05 861 415.6 103.51 516 367.4 104.50 595 377.5 86.38
15 C2_6_1_06 246 181.5 41.52 189 161.9 41.05 258 152.8 35.58
16 C2_6_1_07 428 281.3 97.14 303 229.7 93.32 363 286.8 82.50
17 C2_6_1_08 60 –163.4 111.40 5 –190.8 105.90 –24 –269.9 91.48
18 C2_6_1_09 371 93.9 146.04 328 44.6 141.26 196 22.2 119.51
4 Conclusions
In this study, we propose a GRASP with Path-Relinking to address the e-waste
collection route planning problem modelled as Heterogeneous Vehicle Routing
Problem with Multiple Time Windows (HVRP-MTW). To evaluate the pro-
posed algorithm, named GRASP-PR, we conducted comparative experiments
274 A. Gunawan et al.
against a commercial solver. The results show that GRASP-PR can effectively
handle HVRP-MTW instances with up to 110 customers within reasonable com-
putational times. The comparison revealed that GRASP-PR outperforms the
commercial solver in large-scale instances and can generate optimal solutions
for small instances. However, the metaheuristic approach of GRASP-PR has its
limitation in convergence when the problem size increases. For future research,
the integration of advanced extensions with a learning process such as fixed set
search [8,9] is considered to enhance the convergence and performance of the
current metaheuristic.
References
1. Baldé, C., D’Angelo, E., Deubzer, V.L.O., Kuehr, R.: Global transboundary e-
waste flows monitor - 2022, United Nations Institute for Training and Research
(UNITAR), December 2022. https://ewastemonitor.info/gtf-2022/. Accessed 26
June 2023
2. Braekers, K., Ramaekers, K., Van Nieuwenhuyse, I.: The vehicle routing problem:
state of the art classification and review. Comput. Ind. Eng. 99, 300–313 (2016)
3. Clarke, G., Wright, J.W.: Scheduling of vehicles from a central depot to a number
of delivery points. Oper. Res. 12(4), 568–581 (1964)
4. Feo, T.A., Resende, M.G.: A probabilistic heuristic for a computationally difficult
set covering problem. Oper. Res. Lett. 8(2), 67–71 (1989)
5. Glover, F.: Tabu search and adaptive memory programming-advances, applications
and challenges. In: Barr, R.S., Helgason, R.V., Kennington, J.L. (eds.) Interfaces
in Computer Science and Operations Research. Operations Research/Computer
Science Interfaces Series, vol. 7, pp. 1–75. Springer, Boston, MA (1997). https://
doi.org/10.1007/978-1-4615-4102-8_1
6. Golden, B., Assad, A., Levy, L., Gheysens, F.: The fleet size and mix vehicle routing
problem. Comput. Oper. Res. 11(1), 49–66 (1984)
7. Gunawan, A., Nguyen, M.P.K., F. Yu, V., Nguyen, D.V.A.: The heterogeneous
vehicle routing problem with multiple time windows for the e-waste collection
problem. In: submitted to IEEE CASE 2023 (accepted) (2023)
8. Jovanovic, R., Sanfilippo, A.P., Voß, S.: Fixed set search applied to the clique
partitioning problem. Eur. J. Oper. Res. 309(1), 65–81 (2023)
9. Jovanovic, R., Tuba, M., Voß, S.: Fixed set search applied to the traveling sales-
man problem. In: Blesa Aguilera, M.J., Blum, C., Gambini Santos, H., Pinacho-
Davidson, P., Godoy del Campo, J. (eds.) HM 2019. LNCS, vol. 11299, pp. 63–77.
Springer, Cham (2019). https://doi.org/10.1007/978-3-030-05983-5_5
10. Król, A., Nowakowski, P., Mrówczyńska, B.: How to improve WEEE management?
Novel approach in mobile collection with application of artificial intelligence. Waste
Manag. 50, 222–233 (2016)
11. Mar-Ortiz, J., González-Velarde, J.L., Adenso-Díaz, B.: Designing routes for
WEEE collection: the vehicle routing problem with split loads and date windows.
J. Heuristics 19, 103–127 (2013)
GRASP Solution Approach for the E-Waste Collection Problem 275
12. Pourhejazy, P., Zhang, D., Zhu, Q., Wei, F., Song, S.: Integrated e-waste trans-
portation using capacitated general routing problem with time-window. Transp.
Res. Part E Logist. Transp. Rev. 145, 102169 (2021)
13. Yao, L., He, W., Li, G., Huang, J.: The integrated design and optimization of a
WEEE collection network in Shanghai, China. Waste Manag. Res. 31(9), 910–919
(2013)
The Heterogeneous Fleet
Risk-Constrained Vehicle Routing
Problem in Humanitarian Logistics
1 Introduction
During crises and disasters, humanitarian assistance is essential for alleviating
the suffering of those in need. This involves transporting goods such as food,
medicine, and in increasing numbers cash-based interventions [9]. However, these
movements are not without risks. People may resort to violent behavior in des-
perate times with a lack of resources such as food, water, or medication. For
instance, one week after the 2023 Turkey-Syria earthquake, multiple incidents of
violence and armed looting were reported. Furthermore, in conflict zones, rebel-
lious groups can also be interested in relief goods. Facing the attacks in multiple
cities in Sudan in the spring of 2023, with explosions, gunfire, and looting in the
streets, the World Food Programme had to suspend one of its largest food dis-
tribution missions. Moreover, during the civil war in South Sudan (2013–2020),
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 276–291, 2023.
https://doi.org/10.1007/978-3-031-43612-3_17
The Heterogeneous Fleet Risk-Constrained VRP in Humanitarian Logistics 277
combined with hunger and malnutrition in the country, dozens of incidents were
reported each month, including murders of aid workers. To avoid risks on the
roads, costly airlifts by helicopters and aircraft are often performed, which are
mainly suitable for large shipments. Unmanned Aerial Vehicles (UAVs) can pro-
vide a flexible and efficient alternative for smaller-scale distributions of aid orga-
nizations. Humanitarian cargo UAVs are not necessarily bound to fly between
airports, have the ability to deliver supplies to numerous locations spread over
an area, and can visit hard-to-reach places quickly [17].
Literature offers various secure routing approaches for high-value goods, often
referred to as cash-in-transit (CIT) problems. The transportation of valuable
relief goods in disaster areas can face comparable risks as cash transport. In most
CIT problems, the risk of vehicle attacks is mitigated by diversifying routing
plans through time inconsistency or path inconsistency [7]. However, random-
ization of routing plans is less suitable for humanitarian operations, as these
often deal with urgent, variable or one-off demands, sparse road networks, and
opponents that tend to operate opportunistically. An alternative approach is the
risk-constrained vehicle routing problem (RCVRP) introduced by Talarico et al.
[22], which limits the risk exposure in routes. Despite these approaches in CIT
literature, security issues are seldom considered in humanitarian research, yet
being a top concern for humanitarian organizations [3].
This paper investigates the problem of transporting high-value goods with a
heterogeneous fleet (i.e., trucks and UAVs) in high-risk disaster environments.
The following contributions are made in this paper. (i) We formulate the hetero-
geneous fleet risk-constrained vehicle routing problem (HFRCVRP) including
indirect losses of an attack. (ii) We propose an adaptive large neighborhood
search (ALNS) algorithm with specific operators to solve the HFRCVRP in a
reasonable time, as the HFRCVRP may need to be solved daily or suddenly after
a disaster. (iii) The methods are tested and results are analyzed using real-world
data from South Sudan.
The remainder of this paper is organized as follows. Related literature on the
HFRCVRP is reviewed in Sect. 2. In Sect. 3, we describe the problem and formu-
late the model. Section 4 introduces our ALNS heuristic. Section 5 describes the
real-world case study of South Sudan, the experimental design, and the results.
Finally, the conclusions and future research directions are stated in Sect. 6.
2 Related Work
In this section, we first describe the risk-constrained vehicle routing problem
(RCVRP) and then review literature on variants of the RCVRP that are pro-
posed in recent years. Talarico et al. [22] formally introduce the RCVRP. The
expected loss along edge (i, j) is the product of the probability of being robbed
pij (i.e., risk probability) and the loss Di of valuables transported from node i.
In the case of collecting cash, Di will increase every stop, in the case of deliveries,
Di will decrease at a stop. As the probability of being robbed along an edge is
low, Talarico et al. [22] simplify the expected loss associated with route r by
taking the sum of expected losses of robbery on each arc (i, j):
278 R.M. van Steenbergen et al.
pij · Dir (1)
(i,j)∈r
ping arcs and the spread arrival times (i.e., route predictability). Allahyari et
al. [1] apply the same risk index in a secure pickup and delivery problem with
time windows and include facility location decisions as well. To solve realistic
instances, they propose an adaptive large neighborhood search (ALNS) algo-
rithm, which outperforms an exact method considering a computational time
limit, and minimizes the risk index, depot costs, vehicle costs, and travel costs.
A two-echelon variant of the RCVRP is presented by Fallahtafti et al. [6], which
includes facility location decisions for multiple depots. A multi-objective simu-
lated annealing approach provides the most promising results in a computational
study to minimize depot costs, transportation costs, and risk exposure. Kian et
al. [11] consider an RCVRP for cash-based interventions in the Syrian refugee
crisis. They extend the works on facility location decisions and multi-echelon
problems by introducing temporary facilities and mobile depots. The goal is to
maximize the number of reached beneficiaries within a limited time period while
minimizing logistics costs. A hierarchical metaheuristic is developed to obtain
efficient solutions. An overview of all the works and their characteristics is pre-
sented in Table 1.
Table 1. Overview of literature regarding the RCVRP. (MO = multi-objective, MD
= multi-depot, ME = multi-echelon, LR = location-routing, TD = time-dependent
risk, PD = pickup and delivery, TW = time windows, OL = overlapping links, MP =
multi-period, HF = heterogeneous fleet)
road damage, are often ignored in the literature. These losses are considered
to not be under the control of the transportation company [6,22]. Nonetheless,
they can significantly impact operations and could still be mitigated by adapting
decisions (e.g., avoiding them by using UAVs). The risk of vehicle failure is con-
sidered by [21], showing similarities with indirect losses, as vehicle failure is also
unrelated to the amount of cargo. We address this gap in literature by including
indirect losses by estimating the value of the indirect costs of an attack. Further-
more, we formulate the heterogeneous fleet RCVRP, propose an adaptive large
neighborhood search heuristic with specific operators to solve the problem, and
analyze the problem in a humanitarian context in South Sudan.
Besides the direct consequences of a robbery (i.e., the loss of valuable goods),
indirect losses may also occur, such as required vehicle repairs due to damages,
emotional/physical abuse of employees, or administrative work (e.g., insurance).
We include these indirect losses that occur irrespective of the amount of lost
valuables in Dijr . The objective function of the HFRCVRP is to find routes
that minimize the total transportation costs and the total expected loss:
min (cijk xijkr + pijk Dijkr ) (2)
k∈K r∈N (i,j)∈A
– Random removal. Randomly selects q locations that are removed from the
routes and added to Φ.
– Distance removal. Selects a random location and removes it together with
its q − 1 nearest neighbors based on Euclidean distance.
– String removal. This operator proposed in [4] removes adjacent sequences of
locations (i.e., strings) in the routes. First, a seed location is selected, followed
by the removal of a string that encompasses the seed location. Thereafter, a
string is removed from another route that contains a location adjacent to the
seed location.
– Route removal. Randomly selects an existing route of one vehicle type
and completely destroys it. Specifically useful for heterogeneous fleets, as this
allows to mix up the locations of one specific vehicle type.
– Worst (cost/risk) removal. These operators remove the location that
results in the largest decrease in the objective value. To include some ran-
domness in the worst removal, e.g., to avoid the same location removed mul-
tiple times and to avoid local optima, the removal value is randomized, by
multiplying the value by a random number. This process is repeated until q
locations are removed. Two additional variants are developed for the HFR-
CVRP; the worst cost removal only considers the transportation costs and
the worst risk removal only considers the expected loss of the objective value.
– (Random) greedy repair. The greedy repair operators insert the removed
locations iteratively. In each iteration, the location with the smallest increase
of the objective is inserted in the best possible position, which is either in an
existing route or in a new route. For the random variant, the greedy repair is
randomized by multiplying the insertion value with a random number, aiming
to overcome the myopic behavior of the greedy repair [18].
– (Random) regret repair. The regret repair operators insert locations with
the highest regret. Regret is the difference in the insertion value between
inserting the location at its best position and at another position. When
the other position is the second-best position, it is denoted as 2-regret. In
each iteration of the regret operator, the location with the highest 2-regret is
inserted at its best position. Hence, we insert the location that we would most
regret if it later could not be inserted at its best position. For the random
variant, the regret is multiplied by a random number.
The Heterogeneous Fleet Risk-Constrained VRP in Humanitarian Logistics 283
The choice of destroy and repair operators in each iteration of ALNS is based
on the roulette-wheel selection principle (e.g., [12,18]). To select the different
operators, each operator k has a weight wi , i ∈ {1, 2, .., k} and we select the
operator j with probability:
wj
k (3)
i=1 wi
The weights of the operators are all initialized with a weight of 1 and updated
throughout the search. The entire search is divided into a number of segments.
Throughout each segment, we keep track of the performance of the operators
by assigning scores σ1 , σ2 , and σ3 . At the start of each segment, the score is set
to zero for each operator. If the operators find a new best solution, the score
is increased by σ1 . If operators find a solution that is better than the current
solution, the score is increased by σ2 . If the solution is worse than the current
solution but is accepted, the score is increased by σ3 . The latter is included to
encourage operators that diversify the search and explore new sections of the
solution space [18]. At the end of a segment, weights are updated according to
their scores. Let wij be the weight of operator i in segment j, πi be the score
of operator i, and θi be the number of times heuristic i is used in segment j. A
reaction factor γ controls the weight update:
πi
wi,j+1 = wij (1 − γ) + γ (4)
θi
In this section, we describe the case, the instances, and the experimental settings
which we use to solve the HFRCVRP, and clarify the tuning of the parameters
of the ALNS.
284 R.M. van Steenbergen et al.
A civil war, a widespread scarcity of food, and multiple floods have plagued
South Sudan for years, especially in 2017. According to the humanitarian snap-
shots of the United Nations Office for the Coordination of Humanitarian Affairs
(OCHA) in 2017, Unity was the state with severe food scarcity, the most inter-
nally displaced people, and the highest violence rate in South Sudan. Inci-
dents in Unity included violence against humanitarian personnel (also murders),
ambushes and looting, active hostilities, violence against humanitarian assets,
and operational interference and restrictions of movement. In 2017, the OCHA
reported 182 incidents in Unity, with 6 murders of aid workers among them.
Tonnes of nutritional supplies, essential medicines, and utilities were stolen and
in November 2017 food assistance was suspended in the south of Unity due to
ongoing fights in November 2017. In this study, we focus on this southern part,
containing the districts of Koch, Leer, and Mayiendit. Based on assessments from
the International Organization of Migration (IOM), we obtain a set of 125 loca-
tions where internally displaced people rely on food assistance or health support
(see Fig. 1). The depot is situated in the village of Leer, where a humanitarian
hub and an airstrip are located. The locations contain between 19 and 2,579
individuals, with a total number of 42,654 individuals in the area.
Fig. 1. Map of the operating area; the colors give a sense of the three districts and the
size of the locations scales according to the number of individuals.
The Heterogeneous Fleet Risk-Constrained VRP in Humanitarian Logistics 285
Speed (km/h) Costs (e/h) Capacity Qk Indirect loss (e) Risk factor
Truck 37.5a 70 6,000 10,000 0.01
UAV 120 250 200 0 0.00
a
On average, based on the obtained distances and travel times from ArcGIS
For all experiments, we consider that each individual requires 0.5kg of goods,
with a value of 2 euro per kg.
The ALNS framework contains several parameters that can be tuned according
to the problem setting, but once they are tuned on one instance, results tend to
be robust for a wide range of instances [18]. We tune our parameters on instance
7, the largest and most complex instance. To find suitable parameters, we varied
the parameters and chose the setting that showed the best performance. The
number of locations to remove each iteration q (i.e., the degree of destruction)
is picked from the uniform interval [0.1n, 0.2n], where n is the total number of
locations. The level of randomization of the costs for the initial construction, the
worst removal operators, and the randomized repair operators is set to [0.9, 1.1].
Regarding the scoring and weight updating of the operators, we divide the search
into segments of 50 iterations. In each iteration, weights get a score based on
their performance and the scores σ1 , σ2 , and σ3 , which are set to 30, 10, and
5, respectively. The reaction factor γ for weight updating is set to 0.1. The
acceptance threshold T of RRT is a function of the best solution f ∗ , with initial
threshold T start = 0.05f ∗ . The threshold decays linearly throughout the search
until T = 0.
The results obtained by the ALNS heuristic for all variants on the instances
are presented in Table 4, and were obtained within minutes on a 4-core 1.90
GHz computer with 16GB RAM. Without considering risks, UAVs are only
applied in three subsets and allocated to a total of five locations. About 1.1%
in transportation costs is saved in these instances compared to truck-only sce-
narios. When including the expected loss in the objective, UAVs are deployed
The Heterogeneous Fleet Risk-Constrained VRP in Humanitarian Logistics 287
in all instances visiting on average 19% of the locations, improving the objec-
tive of transportation costs and expected loss by 7.1% on average compared to
only trucks. Additionally, transportation costs per location decrease by 30% on
average for trucks when UAVs are deployed, showing that UAVs visit locations
further away from the depot. These relatively remote locations are also those
that result in a high exposure to risk for trucks, due to their longer travel times.
When comparing the bi-objective results with the results of the single cost
objective, we observe that for the truck-only instances, the transportation costs
increase by 11.9% and the expected loss decreases by 13.4% on average. For the
heterogeneous fleet instances, the effect is about four times as large, with an
average increase of 53.0% in costs, and a 47.7% average decrease of expected
loss, resulting in an overall decrease of 14.8% in the objective value.
Table 4. Results for all experiments, where each instance stands for the case subset
(1–7), truck-only (T) or heterogeneous fleet with trucks and UAVs (HF), and a single
objective based on costs (C) or bi-objective based on costs and risks (CR). The lowest
total costs and expected losses per subset are marked in bold.
Instance Truck costs UAV costs Expected loss Total # truck # UAV
locations locations
1-T-C 1824.2 0 4198.1 6022.2 31 0
1-T-CR 1850.8 0 3168.2 5019.0 31 0
1-HF-C 1824.2 0 4198.1 6022.2 31 0
1-HF-CR 622.9 2927.4 1076.8 4627.2 20 11
2-T-C 413.2 0 722.2 1135.4 54 0
2-T-CR 417.6 0 706.6 1124.2 54 0
2-HF-C 343.9 67.7 648.4 1059.9 51 3
2-HF-CR 269.9 204.8 484.7 959.4 42 12
3-T-C 837.1 0 1523.3 2360.4 40 0
3-T-CR 843.9 0 1453.2 2297.2 40 0
3-HF-C 762.7 54.9 1678.8 2496.5 39 1
3-HF-CR 445.7 946.2 733.8 2125.8 32 8
4-T-C 2186.7 0 4440.9 6627.6 85 0
4-T-CR 2224.2 0 3832.8 6057.0 85 0
4-HF-C 2186.7 0 4440.9 6627.6 85 0
4-HF-CR 896.8 3109.0 1591.3 5597.0 65 20
5-T-C 2283.9 0 4403.7 6687.6 71 0
5-T-CR 2294.9 0 3933.1 6228.0 71 0
5-HF-C 2214.1 54.9 4220.1 6489.1 70 1
5-HF-CR 1315.0 2078.8 2402.3 5796.2 63 8
6-T-C 1052.8 0 2474.0 3526.8 94 0
6-T-CR 1064.3 0 1910.1 2974.4 94 0
6-HF-C 1052.8 0 2474.4 3527.2 94 0
6-HF-CR 1034.5 72.4 1848.3 2955.2 92 2
7-T-C 2481.7 0 5194.7 7676.4 125 0
7-T-CR 2524.1 0 4398.7 6922.8 125 0
7-HF-C 2481.7 0 5194.7 7676.4 125 0
7-HF-CR 1481.3 2311.2 2831.8 6624.3 105 20
288 R.M. van Steenbergen et al.
For the largest instance with all districts, we perform a sensitivity analysis
to compare the separate objectives of transportation costs and the expected loss
with different weights, see Fig. 2. We weigh the transportation costs by ξ and
the expected loss by 1 − ξ in the objective, with different values of ξ ∈ [0, 1]. When
the transportation costs weigh 0.6 or higher, UAVs are not deployed, and trans-
portation costs and expected loss only slightly change. When risk becomes more
important, UAVs are increasingly deployed. Expected loss can be eliminated by
increasing transportation costs by a factor of five (i.e., from = 0.6 for the case
without UAVs to = 0.1 where UAVs fully diminish the risk exposure). The
changes in truck routes are minimal when risk becomes more important, which
can be explained by the indirect loss. Multiple shorter routes that decrease the
expected direct loss causes not only higher transportation costs but also increase
the expected indirect loss due to more time on the road. This allows for fewer
possibilities in reducing risks with only trucks.
Fig. 2. Results for case subset 7 for both trucks only and trucks & UAVs scenarios
with different weights for transportation costs and expected loss.
6 Conclusions
Risks in disaster areas can have a significant impact on the operations of human-
itarian organizations. In this work, we presented a heterogeneous fleet RCVRP
with the consideration of indirect losses (loss irrespective of the amount of load
carried, such as material- or emotional damage) to mitigate these impacts. We
developed an ALNS metaheuristic with specific destroy and repair operators
to solve this problem variant effectively. We tested this method on real-world
instances from high-risk areas in South Sudan.
The results show that both the minimization of expected losses and the use
of UAVs increase transportation costs, but simultaneously decrease the exposure
to risks significantly. This enables safer operations for aid workers of humani-
tarian organizations. Hence, UAVs provide a valuable alternative in high-risk
environments, with an average decrease of the objective value of 7.1% in the
The Heterogeneous Fleet Risk-Constrained VRP in Humanitarian Logistics 289
case of equal consideration of transportation costs and expected loss. UAVs are
especially valuable in situations in which locations are scattered and relatively
far away from humanitarian hubs. These locations cause long travel times and
high exposure to risks for road transport. Hence, UAVs can take over the most
challenging locations. In the largest instance, 16% of the locations were visited
by UAVs.
The inclusion of indirect losses for a more accurate consideration of risks
resulted in less flexibility in mitigating risks with trucks. Only minimal changes in
routes and objective value were found for different weights of the transportation
costs and the expected loss. For the heterogeneous fleet variant, risks could be
eliminated entirely in the most extreme variants by increasing transportation
costs by a factor of five. However, when the focus was mainly on transportation
costs, UAVs were not deployed.
In future work, we aim to test the ALNS framework on benchmark instances
and compare it with exact approaches to evaluate the performance of each
method. Furthermore, we suggest introducing prioritization based on the vulner-
abilities of specific beneficiaries, as the delivery of goods to certain groups might
be more important than to others. This could be done by explicitly considering
deprivation costs that can occur when beneficiaries have to wait when deliveries
cannot be performed due to attacks. Facility location decisions could also be
included, including setup costs for an operating base of UAVs.
References
1. Allahyari, S., Yaghoubi, S., Van Woensel, T.: A novel risk perspective on location-
routing planning: an application in cash transportation. Transp. Res. Part E Logist.
Transp. Rev. 150, 102356 (2021). https://doi.org/10.1016/j.tre.2021.102356
2. Allahyari, S., Yaghoubi, S., Van Woensel, T.: The secure time-dependent vehicle
routing problem with uncertain demands. Comput. Oper. Res. 131, 105253 (2021).
https://doi.org/10.1016/j.cor.2021.105253
3. Besiou, M., Van Wassenhove, L.N.: Humanitarian operations: a world of opportu-
nity for relevant and impactful research. Manuf. Serv. Oper. Manag. 22(1), 135–145
(2020). https://doi.org/10.1287/msom.2019.0799
4. Christiaens, J., Vanden Berghe, G.: Slack induction by string removals for vehicle
routing problems. Transp. Sci. 54(2), 417–433 (2020). https://doi.org/10.1287/
trsc.2019.0914
5. Dueck, G.: New optimization heuristics: the great deluge algorithm and the record-
to-record travel. J. Comput. Phys. 104(1), 86–92 (1993). https://doi.org/10.1006/
jcph.1993.1010
6. Fallahtafti, A., Ardjmand, E., Young Ii, W.A., Weckman, G.R.: A multi-objective
two-echelon location-routing problem for cash logistics: a metaheuristic app-
roach. Appl. Soft Comput. 111, 107685 (2021). https://doi.org/10.1016/j.asoc.
2021.107685
7. Fröhlich, G.E., Gansterer, M., Doerner, K.F.: Safe and secure vehicle routing: a
survey on minimization of risk exposure. Int. Trans. Oper. Res. (2022). https://
doi.org/10.1111/itor.13130
290 R.M. van Steenbergen et al.
24. Talarico, L., Springael, J., Sörensen, K., Talarico, F.: A large neighbourhood meta-
heuristic for the risk-constrained cash-in-transit vehicle routing problem. Comput.
Oper. Res. 78, 547–556 (2017). https://doi.org/10.1016/j.cor.2016.04.003
25. Tikani, H., Setak, M., Demir, E.: A risk-constrained time-dependent cash-in-transit
routing problem in multigraph under uncertainty. Eur. J. Oper. Res. 293(2), 703–
730 (2021). https://doi.org/10.1016/j.ejor.2020.12.020
26. Vincent, F.Y., Jewpanya, P., Redi, A.P., Tsao, Y.C.: Adaptive neighborhood sim-
ulated annealing for the heterogeneous fleet vehicle routing problem with multiple
cross-docks. Comput. Oper. Res. 129, 105205 (2021). https://doi.org/10.1016/j.
cor.2020.105205
The Snow Grooming Routing Problem
1 Introduction
There are 5360 km of cross-country skiing tracks in Norway’s top ten most pop-
ular ski facilities. Municipalities, ski clubs and commercial actors in these areas
spend more than 50 million NOK in total each season to maintain their track
networks [3].
According to Statistics Norway [13], 34266 km of tracks were planned to be
groomed in 2022. Extrapolating the costs from the popular facilities results in a
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 292–306, 2023.
https://doi.org/10.1007/978-3-031-43612-3_18
The Snow Grooming Routing Problem 293
rough estimate of 300 million NOK spent every year grooming Norwegian track
networks. These networks are complex, consisting of wide and narrow track
segments for the different styles of cross-country skiing. Popular routes must
be prioritized and groomed more often than peripheral segments solely used by
enthusiasts to maintain a certain standard across the whole network. Grooming
requests from local ski clubs, winter sport events and competitions must be
handled and included in the planning of the operations. These events often set
high standards for the snow grooming, requiring the grooming to be done within
specific time windows. Requests vary from being weekly requests known from
the start of the season to urgent short-term requests. The fact that requested
segments are mandatory complicates the planning.
In 2023, there were more than 700 snowcats grooming track networks and
alpine slopes in Norway [12]. Popular areas have fleets of snowcats grooming
simultaneously to meet demand. Statistics Norway show an increase in cabins
and holiday homes in Norway, where an expected consequence is an increase in
demand for cross-country tracks [14]. Current track networks will therefore prob-
ably meet an increase in skier traffic, requiring additional networks or expansion
of the current ones to be practical. This will further increase the complexity of
the grooming operations. New technology with GPS tracking of snowcats and
websites like skisporet.no has made it easier for skiers to find freshly groomed
tracks, but for the facilitators of the networks, helpful tools for planning groom-
ing operations is hard to find.
The problem of planning the grooming operations, the Snow Grooming Rout-
ing Problem (SGRP) is a rich arc routing problem with profits, see for example
[1,6] and [2] for recent surveys and a book on many aspects of arc routing
problems. The SGRP has multiple depots and a heterogeneous fleet. The track
network consists of wide and narrow segments, which are either requested, pri-
oritized, or regular. A wide segment can be groomed by one large snowcat or two
small snowcats. Requested segments have to be served within a given time win-
dow while the others can be groomed without regard to time. The optional seg-
ments are diversified by priority based on popularity for skiers but are groomed
without precedence to ensure an efficient route through the network. Arc rout-
ing problems for snow plowing and road gritting have similar attributes, see for
example [7–11], and [5] but this constitutes the first known attempt of solving
the Snow Grooming Routing Problem.
This paper aims to give a formal description of the Snow Grooming Routing
Problem (SGRP), formulate the problem as a mathematical model, and use the
model to optimize the snow grooming operations on a set of test instances based
on real-life data. First, we give some background to snow grooming and different
aspects of planning snow grooming operations in Sect. 2. The problem description
follows in Sect. 3 before the mathematical formulation in Sect. 4. Results from
testing the model on five instance are presented in Sect. 5. We conclude the paper
with a short discussion in Sect. 6.
294 H. Andersson et al.
2 Background
Networks of ski tracks vary in size and complexity from place to place. Some areas
have huge and intricate networks, while others have only a couple of closed loops
of tracks. The complexity comes from different track widths, the combination of
prioritized and requested track segments, track crossings, bridges, lakes, swamps
and road crossings. Small networks are usually groomed by a single snowcat,
while large networks have a larger fleet and potentially multiple depots. Norway
has hundreds of track networks, and Fig. 1 shows an overview of these in south-
ern Norway. Figure 2 is a more detailed map of the track network in Bymarka,
Trondheim. This is the main network in the Trondheim area and is the basis
for the problem description and the test instances. The network is operated by
Trondheim Bydrift and consists of more than 100 km of tracks. Three snowcats
in two depots are used to groom the tracks.
Track networks with stadiums and tracks meant for competitions are often
very dense, which means that they have many intersections and relatively short
distances between them. Competition tracks are mostly loops stretching 3, 5 or
10 km, and should be groomed in a single route so that skiers can use them
at once. The grooming of dense networks is often more challenging to plan
because of the numerous possibilities, while scattered networks with fewer seg-
ments between intersections and longer track segments pose fewer options.
The Snow Grooming Routing Problem 295
There are many stakeholders with different agendas regarding the snow
grooming operation, the track network and the surrounding nature. The gen-
eral public is the largest stakeholder, visiting popular cabins serving food as
well as the outskirts of the track network. They are to some extent interested
in quantity over quality when it comes to grooming, and that the track net-
works are large enough. The second largest stakeholder is the ski clubs and
high-schools for athletes. They normally have a set of routes that they regularly
use for organized training activities, and are mostly interested in top quality for
these specific routes. They also hand in grooming requests for special events such
as competitions. The municipalities want to facilitate for an active population,
while at the same time preserve the regional nature and minimize local pollution
in the area.
Decisions about which segments of the network to prioritize early in the
season are made in collaboration with the stakeholders. On a daily basis, it is
up to the snowcat operators to determine which tracks to groom and when to
groom them. Planning longer time-spans is difficult and not robust enough, due
to aspects like weather changes and short-term grooming requests.
During periods of consistent whether a weekly schedule for the network can
be followed, but with an occurrence of precipitation or a change in temperature,
the schedule is of little use. A snowfall resets the problem and all previously
groomed segments will have to be groomed again. Since consistent weather is
the exception for most of the season, the operators experience much overtime.
In some municipalities, the grooming of the tracks can be requested in
advance by stakeholders, either as a repetitive request throughout the season
or as a one-time request on an occasional basis. These requested tracks must be
of a certain standard at a given time of day. To meet the quality requirements,
the requested tracks must be groomed within a given time window. The time
window is set based on weather forecasts and projected traffic of skiers in the
network, a rule of thumb is that the grooming is finished an hour before the start
of the event. The total duration of the time window depends on the length of
the requested route. The possibility of requesting the grooming of certain tracks
results in a further complication of the route planning.
There are some segments that are prioritized, typically popular roundtrips
or routes leading to serviced cabins. Other segments are used less, but still they
have to be groomed regularly because of snowfalls. The goal is, therefore, to
ensure top track quality for the most popular tracks, while at the same time
not neglecting the other segments, substantiating the importance of planning.
Track networks with multiple depots are faced with a more complicated planning
problem. Coordination of routes to minimize overlapping and non-groomed track
segments is obtainable, but hard to plan since the route selection of the operators
is often done in a myopic way with incomplete information of the plans of the
other operators.
With different sizes of snowcats and different widths of tracks, comes the
problem of assigning the right snowcat to the right track segment. Large snowcats
are typically too wide to traverse narrow tracks with only classic skiing, meaning
296 H. Andersson et al.
they can only be used on wide tracks (for both classic and skate). The small
snowcats are used for the narrow tracks, but can also be delegated to grooming
wide tracks. For a small snowcat to groom a wide track, it must traverse it twice.
3 Problem Description
The Snow Grooming Routing Problem (SGRP) is an extension of the arc routing
problem with profits. The problem arises in cross-country skiing facilities around
the world when scheduling the grooming of the track networks. The network is
a set of tracks (edges) and intersections (nodes) between them. Each edge has
a given length and topography and is classified according to width and priority.
The width is either wide or narrow. An edge can be without priority, prioritized
or requested. A requested edge must be groomed and there is a time window
specifying when the edge can be groomed. The characteristics of a track segment
can only change at the intersections. In general, the network is non-directional,
meaning that it does not matter in which direction a snowcat traverses an edge.
A network of ski tracks generally contains many intersections and relatively few
track segments between every pair of intersections.
On narrow tracks, a small snowcat can choose to groom two classic sections
or a combination of classic and skate. This is a dynamic setting that the operator
can set with negligible setup time and cost. For wide track segments, a small
snowcat will have to traverse the segment twice to be able to groom the whole
width. The solution is often to acquire a larger snowcat for networks with many
wide segments, which only have to traverse these segments once. These large
snowcats cannot traverse the narrower segments. Every intersection poses the
possibility of a change in track type, complicating the route planning and choice
of snowcat type for the route. Different snowcats have similar attributes, apart
from width and range. They operate at identical speeds under the same weather
conditions. Within each snowcat size, there are no significant differences. There-
fore, all track segments of the same type can be groomed by the same snowcats.
Figure 3 is a snippet of the larger track network in Bymarka, Trondheim (left),
which is presented in the form of a graph (right). Each segment is represented
by an edge in the graph.
When a track segment is groomed, there is no added benefit by grooming
it over again, and it may even be undesirable for the skiing conditions during
periods of consistent weather conditions. The problem aims to determine how
the track network should be groomed. That is, which snowcats should groom
which track segments, and at what time. The purpose is to ensure the effective
use of the available resources.
4 Mathematical Formulations
Before presenting the notation and the mathematical model, we state some
important assumptions:
The Snow Grooming Routing Problem 297
Fig. 3. Small part of the track network in Bymarka, Trondheim (left) and the corre-
sponding graph (right). Nodes are intersections and edges are segments. The priorities
are shown: dotted (no priority), dashed (prioritized), and solid (requested). Wide seg-
ments are marked with thick lines.
– Costs of grooming operations are not included in the model. Describing the
problem for daily operations, long-term overhead costs like inventory and
wages are considered sunk.
– The problem considers a single day. The weather conditions are known and
consistent during the day.
– The working hours are fixed, imposed as an upper bound with no possibility
of overtime.
We denote the track segments as edges and the intersections as nodes. The
problem is formulated over a graph G = (N , E) where N is the set of nodes and
E the set of edges. We also introduce A as the set of arcs to track directions,
each edge corresponds to two arcs. A subset E W ⊆ E of the edges are wide
edges, another subset E P ⊆ E of the edges are prioritized edges, and a third
subset E R ⊆ E of the edges are requested edges that must be groomed within a
given time window. We also define K to be the set of snowcats, and denote the
set of small and large snowcats as KS and KL , respectively.
We use (ij) to denote the edge connecting node i and j, with i < j. When
the direction the edge is traversed matter, we use ij to denote the arc starting
in node i and ending in node j. The time used to traverse edge (ij) is T(ij) and
its length is L(ij) . We use [T (ij) , T (ij) ] to denote the time window defined for
the requested edge (ij). The time available for snowcat k is TkK and the node
where it must start and end its route is denote i(k). We introduce a reward
R(ij) for grooming edge (ij) and a cost C per length unit groomed to penalize
unnecessary grooming.
298 H. Andersson et al.
We use a leg-based formulation and introduce the index set L = {0, . . . , L+1}
for the set of legs where L is an upper bound on the number of arcs in the route
of a snowcat. The binary variable xkijl is 1 if snowcat k grooms arc ij on leg
l, and 0 otherwise. The binary variable v(ij) is 1 if edge (ij) is groomed, and 0
otherwise, and wk(ij)l is 1 if edge (ij) ∈ E R is groomed by snowcat k on leg l.
The time variable tkl is the time snowcat k starts leg l and τk(ij)l is the time
when edge (ij) ∈ E R is groomed by snowcat k on leg l.
With this notation, the SGRP can be formulated as follows:
max z = R(ij) v(ij) − C L(ij) (xkijl + xkijl ) (1)
(ij)∈E k∈K (ij)∈E l∈L
The objective function (1) maximizes the reward from groomed edges and penal-
izes the traversal of an edge more than once.
Constraints (2) and (4) state that snowcat k must start and end its route in
its depot i(k). Node 0 is an artificial starting node with T(0i(k)) = L(0i(k)) = 0.
Constraints (3) ensure the flow of snowcats through the nodes. They state that
if snowcat k enters node j, it must exit the same node on its next leg.
v(ij) ≤ (xkijl + xkjil ) (ij) ∈ E \ E W (5)
k∈KS l∈L
1
v(ij) ≤ (xkijl + xkjil ) + (xkijl + xkjil ) (ij) ∈ E W (6)
2
k∈KL l∈L S
k∈K l∈L
Constraints (5) and (6) keep track of if an edge (ij) is groomed or not. The right
side of the constraints contains both arc ij and arc ji, so the model becomes non-
directional. Constraints (6) also state that for wide edges, a small snowcat must
groom the edge twice, while large snowcats only need one traversal. Constraints
(5) ensure that only small snowcats can groom narrow edges.
Constraints (7) link the x-variable with the w-variable, so that a requested edge
is considered groomed independent of which direction it is traversed. Constraints
The Snow Grooming Routing Problem 299
(8) state that the prioritized track segments must be groomed, either once by
a large snowcat or twice by a small snowcat. Constraints (9) state that narrow
requested edges must be groomed by a small snowcat.
tk,l+1 ≥ tkl + T(ij) (xkijl + xkijl ) k ∈ K, l ∈ L (10)
ij∈E
Constraints (10) update the starting time for snowcat k on leg l+1 by connecting
it with the starting time on leg l.
τk(ij)l ≥ tkl − M (1 − wk(ij)l ) k ∈ K, (ij) ∈ E R , l ∈ L (11)
R
τk(ij)l ≤ tkl + M (1 − wk(ij)l ) k ∈ K, (ij) ∈ E , l ∈ L (12)
R
T (ij) ≤ τk(ij)l ≤ T (ij) k ∈ K, (ij) ∈ E , l ∈ L (13)
Constraints (11) and (12) link the τ -variable and the t-variable. Constraints (13)
make sure that the requested edges are groomed within their time window.
xkijl ∈ {0, 1} k ∈ K, ij ∈ A, l ∈ L (14)
R
wk(ij)l ∈ {0, 1} k ∈ K, (ij) ∈ E , l ∈ L (15)
v(ij) ∈ {0, 1} (ij) ∈ E (16)
0 ≤ tkl ≤ TkK k ∈ K, l ∈ L (17)
R
τk(ij)l ≥ 0 k ∈ K, (ij) ∈ E , l ∈ L (18)
Constraints (14)–(16) are binary restrictions, while constraints (17) set the
bound on the time used. Constraints (18) ensure non-negativity. Variables xkijl
and wk(ij)l for large snowcats are not generated for narrow edges.
5 Results
We have collected data for Bymarka, Trondheim and created 5 instances. Rel-
evant statistics for each instance are presented in Table 1. The table shows the
300 H. Andersson et al.
name of the instance, the total length of the network in km, the number of
nodes, the number of edges, the number of wide edges, the number of priori-
tized edges, the number of requested edges and the number of small and large
snowcats respectively. The solution time is set to 7200 s for all instances. The
optimization model is implemented in the programming language Mosel [4] and
the models are solved using Xpress-Optimizer Version 9.0. All software is run on
Lenovo ThinkSystem SD530 computers with the following specifications.
Since each network is connected and there are positive revenues for all edges,
we can assume that each network will be completely groomed given enough time.
Our first test is therefore to run the model on each network without considering
the time windows and the available time for different values on the number of legs
L. A lower bound on the number of legs needed can be calculated as |E| + |E W |
for the instances with one small snowcat. The results are presented in Table 2.
The table shows the name of the instance, the maximum number of legs for each
snowcat, the objective value of the best feasible solution, the dual bound, the
number of edges groomed, and the number of edges traversed.
A first observation when studying the results in Table 2 is that the problem
is very complex and most instances are not solved to proven optimality. Still,
the gaps are low in all cases and we believe that valid concussions can be drawn
based on the results.
If we analyze Leinstrand, the lower bound is 42 and we see that L = 40 is
clearly not enough to groom all edges. Even though L = 50 is higher than the
lower bound, we are still not able to groom all edges. The solutions for L = 60
and L = 70 show that 53 edges (excluding the artificial edges to and from node
0) must be traversed to groom the full network.
The results are the same for the other cases with one small snowcat. The
lower bound on SkistuaV is 45 and 57 edges are needed to completely groom
The Snow Grooming Routing Problem 301
the network and the lower bound for SkistuaO is 111 and the number of edges
needed is 127. On the instances with more than one snowcat, the lower bound
on the number of edges is |E|, the small snowcat grooms all narrow edges and
the large snowcat all wide edges, but how to set L based on this is an open
question. The results in Table 2 show that L = 50 is enough for Nilsbyen. For
N&L, L = 90 gives the best results.
Since restricting L causes changes to the solution that do not have to do
with the planning problem in itself, we continue the computational study with
L large enough to not affect the optimal solution of the problem. Still, since the
number of variables and constraints depends on L, keeping it low is preferable.
Next, we analyse the effect of the available time and time windows. We test
this on the Leinstrand instance where the snowcat spends 164.8 min to groom the
complete network. We set the available time to 120 min and test without time
windows. In the solution from Table 2 we note that about half of the requested
edges are groom in the beginning of the route and the rest at the end of the
route. We therefore also test with three different time windows; [0, 60], [30, 90],
and [60, 120]. The results of these tests are presented in Table 3. The table shows
the name of the instance, the maximum number of legs for the snowcat, the
302 H. Andersson et al.
Fig. 4. Two examples of time constrained solutions for the Leinstrand instance. With-
out time windows (left) and with time windows [30, 90] (right). The edges that are not
groomed are shown in orange. The priorities are shown: dotted (no priority), dashed
(prioritized), and solid (requested). Wide segments are marked with thick lines. (Color
figure online)
available time for the snowcat, the time windows [T (ij) , T (ij) ], which are the
same for all requested edges, the objective value of the best feasible solution, the
dual bound, the number of edges groomed, the number of edges traversed, and
the time the snowcat uses to groom.
It is clear that the number of edges that are groomed decreases. In Fig. 4, the
edges that are not groomed are marked in orange. The broad, prioritized edges
are all groomed. The difference between the solutions without time windows and
with early or late time windows is small. In the solution without time windows,
almost all requested arcs are groomed in the first hour, and due to symmetry
it is likely that there is an alternative solution where the remaining edges can
also be groomed within the first hour. Since there are only edges, a route can
be traversed in opposite direction without changing the cost. This means that
a feasible route for the case with time windows [0, 60] can become a feasible
route for the case with time windows [60, 120] by traversing it in the opposite
direction. The solution for the time windows [30, 90] is worse. The snowcat cannot
start grooming the requested edges and must return to do this later. This gives a
situation where the farthest edges cannot be groomed. Not only is the number of
edges lower, but the length of the edges not groomed is also longer. In the time
restricted case without time windows, 8.2 km of the network is not groomed,
while the corresponding number of the case with the time windows [30, 90] is
10.1 km.
In the last analysis, we study the effect of coordination on the two snowcats
instances Nilsbyen and N&L. The variable restrictions (17) set an upper bound
The Snow Grooming Routing Problem 303
TkK on the available time for snowcat k. This models the case when one driver
per snowcat grooms the track. Another interesting situation is when there is one
driver that operates both snowcats and has an upper bound T K on the total
available time. To model this, we introduce the constraint
tk,L+1 ≤ T K (19)
k∈K
To test coordination together with time windows, we introduce the binary vari-
able o, which is 1 if the small snowcat is operated first and 0 otherwise. The
sequencing of the snowcats is modeled by the following constraints
where k S and k L are the indices for the small and large snowcat respectively. If
the fleet of snowcats is larger, a more complex formulation of the sequencing is
needed. Here, we limit the time using the variable bounds
We run Nilsbyen and N&L with less available time than the time used in
the solution from Table 2. We then take the same total available time, but let
the model decide on how to divide the time between the snowcats. We also test
with even less available time to see if coordination can compensate for less time.
The results from these tests are given in Table 4. The table shows the name of
the instance, the maximum number of legs for the snowcat, the available time
for each snowcat when there is no coordination, the total available time when
there is coordination, the time windows [T (ij) , T (ij) ], which are the same for all
requested edges, the objective value of the best feasible solution, the dual bound,
the number of edges groomed, the number of edges traversed, and the time spent
by the small and large snowcat respectively.
304 H. Andersson et al.
it before it has to return to the depot. When going back and forth on the same
edges, the advantage of the large snowcat is lost.
The results are similar for the N&L instance, the case with a total available
time is better than the case with individual available times. This is especially
true for the 90 + 90 case compared with the 180 case. Figure 6 illustrates this
case. When the total time can be distributed, more time can be given to the
large snowcat so that it can groom all prioritized edges.
Fig. 6. A comparison between solutions without and with coordination for the N&L
instance. In the left figure, each snowcat has 90 min available, in the right figure the
total available time is 180 min. The route of the small snowcat is marked in orange and
the route of the large snowcat is marked in green. The priorities are shown: dotted (no
priority), dashed (prioritized), and solid (requested). Wide segments are marked with
thick lines. (Color figure online)
6 Conclusion
We have introduced a new problem to the arc routing society, the Snow Groom-
ing Routing Problem (SGRP). The SGRP is a rich arc routing problem with
profits including multiple depots, heterogeneous vehicles, time windows, and
edges that can be serviced in different ways. A mixed-integer linear formulation
of the problem is given and tested on five different instances based on real-
life data. The computational study highlights interesting findings regarding the
effect of restricted time and time windows, and coordination. The problem is
complex and hard to solve and developing both efficient heuristics and other
exact solution methods are interesting avenues for future research. The SGRP
studied here is a static version of the problem, extending it to a dynamic prob-
lem where both the initial condition of the tracks and the weather are included,
which is an interesting topic for future research.
306 H. Andersson et al.
References
1. Corberán, A., Eglese, R., Hasle, G., Plana, I., Sanchis, J.M.: Arc routing problems:
a review of the past, present, and future. Networks 77(1), 88–115 (2021)
2. Corberán, A., Laporte, G.: Arc Routing. Society for Industrial and Applied Math-
ematics, Philadelphia (2015)
3. Dagens Næringsliv: Norges beste langrennsløyper (2013). https://www.dn.no/
trening/norges-beste-langrennsomrader/1-1-5010947. Accessed 13 Apr 2023
4. Fair Isaac Corporation: FICOXpress
R Optimization Help (2023). https://www.
fico.com/fico-xpress-optimization/docs/latest/overview.html. Accessed 02 Feb
2023
5. Gundersen, A.H., Johansen, M., Kjær, B.S., Andersson, H., Stålhane, M.: Arc
routing with precedence constraints: an application to snow plowing operations.
In: ICCL 2017. LNCS, vol. 10572, pp. 174–188. Springer, Cham (2017). https://
doi.org/10.1007/978-3-319-68496-3 12
6. Mourão, M.C., Pinto, L.S.: An updated annotated bibliography on arc routing
problems. Networks 70(3), 144–194 (2017)
7. Perrier, N., Langevin, A., Amaya, C.A.: Vehicle routing for urban snow plowing
operations. Transp. Sci. 42(1), 44–56 (2008)
8. Perrier, N., Langevin, A., Campbell, J.F.: A survey of models and algorithms for
winter road maintenance. Part I: system design for spreading and plowing. Comput.
Oper. Res. 33(1), 209–238 (2006)
9. Perrier, N., Langevin, A., Campbell, J.F.: A survey of models and algorithms for
winter road maintenance. Part II: system design for snow disposal. Comput. Oper.
Res. 33(1), 239–262 (2006)
10. Perrier, N., Langevin, A., Campbell, J.F.: A survey of models and algorithms for
winter road maintenance. Part III: vehicle routing and depot location for spreading.
Comput. Oper. Res. 34(1), 211–257 (2007)
11. Perrier, N., Langevin, A., Campbell, J.F.: A survey of models and algorithms for
winter road maintenance. Part IV: vehicle routing and fleet sizing for plowing and
snow disposal. Comput. Oper. Res. 34(1), 258–294 (2007)
12. Skisporet: Skisporet - om oss (2023). https://www.skisporet.no/about. Accessed
13 Apr 2023
13. Statistisk Sentralbyrå: Tabell 12593: Tilrettelegging for friluftsliv (2022). https://
www.ssb.no/statbank/table/12593/. Accessed 13 Apr 2023
14. Statistisk Sentralbyrå: Bygningsmassen (2023). https://www.ssb.no/
bygningsmasse. Accessed 13 Apr 2023
A Constraint Programming Model
for the Vehicle Routing Problem
with Multiple Time Windows
Florian Linß(B)
1 Introduction
Solomon instances [14]. These benchmark instances have since served as a stan-
dard for evaluating future work for the VRPMTW. Other state-of-the-art heuris-
tics include the adaptive large neighborhood search (ALNS) proposed by [8], the
genetic VNS introduced by [3], the adaptive variable neighborhood search sug-
gested by [5], and the ALNS presented by [13]. More recently, [15] extended
the VRPMTW to the vehicle routing problem (VRP) with availability profiles,
where successful delivery to customers depends on the probability of delivery
within specific time windows. The proposed ALNS heuristic was also evaluated
for the VRPMTW and produced new best-known solutions.
Regarding exact methods, different mixed integer programming (MIP) for-
mulations exist in the related literature. Still, these are mainly problem defini-
tions and have rarely been evaluated as exact approaches for the VRPMTW. [2]
introduced a three-index (vehicle flow) formulation where each vehicle’s flow is
formulated separately. However, since the vehicles are homogeneous, differentia-
tion between vehicles is not necessary. Thus, the formulation can be strengthened
to a two-index formulation introduced by [13], where the flow is formulated aggre-
gated for all vehicles. The authors implemented both MIP formulations using a
standard solver and showed that the latter formulation performed significantly
better. Nevertheless, there were already instances with only 15 customers that
could not be solved optimally. In addition to the comprehensive MIP formula-
tions, [4] presented a column generation approach and a post-optimization VNS
heuristic. For this approach, the pricing problem - which involves the determi-
nation of an elementary shortest path with multiple time windows and capacity
constraints - is solved using a standard solver. The authors were able to optimally
solve instances with up to 17 customers. The rarity of exact approaches for the
VRPMTW contrasts with the adoption of exact approaches for the VRPTW.
As pointed out by [1], many exact methods are available for the latter. In recent
research, [10] suggested that constraint programming (CP) is a viable alterna-
tive to more traditional MIP models and solvers in scheduling problems. This
finding is relevant to the VRP context, as considering time windows and service
times shares similarities to scheduling. [11] previously demonstrated the suit-
ability, who applied CP to solve the traveling salesman problem with multiple
time windows. Additionally, [12] employed a column generation approach for the
VRPTW and integrated CP for solving the pricing problem.
This paper is the first approach that applies a CP formulation in the context
of the VRPMTW. In addition, we show that fixing variables and additional
interval variables improve the performance of the CP model. The CP model
outperforms a comparable MIP formulation regarding the number of instances
optimally solved, the time required to prove optimality, and remaining gap after
a given runtime. We achieve proven optimality for almost all instances with 20
customers and for more than half of the instances with 25 customers. The work is
structured as follows. In Sect. 2, the CP formulation is presented. Computation
tests are discussed in Sect. 3. Finally, a conclusion is presented in Sect. 4.
A CP Model for VRP with Multiple Time Windows 309
1
https://github.com/google/or-tools.
310 F. Linß
The arc set is defined as A = {(i, j)|i = j, i, j ∈ V}. Each arc (i, j) ∈ A
is associated with a non-negative travel time τij and travel costs γij . All time
windows ending before the earliest possible start of service and starting after the
latest possible end of service can be removed from Ti of customer i. The earliest
possible start of service is defined as α01 + τ0i and the latest possible end can be
calculated by T − σi − τi0 . We modify the new first and last time windows as
follows: αi1 = max{αi1 , τ0i } and βiθi = min{βiθi , T − σi − τi0 }.
The vehicle set K denotes a homogeneous fleet of K vehicles with a freight
capacity of κ and fixed vehicle costs γ f . Following the definition in [13], the
VRPMTW is considered in two variants, which differ in their objective function.
The first variant aims to minimize vehicle and travel costs. In contrast, the
second variant additionally considers waiting times in case a vehicle arrives at
a customer when there is no time window open. The costs for waiting time
are described by γ w per time unit. To distinguish between both variants, the
parameter B is used to differentiate between them: B = 1 indicates waiting
times are considered, and B = 0 means they are not.
Table 2 summarizes the definition of the decision variables. The vehicle flow
in the graph is described by the boolean variable xij indicating whether an arc
(i, j) ∈ A is used or not. Additionally, the boolean variable yik defines the assign-
ment of customer i to a vehicle k. For convenience and efficient determination
of the number of vehicles, the variable vi defines the vehicle index for each cus-
tomer i, and the variable r denotes the number of vehicles used. The variable
si describes the start time of a service at a vertex i. The time windows for
each vertex are expressed by specifying multiple disjoint domains in the variable
definition. With this notation, we state a CP model for the VRPMTW:
min γ f r + γij xij + B γ w wi (1)
(i,j)∈A j∈C
The objective function (1) minimizes total costs consisting of total fixed vehi-
cle costs, total travel costs, and total waiting costs in case of B = 1. The vehicle
flow is modeled in constraint (2) using the global function MultipleCircuit
implemented in Google OR-tools. The constraint ensures that every customer
vertex has exactly one incoming and one outgoing arc. Additionally, vertex 0
has the same number of incoming and outgoing arcs, and cycles not including
the depot are prohibited. The global functions ExactlyOne in constraints (3)
ensure that exactly one variable of a set of boolean variables must be true. This
relation ensures that only one vehicle is assigned to each customer.
Constraint (4) connects the number of vehicles used with the maximum vehi-
cle index. Constraint (5) guarantees that the number of vehicles used corresponds
to the sum of outgoing arcs from the depot. Constraints (6) ensure that each
vehicle’s capacity is respected. Constraints (7) set vehicle index variables vi
based on binary variables yik . Constraints (8) enforce that the head and tail of a
traveled arc connecting customers have the same vehicle index. The scheduling
of services is respected in constraints (9). If an arc xij is used, the service at its
tail j must start at the arrival time si + σi + τij plus a potential waiting time
wj .
Constraints (10) defines that if two customers are assigned to two different
routes, i.e., x0i ∧ x0j , their vehicle indices must differ, too. Similarly, we specify
the vehicle index of a customer cannot be k if the customer is not assigned
to vehicle k, see constraints (11). Since all vehicles are homogeneous, the first
customer is assigned to the first vehicle by the symmetry breaking condition (12).
312 F. Linß
Table 3 summarizes the definition of the additional variables. Since each vehi-
cle starts at the depot, separate interval variables are defined for each vehi-
cle. More precisely, the integer variable s0k is introduced to define the start
time of a vehicle k at the depot. With integer variables t0k and a0k denoting
the travel time arrival time at the first customer, S0k is defined as
to and
IntervalVar s0k , t0k , a0k . Since each customer is visited by one vehicle, exactly
one interval variable is present per customer. Therefore, optional interval vari-
ables are introduced for each vehicle and customer. Their presence depends
on the assignment of customers to vehicles. The variables ti and ai are intro-
duced as travel to and arrival time at the next vertex for each customer i.
Depending on whether a customer is assigned to a vehicle, the interval variable
optIntervalVar (yik , [si , σi + ti , ai )) is only present for vehicle k if yik is true.
In Fig. 1, the definitions above are illustrated by the interval variables of an
example route, see Fig. 1b. The gantt chart in Fig. 1a shows each vertex’s service
time and travel time to the next vertex. Since the start times are only defined
within the time windows, only an overlap of the intervals has to be prevented
for a feasible schedule. This can be enforced efficiently with global constraints
like NoOverlap. The corresponding constraints are listed below.
A CP Model for VRP with Multiple Time Windows 313
Fig. 1. The schedule of an example route illustrates the definition of interval variables.
The length and the end of the optional interval variables of the customers
are defined with the constraints (13) and (14). If j is visited after i, ti is equal
to the travel time τij and the service start sj at j must be after ai . Similarly,
the length and the end of the interval variable at the depot are described by
constraints (15) and (16). If j is the first customer visited by vehicle k, t0k must
be equal to the travel time τ0j . Since each vehicle can start every time within
the planning horizon, the first customer’s service start sj equals a0k , and no
waiting times occur. Finally, overlapping of the intervals can be prevented for
each vehicle with the global constraint NoOverlap, see constraint (17).
314 F. Linß
3 Computational Tests
3.1 Experimental Design
We apply the MIP formulation introduced in [13] to validate and assess the CP
model. We use the CP-SAT solver of Google OR-Tools (9.4) to solve the CP
model and Gurobi (10.0.0) to solve the MIP formulation. Both approaches are
implemented in C++ using MSVC v143. All tests were run with a limit of 8
parallel threads on an AMD EPYC 7513 with 2.6 GHz clock speed and 64 GB
RAM. Each instance is run three times with different seed values to account for
the variability in solver performance. Apart from that, both solvers were run
with their default parameters. The runtime is limited to 3,600 s for each run.
We use the small to medium-sized instances provided by [13] in our experi-
ments. These instances were derived from the instances presented in [2]. Specif-
ically, the first 10, 15, 20, 25, and 50 customers from the instances cm101,
cm105, cm201, cm205, rm101, rm105, rm201, rm205, rcm101, rcm105, rcm201,
and rcm205 were selected to create a total of 60 instances. The instances are
classified according to three metrics for the customers’ location: clustered (cm),
random (rm), and random clustered (rcm). Additionally, the first digit of the
instance name indicates whether the vertices have short (1) or long (2) time
windows. Since the CP model is only defined for integer domains, all parameter
values were scaled with 104 and rounded.
3.2 Results
Model Comparison. To evaluate the impact of fixing the variables and adding
interval variables, we solve all instances using the base formulation without
model strengthening (CPBase ) and with model strengthening (CPFull ). In pre-
liminary tests, we observed that CPFull had difficulties finding a feasible solution
for large instances. Therefore, CPBase was solved for maximal 60 s as a starting
solution and passed to CPFull .
The results are shown in Table 4. The average values are grouped by their
instance size, i.e., the number of customers. Apart from the count of optimally
solved runs out of the 36 runs (12 instances × 3 runs per instance), the remaining
gaps and runtime are computed considering the average of all runs. Analyzing the
optimal objective function values shows the validity of the CP model compared
to the MIP model. We were able to prove optimality for the first time in a total
of 18 instances. For this, we refer to Table 6 in the Appendix for more detailed
results for all instances.
A CP Model for VRP with Multiple Time Windows 315
Although all models can optimally solve the small instances with 10 cus-
tomers, a comparison between MIP and CPBase reveals that the latter performs
better regarding average gap and runtime over all instances. More considerable
differences can be seen for CPFull . For example, instances with up to 15 cus-
tomers are solved within a few seconds, and, unlike other formulations, all but
one of the 20 customer instances can be solved optimally. However, as the num-
ber of customers exceeds 25, the increasing complexity of the problem becomes
apparent in CPFull . This is reflected by higher runtimes and decreased instances
for which optimality can be proven. Despite these challenges, CPFull is still the
most effective formulation for larger instances.
Comparing the two objective functions shows that including waiting costs
makes the optimality proof more difficult since the runtimes are more extensive
as only minimizing vehicle and travel costs. In summary, the CP model is clearly
better than the MIP formulation due to the use of variable interval variables.
Therefore, we will refer to the CPFull model as the CP model in the following.
one–third of the vehicle capacity for both routes in the optimal solution. The
fixed costs in the benchmark instances are very high. Therefore, a solution with
fewer vehicles leads to lower costs [2]. However, the time windows of the depot
and the customers do not permit a feasible sequence with only one vehicle, even
though the volume capacity is sufficient to visit all customers using one vehicle.
Fig. 2. Optimal solution of instance rm101 with 10 customers presented by the sched-
ules and the volume utilization of the routes.
Fig. 3. Comparison of MIP and CP solver according to the average gap and runtime.
The instances are divided into a set of 44 instances, where the lower bound for the
number of required vehicles calculated by volume approximation (LBVol ) equals the
lower bound calculated by the CP solver (LBCP ) and a set of 16 instances, where
LBVol < LBCP .
also be halved. It becomes evident that the CP model can determine the LB for
the number of vehicles much better than the MIP. As a result, the CP model
leads to better solutions overall because the number of vehicles greatly influences
the objective function due to high fixed costs.
Table 5. Average remaining gap G in [%] after 3.600 s, average runtimes (T) in [s], and
the average number of vehicles used (V) grouped by problem variant and time window
lengths per instance metric (20 instances × 3 runs) are displayed.
Interestingly, for both problem variants, the random clustered instance with
short time windows can be solved easier than solving the remaining instances.
318 F. Linß
4 Conclusion
This paper studies the vehicle routing problem with multiple time windows
(VRPMTW). The goal is to visit all customers with multiple time windows and
start and end within the depot time window. Two variants of the VRPMTW
are considered with different objective functions: the first variant aims to mini-
mize vehicle and travel costs. In contrast, the second variant considers waiting
times in case a vehicle arrives at a customer when there is no time window
open, in addition to vehicle and travel costs. We developed a constraint pro-
gramming (CP) model with additional model strengthening by fixing variables
and variable interval variables to address this problem. Experimental tests on
benchmark instances reveal that model strengthening improves the performance
of the CP model. As a result, the CP model performs considerably better than
a comparable mixed–integer programming (MIP) model. In total, we solved 18
instances optimally for the first time. We achieve proven optimality for almost
all instances with 20 customers and more than half with 25 customers. When
reaching the time limit, the gap values are halved compared to the MIP model.
In addition, we performed a more detailed analysis to better explain the differ-
ence in performance between the two models and to account for the influence of
instance characteristics on the CP model’s solution performance. The feasibility
of the routes is defined by two factors: the respect to the vehicle capacity and the
feasible sequence so that each customer is visited one of the time windows. The
CP model is particularly well suited for instances where customer time windows
constrain the number of required routes in addition to the vehicles’ capacity. This
way, solutions with too few vehicles can be excluded more quickly. Nevertheless,
instances with longer time windows and, thus, more possible combinations of
customers are challenging to solve.
Observations reveal that in instances with 50 customers, there are large dif-
ferences in the solutions found by the CP solver between different runs on the
same instance. To improve the upper bound in such instances, embedding other
solution methods, such as heuristics, is a promising approach. Another extension
involves the integration of the CP model into a sophisticated solution approach.
The results regarding the lower bound of the required vehicles motivate using
the CP model to solve the traveling salesman problem with multiple time win-
dows as a feasibility problem on subsets of customers. Efficient feasibility checks
A CP Model for VRP with Multiple Time Windows 319
through the CP model can determine that these customers must be served by at
least two vehicles if their combination is infeasible. This information can be used
to create cutting planes, e. g. cover inequalities in a branch-and-cut approach.
Appendix
Table 6. Detailed results of the best runs of MIP and CP for both problem variants
according to objective value (λ), remaining gap (G) in [%] after the time limit 3,600 s
(TL), and runtime (T) in [s].
Inst. C B =0 B =1
MIP CP MIP CP
λ G T λ G T λ G T λ G T
cm101 10 485.3 0.0 3174 485.3 0.0 9.9 490.8 0.0 1662 490.8 0.0 62
cm105 10 486.0 0.0 58 486.0 0.0 0.7 488.5 0.0 38 488.5 0.0 4.8
cm201 10 833.1 0.0 0.5 833.1 0.0 0.3 834.9 0.0 0.4 834.9 0.0 0.5
cm205 10 834.9 0.0 0.5 834.9 0.0 0.3 835.3 0.0 0.3 835.3 0.0 0.8
rcm101 10 566.0 0.0 0.4 566.0 0.0 0.3 566.0 0.0 0.2 566.0 0.0 0.4
rcm105 10 566.8 0.0 1.2 566.8 0.0 0.3 569.5 0.0 0.2 569.5 0.0 0.8
rcm201 10 1137.8 0.0 1.0 1137.8 0.0 0.2 1137.8 0.0 0.2 1137.8 0.0 0.3
rcm205 10 1137.8 0.0 0.4 1137.8 0.0 0.2 1137.8 0.0 0.1 1137.8 0.0 0.3
rm101 10 598.2 0.0 54 598.2 0.0 0.5 606.7 0.0 30 606.7 0.0 3.5
rm105 10 595.4 0.0 125 595.4 0.0 1.9 595.4 0.0 40 595.4 0.0 27
rm201 10 1173.0 0.0 0.1 1173.0 0.0 0.2 1189.1 0.0 1.3 1189.1 0.0 0.9
rm205 10 1173.0 0.0 0.1 1173.0 0.0 0.2 1173.0 0.0 0.1 1173.0 0.0 0.3
Mean 10 799.0 0.0 285 799.0 0.0 0.2 802.1 0.0 148 802.1 0.0 0.5
cm101 15 537.8 0.0 9.6 537.8 0.0 3.9 555.3 0.0 263 555.3 0.0 148
cm105 15 538.1 0.0 4.1 538.1 0.0 1.6 538.1 0.0 4.2 538.1 0.0 6.0
cm201 15 870.6 0.0 3.3 870.6 0.0 1.1 873.7 0.0 4.6 873.7 0.0 5.9
cm205 15 883.2 0.0 10 883.2 0.0 4.1 883.2 0.0 4.8 883.2 0.0 7.8
rcm101 15 586.9 0.0 2.3 586.9 0.0 0.7 589.3 0.0 1.0 589.3 0.0 2.0
rcm105 15 588.5 0.0 3.2 588.5 0.0 1.1 594.5 0.0 1.2 594.5 0.0 4.6
rcm201 15 1153.6 0.0 3.9 1153.6 0.0 0.4 1153.6 0.0 0.5 1153.6 0.0 0.8
rcm205 15 1153.6 0.0 28 1153.6 0.0 0.4 1156.2 0.0 0.9 1156.2 0.0 2.4
rm101 15 878.2 19.8 TL 878.2 0.0∗ 86 883.8 21.3 TL 883.8 0.0∗ 86
rm105 15 661.5 0.0 2.9 661.5 0.0 1.3 664.9 0.0 1.1 664.9 0.0 4.2
rm201 15 1229.2 0.0 0.3 1229.2 0.0 0.3 1237.2 0.0 1.4 1237.2 0.0 1.6
rm205 15 1229.2 0.0 0.4 1229.2 0.0 0.6 1229.2 0.0 0.2 1229.2 0.0 0.6
Mean 15 859.2 1.7 306 859.2 0.0 2.9 863.2 1.8 324 863.2 0.0 11
cm101 20 801.7 27.7 TL 801.7 0.0∗ 1236 839.0 29.1 TL 832.0 7.2 TL
cm105 20 801.1 28.0 TL 801.1 0.0∗ 558 802.6 28.5 TL 802.6 0.0∗ 798
cm201 20 1599.0 44.3 TL 1599.0 0.0∗ 65 1599.0 44.3 TL 1599.0 0.0∗ 68
cm205 20 1607.2 44.5 TL 948.7 6.7 TL 1608.7 44.4 TL 948.7 0.0∗ 2059
rcm101 20 883.7 0.0 6.4 883.7 0.0 1.7 887.3 0.0 1.7 887.3 0.0 4.5
rcm105 20 885.2 0.0 4.2 885.2 0.0 2.4 895.8 0.0 6.3 895.8 0.0 9.4
rcm201 20 1219.4 0.0 6.2 1219.4 0.0 0.8 1219.4 0.0 0.9 1219.4 0.0 1.2
(continued)
320 F. Linß
Table 6. (continued)
Inst. C B =0 B =1
MIP CP MIP CP
λ G T λ G T λ G T λ G T
rcm205 20 1219.4 0.0 6.3 1219.4 0.0 0.7 1229.6 0.0 194 1229.6 0.0 111
rm101 20 922.4 21.5 TL 922.4 0.0∗ 76 938.1 22.8 TL 938.1 0.0∗ 103
rm105 20 906.2 21.2 TL 906.2 0.0∗ 76 906.2 21.5 TL 906.2 0.0∗ 74
rm201 20 1262.3 0.0 0.7 1262.3 0.0 0.7 1280.4 0.0 98 1280.4 0.0 17
rm205 20 1262.3 0.0 0.5 1262.3 0.0 1.0 1262.3 0.0 0.7 1262.3 0.0 1.6
Mean 20 1114.2 15.6 1802 1059.3 0.6 443 1122.4 15.9 1825 1066.8 0.6 543
cm101 25 847.2 5.5 TL 825.2 6.3 TL 888.8 8.1 TL 876.3 11.7 TL
cm105 25 823.6 2.0 TL 823.6 5.8 TL 825.2 2.3 TL 825.2 6.0 TL
cm201 25 1629.8 44.6 TL 1626.9 0.0∗ 195 1629.9 44.7 TL 1628.7 0.0∗ 378
cm205 25 1619.7 44.2 TL 1023.3 11.7 TL 1624.8 44.3 TL 1624.8 44.5 TL
rcm101 25 895.0 0.0 5.6 895.0 0.0 3.3 898.6 0.0 4.8 898.6 0.0 5.4
rcm105 25 898.3 0.0 5.5 898.3 0.0 6.3 912.4 0.0 35 912.4 0.0 50
rcm201 25 1226.1 0.0 20 1226.1 0.0 1.2 1235.7 0.0 546 1235.7 0.0 23
rcm205 25 1226.1 0.0 24 1226.1 0.0 1.2 1255.2 1.7 TL 1255.2 2.3 TL
rm101 25 1185.8 35.7 TL 1185.8 19.8 TL 1209.3 36.8 TL 1209.3 21.5 TL
rm105 25 960.1 21.2 TL 960.1 0.0∗ 66 976.2 22.8 TL 976.2 0.0∗ 231
rm201 25 1316.9 0.0 13 1316.9 0.0 2.3 1342.0 0.8 TL 1342.0 0.0∗ 243
rm205 25 1313.3 0.0 1.6 1313.3 0.0 1.6 1315.7 0.0 1.4 1315.7 0.0 5.3
Mean 25 1161.8 12.8 1806 1110.1 3.6 1213 1176.2 13.5 2449 1175.0 7.2 1562
cm101 50 1868.3 27.8 TL 1683.7 21.5 TL 1833.1 25.4 TL 1913.7 31.0 TL
cm105 50 1657.5 18.4 TL 1650.0 19.7 TL 1658.1 17.5 TL 1687.1 21.4 TL
cm201 50 3234.6 46.8 TL 3230.0 23.9 TL 3263.6 46.5 TL 3247.4 24.3 TL
cm205 50 2567.5 32.3 TL 2524.5 30.9 TL 2559.3 31.7 TL 2530.7 31.1 TL
rcm101 50 1525.8 0.0 261 1525.8 0.0 232 1537.3 0.0 440 1537.3 0.0∗ 980
rcm105 50 1548.5 0.0 1363 1548.5 0.0 557 1566.0 1.1 TL 1568.2 2.1 TL
rcm201 50 2398.4 43.3 TL 2398.4 42.9 TL 2448.2 44.0 TL 2423.0 43.5 TL
rcm205 50 2398.4 43.4 TL 2398.4 42.9 TL 2430.5 43.5 TL 2431.1 43.7 TL
rm101 50 1885.5 29.6 TL 2063.6 26.7 TL 2109.7 36.8 TL 1922.5 21.4 TL
rm105 50 1797.0 26.4 TL 1794.2 15.8 TL 1793.7 25.9 TL 1806.2 16.4 TL
rm201 50 2488.6 42.3 TL 2485.7 41.2 TL 2664.3 45.1 TL 2540.5 42.5 TL
rm205 50 2478.1 41.7 TL 1486.7 1.7 TL 2479.7 41.0 TL 1487.0 1.7 TL
Mean 50 2154.0 29.3 3136 2065.8 22.3 3068 2195.3 29.9 3337 2091.2 23.2 3390
Note. Start solution time is excluded for CP. Bold indicates the minimum gap per
instance and variant. If both methods prove optimality, bold indicates minimum
runtime. ∗ denotes instances that the CP model solved optimally for the first time.
References
1. Baldacci, R., Mingozzi, A., Roberti, R.: Recent exact algorithms for solving the
vehicle routing problem under capacity and time window constraints. Eur. J. Oper.
Res. 218(1), 1–6 (2012). https://doi.org/10.1016/j.ejor.2011.07.037
2. Belhaiza, S., Hansen, P., Laporte, G.: A hybrid variable neighborhood tabu search
heuristic for the vehicle routing problem with multiple time windows. Comput.
Oper. Res. 52, 269–281 (2014). https://doi.org/10.1016/j.cor.2013.08.010
3. Belhaiza, S., M’Hallah, R., Brahim, G.B.: A new hybrid genetic variable neigh-
borhood search heuristic for the vehicle routing problem with multiple time win-
dows. In: 2017 IEEE Congress on Evolutionary Computation (CEC). IEEE (2017).
https://doi.org/10.1109/cec.2017.7969457
A CP Model for VRP with Multiple Time Windows 321
4. Bogue, E.T., Ferreira, H.S., Noronha, T.F., Prins, C.: A column generation and
a post optimization VNS heuristic for the vehicle routing problem with multiple
time windows. Optim. Lett. 16(1), 79–95 (2020). https://doi.org/10.1007/s11590-
019-01530-w
5. Hoogeboom, M., Dullaert, W., Lai, D., Vigo, D.: Efficient neighborhood evaluations
for the vehicle routing problem with multiple time windows. Transp. Sci. 54(2),
400–416 (2020). https://doi.org/10.1287/trsc.2019.0912
6. de Jong, C., Kant, G., Van Vlient, A.: On finding minimal route duration in the
vehicle routing problem with multiple time windows. Manuscript, Department of
Computer Science, Utrecht University, Holland (1996)
7. Laborie, P., Rogerie, J., Shaw, P., Vilı́m, P.: Reasoning with conditional time-
intervals. Part II: an algebraical model for resources. In: FLAIRS Conference, pp.
201–206 (2009)
8. Larsen, R., Pacino, D.: Fast delta evaluation for the vehicle routing problem with
multiple time windows (2019). https://doi.org/10.48550/ARXIV.1905.04114
9. Liu, C., Aleman, D.M., Beck, J.C.: Modelling and solving the senior transportation
problem. In: van Hoeve, W.-J. (ed.) CPAIOR 2018. LNCS, vol. 10848, pp. 412–428.
Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93031-2 30
10. Naderi, B., Ruiz, R., Roshanaei, V.: Mixed-integer programming vs. constraint
programming for shop scheduling problems: new results and outlook. INFORMS
J. Comput. (2023). https://doi.org/10.1287/ijoc.2023.1287
11. Pesant, G., Gendreau, M., Potvin, J.Y., Rousseau, J.M.: On the flexibility of con-
straint programming models: from single to multiple time windows for the traveling
salesman problem. Eur. J. Oper. Res. 117(2), 253–263 (1999). https://doi.org/10.
1016/s0377-2217(98)00248-3
12. Rousseau, L.M., Gendreau, M., Pesant, G., Focacci, F.: Solving VRPTWs with
constraint programming based column generation. Ann. Oper. Res. 130(1–4), 199–
216 (2004). https://doi.org/10.1023/b:anor.0000032576.73681.29
13. Schaap, H., Schiffer, M., Schneider, M., Walther, G.: A large neighborhood search
for the vehicle routing problem with multiple time windows. Transp. Sci. 56(5),
1369–1392 (2022). https://doi.org/10.1287/trsc.2021.1120
14. Solomon, M.M.: Algorithms for the vehicle routing and scheduling problems with
time window constraints. Oper. Res. 35(2), 254–265 (1987). https://doi.org/10.
1287/opre.35.2.254
15. Voigt, S., Frank, M., Fontaine, P., Kuhn, H.: The vehicle routing problem with
availability profiles. Transp. Sci. (2022). https://doi.org/10.1287/trsc.2022.1182
A Variable Neighborhood Search Algorithm
for the Truck-Drone Routing Problem
Abstract. The rise in e-commerce sales has increased the demand for logistics
companies to handle the problem of last-mile delivery, which is complex and
incurs large operational expenses. The use of drones in the development of modern
distribution networks opens new avenues for improving existing delivery systems.
The integration of drones with traditional delivery trucks that form the truck-drone
systems are being proposed by logistics firms for use in delivery procedures. In
this paper, we consider a single truck and drone delivery system and address the
associated routing problem. As a result, a variable neighborhood search algorithm
is developed and tested on small and medium instances. The algorithm shows
its potential ability to handle instances in a short time. Finally, the introduced
truck-drone delivery system with multiple visits per drone dispatch generates
cost-savings in comparison with the case of a single visit per delivery trip.
1 Introduction
The rapid evolution of e-commerce has intensified the need for logisticians to resolve the
problem of conveying goods from transportation hubs to their final destinations, known as
the Last-Mile Delivery problem (LMD) (Boyer et al. 2009). It is the last leg of a product’s
journey, nevertheless the most challenging, with the highest transportation cost across
all distribution networks (Gevaers et al. 2009). The inclusion of new technologies such
as drones may help in overcoming these challenges.
Drones are now widely used in a variety of applications, including logistics, health-
care, and precision agriculture (Scott and Scott 2017; Mogili and Deepak 2018; Murray
and Chu 2015). In this context, major drone manufacturers such as Zipline, Wing, Prime
Air, Flirtey, and others have expressed interest in incorporating drone technologies into
their operations (Wood 2019). For instance, Prime Air, developed by Amazon, can satisfy
customers’ orders within a 16 km radius of a fulfillment center by navigating using the
onboard global positioning system (Lavars 2013). Similarly, Zipline drones have made
over 13,000 deliveries across Rwanda, demonstrating their humanitarian capabilities by
transporting blood for transfusions (Dragolea 2019).
Although drones have shown potential in improving delivery operations, they are
limited by their technical characteristics such as payload capacity and flying range. As a
result, drones and delivery trucks can work in tandem to provide a more flexible delivery
system, resulting in a truck-drone delivery system.
Truck-drone routing problems are viewed as extended versions of the classical Vehi-
cle Routing Problems (VRP), which calls for the development of efficient solution meth-
ods to address the NP-hard nature of the related routing problem. Therefore, this study
considers a specific setting of the truck-drone delivery system and proposes a solution
method based on the variable neighborhood search (VNS) algorithm to solve the routing
considerations. The main contribution of this work is as follows.
1. Developing an efficient VNS algorithm for a truck-drone delivery system where the
drone is able to perform multiple visits per dispatch.
2. Conducting computational experiments to assess the algorithm’s performance using
small and medium instances.
The remainder of the paper is structured as follows. Section 2 presents a review
of the truck-drone routing problems-related literature. Section 3 provides the proposed
model description while Sect. 4 demonstrates the solution approach. The computational
analysis is given in Sect. 5 and finally, Sect. 6 concludes and provides future research
prospects.
2 Related Literature
The exploration of truck-drone systems is definitely appealing to academics and practi-
tioners. According to UPS, truck-drone systems might save up to $50 million per year
by reducing every driver’s route by one mile every day (UPS 2017). Amazon has also
been granted a patent for a truck-drone system in which the truck functions as a mobile
facility for operating and repairing the drone (Wohlsen 2014).
The truck-drone problems-related literature has gained significant attention due to
the various complexities associated with modeling and solving optimization problems.
Interested readers may refer to (Madani and Ndiaye 2022; Chung et al. 2020; Boysen et al.
2021) reviews, among others, to understand the current state-of-the-art of truck-drone
routing problems.
Researchers’ efforts are devoted to studying the logistical uses of drones in enhanc-
ing the delivery processes and tackling the LMD challenges. These studies take into
consideration the operational limitations of a drone and the routing optimization of a
combined truck-drone system. In this context, Murray and Chu (2015) are among the first
to investigate the flying side-kick traveling salesman problem (FSTSP), which considers
a set of customers to be served once by either a truck or a drone, with the truck only
fulfilling customer deliveries that the drone cannot serve. The FSTSP has been extended
and solved in different ways by (Jeong et al. 2019; Windras Mara et al. 2022; Liu et al.
2022; Boccia et al. 2021; de Freitas and Penna 2018; Dell’Amico et al. 2019; Murray
and Raj 2020), among others. For instance, Dell’Amico et al. (2019) proposed a new for-
mulation for the FSTSP that uses two and three-indexed formulations and strengthened
it by a set of valid inequalities and finally solved using branch-and-cut. This approach
yielded solutions in a relatively shorter time frame. Murray and Raj (2020) considered a
fleet of heterogeneous drones, each of which varies in terms of weight, volume capacity,
and endurance.
324 B. Madani and M. Ndiaye
Murray and Chu (2015) suggested the parallel drone scheduling traveling salesman
problem (PDTSP) as another form of the truck-drone delivery problem. In particular, a
truck and one or more drones perform delivery services independently, departing and
returning to the depot simultaneously. The objective is to reduce the time required to
service all customers and allow all vehicles to return to the depot, which was solved
using exact solution methods and a saving heuristic. Lei and Chen (2022) designed an
improved VNS algorithm that is based on reduced VNS for solving the PDSTP. The
algorithm includes 9 neighborhood structures covering operators such as relocate, 1-1
exchange, 2-2 exchange, and other operators tailored to the specific problem.
In a different approach, Agatz et al. (2018) introduced the traveling salesman prob-
lem with a drone (TSP-D), in which the drone must follow the truck’s road network and
may be launched and returned to the same location, which is not allowed in the FSTSP.
The objective of the TSP-D is to minimize the time required to serve all customers or
minimize the arrival time to the depot. The VRP with a drone (VRP-D) considers the
shared delivery operations between multiple trucks and multiple drones visiting a single
customer location per dispatch. Wang et al. (2017) introduced the VRP-D which deals
with multiple trucks and drones with the goal of minimizing the completion time, in
which the drones can be dispatched from and picked up only at the depot and customers’
nodes. The analysis was conducted on several worst-case scenarios, from which they
propose bounds on the best possible savings in terms of completion time when using
trucks and drones instead of trucks only in delivering parcels to customers. Further devel-
opment of this research was studied by Poikonen et al. (2017), where they extended the
worst-case bounds to more generic distance/cost metrics. Also, they explicitly consider
the limitation of battery life and cost objectives. Sacramento et al. (2019) investigated
the VRP-D with each truck equipped with a single drone and developed an adaptive large
neighborhood search algorithm (ALNS). Kuo et al. (2022) studied a similar problem as
Sacramento et al. (2019) but considered the time-window aspect and developed a VNS
solution.
In the above studies, trucks and drones share the responsibility of fulfilling cus-
tomer deliveries, while permitting the launching/collecting of drones to take place at the
customer nodes visited by the truck.
Another truck-drone configuration is to limit the delivery role to the drone while the
truck acts as a moving warehouse for launching, collecting, and refilling with the drone.
In such cases, the drone’s launching and collection operations occur at non-customer
nodes such as drone stations/hubs, nodes along the truck route, and others. This system
configuration can maximize the potential use of drones as they are not restricted to
being launched from customer nodes which may not have all the proper facilities for
launching/collecting the drone. It is also important to stress that few studies limited the
launching/collection to non-customer nodes (Karak and Abdelghany 2019; Poikonen
and Golden 2020a, b; Marinelli et al. 2018; Carlsson and Song 2018; Wang and Sheu
2019). In addition, the majority of the literature considers a single drone capacity while
it would be interesting to investigate the case of allowing the drone to perform multiple
visits per dispatch.
A Variable Neighborhood Search Algorithm 325
Therefore, this study considers the second configuration of the truck-drone deliv-
ery system while considering drone characteristics such as payload capacity and flying
endurance.
3 Model Description
Given a single truck equipped with a single drone, the task is to deliver packages to
a given set of customers. Each customer must be visited exactly once by the drone
operating in conjunction with the truck while the truck acts as a platform for refilling,
launching, and collecting the drone. The truck with its drone must depart and return to a
single depot. The launching and collecting operations can take place at a set of potential
locations. The objective is to determine the sequencing of the deliveries by drone while
minimizing the total traveling cost of the delivery system. Several assumptions are taken
into account and are described below.
• Drones can handle multiple deliveries per dispatch, according to their maximum
payload capacity Q and flying endurance E.
• Customer demands are based on the weight of the packages.
• Trucks are assumed to have a large enough capacity to transport both parcels and
drones during the whole operation.
• The drone may be launched/collected from the depot or at any of the launch-
ing/collecting potential locations.
• The drone can be collected at the same location from where it was launched or at any
other location.
• The truck must precede the drone at the collection location for safety purposes. In
other words, the arrival time of the truck at the collection location must be before that
of the drone.
Figure 1 demonstrates the delivery operation performed by the truck and the drone.
The first step in developing the VNS algorithm is to examine the VNS literature in
the context of truck-drone routing models. Table 1 summarizes the literature on VNS for
solving the routing considerations of truck-drone systems. It can be clearly observed that
only a few research have adopted the VNS algorithm. This necessitates the use of VNS
in such problems to investigate its capabilities in the context of truck drone problems. In
a study conducted by Kuo et al. (2022), the VNS proves its ability in providing quality
solutions for problems related to truck-drone routing compared to other metaheuristics
such as the ALNS. Therefore, it is worth exploring the adoption of VNS in this study.
The structure of the VNS algorithm is divided into three elements, namely, the
initial solution, neighborhood structures, and finally local search strategy. Next, each of
the three elements is presented in detail.
A Variable Neighborhood Search Algorithm 327
The initial solution procedure is based on selecting the launching and collecting nodes
of each drone trip according to the drone’s characteristics (payload capacity and flying
endurance). The approach of the initial solution chooses the depot as the launching node
of the first drone’s delivery trip and assigns a customer node. If no customer node exists
within the flight endurance of the drone from the depot, the next potential location is
chosen, and so on. Following the selection of the launching point, and assuming that
the payload limit has not been exceeded, a customer node can be added to the drone’s
delivery trip, or the current drone trip is terminated. After that, the collecting location
of the drone is selected according to the drone’s flying endurance limit. A new drone
delivery trip can be initiated, and the same steps are repeated until all customers are
served by the drone.
In this algorithm, eight neighborhood structures (Kmax = 8), which are a combination
of traditional neighborhoods and problem-specific neighborhoods, are used. It is worth
noting that some of these neighborhoods correspond to the well-known VRP local search
operators. The list of neighborhoods is presented below.
• N 1 : Select a random customer node from two drone delivery trips and swap them.
This neighborhood is similar to 1-1 exchange operators where the positions of a pair
of nodes are swapped.
• N 2 : Select a random drone trip and split it into two delivery trips. In particular, a
random trip is chosen, and a new stopping location is added at random between the
initial launching and collecting nodes, splitting the initial trip.
• N 3 : Select two random drone delivery trips and combine them into a single delivery
trip.
• N 4 : Remove a random customer node from a drone delivery trip and insert it in
another drone trip. This move is related to relocate operator in that a node is injected
from its original position and relocated to another.
• N 5 : Select two random drone delivery trips and swap their order in the list of drone
trips.
• N 6 : Select a random drone trip and swap the launching and collecting locations. If
they are similar, this neighborhood is neglected.
• N 7 : Select a random drone trip and swap two customer nodes. This is comparable to
the 1-1 exchange operator, except it occurs within the same route (exchange intra-
route).
• N 8 : Select a random drone trip where the launching and collecting nodes are different
and make the launching location similar to the collecting one.
5 Computational Experiments
In this section, the performance of the proposed VNS algorithm is assessed using
instances derived from Sacramento et al. (2019). We first present the description of
instances and the inputs used in the algorithm followed by the performance evaluation
and additional analysis.
The algorithm is coded using the C++ programming language; all the runs and
experiments were performed on a 64-bit operating system with an x64-based processor
that is Intel(R) Core™ i7-8565U CPU @1.80 GHz and 8 GB memory.
a maximum payload capacity Q of 5 kg (∼10 lbs) and a flying range E of 30 min, which
are the typical characteristics of a drone. Finally, the drone’s speed is set to 50 mph
(Sacramento et al. 2019).
The instances used in the VNS performance evaluation are based on Sacramento
et al. (2019) instances, which are formed on a grid size of 2d × 2d where d is 5, 10, 20,
30, or 40 and follow a uniform distribution. Finally, customer packages are set to weigh
less than 5 kg and follow a uniform distribution.
Given that Sacramento et al. (2019) presented the VRP-D where the launch-
ing/collecting nodes are customer nodes, we added additional nodes representing the
potential launching/collecting location to each instance. We followed the p-median app-
roach to generate the potential locations used as possible launching/collecting nodes.
The number of potential locations is selected based on the number of customer nodes in
each instance size. In this analysis, we chose 30 instances of size 6, 10, 12, 20, and 50
customer nodes to apply the modification on. The modified instances are named S[no.
potential locations (including depot)].[no. customer nodes].[grid dimension].[instance
number].
To assess the performance of the VNS algorithm, we use modified instances and obtain
the solution for each. Each instance is computed 10 times (10 runs) and the average C Avg
and best C best solutions are recorded. In addition, the average computational time is docu-
mented t Avg in seconds. Finally, the maximum number of iterations (maximumiteration)
is fixed to 100 for all instances. Table 2 summarizes the results of the 30 modified
instances.
The results show that the VNS generates solutions for instances with up to 50 cus-
tomer nodes in less than 1 s. Add to that, the algorithm provides relatively consistent
results given the difference between C best and C Avg in each instance. In terms of compu-
tational time, the suggested approach outperforms the previous VNS algorithms found
in truck-drone literature (Lei and Chen 2022; de Freitas and Penna 2018 2020; Schermer
et al. 2019; Kuo et al. 2022). Nevertheless, comparing to other heuristics based on com-
putational time is insufficient and requires evaluation based on solution quality. Indeed,
the suggested algorithm must be benchmarked against other algorithms or exact solution
methods to assess its solution quality, which will be part of our future work.
5.3 Drone Capacity Impact: Single vs. Multiple Visits per Drone Dispatch
The ability of the drone to perform multiple visits may have a significant impact on the
delivery cost.
According to Meng et al. (2023), allowing the drone to visit several customers in
one route can bring cost savings.
In this part, we analyze the impact of the drone’s payload capacity by preventing
the drone from visiting more than a single customer per dispatch. The analysis is per-
formed on S(3.6.5.1), S(5.10.5.1), S(6.12.5.1), S(8.20.5.1), and S(10.50.10.3) instances.
Table 3 compares the objective function value for both cases using the above-mentioned
330 B. Madani and M. Ndiaye
instances. The findings indicate that enabling the drone to service multiple customers
can better leverage the drone’s flying endurance, resulting in a shorter distance traveled
by the truck to retrieve the drone, and thus generating cost-savings.
A Variable Neighborhood Search Algorithm 331
Table 3. Performance evaluation of the VNS algorithm based on the drone’s payload capacity
Table 4. Performance evaluation of the VNS algorithm based on the drone’s flying range
This paper proposed a VNS algorithm to solve the routing considerations of a truck-drone
delivery system. In particular, a single truck functioning as a moving hub is in charge
of operating the drone from potential truck locations while the drone is responsible for
serving multiple customers per dispatch. The suggested approach has demonstrated its
ability to solve small to medium instances in a short time.
This research can be further explored in a variety of ways. To begin, the truck-drone
delivery system could look into using multiple trucks in tandem with multiple drones to
service customers. Second, other aspects of the problem, such as time window and energy
332 B. Madani and M. Ndiaye
consumption, may be examined. Third, a mathematical formula for the given problem
description may be presented. Fourth, the suggested algorithm must be compared to
other solutions generated by exact solution methods and/or metaheuristic solutions to
better assess its quality. Finally, extensive computational analysis of a large number of
instances of various sizes may be performed.
References
Agatz, N., Bouman, P., Schmidt, M.: Optimization approaches for the traveling salesman problem
with drone. Transp. Sci. 52, 965–981 (2018). https://doi.org/10.1287/trsc.2017.0791
Boccia, M., Masone, A., Sforza, A., Sterle, C.: A column-and-row generation approach for the
flying sidekick travelling salesman problem. Transp. Res. Part C 124 (2021). https://doi.org/
10.1016/j.trc.2020.102913
Boyer, K.K., Prud’homme, A.M., Chung, W.: The last mile challenge: evaluating the effects of
customer density and delivery window patterns. J. Bus. Logist. 30(1), 185–201 (2009). https://
doi.org/10.1002/j.2158-1592.2009.tb00104.x
Boysen, N., Fedtke, S., Schwerdfeger, S.: Last-mile delivery concepts: a survey from an operational
research perspective. OR Spectr. 43(1), 1–58 (2021). https://doi.org/10.1007/s00291-020-006
07-8
Campbell, J.F., Sweeney, D., Zhang, J.: Strategic design for delivery with trucks and drones.
Technical report (2017)
Carlsson, J.G., Song, S.: Coordinated logistics with a truck and a drone. Manag. Sci. 64, 4052–4069
(2018). https://doi.org/10.1287/mnsc.2017.2824
Chung, S.H., Sah, B., Lee, J.: Optimization for drone and drone-truck combined operations: a
review of the state of the art and future directions. Comput. Oper. Res. 123 (2020). https://doi.
org/10.1016/j.cor.2020.105004
de Freitas, J.C., Penna, P.H.V.: A randomized variable neighborhood descent heuristic to solve the
flying sidekick traveling salesman problem. Electron. Notes Discret. Math. 66, 95–102 (2018).
https://doi.org/10.1016/j.endm.2018.03.013
de Freitas, J.C., Penna, P.H.V.: A variable neighborhood search for flying sidekick traveling
salesman problem. Int. Trans. Oper. Res. 27(1), 267–290 (2020). https://doi.org/10.1111/itor.
12671
Dell’Amico, M., Montemanni, R., Novellani, S.: Drone-assisted deliveries: new formulations for
the flying sidekick traveling salesman problem. Optim. Lett. (2019). https://doi.org/10.1007/
s11590-019-01492-z
Dragolea, N.: 9 Drones that will revolutionize Healthcare (2019). http://www.doctorpreneurs.com/
9-drones-that-will-revolutionise-healthcare/. Accessed 26 Feb 2021
Gevaers, R., Van de Voorde, E., Vanelslander, T.: Characteristics of innovations in last-mile
logistics-using best practices, case studies and making the link with green and sustainable
logistics. Association for European Transport and Contributors (2009)
Gu, R., Poon, M., Luo, Z., Liu, Y., Liu, Z.: A hierarchical solution evaluation method and a hybrid
algorithm for the vehicle routing problem with drones and multiple visits. Transp. Res. Part C
141 (2022). https://doi.org/10.1016/j.trc.2022.103733
Jeong, H.Y., Song, B.D., Lee, S.: Truck-drone hybrid delivery routing: payload-energy dependency
and No-Fly zones. Int. J. Prod. Econ. 214, 220–233 (2019). https://doi.org/10.1016/j.ijpe.2019.
01.010
Karak, A., Abdelghany, K.: The hybrid vehicle-drone routing problem for pick-up and delivery
services. Transp. Res. Part C 102, 427–449 (2019). https://doi.org/10.1016/j.trc.2019.03.021
A Variable Neighborhood Search Algorithm 333
Kuo, R.J., Lu, S.-H., Lai, P.-Y., Mara, S.T.W.: Vehicle routing problem with drones considering
time windows. Expert Syst. Appl. 191 (2022). https://doi.org/10.1016/j.eswa.2021.116264
Lavars, N.: Amazon reveals Prime Air drone capable of 30 minute deliveries (2013). https://new
atlas.com/amazon-prime-air-delivery-drone/29982/. Accessed 20 Feb 2021
Liu, Z., Li, X., Khojandi, A.: The flying sidekick traveling salesman problem with stochastic travel
time: a reinforcement learning approach. Transp. Res. Part E: Logist. Transp. Rev. 164 (2022).
https://doi.org/10.1016/j.tre.2022.102816
Lei, D., Chen, X.: An improved variable neighborhood search for parallel drone scheduling trav-
eling salesman problem. Appl. Soft Comput. 127 (2022). https://doi.org/10.1016/j.asoc.2022.
109416
Madani, B., Ndiaye, M.: Autonomous vehicles delivery systems classification: introducing a tsp
with a moving depot. In: 2019 8th International Conference on Modeling Simulation and
Applied Optimization (2019). https://doi.org/10.1109/ICMSAO.2019.8880379
Luo, Q., Wu, G., Mao, Ni., Shi, J.: Hybridization of variable neighborhood search and simulated
annealing for the collaborative truck-drone routing problem. In: Wu, M., Niu, Y., Gu, M.,
Cheng, J. (eds.) ICAUS 2021. LNEE, vol. 861, pp. 1505–1515. Springer, Singapore (2022).
https://doi.org/10.1007/978-981-16-9492-9_149
Madani, B., Ndiaye, M.: Hybrid truck-drone delivery systems: a systematic literature review. IEEE
Access 10, 92854–92878 (2022). https://doi.org/10.1109/ACCESS.2022.3202895
Marinelli, M., Caggiani, L., Ottomanelli, M., Dell’Orco, M.: En route truck-drone parcel delivery
for optimal vehicle routing strategies. IET Intell. Transp. Syst. 12(4), 253–261 (2018). https://
doi.org/10.1049/iet-its.2017.0227
Meng, S., Guo, X., Li, D., Liu, G.: The multi-visit drone routing problem for pickup and delivery
services. Transp. Res. Part E: Logist. Transp. Rev. 169, 102990 (2023). https://doi.org/10.1016/
j.tre.2022.102990
Mladenović, N., Hansen, P.: Variable neighborhood search. Comput. Oper. Res. 24(11), 1097–1100
(1997). https://doi.org/10.1016/S0305-0548(97)00031-2
Mogili, U.M.R., Deepak, B.B.V.L.: Review on application of drone systems in precision agri-
culture. Procedia Comput. Sci. 133, 502–509 (2018). https://doi.org/10.1016/j.procs.2018.
07.063
Murray, C.C., Chu, A.G.: The flying sidekick traveling salesman problem: optimization of drone-
assisted parcel delivery. Transp. Res. Part C 54, 86–109 (2015). https://doi.org/10.1016/j.trc.
2015.03.005
Murray, C.C., Raj, R.: The multiple flying sidekicks traveling salesman problem: parcel delivery
with multiple drones. Transp. Res. Part C 110, 368–398 (2020). https://doi.org/10.1016/j.trc.
2019.11.003
Poikonen, S., Golden, B.: Multi-visit drone routing problem. Comput. Oper. Res. 113 (2020a).
https://doi.org/10.1016/j.cor.2019.104802
Poikonen, S., Golden, B.: The mothership and drone routing problem. INFORMS J. Comput. 32,
249–262 (2020b). https://doi.org/10.1287/ijoc.2018.0879
Poikonen, S., Wang, X., Golden, B.: The vehicle routing problem with drones: Extended models
and connections. Networks 70(1), 34–43 (2017). https://doi.org/10.1002/net.21746
Sacramento, D., Pisinger, D., Ropke, S.: An adaptive large neighborhood search metaheuristic for
the vehicle routing problem with drones. Transp. Res. Part C 102, 289–315 (2019). https://doi.
org/10.1016/j.trc.2019.02.018
Salama, M.R., Srinivas, S.: Collaborative truck multi-drone routing and scheduling problem:
package delivery with flexible launch and recovery sites. Transp. Res. Part E 164 (2022).
https://doi.org/10.1016/j.tre.2022.102788
Salhi, S.: Heuristic Search: The Emerging Science of Problem Solving. Springer, Heidelberg
(2017)
334 B. Madani and M. Ndiaye
Schermer, D., Moeini, M., Wendt, O.: A hybrid VNS/Tabu search algorithm for solving the vehicle
routing problem with drones and en route operations. Comput. Oper. Res. 109 (2019). https://
doi.org/10.1016/j.cor.2019.04.021
Scott, J., Scott, C.: Drone delivery models for healthcare. In: Proceedings of the 50th Hawaii
International Conference on System Sciences (2017)
UPS tests drone delivery system. http://www.businessinsider.com/ups-tests-drone-delivery-sys
tem-2017-2. Accessed 15 Mar 2021
Wang, X., Poikonen, S., Golden, B.: The vehicle routing problem with drones: several worst-case
results. Optim. Lett. 11(4), 679–697 (2017). https://doi.org/10.1007/s11590-016-1035-3
Wang, Z., Sheu, J.-B.: Vehicle routing problem with drones. Transp. Res. Part B 122, 350–364
(2019). https://doi.org/10.1016/j.trb.2019.03.005
Windras Mara, S.T., Rifai, A.P., Sopha, B.M.: An adaptive large neighborhood search heuristic
for the flying sidekick traveling salesman problem with multiple drops. Expert Syst. Appl. 205
(2022). https://doi.org/10.1016/j.eswa.2022.117647
Wohlsen, M.: The next big thing you missed: amazon’s delivery drones could work—they just
need trucks. https://www.wired.com/2014/06/the-next-big-thing-you-missed-delivery-drones-
launched-from-trucks-are-the-future-of-shipping/. Accessed 10 Mar 2021
Wood, L.: $91.5 Billion Autonomous Last Mile Delivery Market, 2030. https://www.prnews
wire.com/news-releases/91-5-billion-autonomous-last-mile-delivery-market-2030--300895
603.html. Accessed 10 Mar 2021
Traffic and Transport
Prediction and Analysis of Transit Ferries
Travel Time: An Open Data-Based Case
Study
1 Introduction
Public transportation plays an increasingly important role in modern transporta-
tion systems and has emerged as one of the most crucial modes of transportation
due to its positive environmental and cost impacts. One of the topics that have
attracted considerable attention in recent years is travel time (TT) prediction,
which refers to the estimation of the time required to traverse a road segment
or a route under current or future traffic conditions. TT prediction is essen-
tial for various applications in intelligent transportation systems, such as route
scheduling [5] and hence can lead to provide a better service to the passengers.
Public transport modes can be divided into buses (including trolleybuses),
rail (including metros and trams), and ferries. While most of the research on
delay and travel time predictions for transit focuses on buses and rail services
(as evidenced by the fact that all papers included in [8] are about these modes), it
is important to note that ferry transit is also a significant mode of transportation.
For instance, the Sydney Ferry network in Australia carries an estimated 1.25
million passengers each month, and the Hamburg ferry system in Germany serves
approximately nine million passengers annually. Therefore, providing accurate
estimates of travel time for ferries should also be given importance, just like
buses and rail transit systems.
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 337–349, 2023.
https://doi.org/10.1007/978-3-031-43612-3_21
338 M. Sarhani et al.
Analyzing and predicting the travel time of a ferry is a challenging task due to
several factors that can affect it. That is, ferries are ships, watercraft, or amphibi-
ous vehicles that transport passengers, and as such, their travel time can be influ-
enced by a range of factors. Nevertheless, despite the difficulties in predicting ship
travel times, there have been significant advances in this area in recent years.
Indeed, accurate prediction of ship travel time is crucial for the maritime industry
as it helps in obtaining more precise estimations of the estimated time of arrivals.
The latter has a crucial impact on overall planning, as indicated in [20].
In particular, the use of machine learning (ML) in the transportation sec-
tor and maritime industry has gained considerable momentum due to the large
amount of data generated by this transportation mode, and recent advancements
in ML algorithms have made it possible to effectively analyze and make real-time
predictions based on this data [4]. Indeed, ML algorithms such as support vec-
tor machines (SVM), random forests (RF), gradient boosting (GB), and neural
networks (NN) are examples of ML algorithms that can be used to predict vessel
arrival times and address the challenges in this area. These predictions take into
account factors such as voyage information, vessel performance data, and weather
conditions. This information enables organizations to make more informed deci-
sions and reduces uncertainty about the travel time of ships. Additionally, there
have been recent developments in ML, such as the analysis of feature importance,
which aim to improve model interpretability and understand the impact of differ-
ent features.
In this paper, our goal is to compare different ML algorithms to determine the
most accurate model for estimating and predicting the travel time of ferry transit
using various data sources. We will also evaluate the impact of different features
and highlight the most important ones. These factors can have significant impacts
on transit service quality, reliability, efficiency, and customer satisfaction.
The remainder of this paper is structured as follows: In the next section, we
provide a brief literature review of the topic. Section 3 describes the data used
and the various features employed in addition to the prediction approach. The
experiments are presented in Sect. 4. Finally, we conclude with a discussion of
the results and provide future research directions.
2 Literature Review
At the beginning of this section, we would like to note that we have not found
any previous work that focuses on predicting ferry delays or TT, or analyzing
the impact of different factors on these outcomes. To our knowledge, the main
work that focuses on ferries, proposed by [1], is interested in the observed waiting
time for ferry passengers. More information about the waiting time optimization
problem can be found in [6]. [1]’s results indicate a positive relationship between
waiting time and headway. The related literature review to our paper has primar-
ily focused on ship TT prediction and understanding the factors that impact it.
In this brief literature review, we highlight the different factors affecting transit
or ship TT and delays and discuss some prediction methods.
Prediction and Analysis of Transit Ferries Travel Time 339
Feature Description
GTFS static Trip ID A specific trip identification
Route ID A route is a group of trips that are displayed to riders as a single service
Service ID Identifies a set of dates when service is available for one or more routes
Shape ID Identifies a geospatial shape that describes the vehicle travel path for a trip
Trip headsign Text that appears on signage identifying the trip’s destination to riders
Direction ID Indicates the direction of travel for a trip
Route direction Planned arrival time at a specific stop for a specific trip
GTFS real-time Vehicle ID Internal identification for the vehicle
Vehicle Route ID Route of the vehicle (used for merging GTFS real-time with GTFS static)
Vehicle Trip ID Trip of the vehicle (used for merging GTFS real-time with GTFS static)
Vehicle Trip Start Time Start time of the trip of the vehicle (used for merging GTFS real-time with GTFS static)
Vehicle Trip Start Date Start date of the trip of the vehicle (used for merging GTFS real-time with weather and GTFS static)
Weekday Weekday of the trip (extracted from the previous feature)
Latitude Latitude for the vehicle position
Longitude Longitude for the vehicle position
Bearing Bearing for the vehicle position
Weather Temperature Average temperature of the day
Dew Point the temperature the air needs to be cooled to in order to achieve a relative humidity
Humidity Average humidity of the day
Wind Speed Average wind speed of the day
Pressure Average pressure of the day
The data used in this study covered the period from March 5, 2023, to April
7, 2023. Specifically, the data encompasses the time between March 5 and March
7, as well as April 4 to April 7. We made an effort to include all possible data
during these days, considering various weekdays and different weather conditions,
342 M. Sarhani et al.
4 Experiment
In this section, we define the experimental setup of our case study, then present
and discuss the results.
error (MSE) and the root mean square error (RMSE) and, in addition, the mean
absolute percentage error (MAPE).
We will begin by displaying the correlation plot, as depicted in Fig. 2
Based on Fig. 2, we can observe that most features exhibit a moderate or
low correlation (between -0.8 and 0.8), with the exception of weather-related
features such as dew point and precipitation. We will discuss this later.
GB RF DNN SVM
2
R 0.9334 0.9428 0.6924 0.1664
MAE 0.0061 0.0102 0.092 0.429
RMSE 0.008 0.0132 0.163 0.508
MAPE 0.0989 0.106 0.418 0.760
We can draw a preliminary conclusion from Table 2 that the predictive ability
of ML algorithms varies significantly. The best results are obtained using GB,
which is slightly better than RF. Their performance is superior to that of SVM
and DNN. It should be noted that these results are specific to the given case
study and the chosen dataset. However, we can conclude that RF and GB are
effective approaches for this case. These methods have also been adopted to
assess the feature importance, as shown below.
Prediction and Analysis of Transit Ferries Travel Time 345
Feature RF GB Permutation
Vehicle ID 0.00001 0.00001 0.00061
Vehicle Bearing 0.00003 0.00004 0.000105
Trip ID 0.02048 0.01582 0.025763
Trip Start Time 0.21010 0.21140 0.173025
Trip Start Date 0.20701 0.22431 0.18900
Route ID 0.00563 0.00437 0.069827
service ID 0.00000 0.00000 0.00000
shape ID 0.00000 0.00000 0.00000
Trip_headsign 0.00000 0.00000 0.00000
Direction_id 0.00000 0.00000 0.00000
Route_direction 0.00000 0.00000 0.00000
Weekday 0.03302 0.04368 0.04900
Hour 0.04004 0.04769 0.03529
Latitude 0.11721 0.13140 0.11521
Longitude 0.03560 0.04310 0.00000
Temperature 0.06835 0.03690 0.07
Dew Point 0.05735 0.05067 0.06456359
Humidity 0.0813 0.05098 0.0685
Wind Speed 0.18709 0.16185 0.1734
Pressure 0.06582 0.06377 0.06456359
Furthermore, Fig. 4 displays a bar plot of the Shapley values [14] for the 12
most significant features (which correspond to two of the third used features).
Shapley values have gained popularity for this purpose [18]. They represent the
346 M. Sarhani et al.
5 Conclusion
Travel time prediction for transit has recently become a recurring topic in the
intelligent transportation literature. Transit travel time is influenced by several
factors. This study proposes a systematic approach to predicting ferry transit
travel time using publicly available timetables, real-time transit service feed data
in the GTFS format, and weather data. In this paper, we merged different data
sources to estimate travel time and showed how open data can be used to enhance
the information provided to passengers. We compared various machine learning
techniques for prediction and found that gradient boosting and random forest
are the most suitable methods for the provided case study. Regarding the factors
affecting travel time, the trip time and date were shown to be the most important
features. Additionally, weather features such as wind speed have a significant
impact.
Our proposed approach based on machine learning integrates heterogeneous
data sources and can therefore be useful for a wide range of agencies worldwide.
Thanks to the increasingly standardized data, it can be easily adopted by many
cities.
The findings and accurate travel time predictions can be used for early detec-
tion of delays as in [13]. For this purpose, more information about the planned
travel times is needed. In our data, we relied solely on actual travel times, but
incorporating both actual and planned travel times is necessary for delay analy-
sis. Additionally, travel time prediction can be used to obtain a better estimation
of arrival travel time and improve service, as mentioned earlier [11]. The same
methodology and features can be utilized for predicting both delays and esti-
mated travel times. These topics are gaining increasing interest due to their
importance in improving transit service, and acquiring more data is essential for
addressing them effectively.
Future research should prioritize the acquisition of additional datasets and
the exploration of alternative case studies. In fact, the availability of open data
for ferry services is still limited compared to other transit services. To yield
more comprehensive results, it is advisable to employ longer data periods for a
more thorough evaluation of various factors (e.g., precipitation as no rain was
recorded at that period). Additionally, beyond the findings presented in Fig. 2, it
is essential to gain a deeper understanding of the relationships between different
factors (e.g., whether wind speed has a positive or negative effect on travel time
and to what extent it is linked to the date of the trip). For assessment, other
measures such as adjusted R2 can be used to detect the best combination of
features for the prediction. The goal of this study is to lay the groundwork for
further research in this crucial field.
348 M. Sarhani et al.
References
1. Andersen, S.N., Tørset, T.: Waiting time for ferry services: empirical evidence from
Norway. Case Stud. Transp. Policy 7(3), 667–676 (2019). https://doi.org/10.1016/
j.cstp.2019.04.006
2. Bauer, D., Tulic, M.: Travel time predictions: should one model speeds or travel
times? Eur. Transp. Res. Rev. 10, 1–12 (2018). https://doi.org/10.1186/s12544-
018-0315-7
3. Brownlee, J.: https://machinelearningmastery.com/calculate-feature-importance-
with-python/. Accessed 27 Feb 2022
4. Cammin, P., Sarhani, M., Heilig, L., Voß, S.: Applications of real-time data to
reduce air emissions in maritime ports. In: Marcus, A., Rosenzweig, E. (eds.) HCII
2020. LNCS, vol. 12202, pp. 31–48. Springer, Cham (2020). https://doi.org/10.
1007/978-3-030-49757-6_3
5. Daduna, J.R., Trépanier, M., Voß, S.: From computer-aided transit scheduling to
systems and surveys in public transport. Public Transp. 14(1), 1–3 (2022). https://
doi.org/10.1007/s12469-022-00302-7
6. Daduna, J., Voß, S.: Practical experiences in schedule synchronization. In: Daduna,
J.R., Branco, I., Paixão, J.M.P. (eds.) Computer-Aided Transit Scheduling. LNE,
vol. 430, pp. 39–55. Springer, Heidelberg (1995). https://doi.org/10.1007/978-3-
642-57762-8_4
7. Filtz, E., de la Cerda, E.S., Weber, M., Zirkovits, D.: Factors affecting ocean-going
cargo ship speed and arrival time. In: Persson, A., Stirna, J. (eds.) CAiSE 2015.
LNBIP, vol. 215, pp. 305–316. Springer, Cham (2015). https://doi.org/10.1007/
978-3-319-19243-7_29
8. Ge, L., Sarhani, M., Voß, S., Xie, L.: Review of transit data sources: potentials,
challenges and complementarity. Sustainability 13(20), 11450 (2021). https://doi.
org/10.3390/su132011450
9. Ge, L., Voß, S., Xie, L.: Robustness and disturbances in public transport. Public
Transp. 14(1), 191–261 (2022). https://doi.org/10.1007/s12469-022-00301-8
10. Gkiotsalitis, K., Cats, O.: Public transport planning adaption under the COVID-
19 pandemic crisis: literature review of research needs and directions. Transp. Rev.
41(3), 374–392 (2020). https://doi.org/10.1080/01441647.2020.1857886
11. Heilig, L., Stahlbock, R., Voß, S.: From digitalization to data-driven decision mak-
ing in container terminals. In: Böse, J.W. (ed.) Handbook of Terminal Planning.
ORSIS, pp. 125–154. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-
39990-0_6
12. Hörcher, D., Graham, D.J., Anderson, R.J.: Crowding cost estimation with large
scale smart card and vehicle location data. Transp. Res. Part B: Methodol. 95,
105–125 (2017). https://doi.org/10.1016/j.trb.2016.10.015
13. Kim, S., Kim, H., Park, Y.: Early detection of vessel delays using combined histor-
ical and real-time information. J. Oper. Res. Soc. 68(2), 182–191 (2017). https://
doi.org/10.1057/s41274-016-0104-4
14. Marcílio-Jr, W.E., Eler, D.M.: Explaining dimensionality reduction results using
Shapley values. Expert Syst. Appl. 178, 115020 (2021). https://doi.org/10.1016/
j.eswa.2021.115020
15. Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn.
Res. 12, 2825–2830 (2011)
16. Petersen, N.C., Rodrigues, F., Pereira, F.C.: Multi-output bus travel time predic-
tion with convolutional LSTM neural network. Expert Syst. Appl. 120, 426–435
(2019). https://doi.org/10.1016/j.eswa.2018.11.028
Prediction and Analysis of Transit Ferries Travel Time 349
17. Taghipour, H., Parsa, A.B., Mohammadian, A.K.: A dynamic approach to pre-
dict travel time in real time using data driven techniques and comprehensive
data sources. Transp. Eng. 2, 100025 (2020). https://doi.org/10.1016/j.treng.2020.
100025
18. Wagner, F., et al.: Using explainable machine learning to understand how urban
form shapes sustainable mobility. Transp. Res. Part D: Transp. Environ. 111,
103442 (2022). https://doi.org/10.1016/j.trd.2022.103442
19. Yan, R., Wang, S., Zhen, L., Laporte, G.: Emerging approaches applied to mar-
itime transport research: past and future. Commun. Transp. Res. 1, 100011 (2021).
https://doi.org/10.1016/j.commtr.2021.100011
20. Yu, J., Voß, S.: Towards just-in-time arrival for container ships by the integration
of prediction models. In: 56th Hawaii International Conference on System Sciences
(2023). https://hdl.handle.net/10125/102778
21. Zissis, D., Xidias, E.K., Lekkas, D.: Real-time vessel behavior prediction. Evolving
Syst. 7, 29–40 (2016). https://doi.org/10.1007/s12530-015-9133-5
A Bi-Objective Column Generation
Approach for Real-World Rolling Stock
Circulation Planning Problems
1 Introduction
In real-world situations, decision makers often aim to minimize both the use
of traction units and empty run kilometers, as they are major cost drivers. How-
ever, it is generally difficult to assign specific cost values to both aspects, which
are necessary when integrating them into one objective function. This is due
to the fact that they have different priorities at different planning stages. The
contribution of this work is the development of a bi-objective column generation
approach for the RSCP, whose solution front allows an evaluation of the rela-
tion between the two objectives and enables the decision makers to choose the
currently most suitable circulation plan.
The remainder of this paper is organized as follows. Section 2 provides a
brief literature review focusing on different models and approaches for RSCPs.
Section 3 describes the real-world problem and its particularities verbally and
mathematically. The bi-objective column generation approach is introduced in
Sect. 4, and the results for real-world use cases are presented in Sect. 5. Finally,
Sect. 6 concludes this work and gives a summary and possibilities for future
research.
2 Literature Review
Many authors presented publications on the topic of RSCPs. First, the articles
can be distinguished by the transport mode in rail passenger transport, e.g., [2],
and rail freight transport, e.g., [13]. The next possible differentiation is based
on which kind of railway vehicle the particular authors study. [13] deal with
the assignment of locomotives to scheduled trips, [2] consider railcars, and [7]
examine the routing of carriages.
An RSCP is usually formulated as a mixed integer problem. Most publica-
tions provide an arc-flow formulation in a time-space graph, e.g., [2]. Another
option for modeling an RSCP is a Set Covering Problem (SCP) formulation as
in [10]. The third formulation can be referred to as hyperarc-flow formulation in
a hypergraph [11]. The difference from regular graphs is that in hypergraphs, an
edge can join any number of nodes, not only two.
In order to generate optimal circulations for railway vehicles, appropriate
evaluation criteria for the solutions must be established in advance. Most objec-
tive functions target the efficiency of the formed plans as in [2]. For example,
the total costs incurred from allocating the vehicles, the driven kilometers, or
the number of necessary railway vehicles are minimized. Other frequently used
objectives can be combined into the quality of service, e.g., [7]. An example of
this is the minimization of kilometers with seat shortages. The third group of
criteria is intended to ensure highly robust solutions. For example, [12] inter-
actively use optimization and simulation to focus on robustness. The criteria
considered in this paper are the number of traction units and the number of
empty run kilometers. Both are evaluated in [6] with the difference that costs
are given to the criteria to combine them in one objective function. However, in
practice this is often impossible since the decision makers can hardly quantify
those values and thus want different solutions with various ratios between both
352 P. Päprer et al.
objectives. This relationship has not been extensively studied in the literature
so far.
Because there is an enormous diversity in the practical requirements of the
particular use case, the applied methods to obtain reasonable solutions differ
strongly. As a result, they can be distinguished into integer programming meth-
ods [5], heuristics [2], and column generation [1].
3 Problem Description
3.1 Real-World Rolling Stock Circulation Planning
The studied problem is based on a practical RSCP from the Austrian Federal
Railways (ÖBB). The first sub-task of rolling stock circulation planning, i.e., the
rolling stock assignment, is already given. This means that the traction unit type
for every trip is known. Only the second sub-task, the train routing, needs to be
solved here. Consequently, circulations must be created for a homogeneous set
of vehicles to cover all scheduled trips requiring this type. The focus lies hereby
on traction units, so carriages are not considered.
The railway network and a train schedule for a specific planning horizon are
given and form the basis for the modeling. Every trip from the schedule has an
origin station, a destination station, a start time, and an end time. The traction
units are allowed to perform empty runs. These are unproductive trips that do
not appear in the timetable and only serve to transfer a traction unit between
two locations. To calculate the duration and distance for possible empty runs,
the duration and distance between specific network points are used. Based on
these fundamentals, a verbal formulation of the real-world circulation problem
is given below, showing the objective function and relevant constraints.
There are two objectives when performing the circulation planning for the
ÖBB. Besides the number of used traction units, minimizing empty run kilo-
meters is important. Since the traction units are the most expensive resources
of a railway company, purchasing new vehicles represents considerable acquisi-
tion costs. Therefore, circulation plans should strive for maximum efficiency in
using the existing traction units, with the aim of minimizing the total number of
units required. As the second part of the objective function, a small number of
empty run kilometers is desired. Because traction unit drivers are an expensive
resource, too, the ÖBB aims their drivers to perform as many productive, i.e.,
planned, trips as possible. Empty runs only cost money but do not generate any
profit, so they should be avoided whenever possible.
There are several practical requirements in the real-world circulation prob-
lem. First, all trips from the previously planned timetable must be covered by
at least one traction unit. Second, the planners aim at a round or cyclic plan-
ning solution. This means, in every station, the number of traction units at the
beginning must equal the number of traction units at the end. Such a station-
wise balanced circulation plan automatically puts the whole system back in its
initial state. It is possible to perform the same circulation plan again in the next
Column Generation Approach for Circulation Planning 353
period. A third restriction limits the available traction units. Finally, all circula-
tions must be spatially and temporally feasible. This means that after covering
one trip, the next trip has to start later than the arrival of the previous one.
Moreover, if the next trip has a different origin station, the time between both
trips must be greater than the empty run needed to come to this station.
The basic idea of the depicted SCP formulation is to have a set T of the
trips to be covered and another set P of feasible circulations. The aim is now
to identify the smallest or cost-minimal sub-collection of the circulation set to
cover all elements of the trip set.
min cp · xp (1)
p∈P
subject to ap,t · xp ≥ 1 ∀t ∈ T (2)
p∈P
bp,s · xp = 0 ∀s ∈ S (3)
p∈P
xp ≤ n (4)
p∈P
xp ∈ {0, 1} ∀p ∈ P (5)
Equation (1) is the objective function of the model formulation and minimizes
the total cost. This happens by summing up the cost of all used circulations.
354 P. Päprer et al.
Note that the term cost can represent both objectives and is defined explicitly
in Sect. 4. Constraints (2) use the parameter ap,t to guarantee that every trip is
assigned to at least one circulation. At this point, the character of the problem
in the sense of an SCP formulation becomes clear because there is the possibility
of over-covering. The constraints (3) ensure a station-wise balanced rolling stock
circulation planning. These restrictions count the circulations which start (+1)
and/or end (−1) at the specific station and provide a balanced overall plan
(balance = 0). The methodology employed in this paper for modeling station-
wise balanced plans draws from [1]. Constraint (4) ensures that the number of
used traction units does not exceed the number of available ones. Finally, the
decision variables have a binary character, as modeled in Eqs. (5). With this 0–1
integer linear programming model, the problem can be solved optimally when
the set of all feasible circulations is built before. Since this is inefficient for large
instances, a column generation procedure is applied.
4 Solution Approach
4.1 General Procedure
To solve the RSCP, column generation is used. The basic procedure of this
solution method is described in [4]. It consists of a constructive heuristic, a
Master Problem (MP), a Pricing Problem (PP), and an integrality step. The
general process of the particular algorithm is depicted in Fig. 1. The picture
shows the interaction of the four critical components in the algorithm.
circulations from the current circulation pool in the MP, the PP tries to construct
new improving columns, i.e., circulations.
The applied column generation terminates if one of the following conditions
is met: the best-found circulation in the PP has non-negative reduced cost, or
the total number of 200 iterations is reached, or the objective value of the MP is
improved less than 0.1 % in 20 consecutive iterations. Since the MP works with
continuous variables, it is necessary to convert the found solution into an integer
one after that.
min cp · xp + (us + vs ) + w · cpunish (6)
p∈P s∈S
subject to ap,t · xp ≥ 1 ∀t ∈ T (7)
p∈P
bp,s · xp ≤ 0 + us ∀s ∈ S (8)
p∈P
bp,s · xp ≥ 0 − vs ∀s ∈ S (9)
p∈P
xp ≤ n + w (10)
p∈P
xp ∈ R+
0 ∀p ∈ P (11)
us , vs ∈ R+
0 ∀s ∈ S (12)
w∈ R+
0 (13)
Given this information, the equation for the reduced cost of a variable can be
derived. The formula always starts with the cost of a new variable. Besides this
term, every restriction of the rRMP appears in the equation by its calculated
dual value and the decision variable coefficient. Note that parameters ap,t and
Column Generation Approach for Circulation Planning 357
bp,s from the MP are now variables. Now, the PP aims to find the variable with
the lowest reduced cost. Consequently, the equation for the reduced cost has to
be formulated as follows.
cp = cp − πt · ap,t − σsl · bp,s − σsg · bp,s − λ (14)
t∈T s∈S s∈S
The connection between an empty run start node and a trip node is only
possible when the empty run’s destination station is the same as the trip’s origin
358 P. Päprer et al.
station. Analogously, a trip can only be followed by an empty run end node when
the trip’s destination station equals the empty run’s origin station. Furthermore,
the trip’s start and end times must be considered when connecting empty run
nodes and trips. Sometimes a trip starts right after the beginning of the planning
period. In this case, inserting an empty run before the trip is impossible. The
network representation resulting from this process is depicted in Fig. 2.
Now, the PP can be formulated as a shortest path problem [10], where the
goal is to find a path from source i to sink j that generates the lowest reduced
cost. For this purpose, a variable xt,t indicates whether node t is followed by node
t . This variable is created for every directed edge of the network. Additionally,
some other sets of nodes are needed. V s is the subset of the node’s successors
in the network, and V p is the subset of the node’s predecessors. W combines
the scheduled trips T and the empty runs R. The following constraints of the
respective shortest path problem can find the cost-minimal circulation, i.e., the
circulation with the lowest reduced cost.
min cp (15)
⎧
⎪
⎨ 1 if t = i
subject to xt,t − xt ,t = 0 if t ∈ W ∀t ∈ V (16)
⎪
⎩
t ∈V s t ∈V p −1 if t = j
xt,t = ap,t ∀t ∈ T (17)
t ∈V s
xi,t − xt,j = bp,s ∀s ∈ S (18)
t∈Wsori t∈Wsdes
The first group of constraints (16) are the typical flow conservation con-
straints of the shortest path problem. Only one outgoing arc is allowed for the
source, and there will be exactly one ingoing arc for the sink. All other nodes
must have an equal number of ingoing and outgoing arcs. The next two con-
straints (17) and (18) are the linking constraints between the variables xt,t ,
ap,t , and bp,s . If an arc is used, the respective trip is part of the circulation, and
hence, ap,t needs to be 1 (17). For defining the bp,s variable, it is crucial to deter-
mine whether a station is used as an origin and/or destination station. For this
purpose, a set of all trips or empty runs starting at the particular station (Wsori )
and ending there (Wsdes ) is formed. With the help of these two sets, the balance
for every station can be calculated, which is then equal to the bp,s variable of the
station s (18). The corresponding circulation is added to the circulation pool if
the minimum reduced cost is negative.
Column Generation Approach for Circulation Planning 359
Because two objective criteria are considered for the circulation plans, two col-
umn generation variants are created, each focusing on a different objective.
They differ in formulating the objective function of the MP and the formu-
lation of the reduced cost for the PP. We apply the ε-constraint method to the
column generation approach to provide a front of different solutions. The general
procedure of that method can be found in [8]. The developed method is depicted
in Fig. 3. It starts with two independent runs of both column generation variants.
CG-TU will minimize the number of traction units T Umin leading to an upper
bound of empty run kilometers ERmax because of neglecting this target. In con-
trast, CG-ER minimizes the number of empty run kilometers ERmin , providing
an upper bound for the used traction units T Umax . Between those two extreme
points, other solutions are calculated. Given two objective functions, one has
to be chosen as the single objective making the other a constraint. Solving the
column generation variant CG-TU is much more time-consuming because many
solutions have the same objective value, i.e., number of traction units. For this
reason, the second, more runtime-efficient column generation variant, CG-ER, is
taken. The other objective of minimizing the number of traction units becomes
a constraint, which is already part of the model as constraint (10). Starting from
the CG-TU solution, CG-ER is run. The right-hand side of the capacity con-
straint starts with the minimal number of traction units T Umin and is increased
step by step until the highest number of traction units, i.e., the one of the inde-
pendent CG-ER run T Umax , is reached. This means, the algorithm can use more
traction units, gradually reducing the number of empty run kilometers.
5 Computational Results
To test the column generation, it is applied to two real-world use cases given by
the ÖBB (see Table 3). Some descriptive parameters are observed there to get an
overview of the use cases. Furthermore, the graph in the bottom row visualizes
the use case, where every node belongs to one station. The edges represent the
scheduled trips in the style of an undirected graph. The thicker the line, the
more often a trip occurs over the planning horizon. For this purpose, the trips
are sorted into five classes whose borders are set by quantiles of the absolute
frequencies.
The linear and integer programming formulations of the RSCP are imple-
mented with Gurobi 10.0.1 using its C# API. C# is also used for the implemen-
tation of the proposed constructive heuristic and the column generation frame-
work. All tests are run on an AMD EPYCTM 7513 32-Core Processor using up
to 24 physical cores and 128 GB RAM. The relevant parameters for the column
generation algorithm are set to the following values: cpunish is 100,000, and n is
500.
To evaluate the proposed column generation approach, a distinction is made
between the results of the constructive heuristic, CG-TU, and CG-ER. For all
three solution types as well as both use cases, Table 4 shows the number of
traction units, the empty run kilometers, the best rRMP value, and the run-
time in seconds. The number of infeasibilities for the constructive heuristic only
refers to the station balancing constraints and the capacity of traction units. An
overused traction unit is one infeasibility. In addition, whenever there is a differ-
ence between departing and ending traction units for a station, this difference
is added to the number of infeasibilities. The best rRMP value represents the
best real solution before setting all variables to binary or integer. Consequently,
Column Generation Approach for Circulation Planning 361
it refers to the traction unit number in CG-TU and empty run kilometers in
CG-ER.
real-world problem, the number of traction units and the number of empty run
kilometers. This means no weights are given to the objectives to obtain one single
best solution but a set of non-dominated solutions are determined. Thereby, a
solution S1 dominates another solution S2 if it is not worse than S2 in both
objectives and better in at least one objective [14]. Figure 4 visualizes the ten
non-dominated solutions obtained by our method for use case A. It can be seen
that a hyperbolic solution front is created. In the top left corner is a solution with
a small number of traction units but many empty run kilometers. The opposite
can be found in the bottom right corner, a solution with very low empty run
kilometers and many traction units.
The assumption that the two objective criteria influence each other can be
strengthened. The improvement of one criterion leads, ceteris paribus, to a dete-
rioration of the other criterion. If up to 44 % more traction units are gradually
deployed, the number of empty kilometers can be progressively reduced to 29 %.
Now, it is up to the decision maker to decide which solution he/she chooses.
The same procedure of multi-objective column generation is applied to the
large use case B. The resulting front of the solutions is shown in Fig. 5. Interest-
ingly, allowing more traction units substantially decreases empty run kilometers
between 73 and 79 traction units. The decision maker should consider using
more traction units here to take the sizeable empty run advantages. Using more
than 79 traction units also improves the empty run kilometers. However, the
improvements are significantly smaller compared to the initial savings.
The figures illustrate the benefits of the approach presented. It helps to cre-
ate solution fronts and allows an analysis of the relationship between the two
objectives. This is very important for decision makers: they can focus on one
direction or the other at different planning stages.
Column Generation Approach for Circulation Planning 363
Acknowledgements. This work was funded in the course of the project VIPES (FFG
project number 893963) by the Federal Ministry for Climate Protection, Environment,
Energy, Mobility, Innovation and Technology (BMK) as part of the call for proposals
Mobilität der Zukunft. FFG is the central national funding organization and strength-
ens Austria’s innovative power. We also want to thank Matthias Wastian, Senior Data
Scientist at dwh GmbH, for providing all the necessary information to formulate the
model and use cases to run the algorithm.
References
1. Bach, L., Gendreau, M., Wøhlk, S.: Freight railway operator timetabling and engine
scheduling. Eur. J. Oper. Res. 241(2), 309–319 (2015). https://doi.org/10.1016/j.
ejor.2014.08.036
364 P. Päprer et al.
2. Cacchiani, V., Caprara, A., Toth, P.: An effective peak period heuristic for rail-
way rolling stock planning. Transp. Sci. 53(3), 746–762 (2019). https://doi.org/10.
1287/trsc.2018.0858
3. Caprara, A., Kroon, L., Monaci, M., Peeters, M., Toth, P.: Passenger railway opti-
mization. In: Handbooks in Operations Research and Management Science. vol.
14, pp. 129–187 (2007). https://doi.org/10.1016/S0927-0507(06)14003-7
4. Desrosiers, J., Lübbecke, M.E.: A primer in column generation. In: Desaulniers,
G., Desrosiers, J., Solomon, M.M. (eds.) Column Generation, pp. 1–32. Springer,
Boston (2005). https://doi.org/10.1007/0-387-25486-2_1
5. Fioole, P.J., Kroon, L., Maróti, G., Schrijver, A.: A rolling stock circulation model
for combining and splitting of passenger trains. Eur. J. Oper. Res. 174(2), 1281–
1297 (2006). https://doi.org/10.1016/j.ejor.2005.03.032
6. Frisch, S., Hungerländer, P., Jellen, A., Primas, B., Steininger, S., Weinberger,
D.: Solving a real-world locomotive scheduling problem with maintenance con-
straints. Transp. Res. Part B: Methodological 150, 386–409 (2021). https://doi.
org/10.1016/j.trb.2021.06.017
7. Lingaya, N., Cordeau, J.F., Desaulniers, G., Desrosiers, J., Soumis, F.: Operational
car assignment at via rail Canada. Transp. Res. Part B: Methodological 36(9),
755–778 (2002). https://doi.org/10.1016/S0191-2615(01)00027-3
8. Mavrotas, G.: Effective implementation of the ε-constraint method in multi-
objective mathematical programming problems. Appl. Math. Comput. 213(2),
455–465 (2009). https://doi.org/10.1016/j.amc.2009.03.037
9. Neufeld, J.S., Scheffler, M., Tamke, F., Hoffmann, K., Buscher, U.: An efficient
column generation approach for practical railway crew scheduling with attendance
rates. Eur. J. Oper. Res. 293(3), 1113–1130 (2021). https://doi.org/10.1016/j.ejor.
2020.12.058
10. Nishi, T., Ohno, A., Inuiguchi, M., Takahashi, S., Ueda, K.: A combined column
generation and heuristics for railway short-term rolling stock planning with regular
inspection constraints. Comput. Oper. Res. 81, 14–25 (2017). https://doi.org/10.
1016/j.cor.2016.11.025
11. Reuther, M., Schlechte, T.: Optimization of rolling stock rotations. In: Borndörfer,
R., Klug, T., Lamorgese, L., Mannino, C., Reuther, M., Schlechte, T. (eds.) Hand-
book of Optimization in the Railway Industry. ISORMS, vol. 268, pp. 213–241.
Springer, Cham (2018). https://doi.org/10.1007/978-3-319-72153-8_10
12. Rößler, M., et al.: Simulation and optimization of traction unit circulations. In:
2020 Winter Simulation Conference (WSC), pp. 90–101. IEEE (2020). https://doi.
org/10.1109/WSC48552.2020.9383926
13. Scheffler, M., Neufeld, J.S., Hölscher, M.: An MIP-based heuristic solution app-
roach for the locomotive assignment problem focussing on (dis-) connecting pro-
cesses. Transp. Res. Part B: Methodological 139, 64–80 (2020). https://doi.org/
10.1016/j.trb.2020.05.020
14. Zitzler, E., Thiele, L., Laumanns, M., Fonseca, C.M., Da Fonseca, V.G.: Per-
formance assessment of multiobjective optimizers: an analysis and review. IEEE
Trans. Evol. Comput. 7(2), 117–132 (2003). https://doi.org/10.1109/TEVC.2003.
810758
An Effective Matheuristic Approach
for Robust Bus Driver Rostering
with Uncertain Daily Working Hours
1 Introduction
The driver rostering problem (DRP) is a critical problem in the transporta-
tion industry, particularly for bus companies that must efficiently create sched-
ules for their drivers. The problem involves assigning drivers to shifts or duties
over a specific period, such as a week or a month, while satisfying various con-
straints, including the availability of drivers, work regulations, and customer ser-
vice requirements. The DRP is a complex and challenging optimization problem
that has received significant attention from researchers in the fields of operations
research, computer science, and transportation.
In recent years, researchers have developed various approaches to solve the
DRP, including exact methods, heuristics, and metaheuristics. Exact methods
are designed to solve the DRP to optimality but are computationally expensive
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 365–380, 2023.
https://doi.org/10.1007/978-3-031-43612-3_23
366 A. Nourmohammadzadeh and S. Voß
and may not be practical for large-scale problems. Heuristics are fast and effi-
cient but do not guarantee optimal solutions. Metaheuristics are a compromise
between exact methods and heuristics, offering the ability to find near-optimal
solutions in a reasonable amount of time. Metaheuristics are particularly useful
for solving large-scale problems with complex constraints. Besides these methods,
combining elements of mathematical programming techniques with heuristics or
metaheuristics has shown promise under the acronym matheuristics [15,16].
Matheuristics are able to find high-quality solutions in a relatively short
amount of time, making them a popular choice for real-world applications where
time and computational resources are limited. They are particularly effective
when traditional optimization algorithms fail to find a solution or when a solution
is required within a tight time frame. These methods are also highly adaptable
and can be tailored to specific problem structures, making them a flexible tool
for a wide range of optimization problems.
There are several related works which investigate the bus driver rostering
problem (BDRP) with different assumptions and objectives. [21] presents a case
study on the driver rostering problem for a Finnish bus transportation com-
pany. The authors use a mathematical programming model to create an initial
solution, and then applied a local search heuristic to improve the solution. The
results show that the proposed approach significantly improves the quality of
the rosters compared to the manual process used by the company. [20] proposes
a bi-objective evolutionary heuristic to the problem. The proposed method con-
siders two objectives: minimizing the number of drivers needed and minimizing
the number of working hours. The authors tested their approach on real-world
data from a Portuguese bus company and demonstrated that their approach
could provide high-quality solutions in reasonable computation times. [14] used
a genetic algorithm to solve the balanced bus crew rostering problem, which is a
variant of the BDRP where each bus trip requires two drivers to work together.
The authors showed that their approach could generate high-quality solutions
and outperformed other approaches from the literature.
[3] proposes a column generation metaheuristic for the DRP. The authors
use a decomposition approach that allows for the creation of a large number of
rosters in a short time while still maintaining quality. [9] propose an integrated
and sequential solution method for the cyclic BDRP. The authors use a combi-
nation of column generation and branch-and-price. Their test instances are from
a Canadian bus company.
[17] describes a heuristic method for solving the problem, which involves
assigning drivers to shifts while taking into account their preferred days off. The
proposed method uses multi-commodity flow models to decompose the problem
into smaller subproblems, and then, uses a fixing process to optimize the solu-
tions. [24] presents a mathematical model for solving the integrated driver roster-
ing problem in public bus transport. The paper proposes a duty-block network
approach that divides the scheduling process into two stages: duty assignment
and block construction. The proposed model is compared with a traditional
linear programming model, and the results show that the duty-block network
Bus Driver Rostering with Uncertain Daily Working Hours 367
8% in 2022. Furthermore, the survey noted a lack of young drivers, with only
3% of bus and coach drivers under the age of 25, contributing to a demographic
crisis in the industry. Additionally, gender diversity in the profession is limited,
as less than 3% of truck drivers and only 12% of bus and coach drivers in Europe
are women. The primary cause of driver shortages, as reported by road transport
operators, is a scarcity of skilled drivers in various regions.
Some previous attempts in right-hand side uncertainties are for example: [22],
which discusses the bi-level knapsack problem with stochastic right-hand sides,
exploring the optimal solution to this problem using mathematical modeling
techniques, and [13], which addresses the maximization problem of monotone
submodular functions under an uncertain knapsack capacity. [18] explores the
complexity of 2-stage robust linear programming with right-hand side uncer-
tainty and its applications. The paper provides useful insights for researchers
and practitioners dealing with uncertainty in linear programming problems. The
same author researches into the complexity of two-stage robust linear program-
ming problems with right-hand side uncertainty in [19]. It is proven that the
problem becomes NP-hard when the uncertainty set is defined as an ellipsoid,
which is a more general set than the polyhedral set considered in the previous
paper. The paper provides a detailed proof of the complexity result and dis-
cusses its implications for practical applications. Closely related to our paper
are also issues to solve integrated vehicle and crew scheduling problems with
simple uncertainty measures in [10,11] regarding the number of vehicles.
A robust optimization approach is chosen to deal with right-hand side uncer-
tainties in our work. The concept of robustness is essential to address uncer-
tainties inherent in the system. The motivation behind employing robust opti-
mization techniques is to ensure that the proposed solutions remain of high
quality and effective even when faced with various input variations or scenar-
ios. By considering the uncertainties in the inputs, the objective is to develop
strategies or models that can withstand these uncertainties and still perform
optimally. Robustness measurement is a way to quantify the ability of a solution
or approach to maintain its effectiveness and quality under different conditions
or uncertainties. It allows for a more comprehensive evaluation and comparison
of different strategies, taking into account their performance across a range of
possible scenarios.
Due to the complexity of this problem, a matheuristic approach comprising
the particle swarm optimization (PSO) [12] and partial optimization metaheuris-
tic under special intensification conditions (POPMUSIC) [23] using consecutive
mathematical optimization of sub-models of the problem, is devised. A set of
benchmark instances and a peer algorithm from the literature are chosen to test
the ability of our solution methodology.
The remainder of this paper is organized as follows. In Sect. 2, the problem
is described and its mathematical model is presented. Section 3 explains our
solution methodologies. The computational experiments are covered in Sect. 4.
Finally, Sect. 5 is dedicated to the conclusions and directions to extend the work.
Bus Driver Rostering with Uncertain Daily Working Hours 369
2 Problem Description
In the BDRP, there is a set of duties which have to be performed with a set of
drivers within a planning horizon. Each duty has a duration and each driver has
a set of desired duties. For the assignment of each duty to each driver, a specific
cost incurs. There is also a cost corresponding to the assignment of a duty which
is not included in the set of desired duties of the driver. The objective of this
version of the BDRP is to minimize a weighted sum of the assignment costs
and drivers’ dissatisfaction. A big challenge is that the maximum time of the
availability of each driver is not certain during each day. In other words, drivers
can be unavailable in some periods but may make up the missing hours on other
days. Therefore, the daily working time of drivers may become smaller or larger
than the considered standard 8 h. Drivers do not usually work for more than a
maximum number of consecutive days without taking any day off in between.
This is regarded as another major restriction in rostering.
In the context of the BDRP, an additional restriction has been introduced to
ensure efficient scheduling of duties while considering driver preferences. This
constraint, referred to as the “no early duty after late duty” constraint, is
designed to avoid assigning an early duty to a driver who has done a late duty on
the previous day. The early duties are those which are done before 11:00 while the
late duties start after 18:00. By incorporating the “no early duty after late duty”
constraint into the BDRP, the scheduling algorithm can optimize the assignment
of duties while respecting the drivers’ preferences and promoting their well-being
by acknowledging the need for appropriate rest and recovery periods. This con-
straint enhances the fairness and efficiency of the rostering process, contributing
to improved driver satisfaction and overall operational performance.
The rostering constraints are respecting the maximum daily working hours
of drivers as well as the maximum number of consecutive working days, which
is here assumed to be 5. The notations used in the model are given in Table 1.
The mathematical formulation of our BDRP is as follows:
M in w1 cd,u + w2 xd,u (1)
d u∈U d u∈P
/ d
xd,u = 1 u∈U (2)
d
xd,u Hu ≤ M DW Hd,t d ∈ D, t ∈ T (3)
u∈Ut
xd,u ≤ 1 − xd,u d ∈ D, t ∈ T (4)
u∈Ul ∩Ut u∈Ue ∩Ut+1
ad,t |T | ≥ xd,u d ∈ D, t ∈ T (5)
u∈Ut
t+5
ad,t ≤ 5 d ∈ D, 1 ≤ t ≤ |T | − 5 (6)
t
xd,u , ad,t ∈ {0, 1} d ∈ D, u ∈ Ut , t ∈ T (7)
370 A. Nourmohammadzadeh and S. Voß
Parameters Explanation
T The set of days of the planning horizon; t ∈ T
D The set of drivers; d ∈ D
U The set of all duties; u ∈ U
Ut The set of duties to be assigned to the drivers on day t; u ∈ Ut
Ue The set of early duties; u ∈ Ue
Ul The set of late duties; u ∈ Ul
Hu The duration of duty u in hours
M DW Hd,t The maximum daily working hours of driver d on day t (an uncertain
parameter)
Pd The duties which driver d desires to do
w1 The weight of the total assignment cost for the bus company
w2 The weight related to the importance of driver satisfaction for the
company
cd,u The assignment cost of duty u to driver d
Variables Explanation
xd,u Binary, =1, if driver d is assigned to duty u
ad,t Binary, =1, if driver d is active on day t
In the objective function (1), we calculate the weighted sum of the assignment
costs and the number of undesired duties assigned to the drivers. Constraint (2)
enforces each duty to be assigned to only one driver. Respecting the maximum
working days of each driver is implied by Constraint (3). Constraint 4 prevents
the assignment of an early duty to a driver if they were assigned a late duty on the
previous day. This is an important constraint in our model because M DW Hd,t
are uncertain parameters and appear on the right-hand side. Constraint (5)
activates variable ad,t if driver d works on day t. Constraint (6) avoids exceeding
the maximum consecutive working days of each driver. The variable domains are
expressed by (7).
3 Solution Methodology
where vi (t) is the current velocity of particle i, pbesti is the personal best position
of the particle, and gbest is the best position of all particles in the swarm. The
Bus Driver Rostering with Uncertain Daily Working Hours 371
values r1 and r2 are random numbers between 0 and 1, and w is the inertia
weight that controls the balance between exploration and exploitation.
Then, the positions are updated by:
wheref (s) and f (s ) are the objective values of the current and the neighboring
solution, respectively. T is the current temperature. We begin from an initial
temperature T0 and reduce it after each iteration by dividing it by CT . When
we move to a new solution, searching the remaining neighboring solutions of the
iteration is continued around it. The termination condition of the SA is passing
a number (M ISA) of consecutive iterations without observing any improvement
in the objective value.
Following the algorithms’ procedure, a proportion of the best particles
(P M B) and a proportion of other particles (P M O) are also improved according
to the POPMUSIC framework. The latter group is selected again by the roulette
wheel method. In the POPMUSIC, solutions are divided into K parts; then, each
time, one of these parts (called sseed ) is selected at random and an area includ-
ing it and the D adjacent parts to it called altogether A is processed by solving
the mathematical model of the problem. So, we solve each time a sub-model of
the problem by fixing all variables except for those existing in A at their values
in the incumbent solution. The Gurobi solver is used to solve the sub-models.
The POPMUSIC process on a solution stops when no more improvement can be
done by choosing any part as sseed .
The whole matheuristic terminates if the number of consecutive stagnant
PSO iterations, which are iterations without any improvement in the objective,
exceeds a limit (M CU I). This algorithm has several hyper-parameters, which
have to be tuned properly in the beginning with an appropriate method. The
pseudocode of this matheuristic algorithm is presented in Algorithm 1.
The solutions are encoded as strings. Each cell of the string belongs to a duty
and is filled with the label of one driver. However, as PSO works usually with
continuous domains, the cell contents are real numbers in the interval [0,|D|].
To convert the real numbers to the driver labels, assuming that cc is the cell
content, we calculate cc + 1. Figure 1 shows the encoding of a simple BDRP
with 5 drivers and 10 duties.
372 A. Nourmohammadzadeh and S. Voß
The constraints of the model are handled by generating the variables’ values
in their feasible interval. For this sake, the variables are filled with a feasible
value one by one in the ascending order of the duties. When a variable is filled,
its value is entered in the corresponding constraints and determines the feasible
intervals for the next variables.
As the right-hand side of Constraint 3 is uncertain, the problem is solved once
based on each combination of M DW Hd,t values. Hence, we have one solution
and objective value for each scenario. The solution corresponding to the worst
objective value is considered as the robust solution because no scenario can lead
to any objective value worse than that. This is in accordance with the definition
of robustness presented in [4].
374 A. Nourmohammadzadeh and S. Voß
4 Computational Experiments
In this section, firstly, the used test instances, and then, the computational
settings are explained. The important step of parameterization of the algorithm
is declared next. Finally, the obtained results are presented and analyzed.
Instance MP MH S OA BVF
Obj Time Obj Time Obj Time Obj Time
48-75-6 1.00 12154.21 1.32 42.10 1.17 45.20 1.04 62.19 8.86 × 108
52-73-6 1.00 10534.53 1.26 37.15 1.06 48.62 1.02 66.73 8.92 × 108
52-75-6 1.00 11543.14 1.28 40.21 1.11 46.71 1.03 65.21 6.32 × 108
393-45-37 1.00 11397.19 1.31 41.64 1.33 182.27 1.07 63.31 4.19 × 108
392-45-37 1.00 13249.95 1.33 44.11 1.13 178.42 1.03 71.09 0.63 × 109
397-40-37 1.00 11864.44 1.25 39.45 1.33 202.82 1.08 68.91 4.07 × 108
96-70-8 1.00 10764.16 1.23 35.20 1.29 188.90 1.07 64.15 2.33 × 108
87-70-8 1.00 14177.99 1.29 41.72 4.10 219.62 1.18 72.63 2.08 × 107
89-70-8 2.65 43200 1.20 34.39 1.41 295.83 1.09 59.87 8.16 × 108
221-45-30 3.33 43200 1.23 36.58 1.23 478.62 1.05 62.42 1.42 × 1011
211-45-34 1.00 15692.70 1.35 45.82 2.16 296.08 1.21 70.10 6.84 × 1010
214-45-34 1.00 16480.40 1.24 43.17 2.25 352.52 1.19 67.06 6.21 × 1010
The PSO-SA (MH) and MMASPara (S) can find solutions slightly distant
from the bests but in much shorter execution times compared to MP. On the
other hand, our matheuristic spends a little more time than MH but achieves
solutions of considerably better quality. It is also perceived that the POPMUSIC
plays an important role in the success of our matheuristic since the solution
quality significantly drops when we exclude it and use only PSO-SA (MH).
The proportional objective values of all runs are illustrated for the four
examined algorithms as well as the model without uncertainties solved by our
matheuristic (non-robust: shown as NR) as box plots in Fig. 3. It is evident that
the four quartiles related to our approach are lower or better than the three first
and this indicates its satisfactory performance and efficiency. In addition, we see
that the costs of solving the robust model are not high because the box plot of
NR does not considerably differ from OA.
Fig. 3. Box plots of proportional objective values based on all runs of all instances
MP MH S OA
MP – 1.58 × 10−4 2.61 × 10−4 2.71 × 10−4
MH 1.58 × 10−4 – 3.73 × 10−5 3.12 × 10−5
−4 −5
S 2.61 × 10 3.73 × 10 – 4.90 × 10−3
OA 2.71 × 10−4 3.12 × 10−5 4.90 × 10−3 –
378 A. Nourmohammadzadeh and S. Voß
According to the very low p-values, we can reject the null hypothesis claiming
the equality of averages for all pairs in the significance level of 0.01.
Generally, it can be deduced that this robust optimization with regard to the
existence of uncertainties on the right-hand side of the model can be properly
handled by the proposed matheuristic. So, we are able to provide driver rosters
that remain effective even when the daily working hours of drivers change, which
is very likely to happen over the course of the planning horizon.
5 Conclusions
In this work, we addressed the important problem of driver rostering of bus
companies with the consideration of uncertain daily working hours of drivers.
Concerning the fact that it is an uncertainty in right-hand side parameters of the
corresponding mathematical model, a robust optimization problem which is less
focused in the literature was surveyed. A matheuristic algorithm is proposed con-
sisting of proper exploration and exploitation with PSO, SA and POPMUSIC.
The results indicate that the combination of metaheuristics and mathematical
programming techniques can provide a strong optimization power in a short
time, which cannot be found in pure approaches of these types.
For extending this research, we can recommend applying the proposed
method to other similar assignment problems. Another direction is to enhance
the performance of the method by adding extra effective operators to it or to
try other concepts of embedding mathematical optimization in metaheuristics.
Investigation into fast and efficient parameter tuning can also be of great value.
References
1. Current traffic situation. https://www.der-metronom.de/fahrplan/aktuelle-
verkehrslage/. Accessed 20 June 2023
2. New survey reveals shocking shortage of bus drivers. https://www.unitetheunion.
org/news-events/news/2021/november/new-survey-reveals-shocking-shortage-of-
bus-drivers/. Accessed 20 June 2023
3. Barbosa, V., Alvelos, F., Respı́cio, A.: Bus driver rostering by column genera-
tion metaheuristics. In: Fonseca, R.J., Weber, G.-W., Telhada, J. (eds.) Compu-
tational Management Science. LNEMS, vol. 682, pp. 225–231. Springer, Cham
(2016). https://doi.org/10.1007/978-3-319-20430-7 29
4. Ben-Tal, A., Nemirovski, A.: Robust solutions of uncertain linear programs. Oper.
Res. Lett. 25(1), 1–13 (1999). https://doi.org/10.1016/s0167-6377(99)00016-4
5. Bergmann, B., Hommel, G.: Improvements of general multiple test procedures for
redundant systems of hypotheses. In: Bauer, P., Hommel, G., Sonnemann, E. (eds.)
Multiple Hypothesenprüfung / Multiple Hypotheses Testing, vol. 70, pp. 100–115.
Springer, Heidelberg (1988). https://doi.org/10.1007/978-3-642-52307-6 8
Bus Driver Rostering with Uncertain Daily Working Hours 379
6. Blickle, T., Thiele, L.: A comparison of selection schemes used in evolutionary algo-
rithms. Evol. Comput. 4(4), 361–394 (1996). https://doi.org/10.1162/evco.1996.4.
4.361
7. Box, G.E.P., Draper, N.R.: Response Surfaces, Mixtures, and Ridge Analyses, 2nd
edn. Wiley, Hoboken (2007)
8. Derrac, J., Garcı́a, S., Molina, D., Herrera, F.: A practical tutorial on the use of
nonparametric statistical tests as a methodology for comparing evolutionary and
swarm intelligence algorithms. Swarm Evol. Comput. 1(1), 3–18 (2011). https://
doi.org/10.1016/j.swevo.2011.02.002
9. Er-Rbib, S., Desaulniers, G., Elhallaoui, I., Munroe, P.: Preference-based and cyclic
bus driver rostering problem with fixed days off. Public Transp. 13(2), 251–286
(2021). https://doi.org/10.1007/s12469-021-00268-y
10. Ge, L., Kliewer, N., Nourmohammadzadeh, A., Voß, S., Xie, L.: Revisiting the
richness of integrated vehicle and crew scheduling. Public Transp. (2022). https://
doi.org/10.1007/s12469-022-00292-6
11. Ge, L., Nourmohammadzadeh, A., Voß, S., Xie, L.: Robust optimization for inte-
grated vehicle and crew scheduling based on uncertainty in the main inputs. In:
The Fifth Data Science Meets Optimisation Workshop at IJCAI-22. Vienna (2022).
https://sites.google.com/view/ijcai2022dso/. Accessed 30 Apr 2023
12. Kennedy, J., Eberhart, R.: Particle swarm optimization. In: Proceedings of
ICNN’95 - International Conference on Neural Networks, vol. 4, pp. 1942–1948
(1995). https://doi.org/10.1109/ICNN.1995.488968
13. Kawase, Y., Sumita, H., Fukunaga, T.: Submodular maximization with uncertain
knapsack capacity. arxiv (2018). https://arxiv.org/pdf/1803.02565
14. Ma, J.H., Liu, T., Guan, W.: Solving balanced bus crew rostering problem with
genetic algorithm. Appl. Mech. Mater. 361–363, 2070–2074 (2013). https://doi.
org/10.4028/www.scientific.net/AMM.361-363.2070
15. Maniezzo, V., Boschetti, M.A., Stützle, T.: Automatic design for matheuristics.
In: Matheuristics. EURO Advanced Tutorials on Operational Research, pp. 35–57.
Springer, Cham (2021). https://doi.org/10.1007/978-3-030-70277-9 2
16. Maniezzo, V., Stützle, T., Voß, S. (eds.): Matheuristics: Hybridizing Metaheuristics
and Mathematical Programming. Springer, New York (2010). https://doi.org/10.
1007/978-1-4419-1306-7
17. Mesquita, M., Moz, M., Paias, A., Pato, M.: A decompose-and-fix heuristic based
on multi-commodity flow models for driver rostering with days-off pattern. Eur. J.
Oper. Res. 245(2), 423–437 (2015). https://doi.org/10.1016/j.ejor.2015.03.030
18. Minoux, M.: On 2-stage robust LP with RHS uncertainty: complexity results and
applications. J. Global Optim. 49(3), 521–537 (2011). https://doi.org/10.1007/
s10898-010-9645-2
19. Minoux, M.: Two-stage robust LP with ellipsoidal right-hand side uncertainty is
NP-hard. Optim. Lett. 6(7), 1463–1475 (2012). https://doi.org/10.1007/s11590-
011-0341-z
20. Moz, M., Respı́cio, A., Pato, M.V.: Bi-objective evolutionary heuristics for bus
driver rostering. Public Transp. 1(3), 189–210 (2009). https://doi.org/10.1007/
s12469-009-0013-x
21. Nurmi, K., Kyngäs, J., Post, G.: Driver rostering for a Finnish bus transportation
company. In: IAENG Transactions on Engineering Technologies, vol. 7, pp. 15–35.
World Scientific, Singapore and Hackensack NJ (2012). https://doi.org/10.1142/
9789814390019 0002
380 A. Nourmohammadzadeh and S. Voß
22. Özaltın, O.Y., Prokopyev, O.A., Schaefer, A.J.: The bilevel knapsack problem with
stochastic right-hand sides. Oper. Res. Lett. 38(4), 328–333 (2010). https://doi.
org/10.1016/j.orl.2010.04.005
23. Taillard, É.D., Voß, S.: POPMUSIC – partial optimization metaheuristic under
special intensification conditions. In: Ribeiro, C., Hansen, P. (eds.) Essays and
Surveys in Metaheuristics, vol. 15, pp. 613–629. Springer, Boston (2002). https://
doi.org/10.1007/978-1-4615-1507-4 27
24. Xie, L., Suhl, L.: A duty-block network approach for an integrated driver rostering
problem in public bus transport. In: Longhurst, J., Brebbia, C.A. (eds.) Urban
Transport XVIII, pp. 295–306. WIT Transactions on The Built Environment, WIT
Press Southampton, UK (2012). https://doi.org/10.2495/UT120261
25. Xie, L., Kliewer, N., Suhl, L.: Integrated driver rostering problem in public bus
transit. Procedia. Soc. Behav. Sci. 54, 656–665 (2012)
26. Xie, L., Merschformann, M., Kliewer, N., Suhl, L.: Metaheuristics approach for
solving personalized crew rostering problem in public bus transit. J. Heuristics
23(5), 321–347 (2017). https://doi.org/10.1007/s10732-017-9348-7
27. Xie, L., Suhl, L.: Cyclic and non-cyclic crew rostering problems in public bus
transit. OR Spectrum 37(1), 99–136 (2015). https://doi.org/10.1007/s00291-014-
0364-9
Beyond Cargo Hitching: Combined People
and Freight Transport Using Dynamically
Configurable Autonomous Vehicles
1 Introduction
Dynamically Configurable Autonomous Vehicles (DCAVs) are a new class of
autonomous vehicles using a separable design concept consisting of drive and
transport units. The drive unit (also called the carrier, driveboard, or chassis)
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 381–395, 2023.
https://doi.org/10.1007/978-3-031-43612-3_24
382 J. J. A. Kortekaas et al.
By separating drive and transport units, providers can increase the utilization
of self-driving carriers (presumably, the most expensive assets) while increasing
profits, adapting to changing demand profiles such as people and freight on the
operational level. Typically, vehicle acquisition is a long-term investment, which
makes deciding on adequate fleet size and mix a complex strategical decision
[2,7]. Therefore, relying on a DCAV fleet may also mitigate losses due to dras-
tic demand shifts. For example, the COVID outbreak led to a surge in delivery
services while social distancing decreased mobility, exposing the fragility of trans-
portation systems based on single-purpose vehicles [12]. Ultimately, investing in
additional freight modules to accommodate a higher freight demand is expected
to be considerably cheaper than investing in entirely new freight vehicles.
Combined people and freight transportation systems have been gaining atten-
tion in the literature [3,9]. For example, solution approaches have been designed
for settings where people and parcel share rides on taxis (see, e.g., [8]) and
compartmentalized autonomous vehicles (see e.g., [1,13]). However, there is no
study on optimizing the operations of a DCAV-based transportation system
that dynamically switches transport units to fulfill heterogeneous demand. Only
concept designs have been evaluated (see, e.g., [10,11,15]) and presented by com-
panies such as the Continental BEE [4], the Mercedes Benz Vision Urbanetic [5]
and the Toyota e-palette [14].
Combined Transport Using Dynamically Configurable Autonomous Vehicles 383
changing modules for each trip. Even though demand fluctuates throughout the
day, the carrier is being used to serve requests for either passengers or parcels,
resulting in high carrier utilization.
We benchmark the performance of the DCAV fleet (consisting of carriers and
passenger and freight modules) with a typical single-purpose fleet (consisting of
dedicated passenger and freight vehicles). We consider carriers and dedicated
vehicles of modular and single-purpose fleets have to start and end their route
at a central depot. Vehicles are not allowed to arrive early and wait at a pickup
location since requests are made on-demand, and there is little room for parking
in city centers. However, parking locations are available for vehicles to wait until
a pickup is requested; these locations can also be used for modular vehicles to
switch modules. In contrast, modules do not have to start or end their route
at the depot. They can also start and end their route from a parking location
and do not have to return to the same location as where they originated from.
We assume there is no limit to the number of times a module can be switched,
except occupied modules, which are not allowed to be switched.
2 Problem Description
Fig. 3. Difference between single-purpose and modular DCAV fleets when servicing
the same passenger and freight request set.
3 Problem Formulation
The DCAVPDP is defined on a directed graph G = (V, A), consisting of vertices
V and arcs A, and requires synchronisation in time and space of the movement
of carriers and modules. We consider a grid service area where a total number
of nr requests is generated. The set of all vehicles is indicated by F , which is a
collection of all vehicles that are available to the model. The set of all vehicles is
further separated in the sets Fa , Fq , and Fm . All motorized vehicles in the model
are indicated by Fa , which is a combination of all passenger and freight vehicles
and the carriers. Fq is the set of all vehicles that have load-carrying capability,
which consists of the passenger and freight vehicles and the passenger and freight
modules. The final set regarding vehicles is Fm , which is a set of all modules in
the model. Where V k are all reachable vertices for vehicle k, VPk is a set of all
reachable pickup vertices for vehicle k ∈ F .
The set of traversable arcs Ak is further specified in the following sets: Am C,
APD , and AkW . The set Am
k
C describes all traversable arcs module m ∈ Fm can
travel along when being coupled to carrier c ∈ C, since modules are not capa-
ble of traveling between real-world locations without the assistance of a carrier.
However, modules are allowed to travel to couple vertices on their own to allow
for the distribution of modules across all parking locations. The number of park-
ing locations in the model is specified by np and f and 0 correspond to the
start and final vertex respectively. This ensures a module can be coupled at all
available parking locations on the grid after being decoupled from a carrier. AkPD
386 J. J. A. Kortekaas et al.
Table 1. Sets, parameters, and variables of the DCAVPDP and dedicated people and
freight integrated problem.
Sets
C, Mp , Mf Carriers and passenger and freight modules (modular fleet)
Kp , K f Passenger and freight vehicles (dedicated fleet)
F = Kp ∪ Kf ∪ C ∪ Mp ∪ Mf . Set of all vehicles (fleet)
Fa = Kp ∪ Kf ∪ C. Set of powered vehicles (autonomous)
Fq = Kp ∪ Kf ∪ Mp ∪ Mf . Set of vehicles with load carrying capability
Fm = Mp ∪ Mf . Set of all modules
Vk Set of all reachable vertices for vehicle k ∈ F
VPk Set of all reachable pickup vertices for vehicle k ∈ F
VP = k∈F Vpk
k
Vpark Set of all vertices where vehicle k is allowed to wait
Ak Set of all traversable arcs for vehicle k ∈ F
AmC Set of arcs module m ∈ Fm can only traverse while spatially synchronised with a
compatible carrier c ∈ C
AkPD Set of traversable arcs between request pairs for vehicle k ∈ F
AkW Set of arcs for vehicle k ∈ F where waiting is not allowed at the arrival vertex
Parameters
Vehicles
nk Number of available vehicles with load carrying capability
nc Number of available carriers
Qk Capacity of vehicle k ∈ F
vk Speed of vehicle k ∈ F
Requests
nr Number of requests
qi Demand of request at vertex i ∈ VP
ei Earliest pick up time of request at vertex i ∈ VP
δi Maximum waiting time of request at vertex i ∈ VP
Δi Maximum extra ride time of request at vertex i ∈ VP
Model
0 Start vertex for all vehicles
f Final vertex for all vehicles
np Number of parking locations
si Service time at vertex i ∈ V
di,j Travel distance from vertex i to vertex j
ti,j Travel time from vertex i to vertex j
wm Weight of module travel distances
wrejection Requests rejection penalty
S Simulation start time
H Simulation horizon
Variables
xki,j (Binary) 1 if vehicle k traverses arc (i, j), 0 otherwise
τik Arrival time of vehicle k at vertex i
ωik Load of vehicle k after vertex i
rik Ride time of request i on vehicle k
Combined Transport Using Dynamically Configurable Autonomous Vehicles 387
is a set of traversable arcs for vehicle k ∈ F that are composed of a pickup and
delivery pair. Finally, AkW is a set of traversable arcs for vehicle k ∈ F on which
arriving early is not allowed since vehicles are not allowed to wait at request
locations.
The total number of requests in the model is specified by nr . Each request
has a pickup vertex i ∈ VP , which has a demand size of qi . The corresponding
delivery vertex has a demand size of −qi , effectively emptying the vehicle again.
The pickup time window of request i is determined by the earliest pickup time
ei of the request and its maximum waiting time δi . The earliest pickup times of
requests are being generated during simulation horizon H, starting from simula-
tion start time S. Requests do not have a set delivery time window, instead, each
request has a maximum extra ride time Δi . The destination of a request has to
be reached within the sum of the travel time and the maximum extra ride time.
Additionally each vertex i ∈ V has a service duration of si seconds. nk specifies
the number of available vehicles in the model that have a load-carrying capa-
bility (passenger and freight vehicles as well as passenger and freight modules),
while nc specifies the number of carriers available to the model. All vehicles have
a set capacity of Qk and an average speed of v k . Table 1 shows an overview of
all sets and parameters used in the mixed-integer linear programming model.
The formulation of the MILP is as follows:
Minimize:
xki,j di,j + wm xki,j di,j
k∈Fa (i,j)∈Ak k∈Fm (i,j)∈Ak
− xk0,f + (1 − xki,j ) ∗ wrejection (1)
k∈Fa i∈VP k∈Fq (i,j)∈Ak
Subject to:
xki,j ≤ 1 ∀i ∈ VP (2)
k∈Fq (i,j)∈Ak
xki,h − xkh,j = 0 ∀(i, j) ∈ AkPD , ∀k ∈ Fq (3)
(i,h)∈Ak (h,j)∈Ak
xk0,j = 1 ∀k ∈ F (4)
0,j∈Ak
xki,f = 1 ∀k ∈ F (5)
i,f∈Ak
xkh,i − xki,j = 0 ∀i ∈ V k \{0, f}, ∀k ∈ F (6)
(h,i)∈Ak (i,j)∈Ak
xm
i,j − xki,j = 0 ∀(i, j) ∈ Am
C , ∀m ∈ Fm (7)
k∈C
388 J. J. A. Kortekaas et al.
The objective function (1) aims to minimize the travel distance of all motor-
ized vehicles, as well as the total number of motorized vehicles used to find an
optimal solution. Secondly, the travel distance of modules is also minimized,
however, it is deemed less important and is offset by a factor of wm . To allow
the model to reject requests a penalty of wrejection is added if a pickup vertex
is not visited. Constraints (2) to (6) are the general routing constraints, and
(7) to (9) are spatial and temporal synchronisation constraints to couple the
movement of carriers and modules. The time window constraints are described
by constraints (10) to (13) and subsequently constraints (14) and (15) are the
ride time constraints. Finally, constraints (16) to (18) are the vehicle load con-
straints. Equations (19) to (22) describe the decision variables in the model.
Constraint (2) ensures that every request’s pickup vertex is visited by at most
one vehicle, however, every vertex does not need to be visited. A vehicle that
has visited the pickup vertex of a request must also travel to the request’s des-
tination vertex, which is ensured by (3). Constraints (4) and (5) guarantee that
every vehicle starts it’s route at the start vertex and ends at the final vertex
respectively. To preserve flow conservation, constraint (6) enforces every vehicle
to leave the same vertex it has entered. Constraint (7) is a spatial synchronisation
constraint to ensure modules are only able to traverse arcs between two different
real-world locations (Am C ) while being coupled with a carrier. Constraints (8)
and (9) are temporal synchronisation constraints that impose identical arrival
Combined Transport Using Dynamically Configurable Autonomous Vehicles 389
times of coupled carriers and modules. The arrival time of vehicles is defined by
constraint (10). Constraint (11) ensures that a request is picked up after their
earliest pick-up time and within their maximum waiting time. A vehicle should
arrive at the destination vertex of a request pair after the pickup vertex has been
visited, which is guaranteed by (12). Vehicles are not allowed to arrive early and
wait at request pickup locations, constraint (13) ensures that for the set of arcs
on which waiting is not allowed (AkW ) vehicles won’t arrive before the earliest
pickup time. Constraint (14) defines the ride time of a request, and (15) makes
sure the maximum extra ride time of a request is not exceeded. Constraint (16)
defines the vehicle load and (17) enforces the load of a vehicle not to exceed the
maximum capacity of the vehicle. Constraint (18) guarantees that vehicles are
empty when arriving at parking or decoupling vertices, to ensure no occupied
vehicles or modules are parked or decoupled. Finally, constraints (19) to (22)
define the model’s decision variables, which are the traversed arcs, arrival time,
vehicle load and ride time variables respectively.
4 Numerical Study
In this section, the configuration of the MILP for the different instances will be
discussed, as well as the generation of the passenger and freight request data.
Next, the general parameters of the model will be given, followed by a discussion
of key performance indicators of the transport systems.
We test the model on two fleet configurations (viz., single-purpose and DCAV
fleets) using the same request data. We consider four different demand scenarios
throughout the simulation horizon H:
Values nk and nr are divided equally between passenger and freight vehicles
and requests, respectively. Additionally, for each module in the model, there is a
carrier available to be used, thus, the number of carriers nc is equal to the number
of vehicles nk . In the worst-case scenario, the DCAV fleet could behave similarly
to the single-purpose fleet by assigning a module to each carrier and having the
same routes as the single-purpose vehicles. The parameter groups (H, nk , nn )
are as follows: (1, 2, 4), (1, 2, 8), (1, 2, 12), (1, 4, 4), (1, 4, 8), (1, 4, 12), (1, 6, 4),
(1, 6, 8), (1, 6, 12), (8, 2, 12), (8, 2, 16), (8, 2, 24), (8, 4, 12), (8, 4, 16), (8, 4, 24).
An overview of the 720 instances generated can be found in Fig. 4.
Fig. 4. Instances for the DCAVPDP and the benchmark single-purpose problem (ded-
icated vehicles for each commodity).
time between opposing corners of the grid is 800 s. Passenger and freight vehicles
and modules have a capacity Qk = 2 for both the single-purpose and the DCAV
fleet, respectively. Typically, passengers are more sensitive to waiting times and
extra ride times, this is reflected in the model by giving more strict values to
passenger requests. The maximum waiting time δi for passenger requests is 300
s and the maximum extra ride time Δi is 900 s. For freight requests, δi = 900
s and Δi = 1800 s. Regarding the service duration si , we assume a vehicle has
to wait 120 s when servicing passengers and 300 s when picking up a parcel.
Besides, we consider the modular DCAV fleet has an additional service duration
si = 120 s to switch (i.e., couple or decouple) modules. Finally, rejecting requests
incurs a penalty wpenalty = 105 and the traveling of modules is weighted down
using wm = 10−3 .
Our numerical study aims to assess whether a fleet of DCAVs improves vehicle
utilization and occupancy while maintaining an equivalent service quality com-
pared to a traditional single-purpose fleet. In order to highlight the differences
between these two fleets, we consider the following fleet management perfor-
mance metrics:
The numerical study has been performed on an AMD Ryzen 5 3600 4Ghz CPU
with access to 32GB of RAM. Python version 3.8.8 and commercial solver Gurobi
Optimizer 9.1.2 were used to implement the DCAVPDP. All instances were run
to optimality.
392 J. J. A. Kortekaas et al.
Fig. 5. Average fleet size and distance traveled for all fleet types across the four demand
distribution scenarios.
Table 2. Performance difference of the modular DCAV fleet compared to the single-
purpose fleet. The final column shows the number of ran instances that show better
performance for the DCAV fleet.
Performance Indicator Avg. Diff BLS PPS FPS COS #DCAV best
Utilization rate (on-duty) 4.82% −4.80% 3.82% −1.14% 24.36% 241/360
Utilization rate (total) 23.75% 2.90% 16.82% 7.78% 74.71% 326/360
Occupancy rate −1.30% −1.49% −0.47% −2.35% −0.97% 240/360
Service rate −0.08% −0.15% 0.00% −0.15% 0.00% 329/360
Avg. pickup time −9.70% −22.23% −10.27% −8.17% 3.28% 219/360
Avg. ride times −4.57% −9.73% −1.72% −8.15% −1.51% 330/360
Fleet capacity 7.58% 13.6% 5.75% 9.74% 1.81% 266/360
Fleet size −18.77% −4.80% −17.62% −8.24% −42.60% 346/360
Distance traveled −1.52% −1.27% −1.32% −1.21% −2.38% 218/360
6 Conclusion
This study has presented a MILP formulation for the Dynamically Configurable
Autonomous Vehicle Pickup and Delivery Problem (DCAVPDP). The model pro-
vides the first quantitative method investigating the benefits of a DCAV-based
transport system, which leverages automation and a novel automotive modu-
lar design concept to service multiple demand types by dynamically configuring
vehicle upper compartments. We compare the performance of a single-purpose
fleet consisting of dedicated passenger and freight vehicles and a DCAV fleet con-
394 J. J. A. Kortekaas et al.
References
1. Beirigo, B.A., Schulte, F., Negenborn, R.R.: Integrating people and freight
transportation using shared autonomous vehicles with compartments. IFAC-
PapersOnLine 51, 392–397 (2018). https://doi.org/10.1016/j.ifacol.2018.07.064
2. Beirigo, B.A., Schulte, F., Negenborn, R.R.: A learning-based optimization app-
roach for autonomous ridesharing platforms with service-level contracts and on-
demand hiring of idle vehicles. Transp. Sci. 56(3), 677–703 (2022). https://doi.
org/10.1287/trsc.2021.1069
3. Cheng, R., Jiang, Y., Nielsen, O.A.: Integrated people-and-goods transportation
systems: from a literature review to a general framework for future research, pp.
1–24 (2023). https://doi.org/10.1080/01441647.2023.2189322
4. Continental: Continental launches bee in the urban world of the future (2017).
https://www.continental.com/en/press/press-releases/bee-96328. Accessed 28
June 2023
5. Daimler: Vision urbanetic (2019). https://media.mercedes-benz.com/article/
c518445b-9050-42ec-9be1-3446e1bf3973. Accessed 28 June 2023
6. Drexl, M.: On some generalized routing problems. Ph.D. thesis, Faculty of Busi-
ness and Economics, RWTH Aachen (2007). http://publications.rwth-aachen.de/
record/62536
7. Hörl, S., Ruch, C., Becker, F., Frazzoli, E., Axhausen, K.: Fleet operational policies
for automated mobility: a simulation assessment for Zurich. Transp. Res. Part C:
Emerg. Technol. 102, 20–31 (2019). https://doi.org/10.1016/j.trc.2019.02.020
8. Li, B., Krushinsky, D., Reijers, H.A., Van Woensel, T.: The share-a-ride problem:
people and parcels sharing taxis. Eur. J. Oper. Res. 238(1), 31–40 (2014). https://
doi.org/10.1016/j.ejor.2014.03.003
9. Mourad, A., Puchinger, J., Chu, C.: A survey of models and algorithms for optimiz-
ing shared mobility. Transp. Res. Part B: Methodol. 123, 323–346 (2019). https://
doi.org/10.1016/j.trb.2019.02.003
10. Münster, M., et al.: U-shift vehicle concept: modular on the road. In: Bargende,
M., Reuss, H.-C., Wagner, A. (eds.) 21. Internationales Stuttgarter Symposium.
Combined Transport Using Dynamically Configurable Autonomous Vehicles 395
Berlin School of Economics and Law, Badensche Straße 52, 10825 Berlin, Germany
{daduna,Daniel.Philipp}@hwr-berlin.de
Abstract. In the context of cohesion policy, one of the main points is the divergent
development between urban and rural areas. One critical problem is the undersup-
ply of rural areas due to low demand potential. In addition, there are inadequate
public transport services for many residents who, for various reasons, do not
have access to private motorized transport. With technical advances in the field of
road transport vehicles and comprehensive digitization, autonomous driving has
opened up opportunities in recent years for development of new (flexible) forms of
service based on quasi-individual mobility. The technical foundations required in
this regard as well as the organizational and legal framework conditions are high-
lighted and critically analyzed. Using the example of health care in rural areas, a
mobility concept is outlined that offers a suitable solution in conjunction with a
hierarchically centralized location structure of medical care facilities. This app-
roach is also suitable for reducing further structural deficits and supporting the
creation of equal living conditions on a broader level, also with a view to broad
participation in social and community life in rural areas.
1 Introduction
With the Second Industrial Revolution [17, 96], profound changes began (especially in
the central European regions and North America) in the settlement and economic struc-
tures that had until then been largely dominated by agriculture and handicraft businesses
The starting point for industrial developments was, in addition to new technologies, espe-
cially innovations in the transport sector. Thus, with the emergence of rail transport ([99]
74–86; [103] 565–573; [54] 234–244) for the first time an efficient transport mode was
available for freight and passenger transport, the network design and operation of which
were largely independent of geographical, topographical and weather-related influences.
Very quickly, the (technical) performance capabilities as well as cost reductions of about
90% became apparent in comparison to the previously existing modes of land and water
transport ([102] 95–96).
This was associated with the emergence of industrial areas and later also of service
centers with a constantly increasing demand for labor, which had to be met mainly from
rural residents or by immigration. This led to an extreme population growth in urban
regions ([54] 135–136), which inevitably brought about changes in social, economic
and work structures. An essential prerequisite, however, was a fundamental change in
agriculture ([46] 635–654). Significant increases in production with a lower demand for
labor due to efficiency improvements on the agricultural estates and the transition from
subsistence farms, which were widespread at the time, to commercial farming ([54]
141–157) or, through corresponding imports, to ensure the food basis in urban regions.
These developments have led to a divergence of living conditions between urban and
rural areas over many years, which has resulted in considerable inequalities in social par-
ticipation [68] and economic structures. This situation has been and can still be observed
worldwide, with varying degrees of intensity. However, it is politically undesired, espe-
cially because of the inherent potential for social conflict, so that suitable concepts
and approaches have been discussed for some time with the objective of eliminating
structural disproportions and enabling equal living conditions. Thus, one of the prior-
ity objectives in the European Union (EU) is the implementation of a cohesion policy
[27], increasingly also in the context of national measures of a regional policy [19]. For
Germany, this has been written down in a legally binding way in Sect. 1 of the Regional
Planning Act (ROG) for several years. One of the measures in this regard is to ensure
that mobility for residents living in rural areas meets their needs, which is currently
inadequate, ([6] 29–32). The emerging transition to autonomous driving in road traffic
[79] and the possibility of a quasi-individualization of mobility in public transport (see
Sect. 3) is a development step to enable profound changes.
In this paper, we show the extent to which new forms of mobility can be used in rural
areas, to sustainably improve existing structural problems in health care, since these are
of major importance for the population there. First, some aspects of spatial planning, in
rural areas are considered. Subsequently, the operational and technical design options
for improving mobility through the expansion of local and regional public transport
will be addressed, with a focus on autonomous driving vehicles. In the following, it
will be analyzed to what extent these measures can guarantee demand-oriented access
to health care services for all population groups in rural areas. A key point here is the
extent to which decentralized structures are desirable and feasible on the one hand, and
centralization is necessary on the other, also from an economic and technological point
of view. Finally, expectations and further development are discussed.
associated with increasing distances for parts of the inhabitants, depending on the loca-
tion of the respective higher-level locations. With regard to accessibility, the availability
of sufficient mobility, not only in the local area, is therefore absolutely mandatory.
The problem of service provision in rural areas is clearly shown by the structural
allocation of areas at the European level, starting from the NUTS classification (Level
3: Small regions for specific diagnoses) ([87] 13). This shows that densely populated
as well as rural regions with aggregation approaches clearly dominate the European
settlement structures. However, there is a certain problem with this (relatively rough)
classification, which does not reflect the significant regional specific differentiation. In
addition, there are also a number of different classification concepts, also for Germany.
Moreover, it must be taken into account that the framework conditions in the respec-
tive counties differ greatly in some cases, depending on demographic developments as
well as on local labor market situation and economic framework conditions ([6] 30–33).
In addition, there are regional location factors, especially with regard to dominant forms
of economic use. An additional influencing factor is the spatial closeness of a county to
(industrial) agglomerations or densification spaces, which can have inducing centrality
effects. In order to ensure an adequate service structure for the population in rural areas,
the availability of sufficient mobility is absolutely necessary. However, there are con-
siderable deficits in this regard. One major reason for this is the technical developments
in the field of individual mobility since the mid-1950s ([59, 97] 30–32). The steadily
growing availability of private cars has led to reductions in public transport, as demand
for it has become increasingly low. In 2017, the modal split in rural areas with small town
and village structures was only 5%, compared to 70% for private motorized transport
([75], 40). Also, more than one third of all families here own at least two cars (30% have
two cars, 6% three and more) ([75], 35).
This shows that a not inconsiderable share of the population in rural areas has no
or only limited access to mobility [58]. Essentially, these are people without a driver’s
license (especially young people), people who cannot drive a vehicle due to their health
situation or age, and people who do not own a car for financial reasons [68, 90] An
Impact of Public Transport Development on Health Care Services 399
expansion of the actually only rudimentary public transport service is not economically
feasible due to the subsidies required for this [35]. This means that ultimately there is
no sufficient mobility available and thus the framework conditions are not at all in place
to be able to guarantee even approximately equivalent living conditions.
For some years now, developments toward autonomous driving have been achieved
in the motor vehicle sector, also in conjunction with significant advances in digitization.
Based on the current status, this technology will be fully ready for the market before the
end of this decade, as shown by the ongoing research work worldwide as well as many
test projects and initial implementations [67]. On this basis, there is the possibility of
significantly and sustainably improving public transport in rural areas with new services
and thus coming more closely to the goal of equal living conditions.
The known public transport services with minibuses and standard buses cannot be an
adequate solution for operating in rural areas, both in terms of suitable structures and, in
particular, for financial reasons. In their current structures, they cannot ensure sufficient
quality of service ([6, 78] 29–32). For this reason, there have already been attempts
several decades ago to develop better alternatives based on demand-oriented services
for rural areas [9, 21, 66], but also in the suburban transition zones [86, 100]. A number
of concepts have been developed, not only in Germany, and have also been tested in
some cases, but often not implemented in regular operations.
The main reasons for this were the complexity of operational monitoring and control
as well as the inadequate communication capabilities between provider and customer.
In addition, there were labor law restrictions regarding to the driving personnel, which
led to limitations in the operational planning of vehicle and duty schedules and also to
considerable costs [66]. The necessary political support from local authorities was not
always available due to the introduction of competing services for the already econom-
ically weakened public transport companies owned by the municipalities. In addition,
there was resistance from the trade unions due to the flexible working time models
associated with the modified forms of service [66].
For a number of years, therefore, considerable effort has been made to develop
on-demand services with autonomous driving vehicles [67]. This also applies to broad
research projects in the field of general public road traffic, such as the Niedersachsen
Test Field project [3]. Standard legal framework conditions as a basis for the certification
and operation of such vehicles have now been laid down in the Road Traffic Act and in
the Autonomous Vehicles Approval and Operation Regulation of June 24, 2022.
Significant advantages for a service design based on such innovative operating con-
cepts in public transport are the elimination of costs for the driving personnel as well as the
partly (time-related) very restrictive labor law limitations. Added to this are far-reaching
developments in vehicle technology, widespread digitization in operational monitoring
and control as well as regarding information management. On this basis, more and more
projects have been initiated worldwide [1, 42], where autonomous driving plays a sig-
nificant role. One project in Germany is the public transport service with minibuses in
400 J. R. Daduna and D. Philipp
the area of Bad Birnbach [7]. In the following, the development of autonomous driving
is outlined first and then the design options are discussed.
The most important basis for the definition of autonomous driving is a classification
based on the J 3016 standard of SAE International [88]. This includes six levels from
SAE Level 0 to SAE Level 5, which define the different functions between the driver and
the automated system (0: No form of automation, 1: Use of automated driver assistance
systems (ADAS), 2: Partial automation, 3: Conditional automation, 4: High automation,
5: Full automation (autonomous driving). The final objective is to bring vehicles with
level 5 into service [52] that means, all vehicle movements are controlled and monitored
by automats. The essential aspect, which is to be seen as the basis of a quasi-individual
mobility, is that point-to-point connections are possible, in which a vehicle can drive
directly to a customer depending on demand.
The main foundations have been laid in recent years as part of the development
of ADAS components [1, 4], that are already being used in practice. These include
supporting functions such as the automatic speed adaptation, lane departure warning,
adaptive cruise control, blind spot information system, collision avoidance, moving
object detection, intelligent speed adoption and traffic sign recognition. In addition,
there are ad hoc communication networks such as vehicle-to-vehicle (V2V) or vehicle-
to-infrastructure (V2I) networks for a more efficient and environmentally oriented traffic
flow management and to increase traffic safety [8, 107].
Automated detection systems installed at specific positions on the vehicle are used
to continuously record the surrounding data. These include optical systems (cameras),
radar-based sensors, and lightwave-based 3D techniques (e.g., 3D laser systems and
LiDAR systems) [1, 47, 108]. The destination positioning as well as the routing are based
on data from (digital) Geographic Information Systems (GIS) [48, 50] in connection with
Global Navigation Satellite Systems (GNSS), such as the US American Differential
Global Positioning System (DGPS) or the system Galileo developed in the EU [45, 48,
50]. Applications of autonomous driving in open area are also already possible at a
contemporary time [80]. Based on the current state of development and the first steps
toward realization, it can be assumed that the essential prerequisites for autonomous
driving are in place in view of the technical developments. Deficits exist primarily in
some necessary adjustments to the legal framework and also in the not always existing
social acceptance, in spite of all the significant advantages.
On the basis of autonomously driving vehicles, completely new design options arise
for public transport, especially for services in rural areas [15, 71, 94]. In addition to
the available technical bases, it is the replacement of the driver by intelligent automats
that enables the independence from, in particular, labor law restrictions and drastically
reduces the previously incurred personnel costs. The essential step here is a demand-
oriented 24/7 service, through which quasi-individual mobility can be provided in rural
Impact of Public Transport Development on Health Care Services 401
areas. However, some legal issues need to be clarified regarding the integration of these
services into the legal framework of the Passenger Transportation Act (PBefG) [83].
In the foreground are four basic types of services that can be applied here, point-to-
point connections, on-demand based line and line band services as well as area services.
For point-to-point connections, services can also be provided by robotaxis [43, 55] and in
the framework of free-floating carsharing services [49] while in the other cases the focus
will be on the use of minibuses and midibuses [7, 74]. In addition, ridesharing concepts
[33, 64] can also be integrated, especially with regard to local feeder transport. However,
the introduction of these services cannot imply that the existing ones are replaced, as it
has been formulated, for example, by [31] with the convergence theory. In fact, they are
a necessary supplement for a service in the area, but not a substitution of the existing
regional bus services and of services in the regional rail passenger transport, which will
continue to be absolutely necessary within the framework of commuter services due to
the demand structures. However, exceptions exist [91] if demand on certain lines or line
sections does not allow for economic operation and cannot be expected to do so in the
future.
This means that in order to strengthen public transport in rural areas, it is absolutely
necessary to combine the existing scheduled with possible on-demand services in order
to be able to guarantee an adequately efficient public transport there. The main objective
must be an extension of the service, especially in regions where it has not been feasible
so far due to the existing framework conditions. With the new forms of service, suitably
adapted organizational and operational structures are also required. The essential aspects
in this context are outlined in a rough framework in the following section.
also ideological) point of view may not play a role in this context, because philosophical
brain games cannot ultimately replace decisions and their implementation, nor can they
cause damage.
the year 2020, 922 cases are indicated in the overview for Rostock, whereas the least
frequented clinics each have less than 10 treatments to report.
In addition to the burden of high costs, there is an increasing shortage of specialists
and expertise to secure adequate care for patients. A look at the development of the
overall situation of nursing staff in the nephrology sector in relation to the increasing
numbers of elderly people shows a clear discrepancy. At the same time, there are fewer
and fewer classical nephrology nursing staff in relation to the patients to be cared for,
also because of the now increasing retirement of veteran nursing staff [84]. As there will
tend to be more and more patients requiring renal replacement therapy such as dialysis
due to demographic change, nationwide care is a problem.
Radiological Methods of Treatment. According to the German Radiology Network,
about 6,800 of the licensed physicians in Germany have completed training as special-
ists in radiology/diagnostic radiology and also work in this field. With about 2% of all
licensed physicians, they form one of the smaller groups of specialists. Approximately
3,100 radiologists are active in the established sector, with further dovetailing between
the outpatient and inpatient sectors. Looking at the different practice sizes, it becomes
clear that the number of radiology individual practices is declining. Of the approximately
1,100 radiology centers in Germany, fewer than 300 are still run as individual practices.
This is a significant decrease compared to the approximately 400 individual practices in
2016 [76], thus a trend toward the necessary centralization is evident here. The remain-
der are already group practices. Around 600 radiologists work in group practices with
two partners and just under 900 radiologists are organized in group practices with 3–5
partners. With almost 350 radiologists, a not insignificant proportion is accounted for
large group practices with six or more partners. Such large practices can be compared to
medium-sized companies. They often have up to 50 employees and operate equipment
worth well over 10 million euros in some cases. Private practice radiology is currently in a
state of flux, and consolidation is taking place with the formation of large, supra-regional
practice conglomerates [76].
Nationwide radiological care without teleradiology is already hard to implement
today [69]. One of the main reasons for this success lies in the working methodology
in the field of radiology. For example, an essential part of the work of radiologists is
predominantly the interpretation of medical images, which are produced by medical-
technical assistants and transmitted digitally by the equipment, be it conventional X-ray
machines, CT or MRI devices, to be viewed and evaluated on a suitable monitor. Here it
becomes clear that radiological care, in comparison to other medical disciplines, depends
less on the universal availability of specialists, but rather on the availability of sufficiently
trained (nursing) staff and the necessary technical equipment. Few radiological proce-
dures, such as interventional radiology, conventional X-ray fluoroscopy or sonography,
absolutely require the on-site presence of radiologists [69]. Since the number of radio-
logical examinations in Germany is growing by about 10% annually, but only 2–3% of
newly trained radiologists are coming on board [81], we already see modern approaches
to solutions here. However, the basis for this is the provision of technical equipment
both in the field of data collection as also in data processing and transmission. Another
essential point is to ensure data security both for the transmission and the archiving of
406 J. R. Daduna and D. Philipp
data. Combined with the growing need for medical-technical assistants [25], this form
of universal care is facing problems.
For example, if one considers the imaging diagnostic procedures performed in MV
alone, the extent of the demand becomes apparent. The Federal Health Report con-
tains approx. 6,794,000 CT examinations and approx. 1,919,000 MRI examinations of
full inpatients in hospitals for 2021. Compared to 2020, this is an increase of about
321,000 CT examinations and 50,000 MRI examinations in one year. This might hint
that the importance of imaging diagnostics, especially CT examinations, has increased
in Germany in the recent years. The path already taken towards partial centralization
of radiological treatments in centers or larger practices offers the possibility of exploit-
ing the synergies associated with the increasing size of the practices in terms of staff
deployment, equipment procurement, service contracts, etc. [76].
Skin Cancer Detection. Malignant neoplasms of the skin (black skin cancer) are among
the most common types of cancer worldwide. The frequency is increasing rapidly, but
it is easily treatable if detected early [24]. Routine diagnostics for melanoma detec-
tion include full-body visual inspection, often supplemented by dermoscopy, which can
significantly increase the diagnostic accuracy of experienced dermatologists [24]. A pro-
cedure that is additionally offered in some practices and clinics is Total Body Digital
Photography (TBDP) in combination with digital dermoscopy. TBDP is also used in
smartphone apps that serve the self-assessment of the skin by the user. The problem here
is the lack of consistency in the image and light quality of the images used [65].
In recent decades, several non-invasive methods have been developed for the ex-
amination of suspicious pigmented lesions that have the potential to allow improved
and, in some cases, automated assessment of these lesions. Currently, the most promising
approach may be the analysis of routine macroscopic and dermoscopic images by AI [51].
In the classification of pigmented skin lesions based on macroscopic and dermoscopic
images, AI, especially in the form of neural networks, has achieved diagnostic accuracy
comparable to dermatologists in numerous experimental studies. Prerequisites for the
transfer of such diagnostic systems into dermatological routine are the comprehensibility
of the system decisions by the user as well as a uniformly high performance of the
algorithms on image data from other clinics and practices.
As seen in recent studies, computer-aided diagnosis systems unfold the greatest
benefit primarily as assistance systems [12, 101]. The physician/machine combination
achieves the best results and AI-based diagnostic systems help to record morphological
features quickly, quantitatively, objectively and reproducibly and could thus, in addition
to medical experience, provide a more objective basis for analysis. Looking at the num-
bers delivered by the National Association of Statutory Health Insurance Physicians
(KBV) from 2020 illustrates the problem. If one compares, for example, the density
of dermatologists in the spatial planning regions of Mecklenburgische-Seenplatte (MS),
Mittleres Mecklenburg/Rostock (MM), Vorpommern (VP), Westmecklenburg (WM) and
Berlin, at first glance the picture is homogeneous; MS has 6.1 dermatologists per 100,000
inhabitants, MM 7.6, VP 7.4, WM 5.2 and Berlin 7.6. However, if one converts the physi-
cian density to the area, different ratios result. Berlin has 0.3125 dermatologists per km2 ,
while the region of MS has 0.0029 dermatologists per km2 (MM 0.009; VP 0.0048; WM
0.0034). This means that, calculated on the area, dermatologist care in Berlin is 109 times
Impact of Public Transport Development on Health Care Services 407
higher than in the area of MS. This implies significantly longer journeys for the patients
concerned, while mobility deficits are also evident here.
Ongoing Hospital Reform. In 2018, the G-BA defined minimum structural require-
ments for emergency care in three levels for the first time. This was an important first
step towards a nationwide uniform definition of care structures and the creation of trans-
parency. In December 2022 the Government Commission for Modern and Needs-Based
Hospital Care has presented a reform proposal for hospital remuneration and a fixed
definition of structural prerequisites in the sense of a minimum required hospital-wide
structural quality [14]. The reform proposals of the government commission refer to
inpatient and emergency care. Nevertheless, it also responds to problems, experiences
and developments that can also be found in the outpatient sector. The politically forced
restructuring of the hospital landscape, insofar as it is implemented, leads to a further
centralization of hospital locations. The three examples described above are primar-
ily long-term plannable and recurring treatments. Nevertheless, the coming structural
change is an opportunity to further dovetail outpatient and inpatient care.
40 km it is additionally six and at r = 30 km again nine more (see Table 1). Only in the
western and southern peripheral areas are there smaller gaps, but these can be covered
in cooperation with the neighboring federal states of Schleswig-Holstein (Lübeck) and
Niedersachsen (Lüneburg) in the west and Brandenburg (Prenzlau) in the south. In the
area of the Baltic Sea coast, on the other hand, complete coverage is required, due among
other things to the geographic conditions, so that the Fischland-Darß-Zingst peninsula
and the island of Rügen can be included in the coverage.
This solution does not include the existing settlement structures in detail, the distance-
based accessibility of the locations within the existing transport networks and also
Impact of Public Transport Development on Health Care Services 409
(regional political) influences. In this respect, the outlined structure must be understood
as an initial basis for discussion, which must be specified more precisely at the political
level and the responsible technical levels.
References
1. Ainsalu, J., et al.: State of the art of automated buses. Sustainability 10(9), 3118 (2018)
2. Arzt, C., Kleemann, S., Plappert, C., Rieke, R., Zelle, D.: Datenverarbeitung und Cyber-
sicherheit in der Fahrzeugautomatisierung - Rechtliche und technische Anforderungen im
Verbund. Zeitschrift für IT-Recht und Recht der Digitalisierung 25(7), 593–614 (2022)
3. Asbach, L., Ortgiese, M.: Automatisierte Fahrzeuge in Europa sicher betreiben. Interna-
tionales Verkehrswesen 75(1), 22–26 (2023)
4. Ayyasamy, S.: A comprehensive review on advanced driver assistance system. J. Soft
Comput. Paradigm 4(2), 69–81 (2022)
5. Avermann, N., Schlüter, J.: Determinants of customer satisfaction with a true door-to-door
DRT service in rural Germany. Res. Transp. Bus. Manag. 32, 100420 (2019)
6. Baba, L., Wilbert, K.: Soziale Stadt-und Ortsentwicklung in ländlichen Räumen. Bundesin-
stitut für Bau-, Stadt-und Raumforschung (BBSR) (2022)
7. Barillère-Scholz, M., Büttner, C., Becker, A.: Mobilität 4.0: Deutschlands erste autonome
Buslinie in Bad Birnbach als Pionierleistung für neue Verkehrskonzepte. In: Riener, A.,
Appel, A., Dorner, W., Huber, T., Kolb, J.C., Wagner, H. (Hrsg.) Autonome Shuttlebusse
im ÖPNV, pp. 15–22. Springer, Heidelberg (2020). https://doi.org/10.1007/978-3-662-594
06-3_2
8. Bento, L.C., Parafita, R., Rakha, H.A., Nunes, U.J.: A study of the environmental impacts
of intelligent automated vehicle control at intersections via V2V and V2I communications.
J. Intell. Transp. Syst. 23(1), 41–59 (2019)
9. Berg, J., Ihlström, J.: The importance of public transport for mobility and everyday activities
among rural residents. Soc. Sci. 8, 58 (2019)
10. Bernhard, C., Oberfeld, D., Hoffmann, C., Weismüller, D., Hecht, H.: User acceptance of
automated public transport - valence of an autonomous minibus experience. Transp. Res. F
70, 109–123 (2020)
11. Bissell, D., Birtchnell, T., Elliott, A., Hsu, E.L.: Autonomous automobilities - the social
impacts of driverless vehicles. Curr. Sociol. 68(1), 116–134 (2020)
12. Brinker, T.J., Schlager, G., French, L.E., et al.: Computerassistierte Hautkrebsdiagnose.
Hautarzt 71, 669–676 (2020)
13. Busse, R., Berger, E.: Vom planerischen Bestandsschutz zum bedarfsorientierten Kranken-
hausangebot. In: Klauber, J., Geraedts, M., Friedrich, J., Wasem, J. (Hrsg.) Krankenhaus-
Report 2018 - Schwerpunkt Bedarf und Bedarfsgerechtigkeit, pp. 149–170. Schattauer,
Stuttgart (2018)
Impact of Public Transport Development on Health Care Services 411
14. Busse, R., Karagiannidis, C., Augurzky, B., Schmitt, J., Bschor, T.: Der Vorschlag der
Regierungskommission für eine grundlegende Reform der Krankenhausvergütung. In:
Klauber, J., Geraedts, M., Friedrich, J., Wasem, J. (Hrsg.) Krankenhaus-Report 2023 - Schw-
erpunkt - Personal, pp. 267–280. Springer, Heidelberg (2023). https://doi.org/10.1007/978-
3-662-66881-8_17
15. Chen, X., Ouyang, Y., Arian, E., Yang, H., Ba, X.: Modeling and testing autonomous and
shared multimodal mobility services for low-density rural areas. Civil Engineering Studies,
Illinois Center for Transportation, Series No. 22-013 (2022)
16. Christaller, W.: Die zentralen Orte in Süddeutschland. Gustav Fischer, Jena (Neudruck 1968:
Wissenschaftliche Buchgesellschaft, Darmstadt) (1933)
17. Coluccia, D.: The second industrial revolution (late 1800s and early 1900s). In: Zanda,
G. (ed.) Corporate Management in a Knowledge-Based Economy, pp. 52–64. Palgrave
Macmillan, London (2012)
18. Cramer, T.: Strafrechtsdogmatische Auflösung dilemmatischer Situationen beim autonomen
Fahren - Entwicklung innovativer Leitlinien für die Programmierung von Notstandsalgorith-
men in Lebensnotstandssituationen. Nomos, Baden-Baden (2022)
19. Crescenzi, R., Fratesi, U., Monastiriotis, V.: Back to the member states? Cohesion Policy
and the national challenges to the European Union. Reg. Stud. 54(1), 5–9 (2020)
20. Daduna, J.R.: Impacts of deregulation on planning processes and information management
design in public transit. In: Voß, S., Daduna, J.R. (eds.) Computer-aided scheduling of public
transport, pp. 429–441. Springer, Berlin (2001). https://doi.org/10.1007/978-3-642-56423-
9_24
21. Daduna, J.R.: Evolution of public transport in rural areas - new technologies and digitization.
In: Marcus, A., Rosenzweig, E. (eds.) HCII 2020. LNCS, vol. 12202, pp. 82–99. Springer,
Cham (2020). https://doi.org/10.1007/978-3-030-49757-6_6
22. Daskin, M.S.: A maximum expected covering location model - formulation, properties and
heuristic solution. Transp. Sci. 17(1), 48–70 (1983)
23. Danielzyk, R., Priebs, A.: Die Sicherung der Daseinsvorsorge durch Klein-und Mittelstädte
als Beitrag des zentralörtlichen Systems zu gleichwertigen Lebensverhältnissen. Europa
Reg. 26(3), 7–21 (2020)
24. de Buhr, Y., Bunde, H., Großmann, E., et al.: Prävention von Hautkrebs. Onkologe 28, 49–54
(2022)
25. Deutscher Bundestag: Reformbedarf bei Medizinisch-Technischen Assistenten in Kranken-
häusern und Arztpraxen. Drucksache 19/24629 (2020)
26. destatis - Statistisches Bundesamt: Verkehr - Verkehrsunfälle - Fachserie 8 Reihe 7 (2022)
27. Di Caro, P., Fratesi, U.: One policy, different effects - estimating the region-specific impacts
of EU cohesion policy. J. Reg. Sci. 62(1), 307–330 (2022)
28. Drabinski, T.: Nephrologische Versorgung in Deutschland - Bestandsaufnahme, Vergütung
und Datengrundlage. Schmidt & Klaunig, Kiel (2014)
29. Eckhardt, J., Lauhkonen, A., Aapaoja, A.: Impact assessment of rural PPP MaaS pilots. Eur.
Transp. Res. Rev. 12(1), 1–14 (2020)
30. Einig, K., Zaspel-Heisters, B.: Das system Zentraler Orte in Deutschland. In: Greiving,
S., Flex, F. (Hrsg.) Neuaufstellung des Zentrale-Orte-Konzepts in Nordrhein-Westfalen,
pp. 3–19. Akademie für Raumforschung und Landesplanung (ARL) Hannover (2016)
31. Enoch, M.P.: How a rapid modal convergence into a universal automated taxi service could
be the future for local passenger transport. Technol. Anal. Strateg. Manag. 27(8), 910–924
(2015)
32. Enquete-Kommission Zukunft der medizinischen Versorgung in Mecklenburg-
Vorpommern: Abschlussbericht. Stellungnahmen 7. Sitzung. Kommissionsdrucksache
7/46 (2021)
412 J. R. Daduna and D. Philipp
33. Farhan, J., Chen, T.D.: Impact of ridesharing on operational efficiency of shared autonomous
electric vehicle fleet. Transp. Res. C 93, 310–321 (2018)
34. Frank, J., Ryll, A.: Demographischer Wandel, Morbidität und Inanspruchnahme - Zur
Projektion des ambulanten Leistungsvolumens. Gesundheitswesen 77(12), 932–938 (2015)
35. Gail, M.M., Götz, G., Herold, D., et al.: Staatliche Eingriffe zur Erfüllung der Daseinsvor-
sorge - Der öffentliche Personennahverkehr. In: Gail, M.M., Götz, G., Klotz, P.A., et al.
(Hrsg.) Staatliche Eingriffe in die Preisbildung - Darstellung, Erklärung und Wirkungs-
analyse, pp. 49–66. Springer, Wiesbaden (2023). https://doi.org/10.1007/978-3-658-406
74-5_5
36. Geissler, A., Wörz, M., Busse, R.: Deutsche Krankenhauskapazitäten im internationalen
Vergleich. In: Klauber, J., Geraedts, M., Friedrich, J. (Hrsg.) Krankenhaus-Report 2010,
pp. 25–40. Schattauer, Stuttgart (2010)
37. Gill, T.: Ethical dilemmas are really important to potential adopters of autonomous vehicles.
Ethics Inf. Technol. 23(4), 657–673 (2021). https://doi.org/10.1007/s10676-021-09605-y
38. Girdhar, M., Hong, J., Moore, J.: Cybersecurity of autonomous vehicles - a systematic
literature review of adversarial attacks and defense models. IEEE Open J. Veh. Technol.
(2023, in press)
39. Greiving, S., Flex, F., Terfrüchte, T.: Vergleichende Untersuchung der Zentrale-Orte-
Konzepte in den Ländern und Empfehlungen zu ihrer Weiterentwicklung. Raumforsch.
Raumordn. 73(4), 285–297 (2015)
40. Haag-Weber, M.: Peritonealdialyse. Dialyse aktuell 21(06), 290–295 (2017)
41. Häckl, D., Kossack, N., Schoenfelder, T.: Prävalenz, Kosten der Versorgung und Formen des
dialysepflichtigen chronischen Nierenversagens in Deutschland - Vergleich der Dialysever-
sorgung innerhalb und außerhalb stationärer Pflegeeinrichtungen. Das Gesundheitswesen
83(10), 818–828 (2021)
42. Hänsch, R., Hoelzmann, J., Mielke, S.: Flächendeckend Rufbusse? Eine Untersuchung zur
Verbreitung alternativer Mobilitätskonzepte. Der Nahverkehr 37(1+2), 39–43 (2019)
43. Hallewell, M., et al.: Deriving user experience design dimensions for future autonomous
taxi interface design. J. Usability Stud. 17(4), 140–163 (2022)
44. Hansson, S.O., Belin, M.Å., Lundgren, B.: Self-driving vehicles - an ethical overview. Philos.
Technol. 34, 1383–1408 (2021)
45. Hein, G.W.: Status, perspectives and trends of satellite navigation. Satell. Navig. 1(1), 1–12
(2020). https://doi.org/10.1186/s43020-020-00023-x
46. Henning, F.W.: Handbuch der Wirtschafts-und Sozialgeschichte Deutschlands, Band 2,
Deutsche Wirtschafts- und Sozialgeschichte im 19. Jahrhundert. Schöningh, Paderborn
(1996)
47. Holzhüter, H., Bödewadt, J., Bayesteh, S., Aschinger, A., Blume, H.: Technical concepts of
automotive LiDAR sensors - a review. Opt. Eng. 62(3), 031213 (2023)
48. Hou, Z., Zhou, F.: Assessing the performance of precise point positioning (PPP) with the
fully serviceable multi-GNSS constellations. Remote Sens. 15, 807 (2023)
49. Illgen, S., Höck, M.: Establishing car sharing services in rural areas - a simulation-based
fleet operations analysis. Transportation 47(2), 811–826 (2020)
50. Jing, H., Gao, Y., Shahbeigi, S., Dianati, M.: Integrity monitoring of GNSS/INS based
positioning systems for autonomous vehicles - state-of-the-art and open challenges. IEEE
Trans. Intell. Transp. Syst. 23(9), 14166–14187 (2022)
51. Jutzi, T., Krieghoff-Henning, E.I., Brinker, T.J.: Künstliche Intelligenz auf dem Vormarsch
- Hohe Vorhersage-genauigkeit bei der Früherkennung pigmentierter Melanome. Laryngo-
Rhino-Otologie 84–91 (2022)
52. Khan, M.A., Sayed, H.E., Malik, S., et al.: Level-5 autonomous driving - Are we there yet? A
review of research literature. ACM Comput. Surv. (CSUR) 55(2), 1–38 (2022)
Impact of Public Transport Development on Health Care Services 413
53. Khan, S.K., Shiwakoti, N., Stasinopoulos, P., Warren, M.: Modelling cybersecurity regula-
tions for automated vehicles. Accid. Anal. Prev. 186, 107054 (2023)
54. Kiesewetter, H.: Industrielle Revolution in Deutschland - Regionen als Wachstumsmotoren.
Franz Steiner, Stuttgart (2004)
55. Kim, S., Chang, J.J.E., Park, H.H., et al.: Autonomous taxi service design and user
experience. Int. J. Hum.-Comput. Interact. 36(5), 429–448 (2020)
56. Kleemann, S., Arzt, C.: Das Gesetz zum autonomen Fahren in Deutschland - Automatisiertes
Fahren findet seinen Weg im StVG. RAW Recht-Automobil-Wirtschaft 99–195 (2021)
57. Knauff, M.: Verfassungs-, europa-und personenbeförderungsrechtliche Aspekte des Vor-
rangs der Eigenwirtschaftlichkeit. In: Knauff, M. (Hrsg.) Vorrang der Eigenwirtschaft-
lichkeit im ÖPNV, pp. 11–28. Nomos, Baden-Baden (2017)
58. Knierim, L., Schlüter, J.C.: The attitude of potentially less mobile people towards demand
responsive transport in a rural area in central Germany. J. Transp. Geogr. 96, 103202 (2021)
59. Kopper, C.: Der Durchbruch des PKW zum Massenkonsumgut 1950–1964. Jahrbuch für
Wirtschaftsgeschichte 51(1), 19–36 (2010)
60. López-Lambas, M.E., Alonso, A.: The driverless bus - an analysis of public perceptions and
acceptability. Sustainability 11(18), 4986 (2019)
61. Lucifora, C., Grasso, G.M., Perconti, P., Plebe, A.: Moral dilemmas in self-driving cars.
Rivista Internazionale di Filosofia e Psicologia 11(2), 238–250 (2020)
62. Lütge, C., Kriebitz, A., Max, R.: Ethische und rechtliche Herausforderungen des autonomen
Fahrens. In: Mainzer, K. (Hrsg.) Philosophisches Handbuch Künstliche Intelligenz, pp. 1–18.
Springer, Wiesbaden (2019). https://doi.org/10.1007/978-3-658-23715-8_30-1
63. Lutz, L.S.: Haftung am Beispiel automatisierter Fahrzeuge. In: Hartmann, M. (Hrsg.) KI &
Recht kompakt, pp. 117–142. Springer, Berlin (2020). https://doi.org/10.1007/978-3-662-
61700-7_3
64. Manik, D., Molkenthin, N.: Topology dependence of on-demand ride-sharing. Appl. Netw.
Sci. 5, 1–16 (2020)
65. Marek, A.J., Chu, E.Y., Ming, M., Khan, Z., Kovarik, C.L.: Impact of a smartphone appli-
cation on skin self-examination rates in patients who are new to total body photography: a
randomized controlled trial. J. Am. Acad. Dermatol. 79(3), 564–567 (2018)
66. Meyer, M.: Bedarfsorientierte ÖPNV-Bedienung in der Fläche - Erfahrungen mit neuar-
tigen Betriebsformen in der Bundesrepublik Deutschland und in den Niederlanden.
Verkehr+Technik 41(7), 280–289 (1988)
67. Michalik, D., Kohl, P., Krause, M., Kummert, A.: Virtueller Demonstrator zur Unterstützung
der Einführung des autonomen Fahrens am Beispiel automatisierter Shuttlebusse. In: Proff,
H. (Hrsg.) Transforming mobility - What next? - Technische und betriebswirtschaftliche
Aspekte, pp. 399–418. Springer, Wiesbaden (2022). https://doi.org/10.1007/978-3-658-
36430-4_24
68. Milakis, D., van Wee, B.: Implications of vehicle automation for accessibility and social
inclusion of people on low income, people with physical and sensory disabilities, and
older people. In: Antoniou, C., Efthymiou, D., Chaniotakis, E. (eds.) Demand for Emerging
Transportation Systems, pp. 61–73. Elsevier, Amsterdam (2020)
69. Möller, T.B.: Teleradiologie in Deutschland. In: Fischer, F., Krämer, A. (Hrsg.) eHealth in
Deutschland, pp. 295–305. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-
49504-9_15
70. Nakoinz, O.: Zentralität - Theorie, Methoden und Fallbeispiele zur Analyse zentraler Orte.
Edition Topoi, Berlin (2019)
71. Nanchen, B., Ramseyer, R., Grèzes, S., et al.: Perceptions of people with special needs
regarding autonomous vehicles and implication on the design of Mobility as a Service to
foster social inclusion. Front. Hum. Dyn. 3, 751258 (2022)
414 J. R. Daduna and D. Philipp
72. Naveen, M.K., Lokesh, M., Varma, K.N.J.L., Teja, P.S.V.: A review on autonomous vehicles
and its components. J. Pharm. Negative Results 13(7), 6916–6922 (2022)
73. Neubauer, G., Beivers, A.: Zur Situation der stationären Versorgung: Optimierung unter
schwierigen Rahmenbedingungen. In: Klauber, J., Geraedts, M., Friedrich, J. (Hrsg.)
Krankenhaus-Report 2010: Schwerpunkt Krankenhausversorgung in der Krise?, pp. 3–11.
Schattauer, Stuttgart (2010)
74. Ng, M.T., Mahmassani, H.S.: Autonomous minibus service with semi-on-demand routes in
Grid Networks. Transp. Res. Rec. 2677(1), 178–200 (2023)
75. Nobis, C., Kuhnimhof, T.: Mobilität in Deutschland - MiD Ergebnisbericht. (Bundesminister
für Verkehr und digitale Infrastruktur) (FE-Nr. 70904/15) Bonn, Berlin (2018)
76. Notogamiprodjo, M., Remplik, P., Schricke, U.: Radiologische Praxis - Terra incognita.
Radiologe 61, 671–675 (2021)
77. Othman, K.: Public acceptance and perception of autonomous vehicles - a comprehensive
review. AI Ethics 1(3), 355–387 (2021)
78. Pangbourne, K.: Challenge, coordination, and collaboration for effective rural mobility solu-
tions. In: Amaral, A.M., Barreto, L., Baltazar, S., Silva, J.P., Gonçalves, L. (eds.) Implications
of Mobility as a Service (MaaS) in Urban and Rural Environments - Emerging Research and
Opportunities, pp. 83–108. IGI Global, Hershey (2020)
79. Parekh, D., et al.: A review on autonomous vehicles - progress, methods and challenges.
Electronics 11(14), 2162 (2022)
80. Patoliya, J., Mewada, H., Hassaballah, M., Khan, M.A., Kadry, S.: A robust autonomous
navigation and mapping system based on GPS and LiDAR data for unconstraint environment.
Earth Sci. Inf. 15, 2703–2715 (2022)
81. Quinsten, A.S., Bontke, B., Saur, A.: Radiologie im Wandel - Remote Scanning als Lösung
gegen den Fachkräftemangel. Radiopraxis 15(02), 62–71 (2022)
82. Rechel, B., Grundy, E., Robine, J.M., et al.: Ageing in the European Union. Lancet
381(9874), 1312–1322 (2013)
83. Saxinger, A.: Genehmigungsvoraussetzungen und -wirkungen von Linienbedarfsverkehren.
In: Knauff, M. (Hrsg.) Neuerungen im Rechtsrahmen für den ÖPNV, pp. 75–93. Nomos,
Baden-Baden (2022)
84. Schäfer, C.: Werden alle Möglichkeiten genutzt? Dialyse aktuell 26(9), 385 (2022)
85. Schäffner, V.: Wenn Ethik zum Programm wird - Eine risikoethische Analyse moralischer
Dilemmata des autonomen Fahrens. Zeitschrift für Ethik und Moralphilosophie 3(1), 27–49
(2020)
86. Schlüter, J., Bossert, A., Rössy, P., Kersting, M.: Impact assessment of autonomous demand
responsive transport as a link between urban and rural areas. Res. Transp. Bus. Manag. 39,
100613 (2021)
87. Schmidt-Seiwert, V., Binot, R., Kiel, L., et al.: Die Unterschiede bestimmen die Vielfalt in
Europa - Ein Atlas ausgewählter Aspekte der räumlichen Strukturen und Entwicklungen.
Bundesinstitut für Bau-, Stadt- und Raumforschung (BBSR), Bonn (2019)
88. Schreurs, M.A., Steuwer, S.D.: Autonomous driving - political, legal, social, and sustain-
ability dimensions. In: Maurer, M., Gerdes, J.C., Lenz, B., Winner, H. (eds.) Autonomous
Driving - Technical, Legal and Social Aspects, pp. 149–171. Springer, Heidelberg (2016).
https://doi.org/10.1007/978-3-662-45854-9_8
89. Schreyögg, J.: Verbesserung der Personalallokation durch Strukturwandel. In: Klauber, J.,
Wasem, J., Beivers, A., Mostert, C. (Hrsg.) Krankenhaus-Report 2023 - Schwerpunkt Per-
sonal, pp. 197–211. Springer, Heidelberg (2023). https://doi.org/10.1007/978-3-662-66881-
8_13
90. Shrestha, B.P., Millonig, A., Hounsell, N., McDonald, M.: Review of public transport needs
of older people in European context. J. Popul. Ageing 10, 343–361 (2017)
Impact of Public Transport Development on Health Care Services 415
91. Sieber, L., Ruch, C., Hörl, S., Axhausen, K.W., Frazzoli, E.: Improved public transportation
in rural areas with self-driving cars - a study on the operation of Swiss train lines. Transp.
Res. A 134, 35–51 (2020)
92. Siegel, J., Pappas, G.: Morals, ethics, and the technology capabilities and limitations of
automated and self-driving vehicles. AI Soc. 1–14 (2021)
93. Simmler, M.: Maschinenethik und strafrechtliche Verantwortlichkeit. In: Bendel, O. (Hrsg.)
Handbuch Maschinenethik, pp. 453–469. Springer, Wiesbaden (2019). https://doi.org/10.
1007/978-3-658-17483-5_30
94. Sörensen, L., Bossert, A., Jokinen, J.P., Schlüter, J.: How much flexibility does rural public
transport need? - Implications from a fully flexible DRT system. Transp. Policy 100, 5–20
(2021)
95. Spindler, G.: Haftung für autonome Systeme. In: Arroyo Vendrell, T., Gsell, B., Kindl, J.
(Hrsg.) Die Schadensersatzhaftung, pp. 139–184. Nomos, Baden-Baden (2022)
96. Stearns, P.N.: The Industrial Revolution in World History, 4th edn. Westview Press, Boulder,
Co. (2020)
97. Südbeck, T.: Motorisierung, Verkehrsentwicklung und Verkehrspolitik in der Bundesrepublik
Deutschland der 1950er Jahre. (Steiner) Stuttgart (1994)
98. Tansel, B.C., Francis, R.L., Lowe, T.J.: State of the art—location on networks: a survey. Part
I: The p-center and p-median problems. Manag. Sci. 29(4), 482–497 (1983). https://doi.org/
10.1287/mnsc.29.4.482
99. Taylor, G.R.: The Transportation Revolution 1815–1860. Rinehart, New York (1951)
100. Thao, V.T., Imhof, S., von Arx, W.: Demand responsive transport - new insights from peri-
urban experiences. Travel Behav. Soc. 31, 141–150 (2023)
101. Tschandl, P., Codella, N., Akay, B.N., et al.: Comparison of the accuracy of human readers
versus machine-learning algorithms for pigmented skin lesion classification - an open, web-
based, international, diagnostic study. Lancet Oncol. 20(7), 938–947 (2019)
102. van Suntum, U.: Verkehrspolitik. (Vahlen) München (1986)
103. Voigt, F.: Verkehr - Die Entwicklung des Verkehrssystems (Band 2, Teil 1). Duncker &
Humblot, Berlin (1965)
104. Wachinger, L.: Mittel in Kooperationen und Verbünden. In: Knauff, M. (Hrsg.) Rechtsan-
wendung und Finanzierung im ÖPNV, pp. 193–204. Nomos, Baden-Baden (2020)
105. Wieringa, F.: Die Zukunft der Nierenersatztherapie. Nephrologie aktuell 27(1), 26–30 (2023)
106. Ye, H., Kim, H.: Locating healthcare facilities using a network-based covering location
problem. GeoJournal 81, 875–890 (2016)
107. Ye, L., Yamamoto, T.: Evaluating the impact of connected and autonomous vehicles on
traffic safety. Physica A 526, 121009 (2019)
108. Zhou, J.: A review of LiDAR sensor technologies for perception in automated driving. Acad.
J. Sci. Technol. 3(3), 255–261 (2022)
Ridesharing in Rural Areas
with Autonomous Electric Vehicles
and Interrelated Trips
Abstract. Passenger cars are responsible for 44% of the greenhouse gas
emissions within Europe’s transportation sector. Especially rural areas
are still fundamentally dependent on private motor vehicles, which is why
in particular the younger and older population in such regions is limited
in their mobility. The purpose of this paper is to determine the poten-
tial of shared, electric, autonomous vehicles to close the aforementioned
mobility gap in an eco-friendly manner. It provides recommendations for
the implementation of the associated transport services in terms of vehi-
cle battery size and charging station infrastructure. For this purpose,
we present a model for the electric autonomous dial-a-ride problem with
interrelated trips (e-ADARP-IT), which is solved by a commercial solver
and through a Variable Neighborhood Search heuristic. Computational
experiments reveal the superior performance of the heuristic. It is shown
that large shares of requested trips can be served efficiently at low opera-
tional cost. Results further indicate that expanding vehicle battery capac-
ity seems more relevant than expanding the number of charging stations,
even though both developments clearly have to go hand in hand.
1 Introduction
The transport sector is responsible for more than a quarter of the EU’s green-
house gas emissions. 44% of its share is generated by passenger vehicles and,
consequently, has a crucial impact on achieving the 1.5◦ C target of the Paris
Climate Agreement [13]. The combination of the three innovations of electrifi-
cation, autonomous driving, and shared mobility has the disruptive potential to
significantly change the personal mobility sector [44]. Together they form the
potential for shared autonomous electric vehicles (SAEVs). With the help of
SAEVs, greenhouse gas emissions per kilometer can be reduced by over 90% and
mobility cost reach 0.16–0.26 e/km, where both these performance measures lie
significantly below those of private vehicles with combustion engines [4].
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 416–434, 2023.
https://doi.org/10.1007/978-3-031-43612-3_26
Ridesharing with Autonomous Electric Vehicles and Interrelated Trips 417
The route planning problem associated with SAEV services is the dial-a-ride
problem (DARP), a combinatorial optimization problem. The DARP consists of
customer requests for transportation from given origin locations to given desti-
nation locations, which have to be scheduled on a finite number of vehicles. The
objective could be to maximize a service quality measure or to minimize trans-
portation costs [5]. SAEV services as considered in this paper combine multiple
DARP-variants within one problem: The dial-a-ride problem with interrelated
trips (DARP-IT), which synchronizes customer requests that are related to each
other by means of a common arrival time of multiple customers or a round trip
with a desired stay time at a location [27], and the electric autonomous dial-a-ride
problem (e-ADARP) that deals with electric and autonomous vehicles including
battery management and recharging [5]. This article combines these two vari-
ants, resulting in the electric autonomous dial-a-ride problem with interrelated
trips (e-ADARP-IT).
Most studies on DARPs for autonomous and electric vehicles focused on
urban areas [4,5,30]. In contrast to this, [27] considered the DARP-IT for rural
areas, where interrelated trips are of particular importance as customers also
demand guaranteed return trips, synchronized arrivals for joint meetings, or
other complex service patterns. The paper at hand also focuses on rural areas
and interrelated trips. Mobility in such areas is characterized by relatively long
transport distances and a very limited availability of public transport options.
This effects that the population so far relies on privately owned vehicles, which
heavily restricts those people that do not own or cannot operate a car on their
own, such as young or elderly people, see [27]. We, therefore, explore the potential
of SAEV in rural areas by combining the approaches of [27] for the ADARP-
IT and of [5] for the e-ADARP within a holistic model and solution approach.
The advances over [27] lie in the integration of battery management into the
dial-a-ride problem with interrelated trips (ADARP-IT) and thus bridges the
gap between these two research directions in the domain of dial-a-ride prob-
lems. A heuristic solution is adopted from [27] and compared to the solutions
derived from solving the optimization model directly. In our experiments, we
then focus on a rural area in the federal state of Schleswig-Holstein, Germany.
The associated test instances vary in terms of the number of customer requests,
the vehicles being available and their battery capacity, as well as the number
of charging stations. The influence of the number of charging stations and the
size of the battery capacity is then analyzed w.r.t. the service quality for the
interrelated trips in this rural area setting.
Therefore, the implementation of an autonomous ridesharing system in rural
areas can provide several advantages, for instance by offering mobility services
to people that cannot operate a vehicle themselves, by avoiding operational
limitations such as personnel costs and service time constraints of conventional
mobility services, and by allowing for more efficient vehicle sharing as well as
a provision of services during off-peak hours. As rural areas generally lack an
offer of public transport options, such a system can close the gap between the
demand for mobility and the supply of mobility services.
418 M. Soth et al.
2 Literature Review
SAEV services combine features of electrification, autonomous driving, and
shared mobility, where autonomous vehicles offer economic and service time
advantages due to not having a driver [20]. This typically allows for uninter-
rupted service provision and greater flexibility in the deployment [5]. Thereby
’shared mobility’ refers to the on-demand use of vehicles by multiple customers
in contrast to privately owned vehicles that are exclusively used by their own-
ers. Shared mobility can take various forms such as ridesharing, carsharing, or
bike-sharing, and allows for cost savings, reduced greenhouse gas emissions, and
less reliance on vehicle ownership [43].
The route planning problem of shared autonomous vehicles (SAVs) is treated
within the DARP. The combinatorial optimization problem minimizes the trans-
portation cost and/or maximizes the service quality [5].
[36] classify the literature on SAV research based on booking type and sharing
system. The booking type divides into reservation-based and on-demand-based
services. In the latter, the customer can book his/her trip in real-time, and those
requests have to be integrated into the ongoing vehicle service process, see e.g.
[25]. The use of priority rules has been established for this purpose by [15].
In contrast, in reservation-based services, customer requests are known before
the vehicle routes are to be determined. This means that the DARP is solved
using the data is available up to that time, which typically leads to a static
perspective on this optimization problem, see [24] and [5]. Due to the longer
distances and longer relocation times of vehicles in rural areas, reservation-based
services appear suitable for mobility systems in such regions [27].
The mobility systems for SAVs can be divided into ridesharing, carshar-
ing, and mixed systems based on the way vehicles are shared in the operations
[36]. In carsharing, customers typically use a car one after the other, such that
requests are serviced in a sequential manner. In ridesharing, several customers
that demand transportation from/to similar locations might jointly use the same
vehicle. In this context, [9] investigate how to identify suitable meeting points
to which customers can walk over short distance from their actual origin loca-
tion or desired destination location. They incorporate this feature into a DARP
in order to reduce detouring of vehicles while offering attractive service routes
to customers. In [29], the authors focus on the dial-a-ride problem in an urban
road network with ride-sharing by autonomous taxis under consideration of traf-
fic congestion. They propose a non-linear integer programming model embedded
within a rolling horizon framework to optimize the routing of the taxis while
Ridesharing with Autonomous Electric Vehicles and Interrelated Trips 419
In DARPs, customers place requests for being transported from a specified start-
ing point (pick-up location) to a specified destination point (drop-off location)
[8]. In classical variants of the DARP, each customer is assumed to place one
such request, without and interrelations among these requests existing, see [7].
We call such requests, conventional requests.
In our work, we also consider more complex trip request types, where two
or more trips are interrelated with each other. Such complex customer requests
were introduced by [27] and are distinguished into requests with serial relations
and requests with parallel relations. Serial relations are characterized by an
outbound and a return trip, combined with a stay time at the destination of the
outbound trip. The drop-off location of the first trip and the pick-up location of
the second trip are, thus, identical. Such trips play a role if a customer demands
a guaranteed return trip, which is particularly relevant in urban area settings.
The challenge in route planning is to schedule the drop-off of the first trip and
the pick-up of the second trip such that the desired stay time is guaranteed.
The second type of complex requests are those with parallel relations, which can
be further subdivided into parallel drop relations and parallel pick relations. In
this paper, only parallel drop relations are considered for reasons of brevity. A
characteristic of such a parallel request is that two or more trips share the same
drop-off location, for example as the involved customers want to meet there. The
complexity of parallel drop requests results from the need to have customers
arrive at the common drop-off location at approximately or exactly the same
time. As is described by [27], these mentioned customer request types can be
flexibly combined in a modular way, resulting in even more complex mobility
request types.
Ridesharing with Autonomous Electric Vehicles and Interrelated Trips 421
Table 1. Notation
Sets
K = {1, ..., m} Set of vehicles
O = {o1 , ..., om } Set of origin locations of vehicles
R = {1, ..., n} Set of trips
P = {p1 , ..., pn } Set of pick-up locations
D = {d1 , ..., dn } Set of drop-off locations
CS = {cs1 , ..., csl } Set of charging station locations
V = O∪P ∪D∪CS Set of all locations
Si ⊂ R Set of trips that serially follow trip i
Gi ⊂ R Set of trips with parallel drop-off relation with trip i
Parameters
P Ck Passenger capacity of vehicle k ∈ K
bk Energy storage capacity of vehicle k s battery
ok Origin location of vehicle k
pi Pick-up location of trip i ∈ R
di Drop-off location of trip i
qi Number of passengers boarding or disembarking at location i ∈ V
si Service time for passenger boarding and disembarking at location i
ei Earliest starting time for the service at location i
li Latest starting time for the service at location i
ūi Maximum travel time for trip i
δimin , δimax Minimum / maximum stay time at drop-off location of serial related trip i
ωimax Maximum waiting time for parallel related services of trip i
ti,j Vehicle travel time from location i to location j
ci,j Vehicle travel cost from location i to location j
βi,j Vehicle battery consumption for traveling from location i to j
αcs Charging rate at charging station cs ∈ CS (in kWh per minute)
γ Desired battery charging level
r Objective weight for maximizing the number of fulfilled trips
Decision variables
yi 1 if trip i ∈ R is served, 0 otherwise
xki,j 1 if vehicle k drives directly from i ∈ V to j ∈ V , 0 otherwise
zik 1 if vehicle k ends its tour at location i, 0 otherwise
Tik Arrival time of vehicle k at location i
Lki Number of passengers on vehicle k after leaving location i
Bik Battery charge level of vehicle k when arriving at location i
k
Ccs Charging time of vehicle k at charging station cs
3.2 Notation
We use the following notation for modelling the e-ADARP-IT (see also summary
in Table 1): The mobility service provider has a fleet of vehicles K = {1, ..., m},
where vehicle k ∈ K has a passenger capacity P C k , a battery capacity bk ,
and starts its operations from the origin location ok . These locations form set
422 M. Soth et al.
k
the charge level of vehicle k’s battery when arriving at location i, variable Ccs
denotes the charging time of vehicle k at charging station cs ∈ CS.
xkok ,j = 1 k ∈ K (2)
j∈V
xuok ,j = 0 k ∈ K (3)
j∈V u∈K
u=k
xkj,i − xki,j = 0 k ∈ K, i ∈ P (4)
j∈V j∈V
j=i j=i
xkj,i − xki,j − zik = 0 k ∈ K, i ∈ D ∪ CS (5)
j∈V j∈V
j=i j=i
zik = 1 k ∈ K (6)
i∈D∪{ok }
xkpi ,j − xkj,di = 0 k ∈ K, i ∈ R (7)
j∈V j∈V
j=pi j=di
xkpi ,j ≥ yi i∈R (8)
j∈V k∈K
0 ≤ Bik ≤ bk k ∈ K, i ∈ V (22)
Bokk = bk k∈K (23)
k ∈ K, i ∈ V \CS,
Bjk ≤ Bik − βi,j + bk (1 − xki,j ) (24)
j ∈ V \{ok }, i = j
k ∈ K, i ∈ V \CS,
Bjk ≥ Bik − βi,j − bk (1 − xki,j ) (25)
j ∈ V \{ok }, i = j
k ∈ K, j ∈ V,
Bjk ≤ Bcs
k
+ αcs · Ccs
k
− βcs,j + bk (1 − xkcs,j ) (26)
cs ∈ CS, j = cs
k ∈ K, j ∈ V,
Bjk ≥ Bcs
k
+ αcs · Ccs
k
− βcs,j − bk (1 − xkcs,j ) (27)
cs ∈ CS, j = cs
k
Ccs ≥ 0 k ∈ K, cs ∈ CS (28)
bk · γ ≥ k
Bcs + αcs · k
Ccs k ∈ K, cs ∈ CS (29)
k ∈ K, cs ∈ CS,
Tik ≤ Tcs
k
+ tcs,i + Ccs
k
+ Mcs,i
k
(1 − xkcs,i ) i ∈ D∪P ∪CS ∪{ok }, (30)
i = cs
k ∈ K, cs ∈ CS,
Tik ≥ Tcs
k
+ tcs,i + Ccs
k
− Mcs,i
k
(1 − xkcs,i ) i ∈ D∪P ∪CS ∪{ok }, (31)
i = cs
xki,j , zik ∈ {0, 1} k ∈ K, i, j ∈ V (32)
yi ∈ {0, 1} i∈R (33)
According to Constraints (2) and (3), each vehicle starts its tour at its own
origin location ok . Constraints (4) ensure the vehicle flow at pick-up locations.
Constraints (5) define this for drop-off locations and charging stations. This
constraint provides flexibility for the continuation of a route, as a vehicle k
may continue its tour to some other location j (xkij = 1) or end its tour at the
current location i (zik = 1). Constraints (6) then state that each vehicle route
has a defined end location. This location can also be the origin location ok if
the vehicle did not move at all (zokk = 1). Constraints (7) ensure that both the
pickup and the drop-off of a request are performed by one and the same vehicle.
Constraints (8) ensure that a request is considered served, only if the pickup
location has actually been visited by a vehicle. Constraints (9) avoid that this
requirement is fulfilled via trivial sub-cycles.
Ridesharing with Autonomous Electric Vehicles and Interrelated Trips 425
Constraints (10) to (13) address the time components of the solution. Con-
straints (10) propagate the vehicle arrival time from one visited location to the
next, where the value Mi,j is set to max(0, li + si + tij − ei ) as in [5]. Con-
straints (11) denote that a drop-off location di must be visited after the asso-
ciated pick-up location pi . Constraints (12) ensure the preset time windows for
visiting the locations. With i = ok and a corresponding value for eok , the time
at which vehicle k becomes available for the service system can be considered
in this constraint, too. The maximum travel duration of a trip is bounded by
Constraints (13).
Capacity restrictions (14) and (15) derive the number of customers Lkj
onboard vehicle k after leaving a visited location j. Constraints (16) respect
the vehicle capacity and Constraints (17) ensure that all vehicles are empty at
their origin locations and while charging.
From Constraints (18), complex requests are always served completely or not
at all, but never partially. More precisely, all interrelated trips j ∈ Si ∪ Gi are
served if trip i is served. Constraints (19) and (20) focus on serially related trips.
They ensure the minimum and maximum customer stay times between drop-off
and subsequent pick-up, where Md i ,pj = max(ldi +δimin −epj , lpj −edi −δimax , 0),
see [27]. Note that the trips in a serial request can be served by different vehicles
k and u (k = u). For example, vehicle k can drop off the customer at drop-
off location di and another vehicle u can later collect the customer at pick-up
location pj for the subsequent trip. Constraints (21) focus on the second type
of complex requests, the parallel related trips. The restriction synchronizes the
arrival times of parallel related trips i and j at the drop-off locations, where
Md i ,dj = max(ldj − edi − wimax , 0). For this, trip j ∈ Gi at the drop-off location
dj = di must not occur later than wimax time units after trip i. By setting wimax
and wjmax , flexibility or strictness can be reflected for these trips. For example, if
wimax = wjmax = 0, the customers must arrive at the drop-off location at exactly
the same time.
Constraints (22) to (31) are for battery management. According to Con-
straints (22), the battery level of a vehicle k must not be negative nor greater
than the battery capacity bk at any location i. Constraints (23) define that the
battery of vehicle k is fully charged at the origin location ok . The depletion of the
battery level when going from some location i (which is not a charging station)
to a subsequent location j is computed by Constraints (24) and (25). Constraints
(26) and (27) calculate the battery status of a vehicle that just visited a charging
station cs before going to a subsequent location j. These constraints take care of
the charged quantity αcs · Ccs k
. Charging times Ccs k
are non-negative according to
Constraints (28). Additionally, through Constraint (29), charging of a vehicle can
be restricted to not exceed a target level γ. This helps, for example, to proactively
avoid increasing charging times when approaching the maximum battery level.
Furthermore, the charging times of a vehicle at a station cs are included in the
propagation of the arrival time at the subsequently visited location i through
426 M. Soth et al.
4 Computational Experiments
Section 4.1 describes the used test instances. Section 4.2 emphasizes a comparison
of the solution approaches being available as well as the effects of battery capacity
and charging station infrastructure on the solution quality.
maximum waiting time wimax for each other is randomly selected from the time
interval U [1, 10] minutes. For trips with serial relations, it is assumed that each
customer request consists of two trips, one outbound and one return trip, where
the drop-off location of the outbound trip is identical to the pick-up location of
the return trip. The trips are associated with a stay time at the destination. The
least stay time δimin is drawn from the distribution U [20, 60]. The maximum
stay time δimax is by 10 min greater than δimin . The maximum travel time ui
for each trip i is 150% of the direct travel time. The examined planning period
of each test instance comprises 12 h. It is assumed that customer requests are
evenly distributed over the planning horizon.
We assume a homogeneous vehicle fleet. Each vehicle has a capacity of PCk =
6 passengers, which is in line with ridesharing provider MOIA [42]. The vehicles’
origin locations are randomly selected from the set of relevant locations within
the considered area. The number of customers boarding or alighting at a location
is qi = 1/-1 for all requests. Customers can be picked up and dropped off within
negligible time (si = 0). Each vehicle has a battery capacity of bk = 40 kWh and
is fully charged at the beginning of the planning horizon. The vehicles constantly
consume 0.206 kWh per kilometer driven. All vehicles generate a travel cost of
0.1 monetary units per kilometer. The objective function primarily maximizes
the number of served trips and secondarily minimizes the total cost. To ensure
this, we set an objective weight of r = 50. Note that complex requests are only
accepted if all involved trips can be served.
All charging stations support fast charging, meaning that a vehicle can be
charged up to 80% within 40 min. This percentage value is also used for the target
charge level γ in Constraint (29). During the charging process, the vehicles are
supplied with a charging quantity of αcs = 0.8 kWh per minute at all stations.
The locations of the charging stations are determined using a grid heuristic
[34]. Figure 2 illustrates the distribution of charging stations in the studied area
for a total of 8 stations.
Using the above configurations, we employ eight sets of test instances for our
experiment, each involving four individual instances. Table 2 provides an overview
of these instance sets, illustrating that the number of trips per instance ranges
428 M. Soth et al.
from 10 to 50, with 2 to 5 vehicles, and 4 to 16 charging stations. The requests are
a mix of conventional and complex requests. The abbreviations ’CLE’ and ’MOI’
that are added to the name of the last four instance sets refer to two mobility
service providers operating in Germany, from which particular battery capacity
values of vehicles have been taken as is described in the next subsection.
The VNS heuristic has been implemented in Python 3.7 and all computational
experiments were performed on a computer with a 3.2 GHz CPU and 32 GB mem-
ory. For the exact solution of the optimization model, the MIP solver CPLEX 12.9
was applied with a maximum runtime per test instance of ten hours (36,000 s).
The GAP s in Table 3 show that the CPLEX solver is hardly able to find opti-
mal solutions. Actually, it only solves two out of the instances with 10 requests
and one with 25 requests to optimality. Furthermore, even for the integer feasible
solutions obtained for the other instances, the acceptance rate drops significantly
the larger these instances are. The VNS algorithm produces comparable results
for the very small instances but within negligible computation time. For the
instances with 25 and more requests, it clearly outperforms CPLEX with much
larger acceptance rates that increase by up to 47% points. At the same time, the
operational costs per served trip are much lower (see column ΔCo), indicating
that VNS not just manages to serve many more requests than CPLEX but also
in a way more efficient fashion and within lower CP U times. Based on the VNS
430 M. Soth et al.
results, SAEVs can serve a large portion of the diverse mobility demands in rural
areas, with an average acceptance rate of 86% over all considered test instances.
We next address the question of whether SAEV providers should prioritize
a high-density charging infrastructure or vehicles with large battery capacities.
Two strategies are developed for investigating this. The first strategy uses vehi-
cles with a battery capacity of 40 kWh and a larger number of charging stations
(12 and 16). The battery capacity is derived from the vehicles used by the ride-
pooling provider CleverShuttle, which is why the test instances are marked with
the abbreviation ‘CLE’ (MIX50V5CS12CLE and MIX50V5CS16CLE) [6]. For
the setting with 12 charging stations, these instances are identical to the ones
reported in the last row of Table 3. In contrast, the second strategy uses vehi-
cles with a larger battery capacity of 86 kWh and fewer charging stations (4
and 8). The battery capacity is similar to those of vehicles used by ridesharing
provider MOIA [42]. The corresponding test instances are, thus, identified by
abbreviation ‘MOI’ (MIX50V5CS4MOI and MIX50V5CS8MOI).
Table 4 reveals the effects of battery capacity and charging infrastructure on
solution quality. The CPLEX solver cannot reach optimality within the given
runtime for any solution of these test instances. Its results all have low to mod-
erate acceptance rates and very high GAP s. Best results with an acceptance
rate of 59.0% and the only GAP below 100% are obtained for test instances
MIX50V5CS4MOI, clearly indicating that the larger the number of charging sta-
tions gets, the more CPLEX struggles. If we consider the test instances in ascend-
ing order of the number of charging stations, the negative impact of this number
on the performance of CPLEX becomes very clear. The average acceptance rate
of 59.0% for MIX50V5CS4MOI strictly decreases to 47.5% (MIX50V5CS8MOI)
to 37.0% (MIX50V5CS12CLE) and to 20.5% (MIX50V5CS16CLE). At the same
time, the GAP s increase strictly from 85.9% up to 604.5%.
In contrast, the acceptance rates of the VNS solutions are above 80% for
all test instances, even reaching 98% for instance set MIX50V5CS4MOI. At the
same time, the cost per served trip is up to 42.8% below the cost of the CPLEX
solutions, confirming the superior performance of VNS under all experimental
settings investigated here. However, VNS also exhibits high computation times,
especially for the CLE-instances with many charging stations. Based on the
results of the VNS, it can be seen that the test instances of the vehicles with a
higher battery capacity (MOI) provide results with better solution quality com-
pared to the CLE-instances. The average acceptance rate of the MOI-instances
is about 15% higher than for the CLE-instances. For the CLE-instances with the
lower battery capacities, more charging processes (about 40) take place in the
solutions (not shown in the table). For the MOI-instances MIX50V5CS4MOI
and MIX50V5CS8MOI, just eight and two charging processes are required,
respectively. Furthermore, for both strategies the number of charging processes
decreases as the number of charging stations increases. From this, the average
cost per trip served decreases for both strategies as the number of charging
stations increases, which reveals the relevance of dense charging infrastructure.
In summary, it can be stated that the number of charging stations has no
substantial impact on the acceptance rate but on the cost per served trip. The
battery capacity, on the other hand, has a clear positive influence on the accep-
tance rate and, thus, a significant impact on the quality of the solutions. This
finding is consistent with [30], who identified battery range as a critical compo-
nent of SAEV service systems. Hence, based on the computational experiments,
a strategy focusing on large battery capacity seems more advantageous with
regards to the service rate of mobility requests, whereas additional charging
stations can help reducing the operational cost of the system.
5 Conclusions
This paper has presented the electric autonomous dial-a-ride problem with inter-
related trips (e-ADARP-IT). The associated mixed integer linear optimization
model includes, in addition to the properties of the classical dial-a-ride prob-
lem, synchronization constraints on the interrelated trips and constraints on the
battery charging management of electric, autonomous vehicles. A variable neigh-
borhood search (VNS) algorithm has been applied for the efficient solution of
larger instances of the e-ADARP-IT. For the computational experiments, test
data from a rural area in the German federal state of Schleswig-Holstein was
used. The results show that small test instances with ten requests can to some
extent be solved optimally using CPLEX. With increasing problem size, the
solution quality obtained by CPLEX decreases drastically, such that the share
of served trips drops to 37% for instances with 50 requests. The limitations of
the performance of CPLEX increase with a larger number of charging stations.
In contrast, the VNS heuristic performs consistently good over all instance sets.
It finds solutions that efficiently serve up to 99% of the trip requests at low
operational cost. The experiments furthermore show that the battery capacity
of vehicles is a crucial issue in SAEV service systems. Larger batteries allow to
serve a much higher share of trip requests. Clearly, sufficient charging infrastruc-
ture needs to be available, too. Nevertheless, increasing the number of charging
stations beyond that point merely reduced the operational service cost in our
experiments but did not result in larger acceptance rates for the trip requests.
Altogether, this paper contributes to identifying the potential of electric
autonomous vehicles for closing mobility gaps in rural areas. It also opens up
432 M. Soth et al.
opportunities for further research. The VNS has proven to be suitable for solving
the e-ADARP-IT but it might be too slow for usage in large service systems.
There are also further aspects at the operational level that require more research.
As an example, if charging stations are occupied by other vehicles, it needs to
be decided whether the SAEVs accept a waiting time or are rerouted to another
charging station. A further area of future research could be to combine the ride
sharing approach with public transport services. Here, an integration of school
transport would be possible as well as an offer as feeder for rail commuter lines.
References
1. Arslan, O., Yildiz, B., Karaşan, O.E.: Minimum cost path problem for plug-in
hybrid electric vehicles. Transp. Res. Part E Logist. Transp. Rev. 80(1), 123–141
(2015)
2. Baldacci, R., Battarra, M., Vigo, D.: Valid inequalities for the fleet size and mix
vehicle routing problem with fixed costs. Netw. Int. J. 54(4), 178–189 (2009)
3. Barco, J., Guerra, A., Muñoz, L., Quijano, N.: Optimal routing and scheduling of
charge for electric vehicles: a case study. Math. Probl. Eng. 2017, 1–6 (2017)
4. Bauer, G.S., Greenblatt, J.B., Gerke, B.F.: Cost, energy, and environmental impact
of automated electric taxi fleets in Manhattan. Environ. Sci. Technol. 52(8), 4920–
4928 (2018)
5. Bongiovanni, C., Kaspi, M., Geroliminis, N.: The electric autonomous dial-a-ride
problem. Transp. Res. Part B Methodol. 122, 436–456 (2019)
6. Clevershuttle: CleverShuttle fährt in Dresden On-Demand mit MOBIshuttle.
https://www.clevershuttle.de/presse/clevershuttle-dresden-mobishuttle. Accessed
04 Apr 2023
7. Cordeau, J.F.: A branch-and-cut algorithm for the dial-a-ride problem. Oper. Res.
54(3), 573–586 (2006)
8. Cordeau, J.F., Laporte, G.: The dial-a-ride problem (DARP): variants, modeling
issues and algorithms. 4OR - Q. J. Belg. Fr. Ital. Oper. Res. Soc. 1(2), 89–101
(2003)
9. Cortenbach, E.: The dial-a-ride problem with meeting points: a problem formula-
tion for shared demand responsive transit. Master’s thesis, University of Twente
(2023). http://essay.utwente.nl/94545/. Accessed 25 June 2023
10. Czioska, P., Kutadinata, R., Trifunović, A., Winter, S., Sester, M., Friedrich, B.:
Real-world meeting points for shared demand-responsive transportation systems.
Public Transp. 11, 341–377 (2019)
11. Desaulniers, G., Errico, F., Irnich, S., Schneider, M.: Exact algorithms for electric
vehicle-routing problems with time windows. Oper. Res. 64(6), 1388–1405 (2016)
12. Doppstadt, C., Koberstein, A., Vigo, D.: The hybrid electric vehicle - traveling
salesman problem. Eur. J. Oper. Res. 253(3), 825–842 (2016)
13. Earl, T.: Roadmap to decarbonising european cars (2018). https://www.
transportenvironment.org/sites/te/files/publications/2050_strategy_cars_
FINAL.pdf. Accessed 27 Mar 2023
Ridesharing with Autonomous Electric Vehicles and Interrelated Trips 433
14. Emadi, A.: Transportation 2.0. IEEE Power Energy Mag. 9(4), 18–29 (2011)
15. Fagnant, D.J., Kockelman, K.M.: Dynamic ride-sharing and fleet sizing for a system
of shared autonomous vehicles in Austin, Texas. Transportation 45(1), 143–158
(2018)
16. Ángel Felipe, Ortuño, M.T., Righini, G., Tirado, G.: A heuristic approach for the
green vehicle routing problem with multiple technologies and partial recharges.
Transp. Res. Part E Logist. Transp. Rev. 71, 111–128 (2014)
17. Genikomsakis, K., Mitrentsis, G.: A computationally efficient simulation model for
estimating energy consumption of electric vehicles in the context of route planning
applications. Transp. Res. Part D Transp. Environ. 50, 98–118 (2017)
18. Geofabrik GmbH: Openstreetmap data extracts. https://download.geofabrik.de/.
Accessed 04 Apr 2023
19. Goeke, D., Schneider, M.: Routing a mixed fleet of electric and conventional vehi-
cles. Eur. J. Oper. Res. 245(1), 81–99 (2015)
20. Greenblatt, J.B., Shaleen, S.: Automated vehicles, on-demand mobility, and envi-
ronmental impacts. Current Sustain. Renew. Energy Rep. 2(3), 74–81 (2015)
21. Hansen, P., Mladenović, N.: Variable neighborhood search. In: Glover, F., Kochen-
berger, G.A. (eds.) Handbook of Metaheuristics, pp. 145–184. Springer, US,
Boston, MA (2003). https://doi.org/10.1007/0-306-48056-5_6
22. Hiermann, G., Puchinger, J., Ropke, S., Hartl, R.F.: The electric fleet size and mix
vehicle routing problem with time windows and recharging stations. Eur. J. Oper.
Res. 252(3), 995–1018 (2016)
23. Häll, C.H., Lundgren, J.T., Voß, S.: Evaluating the performance of a dial-a-ride
service using simulation. Public Transp. 7, 139–157 (2015)
24. Hyland, M.F., Mahmassani, H.S.: Taxonomy of shared autonomous vehicle fleet
management problems to inform future transportation mobility. Transp. Res. Rec.
J. Transp. Res. Board 2653(1), 26–34 (2017)
25. Hyland, M.F., Mahmassani, H.S.: Dynamic autonomous vehicle fleet operations:
optimization-based strategies to assign AVs to immediate traveler demand requests.
Transp. Res. Part C Emerg. Technol. 92, 278–297 (2018)
26. Jäger, B., Agua, F.M.M., Lienkamp, M.: Agent-based simulation of a shared,
autonomous and electric on-demand mobility solution. In: 2017 IEEE 20th Inter-
national Conference on Intelligent Transportation Systems (ITSC), pp. 250–255
(2017)
27. Johnsen, L.C., Meisel, F.: Interrelated trips in the rural dial-a-ride problem with
autonomous vehicles. Eur. J. Oper. Res. 303(1), 201–219 (2022)
28. Keskin, M., Çatay, B.: Partial recharge strategies for the electric vehicle routing
problem with time windows. Transp. Res. Part C Emerg. Technol. 65, 111–127
(2016)
29. Liang, X., de Almeida Correia, G.H., An, K., van Arem, B.: Automated taxis’
dial-a-ride problem with ride-sharing considering congestion-based dynamic travel
times. Transp. Res. Part C Emerg. Technol. 112, 260–281 (2020)
30. Loeb, B., Kockelman, K.M., Liu, L.: Shared autonomous electric vehicle (SAEV)
operations across the Austin, Texas network with charging infrastructure decisions.
Transp. Res. Part C Emerg. Technol. 89(4), 222–233 (2018)
31. Ma, T.Y.: Two-stage battery recharge scheduling and vehicle-charger assignment
policy for dynamic electric dial-a-ride services. PLoS ONE 16(5), 1–27 (2021)
32. Mankowska, D.S., Meisel, F., Bierwirth, C.: The home health care routing and
scheduling problem with interdependent services. Health Care Manag. Sci. 17(1),
15–30 (2014)
434 M. Soth et al.
33. Masmoudi, M.A., Hosny, M., Demir, E., Genikomsakis, K.N., Cheikhrouhou, N.:
The dial-a-ride problem with electric vehicles and battery swapping stations.
Transp. Res. Part E Logist. Transp. Rev. 118, 392–420 (2018)
34. Mattfeld, D.C., Vahrenkamp, R.: Logistiknetzwerke: Modelle für Standortwahl und
Tourenplanung, 2nd edn. Springer Gabler, Wiesbaden (2014). https://doi.org/10.
1007/978-3-8349-6912-5
35. Montoya, A., Guéret, C., Mendoza, J.E., Villegas, J.G.: The electric vehicle routing
problem with nonlinear charging functions. Transp. Res. Part B Methodol. 103,
87–110 (2017)
36. Narayanan, S., Chaniotakis, E., Antoniou, C.: Shared autonomous vehicle services:
a comprehensive review. Transp. Res. Part C Emerg. Technol. 111, 255–293 (2020)
37. Nord, S.: Kreise und Städte in Schleswig-Holstein im Vergleich: Bevölkerung 2018
(2019). https://www.statistik-nord.de/fileadmin/Dokumente/NORD.regional/
Schleswig-Holstein.regional/Band_1_-_Bevoelkerung/SH_regional_Band_1_
2018.pdf. Accessed 04 Apr 2023
38. Open Source Routing Machine: OSRM (2020). https://project-osrm.org/.
Accessed 04 Apr 2023
39. Pelletier, S., Jabali, O., Laporte, G.: Charge sheduling for electric freight vehicles.
Transp. Res. Part B Methodol. 115, 246–269 (2018)
40. Pelletier, S., Jabali, O., Laporte, G., Veneroni, M.: Battery degradation and
behavour for electric vehicles: review and numerical analyses of several models.
Transp. Res. Part B Methodol. 103, 158–187 (2017)
41. Schneider, M., Stenger, A., Goeke, D.: The electric vehicle-routing problem with
time windows and recharging stations. Transp. Sci. 48(4), 500–520 (2014)
42. Schwarzer, C.M.: Volkswagen-Ridesharing: Moin, MOIA! MOIA startet im
April den größten elektrischen Ridesharing-Dienst Europas (2019). https://www.
electrive.net/2019/03/14/volkswagen-ridesharing-moin-moia/. Accessed 04 Apr
2023
43. Shaheen, S., Chan, N.: Mobility and the sharing economy: potential to facilitate
the first- and last-mile public transit connections. Built Environ. 42(4), 573–588
(2016)
44. Sprei, F.: Disrupting mobility. Energy Res. Soc. Sci. 37, 238–242 (2018)
Combinatorial Optimization
Operational Integration of Supply Chain
Activities with Earliness and Tardiness
Considerations
1 Introduction
Today, the management of supply chain operations is a trending topic for many businesses
where competition is of utmost importance. Thus, it is inevitable to provide effective
coordination between the production and distribution phases of the supply chain. Fulfill-
ing customer demands with the required quantity on time becomes an extremely difficult
task, especially for companies which apply the zero-inventory model and make-to-order
businesses. The integrated approach is also beneficial for time management of time-
sensitive or perishable products. Considering all these aspects, operational integration
for the two main stages is carried out by making decisions simultaneously for production
scheduling and vehicle routing. According to the integrated approach, as the output of
either does not affect the other, the decisions on production and distribution stages will
not limit each other.
Recently, there has been a remarkable growth in the online shopping sector [2]. Due
to the abundance of competitors in the market and the ease of communication between
customers and companies, firms are under increasing pressure to ensure customer sat-
isfaction. For this reason, they have turned to customer-oriented performance measures
rather than classical measures such as cost and profit. According to the current appli-
cations, customers know when they can receive the orders and even specify their own
preferences regarding delivery time, which is often handled by time windows in the
vehicle routing literature.
A current survey, which covers the enlarged literature on integrated production
scheduling and outbound vehicle routing problems, can be found in [4]. In the survey,
the literature is classified in terms of the number of operations, the number of vehicles,
and the number of trips. According to this classification, the problem under study can
be represented in a section of parallel machines, several vehicles, and single trips, as in
the studies of [3, 5, 6, 11, 12, 18].
Although there are several criteria (i.e., objectives) identified by researchers, in gen-
eral, objectives can be divided into two main groups: cost and service. In addition,
sometimes multiple criteria can be handled simultaneously as a summation of objectives
in a single form [5, 9, 21, 22] or multi-objective structure [7, 14, 23]. While many criteria
that are related to the problem under study have been defined in the literature [4], the
objective of this study can be expressed as minimizing the sum of total earliness and
tardiness. Ullrich [18] considers a variant of the integrated problem with the objective of
minimizing the total tardiness and proposes a genetic algorithm to produce optimal or
near optimal solutions in a reasonable amount of time. Wu et al. [20] formulate a math-
ematical model for a production scheduling-based routing problem with time windows
and setup times for minimizing total tardiness. Hou et al. [10] address an integrated
problem where the production stage has multiple factories, and each factory has multi-
ple machines based on the flow shop environment. For the objective of minimizing total
weighted earliness and tardiness, they propose a swarm intelligence approach which is
called brainstorm optimization algorithm.
In the integrated problem under investigation, the distribution phase has a homoge-
neous fleet that can only be utilized once, while the production environment consists
of identical parallel machines and each job has a single operation that needs to be per-
formed on any machines. It is very important for customer satisfaction to deliver orders
according to the predefined time windows. In addition, vehicles that arrive earlier must
wait idle until the lower time bound of any customer. So, the objective is to minimize
the sum of total earliness and tardiness. After formulating the problem, we propose a
metaheuristic approach, Iterated Local Search (ILS), to obtain good quality solutions in
a reasonable time.
The contribution of the paper can be summarized as follows: (i) The sum of the
total earliness and tardiness is studied simultaneously for the first time for the integrated
problem at hand. (ii) A new mixed integer programming formulation is developed. (iii)
A metaheuristic algorithm, Iterated Local Search (ILS), is presented, and a perturbation
mechanism is proposed for the only production environment adapted for the integrated
problem.
The remainder of the study is organized as follows: In Sect. 2, we define the MIP
formulation for the integrated problem with an illustrative example. While in Sect. 3,
the proposed ILS structure is given, in Sect. 4, computational experiments are discussed
Operational Integration of Supply Chain 439
based on the different problem parameters. Finally, in Sect. 5, conclusions and future
directions are discussed.
2 Problem Definition
This section is dedicated to two subsections including the MIP model and an illustrative
example for visual presentation.
The assumptions of the problem can be summarized as follows: There is a single pro-
duction facility in which a set of identical parallel machines is used to produce customer
orders. After the production of a particular batch is completed, a limited number of
vehicles is delivering the consolidated orders to associated customers by considering the
time window of each customer. Vehicles can be used only once (i.e., multiple use is not
allowed). Vehicle capacities are homogeneous, and the capacity of a vehicle is enough to
serve all customers. As a result, there is no requirement to use all vehicles in the system.
The sequence of production and distribution does not have to be identical. The objective
is to minimize the sum of total earliness and tardiness penalties. The notations used in
the MIP formulation are shown in Table 1.
Based on the formal definition and notations, we formulate an MIP model for the
problem as follows:
minimize Ej + Tj (1)
j∈NC
Subject to;
Wjm ≤ 1 ∀m ∈ M (2)
j∈NC
Zji + Lj = 1 ∀j ∈ NC (3)
i∈NC
Wjm + Zij = 1 ∀j ∈ NC (4)
m∈M i∈NC
Cj ≥ pj − H 1 − Wjm ∀j ∈ NC ; m ∈ M (5)
Cj ≤ pj + H 1 − Wjm ∀j ∈ NC ; m ∈ M (6)
Ci − Cj + HZij + H − pi − pj Zji ≤ H − pj ∀i, j ∈ NC ; i = j (7)
X0jk ≤ 1 ∀k ∈ K (8)
j∈NC
440 E. Yağmur and S. E. Kesen
Xijk = 1 ∀j ∈ NC (9)
k∈K i∈N
Operational Integration of Supply Chain 441
Xujk = Xjik ∀j ∈ N ; ∀k ∈ K (10)
u∈N i∈N
Fuj − Fji = dj ∀j ∈ NC (11)
u∈N i∈N
dj Xijk ≤ Fij ∀i, j ∈ N ; i = j (12)
k∈K
Q − di + Xijk ≥ Fij ∀i, j ∈ N ; i = j (13)
k∈K
⎛ ⎞
Rjk = ⎝ Xjik ⎠ + Xj0k ∀j ∈ NC ; ∀k ∈ K (14)
i∈NC
Ci − Cj ≤ H 1 − Gij ∀i, j ∈ NC ; i = j (15)
Vk ≥ Ci − H 1 − Xijk ∀i, j ∈ N ; i = j; ∀k ∈ K (17)
Vk ≤ Cj + H (1 − (Rjk − AX 3jik ∀j ∈ NC ; ∀k ∈ K
i∈N C
(18)
+ Gij + Gji − 1 ))
i∈NC
Vk − Yj + s0 + t0j ≤ H 1 − X0jk ∀j ∈ NC ; ∀k ∈ K (19)
Yj − Vk − s0 − t0j ≤ H 1 − X0jk ∀j ∈ NC ; ∀k ∈ K (20)
Yi∗ − Yj + si + tij ≤ H 1 − Xijk ∀i, j ∈ NC ; i = j (21)
k∈K
Yj − Yi∗ − si − tij ≤ H 1 − Xijk ∀i, j ∈ NC ; i = j (22)
k∈K
Tj ≥ Yj∗ − bj ∀j ∈ NC (23)
Ej ≥ aj − Yj ∀j ∈ NC (24)
Yj − aj ≤ H 1 − Oj ∀j ∈ NC (25)
aj − Yj ≤ HOj ∀j ∈ NC (26)
442 E. Yağmur and S. E. Kesen
Yj∗ = Yj − AX 4j + aj Oj ∀j ∈ NC (27)
The objective function minimizing the sum of total earliness and tardiness is given in
(1). It is determined whether a job is the successor of any job on the same machine or the
first/last job to be processed on any machine by Cons. (2)–(4). While Cons. (5) and (6)
are used to determine the production completion time of the first jobs on each machine,
Cons. (7) defines the production completion time for predecessor and successor jobs
on each machine. Cons. (8) identifies the first customer on any tour. Cons. (9) and (10)
guarantee that each customer must be visited exactly once, and the number of entering
and leaving arcs must be the same for any node, respectively. Cons. (11)–(13) are based on
a single-commodity flow formulation described by [8]. This set of constraints guarantee
a connected tour. So, we define Fij as a continuous variable indicating the amount of load
flowing between nodes i and j. In order to assure subtour elimination in the formulation,
we impose tight bounds on the flow variables Fij in addition to a set of flow conservation
constraints. Cons. (11) and (12) indicate that the total amount of load on any vehicle
decreases as it delivers the order to the related customer. According to Cons (13), the total
amount of load on any vehicle should not exceed its capacity. While Cons. (14) defines
the variable of Rjk , , Cons. (15) and (16) define the variable of Gij . . According to Cons.
(17) and (18), each vehicle must wait in production facility until the production process
of the batch assigned completes. Cons. (19) and (20) specify the arrival time to the first
customer on any tour. Cons. (21) and (22) state the arrival time to the two consecutive
customers on any tour. Cons. (23) and (24) calculate tardiness and earliness, respectively,
if either of them occurs. Cons. (25) and (26) define the variable of Oj . . According to
Cons. (27), the service starting time of any customer is equal to the maximum value of
the lower bound of the time windows and arrival time for that customer.
As can be seen, the developed model is not linear because it consists of the product
of two variables. So, while we use the Cons. (28)–(30) for AX 1ijk , AX 2ij and AX 3ijk
where both x and y are binary variables, Cons. (31)–(33) are used for the linearization
of AX 4j where x is a binary variable and y is a non-negative continuous variable, and M
is a sufficiently large number.
z≤x (28)
z≤y (29)
z ≥x+y−1 (30)
z ≤ xM (31)
z≤y (32)
z ≥ y − (1 − x)M (33)
Operational Integration of Supply Chain 443
Based on the formal definition given in the previous section, we present an illustrative
problem for visualizing the integrated approach. In the example, whose parameters are
listed in Table 2, there are nine customers, four identical parallel machines, and two
homogenous vehicles with enough capacity to serve all customers. Table 1 contains the
descriptions of parameters listed in the first row of Table 2.
j pj sj aj bj tij 0 1 2 3 4 5 6 7 8 9
0 – 19 – – 0 0 3 13 5 18 10 12 22 18 10
1 39 1 69 75 1 3 0 13 7 19 12 10 23 16 9
2 20 9 55 61 2 13 13 0 14 30 19 22 11 19 5
3 91 5 128 138 3 5 7 14 0 18 7 15 21 23 12
4 52 2 199 201 4 18 19 30 18 0 14 13 38 31 27
5 85 14 114 135 5 10 12 19 7 14 0 16 26 27 18
6 13 9 108 126 6 12 10 22 15 13 16 0 33 18 18
7 18 9 61 106 7 22 23 11 21 38 26 33 0 29 16
8 2 15 54 81 8 18 16 19 23 31 27 18 29 0 15
9 71 6 132 139 9 10 9 5 12 27 18 18 16 15 0
A feasible solution for the given example is illustrated in Fig. 1. According to Fig. 1,
jobs 7 and 5 are processed on the first machine. Jobs assigned to other machines are
depicted similarly. While customers 2, 7, 8, and 6 are served by the first vehicle, the
second vehicle delivers orders 1, 9, 3, 5, and 4, respectively. Shipping boxes show the
service time spent on each customer. Vehicle one (v1) departs from the depot when the
production process of the first batch completes at time 22, then goes directly from the
depot to customer 2 at time 54. Based on the time window of customer 2, the vehicle must
wait 1 min until the lower bound of the time window (i.e., 55). This process proceeds
in the same way for other customers. Based on the following feasible solution, earliness
occurs for customers 2 and 4, while tardiness occurs for customers 1, 3, 5, 6, and 8, as
seen from Fig. 1. So, the objective value of the problem is equal to the 156 which is a
summation of 6 earliness and 150 tardiness.
444 E. Yağmur and S. E. Kesen
m1 7 5
m2 1 4
m3 8 2 9
m4 6 3
v1 0 2 7 8 6
v2 0 1 9 3 5 4
Fig. 1. A Gantt chart of a feasible solution (also optimal solution) for the illustrated example.
The idea of Iterated Local Search (ILS) as an extension of the local search procedure
was first introduced in [13]. Basically, ILS is formed by incorporating a perturbation
mechanism into the local search process to avoid getting stuck in local optima. ILS has
been used for many combinatorial optimization problems in the literature [15––17, 19].
The pseudo code of the proposed ILS is given in Fig. 2.
In the perturbation process, we use the regeneration operator proposed by [1]. In
the selection process, parents are selected as the current local optimal solution and the
neighbor solution with the best objective. The encoding scheme developed by Afralizad
and Rezaeian [1] is also adapted to the integrated problem. According to the adapted
chromosome, while the first and third rows represent the permutation of the jobs and
customers, respectively, the second and fourth rows demonstrate the assignments of
machines and vehicles for each job and customer in the corresponding position. The
encoding scheme for the illustrated problem is depicted in Fig. 3. In the local search
process, we use swap, insert, and 2-opt operators for both production and distribution
phases so as to consider inter and intra machines and vehicles.
Operational Integration of Supply Chain 445
Job order: 7 5 1 4 8 2 9 6 3
Machine assignment: 1 1 2 2 3 3 3 4 4
Customer order: 2 7 8 6 1 9 3 5 4
Vehicle assignment: 1 1 1 1 2 2 2 2 2
4 Computational Results
seconds for CPLEX and ILS are indicated by the CPU column. The deviation of objec-
−ObjILS
tive values from each other (DEV%) is calculated by ObjCPLEX
ObjCPLEX ×100 if the solution
−ObjCPLEX
found by CPLEX is worse than ILS; otherwise, it is calculated by ObjILSObj ILS
× 100.
We summarize the following remarks from Table 3: CPLEX can find all optimal
solutions for all instances of six and seven customers. As the number of customers
increases, the ability of CPLEX to find optimal solutions declines dramatically. For ten
customers, CPLEX can only find seven optimal solutions out of 81 instances. Like the
first remark, the CPU time also increases drastically when the number of customers
increases from 5 to 10 and reaches to about 10,000 s for the level of 10 customers.
In addition, when the effect of the number of machines and vehicles are examined, we
reach the following results: As the number of machines increases, CPLEX’s ability to find
optimal solutions increases. This is a result of the reduction of the tardiness component in
the objective function due to the simultaneous use of resources in the production phase.
Similarly, as the number of vehicles increases, the average objective value decreases.
However, the number of vehicles has not as much an effect as the number of machines
in terms of the number of optimal solutions found and CPU times. As can be seen from
Table 3, CPLEX is able to find the optimal solution in 331 instances out of 486 instances
within 3 h. When the same instances are solved with ILS, it is seen to provide optimal
solutions for 329 instances out of 331 instances whose optimality are proven by CPLEX.
Especially in instances where the number of customers is set to 9 and 10, the solutions
obtained by ILS are of higher quality, and the percent deviation of CPLEX from ILS
(DEV%) reaches to 77%. As a result, ILS has shown superior performance in terms of
both solution quality and solution time.
Table 3. The comparison of CPLEX and ILS based on the number of customers, machines, and
vehicles.
Operational Integration of Supply Chain 447
References
1. Afzalirad, M., Rezaeian, J.: Resource-constrained unrelated parallel machine scheduling
problem with sequence dependent setup times, precedence constraints and machine eligi-
bility restrictions. Comput. Ind. Eng. 98, 40–52 (2016). https://doi.org/10.1016/j.cie.2016.
05.020
2. Alzoubi, H., Alshurideh, M., Kurdi, B., Alhyasat, K., Ghazal, T.: The effect of e-payment
and online shopping on sales growth: evidence from banking industry. Int. J. Data Netw. Sci.
6(4), 1369–1380 (2022). https://doi.org/10.5267/j.ijdns.2022.5.014
3. Belo-Filho, M., Amorim, P., Almada-Lobo, B.: An adaptive large neighbourhood search for
the operational integrated production and distribution problem of perishable products. Int. J.
Prod. Res. 53(20), 6040–6058 (2015). https://doi.org/10.1080/00207543.2015.1010744
4. Berghman, L., Kergosien, Y., Billaut, J.-C.: A review on integrated scheduling and outbound
vehicle routing problems. Eur. J. Oper. Res. (2023). https://doi.org/10.1016/j.ejor.2022.12.036
448 E. Yağmur and S. E. Kesen
5. Chang, Y.-C., Li, V.C., Chiang, C.-J.: An ant colony optimization heuristic for an integrated
production and distribution scheduling problem. Eng. Optim. 46(4), 503–520 (2014). https://
doi.org/10.1080/0305215X.2013.786062
6. Farahani, P., Grunow, M., Günther, H.-O.: Integrated production and distribution planning for
perishable food products. Flex. Serv. Manuf. J. 24, 28–51 (2012). https://doi.org/10.1007/s10
696-011-9125-0
7. Ganji, M., Kazemipoor, H., Molana, S.M.H., Sajadi, S.M.: A green multi-objective integrated
scheduling of production and distribution with heterogeneous fleet vehicle routing and time
windows. J. Clean. Prod. 259, 120824 (2020). https://doi.org/10.1016/j.jclepro.2020.120824
8. Gavish, B., Graves, S.C.: The travelling salesman problem and related problems (1978). http://
hdl.handle.net/1721.1/5363
9. He, P., Li, K., Kumar, P.R.: An enhanced branch-and-price algorithm for the integrated pro-
duction and transportation scheduling problem. Int. J. Prod. Res. 60(6), 1874–1889 (2022).
https://doi.org/10.1080/00207543.2021.1876941
10. Hou, Y., Fu, Y., Gao, K., Zhang, H., Sadollah, A.: Modelling and optimization of integrated
distributed flow shop scheduling and distribution problems with time windows. Expert Syst.
Appl. 187, 115827 (2022). https://doi.org/10.1016/j.eswa.2021.115827
11. Kesen, S.E., Bektaş, T.: Integrated production scheduling and distribution planning with
time windows. Lean Green Supply Chain Manage: Optim. Models Algorithms, 231–252
(2019).https://doi.org/10.1007/978-3-319-97511-5_8
12. Liu, H., Guo, Z., Zhang, Z.: A hybrid multi-level optimisation framework for integrated
production scheduling and vehicle routing with flexible departure time. Int. J. Prod. Res.
59(21), 6615–6632 (2021). https://doi.org/10.1080/00207543.2020.1821927
13. Lourenço, H.R., Martin, O.C., Stützle, T., Iterated local search. In: Handbook of Metaheuris-
tics, pp. 320–353. Springer, Cham (2003). https://doi.org/10.1007/0-306-48056-5_11
14. Mohammadi, S., Al-e-Hashem, S.M., Rekik, Y.: An integrated production scheduling and
delivery route planning with multi-purpose machines: a case study from a furniture manufac-
turing company. Int. J. Prod. Econ. 219, 347–359 (2020). https://doi.org/10.1016/j.ijpe.2019.
05.017
15. Penna, P.H.V., Subramanian, A., Ochi, L.S.: An iterated local search heuristic for the hetero-
geneous fleet vehicle routing problem. J. Heuristics 19(2), 201–232 (2013). https://doi.org/
10.1007/s10732-011-9186-y
16. Stützle, T.: Applying iterated local search to the permutation flow shop problem. Technical
Report AIDA-98–04, FG Intellektik, TU Darmstadt (1998)
17. Stützle, T.: Iterated local search for the quadratic assignment problem. Eur. J. Oper. Res.
174(3), 1519–1539 (2006). https://doi.org/10.1016/j.ejor.2005.01.066
18. Ullrich, C.A.: Integrated machine scheduling and vehicle routing with time windows. Eur. J.
Oper. Res. 227(1), 152–165 (2013). https://doi.org/10.1016/j.ejor.2012.11.049
19. Vansteenwegen, P., Souffriau, W., Berghe, G.V., Van Oudheusden, D.: Iterated local search for
the team orienteering problem with time windows. Comput. Oper. Res. 36(12), 3281–3290
(2009). https://doi.org/10.1016/j.cor.2009.03.008
20. Wu, G.-H., Cheng, C.-Y., Pourhejazy, P., Fang, B.-L.: Variable neighborhood-based cuckoo
search for production routing with time window and setup times. Appl. Soft. Comput. 125,
109191 (2022). https://doi.org/10.1016/j.asoc.2022.109191
21. Yağmur, E., Kesen, S.E.: A memetic algorithm for joint production and distribution scheduling
with due dates. Comput. Ind. Eng. 142, 106342 (2020). https://doi.org/10.1016/j.cie.2020.
106342
Operational Integration of Supply Chain 449
22. Yağmur, E., Kesen, S.E.: Multi-trip heterogeneous vehicle routing problem coordinated with
production scheduling: memetic algorithm and simulated annealing approaches. Comput. Ind.
Eng. 161, 107649 (2021). https://doi.org/10.1016/j.cie.2021.107649
23. Yağmur, E., Kesen, S.E.: Bi-objective coordinated production and transportation scheduling
problem with sustainability: formulation and solution approaches. Int. J. Prod. Res. 61(3),
774–795 (2023). https://doi.org/10.1080/00207543.2021.2017054
Constrained Multi-agent Path Planning
Problem
1 Introduction
The importance of cooperative path planning for vehicle fleets has been demon-
strated by various applications such as logistics and delivery services [19,26],
ride-sharing services [13,25], and mission planning for autonomous/unmanned
vehicles [4,28]. In general, a multi-agent planner is required for cooperative path
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 450–466, 2023.
https://doi.org/10.1007/978-3-031-43612-3_28
Constrained Multi-agent Path Planning Problem 451
planning of vehicle fleets. This planning problem can be defined as finding a set
of paths that allows a fleet of agents to reach a specified number of targets in the
minimum amount of time. This problem is similar to the classical Multiple Trav-
eling Salesmen Problem (MTSP) [1], and its variants such as the Multi-Agent
Path Planning (MAPP) problem [16]. The classical MTSP is generally defined
as follows. Given n cities (targets) and m salesmen (agents), the aim is to find
m tours (closed paths) starting and ending at a depot (initial position of the
agents) such that each target is visited only once and the total cost of visiting
all targets is minimized. The cost metric can be expressed in terms of distance,
time, etc., [1]. The MAPP problem is similar to the classical MTSP with two
differences:
– Subtours (open paths) are considered in the MAPP problem, so each agent
starts and ends its path at two distinct points.
– Agents can have different initial positions.
Solving the MTSP and the MAPP problem is difficult because of their complex
combinatorial character (NP-hardness). In general, two types of approaches are
used to tackle the MTSP [12]: exact and heuristic-based approaches.
The exact approaches are based on either the transformation of the MTSP to
an equivalent Traveling Salesman Problem (TSP), or relaxing some constraints
of the problem [1,12]. The problem is solved by applying exact methods such
as Branch-and-Bound [6], Cutting Planes [20], and Integer Linear Programming
452 A. Maktabifard et al.
Table 1. Literature instances on the MTSP having similarities to the CMAPP problem
considered in our study
Formulations [17]. The exact approaches are restricted to the MTSPs with rea-
sonable sizes (Euclidean and Non-Euclidean problems up to 100 and 500 cities
(targets), respectively [6]) because their performance is highly dependent on the
size of the problem. Accordingly, the solution runtime encounters an exponen-
tial rise with increasing the problem size [1,12]. Additionally, transforming the
MTSP to an equivalent TSP might result in an even more difficult problem to
solve, especially using the exact approaches [6,20].
The heuristic-based approaches solve the problem by applying approximate
heuristic methods such as Ant Colony Optimization (ACO) [8,12,15], Genetic
Algorithm (GA) [12,15,29,30], Simulated Annealing [24], Neural Networks [23],
and Tabu Search [22]. The heuristic-based approaches can achieve near-optimal
solutions in a reasonable amount of time even for larger problems [12].
The problems tackled in several previous papers on the MTSP have some
similarities to the CMAPP problem considered in our study (see Table 1). These
similarities include considering multiple depots for the MTSP, minimizing the
length (cost) of the longest tour (Min-Max MTSP), and applying additional
constraints.
There are only a few studies on the variants of the MAPP problem. In [9], a
GA was introduced to solve a so-called Subtour problem which is similar to the
MAPP problem with one difference: there is only one agent. This method was
developed for solving the MAPP problem in [10], and a similar method was pro-
posed in [16] to address the MAPP problem. In [27], a heuristic approach based
on a graph simplification method was presented to solve a multi-depot open
Constrained Multi-agent Path Planning Problem 453
tours MTSP which is basically the MAPP problem with Min-Sum optimization
objective.
Concluding the literature review, the MTSP is a well-studied problem, while
there are only a few studies on the MAPP problem. Furthermore, previous stud-
ies have not solved the MAPP problem with our additional constraints.
In this section, the required foundation from Graph theory, and the notations
of the classical MTSP and the MAPP problem are presented. In addition, our
additional constraints for the MAPP problem are defined.
According to Graph theory, a graph is an ordered pair G = (V, E) consisting
of: V = {v1 , ..., vm }, a set of m vertices (nodes), and E = {(vi , vj ) | vi , vj ∈
V, i = j}, a set of edges (links) connecting vertices vi and vj . This type of
graph may be precisely referred to as simple graph which means multiple edges
connecting the same two vertices are not allowed. In the context of the classical
MTSP and the MAPP problem, the targets and the points where the agents
begin their journey, are considered as vertices. In this study, only undirected
graphs are taken into consideration. In an undirected graph, edges are comprised
of unordered pairs of vertices where (vi , vj ) = (vj , vi ). Moreover, if in a graph
all vertices of V are connected to each other, the graph is a complete graph and
it is designated by Km (V ) where m is the number of vertices constituting the
vertex set V .
A path/cycle is a sequence of edges connecting a sequence of vertices. If the
sequence of vertices is composed of distinct vertices, a simple path/cycle is given
with the exception that for simple cycle the starting and ending vertices are
repeated. Here we only consider simple paths/cycles, so henceforth paths/cycles
simply refer to simple paths/cycles. P = (V1 , E1 ) is a path in G = (V, E) if:
V1 = {v1 , ..., vk } ⊂ V , and E1 = {(v1 , v2 ), (v2 , v3 ), ..., (vk−1 , vk )} ⊂ E. This
means a path consisting of k vertices is a sequence of k − 1 edges connecting
these vertices in which each two consecutive edges share a vertex in common.
Likewise, C = (V2 , E2 ) is a cycle in G = (V, E) if: V2 = {v1 , ..., vk } ⊂ V , and
E2 = {(v1 , v2 ), ..., (vk−1 , vk ), (vk , v1 )} ⊂ E. In other words, a cycle comprising
k vertices is a sequence of k edges connecting these vertices in which each two
consecutive edges as well as the first and the last edge share a vertex in common.
Hence, a cycle starts and ends at the same vertex (tour), while a path starts and
ends at different vertices (subtour).
The number of edges in a path or cycle is called the length of that path or
cycle. For G = (V, E), the set of all paths and cycles with length k are designated
by Pk (G) and Ck (G), respectively. A weight (cost) w(vi , vj ) can be assigned to
an edge. If every edge of a graph has a weight, the graph is called a weighted
graph. In a weighted graph if w(vi , vj ) = w(vj , vi ) is valid for every edge, the
weighted graph is called symmetric. In this study, for simplicity, the Euclidean
distance between two vertices of each edge is assigned as the weight of that edge
454 A. Maktabifard et al.
w(vi , vj ) = w(vj , vi ) = |r(vi ) − r(vj )|; note that the problems discussed here
are not restricted to Euclidean distance, so any desired measure (e.g., rectilinear
distance, actual driving distance) can be assigned as the weight of edges. For
a path P ∈ Pk (G), the sum of its edges’ weights is called the total cost of the
path:
k
c(P ) = w(vi , vi+1 ) (1)
i=1
If there is no associated weight to each edge, the total cost of a path or cycle is
simply equal to the length of the path or cycle.
The classical MTSP and the MAPP problem can be formulated on the basis
of the foundation from Graph theory described above. Consider A = {a1 , ..., am }
and T = {t1 , ..., tn } as the set of m agents and n targets, respectively. The agents
and targets are located in Euclidean space. The location of the ith agent and the
j th target is therefore determined by r(ai ) and r(tj ), respectively. The classical
MTSP is formulated as follows. Consider a as the depot for all agents, so ∀ ai = a
and ∀ r(ai ) = r(a). The configuration space of the problem is the complete graph
Kn+1 (V ) with the vertex set V = T ∪ a. Consider Ci as a cycle with length ki
that starts and ends at vertex a (the depot). Let C = {C1 , ..., Cm } be the set
of m cycles Ci with length ki . The aim is to determine C such that each target
is visited only once (visitation only by one agent), and the total cost of C (Eq.
(3)) is minimized (Min-Sum problem).
m
c(C) = c(Ci ) (3)
i=1
The total cost of C is the sum of the lengths (costs) of all m cycles Ci constituting
C (see Eq. (3)).
Likewise, the MAPP problem is formulated as follows. For each agent, the
configuration space of the problem is the complete graph Kn+1 (Vi ) with the
vertex set Vi = T ∪ ai . The Euclidean distance between two vertices of each edge
(e.g., vx and vy ) is assigned as the weight of that edge; w(vx , vy ) = w(vy , vx ) =
|r(vx ) − r(vy )| where vx , vy ∈ Vi . As a result, Kn+1 (Vi ) is a symmetric weighted
graph. Consider Pi as a path with length ki starting at vertex ai . Let P =
{P1 , ..., Pm } be the set of m paths Pi with length ki that each pair of them
do not have any vertex in common (except possible same position for starting
points). The aim is to determine P such that each target is visited only once
(visitation only by one agent), and the cost of the path with the largest cost in
P (Eq. (4)) is minimized.
m
cm (P) = max c(Pi ) (4)
i=1
Constrained Multi-agent Path Planning Problem 455
In this study, a two-step solution based on the method developed in [16] is pre-
sented for the CMAPP problem. The solution method is composed of an initial
solution and a complex solution. The complex solution improves the result of
the initial solution using a GA that applies genetic-like operators and a heuristic
456 A. Maktabifard et al.
method. A solution is a set of paths that is called Plan (P) in this context. The
Plan is optimal if the longest path in the Plan is the shortest possible (Min-Max
problem). Thus, the solution algorithm is as follows:
The initial solution generates the Initial Plan (a starting set of paths). Consider
A = {a1 , ..., am } and T = {t1 , ..., tn } as the set of m agents and n targets,
respectively. A Plan is viable if each pair of paths do not have any vertex in
common (except possible same starting points). For the order of planning, we
assume that the targets are assigned to the agents in the following order: a1 →
a2 → · · · → am . The targets assignment algorithm is as follows:
A viable Plan P = {P1 , ..., Pm } is generated by iterating this algorithm until all
targets would be assigned to the agents.
However, a target selection approach is also required for selecting a target in
every step of the targets assignment algorithm. Therefore, different initialization
methods can be employed by adopting various target selection approaches. In
this study, two initialization methods were employed:
The solution for the non-constrained MAPP problem was also considered in
order to enable us to evaluate the effect of applying our two additional con-
straints that were applied independently of each other. Hence, six different solu-
tion modes were analyzed in this study (see Table 2).
Constrained Multi-agent Path Planning Problem 457
1. Crossover operator:
It is stochastically applied with the probability of Pcrossover . It selects two
paths stochastically either by the best-worst selection with the probability of
Pbest−worst or by random selection with the probability of 1 − Pbest−worst ,
then each of them is randomly cleaved (cleaving position can be different for
each of them) into two parts and the parts are swapped.
2. Mutation operator:
It is stochastically applied with the probability of Pmutation . It randomly
selects two paths and swaps two randomly selected targets between them.
3. Migration operator:
It is stochastically applied with the probability of Pmigration . It randomly
selects two paths Pi and Pj , then it removes a randomly selected target from
458 A. Maktabifard et al.
Pboost = 0.3. Simulations were run for 150000 evolutionary iterations for each
test case by means of MATLAB R2021b. The hardware configuration of the PC
used was as follows. CPU: AMD RyzenTM 5-5500U, RAM: 8 GB.
The computational results are shown in Table 4. Accordingly, if the maximum
number of targets for each agent was restricted:
– in the case of employing Greedy initialization, the average final cost (except
in scenario 1) and the runtime decreased, while the average cost reduction
(except in scenario 1) increased compared to the non-constrained solution
with Greedy initialization.
– in the case of employing Random initialization, the runtime (except in sce-
nario 1) and the average cost reduction decreased, whereas the average final
cost increased compared to the non-constrained solution with Random ini-
tialization.
– in the case of employing Greedy initialization, the average cost reduction and
the average final cost decreased compared to the non-constrained solution
with Greedy initialization. A general trend was not observed regarding the
runtime.
– in the case of employing Random initialization, the average final cost in the
scenarios with more targets (200) and the average cost reduction decreased,
while the average final cost in the scenarios with fewer targets (100) increased
compared to the non-constrained solution with Random initialization. A gen-
eral trend was not observed regarding the runtime.
by solid lines and the longest path is highlighted in all illustrated Plans; and
dashed-line circles depict the maximum range of the agents. Figure 1 shows the
Plans obtained for a test case with 9 agents and 100 targets. In this test case the
solution achieved the largest Δcm (see Eq. (6)) among 100 test cases of scenario
2 in the case of applying Greedy initialization along with the constraint on the
maximum number of targets for each agent. In the absence of any additional
constraints, the cost cm (P) (see Eq. (4)) decreased from 3.16 in the Initial Plan
(Fig. 1(a)) to 1.24 in the Final Plan (Fig. 1(b)). If the maximum number of
targets for each agent was restricted to qi = 12, the cost cm (P) reduced from 3.16
in the Initial Plan (Fig. 1(c)) to 1.17 in the Final Plan (Fig. 1(d)). Lastly, if the
maximum range for each agent was limited to ri = 0.38, the cost cm (P) dropped
from 1.59 in the Initial Plan (Fig. 1(e)) to 1.17 in the Final Plan (Fig. 1(f)).
Furthermore, the Plans generated for a test case with 10 agents and 200 targets
Constrained Multi-agent Path Planning Problem 461
Fig. 1. Initial Plans (left) vs. Final Plans (right) generated by solution modes 1, 2 and
3; scenario 2; the test case in which the solution achieved the largest Δcm in the case
of applying Greedy initialization along with the constraint on the maximum number
of targets for each agent. (Color figure online)
462 A. Maktabifard et al.
Fig. 2. Initial Plans (left) vs. Final Plans (right) generated by solution modes 1, 2 and
3; scenario 4; the test case in which the solution achieved the largest Δcm in the case
of applying Greedy initialization along with the constraint on the maximum range for
each agent. (Color figure online)
Constrained Multi-agent Path Planning Problem 463
are illustrated in Fig. 2. In this test case the solution attained the highest Δcm
among 100 test cases of scenario 4 if Greedy initialization was applied together
with the constraint on the maximum range for each agent. In the absence of
any additional constraints, the cost cm (P) reduced from 2.86 in the Initial Plan
(Fig. 2(a)) to 1.74 in the Final Plan (Fig. 2(b)). If the maximum number of
targets for each agent was limited to qi = 20, the cost cm (P) dropped from 2.86
in the Initial Plan (Fig. 2(c)) to 1.95 in the Final Plan (Fig. 2(d)). Finally, if
the maximum range for each agent was restricted to ri = 0.42, the cost cm (P)
decreased from 2.60 in the Initial Plan (Fig. 2(e)) to 1.52 in the Final Plan
(Fig. 2(f)).
Comparing our developed method with previously proposed methods using
a similar GA for the MAPP problem, our method is novel because despite the
similarity in utilizing the mutation operator and the 2-OPT algorithm in [10],
their initialization phase (initial solution) and crossover operator function dif-
ferently. Moreover, our additional constraints were not applied in [16] and their
initialization phase uses a different targets assignment algorithm. The presented
method can be extended by applying other practical constraints realizing time
windows and priorities [5,13,19,25], conflict-free paths [3,7], and multi-modal
itinerary planning [21].
6 Conclusions
This paper presents a cooperative path planner based on the Multi-Agent Path
Planning (MAPP) problem with additional constraints to model more practi-
cal situations. The proposed solution uses a Genetic Algorithm (GA) applying
genetic-like operators and a heuristic method. It evolves an Initial Plan towards
a near-optimal Multi-Agent Plan by minimizing the length of the longest path.
The applicability of the approximate solution was tested in four random
scenarios. The computational results show that restricting the maximum number
of targets for each agent can mostly improve the performance of the solution if
the Initial Plan is generated by Greedy initialization. Furthermore, on average
the cost of the Final Plan can be decreased by limiting the maximum range
for each agent if the Initial Plan is generated by Greedy initialization. This
trend is observed in the scenarios with more targets (200 in this study) even if
the Initial Plan is generated by Random initialization. These results show that
using a simple yet efficient initial solution, the developed method can potentially
enhance the efficiency of route planning for a fleet of vehicles when the aim is
to optimize open paths (subtours) in a cooperative manner, and the vehicles of
the fleet have a limited operational range or a balance between their workload
is desired. This situation can be observed in a variety of mobility and logistics
services such as crowd-sourced delivery (e.g., TOURMIX), group ride-sharing,
general home delivery, and airport shuttles.
464 A. Maktabifard et al.
References
1. Bektas, T.: The multiple traveling salesman problem: an overview of formulations
and solution procedures. Omega 34(3), 209–219 (2006). https://doi.org/10.1016/
j.omega.2004.10.004
2. Bentley, J.L.: Experiments on traveling salesman heuristics. In: Proceedings of the
First Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 1990), USA,
pp. 91–99. SIAM (1990)
3. Bose, J., Reiners, T., Steenken, D., Voss, S.: Vehicle dispatching at seaport con-
tainer terminals using evolutionary algorithms. In: Proceedings of the 33rd Annual
Hawaii International Conference on System Sciences (HICSS), USA, vol. 2, pp. 1–
10. IEEE (2000). https://doi.org/10.1109/HICSS.2000.926669
4. Chen, J., Ling, F., Zhang, Y., You, T., Liu, Y., Du, X.: Coverage path planning of
heterogeneous unmanned aerial vehicles based on ant colony system. Swarm Evol.
Comput. 69, 101005, 1–11 (2022). https://doi.org/10.1016/j.swevo.2021.101005
5. Fournier, S.M.R., Hülse, E.O., Pinheiro, É.V.: A*-guided heuristic for a multi-
objective bus passenger Trip Planning Problem. Public Transp. 13, 557–578 (2021).
https://doi.org/10.1007/s12469-019-00204-1
6. Gavish, B., Srikanth, K.: An optimal solution method for large-scale multiple trav-
eling salesmen problems. Oper. Res. 34(5), 698–717 (1986). https://doi.org/10.
1287/opre.34.5.698
7. Gawrilow, E., Köhler, E., Möhring, R.H., Stenzel, B.: Dynamic routing of auto-
mated guided vehicles in real-time. In: Krebs, H.-J., Jäger, W. (eds.) Mathematics -
Key Technology for the Future, pp. 165–177. Springer, Heidelberg (2008). https://
doi.org/10.1007/978-3-540-77203-3 12
8. Ghafurian, S., Javadian, N.: An ant colony algorithm for solving fixed destina-
tion multi-depot multiple traveling salesmen problems. Appl. Soft Comput. 11(1),
1256–1262 (2011). https://doi.org/10.1016/j.asoc.2010.03.002
9. Giardini, G., Kalmár-Nagy, T.: Genetic algorithm for combinatorial path planning:
the subtour problem. Math. Probl. Eng. 2011, 483643, 1–31 (2011). https://doi.
org/10.1155/2011/483643
10. Giardini, G., Kalmar-Nagy, T.: Genetic algorithm for multi-agent space explo-
ration. In: Proceedings of AIAA Infotech@Aerospace 2007 Conference and Exhibit
2824, USA, pp. 1–15. AIAA (2007). https://doi.org/10.2514/6.2007-2824
11. Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learn-
ing. Addison-Wesley Publishing Company Inc., Boston (1989)
12. Harrath, Y., Salman, A.F., Alqaddoumi, A., Hasan, H., Radhi, A.: A novel hybrid
approach for solving the multiple traveling salesmen problem. Arab J. Basic Appl.
Sci. 26(1), 103–112 (2019). https://doi.org/10.1080/25765299.2019.1565193
13. Herbawi, W., Weber, M.: The ridematching problem with time windows in dynamic
ridesharing: a model and a genetic algorithm. In: Proceedings of 2012 IEEE
Congress on Evolutionary Computation, USA, pp. 1–8. IEEE (2012). https://doi.
org/10.1109/CEC.2012.6253001
Constrained Multi-agent Path Planning Problem 465
14. Hou, M., Liu, D.: A novel method for solving the multiple traveling salesmen
problem with multiple depots. Chin. Sci. Bull. 57(15), 1886–1892 (2012). https://
doi.org/10.1007/s11434-012-5162-7
15. Jiang, C., Wan, Z., Peng, Z.: A new efficient hybrid algorithm for large scale
multiple traveling salesman problems. Expert Syst. Appl. 139, 112867, 1–11 (2020).
https://doi.org/10.1016/j.eswa.2019.112867
16. Kalmár-Nagy, T., Giardini, G., Bak, B.D.: The multiagent planning problem. Com-
plexity 2017, 3813912, 1–12 (2017). https://doi.org/10.1155/2017/3813912
17. Kara, I., Bektas, T.: Integer linear programming formulations of multiple salesman
problems and its variations. Eur. J. Oper. Res. 174(3), 1449–1458 (2006). https://
doi.org/10.1016/j.ejor.2005.03.008
18. Karabulut, K., Öztop, H., Kandiller, L., Tasgetiren, M.F.: Modeling and opti-
mization of multiple traveling salesmen problems: an evolution strategy approach.
Comput. Oper. Res. 129, 105192, 1–19 (2021). https://doi.org/10.1016/j.cor.2020.
105192
19. Király, A., Abonyi, J.: Redesign of the supply of mobile mechanics based on a novel
genetic optimization algorithm using Google Maps API. Eng. Appl. Artif. Intell.
38, 122–130 (2015). https://doi.org/10.1016/j.engappai.2014.10.015
20. Laporte, G., Nobert, Y.: A cutting planes algorithm for the m-salesmen problem.
J. Oper. Res. Soc. 31(11), 1017–1023 (1980). https://doi.org/10.1057/jors.1980.
188
21. Redmond, M., Campbell, A.M., Ehmke, J.F.: Data-driven planning of reliable
itineraries in multi-modal transit networks. Public Transp. 12, 171–205 (2020).
https://doi.org/10.1007/s12469-019-00221-0
22. Ryan, J.L., Bailey, T.G., Moore, J.T., Carlton, W.B.: Reactive Tabu Search in
unmanned aerial reconnaissance simulations. In: Proceedings of 1998 Winter Sim-
ulation Conference (Cat. No. 98CH36274), USA, vol. 1, pp. 873–879. IEEE (1998).
https://doi.org/10.1109/WSC.1998.745084
23. Somhom, S., Modares, A., Enkawa, T.: Competition-based neural network for the
multiple travelling salesmen problem with minmax objective. Comput. Oper. Res.
26(4), 395–407 (1999). https://doi.org/10.1016/S0305-0548(98)00069-0
24. Song, C.-H., Lee, K., Lee, W.D.: Extended simulated annealing for augmented TSP
and multi-salesmen TSP. In: Proceedings of the International Joint Conference on
Neural Networks (IJCNN), USA, vol. 3, pp. 2340–2343. IEEE (2003). https://doi.
org/10.1109/IJCNN.2003.1223777
25. Tong, Y., Zeng, Y., Zhou, Z., Chen, L., Ye, J., Xu, K.: A unified approach to
route planning for shared mobility. Proc. VLDB Endow. 11(11), 1633–1646 (2018).
https://doi.org/10.14778/3236187.3236211
26. Torabbeigi, M., Lim, G.J., Kim, S.J.: Drone delivery scheduling optimization con-
sidering payload-induced battery consumption rates. J. Intell. Robot. Syst. 97(3),
471–487 (2020). https://doi.org/10.1007/s10846-019-01034-w
27. Wang, X., Liu, D., Hou, M.: A novel method for multiple depot and open paths,
multiple traveling salesmen problem. In: Proceedings of 2013 IEEE 11th Interna-
tional Symposium on Applied Machine Intelligence and Informatics (SAMI), USA,
pp. 187–192. IEEE (2013). https://doi.org/10.1109/SAMI.2013.6480972
28. Yu, H., Meier, K., Argyle, M., Beard, R.W.: Cooperative path planning for tar-
get tracking in urban environments using unmanned air and ground vehicles.
IEEE/ASME Trans. Mechatron. 20(2), 541–552 (2015). https://doi.org/10.1109/
TMECH.2014.2301459
466 A. Maktabifard et al.
29. Yuan, S., Skinner, B., Huang, S., Liu, D.: A new crossover approach for solving
the multiple travelling salesmen problem using genetic algorithms. Eur. J. Oper.
Res. 228(1), 72–82 (2013). https://doi.org/10.1016/j.ejor.2013.01.043
30. Zhou, H., Song, M., Pedrycz, W.: A comparative study of improved GA and PSO
in solving multiple traveling salesmen problem. Appl. Soft Comput. 64, 564–580
(2018). https://doi.org/10.1016/j.asoc.2017.12.031
UAV Path Planning for Area Coverage
and Energy Consumption in Oil and Gas
Exploration Environment
Abstract. This paper proposes a model for unmanned aerial vehicles (UAV) grid-
based coverage path planning, considering coverage completeness and energy con-
sumption in complex environments with multiple obstacles. The work is inspired
by the need for more efficient approaches to oil and gas exploration, but other
application areas where UAVs can be used to explore unknown environments can
also benefit from this work. An energy consumption model is proposed that con-
siders acceleration, deceleration, and turning manoeuvres, as well as the distance
to obstacles, to more accurately simulate the UAV’s movement in different envi-
ronments. Three different environments are modelled: desert, forest, and jungle.
The energy-aware coverage path planning algorithm implemented seeks to reduce
the energy consumption of a single drone while increasing coverage completeness.
The model implementation and experiments were performed in the ROS/Gazebo
simulation software. Obtained results show that the algorithm performs very well,
with the drone able to manoeuvre itself in a combination of hills, valleys, rugged
terrain, and steep topography while balancing coverage and energy consumption.
1 Introduction
Environmental concerns have brought challenges to the oil and gas industry, including
pressure to reduce and stabilise costs for competitive advantage, improve its environ-
mental blueprint, and optimise its performance. In recent years, the production of most
onshore oil fields has declined [12] in part because, after a certain level of recovery, pro-
duction costs do not justify further investment. Oil and gas exploration often takes place
in difficult-to-reach environments where the use of robotic systems can be useful [37].
Petroleum deposits are generated by a natural process that commonly occurs at great
depth and is often poorly understood and predicted by earth scientists. Hence large areas
with significant oil and gas potential remain unexplored [25]. In 2015, Shell Oil decided
to abandon efforts to find and develop hydrocarbon resources in the Chukchi Sea, despite
having spent billions of dollars in exploration, which in the end turned out to be a dry
hole [38]. This type of risk cannot be eliminated, but it can be reduced through tech-
nological innovations. An example of such technology is magnetic exploration which
measures variations in the earth’s magnetic field to identify rocks containing hydrocar-
bon [1]. Aeromagnetic surveys are used to measure magnetic anomalies using low-flying
aircraft carrying a high-precision magnetometer, a sensor for magnetic anomaly detec-
tion [2]. However, aeromagnetic surveying with low-flying aircraft brings challenges in
terms of safety, terrain complexity, quality of data, and costs. An attractive alternative
is to conduct magnetic exploration with UAVs [38]. UAVs can fly close to the surface
at optimal speed with efficient coverage for gathering good-quality magnetic data [42].
UAVs are fast becoming an attractive solution for many scenarios like search and rescue
[19], precision agriculture [31], and delivery systems [26].
Magnetic exploration with UAVs requires effective path planning in unknown envi-
ronments and with possibly multiple obstacles (e.g., trees, large rocks, etc.). The robotic
path planning problem can be divided into two main categories: motion path planning
and coverage path planning. In motion path planning, there is a clear start point and an
end point; the goal is to optimally cover the distance between the start to the endpoint
at minimum cost while avoiding obstacles [35]. Coverage path planning (CPP) finds an
optimal collision-free path that a robot must take to pass over each point in an area of
interest in the given environment [10]. CPP is related to the covering salesman problem
where an agent must travel a minimum-length tour covering the subsets of given cities or
customers [46, 24]. CPP can be offline or online depending on the availability of a priori
information about the area of interest [18], and several algorithms exist to tackle this
problem [40]. In CPP, the area of interest can be decomposed into smaller regions or not
at all. Simple and regular-shaped environments require no decomposition, and simple
geometric patterns such as back-and-forth (BF), zigzag movement, or spiral patterns
can be used to solve the problem [34]. When the exploration area is irregular-shaped
and complex, decomposition can be done in different ways, like exact cellular decom-
position [28], approximate cellular decomposition [8, 21], or grid-based decomposition
[18]. For the scenarios investigated in this paper, grid-based decomposition was used to
ensure that every cell within the area of interest is visited once. This method requires
computational power to represent the cells at higher resolution grids [24]. After the grid
is produced, a traveling salesman algorithm is applied to generate a sequence of nodes
or subregions to visit [4]. The coverage path is generated by connecting these nodes in
sequence using back-and-forth motion perpendicular to the sweep direction from the
start to the goal region [9, 28].
Magnetic surveying with UAVs for oil/gas exploration requires complete coverage of
the area of interest. Performing CPP with UAVs in an environment with multiple obsta-
cles is energy-demanding due to the need for obstacle avoidance, and limited energy
capacity is a feature of UAVs [20]. The approach in this paper minimised energy con-
sumption by limiting accelerations, decelerations, and turning manoeuvres by the drone
while at the same time maximising coverage. Aided by simultaneous localisation and
mapping (SLAM), a model for coverage path-planning in a regular and irregular-shaped
environment is developed in ROS/Gazebo platform. The regular-shaped environment
has a perfect square-shaped setting with no mountains, hills, or valleys. In contrast,
UAV Path Planning for Area Coverage and Energy Consumption 469
the irregular-shaped environment has a complex terrain with no specific shape. Exper-
iments were conducted to study the tradeoff between area coverage and energy con-
sumption. Section 2 outlines related research on coverage path planning. Section 3
describes the methodology and cost function. Section 4 presents the experimental result
and discussions, and concluding remarks are presented in Sect. 5.
2 Related Research
Coverage path planning (CPP) has been extensively studied in the literature but is still
an open problem in robotics [40]. Different approaches have been adopted to classify
the problems into (i) increasing the coverage completeness, (ii) reducing the path over-
lapping, (ii) reducing the energy consumption, (iv) optimising the number of turns, and
(v) reducing the time to completion. A randomised approach does not require sensors
and algorithms for localisation, but such an approach is inefficient when dealing with
large coverage areas requiring more energy and time to complete the coverage [14].
A distributed strategy and model for UAVs oil spill mapping used randomness and
probabilistic guessing to avoid visiting all the cells and hence reduce the total distance
travelled, but coverage completeness decreased [32]. Different geometric patterns and
shapes for coverage patterns have been studied, including back-and-forth (BF), Hilbert
curves, LMAT (chain of equilateral triangles), S curves, and spiral patterns [5]. An opti-
mal line-sweep decomposition path planner to minimise the time required for covering
the area with obstacles in an unknown environment was presented in [4]. The authors
claimed that changing the sweep direction, as shown in Fig. 1, helps in minimising the
number of turns and hence reducing the time to completion. Similarly, reducing the
number of turns using optimal sweep direction and coverage pattern was considered in
[3, 24] in a quest to reduce energy consumption. Energy-aware CPP is an active area of
research aiming at minimum energy consumption and maximum area coverage. Some
studies considered energy optimisation based on the UAV structure, aerodynamic prop-
erties, rotor efficiency, and energy consumed in onboard data processing and controls
[17–44].
Fig. 1. Coverage path-planning with sweep direction changed to reduced turning angles.
for both single and multi-robot coverage [13, 20, 30], and reduce the number of turns [11,
41]. An algorithm for single-robot coverage path planning under constrained energy was
presented in [39]. The robot was aware of its energy limitation, and hence the total dis-
tance traveled to refueling was minimised. Boustrophedon cellular decomposition using
back-and-forth or “ox-plow” motions and Dijkstra’s algorithm were used. Although
reducing the total distance to refueling is important, most of the energy is often con-
sumed during the coverage and turning manoeuvres. Grid-based coverage path planning
exhibits excellent performance in several robotic applications in irregular shapes and no
fly-zones environments [5, 7]. In the work proposed by [43], the irregular-shaped area of
interest was decomposed into regular cells of equal sizes, using an approximate cellular
decomposition technique. The authors introduced a cost function to minimise the num-
ber of turning manoeuvres to save energy but did not take into account important factors
like optimal energy and energy consumption due to accelerations and decelerations. The
energy-aware algorithm proposed by [6] drastically reduced the energy consumption
for the entire coverage. The model was based on optimising the turning manoeuvres,
and avoiding sharp angles while reducing speed, acceleration, and deceleration at the
turning angles. An improved cost function aimed at minimising energy consumption in
an irregular shape was proposed in [7]. The model was good in energy saving, and it may
scale well in a simplistic environment, especially those applications that do not require
a high degree of coverage completeness. Only a single occlusion point was considered
in their experiment, but multiple obstacles can have a significant effect on the coverage
completeness.
Then, optimising the coverage completeness, energy consumption, and distance to
obstacles in complex oil and gas environments with multiple occlusions is an interesting
problem that has not been investigated to the best of our knowledge. The problem
is unique because magnetic survey for oil/gas exploration requires near optimum line
spacing and flight height, and the coverage completeness can be easily affected when
trying to avoid obstacles. The environment is often mixed with regular and irregular
settings of different terrains. In addressing this problem, the following question arises:
Can an intelligent coverage path-planning algorithm that reduces the number of sharp
turns for obstacle avoidance in a highly irregular-shaped environment reduce the amount
of energy consumed by the UAV without reducing the coverage completeness?
3 Proposed Methodology
The proposed model was implemented in the ROS/Gazebo simulation software using
a drone package hector quadrotor noetic [33]. As a proof of concept for the magne-
tometer, the drone is integrated with a magnetic sensor to read the magnetic field of
the Gazebo environment. The modelled exploration environment is rugged terrain with
steep topography, variations in the surface elevations, and thick vegetation cover, as
illustrated in Fig. 2. The environment E is then partitioned into a grid with multiple
cells of equal size Mx × My as illustrated in Fig. 3. Each cell (x, y) has an associated
value u(x, y, t) ∈ [0, 1] that represents the UAV’s uncertainty about the target distri-
bution in the cell at time interval t, for occupancy grid mapping, implemented using the
slam_gmapping package provided in ROS [22]. The distance between each grid cell for
UAV Path Planning for Area Coverage and Energy Consumption 471
cell-to-cell movement can be calculated using Eq. (1) where xi , yi are the target goal
point [27].
m
d (x, y) = |xi − yi | (1)
i=1
Fig. 2. Example of forest environment modeled in the Gazebo simulation software for oil and
gas exploration.
square meter. An obstacle in the form of a tree is shown at the center of the exploration
environment. Three possible waypoints are shown, marked as a, b, and c. Waypoint
(a) maximises coverage in the occluded region but consumes more energy due to the
sharp angles at the path. Waypoint (b) minimises energy consumption due to the reduced
distance by the resultant vector but still consumes energy due to turning manoeuvres.
Waypoint (c) consumes less energy because of the smoothed angle on the waypoint but
with reduced coverage. The problem lies in optimising Delta , the distance between
the drone and the obstacle during the coverage as a constraint to be imposed for obstacle
avoidance by the UAV. Minimising the delta increases the coverage, and maximising it
reduces energy consumption.
Given an initial position (xo , yo ) and target goal points (xi , yi ), a collision-free path
ensures that at no time should the UAV enter into the no-fly-zone region. The equation
for collision avoidance is presented as follows:
∀p ∈ [1 . . . N ], ∀i ∈ [0 . . . T]
xip − xk ≥ (2)
yip − yk ≥ (3)
In the Eqs. (2), (3) above, xip , yip are the drone position in the x and y axis at the T
time instance, and xk , yk are the obstacle position at the x-axis and y-axis, respectively.
Since most obstacles in the forest can be approximated to the shape of a sphere or circle
in 2D, Eqs. (2) and (3) can be written as Eq. (4).
xip − xk sin θ + yip − yk cos θ ≥ (4)
The instantaneous power for the flight scenarios is calculated with Eq. (5), where V
is an instantaneous voltage, and I is the instantaneous current consumed by the drone.
P = VI (5)
UAV Path Planning for Area Coverage and Energy Consumption 473
The total energy consumption can be calculated by integrating the power over the
period of the mission, as shown in the energy Eq. (6), where E is the energy consumed
in joules, P is the instantaneous power consumed in watts, and t is the time interval for
the flight mission.
t t
E= Pdt = VIdt (6)
0 0
The energy consumed when moving at a straight line to cover a distance d at constant
speed v can be calculated with Eq. (7).
d /v d
Ec = ∫ Pc dt = Pc (7)
0 v
The energy consumed for variable speed due to acceleration and deceleration can be
calculated with Eq. (8).
di /vi do /vo
Evar = ∫ Pacc dt = Pv (t2 − t1 ) + ∫ Pdecc dt = Pv (t2 − t1 ) (8)
do /vo di /vi
While the total energy needed for the coverage can be compud with Eq. (12).
0 d /v
m vi
Etotal = Pacc dt + Pdec dt + Pc dt
i=1 0 vi 0
h/vclimb θ/ωi
+ Ph dt + Pturn dt (12)
0 θo/ωo
474 S. S. Maaji and D. Landa-Silva
The UAV energy model, environment model, and coverage path planning approach
were all implemented in the ROS/Gazebo simulation software. Several experiments
were conducted for various types of exploration scenarios. The scenarios consist of a
perfect square regular-shaped environment and an irregular-shaped environment char-
acterised by arbitrarily shaped obstacles. We investigated the tradeoff between coverage
completeness and energy consumption in these environments.
UAV Path Planning for Area Coverage and Energy Consumption 475
4 Experimental Result
Three different environments usually seen in oil/gas exploration were modelled: a desert
with low obstacles, including hills and valleys; a forest with a combination of hills, val-
leys, and moderate vegetation; and a jungle depicted rugged terrain and steep topography.
A survey conducted by [48] shows that flying drones at varying altitudes of 0.5 m, 1.3
m, and 2.2 m above the ground with line spacing of 1 m provides a good result. The
flight altitude was set to 1 m, and the speed was set to v = 2 m/s for the entire experi-
ment. The experiment in a desert environment is considered a benchmark to evaluate the
coverage quality, assuming the drone could successfully cover the cell with minimum
energy consumption without obstacles.
Fig. 4. The red dots represent the area in which the drone performs coverage path planning. The
gray area represents a region in which the UAV is certain there is no obstacle, i.e., path-planning
can be performed on it. The dark edges in the gray regions are obstacle or no-fly zones identified
based on the principle of occupancy grid mapping, and the remaining part of the graph in light
green represents the unexplored area.
To further assess the performance of the algorithm, another experiment was con-
ducted using the same parameters but in a much bigger and irregular map illustrated in
Fig. 5. The area was decomposed into 110 cells of equal size, and the algorithm scales
well in the desert, and also performs well in the forest with few obstacles (Fig. 5b),
as well as in the jungle with more obstacles (Fig. 5c). In the jungle exploration, the
number of uncovered cells was 31 compared to 36 in the forest. In the second sce-
nario, the delta was s et to 0.75 m, and an experiment was also conducted again in a
476 S. S. Maaji and D. Landa-Silva
regular-shaped environment, decomposed into 20 cells of equal size (5 × 4). It was also
run in an irregular-shaped environment decomposed into 110 cells of equal size. Like
the first scenario, three flights were performed to compare the algorithm’s effectiveness
in deserts, forests, and jungles. In the third scenario, the delta was set to 1.0 m, and
the experiment was conducted similarly to scenarios one and two. The regular-shaped
environment was decomposed into 20 cells of equal size (5 × 4). The irregular-shaped
environment was decomposed into 110 cells of equal size. Three flights were performed
to compare the algorithm’s effectiveness in deserts, forests, and jungles. Setting the delta
to 1.0 m makes the coverage very difficult in an obstacle environment. In most cases,
the drone has to abandon the mission because it was trapped for over 8 min facing an
obstacle, i.e., it cannot pass through a narrow path because the delta is too big. Hence,
the energy consumed in the jungle was less than what was consumed in the forest. The
trees in the forest along the way point were dense, and the drone was trapped hence
abandoning the mission before completion.
Fig. 5. Coverage path planning in desert, forest, and jungle for more complex environments.
In some cases, the drone was only able to cover 7 out of the 20 cells in a regular-shaped
forest environment and 29 out of 110 cells in an irregular-shaped forest environment.
Tables 1 and 2 show the average value of the energy consumption and coverage for the
three flights performed in deserts, forests, and jungles.
120
100
Cells(sq meter)
80
60
40
20
0
Desert Forest Jungle Desert Forest Jungle
4000
Energy (Joules) 3500
3000
2500
2000
1500
1000
500
0
Desert Forest Jungle Desert Forest Jungle
5 Conclusion
This paper described a model for UAV Path Planning considering the tradeoff between
area coverage and energy consumption. The work is inspired by magnetic data sur-
veying for oil/gas exploration, where environments are usually unknown and present
multiple obstacles. Simulations were implemented in ROS/Gazebo for three environ-
ments: desert, forest, and jungle. A parameter delta is used to set the constraint for
obstacle avoidance, and experiments with different values (0.5 m, 0.75 m, and 1.0 m)
were conducted. Experimental results show that the drone effectively maneuvered itself
in different environmental settings, achieving high coverage and moderate energy con-
sumption when the delta was set to 0.5 m in a desert environment. As the complexity
of the environment increases, energy consumption increases due to obstacle avoidance
while the coverage optimality reduces. The overall best results were obtained when the
delta was set to 0.5 m, skipping a few cells while performing coverage in forest and
jungle but with added energy consumption compared to exploration in the desert. The
worst coverage result was obtained when the delta was set to 1.0 m in the forest and
the jungle. These results show that the delta value influences energy consumption and
coverage completeness. The larger the delta value, the less energy consumption due to
the smoothed angle and less acceleration and deceleration along the obstacle avoidance
path. The lower the delta value, the more coverage completeness. There was a tradeoff
between the number of cells covered and energy consumption in the occluded environ-
ment, and in some cases, the drone was trapped by the obstacles. Further work will look
at more sophisticated path-planning algorithms and multiple drones.
UAV Path Planning for Area Coverage and Energy Consumption 479
References
1. Abubakar, R., et al.: Formation of magnetic minerals at hydrocarbon-generation conditions.
Mar. Pet. Geol. 68, 509–519 (2015). https://doi.org/10.1016/j.marpetgeo.2015.10.003
2. Aminzadeh, F., Dasgupta, S.: Geophysics for petroleum engineers: chapter 3. Fundamentals
of Petroleum Geophysics, 1st edn, vol. 60, pp. 1–282. Elsevier Inc., UK (2013). ISBN: 978-
0-444-50662-7
3. Araujo, J., Sujit, P., Sousa, J.: Multiple UAV area decomposition and coverage. In: 2013 IEEE
Symposium on Computational Intelligence for Security and Defense Applications (CISDA),
pp. 30–37 (2013). https://doi.org/10.1109/CISDA.2013.6595424
4. Bähnemann, R., Lawrance, N., Chung, J., Pantic, M., Siegwart, R., Nieto, J.: Revisiting bous-
trophedon coverage path planning as a generalized traveling salesman problem. In: Ishigami,
G., Yoshida, K. (eds.) Field and Service Robotics. Springer Proceedings in Advanced
Robotics, vol. 16, pp. 277–290. Springer, Singapore (2021). ISBN: 978-981-15-9459-5.
https://doi.org/10.1007/978-981-15-9460-1_20
5. Cabreira, T.M., Brisolara, L.B., Paulo R, F.J.: Survey on coverage path planning with
unmanned aerial vehicles. Drones 3(1) (2019). https://doi.org/10.3390/drones3010004
6. Cabreira, T., Franco, C., Ferreira, P., Buttazzo, G.: Energy-aware spiral coverage path planning
for UAV photogrammetric applications. IEEE Robot. Autom. Lett. 3(4), 3662–3668 (2018).
https://doi.org/10.1109/LRA.2018.2854967
7. Cabreira, T., Ferreira, P., Di Franco, G. Buttazzo, C.: Grid-based coverage path planning with
minimum energy over irregular-shaped areas with UAVs. In: 2019 International Conference on
Unmanned Aircraft Systems (ICUAS), pp. 758–767, (2019). https://doi.org/10.1109/ICUAS.
2019.8797937
8. Cai, C., Ferrari, S.: Information-driven sensor path planning by approximate cell decomposi-
tion. IEEE Trans. Syst. Man Cybern. Part B 39(3), 672–689 (2009). https://doi.org/10.1109/
TSMCB.2008.2008561
9. Choset, H., Acar, E., Rizzi, A., Luntz, J.: Exact cellular decompositions in terms of critical
points of Morse functions. In: Proceedings ICRA. Millennium Conference. IEEE International
Conference on Robotics and Automation. Symposia Proceedings, vol. 3, pp. 2270–2277, USA
(2000). https://doi.org/10.1109/ROBOT.2000.846365
10. Choset, H., Pignon, P.: Coverage path planning: the boustrophedon cellular decomposition.
In: Field and Service Robotics, pp. 203–209 (1998). ISBN: 978-1-4471-1275-4
11. Choton, C., Prabhakar, P.: Optimal multi-robot coverage path planning for agricultural fields
using motion dynamics. In: 2023 IEEE International Conference on Robotics and Automa-
tion(ICRA), pp. 11817–11823. IEEE (2023). https://doi.org/10.1109/ICRA48891.2023.101
60265
12. Dou, L., et al.: Analysis of the world oil and gas exploration situation in 2021. Pet. Explor.
Dev. 49(5), 1195–1209 (2022). https://doi.org/10.1016/S1876-3804(22)60343-4
13. Duan, L., Wang, J. and Sun, Y.: Multi-robot online complete coverage based on collaboration.
In: Yang, S., Islam, S. (eds.) Web and Big Data. APWeb-WAIM 2022 International Workshops.
APWeb-WAIM 2022. CCIS, vol. 1784, pp. 219–231. Springer, Singapore (2023). https://doi.
org/10.1007/978-981-99-1354-1_19
14. Engelsons, D., Tiger, M., Heintz, F.: Coverage path planning in large-scale multi-floor urban
environments with applications to autonomous road sweeping, In: 2022 International Confer-
ence on Robotics and Automation (ICRA), pp. 3328–3334 (2022). “https://doi.org/10.1109/
ICRA46639.2022.9811941”
15. Franco, C., Buttazzo, G.: Energy-aware coverage path planning of UAVs. In: 2015 IEEE
International Conference on Autonomous Robot Systems and Competitions, pp. 111–117
(2015). https://doi.org/10.1109/ICARSC.2015.17
480 S. S. Maaji and D. Landa-Silva
16. Franco, C., Buttazzo, G.: Coverage path planning for UAVs photogrammetry with energy and
resolution constraints. J. Intell. Robot. Syst. 83(3), 445–462 (2016). https://doi.org/10.1007/
s10846-016-0348-x
17. Frew, E., Argrow, B., Houston, A., Weiss, C., Elston, J.: An energy-aware airborne dynamic
data-driven application system for persistent sampling and surveillance. Procedia Comput.
Sci. 18, 2008–2017 (2013). https://doi.org/10.1016/j.procs.2013.05.370
18. Galceran, E., Carreras, M.: A survey on coverage path planning for robotics. Robot. Auton.
Syst. 61(12), 1258–1276 (2013). https://doi.org/10.1016/j.robot.2013.09.004
19. Gallego, A., Pertusa, A., Gil, P., Fisher, R.: Detection of bodies in maritime rescue operations
using unmanned aerial vehicles with multispectral cameras. J. Field Robot. 36(4), 782–796
(2019). https://doi.org/10.1002/rob.21849
20. Ghaddar, A., Merei, A., Natalizio, E.: PPS: energy-aware grid-based coverage path planning
for UAVs using area partitioning in the presence of NFZs. Sensors 20(13) (2020). https://doi.
org/10.3390/s20133742
21. Guruprasad, K., Ranjitha, T.: CPC algorithm: exact area coverage by a mobile robot using
approximate cellular decomposition. Robotica 39(7), 1141–1162 (2021). https://doi.org/10.
1017/S026357472000096X
22. GvdHoorn, slam_gmapping, gmapping, 2019. http://wiki.ros.org/gmapping. Accessed 07
Mar 2023
23. Jensen-Nau, K., Hermans, R, Leang, T., Leang, K.: Near-optimal area-coverage path planning
of energy-constrained aerial robots with application in autonomous environmental monitor-
ing. IEEE Trans. Autom. Sci. Eng. 18(3), 1453–1468. IEEE (2020). https://doi.org/10.1109/
TASE.2020.301627
24. John, R., Current, Schilling, D.: The covering salesman problem. Transp. Sci. 23(3), 208–213
(1989). https://doi.org/10.1287/trsc.23.3.208
25. Jones, C.M.: The oil and gas industry must break the paradigm of the current exploration
model. J. Pet. Explor. Prod. Technol. 8(1), 131–142 (2017). https://doi.org/10.1007/s13202-
017-0395-2
26. Kuru, K., Ansell, D., Khan, W., Yetgin, H.: Analysis and optimization of unmanned aerial
vehicle swarms in logistics: an intelligent delivery platform. IEEE Access 7, 15804–15831
(2019). https://doi.org/10.1109/ACCESS.2019.2892716
27. Kyaw, P., Paing, A., Thu, T., Mohan, R., Le, A., x Veerajagadheswar, A.: Coverage path plan-
ning for decomposition reconfigurable grid-maps using deep reinforcement learning based
travelling salesman problem. IEEE Access 8, 225945–225956 (2020). https://doi.org/10.1109/
ACCESS.2020.3045027
28. Li, Y., Chen, H., Er, M., Wang, X.: Coverage path planning for UAVs based on enhanced
exact cellular decomposition method. Mechatronics 21(5), 876–885 (2011). https://doi.org/
10.1016/j.mechatronics.2010.10.009
29. Merz, M., Pedro, D., Skliros. V., Bergenhem, C. Himanka, M.: Autonomous UAS-based
agriculture applications: general overview and relevant European case studies, Drones. MDPI
6(5), 128 (2022). https://doi.org/10.3390/drones6050128
30. Mukhamediev, R., et al.: Coverage path planning optimization of heterogeneous UAVs group
for precision agriculture. IEEE Access 11, 5789–5803 (2023). https://doi.org/10.1109/ACC
ESS.2023.3235207
31. Murugan, D., Garg, A., Singh, D.: Development of an adaptive approach for precision agri-
culture monitoring with drone and satellite data. IEEE J. Sel. Top. Appl. Earth Obs. Remote
Sens. 10(12), 5322–5328 (2017). https://doi.org/10.1109/JSTARS.2017.2746185
32. Odonkor, P., Ball, Z., Chowdhury, S.: Distributed operation of collaborating unmanned aerial
vehicles for time-sensitive oil spill mapping, Swarm Evol. Comput. 46, 52–68, (2019). https://
doi.org/10.1016/j.swevo.2019.01.005
UAV Path Planning for Area Coverage and Energy Consumption 481
Michael Eley(B)
1 Introduction
In the flexible job shop problem (FJSP) a given set of jobs has to be scheduled.
Each job consists of an ordered set of operations and for each operation a set
of different machines is available to carry out the required processing. In most
papers that address this classical version of the flexible job shop problem some
given economic or production criteria like cost, total (weighted) tardiness or
makespan have to be optimized while meeting some production constraints like
adhering to the sequence of operations or preventing pre-emption.
In green scheduling this problem formulation is extended by considering addi-
tional aspects like energy cost, energy demand or carbon footprint. These addi-
tional aspects can be integrated by additional constraints or by adopting the
objective function, for example in a multi-objective optimization approach. In this
paper a hierarchical two-objective approach is pursued for a flexible job shop prob-
lem. This approach was motivated by a case study carried out at a mechanical
engineering company. Besides the primary objective function that minimizes the
makespan an additional secondary objective function that minimizes peak energy
demand (peak load) is introduced. We assume that energy demand is determined
solely by electricity demand. Other forms of energy such as thermal energy or com-
pressed air are not considered. As for the classical FJSP, the operations can be car-
ried out on different machines, which, however, have different energy demands.
Obviously, the two objective functions are in conflict with one another. For
example, in order to minimize peak energy demand, it could be advantageous not
to use machines with high energy demand at the same time. However, this can
lead to an increased makespan if machines with high energy demand are not used
in parallel. In order to ensure high capacity utilization and adherence of delivery
dates, makespan minimization is often the most important objective of a company.
The motivation for considering peak load minimization as the secondary
objective is that in the German electricity market bulk consumers for energy
like industrial companies are mainly charged according to their peak energy
demand, as power providers have to guarantee that peak energy demand can
be satisfied. Thus, minimizing peak demand results in lower energy costs for a
company. Merely minimizing the peak energy demand makes no sense from an
economic point of view since the proportion of energy costs in relation to total
costs is low and, as described above, other criteria must also be considered in
production planning.
The remainder of the paper is organized as follows. The relevant literature
to the problem under consideration is reviewed in the next section. In the third
section the problem is formulated as a non-linear integer programming problem.
This problem formulation will later on be used to evaluate the performance
of a solution approach that is presented in section four. The main focus of the
proposed approach is to minimize the peak energy demand for a given makespan.
The problem of minimizing the makespan in a flexible job shop is not addressed
in this paper. We refer to the large number of articles in the literature addressing
that problem (e.g. [1,2]). Results on comprehensive tests on extended benchmark
problems taken from the literature are discussed in section five. The last section
summarizes the main results and puts them into a perspective.
2 Literature Review
Below we review the literature on the flexible job shop problem with respect to
papers dealing with energy aspects. In addition, we will also discuss the literature
on minimizing peak energy demand. As the number of papers addressing this
objective for the flexible job shop problem is rather limited, we will discuss peak
load minimization for different variants of scheduling problems. A comprehensive
survey on the literature on different types of green scheduling problems like single
or parallel machine problems, job shop problems as well as flow shop problems
can be found in the paper of [16].
addressed green scheduling for different variants of the flexible job shop problem.
Surveys on flexible job shop approaches can be found in the papers of [8] or of
[23]. The latter one also includes green scheduling approaches. Besides makespan
optimization or minimization of total tardiness in most approaches minimizing
total energy demand (or total energy cost) is used as the secondary objective in
a multi-objective optimization approach.
Many papers on green scheduling assume that energy demand at a given
time depends on the state of the machine during that time. Therefore, states
like idle, setup, loading, warm-up or processing are considered. For most of these
states, the energy demand is known in advance and constant over time for each
operation. Only total energy demand during idle times is not known in advance as
the length of idle times depends on the result of the scheduling. Thus, in many
approaches the objective of minimizing total energy demand can be reduced
to minimizing idle energy demand or even idle times (cf. the approaches of
[10,15,18,19,21,24,34,39,42–44]).
In some approaches the production speed for the operations is an additional
decision variable, for example in the articles of [42] or [15]. With a higher produc-
tion speed processing times for operations can be reduced, resulting for example
in a lower makespan, but energy demand is increased.
Peak energy demand aspects for different types of scheduling problems have
recently been addressed by several authors. In these approaches, restricting peak
energy demand can be imposed by a constraint that limits a given maximum
value (threshold) for each single period or by the objective function that mini-
mizes peak energy demand.
[4,27] as well as [36] addressed problems from the process industry. Single
and parallel machine scheduling problems were discussed in the articles of [37]
and [3] as well as [40], respectively. [14] as well as [25] addressed the general
flow shop problem, whereas [6] and [12,13] focused on flexible flow shops. Peak
energy demand minimization for job shop problems were discussed by several
authors, e.g. [32,41,45] (reactive scheduling), [22] and [26].
As mentioned in Sect. 1, when peak energy demand aspects are considered,
the energy demand of a machine must be modeled more precisely. The same
applies if energy prices vary over time, e.g. for time-of-use tariffs. Therefore, [38]
proposed using energy load profiles instead of average energy demand values
particularly when considering peak energy demand. Furthermore, [35] recom-
mended a discretization of the energy demand as it avoids the modelling of
complex functions and thus reduces the problem’s complexity.
Thus, it is interesting to see that in most papers energy demand solely
depends on the factors like the operation, the machine, the machine load, the
state of the machine or the chosen speed of the machine, but it is kept con-
stant over the processing time (cf. [3,4,6,12–14,25,26,29,30,37,40]). Only a few
authors consider fluctuating energy demand over time.
Minimizing Peak Energy Demand in Flexible Job Shops 485
[27] were the first authors who examined variable energy demand (steam
energy) in their simulation model for the process industry. [36] consider varying
energy demand during processing a job for a batch production problem in the
process industry. [45] and [22] divided each operation of their job shop schedul-
ing problem into two parts. Energy demand is constant for both parts, but the
energy demand for the first part is higher than for the second part. [32] model
energy demand of machines by an integral of the energy during processing times.
The authors address a reactive job shop scheduling problem for a flexible man-
ufacturing system. [41] approximate the energy profile of a machine for a job
shop problem by a mathematical power series.
In summary, it can be stated that the flexible job shop problem with different
energy-oriented objectives and restrictions has been dealt with in the literature
in recent years. The main focus, however, was on minimizing total energy con-
sumption. There is also extensive literature on the topic of minimizing peak
energy load, with various problems being discussed. In contrast, the intersection
of the two subject areas green scheduling for the flexible job shop problem and
minimization of peak energy load is hardly considered. Furthermore, if any, other
optimization models than the problem presented in Sect. 1 are discussed. This
is surprising insofar as it is precisely in this case that peak load minimization
is promising if the machines that can carry out the processing of the operations
have different energy demands, and if the energy demand also changes over the
course of the processing.
Below we will introduce a new solution approach to the flexible job shop prob-
lem where we model the operation-dependent energy demand of the machines
by step functions. In the objective functions we will consider minimization of the
makespan and peak energy demand. We will implement a hierarchical approach
minimizing in the first stage the time-based objective function and in the second
stage peak energy demand. As far as the author is aware, this question has so
far not been investigated for the flexible job shop problem. Similar optimization
approaches have only been suggested for different scheduling problems. [6] pro-
posed this hierarchical approach for the flow shop problem; [3] addressed it to
the parallel machine problem.
3 Problem Formulation
The problem under consideration consists of n jobs that have to be scheduled on
a set of m machines. We will pursue a hierarchical two-objective optimization
approach. The primary objective function minimizes the makespan. Minimiza-
tion of peak energy demand serves as a secondary objective. Each job j consists
of hj operations. For each operation Oj,h a sub-set of the machines is permitted
to carry out the operations. We assume that each operation can be processed
by only one machine at a time and that each machine can only carry out one
operation at a time. Pre-emption is not allowed.
Before stating the problem formally, we introduce some notation in Table
1. In the problem formulation used here, the use of time-indexed variables is
486 M. Eley
dispensed with. The variables used instead lead to a non-linear constraint on the
one hand. On the other hand, they increase the comprehensibility and make it
clear how the sub-problems are separated from one another within the framework
of the hierarchical solution approach. It should also be noted that the use of
time-indexed variables does not improve the solvability of the problem since the
number of variables increases with the number of periods. For example, in the
test cases used in Sect. 5, this leads to a few million variables.
Table 1. Parameters and decision variables used
N number of jobs
M number of machines
hj number of operations for job j
l maximal number of operations assigned to a machine
T length of planning horizon
L a large
number
1 if operation Oj,h can be performed on machine i,
ai,j,h =
0 otherwise
pi,j,h processing time of operation Oj,h if performed on machine i
Ei,j,h (d) energy demand (wattage) for processing operation Oj,h on
machine i after elapsing of d = 1, ..., pi,j,h periods
Γ makespan
Π peak energy demand
tj,h start time of the processing of operation Oj,h
T Mi,k start
working time of the k-th operation of machine i
1 if machine i is selected for carrying out operation Oj,h ,
yi,j,h =
0 otherwise
1 if operation Oj,h is the k-th operation on machine i,
xi,j,h,k =
0 otherwise
1 if r = T Mi,k ,
wi,k,r =
0 otherwise
min Γ (1)
min Π (2)
The two objective functions (1) and (2) minimize the makespan and the peak
energy demand, respectively, subject to the following constraints.
M
yi,j,h = 1 j = 1, ..., N ; h = 1, ..., hj (3)
i=1
N
hj
xi,j,h,k ≤ 1 i = 1, ..., M ; k = 1, ..., l (5)
j=1 h=1
N
hj
N
hj
xi,j,h,k ≥ xi,j,h,k+1 i = 1, ..., M ; k = 1, ..., l − 1 (6)
j=1 h=1 j=1 h=1
l
xi,j,h,k = yi,j,h i = 1, ..., M ; j = 1, ..., N ; h = 1, ..., hj (7)
k=1
Constraint (3) guarantees that each operation Oj,h is assigned to exactly one
machine. This assignment can only take place if the corresponding operation can
be performed on the considered machine i. This is imposed by constraint (4).
The following constraint (5) limits the number of operations that can be assigned
on the k-th position of a machine to at most one. By constraint (6) the positions
of a machine are assigned sequentially. Note, that this constraint is not required
as one can always find an optimal solution that satisfies this constraint. Never-
theless, this constraint limits the number of possible assignments of operations
to machine positions. Finally, each operation assigned to one machine must also
be assigned to one position in the sequence of operations on that machine; cf.
constraint (7).
M
tj,h + yi,j,h · pi,j,h ≤ tj,h+1 j = 1, ..., N ; h = 1, ..., hj − 1 (8)
i=1
N
hj
T Mi,k + pi,j,h · xi,j,h,k ≤ T Mi,k+1
j=1 h=1
(11)
i = 1, ..., M ; k = 1, ..., l − 1
Constraint (8) ensures that the available start time of operation Oj,h+1 is
greater than or equal to the completion time of the previous operation of that
job j. The following two constraints (9) and (10) guarantee the consistency of
the time variables of operation Oj,h , i.e. tj,h , with the time variables of machine
i, i.e. T Mi,k . Both constraints are only restrictive in the event that the operation
is carried out on the considered machine as the k-th operation, i.e. if xi,j,h,k = 1.
Finally, constraint (11) guarantees that the time span between the starting times
of two consecutive operations on a machine are large enough to carry out the
operation. Again, this constraint is only restrictive if operation Oj,h is the k-th
operation on machine i.
488 M. Eley
M
N
hj
l p
i,j,h
M
tj,hj + yi,j,hj · pi,j,hj ≤ Γ j = 1, ..., N (13)
i=1
xi,j,h,k ∈ {0, 1}
(17)
i = 1, ..., M ; j = 1, ..., N ; h = 1, ..., hj ; k = 1, ..., l
4 Solution Approach
The solution approach pursued in this paper is based on the variable neigh-
borhood search (VNS). VNS, originally proposed by [17,28], is a meta-
heuristic method for solving combinatorial optimizations problems. A pseudo-
code description of the solution approach is shown in Algorithm 1.
In a first step an initial solution is generated by solving a classical FJSP with
respect to makespan minimization. For test runs carried out to assess the quality
of the proposed approach we used known optimal solutions for test cases from
the literature. It goes without saying that the proposed approach also works
when the makespan is not set to its minimal value.
This starting solution is set as the incumbent solution s. Within the proposed
approach a solution s is represented by a string that is a permutation of all
operations supplemented by a feasible machine μ and a starting time tj,h for
N
each operation Oj,h , i.e. si (j, h, μ, tj,h ) for i = 1... j=1 hj . One should keep in
mind that in order to reduce peak energy demand it is not sufficient to consider
only active schedules where all operations are scheduled as early as possible.
Therefore, the starting time of each operation must also be specified in the
solution string. The transformation of a solution string into a solution is carried
out by the local search procedures (cf. Sect. 4.2).
It should be noted that only permutations in which the specified sequence of
operations is adhered to for all jobs represent feasible solutions.
In the VNS approach a predetermined number of iterations imax is carried
out, each of them consisting of a shaking procedure and a local search step. In
every iteration, a random solution s is generated in the current neighborhood
N Hg (s) of s (shaking). Therefore, a total of gmax neighborhoods are available.
It starts in the first iteration with the first neighborhood, i.e. g := 1. Whenever
a solution s is replaced by a new solution the value of g is reset to 1; otherwise g
is increased by 1 in the subsequent iteration, i.e. g := g + 1. If g reaches a value
bigger than a predetermined value gmax then g is set to 1.
A subsequent local search procedure is applied to s transforming this solution
to s . If s is better than s with respect to a specific objective function f (·) or
s meets another acceptance criterion then s replaces s. At the end of the VNS
approach, the best solution sbest found during all iterations is returned as the
final solution.
The implemented neighborhoods, the local search procedures as well as the
acceptance criteria are presented in detail below.
selected from the set of operations that can be processed on at least two different
machines and assigned to new machines. Thereby, all potential operations and all
selectable machines for an operation have the same probability of being selected.
After changing the machine assignment of n operations in the solution string, the
starting times of all operations must be updated. This is done by the procedure
presented in the next section.
The second type of neighborhood (NH2) changes the sequence of operations
in the solution string. Whenever this neighborhood is invoked, up to n oper-
ations are shifted one after another. An operation Oj,h is selected by roulette
wheel selection, where probabilities for selecting an operation are calculated
using values π(j, h). When shifting operations to earlier or later periods in a
classical FJSP while minimizing makespan there is always a direct relationship
to the objective function value as the makespan is also measured in time peri-
ods. Unfortunately, there is no direct relationship between time and objective
function value when minimizing peak energy demand is pursued. For this reason,
we have to employ other mechanisms that detect which operations are difficult
to plan and which are likely to cause high peak energy demands. This selection
mechanism based on the π-values has been shown in test runs to be superior to
neighborhoods in which the operations were selected according to their maximum
energy demand. Variants in which operations were preferably selected that took
place at peak times were also significantly worse. The π(j, h) values are updated
by the local search procedures (cf. explanation below). The new position of the
operation to be shifted is selected randomly following a uniform distribution for
all positions that do not violate the processing sequence of the corresponding
job. After shifting one operation the local search procedure is started in order
to update the starting time of all operations in the solution string.
Because of the high computing effort for the generation of solutions, we have
restricted ourselves to inserting individual operations and have neglected more
complex operators in which several operations swap positions. In addition, we
Minimizing Peak Energy Demand in Flexible Job Shops 491
had to restrict the neighborhood width n to at most 3. It turned out that larger
values for n did not improve the solutions any more while just increasing required
CPU times. Thus, the value of gmax was set to 6. The neighborhoods are applied
in the following order: NH2/1, NH1/1, NH2/2, NH1/2, NH2/3, NH1/3, where
the number after the slash represents the value of the neighborhood width n.
They are invoked in the displayed order. It should be noted that the first two
procedures do not change the sequence of operations on the machines. With all
three local search procedures shifts are carried out in such a way that makespan
of a solution s generated by shaking does not increase. As a result, optimization
is no longer limited to only active schedules.
Smoothing. Starting from a given solution string where the sequence of oper-
ations is already given for every machine, this procedure tries to reduce peak
energy demand by delaying the beginning of operations. Remember that every
operation has been scheduled as early as possible when decoding a solution
string. If, for example, a machine is not used after the completion of an opera-
tion, the start of processing that operation may be delayed in order to reduce
peak energy demand. Figure 1 shows an illustrative example of this situation
with three jobs. Each job consists of up to two operations that are denoted by
Oj,h . The Gantt chart on the left shows the initial situation. Energy demand
for the three jobs and total energy demand for all periods is shown in the table
above the Gantt chart. The Gantt chart on the right shows the situation when
the beginning of the first operation of job 1 on machine 2, i.e. operation O1,1 , is
delayed by three periods. This shift results in a reduction of peak energy demand
from 14 to 10.
492 M. Eley
Fig. 1. Reducing peak energy demand from 14 (left) to 10 (right) by postponing the
start of the first operation of job 1, i.e. operation O1,1 , on machine 2 by three periods
The procedure tries to shift operations one by one beginning with the opera-
tion that determines the makespan, i.e. the operation with the latest completion
time. Thereafter, in every iteration of the smoothing procedure the operation
with the latest completion time that has not yet been considered is identified and
the beginning of that operation is delayed as long as makespan is not increased
or the next operation scheduled on that machine had to be delayed.
Shaving. This local search procedure considers the whole planning horizon.
Starting from the incumbent solution s the procedure tries to shift opera-
tions that are processed during the peak period in order to reduce peak energy
demand. At the same time, however, makespan must not be increased.
Let tpeak denote the peak period of a solution s . Energy demand for every
period t of the planning horizon and therefore the peak period can be calculated
by the left hand side of Eq. (12) as for a given solution s the values of decision
variables xi,j,h,k and wj,k,t−d+1 are known.
Within the procedure the set P contains all operations that are carried out
in solution s during the peak period. In addition, let us define by M (j, h) the
machine an operation Oj,h is assigned to. The operations from set P are exam-
ined one by one and the beginning of processing that operation is gradually
increased by one period. Note that when shifting an operation, it might also be
necessary to shift the succeeding operations belonging to the same job as well as
operations of other jobs that are assigned in a later period to the same machine.
These resulting shifts are also considered when examining the effects on peak
energy demand. Whenever shifting an operation causes the completion of an
operation (either that one or a different one) to be postponed beyond the end
of the specified makespan, the examination of further shifting this operation is
stopped and the starting period of that operation is set to its original period.
A second stopping criterion is a reduction in peak energy demand. In this case
the incumbent solution is replaced by the newly generated solution, the value of
the peak period as well as the set P are updated and the procedure starts again
Minimizing Peak Energy Demand in Flexible Job Shops 493
in trying to postpone the beginning of operations that are processed during the
newly generated peak period. Eventually, the local search procedure terminates
when all operations from set P have been checked without finding any further
improvement.
During the execution of the shaving procedure values π(i, j) for an operation
Oj,h that are used in the second type of neighborhood are updated. At the begin-
ning of the VNS approach all values of π(i, j) are initialized with 1. Whenever
an increase of the peak energy demand is observed during a shift in the shav-
ings procedure then we identify the operations that are processed in the peak
period. For each of these operations it is compared how high the energy demand
was at the peak period before and after the shift. In this way, we can identify
the operations that contribute to the peak energy demand. For all contributing
operations we increase the value π(i, j) of that operation by 1. Thus, in the long
run we can identify operations that are somehow critical and therefore should
be considered during the shaking phase.
5 Computational Results
The use of the proposed approach as part of the above-mentioned case study at
the mechanical engineering company led to a reduction in peak energy demand
of 23.6%. However, this result says little about the performance of the proposed
approach in general. In order to enable a broader evaluation of the proposed app-
roach, modified test cases from the literature were also used (cf. [31]). Therefore,
the test cases were extended by adding randomly generated energy demands for
all operations on their eligible machines. The profiles were generated in such a
way that the difference between the machine with the lowest and the highest
total energy demand for each operation is at most 20%.
Table 2. Aggregated results for the test cases of [5, 7, 11, 20]
test case class # problems # VNS better # ties Min. Avg. Max.
[7] 21 18 0 0.719 0.814 0.902
[11] 7 4 3 0.801 0.841 0.892
[5] 9 8 1 0.719 0.814 0.902
edata set of [20] 51 39 1 0.744 0.863 0.972
rdata set of [20] 45 45 0 0.353 0.398 0.475
vdata set of [20] 40 40 0 0.214 0.242 0.289
The proposed approach of Sect. 4 was coded in Delphi. Each test case was solved
20 times. The number of iterations was set to 20,000 in order to limit the com-
puting effort.
Table 2 summarizes the aggregated results for the different classes of test
cases. The values in columns two through four indicate the number of considered
test cases. In addition, it is indicated how often the proposed approach achieved
a better solution than the benchmark approach and how often both approaches
led to the same results, respectively. In the remaining three columns the best,
average and worst results are displayed that were obtained by applying the
proposed solution approach. The given values represent the share in the objective
function value achieved. For example, if the benchmark approach achieves an
average objective function value of 1000 for all test problems of this class and
the value in the column min is 0.95 then the average of the best objective function
values obtained by the proposed approach was 950. Thus, values in columns 3
to 5 that are below 1 represent improvements obtained by the proposed solution
approach over the benchmark.
The results can be summarized as follows. The proposed algorithm clearly
outperformed the benchmark approach for most of the 173 test cases. With five
test cases, namely three test cases of Dauzère-Pérès and Paulli and one test cases
of Brandimarte and Hurink et al. respectively, both approaches achieved the
same solution. For 14 test cases a better solution than the benchmark approach
could not be found in any of the twenty test runs - these were three test cases of
Chambers and Barnes and eleven test cases from the edata set of Hurink et al.
In the remaining 159 test cases, however, almost consistently better results were
achieved compared to the comparison method, in the majority of cases with a
very clear difference based on the best or worst individual values as well as on
the average values for all 20 test runs.
Of the test cases of [20] the edata test cases were the most difficult ones to
solve. For the rdata and even more for the vdata test cases more machines are
available to be chosen for the operations. Thus, this kind of flexibility can help
to reduce peak energy demand considerably.
496 M. Eley
Finally, further experiments with the proposed approach were carried out to
investigate whether and if so to what extent a relaxation of the strict limitation
of the makespan has an impact on peak energy demand. Therefore, the optimal
value of makespan, i.e. the value of Γ in Eq. (13), was increased in 1% steps
from 1% to 15%. Test runs were carried out for 30 selected test cases from the
edata set of Hurink et al., i.e. abz5 and abz6, la01 to la20, mt06, mt10 and
mt20 and orb1 to orb10. Each test case was solved ten times for each step.
With a larger makespan peak energy demand can be reduced considerably. The
maximum decrease in peak energy demand is almost 20% if at the same time
makespan is increased by 10%. Further reductions in peak energy demand could
not be realized even with higher increases in makespan.
6 Conclusion
In this paper an approach to minimize peak energy demand in a flexible job shop
considering narrow specifications for makespan was presented. In particular, the
last results from Sect. 5.2 show that peak energy demand can be reduced if
the makespan is increased. Obviously, there is a tradeoff between these two
objectives. But, as the results of the test cases also show, there is even a strong
potential for minimizing peak energy demand when the makespan is kept at
its lowest value. The greater the number of alternative machines (with different
energy requirements) available for the processing of individual operations, the
greater this potential is.
Another finding from the computational tests is that a greater reduction in
peak energy demand is possible if both objectives are considered at the same
time. This becomes clear by comparing the results of the proposed algorithm
with the benchmark approach. In the latter approach, the optimal solution
for the objective of minimizing the makespan was first determined and then,
based on this solution, peak energy demand was minimized in the second step.
Although optimal solutions were generated for both subproblems, the approach
presented in Sect. 4 outperformed the benchmark considerably. This is because
the VNS approach can change the assignment of operations to machines, as well
as the order of operations on the machines, provided that the makespan does not
increase. And this additional flexibility is sufficient to reduce the peak energy
demand significantly.
Apart from very energy-intensive industries, with today’s energy prices, many
companies do not view minimizing energy costs as a top priority. The proposed
optimization approach could be of interest to these companies, especially if they
want to cut down peak energy for environmental reasons or if energy prices rise.
Another application would be in the context of a demand side management in
which the local search procedures used in the article can also be used to smooth
the demand.
Improvements would also be possible on the algorithmic side. Due to the very
high computing times that are due to the optimization of the energy requirement,
an obvious possibility would be to consider the parallelization of the calculation.
Minimizing Peak Energy Demand in Flexible Job Shops 497
Acknowledgement. The research was funded by the Bavarian State Ministry for
Economic Affairs, Regional Development and Energy as part of the R&D program
“Information and communication technology” of the Free State of Bavaria, project
title: KAnIS: Cooperative Autonomous Intralogistics Systems.
References
1. Amirghasemi, M.: An effective parallel evolutionary metaheuristic with its appli-
cation to three optimization problems. Appl. Intell. 53, 5887–5909 (2023). https://
doi.org/10.1007/s10489-022-03599-w
2. Amirghasemi, M., Zamani, R.: An effective asexual genetic algorithm for solving
the job shop scheduling problem. Comput. Ind. Eng. 83, 123–138 (2015). https://
doi.org/10.1016/j.cie.2015.02.011, https://www.sciencedirect.com/science/article/
pii/S0360835215000686
3. Artigues, C., Lopez, P., Haı̈t, A.: The energy scheduling problem: industrial case-
study and constraint propagation techniques. Int. J. Prod. Econ. 143(1), 13–23
(2013). https://doi.org/10.1016/j.ijpe.2010.09.030
4. Babu, C.A., Ashok, S.: Peak load management in electrolytic process industries.
IEEE Trans. Power Syst. 23, 399–405 (2008). https://doi.org/10.1109/TPWRS.
2008.920732
5. Brandimarte, P.: Routing and scheduling in a flexible job shop by tabu search.
Ann. Oper. Res. 41(3), 157–183 (1993). https://doi.org/10.1007/BF02023073
6. Bruzzone, A., Anghinolfi, D., Paolucci, M., Tonelli, F.: Energy-aware scheduling for
improving manufacturing process sustainability: a mathematical model for flexible
flow shops. CIRP Ann. 61(1), 459–462 (2012). https://doi.org/10.1016/j.cirp.2012.
03.084
7. Chambers, J., Barnes, J.: Flexible job shop scheduling by tabu search. Technical
report, The University of Texas, Austin, TX, Technical Report Series ORP96-09,
Graduate Program in Operations Research and Industrial Engineering (1996)
8. Chaudhry, I., Khan, A.: A research survey: review of flexible job shop scheduling
techniques. Int. Trans. Oper. Res. 23, 551–591 (2016). https://doi.org/10.1111/
itor.12199
9. Cordeau, J.F., Laporte, G.: A tabu search heuristic for the static multi-vehicle
dial-a-ride problem. Transp. Res. Part B: Methodol. 37, 579–594 (2003). https://
doi.org/10.1016/S0191-2615(02)00045-0
10. Dai, M., Tang, D., Giret, A., Salido, M.: Multi-objective optimization for energy-
efficient flexible job shop scheduling problem with transportation constraints.
Robot. Comput.-Integr. Manuf. 59, 143–157 (2019). https://doi.org/10.1016/j.
rcim.2019.04.006
11. Dauzère-Pérès, S., Paulli, J.: An integrated approach for modeling and solving the
general multiprocessor job-shop scheduling problem using tabu search. Ann. Oper.
Res. 70, 281–306 (1997). https://doi.org/10.1023/A:1018930406487
12. Fang, K., Uhan, N., Zhao, F., Sutherland, J.: A new approach to scheduling in
manufacturing for power consumption and carbon footprint reduction. J. Manuf.
Syst. 30(4), 234–240 (2011). https://doi.org/10.1016/j.jmsy.2011.08.004
13. Fang, K., Uhan, N., Zhao, F., Sutherland, J.: A new shop scheduling approach
in support of sustainable manufacturing. In: Hesselbach, J., Herrmann, C. (eds.)
Glocalized Solutions for Sustainability in Manufacturing, pp. 305–310. Springer,
Berlin (2011). https://doi.org/10.1007/978-3-642-19692-8 53
498 M. Eley
14. Fang, K., Uhan, N., Zhao, F., Sutherland, J.: Flow shop scheduling with peak
power consumption constraints. Ann. Oper. Res. 206(1), 115–145 (2013). https://
doi.org/10.1007/s10479-012-1294-z
15. Gong, G., Deng, Q., Gong, X., Liu, W., Ren, Q.: A new double flexible job-shop
scheduling problem integrating processing time, green production, and human fac-
tor indicators. J. Clean. Prod. 174, 560–576 (2018). https://doi.org/10.1016/j.
jclepro.2017.10.188
16. Grahm, C., Denz, F., Dirr, M., Tuma, A.: Energy efficient scheduling in manu-
facturing companies: a review and research framework. Eur. J. Oper. Res. 248,
744–757 (2016). https://doi.org/10.1016/j.ejor.2015.07.017
17. Hansen, P., Mladenovic, N., Moreno-Perez, J.: Variable neighbourhood search:
methods and applications. Ann. Oper. Res. 175(1), 367–407 (2010). https://doi.
org/10.1007/s10288-008-0089-1
18. He, Y., Li, Y., Wu, T., Sutherland, J.: An energy-responsive optimization method
for machine tool selection and operation sequence in flexible machining job shops.
J. Clean. Prod. 87, 245–254 (2015). https://doi.org/10.1016/j.jclepro.2014.10.006
19. He, Y., Liu, B., Zhang, X., Gao, H., Liu, X.: A modeling method of task-oriented
energy consumption for machining manufacturing system. J. Clean. Prod. 23, 167–
174 (2012). https://doi.org/10.1016/j.jclepro.2011.10.033
20. Hurink, J., Jurisch, B., Thole, M.: Tabu search for the job-shop scheduling problem
with multi-purpose machines. OR Spektrum 15(4), 205–215 (1994). https://doi.
org/10.1007/BF01719451
21. Jin, L., Zhang, C., Fei, X.: Realizing energy savings in integrated process planning
and scheduling. Processes 7, 120 (2019). https://doi.org/10.3390/pr7030120
22. Kemmoe, S., Lamy, D., Tchernev, N.: Job-shop like manufacturing system with
variable power threshold and operations with power requirements. Int. J. Prod.
Res. 55, 1–22 (2017). https://doi.org/10.1080/00207543.2017.1321801
23. Li, X., Xie, J., Peng, K., Li, H., Gao, L.: A review on flexible job shop scheduling.
IET Collaborative Intell. Manuf. 1(3), 63–77 (2019). https://doi.org/10.1049/iet-
cim.2018.0009
24. Liu, Q., Chekem, F., Zhan, M., Shao, X., Ying, B., Sutherland, J.: A hybrid fruit fly
algorithm for solving flexible job-shop scheduling to reduce manufacturing carbon
footprint. J. Clean. Prod. 168, 668–678 (2017). https://doi.org/10.1016/j.jclepro.
2017.09.037
25. Luo, H., Du, B., Huang, G., Chen, H., Li, X.: Hybrid flow shop scheduling con-
sidering machine electricity consumption cost. Int. J. Prod. Econ. 146, 423–439
(2013). https://doi.org/10.1016/j.ijpe.2013.01.028
26. Masmoudi, O., Delorme, X., Gianessi, P.: Job-shop scheduling problem with energy
consideration. Int. J. Prod. Econ. 216, 12–22 (2019). https://doi.org/10.1016/j.
ijpe.2019.03.021
27. Mignon, D., Hermia, J.: Peak utility load reduction in batch processes operated
periodically and under uncertainty. Comput. Chem. Eng. 20(3), 249–263 (1996).
https://doi.org/10.1016/0098-1354(95)00041-0
28. Mladenovic, N., Hansen, P.: A parallel iterated tabu search heuristic for vehicle
routing problems. Comput. Oper. Res. 24(11), 1097–1100 (1997). https://doi.org/
10.1016/S0305-0548(97)00031-2
29. Mokhtari, H., Hasani, A.: An energy-efficient multi-objective optimization for flex-
ible job-shop scheduling problem. Comput. Chem. Eng. 104, 339–352 (2017).
https://doi.org/10.1016/j.compchemeng.2017.05.004
Minimizing Peak Energy Demand in Flexible Job Shops 499
30. Moon, J.Y., Park, J.: Smart production scheduling with time-dependent and
machine-dependent electricity cost by considering distributed energy resources and
energy storage. Int. J. Prod. Res. 52, 3922–3939 (2014). https://doi.org/10.1080/
00207543.2013.860251
31. OR library (2020). http://people.brunel.ac.uk/mastjjb/jeb/info.html
32. Pach, C., Berger, T., Sallez, Y., Bonte, T., Adam, E., Trentesaux, D.: Reactive and
energy-aware scheduling of flexible manufacturing systems using potential fields.
Comput. Ind. 65, 434–448 (2014). https://doi.org/10.1016/j.compind.2013.11.008
33. Pezzella, F., Morganti, G., Ciaschetti, G.: A genetic algorithm for the flex-
ible job-shop scheduling problem. Comput. Oper. Res. 35(10), 3202–3212
(2008). https://doi.org/10.1016/j.cor.2007.02.014, https://www.sciencedirect.
com/science/article/pii/S0305054807000524
34. Piroozfard, H., Wong, K.Y., Wong, W.P.: Minimizing total carbon footprint and
total late work criterion in flexible job shop scheduling by using an improved multi-
objective genetic algorithm. Res. Conserv. Recycl. 128, 267–283 (2018). https://
doi.org/10.1016/j.resconrec.2016.12.001
35. Rager, M., Gahm, C., Denz, F.: Energy-oriented scheduling based on evolutionary
algorithms. Comput. Oper. Res. 54, 218–231 (2015). https://doi.org/10.1016/j.
cor.2014.05.002
36. Seid, E., Majozi, T.: Heat integration in multipurpose batch plants using a
robust scheduling framework. Energy 71, 302–320 (2014). https://doi.org/10.1016/
j.energy.2014.04.058
37. Shrouf, F., Ordieres-Meré, J., Garcı́a-Sánchez, A., Ortega-Mier, M.: Optimizing the
production scheduling of a single machine to minimize total energy consumption
costs. J. Clean. Prod. 67, 197–207 (2014). https://doi.org/10.1016/j.jclepro.2013.
12.024
38. Suwa, H., Samukawa, T.: A new framework of energy-efficient manufacturing sys-
tems based on energy load profiles. Procedia CIRP 41, 313–317 (2016). https://
doi.org/10.1016/j.procir.2015.10.011
39. Wang, H., Jiang, Z., Wang, Y., Zhang, H., Wang, Y.: A two-stage optimization
method for energy-saving flexible job-shop scheduling based on energy dynamic
characterization. J. Clean. Prod. 188, 575–588 (2018). https://doi.org/10.1016/j.
jclepro.2018.03.254
40. Wang, Y.C., Wang, M.J., Lin, S.C.: Selection of cutting conditions for power con-
strained parallel machine scheduling. Robot. Comput.-Integr. Manuf. 43, 105–110
(2017). https://doi.org/10.1016/j.rcim.2015.10.010
41. Weinert, N., Chiotellis, S., Seliger, G.: Methodology for planning and operating
energy-efficient production systems. CIRP Ann. 60(1), 41–44 (2011). https://doi.
org/10.1016/j.cirp.2011.03.015
42. Wu, X., Sun, Y.: A green scheduling algorithm for flexible job shop with energy-
saving measures. J. Clean. Prod. 172, 3249–3264 (2018). https://doi.org/10.1016/
j.jclepro.2017.10.342
43. Zhang, L., Tang, Q., Wu, Z., Wang, F.: Mathematical modeling and evolutionary
generation of rule sets for energy-efficient flexible job shops. Energy 138, 210–227
(2017). https://doi.org/10.1016/j.energy.2017.07.005
44. Zhang, Z., Wu, L., Peng, T., Jia, S.: An improved scheduling approach for mini-
mizing total energy consumption and makespan in a flexible job shop environment.
Sustainability 11(1) (2019). https://doi.org/10.3390/su11010179
45. Zheng, H.Y., Wang, L.: Reduction of carbon emissions and project makespan by a
pareto-based estimation of distribution algorithm. Int. J. Prod. Econ. 164, 421–432
(2014). https://doi.org/10.1016/j.ijpe.2014.12.010
Carbon-Aware Mine Planning
with a Novel Multi-objective Framework
1 Introduction
The five phases of mining present a number of logistical challenges (Fig. 1).
During the planning, implementation, and production phases, logistics manage-
ment must forecast, plan, and schedule tasks from strategic, tactical, and opera-
tional angles. During the planning phase, environmental concerns are addressed
through the environmental impact assessment (EIA) and reclamation plan.
Supported by Enterprise Singapore.
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 500–516, 2023.
https://doi.org/10.1007/978-3-031-43612-3_31
Carbon-Aware Mine Planning with a Novel Multi-objective Framework 501
These are submitted with the mining plan to the respective government author-
ities before the implementation phase [7]. As the mining phases mature, the
mining plans, EIA, and reclamation plans are updated and can become progres-
sively detailed. The reclamation plan is updated and reviewed periodically in
certain jurisdictions such as Western Australia. However, regulatory rigor and
intra-governmental coordination vary across jurisdictions and may be improved,
such as integrating the EIA and reclamation plan processes [12]. These can be
classified as the strategic perspective for mine planning.
Fig. 1. Summary of activities for each mine phase and their influences on costs.
Fig. 2. Block extraction sequence example from year to year following precedence and
resource constraints for MineLib’s Newman1 instance [3]
throughout the value chain of the PCPSP is increasingly necessary and ties in
with schemes such as the emission trading systems (ETS) and emission taxes. In
this paper, we boost previous research on the PCPSP by enhancing the generic
PCPSP formulation [6] with carbon cost. This allows planners to consider the
NPV of carbon cost alongside profit.
Instead of solely maximizing the NPV of profit in the generic PCPSP formu-
lation, we convert it into a multi-objective optimization (MOO) problem with
another objective that minimizes the NPV of carbon cost. Next, we apply the
NSGA-II [5] whereby components of the algorithm - initial solution generation,
crossover and mutation - are tailored to the PCPSP. As far as we know, earlier
works of evolutionary algorithms (EA) to the PCPSP did not propose similar
frameworks. In fact, details on the initial solution generation, crossover, and
mutation are quite lacking. Furthermore, earlier works for the PCPSP employed
EA such as genetic algorithm [1] and differential evolution [8] for a single objec-
tive formulation problem whereas we use NSGA-II for an MOO problem. We
then test variants of our approach with a real-world mine dataset from an oper-
ating mine and benchmark instances from Minelib [6]. It turns out that variants
to the solution generation can improve results. Furthermore, we demonstrate how
our approach can produce approximated Pareto fronts for planners to observe
carbon emissions targets when deciding on mining plans.
2 Related Work
The angles of strategic, tactical, and operational issues are commonly used for
research on the logistics of mining. Lately, qualitative and quantitative research
Carbon-Aware Mine Planning with a Novel Multi-objective Framework 503
estimate of a mine’s value [7]. The mine consists of several components, such
as the pit, dump, stockpiles, processing plants, and heavy machinery. In the
pit, mineral ore deposits are divided into blocks of the same size for modeling
purposes. Each block has a unique value and a set of precedence constraints
based on geology, which affects the overall extraction sequence over time and
how the ore is processed, as illustrated in Fig. 2. After extraction, the block is
transported to processing facilities where it undergoes various treatments, such
as crushing, grinding, and screening, to reduce it according to the requirements.
Then, the material is refined to improve its quality and obtain various desired
end products. Unfortunately, this value chain consumes significant raw materials
(e.g., energy, water, gases) and generates harmful by-products (e.g., water and
air pollution, chemical waste).
Our main aim is to augment the PCPSP formulation with carbon costing. To
measure carbon emissions, we use the metric by Xu et al. [20], which defines
the cost of carbon emissions Ci,e from energy consumption. This cost reflects the
amount of carbon dioxide that is absorbed during ore excavation, processing, and
refining. The formula includes two key quantities: the amount of ore extracted
from the pit and sent for further processing Qi,o , and the amount of waste
material extracted and treated Qi,w . These quantities are multiplied by the
energy consumed using coal to extract one unit tonne of material (either ore
or waste) from the pit em , and the energy consumed to process one unit tonne
of ore ep . These values are then multiplied by the carbon factor of coal fc , the
conversion coefficient of carbon dioxide from carbon fa , and the absorption cost
of carbon dioxide Cc .
is determined by geologists using its ore composition and grade. The summation
of the NPV of profit throughout the mine’s lifespan emphasizes the significance
of extracting blocks with higher value earlier.
(Objective function 1) Z1 = max p̃bdt ybdt (2)
b∈B d∈D t∈T
The second objective function calculates the discounted carbon cost c̃bdrt
from using operational resource r ∈ R to extract or process block b at desti-
nation d ∈ D during period t ∈ T . It is derived using (1+α)
cbdr
t . Practically, this
function can be crucial within the framework of carbon credit trading, a market-
based instrument aimed at reducing carbon dioxide emissions. Under this system,
economies that exceed their allocated emissions can purchase credits from those
that have reduced their emissions below their carbon emission permits.
(Objective function 2) Z2 = min c̃bdrt ybdt (3)
b∈B d∈D r∈R t∈T
Constraint (4) sets conditions for the order in which blocks can be extracted,
and it applies to all blocks and periods. It states that block b must be extracted
in the same or an earlier period than block b, as b is a predecessor of b. This
constraint is decided by mining engineers based on the materials surrounding
the ore, including sand, silt, and clay, as well as the ore’s type and composition.
The latter is determined by geologists.
xbτ ≤ xb τ ∀ b ∈ B, b ∈ Bb , t ∈ T (4)
τ ≤t τ ≤t
506 N. A. B. Azhar et al.
Constraint (6) limits block extraction to only once during the mine’s lifespan.
xbt ≤ 1 ∀ b ∈ B (6)
t∈T
Constraint (7) guarantees that for each time period t, the use of every opera-
tional resource r is within the limits of minimum Rrt and maximum R̄rt . These
resources are managed by mining engineers and technicians, and they include
diggers, haulage trucks, grinders, and processing plants.
Rrt ≤ qbdr ybdt ≤ R̄rt ∀ r ∈ R, t ∈ T (7)
b∈B d∈D
Constraint (8) represents side constraints with lower bound a and upper
bound ā. It can represent various mining situations, including grade constraints.
a ≤ Ay ≤ ā (8)
Finally, constraints (9) and (10) reflect the range of values.
Diverse techniques are available for MOO problems with their own advantages
and disadvantages [11]. Due to the NP-hard nature of the PCPSP, exact methods
such as the MIP and constraint programming [15], are found to be efficient for
smaller instances but become intractable as the instance size increases. Hence,
alternatives have to be explored. One of them is the extension of the genetic
algorithm framework. Genetic algorithms use a population of randomly gener-
ated solutions that are evaluated for improvement at each iteration, making it
possible to converge on the entire Pareto set in one run. Multi-objective evolu-
tionary algorithms (MOEAs) belong to the class of genetic algorithms used for
MOO problems. They utilize additional advanced methods to maintain a varied
population of Pareto optimal solutions throughout the iterations. The MOEAs
are differentiated by their fitness assignment, diversity mechanism, elitism, and
the use of external population [9].
Carbon-Aware Mine Planning with a Novel Multi-objective Framework 507
Fig. 4. Cross-sectional view of mine pit when excavating different target blocks.
4.2 Reproduction
The reproduction of offspring from the initial solution or parent solution consists
of the binary tournament operator, mutation, and crossover. Once offspring solu-
tions are produced, they are again ranked based on the two objective functions
with non-domination sorting, and crowding distance. This reproduction step is
run till the maximum number of generations (i.e., the termination threshold).
The binary tournament operator randomly selects two individuals (or solu-
tions) from a population and chooses the best (based on the rank function of
NSGA-II) for the next generation. This procedure is repeated until the desired
number of individuals for the next generation is obtained. The set of parent
solutions from this step then leads to the crossover operator.
Crossover facilitates the creation of new offspring solutions by combining
genetic material from two parent solutions. The process involves selecting a
crossover point in the parents’ genetic code and exchanging genetic informa-
tion beyond that point to generate two new offspring solutions. The crossover
rate parameter η determines whether this step occurs for the parent pair. A high
Carbon-Aware Mine Planning with a Novel Multi-objective Framework 509
crossover rate reflects a higher chance that crossover occurs. Due to the multi-
period structure of the PCPSP, we design an interdependent-period single-point
crossover that accounts for the entire mine lifespan. It combines two parent solu-
tions by exchanging portions of their periodic components. We use a crossover
cut
ζ ∈ [0, 1], drawn randomly, to determine the crossover index using ζ ∗ t∈T xbt .
This index can fall in any time period. Figure 5 illustrates a crossover between
two feasible solutions. The last step in this illustration is an immediate fix to
prevent block extraction across multiple periods.
Subsequently, mutation introduces a small random alteration to the genetic
code of an individual. In the mutation step, we choose blocks for extraction
(i.e., xb t = 1) with no precedence blocks. The current time period t is mutated
randomly to another time period t ∈ {T \ t }. The mutation step occurs for all
parent pairs, but we used the mutation rate γ to select the percentage of blocks
with no precedence |{b̂}| to mutate. The subset of blocks with no precedence is
selected uniformly using SAM P LE(RAN DOM (0, γ) ∗ |{b̂}|, {b̂}).
Solutions may become infeasible after crossover and mutation. The crossover
step may result in precedence violation whereas the mutation step may result
in resource violation. Hence, the repair operator is invoked. It uses the pre-
computed cone sets that are akin to the initial solution generation.
510 N. A. B. Azhar et al.
5 Experiments
We utilize a small-sized real-world inspired dataset from an operating copper
and gold mine, Wilma1. It was adapted for the generic formulation and mocked
up. We complemented it with benchmark instances from MineLib - Newman1
and Kd. Newman1 is small-sized whilst Kd is medium-sized (Kd is a copper
mine in North America). Table 1 informs the number of blocks |B|, precedence
|Bb |, time periods |T |, destinations |D| and operational resource constraints |R|.
eλpbd
softmax = D (11)
λpbd
d=1 e
We run the two variants three times, each with 100 solutions in a population
and generations ranging from eight to ten. From the six populations of solution
Carbon-Aware Mine Planning with a Novel Multi-objective Framework 511
sets, we derive the true Pareto front of non-dominated solutions. Then, the solu-
tion sets from each population are compared using the ratio of non-dominated
individuals (RNI) [16], distance, and diversity metrics [5]. Firstly, the RNI met-
ric yields the proportion of best-known solutions φ (i.e., solutions that form part
of the Pareto front) that exist in a population Ωj with N solutions:
|φj |
Ratio of non-dominated individuals (RNI) = (12)
N
Secondly, the distance metric assesses how closely a solution j in population
Ωj converges to the true Pareto front. Initially, the metric calculates the mini-
mum Euclidean distance between a solution from population Ωj and all solutions
k from the true front Ωk . This is then averaged across all N solutions. A value
close to zero is desired.
N
j=1 min djk
Distance metric = (13)
N
Finally, the diversity metric evaluates the even spread of solutions over the
true Pareto front using the Euclidean distance. It considers the distance between
successive solutions di , the average distance d¯ of all di , distances between extreme
solutions of the true Pareto-optimal front dk , and distances between extreme
solutions in a population set dj . A value close to one indicates better diversity.
N −1 ¯
dk + dj + i=1 |di − d|
Diversity metric = (14)
dk + dj + (N − 1)d¯
The NSGA-II model for the augmented multi-objective PCPSP was built
using Python. The algorithm was executed on a Linux operating system with
3.5 GHz 3rd generation Intel Xeon Scalable processor, 128 vCPUs and 128 Gb
memory. The initial solution was produced with the upper bound resource mul-
tiplier ρ of 1.2. In the reproduction, the mutation γ and crossover η rates were
set to 0.2 and 0.6, respectively. Lastly, the softmax multiplier λ was 4.
5.2 Results
The performance metrics of RNI, distance, and diversity for the argmax and
softmax variants are summarized in Table 2. Overall for Newman1 and Wilma1
instances, the argmax variants, compared to the softmax variants, generate sets
of solutions that are closer to the true Pareto front (distance metric), but with
less assortment (diversity metric) and lesser individual solutions that form part
of the true Pareto front (RNI). For Newman1, the argmax and softmax average
distance metrics are 0.290 and 0.304, respectively, while for Wilma1, their values
are 0.056 (argmax) and 0.074 (softmax). Next, the diversity metric averages
0.753 (argmax) and 0.830 (softmax) for Newman1, and 0.863 (argmax) and
1.017 (softmax) for Wilma1. Finally, the RNI averages 0.003 (argmax) and 0.007
(softmax) for Newman1, and 0.007 (argmax) and 0.037 (softmax) for Wilma1.
512 N. A. B. Azhar et al.
Fig. 6. Newman1 Pareto front across generations for two NSGA-II variants.
Fig. 7. Wilma1 Pareto front across generations for two NSGA-II variants.
A Pareto optimal front is also produced for each population variant. This
visual aid allows mine planners to appreciate the trade-off between the NPV of
profit and carbon cost for decision making. Figure 6, Fig. 7 and Fig. 8 display
the Pareto front of the population variants across generations for Newman1,
Wilma1, and Kd respectively. The computation time for each population variant
was less than an hour for Newman1. However, when the model is applied to larger
datasets, the computation time increases. Hence, the traversal of the search space
may be improved for this framework to be applied to much larger instances.
514 N. A. B. Azhar et al.
6 Conclusion
In this paper, we implement the NSGA-II model to balance the NPV of profit
with carbon emissions when managing the movement of ores from the mine
pit to the production facilities. Our framework for the open pit mine addresses
sustainability concerns by augmenting the generic PCPSP into a multi-objective
problem that can be catered to diverse environmental concerns. It is applied to
real-world instances of an operating mine and MineLib that have been extended
for carbon emission considerations. Our approach demonstrates the effective
formation of Pareto fronts with profit and carbon cost axes so that mine planners
can consider both aspects concurrently.
Previous work has used the generic PCPSP for carbon emissions trade-off
but through an additional constraint. To the best of our knowledge, this is the
first reformulation into a MOO problem. Future work can focus on improving the
computation time to extend this proof of concept to larger datasets. Additionally,
other MOEAs can be evaluated against the NSGA-II model and the objective
functions can be further expanded for other environmental concerns.
References
1. Alipour, A., khodaiari, A.A., Jafari, A., Tavakkoli-Moghaddam, R.: A genetic algo-
rithm approach for open-pit mine production scheduling. Int. J. Mining Geo Eng.
51(1), 47–52 (2017). https://doi.org/10.22059/ijmge.2017.62152
2. Attari, M.Y.N., Torkayesh, A.E.: Developing benders decomposition algorithm for
a green supply chain network of mine industry: case of Iranian mine industry. Oper.
Res. Perspect. 5, 371–382 (2018). https://doi.org/10.1016/j.orp.2018.11.002
3. Azhar, N.A.B., Gunawan, A., Cheng, S.F., Leonardi, E.: A carbon-aware planning
framework for production scheduling in mining. In: Computational Logistics: 13th
International Conference, ICCL 2022, Barcelona, Spain, 21–23 September 2022,
Proceedings, pp. 441–456. Springer, Heidelberg (2022). https://doi.org/10.1007/
978-3-031-16579-5 30
4. Canales-Bustos, L., Santibañez-González, E., Candia-Véjar, A.: A multi-objective
optimization model for the design of an effective decarbonized supply chain in
mining. Int. J. Prod. Econ. 193, 449–464 (2017). https://doi.org/10.1016/j.ijpe.
2017.08.012
5. Deb, K., Agrawal, S., Pratap, A., Meyarivan, T.: A fast elitist non-dominated sort-
ing genetic algorithm for multi-objective optimization: NSGA-II. In: Schoenauer,
M., et al. (eds.) PPSN 2000. LNCS, vol. 1917, pp. 849–858. Springer, Heidelberg
(2000). https://doi.org/10.1007/3-540-45356-3 83
6. Espinoza, D., Goycoolea, M., Moreno, E., Newman, A.: MineLib: a library of open
pit mining problems. Ann. Oper. Res. 206(1), 93–114 (2012). https://doi.org/10.
1007/s10479-012-1258-3
7. Hustrulid, W.A., Kuchta, M., Martin, R.K.: Open Pit Mine Planning and Design,
Two Volume Set & CD-ROM Pack. CRC Press, Boca Raton (2013)
Carbon-Aware Mine Planning with a Novel Multi-objective Framework 515
8. Khan, A., Niemann-Delius, C.: A differential evolution based approach for the
production scheduling of open pit mines with or without the condition of grade
uncertainty. Appl. Soft Comput. 66, 428–437 (2018). https://doi.org/10.1016/j.
asoc.2018.02.010
9. Konak, A., Coit, D.W., Smith, A.E.: Multi-objective optimization using genetic
algorithms: a tutorial. Reliabil. Eng. Syst. Saf. 91(9), 992–1007 (2006). https://
doi.org/10.1016/j.ress.2005.11.018
10. Luong, N.H., Alderliesten, T., Bel, A., Niatsetski, Y., Bosman, P.A.: Application
and benchmarking of multi-objective evolutionary algorithms on high-dose-rate
brachytherapy planning for prostate cancer treatment. Swarm Evol. Comput. 40,
37–52 (2018). https://doi.org/10.1016/j.swevo.2017.12.003
11. Marler, R.T., Arora, J.S.: Survey of multi-objective optimization methods for engi-
neering. Struct. Multidisc. Optim. 26(6), 369–395 (2004). https://doi.org/10.1007/
s00158-003-0368-6
12. Morrison-Saunders, A., McHenry, M.P., Sequeira, A.R., Gorey, P., Mtegha, H.,
Doepel, D.: Integrating mine closure planning with environmental impact assess-
ment: challenges and opportunities drawn from african and australian practice.
Impact Assess, Project Apprais. 34(2), 117–128 (2016). https://doi.org/10.1080/
14615517.2016.1176407
13. Newman, A.M., Rubio, E., Caro, R., Weintraub, A., Eurek, K.: A review of opera-
tions research in mine planning. Interfaces 40(3), 222–245 (2010). https://doi.org/
10.1287/inte.1090.0492
14. Rimélé, M.A., Dimitrakopoulos, R., Gamache, M.: A stochastic optimization
method with in-pit waste and tailings disposal for open pit life-of-mine produc-
tion planning. Res. Policy 57, 112–121 (2018). https://doi.org/10.1016/j.resourpol.
2018.02.006
15. Soto, R., Crawford, B., Almonacid, B., Johnson, F., Olguı́n, E.: Solving open-pit
long-term production planning problems with constraint programming a perfor-
mance evaluation. In: 2014 9th International Conference on Software Engineering
and Applications (ICSOFT-EA), pp. 70–77. IEEE (2014). https://doi.org/10.5220/
0005093900700077
16. Tan, K.C., Lee, T.H., Khor, E.F.: Evolutionary algorithms for multi-objective opti-
mization: performance assessments and comparisons. Artif. Intell. Rev. 17(4), 251–
290 (2002). https://doi.org/10.1023/a:1015516501242
17. Touzout, F.A., Benyoucef, L.: Multi-objective multi-unit process plan generation in
a reconfigurable manufacturing environment: a comparative study of three hybrid
metaheuristics. Int. J. Prod. Res. 57(24), 7520–7535 (2019). https://doi.org/10.
1080/00207543.2019.1635277
18. Valderrama, C.V., Santibanez-González, E., Pimentel, B., Candia-Vejar, A.,
Canales-Bustos, L.: Designing an environmental supply chain network in the min-
ing industry to reduce carbon emissions. J. Clean. Prod. 254, 119688 (2020).
https://doi.org/10.1016/j.jclepro.2019.119688
19. Wang, X., Gu, X., Liu, Z., Wang, Q., Xu, X., Zheng, M.: Production process
optimization of metal mines considering economic benefit and resource efficiency
using an NSGA-II model. Processes 6(11), 228 (2018). https://doi.org/10.3390/
pr6110228
516 N. A. B. Azhar et al.
20. Xu, X.C., Gu, X.W., Qing, W., Zhao, Y.Q., Wang, Z.K.: Open pit limit opti-
mization considering economic profit, ecological costs and social benefits. Trans,
Nonferrous Metals Soc. China 31(12), 3847–3861 (2021). https://doi.org/10.1016/
s1003-6326(21)65769-2
21. Zeng, L., Liu, S.Q., Kozan, E., Corry, P., Masoud, M.: A comprehensive interdisci-
plinary review of mine supply chain management. Res. Policy 74, 102274 (2021).
https://doi.org/10.1016/j.resourpol.2021.102274
Multi-product Lot-Sizing Problem
with Remanufacturing, Lost Sales
and Sequence-Dependent Changeover
Cost
1 Introduction
The alarming rise of environmental issues and climate change has shifted how
things are done. Industrial companies, which share partial responsibility for the
current environmental crisis, face pressure from governments and consumers
alike to adopt more eco-friendly practices. In Chile, a public law known as The
Extended Liability of the Producer (ERP) has been promoting waste manage-
ment since 2016 to create a circular production model that retains the util-
ity and worth of products and their components. One way to achieve this is
through remanufacturing, which involves transforming waste materials into like-
new products, materials, or substances for the original or other purposes [15].
This research was partially supported by Dicyt projects 062217QV and 062217DG,
Vicerrectorı́a de Investigación, Desarrollo e Innovación, Universidad de Santiago de
Chile.
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 517–532, 2023.
https://doi.org/10.1007/978-3-031-43612-3_32
518 L. Gana et al.
2 Related Works
In the past decade, various efforts have been made to enhance the MILP formu-
lation of single-echelon lot-sizing problems that involve remanufacturing. This
has been achieved through the development of extended reformulations and valid
inequalities in the literature.
520 L. Gana et al.
On the other hand, the time-invariant input parameters are defined as follows:
– αij denotes the quantity of the end-of-life product j required for the finished
product i.
In order to build a mathematical model for the problem, we introduce the fol-
lowing decision variables for time period t ∈ T :
522 L. Gana et al.
The objective function (1a) minimizes the total production cost over the
whole planning horizon, i.e., the sum of the setup, production, inventory hold-
ing, lost sales, and changeover costs. Constraints (1b)–(1c) are the inventory
balance constraints. More specifically, constraints (1b) ensure that any remanu-
factured/finished product is either used to satisfy the demand or kept in stock.
if there are not enough remanufactured products to satisfy the demand, the
unsatisfied demand is lost. Constraints (1c) ensure that end-of-life product are
re-manufactured or kept in stock. Constraints (1d) link the production quan-
tity variables to the setup variables. Constraints (1e) restrict the quantity of
lost sales. Constraints (1f) link the setup variable with the changeover variable,
indicating if a transition occurs in production from one product to another.
Constraints (1g) ensure that a maximum of one product per time period is pro-
duced in the production line. Without loss of generality, we assume that the
initial inventory, I(r)j0 and I(s)i0 for each j ∈ J and i ∈ I, is set to 0. Finally,
Constraints (1h)–(1j) provide the domain of the decision variables.
Multi-product Lot-Sizing Problem with Remanufacturing 523
In order to set the value of each constant Mti using an upper bound on the
quantity that can be processed of each product i at each time period t, this
quantity is limited by two elements: the availability of the end-of-life products
already returned by customers and the future demand for finished products. We
thus define the value of Mti , for each period t and product i, as follows:
⎧ ⎧ ⎫ ⎫
⎨ ⎨ ⎬ ⎬
−1
Mti = min min αij rkj , dik
⎩ j∈J ⎩ ⎭ ⎭
1≤k≤t t≤k≤T
4 Valid Inequalities
We now seek to adapt the single-echelon single-item (k, U ) inequalities intro-
duced in [9] and further investigated in [12] for the case of stochastic demand
and [11] for the case of the remanufacturing system by considering the limited
quantity of returned products available at each time period.
We first recall that a single-echelon single-item (k, U ) is defined as follows:
Proposition 1. Let 1 ≤ k ≤ T be a time period of the planning horizon. Let
U ⊆ {k, ..., T } be a subset of periods and t∗ = max{τ : τ ∈ U } be the last
time period belonging to U . Then, the following inequalities are valid for Prob-
lem (1a)–(1j):
⎛ ⎞
I(s)ik−1 + ⎝ dil ⎠ Yti ≥ (dit − Lit ) ∀i ∈ I (2)
k≤t≤t∗ l∈U :t≤l t∈U
Proof. There is a simple proof to show the validity of the inequalities (2) for
Problem (1a)–(1j). Let us start by summing up constraints (1c) from period k
to period T for any product i ∈ I, obtaining the subsequent inequality:
I(s)iT = I(s)ik−1 + Xti + (−dit + Lit ) (3)
t≤k≤T t≤k≤T
greater of equal to the latter due to the non-negativity of variables Lit . On the
T T T
other hand, we can replace t=k ( l=t dil )Yti by t=k ( l∈U :l≥t dil )Yti on the
left-hand as we must now satisfy the demand in time periods t ∈ U .
524 L. Gana et al.
Note that inequalities (2) can be strengthened by taking into account that
the production of each product i is limited by the minimum availability among
all
Tend-of-life products j returned from period one until period t, i.e., we replace
i
d
l=t l by
⎧ ⎧ ⎫ ⎫
⎨ ⎨ ⎬ ⎬
−1
φit = min min j∈J αij rlj , dil . (6)
⎩ ⎩ ⎭ ⎭
1≤l≤t l∈U :t≤l
(7)
– S = ∅. In this case, τ ∈S Xτi = 0 whereas I(r)j−1 + τ ∈S φjτ Yτi ≥ I(r)j−1 ≥
0. Inequality (8) is trivially respected.
– S = {}. If Yj = 0. There is no production in period so that τ ∈S Xτi =
Xi = 0 whereas I(r)j−1 + τ ∈S φjτ Yτi ≥ I(r)i−1 ≥ 0. Inequality (8) is trivially
respected. If Yi = 1. In this case, the inventory balance constraint (1c) for
period gives:
By noting that, under the current assumptions, φj Yi = rj , we have:
Induction step: Assume that inequality (8) is valid for any S ⊂ {1, ..., t}. We show
that, under this assumption, inequality (8) is valid for any S ⊂ {1, ..., t + 1}. We
again consider two cases:
i
If Yt+1 = 1. Summing up the inventory balance constraints (1c) over periods
to t + 1 and using the non-negativity of variables I(r)j and Xτi gives:
t+1
t+1
I(r)j−1 + rτj − Xτi ≥ 0
τ = τ =
t+1
I(r)j−1 + rτj − Xτi ≥ 0.
τ = τ ∈S
t+1 j
By noting that under the current assumption, φjt+1 Yt+1
i
= τ = rτ and
adding the non-negative terms τ ∈S\{t+1} φjτ Yτi to the left-hand side of the
previous inequality, we obtain:
I(r)j−1 + φjτ Yτi − Xτi ≥ 0.
τ ∈S τ ∈S
This shows that if Inequality (8) is valid for any S ⊂ {1, ..., t}, it is also valid
for any S ⊂ {1, ..., t + 1}, which concludes the proof.
526 L. Gana et al.
We now briefly discuss the resolution of the separation problem for each proposed
valid inequality. We first recall that this problem consists in finding an inequality
˜ I(s),
violated by a given solution (X̃, Ỹ, I(r), ˜ L̃, W̃) of the linear relaxation of
Problem (1a)–(1j) or prove that no such inequality exists.
In the case of the simple (k, U ) inequalities in Proposition 1, an exact separa-
tion algorithm is known to find the most violated inequality that runs in O(T 2 ).
For each period k = 1, ..., T , the algorithm finds the set U of time periods by
inspection. See [9] for further details on the separation algorithm.
Then, for the case of valid inequalities in Proposition 2, the separation is not
trivial as is mentioned in [11], in particular, because the value of each coefficient
φjt simultaneously depends on U and . We thus consider a heuristic separation
algorithm in our computational experiments as is proposed in [11], which con-
sists of finding the set U as is done for inequalities in Proposition 1 and then
computing the coefficient φjt for each given period . The algorithm has a time
complexity of O(T 2 ).
Finally, we note that inequalities in Proposition 3 form an exponential class
of valid inequalities, i.e., their number grows exponentially fast with the num-
ber of time periods T . However, the corresponding separation problem is poly-
nomially solvable by inspection in O(T 2 ). Namely, to find the most violated
valid inequalities (8) by the current linear relaxation, we need to find, for each
period ∈ T , the subset U minimizing the difference between the left and the
right-hand
side of inequality
(8). This amounts to finding the subset S mini-
mizing t∈S φjt Ỹti − t∈S X̃pt . This can be done by inspection of each period
t ∈ {, . . . , T } and adding the time period t to the set S if φjt Ỹti − X̃ti ≤ 0.
We briefly summarize the proposed valid inequalities and their related fea-
tures in Table 1. Column Class indicates whether the number of valid inequali-
ties in the family grows linearly or exponentially fast concerning the number of
periods in the planning horizon. Column Complexity reports the difficulty of the
corresponding separation problem. Column Sep. alg. then indicates whether
the proposed separation problem is solved by an exact or a heuristic algorithm
in our numerical experiments. Column Time reports the asymptotic time com-
plexity of the implemented separation algorithm, and Column Total # reports
the total possible number of valid inequalities that can be generated. Note that
all these features significantly impact the numerical tractability and efficiency of
a class of valid inequalities.
5 Computational Experiments
In this section, we focus on assessing the performance of the proposed valid
inequalities when used within a branch-and-cut algorithm. We compare the per-
formance of this algorithm with one of the generic branch-and-cut algorithms
embedded in a mathematical programming solver.
5.2 Results
Each algorithm is based on the solver GUROBI 9.5.2, and it generates corre-
sponding inequalities through a cutting-plane generation algorithm at the root
node and at intermediate nodes of the branch-and-bound search tree using the
user Callbacks provided by the solver.
All related linear programs and mixed-integer linear programs were solved
using GUROBI 9.5.2 with the solver default settings. The algorithms were imple-
mented in Python 3.9. All tests were run on the computing infrastructure con-
sisting of an Intel(R) Core(TM) i5-10.300H, 2.50 GHz, and 8 GB RAM to solve
each instance. We imposed a time limit of 300 s to find an optimal solution for
each instance.
The corresponding results are displayed in Table 2 for Set 1 instances
and Table 3 for Set 2 instances. Column Method indicates the branch-and-cut
Multi-product Lot-Sizing Problem with Remanufacturing 529
algorithm used to solve the instances. Column R.LPgap reports the gap between
the value of the linear relaxation strengthened by the corresponding valid
inequalities and the best feasible solution found through the branch-and-bound
search. The GRB method reports the gap between the value of the initial linear
relaxation and the best feasible solution found through the branch-and-bound
search. Column R.MIPgap reports the gap between the lower bound at the root
node (after the generation of GUROBI generic cutting planes) and the best
feasible solution found through the branch-and-bound search. Column MIPgap
reports the gap between the best lower bound and the best feasible solution
found through the branch-and-bound search. The average CPU time in seconds
for the cutting-plane generation of each method is reported in column C.Time,
the CPU time in seconds spent at the root node in column R.Time and the aver-
age total CPU time in seconds in column T.Time. Finally, column # Cuts reports
the number of cuts generated by each method. Note that each line corresponds
to the average value of the corresponding 10 instances.
In general, we observe that the customized branch-and-cut algorithms based
on valid inequalities in Proposition 2 and 3 (P 2 + P 3) outperforms the method
GRB and the other methods, providing solutions of better quality with similar
computation times for all group of tested instances. Specifically, the method
P 2 + P 3 is able to almost close the optimality gap at the root node of the
530 L. Gana et al.
these models often serve as a starting point for future research. In our case, the
proposed deterministic approach opens up an interesting discussion about the
returned flows in the textile industry. In a favorable scenario where the returns
are significantly higher than the demand, the deterministic assumption does not
highly impact the production plan since there are enough returned products to
carry out the production plan. Otherwise, as the amount of end-of-life products
available restricts the ability to comply with the planning, it might incur new
costs and lost sales to adjust the production plan. Hence, extending the model
to a stochastic setting is also worth investigating, as demand and end-of-life
product quantities are usually difficult to predict accurately.
References
1. Aishah, S., Ali, S., Ali, F., Talib, M.L., Rozita, W., Din, W.: Valid inequalities for
the capacitated lot-sizing problem in a hybrid manufacturing and remanufacturing
system (2021)
2. Akartunalı, K., Arulselvan, A.: Economic lot-sizing problem with remanufacturing
option: complexity and algorithms. In: Pardalos, P.M., Conca, P., Giuffrida, G.,
Nicosia, G. (eds.) MOD 2016. LNCS, vol. 10122, pp. 132–143. Springer, Cham
(2016). https://doi.org/10.1007/978-3-319-51469-7 11
3. Ali, S.A.S., Doostmohammadi, M., Akartunalı, K., van der Meer, R.: A theoretical
and computational analysis of lot-sizing in remanufacturing with separate setups.
Int. J. Prod. Econ. 203, 276–285 (2018)
4. Brahimi, N., Absi, N., Dauzère-Pérès, S., Nordli, A.: Single-item dynamic lot-sizing
problems: an updated survey. Eur. J. Oper. Res. 263(3), 838–863 (2017)
5. Cunha, J.O., Konstantaras, I., Melo, R.A., Sifaleras, A.: On multi-item economic
lot-sizing with remanufacturing and uncapacitated production. Appl. Math. Model.
50, 772–780 (2017)
6. Goerler, A., Lalla-Ruiz, E., Voß, S.: Late acceptance hill-climbing matheuristic for
the general lot sizing and scheduling problem with rich constraints. Algorithms
13(6), 138 (2020)
7. Goerler, A., Voß, S.: Dynamic lot-sizing with rework of defective items and mini-
mum lot-size constraints. Int. J. Prod. Res. 54(8), 2284–2297 (2016)
8. Guan, Y., Ahmed, S., Nemhauser, G.L.: Cutting planes for multistage stochastic
integer programs. Oper. Res. 57(2), 287–298 (2009)
9. Loparic, M., Pochet, Y., Wolsey, L.A.: The uncapacitated lot-sizing problem with
sales and safety stocks. Math. Program. 89(3), 487–504 (2001)
10. Pochet, Y., Wolsey, L.A.: Polyhedra for lot-sizing with wagner-whitin costs. Math.
Program. 67(1), 297–323 (1994)
11. Quezada, F., Gicquel, C., Kedad-Sidhoum, S.: New valid inequalities for a multi-
echelon multi-item lot-sizing problem with returns and lost sales. In: Mes, M.,
Lalla-Ruiz, E., Voß, S. (eds.) ICCL 2021. LNCS, vol. 13004, pp. 192–207. Springer,
Cham (2021). https://doi.org/10.1007/978-3-030-87672-2 13
12. Quezada, F., Gicquel, C., Kedad-Sidhoum, S., Vu, D.Q.: A multi-stage stochastic
integer programming approach for a multi-echelon lot-sizing problem with returns
and lost sales. Comput. Oper. Res. 116, 104865 (2020)
13. Retel Helmrich, M.J., Jans, R., van den Heuvel, W., Wagelmans, A.P.: Economic
lot-sizing with remanufacturing: complexity and efficient formulations. IIE Trans.
46(1), 67–86 (2014)
532 L. Gana et al.
14. Roshani, A., Paolucci, M., Giglio, D., Demartini, M., Tonelli, F., Dulebenets, M.A.:
The capacitated lot-sizing and energy efficient single machine scheduling problem
with sequence dependent setup times and costs in a closed-loop supply chain net-
work. Ann. Oper. Res. 321(1–2), 469–505 (2023)
15. Suzanne, E., Absi, N., Borodin, V.: Towards circular economy in production plan-
ning: challenges and opportunities. Eur. J. Oper. Res. 287(1), 168–190 (2020)
16. Torkaman, S., Ghomi, S.F., Karimi, B.: Multi-stage multi-product multi-period
production planning with sequence-dependent setups in closed-loop supply chain.
Comput. Ind. Eng. 113, 602–613 (2017)
A Radius-Based Approach
for the Bi-Objective p-Center
and p-Dispersion Problem
1 Introduction
The p-center problem (pC) is an NP-Hard [13] facility location problem (FLP)
that requires location of p centers on a given network such that the maximum
c The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 533–549, 2023.
https://doi.org/10.1007/978-3-031-43612-3_33
534 N. De Walsche et al.
distance between demand nodes and centers is minimized. The problem arises in
emergency logistics for locating ambulances, fire stations, police patrol units and
shelters; military logistics as well as districting. Similar to pC, the p-dispersion
problem (pD) is also NP-hard [9] and requires locating p facilities on a given
network. Unlike pC, the goal is to maximize the minimum distance between
the selected facilities. The optimal solutions to these two problems could be
significantly different as it can be observed in an OR-Library instance in Fig. 1.
In the pC solution of Fig. 1(a), the facilities are more centralized in their clusters
and in the entire network. The pD facilities in Fig. 1(b), however, are more
dispersed and located at the extremes of the network. It is therefore worthwhile
investigating bi-objective approaches if one wishes to maintain a certain level of
balance between the two objectives.
Fig. 1. Optimal solutions for instance pmed1 from OR-Library (p = 5). Square nodes
indicate facilities and circles indicate demand points clustered with the color of their
closest open facility.
2 Literature Review
The seminal paper by [12] introduces the 1-center problem for the purpose of
locating a police station on a given network so that the maximum distance from
this station is minimum. Both absolute and vertex-restricted 1-center problems
are discussed. An absolute center can be placed anywhere on the network whereas
a vertex center has to be located at a vertex. Minieka [16] extends the problem to
the case where p > 1 and provides an exact algorithm solving set covering prob-
lems for a finite number of radius (distance) values. This method has inspired the
most efficient radius formulations and set covering-based algorithms for solving
various p-center variants which have been reviewed by [2]. As [16] proves that
there exists a finite set of points in the given graph which contains an optimal
set of p centers for the absolute p-center problem and [10] provides a method
for generating this finite set, it has been possible to develop mathematical for-
mulations which can tackle both absolute and vertex p-center variants without
further adaptations ([3,6,8] and [14] are a few examples). Although recently
most p-center studies consider only vertex-restricted problems, absolute variants
remain valid for many real-world applications.
A critical and careful review of the pC literature reveals two main categories
of compact mathematical formulations: (1) Mixed-Integer Linear Programming
(MILP) models based on the classical formulation with two-index allocation
variables first provided by [6]. (2) Radius-based Integer Linear Programming
(ILP) formulations which exploit the finite set of distinct distance values between
the centers and the demand nodes. Radius formulations typically utilize either
a telescopic sum of radius values [8] or enforce selection of exactly one radius
value [3]. When the radius set is large, solving these formulations may require
536 N. De Walsche et al.
It becomes clear from Table 1 that developments for both pC and pD are a lot
more advanced than those for BpCD, which is still a relatively recent problem.
While researchers have been able to solve instances with up to a million nodes
for pC and a hundred thousand nodes for pD, for BpCD only instances with up
to 600 nodes could be solved until now. Table 1 also demonstrates that success-
ful approaches for pC and pD make use of radius-based formulations. Finally,
most methods make use of lower- and upper-bound information to at least some
538 N. De Walsche et al.
extent. However, only a few approaches use valid inequalities or dominance rules,
techniques which can sometimes speed up the solution process.
Based on the information from the relevant literature, it is surprising that
Tutunchi and Fathi [20] did not experiment with radius-based formulations to
solve BpCD. It is our main hypothesis that addressing the BpCD by means of
radius-based formulations allows us to increase the size of the problems that
can be efficiently solved. Indeed, we will demonstrate that employing a suitable
mathematical formulation allows us to solve instances which are 3–4 times larger
than those solved using classic two-index MILP formulation.
3 Mathematical Model
Let G be a given undirected network with I = {1, . . . , n} being the set of demand
nodes and J = {1, . . . , m} being the set of potential centers. The BpCD requires
selecting a subset X ⊂ J such that |X| = p under two objective functions:
(1) minimize the maximum distance between the centers and demand points
and (2) maximize the minimum distance between the centers. Let us define dij
as the shortest path distance between points i and j. Let Dc = [dij ] denote
the n × m distance matrix between demand nodes and potential centers and
Dd = [dhj ] denote the m × m distance matrix between potential centers. The pC
can then be formalized as min max min dij whereas pD can be formalized
X⊂J:|X|=p i∈I j∈X
as max min dhj . It is easy to observe that the optimal values of pC and
X⊂J:|X|=p h,j∈X:h=j
pD will be equal to one of the entries in Dc and Dd , respectively. Consider two
ordered sets R = {r1 , . . . , rK } and Q = {q1 , . . . , qL } of distinct distance values
in Dc and Dd , respectively, and let RI and QI be the corresponding index sets.
Recalling the pC formulations of [3], it is then intuitive to associate a binary
variable for each radius value in each set as follows.
uk = 1 if rk , k ∈ RI is the value of pC objective for BpCD, uk = 0 otherwise.
vl = 1 if ql , l ∈ QI is the value of pD objective for BpCD, vl = 0 otherwise.
We may then formulate the BpCD as follows:
min f1 = rk uk (1)
k∈RI
max f2 = q l vl (2)
l∈QI
s.t. yj = p (3)
j∈J
yj ≥ uk , ∀ i ∈ I, k ∈ RI (4)
j∈J:dij ≤rk
uk = 1 (5)
k∈RI
yh + yj ≤ 1 + vl , ∀ h, j ∈ J (6)
l∈QI :dhj ≥ql
A Radius-Based Approach for BpCD 539
vl = 1 (7)
l∈QI
Functions (1) and (2) determine the objective values for pC and pD selected by
Constraints (5) and (7) from R and Q, respectively. Constraints (4) ensure that
each demand point is covered by a center within the selected pC radius. For any
open center pair, Constraints (6) guarantee that the distance between them is
greater than or equal to the selected pD value. Constraints (3) select exactly p
centers and Constraints (8)-(10) are binary restrictions.
However, this BpCD formulation cannot be directly solved by general-
purpose mathematical programming solvers. The existence of two competing
objective functions requires more sophisticated solution techniques and, more
importantly, it requires that we clarify what a good solution is. In this paper,
we will use a terminology based on the work of Erghot et al. [7]. Let us begin by
defining S as the set of all feasible solutions to BpCD and s ∈ S is a solution (an
assignment of facilities). Let us further denote objective value for pC by f1 (s)
and objective value for pD by f2 (s) for solution s. A solution s ∈ S dominates
another solution s ∈ S if one of the following two conditions is true:
these two methods lies on how we approached the solution of subproblems. Algo-
rithm 1 provides an overview of RIA.
In lines 1–2, the pC and pD associated with the BpCD instance are solved to
optimality. This is done by employing the algorithms described in Sects. 4.1 and
4.2, respectively. Their results provide 2 sets of bounds for the subsequent incre-
mental algorithm. Bound [cmin , cmax ] on pC objective, and bound [lmin , lmax ] on
pD objective. Note that cmin is the optimal pC objective and lmax the optimal
pD objective. Meanwhile, cmax is the center value obtained from the optimal pD
solution. Similarly, lmin is the dispersion value obtained from the optimal pC
solution. In other words: the optimal solutions of pC and pD form the extreme
points in our (weakly) non-dominated frontier.
After these extreme points are obtained, RIA continues to the main loop
spanning lines 4–7. In each iteration of this loop, another (weakly) efficient solu-
tion is generated in line 5. The MCD-Algorithm is used to solve a pC formulation
with an added constraint on the minimum dispersion required in the optimal
solution. This dispersion is defined by means of parameter l, which changes in
each iteration of the loop. A full description of the MCD-Algorithm can be found
in Sect. 4.3. The algorithm returns both the optimal pC objective given a min-
imum dispersion l, denoted by CenterV al as well as the dispersion DispV al
associated with this solution. Note that DispV al ≥ l. In line 6, the minimum
center bound cmin is updated to speed up subsequent MCD-Algorithm execu-
tions. The dispersion level l is updated in line 7 with the minimum value in Q
that is larger than DispV al. The main loop is executed as long as l ≤ lmax .
4.1 MC-Algorithm
In order to solve the pC problem, we use formulation MC which is defined as
min (1)
s.t. (3) − (5), (8), (10)
Optimal solutions for MC are obtained by means of the double bound algorithm
DBR2 [3], which decomposes pC into a series of restrictive feasibility subprob-
lems. It then simply searches for the smallest radius with an associated feasible
solution. To speedup solving pC even further, we utilize the quick approximation
algorithms from [3] to obtain lower- and upper-bounds for pC. This in turn can
potentially reduce the number of radii that must be considered by DBR2.
A Radius-Based Approach for BpCD 541
4.2 MD-Algorithm
For solving the pD problem, we utilize model MD which is defined as
max (2)
s.t. (3), (6), (7), (9), (10)
4.3 MCD-Algorithm
The MCD-Algorithm aims solving model MCD which is defined as follows
min (1)
s.t. (3) − (5), (8), (10)
yi + y j ≤ 1 ∀ i, j ∈ J : dij < l, i < j (14)
starting with an index k = 0 and then repeatedly trying other radius values from
the ordered list T until a feasible one is found. Note that k = 0 is associated
with the optimal radius from the previous RIA iteration (current cmin value).
Whenever the k-th radius fails to provide a feasible solution, we increase k by
k ← k + α. Value α denotes the step size. After each k update, we also update
the step size as α ← 2α (initially we set α = 4). Step 1 terminates as soon as a
feasible radius rk ∈ T is found. We will denote by k the last k before we found
a feasible solution (if k = 0 then k = 0).
Step 2 - Reduce Range of Bounds. Once we have determined a feasible
radius rk for the current dispersion level, we try to find a small range of radii
within which the optimal solution exists. This is done by means of a straight-
forward binary search over T . In the binary search, we use two indices: a = k
and b = k. We then proceed by trying the middle point m = (a + b)/2 and
updating either a or b accordingly. This is done so long as a − b > 4.
Step 3 - Find Optimal Value. Finally, we search for the optimal value, which
we know lies within the (index) range [a, b]. Note that ra ∈ T is infeasible from
Steps 1 and 2. Therefore, if we try ra+1 and it turns out to be feasible, we may
quickly end the search (ra+1 is optimal). Otherwise, we try rb−1 and if it is
infeasible, we can also stop the search (rb is optimal). If neither of the previous
situations applies, we check ra+2 . If feasible, then the optimal solution is ra+2 ,
otherwise it must be rb−1 (recall that the range [a, b] has at most five values
because of Step 2).
In order to speed up RIA, we employ matrix reduction rules which can reduce
the size of the models. This is done whenever we solve a feasibility subproblem
to verify whether a given radius is feasible, for example when employing DBR2,
or our special procedure from Sect. 4.3.
Consider an arbitrary iteration of the MC- or MCD-Algorithm where we are
solving a feasibility problem for radius value rk to verify whether it is a feasible
pC objective. Recall that I denotes the set of demand nodes and J the set of
potential facilities. We construct a matrix A0 as follows.
This implies that matrix A0 has a row corresponding to each demand point and
a column corresponding to each potential facility.
Consider an arbitrary iteration of the MD- or MCD-Algorithm, where the
radius value is equal to l for the pD objective. We construct a matrix B 0 as
follows.
B 0 = [b0jm ] where b0jm = 1 if djm ≥ l for j, m ∈ J (16)
This implies that matrix B 0 has a row and column corresponding to each poten-
tial facility. Assuming that Dd is symmetric, B 0 will also be symmetric.
A Radius-Based Approach for BpCD 543
With the matrices defined, we can employ the following reduction rules
sequentially and repetitively, where relevant and potentially beneficial. One of
the consequences of these rules is removing potential facility locations. Suppose
we know that node m can be excluded, we then update J as J \{m} which reduces
the number of y variables. Where applicable, this change can be reflected in the
matrices as follows. Column m from matrix Ah can be removed to obtain Ah+1
and both row and column m from matrix B h can be removed to obtain B h+1 .
Rule 1 - Center Fixing. Applicable to solve MC and MCD. Suppose for any
row i ∈ I and center matrix Ah the following condition holds:
ahij = 1 (17)
j∈J
This implies that demand node i can only be covered by a single facility j 1
within the given radius rk . Therefore, j 1 should be selected as facility location
in every feasible solution of the problem with radius rk . This can be ensured
by setting yj 1 = 1. In BpCD it also implies that any facility m ∈ J for which
bhj1 m = 0 holds should be excluded. Stated differently, because facility j 1 has to
be selected, any facility m within the dispersion threshold l can not be selected.
Rule 2 - Coverage Feasibility. Applicable to solve MC and MCD. Suppose
for any row i ∈ I and center matrix Ah the following condition holds:
ahij = 0 (18)
j∈J
This implies that demand node i cannot be covered by any facility within a given
radius value rk . Therefore, the problem is not feasible for that radius value (rk )
and will have to be increased. In BpCD facility coverage is not only dependant
on rk but also on the dispersion threshold l, consequently feasibility can be
recovered by increasing rk or lowering l.
Rule 3 - Dispersion Suitability. Applicable to solve MD and MCD. Suppose
for some j ∈ J the following condition holds:
bjm < p − 1 (19)
m∈M
This implies that selecting j as a facility will make the problem infeasible for the
given dispersion value (ql ) since it will not be possible to select p−1 other centers
that are distant enough from j. Therefore, one can safely exclude j from J.
Rule 4 - Dominance. Applicable to solve MC and MCD. Suppose for any
column pair j, m ∈ J the following condition holds:
For each i ∈ I, ahij ≥ ahim (20)
The condition implies that j can serve every demand point that m can and maybe
more. Therefore, one can safely exclude m from the set of potential facilities since
every solution that includes m can instead include j.
544 N. De Walsche et al.
For MD and MCD we can similarly state the following. Suppose for any
column pair j, m ∈ J with bjm = 0 the following conditions hold:
The condition implies that any potential facility respecting the dispersion thresh-
old when m is selected will remain a potential facility if j is selected instead.
Therefore, one can safely exclude m from J.
While Condition (20) can be applied to MC, and Condition (21) to MD, both
conditions must be simultaneously verified to exclude m when solving MCD.
5 Computational Study
In this section we evaluate the performance of RIA to solve BpCD using instances
from both the OR-Library [1] and the TSPLIB [18]. We implemented RIA in Java
(OpenJDK 19) using CPLEX 22.1 to solve the mathematical programming for-
mulations. All experiments were conducted in a computer with an AMD Ryzen
7 5800X processor at 3.8GHz with 8 cores, 16 GB of RAM and Windows 10
operating system. Furthermore, we have imposed a time limit of 2h per instance
following the settings of Tutunchi and Fathi [20].
Table 2 reports results over the 40 pmed instances from the OR-Library. Col-
umn Instance provides the name of the instance, while columns |N | and p con-
tain the number of nodes in the input graph and the number of facilities that
should be placed, respectively. Columns TC and TD report the execution time,
in seconds, required to solve the first pC and pD, which provide bounds for the
subsequent incremental algorithm. Finally, columns Iter. and TT otal report the
number of iterations of the incremental algorithm and the total execution time in
seconds to solve the BpCD defined by the given instance. Note that the number
of iterations corresponds to the number of weakly efficient solutions. In other
words: it corresponds to the size of the final pareto front.
The first observation we can make from Table 2 is that RIA was able to solve
every OR-Library instance within less than 1600 s seconds of execution time.
Although a precise comparison is not possible due to technological differences,
this brings a great contrast to the results of Tutunchi and Fathi [20] where
instances with |N | = 300 required 1600 s seconds on average to be solved. Fur-
thermore, solving the initial pC and pD problems never required more than 30 s.
Indeed, most of the time is spent in MCD-Algorithm executions where some dis-
persion limits l lead to difficult subproblems. For example, RIA solved the initial
pC and pD for instance pmed40 in 23 s, while the 19 iterations of the incremental
algorithm required, in total, 1500 s seconds.
Let us now turn our attention to larger instances from TSPLIB to verify
the scalability of RIA. Table 3 reports the results for a set of selected TSPLIB
instances which are instances frequently utilized in the pC and pD literature.
The Euclidean distance values are rounded up to the nearest integer. All of
the selected instances contain |N | > 1000 locations. We solved each instance for
p ∈ {5, 10, 20, 40, 100, 200}. The selection of p values was based on the results
A Radius-Based Approach for BpCD 545
The increased size of the TSPLIB instances poses a challenge to our method.
It is clear from Table 3 that solving the BpCD takes much longer for these
instances, even if the times required to solve the initial pC and pD are still rela-
tively short (most times are under 60 s). Total execution times grow significantly
and in some cases they are close to, or exceed, 1h. These TSPLIB instances also
have larger sets of weakly efficient solutions (Iter.) compared to OR-Library
instances with differences as high as five times. However, this does not seem to
be the cause of increased computation times since larger Iter. values are not
necessarily correlated to longer total execution times.
Another interesting observation from Table 3 is that while solving instances
for p ∈ {5, 10, 20, 40} is difficult, the time to solve the same instance with
p ∈ {100, 200} is often much shorter. This corroborates what Çalık and Tansel [3]
observed for pC, but here we notice the same behavior in the pD and BpCD.
However, while pC and pD can be quickly solved for p ∈ {5, 10}, the same is not
observed when solving BpCD. Indeed, for instance u1817, RIA could not finish
execution for p ∈ {5, 10} while it finished for p = 20. The reason being that some
executions of the MCD-Algorithm took longer for these values of p and certain
dispersion levels, leading RIA to exceed the time limit of 2h.
The positive contribution of the strong lower- and upper-bounds to improve
the efficiency of the radius-based algorithms were confirmed in our computa-
tional study. We also observed a significant speed gain by utilizing the matrix
reduction rules. More specifically, for MCD-Algorithm itself, these rules provided
an average improvement of 22 s and a maximum improvement of 186 s when solv-
ing pmed instances. Employing a list of bitsets for the matrices instead of lists of
Boolean variables played a very important role in this significant improvement
with up to 100 times speed improvement.
6 Conclusion
This study reinforced the efficiency of radius-based formulations and exact
decomposition algorithms based on their restrictive subproblems for solving a
bi-objective location problem where the two objectives have minimax and max-
imin structures (BpCD). The efficient approximation algorithm available for the
p-center (pC) and p-dispersion (pD) problems proved effective in largely reducing
the search spaces for the decomposition algorithms for each individual problem.
Another important improvement component which can be utilized in methods
for solving similar pC and pD variants consists of the matrix reduction rules
which were effectively employed in multiple steps of our radius-based incremen-
tal algorithm (RIA).
Despite our improvements, there are still limitations concerning the size of
BpCD instances which can be effectively solved within a short computation time.
The state-of-the-art methods for both pC and pD can tackle instances with
tens of thousands of nodes. Meanwhile, RIA is the first approach to successfully
breach the one thousand nodes threshold for BpCD, which indicates that there is
a lot of room for additional research in trying to further reduce the gap between
the single- and the bi-objective problems.
548 N. De Walsche et al.
Possible options for future research include exploring a change in the main
formulation within the incremental algorithm. While both our RIA and Tutunchi
and Fathi [20] employed the pC formulation with an additional dispersion con-
straint, how would the performance be impacted if we employed the pD for-
mulation with additional center constraints? Are there instances for which one
formulation outperforms the other? Another open question refers to a capaci-
tated version of BpCD where facilities have an associated maximum capacity.
Although the radius-based formulations are still valid, they require additional
allocation variables which increases the size of the models. Hence, a natural
question appears: can medium- and large-scale instances still be solved exactly
when facilities have capacities?
References
1. Beasley, J.E.: OR-Library: distributing test problems by electronic mail. J. Oper.
Res. Soc. 41(11), 1069–1072 (1990). https://doi.org/10.2307/2582903
2. Çalık, H., Labbé, M., Yaman, H.: p-center problems. Location Sci. 51–65 (2019).
https://doi.org/10.1007/978-3-319-13111-5_4
3. Çalık, H., Tansel, B.C.: Double bound method for solving the p-center location
problem. Comput. Oper. Res. 40(12), 2991–2999 (2013). https://doi.org/10.1016/
j.cor.2013.07.011
4. Contardo, C.: Decremental clustering for the solution of p-dispersion problems to
proven optimality. INFORMS J. Optim. 2(2), 134–144 (2020). https://doi.org/10.
1287/ijoo.2019.0027
5. Contardo, C., Iori, M., Kramer, R.: A scalable exact algorithm for the vertex
p-center problem. Comput. Oper. Res. 103, 211–220 (2019). https://doi.org/10.
1016/j.cor.2018.11.006
6. Daskin, M.: Network and Discrete Location: Models, Algorithms and Applications,
2nd edn. Taylor & Francis, Milton Park (2013)
7. Ehrgott, M., Gandibleux, X., Przybylski, A.: Exact methods for multi-objective
combinatorial optimisation. In: Greco, S., Ehrgott, M., Figueira, J.R. (eds.) Mul-
tiple Criteria Decision Analysis. ISORMS, vol. 233, pp. 817–850. Springer, New
York (2016). https://doi.org/10.1007/978-1-4939-3094-4_19
8. Elloumi, S., Labbé, M., Pochet, Y.: A new formulation and resolution method for
the p-center problem. INFORMS J. Comput. 16(1), 84–94 (2004). https://doi.org/
10.1287/ijoc.1030.0028
9. Erkut, E.: The discrete p-dispersion problem. Eur. J. Oper. Res. 46(1), 48–60
(1990). https://doi.org/10.1016/0377-2217(90)90297-O
10. Garfinkel, R.S., Neebe, A.W., Rao, M.R.: The m-center problem: minimax facility
location. Manage. Sci. 23(10), 1133–1142 (1977). https://doi.org/10.1287/mnsc.
23.10.1133
11. Gonzalez, T.F.: Clustering to minimize the maximum intercluster distance. Theor.
Comput. Sci. 38, 293–306 (1985). https://doi.org/10.1016/0304-3975(85)90224-5
A Radius-Based Approach for BpCD 549
12. Hakimi, S.L.: Optimum locations of switching centers and the absolute centers and
medians of a graph. Oper. Res. 12(3), 450–459 (1964). https://doi.org/10.1287/
opre.12.3.450
13. D Kariv, O., Hakimi, S.L.: An algorithmic approach to network location problems.
Part I: the p-centers. SIAM J. Appl. Math. 37(3), 513–538 (1979). https://doi.
org/10.1137/0137040
14. Kramer, R., Iori, M., Vidal, T.: Mathematical models and search algorithms for
the capacitated p-center problem. INFORMS J. Comput. 32(2), 444–460 (2020).
https://doi.org/10.1287/ijoc.2019.0889
15. Kuby, M.J.: Programming models for facility dispersion: the p-dispersion and max-
isum dispersion problems. Geogr. Anal. 19(4), 315–329 (1987). https://doi.org/10.
1111/j.1538-4632.1987.tb00133.x
16. Minieka, E.: The m-center problem. SIAM Rev. 12(1), 138–139 (1970). https://
doi.org/10.1137/1012016
17. Ravi, S.S., Rosenkrantz, D.J., Tayi, G.K.: Heuristic and special case algorithms for
dispersion problems. Oper. Res. 42(2), 299–310 (1994). https://doi.org/10.1287/
opre.42.2.299
18. Reinelt, G.: TSPLIB-A traveling salesman problem library. ORSA J. Comput.
3(4), 376–384 (1991). https://doi.org/10.1287/ijoc.3.4.376
19. Sayah, D., Irnich, S.: A new compact formulation for the discrete p-dispersion
problem. Eur. J. Oper. Res. 256(1), 62–67 (2017). https://doi.org/10.1016/j.ejor.
2016.06.036
20. Tutunchi, G.K., Fathi, Y.: Effective methods for solving the bi-criteria p-center and
p-dispersion problem. Comput. Oper. Res. 101, 43–54 (2019). https://doi.org/10.
1016/j.cor.2018.08.009
Author Index
A H
Andersson, Henrik 292 Haug, Helle V. 170
Azhar, Nurul Asyikeen Binte 500 Heilig, Leonard 202
Høyland, Sondre 292
B
Bak, Bendegúz Dezső 450 J
Beirigo, Breno A. 381 Jensen, Rune Møller 105
Buscher, Udo 350 Joe, Waldy 18
Johnsen, Lennart C. 416
Jovanovic, Raka 219
C
Çalık, Hatice 533 K
Cammin, Philip 139 Karami, Farzaneh 36
Cattrysse, Dirk 248 Kascak, Nicholas 56
Cheng, Shih-Fen 500 Kerkenhoff, Tim 36
Kesen, Saadettin Erhan 437
D Kortekaas, Joris J. A. 381
Daduna, Joachim R. 396 Krogstad, Jesper Anker 292
Dávila-Gálvez, Sebastián 517 Kucukoglu, Ilker 248
De Walsche, Niels 533
Dieter, Peter 235 L
Lalla-Ruiz, Eduardo 276
E Landa-Silva, Dario 467
Ehsanfar, Ebrahim 36 Lau, Hoong Chuin 18
El Achhab, Abdelhak 337 Leonardi, Erwin 500
El Amrani, Mohammed 337 Li, Mingyu 170
Eley, Michael 482 Linß, Florian 307
Er, Yuyang 18 Liu, Yijia 85
M
F Maaji, Salim Sulaiman 467
Fagerholt, Kjetil 170, 185 Madani, Batool 322
Flatberg, Truls 292 Main, Alastair Ronald 122
Földes, Dávid 450 Maktabifard, Ali 450
Meisel, Frank 185, 416
G Mes, Martijn 276
Gana, Lucas 517
Grbic, Djordje 105 N
Gullhav, Anders N. 292 Ndiaye, Malick 322
Gunawan, Aldy 260, 500 Nestler, Alexander 56
© The Editor(s) (if applicable) and The Author(s), under exclusive license
to Springer Nature Switzerland AG 2023
J. R. Daduna et al. (Eds.): ICCL 2023, LNCS 14239, pp. 551–552, 2023.
https://doi.org/10.1007/978-3-031-43612-3
552 Author Index
R Y
Rodrigues, Filipe 122 Yağmur, Ece 437
Yu, Jingjing 139
S
Sarhani, Malek 337 Z
Sartori, Carlo S. 533 Zhang, Zhiqin 18