Applsci 12 04570 v2
Applsci 12 04570 v2
Applsci 12 04570 v2
sciences
Article
Measures and Methods for the Evaluation of ATO Algorithms
Patrick Bochmann and Birgit Jaekel *
Chair of Traffic Process Automation, Technische Universität Dresden, 01062 Dresden, Germany;
[email protected]
* Correspondence: [email protected]; Tel.: +49-351-463-36786
Abstract: There is increasing interest in automating train operations of mainline services, e.g., to
increase network capacity. Automatic train operation (ATO) is already achieved by several pilot
projects, but is still not implemented on a large scale. Functional, interoperability and performance
tests are necessary before ATO can be introduced generally. Virtual preliminary analysis will con-
tribute to the validation process to ensure a safe and successful implementation. This paper aims to
present an approach that applies to the performance testing of ATO systems. Therefore, methods and
test standards for technologies enabling automatic operation in other transport sectors are reviewed.
The main findings have been adapted, transformed and combined to be used as a general strategy
for virtual performance testing in the railway sector. Specifically, universal performance indicators
commonly used in the railway sector, namely punctuality, accuracy, energy consumption, safety
and comfort, are presented. They are refined by adding sub-indicators specific to the performance
evaluation of ATO algorithms. A layer model for scenario description is adapted from the automotive
sector, as well as the definition of different scenario types. Lastly, factors that can influence the
performance of an ATO algorithm are identified. For demonstration purposes, a simple case study is
conducted. Thereby we exemplarily show-cased the approach for ATO performance testing using a
microscopic train simulator in combination with an ATO algorithm.
Citation: Bochmann, P.; Jaekel, B. Keywords: ATO; simulation; performance evaluation; scenario-based testing
Measures and Methods for the
Evaluation of ATO Algorithms. Appl.
Sci. 2022, 12, 4570. https://doi.org/
10.3390/app12094570 1. Introduction
Academic Editors: Valerio De Automatic Train Operation (ATO) will enable more efficient train traffic by increasing
Martinis and Raimond track capacity, reducing travel time, and lowering energy consumption [1]. All these
Matthias Wüst benefits will become crucial due to a steady increase in global passenger mobility and freight
Received: 11 March 2022
transportation demand and the resulting necessity of infrastructural development [2].
Accepted: 26 April 2022
The development of theoretical and technical principles to automate railway oper-
Published: 30 April 2022
ations is already well advanced. Similar trends can be found in other traffic and trans-
portation sectors that aim to take advantage of vehicle and operational automation to
Publisher’s Note: MDPI stays neutral
realize more efficient and safer transportation. Certain systems automating the operation
with regard to jurisdictional claims in
of trains are already qualified for series production and are used in some special fields of
published maps and institutional affil-
the sector like metro systems. However general automation of mainline operation is still
iations.
not achieved [3].
The impact of the introduction of such a system on the railway operations is going
to be huge. To ensure efficient implementation of ATO, careful testing and analysis of
Copyright: © 2022 by the authors.
the technical components and software used is required and achieved by demonstrating
Licensee MDPI, Basel, Switzerland. functionality and performance. Virtual validation and testing are vital prior to verification
This article is an open access article on tracks and under real-world conditions.
distributed under the terms and Proving the functionality of a system is the responsibility of the manufacturer. Cus-
conditions of the Creative Commons tomers (infrastructure managers and train operators) must decide which system complies
Attribution (CC BY) license (https:// with their requirements and can be purchased. To make this decision, testing the perfor-
creativecommons.org/licenses/by/ mance of available systems is an essential step. Virtual testing can solve this issue efficiently
4.0/). by utilizing a simulation environment.
The general scope of the approach is envisaged to allow a flexible performance analysis
of ATO algorithms. The tests are specifically focused on determining whether an algorithm
fulfils operational requirements. Runtime analysis and software specific questions are not
addressed by the proposed approach since these requirements should be met in general.
In Europe, ATO in combination with the European train control system (ETCS) is
the preferred way. This is referred to as ATO over ETCS. An ATO system shall enable the
train to regulate traction and braking automatically. The ASTRail Consortium, part of the
European Shift2Rail project, was aimed at the identification of general functionalities of an
ATO system. Within the project scope, [4] describes the tasks of an ATO system as follows:
• retrieving necessary data for automatic operation,
• calculation of optimal speed profile,
• journey planning while including restrictions as well as provided trackside data and
defining the resulting trajectory.
Furthermore, based on the systems Grade of Automation (GoA) the following tasks
need to be carried out by the ATO software components:
The use of ATO in different settings leads to the need to incorporate multiple train
characteristics while simultaneously taking different track layouts into account. Challeng-
ing track sections can include gradient changes, varying speed limits, or complex routing.
In combination with changing technical trackside equipment even within a country and
differences in environmental conditions, all these characteristics contribute to a complex
set of external factors [1,5]
The performance of ATO is connected to the combination of algorithms for the two
main tasks of ATO, namely calculating the optimal speed profile and controlling the move-
ment of the train. Different algorithms of each category exist with individual advantages
regarding computational time or accuracy of calculations, or meeting other criteria like
driving comfort and energy consumption in the case of vehicle control, respectively. One
major condition must be met by all algorithms taking into account the real-world applica-
tion. The computational time needs to be sufficiently short. In [6], one to two seconds is
considered acceptable.
It is to be expected that the performance of ATO systems varies due to the use of
different algorithms. Especially under demanding restrictions, differences can be noticeable.
Therefore, the performance evaluation could provide useful information, e.g., for the
question of which system satisfies the requirements of the customer.
By focusing on the core software elements of ATO, this paper develops a structured
approach to enable the virtual performance evaluation of ATO algorithms. The evaluation is
focusing on algorithms for optimizing the speed profile. Commonly used key performance
indicators for railway operations are extended by introducing sub-indicators that serve
as main performance criteria. A structured scenario definition framework is described to
test algorithms with these KPIs. The framework consists of a layer-model for the railway
sector and types of test scenarios adapted from approaches developed in the context of
the automotive sector. Additionally, a list of factors possibly influencing the performance
of ATO algorithms is presented. A simple case study is conducted to demonstrate the
use of the approach. It is not intended to evaluate one specific algorithm in-depth within
this paper.
The remainder of the paper, following this introductory section, is separated into
four parts. After a comprehensive literature review of methods and standards for virtual
testing of systems enabling automatic operation in other traffic sectors conclusions for the
railway sector are drawn. On this basis, an approach is designed which provides concepts
Appl. Sci. 2022, 12, 4570 3 of 24
for measuring the performance of ATO algorithms, a structured design of test scenario
specifications, and the extraction of relevant and challenging test scenarios. Finally, this
approach is used in a specific performance-relevant test case, using a simulator provided
by ProRail.
2. Literature Review
2.1. Research Methodology
Automation is to be found in all traffic modalities. Hereby software plays an important
role in planning and controlling vehicle movements. Methods and approaches for develop-
ing and testing automation systems from other modes of traffic may be transferable and
adaptable to the railway section. Therefore, a literature review was conducted to identify
standards and strategies used in other traffic domains Figure 1.
Figure 1. Visualization of the literature review process including examples for search terms and
identified related terms.
Firstly, splitting the search into different traffic modalities allowed the identification of
standards and norms that outline and give instructions on testing criteria and strategies
for each sector. They form the basis for all subsequently used approaches. Finding these
specific test strategies and methodologies was the main goal of the literature review process.
The general terms test strategies and methodologies can hereby include in which form tests
are performed, which criteria apply, how scenarios for tests were designed or test data was
either extracted or generated.
Already in 2019, ATO over ETCS was implemented in the core of London’s rail
network. The Thameslink program was aiming to enable a high-capacity passenger railway
route by introducing ETCS Level 2 and ATO over ETCS functionalities. To introduce the
system under real operational conditions several stages of testing were used. At first,
a simulator was developed to showcase the technology and increase efficiency in the early
development phase of the system. Furthermore, a system integration laboratory enabling a
simulation of the Thameslink route was arranged. The next step was to conduct field tests
at the ETCS National Integration Facility and finally operational tests on the Thameslink
route itself [9].
In [10], the authors are proposing an operational design domain (ODD) for a Chinese
highspeed main line ATO system. The paper focuses on model validation and is including
a case study of the linkage scenario between train doors and platform doors to extract
system requirements. In this context, the ODD is used for mapping components relevant to
the case study. The scenario model construction is based on an interaction process. Yet, it is
not demonstrated how the ODD contributes to the extraction of the interaction process.
Another example of improving interoperability and additionally maintainability is the
initiative of defining an open-source version of the ETCS called openETCS. Between 2012
and 2015 a comprehensive toolchain for requirements identification, system modelling,
code development, and verification as well as testing and further related development steps
was established ([11]). Performance testing was not the main goal of this initiative rather
than providing a reference system of the ETCS onboard unit (European Vital Computer)
which allows the validation of commercially developed implementations.
Throughout all of these projects, testing, both virtual and on the track, is necessary
and is performed to prove the function of the system. On the other hand, comprehensive
performance tests and the evaluation of different algorithms to determine variances in
performance could not be identified during the literature research. This confirms the goal
of this paper to provide an approach to evaluate and compare the performance of ATO
algorithms in a virtual environment.
While some standards for virtual testing in the railway sector have been identified, it
could not be verified whether the described principles are used for testing ATO systems in
current projects. Generally, the standardization of test methods and the transferability of the
used data formats can contribute to an improvement of interoperability and comparability
of technical systems in the railway sector.
proposed to provide a testing environment where seamless transitions are possible. Using
live data, virtual and constructed entities enable multiple inputs that can be handled in the
same way by the system under test without the need to change the simulation environment.
validation is used to reduce the amount of testing. However, the concept of traceability
of design requirements can add value to the testing of ATO algorithms, while not being
able to relate certain code areas directly to a given requirement, it would instead be useful
to enable traceability of performance changes to the initiating change in the test situation.
As a promising approach, the scenario-based testing strategy was identified
As a promising approach, the scenario-based testing strategy was identified. This
strategy will be the foundation of the approach developed within this paper. The strategy
includes the already mentioned 6-layer model and the different scenario types. Both will be
transferred to the railway sector to make use of the benefits of a structured way to describe
and generate test scenarios. Together with performance criteria for railway operation found
in the literature the basis of a strategy for testing the performance of ATO algorithms
is formed.
Scene: A scene captures the current state of the environment and all dynamic elements as
well as relationships between them for one specific point in time. This includes static
objects (rail network, signs, stations, etc.) and dynamic elements (trains, passengers,
environmental conditions, etc.). Interactions and relations between all elements are
static within a scene.
Scenario: A scenario represents the temporal development of a sequence of scenes. The
timespan of a scenario can vary. How the scenes and their elements change over time
is influenced dynamically by external constraints.
Appl. Sci. 2022, 12, 4570 8 of 24
Table 1. List of KPIs for the evaluation of the performance of ATO algorithms.
The calculation of the KPIs is performed for each track section, defined by a departure
and arrival station (or point). This shall provide an additional grade of detail for the subse-
quent analysis of the measures. A more condensed evaluation is possible by determining
each KPI for the complete journey of the train under test.
The KPI ”Punctuality” seeks a detailed evaluation at multiple locations during a train
journey. The comprehensive analysis of punctuality is of particular importance due to its
significant operational impact. Besides the obvious evaluation points departure and arrival,
passing points are part of the scenario definition. These are characterized by the fact that
the train does not stop at such locations. For each point, a passing time is defined which
allows an evaluation of the punctuality at this specific point of the journey. Emerging
delays can be identified and possibly traced back to their origin. Moreover, it is possible to
assess whether and how well the ATO algorithm can reduce given delays.
Passing points can also be critical timing points (CTP). CTPs are vital points within
a timetable equipped with a time window that is characterized by the earliest and latest
passing time. These time windows define a frame in which a train must pass the CTP in
order to not negatively influence the timetable. Both delays and early arrivals are to be
avoided. As a consequence, they can have a great influence on punctuality as well as on
capacity as explained later.
The accuracy or precision of an ATO system is mainly defined by the accuracy of the
stopping process and the resulting deviation of the specified and actual stopping position.
Appl. Sci. 2022, 12, 4570 9 of 24
A stop is not only necessary at stations but also in front of red signals. Thus, two indicators
are introduced.
If using PSD in higher GoA, the acceptable range of deviation of the stopping position
is significantly lower. In [4], “the need to the ATO function of achieving a centimetre-level
accuracy for specific operations” is outlined. For stopping at stations with PSD a precision
of less than 10 cm is seen as a prerequisite.
As ATO systems shall contribute to a more efficient drive, characterized by reduced
energy demand, measuring the total energy consumption is essential. The total consump-
tion can be separated into different components such as traction and subsystem energy
consumption as well as regenerative braking energy. The most important component of the
overall energy use is the consumption of traction energy which is influenced by multiple
parameters. Those influences can be traction resistance, air resistance, efficiency rates of
system components and others.
In this paper, the energy consumption is calculated based on the data available within
the simulation log files of the simulator. Therefore, the energy consumption is calculated per
time step and summed up for each track section. In general, the following equations apply:
Ftrac = m · ( a − ar ), (1)
n Ftrac,i · vi + Ftrac,i+1 · vi+1
E= ∑ 2
· ∆t. (2)
i =1
where Ftrac is traction force in N, ar is the coasting resistance [m/s2 ] determined by the sim-
ulator based on an empirical Equation (Davis equation), a is the actual resulting acceleration
[m/s2 ] and m is the train mass in kg. In Equation (2), E is the total energy consumption in
[kWh], v is the actual speed in [m/s] and ∆t = ti+1 − ti is the duration of the time step and
n is the number of time steps.
Although ATO does not include the execution of safety-critical tasks, the performance
of the connected algorithms can be evaluated by measuring safety-related indicators.
These measures are well-accepted values and are commonly in use when examining
rail operations.
These values can be compared with a reference run on the evaluated route under
standardized (optimal) conditions, allowing a statement to be made about the quality of
the calculated speed profile. Furthermore, they can also serve as indicators for capacity
analyses, whereby further interpretation is necessary.
The indicators describe the number of approaches toward yellow and red signals as
well as counting the passages of yellow signals and stops at red signals. They provide
information on how unhindered a train can pass a track section. In addition, approaching
a yellow or particularly a red signal is always a situation that involves a higher risk than
passing a signal that indicates to proceed.
By calculating the minimal differences between the actual speed and the ETCS braking
curves, information is obtained about how close the algorithm comes to forced braking
or to what extent speed restrictions are observed. With a continuous evaluation of these
values (for each time step), more in-depth analyses would be possible.
In addition to the aforementioned KPIs, customer satisfaction is also influenced by
other factors related to comfort. One possibility to determine (parts of) this subjective
Appl. Sci. 2022, 12, 4570 10 of 24
feeling is to calculate the physical values of acceleration and jerk. Thereby it is possible to
get a statement about how comfortable or uncomfortable a drive is.
Based on relevant literature [37–39], ranges for the acceleration for the defined comfort
levels are introduced (Table 2). The absolute value of jerk should not exceed 3 sm3 . Braking
and acceleration values shall remain equal to or below 1.5 sm2 to be considered comfortable.
The proposed classification can serve as an indication of how to interpret the measured
values. All defined values must be seen in the context of the concrete application. Note
that comfort is only evaluated in terms of longitudinal motion.
Table 2. Comfort levels for absolute jerk and acceleration values based on the reviewed literature.
Figure 2. Layer model for the railway sector derived from the 6LM in [32].
Layers one, two and three in the 6LM are describing the static elements of a test
case, thereby including spatial information of all important structures and elements which
describe the static scenario components. Analogous to these layers, layers 1 and 2 of the
transferred model were defined, whereby track elements, signs and signals (without the
information, if it is not static) are subject to layer 1 and trackside structures like platforms,
tunnels, bridges, PSD and balises are elements of layer 2.
Layer three in the 6LM describes temporal changes of layers one and two if they are
consistently existing during the whole scenario. This is reasonable for road traffic because
changes to the road marking, lane guidance and connected traffic rule changes (e.g., signs
and traffic lights) can occur frequently. However, changes to the track position of rail
lines during construction are very rare and, if present, are known in advance. The installa-
tion of fences and warning lights can serve as an example of temporal modifications on
layer 2. Such changes will be subject to layer 2 for simplicity reasons, due to their limited
occurrences and often long-term installation.
From layer three upwards dynamic elements of a scenario are implemented which
do or can contain temporal information. Their state does not necessarily need to change
during the scenario but changes can occur, e.g., the sun is shining from the beginning of a
scenario till the end, or clouds are slowly covering the sun and it starts to rain.
Automatic train protection (ATP) is a vital function in train operation because it
influences all operational phases. Layer 3, which was originally containing temporal
changes to layers one and two in the 6LM, is now reserved for the ATP system, e.g., ETCS,
its components and possible interactions. When testing ATO functionalities, influences by
the ATP need to be evaluated. These influences can range from providing new movement
authorities (MA) to intervening when the actual speed is exceeding the ETCS emergency
brake indication speed curve or maximum track speed. As a result, the train might have to
brake to a full stop before continuing its drive.
Layer 4 is similar to layer four in the 6LM. S CHOLTES et al. are referring to it as the
“traffic layer” because it includes “movable objects whose movement could evolve over
time”. Dynamic objects in the railway sector are passengers (on a platform), other trains,
cars driving on level crossings, or other vehicles and objects connected to railway operation
like cranes for loading and unloading freight trains. They might not exist in such a great
amount as in road traffic. Nevertheless, they can play a significant role within a scenario.
Layer 5 copies the equivalent layer of the 6LM. All environment conditions are in-
cluded in this layer. In [10], proposing an ODD for ATO systems of Chinese highspeed
trains the following environmental parameters are conducted: humidity, wind speed and
temperature. Furthermore, the amount of precipitation, as well as the range of vision due
to fog or rainfall, can also be added to the list of environmental parameters.
On Layer 6 all digital information is incorporated, like in the original 6LM. Besides the
signal state, all information received from the ATO trackside component is subject to this
Appl. Sci. 2022, 12, 4570 12 of 24
layer. Examples of such information are journey profile updates with timetable changes,
temporary speed restrictions, disruptions, rerouting, and more. Generally, all transferred
information is modelled by this layer including connections of onboard systems.
Within the railway sector, several simulation environments are available either includ-
ing a proprietary scenario description language and format or making use of standardized
formats in the sector. The latter can also be highly relevant when aiming for a universal
scenario description. Potentially, the layer model can provide a basic structure for a new
scenario description format.
Bringing back the layer model for the railways, elements of the static layers one and
two as well as parts of layers three and six need to be defined as a base scenario before
logical scenarios can be set up. According to the simulator used and which default values
need to be defined, these minimal requirements can be more comprehensive.
Table 3. Definition of entities and their properties of an exemplary logical scenario for a train journey
based on the description of a simple functional scenario. Elements marked with an asterisk (*) do
already have finalized values and are part of the base scenario, showcasing the difference between a
base and a logical scenario.
Figure 3. Two different scopes and the according ways to evaluate the results of concrete
scenario simulations.
scenario must be taken to ensure a level playing field. When testing the algorithms against
the same scenario, it is possible to discover deviations in the handling of challenging
situations. For this purpose, either the performance for the base scenario or the performance
for the same scenario variation is comparable as illustrated in Figure 3.
4. Demonstration of Approach
This section describes a brief demonstration of the outlined approach. The simulation
framework that has been used for an exemplary ATO algorithm evaluation consists of
two components depicted in Figure 4:
Figure 4. Visualization of the simulation framework consisting of two major components (train
simulator and ATO algorithm), their subparts and the data exchange between them.
BEAOnline was originally developed as a driver advice system and based on the offline
functionality of BEA (Basic Energy Analysis), which can be used for energy consumption
estimation [43]. The main functionality of ATO and DAS algorithms, namely the calculation
of the optimal speed curve, coincides. With BEAOnline, a DAS algorithm was refined for
application as part of an ATO software.
The NEO simulator by ProRail allows the microscopic simulation of one (or multiple)
train drives. Thereby, train models and detailed virtual track images are necessary. The sys-
tem is based on the original version named MATRICS following the concept of gaming
simulation. It was used for several projects within ProRail either for demonstration or
research purposes. [44] Furthermore, NEO is capable of generating comprehensive logging
data, which is necessary to calculate the KPIs and thus evaluate the performance of the
algorithm under test. By reading and analyzing all simulation log files, the KPIs for ATO
algorithms are calculated and can be compared with other tests.
The case study provides a brief evaluation of the impact of different timetable defini-
tions including CTP variations to demonstrate the approach. For a complete evaluation of
the given ATO algorithm more profound analyses are necessary.
Different timetables The train equipped with the system under test (BEAOnline ATO
algorithm) is driving from a station on a reference line to a target destination and
needs to stop at one additional station along the line. For this journey, different
timetables are provided, which are characterized by the available running time.
Appl. Sci. 2022, 12, 4570 16 of 24
The timetable can include time windows for critical timing points. All other necessary
definitions are given by the base scenario.
The case study is based on an existing reference route available for the NEO simulator.
This route represents the railway line from Schiphol to Zwolle in the Netherlands. Train
operation is organized and constrained by timetables. Their change causes the algorithm
to adapt the calculations and thus also the optimal speed profile to the newly arising
constraints. Depending on whether the new limitations are tighter or less strict, the perfor-
mance of the ATO algorithm may change. It is intended to investigate to what extent the
algorithm can cope with varying running times set by the timetable.
The main goal of this demonstration is to show how the developed approach can
be applied and whether the defined KPIs allow concluding about the performance of the
algorithm in practice.
Based on the functional scenario two examples of logical scenarios are defined. In
Tables 4 and 5 a description of these logical scenarios is given. For both logical scenarios,
one part of the reference line functions as the corresponding base scenario, while the
baseline scenarios include all the specific characteristics of the section, the two logical
scenarios differ in terms of the concepts used to vary the timetables. As the section from
Zwolle to Almere Centrum includes two CTPs before the stop at Lelystad Centrum, logical
scenario 2 incorporates these as additional constraints. In contrast to that, no CTPs are
defined for the section of the reference line between Schiphol Airport and Almere assigned
to logical scenario 1.
Table 4. Description of the entities and their parameters relevant for testing the influence of dif-
ferent timetables on the ATO performance. Only showing relevant definitions differing from the
base scenario.
Table 5. Description of the entities and their parameters relevant for testing the influence of different
timetables and the variation of TWs of CTPs on the ATO performance. Only showing relevant
definitions differing from the base scenario.
Regarding the defined parameter ranges of the timetable properties, concrete timeta-
bles were defined. They are changing the original reference timetable of the base scenario
in such a way, that the different property parameter values within the defined ranges are
achieved. For example, for a concrete scenario derived from logical scenario one, this
could result in a two-minute reduction in available running time and for a second concrete
scenario, a four-minute reduction. A reasonable gradation of the parameters needs to be
selected according to the evaluation objectives.
Overall, in comparison with a reference drive under base scenario conditions with the
standard timetable, performance changes shall be identifiable if existing.
restrictive boundary conditions in the timetable, important insights were gained about the
simulation environment and the ATO algorithm itself.
Within Figure 5 the KPI values for different concrete scenarios of logical scenario one
are shown exemplarily. Logical and expected results, e.g., punctuality, can be identified.
Other KPIs need to be evaluated in a greater context. For example, the values of the
accuracy KPI vary greatly from one concrete scenario to the other. This variation can be
attributed to inaccuracies in the simulation environment, which resulted in a non-optimal
stopping process. Regarding maximum acceleration and jerk levels, measured values for
the available scenarios are within the defined limits. Energy consumption falls within the
expected ranges as well.
In the case of the capacity evaluation, a measurement area from the beginning of the
track up to a speed restriction area before the station of Amsterdam Zuid was selected.
This measurement allows determining how the algorithm tries to reduce the travel time to
a minimum by following the allowed maximum speed as closely as possible. Within the
base scenario, enough running time is available. Thus, it is not necessary to drive at full
speed leading to a greater area enclosed by the permitted and actual speed curve. When
the algorithm tries to achieve a time-optimal drive, the area enclosed by the two curves
should be minimal, since it tries to maintain the maximum permissible speed as closely
as possible.
How the capacity KPI is evaluated in this approach might not provide all information
important for capacity determination. Because the scope of the simulator is limited to a
microscopic assessment, the capacity evaluation is adapted to this characteristic. If analysis
on a network scale was possible, more general statements potentially of greater relevance
and significance may be drawn.
In Figure 6 visualization of the speed profile including relevant restrictions like the
maximum allowed track speed and ETCS emergency braking curve is displayed. Addition-
ally, track data like station locations and height are given. With this, a more convenient and
easier evaluation of the KPIs is possible by providing context to the plain numbers.
(a) (b)
(c) (d)
Figure 5. Cont.
Appl. Sci. 2022, 12, 4570 18 of 24
(e) (f)
Figure 5. Selection of plots comparing indicator values for concrete scenarios of logical scenario one.
(a) Arrival delay at stations. (b) Stopping accuracy at stations. (c) Energy consumption per track part.
(d) Capacity score. (e) Maximum acceleration per track part. (f) Maximum jerk per track part.
The scenarios are illustrating how the algorithm is reacting to a set of challenging
timetables. This is observable by the change in the KPI values. Additionally, the speed plots
provide viable information to further interpret the KPIs for example for a more in-depth
analysis of weaknesses of the algorithm under test or to understand better the reason for
certain specific behaviours.
Figure 6. Additional plots generated by the evaluation tool for the original base scenario simulation
of logical scenario one.
5. Conclusions
With the approach presented in this paper, two set goals are achieved, namely iden-
tifying a way to efficiently test ATO algorithms in a virtual environment and enabling
the comparison of multiple algorithms based on logical measures. Its application may
allow train operators and infrastructure managers to select the best suiting ATO systems
according to their requirements.
The approach consists of universal KPIs taken from the literature and supplemented
with sub-level indicators to measure the performance of ATO algorithms, a framework
for the structured definition of test scenarios taking advantage of the layer model for the
railway sector as well as different scenario types, and the identification of performance
relevant factors.
The KPIs introduced for this approach can provide valuable information about the
performance of an ATO algorithm in specific situations and scenarios. It proved difficult to
strike a balance between the need for further interpretation of the values and providing
Appl. Sci. 2022, 12, 4570 19 of 24
quantified metrics that are comparable when scenario definitions change. Calculating the
KPIs for each departure and arrival pair results in a more or less extensive list of values,
depending on the length of the scenario. If the KPIs were determined only for the entire
run, a simpler but less detailed overview of the performance for a given scenario would be
possible. Furthermore, it is necessary to provide a range or overview of reference values
for some indicators to be able to classify the measured quantitative values. This proved to
be particularly difficult for the comfort indicator since the classification of these values is
not commonly realized.
Within the approach, different scenario types are introduced. These are helpful to infer
from general test cases to exactly defined, complete, and thus also simulatable scenarios
in a structured manner. However, the proposed approach only abstractly defines those
scenario types. This decision was made due to the specific demands of each simulation
software concerning the scenario definition. If more standardization for the design and
exchange of scenarios is achieved, an improved and standardized concept for delineating
the different types of scenarios will also be beneficial and easier to establish. With the layer
model, a foundation is laid, based on the definition of entities and their properties.
Another critical aspect of the approach is the identification of factors potentially having
an impact on the ATO performance. Their extraction from the literature leads to an unequal
grade of detail. What is meant by this is that one factor can have a variety of impacts on the
performance of ATO systems due to the different effects it induces. Influences of several
causative constituents of railway operation were partially combined, or the causing entity
itself was included as a factor. This was done to achieve the conciseness of the overview
but resulted in redundancies. The list can certainly be extended and umbrella terms might
be divided into more specific impacts on ATO.
During the identification of the factors having a potential impact on the ATO algo-
rithm performance it was ascertained, that the layer model is applicable for defining the
simulation scenario, the entities within those scenarios, and also including the definition
of potential influencing factors of these entities. In contrast to that, it does not provide
a possibility to allocate factors directly related to the train under test, the so-called ego
train. More precisely, taking the impact of a reduced traction or braking power of the train
probably not known to the ATO algorithm as an example can lead to unexpected behaviour
and provoke an unsafe situation. This factor needs to be evaluated as well and therefore is
appending the list of influencing factors (Table A1) by a new row. This row is not assigned
to a layer of the layer model and thus resembles an additional category called Ego Train.
The approach focused on providing universal measures for evaluating and comparing
ATO algorithms. However, it showed to be interesting to analyze the information about the
calculations of the algorithm which are directly provided by the algorithm itself within a
log file.
More importantly, the proposed algorithm evaluation strategy is useful to test the
performance of ATO algorithms, especially due to the structured scenario design frame-
work, allowing an organized investigation of the potential performance influencing factors.
Furthermore, the KPIs seemed to be selected appropriately for the small case study con-
ducted. To fully determine if this is the case when aiming for the comprehensive evaluation
of an ATO algorithm and the comparison of multiple algorithms, additional explorations
are crucial.
The more specific definition of standards can be seen as a further opportunity for
improvement. Here, the definition of a uniform scenario format, for example, based on the
layer model, or the consideration of already existing formats is possible.
In general, the question remains open which level of performance can and needs to
be expected under varying operating conditions. In this context, the KPIs can provide an
important indication of the values to be achieved, but the definition of these criteria for
performance only seems to make sense with simultaneous consideration of the real routes
existing in the network and their characteristics.
For the indicators of comfort, stopping accuracy, and possibly punctuality, generally
applicable values can be defined and have been already partially provided in the scope of
this approach. The evaluation of the energy demand, capacity and the individual indicators
of the safety KPI is route-dependent and will have to be substantiated by empirical values
and feasible targets.
Lastly, if the specifications for ATO over ETCS are finally realized a further analysis of
the interfaces defined within those specifications can be worthwhile. These interfaces have
to certainly include messages, status updates and variables which need to be sent out by the
ATO onboard or trackside unit. Because of the therefore necessary standardization, useful
data might be available generally. This data could allow more extensive ATO performance
analyses but is limited to systems under the ATO over ETCS concept.
Author Contributions: Conceptualization, P.B.; methodology, P.B.; software, P.B.; validation, P.B.;
investigation, P.B.; writing—original draft preparation, B.J. and P.B.; writing—review and editing,
B.J. and P.B.; visualization, P.B.; supervision, B.J. All authors have read and agreed to the published
version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Acknowledgments: We would like to thank Jelle van Luipen from ProRail, the railway infrastruc-
ture manager in The Netherlands, for supporting this work through thorough discussions and the
internship opportunity with ProRail.
Conflicts of Interest: The authors declare no conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
6LM 6-layer model
ADAS Advanced Driver Assistance System
ASAM Association for Standardization of Automation and Measuring Systems
ATO Automatic Train Operation
ATP automatic train protection
AV Automatic vehicle
BEA Basic Energy Analysis
CTP critical timing point
DAS Driver Advisory System
ETCS European Train Control System
GoA Grade of Automation
ISO International Standardization Organization
KPI Key performance indicator
MA Movement Authority
ODD Operational Design Domain
PSD platform screen doors
SAE Society of Automotive Engineers
SBB Schweizerische Bundesbahnen
TSI Technical Specifications for Interoperability
Appl. Sci. 2022, 12, 4570 21 of 24
Table A1. Overview of all factors capable of influencing the performance of an ATO system and
especially the ATO algorithm. Factors are structured by the layers of the layer model of the railway
sector and provide information about the operational phase they are relevant to.
References
1. Tasler, G.; Knollmann, V. The introduction of highly automatic operation—Towards fully automatic train operation. Signal Draht
2018, 110, 6–14.
2. International Energy Agency. The Future of Rail; Technical Report; IEA: Paris, France, 2019.
3. Akbari, M.; Hoogewoonink, B.; Godziejewsk, B.; Rajabalinejad, M. Expediency of ATO in heavy rail: A survey for the Dutch
Railways. MATEC Web Conf. 2020, 314, 1005. [CrossRef]
4. Sirti. D3.2—Automatic Train Operations: Implementation, Operation Characteristics and Technologies for the Railway Field; Technical
Report; ASTRail Consortium, 2019.
5. Kessell, C. Main Line ATO Evaluated. Rail Eng. 2017, 150, 20–24.
6. Yin, J.; Tang, T.; Yang, L.; Xun, J.; Huang, Y.; Gao, Z. Research and development of automatic train operation for railway
transportation systems: A survey. Transp. Res. Part C Emerg. Technol. 2017, 85, 548–572. [CrossRef]
7. Nolte, J.; Wanner, F.; Matthias, M.; Kyburz, M. ATO 2 Basic Phase 1 Abschlussbericht; Technical Report; SBB: Bern, Switzerland,
2019.
8. Nolte, J. ATO2Basic Phase 2 Abschlussbericht; Technical Report; SBB: Bern, Switzerland, 2020.
9. Hartwell, G. A Review of the Thameslink Programme; Technical Report; Institution of Railway Signal Engineers Australasia: London,
UK, 2015.
10. Meng, Z.; Tang, T.; Wei, G.; Yuan, L. Analysis of ATO System Operation Scenarios Based on UPPAAL and the Operational Design
Domain. Electronics 2021, 10, 503. [CrossRef]
11. Karg, S.; Raschke, A.; Tichy, M.; Liebel, G. Model-driven software engineering in the openETCS project. In Proceedings of the
ACMIEEE 19th International Conference on Model Driven Engineering Languages and Systems; Baudry, B., Ed.; ACM: New York, NY,
USA, 2016; pp. 238–248. [CrossRef]
12. Börcsök, J. Funktionale Sicherheit, 4th ed.; VDE Verlag GmbH: Berlin, Germany, 2015.
13. Moy, Y.; Ledinot, E.; Delseny, H.; Wiels, V.; Monate, B. Testing or Formal Verification: DO-178C Alternatives and Industrial
Experience. IEEE Softw. 2013, 30, 50–57. [CrossRef]
14. International Standardization Organization. Road Vehicles; Technical Report ISO 26262; ISO: Geneva, Switzerland, 2018.
15. Ponn, T. How to Define System-Specific Corner Cases for the Type Approval of Automated Vehicles. Ph.D. Thesis, Technische
Universität München, Munich, Germany, 2021.
16. Riedmaier, S.; Ponn, T.; Ludwig, D.; Schick, B.; Diermeyer, F. Survey on Scenario-Based Safety Assessment of Automated Vehicles.
IEEE Access 2020, 8, 87456–87477. [CrossRef]
17. Huang, W.; Wang, K.; Lv, Y.; Zhu, F. Autonomous vehicles testing methods review. In Proceedings of the 2016 IEEE 19th
International Conference on Intelligent Transportation Systems (ITSC), Rio de Janeiro, Brazil, 1–4 November 2016; pp. 163–168.
[CrossRef]
18. Kitajima, S.; Shimono, K.; Tajima, J.; Antona-Makoshi, J.; Uchida, N. Multi-agent traffic simulations to estimate the impact of
automated technologies on safety. Traffic Inj. Prev. 2019, 20, 58–64. [CrossRef] [PubMed]
19. Hilderman, V. DO-178C BEST PRACTICES; Technical Report; AFuzion: Los Angeles, CA, USA, 2020.
20. Schuldt, F.; Saust, F.; Lichte, B.; Maurer, M.; Scholz, S. Effiziente systematische Testgenerierung für Fahrerassistenzsysteme in
virtuellen Umgebungen. In AAET2013—Automatisierungssysteme, Assistenzsysteme und Eingebettete Systeme für Transportmittel.
Institut für Regelungstechnik; TU Braunschweig: Braunschweig, Germany, 2013.
21. Stellet, J.E.; Zofka, M.R.; Schumacher, J.; Schamm, T.; Niewels, F.; Zollner, J.M. Testing of Advanced Driver Assistance Towards
Automated Driving: A Survey and Taxonomy on Existing Approaches and Open Questions. In Proceedings of the 2015 IEEE
18th International Conference on Intelligent Transportation Systems, Gran Canaria, Spain, 15–18 September 2015; pp. 1455–1462.
[CrossRef]
22. Nalic, D.; Mihalj, T.; Bäumler, M.; Lehmann, M.; Bernsteiner, S. Sceneario based testing of automated driving systems: A literature
survey. In Proceedings of the FISITA Web Congress 2020; FISITA: Chennai, India, 2020.
23. Kügler, M.E.; Mumm, N.C.; Holzapfel, F.; Schwithal, A.; Angermann, M. Vision-Augmented Automatic Landing of a General
Aviation Fly-by-Wire Demonstrator. In AIAA Scitech 2019 Forum; American Institute of Aeronautics and Astronautics: Reston,
VA, USA, 2019. [CrossRef]
24. Angermann, M.; Wolkow, S.; Schwithal, A.; Tonhäuser, C.; Hecker, P. High Precision Approaches Enabled by an Optical-Based
Navigation System. In Proceedings of the ION 2015 Pacific PNT Meeting, Honolulu, HI, USA, 20–23 April 2015; pp. 694–701.
25. Karlsson, E.; Schatz, S.P.; Baier, T.; Dörhöfer, C.; Gabrys, A.; Krause, C.; Hochstrasser, M.; Lauffs, P.J.; Mumm, N.C.; Nürnberger,
K.; et al. Development of an Automatic Flight Path Controller for a DA42 General Aviation Aircraft. In Advances in Aerospace
Guidance, Navigation and Control; Doł˛ega, B., Gł˛ebocki, R., Kordos, D., Żugaj, M., Eds.; Springer: Berlin/Heidelberg, Germany,
2018; pp. 121–139. [CrossRef]
26. Theunissen, E.; Kotegawa, T. Applying LVC to testing and evaluation of DAA systems. In Proceedings of the 2017 IEEE/AIAA
36th Digital Avionics Systems Conference (DASC), St. Petersburg, FL, USA, 17–21 September 2017; pp. 1–7. [CrossRef]
27. Bagschik, G.; Menzel, T.; Reschka, A.; Maurer, M. Szenarien für Entwicklung, Absicherung und Test von Automatisierten
Fahrzeugen. 11. Workshop Fahrerassistenzsysteme und Automatisiertes Fahren. 2017; pp. 125–135. Available online: https:
//www.uni-das.de/images/pdf/veroeffentlichungen/2017/13.pdf (accessed on 25 April 2022).
28. ASAM Group. ASAM openSCENARIO; Technical Report; ASAM: Chevy Chase, MD, USA, 2021.
Appl. Sci. 2022, 12, 4570 24 of 24
29. ASAM Group. ASAM XIL; Technical Report; ASAM: Chevy Chase, MD, USA, 2020.
30. PEGASUS Project Office. About PEGASUS. Available online: https://www.pegasusprojekt.de/en/about-PEGASUS (accessed
on 25 April 2021).
31. PEGASUS Project Office. PEGASUS Method; Technical Report; PEGASUS Project: Berlin, Germany, 2019.
32. Scholtes, M.; Westhofen, L.; Turner, L.R.; Lotto, K.; Schuldes, M.; Weber, H.; Wagener, N.; Neurohr, C.; Bollmann, M.H.;
Kortke, F.; et al. 6-Layer Model for a Structured Description and Categorization of Urban Traffic and Environment. IEEE Access
2021, 9, 59131–59147. [CrossRef]
33. Thorn, E.; Kimmel, S.; Chaka, M. A Framework for Automated Driving System Testable Cases and Scenarios; Technical Report; National
Highway Traffic Safety Administration: Washington, DC, USA, 2018.
34. X2Rail1 Consortium. D6.1 Current Test Condition and Benchmarking Report; Technical Report, 2018.
35. Hoffmann, D.W. Software-Qualität; Springer: Berlin/Heidelberg, Germany, 2013. [CrossRef]
36. Ulbrich, S.; Menzel, T.; Reschka, A.; Schuldt, F.; Maurer, M. Defining and Substantiating the Terms Scene, Situation, and Scenario
for Automated Driving. In Proceedings of the 2015 IEEE 18th International Conference on Intelligent Transportation Systems, Las
Palmas, Spain, 15–18 September 2015; pp. 982–988. [CrossRef]
37. Powell, J.P.; Palacín, R. Passenger Stability Within Moving Railway Vehicles: Limits on Maximum Longitudinal Acceleration.
Urban Rail Transit 2015, 1, 95–103. [CrossRef]
38. Liu, K.W.; Wang, X.C.; Qu, Z.H. Research on Multi-Objective Optimization and Control Algorithms for Automatic Train Operation.
Energies 2019, 12, 3842. [CrossRef]
39. Hoberock, L.L. A Survey of Longitudinal Acceleration Comfort Studies in Ground Transportation Vehicles. J. Dyn. Syst. Meas.
Control. 1977, 99, 76–84. [CrossRef]
40. Stanley, P. ETCS for Engineers, 1st ed.; DVV Media Group, Eurail Press: Hamburg, Germany, 2011.
41. Pachl, J. Systemtechnik des Schienenverkehrs, 10th ed.; Springer Fachmedien Wiesbaden and Springer Vieweg: Berlin/Heidelberg,
Germany, 2021.
42. Lübke, D.; Hecht, M. Das System Bahn, 1st ed.; DVV Media Group (Eurailpress): Hamburg, Germany, 2008.
43. Albrecht, T.; Gassel, C.; Binder, A.; van Luipen, J. Dealing with operational constraints in energy efficient driving. In IET
Conference on Railway Traction Systems (RTS 2010); IET: London, UK, 2010; pp. 1–7. [CrossRef]
44. van Luipen, J.; Meijer, S. Uploading to the MATRICS: Combining simulation and serious gaming in railway simulators. In Rail
Human Factors around the World; Wilson, J.R., Mills, A., Clarke, T., Rajan, J., Dadashi, N., Eds.; CRC Press: Boca Raton, FL, USA,
2012; pp. 165–177.