Theses 2
Theses 2
Theses 2
Composition du Jury :
Michel BASSET
Thèse de doctorat
Automation of everyday vehicles becomes a societal, economic and technological issue. This
task of driving automation is suggested as one of the major solutions to make transportation
safer, more comfortable and potentially more environment friendly. Automated systems
are gradually implemented in recent vehicles through Advanced Driver Assistance Systems
(ADAS). However, they are often limited to warning systems and a partial delegation of
control, or only applied under controlled and constrained conditions.
This thesis work is part of the development of a self-driving car in highway environments.
More precisely, it aims to propose a unied architecture of trajectory planner and decision-
maker taking into account the limitations of the environment and the available data within
the current development of sensors technologies (distance limitations, uncertainties).
On the one hand, the method generates sigmoid trajectories in a continuous spatiotempo-
ral representation of the evolution space, which is reduced beforehand by modeling collision-
free intervals in nominal conditions of driving. The sigmoid parameters are subsequently
optimized with a simulated annealing approach that uses the decision-maker algorithm as
the evaluation function for the generated trajectory. It thus makes it possible to elude both
the discretization and position/speed decoupling problems. On the other hand, the aggre-
gation of fuzzy logic and belief theory allows decision making on heterogeneous criteria and
uncertain data. The proposed framework also handles personalization of the vehicle's behav-
ior, depending on the passengers' risk perception and an aggressive or conservative driving
style.
The presented approach was nally evaluated and validated in a simulation environment,
and then in a test vehicle. The planning block was integrated into the existing vehicle's
architecture, interfaced with the localization, obstacles' perception and control blocks.
Je souhaite remercier tout d'abord les professeurs et chercheurs qui ont accepté de participer
à mon jury de thèse. Ainsi, je remercie plus particulièrement mes deux rapporteurs, Michel
Basset et Fawzi Nashahsibi, dont les retours minutieux et experts m'ont permis de pren-
dre conscience du travail réalisé et d'améliorer la qualité et la clarté du manuscrit et de la
soutenance. Merci également à Arnaud de La Fortelle d'avoir accepté la présidence de ce
jury, ainsi qu'à Lydie Nouvelière et Olivier Orla pour les discussions enrichissantes que nous
avons pu mener pendant la soutenance, et dont les questions éclairées ont permis de couvrir
tous les aspects de mon travail et des perspectives à y apporter. J'ai été sincèrement honorée
de votre présence et de l'intérêt que vous avez porté à mes recherches. Je tiens également à
remercier Dominique Gruyer d'avoir initié ce projet de thèse et de m'avoir oert la possibilité
d'eectuer un séjour au laboratoire ACIS de l'Université de Colombie Britannique, ainsi que
Sébastien Glaser pour les discussions autour de la thèse et de m'avoir permis de mener ce
projet à son terme.
vous dire merci pour tout ce que vous avez fait pour moi. Guillaume, tu as su trouver les
mots pour me ramener à mon bureau et me redonner conance quand je pensais avoir tout
perdu. Marc, tu as été un véritable compagnon d'aventure. Tu as été l'une des premières
personnes avec qui j'ai discuté lors de mon entretien à VEDECOM et tu m'as accompagné
jusqu'à la n. Tu as toujours pris le temps de m'écouter et de me conseiller. Tu m'as apporté
ta sincérité et ton respect, qui sont deux valeurs essentielles à mes yeux. Je n'aurais pas pu
espérer mieux que tout ce que tu m'as apporté, et tu m'as aussi fait grandir (une parenthèse
de merci).
Je n'oublie pas les amis, toujours présents pour se changer les idées et qui malgré mon
intermittence, me sont restés dèles et ont supporté mes mines de déterrée après un rendu
d'article. Merci aux copains de Condorcet avec qui nous fêtons aussi nos 10 ans d'amitié,
merci aux copains des copains qui sont devenus des copains, les copains de Tournan-en-Brie,
ceux des Ponts et ceux de Centrale Paris, merci de m'avoir accueillie ! Un grand merci aussi
à ceux dont les sms ont pu rester sans réponse, mais qui ont toujours répondu présents,
Quentin et Soline. Et merci à tous ceux que j'ai pu rencontrer lors de ce doctorat en France
ou à l'étranger.
Je souhaite également remercier ma famille et ma belle-famille pour leur soutien. Mamy,
Papa, Maman et Lili, j'espère que vous avez bien compris ce que je faisais lors de la soutenance
car il n'y en aura pas d'autres ! Je souhaite dédier ce travail à mon grand-père qui n'aura
pas eu le temps de le voir achevé.
Pour terminer, je remercie de tout c÷ur Milan. Comme tu le dis dans tes remerciements,
c'était un sacré dé que de commencer nos thèses en ce même jour du 1er décembre 2015
. Ça l'a été, et pas des moindres. C'était un beau dé à partager avec toi. Tu m'as montré
un soutien sans faille, tu as répondu présent pour les discussions, les réexions, les recherches
bibliographiques à 3h du matin quand j'avais un doute existentiel avant d'envoyer mes articles.
Merci pour tes relectures, tes encouragements, ton réconfort et ta conance. Merci d'avoir
toujours été là, d'avoir tout partagé avec moi, et je suis prête à relever n'importe quel autre
dé à tes côtés.
Contents
1 Introduction 1
1.1 Context of the autonomous vehicle . . . . . . . . . . . . . . . . . . . . . . . . 2
1.1.1 Evolution of the automated vehicle towards the self-driving car . . . . 2
1.1.2 From ADAS to fully autonomous vehicles . . . . . . . . . . . . . . . . 4
1.2 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.3 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.3.1 A taxonomy of motion planning algorithms . . . . . . . . . . . . . . . 8
1.3.2 A new architecture for decision making and motion generation . . . . 9
1.3.3 A continuous spatiotemporal maneuver generation . . . . . . . . . . . 9
1.3.4 An adaptive framework design for decision making . . . . . . . . . . . 9
1.3.5 Publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.4 Structure of the document . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
Bibliography 173
List of Figures
3.19 Inuence of trust values on the aggregate function of category 3 for the left
trajectory p1. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
3.20 Decision brackets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
4.1 Illustration of the reachable sets and graph interval with a partitioning space. 86
4.2 Illustration of the two classic architectures for motion generation and decision
making. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
4.3 All-in-one motion planning architecture. . . . . . . . . . . . . . . . . . . . . . 88
4.4 Example of 10 intermediate longitudinal velocity proles. . . . . . . . . . . . 89
4.5 Characterization of the sigmoid function. . . . . . . . . . . . . . . . . . . . . . 91
4.6 Illustration of full lane occupancy obstacles. . . . . . . . . . . . . . . . . . . . 92
4.7 The holistic view of safety. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
4.8 Example of NCNI. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
4.9 Algorithm diagrams. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
4.10 NCNI characterization. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
4.11 Illustration of the sigmoid completion constraint. . . . . . . . . . . . . . . . . 102
4.12 Illustration of the variables search space. . . . . . . . . . . . . . . . . . . . . . 103
4.13 Evaluation function. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
4.14 Inuence of T0 and q on the maximum evaluation value. . . . . . . . . . . . . 105
4.15 Illustration of the numerical scenario. . . . . . . . . . . . . . . . . . . . . . . . 106
4.16 Representation of the black-box evaluation function Jobj . . . . . . . . . . . . 107
2.1 Space and time horizon for the motion planning functions. . . . . . . . . . . . 18
2.2 Comparison table for highway applications of motion planning methods. . . . 44
2.3 Use case references table for highway applications of motion planning methods. 46
Abbreviation
DST Dempster-Shafer Theory
FDST Fuzzy Dempster-Shafer Theory
FIS Fuzzy Inference System
MCDM Multi-Criteria Decision Making
NCNI Non-Collision Nominal Interval
SA Simulated Annealing
Frames
E ego local frame
O obstacle local frame
R road local frame
W world global frame
xviii List of Tables
Functions
Operators
˙ time derivative
¨ time second derivative
∆() dierence operator for variables
end() returns the end value of a set
inf() returns the inferior value of an interval
isT rue() returns the Boolean value (0 : f alse, 1 : true)
l() returns the length of an interval
max() returns the maximum value of a set
mean() returns the mean value of a set
min() returns the minimum value of a set
sum() returns the sum value of a set
sup() returns the superior value of an interval
Fuzzy Operators
Vectors
Variables
α ego direction angle with x-axis
ax ego longitudinal acceleration
ay ego lateral acceleration
b sigmoid shift parameter
bba basic belief assignment
bel belief degree (lower probability)
c sigmoid delay parameter
d Euclidean distance
df,collision front collision distance
dr,collision rear collision distance
df,saf e front safe distance
dr,saf e rear safe distance
dx longitudinal deceleration
dir maneuver direction dir = {Lef t, Straight, Right}
Hp prediction horizon
Ip evaluation index of the pth trajectory
λ sigmoid parameter
mfA (us ) membership degree of the criterion value us in the fuzzy set A
pl plausibility degree (upper probability)
q geometric parameter of the geometric progression
t time variable
T SA temperature
T0 SA initial temperature
Tf SA nal temperature
us value of criterion s
uodA universe of discernment of the fuzzy set A
v velocity
vego ego velocity
vmax maximum road speed limit
vmin minimum road speed limit
vO j Oj velocity
vrel,Oj relative velocity between ego and Oj
vt target velocity
vwish driver's desired velocity
vx ego longitudinal velocity
vy ego lateral velocity
νs weight of the sth criterion
wκ weight of the κth category
Ωh weight of the hth hypothesis
x position along x-axis in the road R or ego E frame
xrel,Oj relative longitudinal position between ego and Oj
y position along y -axis in the road R or ego E frame
yrel,Oj relative lateral position between ego and Oj
ycenterline lane centerline lateral coordinate in R
xx List of Tables
Others
H hypothesis
I interval
Jobj objective function
LW local warning risk level (LWκι for category κ and fuzzy set ι)
O obstacle
OinfIk represents the inferior bound of Ik
OsupIk represents the superior bound of Ik
RA Risk Assessment
Quantities
C cardinal of the categories κ = 1..C
H cardinal of the hypotheses H h = 1..H
J cardinal of the obstacles O j = 1..J
K cardinal of the intervals I k = 1..K
M cardinal of the maneuvers m = 1..M
N cardinal of the ego proles i = 1..N
nLW cardinal of the combined output fuzzy sets l = 1..nLW
P cardinal of the candidate trajectories p = 1..P
S cardinal of the criteria s = 1..S
Chapter 1
Introduction
Contents
1.1 Context of the autonomous vehicle . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.1.1 Evolution of the automated vehicle towards the self-driving car . . . . 2
1.1.1.1 Phase 1: Research projects (1920s-1980) . . . . . . . . . . . 2
1.1.1.2 Phase 2: Consortium projects (1980-2000) . . . . . . . . . . 2
1.1.1.3 Phase 3: Expansion of autonomous projects (2000-present) . 3
1.1.2 From ADAS to fully autonomous vehicles . . . . . . . . . . . . . . . . 4
1.1.2.1 Levels of driving automation for on-road vehicles and adap-
tation of legislations . . . . . . . . . . . . . . . . . . . . . . 4
1.1.2.2 ADAS implemented in today vehicles for highway driving . 5
1.1.2.3 The automotive companies and the emergence of new IT
business . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.3 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.3.1 A taxonomy of motion planning algorithms . . . . . . . . . . . . . . . 8
1.3.2 A new architecture for decision making and motion generation . . . . 9
1.3.3 A continuous spatiotemporal maneuver generation . . . . . . . . . . . 9
1.3.4 An adaptive framework design for decision making . . . . . . . . . . . 9
1.3.5 Publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.4 Structure of the document . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
Since the last decade, the development of automated and autonomous vehicles has spread
worldwide among universities, software companies and the automotive sector as one of the
promising advancements in automotive engineering and research. The automation of everyday
vehicles becomes a societal, economic and technological issue. This task of driving automation
is studied in the eld of transportation safety, comfort and environmental care. Automated
systems are gradually implemented in recent vehicles through Advanced Driver Assistance
Systems (ADAS).
This rst chapter presents the evolution of the context of autonomous vehicles in section
1.1, the motivations driving this doctoral work in section 1.2 and the contributions made in
section 1.3. The structure of the document is nally summarized in section 1.4.
2 Chapter 1. Introduction
Carnegie Mellon University's Navlab succeeded in their project No Hands Across Amer-
ica with Navlab 5 [NHA ]. They drove, from Pittsburgh, PA to San Diego, CA, USA, a
steering controlled vehicle, with human-controlled throttle and brake. To quote but a few,
others inuential demonstrated projects were the ARGO project of University of Parma's
VisLab [Broggi et al. 1999], a lane marking- and vehicle detection-based vehicle, which trav-
eled the MilleMiglia in Automatico tour a 2000 km journey along Italian highways , and
the platooning demonstration of automated highway in 1997 at San Diego by the California
Program On Advanced Technology For The Highway (PATH) in collaboration with General
Motors during the National Automated Highway System Consortium (NAHSC) [Rajamani
et al. 2000], see Figure 1.1b.
(a) The radio-operated automobile (b) Demo'97: I-15 lanes in San (c) The VIAC VisLab vehicle.
from the Houdina Radio Control Co. Diego, CA.
1.1.2.1 Levels of driving automation for on-road vehicles and adaptation of leg-
islations
After Nevada, USA, authorized driverless vehicle experimentations on public roads in June
2011, other American states and countries (arranged in alphabetical order: Belgium, China,
France, Germany, Italy, Japan, the Netherlands, Singapore, Spain, Switzerland or United
Kingdom) adopted laws to test autonomous vehicles in trac. In response to these evolu-
tions, the United Nations Economic Commission for Europe (UNECE) amended the Vienna
Convention on Road Trac [United Nations ] to make the use of automation more exible. It
allowed automated driving systems, "provided that these technologies are in conformity with
the United Nations vehicle regulations or can be overridden or switched o by the driver"
[UNECE ].
With such a fast development in the automotive eld, the Society of Automotive Engineers
(SAE) published in 2014 a standard classication for automated and autonomous vehicles
with a 6-level system, detailed in Table 1.1, from 0 (no control but active safety systems) to 5
(no human intervention for driving) [SAE ]. Within this decomposition, the SAE provides a
common taxonomy and denition for automation levels according to the operator, i.e. human
driver or system, for the operational driving task (steering, acceleration, deceleration), the
environment monitoring, the fallback performance, and the driving scenario capability. With
respect to this taxonomy, the automated vehicle applies the levels 1 to 3 and the autonomous
vehicle designates the levels 4 and 5.
1.1. Context of the autonomous vehicle 5
At the moment, main automakers propose level-2 functions for individual vehicles except
for Audi who implemented a level-3 feature for the trac jam driving mode, and a level-4
for autonomous shuttle services, as it is described in the next section.
speed is more stable and limits trac jamming and fuel consumption; see [Wang et al. 2018c]
for a recent review. While interacting with obstacles, a collision avoidance system is applied
to either warn the driver of an imminent collision, or to autonomously take actions, such as
braking, helping the driver to stay safe. The Forward Collision Warning (FCW) and Reverse
Collision Warning (RCW) are visual and acoustic warnings for, respectively, front and rear
safety distances with obstacles. Lane Change Assist (LCA) systems assess the surrounding
trac and warn the driver against impending collision during his/her lane change intent.
Exploiting also the well painted lane marks on highways, systems based on Lane Departure
Warning (LDW) alert if the vehicle is leaving the ego lane, or those on Lane Keeping Assist
(LKA) keep the vehicle within the ego lane marks.
Nevertheless, all those functions are mainly reactive, as perception/action scenarios, and
are used as isolated functions of the system. A systemic point of view including both pre-
dictive and reactive reasoning is still missing. That is part of the challenges for the next
generation of self-driving cars. Moreover, two evolutions are conceivable: incrementally im-
proving ADAS to turn them into self-driving cars, or thinking the self-driving car as a new
system. The rst option allows a progressive integration of the automation levels to the full
automation without steering wheel. The second option considers the vehicle as a new fully
autonomous system, where ADAS would only concern the dynamic control of the vehicle,
warning systems and comfort of passengers. This remains an open question today among the
car-makers and the new actors of IT as detailed in the next subsection.
(a) Audi A8 L. (b) Waymo self-driving cars. (c) Navya autonom shuttle.
1.2 Motivation
The autonomous vehicle is one of the signicant technological evolutions arousing popular
enthusiasm in the past decade. As such, autonomous vehicles are now applied to dierent
kinds of mobility, such as individual vehicles, on-demand robo-taxis, or autonomous shuttles
used for the last kilometer or on limited areas. The development of this technology brings
up potential improvements. First, it tackles the safety issue with systems which have faster
reaction time than human drivers and are potentially able to communicate their intentions
more clearly on the road than human drivers can do to each other. These system capacities
would also help to decrease road congestion. In addition, they come along with a new vision
of transport and relationship to the personal vehicle. This is part of a democratization and an
environmental approach in order to reduce the number of vehicles in trac, particularly in big
cities, and to optimize the existing eet by sharing it between several individuals (on-demand
autonomous vehicles for car-sharing providing robo-taxi features). This would contribute to
save transportation time and money. Furthermore, the social aspect is part of the motivation
for vehicle automation by oering mobility to people with disabilities or diculties to drive.
Among the various components of an autonomous system, the planning part is considered
as the thinking part, responsible for the chosen actions. This part is in charge of making
the link between environmental observations from perception and vehicle's actions, see Fig-
ure 1.3. In spite of the current trendiness of this topic, autonomous projects are not recent
in the robotics literature and many reviewing works exist on motion planning strategies
[Latombe 1991, Laumond et al. 1998, LaValle 2006] or, more recently, on the specic use case
of autonomous driving [Katrakazas et al. 2015, González et al. 2016, Paden et al. 2016]. How-
ever, the standardization of the research to t industrially realistic prototypes of self-driving
cars introduces new concerns such as functional safety, real-time computing, a systemic ap-
proach, or low-cost developments, and specic popular applications for parking, intersection
management, or highway driving. The motivation for highway driving lies in its simple driving
structure and the driver's limited behavior in nominal situations, making it the most reach-
able context for the rst fully autonomous systems in trac. Furthermore, highways seem
to be the rst environment where drivers would be condent driving in a fully autonomous
mode [Payre et al. 2014]. Nonetheless, the high velocity on highway environment is a major
risk factor in case of crash.
The motivation of this Ph.D. work is to propose an approach to systemically treat decision
making and trajectory generation in a dynamic highway environment. In addition, the state
8 Chapter 1. Introduction
of the art lacks a taxonomy based on solutions that can be implemented with the available
resources, but that are exible enough to be able to evolve with future technological advances.
Therefore, we assume that the vehicle's decision making is carried out in an "isolated" vehicle,
without communication with its environment and realistic sensors' capacities for perception,
which leads to a vision at 50 − 100 meters. Even if these perception distances are admissible
for highway behavior, their limitations are a major disadvantage for autonomous vehicles at
high velocities. Moreover, we do not consider the driver in the loop at the core of the decision
making and trajectory generation developments. Thereby, our work belongs to SAE level-4,
with focus on highway driving. Nonetheless, the techniques used for decision making and
trajectory generation are not dependent on the driver actions, and can be integrated without
any intrinsic change in the SAE levels 1 to 3, where the driver will then aect the inputs,
outputs and integration constraints. Consequently, the research question driving this work
is to address the problem of a linear architecture between decision making and trajectory
generation considering the uncertainties of perception, while avoiding an overly conservative
vehicle's behavior.
1.3 Contributions
The problem addressed in this thesis is the motion planning, including the decision making
and motion generation. The main contributions in the present dissertation are synthesized
below.
self-driving cars on highway environment. It follows two purposes: (1) motion planning tech-
niques are highly dependent on the constraints of perception, control, and environment, thus
it is important to refer to an algorithm with the appropriate behavior with relation to the
target application; (2) a systemic treatment of motion planning will lead to develop comple-
mentary meta-algorithms. Finally, the issues of implementing motion planning algorithms on
autonomous vehicle in mixed-human trac infrastructure are overviewed for future research
directions.
deals with vague or imprecise information, handles non-independent variables, and recalls
human evaluation process. The second level combines the categories with the Dempster-
Shafer Theory. It yields both the evaluation of decisions and how it has been obtained, xes
the inuence of categories on the nal decision, and interprets the fuzzy information into
bound estimates. Finally, a linear combination of the beliefs intervals selects the best motion
to follow.
Thus, the proposed framework for decision making simultaneously considers several fac-
tors of risk with their relative importance for human safety. It also handles personalization
of the vehicle's behavior, depending on the passengers' risk perception and an aggressive or
conservative driving style. Real testing validates the algorithm, which is evaluated by comfort
and safety analyses.
1.3.5 Publications
Peer-Reviewed Journals
Title: Perception, information processing and modeling: Critical stages for autonomous
driving applications
Authors: Dominique Gruyer, Valentin Magnier, Karima Hamdi, Laurène Claussmann, Olivier
Orla, and Andry Rakotonirainy
Journal: Annual Reviews in Control
Volume: 44, Pages: 323-341, Year: 2017.
Title: A study on AI-based approaches for high-level decision making in highway autonomous
driving
Authors: Laurène Claussmann, Marc Revilloud, Sébastien Glaser, and Dominique Gruyer
Conference: IEEE International Conference on Systems, Man, and Cybernetics (SMC)
Place: Ban, AB, Canada Date: October 58, 2017. Used in Chapters 2 and 6
Title: Multi-Criteria Decision Making for Autonomous Vehicles using Fuzzy Dempster-
Shafer Reasoning
Authors: Laurène Claussmann, Marie O'Brien, Sébastien Glaser, Homayoun Najjaran, and
Dominique Gruyer
Conference: IEEE Intelligent Vehicles Symposium (IV)
Place: Changshu, Suzhou, China Date: June 26-30, 2018. Used in Chapter 3
The present Ph.D. thesis is organized in six chapters. The remaining chapters are described
as follows:
Chapter 2, State of the Art This chapter presents a literature review of the motion
planning algorithms applied to autonomous vehicles. Motion planning is considered under
the decision making and motion generation functions. The scope of this review focuses on
the highway environment. A taxonomy is proposed in order to highlight the main features
of each algorithm for their application to real-time autonomous systems in complex environ-
ments. Thus, six families are introduced, considering their goal, space-time property and
their mathematical domains. For each family, we review the basic idea, the specicities, the
advantages and drawbacks, the improvements, and the interest for highway planning. At the
end, a comparison table is built to summarize the aforementioned characteristics, intrinsic
and extrinsic limits for a real-time use in highway motion planning.
Chapter 3, Decision making This chapter introduces our work for decision making. The
decision part in motion planning is crucial and deals with many dierent kinds of criteria and
data, such as guaranteeing the safety time-space between the ego vehicle and its environment,
respecting the physical limits of the ego vehicle, taking into account the passengers' comfort,
following the navigation orders, etc. This presents two main issues: the heterogeneous criteria
and the uncertain data. The proposed framework for decision making addresses both by
combining the fuzzy logic approach and the Belief theory.
the prototype vehicle of Institut VEDECOM on a high speed test track with a 2-lane road,
approximatively straight over 1000 meters. The planning algorithms have been integrated
with a controller and perception modules. Two real obstacles (manual-driving) are introduced
in the last scenarios.
Chapter 6, Conclusion, future work and research perspective This nal chapter
presents the conclusions on the main contributions of this Ph.D. work, as well as some possible
future work. Also, perspectives on motion planning research are discussed.
Chapter 2
Self-driving vehicles raises relevant controversies, especially with recent deadly acci-
dents. Nevertheless, they are still popular and attractive thanks to the improvement
they represent to people's way of life (safer and quicker transit, more accessible, com-
fortable, convenient, ecient, and environment friendly). This chapter presents a
review of motion planning techniques over the last decade with focus on highway
planning. In the context of this thesis, motion planning denotes path generation and
decision making. Highway situations limit the problem to high speed and small cur-
vature roads, with specic driver rules, under a constrained environment framework.
Lane change, obstacle avoidance, car following, and merging are the situations ad-
dressed in this work. After a brief introduction to the context of autonomous ground
vehicles, the detailed conditions for motion planning are described. The main algo-
rithms in motion planning, their features, and their applications to highway driving
are reviewed.
Contents
2.1 Considerations for highway motion planning . . . . . . . . . . . . . . . . . . . 15
2.1.1 Terminology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2.1.2 Motion planning scheme . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2.1.3 Specicities of highway driving . . . . . . . . . . . . . . . . . . . . . . 18
2.1.4 Constraints on highway driving . . . . . . . . . . . . . . . . . . . . . . 19
2.2 Literature review of motion planning techniques applied on highway environment 20
2.2.1 Taxonomy description . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.2.2 Algorithm classication . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.2.2.1 Space conguration . . . . . . . . . . . . . . . . . . . . . . . 23
2.2.2.2 Pathnding algorithms . . . . . . . . . . . . . . . . . . . . . 27
2.2.2.3 Attractive and repulsive forces . . . . . . . . . . . . . . . . . 29
2.2.2.4 Parametric and semi-parametric curves . . . . . . . . . . . . 31
2.2.2.5 Numerical optimization . . . . . . . . . . . . . . . . . . . . . 33
2.2.2.6 Articial intelligence . . . . . . . . . . . . . . . . . . . . . . 34
2.3 Comparison table for highway applications . . . . . . . . . . . . . . . . . . . . 41
2.4 Conclusion and positioning of the research . . . . . . . . . . . . . . . . . . . . 42
The purpose of this work is to build a taxonomy of motion planning algorithms for highway
driving for autonomous vehicles used like the workaday car of the future.
This chapter is organized as follows: Section 2.1 explains the scope of our classication,
related to the perception and control constraints, and driver/highway driving rules, to high-
light the specic situation of highway driving. Section 2.2 describes the state of the art, and
14 Chapter 2. State of the Art
section 2.3 gives a comparison table for highway applications. Finally, section 2.4 introduces
the present work with respect to the existing state of the art.
2.1. Considerations for highway motion planning 15
This section details the terminology related to the motion planning in subsection 2.1.1 and
the motion planning scheme in subsection 2.1.2. The specicities of highway driving and the
constraints on autonomous driving are listed, respectively, in subsections 2.1.3 and 2.1.4.
2.1.1 Terminology
Before dealing with planning algorithms, one needs to dene the wide terminology involved.
The adjective ego relates to the vehicle that is mastered and sensors-equipped. In contrast,
other vehicles are denoted as obstacles. The kinematics of the vehicle are represented by
its states, i.e. its position and orientation, and their time derivatives (position, velocity,
and acceleration, linear and angular). The geometric state space is called the congura-
tion space. The evolution space identies the conguration space-time in which the ego
vehicle can navigate. Both the conguration and evolution spaces are usually divided into
three subspaces, see Figure 2.1: the collision space, in which the ego vehicle collides with
obstacles; the uncertain space, in which there exists a probability for the ego vehicle to be
in collision; and the free space, in which there is no collision. The generic term motion
characterizes the states' evolution over time. Motion can refer in the literature to either free-
space (spatial geometric zones), path (sequence of space-related states in the free space, i.e.
geometric waypoints), trajectory (sequence of spatiotemporal states in the free space, i.e.
time-varying waypoints), maneuver (predened motion, considered as a subspace of paths or
trajectories, i.e. motion primitives), or action/task (symbolic operations of maneuvers). We
distinguish generation, which builds sequences of paths, trajectories, maneuvers, or actions,
from planning, meaning the selection of one sequence among the generated motions. Finally,
the prediction horizon denotes the space or/and time horizon limits for the simulation of
the motion.
Figure 2.1: Evolution space representation: red = collision space, brown = uncertain space, green = free
space. The evolution space decomposition is based on the perception data symbolized by the gray radii.
Scene Representa on
Moon Strategy
Route Planning
Driver
Decision Making
Generaon
Feedback loop
Control
Actua on
collected, but to its quality such as the measures, their uncertainties, and their trustwor-
thiness. Thus, a scene representation is introduced between the perception / localization /
communication and motion strategy blocks, as presented in [Broggi et al. 2014]. This com-
ponent manages sensors data and provides a perception map with obstacles, lanes, trac,
road, and ego information.
On the other hand, the control block is formally fed with the reference motion decided
by motion strategy, and acts on actuators to move the ego vehicle. The motion strategy
block also reacts as a closed-loop system on the control and actuation blocks. In fact, the
information from the sensors on actuators' outputs provides an up-to-date ego state vector,
so that the motion strategy remains accurate.
Another closed-loop system from the perception / localization / communication block
to the motion strategy block is used to inform the ego perception and localization of the
current and future motion. It also conveys the current and future ego motion intentions to
the environment via communication.
Moreover, with semi-autonomous vehicles, the driver will interfere in the low levels of
automation on the actuators pedals and steering, and on the high levels, so the motion
strategy block acts more or less as a co-driver. In that sense, both communication from the
driver to the motion strategy block and the reverse are necessary to warn the driver of the
motion decision and to take into account his/her intention to drive. However, the driver will
not aect the core of the algorithms in the motion strategy block. In fact, the interactions
with the driver in the loop concern the input and output constraints (e.g. the driver wants to
stay on the right lane), and the integration conditions (e.g. replanning if the driver modies
the planned trajectory). Thus, the presented motion planning methods can be extended to
any SAE levels.
The motion strategy decides the most convenient motion according to some chosen criteria
(addressed in subsection 2.1.4). In [Michon 1985], the author names the three levels of
skills and controls: strategical (planning), tactical (maneuvering), and operational (control)
respectively. These three layers are largely exploited in the recent architecture [Broggi
2.1. Considerations for highway motion planning 17
et al.2014, Ziegler et al. 2014]. The tactical part is usually divided into subparts, with at least
one for behavioral planning and another for motion generation [Ziegler et al. 2014, González
et al. 2016, Rodrigues et al. 2016, Gu et al. 2013]. According to the structures previously
described, we distinguish ve main functions of the motion strategy hierarchy, as shown in
Figure 2.3: (i) route planning, (ii) prediction, (iii) decision making, (iv) generation, and (v)
deformation. The space and time horizons for each of the motion planning functions are
summarized in Table 2.1.
scope for
Decision motion planning
Making (iii)
Figure 2.3: Motion planning functions. Motion planning acts as a global, local, and reactive motion strategy.
The route planner (i) is a trip scheduler; it provides a general long-term planning through
the road network from the initial position to the driver's desired destination. This function
is outside the scope of this work; see [Delling et al. 2009] for a related review.
The second function, the prediction step (ii), stores the current and historic dynamics data
to predict the dynamics of all the elements surrounding the ego vehicle. This process allows
to perform long-term risk estimation and dynamic replanning. The road infrastructure, the
route direction, the driving rules, and the lane marking evolution are usually extrapolated
from map information. The prediction of obstacles' behavior is the most critical task of the
prediction function. Although it is necessary for motion planning, it is not explained in this
work; see [Lefèvre et al. 2014] for a corresponding review.
The core functions addressed in this literature review are decision making (iii), generation
(iv), and deformation (v), which we call the motion planning scope. In recent works [González
et al. 2016], the motion planning approaches are organized as follows:
A high-level predictive planning built around three objectives: criteria evaluation, risk
minimization, and constraint submission (see subsection 2.1.4). Those are used for de-
cision making (iii), i.e. to select the best solution out of the candidates' generation (iv).
One either generates a set of motions and then makes a decision as a motion selector,
or denes the behavior to adopt and then generates a motion to t this behavior. This
high-level stage benets from a longer predicted motion but is time-consuming.
A low-level reactive planning deforming the generated motion from the high-level plan-
ning according to a reactive approach, i.e. the deformation function (v). This acts on
a shorter range of actions and thus has a faster computation time.
18 Chapter 2. State of the Art
Table 2.1: Space and time horizon for the motion planning functions.
Lane keeping: The safety space in front of and behind the ego vehicle is guaranteed.
The longitudinal decision is to maintain the desired speed, whereas the lateral decision
keeps the ego trajectory inside the ego lane marks.
Car following: Besides lane following, the ego vehicle must follow the front ego lane
vehicle while maintaining its longitudinal safety distances.
Lane changing: This decision is made under either directional or obstacles' constraints.
The motion planner must ensure that the space in the target lane is sucient and that
the speed is adequate to keep the ego vehicle in a safe state.
Lateral-most lane changing: To ensure uid trac ow and safety, some driving rules
require leftmost or rightmost lane driving. Hence, the ego vehicle always seeks to change
lanes until the lateral-most lane is reached.
Passing: The ego vehicle respects a lane keeping or car following decision while obstacles
are in the adjacent lanes. Keeping lateral security distances is required.
Overtaking: This complex maneuver consists of a lane change, then passing an obstacle,
and nally another lane change to return to the previous ego lane.
Merging: Two actions occur on a highway: entering or exiting the highway, and merging
two rows of vehicles into one. The ego vehicle must adapt its longitudinal and lateral
speeds and distances to ease its way into trac.
2.1. Considerations for highway motion planning 19
Highway toll: The approach decision is to merge into a ctional lane in anticipation
of a toll lane delimitation and to reduce the speed until stopping, whereas the leave
decision is to accelerate to the reference speed and to merge into a real trac lane.
The emergency maneuvers can either be treated from a motion planning point of view
or from the control one. In the rst case, two strategies are highlighted in the literature:
applying reactive methods for a swerve maneuver to limit the collision, e.g. with a potential
eld approach in [Wang et al. 2019] (see subsection 2.2.2.3) or a graph search in [Jafari
et al. 2017] (see subsection 2.2.2.2), and planning an emergency lane maneuver to drive the
vehicle in a priori safe environments, as suggested in [Vanholme et al. 2010] with a predened
emergency lane trajectory. In the second case, the controller strategy aims to maintain the
vehicle stability and minimize the collision states, as developed in [He et al. 2018].
Autonomous vehicles are also studied in platooning. This conguration decreases the
distances between the vehicles and thus increases the capacity of roads. The motion planning
strategy in platoons must be more robust than for individual vehicles in the sense of smoother
acceleration and turns to guarantee the platoon's stability. Platooning is not as interesting
from the perspective of motion planning as from that of control. We therefore do not consider
any specic studies on platooning in this state of the art work. Please refer to [Axelsson 2017]
for more details.
The main dierences between highway, except for platooning, and city driving consist
in a further look-ahead time, with a stronger focus towards the ahead direction of the road,
whereas city driving involves a closer range of prediction but in all the directions. The highway
dynamics is also simpler with lower turn-angle, no reverse, and less braking/acceleration, but
higher and more constant velocity. Besides, vulnerable road users (e.g. pedestrians, motor-
cyclists) are less likely encountered on highways. Thus, even if there are less hazards, the
risk due to high speed is stronger. Moreover, the higher distances imply poorer sensors
capacities. Finally, less trac insures more stable scenario. The algorithms which consider
all these specicities in real-time will be favored for a practical application on highways.
2.2.2.3) and the numerical optimization (see subsection 2.2.2.5). Indeed, the longitudinal and
lateral constraints on the vehicle dynamics are important to maintain passenger's comfort for
high speeds; thus simple dynamics models are sucient. However, this question of degrees
of freedom is a fundamental design parameter in motion planning and control architecture,
and should be addressed to guarantee a safe and drivable motion [Schürmann et al. 2017], as
well as consistency [Polack et al. 2018]. The last dynamic constraint is the evolution of the
ego vehicle in time. To summarize, the authors in [Rodrigues et al. 2016] identify the quality
requirements for the generated motion: feasible, safe, optimal, usable, adaptive, ecient,
progressive, and interactive.
on highway environment
The choice of the motion planning methods depends on the formulation of the motion planning
problem.
Firstly, the problem formulation strongly varies with the inherited data (discrete/continuous,
algebraic/analytic or static/dynamic) for the scene representation. Thus, the sensors' tech-
nologies are of great importance to implement a motion planning algorithm. Even if the
uncertainties are more and more considered in the motion planning scope, too few papers
addressing motion planning tackle the issue of the sensor's architecture and technology. We
invite the readers to refer to [Gruyer et al. 2017] for a corresponding review.
Secondly, motion planning combines ve unavoidable aspects, as seen in the previous
sections: (i) state estimation, (ii) time evolution, (iii) action planning, (iv) criteria optimiza-
tion, and (v) compliance with constraints. How these are handled changes the outlook of the
problem.
Furthermore, two approaches for reviewing motion planning algorithms coexist: distin-
guishing and not distinguishing the driving modes. For the rst, the driving modes and
use cases are separated. For example, in [Bevly et al. 2016], the authors focus on the lane
changing and merging maneuvers on highways. In this thesis, we consider a situation only as
a specic set of criteria and constraints, not as a dierentiating element for the algorithms.
In this way, all methods could be applied to the dierent situations, with limitations to their
proper functioning. Yet, we noticed that the main functions of motion strategy described
in subsection 2.1.2 involve discriminating specicities among motion planning methods, e.g.
the deformation function requires a reactive online real-time method.
In this section, we rst explain our taxonomy in subsection 2.2.1, and then review the most
popular approaches from their founding principles to their advancement for an application
to highway planning since the DARPA Urban Challenge 2007 in subsection 2.2.2. We do not
pretend to make an exhaustive taxonomy as an innite number of ways of treating motion
planning exist.
logic reacve horizon logic reacve horizon logic reacve horizon logic reacve horizon
heurisc geometric heurisc geometric heurisc geometric heurisc geometric
Space con guraon Path ndings Aracve and repulsive forces Curves
set-algorithm
3 set-algorithm set-algorithm set-algorithm set-algorithm
biomimec solve-algorithm 3 3 3
biomimec biomimec
3
2 biomimec solve-algorithm solve-algorithm solve-algorithm biomimec solve-algorithm
2 2 2 2
1
predicve horizon cognive predicve horizon cognive predicve horizon cognive predicve horizon
1
cognive predicve horizon cognive
1 1 1
0 0 0 0 0
logic reacve horizon logic reacve horizon logic reacve horizon logic reacve horizon
logic reacve horizon
heurisc geometric heurisc geometric heurisc geometric heurisc geometric
heurisc geometric AI Logic AI heuriscs AI approximate AI cognive
Figure 2.4: Taxonomy classication. The radar charts show the distribution of each of the families considering the selected attributes. At the center of the
chart, the methods do not suit the corresponding attribute at all (value = 0). In contrast, an outer position of the attribute means it is well adapted to the
use of the method (value = 3). Each value is assigned according rst to the denitions of the families, and second to their occurrence in the literature.
2.2. Literature review of motion planning techniques applied on highway
environment 23
2.2.2 Algorithm classication
For each family, we review the basic unifying idea, the specicities, the advantages/drawbacks,
the evolution, and the interest for highway planning. One notices that the solutions were rst
dealing with a specic method in a specic scientic eld, and then evolved to multi-eld
mixed algorithms.
2. to exclude the points, cells, or lattices in collision with obstacles or not feasible; and
Figure 2.5: Illustrations of space congurations: (a) sampling points, (b) cells decomposition, (c) lattice. The
points / cells / maneuvers are candidate, dismissed and chosen solutions in blue, red, and green.
24 Chapter 2. State of the Art
Sampling-Based Decomposition (Figure 2.5a) Two main routines are used in the
literature to return a set of sampling points. The rst one chooses the points in the evolution
space of the ego vehicle with respect to the kinematic constraints, but with higher calculation
time due to the optimization choice of the samples under these constraints. The second one
picks random points in the evolution space, so that the algorithm is computed faster but
the method is incomplete, not replicable and sensitive to the random points' distribution.
Moreover, the links between two points are not necessarily kinematically feasible. Both
routines are suboptimal and do not guarantee that a solution will be found if one exists, or that
a solution will be returned in a nite computational time. However, sampling congurations
are exible to dynamic replanning and do not require any explicit modeling of the collision
space. They can thus be used for a reactive trajectory deformation.
The most popular random method is the Probabilistic RoadMap (PRM) [Kavraki
et al. 1996]. It uses random samplings picked in the evolution space during the construction
phase. These samplings are connected to their neighbors to create an obstacle-free roadmap,
which is then solved during a second query phase by a pathnding algorithm, e.g. Dijkstra
(see subsection 2.2.2.2) in [Villagra et al. 2012]. In [Li et al. 2014], the authors rst sampled
the conguration space based on a reference path, e.g. the ego lane centerline, selected the
best set of sampling points according to an objective function, and then assigned a speed
prole to the path to respect safety and comfort criteria.
A better strategy is to consider both space and time dimensions in the decomposition.
Dealing with spatiotemporal sampling points makes it possible to obtain a predictive
algorithm, as was done in [Hesse et al. 2010], where points are constrained in a 5-dimensional
evolution space with vehicle position, orientation, velocity, and arrival time. Considering the
drawback of tiny space and space close to obstacles, adaptive samplings can be adapted to
autonomous vehicles, as presented in [Hsu & Sun 2004] for robots from 2 to 8 degrees of
freedom.
These methods are usually preferred for mobile robotics or autonomous vehicles in un-
structured environments. Their use for highly structured highway planning is thus dimin-
ished.
Connected Cells Decomposition (Figure 2.5b) The methods rst decompose the space
into cells using geometry, and then construct an occupancy grid and/or a cells connectivity
graph, see Figure 2.6 for application examples. In the occupancy grid approach, a grid is
generated around the ego vehicle. The information on the obstacles' detection is overlaid on
the grid. In case uncertainties are considered, stochastic weights are added to the cells to
obtain a cost-map representation. The main drawbacks of an occupancy grid are the large
memory requirements and high computation time, the false indicative occupation with moving
obstacles, and a space- and time-varying resolution. In the connectivity graph approach, the
nodes represent the cells, and the edges model the adjacency relationship between cells. The
graph can be interpreted as a path along the edges of the cells or a path to nd inside the
connected cells.
Two strategies are distinguished: those that are based on the obstacles, and those that
are not. The representation of obstacles plays a key role for cells decomposition algorithms.
Obstacles are usually represented as convex polygons [Kuan et al. 1985], rectangular [Liu
et al. 2017a], triangular [Nilsson et al. 2017], circular [Song et al. 2013, Choi 2014, Mouhagir
2.2. Literature review of motion planning techniques applied on highway
environment 25
et al. 2016], ellipsoidal shapes [Claussmann et al. 2015, Wang & Ayalew 2016], based on the
obstacle dimensions and speed; or the entire road lane [Yu et al. 2016]. For non-obstacle-
based representation, the cells' organization can be determined oine, and then lled online.
The grid is rapidly obtained, but does not take advantage of the environment properties. On
the other hand, the obstacle-based representation builds an online grid, which is more com-
putational as replanning is necessary to consider the dynamic of the environment. Whatever
the decomposition, probabilistic occupancy can be applied to dene a more realistic decom-
position proportional to the probability of occupancy, as proposed in [Moras et al. 2011].
The most intuitive non-obstacle-based method is the exact decomposition (Figure 2.6e),
which separates the space with vertical and/or horizontal segments. An egocentric exact de-
composition adapted to the vehicle's size and the obstacle's relative speed is used to exploit
an occupancy grid for a Markov Decision Process in [Mouhagir et al. 2016] and a lane change
ow chart in [Tehrani et al. 2015] (see subsection 2.2.2.6). Reference [Yu et al. 2016] lls the
occupancy grid with a belief state from the semantic lane information. In [Ardelt et al. 2010],
8 overlapping static and dynamic cells are mapped based on the lane conguration and per-
formance of integrated sensors. Considering the nonholonomic behavior of the vehicle, the
curvilinear or polar grids [Moras et al. 2011] provide a more realistic decomposition around
the ego vehicle. However, as non-obstacle-based occupancy grids cannot evolve according to
the obstacles' dynamics, they quickly become obsolete due to the computational cost of a
rened decomposition. Another drawback is the lack of accuracy on obstacles' positions.
To consider environment uncertainties, the Vector Field Histograms (VFH) from
[Borenstein & Koren 1991] statistically models the evolution space with a velocity histogram
occupancy grid as a polar histogram (Figure 2.6d). Reference [Qu et al. 2015] proposes a
constrained VFH to treat kinematic and dynamic constraints to extend to highway applica-
tions. VFH methods are mostly used for deformation function as a reactive planner, as they
are robust to sensors' uncertainties [Lee et al. 2014].
To reduce the search space, decrease the computation time and deal with dynamic obsta-
cles that are either hard to model or not modeled, the Dynamic Window (DW) approach,
26 Chapter 2. State of the Art
introduced in [Fox et al. 1997], reduces the search space into a reachable velocity space within
a short time interval (Figure 2.6f). It is also used as a reactive planner for trajectory deforma-
tion. In [Kang et al. 2015], DW is directly applied on an image from the camera. Reference
[de Lima & Pereira 2013] combines a DW approach for avoiding unmodeled obstacles with
a Velocity Vector Fields (VVF) (see subsection 2.2.2.3). The authors in [Mitsch et al. 2013]
demonstrate safety recommendations for both static and moving obstacles using DW.
The routines of the second type use the obstacles to set the cells decomposition. The
Voronoi decomposition [Kuan et al. 1984] builds cells between particular points, which
represent each obstacle (Figure 2.6b), mainly using the Euclidean bisection (L2 Euclidean
distance norm) or absolute value (L1 Manhattan distance norm). The method is extended
to polygonal obstacles with the generalized Voronoi diagram [Aurenhammer 1991]. The
decomposition generate a non-regular grid and is classically interpreted as the way that is
equidistant from each obstacle (way along the edges), i.e. a skeleton. Reference [Villagra
et al. 2012] uses Voronoi diagrams to reduce the obstacle-free point's space, whereas the
authors in [Lee et al. 2014] propose a real-time algorithm based on one Voronoi cell built
on the next collision point in the ego vehicle trajectory. To increase the distance between
an obstacle and the ego vehicle, weighted [Sugihara 1993] and uncertain [Ok et al. 2013]
Voronoi diagrams have been developed in mobile robotics, but they are not that common for
autonomous vehicles. The major drawbacks of Voronoi decompositions are the heterogeneous
cells sizes, the kinematic feasibility of linking adjacent cells, and the dynamic evolution,
which involves time-consuming replanning. Moreover, the equidistance does not necessarily
guarantee safety. In response to these issues, the approximate methods split up into
thinner cells when the obstacles are closer to obtain a more accurate occupation grid. A
classic approximate approach is the quad tree decomposition, as in [Trepagnier et al. 2011].
Building a dynamic cells decomposition remains the major drawback of the Voronoi and
approximate approaches.
A major improvement is then to add a time dimension to the spatial decomposition.
The visibility decomposition (Figure 2.6a) picks points of interest in the scene represen-
tation, and links them with segments if these do not intersect obstacles [Lozano-Pérez &
Wesley 1979]. They are extended to a path velocity representation in [Kant & Zucker 1986].
The points of interest can be based on the obstacles' vertices [Johnson & Hauser 2013] or
along the road border [Choi 2014]. This implies that the path along the edges can be in
contact with obstacles, which is then adapted to the highway planning problem by space
shifting the trajectories, as in [Johnson & Hauser 2013]. In addition, the driving corridor
representation uses road boundaries and the spatiotemporal positions of obstacles to produce
a set of free spatiotemporal evolutions as corridors [Ziegler et al. 2014]; see Figure 2.6c. From
a static decomposition based on visibility decomposition as graph connectivity, [Choi 2014]
builds a dynamic corridor from Velocity Obstacles (VO) analysis. This spatiotemporal
algorithm returns the set of all the velocities of the ego vehicle that lead to a collision [Fiorini
& Shiller 1998]. A velocity outside the spatiotemporal representation guarantees that there
will be no collision under the hypothesis that the obstacle velocity prediction is correct. The
passage and region channels are then solved with a connectivity graph [Kuan et al. 1985]
or with a space constraint-based optimization method [Gu et al. 2016b]. Reference [Ben-
der et al. 2015] builds the corridors with the homotopic method to enumerate the possible
maneuver variants from a path velocity decomposition and [Altché & De La Fortelle 2017]
2.2. Literature review of motion planning techniques applied on highway
environment 27
proposes a four collision-free cells partitioning to design a spatiotemporal transition graph.
A major drawback of driving corridor algorithms is still their calculation time.
Most static cells decompositions are of little interest in the context of highway planning.
Methods that are only based on obstacle decomposition exclude the road geometry, which is
at the core of highway driving. In contrast, algorithms that benet from both road constraints
and dynamic obstacles are of major use for highway planning.
Trees (RRT see below for details) comprises both set- and solve-algorithms, with motion
generation and selection. The main use of pathnding algorithms is for route planning, but
they adapt well for local planning and applications to highways as predictive algorithms. As
benets, these algorithms are universal and widely used, and solve either known or unknown
environments. The main drawbacks are their dependency on the graph size and complexity,
which aects the choice of the solver, and their need for detailed information on the space
conguration, which makes them slow in vast areas. We will restrict our review to the most
frequently observed algorithms for highway autonomous driving.
Explored States Selected Path
Cs-i Cs-i
Start State Goal State Start State Goal State
Obstacle Obstacle
(a) Dijkstra
(b) A*
Figure 2.7: Illustrations of the processes of (a) Dijkstra, (b) A*, and (c) RRT.
(a) APF in [Wolf & Burdick 2008]. (b) VFF in [Hesse et al. 2010]. O1 (c) Elastic Band Model in
The yellow square denotes the start- and O2 are moving obstacles. [Gehrig & Stein 2007].
ing position and the green circle the
end position. Lower potentials are in
blue, higher in red.
partial potential to distinguish between emergency reactions and preventive actions. The
benet to time consideration with a velocity potential leads the ego vehicle to progress for-
ward smoothly, as emphasized in [de Lima & Pereira 2013] with the use of a Velocity Vector
Field (VVF). Furthermore, APF returns direct control inputs [Gerdes & Rossetter 2001] or
constraints for the optimization solver [Liu et al. 2017a, Rasekhipour et al. 2017]. The rst
drawback for application to vehicle planning is the oscillatory behavior when close to obsta-
cles, but smoothing algorithms overcomes this [Galceran et al. 2015]. The second diculty
is the presence of local minima. Not only is the ego vehicle stuck in a local minimum, but
overcoming this issue impacts the smoothness of the path and calculation time. A trivial so-
lution is to add a heuristic to exit the minima with a randomized path, as is done in [Bounini
et al. 2017], where local minima are also repulsive articial potentials. This tends towards
the elimination of the local minima afterwards. However, local minima might be necessary
in highway planning to keep the ego vehicle safe from inopportune lane changes, as stated in
[Wolf & Burdick 2008].
Extensions to uncertain environments allow to transpose the algorithms from a determin-
istic robotics environment to highway planning. The Virtual Force Field (VFF) introduced
by [Borenstein & Koren 1991] uses the VFH decomposition (see subsection 2.2.2.1) and in-
terprets the probability distribution as potential forces to guide the vehicle along the weakest
grids. This method is adapted to highway driving in [Hesse et al. 2010] to deform a trajectory
previously obtained with sampling points and A* methods; see Figure 2.8b.
Another drawback of the APF highlighted by [Khatib 1986] is the lack of dynamic rea-
soning: namely, only spatial dimensions are used for dynamic obstacle avoidance, so that the
ego vehicle always tried to avoid the obstacle by bypassing it. The elastic band algorithm
[Quinlan & Khatib 1993] models the environment as a spring-mass system, considering N
discrete nodes on which potential forces are applied [Song et al. 2013]; see Figure 2.8c. Ref-
erence [Gehrig & Stein 2007] applies elastic band to car following as a deformation of the
leader vehicle path and [Keller et al. 2014] develops a time-related elastic band framework
with temporal waypoints for lane changing.
Attractive and repulsive forces are widely implemented for reactive planning in all kinds
of robotics. Except the high time calculation, they prove their eciency for highway plan-
ning, thanks to a straightforward integration of the scene representation, provided that the
potential functions are well chosen within the environment.
2.2. Literature review of motion planning techniques applied on highway
environment 31
2.2.2.4 Parametric and semi-parametric curves
Parametric and semi-parametric curves are major geometric methods in path planning algo-
rithms on highway for at least two reasons: (i) the highway roads are built as a succession
of simple and predened curves (line, circle, and clothoid [Baass 1984]); and (ii) a prede-
ned set of curves is easy to implement as candidate solution sets to test. Moreover, as
some of the curve-based algorithms directly take into account the kinematic constraints of
the vehicle, they are widely used to complement other methods. Geometric considerations
are usually separated from the dynamic constraints. For instance, the authors in [Villagra
et al. 2012, Lima et al. 2015b] rst solve a static problem considering the geometry of the
road, and then solve the dynamic dual problem by searching for a speed prole built to adapt
to the curvature prole of the road and to respect the constraints of the dynamic obstacles.
In contrast, [Vanholme et al. 2013] rst xes the speed prole and then deforms the curve
path. Those decoupled approaches could lead to a long time reaction for the vehicle in case of
a blocking situation or replanning. Otherwise, curve planners are well suited to a predictive
approach too, as they dene a motion from a start to a goal point or cell or maneuvers. This
property is also suitable for replanning stages.
(a) Tentacle scheme in [Von Hun- (b) Bézier curve in [Chen et al. 2013]. Pi , Qi are control
delshausen et al. 2008]. points.
We distinguish between two exploitations of curve algorithms. First, the point-free curves
subfamily is used to build kinematically feasible trajectories as a set of candidate solutions
(maneuvers). Second, the point-based subfamily uses curves to t a set of chosen waypoints
(sampling points or cells). The rst routine is a set-algorithm and needs a decision-maker to
return the most convenient maneuver, whereas the second one requires a space decomposition
before tting a path or a trajectory, and is thus a solve-algorithm. Both are considered for
the generation function.
The point-free curves subfamily refers to the principle of lattice, called tentacles algo-
rithm, as introduced in [Von Hundelshausen et al. 2008]; see Figure 2.9a. Instead of a space
decomposition, the tentacles are based on primitives parametric curves, such as lines and
circles, clothoids, and sigmoids. Each tentacle is obtained with dierent lateral, i.e. steering
wheel angle, and/or longitudinal, i.e. speed, parameters. Moreover, as the search space for
solutions is reduced, the computational time is limited compared to the space decomposition
methods. The tentacles can also be calculated oine, as a trajectory data base [Cherubini
et al. 2012].
Reference [Dubins 1957] shows that line and circle paths are the solution for curves
32 Chapter 2. State of the Art
of MPC algorithms is their replanning ability, but they are still too poor for non-convex and
high complexity problems.
Dynamic Programming (DP) draws its ecacy for complex computational problems
by breaking them into simpler subproblems, even with interdependency. The resolution of
each subproblem is combined to nd the global problem solution. This implies that the
description of the problems has good characteristics. In [Choi 2014], the author searches for
the shortest path in a cells decomposition using a pre-calculated DP solution. The authors
in [Gu et al. 2016a] use DP to calculate the optimal cost-to-go value of a set of maneuvers
for dierent speed proles.
Numerical optimization is widely used in motion planning, either to decrease the solving
time of a graph's exploration, or to exploit the mathematical properties of the problem.
These algorithms can be solved by generic numerical resolution tools. The main encounter
frameworks in the literature are CVX and CVXGEN softwares, Gurobi and YALMIP solvers,
Matlab Optimization Toolbox, NPSOL package, and the ACADO Toolkit.
Learning SVM
Game Evoluonary
Human-like Heuris c
Task Agent
Cognive Risk
Neural Network Raonal
Beliefs
Markov
Approximate
Logic
reasonning
Rule-based
Fuzzy Logic FSM
Rules Decision tree
Figure 2.11: Illustration of an FSM for highway [Wang et al. 2015]. Ci , Ei , Fi are specic conditions to
transit between the vehicle behaviors Si .
In case of uncertainty, Bayesian networks using Markov models are employed. They are
statistical representations of causal links, based on probabilistic transitions. They present
a knowledge-based identication step of their parameters to determine the most likely se-
quence of states to the sequence of outputs. Reference [Rodrigues et al. 2016] matches the
most expected obstacles' behavior intention to return the best ego behavior maneuver (stop |
cruise | accelerate | decelerate and turn-right | turn-left | straight). The authors in [Mouhagir
et al. 2016] develop a Markov Decision Process (MDP) on the choice of tentacle trajecto-
ries, and the one in [Zhou et al. 2017] for a lane-staying or -changing decision. The advantage
of MDP is their ability to evaluate several predictions at the same time. Behavior improve-
ments are shown using a Partially Observable Markov Decision Process (POMDP) with a
probability distribution over the set of possible states in [Ulbrich & Maurer 2015, Galceran
et al. 2017, Li et al. 2018].
The AI logic-based methods are mostly suitable in constrained and predictable environ-
ments, such as nominal highway driving. Their intuitive and fast architectures are widely
promoted for use in critical safety environments, where cause-and-eect reasoning is neces-
sary. On the other hand, their lack of autonomy and rigid program structure for an adaptable
and recongurable algorithm are their main disadvantages for use in open environments such
as highways. As the logic-based approaches are straightforward to implement, no specic
framework stands out in the literature.
On the other hand, their main drawbacks are their lack of traceability and the absence of a
systematic design methodology. The authors in [Balal et al. 2016] propose a fuzzy inference
decision system to check whether the ego vehicle has enough time to change lanes based on a
close, medium, or far evaluation of the gaps between the surrounding vehicles; see Figure 2.12.
References [Naranjo et al. 2008] and [Milanés et al. 2011] use fuzzy rules on lateral and an-
gular error, respectively, to propose automatic overtaking maneuvers and lateral following of
a reference map.
Figure 2.12: Illustration of the fuzzy logic decision in [Balal et al. 2016].
Articial Neural Networks (ANN) are a popular form of approximate learning solu-
tions. ANNs imitate the low-level structure of biological neural networks, i.e. connected neu-
rons in overlaid layers. The learning process is based on error retropropagation to adjust the
neural connections' weights with dierent learning strategies, e.g. supervised, unsupervised,
reinforcement learning. They are used for both generation and decision making functions,
as a set- and solve-algorithm in predictive and reactive approaches. Their main drawback
for autonomous driving is the absence of a causal explanation to a solution, while their main
advantage is their ability to learn by training on multidimensional data. Another disadvan-
tage is thus the large amount of data required, which makes them computationally expensive.
Reference [Lefevre et al. 2016] proposes a combination of Hidden Markov Models (HMM) and
Gaussian Mixture Regression to mimic the driver's acceleration and braking model, whereas
[Geng et al. 2016] learns the driving speed proles with a Levenberg-Marquardt algorithm
on a 3-layer ANN with 20 hidden nodes trained on 600 driving trips. The authors in [Re-
hder et al. 2017] imitate driver trajectories for lane changing with Convolutional Neural
Networks (CNN), and [Bojarski et al. 2016] compares the learning stage with human driver
control inputs. Reinforcement learning is used in [Li et al. 2018] to determine the policies
for each agent in the scene representation, or for multi-goal overtaking maneuvers in [Ngai
& Yung 2011] and automated lane change maneuvers in [Wang et al. 2018a]. Some new
promising research is in progress using the deep learning methods [Chen et al. 2015b], or by
integrating the perception and planning blocks in end-to-end learning [Yang et al. 2018].
2.2. Literature review of motion planning techniques applied on highway
environment 39
Neural networks are also extended in case of uncertainty with Bayesian probabilities in belief
networks [Huang et al. 1994]. However, one of the challenges of these methods is ensuring
that they learn in the correct way with the diversity of environments and behaviors of driving.
AI approximate reasoning algorithms seem to be highly promising for the near future.
They rely on logic and statistical bases, which are extended to cognitive properties. They
also provide adaptive reasoning to evolve appropriately to their environment. At the moment,
the main impediment to these algorithms is the lack of feedback in real driving scenarios.
As the real-time implementation of approximate reasoning algorithms is complex, the use of
specic frameworks is favored, e.g. Matlab Fuzzy Logic Toolbox for fuzzy logic approach or
TensorFlow for machine learning.
troduce a Time of Zone Clearance (TZC), combined with the perceived risk on speed,
distance, safety, and comfort, and based on the proposition that risk appears if trajectories
overlap.
Even if compensation risk estimators seem to be more faithful to the human's decision,
the above risk approaches are usually based on one factor of risk, whereas risk warning should
take into account all factors of the scene representation. Reference [Prokhorov 2009] proposes
a two-level risk estimator aected by weather, trac, or road conditions, and then rened
with real-time information about the ego vehicle surroundings. Promising risk assessments
have been developed combining usual risk estimators and belief theory, as proposed in [Daniel
et al. 2013] to relax thresholds applied to fuzzy sets.
Figure 2.13: Illustration of the rst elementary tasks for "passing" in [McKnight & Adams 1970a].
The second approach comprises taxonomic models, which are also popular symbolic
models for human decisions. They identify the sequential driving tasks relations, with behav-
ioral and ability requirements. Those methods call for a logical organization, and represent
operational research task scheduling. They must then be interpreted either as space-time-
action constraints or with a local planner, e.g. the curve planner discussed in subsection
2.2.2.4. One of the disadvantages is their dicult explicit description, as they are either
focused on a driving task, such as lane changing or merging, or hardly exhaustive. How-
ever, they present the advantages of providing an abstract and universal representation and
easy replanning. For example, McKnight and Adams list 45 major driving tasks in [McK-
night & Adams 1970b, McKnight & Adams 1970a, McKnight & Hundt 1971b, McKnight &
Hundt 1971a], decomposed into 1700 elementary tasks; see Figure 2.13. References [Naranjo
et al. 2008, Chen et al. 2015a] dene the sequence of operations to perform an overtaking
maneuver; each action is then validated under a set of numerical criteria to prompt the next
action. The authors in [Bahram et al. 2014] propose a functional architecture of the driving
strategy as a discrete set of behavioral strategies for a specic trac situation.
Third, some learning approaches are inspired by interaction models introduced in game
theory. The idea for highway planning is to consider the moving vehicles as players that
observe each other's actions and consequently react with an appropriate strategy. The main
advantage of this method is that it quickly obtains a trained driver model, starting with a
merely basic one. The main drawback is that it assumes that all the players respect the
rules, and it can therefore lead to unsafe reactions in real-life applications. Moreover, one
should make sure to learn using various behaviors of the players to enrich the knowledge.
In [Li et al. 2018], the models develop more complex strategies as they are trained against
the other behaviors. The authors use POMDP to model the players' knowledge and Jaakola
2.3. Comparison table for highway applications 41
The highway applicability of the previously described methods is summarized in Tables 2.2
and 2.3. We propose to quantify the constraints' assessment with a ` / ∼ / +' scale. We assess
` ' as being highly inappropriate, `' inappropriate, `∼' intermediary, `+' appropriate, and
`++' highly appropriate in Table 2.2. The references are gathered in Table 2.3, and illustrate
the use cases of the families to suggest their situation adaptiveness (collision avoidance,
car following, lane change, merging, overtaking) and their implementation in simulation or
experimentation, specied with the square brackets' superscript and subscript respectively.
To quote but a few, simulators such as CARLA, PreScan, SCANeR studio, TORCS are
developed for autonomous driving.
Unlike the classication in [Paden et al. 2016], we do not focus on the usual points of
comparison, such as completeness, optimality, or time complexity, but on how the algorithms
oer an eective and eciently implementable response to practical applications. Therefore,
one can interpret Table 2.2 as a guide to choose the most appropriate family given the
attributes of the algorithm, and its intrinsic and extrinsic limits. Furthermore, one will
notice that references often apply to dierent families. In that sense, Table 2.2 also helps to
understand the complementary methods for a systemic motion planner design.
Among the characteristics highlighted, the taxonomy criteria can be found in subsection
2.2.1: (i) the type of use of set- and solve-algorithm, (ii) the predictive or reactive horizon,
and (iii) the mathematical domain; and the motion planning functions in subsection
2.1.2 (Figure 2.3).
As intrinsic limits, we distinguish the performance, ease of use, and data analysis. The
performance rst gathers the real-time implementation. In the case of highway motion
(> 17m/s), with an error of a meter, the motion planning algorithm should return a solution
within the order of magnitude of 10ms to be real-time. As the intrinsic cost of each algorithm
strongly relies on the hardware platform, we inform about qualitative, but not quantitative
computation. The other performance requirements are derived from the analysis proposed
in [Khatib & Le Maitre 1978]: robustness of the algorithm to nd a solution despite the
variation of the environment (merging, jam approach); stability to keep the solution despite
environmental changes; adaptability to perform a solution in various conditions, e.g. intro-
ducing new scenarios or criteria; and the feasibility of converging to a solution in a nite
time. The ease of use is dened by the ability to replan in real-time without changing the
structure of the algorithm. The last intrinsic limit relates to the data analysis: the input
type for the scene representation (discrete, sampled, continuous) and the output type of
42 Chapter 2. State of the Art
the algorithm (space, path, trajectory, maneuver, task), along with the ability to deal with
uncertain data.
The extrinsic limits show the algorithm's dependency on high-level sensors of the ego ve-
hicle and/or the infrastructure. This aspect is important to reect the real applicability in a
near or further future. If the algorithm works well only with precise information over a large
time horizon, which does not correspond to the current high-level sensors' properties, it will
be highly inappropriate for sensors constraints. We invite the readers to consider [Gruyer
et al. 2017] as a reference for sensors' properties. As stated before, the specicities of the driv-
ing environment and vehicle kinematics must be considered. We therefore consider whether
the algorithm performs well in complex environments (dense trac, various topologies)
and takes the constraints of the environment and the ego vehicle kinematics into
account.
To conclude, our literature review revealed a considerable amount of algorithms for motion
planning in robotics. The objective of this chapter was to identify the main methods of motion
planning for autonomous vehicles in the context of highway driving, in order to propose new
research contributions. We do not claim to have exhaustively gathered all algorithms used for
this application, but we hope to have shown the diversity and potential of highway planning.
In addition, we propose a novel decomposition of the methods for the state of the art. In
particular, we believe that the radar chart illustration of dierentiating attributes will help
future research to identify and orient the work towards solutions that will best address the
problem.
From this literature review of motion planning algorithms, we were able to identify dif-
ferent areas of improvement to study for the positioning of our research.
The rst area consists in the improvement of the planning architecture, in particular,
the relationship between generation and decision making. In previous works, the functions
of motion generation and decision making are addressed separately or in a sequential way,
applying decision making to decide on a maneuver and then motion generation to properly t
this maneuver, or vice versa. However, this separation may lead to a nal motion that will not
be consistent or optimal with respect to the navigation environment and associated criteria.
For example, during an overtaking maneuver, the return lane change may not be optimal
if there is another obstacle on the arrival lane. In this case, it may be better to generate a
motion to stay on the passing lane for a second passing. We therefore propose a new combined
architecture for decision making and motion generation so that both functions are optimized
in relation to each other. The advantage of this architecture is also its ability to work in
a continuous solution space (no need for discretization). Similar combined approaches also
emerge within the research in perception that uses learning methods. Nevertheless, one of
the major drawbacks of these learning algorithms is the lack of cause and eect explanations
and reproducibility. In fact, they are not acceptable for decision making that involves human
safety. Therefore, we will present a deterministic decision-maker, as well as a deterministic
characterization of the evolution space.
A second area of improvement concerns the workspace in the context of motion generation.
2.4. Conclusion and positioning of the research 43
Indeed, many approaches rely on a discretization of the evolution space. This is the case for
methods of space conguration by sampling, cells decomposition, or pathnding algorithms.
Other methods consider an incomplete evolution space by selecting predened paths, such as
lattice or tentacle approaches. By doing so, these methods do not allow the optimal motion
to be found since the evolution space is truncated. Continuous space approaches such as
homotopy or eld methods have a major disadvantage in terms of computation time and are
therefore forced to be used as a reactive rather than a predictive approach to be embedded in
real time on the vehicle. To address the workspace problem, we consider the navigable space
as a set of intervals. These intervals are previously reduced to a collision-free space, i.e. the
vehicle is in a non-conict situation. The distinction between intervals is made by road lanes
and spaces between obstacles. Finally, the high-level maneuvers are carried out to move from
one interval to another. It should also be noted that the interval delimits a continuous space.
Moreover, the dynamic trajectory is often optimized separately from the static path, as
in curves approaches. This is why we propose to work on the intervals previously discussed
in space and time as a third area of improvement. The prediction of obstacles and the
dynamically feasible motions of the vehicle ego characterize the spatiotemporal intervals.
Finally, with regard to the decision making or selection part, the main drawback of ex-
isting methods concerns the multi-criteria approach, while taking into account uncertainty
and trustworthiness. Many methods correctly manage a multi-criteria decision but it is then
necessary to deal with homogeneous criteria or in limited number for reasonable calculation
time and to avoid combinatorial explosion, as in the case of pathnding algorithms. Logic
algorithms or risk approaches may be too rigid for real life cases and quite complex in man-
aging information from dierent sources. In addition, it is most often a matter of setting
acceptance thresholds, which results in a loss of information for decision making. This fourth
area of improvement is thus addressed by using fuzzy logic to easily combine heterogeneous
criteria and integrating it into a belief theory framework. The advantage of this framework
is to assign criteria into categories corresponding to dierent decision-states, such as safety
around the vehicle, the dynamic limits of the vehicle, passenger comfort or compliance with
driving rules. The categories are then considered as dierent sources of information to obtain
the nal decision. Moreover, the consideration of uncertainties and trustworthiness becomes
possible by using fuzzy logic and the weighting of criteria or categories. This framework also
makes it easy to adapt the vehicle's behavior to the driving style. Nevertheless, the resulting
function for decision making is no longer explicit. It is then impossible to use methods of
numerical optimization from the state of the art. To overcome this, we consider the advantage
of heuristic approaches that allow us to nd a near-optimal solution in a nite time. Besides,
it becomes possible to adjust the calculation time according to the encountered situation,
while guaranteeing to return an acceptable solution.
These identied areas of improvement are explored in the next two chapters. Chapter 3
will detail the contribution to the decision making function. The planning architecture as
well as the space-time evolution space are broached in Chapter 4.
Table 2.2: Comparison table for highway applications of motion planning methods (`−−' very inappropriate, `−' inappropriate,`∼' intermediary, `+' appropriate,
44
`++' very appropriate).
Adaptability
Output type
Constraints
Uncertainty
Type of use
Robustness
Kinematics
Complexity
Input Type
Replanning
constraints
Real-time
Feasibility
Functions
Highway
Stability
Horizon
Domain
Conguration space sampling se P/R G gen ∼ − ∼ ∼ − ++ s sp − ∼ − + ++
Non-obstacle-based cells
Exact decomposition; Polar grid se P/R G gen − ∼ − ∼ ∼ − d sp/pa ∼ ++ − −− −−
Obstacle-based cells
Voronoi ; Approximate se P/R G gen −− ++ ∼ + ∼ −− d sp/pa + − − + −
Lattice
Maximum turn; Curvature Velocity se P G gen ∼ + + + + + c tr/ma −− + − + ++
Pathndings
Dijkstra so P L dec − ∼ + + ++ ∼ s pa − + −− −− −−
A∗ so P H/L dec s pa
Articial forces
APF; VFF; Elastic band se/so R B gen/def − + + + + ++ c sp/pa/tr + + ∼ ++ +
Parametric curves
Line and circle se P/R G gen + − −− + + + s/d/c pa/tr/ma −− + −− + ∼
Semi-parametric curves
Polynomial; Spline; Bézier so P/R G gen ∼ − −− + ++ + s/d/c pa/tr/ma ∼ + + + ++
Legend:
Characteristics se/so = set-/solve-algorithm; P = predictive / R = reactive; G = geometric / H = heuristic / L = logic / C = cognitive / B = biomimetic; gen =
Adaptability
Output type
Constraints
Uncertainty
Type of use
Robustness
Kinematics
Complexity
Input Type
Replanning
constraints
Real-time
Feasibility
Functions
Highway
Stability
Horizon
Domain
Mathematical optimization
LP; NLP; QP; MPC; DP so P H/L gen/dec + + + ++ ∼ + s/d/c pa/tr/ma ∼ + + ++ ++
AI Logic
if-then-rules so P/R L dec ∼ + + − + ∼ d ta − ++ − ++ ∼
AI Heuristic
Agent se/so P/R H/L/B gen/dec − ∼ − ++ ∼ + s/d/c pa/tr/ma + + + ++ +
AI Approximate
Fuzzy logic so P/R H/L/C dec ∼ + + ∼ + ∼ c ma/ta + ++ ∼ ++ +
AI Cognitive
Risk so P/R L/C dec ++ ++ ++ + ++ ++ d ta ∼ −− + ++ ∼
Legend:
Characteristics se/so = set-/solve-algorithm; P = predictive / R = reactive; G = geometric / H = heuristic / L = logic / C = cognitive / B = biomimetic; gen =
generation / dec = decision making / def = deformation.
Data Analysis d = discrete / s = sampled / c = continuous; sp = space / pa = path / tr = trajectory / ma = maneuver / ta = task
45
46 Chapter 2. State of the Art
Table 2.3: Use case references table for highway applications of motion planning methods.
QP; ∗
[Ziegleret al. 2014]E [Qianet al. 2016]S [M illeret al. 2018]∗S
o
MPC; [Limaet al. 2015a]∗S [Lef evreet al. 2016]∗E [W ang&Ayalew 2016]oS
[Altchet al. 2017]∗S [Cardosoet al. 2017]∗E [Liuet al. 2017a]∗S
[N ilssonet al. 2017]oS
DP [Choi 2014]∗S [Guet al. 2016a]∗S
AI Logic
if-then-rules [V anholmeet al. 2013]∗E [Constantinet al. 2014]∗S
[Claussmannet al. 2015]lc o lc
E [N ilssonet al. 2015]S [N ilssonet al. 2016]E
[Liet al. 2017]∗S
FSM [Ardeltet al. 2010]eS [Ziegleret al. 2014]∗E [T ehraniet al. 2015]lc
S
[W ang et al. 2015]∗S
Dynamic Bayesian [U lbrich&M aurer 2015]lc E [M ouhagir et al. 2016]S
o
∗ ∗
[Rodrigueset al. 2016]S [Galceranet al. 2017]E [Zhouet al. 2017]∗S
[Liet al. 2018]∗S
AI Heuristic
Agent [Galceranet al. 2017]∗E [Liet al. 2018]∗S
SVM [Huy et al. 2013]∗S [V allonet al. 2017]lc
S
Evolutionary [Balujaet al. 1997]∗/ [Onievaet al. 2011]lcS [Riaz et al. 2015]ca
S
AI Approximate
Fuzzy logic [N aranjoet al. 2008]oE [M ilanset al. 2011]∗E [Balalet al. 2016]lcS
ANN [N gai&Y ung 2011]oS [Chenet al. 2015b]∗E [Bojarskiet al. 2016]∗E
[Geng et al. 2016]∗S [Lef evreet al. 2016]∗E [Rehderet al. 2017]∗E
[Liet al. 2018]∗S [W ang et al. 2018a]∗S [Y ang et al. 2018]∗E
Belief Network [Huang et al. 1994]∗E
AI Cognitive
Risk [T aylor 1964]∗/ [W ilde 1982]∗/ [Ghummanet al. 2008]caE
[P rokhorov 2009]∗S [Ardeltet al. 2010]eS [Bautinet al. 2010]∗S
[Glaseret al. 2010]∗S [T amkeet al. 2011]∗S [Danielet al. 2013]∗S
[W ang&Ayalew 2016]oS [N aumann&Stiller 2017]∗S
[W agneret al. 2018]caS
Task [M cKnight&Adams 1970b]∗/ [M cKnight&Adams 1970a]∗/
[M cKnight&Hundt 1971b]∗/ [M cKnight&Hundt 1971a]∗/
[N aranjoet al. 2008]oE [Bahramet al. 2014]∗S [Chenet al. 2015a]oS
Game [Liet al. 2018]∗S
Legend: Use case [ ]S = simulation / [ ]E = experimentation; [ ]ca = collision avoidance / [ ]cf = car
following / [ ]lc = lane change / [ ]m = merging / [ ]o = overtaking / [ ]e = emergency / [ ]∗ = no specic use
case.
Chapter 3
This chapter considers the problem of high-level decision process for autonomous
vehicles on highways. The goal is to select a predictive reference trajectory among
a set of candidate ones, issued from a trajectory generator. This selection aims at
optimizing multi-criteria functions, such as safety, legal rules, preferences and com-
fort of passengers, or energy consumption. This work introduces a new framework
for Multi-Criteria Decision Making to provide a risk assessment. The proposed ap-
proach adopts fuzzy logic theory to deal with heterogeneous uncertain criteria and
the Dempster-Shafer Theory to aggregate the fuzzy information while considering
dierent sources of risk. Simulation results using existing datasets are presented on
car following cases, and extended to lane changing situations.
Contents
3.1 Literature review and motivation . . . . . . . . . . . . . . . . . . . . . . . . . 51
3.2 Multi-criteria selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
3.2.1 Criteria categorization . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
3.2.2 Category and criterion weight . . . . . . . . . . . . . . . . . . . . . . . 53
3.2.3 Predictive criterion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
3.2.4 Criterion fuzzy value . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
3.3 Fuzzy Dempster-Shafer algorithm . . . . . . . . . . . . . . . . . . . . . . . . . 56
3.3.1 Basic concepts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
3.3.1.1 Fuzzy Logic . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
3.3.1.2 Evidential reasoning . . . . . . . . . . . . . . . . . . . . . . 59
3.3.2 Level 1: Fuzzy Inference System . . . . . . . . . . . . . . . . . . . . . 61
3.3.3 Level 2: Risk Assessment . . . . . . . . . . . . . . . . . . . . . . . . . 63
3.3.3.1 Hypotheses . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
3.3.3.2 Evidence and Belief functions . . . . . . . . . . . . . . . . . 64
3.3.3.3 Combining Beliefs . . . . . . . . . . . . . . . . . . . . . . . . 65
3.3.3.4 Risk evaluation . . . . . . . . . . . . . . . . . . . . . . . . . 67
3.3.4 Level 3: Reference trajectory choice . . . . . . . . . . . . . . . . . . . 74
3.4 Simulation results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
3.4.1 Case study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
3.4.2 Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
3.4.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
3.5 Conclusion and future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
50 Chapter 3. Fuzzy Dempster-Shafer Decision Making
The research and development on automated and autonomous vehicles in the last decade
may disrupt future transportation utilities. Even if the main focus remains safety on roads to
decrease human injuries, Chapter 1 shows that these advanced transportation systems also
aim at improving passenger comfort, energy savings, or trac ow. The addition of those
new driving objectives makes it necessary to develop intelligent behaviors with Multi-Criteria
Decision Making (MCDM) to autonomous vehicles. The simplied vision of an intelligent
system based on Figures 2.2 (p. 16) and 2.3 (p. 17) proposes a 2-level motion planning with
a predictive high-level and a reactive low-level. The proposed work focuses on the high-
level planning and more specically on the decision function, through the implementation
of MCDM to decide the future motion of the ego vehicle considering the evolution of the
surroundings and predened goals of the ego vehicle.
This chapter is organized as follows. Previous work and motivation are presented in
section 3.1. Section 3.2 introduces the multi-criteria selection. The decision algorithm is
detailed in section 3.3, and simulation results are discussed in section 3.4. Lastly, section 3.5
presents the conclusion and future work.
3.1. Literature review and motivation 51
Previous works on the high-level decision making process are reviewed in Chapter 2 with a
variety of methods recently applied to autonomous highway driving. These methods cover
a large domain of decision mechanisms such as knowledge-based inference engines, heuris-
tic approaches, approximate reasoning, and human-like models. As stated previously, our
decision-maker will deal with multi-criteria. The specic issue of MCDM is generally solved
in the literature through mathematical optimization [Ali et al. 2013, Qian et al. 2016] and
expert systems [Wardzi«ski 2006, Furda & Vlacic 2010, Chen et al. 2014, Noh & An 2018].
The main advantage of these methods is to threat large sets for candidate solutions with a
predictive application, meaning a long time horizon as explained in subsection 2.1.2 (p. 15).
However, these previous works return criteria evaluations with independently qualied and
quantied numerical values. In real experiments, these values are not straightforward to ob-
tain in the case of autonomous vehicles. For example, it is easy to calculate the ego velocity
to return a velocity value, but it is harder to associate a value to the dangerousness of the
weather conditions.
Moreover, when dealing with vague or imprecise information, linguistic representations of
an event may be interpreted using a fuzzy range. This provides the ability to develop a con-
tinuous decision model which may have been too complex due to an exhaustive qualication
using exact values or too many thresholds boundaries. In that sense, fuzzy logic theory recalls
the human evaluation process [Yager & Filev 1994], and proves its relevance for autonomous
driving as discussed in [Prokhorov 2009, Balal et al. 2016]. Another advantage of fuzzy logic
is the consideration of continuous physical variables with a continuous interpretation.
Besides, it is important for the decision-maker to take into consideration how humans
interpret the evaluation of a driving situation and what the safety warning means for their own
safety. As the human interpretation is typically based on their own cognitive life experiences,
a common scale is needed between the human driver and the machine, so that the decision
taken by the machine is understandable from the driver point of view. A popular scale
in human decision involves risk assessment. Related work on risk estimators in intelligent
transportation systems applies with dierent architectures. The most basic ones use only one
risk indicator based on one factor to a specic event [Lefèvre et al. 2014]. These approaches are
adequate to evaluate the current risk of the situation considering only a rough vehicle collision
warning with static or moving surroundings obstacles. In order to provide a more complete
risk assessment, estimators could rely on a combination of several factors. Thus, vehicle,
driver, and environment data are taken into account to return a suitable risk warning. In
this way, the system acquires omniscience and stability based on dierent sources of risk. For
example in [Prokhorov 2009], the author develops a 2-level risk estimation system. The top-
level estimator is based on the perceived risk aected by weather, trac or road conditions,
i.e. prior knowledge. The low-level estimator is then applied to rene the top-level estimator
with real-time information about the surroundings, such as relative velocities or distances
with obstacles. Another 2-level risk estimator proposition in [Daniel et al. 2013] calculates
a local risk estimator for each category (vehicle, driver, and environment) by separating
the static data (e.g. mileage of the vehicle, experience of the driver, road topology), the
contextual data (e.g. vigilance of the driver, weather conditions) and the variable data (e.g.
accelerations of the vehicle, eye gaze of the driver, Time-To-Collision with obstacles). Each
52 Chapter 3. Fuzzy Dempster-Shafer Decision Making
data is locally fused for each category, dening a local risk. Finally, each category is fused to
dene a global driving risk, as illustrated in Figure 3.1.
Furthermore, it is crucial to also consider the uncertainties of criteria values and the
consistency of the dierent sources of risk. Indeed, if a decision evaluation appears to be fa-
vorable to a candidate trajectory, but the uncertainties on the criteria values used to evaluate
this candidate are strong, the decision might be too risky. In the same way, if the sources for
the evaluation depict dierent conclusions, it may be safer not to make the decision. Thus,
uncertainties and consistency impact the risk evaluation of the decision-maker. In order to
take these into consideration, the proposed approach incorporates evidential reasoning using
the Dempster-Shafer Theory (DST) [Dempster 1967, Shafer 1976]. The DST, or evidential
theory or belief theory, is a mathematical framework based on evidences and plausibility in
order to combine evidence from dierent sources and nd a degree of belief for the consid-
ered event. It has already been adapted for risk assessment on current and predicted driving
situation in [Wardzi«ski 2006, Daniel et al. 2013].
Our motivation in this work is to develop a high-level MCDM without any restriction on
the criteria values, considering both their uncertainties in the form of trustworthiness and
strife with a deterministic and straightforward explainable process. To do so, we propose
a risk assessment based on the Fuzzy Dempster-Shafer Theory (FDST) approach. Previous
works using fuzzy set and evidential theory have been studied in autonomous vehicle deci-
sion making, such as the fusion of crisp objective and fuzzy subjective information in [Liu
et al. 2012], the prediction of road accidents in case of exceeding accelerations in [Gündüz
et al. 2017], or driving scores for usage based insurance application in [Derbel & Landry 2018].
But, to our knowledge, this work has not been extended to a high-level decision-maker in
autonomous vehicles. The presented strategy is based on a 3-level architecture: a set of pre-
dened motions is evaluated with criteria values, which are rst fuzzied in local risk levels
for each category; second the local risk levels of all the categories are combined as dierent
sources of risk; and third, the motion with the best evaluation is selected as the reference mo-
tion over a prediction horizon. The motion can be a set of paths, trajectories or maneuvers,
as long as it can be characterized by a set of criteria. The evaluation is based on a linguistic
description of the criteria using fuzzy logic and risk assessment combining the belief of risk
evidences with FDST. Finally, our algorithm simultaneously considers several factors of risk,
their uncertainties, their consistency and their relative importance for human safety in the
decision-maker.
3.2. Multi-criteria selection 53
Multi-criteria involves processing data from dierent sources, with dierent units and dierent
range values. In order to process these data correctly, they will rst have to be homogenized.
Furthermore, not all these data will be of equal importance for the evaluation of the motion
and the selection of the decision. This section indicates the preprocessing steps necessary to
provide consistent inputs data to our algorithm. Our pretreatment is divided into 4 steps:
the data categorization in subsection 3.2.1, the weight assignment in subsection 3.2.2, the
predictive characterization in subsection 3.2.3, and the transformation of the crispy criteria
values to fuzzy values in subsection 3.2.4.
the driver. In our proposed architecture, the importance of each s criterion (νs ) is implic-
itly included using the fuzzy inference system (see subsection 3.3.2). In addition, we dene
discrete weights wκ for each κ category, as it can be designed in mathematical optimization
approaches, i.e as soft, middle or hard constraints. By denition, the inuence of hard con-
straints cannot be released, whereas it can for soft constraints. Middle constraints inuence
must be specied considering the severity of the situation. Table 3.1 shows some examples of
relative weights with priority to safety. For example, the safety categories are hard whereas
the energy savings category is soft. Another advantage of the multi-criteria selection and
attribution steps is the ability to relax some criteria in case of a conicting situation. If the
ego vehicle encounters an imminent collision, the energy savings and respecting driving rules
do not matter compared to the safety categories.
55
Soft Energy savings -8- mean Longitudinal Acceleration/Deceleration Slow / Medium / High
56 Chapter 3. Fuzzy Dempster-Shafer Decision Making
slow for instance; if it navigates in the mostright lane, it will be 100% safe, in the middle
lane, 50% safe and 50% unsafe.
The predictive criteria and their fuzzy sets are the inputs of the Fuzzy Dempster-Shafer
algorithm presented in the next section.
The decision-maker is based on the previously mentioned criteria s = 1..S in section 3.2. They
represent the driving scene with data issued from exteroceptive (environment information)
and proprioceptive (ego information) sensors. Thus, the decision process is based on imperfect
and heterogeneous information provided by more or less reliable and conicting sources. The
aim of our algorithm is to combine dierent criteria values us with their uncertainties to
yield a risk assessment RA. To do so, we develop the architecture depicted in Figure 3.3 used
for each candidate trajectory p = 1..P . We rst dene in level 1 a Fuzzy Inference System
(FIS) on the criteria s = 1..S to return fuzzy local warning risk levels LWκι for each category
κ = 1..C with a common scale of output fuzzy sets ι. Level 2 applies the Fuzzy Dempster-
Shafer Theory (FDST) to combine the resulting risk levels LWκι , and calculates the belief
and plausibility degree RA[bel, pl]hp of each predened risk hypotheses Hh=1..H . Lastly, level
3 evaluates a risk indicator Ip to determine the best trajectory. This section will rst recall
the basic concept of the two theories of fuzzy logic and evidential reasoning in subsection
3.3.1, and then develop each of the 3-level FDST architecture (level 1 in subsection 3.3.2,
level 2 in subsection 3.3.3, and level 3 in subsection 3.3.4).
57
58 Chapter 3. Fuzzy Dempster-Shafer Decision Making
As an overview, fuzzy logic provides the ability to represent vague and imprecise variables
in a decision making system. Unlike the traditional crisp sets, which have strict boundaries,
i.e. the variable belongs or belongs not to a set so that the membership is binary, fuzzy logic
denes a continuous membership degree mf to a fuzzy set described by a membership function
µ for each fuzzy set. The fuzzy sets can have overlapping boundaries, which means that the
variable can have more than one non-zero membership degree. The input space is then
mapped to the output space using a list of if-then statements called rules. Hence, the rules
do not refer to thresholds but to the corresponding mf from input to output spaces. Various
methods have been used in research to design and optimize µ and determine the systems' rule-
base including expert knowledge [Yager & Filev 1994], clustering [Yager & Fileu 1993], and
adaptive neuro-fuzzy inference systems [Jang 1993]. Dierent types of inference systems also
exist and vary in the way outputs are determined. The most popular ones are the Mamdani
and the Sugeno FIS [Yager & Filev 1994]. The main dierence between them is the output
membership function µout . The Sugeno output membership functions are either linear or
constant, whereas the Mamdani ones are fuzzy sets. In this proposed framework, we want to
fuzzify, homogenize, and combine the chosen criteria values s = 1..S into categories κ = 1..C ,
see Figure 3.3a. We dene the output µout to return local warning risk levels (LWκι ) as fuzzy
sets, thus a Mamdani-type inference is developed. In the rest of the manuscript, µ and mf
designate respectively the membership functions and the membership degrees on the entire
fuzzy set; µA (s) and mfA (us ) are the membership function and the membership degree of
the fuzzy set A for the input criterion s with the criterion value us .
The development of a Mamdani Fuzzy Inference System (FIS) generally includes the
following ve steps ([Yager & Filev 1994]):
• (ii) fuzzication,
First (i), each input and output are expressed in fuzzy sets. Each fuzzy set is represented
by a membership function µ over a universe of discernment uod. The universe of discernment
uod is the range of the variable to analyze. Each input and output has a corresponding
uod. A membership function µ is an arbitrary curve that maps each element of the uod to a
membership value between 0 and 1, called the membership degree mf . Thus, a fuzzy set A is
represented by its membership function µA for all the elements z in its uodA in Equation 3.1:
The second step (ii) of fuzzication consists in associating the input variable value to its
input membership degrees. The membership degrees mf are the intersections between the
membership functions µ and the input variable value us in the uod. Due to the overlapping
boundaries of µ, the fuzzied input value can belong to more than one µ, and thus may have
more than one membership degree mf for each of the corresponding input fuzzy sets.
3.3. Fuzzy Dempster-Shafer algorithm 59
Third (iii), a rule-base describes the relationship between the input fuzzy sets and the
output fuzzy sets. This rule-base is expressed as if-then statements with the logic operator
AN D, OR, N OT . The rules express the relationship between the input fuzzy sets of each
input variable and the output fuzzy sets. The activation of a rule depends on the membership
degree mf (us ) of its input variables us previously calculated in the second step (ii). Here
is an example of a FIS with 2 inputs (2 criteria s = 1, 2) and 1 output (RA); input 1 is
fuzzied with 3 fuzzy sets (small, medium, high), input 2 is fuzzied with 2 fuzzy sets (low,
high), and output is fuzzied with 4 fuzzy sets (very bad, bad, good, very good). The rule
number 1 "If Input 1 is small AN D Input 2 is low, then output RA is good" is activated
if (mf1(small) (u1 ) 6= 0 & mf2(low) (u2 ) 6= 0). In the manuscript, we named IRR the set of
the activated rules and IRRr ≡ ruler (mfs (us )) the activation of rule number r with the
corresponding membership degree mfs (us ) for each input criterion s activating the rule.
After the set of activated rules IRR have been deduced from the membership degree
mf of the input fuzzy sets and the rule-base, the rule-implication calculates the mf of
the fuzzy output of each activated rule. It is decomposed in two steps. The rst step
applies the logic operator of the rules on the membership degrees of the input values. A
classic interpretation of the logic operator AN D, OR, N OT are respectively the minimum,
maximum and additive complement functions. With the previous example, the resulting
value of the rule "If Input 1 is small AN D Input 2 is low, then output RA is good" is equal
to min(mf1(small) (u1 ), mf2(low) (u2 )). The second step is the truncation of the corresponding
output fuzzy set with this resulting value. In our previous example, the membership function
µout(good) of the output fuzzy set good is truncated by the min(mf1(small) (u1 ), mf2(low) (u2 ))
value. In the manuscript, we name ORR the set of the implicated rules and ORRr ≡
trunc(µout , T (IRRr )) the truncated output fuzzy set corresponding to rule number r with T
its logic operator.
The fourth step (iv) aggregates the truncated output fuzzy sets of all the activated rules
ORR, calculated in the implication process. In the manuscript, we named ARR the aggrega-
tion function of the implicated rules. There exist dierent aggregation operator such as the
maximum, the probabilistic or, or the sum of each activated rule's output fuzzy set. With S
the aggregation operator, and r1, r2 the two activated rules, ARR = S(ORRr1 , ORRr2 ).
The nal step (v), defuzzication, provides a single output value from ARR; however, the
proposed architecture uses the aggregation function ARR for the FDST algorithm. Thereby,
each category κ is assigned with a LWκι as the likelihood probability of being in each prede-
ned risk level ι as dened in the rule-base.
assignment bba. The mass of a subset element represents the proportion of the other subsets
asserting that the current state is part of the considered element and no other subset, i.e. the
veracity of a proposition. The nonzero mass sets are called focal elements. The mathematical
formalism is then dened as:
X
m : 2X → [0, 1]; m(∅) = 0; m(A) = 1 (3.2)
A⊆2X
The interest of belief theory is to combine evidences to nd their common part within
the universe. The Dempster's combination rule of evidence for independent sources 1 and 2
is given in [Dempster 1967] by the joint mass m1,2 :
m1,2 (∅) = 0
m1,2 (A) = (m1 ⊕ m2 )(A)
(3.3)
1 X
= m1 (B)m2 (C)
1−K
B∩C=A6=∅
where K represents a measure of conict between the two mass sets: K = B∩C=∅ m1 (B)m2 (C),
P
with B and C respectively the subsets of sources 1 and 2. In practice, the independence of
sources is not always veried, and conicts have dierent interpretation. This can be handled
by extending the original combination rules. A non-exhaustive list of rules for the combina-
tion of evidences is presented in [Sentz et al. 2002].
The use of DST diers from the classic probabilistic theory, in the sense that the exact
probability P of an event A is contained in an interval, bounded by a lower probability called
belief (bel) and an upper probability called plausibility (pl):
The belief represents the sum of the masses of all the subsets of the considered event, i.e.
the sum of the masses of all the focal elements which necessarily imply the desired event.
The plausibility represents the sum of the masses of the sets that intersect the considered
event, i.e. the sum of the masses of all the focal elements that do not necessarily contradict
the desired event. In other words:
X
bel(A) = m(B)
B|B⊆A
X (3.5)
pl(A) = m(B)
B|B∩A6=∅
In our work, the evidences describe driving scene data, i.e the previously dened categories
κ = 1..C in section 3.2. Each element of the universe is interpreted as a local warning risk
level (LWκι ), which ranges over the interval [0, 1], 0 corresponding to a zero risk level and 1
being the highest risk level. The mass is calculated with the output membership function
ARR of the fuzzy logic. More details and examples are given in subsection 3.3.3.
3.3. Fuzzy Dempster-Shafer algorithm 61
For the rst step, input and output membership function µ design, we assume that all
the input and output µ are provided for each criterion s and each local warning risk level
(LWκι ). The heterogeneousness of the criteria s = 1..S results from the dierent fuzzy sets, µ
and universe of discernment uod for each criterion s. On the contrary, the homogeneousness
of the categories κ = 1..C leads to the same uod, µ and fuzzy sets ι to dene the LWκι for all
the categories C .
In our work, we dene 5 µout outputs warning levels to estimate the risk over uodout =
[0, 1]: very low (µout(veryLow) , uodveryLow = [0, 0.25]), low (µout(low) , uodlow = [0, 0.5]),
medium (µout(med) , uodmed = [0.25, 0.75]), high (µout(high) , uodhigh = [0.5, 1]) and very high
(µout(veryHigh) , uodveryHigh = [0.75, 1]), as shown in Figure 3.4. In the example of category 4
Driving rules in Table 3.1, the rst criterion Legal velocity (Lv) has a uod1 = [0, 180]km/h
and four µ (µ1(tooslowV el) , µ1(slowV el) , µ1(medV el) , µ1(toof astV el) ), shown in Figure 3.5a, and the
second criterion Legal lane (Ll) has a uod2 = [0, 1] and two µ (µ2(unsaf eLane) , µ2(saf eLane) ),
shown in Figure 3.5b.
Figure 3.4: Fuzzy outputs µ for the warning levels LWκι such that ∀κ ∈ [1, C], ι ∈
{veryLow, low, med, high, veryHigh} in uod = [0, 1].
The second step of fuzzication consists on dening the membership of each considered
criterion. For example, the category 4 Driving rules has two criteria: Legal velocity (Lv) and
Legal lane (Ll). If the ego vehicle has a velocity of 95km/h and is on the mostleft lane, the
membership degrees to the input fuzzy sets are equal to mf1(slow) = 0.25 and mf1(med) = 0.75
for the criterion 1 Lv and mf2(unsaf e) = 1 for the criterion 2 Ll, with respect to the input
membership functions dening in Figures 3.5a and 3.5b.
Furthermore, we are interested in uncertainties based on a trust value to quantify the
delity of the measurement of an input variable, i.e. to attribute a condence degree to the
variable. This trust value lies between 0 and 1. The closer to 1 the trust value is, the higher
the condence degree of the measurement is. Thus, the degree of membership of each fuzzy
set for an input variable s after the input fuzzication step is weighted by the the importance
62 Chapter 3. Fuzzy Dempster-Shafer Decision Making
(a) Four µ for criterion 1 legal velocity: too (b) Two µ for criterion 2 legal lane: safe
slow (µ1(tooslowV el) ), slow (µ1(slowV el) ), medium (µ2(saf eLane) ) and unsafe (µ2(unsaf eLane) ).
(µ1(medV el) ) and too fast (µ1(toof astV el) ).
of each criterion νs , which is the trust value in our problem, as stated in Equation 3.6:
(
µ0s = νs ∗ µs ,
(3.6)
mfs0 = νs ∗ mfs ,
where µ and mf are the membership function and membership degree before weighting, µ0
and mf 0 the membership function and membership degree after weighting.
In the third step, the rule-base links the input membership functions µs of criteria s = 1..S
to the output ones µout . For example, we assign the rule-base in Table 3.2 for category 4
Driving rules.
Considering the previous example with Lv = 95 km/h and Ll = 1, the rules number 4
and 6 are activated with the inputs mf1 (Lv = 95) and mf2 (Ll = 1) in Equation 3.7:
IRR = IRR4 & IRR6 = rule4 (0.25, 1) & rule6 (0.75, 1) (3.7)
The result of evaluating the rules r through the implication method AN D (equivalent
to T ≡ min function) on IRRr are the truncated output µout for ORRr expressed in Equa-
3.3. Fuzzy Dempster-Shafer algorithm 63
tion 3.8:
The result of the fourth step, evaluating the truncated output membership functions
ORRr for each activated rule r through the aggregation method S ≡ max for ARR, is the
maximum value of the ORR over the output uod expressed by Equation 3.10:
The corresponding gures of IRR, ORR and ARR calculation are shown in Figure 3.6:
Finally, the aggregation output for the local warning risk levels of category 4 is represented
in Figure 3.7 by overlapping the ARR function and the output fuzzy sets in Figure 3.4. The
blue area corresponds to LW4veryLow , the pink area to LW4low , the yellow area to LW4med , the
green area to LW4high and the purple area to LW4veryHigh for the input values Lv = 95 km/h
and Ll = 1.
The covered areas for each of the LWκι are then used as the inputs for the level 2 of our
architecture, explained in details in the next subsection 3.3.3.
Figure 3.7: FIS ARR area outputs for each LW for category 4 Driving rules.
risk hypotheses Hh=1..H . The procedure is: (i) dening H hypotheses, (ii) constructing the
focal elements of evidences and their bba, (iii) combining bba, and (iv) calculating the result-
ing risk assessment RA for each hypothesis h of each trajectory p. In our case, the events are
the dierent local warning risk levels (LWκι ) issued from the focal elements of the evidences,
the evidences are the categories κ = 1..C dened in section 3.2, and the hypotheses Hh=1..H
are a priori dened as the values for the risk assessment (RA) acceptable to non-acceptable,
characterized by the evidential decision brackets RA[belhp , plhp ], see Figure 3.3b.
3.3.3.1 Hypotheses
In this work, belief theory is used to dene a risk assessment RA for each candidate trajec-
tory p. One of the drawbacks of risk assessment is the need for dening an acceptable or
unacceptable risk threshold, which depends heavily on the driver's experience, as analyzed in
[Wardzi«ski 2006]. The use of belief theory with fuzzy sets makes it possible to handle fuzzy
thresholds too. Thus, the advantage of the proposed architecture is to adjust the thresholds'
shapes to a continuous risk tolerance.
Considering the three threat levels in [Noh & An 2018], we represent 3 hypotheses Hh=1..3
for an acceptable level of risk (A), a partially acceptable (PA) and a non-acceptable (NA),
i.e. H = {A, P A, N A}. We consider the respective trapezoidal functions fh depicted in
Figure 3.8, based on the local warning risk levels LWκι dened in the previous level 1 in
subsection 3.3.2. Indeed, we dene as acceptable a very low risk level, 5/6 of a low risk level
and a sixth of medium one, the partially acceptable hypothesis covers the last sixth of low,
medium and the rst half of high, and the non acceptable hypothesis is given by the second
part of high and very high risk levels. This choice has been made considering a conservative
driver prole and the safety rules of the Driver Rules. Therefore, the transition between A
and P A is smooth due to the driver prole, whereas the transition P A to N A is radical due
to the safety approach.
category κ. The LWκι result from the output µ and mf calculated in subsection 3.3.2. For
each category κ, several local warning risk levels may then be concerned.
It was decided here to consider, a priori, all LWκι as having equal probability for each
category κ. However, each category κ is also assessed from the fuzzy combination of criteria
s. The aggregate output of fuzzy rules is therefore interpreted as equivalent to the relative
probability that each local warning risk level LWκι of the category κ is represented. Thus,
the new mass of each warning level ι of each category κ is the product of the uniform
distribution and the surface covered by the ARR function (i.e. the integral value) on the
corresponding LWκι [Jiang et al. 2012]. These new masses are nally normalized in order to
respect Equation 3.2 (subsection 3.3.1.2, p. 60). The bba calculation for a warning level ι for
a category κ over uodout is summarized in Equation 3.11:
Z
1
ι
bbaκ = ARR(LWκι ) d(uodout ), (3.11)
nLW
with nLW the cardinal of the output fuzzy sets, i.e. the number of local warning risk levels.
In the example of category 4 Driving rules, the bba shown in Figure 3.4 are calculated
in Equation 3.12:
Z
veryLow 1
bba4 = ∗ ARR(LW4veryLow ) d(uodout ) = 0.3061,
5
Z
1
bbalow = ∗ ARR(LW4low ) d(uodout ) = 0.1632,
4
5
Z
1
bba4 = ∗ ARR(LW4med ) d(uodout ) = 0.1225,
med
(3.12)
5
Z
1
bbahigh = ∗ ARR(LW4high ) d(uodout ) = 0.2857,
4
5
Z
1
veryHigh
= ∗ ARR(LW4veryHigh ) d(uodout ) = 0.1225.
bba4
5
focal elements of all other evidences. In our problem, this combination approach consists
in calculating the resulting membership functions µ and basic belief assignment bba of each
non-zero local warning risk levels LWκι from each category κ with the non-zero LWκι 0 of all
the other categories.
Besides, in our application, the conict situation leads to inconsistent decisions not to
be ignored. Indeed, if a rst category κ has only one focal element describing a very high
risk level (i.e. only LWκveryHigh 6= 0), and a second category κ0 has its only focal element
describing a very low risk (i.e. only LWκveryLow
0 6= 0), it means that the global estimated risk
is both very high and very low. In this situation, the 2 evidences' conclusions are in conict;
this conict leads to an incoherent global risk level decision making. In order to address
this issue of conict, we apply the conjunctive Yager's combination rule [Sentz et al. 2002]
in Equation 3.13. The dierence here is to not normalize the joint mass with the Dempster's
conict factor K in Equation 3.3, but to translate conict in ignorance. As a consequence, the
value of bel is smaller and the value of pl larger than the one obtained with the Dempster's
rule.
Moreover, we introduce in Yager rule's combination the weight of categories wκ in the
combination of µ, as it has been suggested in [Liu et al. 2012]. The idea is to shift the
resultant membership function in the direction of the highest-weight category. Consequently,
the LW of the highest-weight category support a higher contribution on the calculation of
the resulting beliefs.
Finally, the combination of a category 1 with the focal element LW1ι and a category 2
0
with the focal element LW2ι results in µ1,2(ι,ι0 ) over uod1,2(ι,ι0 ) = [0, 1] and bba1,2(ι,ι0 ) given in
Equation 3.13:
(
µ1,2(ι,ι0 ) = G × min w1 µ1(ι) , w2 µ2(ι0 )
(3.13)
bba1,2(ι,ι0 ) = bba1(ι) × bba2(ι0 )
where G is a normalization factor dened in Equation 3.14, such that the category weights
wκ only inuence the µ representation and not the bba:
max(min(µ1(ι) , µ2(ι0 ) ))
G= . (3.14)
max(min(w1 µ1(ι) , w2 µ2(ι0 ) ))
Figure 3.9 illustrates the calculation of the resulting membership function µ for 2 cate-
gories κ = 1, 2, with w1 = w2 = 1.
Furthermore, in our example, the combination operator min for µκ,κ0 with the intersection
of the focal elements is symmetric thanks to the identical output fuzzy sets LWκι denition
for all the categories κ = 1..C , as illustrated in Figure 3.10. Moreover, the uodout for each
ι ∈ {veryLow, low, med, high, veryHigh} do not overlap for all the combination, as depicted
in Figure 3.10. As a consequence of the min combination operator, the red combinations in the
matrix of Figure 3.10 are equal to 0. Indeed, the min resulting membership function between
µout,low over uodlow = [0, 0.5] and µout,high over uodhigh = [0.5, 1] is equal to 0. Therefore, for
any combination of C categories, the combined µκ,κ0 are calculated for the following fuzzy sets:
{veryLow, veryLow/low, low, low/med, med, med/high, high, high/veryHigh, veryHigh}.
Those fuzzy sets contribute to the belief-plausibility interval of each hypotheses Hh , as de-
tailed in the next subsection 3.3.3.4.
3.3. Fuzzy Dempster-Shafer algorithm 67
For the sake of clarity, we displayed separately the contribution {veryLow, low, med, high,
veryHigh} in Figure 3.11 and {veryLow/low, low/med, med/high, high/veryHigh} in Fig-
ure 3.12. The resulting µout of the example shown in Figure 3.9 is represented in a solid blue
line in the top representation of both gures. The contributions of {veryLow, low, med, high,
veryHigh} and {veryLow/low, low/med, med/high, high/veryHigh} are plotted respectively
in colored dashed lines in the center representations in Figures 3.11 and 3.12. Each con-
tributed mfout,l is normalized, as shown in the colored solid lines in the center represen-
tations in Figures 3.11 and 3.12. The bottom representations report the contribution of
the normalized mfout 0 to the output fuzzy sets {veryLow, low, med, high, veryHigh} and
{veryLow/low, low/med, med/high, high/veryHigh}.
68 Chapter 3. Fuzzy Dempster-Shafer Decision Making
Figure 3.10: Illustration of the µ combination process. The red combinations are equal to 0, as their uod do
not intersect.
To calculate the contribution of fuzzy sets to the hypotheses, we use the generalization of
the Dempster-Shafer Theory to fuzzy sets in [Yen 1990]. It consists in the decomposition of a
fuzzy focal element A as a collection of nonfuzzy α-level subsets. Even if the hypotheses and
the fuzzy sets are represented by simple trapezoidal and triangular shapes in our problem,
we use the α-level subsets for numerical approximation to calculate the surface of arbitrary
functions. An illustration of the contribution of {veryLow, low, med, high, veryHigh} to the
three hypotheses Hh∈{A,P A,N A} and 5 α-level subsets is shown in Figure 3.13.
Finally, the belief and plausibility values for evaluating the risk assessment RA[bel, pl]hp
for each hypothesis Hh∈A,P A,N A of each trajectory p = 1..P are obtained in Equation 3.16:
nX
!
LW X
0
bel (Hh ) = bbal × (αi − αi−1 ) × 0inf fh (z)
z|µl (z)>αi
l=1 αi
nX
! (3.16)
LW X
pl (Hh ) = bba0l × (αi − αi−1 ) × sup fh (z)
l=1 αi z|µ0l (z)>αi
We threat the example of the contribution of veryLow and low on the hypothesis accept-
able HA respectively in Figures 3.14 and 3.15:
3.3. Fuzzy Dempster-Shafer algorithm 69
Figure 3.11: Illustration of the normalization process for the contribution of very low in dark green, low in
light blue, medium in purple, high in pink and very high in black. Top: Output fuzzy sets denition in
dotted lines and the resulting output µout of the combination of the C categories in solid blue line. Center:
Normalization µ0out in solid line of the µout in dashed lines on each output fuzzy set. Bottom: Normalized
contribution of µ0out in solid lines for the output fuzzy sets in dotted lines.
Figure 3.12: Illustration of the normalization process for the contribution of very low/low in light blue,
low/medium in magenta, medium/high in light pink, high/very high in light gray. Top: Output fuzzy sets
denition in dotted lines and the resulting output µout of the combination of the C categories in solid blue
line. Center: Normalization µ0out in solid line of the µout in dashed lines on each output fuzzy set. Bottom:
Normalized contribution of µ0out in solid lines for the output fuzzy sets in dotted lines.
• For the contribution of veryLow, in the α-set α1 , the inf value of the representation
function fA of HA is equal to 1, as well as its sup value. One notices that the inf
and sup values of fA are equal to 1 for all the α-sets α2 , α3 , α4 , α5 . Thus, veryLow
contributes for all the α-level subsets to the belief bel and plausibility pl of hypothesis
70 Chapter 3. Fuzzy Dempster-Shafer Decision Making
• For the contribution of low, in the α-set α1 , the inf value of fA is equal to 0, whereas
its sup value is equal to 1. Consequently, low contributes under the α-set α1 only to
the plausibility value of hypothesis HA . One notices that the inf and sup values of fA
are respectively 0 and 1 for the α-sets α2 , α3 , α4 . For the α5 , both the inf and sup
values of fA are equal to 0. Thus, low contributes for the α-level subsets α1 , α2 , α3 , α4
to the plausibility pl(HA ) of hypothesis HA , but do not contribute to its belief value
according to Equation 3.16, as summarized in Equation 3.17:
inf f (z) = 0, sup fA (z) = 1,
z|µlow (z)>α1 A
z|µlow (z)>α1
inf fA (z) = 0, sup fA (z) = 1,
z|µlow (z)>α2 z|µlow (z)>α2
inf fA (z) = 0, sup fA (z) = 1, (3.18)
z|µlow (z)>α3 z|µlow (z)>α3
inf fA (z) = 0, sup fA (z) = 1,
z|µlow (z)>α4 z|µlow (z)>α4
z|µ inf fA (z) = 0, sup fA (z) = 0.
low(z)>α 5 z|µlow (z)>α5
To conclude, an output fuzzy set will generally contribute to the bel value if its uod of
interest (i.e. uod values for which µ 6= 0) is included in the uod of interest of the represen-
tation function of the hypothesis. Its contribution to the pl value is guaranteed if its uod of
interest intersects the uod of interest of the hypothesis. For example, the output fuzzy sets
contributions by mfout of Figure 3.9 (p. 67) to the 3 hypotheses HA,P A,N A are depicted in
Figure 3.16.
3.3. Fuzzy Dempster-Shafer algorithm 71
Figure 3.14: Illustration of 5 α-level subsets decomposition process to calculate the contribution of veryLow
in dark green line to HA in green line.
72 Chapter 3. Fuzzy Dempster-Shafer Decision Making
Figure 3.15: Illustration of 5 α-level subsets decomposition process to calculate the contribution of low in
light blue line to HA in green line.
3.3. Fuzzy Dempster-Shafer algorithm 73
Figure 3.16: Illustration of the belief and plausibility values for each hypothesis. (a) The small green area is
the contribution to bel(HA ) and the larger green area is the contribution to pl(HA ). (b) The small orange
area is the contribution to bel(HP A ) and the larger orange area is the contribution to pl(HP A ). (c) There is
no contribution to bel(HN A ) and pl(HN A ) with the mfout of this example.
74 Chapter 3. Fuzzy Dempster-Shafer Decision Making
The way to combine the length and average of the interval is part of the limits of a non-
human decision process. It deals with the arbitrary choice: to favor safety or security, i.e. to
assign a conservative or aggressive vehicle's behavior.
In order to validate our decision-maker, we propose a simple case study in simulation. More
complex highway use cases are presented in experimentation in Chapter 5. We use sigmoid
candidate trajectories as inputs in this simulation. The case study is described in subsection
3.4.1, the parameters for FDST are set in subsection 3.4.2, and the simulation results are
depicted in subsection 3.4.3.
3.4. Simulation results 75
b
y(t) = y0 ± , (3.21)
1 + e−λ(x(t)−c)
with b the shift between y0 and the asymptotic end value, which corresponds in our
problem to the target centerline. The sigmoid parameter λ and the distance delay c are
tuning parameters. In our problem, they are used for driver safety and comfort, which
depend on the lateral and longitudinal ego vehicle's velocities (see subsection 4.3.1.3, p. 90
for more details).
We obtain 14 candidate trajectories: 7 for lane changing Left and 7 for car following
maneuvers Straight, see Figure 3.17.
Figure 3.17: Case study description. The ego vehicle is represented in blue. The obstacle is the blue
rectangular with the red front line. The 7 acceleration proles give 14 dotted-dashed trajectories for left and
a a a
straight directions: no acceleration p1 in yellow, p2 : x,max
4
in magenta, p3 : x,max
3
in cyan, p4 : x,max
2
in
dx,max dx,max dx,max
red, p5 : 4
in green, p6 : 3
in blue, p7 : 2
in black, with respectively ax,max = 2 m/s and
2
dx,max = −1.5 m/s2 the maximum acceleration and deceleration of the ego vehicle. The ends of trajectories
are marked with a plus sign. The obstacle trajectory is the dotted-asterisk purple line.
Trajectory p1 p2 p3 p4 p5 p6 p7
Acceleration (m/s2 ) 0 0.5 0.66 1 -0.375 -0.5 -0.75
λ (m−1 ) 0.15 0.13 0.11 0.10 0.17 0.20 0.23
c (m) 40 45 50 55 35 30 25
76 Chapter 3. Fuzzy Dempster-Shafer Decision Making
3.4.2 Parameters
We detail in this part the 2 categories used for this use case: obstacle safety (κ = 3) and
driving rules (κ = 4), see Table 3.1. The membership functions and rule-base of category
3 Obstacle safety are adapted from [O' Brien et al. 2017]. The authors used the dataset
NGSIM and the Insurance Corporation of British Columbia's (ICBC) driving rules [ICB 2015]
to create the input fuzzy sets for the criteria 'Relative velocity' (Rv in km/h), 'Time head-
way' (T h in s) and 'Ego velocity' (Ev in km/h), and the 'warning level' output fuzzy sets
ι∈{veryLow,low,med,high,veryHigh}
LW3 . The µ are plotted in Figure 3.18a and the rule-base is
written in Table 3.4. As we consider in this paper an autonomous vehicle, we shift 1 s less
the membership functions of the 'Time headway' criteria, which was initially designed in
[O' Brien et al. 2017] for the analysis of human drivers. This is justied by the 1 s assessed
to the driver reaction time.
Figure 3.18b shows the FIS for category 4, based on [ICB 2015]. The input fuzzy sets
are 'Legal velocity' (Lv in km/h) and 'Legal lane' (Ll no unit), and the output fuzzy sets
are the same 'warning level' as for category 3. We use a Mamdani FIS with a minimum
AndMethod to the set of rules, and respectively the minimum, maximum, for implication
ORR, aggregation ARR, methods.
In order to show the inuence of the trust and weight consideration, we present 3 scenarios.
Scenario 1 is a nominal situation with sensors' trust values set to 100%. Scenario 2 presents
a nominal situation with a 30% trust value for 'Time headway' and 100% for the two other
criteria of category 3. Lastly, scenario 3 keeps the trust values of scenario 1 in critical
situation, where category 3 is hard and category 4 middle. In nominal situation, we consider
the hard, middle, and soft weights as equivalent, i.e. whard,middle = 1 for our 2 categories 3
and 4. In critical situation, we consider hard weight ten times bigger than middle weight. For
conjunctive combination, we use the inverse of the weight value in the min operator, whereas
for disjunctive combination, we use the weight value in the max operator. As we apply the
conjunctive Yager's rule, we get whard = 1/10 and wmiddle = 1.
The 3 hypotheses for risk assessment with respectively Acceptable hypothesis (A), Par-
tially Acceptable (PA), and Non-Acceptable (NA) are plotted respectively in green, orange,
and red in Figure 3.8 (p. 65). We choose the hypotheses weights ΩA = 2, ΩP A = 1 and
ΩN A = −3 for the risk indicator calculation, and ε = 0.1 for consistency, as belief and
plausibility values expressed on [0, 1]. The highest Ip leads to the best trajectory p.
3.4. Simulation results 77
3.4.3 Results
The criteria values obtained with the corresponding predictive operator (Table 3.1) for our
candidate trajectories are collected in Table 3.5. Considering the input membership functions
of categories 3 and 4, it seems that for category 3, the 'Time headway (Th)' criterion would
be the most dierentiated one, whereas for category 4, 'Legal velocity (Lv)' and 'Legal lane
(Ll)' would both inuence the degree of membership on the output warning risk levels LW .
Table 3.5: Criteria values for categories 3 and 4.
Table 3.6 shows the risk indicator Ip of each straight and left trajectories p = 1..7 for the
3 scenarios.
Table 3.6: Risk indicator Ip .
For scenario 1, the left candidate trajectories scores are close and positive, as their
criteria values through the rule-base mostly lead to low, medium and high risk levels. For
the straight trajectories, collision with the front obstacle is very close for the acceleration
proles as shown in Figure 3.17, where the obstacle trajectory's (purple dotted-asterisk line)
distances with p2, p3, and p4 trajectories (resp. magenta, cyan and red plus-sign lines) are
less than 15 m. This potential collision is expressed with Th criterion smaller than 1 s. On
the other side, straight trajectory p1 has a lower score than left trajectory p1 because of a
smaller Th and category 4 yielding low risk level for straight lane and veryLow for left one.
On the contrary, straight p5, p6, and p7 have a better score than left trajectories as they
remain on the right lane which is safer than changing lane at low velocities. The decision
brackets are depicted in Figure 3.20a. For each hypothesis, the [bel, pl] interval for the 7
left trajectories and 7 straight trajectories are detailed. The analysis of the beliefs intervals
for all the trajectories conrms that the most pronounced risk assessment is for acceptable
(A) and partially acceptable (PA) hypotheses. If we look at the rule-base, category 3 induces
mostly medium, high and veryHigh warning risk levels and category 4 veryLow and low levels.
Thus the similarity with PA hypothesis is higher than the one with A or NA hypotheses. The
decision brackets for straight trajectories p2, p3 and p4 are very close or set to 0, as previously
explained. Only the deceleration trajectories cover the non-acceptable hypothesis (NA), as
category 4 yields med and high risk levels, i.e high and veryHigh LW . Considering criteria
values of category 4, the best trajectory is among the left medium velocities. The values
80 Chapter 3. Fuzzy Dempster-Shafer Decision Making
for category 3 are less discriminative. The best trajectory for scenario 1 in the sense of the
previously dened risk indicator is then Left p2.
During scenario 2 - lower trust, the lower trust value inuences the aggregate output
membership function, as shown in Figure 3.19 for the example of the candidate trajectory left
with no acceleration p1. We notice that inuences of trust in 'Time headway' for category
3 is important for all the warning levels. The corresponding belief intervals are displayed in
Figure 3.20b. We notice that the length of the intervals remains almost the same for all the
candidate trajectories under all hypotheses, but the mean values are smaller for A and PA
hypotheses. Indeed, the Th criterion mostly leads category 3 to veryLow, low and medium
risk levels in this case study. If the criterion is being less considered as its trust is lower, the
bba for those values decrease. Besides, the mean values of A and PA hypotheses are closer
than in scenario 1. The best trajectory for scenario 2 is to go Straight p7.
Figure 3.19: Inuence of trust values on the aggregate function of category 3 for the left trajectory
p1.
Finally, we notice that (i) for most cases, the decision brackets are large, which means
that the fuzzycation is important and the hypotheses may thus be too large, (ii) with close
candidate trajectories, the tested categories return similar results for the 3 scenarios, (iii) the
results are sensitive to rules-base consistency, (iv) the values for the risk indicator are relevant
in the 3 scenarios, and (v) considering criteria trust values and weighted categories shows
sensible changes on the risk assessment. In fact, to observe the respective inuence of trust
value and weight category, the untrusted criteria must inuence the LWci assignment, and the
overlap of the LWci between two categories must coincide with the hypotheses representation
functions transition.
3.5. Conclusion and future work 81
Figure 3.20: For each of the hypotheses (A) in green, (PA) in orange and (NA) in red, the decision brackets
[bel, pl] of each candidate trajectory are plotted.
82 Chapter 3. Fuzzy Dempster-Shafer Decision Making
We presented a new framework for predictive multi-criteria decision making for autonomous
vehicles, which simultaneously considers several factors of risk, their uncertainties, and their
relative importance for human safety. We employed the belief theory for risk assessment and
fuzzy logic for uncertain data with a trustworthiness value. As the fuzzy rules are based
on both trac rules and real data learning, the decision relies on a conservative but active
behavior of the ego vehicle. Moreover, the Fuzzy Dempster-Shafer reasoning avoids both
the defuzzication process for fuzzy logic by using belief theory and the problem of fallacy
of the excluded middle for belief theory by using fuzzy set and appropriate rule-base. We
also introduced the inuence of sensors trust values and relative importance on criteria and
categories to relax if safer or to consider driver preferences to get a exible decision. Another
advantage is the possibility to evaluate any description of motion, i.e. paths, trajectories, or
symbolic maneuvers, as long as they can be characterized through a set of criteria. Lastly, the
proposed framework is scalable and ables to evaluate arbitrary functions, although it requires
ne tuning for fuzzy inference systems and best trajectory denition to respect the problem's
specication. However, the combination of fuzzy logic and evidential reasoning still depends
on a mathematically tuning formalism for the fuzzy membership functions, fuzzy rules and
the hypothesis denition. It also suers from an explicit mathematical function to simply
expressed the decision rules.
Future work will consist in deploying this approach to other use cases with more risk
factors based on more criteria and/or categories. In order to adapt the behavior of the fuzzy
inference systems to a human-in-the-loop behavior, learning techniques can also be considered
for dierent driver prole datasets to form the fuzzy sets. Lastly, if the criterion leading to an
unsafe decision is identied, transposing it into a safe universe will allow to apply preventive
decisions.
After dening the decision making function, one needs to provide the candidate motions
to evaluate as the best one for the reference motion sent to the controller. This is the research
topic of Chapter 4.
Chapter 4
This chapter considers the problem of near-optimal trajectory generation for au-
tonomous vehicles on highways. The goal is to select a predictive reference trajectory
in the free evolution space, while avoiding both generating a pre-calculated set of can-
didate trajectories and decoupling path and velocity optimizations. Moreover, this
trajectory aims at optimizing a decision process based on multi-criteria functions,
which are not straightforward to design and can have a black-box formulation. The
main idea is to use the decision evaluation function in the trajectory generator with
a Simulated Annealing approach. The parameters of a sigmoid trajectory are op-
timized within Non-Collision Nominal Intervals, which are dened as collision-free
intervals under nominal conditions using a velocity-space representation.
Contents
4.1 Literature review and motivation . . . . . . . . . . . . . . . . . . . . . . . . . 85
4.2 All-in-one motion planning architecture . . . . . . . . . . . . . . . . . . . . . . 86
4.3 The reachable space-time as NCNI . . . . . . . . . . . . . . . . . . . . . . . . . 88
4.3.1 Environment description . . . . . . . . . . . . . . . . . . . . . . . . . . 88
4.3.1.1 Road model . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
4.3.1.2 Vehicle model . . . . . . . . . . . . . . . . . . . . . . . . . . 89
4.3.1.3 Trajectory model . . . . . . . . . . . . . . . . . . . . . . . . 90
4.3.1.4 Obstacle prediction . . . . . . . . . . . . . . . . . . . . . . . 91
4.3.2 Non-Collision Nominal Intervals (NCNI) . . . . . . . . . . . . . . . . . 91
4.4 Algorithm architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
4.4.1 Assumptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
4.4.2 Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
4.4.2.1 Initial diagram . . . . . . . . . . . . . . . . . . . . . . . . . 96
4.4.2.2 Decision diagram . . . . . . . . . . . . . . . . . . . . . . . . 96
4.5 Simulated Annealing optimization . . . . . . . . . . . . . . . . . . . . . . . . . 97
4.5.1 Algorithm description . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
4.5.1.1 Architecture choice . . . . . . . . . . . . . . . . . . . . . . . 99
4.5.1.2 Parameters denition . . . . . . . . . . . . . . . . . . . . . . 99
4.5.1.3 Variables search space . . . . . . . . . . . . . . . . . . . . . 101
4.5.2 Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
4.5.2.1 Objective function . . . . . . . . . . . . . . . . . . . . . . . 104
4.5.2.2 Convergence . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
4.5.2.3 Temperature sensitivity . . . . . . . . . . . . . . . . . . . . . 105
Chapter 4. Optimized Trajectory Planning within Non-Collision Nominal
84 Intervals
4.5.2.4 Initial variables sensitivity . . . . . . . . . . . . . . . . . . . 105
4.6 Numerical example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
4.6.1 Scenario description . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
4.6.2 SA parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
4.6.3 Maneuvers evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
4.7 Conclusion and future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
Two approaches are distinguished for trajectory generation in autonomous vehicles: dis-
cretization and decoupling. The rst one generates a set of candidate trajectories called
tentacles, introduced in [Von Hundelshausen et al. 2008], as further described in subsection
2.2.2.4 (p. 31). The generation is made a priori, based on predened geometric curves, such
as line and circle [Cherubini et al. 2012], clothoid [Chebly et al. 2017] or sigmoid [Cesari
et al. 2017]. The completion of each candidate trajectory is then tested and scored against
the objective function of the decision subpart in order to dene the most appropriate one.
The second approach is based on static and dynamic decoupling. A spatial path solution
is rst dened on a static decomposition of the space, for example tting particular points
with polynomials [Xu et al. 2012] or Bézier curves [Lattarulo et al. 2017]. This representation
uses the topography of the road, such as lane marking, road shape, map information, and
static obstacle avoidance. The path is then adjusted to the dynamic obstacles by choosing
a speed/acceleration prole [Liu et al. 2017b] to respect the speed limits and avoid dynamic
obstacles. Conversely, dening rst the optimal velocity prole dynamically and then the
static path is also possible [Wang et al. 2018b].
From these two representations emerge two main issues. Indeed, the discretization of the
candidate solutions requires either testing a large number of candidate proles to approach the
optimal one, or to make a choice a priori on the characteristics of the solution. This problem is
studied with the continuous optimization strategies mentioned in subsection 2.2.2.5 (p. 33),
such as gradient descent [Xu et al. 2012], Linear Programming (LP) [Plessen et al. 2017],
Mixed-Integer Programming (MIP) [Wang et al. 2018b], or Model Predictive Control (MPC)
[Cesari et al. 2017]. The drawback of such strategies is their need for an explicit objective
function. Similarly, the decoupling approach does not allow exploring all motions over the
space-time prediction horizon, as it optimizes each dimension separately. This has previously
been addressed through the Velocity Obstacles (VO) representation [Fiorini & Shiller 1998],
the Inevitable Collision States (ICS) [Fraichard & Asama 2004], reachable sets [Söntges & Al-
tho 2018] (see Figure 4.1a), or the graph interval formulation [Altché & De La Fortelle 2017]
(see Figure 4.1b). By combining these ideas, we propose the Non-Collision Nominal Intervals
(NCNI), which return, based on a velocity representation, the predictive collision-free inter-
vals for nominal maneuvers. NCNI are inspired from the complementary space of ICS, and
are characterized as spatiotemporal intervals in order to dene the reachable space-time for
navigation and the corresponding graph of maneuvers.
The last motivation which drives our research is the optimization of the trajectory, eval-
uated with the previously developed decision-maker in Chapter 3. The main issue is the
optimization of the objective function that uses the Fuzzy Dempster-Shafer Theory (FDST).
However, this function can not be expressed explicitly as a simple mathematical function.
Indeed, we consider our decision-maker as a black-box function. Moreover, we want to treat
a continuous solution space in order to tackle the two main issues from trajectory genera-
tion mentioned in the previous paragraphs. Thus, there exists a huge number of potential
solutions. Black-box objective functions with a huge number of potential solutions and no a
priori knowledge are usually optimized using metaheuristics [Gendreau & Potvin 2010]. The
advantages of metaheuristics approaches are that they do not require a formalized problem,
and do not suer from combinatorial explosion. Moreover, the heuristic property tackles the
Chapter 4. Optimized Trajectory Planning within Non-Collision Nominal
86 Intervals
(a) Reachable sets representation from [Söntges & (b) Partitioning space from [Altché &
Altho 2018]. Up: Constant velocity prediction of De La Fortelle 2017]. The colored space in-
the three surrounding obstacles. Down: The grey dicates the relative position of the ego vehicle
area corresponds to the free-space where the ego ve- respectively to obstacles 1, 2 and 3 (b=behind,
hicle can navigate. f=front, l=left, r=right).
Figure 4.1: Illustration of the reachable sets and graph interval with a partitioning space.
local minima issue. Nevertheless, the main drawbacks of these methods are the need for
a ne problem-adaptive tuning, their non-deterministic resolution, and a trade-o between
optimality, completeness and execution time. In order to choose the most adapted algorithm
among the large variety of metaheuristics approaches, we list our problem properties, based
on the decision-maker developed in Chapter 3, and the motivations for the trajectory planner:
• there is no a priori on the function, which can have local minima: a solution to escape
local minima is required (even if a local minimum might be accepted);
• the algorithm is required to run in real-time: the solutions with memory, e.g. Tabu
search, are dismissed.
As reviewed in [Wolpert & Macready 1997], the Simulated Annealing (SA) algorithm tackles
these properties issues, while guaranteeing theoretical convergence.
The strategy that we propose is the combination of NCNI and SA algorithm to address
the discretization and decoupling issues. This approach results in a new architecture for
trajectory generation and decision making, detailed in the next section.
As presented in the previous section, two strategies occur for combining motion generation
and decision making, which are the two functions in the scope of our work.
In the rst strategy represented in Figure 4.2a, the trajectory (or more generally motion)
generation is rst addressed, and then the decision making. To do so, the evolution space
4.2. All-in-one motion planning architecture 87
is usually discretized in order to propose predened trajectories. Thus, the decision making
function acts as a trajectory selector among the previously predened trajectories to return
the one which has the best score following some criteria. For example in Figure 4.2a, four
predened trajectories (considering both position and velocity) are drawn. The decision-
maker is based on three criteria to select the nal choice. As trajectory 3 has the best score,
it becomes the reference trajectory to follow by the controller.
In the second strategy represented in Figure 4.2b, the decision making is rst applied,
and then the trajectory generation. The decision-maker considers a maneuver selection with
an orientation and a velocity prole. In the case of highway driving, the maneuver can be
reduced to stay on its lane or a left/right lane change, and the velocity prole to maintain
speed, accelerate or brake. A trajectory is then generated in the evolution space, which
corresponds to the selected maneuver. To do so, the trajectory is usually decoupled in a static
prole (position) and a dynamic one (velocity). This two-dimension optimization problem
is solved by optimizing rst the static prole and then the dynamic one, or vice-versa. For
example in Figure 4.2b, the decision-maker selects a left lane change with acceleration. The
ve corresponding way-points are generated to dene the maneuver in static, and a velocity
prole with acceleration is optimized for the dynamic part of the maneuver. Both static and
dynamic parts form the reference trajectory.
(a) Architecture with discretization issue. (b) Architecture with decoupling issue.
Figure 4.2: Illustration of the two classic architectures for motion generation and decision making.
The reachable space-time is dened as the evolution space fullling the following conditions:
(i) it must be consistent with the physical limitations of the vehicle, (ii) it is represented with
position and velocity information, (iii) it is predicted along a space and time horizon, and
(iv) it is collision-free over the prediction horizon or, if a collision has been predicted along
the prediction horizon, it can be avoided through a nominal maneuver, e.g. braking or a lane
changing. The environment is described in subsection 4.3.1 and the NCNI in 4.3.2.
of the rst adjacent lanes only values for the lane change behavior modeling, more lanes can
be considered to locate the ego vehicle on the road for route planning.
In our work, we use a road-aligned coordinate frame under the approximation of straight
lines for highways (the radius of curvature is higher than 1000 m [Reif & Dietsche 2010]).
Nevertheless, dierent methods exist to transform the road coordinate from a global frame to
a road-aligned coordinate frame, for example authors in [Altché & De La Fortelle 2017] apply
a local Frenet frame, and authors in [Hudecek & Eckstein 2014] rst decompose the road into
the three segments straights, curves and arcs, and then transform the segment-aligned world
coordinates into segment-aligned road coordinates.
Figure 4.4: Example of 10 intermediate longitudinal velocity proles with xed acceleration and
deceleration respectively at ax = 2m/s2 and dx = −1m/s2 , bounded velocities vmax = 36.11m/s
(130km/h) and vmin = 22.22m/s (80km/h), and the initial ego velocity v0 = 30m/s (108km/h).
Chapter 4. Optimized Trajectory Planning within Non-Collision Nominal
90 Intervals
The vehicle model used in this work is summarized in Equations 4.1a and 4.1b:
ẍ(t) = ax ,
ẋ(t) = vx (t) = ax t + v0 ,
ax 2
x(t) = t + v0 t + x0 ,
2
(4.1a)
ÿ(t) = ay ,
ẏ(t) = vy (t),
α = arctan( vy ),
vx
under the constraints
ax ∈ [dx,max , ax,max ],
v ∈ [v , v
x min max ],
(4.1b)
ay ∈ [dy,max , ay,max ],
α ∈ [−αmax , αmax ].
q
The notation vego is the global ego velocity, expressed as vego = vx2 + vy2 . As the lateral
velocity vy is usually very small compared to the longitudinal velocity vx on highways, we
approximate vego ≈ vx . Moreover, the condition on the maximum angle αmax is also less
restrictive than the one on the maximum lateral acceleration ay,max on highways, as given in
[Reif & Dietsche 2010]. We thus consider only the constraint on ay,max in the next sections.
• The lane keeping maneuver consists in following the center of the current lane: y(t) =
ycenterline .
• The lane changing maneuver consists in a lane shift from the current lane to the target
lane centerline. Based on the literature review in Chapter 2, the sigmoid function has
been chosen. Even if the clothoids, lines and circles are the usual shape for highway
structure, the mathematical calculation of a clothoid is iterative and leads to a higher
calculation time than the one of a sigmoid. Besides, the properties of the sigmoid func-
tion make it easy to use, as it is bounded, symmetric, dierentiable, and monotonic. We
thus approximate the lane change behavior with sigmoids in Equation 4.2, represented
in Figure 4.5.
b
y(t) = y0 ± −λ(x(t)−c)
, (4.2)
1+e
with b the shift between y0 and the asymptotic end value, which corresponds in our
problem to the target centerline. The sigmoid parameter λ and the distance delay c
are the variables to optimize in our study. They are used for driver safety and comfort,
4.3. The reachable space-time as NCNI 91
which depend on the lateral and longitudinal ego vehicle velocities. The maximum slope
of the curve is mathematically dened to bλ/4 at x(t) = c, as illustrated in Figure 4.5.
Figure 4.5: Characterization of the sigmoid. For illustration, x ∈ [0, 100], b = 1, λ = 0.15, and c = 50. Both
increasing and decreasing sigmoid functions are plotted.
Figure 4.6: Illustration of full lane occupancy obstacles. The striped area shows the front collision space for
obstacle O1 in yellow and obstacle O2 in green, and the rear collision space for O2 in orange. The reminding
rear space of O1 denes the space for interval I1 , the reminding rear space of O2 the space for interval I2 and
the reminding front space of O2 the space for interval I3 .
Figure 4.7: The holistic view of safety. Source Swedish Transport Administration 2010 .
NCNI consist in spatiotemporal intervals, in which the ego vehicle navigates without
collision under nominal conditions. We dene 3 possible characterizations. First, the ego
vehicle passes in front of the obstacle, which is thus considered as a rear obstacle, even if its
initial position was in front of the ego vehicle, as obstacle O3 in Figure 4.8. A rear obstacle
yields a lower bound of distance and velocity, under which a collision is encountered. Second,
the ego vehicle passes behind the obstacle, which is thus considered as a front obstacle, even
if its initial position was behind the ego vehicle, as obstacle O5 in Figure 4.8. A front obstacle
yields an upper bound of distance and velocity, over which there is a collision. Last, if there
is no obstacle, an ego phantom vehicle marks respectively the lower and upper bounds, with
the minimum and maximum velocity of the ego vehicle. This ego phantom vehicle represents
a hypothetic obstacle at the sensors' limit perception range. These bounds delimit the NCNI
between the adjacent obstacles, as depicted in Figure 4.8.
The NCNI characterize where, when, and how the ego vehicle can maneuver along the pre-
diction horizon HP . A classication of the perceived obstacles Oj=1..J within the perception
area returns the intervals to consider in the ego adjacent lanes. For example in Figure 4.8,
the perception area is represented in purple dotted line, 7 obstacles are detected, which form
infI supI
K = 6 intervals Ik=1..K . Each interval Ik is dened by a lower Oj k and an upper Oj 0 k
bound due to a rear obstacle Oj and a front obstacle Oj 0 . In Figure 4.8, I2 is delimited by
4.4. Algorithm architecture 93
Figure 4.8: Example of a 3-lane road with 7 obstacles around the ego vehicle in the perception area, which
dene 6 NCNI.
infI supI
obstacles O2 and O3 . Thus, I2 is associated with (O2 2 , O3 2 ). The open intervals (e.g.
intervals I4 and I6 in Figure 4.8) are bounded with the minimum and maximum dynamics of
the ego phantom vehicle. The ego rear obstacles are not considered as interval bounds, as the
ego vehicle is not allowed to reverse on highway in nominal driving conditions. In Figure 4.8,
lane/car following will consist in nding the non-collision range of velocities inside interval
I1 . A left lane change is only possible if there exists at least one velocity prole to reach
intervals I2 or I3 from interval I1 , respectively I4 , I5 or I6 from interval I1 for a right lane
change.
A spatiotemporal interval is thus dened by a lower and upper bound in distance and
velocity. A denition is given in Equation 4.3:
Ik,d (pi )
Ik =
ˆ , (4.3)
Ik,v (pi ) i=1,N
where pi is an ego prole, N is the total number of ego proles, Ik,d denes the interval Ik in
distance in Equation 4.4a, respectively Ik,v in velocity in Equation 4.4b:
where dinf , dsup , are the inferior, superior, distance values, and vinf , vsup are the inferior,
superior velocity values to characterize the free space of interval Ik for the ego prole pi (see
subsection 4.4.2 for more details on how to obtain the inf and sup values).
To conclude, the NCNI provide information on longitudinal and lateral position and
velocity constraints to avoid a collision.
This section details the assumptions in subsection 4.4.1 and the algorithm diagrams used to
dene the NCNI before the SA-search for the reference trajectory in subsection 4.4.2.
4.4.1 Assumptions
The following assumptions are necessary to delimit the nominal use of the proposed motion
planner.
Chapter 4. Optimized Trajectory Planning within Non-Collision Nominal
94 Intervals
Assumption 1 In the ego lane, only the closest front obstacle is considered, as overtaking
this obstacle will lead to a replanning step.
At least 2 maneuvers are necessary to pass in front of the initial closest front obstacle in the
ego lane. However, one of the strategy of our motion planner is to replan if the situation
changes signicantly, e.g. a lane change or acceleration.
Assumption 2 If the velocities of the closest front obstacles in each considered left, ego, and
right lane within the perception area are below the minimum speed limit, the situation is not
nominal and is out of the scope of this motion planner.
We assign a minimum velocity to navigate on the road in nominal conditions (see subsection
4.3.1.1). If all the closest front obstacles of each lane are below this speed limits, the ego
vehicle can not stay without a collision on any of the ego lane, adjacent right or adjacent left,
as its minimum velocity is higher than the one of the obstacles, and will lead to a collision.
This assumption can be relaxed in some cases, e.g., if the ego vehicle has time and distance
for an adjacent lane change rst and then to reconsider the situation with the new adjacent
lanes.
Assumption 3 Only the ego, adjacent left and right lanes are considered to dene the NCNI.
Indeed, if an obstacle in the second left lane plans to navigate to the ego lane, it will necessarily
pass through the adjacent left lane rst.
The NCNI are considered only in the ego and adjacent left and right lanes, but further
detected obstacles are analyzed for obstacles prediction, specially the ones which plan to
navigate on the NCNI's lanes are considered (see subsection 4.3.1.4). There is no theoretical
limitation to consider more lanes, except for real-time calculation.
Assumption 4 If there is a front collision in one of the directions, the corresponding prole
is discarded in this direction.
Assumption 5 If there is a rear collision in one of the directions all along the prediction
horizon, the corresponding prole is discarded in this direction.
Assumption 6 Rear obstacles in the ego lane are not considered in a general framework.
This assumption can be relaxed in case of specic driver rules, e.g., to pull back the ego
vehicle in the mostright lane.
We will notice in the experimentation Chapter 5 that Assumptions 4 and 5 are too con-
servative, and might be relaxed considering an anticipated horizon.
4.4.2 Architecture
The proposed algorithm has three main parts. First, the ego velocity/distance proles and
obstacle proles are generated, as described in subsections 4.3.1.2 and 4.3.1.4. Second, the
intervals are dened and characterized by comparing the ego and obstacles proles. Last,
the remaining collision-free proles in the sense of the Assumptions 4 and 5 are gathered
through the intervals to dene possible maneuvers and associated time-space limitations.
The architecture of the algorithm is displayed in Figure 4.9.
4.4. Algorithm architecture
95
Figure 4.9: Algorithm diagrams.
Chapter 4. Optimized Trajectory Planning within Non-Collision Nominal
96 Intervals
4.4.2.1 Initial diagram
The initial diagram consists in dening the predicted ego and obstacles motions with the
models described respectively in subsections 4.3.1.2 and 4.3.1.4 (step 0), and generating the
Non-Collision Nominal Intervals (NCNI) in step 0bis.
Although rst and last moments of collision could be evaluated analytically, as it has been
done in [Jula et al. 2000] using simple vehicle's models, our approach discretizes a sample
of N target velocities vt within the road speed limits [vmin , vmax ]. N longitudinal velocity
vi=1:N and position pi=1:N proles are then calculated for the ego vehicle over the prediction
horizon HP . One notices that this discretization step is only used to dene the collision
limits and the interval bounds, but does not stand for the candidate solution to the reference
trajectory, which is calculated in the continuous interval previously dened, see subsection
4.5.1.3.
The longitudinal velocity and position proles, with indication of the direction, of the
J obstacles Oj=1:J within the perception area are generated in vOj=1..J ,dir , pOj=1..J ,dir . As
detailed in subsection 4.3.2 (p. 91), the space between the obstacles form the NCNI. The
intervals Ik are described with their inferior bound based on an obstacle j , Ojinf , their superior
bound based on an obstacle j 0 , Ojsup
0 , and their direction lane, dir ∈ {Straight, Lef t, Right}:
for the ego prole vi , vi0 . The lower (resp. upper) velocity bound is the minimum (resp.
maximum) of the vinf (resp. vsup ) of each existing ego prole: vt ∈ [vi , vi0 ]. The position of
the lane shift for each existing ego prole, i.e. the distance delay parameter c of the sigmoid,
lies within the minimum and maximum, dinf and dsup , from the initial interval Ik and the
target one Ik0 , as expressed in Equation 4.6:
c ∈ [max(dpinf
n
I
, dpinf
n
I
); min(dpsup
n
I
, dpsup
n
I 0
)] (4.6)
k k0 k k
as illustrated in Figure 4.10d with I1 and I2 for ego proles vi , pi in blue and vi0 , pi0 in purple,
i.e. n = i, i0 .
Then, SA optimization, using the evaluation function of Chapter 3 (Jobj,m = Im ), is
applied (step 4) in order to return a near-optimal trajectory solution for each remaining
maneuver m. The nal step 5 selects the near optimal trajectory solution which has the
best evaluation function values Im . This trajectory is the reference trajectory sent to the
controller. Next section details the SA algorithm.
Figure 4.10: Characterization of the intervals I1 and I2 for a left change maneuver. The collisions are depicted
in red, the eliminations in dark red. (a) Ego I1 upper bound: The ego proles pmax , pi , pi0 are in collision with
supI
O1 1 , they are discarded for the direction ego (Assumption 4). (b) Left I2 upper bound: The pmax prole
sup inf
collides with O3 I2 . (c) Left I2 lower bound: Both pmin , pi0 are in collision with O2 I2 . pmin always collides
along HP , so the prole is discarded (Assumption 5). (d) Left change maneuver I1→2 : As pmax collides both
with I1 and I2 , it is discarded. Only pi,i0 remain with [dinf ; dsup ]I1→2 (pi ) and [dinf ; dsup ]I1→2 (pi0 ).
4.5. Simulated Annealing optimization 99
probability). The lower the temperature is, the less worse solutions are accepted (smaller
probability). With a too low, respectively too high, temperature value, the SA becomes like
stochastic hill climbing, respectively random search. Besides, the positive aspect in accepting
worse solution is a more extensive search for the global optimal solution, as well as the ability
to jump out of local optima.
The SA works as follows: (0) selects an initial candidate solution and xes an initial
temperature; (1) selects a random solution close to the current one; (2) evaluates this solution
and decides to accept or reject it if either it is a better local optimum to the objective
function, or it has a non-zero probability of acceptance as a worse solution; (4) decreases the
temperature parameter to decrease the probability of acceptance for a worse solution.
The drawback of SA is the need to be empirically adjusted for each problem. The tuning
of the annealing schedule concerns both the resolution scheme and the denition of the
parameters. More information on the dierent areas of tuning can be found in [Aarts &
Van Laarhoven 1985]. We detail the adaptation of the SA to our problem considering the
architecture choice in subsection 4.5.1.1, the cooling parameters in subsection 4.5.1.2, and
the variables search space in subsection 4.5.1.3.
Acceptance probability function: We use the initial Metropolis rules with the Boltzmann
probabilities to randomly accept a worse solution: P(T ) = e(−∆(Jobj )/T ) .
Cooling schedule: The cooling schedule is the iterative function to decrease the temper-
ature value. Dierent strategies exist such as linear, geometric, logarithmic or adaptive
cooling schedules. The convergence is guaranteed if the temperature decreases in a loga-
rithmic way [Hajek 1988]. However, in practice, the geometric progression, Tn = q·Tn−1 ,
is faster and returns satisfying results. The geometric parameter q is calculated accord-
ing to the nal acceptance probability Pf or temperature value Tf : Pf = e(−Zf /Tf ) ,
Tf = q N · T0 , with Zf a characteristic value at the end of the N iteration process.
Initial solution ξ0 : The initial solution is the current trajectory of the ego vehicle,
4.5. Simulated Annealing optimization 101
respecting Equation 4.11 (p. 102): vt = min(max(v0 , vmin ), vmax ), c = (dinf + dsup )/2,
λ = 4/c.
Neighborhood search: The function generates uniformly random numbers for the cre-
ation of new candidate solutions. The main point is to dene the length of the interval
for the neighborhood search. Indeed, if the search is made in the entire space, the rea-
soning is the one of random search. If the search space is too small, the algorithm will
frequently get trapped in a local minimum too quickly. Moreover, if the length of the
interval is xed, the methods is close to a local random search. In order to maintain
the idea of the annealing process, the length of the search interval is converted in a
decreasing function. We choose arbitrarily a proportionally decreasing interval length
with the temperature: (ub − lb)iter = Titer /T0 · (ub − lb)0 .
Stop criterion: The stop criterion is either xed as a maximum number of iterations,
or when the temperature is below a predened Tf , or at a steady state of the evolution
function, i.e. ∆(Jobj ) ≤ ε.
d2 y dy 2
2d y
= ax + v x (4.7)
dt2 dx dx2
ax λ v 2 λ2
= y(b − y) + x 2 y(b − y)(b − 2y) (4.8)
b b
≤ ay,max (4.9)
Equation 4.7 is obtained according to Schwarz's theorem and chain rule formula for computing
the derivative of the composition of two or more functions with continuous second-order
derivatives functions. Equation 4.8 uses the derivative properties of the sigmoid function.
The two terms in Equation 4.8 are bounded separately, so that the maximum value for
λ respecting Equation 4.9 is approximated by Equation 4.10. More details are written in
Annexe A.2 (p. 167).
√ √
−ax b/4 + ∆ a2x b2 2 3 2
λmax = √ ,∆ = + v bay,max (4.10)
3 2 16 9 x
9 vx b
Thus, the variables search space of ξ for Ik→k0 is illustrated in Figure 4.12 and dened in
Equation 4.11:
vt ∈ [vinf , vsup ]Ik→k0
c ∈ [dinf,vt , dsup,vt ]Ik→k0 (4.11)
4
λ ∈ [ , λmax,vt ]
c
Figure 4.11: Illustration of the sigmoid completion constraint. For λ = 0.15, x = [0, 100], the sigmoid in blue
with c = 50 is complete at 99.4%, whereas the one in orange with c = 90 is incomplete at 81.76%.
4.5. Simulated Annealing optimization 103
Figure 4.12: Illustration of the variables search space. The maneuver interval in (a) returns the velocity space
in (b) to select the target velocity vt1 . The selected target velocity vt1 is used to dene the c-space interval
in (c). The selected ct1 is used to dene the λ-space interval in (d), in which λt1 is selected.
4.5.2 Evaluation
We evaluate the performance of the algorithm by analyzing the convergence to a near-global
optimum and the sensitivity to T0 and q , as well as to the initial variables ξ0 = (vt , c, λ)0 . The
algorithm is run 10 times for each performance test. The evaluation tests are performed for
a representative case study of a left lane change maneuver with one obstacle in the ego front
lane, with a relative velocity vrel,O1 = −5 m/s and a relative distance drel,O1 = 90 m. The
Chapter 4. Optimized Trajectory Planning within Non-Collision Nominal
104 Intervals
ego initial velocity is v0 = 30 m/s, and 15 vt sampled within [vmin=22.22, vmax=36.11] m/s
are chosen.
Figure 4.13 displays the evaluation of this black-box objective function. The choice of vt
inuences all 3 criteria, while λ and c have an eect on Th . One notices that the maximum of
the evaluation function in yellow leads to a higher time headway, a negative relative velocity
(i.e. the ego vehicle is navigating faster), and an ego velocity inside the speed limits of the
road. Besides, the discontinuities from dark blue to green evaluation function at Th ≈ 2 s
is due to the rule-base in Table 3.2. Indeed, the warning levels to pass from Th in unsafe
distance to Th in safe distance for the same relative velocity and ego velocity membership
are not smooth. For example, if relative velocity belongs to negVelocity and ego velocity to
fastVelocity, the warning risk level, which results from the rule-base, is denoted as very high
for unsafe time headway (rule number 39) and low for safe time headway (rule number 42).
Figure 4.13: Evaluation function over the range of 3 variables: relative velocity vrel , time headway Th and
ego longitudinal velocity vx .
4.5. Simulated Annealing optimization 105
4.5.2.2 Convergence
The convergence analysis yields the convergence speed and optimal evaluation value Im
displayed in Table 4.1. One notices that the convergence speed is fast, as the evaluation
function displayed in Figure 4.13 shows similar values for large ranges of variables. This is
due to a conservative evaluation for decision making. Moreover, the optimal values over the
10 runs present a very low standard deviation σ , which indicates that despite the heuristic
calculation, the algorithm always nds a solution close to the global optimum value.
We demonstrate our SA-optimized trajectory generator on the scenario of Figure 4.8 (p. 93).
We detail the scenario values in subsection 4.6.1, the SA parameters in subsection 4.6.2, and
the maneuvers evaluation in subsection 4.6.3.
Moreover, to validate our approach in real-time, 28 computations of the described scenario
are performed on an embedded computer (2.10 GHz Intel Core i7-3612QM CPU, 8 GB RAM)
running a Visual C/C++ Solution File for Matlab/Simulink Coder. The mean ± standard
deviation/minimum/maximum values for the calculation time are 72 ± 10/61/99 ms, which
is satisfyingly fast for a real-time predictive motion planner.
4.6.2 SA parameters
We use the black-box decision function in Chapter 3 with the 3 criteria (relative velocity vrel ,
time headway Th and ego velocity vx ) recalled in section 4.5.2.1 and displayed in Figure 4.13.
The range of the criteria are vrel ∈ [−8, 14] m/s, Th ∈ [0, 3.36] s, and vx ∈ [22.22, 30.16] m/s.
A discretized zoom of the evaluation function is depicted in Figure 4.16. One should notice
that all the combination of the ranges of criteria values are not feasible, which implies that the
maximum value of the evaluation function depicted in Figure 4.16 is not necessary reached
during the simulation.
• Left lane change: to pass from interval 1 to interval 2, I1→2 , or from interval 1 to
interval 3, I1→3 ;
• Right lane change: to pass from interval 1 to interval 4, I1→4 , or from interval 1 to
interval 5, I1→5 , or from interval 1 to interval 6, I1→6 .
In this case study, I1→3 does not allow a left change maneuver, as O3 is driving at the
initial ego velocity, and thus, the distance to reach a higher vt is too low with O1 for a lane
change within the prediction horizon. There is no maneuver for I1→4 neither, as O4 , which
is behind the ego vehicle at the initialization, is driving at the minimum bound velocity of
the ego vehicle. No maneuver are allowed for I1→6 as O1 and O6 have the same velocity and
Chapter 4. Optimized Trajectory Planning within Non-Collision Nominal
108 Intervals
are close to each other, Equation 4.5 is not respected for any of the velocity prole. Only
maneuvers I1→1 , I1→2 and I1→5 remains.
The index proles and velocity bounds for each maneuver, as well as the optimal parame-
ters and evaluation, are summarized in Table 4.3. The corresponding Im lies in [−0.42, 6.00].
The optimization process is thus to nd a trajectory with the maximum value of Im = 6.
Table 4.3: Maneuvers bounds and evaluation.
• I1→1 : The target velocity proles 1 to 7 are valid for staying in the ego lane behind
obstacle O1 without collision. The maneuver is thus to break, which is logic as O1 has
a smaller velocity than the initial ego one set to 30 m/s. The minimum allowed target
velocity is 22.22 m/s and the maximum one is 28.17 m/s. With the evaluation function
Jobj in Figure 4.16, one notices that a relative velocity of ±5 m/s has no inuence.
Moreover, T h is determined with the minimum value along the prediction horizon. In
fact, p = 1..7 are constant and decreasing velocity proles, thus their T h are the same
and equal to T h at the beginning of the simulation. Thereby, only the ego velocity
modies the evaluation function, which is higher with higher ego velocity. Finally, the
best target velocity is 28.17 m/s.
• I1→2 : Maneuver I1→2 is only possible with p = 9 velocity prole at 30.16 m/s. It has
a very low evaluation value due to the small Th to insert between O2 and O3 .
• I1→5 : The target velocities 2 to 5 are allowed for I1→5 , i.e., the target velocity is in
[23.21, 26.19] m/s. As for I1→1 , the higher ego velocity is chosen: vt = 26.19 m/s. It
has a smaller evaluation value than I1→1 as the distances to O5 and O6 , as well as vt ,
are smaller than the ones with O1 , and vrel is higher. But it has a better evaluation
value than I1→2 , as O5 has a smaller velocity and thus a bigger time headway than the
left lane change with O2 .
Finally, the best maneuver with the maximum Im value is to stay in the ego lane I1→1
and adapt the ego velocity for car following at vt = 28.17 m/s.
This chapter presents an all-in-one architecture for trajectory generation and decision mak-
ing, based on a black-box objective function. The method addresses both the problem of
4.7. Conclusion and future work 109
The proposed architecture, as well as the Fuzzy Dempster-Shafer (FDS) decision making
and the Simulated Annealing (SA) optimized trajectory generation, are implemented and
validated in real experiments in the next chapter.
Chapter 5
Contents
5.1 Experimental environment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
5.1.1 Environment description . . . . . . . . . . . . . . . . . . . . . . . . . . 113
5.1.1.1 Test tracks . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
5.1.1.2 Ego vehicle prototype . . . . . . . . . . . . . . . . . . . . . . 113
5.1.1.3 Obstacles vehicles . . . . . . . . . . . . . . . . . . . . . . . . 114
5.1.1.4 Reference frames . . . . . . . . . . . . . . . . . . . . . . . . 114
5.1.2 Architecture description . . . . . . . . . . . . . . . . . . . . . . . . . . 115
5.1.2.1 Operating diagram . . . . . . . . . . . . . . . . . . . . . . . 116
5.1.2.2 Perception block . . . . . . . . . . . . . . . . . . . . . . . . 118
5.1.2.3 Planning block . . . . . . . . . . . . . . . . . . . . . . . . . 118
5.1.2.4 Control block . . . . . . . . . . . . . . . . . . . . . . . . . . 120
5.1.2.5 Data sources . . . . . . . . . . . . . . . . . . . . . . . . . . . 126
5.2 Experimental scenarios and results . . . . . . . . . . . . . . . . . . . . . . . . . 127
5.2.1 Scenario 1: Road following, Lane keeping and Lane changing . . . . . 128
5.2.2 Scenario 2: Vehicle following . . . . . . . . . . . . . . . . . . . . . . . 131
5.2.3 Scenario 3: Overtaking one front obstacle . . . . . . . . . . . . . . . . 137
5.2.4 Scenario 4: Overtaking two front obstacles . . . . . . . . . . . . . . . . 142
5.2.5 Scenario 5: Overtaking in front and behind a left obstacle . . . . . . . 146
5.2.5.1 Scenario 5.1: Overtaking a slower ego front vehicle with a
slower rear left vehicle . . . . . . . . . . . . . . . . . . . . . 146
5.2.5.2 Scenario 5.2: Overtaking a slower ego front vehicle with a
blocking left vehicle . . . . . . . . . . . . . . . . . . . . . . . 150
5.2.6 Algorithm performances . . . . . . . . . . . . . . . . . . . . . . . . . . 153
5.3 Conclusion and future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155
This chapter presents the results of the experimental tests performed to validate the
motion planning approach proposed in this thesis. The experimental platform is the proto-
type vehicle of Institut VEDECOM navigating on the Versailles-Satory test tracks, France.
The validation considers for dierent scenarios, from a simple lane following to a double
lane changing with one or two obstacles. All the scenarios had been previously tested in
simulation. Our solution generates a trajectory for the studied situations of lane follow-
ing, car following, lane changing and overtaking with dynamic obstacles, without any prior
knowledge. The developed algorithm has been integrated with localization, communication,
obstacles perception, and control modules.
112 Chapter 5. Experimentation and Results
This chapter is structured as follows: First, section 5.1 describes the experimental envi-
ronment used to test our motion planning algorithm. The tested scenarios with results are
presented in section 5.2. The conclusions of the performed experiments are summarized in
section 5.3.
5.1. Experimental environment 113
This section describes the experimental environment and architecture, respectively in sub-
sections 5.1.1 and 5.1.2, used to test and implement our motion planning algorithm.
Figure 5.1: Versailles-Satory high speed tracks, 78000 Versailles, France. [credits: GoogleMaps]
The road input for our algorithm is modeled oine as a 2-lane straight road, 1.2 km long,
with a constant lane width equal to 3.7 m and constant road and speed limits.
• Ultra High Frequency (UHF) communication module to receive localization data from
other vehicles.
The algorithms run on an embedded computer (2.10 GHz Intel Core i7-3612QM CPU 8
GB RAM).
• World frame W : Geographic coordinate system dened by the latitude, longitude and
orientation to the East. The (latitude, longitude) coordinates are then transformed into
a 2-D frame (x, y)W .
5.1. Experimental environment 115
• Road frame R : As specied in subsection 5.1.1.1, the road used for the tests can be
approximated as straight with a constant lane width. The road frame is therefore
created by arbitrarily setting a point (x0,R , y0,R ) = (0, 0) on the right mark at the be-
ginning of the demonstration portion, which corresponds to the W -coordinate latitude
Lat0 = 48.78513748◦ , longitude Long0 = 2.100552545◦ , orientation θ0 = 158.40035◦ .
The x-orientation of the road frame is aligned with the mostright lane mark. The
y -coordinate is positive in the direction of the mostleft lane mark.
• Ego frame E : This frame is xed in the middle of the rear axle tree, with the x-direction
following the orientation of the ego vehicle and the y -direction to the left side of the
ego vehicle. The angle with R is dened by α in Figure 5.3.
• Obstacle frame O : It has the same properties as E by replacing the ego vehicle with
an obstacle vehicle.
The coordinate transformations for each of the frames are the following:
• W → R: (
xR = xW cos(−θ0 ) + yW sin(−θ0 )
(5.1)
yR = −xW sin(−θ0 ) + yW cos(−θ0 )
• R → E: (
xE = xR cos(α) + yR sin(α)
(5.2)
yE = −xR sin(α) + yR cos(α)
• O → E: (
xrel,Oj E = xOj ,R − xego,R
(5.3)
yrel,Oj E = yOj ,R − yego,R
Ego data The ego data longitudinal and lateral positions and heading angle are acquired
by the IXBLUE ATLANS-C system (SocketAtlans and AtlansDecoder blocks) in the world
frame W . They are vectorized into 1 ego vector (Ego_Vect) and transformed through
AtlansToXYV_1 block into the road frame R as a vector with the longitudinal and lateral
positions and longitudinal velocity χego = (xego , yego , vego )R , which is then sent to the motion
planner block.
Communication obstacle data The longitudinal and lateral positions and heading angle
acquired from the IXBLUE ATLANS-C system for the rst obstacle O1 are sent to the
ego vehicle using the UHF communication module (SerialPort_UHF, Stream2Text_1, and
Vedecom_UHF_Decoder_1). As for the ego data, this information is transformed through
AtlansToXYV_2 block from W into R as χO1 = (xO1 , yO1 , vO1 )R , and sent to the block
Obst_Vectorizer.
LIDAR detection obstacle data The data for the second obstacle O2 are obtained from
the LIDAR sensors with a 4 layers Ibeo fusion box (FusionBox and IbeoEcuDecoder_1
blocks). The 3D LIDAR data are XYZ point clouds obtained in the ego frame E . Only
data from the second bottom layer are selected (custom_cutter_1) in order to send 2D data
XY to the in-house algorithm VEDECOM_DeTroLi_Transdev_1. DetroLi algorithm detects
obstacles as points clouds and returns ego relative bounding box around obstacles in E co-
ordinate systems. The obstacle of interest O2 is then manually selected in DetroliFilter_1
and transformed into W coordinates using the ego coordinates in W . Finally, its coordinates
in W are transformed through AtlansToXYV_3 block into R as χO2 = (xO1 , yO1 , vO1 )R , and
sent to the Obst_Vectorizer block.
Obstacles The obstacles vectors χO1 and χO2 are vectorized (Obst_Vectorizer), with a
simulated third obstacle (Obst) not used in our experimentations, into 1 output obstacle
vector, which is sent to the motion planner.
Planning The developed motion planner (Planner) with the decision-maker and trajec-
tory generator uses χego , χO1 , χO2 , and the tuning parameter (PP_Params) to calculate the
reference trajectory over the prediction horizon Hp in E : χref = (x(t), y(t), v(t))E ,(t=1..Hp ) .
5.1. Experimental environment
Figure 5.4: RTMaps diagram. The ego, obstacle communication, and obstacle perception data are issued respectively from the green, blue, and purple areas.
All the obstacles data are gathered in the gray area to return one obstacle vector. The motion planner is then activated in the orange area and returns the
reference trajectory to the controllers in the yellow area.
117
118 Chapter 5. Experimentation and Results
Control The reference trajectory is sent to the lateral angle controller (Control_Angle) and
the reference velocity vector is sent to the longitudinal velocity controller (Control_speed).
The reference velocity is sent to the controller block if (Condition_Velocity) its velocity
value at 1.5 s ahead (see subsection 5.1.2.4 for more details) is higher than the threshold
condition (Cond_1 equal to 3.6 km/h). Then the Control_speed block calculates the accel-
eration and breaking torques to send to the actuators (PilotageProvel_1) according to the
control reference velocity value, the current ego velocity and the current breaking pressure,
issued from the Controller Area Network (CAN bus: CANPeak44_CanV and CANDecoder_1),
otherwise the value 1 m/s is sent. The reference trajectory and the current ego velocity
are sent to the lateral angle controller block to return the steering angle command. If
(Condition_Lateral) the current ego velocity is higher than 7.2 km/h, this steering an-
gle command is sent to the actuators (PilotageProvel_1), otherwise the value 0 rad is sent.
The condition values are used at the initialization step to activate the motion planner only
at nominal driving conditions.
Figure 5.5: DeTroLi viewer. The ego vehicle is the gray rectangle with red edges, the tracked obstacle is
number 722.
and obstacles vector χOj = (xOj , yOj , vOj )E . It uses these information to return the reference
trajectory over the predicted horizon Hp , χref = (x(t), y(t), v(t))E ,(t=1..Hp ) to the control
block. The trajectory is planned over a time horizon Hp = 10 s with a time step ∆t = 0.05 s,
generating 200 coordinates in E starting at the current state of the ego vehicle. As explained
in subsection 5.1.1.3, we consider maintaining direction and velocity for obstacles prediction.
The pseudo-code of our algorithm is presented in Algorithms 4, 5 and 6. In addition
to the assumptions for the nominal use in subsection 4.4.1 (p. 93), we dened the following
experimental assumptions for the integration of our algorithm:
Assumption 7 The motion planner is triggered every 60 ms, which corresponds to the max-
imum calculation time in simulation, or if an obstacle enters the safety space as dened in
Figure 5.6.
Assumption 8 The motion planner calculates a new trajectory as soon as one of the sur-
rounding obstacles enters the safety space around the ego vehicle, as illustrated in Figure 5.6.
Assumption 9 The motion planner is in a default mode if it can not nd a valid solution
or if there is an obstacle detected inside its collision space, as illustrated in Figure 5.6.
The default mode is a non-nominal mode. In our experimentations, the default reference
trajectory corresponds to stopping in the right road limit with a constant longitudinal de-
celeration dx = −0.7 m/s2 , i.e. χref,def ault = (0, 0, 0). This default trajectory is thus easily
identiable by the driver.
Assumption 10 While the ego vehicle changes lane, the motion planner does not make any
decision.
This assumption has to be removed in the future, so that the motion planner is reactive to
its surroundings.
120 Chapter 5. Experimentation and Results
Assumption 11 If there is no obstacle, the two authorized maneuvers are either to maintain
direction and velocity or to move to the right lane according to the French Driver Rules.
Figure 5.6: Safety and collision spaces around the ego vehicle.
For the experimentation, we use the Mamdani Fuzzy Inference System (FIS) from category
3 Obstacle safety, in Table 3.1 (p. 55). Its properties are written in Table 5.1, its membership
functions µ are plotted in Figure 5.7, its output values are plotted against 2 outputs in
Figure 5.22, and its rule-base is depicted in Table 3.4 (p. 78).
Based on the simulation tests in section 4.6, the simulated annealing (SA) algorithm
implemented for the experimental results and presented in Algorithm 3 (p. 101) has the
following parameters: Niter = 200, T0 = 0.6213, Tf = 0.01, and P0 = 20% with Z0 = 1.
Lateral controller The lateral control is a proportional gain (P) on the lateral gap egap
and the cap angle αcap to return the steering angle command αc . The control low is expressed
in Equation 5.4.
pref and pref −1 . The target point ptarget is interpolated within [pref −1 pref ]. The lateral
distance between ptarget and y -axis in E is equal to egap and the angle formed by the segment
[pref −1 pref ] with the x-axis in E is equal to αcap expressed in Equation 5.5:
!
ypref ,E − ypref −1 ,E
αcap = atan . (5.5)
xpref ,E − xpref −1 ,E
else
Discard direction;
Calculate the best evaluation indicator among the remaining directions;
Return best trajectory;
else
Update the current lane change trajectory;
/* 2. Check trajectory safety */
/* 2.1 Stay within road limits condition */
/* 2.2 Exist trajectory condition */
if no 2.1 || no 2.2 then
Return default trajectory;
else
Return the best trajectory;
/* 3. Transform the road frame trajectory in the ego frame trajectory
*/
Transform the reference trajectory R → E ;
124 Chapter 5. Experimentation and Results
Type: Mamdani
AndMethod: min
OrMethod: max
ImplicationMethod: min
AggregationMethod: max
Inputs: 3
Outputs: 1
Rules: 48
To validate our approach, we consider the following highway driving situations among the
ones listed in subsection 2.1.3 (p. 18): lane keeping, car following, lane changing, lateral-
most lane changing, passing, and overtaking. We distinguish 6 scenarios to propose realistic
experimental conditions:
• Scenario 1: Road following, Lane keeping and Lane changing in subsection 5.2.1;
• Scenario 5: Overtaking the front obstacle with merging (1) in front of, and (2) behind,
the left obstacle in subsection 5.2.5.1, respectively 5.2.5.2.
For each scenario, we rst present its goals and constraints, and then analyze the experi-
mental data in order to evaluate and validate our algorithm. We run each scenario multiple
times to verify its repeatability. As each scenario was actually repeatable, only one run per
scenario is analyzed in the following. The algorithm performances are also given in subsection
5.2.6.
The experimental results are presented in two gures: the ego trajectories and limits, and
the relative velocities and positions with the motion planner behavior. The rst gure is split
into four plots:
(a) shows the reference χref , ego χego , and obstacles χO1 , χO2 trajectories in space and
time in the road frame R ;
(b) gives the evolution of the velocities with time: the driver desired velocity vwish , the
road speed minimum vmin and maximum vmax limits, the reference velocity vref from
χref , the ego velocity vego , and the obstacles velocities vO1 , vO2 ;
(c) displays the ego accelerations evolution in time with the longitudinal ax and lateral
ay ego accelerations. The attainable longitudinal accelerations given in [Reif & Di-
etsche 2010] lie within the range of 0.4 − 1.4 m/s2 . We thus considered a reference
longitudinal acceleration, respectively deceleration, value of ax,ref = 0.9 m/s2 , respec-
tively dx,ref = −0.7 m/s2 . In [Reif & Dietsche 2010], the range for lateral acceleration
lies within [−2, 2] m/s2 . These limits are represented with dy,lim and ay,lim ;
(d) provides the time evolution of the steering angle and the yaw rate.
(a) shows the relative velocities between the ego vehicle and O1 , vrel,O1 , and O2 , vrel,O2 .
(b) gives the longitudinal relative distances between the ego vehicle and O1 , xrel,O1 , and
O2 , xrel,O2 . The ego vehicle presents safe distances with obstacle Oj , if their relative
longitudinal distance xrel,Oj in front, respectively rear, is bigger than the front safe
128 Chapter 5. Experimentation and Results
distance df,saf e , respectively the rear safe distance dr,saf e . Similarly, the ego vehicle
is in collision with obstacle Oj , if their relative longitudinal distance xrel,Oj in front,
respectively rear, is smaller than the front collision distance df,collision , respectively the
rear collision distance dr,collision . Overcoming the safe distances leads the triggering of
the motion planner. Overcoming the collision distances leads the default mode.
(c) displays the lateral relative distances between ego vehicle and O1 , yrel,O1 , and O2 ,
yrel,O2 .
(d) plots the motion planner ags to verify a situation of collision, the decisions of lane
changing and navigating back to the right lane, if there is no solution, and if the ego
vehicle is in default mode navigating o the road limits.
Figure 5.11a shows that the reference trajectory χref is well followed by the ego vehicle
χego both in space and time. The longitudinal distance delay is less than 2 m (data points 1
and 2), the lateral error less than 20 cm (data points 1 and 2), and the time delay less than
200 ms. Moreover, one notices an oset to the right side of the road on the lateral position.
This is due to the deformation of the road with curved borders. This is further conrmed in
Figure 5.11d, where the mean steering angle is shifted to a positive value. In fact, the angle's
control compensates the rightward road camber by pushing the vehicle to the left side.
5.2. Experimental scenarios and results 129
In Figure 5.11b, the initial vwish,t0 is xed at 10 m/s, then decreases to vmin = 9 m/s
at t2 = 12.84 s, and increases at vmax = 12 m/s at t3 = 26.1 s. Its value exceeds the speed
limits at t4 = 57.06 s with vwish,t4 = 15 m/s. Conversely, it is below the speed limits at
t5 = 69.36 s with vwish,t5 = 6 m/s. The reference velocity vref follows the changes of vwish ,
while maintaining the reference velocity within the speed limits at t4 and t5 . Besides, vref is
well tracked by the controller, as vego ≈ vref along the entire scenario. There are response
delays around 100 ms in acceleration (data points 3) and 350 ms in braking (data points 4),
as well as an overshoot up to 1.7 m/s in acceleration (data point 5) for the velocity tracking.
This is due to the fact that the tracked velocity is taken as the value of vref 1.5 s ahead in
order to prevent oscillations, which causes a time delay in the controller response, and to the
algorithm latency and the dynamic response of the vehicle. Nonetheless, these discrepancies
(response delay and overshoot) remain acceptable. Furthermore, to avoid an important gap
to the velocity controller, vref follows the overshoot of vego to decrease it to vwish with a
feasible braking action.
As noticed in Figure 5.11c, the longitudinal acceleration is bigger than the one xed by
the reference trajectory ax,ref , but remains under the maximum comfort value set to 3 m/s2
by [Reif & Dietsche 2010]. However, no braking action is clearly observed; the vehicle is
coasting. The dierence in acceleration is mainly attributed to the fact that the control is
performed on the velocity and not on the acceleration, and to the latency of the dynamics
of the ego vehicle and the implemented architecture. The braking control has a very limited
action in our scenario because the test vehicle had a braking control based on large step
commands. As our motion planner tried to follow the dynamic of the vehicle, its braking
reference was too smooth to be considered by the controller. This control law also explains
why overshoots are perceived during acceleration but not during the braking phases.
Finally, the road camber compensation is observed in the rightward-shifted mean steering
angle values in Figure 5.11d.
The second situation analyses a simple lane change to the right lane without obstacle.
Figure 5.12a shows the lane change trajectory in time for a reference velocity vref = 9.92 m/s.
The time delay along the trajectory is very short, less than 200 ms at the road centerline,
which corresponds in a distance delay less than 1.6 m (data points 1) and a slight overshoot
of 0.25 m to the right at the end of the lane change (data point 2). Figure 5.12b conrms
the proper tracking of vref . Figures 5.12c and d show the feasible lateral dynamics of the
ego vehicle, where ay is within [dy,lim , ay,lim ] and both the steering angle and yaw rate are
reasonable.
To summarize, the performances of both the motion planner and the controllers are eval-
uated good enough for the validation of our algorithms. Besides, the controller performances
stated in this rst scenario are expected for all scenarios. Since the control design is not in
the scope of the present work, we do not pretend to provide a better controller for the next
scenarios. Please remind the following performances of the controllers:
• the response is stable for both the velocity and angle controllers;
• the steady-state error is very low: the velocity controller has no signicant error
(< 0.3 m/s) and less than 20 cm for the lateral controller (which is due to the road
camber);
130 Chapter 5. Experimentation and Results
Figure 5.11: Trajectory, velocity and dynamics analysis. In (a), the reference trajectory χref is in dashed
line, the ego trajectory χego in solid line and the obstacles trajectories χO1 , χO2 in dashed-dotted lines, if
they exist; in (b) the driver wish velocity vwish is in cyan, the road minimum vmin and maximum vmax speed
limits in dotted and dashed red, the reference velocity vref in solid green, the ego velocity vego in solid blue,
and the obstacles velocities vO1 and vO2 in dashed orange and purple, if they exist; in (c) the ego longitudinal
and lateral accelerations ax and ay are in solid blue and red, the reference longitudinal acceleration and
deceleration ax,ref and dx,ref in dotted and dashed-dotted cyan, and the lateral acceleration and deceleration
limits ay,lim and dy,lim in dotted and dashed-dotted magenta; in (d) the steering angle is in blue and the yaw
rate in orange. [Scenario 1]
• the overshoot on the longitudinal velocity is between 5−30%, and 0−10% for the lateral
position, which represents up to 1.5 m/s in velocity and 25 cm in lateral position;
Figure 5.12: Trajectory, velocity and dynamics analysis for a lane changing. [Scenario 1]
1.5 s ahead for the velocity control and the angle control in order to avoid oscillations
and over-reaction if the tracking was done on the next time-step value.
regardless the velocity of our vehicle vego and the relative speed with obstacles vrel,Oj . As the
maximum allowed velocity in autonomous mode is 50 km/h = 13.9 m/s, this collision distance
is overestimated with respect to the 2 s rule. In this scenario, we xed a safety distance twice
bigger than the collision distances, i.e. df,saf e = 2 ∗ df,collision and dr,saf e = 2 ∗ dr,collision .
The front moving obstacle O1 has its initial velocity v01 ,t0 < vego,t0 . Along the scenario,
we modied vO1 in order to check that the ego vehicle adapts its velocity to the front vehicle.
At the beginning, the obstacle is far enough, out of the safety space of the ego vehicle. The
ego vehicle is in the situation of road following. At t1 , the obstacle enters the safety zone
of the ego vehicle. The ego vehicle has to slow down and adapts its velocity to the one of
the obstacle vehicle vO1 ,t1 . At t2 , the obstacle velocity is reduced to vO1 ,t2 < vO1 ,t1 . At t4 ,
the obstacle accelerates above the maximum speed limit, vO1 ,t4 > vmax . The ego vehicle can
then accelerate and reach its wish velocity vwish .
This scenario is a tedious task for the motion planner for two reasons: (i) in this situation,
the motion planner is triggered at each iteration as the obstacle permanently enters the safety
space, (ii) the range of action is very limited to respect the three decision criteria (higher
speed, low relative speed, high time headway). We present a nominal behavior of the motion
planner and a more critical one, for which we show the limits of our algorithm.
Nominal case: At the beginning of the scenario, the longitudinal relative distance xrel,O1
between the ego vehicle and O1 is bigger than the front safe distance df,saf e (data points
1 in Figures 5.14a and 5.15b). O1 enters the safety space at t1 = 22.44 s (data point 2 in
Figure 5.15b). Consequently, the ag collision is activated (data point 2 in Figure 5.15d) and
the motion planner sends a decelerating vref , so that the ego vehicle brakes (data point 2 in
Figure 5.14b). The ego velocity vego then decreases until it reaches the velocity which allows
to maintain the relative distance xrel,O1 above the collision distance df,collision (data area 3 in
Figures 5.15a and b). The vref prole is the best compromise of the 3 decision criteria: time
headway Th , relative velocity vrel,O1 , and ego velocity vego , see subsection 5.1.2.3. Logically,
vref follows vO1 as depicted in data area 3 in Figure 5.14b.
At t2 = 50 s, O1 increases its velocity up to 10 m/s. Its dynamics can be followed by
the ego vehicle, thus the motion planner adapts vref to vO1 (data area 4 in Figure 5.14b).
When vO1 decreases at t3 = 64 s, the motion planner sends a decreasing vref too (data area
5 in Figure 5.14b). As the dynamics of O1 is steeper than that of the ego vehicle, one notices
that xrel,O1 decreases too (data area 5 in Figure 5.15b). When O1 increases its velocity at
5.2. Experimental scenarios and results 133
t4 = 83 s over the maximum speed limit vmax = 12 m/s, the ego vehicle can also accelerate
to its initial wish velocity vwish = 11 m/s. If the obstacle velocity is over the maximum speed
limit vmax (at t5 = 85 s), vref plateaus at vwish = 11 m/s, as seen in Figure 5.14b. O1 leaves
the safety space at t6 = 89.58 s (data points 6 in Figures 5.15b and d).
Moreover, the reference trajectory is well followed in Figure 5.14a. One notices a 10 cm-
shift to the right due to the road camber. Slight overshoots (≈ 1m/s) on the velocity control
are shown in Figure 5.14b, as explained in scenario 1.
In this scenario, the motion planner has enough time and distance to adapt well to
the obstacle dynamics. In Figure 5.15d, one notices that the no solution ag is always
inactivated, even if the collision ag is activated.
Finally, the longitudinal and lateral accelerations and angle constraints for both safety
and comfort are respected as shown in Figures 5.14c and d.
Critical case: In the critical case, the minimum speed limit vmin is set to 8 m/s, which is
the velocity of O1 at the beginning of the scenario. As a result, there are very small velocity
(< 1 m/s) and distance (< 50 m) ranges for the ego vehicle to maintain xrel,O1 > df,collision
(Figure 5.16b). Besides, the motion planner only selects a velocity prole that does not
present any collision state along the prediction horizon Hp . Consequently, if the relative
velocity and distance do not allow the ego vehicle to reach a collision-free velocity prole
before O1 enters the collision space xrel,O1 < df,collision , none of the allowed velocities are
safe. This inconvenience is emphasized with the poor braking capacity of the ego vehicle in
our experiments. In this critical case, we thus observe that O1 enters df,saf e at t1 = 20.46 s
and df,collision after t2 = 36.18 s (data points 1 and 2 in Figures 5.16b and d). The motion
planner has diculty maintaining this collision distance, as a safe velocity prole does not
always exist. This is represented by the no solution ag in area 2 in Figure 5.16d. In
practice, this situation leads to the default mode, with an emergency trajectory tending
towards yref,R = 0 m with a reference velocity prole decreasing to 0 m/s. However, as
soon as the collision distance is recovered, the motion planner replans a collision-free velocity
prole. O1 leaves the safety space at t3 = 74.58 s (data points 3 in Figures 5.16b and d).
To conclude, the condition of collision-free ego prole along the prediction horizon may be
too strong for real life application, as observed in the critical case. Furthermore, the collision
distance should be taken proportional to the ego and relative velocities in future work.
134 Chapter 5. Experimentation and Results
Figure 5.14: Trajectory, velocity and dynamics analysis. The position outliers of O1 are due to errors in the
communication blocks. [Scenario 2]
5.2. Experimental scenarios and results 135
Figure 5.15: Relative longitudinal velocity, longitudinal and lateral positions and ag analysis. The position
outliers of O1 are due to errors in the communication blocks. In (a), the relative velocities vrel,O1 and vrel,O2
are in orange and purple, if they exist; in (b), the longitudinal relative distances xrel,O1 and xrel,O2 are in
orange and purple, if they exist, the safe front df,saf e and rear dr,saf e distances in dotted red and yellow, the
front df,collision and rear dr,collision collision distances in dashed red and yellow; in (c), the lateral relative
distances yrel,O1 and yrel,O2 are in orange and purple, if they exist; in (d) the motion planner ags denote
collision in blue, lane changing in red, right lane decision in yellow, no solution in purple and navigating o
the road limits in green. [Scenario 2]
136 Chapter 5. Experimentation and Results
Figure 5.16: Relative longitudinal velocity, longitudinal and lateral positions and ag analysis in critical case.
The position errors of O1 are due to errors in the communication blocks. [Scenario 2]
5.2. Experimental scenarios and results 137
When O1 is detected below the safety distance df,saf e at t1 = 12.6 s (data point 1 in Fig-
ure 5.20b), the motion planner sends a True collision ag value (data point 1 in Figure 5.20d).
Then, a new reference trajectory χref is calculated. As the left lane is free of obstacle, the
best trajectory in order to respect the 3 decision criteria is a left lane change maneuver, as
indicated by the lane change ag in Figure 5.20d (data point 1) and illustrated with χref
in Figure 5.19a (data area 1). This decision is made immediately at t2 = t1 = 12.6 s. The
selected reference velocity for this maneuver is set at vref = 11 m/s (data point 1 in Fig-
ure 5.19b). One notices that vref is not equal to vwish . This is due to the fact that the fuzzy
logic returns a at evaluation function as explained in Figure 5.22 and the Simulated Anneal-
ing architecture returns the rst global minimum encountered. Indeed, the FIS presents a lot
of constant warning areas. This means that the evaluation function will also presents these
constant areas which are similar to local minima. Moreover, the evaluation of two solutions
in the constant areas will return the same value. This can be an inconvenient when searching
for the globally optimal solution. A set of close trajectories (i.e. for which the variables
present little variation from one trajectory to another) is evaluated as near-optima. How-
ever, the non-uniqueness of a solution is still acceptable in our experiments for autonomous
driving, because only the obstacle safety category is considered here. This particularity will
also inuence the neighborhood search in the simulated algorithm method: (i) it is important
that the neighborhood search is large at high temperature, so that candidate solutions from
dierent constant areas can be tested, and (ii) it appears useless to continue the SA search
in a small neighborhood over a constant warning areas. The integration of the category 5
Reference wishes from Table 3.1 (p. 55) in the decision-maker can solve this issue.
Besides, this rst reference lane change is well followed (Figure 5.19a), and the collision
distances are maintained as long as O1 is on the ego lane (Figures 5.19a and 5.20b).
When the lane change is over at t3 = 20.1 s (data points 2 in Figures 5.20c and d),
the reference behavior consists in maintaining vwish in the straight direction (data area 2 in
Figure 5.19a and data point 2 in Figure 5.19b). Then, once the ego vehicle has passed O1
(data points 3 in Figure 5.20b and c), the right choice ag is activated at t4 = 39.78 s
(data points 3 in Figure 5.20d), which means that the motion planner is triggered to return
a reference trajectory to navigate in the right lane. As vego > vO1 , the best maneuver is a
rightward lane change ahead of O1 . This is decided and initiated at time t4 . Indeed, the
motion planner has enough rear distance (xrel,O1 > dr,collision ) to maneuver and navigate
back on the right lane at t5 = 48.96 s (data points in Figures5.20b, c and d). This second
lane change is also well followed (data area 4 in Figure 5.19a).
Moreover, the lateral dynamics limits are respected for both the acceleration and angle
in Figures 5.19c and d. In Figure 5.19d, one observes the left deviation of the steering angle
when the ego vehicle is on the right lane (data areas 5 and 6) and the right deviation when
the ego vehicle is on the left lane (data area 7) due to the road camber.
In Figure 5.21, we notice that the calculation times for the 2 maneuvers (data points 1 for
left lane changing and 2 for right lane changing) are below 50 ms, which proves the real-time
eectiveness of our algorithm.
To conclude, we notice that the fuzzy logic tuning is important to return the expected
behavior (e.g. driver wishes). Our evaluation function, as represented in Figure 4.13 (p. 104),
presents at areas. This means that several dierent trajectories have the same evaluation
function. According to the architecture of the Simulated Annealing algorithm, the output
5.2. Experimental scenarios and results 139
Figure 5.20: Relative longitudinal velocity, longitudinal and lateral positions and ag analysis. [Scenario 3]
Figure 5.21: Calculation time. The peaks 1 and 2 correspond to the time step at which the motion planner
is triggered. The two other peaks are outliers in the motion planner proceedings. [Scenario 3]
5.2. Experimental scenarios and results 141
Figure 5.22: FIS output membership degree for the 3 criteria. Top: variations of relative velocity and time
headway with xed vego = 30 km/h. Middle: variations of relative velocity and ego velocity with xed
Th = 2 s. Bottom: variations of time headway and ego velocity with xed vrel = 0 km/h. The darker and
higher warning is, the riskier it is, and vice versa.
142 Chapter 5. Experimentation and Results
value is the rst trajectory with the best evaluation function. Thus, it might be interesting to
test our algorithm when considering the category 5 Reference wishes to verify that the best
velocity corresponds to the wish velocity vwish when this latter features a global optimum
too.
O1 is detected by the LIDAR and O2 communicates its positions and velocity (see sub-
section 5.1.2.2 for more details). As the LIDAR obstacle detection is stable within a distance
of 50 m, the initial vwish is set to 8 m/s (data point 1 in Figure 5.25b), df,saf e = 50 m,
df,collision = 20 m, dr,saf e = 37.5 m, dr,collision = 15 m in order to have time for manually
selecting O1 in DeTroLi algorithm. O1 is thus detected with xrel,O1 = 58.1 m (data point 0 in
Figure 5.26b). When xrel,O1 is below df,saf e at t1 = 50.28 s (data points 1 in Figures 5.26b, c
5.2. Experimental scenarios and results 143
and d), the motion planner calculates a new reference trajectory χref , and decides a left lane
change with vref = 8.359 m/s (data area 2 in Figure 5.25a and data point 2 in Figure 5.25b).
This lane change is very aggressive with only 30 m to nd the sigmoid distance delay c. We
observe that χego has an overshoot of 40 cm and 4.5 m longitudinal error, which corresponds
to 720 ms time delay.
At the end of left lane change t3 = 55.44 s (data points 3 in Figures 5.26c and d), we
decide to change vwish at 11 m/s (data point 3 Figure 5.25b).
When the ego vehicle reaches the left lane, O2 becomes visible. The motion planner
determines that there is not enough space to return to the right lane and discards the right
lane change maneuver. In Figure 5.26d, the right choice ag is not activated.
When the ego vehicle has passed O2 at t4 = 71.1 s (data point 4 in Figure 5.26b), the
motion planner searches and nds a trajectory to go back to the right lane in front of O2
(data points 4 in Figures 5.26d), with vref = 11.28 m/s (data areas 4 in Figures 5.25a and
b). The right lane change ends at t5 = 80.58 s with xrel,O2 = 48.76 m bigger than dr,collision
(data points 5 in Figures 5.26b, c and d).
Finally, the longitudinal and lateral accelerations and angle constraints for both safety
and comfort are respected (Figures 5.25c and d).
144 Chapter 5. Experimentation and Results
Figure 5.26: Relative longitudinal velocity, longitudinal and lateral positions and ag analysis. [Scenario 4]
146 Chapter 5. Experimentation and Results
• If O2 is behind the ego vehicle and its relative velocity vrel,O2 and longitudinal dis-
tance xrel,O2 are sucient for the ego vehicle to pass in front of it and behind O1 , see
subsection 5.2.5.1;
• If O2 is behind the ego vehicle but vrel,O2 and xrel,O2 are not sucient for the ego
vehicle to pass in front of it, or if O2 is in front but it blocks a lane changing for the ego
vehicle because vrel,O2 and xrel,O2 are not advantageous in comparison to stay in the
ego lane. In this situation, the ego vehicle has to wait for sucient vrel,O2 and xrel,O2
in order to make a lane change and pass O1 behind O2 , see subsection 5.2.5.2.
For both scenario the evolution space is decomposed into three intervals: straight front
I1 , left front O2 for I2 and left rear O2 for I3 for the rst lane change, see Figure 5.27a;
straight front I1 , right front O1 for I2 and right rear O1 for I3 for the second lane change to
go to the mostright lane after overtaking O1 , see Figure 5.27b.
5.2.5.1 Scenario 5.1: Overtaking a slower ego front vehicle with a slower rear
left vehicle
This rst maneuver is depicted in Figure 5.28. O1 is in front ego lane and O2 is rear on
the left lane. When O1 enters the safety area at t1 , the motion planner is triggered. As
vwish > vO1 and vwish > vO2 , the best decision is to pass O1 in front of O2 , whose relative
velocity and position guarantee rear safety. This decision is initiated at t2 . The lane change
ends at t3 . The ego vehicle passes O1 until it can go back into the right lane in front of O1 ,
as the relative velocity and distance with O2 are sucient not to be in collision.
O1 is detected at t1 = 18.3 s and is already below df,saf e (data points 1 in Figures 5.30b
and d). The motion planner is triggered. However, vO1 is calculated equal to 0 m/s (data
points 1 in Figures 5.29b and 5.30a) and xrel,O1 < df,saf e . Consequently, there is no solution:
no velocity prole for lane following, nor place for lane changing; χref is the default trajectory
5.2. Experimental scenarios and results 147
(data area 2 in Figures 5.29a and b) and the ag offroad is activated (data point 2 in
Figure 5.30d). Nonetheless, as xrel,O1 is still below df,saf e , the ag collision is still activated
(data areas 2 in Figures 5.30b and d). Thus, the motion planner is triggered every time-step
to nd the best χref . For t = [18.36, 18.78] s, χref is staying on the ego lane. Indeed, vO1 is
over the maximum speed limit vmax (data area 3 in Figure 5.29b). Staying on the ego lane
or left lane changing are equivalent.
At t2 = 18.84 s, the motion planner nds a better evaluation function Ip for lane chang-
ing with vref = 11.12 m/s (data points 4 in Figures 5.29b and 5.30d). The ego vehicle
overtakes O1 in front of O2 while maintaining the collision distances, respectively df,collision
and dr,collision (data area 4 in Figure 5.29a and data points 4 in Figures 5.30b and c). When
on the left lane, vref = vwish = 11 m/s (data point 0 in Figure 5.29b). When O2 is behind
the ego vehicle on its ego lane, it will no longer be analyzed by the motion planner according
to Assumption 6 (p. 94).
Once O1 is overtaken, the motion planner suggests to go back into the right lane with
vref = 11.28 m/s at t3 = 28.62 s while maintaining dr,saf e (data area and points 5 and 6 in
Figures 5.29a and b and 5.30b and d).
Finally, the longitudinal and lateral accelerations and angle constraints for both safety
and comfort are respected as shown in Figure 5.29c and d.
148 Chapter 5. Experimentation and Results
Figure 5.30: Relative longitudinal velocity, longitudinal and lateral positions and ag analysis. [Scenario 5-1]
150 Chapter 5. Experimentation and Results
5.2.5.2 Scenario 5.2: Overtaking a slower ego front vehicle with a blocking left
vehicle
This last scenario in Figure 5.31 is tested with O1 in front ego lane and O2 in front left
lane behind O1 . The minimum speed limit vmin is set to 6 m/s as the vehicles are close to
each other. When O1 enters the safety area at t1 , the motion planner is triggered. The best
decision is rst to follow O1 . Indeed, O2 is closer to the ego vehicle. The evaluation of any
left lane change is worse than for staying trajectory. However, at t3 , O2 accelerates to pass
O1 . Thus, the left lane change is free of collision and has a better evaluation than staying
behind O1 . Consequently, the motion planner decides to overtake O1 behind O2 . The lane
change ends at t4 . The ego vehicle passes O1 and goes back into the right lane in front of O1 .
O1 is below df,saf e at t1 = t2 = 20.52 s (data points 1 in Figures 5.33b and d). The motion
planner is triggered. As vO2 ≈ vO1 and xrel,O1 > xrel,O2 , the left lane change maneuver has
a lower evaluation function Ip than staying in the ego lane. However, the ego vehicle is
still in collision with O1 (data area 2 in Figure 5.33d). Consequently, the motion planner
applies a car following behavior with O1 and decreases vref to adapt to vO1 (data area 3 in
Figures 5.32b and 5.33a).
When O2 increases its velocity at t3 = 38.34 s, the left lane changing maneuver becomes
safe and has a higher evaluation function Ip . The motion planner sends a reference left lane
change with vref = 8.246 m/s (data area and points 4 in Figures 5.32a, b and 5.33d). We
observe that χego has an overshoot of 57 cm and 4.2 m longitudinal error, which corresponds
to 660 ms time delay. This is due to the small lane change space behind O1 . One notices
that the ag collision is activated when the lane changing is completed (data point 5 in
Figure 5.33d). This is due to O2 in the safety space of the ego vehicle (data points 5 in
Figures 5.33b and c). Nonetheless, χref is still a lane following, as O2 is driving faster than
the ego vehicle (data area 5 in Figure 5.32a).
Once O1 is overtaken at t4 = 55.2 s (data point 6 in Figure 5.33b), the motion planner
suggests to go back into the right lane (data point 6 in Figure 5.33d). The right lane change
starts at t5 = 55.74 s with vref = 10.58 m/s (data area and points 7 in Figures 5.32a and b
and 5.33d), and safely ends at t6 = 65.52 s (data points 8 in Figures 5.33b and d).
Finally, the longitudinal and lateral accelerations and angle constraints for both safety
and comfort are respected as shown in Figures 5.32c and d.
5.2. Experimental scenarios and results 151
Figure 5.33: Relative longitudinal velocity, longitudinal and lateral positions and ag analysis. [Scenario 5-2]
5.2. Experimental scenarios and results 153
The convergence iterations histogram for all the scenario is plotted in Figure 5.34. We
also display the convergence of the evaluation function for scenarios 3, 4 and 5-1 in Fig-
ure 5.35. The maximum number of iteration for the Simulated Annealing was xed at 200
for real-time integration. We notice that the algorithm converges in less than 200 iterations,
which validates our algorithm tuning and our approach. With the chosen parameters, the
convergence value at 5% is around 120 iterations. However, if there is a small variables search
space, the convergence is faster as the corresponding evaluation function is probably within
a at area. This is the case at the end of the scenario 2, which leads to the smaller peak in
Figure 5.34 around 70 iterations.
Figure 5.34: Histogram of the calculation time for all the scenarios.
154 Chapter 5. Experimentation and Results
Figure 5.35: Convergence to the near-optimal value of the evaluation function Jobj for Simulated Annealing.
For each scenario, the top, middle, and bottom gures respectively show the convergence for the left lane
change, straight following, and right lane change.
5.3. Conclusion and future work 155
The experimental tests made it possible to demonstrate that the algorithm works as expected
in all the described scenarios. The unied architecture of both the decision making and tra-
jectory generation gives real-time computation for continuous optimization of the reference
trajectory. Moreover, considering real obstacles localization with communication and percep-
tion algorithms shows the robustness of our algorithm to noisy inputs, which is promising to
extend our work to real applications. Furthermore, even if the scenarios have been staged,
the algorithm was not xed on particular scenario values, and was able to adapt to the real
behavior of the obstacles. The use of a simple dynamic model for the ego vehicle motion has
also been validated with the use of a simple controller and a satisfying reference trajectory
tracking in velocity and position (small overshoot and steady error).
However, the tests highlight some aspects of the methodology that could be blocking, in
the sense of an overly conservative behavior for the implementation of the proposed algorithm
in real life. The main changes to be made to the algorithm in future work are as follows:
• Having a continuous fuzzy inference system (FIS): The at area of the evaluation func-
tion suggests that several dierent combinations of criteria will have the same risk
indicator. This is not a problem for the motion planner itself, but it can lead to a
non-eective optimization process. Testing a continuous fuzzy system could increase
the ecacy of the optimization process and provide a more intuitive decision process.
• Relaxing Assumptions 4 and 5 (p. 94) to discard a prole having a collision along the
prediction horizon: The experimental results show that the collision test becomes too
conservative with a large prediction horizon (Hp > 5s). On one hand, it may be possible
to replan after a lane change completion for the front collision test (Assumption 4). On
the other hand, the lane change back is far away from what can be observed in real life
(Assumption 5). Besides, an anticipation horizon can be considered, which would be
smaller than the prediction horizon and correspond to the minimum safe horizon for
the ego vehicle to reduce or increase its velocity in order to stay safe.
• Relaxing Assumption 6 (p. 94) not to consider the ego lane rear obstacles: In fact,
the Driver Rules recommend to perceive and analyze its environment before acting.
Therefore, the prediction of the rear obstacle has to be considered before making a
decision.
• Relaxing Assumption 10 (p. 119) freezing the lane change maneuver: A lane change
maneuver might not be safe along the navigation, and the motion planner should be
able to react to any surroundings change.
• Calculating the safety distances based on the ego and relative velocities to adapt the
safety risk to the context. For instance, the safety distances at low velocities are smaller
than the one at high velocities, similarly with a small relative velocity and a higher one.
• Considering more complex obstacles' prediction and testing the algorithm with uncer-
tainties (see section 6.2 for more details).
• Parallelizing the algorithm to decrease the calculation time below 50 ms, and thus being
able to consider more intervals.
• Dealing with trac behavior to observe the integration of our ego vehicle in trac.
The rst aspect is to test the impact of our decision in trac and to notice the limit
of good working for our algorithm. The second one is to test the interaction of many
autonomous vehicles that use our motion planning solution. In particular, to verify the
robustness of the decision-maker in case of aggressive or conservative fuzzy sets.
Chapter 6
Contents
6.1 Conclusions on our contributions . . . . . . . . . . . . . . . . . . . . . . . . . . 158
6.2 Future work regarding our contributions . . . . . . . . . . . . . . . . . . . . . 159
6.3 Future research directions in motion planning for autonomous vehicles . . . . . 161
6.3.1 Data management . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161
6.3.2 Adaptive mobility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161
6.3.3 Validation and evaluation . . . . . . . . . . . . . . . . . . . . . . . . . 162
This last chapter summarizes and concludes the developments presented along this manuscript
in section 6.1 and proposes future work and perspectives regarding our approach in section
6.2. Section 6.3 discusses more generally the gaps to be lled in motion planning by the next
autonomous driving car generation.
158 Chapter 6. Conclusion and Future Work
In this thesis, we proposed a motion planner algorithm for autonomous vehicles in a highway
environment, integrating a unied decision-maker and trajectory generator able to handle
multi-criteria consideration and oering a continuous optimization of a spatiotemporal evo-
lution space. This manuscript details the dierent developments resulting from our research,
from the study of the state of the art to the experimentations of our proposed strategy.
The study of the literature, presented in Chapter 2, has evidenced four problems in
research on motion planning. The rst one (i) is the use of separated or sequential architecture
of the two studied motion planning functions decision making and trajectory generation
either by rst applying decision making to decide on a maneuver and then proceeding to
motion generation to properly t this maneuver, or vice versa. Treating these two tasks
in a separate fashion may indeed lead to a nal motion that is not consistent or optimal
with respect to the evolution space. The second issue (ii) concerns the discrete handling of
the navigation area. Indeed, if the discretization is too small, the evolution space may be
truncated; conversely, if the discretization is too high, the calculation time may be too long.
The third area of improvement (iii) is that trajectories are usually obtained by separately
optimizing velocity and position. A spatial path solution is dened on a static decomposition
of the space, the path is adjusted to the dynamic obstacles by choosing a velocity/acceleration
prole, or reverse by dening the optimal velocity prole dynamically and then the static
path. These rst three aspects may harm the consistency of the motion planner and most
often reduce the solution-space. Finally, the last point to be considered (iv) concerns the
treatment of multi-criteria decision making without context. By dening the nature of a
criterion in the sense of its value and its evolution in a context, it is necessary to provide the
human passengers with an understandable reasoning on the contribution of each criterion,
in particular regarding the context. For instance, at low velocities, a small time headway is
acceptable, whereas it is not at high velocities.
In order to address the issues evidenced in Chapter 2, we rst investigated a strategy to
incorporate the nature of the criteria into multi-criteria decision making, and then developed
a unied architecture for decision-maker and trajectory generator.
In Chapter 3, we focused on the decision making function. The main objectives were to
propose a deterministic multi-criteria approach considering the uncertainties of perception,
while avoiding an overly conservative vehicle's behavior. This presents two main issues: the
heterogeneous criteria and the uncertain data. The proposed framework for decision making
addresses both by combining the fuzzy logic approach and the Belief theory. Using the fuzzy
logic to translate the specic criteria scales into a risk scale, which is common to all the
criteria, we were able to consider heterogeneous criteria without loosing their nature, and
to combine them into a human readable risk assessment, thus proposing a solution to the
literature problem (iv). Moreover, the criteria combination follows a rule-base, which allows
for a decision-reasoning that is easily understandable and traceable for humans, and also
for the interpretation of the context before making the decision. Besides, tuning the fuzzy
set now oers the possibility to adapt the risk scale to the passengers of the vehicle. As all
the criteria may not have the same importance regarding the passenger' behavior, we also
used a belief theory framework within the Fuzzy Dempster-Shafer Theory. The advantage
of this framework is its ability to assign criteria to dierent categories corresponding to
6.2. Future work regarding our contributions 159
dierent decision-states, such as safety or comfort. The categories are then considered as
diverse sources of information to obtain the nal decision. Thus, the combination of the
categories returns two major information for decision making: the probability to be in the
state and the belief in actually being in this state. Moreover, the consideration of uncertainties
and trustworthiness becomes possible by using fuzzy logic and weighting the criteria and
categories. Finally, the Fuzzy Dempster-Shafer reasoning avoids both the defuzzication
process for fuzzy logic by using belief theory and the problem of fallacy of the excluded
middle for belief theory by using fuzzy set and appropriate rule-base.
In Chapter 4, we rst studied the problem of the sequential motion planning architecture
for the literature problem (i), proposing a unied approach so that both functions of motion
generation and decision making are optimized in a combined manner. To do so, we devel-
oped an all-in-one architecture, within which the decision making algorithm from Chapter 3
corresponds to the evaluation function used in the optimization approach for trajectory gen-
eration. At the end, this architecture returns a near-optimal reference trajectory to follow.
The main advantage is thus the joint processing of decision making and trajectory generation
resulting in a consistent motion planner. Another advantage of this architecture is its ability
to work in a continuous trajectory solution space, which eludes the issue of trajectory dis-
cretization (literature problem (ii)). Secondly, we presented a deterministic and continuous
characterization of the evolution space, considering it as a set of intervals. We introduced the
Non-Collision Nominal Intervals (NCNI), which consist in a collision-free space delimited by
road lanes and spaces between obstacles. The NCNI contain all the spatiotemporal motions
allowed for the ego vehicle in the evolution space along the prediction horizon. The high-level
maneuvers are carried out to move from one interval to another. The position and velocity
prediction of obstacles and the dynamically feasible motions of the ego vehicle characterize
the spatiotemporal intervals. By optimizing a trajectory inside these intervals, we also avoid
the decoupling issue in motion generation (literature problem (iii)). Lastly, in order to uti-
lize the implicit evaluation function of the decision-maker developed in Chapter 3, we have
opted for a metaheuristics approach with a Simulated Annealing algorithm, that allow us to
nd a near-optimal solution in a nite time. Furthermore, it oers the possibility to adjust
the calculation time according to the encountered situation, while guaranteeing to return an
acceptable solution.
In Chapter 5, we successfully demonstrated the eectiveness of our approach implemented
in an autonomous prototype vehicle. The motion planning algorithm has been integrated with
the controller and perception modules. The tested scenarios highlighted the real-time and
robustness performances of our work with two manually-driven obstacles.
The research solutions presented in this thesis have shown encouraging results. The following
future work will enrich the reection carried out to eectively bring these ideas into a fully
autonomous vehicle application.
Extending the highway environment Even if we tested some merging use cases with
the 2-obstacle scenarios, future work will consist in addressing highway ramp-entrances and
160 Chapter 6. Conclusion and Future Work
exits. This environment requires a socially cooperative driving: either the ego vehicle enters
the highway and merges into the current trac, or the ego vehicle is in trac and adjusts
its own behavior to facilitate the merging of the incoming vehicles. Besides, considering this
road context will also change the nominal conditions of driving, otherwise the ego vehicle will
be confronted with a blockage action. For instance, the relative safe distances or ego velocity
for merging can be reduced. Both situations of merging refer to the ability to plan adequate
courtesy behaviors, which may lead to the creation of a new courtesy category in our work.
Dealing with uncertainty From a theoretical standpoint, the current development of our
Fuzzy Dempster-Shafer approach can accommodate trustworthiness. Additional research is
needed to build a Fuzzy Dempster-Shafer reasoning which also considers uncertainty as input
of the fuzzy logic. Moreover, considering several obstacles' predictions is necessary, as the
predicted behaviors are only accurate for a short time horizon. One of the direction could be
the addition of weights on the criteria according to the probability and dangerousness of the
predicted obstacle's behavior, as it has been done with the data trustworthiness.
Improving the fuzzy relationships The denition of the fuzzy sets and the associated
rule-base is of major importance in our decision making process. We noticed the two fol-
lowing issues: (i) the consistency of the rule-base, and (ii) the constant warning areas in the
evaluation function. The rst improvement requires further research to make the rule-base
coherent and consistent in order to prevent non-intuitive results and missing association of
the fuzzy sets. The second improvement consists in studying the coherence and inuence of
the fuzzy set membership functions on the evaluation function. For example, a trapezoidal
membership function will return a at evaluation function, which results in an area with local
minima. The non-uniqueness of a solution is acceptable for motion planning in the case of
autonomous vehicles, but it inuences the optimization process, which can become inecient.
Dealing with ethics behavior In order to adapt the behavior of the fuzzy inference
systems to a human-in-the-loop behavior, learning techniques can also be considered for
dierent driver prole datasets to form the fuzzy sets. This improvement is useful for at least
two reasons: (i) reproducing a human behavior leads to a better acceptation in case of a mix
human- and robot-driver, (ii) learning the fuzzy sets will consist in adapting the risk levels
to the context, and thus integrating somehow a cognitive reasoning.
6.3. Future research directions in motion planning for autonomous vehicles 161
vehicles
This section spotlights some critical and forthcoming issues of research in the wider scope of
motion planning for autonomous vehicles in general.
Safety The safety considerations impose the need for intelligent enough algorithms to dis-
tinguish a permanent or temporary danger, such as the front vehicle suddenly stopping, or
another one cutting in.
From a motion planning perspective, the ego vehicle is safe if it is not in conict with
its environment. However, this safety space highly depends on the safety capacity of robot
driving, which can be dierent from a human one. As stated in [Altho & Lösch 2016],
longitudinal distance and speed controls are generally faster in autonomous driving, and so
longitudinal safety space could be smaller. In contrast, lateral distance control is generally
more stable with human drivers; thus, lateral safety space must be larger. Furthermore, this
safety space has to be sucient against the unexpected challenges of the environment (e.g.
unexpected braking from leading vehicle, unforeseen road's dead end). The formal methods
are thus used to mathematically prove safety properties [Pek et al. 2017a, Pek et al. 2017b]
in any conditions including the most critical ones.
162 Chapter 6. Conclusion and Future Work
Moreover, at the moment, automakers are developing automated vehicles with human
takeover and human-robot mixed driving. This implies a possibly shared decision making
between automated vehicles and drivers. The rst problem is retaining a stable and predictive
decision in case the driver takes the control back [Merat et al. 2014]. The second one is
that the decisions of the human driver and the machine may contradict each other [Da Lio
et al. 2015]. The third safety consideration is of human drivers' behaviors towards robot
drivers in the sense of respect, acceptability, and predictability [Verberne et al. 2012], or
comfort [Elbanhawi et al. 2015].
Route context The route planner is well studied with a trip scheduler. However, there
are very few examples in the literature of integrating these data for decision and generation
functions. In fact, route planning constrains the evolution space, especially on highways;
e.g. a right-hand highway exit involves the ego vehicle to navigate to the rightmost lane.
These requirements imply respecting constraints in actions, distances, and time to follow the
instructions of the navigator.
Transition stability The variety of the encountered situations often requires the use of
dierent combined motion planning algorithms. In this case, we need to know whether
the resulting architecture is stable, reliable and robust, or whether it is necessary to add
a high-level supervisor to validate the coherence of observations, decisions and actions. In
[Broggi et al. 2013b], the authors propose a behavior paradigm analogous to the FSM decision
to implement an architecture able to switch maneuver modes from manual to autonomous
driving.
Evaluation The validation of motion planning algorithms mostly relies on extensive simula-
tion and experimental testing results, where a human driver evaluates the action according to
6.3. Future research directions in motion planning for autonomous vehicles 163
a personal reference index, such as safety, smoothness, or operation time [Tehrani et al. 2015].
There is a lack of formal analysis and evaluation methods to hierarchically classify algorithms'
performances. This would change thanks to the open source scenario library developed by
[Altho et al. 2017] to propose a benchmark for motion planning algorithms, or with the Key
Performance Indicators (KPIs) proposed by [Quilbeuf et al. 2018].
Ethics As soon as robotic systems interact with human beings, questions of ethics in
decision-makers arise. Reference [Goodall 2014] proposes an ethical vehicle deployment strat-
egy in a hybrid rational and AI approach, especially for critical safety situations. The authors
in [Thornton et al. 2017] show how to incorporate ethics into decisions based on ethical frame-
works, such as deontology as rules constraining the actions and navigation of the system,
consequentialism as cost-based construction for the objective function, or as morality to
determine the dierent costs of the system's behaviors.
Algorithm relaxation The strict respect and relaxation of driving rules are key points to
ensure safety in any case for an autonomous vehicle. Indeed, even hard constraints might be-
come dangerous in critical cases [Broggi et al. 2014]. Reference [Morignot & Nashashibi 2013]
introduces an ontology for trac rules in unusual situations with knowledge-based inference
engines to avoid blockage situations and to preserve safety with lane crossing or excess speeds.
Appendix A
The t-parameterized equation of the sigmoid function, with x(t) the abscissa value and y(t)
the ordinate value, is given in Equation A.1:
b
y(t) = , (A.1)
1+ e−λ(x(t)−c)
where b is the shift value, λ is the sigmoid parameter, and c an abscissa delay of the inection
point.
In our problem, b, λ and c are constant values, and x 7→ x(t) is at least a twice dieren-
tiable function over R. Consequently, y is obviously at least a twice dierentiable function of
the time variable t.
The rst-order time derivative of x is written dx
dt , vx .
2
The second-order time derivative of x is written ddt2x = dv
dt , ax .
x
The rst-order and second-order sigmoid derivatives with respect to x are respectively
expressed in Equations A.2 and A.3:
dy −b(−λ)e−λ(x−c) y λ
= = λy(1 − ) = y(b − y). (A.2)
dx (1 + e−λ(x−c) )2 b b
!
λ
d2 y d b y(b − y)
=
dx2 dx dx
λ dy dy
= (b − y) − y
b dx dx (A.3)
λ λ 2 λ 2
= y(b − y) − y (b − y)
b b b
λ 2
= 2 y(b − y)(b − 2y).
b
The rst-order time derivative of the sigmoid function represented in Equation A.1 is
calculated using the chain rule in Equation A.4:
dy dy dx
=
dt dx dt
dy
= vx (A.4)
dx
λ
= y(b − y)vx .
b
166 Appendix A. Lateral acceleration limits calculation
The second-order time derivative of the sigmoid function based on Equation A.4 is calcu-
lated using the chain rule in Equation A.5:
d2 y
d dy
=
dt2 dt dt
d dy dx
=
dt dx dt
(A.5)
d dy dx dy d2 x
= +
dt dx dt dx dt2
2
d y dy
= vx + ax .
dtdx dx
d2 y
Using the Schwarz's Theorem, we express dtdx in Equation A.6 :
d2 y
d dy
=
dtdx dt dx
d dy
=
dx dt
d dy dx (A.6)
=
dx dx dt
d2 y dx dy d2 x
= +
dx2 dt dx dxdt
d2 y dy d2 x
= vx + .
dx2 dx dxdt
d2 x
Using the Schwarz's Theorem, we express dxdt in Equation A.7 :
d2 x
d dx d dx d
= = = (1) = 0. (A.7)
dxdt dx dt dt dx dt
d2 y d2 y dy
2
= vx + ax
dt dtdx dx
d2 y dy d2 x
dy
= 2
vx + vx + ax
dx dx dxdt dx
(A.8)
d2 y 2 dy
= v + ax
dx2 x dx
λ2 λ
= 2 y(b − y)(b − 2y)vx2 + y(b − y)ax .
b b
d2 y ax λ vx2 λ2
= y(b − y) + y(b − y)(b − 2y) . (A.9)
dt2 b b2
A.2. Resolution of the second-order time derivative constraints 167
We want to nd the bounds values of λ in order to respect the maximum lateral acceleration
condition in Equation 4.9 (p. 102), as recalled in Equation A.10:
ax λ v 2 λ2
y(b − y) + x 2 y(b − y)(b − 2y) ≤ ay,max (A.10)
b b
We write:
ax λ v 2 λ2
f (y) = y(b − y) + x 2 y(b − y)(b − 2y),
b b
ax λ
f1 (y) = y(b − y), (A.11)
b
v 2 λ2
f2 (y) = x 2 y(b − y)(b − 2y),
b
with y ∈ [0, b], by denition of b in Equation A.1.
Equation A.10 is then equivalently reformulated as Equation A.12:
Then, the two terms f1 (y) and f2 (y) of Equation A.12 are bounded separately.
y 0 b
2 b
0
f1 (y) + 0
f1 (y) 0 % f1 (y = 2b ) & 0
√ √
y 0 y1 = 3− 3
6 b y2 = 3+ 3
6 b b
0
f2 (y) + 0 0 +
f2 (y) 0 % f2 (y1 ) & f2 (y2 ) % 0
√ √ √
The maximum value of f2 is obtained for y = 6 b:
3− 3
f2 (y = 3− 3
6 b) = 18 vx bλ .
3 2 2
168 Appendix A. Lateral acceleration limits calculation
g(λ) ≤ 0 (A.18)
λ 0 λ2 +∞
g(λ) −ay,max 0 +
−ax b
√ √
+ ∆ a2x b2
with ay,max ≥ 0, λ2 = 4√
3 2
, and ∆ = 16 + 2 3 2
9 vx bay,max > 0.
9 x
v b
Finally, the maximum value for λ that fullls the condition of Equation 4.9 is given in
Equation A.19:
√ √
−ax b/4 + ∆ a2x b2 2 3 2
λmax = √ , ∆= + v bay,max . (A.19)
3 2 16 9 x
9 vx b
Appendix B
Résumés en français
Le chapitre d'état de l'art détaille les considérations spéciques à la recherche d'un plan-
icateur de mouvement dans un environnement autoroutier. La terminologie propre à la
planication de mouvement, les diérents niveaux de planication ainsi que les spécicités et
contraintes inhérentes au contexte autoroutier sont présentés.
Une revue de littérature est également proposée dans ce chapitre. Les algorithmes des
fonctions de prise de décision, génération de mouvement et déformation sont analysés vis-à-
vis de leur application à un environnement autoroutier. Trois axes sont étudiés : l'utilisation
de l'algorithme, la nature des sorties qu'il renvoie et son orientation mathématique. Les
algorithmes existants sont alors rassemblés en 6 familles. A l'issue de l'état de l'art, un
tableau de comparaison est proposé de manière à mettre en évidence l'algorithme le mieux
adapté au problème déni.
Pour nir, les conclusions de ce travail bibliographique font ressortir le besoin d'une
méthode de planication de mouvements explicable et déterministe (réplicabilité), intuitive
pour l'humain (acceptabilité), cohérente avec l'environnement d'application (adaptabilité et
faisabilité en temps réel), et contextuelle. Ce travail de recherche se positionne donc sur une
170 Appendix B. Résumés en français
Ce chapitre traite du problème de prise de décision pour les véhicules autonomes sur au-
toroute. L'objectif est de sélectionner une trajectoire de référence sur un horizon prédictif.
La trajectoire sélectionnée est obtenue à partir d'une optimisation multi-critères. Ces critères
peuvent par exemple correspondre à la sécurité avec les obstacles, au respect des règles de
conduite, au confort des passagers, à la consommation énergétique, etc.
Ce travail introduit un nouveau cadre de décision proposant une évaluation de risque.
L'architecture est présentée en trois niveaux. A partir des mesures capteurs et de la pré-
diction de l'environnement sur un horizon de prédiction, pour chaque trajectoire à évaluer,
on obtient les valeurs des critères. Le premier niveau utilise une approche par logique oue
pour agréger ces critères de décision hétérogènes et incertains. En sortie de ce niveau, chaque
trajectoire candidate se voit attribuer un niveau d'avertissement local. La théorie des croy-
ances est utilisée au deuxième niveau pour combiner les niveaux d'avertissement locaux en
les interprétant comme diérentes sources de risque pour la prise de décision. A l'issue de ce
niveau, chaque trajectoire candidate est associé à un intervalle de risque. Enn, le troisième
niveau consiste à évaluer l'intervalle de risque an de dénir la trajectoire la moins risquée
comme trajectoire de référence. Des résultats en simulation sont nalement exposés dans ce
chapitre pour illustrer le raisonnement appliqué.
Le chapitre 5 montre les expérimentations implémentées sur véhicule réel. Celles-ci se sont
déroulées sur la piste de vitesse du site de Versailles-Satory. Le véhicule test est instru-
menté par une localisation centrale inertielle et GPS RTK, un module de communication
Ultra Haute Fréquence (UHF) et une ceinture de 5 LIDARs pour la détection d'obstacles.
La cartographie est prédénie. Deux véhicules obstacles complètent les moyens expérimen-
taux. Le premier, disposant également d'une centrale inertielle et GPS RTK, communique
sa position par communication UHF. Le second obstacle est perçu par les LIDARs. Les in-
formations de localisation et de perception des obstacles sont envoyées au planicateur qui
décide de la trajectoire de référence à suivre par le contrôleur. Ce dernier se compose d'un
contrôleur proportionnel-intégral sur la consigne de vitesse, et d'un contrôleur proportionnel
sur la consigne de position, en considérant l'angle de déviation.
Cinq scénarios ont été eectués pour ces expérimentations. Pour chacun des scénarios,
le suivi de la trajectoire de référence, les limites de la dynamique du véhicule et les critères
de sécurité de distance aux obstacles sont étudiés. L'analyse des résultats montrent la re-
productibilité de l'algorithme, une trajectoire de référence réalisable et correctement suivie
(faible dépassement et retard), ainsi que le respect des limites dynamiques du véhicules et
des distances de sécurité aux obstacles. De plus, l'algorithme est temps réel avec un temps
de calcul de 60 ms avec le paramétrage proposé.
Ce dernier chapitre fait état des conclusions du travail de thèse mené ainsi que des perspectives
à développer. Des conclusions et perspectives plus générales sur le problème de planication
sont également proposées.
Bibliography
[Aarts & Van Laarhoven 1985] E. H. Aarts et P. J. Van Laarhoven. Statistical cooling: A
general approach to combinatorial optimization problems. Philips J. Res., vol. 40,
no. 4, pages 193226, 1985. (p. 99)
[Altché & De La Fortelle 2017] F. Altché et A. De La Fortelle. Partitioning of the free space-
time for on-road navigation of autonomous ground vehicles. In IEEE Annual Conf.
on Decision and Control (CDC), 2017. (p. 26, 46, 85, 86 and 89)
[Altho & Dolan 2014] M. Altho et J. M. Dolan. Online verication of automated road
vehicles using reachability analysis. IEEE Trans. on Robotics, vol. 30, no. 4, pages
903918, 2014. (p. 161)
[Altho & Lösch 2016] M. Altho et R. Lösch. Can automated road vehicles harmonize with
trac ow while guaranteeing a safe distance? In IEEE Int. Conf. on Intelligent
Transportation Systems (ITSC), 2016. (p. 161)
[Altho & Mergel 2011] M. Altho et A. Mergel. Comparison of Markov chain abstraction
and Monte Carlo simulation for the safety assessment of autonomous cars. IEEE
Trans. on Intelligent Transportation Systems, vol. 12, no. 4, pages 12371247, 2011.
(p. 161)
[Baass 1984] K. Baass. Use of clothoid templates in highway design. In Transportation Forum,
volume 1, pages 4752, 1984. (p. 31)
[Balal et al. 2016] E. Balal, R. L. Cheu et T. Sarkodie-Gyan. A binary decision model for
discretionary lane changing move based on fuzzy inference system.
Transportation
Research Part C: Emerging Technologies, vol. 67, pages 4761, 2016. (p. 38, 47
and 51)
[Bender et al. 2015] P. Bender, Ö. S. Tas, J. Ziegler et C. Stiller. The combinatorial aspect of
motion planning: Maneuver variants in structured environments. In IEEE Intelligent
Vehicles Symposium (IV), 2015. (p. 26 and 46)
[Borenstein & Koren 1991] J. Borenstein et Y. Koren. The vector eld histogram-fast obstacle
avoidance for mobile robots. IEEE Trans. on Robotics and Automation, vol. 7, no. 3,
pages 278288, 1991. (p. 25 and 30)
[Bounini et al. 2017] F. Bounini, D. Gingras, H. Pollart et D. Gruyer. Modied articial po-
tential eld method for online path planning applications. In IEEE Intelligent Vehicles
Symposium (IV), 2017. (p. 28, 29, 30 and 46)
[Chebly et al.2015] A. Chebly, G. Tagne, R. Talj et A. Charara. Local trajectory planning and
tracking of autonomous vehicles, using clothoid tentacles method. In IEEE Intelligent
Vehicles Symposium (IV), 2015. (p. 32 and 46)
[Chebly et al. 2017] A. Chebly, R. Talj et A. Charara. Maneuver Planning for Autonomous
Vehicles, with Clothoid Tentacles for Local Trajectory Planning. In IEEE Int. Conf.
on Intelligent Transportation (ITSC), 2017. (p. 85)
[Chen et al. 2013] J. Chen, P. Zhao, T. Mei et H. Liang. Lane change path planning based
on piecewise Bezier curve for autonomous vehicle. In IEEE Int. Conf. on Vehicular
Electronics and Safety (ICVES), 2013. (p. 31, 33 and 46)
[Chen et al. 2014] J. Chen, P. Zhao, H. Liang et T. Mei. A Multiple Attribute-based Decision
Making model for autonomous vehicle in urban environment. In IEEE Intelligent
Vehicles Symposium (IV), 2014. (p. 51)
176 Bibliography
[Chen et al.2015a] C. Chen, A. Gaschler, M. Rickert et A. Knoll. Task planning for highly
automated driving. In IEEE Intelligent Vehicles Symposium (IV), 2015. (p. 40 and 47)
[Chen et al. 2015b] C. Chen, A. Se, A. Kornhauser et J. Xiao. Deepdriving: Learning aor-
dance for direct perception in autonomous driving.In IEEE Int. Conf. on Computer
Vision, 2015. (p. 38 and 47)
[Choi 2014] J. Choi. Kinodynamic motion planning for autonomous vehicles. International
Journal of Advanced Robotic Systems, vol. 11, no. 6, page 90, 2014. (p. 25, 26, 32,
34, 46 and 47)
[Connell & La 2017] D. Connell et H. M. La. Dynamic path planning and replanning for
mobile robots using RRT. In IEEE Int. Conf. on Systems, Man, and Cybernetics
(SMC), 2017. (p. 29)
[Da Lio et al. 2015] M. Da Lio, F. Biral, E. Bertolazzi, M. Galvani, P. Bosetti, D. Windridge,
A. Saroldi et F. Tango. Articial co-drivers as a universal enabling technology for
future intelligent vehicles and transportation systems. IEEE Trans. on Intelligent
Transportation Systems, vol. 16, no. 1, pages 244263, 2015. (p. 162)
[de Lima & Pereira 2013] D. A. de Lima et G. A. S. Pereira. Navigation of an autonomous car
using vector elds and the dynamic window approach. Journal of Control, Automation
and Electrical Systems, vol. 24, no. 1-2, pages 106116, 2013. (p. 26, 30 and 46)
[Derbel & Landry 2018] O. Derbel et R. J. Landry. Belief and fuzzy theories for driving
behavior assessment in case of accident scenarios. Int. J. Automotive Technology,
vol. 19, no. 1, pages 167177, 2018. (p. 52)
[Dijkstra 1959] E. Dijkstra. A note on two problems in connexion with graphs. Numerische
Mathematik, vol. 1, no. 1, pages 269271, 1959. (p. 28)
[Dubins 1957] L. E. Dubins. On curves of minimal length with a constraint on average cur-
vature, and with prescribed initial and terminal positions and tangents. American
Journal of mathematics, vol. 79, no. 3, pages 497516, 1957. (p. 31)
[Eglese 1990] R. Eglese. Simulated annealing: a tool for operational research. European
Journal of Operational Research, vol. 46, no. 3, pages 271281, 1990. (p. 99)
[EPC ] Directive 2010/40/EU of the European Parliament and of the Council of 7 July
2010. https://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2010:
207:0001:0013:EN:PDF accessed 2019-03-19. (p. 2)
[Fiorini & Shiller 1998] P. Fiorini et Z. Shiller. Motion planning in dynamic environments
using velocity obstacles. The International Journal of Robotics Research, vol. 17, no. 7,
pages 760772, 1998. (p. 26 and 85)
[Fox et al.1997] D. Fox, W. Burgard et S. Thrun. The dynamic window approach to collision
avoidance. IEEE Robotics & Automation Magazine, vol. 4, no. 1, pages 2333, 1997.
(p. 26)
[Fraichard & Asama 2004] T. Fraichard et H. Asama. Inevitable collision statesA step
towards safer robots? Advanced Robotics, vol. 18, no. 10, pages 10011024, 2004.
(p. 85)
178 Bibliography
[Furda & Vlacic 2010] A. Furda et L. Vlacic. Multiple criteria-based real-time decision mak-
ing by autonomous city vehicles. IFAC Proceedings Volumes, vol. 43, no. 16, pages
97102, 2010. (p. 51, 53, 54 and 74)
[Gehrig & Stein 2007] S. K. Gehrig et F. J. Stein. Collision avoidance for vehicle-following
systems. IEEE Trans. on Intelligent Transportation Systems, vol. 8, no. 2, pages
233244, 2007. (p. 30 and 46)
[Gendreau & Potvin 2010] M. Gendreau et J.-Y. Potvin. Handbook of metaheuristics, vol-
ume 2. Springer, 2010. (p. 85)
[Geng et al. 2016] X. Geng, H. Liang, H. Xu, B. Yu et M. Zhu. Human-driver speed prole
modeling for autonomous vehicle's velocity strategy on curvy paths. In IEEE Intelligent
Vehicles Symposium (IV), 2016. (p. 38 and 47)
[Goodall 2014] N. J. Goodall. Ethical decision making during automated vehicle crashes.
Transportation Research Record, vol. 2424, no. 1, pages 5865, 2014. (p. 163)
[Green 1925] H. Green. Radio-Controlled Automobile. Radio News, pages 592656, Novembre
1925. (p. 2)
Bibliography 179
[Gu et al.2013] T. Gu, J. Snider, J. Dolan et J.-w. Lee. Focused trajectory planning for
autonomous on-road driving. In IEEE Intelligent Vehicles Symposium (IV), 2013.
(p. 17, 27, 32, 33, 46 and 47)
[Gu et al. 2016a] T. Gu, J. M. Dolan et J.-W. Lee. On-road trajectory planning for general
autonomous driving with enhanced tunability. In Intelligent Autonomous Systems,
volume 13, pages 247261. Springer, 2016. (p. 34 and 47)
[Gu et al.2016b] T. Gu, J. M. Dolan et J.-W. Lee. Runtime-bounded tunable motion planning
for autonomous driving. In IEEE Intelligent Vehicles Symposium (IV), 2016. (p. 26,
27, 28 and 46)
[Hajek 1988] B. Hajek. Cooling schedules for optimal annealing. Mathematics of Operations
Research, vol. 13, no. 2, pages 311329, 1988. (p. 100)
[Hart et al.1968] P. E. Hart, N. J. Nilsson et B. Raphael. A formal basis for the heuristic de-
termination of minimum cost paths. IEEE Trans. on Systems Science and Cybernetics,
vol. 4, no. 2, pages 100107, 1968. (p. 28 and 33)
[Hesse et al. 2010] T. Hesse, D. Hess et T. Sattel. Motion planning for passenger vehicles-
force eld trajectory optimization for automated driving. In Int. Conf. on Robotics
and Applications, 2010. (p. 24, 28, 30 and 46)
[Hsu & Sun 2004] D. Hsu et Z. Sun. Adaptively combining multiple sampling strategies for
probabilistic roadmap planning. In IEEE. Conf. on Robotics, Automation and Mecha-
tronics, 2004. (p. 24)
[Hudecek & Eckstein 2014] J. Hudecek et L. Eckstein. Improving and simplifying the gener-
ation of reference trajectories by usage of road-aligned coordinate systems. In IEEE
Intelligent Vehicles Symposium (IV), 2014. (p. 32 and 89)
[Huy et al. 2013] Q. Huy, S. Mita, H. T. N. Nejad et L. Han. Dynamic and safe path plan-
ning based on support vector machine among multi moving obstacles for autonomous
vehicles. IEICE Trans. on Information and Systems, vol. 96, no. 2, pages 314328,
2013. (p. 33, 37, 46 and 47)
[Hwan Jeon et al. 2013] J. Hwan Jeon, R. V. Cowlagi, S. C. Peters, S. Karaman, E. Frazzoli,
P. Tsiotras et K. Iagnemma. Optimal motion planning with the half-car dynamical
model for autonomous high-speed driving. In IEEE American Control Conference
(ACC), 2013. (p. 29 and 46)
[ICB 2015] Learn to drive smart: Your guide to driving safely. Insurance Corporation of
British Columbia (ICBC), North Vancouver, B.C., Canada, 2015. (p. 76)
[Jang 1993] J.-S. Jang. ANFIS: adaptive-network-based fuzzy inference system. IEEE Trans.
Systems, Man, and Cybernetics, vol. 23, no. 3, pages 665685, 1993. (p. 58)
[Johnson & Hauser 2013] J. Johnson et K. Hauser. Optimal longitudinal control planning
with moving obstacles. In IEEE Intelligent Vehicles Symposium (IV), 2013. (p. 26
and 46)
[Kant & Zucker 1986] K. Kant et S. W. Zucker. Toward ecient trajectory planning: The
path-velocity decomposition. The international journal of robotics research, vol. 5,
no. 3, pages 7289, 1986. (p. 26)
[Katrakazas et al. 2015] C. Katrakazas, M. Quddus, W.-H. Chen et L. Deka. Real-time mo-
tion planning methods for autonomous on-road driving: State-of-the-art and future
research directions.
Transportation Research Part C: Emerging Technologies, vol. 60,
pages 416442, 2015. (p. 7)
Bibliography 181
[Khatib 1986] O. Khatib. Real-time obstacle avoidance for manipulators and mobile robots.
In Autonomous Robot Vehicles, pages 396404. Springer, 1986. (p. 29 and 30)
[Ko & Simmons 1998] N. Y. Ko et R. G. Simmons. The lane-curvature method for local obsta-
cle avoidance. In IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, volume 3,
pages 16151621, 1998. (p. 27)
[Kuan et al. 1984] D. T. Kuan, R. A. Brooks, J. C. Zamiska et M. Das. Automatic path plan-
ning for a mobile robot using a mixed representation of free space.
In IEEE Computer
Society Conference on Articial Intelligence Applications, 1984. (p. 26)
[Kuan et al. 1985] D. Kuan, J. Zamiska et R. Brooks. Natural decomposition of free space for
path planning. In IEEE Int. Conf. on Robotics and Automation, 1985. (p. 24 and 26)
[Latombe 1991] J.-C. Latombe. Robot motion planning. Springer Science & Business Media,
1991. (p. 7)
[LaValle 1998] S. M. LaValle. Rapidly-exploring random trees: A new tool for path planning.
1998. (p. 29)
[LaValle 2006] S. M. LaValle. Planning algorithms. Cambridge university press, 2006. (p. 7
and 27)
[Lee et al.2014] U. Lee, S. Yoon, H. Shim, P. Vasseur et C. Demonceaux. Local path planning
in a complex environment for self-driving car.
In IEEE Int. Conf. on Cyber Technology
in Automation, Control, and Intelligent Systems (CYBER), 2014. (p. 25, 26 and 46)
182 Bibliography
[Li et al. 2014] X. Li, Z. Sun, Q. Zhu et D. Liu. A unied approach to local trajectory planning
and control for autonomous driving along a reference path.In IEEE Int. Conf. on
Mechatronics and Automation (ICMA), 2014. (p. 24, 33, 46 and 47)
[Li et al.2017] N. Li, H. Chen, I. Kolmanovsky et A. Girard. An Explicit Decision Tree Ap-
proach for Automated Driving. In ASME Dynamic Systems and Control Conference,
2017. (p. 35 and 47)
[Liu et al. 2012] Y. Liu, W.-L. Huang, T. Sun et F. Zhu. AGV decision making subsystem
based on modied Dempster-Shafer evidence theory and fuzzy logic.In IEEE Int. Conf.
on Vehicular Electronics and Safety (ICVES), 2012. (p. 52, 53 and 66)
[Liu et al.2017a] C. Liu, S. Lee, S. Varnhagen et H. E. Tseng. Path planning for autonomous
vehicles using model predictive control. In IEEE Intelligent Vehicles Symposium (IV),
2017. (p. 24, 30, 33, 46 and 47)
[Liu et al. 2017b] C. Liu, W. Zhan et M. Tomizuka. Speed prole planning in dynamic en-
vironments via temporal optimization. In IEEE Intelligent Vehicles Symposium (IV),
2017. (p. 85)
[McKnight & Adams 1970a] A. J. McKnight et B. B. Adams. Driver Education Task Anal-
ysis. Volume II: Task Analysis Methods. 1970. (p. 40 and 47)
[McKnight & Adams 1970b] A. McKnight et B. Adams. Driver education task analysis. Vol-
ume I: Task descriptions. 1970. (p. 40 and 47)
[McKnight & Hundt 1971a] A. J. McKnight et A. G. Hundt. Driver Education Task Analysis.
Volume IV: The Development of Instructional Objectives. 1971. (p. 40 and 47)
[McKnight & Hundt 1971b] A. McKnight et A. Hundt. Driver education task analysis. Vol-
ume III: Instructional Objectives. 1971. (p. 40 and 47)
[Michon 1985] J. A. Michon. A critical view of driver behavior models: what do we know,
what should we do? In Human behavior and trac safety, pages 485524. Springer,
1985. (p. 16 and 39)
[Miller et al. 2018] C. Miller, C. Pek et M. Altho. Ecient mixed-integer programming for
longitudinal and lateral motion planning of autonomous vehicles. In IEEE Intelligent
Vehicles Symposium (IV), 2018. (p. 33 and 47)
[Mitsch et al. 2013] S. Mitsch, K. Ghorbal et A. Platzer. On provably safe obstacle avoidance
for autonomous robotic ground vehicles. In Robotics: Science and Systems (RSS),
2013. (p. 26 and 46)
[Moras et al. 2011] J. Moras, V. Cherfaoui et P. Bonnifait. Credibilist occupancy grids for
vehicle perception in dynamic environments. In IEEE Int. Conf. on Robotics and
Automation (ICRA), 2011. (p. 25 and 46)
[Naumann & Stiller 2017] M. Naumann et C. Stiller. Towards cooperative motion planning
for automated vehicles in mixed trac. IEEE Int. Conf. on Intelligent Robots and
Systems (IROS) Workshops, 2017. (p. 39 and 47)
[Noh & An 2018] S. Noh et K. An. Decision-Making Framework for Automated Driving in
Highway Environments. IEEE Trans. Intelligent Transportation Systems, vol. 19,
no. 1, pages 5871, 2018. (p. 51 and 64)
[O' Brien 2017] M. O' Brien, K. Neubauer, J. Van Brummelen et H. Najjaran. Analysis
et al.
of Driving Data for Autonomous Vehicle Applications. In IEEE Int. Conf. on Systems,
Man, and Cybernetics (SMC), 2017. (p. 76)
[Quinlan & Khatib 1993] S. Quinlan et O. Khatib. Elastic bands: Connecting path planning
and control. In IEEE Int. Conf. on Robotics and Automation (ICRA), 1993. (p. 30)
[Rehder et al. 2017] E. Rehder, J. Quehl et C. Stiller. Driving Like a Human: Imitation
Learning for Path Planning using Convolutional Neural Networks.In IEEE Int. Conf.
on Intelligent Robots and Systems (IROS) Workshops, 2017. (p. 38 and 47)
[Reichardt & Shick 1994] D. Reichardt et J. Shick. Collision avoidance in dynamic environ-
ments applied to autonomous vehicle guidance on the motorway. In IEEE Intelligent
Vehicles Symposium (IV), 1994. (p. 29 and 46)
[Shafer 1976] G. Shafer. A mathematical theory of evidence, volume 42. Princeton university
press, 1976. (p. 52 and 59)
[Song et al. 2013] X. Song, H. Cao et J. Huang. Vehicle path planning in various driving situ-
ations based on the elastic band theory for highway collision avoidance.
Proceedings of
the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering,
vol. 227, no. 12, pages 17061722, 2013. (p. 25, 30 and 46)
[Söntges & Altho 2018] S. Söntges et M. Altho. Computing the drivable area of au-
tonomous road vehicles in dynamic road scenes. IEEE Trans. on Intelligent Trans-
portation Systems, vol. 19, no. 6, pages 18551866, 2018. (p. 85 and 86)
[Stentz 1994] A. Stentz. Optimal and ecient path planning for partially-known environ-
ments. In IEEE Int. Conf. on Robotics and Automation (ICRA), 1994. (p. 29)
[Tamke 2011] A. Tamke, T. Dang et G. Breuel. A exible method for criticality as-
et al.
sessment in driver assistance systems.
In IEEE Intelligent Vehicles Symposium (IV),
2011. (p. 39 and 47)
[Taylor 1964] D. Taylor. Drivers' galvanic skin response and the risk of accident. Ergonomics,
vol. 7, no. 4, pages 439451, 1964. (p. 39 and 47)
[Ulbrich & Maurer 2015] S. Ulbrich et M. Maurer. Towards tactical lane change behavior
planning for automated vehicles. In IEEE Int. Conf. on Intelligent Transportation
Systems (ITSC), 2015. (p. 36, 47 and 161)
[Ulmer 1994] B. Ulmer. Vita ii-active collision avoidance in real trac. In IEEE Intelligent
Vehicles Symposium (IV), 1994. (p. 2)
[Wagner et al. 2018] S. Wagner, K. Groh, T. Kühbeck, M. Dörfel et A. Knol. Using Time-
to-React based on Naturalistic Trac Object Behavior for Scenario-Based Risk As-
sessment of Automated Driving. In IEEE Intelligent Vehicles Symposium (IV), 2018.
(p. 39 and 47)
[Wang & Ayalew 2016] Q. Wang et B. Ayalew. Obstacle ltering algorithm for control of an
autonomous road vehicle in public highway trac. In ASME Dynamic Systems and
Control Conference, 2016. (p. 25, 33, 39 and 47)
[Wang et al. 2017] W. Wang, C. Liu et D. Zhao. How much data are enough? A statistical
approach with case study on longitudinal driving behavior. IEEE Trans. on Intelligent
Vehicles, vol. 2, no. 2, pages 8598, 2017. (p. 161)
[Wardzi«ski 2006] A. Wardzi«ski. The role of situation awareness in assuring safety of au-
tonomous vehicles. Computer Safety, Reliability, and Security, pages 205218, 2006.
(p. 51, 52 and 64)
[Wilde 1982] G. J. Wilde. The theory of risk homeostasis: implications for safety and health.
Risk analysis, vol. 2, no. 4, pages 209225, 1982. (p. 39 and 47)
[Wolf & Burdick 2008] M. T. Wolf et J. W. Burdick. Articial potential functions for highway
driving with collision avoidance. In IEEE Int. Conf. on Robotics and Automation
(ICRA), 2008. (p. 29, 30 and 46)
190 Bibliography
[Wolpert & Macready 1997] D. H. Wolpert et W. G. Macready. No free lunch theorems for
optimization. IEEE Trans. on Evolutionary Computation, vol. 1, no. 1, pages 6782,
1997. (p. 86)
[Xu et al.2012] W. Xu, J. Wei, J. M. Dolan, H. Zhao et H. Zha. A real-time motion planner
with trajectory optimization for autonomous vehicles. In IEEE Int. Conf. on Robotics
and Automation (ICRA), 2012. (p. 27, 32, 33, 46, 47 and 85)
[Xu et al.2014] W. Xu, J. Pan, J. Wei et J. M. Dolan. Motion planning under uncertainty
for on-road autonomous driving. In IEEE Int. Conf. on Robotics and Automation
(ICRA), 2014. (p. 161)
[Yager & Fileu 1993] R. R. Yager et D. P. Fileu. Learning of fuzzy rules by mountain clus-
tering. In Applications of Fuzzy Logic Technology, volume 2061, pages 246255. In-
ternational Society for Optics and Photonics, 1993. (p. 58)
[Yager & Filev 1994] R. R. Yager et D. P. Filev. Essentials of fuzzy modeling and control.
New York, vol. 388, 1994. (p. 51, 56 and 58)
[Yao et al.2012] W. Yao, H. Zhao, F. Davoine et H. Zha. Learning lane change trajectories
from on-road driving data. In IEEE Intelligent Vehicles Symposium (IV), 2012. (p. 27
and 46)
[Yen 1990] J. Yen. Generalizing the Dempster-Schafer theory to fuzzy sets. IEEE Trans.
Systems, Man, and Cybernetics, vol. 20, no. 3, pages 559570, 1990. (p. 68)
[Yu 2016] C. Yu, V. Cherfaoui et P. Bonnifait. Semantic evidential lane grids with prior
et al.
maps for autonomous navigation. In IEEE Int. Conf. on Intelligent Transportation
Systems (ITSC), 2016. (p. 25, 46 and 161)
[Zadeh 1965] L. A. Zadeh. Information and control. Fuzzy sets, vol. 8, no. 3, pages 338353,
1965. (p. 56)
[Ziegler & Stiller 2009] J. Ziegler et C. Stiller. Spatiotemporal state lattices for fast trajectory
planning in dynamic on-road driving scenarios.
In IEEE/RSJ Int. Conf. on Intelligent
Robots and Systems (IROS), 2009. (p. 27, 32 and 46)
Title : Motion Planning for Autonomous Highway Driving: A Unified Architecture for Decision-Maker and Tra-
jectory Generator
Keywords : Self-driving cars, Motion planning, Decision making, Trajectory generation, Intelligent Transporta-
tion Systems
Abstract : This thesis work is part of the development the evaluation function for the generated trajectory. It
of a self-driving car in highway environments. More thus makes it possible to elude both the discretiza-
precisely, it aims to propose a unified architecture of tion and position/speed decoupling problems. On the
trajectory planner and decision-maker taking into ac- other hand, the aggregation of fuzzy logic and be-
count the limitations of the environment and the avai- lief theory allows decision making on heterogeneous
lable data within the current development of sensors criteria and uncertain data. The proposed framework
technologies (distance limitations, uncertainties). also handles personalization of the vehicle’s behavior,
On the one hand, the method generates sigmoid tra- depending on the passengers’ risk perception and an
jectories in a continuous spatiotemporal representa- aggressive or conservative driving style.
tion of the evolution space, which is reduced befo- The presented approach was finally evaluated and va-
rehand by modeling collision-free intervals in nomi- lidated in a simulation environment, and then in a test
nal conditions of driving. The sigmoid parameters are vehicle. The planning block was integrated into the
subsequently optimized with a simulated annealing existing vehicle’s architecture, interfaced with the lo-
approach that uses the decision-maker algorithm as calization, obstacles’ perception and control blocks.
Université Paris-Saclay
Espace Technologique / Immeuble Discovery
Route de l’Orme aux Merisiers RD 128 / 91190 Saint-Aubin, France