0% found this document useful (0 votes)
55 views17 pages

DB Report Springer Paper

The document describes a method to emulate emergency vehicle navigation in mixed traffic environments using graph prediction and simulation. A graph neural network algorithm called GraphSAGE is used to learn representations of relationships between nodes in a graph based on traffic data, and predict link states. A custom action policy function then uses the predicted link states to determine lane changes, speeds, and accelerations for the emergency vehicles in a large-scale traffic simulation, allowing them to navigate more efficiently compared to baseline approaches. The method aims to better simulate real-world complex traffic than reinforcement learning alone by incorporating graph neural networks to model the intricate relationships between vehicles.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
55 views17 pages

DB Report Springer Paper

The document describes a method to emulate emergency vehicle navigation in mixed traffic environments using graph prediction and simulation. A graph neural network algorithm called GraphSAGE is used to learn representations of relationships between nodes in a graph based on traffic data, and predict link states. A custom action policy function then uses the predicted link states to determine lane changes, speeds, and accelerations for the emergency vehicles in a large-scale traffic simulation, allowing them to navigate more efficiently compared to baseline approaches. The method aims to better simulate real-world complex traffic than reinforcement learning alone by incorporating graph neural networks to model the intricate relationships between vehicles.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

The Report is Generated by DrillBit Plagiarism Detection Software

Submission Information

Author Name Shubham


Title Emulating Emergency Vehicle
Paper/Submission ID 1125724
Submitted by [email protected]
Submission Date 2023-11-23 12:24:22
Total Pages 13
Document type Article

Result Information

Similarity 10 %
1 10 20 30 40 50 60 70 80 90

Sources Type Report Content


Words < Quotes
Internet 14, 4.92%
2.83% 5.41%

Journal/
Publicatio
n 7.17% Ref/Bib
11.66%

Exclude Information

Quotes Not Excluded


References/Bibliography Not Excluded
Sources: Less than 14 Words Similarity Not Excluded
Excluded Source 0%
Excluded Phrases Not Excluded

A Unique QR Code use to View/Download/Share Pdf File


DrillBit Similarity Report

A-Satisfactory (0-10%)
B-Upgrade (11-40%)

10 28 A C-Poor (41-60%)
D-Unacceptable (61-100%)
SIMILARITY % MATCHED SOURCES GRADE

LOCATION MATCHED DOMAIN % SOURCE TYPE

1 docview.dlib.vn Publication
1

2 Student Thesis Published in HAL Archives Publication


1

3 mdpi.com Internet Data


1

4 www.emerald.com Internet Data


<1

5 www.mdpi.com Internet Data


1

6 www.dx.doi.org Publication
<1

7 www.dx.doi.org Publication
<1

8 docview.dlib.vn Publication
<1

9 A decision model for energy companies that sorts projects, classifies the Publication
<1
projec by Oliveira-2015

10 SpringerBriefs in Applied Sciences and Technology Fluid Flow and He, Publication
<1
by Vadasz, Peter- 2016

11 ijcsit.com Publication
<1

12 IEEE 2015 IEEE 18th International Conference on Intelligent Transpor Publication


<1
by

13 www.dx.doi.org Publication
<1
14 IEEE 2019 28th International Conference on Computer Communication Publication
<1
an

15 arxiv.org Internet Data


<1

16 matjournals.in Publication
<1

17 mdpi.com Internet Data


<1

18 Academic Skills, Community Engaged, Leadership, and Global Themed Publication


<1
First-Year Sem by Boettler-2020

19 Age differences in neural correlates of feedback processing after Publication


<1
economic decis by Fernandes-2018

20 Analysis of water security and source preferences in rural Tanzania by Publication


<1
Ngasala-2018

21 biomedcentral.com Internet Data


<1

22 Clarification of and comment on Erlandson etal Life on the Edge by Publication


<1
Lore-2009

23 repository.uclawsf.edu Publication
<1

24 Singularly perturbed implicit control law for linear time-varying delay Publication
<1
MIMO sys by Puga-2015

25 Thermodynamic Optimization of Three-Fluid Cross-Flow Heat Publication


<1
Exchanger U, by Jyothiprakash, K.H.-

26 Thesis Submitted to Shodhganga Repository Publication


<1

27 vdocuments.mx Internet Data


<1

28 www.science.gov Internet Data


<1
Emulating Emergency Vehicle Navigation in Mixed
Traffic Environment Using Graph Prediction and
Simulation

Shubham S1, Amogh N Rao1, Shuchith B U1, Siddarth M P1, and Bhaskarjyoti Das1
11 16
Department of Computer Science and Engineering,
PES University, Bengaluru, Karnataka, India
[email protected] , [email protected] , [email protected] ,
[email protected] , [email protected]

Abstract. Emergency vehicles (EVs) must move through areas with mixed traf-
fic as fast as they can in order to deliver aid in a timely manner. The dynamic
and unpredictable nature of traffic makes this a difficult task. In some bottle-
neck situations, such as highway ramping and intersections, reinforcement
learning (RL) has been used to simulate traffic and improve EV navigation.
However, the complexity of real-world traffic environments is not fully cap-
tured by RL-based 1 simulations, which are frequently
8 restricted to a small num-
ber of entities. In this paper, we propose a novel method that makes use of
GraphSAGE and a custom action policy function to simulate EV navigation in
mixed-traffic environments using SUMO simulation. A graph neural network
(GNN) algorithm called GraphSAGE can be used to learn node representations,
or the representations of the relationships between nodes in a graph, through
training. The custom action policy function uses the link states that we predict
using GraphSAGE to determine the EVs' next course of action and makes deci-
sions about lane changes, speed adjustments, and acceleration based on the pre-
dicted link states, vehicle types, and relative positions of vehicles. We demon-
strate that, when compared to baseline approaches, our approach can signifi-
cantly improve the navigation efficiency of EVs. Our findings demonstrate how
well graphs model and simulate intricate traffic situations. Our simulation can
be used to investigate how various traffic situations and vehicle kinds affect EV
navigation.

Keywords: GNN, GraphSAGE, mixed traffic environment, custom action,


SUMO simulation, Dynamic Graphs

1 Introduction

The integration of autonomous vehicles (AVs) into our traffic networks is happening
quite quickly. Although completely autonomous driving is still a ways off, in the near
future, traffic will be composed of both human-driven and autonomous vehicles. The
unpredictable behavior of other drivers, the restricted sensor range, and the intricate
2

traffic networks present a variety of difficulties for emergency vehicles (EVs) operat-
ing in this mixed traffic environment. In situations where there is mixed traffic, EV
navigation must be both safe and effective in order to quickly assist individuals in
need. However, because traffic is dynamic and complex, this task is difficult.
Learning representations of nodes in a graph is possible with the use of machine
learning models called graph neural networks (GNNs). The efficacy of GNNs has
been demonstrated in numerous traffic network-related tasks, including congestion
prediction and traffic forecasting. As well as learning representations of EVs that
capture their objectives and current state, GNNs can be used to model the traffic net-
work's overall state. Equipped with this data, an action policy function that is compre-
hensive in nature can be developed to assist EVs in safely and efficiently navigating
mixed1 traffic environments.
In this paper, we propose an innovative method that uses GNNs for efficient EV
navigation in mixed-traffic environments. Our method tackles the problem of sensor
failures in real-life situations by utilizing GraphSAGE for link prediction. Additional-
ly, we create a unique action policy function that considers the EV's current condition
as well as the types of vehicles in the network and the traffic network's overall state.
We assess our suggested method on the generated environment and demonstrate that,
in terms of both navigation efficiency and safety, it performs noticeably better than
baseline methods.

2 Related Work

2.1 Simulation Environment

A diverse range of simulation environments has been employed to assess and validate
autonomous driving systems in mixed traffic environments. The most prevalent
choice is the SUMO platform [1][2][3][4][9], which provides a realistic traffic simu-
lation framework for modeling various road networks and the interaction between
autonomous and human-driven vehicles. Additionally, the CARLA simulation [15]
has been used to train and evaluate the DiGNet system, offering high-fidelity scenari-
os across complex maps, including urban, rural, and highway environments.
The robustness and generalizability of the suggested autonomous driving systems
are greatly enhanced by the versatility and depth of these simulation platforms. How-
ever, rather than concentrating on particular use cases, the majority of research studies
currently in existence assess the overall performance of autonomous driving systems.
The lack of research in this area restricts the development and assessment of custom-
ized solutions for particular traffic situations, like emergency vehicle navigation.

2.2 Graph Representation and GNNs

Graph representation and the application of Graph Neural Networks (GNN) emerge as
pivotal elements in enhancing decision-making for autonomous vehicles. Several
papers leverage graph-based models to represent vehicle interactions and properties.
In [1], every vehicle is depicted as a node, and interactions as edges in an undirected
3

network, forming the basis for the proposed modular framework's state space. In [3],
the Graphic Convolution Q network (GCQ) integrates Deep Q Network (DQN) and
Graph Convolutional Neural Network (GCN) to facilitate cooperative lane change
decisions, using graph-based techniques to aggregate collaborative sensing data. The
adoption of GNN is further exemplified in [5][6][13][15], where graph structures
capture spatial and temporal relationships among vehicles, enabling efficient deci-
sion-making in various cooperative and mixed-traffic scenarios. These approaches
underscore the efficacy of graph-based representations and GNN architectures in
modeling complex interactions and improving the decision-making capabilities of
autonomous vehicles.

2.3 Action Policy and outcome

The proposed methodologies in the surveyed papers exhibit a variety of action poli-
cies designed to optimize autonomous vehicle decision-making. Reinforcement
Learning (RL) algorithms, such as Deep Q-Learning [3], Curriculum through Self-
Play [2], and Policy-based training methods [9][13], are recurrent themes. The utiliza-
tion of advanced RL techniques, like Duelling Double DQN [1], Curriculum through
Self-Play [2], and LSTM-Q [3], highlights the importance of effective learning strate-
gies in achieving superior outcomes in interactive traffic scenarios. The incorporation
of novel approaches, such as the Multi-Agent Reinforcement Learning (MARL)
framework with Connected Automated Vehicle Graph [4] and the DiGNet system
[15], showcases the diversity of methodologies aimed at improving efficiency, safety,
and cooperation in autonomous driving.
However, lane changes are rarely taken into consideration, and the majority of cur-
rent action spaces are restricted to simple movements like acceleration and decelera-
tion. In addition, the number of vehicles that could be simulated was constrained by
the complexity of RL algorithms. This limitation is addressed by our proposed custom
action policy function, which allows complex maneuvers such as lane changes based
on the current graph state. This enables us to model more realistic traffic scenarios
and simulate a larger network.

3 Simulation Environment

Three simulation scenarios—highway merge shown in Fig.1, intersection shown in


Fig. 2, and citywide scenarios shown in Fig. 3 are used to assess the efficacy of our
suggested method for emergency vehicle (EV) navigation in mixed traffic environ-
ments. All the scenarios are based on real-world traffic situations with human vehi-
cles, autonomous vehicles, and emergency vehicles. For all scenarios, the complex
traffic network is modeled using SUMO, a well-known and reliable simulation plat-
form.
4

Fig. 1. Merge Network

Fig. 2. Intersection Network

Fig. 3. City Network


5

The IDM3 algorithm, which accurately replicates human travel behaviour, is used
to model human-driven vehicles. Our proposed action policy function is used by
autonomous vehicles to receive actions and also applies the IDM3 algorithm. This
combination ensures safe and effective navigation by enabling autonomous vehicles to
make decisions based on the action policy23 function. Emergency vehicles are able to
move through the traffic network faster because they are given priority over other
vehicles. By setting priorities, emergency vehicles can get to their destinations faster
and possibly save more lives.
7
Three metrics are taken into consideration in order to fully assess the effectiveness
of our suggested approach: overall traffic waiting time, depart delay, and navigation
waiting time. The effectiveness of the navigation strategy is evaluated by measuring
the amount of time an emergency vehicle waits while attempting to reach its
destination, a measure known as navigation waiting time. Depart delay determines the
congestion level, which assesses how well the action policy function performs in
averting collisions and guaranteeing smooth travel of traffic. The effect of the
navigation strategy on the larger traffic system is measured by overall traffic waiting
time, which is determined by the overall flow and congestion of the traffic network.
We compare two baseline approaches: Baseline 1 uses the IDM3 algorithm only
and does not use any custom action policy, while Baseline 2 uses a custom action
policy function that uses simulation obtained by the traCI library instead of
GraphSAGE to predict links.

4 Methodology

The overall architecture of the simulation's interaction with the GraphSAGE algo-
rithm is presented in Fig. 4. It can be essentially divided into three primary compo-
nents: Graph representation, GraphSAGE and link prediction, and custom action poli-
cy function.

Fig. 4. City Network


6

4.1 Graph Representation

At each timestep of the simulation, the vehicles present are identified and catego-
rized as either HV i.e. Human Vehicle, AV i.e. Autonomous Vehicle, or EV i.e.
Emergency Vehicles. The TraCI library is used to extract their attributes, such as
position, speed, and lane information. This library makes communication with the
SUMO traffic simulation software easier. Next, using the extracted vehicle data, a
graph is built, with nodes standing in for the vehicles. Based on the position data of
the two vehicles, an edge is established by calculating the distance between them
using the formula mentioned in Eq. 1. An edge is added to the graph if this distance
drops below a set threshold.

푑 = √ (푥2 − 푥1 )2 − (푦2 − 푦1 )2 (1)

Edge formation is limited to interactions between AVs and HVs, EVs and HVs, or
AVs and EVs in order to better represent real-world scenarios. Connections between
HVs themselves are not included in this. This distinction arises from the fact that HVs
are treated as passive actors in the simulation, with decision-making powers exclu-
sively assigned to AVs and EVs. The graph better captures the dynamics of traffic
scenarios involving AVs, EVs, and HVs by restricting edge formation to these inter-
actions.
The graph representation consists of two components. One is the Node or Vehicle
features matrix which is essentially made up of features list of individual vehicles in
the environment.

푉푇 = [푣푡1 , 푣푡2 , 푣푡3 , … . 푣푡푖 , … … 푣푡푛 ]푇 (2)

Here in Eq. 2. viT represents the feature matrix of vehicle vi. The second compo-
nent of the graph representation is edge list E. It is a two-dimensional array consisting
of vehicle IDs as values.

푖 푣 푘ꢁ
퐸 = ꢀ 푣푗 ⋯ ⋯ (3)
푣 ⋯⋯ 푣푙

In Eq. 3. vi and vj are connected by an edge and similarly vehicles vk and vl also
have an edge connecting them.

4.2 GraphSAGE and Link Prediction

When 9an emergency vehicle needs to get to its destination fast and effectively, its
route needs to be optimized. This is known as emergency vehicle routing. The process
of optimizing emergency response techniques can be further enhanced by link predic-
tion, which forecasts the possibility of links or interactions between vehicles. Real-
world applications of sensors include gathering vehicle location data, which is subse-
quently utilized to connect vehicles based on their proximity to one another. But mal-
7

functioning sensors can cause gaps in data, which means there won't be any connec-
tions between the cars. The decision-making process for emergency vehicles can be
improved by using link prediction techniques to fill up these gaps by forecasting the
missing edges.
To predict the missing links between vehicles, we employ a specialized model
called GraphSageNet. This model comprises two main components:

GraphSAGE. This component creates the node embeddings for cars. In this instance,
the traffic network's vehicles' attributes and interactions with one another are captured
by the node embeddings produced by the GraphSAGE model.

LinkPredictorNN. This 2 component forecasts whether or not there are links connect-
ing the cars. It creates a link prediction score for each pair of nodes based on the node
embeddings that GraphSAGE generates. The likelihood of a link between the two
related nodes increases with the link prediction score.
25
The GraphSAGE model consists of two layers of SAGEConv, which is a variant
of GraphSage graph neural network (GNN), that aggregates information from neigh-
boring nodes to generate node representations.

퐺푇 = ∅ꢀ푟푎푝ℎ푆푎ꢁ푒 (푉푡, 퐸) (4)

Here in Eq. 4. GT refers to the node embeddings of vehicle nodes given by


GraphSage module. The LinkPredictionNN(LNN) consists of two fully connected
layers. The first layer takes the output-dimensional node embeddings from the
GraphSAGE model as input and produces hidden-dimensional representations. The
second layer takes the hidden-dimensional representations as input and produces a
single output value, which represents the probability of a link existing between the
two corresponding nodes.
푃푇 = 휎퐿푁푁(퐺푇 ) (5)

And in Eq. 5. PT refers to prediction scores given by the LinkPredictionNN mod-


ule. In order to identify possible links, the LinkPredictionNN efficiently learns pat-
terns in the informative node embeddings created by the GraphSAGE model, which
captures the interactions between vehicles. Making use of the advantages of both the
LinkPredictionNN and the GraphSAGE model, the combined GraphSageNet model
predicts the missing links between vehicles with high accuracy.
The GraphSageNet model is trained using a dataset obtained from the SUMO (Simu-
lation of Urban MObility) software, utilizing the TraCI (Traffic Control Interface)
API. The dataset represents a traffic network extracted from OpenStreetMap (OSM)
data. A snapshot of the traffic network taken at a particular moment in time is used to
train the model. The training graph's edges are constructed using the same process as
described in the section on graph representation. GraphSageNet is used in the actual
simulation process to forecast the presence of any potential missing links after the
traffic graph has been constructed. Then, in the simulated environment, the Custom
8

Action policy is applied to make well-informed decisions for both autonomous and
emergency vehicles.

4.3 Custom Action Policy

The main element in charge of choosing actions based on the traffic graph's current
state is the custom action policy function. The presence of edges in the graph is nec-
essary for it to function. The action policy function is triggered to determine a score
20
for every vehicle when there are sufficient edges in the graph. Depending on the type
of vehicle and the characteristics of its surrounding nodes and edges, this score is the
basis for decision-making.
For emergency vehicles (EVs), a high score is assigned, prompting an adjustment
of their speed towards their maximum allowable limit. This guarantees that electric
vehicles (EVs) can quickly and effectively navigate the traffic network to get to their
destinations.
The action policy function for autonomous vehicles (AVs) uses neighborhood data
to decide what speed modifications are appropriate. The action policy function tells an
AV to switch lanes to make room for an EV if they are in the same lane. When the
autonomous vehicle (AV) is in a separate lane, it is advised to reduce speed to prevent
possible collisions.
The custom action policy function uses the vast amount of information found in the
traffic graph to plan an orderly and effective vehicle movement that guarantees emer-
gency vehicles can navigate through mixed traffic situations in a timely and safe
manner.

5 Results and Analysis


19 1
Unlike previous studies using Reinforcement Learning to select actions, our method
does not require the high24 processing power of GPU hardware to model the network.
Extensive computations are 10not necessary because our custom action policy function
generates concise actions based on the traffic graph's current state.
2 We examined three separate cases for each of the three simulation environ-
ments in order to assess the efficacy of our methodology. Using no algorithmic inter-
vention, the first case simulates a scenario and acts as a baseline. The second example
builds
11 the graph with our custom action policy function applied, utilizing simulation
attributes that are taken from SUMO. In the final scenario, link states are inferred
using GraphSAGE, and our custom action policy function is employed based on the
resulting graph. The experimental parameters for the three scenarios merge, intersec-
tion, and city network scenarios are presented in Table. 1.
9

Table 1. Parameter Settings

Scenarios
Parameters
Merge Intersection City

Number of HVs 23 26 47

Number of AVs 14 16 26

Number of EVs 3 2 7

Speed limit of EVs and AVs 140 km/h 140 km/h 120 km/h

Speed limit of HVs 120 km/h 120km/h 100 km/h

Number of Lanes 2 2 <3

5.1 Merge

The relatively sparse distribution of cars in the merge scenario suggests that individu-
al vehicle driving behavior has little effect on the overall flow of traffic. This finding
seemingly clarifies the performance patterns shown in Table. 2. Even though the im-
provement in performance over the baseline is not very large, it is still noteworthy.
Furthermore, the GraphSAGE-based approach's performance is close to that of
the condition where the graph was created with simulation attributes.

Table 2. Performance on Merge

Parameters
Simulation Scenario
Emergency Average Wait-
Depart Delay
Waiting Time ing Time

Baseline 7.2s 1.2s 12.8s

Graph made by simulation


5.5s 0s 10.2s
attributes

Graph made by GraphSAGE 6.1s 0.3s 11.1s


10

5.2 Intersection
28
The optimization effect attained in the intersection scenario was more noticeable than
that seen in the merge scenario, as clearly seen in Table. 3. This discrepancy is proba-
bly due to the increased mutual influence between cars in the intersection situation,
where their movements and interactions have a bigger impact on the direction of traffic
flow as a whole. Interestingly, the simulation-based approach's performance and the
GraphSAGE-based approach's closely match which highlights how well GraphSAGE
captures the complex dynamics of the intersection scenario.

Table 3. Performance on Intersection

Parameters
Simulation Scenario
Emergency Wait- Average Waiting
Depart Delay
ing Time Time

Baseline 9.3s 2.4s 15.8s

Graph made by simulation


3.6s 0.2s 3.2s
attributes

Graph made by GraphSAGE 3.8s 0.5s 3.1s

5.3 City Network


As can be seen from Table. 4, the custom action policy function significantly outper-
forms the baseline when combined with the graph for the city network. Furthermore,
the simulation-based approach and the GraphSAGE-based approach for graph con-
struction closely match, indicating GraphSAGE's strong prediction abilities. Due to
computational constraints, large-scale traffic simulations are just not possible, so this is
a significant improvement over RL-based action systems.

Table 4. Performance on City Network

Parameters
Simulation Scenario
Emergency Average Waiting
Depart Delay
Waiting Time Time

Baseline 12.2s 2.6s 20.8s

Graph made by simulation


4.8s 0.4s 8.1s
attributes

Graph made by GraphSAGE 4.9s 0.8s 9.6s


11

13
The outcomes show that the use of a graph in conjunction with a custom action policy
function has greatly increased the simulation's efficiency. Moreover, GraphSAGE is an
effective way for predicting links for graphs in such forms of traffic networks and can
be used efficiently in these kinds of situations. The train-test curve for GraphSAGE, as
depicted in Fig 5. illustrates the loss and highlights its low value.

Fig. 5. Train Test curve for GraphSAGE

6 Conclusions and Future Scope


14
This paper uses GraphSAGE and a custom action policy function to present a novel
approach to EV navigation in mixed-traffic environments. By capturing the dynamic
and unpredictable nature of traffic, our method
21 outperforms baseline approaches in
enabling EVs to navigate. EV navigation efficiency has been significantly improved by
integrating GraphSAGE for graph representation and link prediction with a custom
action policy function for EV navigation optimisation. This reveals the potential of
graph-based traffic modelling and prediction methods in practical traffic control
situations, especially for vital applications such as emergency vehicle navigation.
The suggested method has a number of benefits over conventional RL-based
techniques. GraphSAGE enables more precise link prediction and EV navigation by
efficiently17
capturing the intricate spatial relationships present in the traffic network.
Second, these relationships are taken into account by the custom action policy
function, which optimises traffic flow for EVs and results in more effective navigation.
Third, the SUMO simulation framework's integration of GraphSAGE and the custom
action policy function enables large-scale traffic simulations, getting around the
computational constraints of RL-based techniques.
12

22
All things considered, our research shows the potential of graph-based traffic
modelling and prediction methods in practical traffic management applications,
especially for crucial situations such as electric vehicle (EV) navigation. Our suggested
method provides a viable means of enhancing the effectiveness of EV navigation and
guaranteeing prompt
27 assistance delivery during emergencies.Future research will
examine how various graph neural network architectures 18affect our method's
effectiveness, create a more complex action policy function that accounts for other
variables like pedestrians and road conditions, and test our method on a bigger and
more varied dataset of traffic scenarios. These efforts will improve the efficacy of our
suggested methodology and expand its suitability to an expanded array of actual traffic
situations.

References
1. Liu, Qi, et al. "Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent
Decision-Making in Mixed Traffic Environments." arXiv preprint arXiv:2201.12776
(2022).
2. G. -P. Antonio and C. Maria-Dolores, "Multi-Agent Deep Reinforcement Learning to
Manage Connected Autonomous Vehicles at Tomorrow's Intersections," in IEEE Transac-
tions on Vehicular Technology, vol. 71, no. 7, pp. 7033-7043, July 2022, doi:
10.1109/TVT.2022.3169907.
3. Dong, Jiqian, Sikai Chen, Paul Young Joun Ha, Yujie Li, and Samuel Labi. "A drl-based
multiagent cooperative control framework for cav networks: a graphic convolution q net-
work." arXiv preprint arXiv:2010.05437 (2020).
4. T. Shi, J. Wang, Y. Wu, L. Miranda-Moreno, and L. Sun, "Efficient Connected and Auto-
mated Driving System with Multi-agent Graph Reinforcement Learning," IEEE Transac-
tions on Intelligent Transportation Systems, doi: 10.1109/TITS.2021.3067931, 2021.
5. Klimke, Marvin, Benjamin Völz, and Michael Buchholz. "Automatic Intersection Man-
agement in Mixed Traffic Using Reinforcement Learning and Graph Neural Networks."
arXiv preprint arXiv:2301.12717 (2023).
6. Klimke, Marvin, Benjamin Völz, and Michael Buchholz. "Cooperative Behavior Planning
for Automated Driving using Graph Neural Networks." In 2022 IEEE Intelligent Vehicles
Symposium (IV), pp. 167-174. IEEE, 2022.
7. Klimke, Marvin, et al. "An Enhanced Graph Representation for Machine Learning Based
Automatic Intersection Management." 2022 IEEE 25th International Conference on Intel-
ligent Transportation Systems (ITSC). IEEE, 2022.
8. J. Dong et al., "Spatio-weighted information fusion and DRL-based control for connected
autonomous vehicles," 2020 IEEE 23rd International Conference on Intelligent Transpor-
tation Systems (ITSC), Rhodes, Greece, 2020, pp. 1-6, doi:
10.1109/ITSC45102.2020.9294550.
9. Gao, Xin, et al. "Multi-agent decision-making modes in uncertain interactive traffic sce-
narios via graph convolution-based deep reinforcement learning." Sensors 22.12 (2022):
4586.
10. Liu, Qi, Xueyuan Li, Yujie Tang, Xin Gao, Fan Yang, and Zirui Li. "Graph Reinforcement
Learning-Based Decision-Making Technology for Connected and Autonomous Vehicles:
Framework, Review, and Future Trends." Sensors 23, no. 19 (2023): 8229.
13

11. Lu, Yuhuan, Wei Wang, Xiping Hu, Pengpeng Xu, Shengwei Zhou, and Ming Cai. "Vehi-
cle trajectory prediction in connected environments via heterogeneous context-aware graph
convolutional networks." IEEE Transactions on Intelligent Transportation Systems (2022).
12. Liu, Tao, Aimin Jiang, Jia Zhou, Min Li, and Hon Keung Kwan. "GraphSAGE-Based Dy-
namic Spatial–Temporal Graph Convolutional Network for Traffic Prediction." IEEE
Transactions on Intelligent Transportation Systems (2023).
13. Shang, Yigeng, Zhigang Hao, Chao Yao, and Guoliang Li. "Improving Graph Neural
Network Models in Link Prediction Task via A Policy-Based Training Method." Applied
Sciences 13, no. 1 (2022): 297.
14. Rathore, Heena, and Henry Griffith. "GNN-RL: Dynamic Reward Mechanism for Con-
nected Vehicle Security using Graph Neural Networks and Reinforcement Learning." In
2023 IEEE International Conference on Smart Computing (SMARTCOMP), pp. 201-203.
IEEE, 2023.
15. Cai, Peide, Hengli Wang, Yuxiang Sun, and Ming Liu. "DiGNet: Learning scalable self-
driving policies for generic traffic scenarios with graph neural networks." In 2021
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 8979-
8984. IEEE, 2021.

You might also like