Training Seminar Report 20117119
Training Seminar Report 20117119
Training Seminar Report 20117119
28/08/2023
Submitted by
CANDIDATE’S DECLARATION
I declare that the work carried out in this report entitled “Traffic Graph
Convolutional Recurrent Neural Network with LSTM and Transformers” is
presented on behalf of the fulfilment of the course MIN-499 submitted to the
Mechanical and Industrial Engineering Department, Indian Institute of
Technology Roorkee.
I further certify that the work presented in this report has not been submitted
anywhere for any kind of certification or award of any other degree/diploma.
Date: 28/08/2023
Place: Roorkee, India Shreya Singh
3
Tables of content:
1. Acronyms.............................................................................................4
2. Acknowledgement...............................................................................5
3. Abstract................................................................................................6
4. Introduction..........................................................................................7
5. Theoretical Background.......................................................................7
6. Objectives.............................................................................................8
7. Motivation ...........................................................................................8
8. Problem Statement...............................................................................9
9. Model architecture… ..........................................................................9
10.Methods..............................................................................................10
10.1. LSTM Model .............................................................................10
10.2.Transformers ...............................................................................11
10.3. Graph Convolutional Networks… .............................................12
11.Data set Analysis ................................................................................13
11.1. Study Area… .............................................................................13
11.2. Data Format ...............................................................................14
12.Methodology.......................................................................................15
12.1. TGC-LSTM……………………………………………………15
12.2. Transformers…………………………………………………..15
13.Results.................................................................................................16
14.Conclusion...........................................................................................18
15.References...........................................................................................19
4
Acronyms
Acknowledgment
I would like to extend my sincere gratitude to all those who have contributed to
the success of this project. Firstly, I would like to express my thanks to
Professor Neetesh Kumar Sir (CSE Dept. IIT Roorkee) for his guidance,
insights, and unwavering support throughout the project. His expertise and
mentorship have been instrumental in shaping our project.
We also want to thank Ms. Nisha Singh Chauhan Ma’am (PhD student at IIT
Roorkee)for her valuable insights in completing the project.
I am grateful to have had the opportunity to undertake this project.
6
Abstract
4. Introduction
5. Theoretical background
6. Objectives
7. Motivation
We learn the traffic network as a graph and define a traffic graph convolution
operation to capture spatial features from the traffic network. The traffic graph
convolution incorporates the adjacency matrix and the proposed free-flow
reachable matrix to extract localized features from the graph. We propose a
traffic graph convolutional LSTM neural network and Transformers to forecast
network-wide traffic states.
We also design two regularization terms on the TGC weights and TGC features,
respectively, that can be added to the model’s loss function to help the learned
TGC weight to be more stable and interpretable.
By evaluating on real-world traffic datasets, our approach is proved to be
superior to the compared baseline models. In addition, the learned TGC weight
can help to identify the most influential roadways, and thus, enhance the
interpretability of the proposed model.
9
8. Problem statement
9. Model Architecture
fi
10
10. Methods
1. LSTM model:
2. Transformers Model :
To enrich the feature space, the features extracted from different orders
(from 1 to K) of traffic graph convolution with respect to Xt are
concatenated together as a vector defined as follows
1. Study area:
2. Data Format:
fi
.
15
13. Methodology
1. TGC-LSTM: Given the traffic state data xt and the graph related matrices as input, the
process of generating the final output ht after t steps of iteration.
The minibatch gradient descent process and the backpropagation-based
parameter updating process.
16
14. Results
K L1-Norm L2-Norm
LSTM 2.98492495211943 0.148709757091356
LSGC-LSTM 1 4.26501764010497 0.292260112906983
LSGC-LSTM 2 4.14056689761565 0.31158408806262
LSGC-LSTM 3 4.1438384588285 0.321233727728875
LSGC-LSTM 4 4.20053284966008 0.283045182342036
LSGC-LSTM 5 4.11924856439494 0.293083399091012
GC-LSTM 1 3.0253767472294 0.174000397808078
GC-LSTM 2 2.76522413252443 0.129908970591243
GC-LSTM 3 2.66644555451595 0.119873028215239
GC-LSTM 4 2.64367515846024 0.123952958671975
GC-LSTM 5 3.89547943482374 0.140417291946409
14. Conclusion
References