Smart City Traffic Flow and Signal Optimization Using STGCN-LSTM and PPO Algorithms
Smart City Traffic Flow and Signal Optimization Using STGCN-LSTM and PPO Algorithms
ABSTRACT Urban traffic congestion remains a critical challenge for smart city development, necessitating
innovative approaches to improve traffic flow and reduce delays. This study presents a novel framework that
integrates the Spatiotemporal Graph Convolutional Network-Long Short-Term Memory (STGCN-LSTM)
model for traffic flow prediction with the Proximal Policy Optimization (PPO) algorithm for dynamic traffic
signal control. The STGCN-LSTM model captures complex spatiotemporal dependencies, achieving an R2
of 0.904 on the METR-LA dataset. Extensive experiments and ablation studies highlight the complementary
strengths of STGCN and LSTM, with the hybrid model outperforming standalone variants. The PPO
algorithm dynamically adjusts signal timings, reducing vehicle waiting times by 30% and increasing traffic
throughput by 15%. Incorporating external factors, such as weather and holidays, enhances the framework’s
robustness in dynamic conditions, including adverse weather and traffic surges. GPU acceleration ensures
scalability, enabling deployment in large-scale urban networks efficiently. This framework demonstrates
significant potential to address urban congestion, reduce carbon emissions by 12%, and support sustainable
urban development. Future research will explore edge computing, multi-agent reinforcement learning, and
real-time data integration to further enhance scalability and adaptability.
INDEX TERMS Long short-term memory (LSTM), intelligent transportation systems, proximal policy
optimization (PPO), spatio-temporal graph convolutional networks (STGCN), traffic flow prediction.
2024 The Authors. This work is licensed under a Creative Commons Attribution 4.0 License.
15062 For more information, see https://creativecommons.org/licenses/by/4.0/ VOLUME 13, 2025
T. Lin, R. Lin: Smart City Traffic Flow and Signal Optimization Using STGCN-LSTM and PPO Algorithms
emergencies [7]. Furthermore, while state-of-the-art meth- handling stationary data, it struggles with the non-linear and
ods like Convolutional Neural Networks-Long Short-Term dynamic nature of urban traffic [11], [12]. Machine learning
Memory (CNN-LSTM) and Transformer architectures have techniques, such as Support Vector Regression (SVR) and
advanced spatio-temporal modeling, their high computa- Random Forest (RF), have also been introduced to enhance
tional demands present barriers to real-time deployment in prediction accuracy. However, these methods are inherently
large-scale urban environments [8]. limited in capturing the spatial and temporal dependencies
This study proposes a novel hybrid framework to address across complex urban environments [13].
these challenges. The framework integrates STGCN and Advancements in machine learning and neural networks
LSTM networks for traffic flow prediction. Additionally, have revolutionized traffic modeling, with Graph Neu-
it employs PPO to dynamically optimize traffic signal con- ral Networks (GNNs) and Transformer-based architectures
trol. This hybrid approach captures intricate spatio-temporal emerging as leading tools in this domain [14].
dependencies and dynamically adjusts signal timings to
reduce congestion and enhance overall traffic efficiency [9]. A. TRAFFIC FLOW PREDICTION RESEARCH
To further improve adaptability, the framework incorporates 1) TRADITIONAL APPROACHES
external factors such as weather conditions, holidays, and The ARIMA model has demonstrated strong performance in
unplanned events, enabling it to perform effectively across time series analysis, particularly with stationary linear data.
diverse urban scenarios [10]. However, it struggles with the non-linear and dynamic char-
This study introduces an innovative framework integrating acteristics of urban traffic patterns, limiting its applicability
STGCN with LSTM networks for traffic flow prediction in real-world scenarios [15]. To address these limitations,
and PPO for dynamic signal control. The key contributions models such as SVR and RF have been introduced to improve
are: prediction accuracy. While effective in capturing some rela-
tionships, these methods remain inadequate for modeling the
A. UNIFIED SPATIOTEMPORAL FRAMEWORK
spatial dynamics of urban traffic [16].
The integration of STGCN and LSTM models both spatial
and temporal dependencies. The proposed method addresses
2) ADVANCES IN DEEP LEARNING
the limitations of prior approaches and enhances prediction
accuracy in urban traffic networks. The advent of deep learning has marked a significant mile-
stone in traffic flow prediction. LSTM networks effectively
B. DYNAMIC SIGNAL OPTIMIZATION capture both short- and long-term dependencies in time series
The PPO algorithm enables real-time traffic signal control. data, making them ideal for traffic prediction tasks [17].
It adapts to changing traffic conditions, reducing vehicle However, LSTM focuses primarily on temporal dependencies
waiting times and increasing throughput. and lacks the ability to model spatial relationships between
road segments. To overcome this limitation, researchers
C. ENHANCED ADAPTABILITY have developed hybrid models that integrate spatio-temporal
External factors such as weather and holidays are incorpo- features. One prominent example is the STGCN, which
rated, improving the model’s robustness under real-world combines Graph Convolutional Networks (GCNs) to capture
traffic variations. spatial dependencies with LSTM or temporal convolu-
tion layers to model temporal dynamics. This combination
D. SCALABLE DEPLOYMENT has significantly improved prediction accuracy, particu-
The framework leverages GPU acceleration and supports larly in complex and interconnected urban transportation
edge computing. It ensures scalability and computational systems [18].
efficiency for large-scale urban traffic systems.
This study provides a scalable and adaptive solution for 3) EMERGING TECHNOLOGIES: TRANSFORMER MODELS
modern urban traffic challenges. It demonstrates potential to AND CNN-LSTM
improve traffic flow, reduce congestion, and support smart Transformer models have recently demonstrated outstand-
city sustainability goals. ing performance in spatio-temporal sequence prediction.
Originally developed for natural language processing, these
II. LITERATURE REVIEW models utilize a self-attention mechanism to effectively cap-
Urban transportation management has been extensively stud- ture long-range dependencies [19]. In traffic flow forecasting,
ied, leading to the development of various models and spatio-temporal Transformers excel at capturing both spatial
methods aimed at optimizing traffic flow, reducing conges- and temporal dependencies simultaneously, offering scalable
tion, and improving mobility. The Autoregressive Integrated solutions for complex traffic networks. The CNN-LSTM
Moving Average (ARIMA) model has been widely used to hybrid model has also been explored for its ability to combine
analyze and predict traffic dynamics. Similarly, agent-based the strengths of Convolutional Neural Networks (CNN) for
microscopic simulations have been employed to model traffic spatial feature extraction and LSTM for modeling temporal
behavior at the individual level. While ARIMA is effective for dynamics. This approach has shown promise in high-density
Spatio-Temporal Graph Convolutional Networks (STGCN) Through LSTM, the model can remember and update traf-
with Long Short-Term Memory (LSTM) networks. This fic flow features from past time steps and, combined with
hybrid approach simultaneously addresses both temporal and spatial features, predict future traffic flow. The final model
spatial dependencies in traffic flow. The analysis utilizes the structure, by integrating the strengths of both STGCN and
METR-LA [23] dataset, which contains traffic data from sen- LSTM, captures both spatial and temporal features, improv-
sors in the Los Angeles metropolitan area collected between ing the accuracy of traffic flow prediction.
March 2012 and June 2012. This dataset includes variables
such as vehicle speed, traffic flow, and occupancy, offering 3) DESIGN DETAILS OF STGCN AND LSTM
comprehensive insights into spatio-temporal traffic patterns. In this study, a hybrid model combining STGCN and LSTM is
By leveraging these data, the model captures diverse spatio- employed to capture the spatiotemporal dependencies of traf-
temporal dependencies, enhancing its generalization ability fic flow. Specifically, STGCN extracts spatial features using
across different urban environments. graph convolution layers, while LSTM models the temporal
dynamics of traffic flow. The table 1 below outlines the design
1) STGCN MODEL details of the STGCN and LSTM components:
STGCN is a model that combines GCN with temporal con-
volution, specifically designed to handle spatio-temporal TABLE 1. Design details of STGCN and LSTM components.
sequence data. In an urban traffic network, traffic flow can
be represented as node data on a graph, where each node
represents a road segment or intersection, and edges denote
spatial connections between different road segments. Given
the adjacency matrix A and degree matrix D of the urban
traffic network, the basic operation of graph convolution can
be expressed as:
Here, ft , it and ot represent the forget gate, input gate, and 1) REINFORCEMENT LEARNING FRAMEWORK
output gate, respectively; Ct is the memory cell, and ht is the The traffic signal control problem can be modeled as a
hidden state at the current time step. Wf , Wi , Wo and WC are Markov Decision Process (MDP), consisting of a state space,
the weight matrices. action space, and reward function:
1. State Space S The PPO agent interacts with a custom-built traffic simula-
The state St consists of traffic flow, vehicle waiting tor that replicates traffic flows, vehicle behaviors, and signal
times, lane occupancy, and other information at the current timings. The initial state S0 includes the number of vehicles
intersection: in each lane, their waiting times, and the current signal phase.
2. Execute Signal Control Action
St = {qt−1 , wt , ot , nt } Based on the current state St , the policy network selects an
where qt−1 is the flow from the previous time step, wt is the action At , such as extending the green light or adjusting the
current average vehicle waiting time, ot is the lane occupancy, yellow light duration. This action is executed in the simula-
and nt is the number of vehicles entering the intersection. tion, and the signal timings are adjusted.
2. Action Space A 3. Calculate Reward After the action is executed, the sys-
The action At represents the selection of different signal tem observes the new traffic state St+1 , and the reward Rt
timing schemes. For simplicity, each action can be defined as is calculated based on the vehicle waiting time, throughput,
a fixed timing ratio for the traffic signal: and speed, as described in the reward function. The reward
At ={extend green light,extend red light,extend yellow provides feedback to the agent about the effectiveness of the
light} chosen action.
3. Reward Function R The reward function is designed to 4. Policy Update Using PPO The collected state, action,
minimize vehicle waiting time and improve traffic flow. Let and reward data are used to update the policy and value
Wt be the total vehicle waiting time; the reward function can networks. The PPO algorithm adjusts the policy parameters
be defined as: to maximize the expected reward while limiting the size of
policy updates to ensure stability.
Rt = −Wt 5. Iterate and Converge:
This process is repeated for multiple training iterations.
Additional penalty or reward terms can be introduced to As the agent continuously interacts with the simulation,
consider vehicle speed and throughput: it learns to optimize signal timings based on evolving traffic
Rt = −(Wt +·Ct − ·Vt ) conditions. The training process continues until the policy
converges to an optimal solution, where further improve-
where Ct is the vehicle throughput, Vt is the average vehicle ments in vehicle flow are minimal.
speed, and α and β are adjustment parameters.
1) HYPERPARAMETER TUNING
2) PPO ALGORITHM During the training process, several hyperparameters are
PPO is an improved policy gradient method that ensures tuned to optimize the PPO algorithm’s performance:
stability during training by limiting the update step size.
1. Discount Factor γ : Controls the importance of future
Compared to traditional Q-learning or Deep Q-Networks
rewards. A value of γ =0.99 was chosen to strike
(DQN), PPO is better suited for optimization problems in
a balance between immediate and long-term reward
high-dimensional action spaces, making it ideal for complex
maximization.
traffic signal control scenarios.
2. Clipping Parameter ϵ: Set to 0.2 to limit the policy
In PPO, the objective is to maximize the following loss
update step size and ensure stable convergence.
function:
h i 3. Learning Rate: The learning rate is dynamically
L CLIP (θ ) = Et min rt (θ )Ât , clip (ri (θ ), 1 − ε, 1 + ε) Ât adjusted using an adaptive optimization algorithm,
starting at 3 × 10−4 , and gradually decaying as training
where rt (θ) = ππθ θ (α(αt |St |St )t ) is a clipping parameter to limit the progresses.
old
step size and ensure stability during training. 4. Batch Size and Training Iterations: The model is
By optimizing this objective, PPO gradually improves the trained with a batch size of 64, and the policy is updated
signal control strategy, significantly reducing vehicle waiting every 2048 timesteps. Training continues until the aver-
times. age reward stabilizes.
By employing these strategies, the PPO algorithm is
C. MODEL TRAINING AND DEPLOYMENT capable of learning effective signal control strategies that
To deploy the PPO algorithm in traffic signal control, the dynamically adapt to real-world traffic scenarios, signifi-
model undergoes training within a custom-designed com- cantly reducing congestion and vehicle waiting times.
puter simulation environment, replicating realistic traffic In summary, the PPO-based traffic signal control strategy
conditions. The simulation environment is designed to mimic offers a robust and scalable solution for intelligent traffic
real-world traffic patterns by modeling intersections, vehicle management. By training within a simulated environment, the
movements, lane occupancy, and traffic signals. The training agent learns optimal signal timings that can later be deployed
process involves several steps: in real-world traffic systems to reduce congestion, enhance
1. Simulated Traffic Environment Initialization throughput, and improve overall traffic efficiency.
D. MODEL IMPLEMENTATION AND OPTIMIZATION such as road connectivity and traffic flow across dif-
STRATEGY ferent intersections.
The intelligent optimization of urban traffic management is 2) LSTM Implementation: LSTM layers model long- and
achieved by combining the hybrid STGCN-LSTM model short-term temporal dependencies in the traffic flow.
for traffic flow prediction and the PPO-based reinforcement The spatial features output by STGCN are then fed into
learning algorithm for signal control optimization. the LSTM layers, which use memory cells to capture
the temporal dynamics over time. This integration helps
1) DATA PREPROCESSING AND STGCN-LSTM MODEL the model predict future traffic flow more accurately by
Data preprocessing and model implementation are crucial accounting for both spatial and temporal changes.
to achieving efficient and accurate traffic flow prediction.
This study leverages spatio-temporal traffic data, collected 2) TRAFFIC SIGNAL CONTROL OPTIMIZATION STRATEGY
at 5-minute intervals from multiple sensors, to capture the In traffic signal control optimization, a reinforcement learn-
complex dependencies inherent in traffic patterns. Key vari- ing algorithm based on PPO is designed to dynamically adjust
ables such as vehicle speed, flow, and occupancy are utilized signal timings in response to changing traffic conditions.
to inform the prediction model. This strategy aims to reduce vehicle wait times and improve
overall traffic efficiency.
a: DATA PREPROCESSING
a) Missing Value Handling: Missing values are handled a: PPO ALGORITHM FRAMEWORK
using forward filling to ensure data continuity. This diagram illustrates how the PPO algorithm functions
b) Outlier Detection: Z-score methods are applied to in the context of traffic signal control. The agent observes
detect and remove outliers, ensuring data quality. the current traffic conditions (state space), selects an action
c) Normalization: MinMaxScaler is used to scale the traf- (such as adjusting the signal timing), and receives feedback
fic flow data to a range between 0 and 1, eliminating the (reward function) based on the impact of its action on traffic
effect of varying data magnitudes on model training. efficiency. This process iterates continuously to improve the
traffic signal control strategy.
b: STGCN-LSTM MODEL IMPLEMENTATION
This diagram illustrates how STGCN extracts spatial infor-
mation from the traffic network, which is then passed into the
LSTM layers to model temporal dependencies. By integrating
these two types of features, the model improves prediction
accuracy and handles complex spatio-temporal relationships
in traffic data.
where wi represents the waiting time for each Assumption: Each unit of prediction error corresponds to
vehicle. an increase of 0.1 seconds in average waiting time. This
Assumption: Each unit of prediction error corresponds to assumption helps translate prediction errors into practical
an increase of 0.1 seconds in vehicle waiting time. This impacts on traffic delays.
assumption translates the model’s prediction errors into prac-
tical waiting times, helping to assess how well the model can d: VEHICLE THROUGHPUT
reduce congestion. This metric measures the number of vehicles passing through
an intersection within a given unit of time. An increase in
e: FUEL CONSUMPTION throughput indicates improved traffic capacity due to opti-
This metric evaluates the total fuel consumed by vehicles mized signal control strategies.
during a specific period. Lower fuel consumption indicates
better traffic flow. The formula is: E. EXPERIMENTAL DESIGN AND BASELINE MODEL
Xn SELECTION
Fuel Consumption = FC i
i=1 In this study, to thoroughly evaluate the performance of
where FC i is the fuel consumption for each vehicle. the proposed STGCN-LSTM model, we first analyzed four
Assumption: Each unit of prediction error is assumed to commonly used traffic flow prediction models: ARIMA, Sup-
result in an additional 0.05 liters of fuel consumption per port Vector Machine (SVM), CNN-LSTM hybrid model, and
vehicle. This assumption helps in correlating the predic- Transformer model. These four models represent a range of
tion accuracy with overall fuel efficiency in urban traffic approaches, from traditional statistical methods to modern
management. deep learning techniques, each with its own strengths and
weaknesses.
f: CARBON EMISSIONS (CO2 EMISSIONS)
This metric calculates the amount of CO2 emitted by vehicles 1) BASELINE MODELS FOR TRAFFIC FLOW PREDICTION
due to traffic congestion and delays. The formula is: In the traffic flow prediction task, we benchmarked the pro-
Xn posed model against four commonly used methods: ARIMA,
CO2 Emissions = Ei
i=1 SVM, CNN-LSTM Hybrid Model, and Transformer. Each
where Ei represents the CO2 emissions produced by each model was trained and evaluated on the same dataset, using
vehicle. the same metrics, to ensure fair comparison.
Assumption: Each liter of fuel consumed produces
2.3 kilograms of CO2 emissions. This assumption links a: ARIMA MODEL
fuel consumption to carbon emissions, helping to quantify The ARIMA model is a classical time-series forecasting
the environmental impact of prediction errors and traffic method that excels at handling stationary time-series data.
inefficiencies. It performs well in short-term predictions but struggles
with nonlinear and dynamic traffic patterns, especially when
2) EVALUATION METRICS FOR TRAFFIC SIGNAL external factors such as weather and holidays are involved.
OPTIMIZATION Due to its wide application in time-series analysis, ARIMA
The performance of traffic signal optimization was evaluated was selected as a representative of traditional statistical
using the following traffic efficiency metrics: methods.
FIGURE 6. Reward progression and rolling average of rewards during PPO Fig. 8 compares MAE and RMSE for traffic flow pre-
training.
dictions in commercial (left) and residential (right) areas,
showing the influence of weather data on model performance.
In commercial areas, the integration of weather data led to
3) IMPACT ON TRAFFIC DELAY DISTRIBUTION AND FLOW a 14.3% reduction in MAE (from 4.0512 to 3.4684) and a
Fig. 7 examines the impact of the PPO algorithm on traffic 15.0% reduction in RMSE (from 9.2034 to 7.8187). Simi-
delay distribution (left) and traffic flow over time (right). larly, residential areas experienced a 16.2% reduction in MAE
The delay distribution indicates that over 80% of vehicles (from 4.2201 to 3.5373) and an 11.9% reduction in RMSE
experience delays of less than 15 seconds, demonstrating the (from 9.5821 to 8.4422).
algorithm’s effectiveness in minimizing waiting times across These results demonstrate that incorporating weather
intersections. Simultaneously, the consistent and stable traffic data significantly enhances prediction accuracy, making the
flow observed over time further validates the PPO algorithm’s STGCN-LSTM model more robust under adverse conditions
robustness in managing and optimizing traffic in dynamic such as extreme heat or cold. This improvement is evident
urban environments. across both commercial and residential environments.
2) IMPACT OF HOLIDAYS
Incorporating holidays as a binary variable further improved
prediction accuracy. In commercial areas, MAE decreased by
5.09%, while in residential areas, it dropped by 6.86%. These
enhancements illustrate the model’s capability to capture
holiday-specific traffic patterns, such as increased congestion
in commercial zones and shifts in residential travel behavior,
thereby improving adaptability to varying traffic conditions.
fuel consumption, and carbon emissions compared to non- larger and more complex networks. Meanwhile, ARIMA and
holiday conditions. Similarly, in residential areas, the model Transformer models show relatively stable execution times
achieves 90% accuracy during adverse weather conditions, across different urban scales.
effectively adjusting to varying environmental factors. These
findings highlight the model’s robustness and adaptability in 2) PERFORMANCE UNDER DIFFERENT TRAFFIC CONDITIONS
diverse urban scenarios. Fig. 11 illustrates prediction errors across different city sizes,
showing that the STGCN-LSTM model exhibits higher errors
4) QUANTITATIVE ANALYSIS OF EXTERNAL FACTORS’ during peak traffic compared to the Transformer model.
IMPACT However, it performs reliably under general traffic condi-
Quantitative analysis reveals that incorporating external fac- tions, highlighting the need for further optimization in highly
tors improves the model’s prediction accuracy by an average dynamic environments.
of 15%, reduces vehicle waiting times by 10%, and decreases
fuel consumption by 7% during peak periods. These improve-
ments highlight the importance of adapting to dynamic traffic
conditions, reinforcing the model’s applicability in real-world
urban environments.
In summary, the integration of external factors significantly
enhances the model’s predictive accuracy and adaptability
across commercial and residential areas. These improve-
ments enable more effective traffic management strategies,
reducing waiting times, fuel consumption, and emissions,
and demonstrating the model’s potential for deployment in
complex traffic scenarios.
FIGURE 12. Traffic prediction error and travel time analysis by model.
to 25 seconds, compared to 35 seconds for Q-learning improves the model’s adaptability. This enhancement reduces
and 45 seconds for Fixed-Time Control. Finally, the PPO the MAE by 17%, demonstrating its ability to handle dynamic
algorithm achieves the highest average speed (35 km/h), sig- traffic patterns like peak-hour congestion and holiday surges.
nificantly surpassing Q-learning (30 km/h) and Fixed-Time
Control (25 km/h). 2) TRAFFIC SIGNAL OPTIMIZATION
The PPO algorithm effectively optimizes traffic signal con-
trol, reducing vehicle waiting times by 30% and increasing
traffic throughput by 15%. The incorporation of external fac-
tors enhances robustness, reducing average waiting times by
an additional 9% during complex conditions. The algorithm’s
ability to coordinate signal timings across multiple intersec-
tions highlights its scalability and suitability for large urban
networks.
4) PRACTICAL APPLICATIONS
The proposed framework addresses critical challenges in
traffic management. Simulations demonstrate a 33% reduc-
tion in vehicle waiting times, a 17% increase in through-
put, and a 12% reduction in carbon emissions. These
improvements underline its scalability, sustainability, and
potential for deployment in smart city infrastructures.
By enhancing urban traffic flow, the framework contributes
to achieving sustainable development goals in modern urban
environments.
VI. DISCUSSION
FIGURE 13. Comparison of travel time, wait time, and speed by control
method.
The proposed STGCN-LSTM framework, integrated with
a PPO-based signal optimization strategy, offers a compre-
These results clearly demonstrate that the PPO algorithm hensive solution to the challenges of urban traffic man-
outperforms traditional methods in terms of efficiency, adapt- agement. This section synthesizes the results, discusses the
ability, and overall traffic optimization. advantages of the proposed methods compared to exist-
ing approaches, identifies limitations, and outlines future
E. EXPERIMENTAL CONCLUSIONS research directions.
The experimental results confirm the effectiveness of the
STGCN-LSTM model and PPO algorithm in addressing traf- A. RESULTS SYNTHESIS AND IMPLICATIONS
fic flow prediction and signal optimization challenges in 1) TRAFFIC FLOW PREDICTION
smart city scenarios. Key findings are summarized as follows: The STGCN-LSTM model demonstrated superior perfor-
mance in traffic prediction, achieving an R2 of 0.904 on
1) TRAFFIC FLOW PREDICTION the METR-LA dataset and reducing the MAE by 20% com-
The STGCN-LSTM model successfully captures spatiotem- pared to Transformer-based methods. These results underline
poral dependencies in urban traffic data, achieving an R2 of its ability to capture spatiotemporal dependencies in urban
0.904 on the METR-LA dataset. The ablation study shows traffic data effectively. However, its performance on noisier
that combining STGCN and LSTM significantly enhances datasets, such as the Traffic dataset (R2 = 0.454), high-
performance compared to reduced variants. The inclusion lights the need for advanced preprocessing and noise-resilient
of external factors, such as weather and holidays, further techniques.
ensuring its successful deployment in real-world smart city methods and prevent unauthorized commercial use. However,
environments. the code may be made publicly available once the patent
application process is completed. In the meantime, qualified
VII. CONCLUSION researchers may contact the corresponding author to request
This study introduced a novel framework combining the access to the code. For further inquiries, please contact the
STGCN-LSTM model for traffic flow prediction with the corresponding author.
PPO algorithm for dynamic traffic signal optimization. The
STGCN-LSTM model effectively captures spatiotemporal REFERENCES
dependencies in urban traffic data, while the PPO algorithm [1] M. U. Tariq, ‘‘Smart transportation systems: Paving the way for sustainable
dynamically adjusts signal timings to optimize traffic flow. urban mobility,’’ in Contemporary Solutions for Sustainable Transporta-
tion Practices, S. Munuhwa, Ed., Hershey, PA, USA: IGI Global, 2024,
Together, these components address critical challenges in pp. 254–283, doi: 10.4018/979-8-3693-3755-4.ch010.
urban traffic management, including congestion and ineffi- [2] Ž. Majstorović, L. Tišljarić, E. Ivanjko, and T. Carić, ‘‘Urban traffic signal
ciency, and offer a robust solution for modern smart city control under mixed traffic flows: Literature review,’’ Appl. Sci., vol. 13,
no. 7, p. 4484, Apr. 2023, doi: 10.3390/app13074484.
applications. [3] C. Creß, Z. Bing, and A. C. Knoll, ‘‘Intelligent transportation sys-
Experimental results validated the framework’s effective- tems using roadside infrastructure: A literature survey,’’ IEEE Trans.
ness, achieving a 30% reduction in vehicle waiting times Intell. Transport. Syst., vol. 25, no. 7, pp. 6309–6327, Jul. 2024, doi:
10.1109/TITS.2023.3343434.
and a 15% improvement in traffic throughput, significantly [4] Y. Cui and D. Lei, ‘‘Design of highway intelligent transportation system
outperforming traditional methods such as fixed-time con- based on the Internet of Things and artificial intelligence,’’ IEEE Access,
trol and Q-learning. Furthermore, the inclusion of external vol. 11, pp. 46653–46664, 2023, doi: 10.1109/ACCESS.2023.3275559.
[5] Y. Xu, X. Cai, E. Wang, W. Liu, Y. Yang, and F. Yang, ‘‘Dynamic traf-
factors, such as weather and holidays, enhanced the sys- fic correlations based spatio-temporal graph convolutional network for
tem’s robustness, reducing waiting times by an additional urban traffic prediction,’’ Inf. Sci., vol. 621, pp. 580–595, Apr. 2023, doi:
9% under adverse conditions. The framework also demon- 10.1016/j.ins.2022.11.086.
strated environmental benefits, achieving a 12% reduction in [6] Y. Shi, Y. Liu, Y. Qi, and Q. Han, ‘‘A control method with reinforcement
learning for urban un-signalized intersection in hybrid traffic environ-
carbon emissions, supporting sustainable urban development ment,’’ Sensors, vol. 22, no. 3, p. 779, Jan. 2022, doi: 10.3390/s22030779.
objectives. [7] R. A. Khalil, Z. Safelnasr, N. Yemane, M. Kedir, A. Shafiqurrahman, and
To enhance the framework’s scalability and adaptabil- N. Saeed, ‘‘Advanced learning technologies for intelligent transportation
systems: Prospects and challenges,’’ IEEE Open J. Veh. Technol., vol. 5,
ity, future research will focus on addressing challenges in pp. 397–427, 2024, doi: 10.1109/OJVT.2024.3369691.
large urban networks and resource-constrained environments. [8] A. Khan, M. M. Fouda, D.-T. Do, A. Almaleh, and A. U. Rahman,
Techniques such as model pruning and quantization will be ‘‘Short-term traffic prediction using deep learning long short-term mem-
ory: Taxonomy, applications, challenges, and future trends,’’ IEEE Access,
explored to improve computational efficiency and enable vol. 11, pp. 94371–94391, 2023, doi: 10.1109/ACCESS.2023.3309601.
deployment on edge devices. Additionally, the integration [9] T. Zhang, J. Xu, S. Cong, C. Qu, and W. Zhao, ‘‘A hybrid method
of real-time data sources, including incident reports, public of traffic congestion prediction and control,’’ IEEE Access, vol. 11,
pp. 36471–36491, 2023, doi: 10.1109/ACCESS.2023.3266291.
transport information, and social events, will further enhance [10] W. Guo, W. Li, Z. Zhang, L. Zhang, L. Li, and D. Li, ‘‘Scalable
the system’s robustness. These advancements aim to extend multi-objective optimization for robust traffic signal control in uncertain
the framework’s applicability, ensuring its effectiveness in environments,’’ 2024, arXiv:2409.13388.
[11] Q. Ji, X. Wen, J. Jin, Y. Zhu, and Y. Lv, ‘‘Urban traffic control meets
diverse and dynamic urban scenarios. decision recommendation system: A survey and perspective,’’ IEEE/CAA
J. Autom. Sinica, vol. 11, no. 10, pp. 2043–2058, Oct. 2024, doi:
ETHICS STATEMENT 10.1109/JAS.2024.124659.
[12] X. Xu, X. Jin, D. Xiao, C. Ma, and S. C. Wong, ‘‘A hybrid autore-
This research did not involve any studies with human partic- gressive fractionally integrated moving average and nonlinear autore-
ipants or animals. gressive neural network model for short-term traffic flow prediction,’’
J. Intell. Transport. Syst., vol. 27, no. 1, pp. 1–18, Jan. 2023, doi:
10.1080/15472450.2021.1977639.
CONFLICT OF INTEREST STATEMENT
[13] O. Idemudia, J. O. Ehiorobo, C. O. Izinyon, and I. Ilaboya, ‘‘Evaluating
The authors declare no conflict of interest. the performance of random forest, decision tree, support vector regression
and gradient boosting for streamflow prediction,’’ CTU J. Innov. Sustain.
Develop., vol. 16, no. 2, pp. 116–130, Jul. 2024, doi: 10.22144/CTU-
DATA AVAILABILITY STATEMENT
JOISD.2024.297.
The dataset used in this study is the METR-LA dataset, [14] L. Waikhom and R. Patgiri, ‘‘A survey of graph neural networks in various
which is publicly available. The METR-LA dataset contains learning paradigms: Methods, applications, and challenges,’’ Artif. Intell.
Rev., vol. 56, no. 5, pp. 6295–6364, Jul. 2023, doi: 10.1007/s10462-022-
traffic data from the Los Angeles metropolitan area, span- 10321-2.
ning March 2012 to June 2012. It includes spatiotemporal [15] P. Patil, ‘‘A comparative study of different time series forecasting methods
traffic patterns, such as speed, flow, and occupancy data. for predicting traffic flow and congestion levels in urban networks,’’ Int.
J. Inf. Cybersecur., vol. 6, no. 1, pp. 1–20, 2022. [Online]. Available:
The dataset can be downloaded from the following link:
https://publications.dlpress.org/index.php/ijic/article/view/6
https://github.com/liyaguang/DCRNN. [16] S. Bilotta, E. Collini, P. Nesi, and G. Pantaleo, ‘‘Short-term prediction of
Due to the ongoing patent application related to the algo- city traffic flow via convolutional deep learning,’’ IEEE Access, vol. 10,
rithms and methods used in this study, the relevant code is pp. 113086–113099, 2022, doi: 10.1109/ACCESS.2022.3217240.
[17] Z. Chen, M. Ma, T. Li, H. Wang, and C. Li, ‘‘Long sequence time-series
currently not publicly available. The purpose of the patent forecasting with deep learning: A survey,’’ Inf. Fusion, vol. 97, Sep. 2023,
application is to protect the uniqueness of the algorithms and Art. no. 101819, doi: 10.1016/j.inffus.2023.101819.
[18] N. Hu, D. Zhang, K. Xie, W. Liang, and M.-Y. Hsieh, ‘‘Graph learning- RONGLIANG LIN (Member, IEEE) received the
based spatial–temporal graph convolutional neural networks for traffic bachelor’s degree in naval combat command and
forecasting,’’ Connection Sci., vol. 34, no. 1, pp. 429–448, Dec. 2022, doi: fire control systems from the Naval University of
10.1080/09540091.2021.2006607. Engineering, in 1994.
[19] R. Kumar, J. Mendes-Moreira, and J. Chandra, ‘‘Spatio-temporal parallel He is currently a Senior Engineer and a Lecturer
transformer based model for traffic prediction,’’ ACM Trans. Knowl. Dis- with the Department of Electronic Information
covery Data, vol. 18, no. 9, pp. 1–25, Nov. 2024, doi: 10.1145/3679017. Engineering, Zhanjiang University of Science and
[20] J. Guo, Q. Xiong, J. Chen, E. Miao, C. Wu, Q. Zhu, Z. Yang, and
Technology, specializing in electronic information
J. Chen, ‘‘Study of static thermal deformation modeling based on a
systems. With more than 29 years of service in the
hybrid CNN-LSTM model with spatiotemporal correlation,’’ Int. J. Adv.
Manuf. Technol., vol. 119, nos. 3–4, pp. 2601–2613, Mar. 2022, doi:
Chinese Navy, he made significant contributions
10.1007/s00170-021-08462-9. to enhancing the performance of various command systems and optimiz-
[21] S. Mohamad Alizadeh Shabestary and B. Abdulhai, ‘‘Adaptive traffic ing operational procedures. His extensive technical experience informs his
signal control with deep reinforcement learning and high dimensional research focus on traffic flow control and traffic signal optimization. During
sensory inputs: Case study and comprehensive sensitivity analyses,’’ IEEE his military career, he held several key roles: appointed as a member of the
Trans. Intell. Transport. Syst., vol. 23, no. 11, pp. 20021–20035, Nov. 2022, Navy’s Equipment Repair Price Expert Advisory Group, in 2013, where
doi: 10.1109/TITS.2022.3179893. he analyzed cost efficiencies of various technological systems; a Special
[22] Y. Bie, Y. Ji, and D. Ma, ‘‘Multi-agent deep reinforcement learning Equipment Technical Support Expert by the Navy’s Equipment Department,
collaborative traffic signal control method considering intersection het- focusing on the integration of advanced technologies for system improve-
erogeneity,’’ Transport. Res. C, Emerg. Technol., vol. 164, Jul. 2024, ments, in 2014; and an Equipment Repair Capability Evaluation Expert
Art. no. 104663, doi: 10.1016/j.trc.2024.104663. for naval missile systems, providing insights into system reliability and
[23] C. Chen, K. K. Yeo, Z. Li, X. Yu, Y. Feng, and C. Kang, ‘‘Gated resid- performance, in 2016. He has authored 22 research articles as the first
ual networks for multivariate time series,’’ IEEE Trans. Neural Netw. author. After retiring from the military in 2019, he continued his role as a
Learn. Syst., vol. 32, no. 5, pp. 2050–2062, May 2021. [Online]. Available: Senior Engineer and a Lecturer with Zhanjiang University of Science and
https://github.com/liyaguang/DCRNN
Technology, focusing on equipment maintenance, command systems, and
artificial intelligence applications in intelligent manufacturing and traffic
management.
Mr. Lin is a Senior Member of China Society of Naval Architects and
Marine Engineers (CSNAME). He is dedicated to the application of machine
TUXIANG LIN received the bachelor’s degree learning, with a particular focus on urban traffic systems.
in electrical engineering and information
systems from FOM Hochschule (FOM University
of Applied Sciences for Economics and Manage-
ment), in collaboration with Shenyang University,
China, in 2023. He is currently pursuing the
master’s degree in information and communi-
cation engineering with Technische Universität
Darmstadt, Germany.
He has a solid academic foundation in power
electronics and intelligent traffic systems. He holds certifications, including
the National Computer Rank Examination Level 2 Certificate and the
National Information Security Certificate (Level 1). Additionally, he has
filed two patent applications with China National Intellectual Property
Administration—one as the first inventor and the other as the second
inventor, both currently under review. He is fluent in Chinese and English and
conversant in German, demonstrating strong multilingual communication
skills. Looking ahead, he aims to contribute to the telecommunications
industry. He seeks to engage in technical roles focused on cybersecurity and
communication technologies. Alternatively, he intends to pursue a Ph.D. for
further academic research in related fields.