1. Introduction
The road system is an important infrastructural element of China’s economic and social systems. As of the end of 2020, the national road mileage had reached 5.19 million kilometers. Therefore, the road system bears a huge amount of traffic pressure throughout the country and plays an important role in the rapid development of China’s economy and residents’ lives [
1]. According to the traffic analysis report of major cities in China in 2021 released by Gaode Map [
2], with the rapid development of the economy and improvements in the social consumption levels of residents, the growth rate of car ownership remains high. According to the report [
3], managing and alleviating traffic congestion in the road system, and strengthening the control capabilities of traffic management departments over roads in major cities, has become one of the key tasks in digital transportation construction.
An efficient traffic management system can promote the development of transportation, logistics, and various economic sectors, and has certain significance for the coordinated and balanced development of various regions and functions in a city. With the vigorous development of the intelligent transportation industry, the popularity of intelligent transportation systems is also increasing, and traffic control has also been a hot issue in recent years. Accurate and fast traffic prediction is the data-based foundation for achieving traffic control, and the accuracy of traffic congestion prediction results will directly affect the effectiveness of traffic control [
4]. For travelers, traffic congestion prediction data can be used to formulate reasonable and convenient travel route planning and travel time planning. For traffic managers in major cities, traffic congestion prediction [
5] can help the traffic management department grasp the traffic operation status in real time, control the traffic system reasonably and efficiently, improve the operational efficiency of the traffic system, and reduce and ease traffic congestion.
The main characteristics of the traffic speed prediction problem include: (1) The input variables and output variables have a nonlinear mapping relationship; (2) at the same time, the speed time series has a high degree of periodicity; and (3) there is a certain correlation between the speeds of different road sections. At present, in the field of traffic prediction, the main models used for prediction are divided into three categories. The first category comprises the models based on statistical theory, which assume that the future data to be predicted has the same characteristics as that of the past data [
6]. This type of model mainly includes the ARIMA prediction model [
7] for time-series analysis, Kalman filter analysis [
8], etc. The second category of models contains deep learning models, which are becoming more and more widely used in machine learning. In addition to being applied in research on image recognition, speech recognition, and other aspects, the long short-term memory model (LSTM) is widely used in the field of traffic prediction because of its superior accuracy in time-series prediction [
9].The third category of models is made up of integration models [
10], which improve the prediction accuracy of a single model by combining and integrating multiple different basic learners of different or the same types of model [
11].
The theme of this article is traffic speed prediction. This article references several papers from the Symmetry journal. Ge H et al. studied traffic flow prediction on highways [
12]. Alajali, W et al. studied traffic prediction using decision trees [
13]. Xing, Ban et al. proposed a symmetric extreme learning machine cluster fast learning method for traffic prediction [
14]. These papers have provided enlightenment for the research of this article.
Currently, research on traffic speed prediction mainly adopts statistical learning methods, deep learning methods, and ensemble model methods. These methods are limited by their insufficient prediction accuracy and cannot comprehensively consider the nonlinear, periodic, and spatial characteristics of speed prediction. The optimization problem corresponding to the training process of existing neural network models is a non-convex optimization problem, and the parameters often converge to a local optimum point. This leads to insufficient prediction accuracy. Currently, linear statistical learning models can only describe linear mapping relationships, which is insufficient for prediction problems. However, the polynomial model proposed in this paper can solve both problems simultaneously.
Because polynomial functions can better approximate and fit most nonlinear functions, the parameter solving problem of polynomial functions can usually obtain the global optimal solution through optimization algorithms [
15,
16]. Approximating the nonlinear relationship of speed prediction using polynomial functions can improve the accuracy of speed prediction to a certain extent. Therefore, this article will propose a new traffic speed prediction model based on polynomial functions.
When studying the prediction of traffic speed on roads, we have gained inspiration from research related to traffic flow prediction, which suggests that considering the characteristics of space or different regions may be very beneficial [
17,
18]. However, due to the presence of a large number of spatial features and corresponding features from different regions, considering the integration of all of these features would make our model exceptionally large. Therefore, we need to consider screening and selecting features to ensure that our model can accurately predict traffic speed to the maximum extent possible under controllable conditions. Based on this reality, we will propose a new traffic speed feature selection method [
19], which is based on the maximum mutual information coefficient.
At the same time, considering the periodic variation of road traffic speed time series can improve model prediction accuracy [
20]. We will also introduce periodic trend features to model the periodic characteristics of the time series, in order to further improve the accuracy and robustness of the prediction model.
2. Literature Review
The prediction of road traffic speed is a typical traffic prediction problem. There are currently few documents in the field of road traffic speed prediction, so this article draws on the literature surrounding several other issues in traffic prediction. At present, many scholars have conducted research in the field of traffic prediction [
21,
22].
The travel time problem aims to estimate the travel time for a given OD input, which consists of a starting point, an ending point, and a departure time. In the field of travel time prediction, the main research methods include K-nearest neighbor regression [
23], multi-layer perceptron, ResNet [
24], long-short term memory network, and convolutional neural network [
25]. The travel demand problem aims to predict the future traffic demand in each area of the city. In the field of travel demand forecasting, the main methods include multi-layer perceptron [
26], convolutional neural network and recurrent neural network [
27,
28], graph convolutional network, and graph attention mechanism [
29,
30]. The regional traffic flow problem is defined as predicting the future traffic flow between regions. In the field of regional traffic forecasting, the main methods include the historical average, ARIMA, integrated methods [
31,
32], convolutional neural network [
33], and long and short-term memory network [
34]. In the field of network flow prediction, the main methods currently used include autoencoder [
35], recurrent neural network [
36], and meta learning [
37].
In the field of speed prediction, the current mainstream methods include the historical average and ARMA [
31]. In addition to the historical average method, some scholars currently use deep learning methods to study the problem of road traffic speed prediction, including convolutional neural network, recurrent neural network, and feedforward neural network [
38,
39], long short-term memory network, and graph convolutional neural network [
40]. Some scholars [
39] divide the traffic map into several rectangular grids, then calculate the average speed of each grid, and treat the data at a time point as an image or two-dimensional matrix. Then, they use convolutional neural network-related methods to extract features and use them for final speed prediction. Some scholars have used a bidirectional LSTM model with both forward and backward dependencies to consider the temporal characteristics of speed time series for road traffic speed prediction. Some scholars [
39] combine CNN and RNN, first using CNN to capture the relevant features of road structure in speed sequences, and then using RNN to consider multiple factors for speed time-series prediction. Some scholars [
40] have also introduced more data and factors, including geographic information attributes, road network structure, and social information, to improve the accuracy of speed prediction.
3. Method
This section will first discuss a prediction model based on polynomial functions, which can fit the nonlinear mapping of speed series. Second, this section will discuss the parameter training or parameter solving method of the model. The optimization problem of this model is a convex optimization problem. Therefore, the training of this model is relatively effective. Third, this section will discuss a spatial feature selection method based on the maximum mutual information coefficient. Fourth, this section will discuss a periodic feature extraction method based on the PCA method. Fifth, this section will discuss evaluation metrics for testing. Finally, this section will discuss how to combine polynomial functions, spatiotemporal features, and periodic features into a fusion model.
3.1. Prediction Model
Polynomial functions can provide a good approximation for most nonlinear functions, and can fit most nonlinear relationships. The training process of polynomial regression is a convex optimization problem that can obtain a global optimal solution [
41,
42]. This article will combine polynomial functions with Lasso regression to construct a polynomial regression model. A road speed prediction model is porposed based on polynomial regression. In addition, this paper will prove the improvement in prediction accuracy brought about by the spatial selection method and periodic feature modeling method based on a polynomial regression model. The input features of polynomial regression are as follows:
where
i is the road ID,
t is the time ID,
num is the total number of considered road sections,
T is the maximum time lag considered,
Tk is the maximum time lag considered for other regions,
is the basic variable part for predicting the
i-th road section and the
t-th time moment,
is the other road section feature part of the input,
is the periodic feature part, and
is the input of the merged model.
The features of the road section are the speed of the past T time intervals of the road section, the speed of the past Tk time intervals of the other road sections (how to select these features will be explained later), and the periodic feature is the speed of the periodic trend at the same time (how to extract the periodic trend will be introduced later). The features of the road segment itself, the features of other road segments, and periodic features are combined to form the final feature set.
The input components are recorded as follows:
where
Si,t,m is the
m-th component of the input variable,
i is the segment ID,
t is the time subscript, and
m is the input component subscript. The polynomial regression model in this paper is as follows:
where
is the bias term,
is the
m-th coefficient of the first-order component,
is the coefficient of the second-order component,
is the coefficient of the third order component,
is the input of the model,
is the output of the model (the traffic speed of the
i-th section at the
t-th moment), and
M is the number of variables input into the model. Due to differences in the input of prediction models for different road sections, different models are established for different road sections.
3.2. Model Solution
Solving model parameters is equivalent to optimizing the following optimization problems:
where
fi is the aforementioned polynomial regression model, and
is the model parameter. The first part of the optimization objective is error minimization, and the second part is a norm regularization term. Because 1-norm regularization can adjust most parameters to 0, using 1-norm regularization can automatically filter high dimensional features. The gradient descent method is used to solve the model, and the overall steps of the Algorithm 1 are as follows [
43].
Algorithm 1: Solving parameters of polynomial regression models. |
Input: Training set for traffic speed dataset, regularization parameter , parameters of gradient descent method threshold . Output: Model parameter . |
Step 1. Calculate J(S,ω) based on historical data |
Step 2. Calculate the partial derivative of each component of the parameter:
where
|
Step 3. According to the partial derivative and gradient, record the gradient as g, update the model parameters, and the calculation formula is as follows: |
Step 4. If then algorithm returns to step 1. If then algorithm stop iteration. |
3.3. Spatiotemporal Feature Extraction
The maximum mutual information coefficient method [
44] is a method to analyze the correlation between two variables. It helps to find the nonlinear correlation between two variables, so it can be widely used in variable selection in predictive models.
The basic principle of the maximum mutual information coefficient method is based on information theory. In information theory, information can be measured by entropy. The greater the entropy, the greater the amount of information. The maximum mutual information coefficient method measures the correlation between two variables by calculating the mutual information between them. Mutual information refers to the sharing degree of information between two variables. The greater the mutual information between two variables, the stronger the correlation between them.
where
Si,t is the traffic speed of the
i-th section at the
t-th time point. The maximum mutual information coefficients of section
i and section
j are defined as follows:
where
B is the hyper parameter that limits the number of grids. The
MIC values of the target road segment and candidate feature road segments are calculated, and road segments with
MIC values greater than a threshold are used as input variables for the prediction model. If there are too many road sections greater than the threshold, the first
K road sections with the highest nonlinear correlation coefficient are introduced as input features. If the number of road sections greater than the threshold is less than
K, other road sections are not considered when establishing the model.
3.4. Periodic Feature Extraction
Due to the strong periodicity of road speeds, there is a strong similarity in the speed curves of different days. This article is based on principal component analysis [
45] to extract the periodic patterns of velocity curves. Firstly, a data matrix is constructed, where
represents the road traffic speed for the
i-th section,
t-th time period, and
d-th day. Thus:
The sample data in data set
S is standardized according to the following formula:
where
μd is the mean value of day
d, and
σd is the standard deviation of day
d. The covariance matrix is calculated according to the data matrix
S:
All the eigenvalues of the covariance matrix
D are found, and these eigenvalues are arranged from largest to the smallest. Then, the eigenvectors corresponding to the first
K features are selected and arranged in rows to form the transformation matrix
W. The dimension of the data is reduced using the transformation matrix
W. In the process of using the PCA algorithm, the fractional matrix
SC and coefficient matrix
C are obtained. In order to obtain the final trend, the first
K columns of the
SC matrix and the first
K rows of the coefficient matrix
C are taken, and then matrix multiplication is performed on these two matrices to obtain the results of the PCA periodic rule. The calculation formula is as follows:
It is worth noting that when extracting cycle features, the time series is divided into two parts: weekday and weekend, extracting weekday cycle trends and weekend trends, respectively. When predicting the traffic speed on weekdays, this article uses the periodic trend of weekdays as a feature. When predicting the traffic speed on weekends, this article uses the periodic trend of holidays as a feature. Due to the significant difference in speed between non-peak and peak periods, the periodic trend feature proposed in this paper can effectively address this issue.
3.5. Evaluating Indicator
The evaluation index adopted in this paper is the percentage of mean absolute (MAPE). The MAPE value of the
i-th road is calculated as follows:
where
Sri,t is the observed value of road traffic speed,
Si,t is the predicted value of road traffic speed, and
T is the number of time points. In addition, the improvement amount of model 2 relative to model 1 is defined as follows:
where
MAPE1 and
MAPE2 are the percentage of the mean absolute of model 1 and model 2, respectively.
IMP1,2 represents the improvement amount of the model.
3.6. Model Integration
This article combines the polynomial model, spatial feature selection method, and periodic trend extraction to form a spatiotemporal periodic polynomial model. Firstly, based on the spatial feature filtering method, the road sections which are useful for traffic prediction are selected, and the speed values of these road sections are taken as inputs for the model. Secondly, the periodic trend extraction method is used to extract the periodic trend, and the velocity values at the same time in different periods are taken as inputs for the model. Then, the time, space, and periodic features are used as inputs for the polynomial model. Finally, the model in this article is trained using historical data and evaluated on the test set.
4. Numerical Testing
This section will conduct numerical experiments in traffic speed prediction based on real data sets. First, this section will introduce the description of the data set. Second, this section will show the spatial correlation between different roads. Third, this section will show the network structure of spatial correlation analysis. Fourth, this section will conduct a periodic analysis of speed time series. Fifth, this section will analyze the prediction accuracy of the proposed method and show the improvement in prediction accuracy brought about by polynomial functions, spatial features, and periodic features. Sixth, this section will compare the prediction accuracy of the proposed method with several deep learning models, statistical learning models, and ensemble models. Finally, this section will analyze time complexity and spatial complexity.
4.1. Data Set
The data set consisted of 214 anonymous road sections (mainly main roads and expressways) in Guangzhou [
46], with measured vehicle speed data spanning two months (1 August 2016 to 30 September 2016), with a time window of 10 min. The vehicle speed data set contained a total of 1,855,589 vehicle speed records. The fields it contained were as follows: (1) Road_id: The number of the road section, for example, “1” represents the first road section; (2) Day_id: The number of the day, where the number “1” represents 1 August 2016, the number “2” represents 2 August 2016, and so on, and the number “61” represents 30 September 2016; (3) Time_id: The number of the time window, where the number “1” represents 00:00:00–00:10:00, and the number “2” represents 00:10:00–00:20:00; and (4) Speed: Vehicle speed value (in km/h). The speed data in this article were collected through induction coils installed on the road. The speed of each road segment every 10 min was the average speed of passing vehicles measured by the induction coil. The top 10 data of the road speed dataset are shown in
Table 1.
The dataset used in this article did not contain specific location information for road segments, only the IDs of road segments. However, the method in this article considered the correlation between different road segments through the MIC coefficient, which to some extent made up for this problem.
Figure 1 and
Figure 2 show the speed variation curves of multiple road sections contained in the dataset of this article, as well as the speeds at different road sections at the same time. As can be seen from
Figure 1, there was a strong periodic characteristic in the speed time series. As can be seen from
Figure 2, there were significant differences in the traffic speed at different road sections at the same time.
Figure 3 shows the variation pattern of traffic speed on sections 1–9 at different times on different days. As can be seen from the figure, there was a strong periodicity in the time of each road segment. The model in this article significantly improved the accuracy of model prediction by considering the periodic characteristics of time. At the same time, as can be seen from the figure, there was also a strong correlation between the time series of different road segments. The model in this article significantly improved the accuracy of model prediction by considering spatial characteristics.
4.2. Spatial Feature Analysis
Based on the calculation method of maximum mutual information coefficient mentioned above, it was used to calculate the nonlinear correlation between different regions. For data analysis, relevant data from the top 10 roads in the sample were selected for processing. By calculating the correlation between these data, the display results shown in
Figure 4 were obtained. As can be seen from
Figure 4, there was a strong correlation between some road segments. Therefore, considering spatial feature selection improved the prediction accuracy of the model.
4.3. Dependency Network Structure
Figure 5 shows the dependency relationship between different roads. For each dependent variable, nodes with a MIC value greater than 0.8 were selected, and the connection was treated as having a dependency relationship, while the other nodes were treated as having no relationship. The network is visualized in
Figure 5.
From the graph, it can be seen that there was a certain degree of nonlinear correlation between the traffic speeds in different regions. Therefore, when predicting the traffic speed of a region, introducing speed data from other regions can effectively improve the prediction accuracy.
4.4. Periodicity Analysis
In this study, a statistical graph of the first road section was plotted to demonstrate the variation pattern of road vehicle speed from 8 August 2016 to 12 August 2016. Specifically, the data was processed at 10 min intervals, using different date IDs to distinguish different traffic data, and displaying the data in different colors. The horizontal axis represented the time ID, while the vertical axis represented the vehicle speed. In
Figure 6, the curves of different date IDs show similar characteristics, and overall, it can be seen that there was a significant periodic trend in road traffic speed on weekdays.
Using the cycle extraction method from the previous section, it was first necessary to collect data on the target road to obtain data on travel time and distance. On this basis, the periodic pattern of the target road was extracted by processing the data. For road 1, after data processing and period mining, a daily periodic speed curve and a speed curve from 8 August to 12 August were obtained, as shown in
Figure 7. The green curve displays a periodic speed curve, while the red point represents the speed curve from 8 August to 12 August. From
Figure 7, it can be seen that the cycle law extraction method proposed in this article can efficiently and accurately extract the cycle law of road traffic speed, providing an important data foundation for subsequent predictions of road traffic speed.
4.5. Prediction Accuracy
In this experiment, the data from the previous 48 days (1 August 2016 to 17 September 2016) was taken as the training set, and the data from the following 13 days (18 September 2016 to 30 September 2016) was taken as the test set. According to our experience, the order of the polynomial model can lead to overfitting if it is too large, and it can be equivalent to a linear model if it is too small. In this article, the order of the polynomial model was set to three. When calculating cycle trends, they were divided into weekdays and weekends. The time lag length of the road itself was set to five. The nonlinear correlation coefficient threshold was set to 0.8. The maximum number of nodes considered was three. If the number of independent variables was greater than three, the three roads with the highest coefficients were considered. If the number of independent variables greater than the threshold was less than three, only the univariate model was considered.
Figure 8 shows the comparison between the predicted and observed values of the nonlinear spatial period prediction model on the test set. The data points in the figure represent observed values and predicted values. The closer their values are to the straight line
y =
x, the better the prediction accuracy. It can be seen from the figure that the method proposed in this article can effectively achieve traffic speed prediction.
In this section, this article will compare the prediction accuracy improvement brought about by spatiotemporal feature extraction methods and periodic feature extraction methods compared to univariate models. The polynomial model in
Table 2 only considers the time features of a single variable. The spatiotemporal polynomial model considers spatiotemporal features, while the periodic polynomial model considers periodic features. The spatiotemporal periodic polynomial model considers both spatiotemporal and periodic features.
Table 2 shows the comparison of prediction accuracy between univariate models and models considering spatial features and periodic features. As can be seen from
Table 2, introducing periodicity features can effectively improve the prediction accuracy on most road segments. At the same time, introducing spatial features can also enhance the model features. A model that considers both periodicity and spatial features can achieve better results.
The model evaluation criteria and model improvement calculation formula in
Section 3.5 was used to calculate the improvement in prediction accuracy of the periodic model and spatial model compared to the univariate model. The results are shown in
Figure 9 and
Figure 10. For most roads, spatial models have been shown to improve prediction accuracy. Similarly, for most roads, periodic models also improve the prediction accuracy of the model.
From
Table 2 and
Figure 9 and
Figure 10, it can be seen that the predictive accuracy of the univariate model in this article can meet practical needs. For all roads, after introducing spatial features, the prediction accuracy was significantly improved, with an average optimization of 1.41%. For all roads, after introducing periodic features, the prediction accuracy was again significantly improved, with an average optimization of 1.5%. For all roads, the model incorporating periodic trend extraction and the maximum mutual information coefficient exhibited an additional 2.24% improvement in accuracy compared to the univariate polynomial regression model.
4.6. Model Comparison
To further verify the accuracy of our method, we compared our model with deep learning models, machine learning models, and ensemble learning models. The assessed deep learning models include feedforward neural networks (FNN) [
47] and long short-term memory networks (LSTM) [
48].
The neurons of the FNN are arranged in layers, and each neuron is only connected to the neurons in the previous layer. It receives the output of the previous layer and outputs it to the next layer, without feedback between layers. It is one of the most widely used and rapidly developing artificial neural networks. The hyperparameters and options for the FNN were set as follows: the FNN consisted of four fully connected layers and four relu activation layers. The number of neurons in the hidden layer was set to 64, the loss function was MAE, and the optimizer used an Adam optimizer.
LSTM is a special RNN network designed to solve long-term dependencies. This network has been used to solve a variety of problems and is still widely used today. The hyperparameters and options for LSTM were set as follows: The number of hidden layer units in the LSTM model was set to 500. The input was a time series with a time length of 5 and a feature dimension of 214 (number of road segments), and the output was a vector with a number of 214 elements. The activation function was the relu function, the loss function was the MAE, and the optimizer was the Adam optimizer.
The assessed machine learning models included robust linear regression (RLR) [
49] and K-nearest neighbor (KNN) [
50]. RLR is a form of regression analysis that aims to overcome some limitations of traditional parametric and non-parametric methods, and aims to be less affected by violations of regression assumptions in the underlying data generation process. The KNN algorithm is an instance-based machine learning method that can be used for classification and regression problems. The idea of it is very simple, but it has shown excellent accuracy in practice. The fitting weight function of the RLR assessed here was bissquare, and the penalty constant was set to 10-3. The number of neighboring points in KNN was 5, with uniform weights and Minkowski distance used as the distance indicator.
The ensemble learning models compared here include random forest [
51] and adaboost models [
52]. Random forest refers to a predictor that uses multiple trees to train and predict samples. Adaboost is an iterative algorithm, the core idea of which is to train different prediction periods (weak predictors) for the same training set, and then combine these weak predictors to form a stronger final predictor (strong predictor). In the random forest model, the number of learners was set to 30, the maximum depth of the decision tree was set to 5, and the segmentation metric was set to squared error. The base learner of the adaboost model was set to a decision tree, the maximum depth of the decision tree was set to 5, the segmentation metric was set to squared error, and the number of learners was set to 30.
All models were tested on 214 road sections. In the experiment, the data from the first 48 days was used as the training set (1 August 2016 to 17 September 2016), and the data from the last 13 days was used as the test set (18 September 2016 to 30 September 2016). This paper used MAPE as the evaluation indicator, and the comparison results are shown in
Table 3. As can be seen from
Table 3, the MAPE value of the FNN model was 7.07%, the MAPE value of the LSTM model was 9.18%, the result of the RLR model was 8.11%, the prediction result of the KNN was 7.38%, the result of random forest was 7.12%, and the result of adaboost was 7.44%. Our model achieved a MAPE value of 6.65%, which was lower than all the comparison models. From the table, it can be seen that the model in this paper has improved prediction accuracy by 5.94% compared to the FNN model.
4.7. Time Complexity and Space Complexity
To analyze the time complexity of the model more comprehensively, the following aspects were analyzed: the time spent on extracting features from the entire dataset, the time spent on training the model on the entire dataset, and the time required for making a prediction with the model. To analyze the space complexity of the model, the memory size occupied by the model parameters was used. The main frequency of the computer was 1.9 Hz. The memory size was 16 GB, and the operating system was 64-bit. The results are shown in
Table 4. From the table, it can be seen that the method in this article can efficiently complete training and testing using current computers and configurations.
5. Conclusions
This paper proposed a new polynomial model integrating temporal, spatial, and periodic features for road traffic speed prediction. The method proposed in this paper was shown to effectively improve the prediction accuracy of traffic speed. Aiming at the problem of nonlinear modeling of traffic speed, a road speed prediction method based on a polynomial regression model was proposed. According to our research, we integrated the spatial features of traffic speed into our model. In order to obtain a model that considered both spatial features and did not contain too many variables, the spatial feature selection method of traffic speed was studied, and a spatial feature selection method based on the maximum mutual information coefficient was proposed. Given the strong periodicity of road traffic speed, according to our research, introducing periodicity features into a traffic speed prediction model can effectively improve prediction accuracy. Therefore, we also analyzed the periodic mode of traffic speed and introduced periodic features. In numerical experiments, the prediction accuracies of a univariate polynomial model, a spatiotemporal polynomial model, a periodic polynomial model, and a spatiotemporal periodic polynomial model were evaluated.
Compared with the univariate polynomial model, the prediction accuracy of the spatio-temporal polynomial model was optimized by 1.41% on average. The prediction accuracy of the periodic polynomial model was optimized by 1.5% on average, and the accuracy of the spatiotemporal periodic polynomial model was improved by 2.24% on average. The test results showed that the three methods proposed in this paper can effectively improve prediction accuracy. In addition, our model was also compared with FNN, LSTM, RLR, and KNN regression. Our model was shown to be superior to the existing general machine learning model and deep learning model. The test results showed that the prediction accuracy of this model was 5.94% better than that of the most advanced FNN.
The spatiotemporal periodic polynomial model in this article can achieve better results on data with complex mapping relationships, strong periodicity, and strong spatial correlation. Therefore, this method can provide more accurate road traffic speed prediction results, which has important practical significance for the implementation of intelligent traffic management systems and a reduction in traffic congestion. At present, the method in this article can only predict the road traffic speed in the next 10 min. In subsequent research, we will further discuss how to predict the road traffic speed for a longer period of time.