Pearson Correlation Coefficient-Based Performance Enhancement of Broad Learning System For Stock Price Prediction
Pearson Correlation Coefficient-Based Performance Enhancement of Broad Learning System For Stock Price Prediction
net/publication/359328516
Article in IEEE Transactions on Circuits and Systems II: Express Briefs · May 2022
DOI: 10.1109/TCSII.2022.3160266
CITATIONS READS
43 517
5 authors, including:
Choujun Zhan
South China Normal University
102 PUBLICATIONS 1,144 CITATIONS
SEE PROFILE
All content following this page was uploaded by Choujun Zhan on 26 June 2022.
Abstract—Accurate prediction of a stock price is a challeng- system [2]–[4]. However, the non-linearity, high noise and
ing task due to the complexity, chaos, and non-linearity nature of volatility of the stock market make the prediction of financial
financial systems. In this brief, we proposed a multi-indicator fea- systems take higher risks than to other complex systems [5]. In
ture selection method for stock price prediction based on Pearson
correlation coefficient (PCC) and Broad Learning System (BLS), addition, external factors such as national macro adjustments,
named the PCC-BLS framework. Firstly, PCC was used to select changes from the political situation and investor psychology
the input features from 35 features, including original stock make the prediction of the financial system extremely diffi-
price, technical indicators, and financial indicators. Secondly, cult [6], [7]. Data-driven methods show great potential for the
these screened input features were used for rapid information analysis and control of such complex systems [8], [9].
feature extraction and training a BLS. Four stocks recorded on
the Shanghai Stock Exchange or Shenzhen Stock Exchange were With the rapid development of data acquisition, storage
adopted to evaluate the performance of the proposed method. In and machine learning and big data techniques, researchers
addition, we compared the forecasting results with ten machine are beginning to use machine learning methods to try to
learning methods, including Support Vector Regression (SVR), capture nonlinear patterns of financial systems. Fischer and
Adaptive Boosting (Adaboost), Bootstrap aggregating (Bagging), Krauss predicted the S&P 500 trend from the up-down sig-
Random Forest (RF), Gradient Boosting Decision Tree (GBDT),
Multi-layer Perceptron (MLP), Convolutional Neural Network nals generated by LSTM [10]. Deep Neural Networks (DNNs)
(CNN), and Long Short-Term Memory (LSTM), Gated Recurrent also has shown remarkable results in machine learning prob-
Unit (GRU) and Broad Learning System (BLS). Among all algo- lems [11]–[13]. A hybrid model based on Naive Bayes
rithms used in this brief, the proposed model showed the best classifier and LSTM was used to predict the opening price
performance with the highest model fitting ability. of listed companies in China [14]. Thakkar et al. proposed
Index Terms—Broad learning system, machine learning, a method based on Pearson Correlation Coefficient (PCC) to
Pearson correlation coefficient, time series forecasting, complex initialize the edge weights of neurons between the input and
system. hidden layers of the vanilla neural network (VNN) [15]. In
conclusion, machine learning methods are good candidates for
volatility financial time series forecasting [16].
I. I NTRODUCTION Recent researches indicate that feature selection is essen-
ORECASTING stock prices is of significant importance
F for analyzing financial systems, as the stock market
reflects the economic situation of a country, even the world.
tial for improving the accuracy of stock market prediction.
Several feature selection approaches are taken to effectively
filter redundant or irrelevant features [17], [18]. Ni et al.
Accurate prediction of the stock market allows investors and hybridized the fractal feature selection method and Support
other stakeholders to understand the movement of the financial Vector Machine (SVM) to predict the daily trend of Shanghai
market. Effective trading strategies are then adopted to achieve Stock Exchange Index [19]. Yang et al. filtered the training
greater profits as well as returns with less risk [1]. In general, features by the Maximum Information Coefficient (MIC) and
the stock market is often analyzed by researchers as a complex built an ensemble forecasting model for stock price movement
prediction based on Support Vector Machine (SVM), RF, and
Manuscript received February 7, 2022; accepted March 11, 2022. Date Adaboost [20]. Huang and Tsai established a hybrid SOFM-
of publication March 17, 2022; date of current version May 3, 2022. This
work was supported in part by the Natural Science Foundation of Guangdong SVR based on filter feature selection to improve prediction
Province, China, under Grant 2020A1515010761, and in part by the Key Areas accuracy and predict the fluctuation of the Taiwan Index
Research and Development Program of Science and Technology Program (FITX) [21].
of Guangzhou under Grant 202103010005. This brief was recommended by
Associate Editor H. Li. (Corresponding author: Choujun Zhan.) This brief proposed a novel framework for forecasting the
Guanzhi Li, Aining Zhang, Qizhi Zhang, and Choujun Zhan are with the movement of complex financial systems called the Pearson
School of Computing, South China Normal University, Guangzhou 510641, correlation coefficient and Broad Learning System (PCC-BLS)
China (e-mail: [email protected]).
Di Wu is with the Department of ICT and Natural Science, Norwegian framework. The proposed method was used to predict the close
University of Science and Technology, 7491 Trondheim, Norway (e-mail: price of a stock listed in China for the day ahead. For verifying
[email protected]). the performance of the proposed model, ten machine learn-
Color versions of one or more figures in this article are available at
https://doi.org/10.1109/TCSII.2022.3160266. ing models were adopted for comparison. Experimental results
Digital Object Identifier 10.1109/TCSII.2022.3160266 indicate that the PCC-BLS shows more accurate results with
1549-7747
c 2022 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: SOUTH CHINA UNIVERSITY OF TECHNOLOGY. Downloaded on June 19,2022 at 12:02:08 UTC from IEEE Xplore. Restrictions apply.
2414 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—II: EXPRESS BRIEFS, VOL. 69, NO. 5, MAY 2022
TABLE I
I NPUT F EATURES FOR M ACHINE L EARNING M ODELS
II. M ETHODOLOGY
A. Broad Learning System
Broad Learning System (BLS), based on the random vector
functional link neural network (RVFLNN), is a flat network
proposed by Chen and Liu [22]. The framework of a typical
BLS is shown in Fig. 1. The input features are first transformed
into mapped features by mapping functions and stored in the
feature nodes, which would be extended to enhancement nodes
through nonlinear activation functions and random weights. In
BLS, all the outputs of the enhancement and feature nodes are
connected to the output layer, and the connection weights are
derived by ridge regression of the pseudoinverse.
Considering the time series forecasting problem, the training
data set is represented by {(X, Y) | X ∈ RN×D , Y ∈ RN×1 }.
The mapping function first transforms the training samples
into mapping feature spaces in BLS. For n feature mappings,
each generates k nodes and this procedure can be expressed
as the following equation:
Zi = φ XWei + βei , i = 1, . . . , n, (1)
where weights Wei and bias term βei are randomly gener-
ated matrices with the proper dimensions. φ(·) is usually the
transformation of the activation function. The feature space of
training samples are defined as Z n = [Z1 , Z2 , . . . , Zn ] and the
the highest model fitting ability in terms of stock prediction. ith group of enhancement nodes is denoted as follow:
The main contributions of this brief are as follows.
• A large-scale dataset, containing multiple factors that Hj = ξ(Z n Whj + βhj ), j = 1, 2, . . . , m, (2)
affect the closing price, has been collected. The collection where weights Whj and bias term βhj are also generated
of features involves technical and financial indicators in randomly, and ξ(·) represents the activation function. And
addition to the original features (shown in Table I). we denote the outputs of the enhancement layer by Hm
• PCC is adopted for feature selection analysis to improve
[H1 , H2 , . . . , Hm ]. Therefore, the output Y of BLS can be
BLS. The PCC-BLS model is proposed for forecasting represented as:
the closing price of listed companies in China.
• Ten machine learning methods, including ensemble learn- Y = [Z1 , Z2 , . . . , Zn | H1 , H2 , . . . , Hm ]W m
ing, RF, standard BLS and etc., have been adopted to = Zn | Hm W m
predict the closing prices of the adopted stock price and
= AW m , (3)
compare the performance with the proposed model.
The rest of this brief is organized as follows: In Section II, where A = |
[Z n H m ]. Wm
is the output weight connecting the
the proposed Pearson correlation coefficient (PCC) and Broad feature and enhancement nodes to the output layer. W m can
Authorized licensed use limited to: SOUTH CHINA UNIVERSITY OF TECHNOLOGY. Downloaded on June 19,2022 at 12:02:08 UTC from IEEE Xplore. Restrictions apply.
LI et al.: PCC-BASED PERFORMANCE ENHANCEMENT OF BLS 2415
TABLE II
be rapidly approximated by the ridge regression. SZ.000776: T HE I NPUT F EATURES OF THE PCC-BLS M ODEL
−1
W m = λI + AT A AT Y, (4)
where λ denotes a regularisation parameter to balance the
effect of the error term.
BLS allows output weights to be obtained quickly by pseu-
doinversion, which not only ensures efficient modeling but
also avoids structural redundancy. Therefore, the BLS provides TABLE III
SH.600019: T HE I NPUT F EATURES OF THE PCC-BLS M ODEL
an alternative approach to a deep learning framework, which
is typically subject to time-consuming training of numerous
parameters in a layer.
Authorized licensed use limited to: SOUTH CHINA UNIVERSITY OF TECHNOLOGY. Downloaded on June 19,2022 at 12:02:08 UTC from IEEE Xplore. Restrictions apply.
2416 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—II: EXPRESS BRIEFS, VOL. 69, NO. 5, MAY 2022
TABLE V
SH.600837: T HE I NPUT F EATURES OF THE PCC-BLS M ODEL
TABLE IX
SH.600050: T HE E VALUATION VALUE OF D IFFERENT M ODELS
TABLE VI
SZ.000776: T HE E VALUATION VALUE OF D IFFERENT M ODELS
Authorized licensed use limited to: SOUTH CHINA UNIVERSITY OF TECHNOLOGY. Downloaded on June 19,2022 at 12:02:08 UTC from IEEE Xplore. Restrictions apply.
LI et al.: PCC-BASED PERFORMANCE ENHANCEMENT OF BLS 2417
R EFERENCES
[1] W. Lu, J. Li, J. Wang, and L. Qin, “A CNN-BiLSTM-AM method
for stock price prediction,” Neural Comput. Appl., vol. 33, no. 10,
pp. 4741–4753, 2021.
[2] C. K. Tse, J. Liu, and F. C. M. Lau, “A network perspective of the stock
market,” J. Empirical Financ., vol. 17, no. 4, pp. 659–667, 2010.
[3] X. F. Liu and C. K. Tse, “A complex network perspective of world
stock markets: Synchronization and volatility,” Int. J. Bifurcation Chaos,
vol. 22, no. 6, 2012, Art. no. 1250142.
[4] G. Chen, Y. Lou, and L. Wang, “A comparative study on controllability
robustness of complex networks,” IEEE Trans. Circuits Syst. II, Exp.
Briefs, vol. 66, no. 5, pp. 828–832, May 2019.
[5] M. Small and C. K. Tse, “Determinism in financial time series,” Stud.
Nonlinear Dyn. Econom., vol. 7, no. 3, pp. 1–29, 2003.
[6] J. Dong, W. Dai, Y. Liu, L. Yu, and J. Wang, “Forecasting chinese
stock market prices using baidu search index with a learning-based data
collection method,” Int. J. Inf. Technol. Decis. Making, vol. 18, no. 5,
pp. 1605–1629, 2019.
[7] D. Zhang and S. Lou, “The application research of neural network and
BP algorithm in stock price pattern classification and prediction,” Future
Gener. Comput. Syst., vol. 115, pp. 872–879, Feb. 2021.
[8] Q. Li et al., “Integrating reinforcement learning and optimal power dis-
patch to enhance power grid resilience,” IEEE Trans. Circuits Syst. II,
Exp. Briefs, vol. 69, no. 3, pp. 1402–1406, Mar. 2021.
[9] X. Zhang et al., “Braess paradox and double-loop optimization method
to enhance power grid resilience,” Rel. Eng. Syst. Safety, vol. 215, Nov.
2021, Art. no. 107913.
[10] T. Fischer and C. Krauss, “Deep learning with long short-term memory
networks for financial market predictions,” Eur. J. Oper. Res., vol. 270,
no. 2, pp. 654–669, 2018.
[11] B. N. G. Koneru and V. Vasudevan, “Sparse artificial neural networks
using a novel smoothed LASSO penalization,” IEEE Trans. Circuits Syst.
II, Exp. Briefs, vol. 66, no. 5, pp. 848–852, May 2019.
[12] L. Bai, Y. Zhao, and X. Huang, “A CNN accelerator on FPGA using
depthwise separable convolution,” IEEE Trans. Circuits Syst. II, Exp.
Briefs, vol. 65, no. 10, pp. 1415–1419, Oct. 2018.
[13] A. R. Mohamed, L. Qi, Y. Li, and G. Wang, “A generic nano-Watt power
fully tunable 1-D Gaussian kernel circuit for artificial neural network,”
IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 67, no. 9, pp. 1529–1533,
Sep. 2020.
[14] Q. Zhuge, L. Xu, and G. Zhang, “LSTM neural network with emo-
tional analysis for prediction of stock price,” Eng. Lett., vol. 25, no. 2,
pp. 167–175, 2017.
[15] A. Thakkar, D. Patel, and P. Shah, “Pearson correlation coefficient-
based performance enhancement of vanilla neural network for stock
trend prediction,” Neural Comput. Appl., vol. 33, pp. 1–16, Jul. 2021.
[16] B. Lim and S. Zohren, “Time-series forecasting with deep learn-
ing: A survey,” Philos. Trans. Roy. Soc. A, vol. 379, no. 2194, 2021,
Art. no. 20200209.
[17] C.-F. Tsai and Y.-C. Hsiao, “Combining multiple feature selection meth-
ods for stock prediction: Union, intersection, and multi-intersection
approaches,” Decis. Support Syst., vol. 50, no. 1, pp. 258–269, 2010.
[18] D. Liu, C. K. Tse, and X. Zhang, “Robustness assessment and enhance-
ment of power grids from a complex network’s perspective using
decision trees,” IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 66, no. 5,
pp. 833–837, May 2019.
[19] L.-P. Ni, Z.-W. Ni, and Y.-Z. Gao, “Stock trend prediction based on
fractal feature selection and support vector machine,” Expert Syst. Appl.,
Fig. 2. Actual and predicted price of the four experimental stocks using vol. 38, no. 5, pp. 5569–5576, 2011.
PCC-BLS. [20] J. Yang, R. Rao, P. Hong, and P. Ding, “Ensemble model for stock price
movement trend prediction on different investing periods,” in Proc. 12th
Int. Conf. Comput. Intell. Security (CIS), 2016, pp. 358–361.
[21] C.-L. Huang and C.-Y. Tsai, “A hybrid SOFM-SVR with a filter-
based feature selection for stock market forecasting,” Expert Syst. Appl.,
that PCC-BLS provided more accurate forecasting of future vol. 36, no. 2, pp. 1529–1539, 2009.
values than other prediction models mentioned above. [22] C. L. P. Chen and Z. Liu, “Broad Learning System: An effective and
However, in this brief, we are short on the practical appli- efficient incremental learning system without the need for deep architec-
ture,” IEEE Trans. Neural Netw. Learn. Syst., vol. 29, no. 1, pp. 10–24,
cation of the proposed model. In the future, we will adopt Jan. 2018.
an actual stock investment strategy and evaluate the practi- [23] K. Pearson, “Contributions to the mathematical theory of evolution,”
cal application of the model through returns. Future research Philos. Trans. Roy. Soc. London A, vol. 185, pp. 71–110, Dec. 1894.
[24] A. Thakkar and K. Chaudhari, “Fusion in stock market prediction: A
will also focus on the comparison of different feature selection decade survey on the necessity, recent developments, and potential future
methods. directions,” Inf. Fusion, vol. 65, pp. 95–107, Jan. 2021.
Authorized licensed use limited to: SOUTH CHINA UNIVERSITY OF TECHNOLOGY. Downloaded on June 19,2022 at 12:02:08 UTC from IEEE Xplore. Restrictions apply.
View publication stats