Machine Learning Ensemble Approach For Ionosphere and Space Weather Forecasting With Uncertainty Quantification
Machine Learning Ensemble Approach For Ionosphere and Space Weather Forecasting With Uncertainty Quantification
Machine Learning Ensemble Approach for Ionosphere and Space Weather Forecasting with
Uncertainty Quantification
Authorized licensed use limited to: SRM Institute of Science and Technology Kattankulathur. Downloaded on January 28,2025 at 08:37:12 UTC from IEEE Xplore. Restrictions apply.
crease for the correct ones. The final outcome represents the
weighted estimations from all iterations. Gradient boost-
ing offers a generalization of boosting to arbitrary differen-
tiable objective functions, where a tree is fitted to the gradi-
ent. XGBoost (eXtreme Gradient Boosting) [9] is an opti-
mized gradient boosting algorithm that uses regularization
to avoid overfitting. A detailed explanation of the meth-
ods and data sets can be found in [10]. These three learn-
ing algorithms were individually trained on the three data
sets, i.e. nine models were developed for each ionospheric
region, resulting in a total of 27 models. The developed
models were then combined into a super-ensemble model,
which provides average forecasts of all models within the
ensemble (Figure 2). Ensemble modeling is a standard ap-
proach for terrestrial weather forecasting. The standard de-
viation of ensemble members with respect to the ensemble
mean, known as ensemble spread, provides an estimate of
the forecasting uncertainties. A large (small) spread indi-
cates a low (high) confidence in the forecast.
(https://cddis.nasa.gov/archive/gnss/products/ionex). In
order to model the VTEC temporal and seasonal depen-
dencies, hour of day and day of year (DOY) were added as
features to the model. Exponential moving average (EMA)
and time derivatives of VTEC are additional features. The
exponential moving average gives higher weights on the
latest data values, and is, therefore, more sensitive to recent
values and recent value changes than a simple moving
average. The training data cover the period from January
2015 to December 2016, while the testing is performed for
the subsequent year 2017. Three data sets are prepared:
1) Original data (D1); 2) Daily differences for both inputs Figure 2. Overview of the developed ML models, trained
and outputs (D2); 3) Input data combining the first and on different data sets using tree-based learning algorithms.
the second data sets, while the output corresponds to the Afterward, they are combined into a super-ensemble model
original VTEC (D3). that provides the VTEC forecast and its uncertainty.
Authorized licensed use limited to: SRM Institute of Science and Technology Kattankulathur. Downloaded on January 28,2025 at 08:37:12 UTC from IEEE Xplore. Restrictions apply.
timated, and on September 9 (beginning of the recovery
phase), when the VTEC forecast is overestimated. In the
main storm phase, GIM VTEC is mostly within the ensem-
ble spread. The ensemble mean is slightly overestimated
during the main phase. During the recovery phase (Septem-
ber 10 - 13), the mid-latitude GIM VTEC is within the
ensemble spread and corresponds to the ensemble mean.
For the low-latitude region, the differences between the en-
semble mean and GIM VTEC are larger during daytime on
September 8 and 9. An overview of the RMSE of ML mod-
els within the ensemble, the ensemble mean and the mean
variance (2σ ) is provided in Table 1. The super-ensemble
model provides the optimal results.
Authorized licensed use limited to: SRM Institute of Science and Technology Kattankulathur. Downloaded on January 28,2025 at 08:37:12 UTC from IEEE Xplore. Restrictions apply.
feature, while blue stands for small ones. The x-axis corre- vier, Amsterdam, Netherlands, 22, May 2018, ISBN:
sponds to Shapley values that characterize the average im- 9780128117880.
pact on the model output. Positive Shapley values mean
that a feature increases the VTEC forecast, while negative [2] J. C. Uwamahoro, J. B. Habarulema “Modelling to-
values represent a feature impact that decreases the VTEC tal electron content during geomagnetic storm condi-
forecast. The larger the absolute Shapley values, the higher tions using empirical orthogonal functions and neu-
impact of a feature on the VTEC forecast. The most impor- ral networks,” Journal of Geophysical Research:
tant variables for the mid-latitude VTEC forecast are the Space Physics, 120, 3, December 2015, pp. 794–795,
current VTEC, hour of day, F10.7 solar activity index, ex- doi:10.1002/2015JA021961.
ponentially moving VTEC average and VTEC daily differ- [3] R. Tang, F. Zeng, Z. Chen “The comparison of pre-
ence. Other important variables are DOY, geomagnetic in- dicting storm-time ionospheric tec by three meth-
dices Kp and Dst, sunspot number and daily Dst difference. ods: Arima, lstm, and seq2seq,” Atmosphere, 11, 25,
March 2020, doi:10.3390/atmos11040316.
4 Conclusion
[4] A. V. Zhukov, Y. V. Yasyukevich, A. E. Bykov “Cor-
Today’s society relies on space- and ground-based techno- rection to: Gimli: Global ionospheric total electron
logical infrastructures, which are not adequately protected content model based on machine learning,” GPS So-
against space weather. Therefore, accurate forecasting and lutions, 25, 21, 22, November 2020, pp. 794–795,
early-warning systems of space weather events are urgently doi:10.1007/s10291-020-01063-1.
needed. This paper presents a novel model for forecasting
[5] J. H. Friedman “Greedy function approximation: A
VTEC and space weather effects in the ionosphere based
gradient boosting machine,” The Annals of Statis-
on ML and ensemble modeling. To get the most out of
tics, 29, 5, 1, October 2001, pp. 1189–1232,
the ML model, it is necessary to prepare relevant data and
doi:10.1214/aos/1013203451.
useful features that enhance learning, particularly of space
weather events. Also, it is critical to quantify the uncer- [6] R. Natras, M. Schmidt “Machine Learning Model
tainty of weather forecasts, especially when forecasting ex- Development for Space Weather Forecasting in the
treme weather events. In terrestrial weather forecasting, Ionosphere,” Proceedings of the CIKM 2021 Work-
this is typically achieved with ensemble forecasting sys- shops: 1st Workshop on Complex Data Challenges
tems. Following that approach, ML models were devel- in Earth Observation (CDCEO’21), CEUR-WS.org,
oped using different learning algorithms and different data 3052, 12, December 2021, http://ceur-ws.org/Vol-
sets and then combined via ensemble modeling to produce 3052/short10.pdf
a super-ensemble model with more reliable results that can
quantify their uncertainties. During the severe geomagnetic [7] L. Breiman “Random forests,” Machine Learn-
storm, the forecast of the super-ensemble ML model is less ing, 45, 1, 1, October 2001, pp. 5–32,
confident than after the storm. Most of the time, the mean doi:10.1023/A:1010933404324.
and the spread of the ensemble corresponds to the refer-
[8] Y. Freund, R. E. Schapire “A decision-theoretic
ence VTEC values. The largest differences occur during
generalization of on-line learning and an applica-
the initial storm phase and at the beginning of the recovery
tion to boosting,” Journal of Computer and Sys-
phase. The changes in the interplanetary and geomagnetic
tem Sciences, 55, 1, 1, August 1997, pp. 119–139,
fields were short-term and insufficient information on these
doi:10.1006/jcss.1997.1504.
changes was provided in the input data from a day before.
However, during the main storm phase, the reference VTEC [9] T. Chen, C. Guestrin “Xgboost: A scalable tree
does not deviate much from the ensemble spread. After the boosting system,” Proceedings of the 22nd ACM
storm, the ensemble spread is narrower and the ensemble SIGKDD international conference on knowledge dis-
mean agrees with the reference VTEC. covery and data mining, 1, August 2016, pp. 785–794,
doi:10.1145/2939672.2939785.
5 Acknowledgments
[10] R. Natras, B. Soja, M. Schmidt “Ensemble Machine
Learning of Random Forest, AdaBoost and XGBoost
We acknowledge the use of NASA/GSFC OMNIWeb ser-
for Forecasting Vertical Total Electron Content of the
vice for OMNI data, and the University of Bern for GIM
Ionosphere,” J Geod, submitted 13, December 2021
data. This study was funded by Research Grants - Doctoral
(under review).
Programmes in Germany from the German Academic Ex-
change Service (DAAD). [11] S. M. Lundberg, S. Lee “A Unified Approach to
Interpreting Model Predictions,” Advances in Neural
References Information Processing Systems, 30, 2017, pp. 4765-
4774, http://papers.nips.cc/paper/7062-a-unified-
[1] E. Camporeale, S. Wing, and J. Johnson, “Ma- approach-to-interpreting-model-predictions.pdf.
chine learning techniques for space weather,” Else-
Authorized licensed use limited to: SRM Institute of Science and Technology Kattankulathur. Downloaded on January 28,2025 at 08:37:12 UTC from IEEE Xplore. Restrictions apply.