Journal of Hydrology
Research papers
This manuscript was handled by G. Syme, Machine learning and data-driven models have achieved a favorable reputation in the field of advanced geos-
Editor-in-Chief patial modeling, particularly for models of groundwater aquifer potential over large areas. Such models built
Keywords: using standalone machine learning techniques retain some uncertainty, including errors associated with the
Machine learning modeling process, sampling approach, and input hyper-parameters. Some of these techniques cannot be applied
Groundwater aquifer potential in data-scarce regions because high bias and variance can lead to oversimplification. Therefore, in the current
Multi-adaptive-boosting-logistic-regression study, we developed and validated a novel ensemble multi-adaptive boosting logistic regression (MABLR) model
GIS for groundwater aquifer potential mapping. This model was validated in a large area of the Gyeongsangbuk-do
basin in South Korea and the results were compared to those of different types of machine learning models
including multiple-layer perception (MPL), logistic regression (LR), and support vector machine (SVM) models.
A forward stepwise LR technique was implemented to assess the importance of contributing morphological
factors; we found 15 factors that contributed significantly: topographic wetness index (TWI), topographic
roughness index (TRI), stream power index (SPI), topographic position index (TPI), multi-resolution valley
bottom flatness (MVBF), slope, aspect, slope length (LS), distance from the river, distance from the fault, profile
curvature, plane curvature, altitude, land use/land cover (LULC), and geology. We optimized the MABLR model
using a fuzzy logic supervised (FLS) approach with 184 iterations and then validated the results using accuracy
assessment metrics including the κ coefficient, root-mean-square error (RMSE), receiver operating characteristics
(ROC), and the precision-recall curve (PRC). Our model had superior predictive performance among the models
tested, with higher overall goodness-of-fit and validation values according to the κ coefficient (0.819 and 0.781,
respectively), ROC (0.917 and 0.838), and PRC (0.931 and 0.872). Our experimental results demonstrate that
MABLR is more effective at reducing bias and variance error than other constituent machine learning methods.
are tools often used for mapping groundwater. Although these methods regression spline (Zabihi et al., 2016), index of entropy (Al-Abadi and
provide detailed recognition of subsurface hydrogeological structures Shahid, 2015), boosted regression tree (Naghibi et al., 2016), multi-
(Helaly, 2017), they can be time-consuming and costly (Nampak et al., variate adaptive regression splines (Rahmati et al., 2019), artificial
2014). The development of geographic information systems (GIS), neural network model (Corsini et al., 2009), and aquifer sustainability
statistical techniques, machine learning models, and remote sensing factor (Smith et al., 2010).
data have led to advances in groundwater potential analyses (Yin et al., In most cases, statistical and machine learning models perform well;
2018). GIS and remote sensing technology have been used as spatial however, if the training sample size is inadequate, these models tend to
research tools in numerous environmental applications including hy- oversimplify reality. Different sources of uncertainties related to
drological studies and natural hazard risk assessments (Mojaddadi groundwater modeling can include the modeling process, input para-
et al., 2017; Rizeei et al., 2018a,b; Rizeei et al., 2016, 2018c). Recently, meters, and sampling approach (Refsgaard et al., 2007). The ensemble
machine learning and data mining methods have been implemented in evidential belief function (Mohammady et al., 2012) and tree-based
many groundwater studies due to their ability to recognize patterns model was proposed to create the groundwater potential map (Naghibi
within inventory datasets and nonlinear relationships between para- et al., 2019). The development of ensemble models has allowed the
meters (Naghibi et al., 2018). integration of a base-learner approach with a prime algorithm to
Numerous forms of GIS-based and machine learning models have achieve more robust models that can be applied over large study areas,
been applied for groundwater potential application, including multi- where data coverage can be inconsistent (Naghibi et al., 2017). How-
criteria decision analysis (Kaliraj et al., 2014; Pradhan, 2010), fre- ever, the application of hybrid models should be explored for different
quency ratio (Guru et al., 2017; Oh et al., 2011; Rahmati et al., 2016), regions to determine the optimum model in terms of accuracy, ro-
Dempster-Shafer theory (Rahmati and Melesse, 2016), weights-of-evi- bustness, overfitting, and sensitivity to scarce data (Rahmati et al.,
dence modelling (Corsini et al., 2009; Ghorbani Nejad et al., 2017), 2018).
Self-learning Random Forests (Sameen et al., 2018), logistic regression To reduce these modeling uncertainties, we coupled a multi-adap-
(Ozdemir, 2011; Rizeei et al., 2018a), decision tree (Chenini et al., tive boosting hybrid model (MultiAdaBoosting) based on a decision-
2010), evidential belief function (Mogaji et al., 2015), the logistic committee technique that combines adaptive boosting (AdaBoost) with
model tree (Rahmati et al., 2018) certainty factor (Razandi et al., wagging, with logistic regression (LR), a robust model with strict ex-
2015), analytical hierarchy process (Adiat et al., 2012; Yin et al., 2018), pectations prior to training (Pradhan, 2010), to develop the ensemble
the statistical index (Falah et al., 2017), multivariate adaptive MABLR model. Although, MultiAdaBoosting is one of the powerful
input. Pixels with a value of zero are assigned to flat regions. aquifer.
TPI indicates the position of each cell, and is calculated as follows The MVBF index reflects the valley bottom characteristics of flatness
(De Reu et al., 2013; Guisan et al., 1999): and lowness. Flatness is measured using the inverse of the slope, and
lowness is measured using ranking elevation with respect to a circular
TPI = surrounding area. These two measures, both scaled from 0 to 1, are
Esurrounding (1) combined by multiplication and can be interpreted as fuzzy set mem-
bership functions (Gallant and Dowling, 2003; Kaufmann, 1975). LS is a
where Epixel is the altitude of the cell and Esurrounding is the mean altitude
combination of slope gradient (S) and slope length (L). We adopted an
of the neighboring pixels. High TPI values indicate upper slopes, while
extensively used method for calculating LS, as follows:
low values of TPI show lower slopes where the potential of the
groundwater aquifer is high. The MVBF index links between size and A 0.4 sinβ ⎞1.3
flatness of valley bottoms, which was incorporated into the algorithm LS = ⎛ s ⎞ ⎛
⎝ 22.13 ⎠ ⎝ 0.0896 ⎠ (2)
by reducing the slope threshold. Zero value specifies erosional terrain
with less possibility of groundwater aquifer, while values above 1 in- where A is the accumulated flow of the unit stream power theory,
dicating areas of deposition with much productive groundwater which considers sediments and water, and β is the slope in degrees.
Basically, a low value of LS is more probable for a productive topographic index can be estimated with respect to grid spacing and
groundwater aquifer. terrain roughness by comparing the relationship between the topo-
SPI and TWI are water-related parameters calculated as follows graphic index surface and reference data.
(Gokceoglu et al., 2005): TRI is another morphological parameter widely used in ground-
water analyses; it is calculated in this study as follows:
SPI = Astanβ , (3)
TRI = Abs(max2 − min2), (5)
As ⎞
TWI = ln ⎛⎜ ⎟,
where max and min represent the largest and smallest values of cells in
⎝ tanβ ⎠ (4)
nine rectangular neighborhoods of altitude values.
where As is the catchment area or flow accumulation (m2 m−1) and β is LULC types are also primary factors that strongly contribute to
the local slope gradient measured in degrees. SPI indicates the erosive groundwater potential modeling. A detailed understanding of LULCs
power of water flow. TWI represents the effects of topography on runoff bears extreme significance for environmental and natural hazards
generation and the amount of flow accumulation at any location within (Rizeei et al., 2016). Lithology and geology are also important para-
the river catchment (Gokceoglu et al., 2005). The accuracy of a meters used to detect sensitive groundwater aquifer areas. Soil type
directly affects the drainage process via characteristics such as texture, 2012). The FLS evaluated hyper-parameters by a search run iteratively
permeability degree, and structure. Lithological information regarding from a random vertex that calculated the ideal value among the
the permeability of rocks is also required. The study area contained available domain. After all runs were assessed, the optimal hyper-
rocks from 142 different lithology classes. parameter configuration was selected within 184 iterations according
Variation in the factors contributing to the behavior and activity of to evaluation metrics. The optimal hyper-parameters for all proposed
groundwater cause ambiguity in the overlaying process. Therefore, all models are summarized in Table 1.
factors were normalized to a common scale in the feature raster before
overlaying (Youssef et al., 2015; Mojaddadi et al., 2017; Fanos and 2.3. Theory of the LR, SVM, MLP, and MABLR models
Pradhan, 2019).
LR is a widely used multivariate statistical model that can be ap-
2.2. Model optimization plied to continuous or discrete data of any distribution or raster format
(Lee and Sambath, 2006a,b). It was proposed by McFadden (1974) to 2.4. Evaluation methods
measure probability of occurrence depending on contributing para-
meters. LR can be used to evaluate relationships among binary depen- The following evaluation metrics were applied to assess the accu-
dent variables over nominal and scalar values of independent variables racy of groundwater potential models: RMSE, κ coefficient, ROC, and
(Shirzadi et al., 2012). PRC.
SVM was designed on the basis of statistical learning theory to The κ coefficient measures the overall accuracy of the model among
minimize operational uncertainty (Yao et al., 2008). This process con- all correctly assigned samples on a diagonal basis in the error matrix
verts nonlinear structures into linear structures according to hyperplane allocated by the full dataset (Ridd and Liu, 1998). The κ coefficient is
creation (Tehrany et al., 2014). A separate hyperplane is created for the calculated as follows:
original space with n coordinates among points within two different r r
M ∑i = 1 x ii − ∑i = 1 xi + x + i
categories (Marjanović et al., 2011). The hyperplane separates training K= r
datasets based on a kernel function of the SVM. Support vectors are M2 − ∑i = 1 xi + x + i (6)
recognized as neighboring training vertices of the ideal hyperplane. The
where r reflects the total number of rows in the error matrix, xii is
goal of the SVM model is to recognize the ideal separating hyperplane
observation i, xi and x + 1 are the minimal totals, and M is the set of
observations. ROC curves are designed to evaluate and visualize the
The MLP algorithm is a feed-forward artificial neural network (NN)
performance of an analytical model; they indicate sensitivity or a true
that uses nodes linked by input signals and numeric weights to produce
positive rate (TP) associated with a decision threshold on the y-axis,
layers that receive, process, and display output (Harun et al., 2010).
and specificity or false positive rate (FP) on the x-axis (Fawcett, 2006),
Back-propagation is applied to reduce errors accumulated via the re-
thus representing the positive and negative probability, respectively,
petitive approach. NNs have successfully been utilized in remote sen-
that a pixel is classified correctly. The area under the ROC curve esti-
sing applications. Limitations of the MLP model include high compu-
mates the overall accuracy of the model (Nampak et al., 2014; Pradhan,
tational costs and overlearning (Mia and Dhar, 2016).
2010). However, evaluation of the model solely by visual interpretation
MultiAdaBoosting merges AdaBoost with wagging to produce a
of ROC can be misleading; thus, the precision-recall curve (PRC) is a
decision-committee model (Webb, 2000; Bui et al., 2016) that reduces
complementary evaluation metric that is useful for imbalanced data-
both variance and bias. Although it cannot be applied for committees
sets. The PRC shows the correlation between the positive predictive
of < 10 members, MultiAdaBoosting exhibits greater error reduction
value (PPV) or precision and sensitivity for all possible pixels, from
than all other relative committed algorithms (Kotsiantis et al., 2007). In
which TP and FP can be calculated. The PRC graph can be plotted by
comparison, MABLR uses LR for classifier-based learning to generate
dividing sensitivity by PPV. The x-axis represents recall or sensitivity,
decision committees with less error than either wagging or Multi-
and the y-axis represents precision. Each point on the PRC graph thus
AdaBoosting, even for a large cross-section of datasets (Webb, 2000).
represents a selected cut-off. A perfect model will have a ROC and PRC
MABLR is more efficient than MultiAdaBoosting due to its matching
of 1, whereas a value approaching 0 indicates an inaccurate model.
parallel execution algorithms. The steps of MABLR implementation are
RMSE is used to evaluate differences between the observed sample
shown in Fig. 4.
values and predicted model values. RMSD is the square root of the
All classifiers determined by wagging are independent from all
second trial or the quadratic mean of the deviations from observed
others, permitting parallel multiplication and creating uncertainty in
values to predicted values (Hyndman and Koehler, 2006). RMSE was
the MultiAdaBoosting model at the sub-committee class. MABLR im-
calculated as follows:
proves error reduction compared to other approaches, including bag-
ging decision trees, wagging, and MultiAdaBoosting, particularly at n
∑i = 1 (Xtest − Xtrain )2
∊t < 10, when variance is amplified, thus reducing the frequency at RMSE =
n (7)
which the central tendency is created and therefore reducing its ability
to contribute to uncertainty. where Xtest is the set of testing values and Xtrain is the set of training
values at i.
Fig. 5. Groundwater aquifer potential maps calculated by a) LR, b) MLP, c) SVM, and d) MABLF models.
In particular, MABLR results indicated that locations with the groundwater potential was SPI, with a weight of 4.844. Variation in SPI
highest groundwater aquifer potential were mainly situated in the can directly increase or decrease groundwater potential. Plane curva-
western and southwestern regions of the study area (Fig. 5). By con- ture and MVBF were the second and third most influential factors, with
trast, very low groundwater potential was assigned to eastern and weights of 4.315 and 4.240, respectively. These factors significantly
northwestern regions of the study area. Among a total of 59 productive affected runoff behavior and further delineated areas of groundwater
wells, 47 were assigned to very high and high groundwater aquifer concentration. Other hydrological and morphological factors including
potential zones, indicating the high precision of the MABLR model. TPI, TRI, and SL also contributed greatly to groundwater potential
The MABLR model was used to extract the degree of contribution of zones, with weights of 3.076, 3.039, and 2.537, respectively. Altitude,
each factor (Fig. 6). The most effective parameter in determining index, TWI, and profile curvature made moderate contributions
Fig. 6. The assigned weightage to each contributing factor by the MABLR model.
Table 2
The results of goodness-of-fit and validation evaluation of all the applied models.
Goodness of fit Validation
goodness-of-fit and validation assessment, respectively. SVM placed Declaration of Competing Interest
third among all accuracy assessment metrics. SVM RMSE values in-
dicated the highest error in terms of success rate (0.3119), and the The authors declare that they have no known competing financial
second lowest error in terms of validation rate (0.3217). interests or personal relationships that could have appeared to influ-
LR showed the lowest accuracy among all models, with κ coeffi- ence the work reported in this paper.
cient, ROC, and PRC values of 0.5569, 0.822, and 0.8685, respectively,
in terms of goodness-of-fit and 0.5401, 0.745, and 0.8116 in terms of Acknowledgements
validation. RMSE values indicated that LR had a slightly higher success
rate (0.2937) than the SVM and the worst performance among all This research was supported by the Basic Research Project of the
models (0.4704). Korea Institute of Geoscience and Mineral Resources (KIGAM) and
In general, all models examined in this study had an acceptable Science and Technology Internationalization Project (NRF-
amount of uncertainty and high goodness-of-fit. The well-calibrated 2016K1A3A1A09915721) funded by the Ministry of Science and ICT.
ensemble MABLR model exhibited the highest performance for mod- The research is supported by the Centre for Advanced Modelling and
eling groundwater aquifer potential. Geospatial Information Systems (CAMGIS), University of Technology
Sydney under grant numbers: 323930, 321740.2232335;
321740.2232424 and 321740.2232357.
4. Conclusion The English in this document has been checked by at least two
professional editors, both native speakers of English. For a certificate,
Sustainable groundwater aquifer management requires precise please see:
modeling to accurately and reliably simulate conditions in nature.
Modeling groundwater aquifer potential is a delicate process involving Appendix A. Supplementary data
the estimation of several morphological and hydrological parameters.
Several techniques have been proposed for groundwater potential Supplementary data to this article can be found online at https://
mapping; however, not all can be applied in data-scarce regions where
bias and variance are high, as they tend toward oversimplification.
