LR4 Groundwater Aquifer Potential Modeling Using An Ensemble Multi-Adoptive
LR4 Groundwater Aquifer Potential Modeling Using An Ensemble Multi-Adoptive
LR4 Groundwater Aquifer Potential Modeling Using An Ensemble Multi-Adoptive
Journal of Hydrology
journal homepage: www.elsevier.com/locate/jhydrol
Research papers
A R T I C LE I N FO A B S T R A C T
This manuscript was handled by G. Syme, Machine learning and data-driven models have achieved a favorable reputation in the field of advanced geos-
Editor-in-Chief patial modeling, particularly for models of groundwater aquifer potential over large areas. Such models built
Keywords: using standalone machine learning techniques retain some uncertainty, including errors associated with the
Machine learning modeling process, sampling approach, and input hyper-parameters. Some of these techniques cannot be applied
Groundwater aquifer potential in data-scarce regions because high bias and variance can lead to oversimplification. Therefore, in the current
Multi-adaptive-boosting-logistic-regression study, we developed and validated a novel ensemble multi-adaptive boosting logistic regression (MABLR) model
GIS for groundwater aquifer potential mapping. This model was validated in a large area of the Gyeongsangbuk-do
Optimization
basin in South Korea and the results were compared to those of different types of machine learning models
including multiple-layer perception (MPL), logistic regression (LR), and support vector machine (SVM) models.
A forward stepwise LR technique was implemented to assess the importance of contributing morphological
factors; we found 15 factors that contributed significantly: topographic wetness index (TWI), topographic
roughness index (TRI), stream power index (SPI), topographic position index (TPI), multi-resolution valley
bottom flatness (MVBF), slope, aspect, slope length (LS), distance from the river, distance from the fault, profile
curvature, plane curvature, altitude, land use/land cover (LULC), and geology. We optimized the MABLR model
using a fuzzy logic supervised (FLS) approach with 184 iterations and then validated the results using accuracy
assessment metrics including the κ coefficient, root-mean-square error (RMSE), receiver operating characteristics
(ROC), and the precision-recall curve (PRC). Our model had superior predictive performance among the models
tested, with higher overall goodness-of-fit and validation values according to the κ coefficient (0.819 and 0.781,
respectively), ROC (0.917 and 0.838), and PRC (0.931 and 0.872). Our experimental results demonstrate that
MABLR is more effective at reducing bias and variance error than other constituent machine learning methods.
⁎
Corresponding authors at: Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of
Technology Sydney, NSW 2007, Australia (B. Pradhan). Geoscience Platform Division, Korea Institute of Geoscience and Mineral Resources (KIGAM), Gajeong-dong
30, Yuseong-gu, Daejeon 305-350, Republic of Korea (S. Lee).
E-mail addresses: [email protected] (B. Pradhan), [email protected] (S. Lee).
https://doi.org/10.1016/j.jhydrol.2019.124172
Received 3 July 2019; Received in revised form 30 August 2019; Accepted 23 September 2019
Available online 23 September 2019
0022-1694/ © 2019 Elsevier B.V. All rights reserved.
H.M. Rizeei, et al. Journal of Hydrology 579 (2019) 124172
are tools often used for mapping groundwater. Although these methods regression spline (Zabihi et al., 2016), index of entropy (Al-Abadi and
provide detailed recognition of subsurface hydrogeological structures Shahid, 2015), boosted regression tree (Naghibi et al., 2016), multi-
(Helaly, 2017), they can be time-consuming and costly (Nampak et al., variate adaptive regression splines (Rahmati et al., 2019), artificial
2014). The development of geographic information systems (GIS), neural network model (Corsini et al., 2009), and aquifer sustainability
statistical techniques, machine learning models, and remote sensing factor (Smith et al., 2010).
data have led to advances in groundwater potential analyses (Yin et al., In most cases, statistical and machine learning models perform well;
2018). GIS and remote sensing technology have been used as spatial however, if the training sample size is inadequate, these models tend to
research tools in numerous environmental applications including hy- oversimplify reality. Different sources of uncertainties related to
drological studies and natural hazard risk assessments (Mojaddadi groundwater modeling can include the modeling process, input para-
et al., 2017; Rizeei et al., 2018a,b; Rizeei et al., 2016, 2018c). Recently, meters, and sampling approach (Refsgaard et al., 2007). The ensemble
machine learning and data mining methods have been implemented in evidential belief function (Mohammady et al., 2012) and tree-based
many groundwater studies due to their ability to recognize patterns model was proposed to create the groundwater potential map (Naghibi
within inventory datasets and nonlinear relationships between para- et al., 2019). The development of ensemble models has allowed the
meters (Naghibi et al., 2018). integration of a base-learner approach with a prime algorithm to
Numerous forms of GIS-based and machine learning models have achieve more robust models that can be applied over large study areas,
been applied for groundwater potential application, including multi- where data coverage can be inconsistent (Naghibi et al., 2017). How-
criteria decision analysis (Kaliraj et al., 2014; Pradhan, 2010), fre- ever, the application of hybrid models should be explored for different
quency ratio (Guru et al., 2017; Oh et al., 2011; Rahmati et al., 2016), regions to determine the optimum model in terms of accuracy, ro-
Dempster-Shafer theory (Rahmati and Melesse, 2016), weights-of-evi- bustness, overfitting, and sensitivity to scarce data (Rahmati et al.,
dence modelling (Corsini et al., 2009; Ghorbani Nejad et al., 2017), 2018).
Self-learning Random Forests (Sameen et al., 2018), logistic regression To reduce these modeling uncertainties, we coupled a multi-adap-
(Ozdemir, 2011; Rizeei et al., 2018a), decision tree (Chenini et al., tive boosting hybrid model (MultiAdaBoosting) based on a decision-
2010), evidential belief function (Mogaji et al., 2015), the logistic committee technique that combines adaptive boosting (AdaBoost) with
model tree (Rahmati et al., 2018) certainty factor (Razandi et al., wagging, with logistic regression (LR), a robust model with strict ex-
2015), analytical hierarchy process (Adiat et al., 2012; Yin et al., 2018), pectations prior to training (Pradhan, 2010), to develop the ensemble
the statistical index (Falah et al., 2017), multivariate adaptive MABLR model. Although, MultiAdaBoosting is one of the powerful
2
H.M. Rizeei, et al. Journal of Hydrology 579 (2019) 124172
3
H.M. Rizeei, et al. Journal of Hydrology 579 (2019) 124172
input. Pixels with a value of zero are assigned to flat regions. aquifer.
TPI indicates the position of each cell, and is calculated as follows The MVBF index reflects the valley bottom characteristics of flatness
(De Reu et al., 2013; Guisan et al., 1999): and lowness. Flatness is measured using the inverse of the slope, and
lowness is measured using ranking elevation with respect to a circular
Epixel
TPI = surrounding area. These two measures, both scaled from 0 to 1, are
Esurrounding (1) combined by multiplication and can be interpreted as fuzzy set mem-
bership functions (Gallant and Dowling, 2003; Kaufmann, 1975). LS is a
where Epixel is the altitude of the cell and Esurrounding is the mean altitude
combination of slope gradient (S) and slope length (L). We adopted an
of the neighboring pixels. High TPI values indicate upper slopes, while
extensively used method for calculating LS, as follows:
low values of TPI show lower slopes where the potential of the
groundwater aquifer is high. The MVBF index links between size and A 0.4 sinβ ⎞1.3
flatness of valley bottoms, which was incorporated into the algorithm LS = ⎛ s ⎞ ⎛
⎝ 22.13 ⎠ ⎝ 0.0896 ⎠ (2)
by reducing the slope threshold. Zero value specifies erosional terrain
with less possibility of groundwater aquifer, while values above 1 in- where A is the accumulated flow of the unit stream power theory,
dicating areas of deposition with much productive groundwater which considers sediments and water, and β is the slope in degrees.
4
H.M. Rizeei, et al. Journal of Hydrology 579 (2019) 124172
Fig. 3. (continued)
Basically, a low value of LS is more probable for a productive topographic index can be estimated with respect to grid spacing and
groundwater aquifer. terrain roughness by comparing the relationship between the topo-
SPI and TWI are water-related parameters calculated as follows graphic index surface and reference data.
(Gokceoglu et al., 2005): TRI is another morphological parameter widely used in ground-
water analyses; it is calculated in this study as follows:
SPI = Astanβ , (3)
TRI = Abs(max2 − min2), (5)
As ⎞
TWI = ln ⎛⎜ ⎟,
where max and min represent the largest and smallest values of cells in
⎝ tanβ ⎠ (4)
nine rectangular neighborhoods of altitude values.
where As is the catchment area or flow accumulation (m2 m−1) and β is LULC types are also primary factors that strongly contribute to
the local slope gradient measured in degrees. SPI indicates the erosive groundwater potential modeling. A detailed understanding of LULCs
power of water flow. TWI represents the effects of topography on runoff bears extreme significance for environmental and natural hazards
generation and the amount of flow accumulation at any location within (Rizeei et al., 2016). Lithology and geology are also important para-
the river catchment (Gokceoglu et al., 2005). The accuracy of a meters used to detect sensitive groundwater aquifer areas. Soil type
5
H.M. Rizeei, et al. Journal of Hydrology 579 (2019) 124172
Fig. 3. (continued)
directly affects the drainage process via characteristics such as texture, 2012). The FLS evaluated hyper-parameters by a search run iteratively
permeability degree, and structure. Lithological information regarding from a random vertex that calculated the ideal value among the
the permeability of rocks is also required. The study area contained available domain. After all runs were assessed, the optimal hyper-
rocks from 142 different lithology classes. parameter configuration was selected within 184 iterations according
Variation in the factors contributing to the behavior and activity of to evaluation metrics. The optimal hyper-parameters for all proposed
groundwater cause ambiguity in the overlaying process. Therefore, all models are summarized in Table 1.
factors were normalized to a common scale in the feature raster before
overlaying (Youssef et al., 2015; Mojaddadi et al., 2017; Fanos and 2.3. Theory of the LR, SVM, MLP, and MABLR models
Pradhan, 2019).
LR is a widely used multivariate statistical model that can be ap-
2.2. Model optimization plied to continuous or discrete data of any distribution or raster format
6
H.M. Rizeei, et al. Journal of Hydrology 579 (2019) 124172
(Lee and Sambath, 2006a,b). It was proposed by McFadden (1974) to 2.4. Evaluation methods
measure probability of occurrence depending on contributing para-
meters. LR can be used to evaluate relationships among binary depen- The following evaluation metrics were applied to assess the accu-
dent variables over nominal and scalar values of independent variables racy of groundwater potential models: RMSE, κ coefficient, ROC, and
(Shirzadi et al., 2012). PRC.
SVM was designed on the basis of statistical learning theory to The κ coefficient measures the overall accuracy of the model among
minimize operational uncertainty (Yao et al., 2008). This process con- all correctly assigned samples on a diagonal basis in the error matrix
verts nonlinear structures into linear structures according to hyperplane allocated by the full dataset (Ridd and Liu, 1998). The κ coefficient is
creation (Tehrany et al., 2014). A separate hyperplane is created for the calculated as follows:
original space with n coordinates among points within two different r r
M ∑i = 1 x ii − ∑i = 1 xi + x + i
categories (Marjanović et al., 2011). The hyperplane separates training K= r
datasets based on a kernel function of the SVM. Support vectors are M2 − ∑i = 1 xi + x + i (6)
recognized as neighboring training vertices of the ideal hyperplane. The
where r reflects the total number of rows in the error matrix, xii is
goal of the SVM model is to recognize the ideal separating hyperplane
observation i, xi and x + 1 are the minimal totals, and M is the set of
range.
observations. ROC curves are designed to evaluate and visualize the
The MLP algorithm is a feed-forward artificial neural network (NN)
performance of an analytical model; they indicate sensitivity or a true
that uses nodes linked by input signals and numeric weights to produce
positive rate (TP) associated with a decision threshold on the y-axis,
layers that receive, process, and display output (Harun et al., 2010).
and specificity or false positive rate (FP) on the x-axis (Fawcett, 2006),
Back-propagation is applied to reduce errors accumulated via the re-
thus representing the positive and negative probability, respectively,
petitive approach. NNs have successfully been utilized in remote sen-
that a pixel is classified correctly. The area under the ROC curve esti-
sing applications. Limitations of the MLP model include high compu-
mates the overall accuracy of the model (Nampak et al., 2014; Pradhan,
tational costs and overlearning (Mia and Dhar, 2016).
2010). However, evaluation of the model solely by visual interpretation
MultiAdaBoosting merges AdaBoost with wagging to produce a
of ROC can be misleading; thus, the precision-recall curve (PRC) is a
decision-committee model (Webb, 2000; Bui et al., 2016) that reduces
complementary evaluation metric that is useful for imbalanced data-
both variance and bias. Although it cannot be applied for committees
sets. The PRC shows the correlation between the positive predictive
of < 10 members, MultiAdaBoosting exhibits greater error reduction
value (PPV) or precision and sensitivity for all possible pixels, from
than all other relative committed algorithms (Kotsiantis et al., 2007). In
which TP and FP can be calculated. The PRC graph can be plotted by
comparison, MABLR uses LR for classifier-based learning to generate
dividing sensitivity by PPV. The x-axis represents recall or sensitivity,
decision committees with less error than either wagging or Multi-
and the y-axis represents precision. Each point on the PRC graph thus
AdaBoosting, even for a large cross-section of datasets (Webb, 2000).
represents a selected cut-off. A perfect model will have a ROC and PRC
MABLR is more efficient than MultiAdaBoosting due to its matching
of 1, whereas a value approaching 0 indicates an inaccurate model.
parallel execution algorithms. The steps of MABLR implementation are
RMSE is used to evaluate differences between the observed sample
shown in Fig. 4.
values and predicted model values. RMSD is the square root of the
All classifiers determined by wagging are independent from all
second trial or the quadratic mean of the deviations from observed
others, permitting parallel multiplication and creating uncertainty in
values to predicted values (Hyndman and Koehler, 2006). RMSE was
the MultiAdaBoosting model at the sub-committee class. MABLR im-
calculated as follows:
proves error reduction compared to other approaches, including bag-
ging decision trees, wagging, and MultiAdaBoosting, particularly at n
∑i = 1 (Xtest − Xtrain )2
∊t < 10, when variance is amplified, thus reducing the frequency at RMSE =
n (7)
which the central tendency is created and therefore reducing its ability
to contribute to uncertainty. where Xtest is the set of testing values and Xtrain is the set of training
values at i.
7
H.M. Rizeei, et al. Journal of Hydrology 579 (2019) 124172
Fig. 5. Groundwater aquifer potential maps calculated by a) LR, b) MLP, c) SVM, and d) MABLF models.
In particular, MABLR results indicated that locations with the groundwater potential was SPI, with a weight of 4.844. Variation in SPI
highest groundwater aquifer potential were mainly situated in the can directly increase or decrease groundwater potential. Plane curva-
western and southwestern regions of the study area (Fig. 5). By con- ture and MVBF were the second and third most influential factors, with
trast, very low groundwater potential was assigned to eastern and weights of 4.315 and 4.240, respectively. These factors significantly
northwestern regions of the study area. Among a total of 59 productive affected runoff behavior and further delineated areas of groundwater
wells, 47 were assigned to very high and high groundwater aquifer concentration. Other hydrological and morphological factors including
potential zones, indicating the high precision of the MABLR model. TPI, TRI, and SL also contributed greatly to groundwater potential
The MABLR model was used to extract the degree of contribution of zones, with weights of 3.076, 3.039, and 2.537, respectively. Altitude,
each factor (Fig. 6). The most effective parameter in determining index, TWI, and profile curvature made moderate contributions
Fig. 6. The assigned weightage to each contributing factor by the MABLR model.
8
H.M. Rizeei, et al. Journal of Hydrology 579 (2019) 124172
Table 2
The results of goodness-of-fit and validation evaluation of all the applied models.
Goodness of fit Validation
9
H.M. Rizeei, et al. Journal of Hydrology 579 (2019) 124172
goodness-of-fit and validation assessment, respectively. SVM placed Declaration of Competing Interest
third among all accuracy assessment metrics. SVM RMSE values in-
dicated the highest error in terms of success rate (0.3119), and the The authors declare that they have no known competing financial
second lowest error in terms of validation rate (0.3217). interests or personal relationships that could have appeared to influ-
LR showed the lowest accuracy among all models, with κ coeffi- ence the work reported in this paper.
cient, ROC, and PRC values of 0.5569, 0.822, and 0.8685, respectively,
in terms of goodness-of-fit and 0.5401, 0.745, and 0.8116 in terms of Acknowledgements
validation. RMSE values indicated that LR had a slightly higher success
rate (0.2937) than the SVM and the worst performance among all This research was supported by the Basic Research Project of the
models (0.4704). Korea Institute of Geoscience and Mineral Resources (KIGAM) and
In general, all models examined in this study had an acceptable Science and Technology Internationalization Project (NRF-
amount of uncertainty and high goodness-of-fit. The well-calibrated 2016K1A3A1A09915721) funded by the Ministry of Science and ICT.
ensemble MABLR model exhibited the highest performance for mod- The research is supported by the Centre for Advanced Modelling and
eling groundwater aquifer potential. Geospatial Information Systems (CAMGIS), University of Technology
Sydney under grant numbers: 323930, 321740.2232335;
321740.2232424 and 321740.2232357.
4. Conclusion The English in this document has been checked by at least two
professional editors, both native speakers of English. For a certificate,
Sustainable groundwater aquifer management requires precise please see: http://www.textcheck.com/certificate/189N3i.
modeling to accurately and reliably simulate conditions in nature.
Modeling groundwater aquifer potential is a delicate process involving Appendix A. Supplementary data
the estimation of several morphological and hydrological parameters.
Several techniques have been proposed for groundwater potential Supplementary data to this article can be found online at https://
mapping; however, not all can be applied in data-scarce regions where doi.org/10.1016/j.jhydrol.2019.124172.
bias and variance are high, as they tend toward oversimplification.
Although, MultiAdaBoosting is one of the powerful adaptive boosting References
classifiers that can classify multiple classes even on complex recogni-
tion problems, yet it is sensitive to the existence of the outliers in the Adiat, K., Nawawi, M., Abdullah, K., 2012. Assessing the accuracy of GIS-based ele-
dataset which is very common in groundwater domain, as well as mentary multi criteria decision analysis as a spatial prediction tool–a case of pre-
dicting potential zones of sustainable groundwater resources. J. Hydrol. 440, 75–89.
overfitting problems. Therefore, we proposed the ensemble MABLR, Aghdam, I.N., Varzandeh, M.H.M., Pradhan, B., 2016. Landslide susceptibility mapping
which reduces bias and variance in the dataset in the Gyeongsangbuk- using an ensemble statistical index (Wi) and adaptive neuro-fuzzy inference system
do basin of South Korea. The integrated MultiAdaBoosting with the (ANFIS) model at Alborz Mountains (Iran). Environ. Earth Sci. 75 (7), 553. https://
doi.org/10.1007/s12665-015-5233-6.
actual function of LR caused less sensitivity on outliers, training dis- Al-Abadi, A.M., Shahid, S., 2015. A comparison between index of entropy and catastrophe
tribution that resulted in a tangible reduction of overfitting problem theory methods for mapping groundwater potential in an arid region. Environ. Monit.
with less dependency on modification of hyper-parameters. Assess. 187 (9), 576.
Bui, D.T., Ho, T.-C., Pradhan, B., Pham, B.-T., Nhu, V.-H., Revhaug, I., 2016. GIS-based
Several contributing factors were assessed using a dataset of specific modeling of rainfall-induced landslides using data mining-based functional trees
capacity and transmissivity for 169 well locations. Initially, we applied classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks. Environ.
a forward stepwise LR algorithm to identify 15 significantly con- Earth Sci. 75 (14), 1101. https://doi.org/10.1007/s12665-016-5919-4.
Botzen, W., Aerts, J., Van den Bergh, J., 2013. Individual preferences for reducing flood
tributing morphological factors: TWI, TRI, SPI, TPI, MRVBF, slope, as-
risk to near zero through elevation. Mitig. Adapt. Strat. Gl. 18 (2), 229–244.
pect, SL, distance from the river, distance from fault, profile curvature, Chenini, I., Mammou, A.B., El May, M., 2010. Groundwater recharge zone mapping using
plane curvature, altitude, LULC, and geology. Then we developed a new GIS-based multi-criteria analysis: a case study in Central Tunisia (Maknassy Basin).
robust ensemble method, coupling LR with the MultiAdaBoosting Water Resour. Manage. 24 (5), 921–939.
Corsini, A., Cervi, F., Ronchetti, F., 2009. Weight of evidence and artificial neural net-
technique to construct the MABLR model, which showed higher per- works for potential groundwater spring mapping: an application to the Mt. Modino
formance than other well-known machine learning methods including area (Northern Apennines, Italy). Geomorphology 111 (1–2), 79–87.
MPL, SVM, and standalone LR. We applied FLS to successfully retrieve De Reu, J., Bourgeois, J., Bats, M., Zwertvaegher, A., Gelorini, V., De Smedt, P., Chu, W.,
Antrop, M., Maeyer, P.D., Finke, P., Meivenne, M.V., Verniers, J., Crombe, P., 2013.
optimal hyper-parameter values for the implemented models. The Application of the topographic position index to heterogeneous landscapes.
model results showed that MABLR had the best accuracy and efficiency Geomorphology 186, 39–49.
based on evaluation by RMSE, κ coefficient, ROC, and PRC. The most Dehnavi, A., Aghdam, I.N., Pradhan, B., Varzandeh, M.H.M., 2015. A new hybrid model
using step-wise weight assessment ratio analysis (SWARA) technique and adaptive
influential contributing factors were identified as SPI, plan curvature, neuro-fuzzy inference system (ANFIS) for regional landslide hazard assessment in
and MRVBF. Visual interpolation of high groundwater aquifer potential Iran. Catena 135, 122–148. https://doi.org/10.1016/j.catena.2015.07.020.
areas showed that they were located in low-elevation zones near riv- Falah, F., Ghorbani Nejad, S., Rahmati, O., Daneshfar, M., Zeinivand, H., 2017.
Applicability of generalized additive model in groundwater potential modelling and
erbanks whereas low potential areas were located in high-elevation comparison its performance by bivariate statistical methods. Geocarto Int. 32 (10),
areas with steep slopes. Our results will be valuable for evaluating 1069–1089.
groundwater studies and successive model development to further re- Fanos, A.M., Pradhan, B., 2019. A spatial ensemble model for rockfall source identifica-
tion from high resolution LiDAR data and GIS. IEEE Access. 7, 74570–74585. https://
duce uncertainties and consider the morphological factors that influ-
doi.org/10.1109/ACCESS.2019.2919977.
ence the precision of groundwater potential modeling. The main barrier Fawcett, T., 2006. An introduction to ROC analysis. Pattern Recogn. Lett. 27 (8),
of this research was using the contributing factors with a moderate 861–874.
spatial resolution, which reduced the quality of groundwater mapping. Gallant, J.C., Dowling, T.I., 2003. A multiresolution index of valley bottom flatness for
mapping depositional areas. Water Resour. Res. 39 (12).
Thus, it is suggested to use the 1-meter spatial resolution to leverage the Ghorbani Nejad, S., Falah, F., Daneshfar, M., Haghizadeh, A., Rahmati, O., 2017.
final map precision. Since the proposed model has the capability of Delineation of groundwater potential zones using remote sensing and GIS-based data-
modeling the functions with scare input data, it is also recommended driven models. Geocarto Int. 32 (2), 167–187.
Gokceoglu, C., Sonmez, H., Nefeslioglu, H.A., Duman, T.Y., Can, T., 2005. The 17 March
being experimented on other probability application such as landslide 2005 Kuzulu landslide (Sivas, Turkey) and landslide-susceptibility map of its near
that has a smaller number of inventory datasets. However, the proposed vicinity. Eng. Geol. 81 (1), 65–83.
model should be implemented in multiple regions to test its transfer- Guisan, A., Weiss, S.B., Weiss, A.D., 1999. GLM versus CCA spatial modeling of plant
species distribution. Plant Ecol. 143 (1), 107–122.
ability and reliability before it can be applied to assess the vulnerability Guru, B., Seshan, K., Bera, S., 2017. Frequency ratio model for groundwater potential
of wells.
10
H.M. Rizeei, et al. Journal of Hydrology 579 (2019) 124172
mapping and its sustainable management in cold desert, India. J. King Saud Univ. Sci. Pradhan, B., 2010. Remote sensing and GIS-based landslide hazard analysis and cross-
29 (3), 333–347. validation using multivariate logistic regression model on three test areas in
Harun, N., Dlay, S.S., Woo, W.L., 2010. Performance of keystroke biometrics authenti- Malaysia. Adv. Space Res. 45 (10), 1244–1256.
cation system using multilayer perceptron neural network (MLP NN), Pradhan, B., Lee, S., 2010. Regional landslide susceptibility analysis using back-propa-
Communication Systems Networks and Digital Signal Processing (CSNDSP), 2010 7th gation neural network model at Cameron Highland, Malaysia. Landslides 7 (1),
International Symposium on. IEEE. pp. 711–714. 13–30. https://doi.org/10.1007/s10346-009-0183-2.
Helaly, A.S., 2017. Assessment of groundwater potentiality using geophysical techniques Rahmati, O., Melesse, A.M., 2016. Application of Dempster-Shafer theory, spatial analysis
in Wadi Allaqi basin, Eastern Desert, Egypt-Case study. NRIAG J. Astron. Geophys. 6 and remote sensing for groundwater potentiality and nitrate pollution analysis in the
(2), 408–421. semi-arid region of Khuzestan. Iran. Sci. Total Environ. 568, 1110–1123.
Hong, H., Liu, J., Zhu, A.X., Shahabi, H., Pham, B.T., Chen, W., Pradhan, B., Tien Bui, D., Rahmati, O., Moghaddam, D.D., Moosavi, V., Kalantari, Z., Samadi, M., Lee, S., Tien Bui,
2017. A novel hybrid integration model using support vector machines and random D., 2019. An automated python language-based tool for creating absence samples in
subspace for weather-triggered landslide susceptibility assessment in the Wuning groundwater potential mapping. Remote Sens. 11 (11), 1375.
area (China). Environ. Earth. Sci. 76, 652. https://doi.org/10.1007/s12665-017- Rahmati, O., Naghibi, S.A., Shahabi, H., Bui, D.T., Pradhan, B., Azareh, A., Melesse, A.M.,
6981-2. 2018. Groundwater spring potential modelling: comprising the capability and ro-
Hosmer Jr, D.W., Lemeshow, S., Sturdivant, R.X., 2013. Applied Logistic Regression. John bustness of three different modeling approaches. J. Hydrol. 565, 248–261.
Wiley & Sons, pp. 398. Rahmati, O., Pourghasemi, H.R., Melesse, A.M., 2016. Application of GIS-based data
Hyndman, R.J., Koehler, A.B., 2006. Another look at measures of forecast accuracy. Int. J. driven random forest and maximum entropy models for groundwater potential
Forecast. 22 (4), 679–688. mapping: a case study at Mehran Region, Iran. Catena 137, 360–372.
Jothibasu, A., Anbazhagan, S., 2016. Modeling groundwater probability index in Razandi, Y., Pourghasemi, H.R., Neisani, N.S., Rahmati, O., 2015. Application of analy-
Ponnaiyar River basin of South India using analytic hierarchy process. Model. Earth tical hierarchy process, frequency ratio, and certainty factor models for groundwater
Syst. Environ. 2 (3), 109. potential mapping using GIS. Earth Sci. Inform. 8 (4), 867–883.
Kaliraj, S., Chandrasekar, N., Magesh, N., 2014. Identification of potential groundwater Refsgaard, J.C., van der Sluijs, J.P., Højberg, A.L., Vanrolleghem, P.A., 2007. Uncertainty
recharge zones in Vaigai upper basin, Tamil Nadu, using GIS-based analytical hier- in the environmental modelling process–a framework and guidance. Environ. Model.
archical process (AHP) technique. Arab. J. Geosci. 7 (4), 1385–1401. Softw. 22 (11), 1543–1556.
Kaufmann, A., 1975. Introduction to the Theory of Fuzzy Subsets. Academic Pr, pp. 2. Ridd, M.K., Liu, J., 1998. A comparison of four algorithms for change detection in an
Kotsiantis, S.B., Zaharakis, I., Pintelas, P., 2007. Supervised machine learning: a review of urban environment. Remote Sens. Environ. 63 (2), 95–100.
classification techniques. Emerg. Artificial Intell. Appl. Comput. Eng. 160, 3–24. Rizeei, H.M., Azeez, O.S., Pradhan, B., Khamees, H.H., 2018a. Assessment of groundwater
Kumar, P., Bansod, B.K., Debnath, S.K., Thakur, P.K., Ghanshyam, C., 2015. Index-based nitrate contamination hazard in a semi-arid region by using integrated parametric
groundwater vulnerability mapping models using hydrogeological settings: a critical IPNOA and data-driven logistic regression models. Environ. Monit. Assess. 190 (11),
evaluation. Environ. Impact Assess. 51, 38–49. 633.
Lee, S., Sambath, T., 2006a. Landslide susceptibility mapping in the Damrei Romel area, Rizeei, H.M., Pradhan, B., Saharkhiz, M.A., 2017. Surface Runoff Estimation and
Cambodia using frequency ratio and logistic regression models. Environ. Geol. 50 (6), Prediction Regarding LULC and Climate Dynamics Using Coupled LTM, Optimized
847–855. ARIMA and Distributed-GIS-Based SCS-CN Models at Tropical Region, GCEC 2017.
Lee, S., Sambath, T., 2006b. Landslide susceptibility mapping in the Damrei Romel area, Springer, pp. 1103–1126.
Cambodia using frequency ratio and logistic regression models. Environ. Geol. 50, Rizeei, H.M., Pradhan, B., Saharkhiz, M.A., 2018b. An integrated fluvial and flash pluvial
847–855. model using 2D high-resolution sub-grid and particle swarm optimization-based
Liao, S.-H., Chu, P.-H., Hsiao, P.-Y., 2012. Data mining techniques and applications–A random forest approaches in GIS. Complex Intell. Syst. 1–20.
decade review from 2000 to 2011. Expert Syst. Appl. 39 (12), 11303–11311. Rizeei, H.M., Pradhan, B., Saharkhiz, M.A., 2018c. Surface runoff prediction regarding
Manap, M.A., Sulaiman, W.N.A., Ramli, M.F., Pradhan, B., Surip, N., 2013. A knowledge- LULC and climate dynamics using coupled LTM, optimized ARIMA, and GIS-based
driven GIS modeling technique for groundwater potential mapping at the Upper SCS-CN models in tropical region. Arab. J. Geosci. 11 (3), 53.
Langat Basin, Malaysia. Arab. J. Geosci. 6 (5), 1621–1637. Rizeei, H.M., Saharkhiz, M.A., Pradhan, B., Ahmad, N., 2016. Soil erosion prediction
Marjanović, M., Kovačević, M., Bajat, B., Voženílek, V., 2011. Landslide susceptibility based on land cover dynamics at the Semenyih watershed in Malaysia using LTM and
assessment using SVM machine learning algorithm. Eng. Geol. 123 (3), 225–234. USLE models. Geocarto Int. 31 (10), 1158–1177.
McFadden, D., 1974. Conditional logit analysis of qualitative choice behavior. In: Sameen, M.I., Pradhan, B., Lee, S., 2018. Self-learning random forests model for mapping
Zarembka, P. (Ed.), Frontiers in Econometrics. Academic Press, New York, pp. groundwater yield in data-scarce areas. Nat. Resour. Res. 1–19.
105–142. Shirzadi, A., Saro, L., Joo, O.H., Chapi, K., 2012. A GIS-based logistic regression model in
Mia, M., Dhar, N.R., 2016. Prediction of surface roughness in hard turning under high rock-fall susceptibility mapping along a mountainous road: salavat Abad case study,
pressure coolant using Artificial Neural Network. Measurement 92, 464–474. Kurdistan, Iran. Nat. Hazards. 64, 1639–1656.
Mogaji, K., Lim, H., Abdullah, K., 2015. Regional prediction of groundwater potential Smith, A.J., Walker, G., Turner, J., 2010. Aquifer Sustainability Factor: A Review of
mapping in a multifaceted geology terrain using GIS-based Dempster-Shafer model. Previous Estimates. International Association of Hydrogeologists (AIH) and the
Arab. J. Geosci. 8 (5), 3235–3258. Geological Society of Australia (GSA), pp. EP104589.
Mojaddadi, H., Pradhan, B., Nampak, H., Ahmad, N., Ghazali, A.H.B., 2017. Ensemble Tehrany, M.S., Pradhan, B., Jebur, M.N., 2013. Spatial prediction of flood susceptible
machine-learning-based geospatial approach for flood risk assessment using multi- areas using rule based decision tree (DT) and a novel ensemble bivariate and mul-
sensor remote-sensing data and GIS. Geomat. Nat. Haz. Risk. 8 (2), 1080–1102. tivariate statistical models in GIS. J. Hydrol. 504, 69–79.
https://doi.org/10.1080/19475705.2017.1294113. Tehrany, M.S., Pradhan, B., Jebur, M.N., 2014. Flood susceptibility mapping using a novel
Mojaddadi Rizeei, H., Pradhan, B., Saharkhiz, M.A., 2019. Urban object extraction using ensemble weights-of-evidence and support vector machine models in GIS. J. Hydrol.
Dempster Shafer feature-based image analysis from worldview-3 satellite imagery. 512, 332–343.
Int. J. Remote Sens. 40 (3), 1092–1119. Tong, D., Murray, A.T., 2012. Spatial optimization in geography. Ann. Assoc. Am. Geogr
Mohammady, M., Pourghasemi, H.R., Pradhan, B., 2012. Landslide susceptibility map- 102 (6), 1290–1309.
ping at Golestan Province, Iran: a comparison between frequency ratio, Dempster- Webb, G.I., 2000. Multiboosting: a technique for combining boosting and wagging. Mach.
Shafer, and weights-of-evidence models. J. Asian Earth Sci. 61 (15), 221–236. Learn. 40 (2), 159–196.
https://doi.org/10.1016/j.jseaes.2012.10.005. Woo, M.W., Daud, W.R.W., Tasirin, S.M., Talib, M.Z.M., 2007. Optimization of the spray
Naghibi, S.A., Moghaddam, D.D., Kalantar, B., Pradhan, B., Kisi, O., 2017. A comparative drying operating parameters—A quick trial-and-error method. Dry Technol. 25 (10),
assessment of GIS-based data mining models and a novel ensemble model in 1741–1747.
groundwater well potential mapping. J. Hydrol. 548, 471–483. https://doi.org/10. Yao, X., Tham, L., Dai, F., 2008. Landslide susceptibility mapping based on support vector
1016/j.jhydrol.2017.03.020. machine: a case study on natural slopes of Hong Kong, China. Geomorphology 101
Naghibi, S.A., Pourghasemi, H.R., Dixon, B., 2016. GIS-based groundwater potential (4), 572–582.
mapping using boosted regression tree, classification and regression tree, and random Yin, H., Shi, Y., Niu, H., Xie, D., Wei, J., Lefticariu, L., Xu, S., 2018. A GIS-based model of
forest machine learning models in Iran. Environ. Monit. Assess. 188 (1), 44. potential groundwater yield zonation for a sandstone aquifer in the Juye Coalfield,
Naghibi, S.A., Dolatkordestani, M., Rezaei, A., Amouzegari, P., Heravi, M.T., Kalantar, B., Shangdong, China. J. Hydrol. 557, 434–447.
Pradhan, B., 2019. Application of rotation forest with decision trees as base classifier Youssef, A.M., Al-Kathery, M., Pradhan, B., 2015. Landslide susceptibility mapping at Al-
and a novel ensemble model in spatial modeling of groundwater potential. Environ. Hasher area, Jizan (Saudi Arabia) using GIS-based frequency ratio and index of en-
Monit. Assess. 191 (4), 248. tropy models. Geosci. J. 19 (1), 113–134. https://doi.org/10.1007/s12303-014-
Naghibi, S., Vafakhah, M., Hashemi, H., Pradhan, B., Alavi, S., 2018. Groundwater aug- 0032-8.
mentation through the site selection of floodwater spreading using a data mining Zabihi, M., Pourghasemi, H.R., Pourtaghi, Z.S., Behzadfar, M., 2016. GIS-based multi-
approach (case study: Mashhad plain, Iran). Water 10 (10), 1405. variate adaptive regression spline and random forest models for groundwater po-
Nampak, H., Pradhan, B., Manap, M.A., 2014. Application of GIS based data driven tential mapping in Iran. Environ. Earth Sci. 75 (8), 665.
evidential belief function model to predict groundwater potential zonation. J. Hydrol. Zare, M., Pourghasemi, H.R., Vafakhah, M., Pradhan, B., 2013. Landslide susceptibility
513, 283–300. mapping at Vaz Watershed (Iran) using an artificial neural network model: a com-
Oh, H.-J., Kim, Y.-S., Choi, J.-K., Park, E., Lee, S., 2011. GIS mapping of regional prob- parison between multilayer perceptron (MLP) and radial basic function (RBF) algo-
abilistic groundwater potential in the area of Pohang City, Korea. J. Hydrol. 399 rithms. Arab. J. Geosci. 6 (8), 2873–2888.
(3–4), 158–172. Zhang, Y., Maxwell, T., Tong, H., Dey, V., 2010. Development of a supervised software
Ozdemir, A., 2011. GIS-based groundwater spring potential mapping in the Sultan tool for automated determination of optimal segmentation parameters for ecognition.
Mountains (Konya, Turkey) using frequency ratio, weights of evidence and logistic ISPRS TC VII Symposium – 100 Years ISPRS, Vienna, Austria.
regression methods and their comparison. J. Hydrol. 411 (3–4), 290–308.
11