Hydrological Processes - 2021 - Refsgaard - Hydrological Process Knowledge in Catchment Modelling Lessons and
Hydrological Processes - 2021 - Refsgaard - Hydrological Process Knowledge in Catchment Modelling Lessons and
Abstract
Hydrological process knowledge has advanced significantly during the past six decades. During the same
period catchment models have undergone major developments including simple black box models, lumped
conceptual models, hydrological response unit models, spatially distributed process-based models and,
recently, the emergence of machine learning hybrid models. This development has been enabled by
improved understanding of hydrological processes together with ever increasing computer power and
improved availability and accessibility of data. During the first couple of decades, a key assumption
motivating the development towards increasing complexity of model codes was that more detailed process
description would lead to more accurate model simulations and enable prediction of impacts from human
activities that previous models were not able to provide. Subsequently, scientific tests showed that this is
very often not the case, leading towards a recognition of the importance of careful model evaluation
accounting for key uncertainties in data, model parameters and model conceptual understanding. We have
reviewed 54 model studies from the past 60 years and characterized them with respect to model type,
spatial discretization and model evaluation techniques. This showed clear development trends and
different strategies for enhancing hydrological process knowledge in models. In addition, we present a case
study, where we use two models for the same catchment. The models are identical except for the spatial
discretisation of 100m and 500m, respectively. The two models have an apparent equal performance
measured against standard calibration metrics, but nevertheless show large differences when considering
detailed process information such as partitioning of streamflow components and water table depth
patterns, that was not considered during the model calibration process. The paper discusses perspectives
for enhancing hydrological process knowledge in future catchment modelling concluding that the
emergence of big data is likely to become a major game changer.
Key words
catchment model, hydrological process understanding, spatial discretization, model evaluation, big data,
machine learning
Correspondence
Jens Christian Refsgaard, Department of Hydrology, Geological Survey of Denmark and Greenland, Øster
Voldgade 10, 1350 Copenhagen, Denmark.
Email: [email protected]
This article has been accepted for publication and undergone full peer review but has not been
through the copyediting, typesetting, pagination and proofreading process which may lead to
differences between this version and the Version of Record. Please cite this article as doi:
10.1002/hyp.14463
(Bergström & Forsman, 1973) and NAM (Nielsen & Hansen, 1973). The conceptual models were based on
sound hydrological process knowledge, although point scale process equations and data could not directly
be exploited because the computational unit of the models was the entire catchment. The next step
towards development of model types that could encapsulate more hydrological field data and process
knowledge were triggered by a blueprint from Freeze and Harlan (1979) and the development of the first
spatially distributed process-based (or physics-based) model codes, e.g. SHE (Abbott, Bathurst, Cunge,
O’Connell & Rasmussen, 1986a,b), IHDM (Beven, Calver & Morris, 1987) and THALES (Grayson, Moore &
McHahon, 1992).
During the first couple of decades of development towards ever increasing model code complexity, a key
hypothesis has been that more detailed process description with improved hydrological knowledge would
lead to more accurate model simulations and enable prediction of impacts from human activities that
previous models were not able to provide (Abbott, et al, 1986a). In the absence of hard facts from suitable
tests, the scientific discussion was to a large extent based on expectations and qualitative arguments and
the hypothesis was questioned (Beven, 1989). Since then, several studies have documented that this
hypothesis cannot be confirmed for cases, where the simulation target is confined to river discharge
(Perrin, Michel & Andreassian, 2001; Reed et al., 2004; Refsgaard & Knudsen, 1996). Instead of complex
models with sophisticated process descriptions, large data requirements and many parameters several
authors argued that models should be parsimonious, i.e., they should be as simple as possible and include a
minimum of parameters requiring calibration (Beven, 1989; Bergström, 1991; Jakeman & Hornberger,
1993; Perrin, Michel & Andreassian, 2003).
In spite of these apparent shortcomings and arguments for a different strategy, the developments towards
more detailed spatial representation and more sophisticated process descriptions continued, e.g., MIKE
SHE (Graham & Butts, 1995), SWAT (Arnold, Srinivasan, Muttiah & Williams, 1998), ParFlow-CLM (Kollet &
Maxwell, 2008), HydroGeoSphere (Therrien, McLaren, Sudicky & Park, 2010), mHM (Samaniego, Kumar &
Attinger, 2010). This resulted in model applications that are able to exploit almost all existing data on
climate, soil, vegetation, geology, topography and streams and hereby contribute to improved credibility
among stakeholders (De Lange et al., 2014; Højberg, Troldborg, Stisen, Christensen & Henriksen, 2013) as
well as models capable of assessing impacts of human activities such as hydropower development
(Refsgaard et al., 1998), groundwater abstraction (Sophocleous et al., 1999) and changes in climate and
land use (Mango, Melesse, McClain,Gann & Setegn, 2011).
Recent developments in data acquisition such as new geophysical techniques (Auken, Boesen &
Christiansen, 2017), new earth observation techniques (McCabe et al., 2017) and citizen data collection (Le
Coz et al., 2016) open a whole new chapter in hydrological modelling with easy access to an until now
unprecedented amount and type of data. Together with ever increasing computer power, this provides an
Altogether, the ever-increasing computer power, the improved understanding of hydrological processes
and the improved availability and accessibility of spatial data have enabled and motivated the development
of increasingly complex hydrological models with finer and finer spatial resolutions (Liu & Gupta, 2007). At
the same time, it has been widely recognized that use of sophisticated, high-resolution models in itself is no
guarantee for more accurate simulations. As an example, improved availability of high-resolution spatial
data from satellites has made it possible to test how well spatially distributed models are able to reproduce
Accepted Article
observed spatial patterns. The tests revealed that models performing well against observations of
streamflow and groundwater heads often were not able to simulate spatial patterns in land surface
temperature and evapotranspiration particularly well (Stisen et al., 2018; Demirel et al., 2018), implying
that the internal process representation or parameterization is deficient and that it therefore could be
argued that the streamflow simulations to some extent provided the right answer for the wrong reasons
(Kirchner, 2006).
This illustrates that theoretical process descriptions are not a sufficient measure of hydrological process
knowledge. A second, and equally important component, is model evaluation to account for model
parameter and conceptualization uncertainty as well as catchment specific information. In this respect,
factors contributing to constraining the reliability of complex models are mismatch of spatial scales (Beven,
1995; Blöschl & Sivapalan, 1995), overparameterization leading to equifinality (Beven, 2006), lack of good
quality data and inadequate model calibration. Consequently, a proper evaluation of model simulations and
the associated uncertainties are crucial for the credibility of hydrological model simulations (Klemes 1986;
Beven, 1989; Refsgaard 1996; Beven 2002; Jakeman, Letcher & Norton, 2006; Refsgaard, van der Sluijs,
Højberg & Vanrolleghem, 2007) and hence for enhancing our hydrological process knowledge.
Our three basic hypotheses are that i) a lot of hydrological process understanding has not been utilised in
state-of-the-art catchment modelling until now, ii) currently, hydrological modelling is at a development
stage where a relatively larger share of the existing hydrological process knowledge can be exploited, and
iii) such a development will call for further enhancing hydrological process understanding and model
evaluation strategies.
The objectives of the present study are therefore: i) to learn about the major drivers and obstacles to
utilising hydrological process representation in catchment modelling; and ii) to analyse and discuss
perspectives for enhancing hydrological process knowledge in future catchment modelling.
Methodology
Literature review
The literature review focuses on development of hydrological models during the past 60 years. The review
does not aim at providing a comprehensive review of hydrological modelling literature. Instead, we use
only a few selected references attempting to describe key model developments, seen from a hydrological
process knowledge perspective.
In this paper we use the term hydrological process knowledge as comprising two elements: i) theoretical
process understanding in terms of equations and literature knowledge on parameter values, and ii)
catchment specific information in terms of data, observations, and conceptual understanding. By combining
Our analysis focus has been on how the level of process knowledge has evolved during the past 60 years.
Accepted Article
Catchment models can also be classified into continuous simulation models and event models (Singh,
1995). Because event models are mainly relevant for flood simulation and often dominated by uncertainty
on initial conditions, we have chosen to focus on continuous simulation models and, with a few exceptions,
only include model studies with simulation periods longer than one year. Similarly, we have confined our
analysis to catchment models and excluded point scale models and global hydrological models.
To support the review, we define four proxies (code class, N, Δx, evaluation level - see below) for the
degree of hydrological knowledge embedded in a hydrological model.
Numerous hydrological model codes have been developed and a large number of classification schemes
have been proposed such as Woolhiser (1973), Singh (1995), Refsgaard (1996), Todini (2007), Pechlivanidis,
Jackson, McIntyre & Wheater (2011), Paniconi & Putti (2015), Hrachowitz & Clark (2017). In this paper we
use a classification scheme illustrated in Figure 1 that reflects the degree of hydrological process
understanding that can be built into the model code.
Black box models (BB) are empirical with mathematical equations and coefficients that are
developed and assessed purely from analyses of concurrent input and output time series without
considering hydrological processes understanding.
Lumped models (LU) include some concepts reflecting hydrological process understanding such as
root zone capacity, overland flow and baseflow, that are upscaled to represent hydrological
response at catchment scale. These models have often been denoted conceptual models, but we
do not prefer this term here, because the term conceptual is generally used to describe the
understanding of the system (Refsgaard & Henriksen, 2004; Hrachowitz & Clark, 2017) and other
hydrological model types also include conceptual understanding.
Process-based models (PB) apply point scale (or very small scale) equations of hydrological
processes and a spatially distributed representation of the catchment. These models have often
been denoted physically-based.
Hydrological Response Unit (HRU) models group areas with the same combination of some key
characteristics such as soil type, vegetation, topography and climate forcing within a catchment
The upper row of three boxes in Figure 1 (BB, LU, PB) are basic categories. The names of these classes
should be considered relative rather than absolute. For example, there are elements of lumping in all
process-based models, because the spatial representation will always be larger than point scale (Beven,
1989). In addition, many model codes do not fit nicely into one of these three boxes, because they include
components from more than one class. The commonly used HRU model codes constitute hybrids of LU and
PB with some process descriptions fitting into the LU box and others into the PB box. Finally, to provide
more clarity in our description and discussion of model developments in the remaining part of this paper
we have chosen to make separate classes for some of the most commonly used coupled model codes
including groundwater, water quality and atmospheric process descriptions.
We are aware that, even with these extensions to the three basic classes, some model codes cannot really
be fitted into one of our proposed classes. All classification schemes have some shortcomings, and so does
our proposed scheme, which has been chosen to support our objectives of discussing how the use of
hydrological process understanding in hydrological models has evolved during the past decades.
Model evaluation
This proxy is a measure of the extent to which catchment data and observations have been incorporated in
the modelling process to evaluate the reliability of the model. We define four levels with increasing
amounts of model evaluations (i.e. runs) to characterise this:
Accepted Article
Single (Level 1). In this case the hydrological model has only a single deterministic configuration for
a single catchment. The parameter values have been assessed from literature (no calibration) or
from relatively simple manual adjustments based on a single objective function (typically with
discharge as calibration target).
Multiple (Levels 2-4). In this case multiple evaluations have been performed with respect to three
aspects i) multiple data types used in objective functions for calibration; ii) multiple parameter sets;
and iii) multiple model structures. The level reflects how many of the three aspects have been
considered as multiple. As an example, a model study using one parameter set (no parameter
uncertainty), calibration only against discharge and three model codes will have level 2, while a
model using several parameter sets, that have been estimated through calibrated against both
discharge and soil moisture data and evaluated for three alternative model codes will have level 4.
Case study
The case study has been designed to investigate the importance of spatial discretization as one of the
potential obstacles in utilizing hydrological process representation in catchment modelling. We use two
models for the same catchment. The models are identical except for the spatial discretisation, which differs
by a factor of five in each horizontal direction. We then analyse how the hydrological process knowledge of
the two models differ due to the differences in spatial representation. First, we evaluate the performance
of the two models using standard metrics focusing on the accuracy of simulated streamflow and
groundwater heads when compared to field data used for calibration and validation tests. Next, we
evaluate how the two models simulate shallow groundwater depth as well as partitioning of streamflow
into baseflow, drainflow and overland flow, which are variables that have not been subject to calibration.
Finally, we analyse how the differences in spatial discretisation leads to differences in smaller scale
hydrological processes and discuss to which extent this may limit the model applicability at scales smaller
than those targeted by the calibration data.
Literature review
History of model code development
The description is divided into five periods, where we focus on what we perceive as dominating trends
within each of these periods. We acknowledge that many trends that dominate in one period typically start
with experimental developments in an earlier period and continue with consolidating developments in later
periods. Thus, many developments are overlapping a couple of periods. Hence, the timing of what is
described as developments in these periods should be seen as indicative only.
knowledge and therefore belong to the Black box (BB) class (Figure 1).
More sophisticated BB models were subsequently developed using statistical techniques. Examples of such
models include i) gauge to gauge correlation methods (Searcy, 1960); ii) Antecedent Precipitation Index
(WMO, 1994); and iii) Autoregressive Integrated Moving Average models (Box & Jenkins, 1970). Some of
these methods were originally developed for hand calculations while others were supported by mainframe
computer calculations.
These BB models have been widely used since then, both as stand-alone tools and as components
integrated in more sophisticated models. In addition, a large range of new BB-models have been developed
subsequently by use of first hydroinformatics and, in recent years, machine learning techniques (see
below).
The emergence of new sophisticated PB codes, coupled models and new parametrization schemes
(2000-2015)
Accepted Article
Facilitated by increasing computer power and improved numerical techniques a number of advanced PB
models were developed during the first decade of the millennium, e.g. INHM (VanderKwark & Loague,
2001), CATHY (Bixio et al., 2002; Camporese, Paniconi, Putti & Orlandini, 2010), GEOtop (Rigon, Bertoldi &
Over, 2006), ParFlow (Kollet & Maxwell, 2006) and HydroGeoSphere (Therrien et al., 2010).
Compared to one of the most advanced PB codes commonly used also before 2000 (MIKE SHE) these new
codes had more advanced numerical solvers and in some respects more detailed process descriptions, e.g.
3D instead of 1D Richards’ equation for variably saturated-unsaturated zone flow. Furthermore, some of
the codes were developed with couplings to land surface schemes and numerical weather models, e.g.
ParFlow-Noah-WRF (Maxwell et al., 2011).
Another development during this period was the development and widespread use of water quality
modules coupled to HRU codes. Examples of such HRU-WQ codes include SWAT (Arnold et al., 1998) and
HYPE (Lindström et al., 2010).
A larger focus on scalability and spatial pattern consistency paved the way for fundamentally different
parametrization schemes in PB models, such as the multiscale parameter regionalization scheme (MPR)
(Samaniego et al., 2010). By linking parameter values to attainable bio-geo-physical properties in adequate
spatial resolutions, this approach enabled seamless parametrization and better scaling properties of fluxes
(Kumar, Samaniego & Attinger, 2013). These developments have since been taken up by a range of model
codes and facilitated continental and global scale modelling (Mizukami et al., 2017).
Recent efforts have demonstrated that hydrological process understanding can be consolidated with
machine learning algorithms. This builds a novel class of process- or knowledge-guided machine learning
models (ML-HM) that, for instance, use simulation results from hydrological models as input to the
machine learning model (Koch, Berger, Henriksen & Sonnenborg, 2019; Konapala, Kao, Painter & Lu, 2020),
studies, which advanced the scientific frontier of hydrological modelling at the time of publishing, are
shown in Table 1 together with the four proxies. The table has been confined to include models of the LU,
PB and HRU classes, while the older BB and the very recent ML-HM models are not included. Based on the
models listed in this table Figures 2 and 3 show the developments in number of spatial calculation elements
(N) and their size (Δx), while Figure 4 shows the development in the evaluation level.
To prepare Table 1 we have made literature searches and attempted to select an ensemble of models that
can characterise the development of hydrological models. Any such selection comprises elements of
subjectivity and, given the self-citations in Table 1, we cannot claim to have avoided the same bias as when
hydrologists select a model code, namely that prior knowledge and legacy may play a role (Addor &
Melsen, 2018). The model studies listed in Table 1 should therefore not be considered one by one as the 54
most important model studies during the past 60 years. However, we would argue that the ensemble of the
54 studies reflects a reasonably representative sample of key model studies during this period.
The relationship between hydrological models and hydrological process knowledge is dual. On the one
hand, more and more advanced process descriptions have been built into model codes and more and
better local data have become available for modelling. On the other hand, more sophisticated
methodologies for evaluating parameter values and reliability of model simulations have contributed to
enhancing hydrological process knowledge for numerous specific catchments and as such also generically
for hydrology.
The two most important external factors that have enabled creation of enhanced hydrological process
knowledge during the past decades are increased computer power and better availability of data. In the
hydrological scientific community these new opportunities have been exploited by use of different
strategies. Looking at the proxies listed in Table 1 and shown in Figures 2-4, some key development tracks
appear:
Process descriptions in model codes. The level of theoretical process understanding imbedded in
the model codes increases progressively from BB to LU and PB model classes. From Figures 2-3 it is
seen that the first LU models were developed in the 1960s, while the HRU models appeared in the
1980s. The first PB models with continuous simulations of full catchments appeared in the 1990s,
while the first PB models appearing in the 1970s and 1980s due to computational constraints were
confined to simulation of single events and/or hillslopes. This development reflects that it has been
gradually possible to enhance the level of theoretical process descriptions in new emerging PB
model codes.
Spatial discretisation. The number of spatial grids (N) in model studies has increased significantly
over time from a single unit in the 1960s to more than 10 million units in some recent modelling
studies (Figure 2). This increase in N can clearly be seen both for HRU and PB models, while the
VIC model grids (Δx) in general decreased while the number of grid cells (N) increased over time.
Based on this, Melsen et al. (2016) argued that the process-based evaluation should keep pace with
the increased spatial resolution, which requires new and spatially distributed data sources for
model evaluation.
Evaluation. Figure 4 shows how the level of evaluation has gradually increased. During the first
couple of decades, hydrological modelling was dominated by deterministic, single models with
limited focus on uncertainties in data, parameters and model structures. This changed during the
1990s after the call for the hydrological modelling community to seriously consider uncertainty
(Beven, 1989) and when computer technology allowed execution of many model runs. Most recent
model studies in Table 1 perform evaluation at level 2 or higher implying that multiple data types
and/or uncertainties in parameter values and model structures are considered. Among the level
2/3/4 studies the most frequent factor contributing to increasing the evaluation level is multiple
calibration targets (23 studies) followed by multiple parameter values (11 studies) and multiple
model structures (7 studies). This reflects that multiple calibration targets are common in coupled
models (e.g. discharge and groundwater heads) and that parameter uncertainty is generally easier
to handle and better supported by built-in software tools than model structural uncertainty.
Recently, we have, however, seen an increasing attention on model structural uncertainty with
development of software platforms such as MMS (Leavesley, Markstrom, Restrepo & Viger, 2002),
SUMMA (Clark et al., 2015) and RAVEN (Craig et al., 2020). It may be noted that the only level 4
study included in Table 1 deals with modelling of karst hydrology, where uncertainties are
recognized as being very large (Hartmann et al., 2013). As comprehensive evaluation requires some
kind of ensemble modelling it can only be made at the cost of significantly reduced N, which can be
seen in Table 1.
Case study
Outline and scope of case study
Our case study builds upon a national scale coupled groundwater-surface water model based on the MIKE
SHE code (PB-GW). The model is initially set up and calibrated at a grid resolution of 500m for the entire
land phase of Denmark (43,000 km2). The calibration targets are primarily based on 305 discharge stations
and 24,000 groundwater monitoring wells. In order to map shallow groundwater table dynamics and meet
stakeholder requirements, a 100m version of the model was developed in parallel. Therefore, all model
parametrizations of the 500m model are applied in 100m resolution. After model calibration at 500m
resolution, the model is run in 100m resolution with full parameter transfer and model performance is
compared to the 500m model version utilized during parameter optimization. The national model is run in
observational dataset.
In order to investigate if different spatial resolutions affect the simulated hydrological processes, despite
the overall similarity in performance, we show simulated streamflow and its components for an arbitrary
branch in the DK5 model domain (Figure 6). Overall, the simulated discharge is very comparable between
the 100m and the 500m model. However, the streamflow components vary considerably. Baseflow is the
dominating source in both models, but more pronounced in the 500m model. The reduced baseflow of the
100m model is substituted with an increased drain flow and overland flow with respect to the 500m model.
The overland flow is relatively small in the 100m model and two orders of magnitude less in the 500m
model. This again underlines that standard hydrological data, such as discharge observations, cannot
discriminate between the two models even though they provide different process knowledge. The
apparent differences in simulated streamflow components will have implications for solute transport and
e.g. in-stream nitrate loads.
In order to further investigate the spatially explicit hydrological process knowledge provided by the 100m
and 500m models, we show the average simulated water levels in Figure 5. At regional scale, the two
models resemble each other visually. First when zooming at local scale, covering a few km2, differences in
simulated water levels emerge. The uppermost water table, which is represented by the water level maps
in Figure 5, is topography driven. Small scale variations in topography yield water level heterogeneity, such
as surficial water levels in valley bottoms and deeper water levels for smaller topographical features. This
heterogeneity can effectively be simulated by an increase in spatial resolution, from 500m to 100m,
without recalibrating the model parameters.
As traditional hydrological observations cannot discriminate between the two models, alternative
observations need to be consulted for evaluating. For this we processed a land surface temperature (LST)
dataset which can be utilized as a proxy for soil wetness conditions and thereby shallow water levels. The
process of evaporative cooling typically yields a lower LST for wet than for dry soils. The thermal sensors of
the Landsat satellites provide LST estimates at 30m spatial resolution, which can potentially explain
variability of water table depth for areas with homogenous land use. In order to evaluate the hydrological
process knowledge gained by the abovementioned models (500m and 100m) we have generated a long-
term (1997-2019) average LST map using Landsat data obtained from the Analysis Ready Data (ARD)
developed by the Global Land Analysis and Discovery team (GLAD), which is described in full detail by
Potapov et al. (2020).
We clearly observed that grids with a water level close to the surface are characterized with cooler LST,
whereas warmer grids are associated with a deeper water level. This link is only apparent in the 100m
model, while the 500m cannot reflect this. Such local scale differences are not discriminated by the
objective functions used for calibration, or from large scale pattern comparisons, but will have significant
impacts on groundwater surface water interactions and flowpaths. Based on the high-resolution evaluation
Discussion
Drivers and obstacles for utilizing hydrological process understanding in catchment
models
Accepted Article
The process equations in PB models are, in contrary, point scale equations assuming homogeneous
conditions, where catchment heterogeneity only can be represented through variability of parameter
values. This is typically done using field data such as soil types, vegetation classes and geological units and
assign different parameter values to each category. An important limitation of this approach is that use of
effective parameter values for each category does not account for all heterogeneity. It is for instance well
documented that the spatial variability of e.g. soil properties within one standard soil type at field scale is
very high and can significantly influence the water balance and solute transport at this scale (Djuurhus,
Hansen, Schelde & Jacobsen, 1999; Nielsen, Bigger & Erk, 1973). Hence, even with a spatially distributed
approach that enables PB models to explain a large part of the catchment heterogeneity, a considerable
part of the heterogeneity may still not be accounted for, which in some cases leads to performance
problems (Andersen, Refsgaard & Jensen, 2001; Hansen, Refsgaard, Hansen & Ernstsen, 2007). This
obstacle has led to the development of parameter regionalization strategies that link fully distributed
catchment properties, such as soil texture or vegetation indices, to effective model parameters via transfer
functions. The transfer functions have just a few parameters that need to be calibrated and result in
physically meaningful, i.e. seamless, parameter distributions. Such a regionalization is implemented into
mHM (Samaniego et al., 2010), but can in theory be added to any model, which has been demonstrated by
Beck et al. (2016), Mizukami et al., (2017), Demirel et al., (2018).
Another problem is that spatial discretization of a model domain does not allow a model to resolve
variations of e.g. topography and water levels within a computational unit. As illustrated in our case study
this may adversely affect the realism of simulated hydrological processes. While Refsgaard (1996) reported
that a 500m model calibrated against streamflow and groundwater heads for a neighbouring area showed
a deteriorating performance when applied with larger grid sizes (1000m/2000m/4000m), the 100m model
in our case study at a first glance showed similar performance to the 500m model for which the calibration
was made. This indicates on one hand, that a 500 m spatial resolution in our hydrogeological setting may
be sufficient for reproducing streamflow and groundwater heads from deep wells, and that a calibration of
the 100m model against the same targets as used for the 500m model calibration would not significantly
affect the 100m model performance and simulations. The large differences between the 100m and 500m
the point scale require some degree of calibration (Hrachowitz & Clark, 2017).
The possibility to exploit state-of-the-art hydrological process understanding built into a model code to
enhance hydrological process knowledge for a catchment therefore depends not only on model structure
but also on how a model is parameterised, how much field data are available to assess parameter values
and which evaluation methods and calibration techniques are used. Methodologies to support parameter
optimization have developed extensively from the first parameter optimisation attempts using a single
objective function and simple search techniques (Dawdy & O’Donnel, 1965) to gradually more efficient
parameter optimization techniques allowing use of multiple objective functions and evaluation of pareto
fronts between multiple calibration targets (Asadzadeh & Tolson, 2013; Stisen et al., 2018). The use of
multiple objectives originally focused on evaluating several performance metrics from the same
observation data, typically streamflow hydrographs (Yapo, Gupta & Sorooshian, 1998). Recently, more
attention has been given to utilizing multiple types of observational data and exploiting the multitude of
satellite based data available (Madsen, 2003; Nijzink et al., 2018). Particularly, confronting PB models with
spatial pattern information has been subject to development (Dembele, Hrachowitz, Savenije,Mariethoz &
Schaefli, 2020; Koch, Demirel & Stisen, 2018; Zink, Mai, Cuntz & Samaniego, 2018) and have highlighted the
limitations and non-uniqueness of streamflow-based optimization in PB modelling (Demirel et al., 2018).
Due to uncertainties in data, parameter values and model structure as well as mismatch of scales between
observations and models, model simulations will always include uncertainties, which often can be
considerable and crucial for water management practice. Although many practitioners and water managers
struggle with how to cope with uncertainty (Höllermann & Evers, 2019), uncertainty assessment is today a
fundamental element of good practices in both hydrological science and practical water management
(Refsgaard et al., 2013; Poff et al., 2016).
The PB brute force strategy implies further model developments by including more and more detailed
process descriptions and by adopting gradually smaller spatial grids. The basic hypothesis behind this
strategy is that it will lead to models with improved accuracy and predictive capabilities. This was the
dominating strategy until the 1990s and is still used today (e.g. Kollet et al., 2018).
We argue that both strategies can provide valuable insights contributing to enhancing hydrological process
knowledge. Although the brute force studies may not result in more reliable models as such they have
moved the borders for model applications with more sophisticated and detailed process descriptions,
larger number of small-scale computational grids and coupling with other domains. The comprehensive
evaluation studies, on the other hand, are required to fully test the hypotheses that may emerge from the
brute force studies, to provide documented reliable models and hence to consolidate the process
knowledge. Brute-force modelling applications are often at the limit of what is possible given the available
computational power. Through technological development, a sophisticated PB model that was once used in
a brute-force strategy can later be applied more comprehensively.
In spite of enormous progress in new measurement techniques such as remote sensing, geophysics, and
instruments for ground‐based measurements, the general perception until recently has been, as argued by
Beven (2002), that “catchment hydrology suffers from a measurement problem,” which “is a basic
limitation on the possibility for characterizing a catchment system.” A commonly accepted view has
therefore been that the development of sophisticated model codes has been ahead of the development of
data acquisition techniques and that the lack of data constituted the most important constraint. This view
was supported by the fact that simple LU-models usually turned out to have the same prediction accuracy
as complex PB models, at least for streamflow simulation (Refsgaard and Knudsen, 1996; Reed et al., 2004).
A logical consequence of this is that the parsimonious principle leads to use of robust LU models
(Bergström, 1991, Perrin et al., 2003).
We would argue that the emergence of huge amounts of new high-resolution spatial data that we witness
in these years, and for which we may only have seen the beginning, indicates a new era with
unprecedented new opportunities for hydrological modelling and enhanced hydrological process
knowledge. This is likely to change the balance between the constraining factors, so that lack of data will
not be as important as previously. During the first decades of model development, especially PB models
were typically a step ahead with respect to data availability. This has completely turned around during the
time enhance the hydrological process knowledge, we see a large scope for further developments of
advanced models – and this is likely to be PB and ML-HM models.
According to a quote attributed to Albert Einstein “everything should be made as simple as possible, but no
simpler”. This implies that a model that can be characterised as parsimonious at one point in time may be
considered too simple later on, when more data have become available. We believe that the good old
phrase “we will never have enough data to parameterize our distributed models” might soon be outdated.
This does not imply that additional data may not be useful, but that the benefit of additional data will
decrease. While we consider this realistic for discharge and surface flux aspects of catchment modelling, it
will probably still not apply for contaminant transport studies at small scale nor for studies assessing the
detailed spatial heterogeneity of the subsurface.
In light of the increased process descriptions in PB models and availability of new data types more
advanced model evaluation strategies are emerging. One important element is the increased attention on
evaluating the spatial pattern performance of distributed PB models. Historically, even complex and
distributed PB models have been evaluated solely against catchment aggregated observations (stream
flow) or point observations such as eddy flux data, soil moisture or groundwater levels. This evaluation
approach neglects the ability of the models to reproduce spatial patterns in hydrological states and fluxes.
Improved satellite remote sensing data and the increased focus on seamless model parametrization has
facilitated the development of methods that exploit spatial pattern information in model evaluation (Koch,
Siemann, Stisen & Sheffield, 2016; Stisen et al, 2018) and have highlighted the limitations and non-
uniqueness of streamflow-based optimization in PB modelling (Demirel et al., 2018). Out of this
development a new set of spatial pattern-oriented model evaluation metrics have emerged (Zink et al.,
2018; Koch et al., 2018; Dembele et al., 2020) and applied also for large scale (Koch et al., 2020) and
operational modeling systems (Soltani et al., 2021) as well as for improving process representation and
understanding (Hulsman, Savanije & Hrachowitz, 2021). Another model evaluation approach facilitated by
improved model complexity and data availability is the tracer-aided models (Birkel & Soulsby, 2015;
Sprenger, Tetzlaff, Buttle, Laudon & Soulsby., 2018) which exploit the increasing availability of isotope
tracer data to close the information gap on runoff generation processes. This approach has enabled a move
beyond simple hydrograph separation and runoff calibration to gain insight into flow partitioning and
transit times at catchment scale (Smith, Tetzlaff, Kleine, Maneta & Soulsby, 2021). Coupling this to
integrated groundwater surface water models seems to be an interesting avenue for future developments
and research (Sterte, Lidman, Lindborg, Sjöberg & Laudon, 2021).
understanding. Due to the limitations of state-of-the-art hydrological models to exploit available big data
sources we see a clear need to further the advancement of ML-HM. While different strategies have been
reported (Koch et al., 2019; Read et al., 2019; Konapala et al., 2020; Kraft et al., 2020; Hoedt et al., 2021),
formalized approaches to introduce process understanding into ML models have not yet been developed.
Recent literature indicates that ML-HM leverage hydrological predictions at improved spatial scales and
overall accuracy with respect to state-of-the-art hydrological models. However, it remains unresolved to
what degree ML-HM can generate novel hydrological process understanding. So far ML-HM are typically
trained to predict a single variable of the hydrological cycle, which is a drawback when compared to most
conventional hydrological models that simulate a broad range of hydrological states and fluxes jointly. Also,
it remains unresolved to what degree ML-HM can be utilized for hypothesis testing and scenario analysis,
since the capability of ML-HM models to extrapolate well beyond the variability contained in the training
data has not yet been properly examined.
As an example, our experience when moving from 2D to 3D groundwater modelling was that it improved
the communication with water managers who could recognize geological data from boreholes in the model
representation. Therefore, they considered the 3D model to be more reliable than a 2D model, even if the
test results showed similar performance. Our general understanding from two decades of work with a
national water resources model in Denmark is that non-modellers perceive a model as being more credible
if they can find their own data in the model (Højberg et al., 2013). The possibility to recognise own data can
sometimes be seen as a test to pass before stakeholders open their mind and become willing to learn from
Conclusions
Much of the hydrological process knowledge, both in terms of theoretical knowledge on processes and
specific field data, has been derived from studies at small (point or plot) spatial scales. Hydrological
Accepted Article
catchment models have until now not been able to operate at sufficiently small spatial scales to exploit this
knowledge. The vast difference in hydrological simulations of flow components between the 100m and the
500m models in our case study, is a clear example of this barriere. The two main obstacles in this regard
have during the past decades been the limitations in computer power and the lack of high-resolution data
for many catchment properties and hydrological variables.
The emergence of huge amounts of high-resolution good quality spatial data in these years combined with
the ongoing rapidly increase of computational power, are in our view likely within the next decade to
remove this historical barrier and bridge the gap between local scale hydrological process knowledge and
catchment modelling.
To make this happen and in this way enhance hydrological process knowledge, there is a need to develop
both improved hydrological model codes and improved evaluation strategies. We argue that model codes
with best possibilities to exploit all the new data would be spatially high-resolution process-based models
combined with machine learning techniques. In addition, new evaluation strategies need to include
calibration methodologies to better simulate spatial patterns and multiple hydrological variables.
Acknowledgements
We appreciate the valuable review comments and suggestions from James Kirchner and Hoshin Gupta.
ORCID
Jens Christian Refsgaard - http://orcid.org/0000-0003-0420-349X
References
Abbaspour, K.C., Rouholahnejad, E., Vaghefi, S., Srinivasan, R., Yang, H., & Kløve, B. (2015). A continental-scale
hydrology and water quality model for Europe: Calibration and uncertainty of a high-resolution large-scale SWAT
model. Journal of Hydrology, 524, 733-752.
Abbott, M.B., Bathurst, J.C., Cunge, J.A., O'Connel, P.E., & Rasmussen, J. (1986a). An introduction to the European
Hydrological System - Systeme Hydrologique Européen "SHE", 1: History and philosophy of a physically-based
distributed modelling system. Journal of Hydrology, 87, 45-59.
recharge in the Upper Mississippi river basin. Journal of Hydrology, 227, 21–40.
Auken, E., Boesen, T., & Christiansen, A.V. (2017). A review of airborne electromagnetic methods with focus on
geotechnical and hydrological applications from 2007 to 2017. Advances in Geophysics, 58, 47-93.
Asadzadeh, M., & Tolson, B. (2013). Pareto archived dynamically dimensioned search with hypervolume-based
selection for multi-objective optimization. Engineering Optimization, 45(12), 1489-1509.
Bartosova, A., Capell, R., Olesen, J.E., Jabloun, M., Refsgaard, J.C., Donelly, C., Hyytiäinen, K., Pihlainen, S., Zandersen,
M., & Arheimer, B. (2019). Future socioeconomic conditions may have a larger impact than climate change on
nutrient loads to the Baltic Sea. Ambio, 48(11), 1325-1336.
Bathurst, J.C. (1986). Physically-based distributed modelling of an upland catchment using the Systeme Hydrologique
Europeén. Journal of Hydrology, 87, 103-123.
Beck, H.E., van Dijk, A.I.J.M., de Roo, A., Miralles, D.G., McVicar, T.R., Schellekens, J., & Bruijnzeel, L.A. (2016). Global-
scale regionalization of hydrologic model parameters. Water Resources Research, 52, 3599–3622.
Becker, A. (1977). The integrated hydrological catchment model EGMO. Hydrological Sciences Journal, 22(1), 145-151.
Bergström, S. (1991). Principles and confidence in hydrological modelling. Nordic Hydrology, 22, 123-136.
Bergström, S., Forsman, A. (1973). Development of a conceptual deterministic rainfall-runoff model, Nordic Hydrology,
4, 147-170.
Best, M.J., Abramowitz, G., Johnson, H.R., Pitman, A.J., Balsamo, G., Boone, A., Cuntz, M., Decharme, B., Dirmeyer,
P.A., Dong, J., Ek, M., Guo, Z., Haverd, V., Van Den Hurk, B.J.J., Nearing, G.S., Pak, B., Peters-Lidard, C., Santanello
Jr, J.A., Stevens, J.L., & Vuichardi, N. (2015). The Plumbing of Land Surface Models: Benchmarking Model
Performance. Journal of Hydrometeorology, 16, 1425-1442.
Beven, K. (1989). Changing ideas in hydrology – the case of physically based models. Journal of Hydrology, 105, 157-
172.
Beven, K. (1995). Linking parameters across scales: subgrid parameterizations and scale dependent hydrological
models. Hydrological Processes, 9, 507-525.
Beven, K. (2002). Towards an alternative blueprint for a physically based digitally simulated hydrologic response
modelling system. Hydrological Processes, 16(2), 189–206.
Beven, K. (2006). A manifesto for the equifinality thesis. Journal of Hydrology, 320, 18–36.
Beven, K., Calver, A., & Morris, E.M. (1987). The Institute of Hydrology distributed model. Institute of Hydrology
Report 98, Wallingford UK.
Beven, K.J., Kirkby, M.J. (1979). A physically based, variable contributing area model of basin hydrology. Hydrological
Sciences Bulletin, 24, 43-69.
Birkel, C., Soulsby, C. (2015). Advancing tracer-aided rainfall–runoff modelling: a review of progress, problems and
unrealised potential. Hydrological Processes, 29(25), 5227-5240.
Bixio, A.C., Gambolati, G., Paniconi, C., Putti, M., Shestopalov, V.M., Bublias, V.N., Bohuslavsky, A.S., Kasteltseva, N.B.,
& Rudenko, Y.F. (2002). Modeling groundwater-surface water interactions including effects of morphogenetic
depressions in the Chernobyl exclusion zone, Environmental Geology, 42(2–3), 162–177.
Blöschl, G., & Sivapalan, M. (1995). Scale issues in hydrological modelling: a review. Hydrological Processes, 9, 251-
290.
Box, G.E.P., & Jenkins, G.M. (1970). Time series analysis, forecasting and control. Holden-Day Inc., San Francisco.
Chaney, N.W., Metcalfe, P., & Wood, E.F. (2016). HydroBlocks: a field-scale resolving land-surface model for
application over continental extents. Hydrological Processes, 30, 3543-3559.
Clark, M.P., Slater, A.G., Rupp, D.E., Woods, R.A., Vrugt, J.A., Gupta, H.V., Wagener, T., & Hay, L.E. (2008). Framework
for Understanding Structural Errors (FUSE): A modular framework to diagnose differences between hydrological
models. Water Resources Research, 44, W00B02.
Clark, M.P., Nijssen, B., Lundquist, J.D., Kavetski, D., Rupp, D.E., Woods, R.A., Freer, J.E., Gutmann, E.D., Wood, A.W.,
Brekke, L.D., Arnold, J.R., Gochis, D.J., & Rasmussen, R.M. (2015) A unified approach for process-based
hydrological modeling: 1. modeling concept. Water Resources Research, 51, 2498–2514.
Craig, J.R., Brown, G., Chlumsky, R., Jenkinson, R.W., Jost, G., Lee, K., Mai, J., Serrer, M., Sgro, N., Shafii, M., Snowdon,
A.P., & Tolson, B.A. (2020). Flexible watershed simulation with the Raven hydrological modelling framework.
Environmental Modelling & Software, 129, 104728.
Crawford, N.H., Linsley, R.K. (1966). Digital simulation in hydrology, Stanford Watershed Model IV, Department of Civil
Engineering, Stanford University, Technical Report 39.
Cuntz, M., Mai, J., Samaniego, L., Clark, M., Wulfmeyer, V., Branch, O., Attinger, S., & Thober, S. (2016). The impact of
standard and hard-coded parameters on the hydrologic fluxes in the Noah-MP land surface model. Journal of
Geophysical Research: Atmospheres, 121, 10,676–10,700.
Das, N.N., Mohanty, B.P., Cosh, M.H., & Jackson, T.J. (2008). Modeling and assimilation of root zone soil moisture
using remote sensing observations in Walnut Gulch watershed during SMEX04. Remote Sensing of Environment,
112, 415-429.
Dawdy, D.R., O’Donnel, T. (1965) Mathematical models of catchment behaviour. Journal of the Hydraulics Division,
ASCE, 91(4), 123-137.
De Lange, W.J., Prinsen, G.F., Hoogewoud, J.C., Veldhuizen, A.A., Verkaik, J., Oude Essink, G.H.P., van Walsum, P.E.V.,
Delsman, J.R., Hunink, J.C., Massop, H.T.L., & Kroon, T. (2014). An operational, multi-scale, multi-model system
for consensus-based, integrated water management and policy analysis: The Netherlands Hydrological
Instrument. Environmental Modelling & Software, 59, 94-108.
De Schepper, G., Therrien, R., Refsgaard, J.C., He, X., Kjaergaard, C., & Iversen, B.V. (2017). Simulating seasonal
variations of tile drainage discharge in an agricultural catchment. Water Resources Research, 53(5), 3896-3920.
Dembélé, M., Hrachowitz, M., Savenije, H.H.G., Mariéthoz, G., & Schaefli, B. (2020). Improving the Predictive Skill of a
Distributed Hydrological Model by Calibration on Spatial Patterns With Multiple Satellite Data Sets. Water
Resources Research, 56(1), e2019WR026085.
Demirel, M.C., Mai, J., Mendiguren, G., Koch, J., Samaniego, L., & Stisen, S. (2018). Combining satellite data and
appropriate objective functions for improved spatial pattern performance of a distributed hydrologic model.
Hydrology and Earth System Sciences, 22, 1299–1315.
Djuurhus, J., Hansen, S., Schelde, K., & Jacobsen, O.H. (1999). Modelling mean nitrate leaching from spatially variable
fields using effective parameters. Geoderma, 87, 261-279.
Ehlers, L.B., Refsgaard, J.C., & Sonnenborg, T.O. (2019). Observational and predictive uncertainties for multiple
variables in a spatially distributed hydrological model. Hydrological Processes, 33, 833-848.
Fang, X., Pomeroy, J.W., Westbrook, C.J., Guo, X., Minke, A.G., & Brown, T. (2010). Prediction of snowmelt derived
streamflow in a wetland dominated prairie basin. Hydrology and Earth System Sciences, 14, 991–1006.
Freeze, R.A., Harlan, R.L. (1969). Blueprint for a physically-based digitally-simulated hydrologic response model.
Journal of Hydrology, 9, 237-258.
Goderniaux, P., Brouyère, S., Fowler, H.J., Blenkinsop, S., Therrien, R., Orban, P., & Dassargues, A. (2009). Large scale
surface-subsurface hydrological model to assess climate change impacts on groundwater reserves. Journal of
Hydrology, 373, 122–138.
Graham, D.N., Butts, M.B. (2005). Flexible integrated watershed modelling with MIKE SHE. In: Singh VP, Frevert DK
(Eds) Watershed Models. CRC Press, Chapter 10.
Grayson, R.B., Moore, I.D., & McHahon, T.A. (1992). Physically based hydrologic modelling, 1. A terrain-based model
for investigative purposes. Water Resources Research, 28(10), 2639-2658.
Hansen, J.R., Refsgaard, J.C., Hansen, S., & Ernstsen, V. (2007). Problems with heterogeneity in physically based
agricultural catchment models. Journal of Hydrology, 342 (1-2), 1-16.
Hartmann, A., Wagener, T., Rimmer, A., Lange, J., Brielmann, H., & Weiler, M. (2013). Testing the realism of model
structures to identify karst system processes using water quality and quantity signatures. Water Resources
Research, 49, 3345–3358.
Henriksen, H.J., Troldborg, L., Nyegaard, P., Sonnenborg, T.O., Refsgaard, J.C., & Madsen, B. (2003). Methodology for
construction, calibration and validation of a national hydrological model for Denmark. Journal of Hydrology, 280,
52-71.
Henriksen, H.J., Kragh, S.J., Gotfredsen, J., Ondracek, M., van Til, M., Jakobsen, A., Schneider, R.J.M., Koch, J.,
Troldborg, L., Rasmussen, P., Pasten-Zapata, E., & Stisen, S. (2020). Udvikling af landsdækkende
modelberegninger af terrænnære hydrologiske forhold i 100m grid ved anvendelse af DK-modellen
(Development of nationwide model simulations of shallow hydrogeology in a 100 m grid with the DK model).
Geological Survey of Denmark and Greenland.
Hoedt, J.P., Kratzert, F., Klotz, D., Halmich, C., Holzleitner, M., Nearing, G., Hochreiter, S., & Klambauer, G. (2021). MC-
LSTM: Mass-conserving LSTM. arXiv:2101.05186v2.
Höllermann, B., & Evers, M. (2019). Coping with uncertainty in water management: Qualitative system analysis as a
vehicle to visualize the plurality of practitioners' uncertainty handling routines. Journal of Environmental
Management, 235, 213-223.
Hrachowitz, M., & Clark, M.P. (2017). HESS Opinions: The complementary merits of competing modelling philosophies
in hydrology. Hydrology and Earth System Sciences, 21, 3953-3873.
Hulsman, P., Savenije, H.H.G., & Hrachowitz, M. (2021). Learning from satellite observations: increased understanding
of catchment processes through stepwise model improvement. Hydrology and Earth System Sciences, 25, 957-
982.
Højberg, A.L., Troldborg, L., Stisen, S., Christensen, B.B.S., & Henriksen, H.J. (2013). Stakeholder driven update and
improvement of a national water resources model. Environmental Modelling & Software, 40, 202–213.
Jain, S.K., Storm, B., Bathurst, J.C., Refsgaard, J.C., & Singh, R.D. (1992). Application of the SHE to Catchments in India -
Part 2: Field Experiments and Simulation Studies with the SHE on the Kolar Subcatchment of the Narmada River.
Journal of Hydrology, 140, 25-47.
Jakeman, A.J., & Hornberger, G.M. (1993). How much complexity is warranted in a rainfall-runoff model? Water
Resources Research, 29(8), 2637-2649.
Jakeman, A.J., Letcher, R.A., & Norton, J.P. (2006). Ten iterative steps in development and evaluation of environmental
models. Environmental Modelling & Software, 21, 602–614.
Poff, N.L., Brown, C.M., Grantham, T., Matthews, J.H., Palmer, M.A., Spence, C.M., Wilby, R.L., Haasnoot, M.,
Mendoza, G.F., Dominique, K.C., & Baeza, A. (2016). Sustainable water management under future uncertainty
with eco-engineering decision scaling. Nature Climate Change, 6, 25–34.
Potapov, P., Hansen, M.C., Kommareddy, I., Kommareddy, A., Turubanova, S., Pickens, A., Adusei, B., Tyukavina, A., &
Ying, Q. (2020) Landsat Analysis Ready Data for Global Land Cover and Land Cover Change Mapping. Remote
Sensing, 12(3), 426.
Porter, J.W., & McHahon, T.W. (1971). A model for simulation of streamflow data from climatic records. Journal of
Hydrology, 13, 297-324.
Read, J.S., Jia, X., Willard, J., Appling, A.P., Zwart, J.A., Oliver, S.K., Karpatne, A., Hansen, G.J.A., Hanson, P.C., Watkins,
W., Steinbach, M., & Kumar, V. (2019). Process-guided deep learning predictions of lake water temperature.
Water Resources Research, 55, 9173-9190.
Reed, S., Koren, V., Smith, M., Zhang, Z., Moreda, F., & Seo, D.J. (2004). Overall distributed model intercomparison
project results. Journal of Hydrology, 298, 27-60.
Refsgaard, J.C. (1996). Terminology, modelling protocol and classification of hydrological model codes. In: Abbott, MB
and Refsgaard JC (Eds.) Distributed Hydrological Modelling. Kluwer Academics Publishers, 17-39.
Refsgaard, J.C., & Hansen, E. (1982). A distributed groundwater/surface water model for the Suså catchment. Part 1:
Model description. Nordic Hydrology, 13, 299-310.
Refsgaard, J.C., & Knudsen, J. (1996). Operational validation and intercomparison of different types of hydrological
models. Water Resources Research, 32 (7), 2189-2202.
Refsgaard, J.C., Sørensen, H.R., Mucha, I., Rodak, D., Hlavaty, Z., Bansky, L., Klucovska, J., Topolska, J., Takac, J., Kosc, V.,
Enggrob, H.G., Engesgaard, P., Jensen, J.K., Fiselier, J., Griffioen, J., & Hansen, S. (1998). An integrated model for the
Danubian Lowland – methodology and applications. Water Resources Management, 12, 433-465.
Refsgaard, J.C., Henriksen, H.J. (2004). Modelling guidelines – terminology and guiding principles. Advances in Water
Resources, 27(1), 71-82.
Refsgaard, J.C., van der Sluijs, J.P., Højberg, A.L., & Vanrolleghem, P.A. (2007). Uncertainty in the environmental
modelling process – A framework and guidance. Environmental Modelling & Software, 22, 1543-1556.
Refsgaard, J.C., Arnbjerg-Nielsen, K., Drews, M., Halsnæs, K., Jeppesen, E., Madsen, H., Markandya, A., Olesen, J.E.,
Porter, J.R., & Christensen, J.H. (2013). The role of uncertainty in climate change adaptation strategies – A Danish
water management example. Mitigation and Adaptation Strategies for Global Change, 18(3), 337-359.
Refsgaard, J.C., Højberg, A.L., He, X., Hansen, A.L., Rasmussen, S.H., & Stisen, S. (2016). Where are the limits of model
predictive capabilities? Hydrological Processes, Keith Beven Tribute, 30(26), 4956-4965.
Rigon, R., Bertoldi, G., & Over, T.M. (2006). GEOtop: A Distributed Hydrological Model with Coupled Water and Energy
Budgets. Journal of Hydrometeorology, 7, 371-388.
Rockwood, D.M. (1964). Streamflow synthesis and reservoir regulation. US Army Engineer Division, North Pacific,
Portland Oregon. Engineering Studies Project 171, Technical Bulletin No. 22
Samaniego, L., Kumar, R., & Attinger, S. (2010). Multiscale parameter regionalization of a grid-based hydrologic model at
the mesoscale. Water Resources Research, 46, W05523.
Searcy, J.K. (1960). Graphical correlation of gaging-station records. Manual of hydrology: Part 1. General surface water
techniques. Geological Survey Water-Supply Paper 1541-C. USGS.
Shen, C. (2018). A Transdisciplinary Review of Deep Learning Research and Its Relevance for Water Resources
Scientists. Water Resources Research, 54, 8558-8593.
numerical modeling for basin-wide water management: The case of the Rattlesnake Creek basin in south-central
Kansas. Journal of Hydrology, 214, 179-196.
Speers, D.D. (1995). SSARR model. In Singh VJ (Ed) Computer models in watershed hydrology. Water Resources
Publication, 367-394.
Sprenger, M., Tetzlaff, D., Buttle, J., Laudon, H., & Soulsby, C. (2018). EcH2O-iso 1.0: water isotopes and age tracking in
a process-based, distributed ecohydrological model. Geoscientific Model Development, 11, 3045-3069.
Srinivasan, R., Ramanarayanan, T.S., Arnold, J.G., & Bednarz, S.T. (1998). Large area hydrologic modeling and
assessment - Part 2: Model application. Journal of the American Water Resources Association, 34, 91-101.
Stephenson, G.R., & Freeze, R.A. (1974). Mathematical simulation of subsurface flow contributions to snowmelt
runoff, Reynolds Creek watershed, Idaho. Water Resources Research, 10(2), 284-294.
Sterte, E.J., Lidman, F., Lindborg, E., Sjöberg, Y., & Laudon, H. (2021). How catchment characteristics influence
hydrological pathways and travel times in a boreal landscape. Hydrology and Earth System Sciences, 25, 2133–
2158.
Stisen, S., Koch, J., Sonnenborg, T.O., Refsgaard, J.C., Bircher, S., Ringgaard, R., & Jensen, K.H. (2018) Moving beyond
runoff calibration - Multi-variable optimization of a surface-subsurface-atmosphere model. Hydrological
Processes, 32(17), 2654-2668.
Sugawara, M. (1967). The flood forecasting by a series storage type model. IAHS Publication no. 85, 1-6.
Therrien, R., McLaren, R.G., Sudicky, E.A., & Park, Y.J. (2010). HydroGeoSphere: A Three-Dimensional Numerical Model
Describing Fully-Integrated Subsurface and Surface Flow and Solute Transport, 453 pp., Aquanty Inc., Waterloo,
Ontario, Canada.
Todini, E. (2007). Hydrological catchment modelling: past, present and future. Hydrological Earth System Sciences,
11(1), 468-482.
Tyralis, H., Papacharalampous, G., & Langousis, A. (2019). A Brief Review of Random Forests for Water Scientists and
Practitioners and Their Recent History in Water Resources. Water, 11, 910.
Van Der Knijff, J.M., Younis, J., De Roo, A.P.J. (2010). LISFLOOD: a GIS‐based distributed model for river basin scale
water balance and flood simulation. International Journal of Geographical Information Science, 24(2), 189-212.
VanderKwark, J.E., & Logue, K. (2001). Hydrologic-response simulations for the R-5 catchment with a comprehensive
physics-based model. Water Resources Research, 37(4), 999-1013.
Vazquez, R.F., Beven, K., & Feyen, J. (2009). GLUE based assessment on the overall predictions of a MIKE SHE application.
Water Resources Management, 23, 1325–1349.
Vrugt, J.A., Diks, C.G.H., Gupta, H.V., Bouten, W., & Verstraten, J.M. (2005). Improved treatment of uncertainty in
hydrologic modeling: Combining the strengths of global optimization and data assimilation. Water Resources
Research, 41, W01017.
Western, A.W., Grayson, R.B., & Green, T.R. (1999). The Tarrawarra project: high resolution spatial measurement,
modelling and analysis of soil moisture and hydrological response. Hydrological Processes, 13, 633-652.
WMO. (1975). Intercomparison of conceptual models used in operational hydrological forecasting. WMO Operational
Hydrology Report No 7, WMO No 429, World Meteorological Organization, Geneva.
WMO. (1988). Intercomparison of models for snowmelt runoff. WMO Operational Hydrology Report No 23, WMO No
646, World Meteorological Organization, Geneva.
WMO. (1992). Simulated real-time intercomparison of hydrological models. WMO Operational Hydrology Report No
38, WMO No 779, World Meteorological Organization, Geneva.
H
MAE de. [m] 4.43 4.76
KGE [-] 0.64 0.64
Q
WBE [%] 4.31 2.49
MAE sh. [m] 2.99 3.32
H
MAE de. [m] 4.01 4.32
KGE [-] 0.66 0.67
Q
WBE [%] 3.69 -0.43