CE3013 Lecture Note 5-1 Hydrological Analysis
CE3013 Lecture Note 5-1 Hydrological Analysis
Prof. R. L. H. L. Rajapakse
Hydraulic and Water Resources Engineering Group
Department of Civil Engineering
January 2024
Module Outline
2
Module Outline
3
Module Outline
4
Module Outline
5
Program Outcomes
6
Module Outline/Timetable
7
Weekly Action Plan
8
Weekly Action Plan
9
5-1. Hydrological Analysis
5.1.1 Hydrologic Models and Classification based on
Randomness
Deterministic models/approaches (e.g. Rainfall Runoff Analysis)
▪ Analysis of hydrological processes using deterministic approaches
▪ Hydrological parameters are based on physical relations of the various
components of the hydrologic cycle.
▪ Do not consider randomness; a given input produces the same output.
▪ Therefore, the outcome can be predicted in advance.
Stochastic models/approaches (e.g. Flood Frequency Analysis)
▪ Probabilistic description and modeling of hydrologic phenomena
▪ Statistical analysis of hydrologic data.
▪ A number expressing the ratio of favorable outcome to the all possible outcomes
= 13/52 = 0.25 = 25 %
13
• P (Getting a 3 after rolling a dice) = 1/6
5-1. Hydrological Analysis
5.1.3 Additional Definitions
b). Random Variable
What is a Random Variable?
▪ Random variable: A quantity used to represent probabilistic uncertainty
• Incremental precipitation
• Instantaneous streamflow
• Wind velocity
14
5-1. Hydrological Analysis
5.1.3 Additional Definitions
c). Sampling Terminology
▪ Sample: A finite set of observations x1, x2,….., xn of the random variable (sample
readings or measurements)
• A sample comes from a hypothetical infinite population possessing constant
statistical properties
▪ Sample space: Set of possible samples that can be drawn from a population
▪ Event: Subset of a sample space
◼ Example
• Population: Streamflow
• Sample space: Instantaneous streamflow, Annual maximum streamflow, Daily
average streamflow
• Sample: 100 observations of annual maximum streamflow
• Event (Selected): Daily average streamflow > 100 cfs 15
5-1. Hydrological Analysis
5.1.3 Additional Definitions
d). Types of Sampling
▪ Stratified sampling: Population is divided into groups, and then a random sampling is used
• Pick a streamflow value from annual maximum series
▪ Uniform sampling: Data are selected such that the points are uniformly far apart in time or space
• Pick streamflow values measured on Monday midnight
16
5-1. Hydrological Analysis
5.1.3 Additional Definitions
e). Descriptive/Summary Statistics
Probability Density Function
Also called descriptive statistics
If x1, x2, …xn is a sample then:
1 n
Mean X = xi m for population
n i =1
(xi − X )
1 n
Variance S =
2
s2 for population
n − 1 i =1
▪ Spatial Mean
• Mean areal precipitation
Normal Distribution
▪ Standard deviation (or σ) is a measure of how dispersed the data is in relation to the mean.
▪ A normal distribution is a type of continuous probability distribution in which most data points cluster
toward the middle of the range, while the rest taper off symmetrically toward either extreme.
▪ Skewness is a measure of the asymmetry of a distribution. A distribution is asymmetrical when its
left and right side are not mirror images. A distribution can have right (or positive), left (or negative),
or zero skewness (Normal distribution).
5-1. Hydrological Analysis
5.1.3 Additional Definitions
h). Graphical Display of Data
▪ A graphical or graph-based approach allows for a comprehensible interpretation of flood
dynamics based on historical data.
▪ This method proves to be a useful tool to gain insight in a flood event or a series of events.
▪ Graphical representation helps to identify and locate entities within the data set (floods,
droughts, etc.) and describe their evolution throughout the time series.
▪ The different approaches under graphical representation are:
• Time Series Plots
A time series chart/graph/plot is a data visualization tool that
illustrates data points at successive intervals of time and can be
used to visualize trends in counts or numerical values over time.
Use of Log axis for y-axis ? What are the benefits ?
Trend
Mean
Probability Density Function of a Normal Distribution N(0, σ2) Boxplot and Probability Density Function
of a Normal Distribution N(0, σ2)
1 P (Q ≤ 50,000) = 0.8
.
0.8
P (Q ≤ 25,000) = 0.4
Probability
0.6
0.4
0.2
0
0 100 200 300 400 500 600
Annual m ax flow (103 cfs)
25
How do you produce and interpret the CDF?
5-1. Hydrological Analysis
5.1.3 Additional Definitions
h). Graphical Display of Data
Probability Density Function (PDF) and Cumulative Distribution Function (CDF)
Continuous form of probability mass function is called probability density function (PDF).
PDF is the first derivative of a cumulative distribution function.
▪ How do you plot the PDF and CDF?
▪ What is represented by the area under PDF probability?
5-1. Hydrological Analysis
5.1.4 Data Requirements in Hydrology
- Complete data sets (Missing data => Problematic; Need replacing)
- Reliability (Accuracy)
Assumption that the sample come
- Representative (Sufficiently long length of record) from the same population (data set)
- Homogeneous (Question of stationarity; E.g. Climate Change => Is the past a
good indicator of future ?
- Data checking is an essential step in Hydrological Analysis
5.1.5 Hydrologic Data Series/Timeseries
River/Streamflow (Discharge) Hydrograph
27
5.1.4 Hydrologic Data Series/Timeseries
Annual series
- If the time interval length is taken as one year,
then the extreme value series is called an annual
series
- Annual Maximum Series (Fig. c)
- If the largest annual values are selected
- Annual Minimum Series 28
- If the smallest annual values are selected
5-1. Hydrological Analysis
5.1.5 Concept of Return Period
Return Period, also known as Recurrence Interval or Repeat Interval, is an average time or an
estimated average time between events such as earthquakes, floods, landslides, or river
discharge flows of similar magnitude to occur.
Therefore, Return Period describes the expected (mean) time (usually in years) between the
exceedance (or equaling) of a particular extreme threshold.
It is a statistical measurement typically based on historic data over an extended period and is
used usually for risk analysis.
Deflection Point
Deflection Point
No flow period
33
5-1. Hydrological Analysis
5.1.6 Flow Duration Curve (FDC) – Example
Date of Daily Average Sorted Flow
Rank
Exceedance Daily Average Flow (m3/s)
Observation Flow (m3/s) Rates (m3/s) Probability (%)
400
Day 01 376.2 376.2 1 3.2
Day 01
Day 02
Day 03
Day 04
Day 05
Day 06
Day 07
Day 08
Day 09
Day 10
Day 11
Day 12
Day 13
Day 14
Day 15
Day 16
Day 17
Day 18
Day 19
Day 20
Day 21
Day 22
Day 23
Day 24
Day 25
Day 26
Day 27
Day 28
Day 29
Day 30
Day 12 94.0 129.4 12 38.7
Day 13 70.9 117.9 13 41.9
Day 14 71.8 114.9 14 45.2
Day 15 69.8 110.5 15 48.4
Day 16 66.0 94.0 16 51.6 FDC - Sorted Flow Rates (m3/s)
Day 17 62.0 90.4 17 54.8 400
Day 18 52.6 71.8 18 58.1
Daily Average Flow (m3/s)
350
Day 19 52.4 70.9 19 61.3
300
Day 20 46.6 69.8 20 64.5
Day 21 42.7 66.0 21 67.7 250
Day 22 44.5 62.0 22 71.0 200
Day 23 47.0 52.6 23 74.2 y = -89.79ln(x) + 445.29
Day 24 35.6 52.4 24 77.4 150 R² = 0.9763
Day 25 50.6 50.6 25 80.6 100
Day 26 90.4 47.0 26 83.9
50
Day 27 129.4 46.6 27 87.1
Day 28 250.8 44.5 28 90.3 0
Day 29 175.1 42.7 29 93.5 0 20 40 60 80 100
Day 30 143.4 35.6 30 96.8 Probability of Exceedance (%) 34
5-1. Hydrological Analysis
5.1.6 Flow Duration Curve (FDC) – Interpreting FDC
Shape: Represents the characteristics of a drainage basin, thus the shape of FDC can be used
to compare characteristics of one basin with those of another.
• Steep slope throughout - Highly variable flow
• Curve with flat slope - Presence of surface/groundwater storage
• Flat slope at lower end - Large perennial storage
• Sharp drop - Low/negligible storage
▪ Flat curve: River with only few floods; little variation in flow regime. Dry flow is supplied by
base-flows/groundwater flows.
▪ Steep curve: Frequent floods and dry periods; little storage from groundwater.
--------------------------------------------------------------------------------------------------
▪ Other parameters:
• Mean: Area under portion of curve divided by the percent of time of that portion will
give mean flow during that duration.
• Median: Curve value at 50 % of the time.
35
5-1. Hydrological Analysis
5.1.6 Flow Duration Curve (FDC) – Interpreting FDC
Regulated
Flow
Natural Flow
Perennial River
Intermittent/Ephemeral River
36
5-1. Hydrological Analysis
5.1.6 Flow Duration Curve (FDC) – Ex.1: Calculation and Plotting FDC
Ex. 1: The measured flow frequency distribution for a stream over a period of 4 years
is as shown below. Use the data given to plot FDC and determine HF, MF, LF values.
37
5-1. Hydrological Analysis
5.1.6 Flow Duration Curve (FDC) – Ex.1: Calculation and Plotting FDC
Example: Calculation and Plotting FDC & Identification of Flow Thresholds
LF or EF
10.0% 90.0% 38
5-1. Hydrological Analysis
5.1.6 Flow Duration Curve (FDC) – Ex. 2: Hydrograph Method
39
5-1. Hydrological Analysis
5.1.6 Flow Duration Curve (FDC) – Solution: Hydrograph Method
40
5-1. Hydrological Analysis
5.1.6 Flow Duration Curve (FDC) – Solution: Hydrograph Method
1
2
9
10
11
12
41
5-1. Hydrological Analysis
5.1.6 Flow Duration Curve (FDC) – Solution: Hydrograph Method
42
5-1. Hydrological Analysis
5.1.6 Flow duration curve (FDC) – Solution: Hydrograph Method
43
5-1. Hydrological Analysis
5.1.6 Flow Duration Curve (FDC) – Solution: Hydrograph Method
Applications: The information derived from the FDC can be used for the estimation of
the mean hydropower energy that can be generated by using the streamflow.
44
5-1. Hydrological Analysis
5.1.7 Statistical Analysis in Hydrometeorology
▪ Statistical methods provide effective tools for the analysis of changes in
hydrometeorological variables and extreme hydrological/meteorological events,
as well as the correlation between different variables.
▪ Statistical methods have a long history in the analysis of hydrological data for
designing, planning, infilling, forecasting, and specifying better models to
assess scenarios of land use and climate change in catchments.
• Regression analysis
• Mass curve analysis
• Flood frequency analysis
• Many more (including BigData, Data Clouds, Climate Change
Modelling/GCM/RCM) which are beyond the scope of this class!
45
5-1. Hydrological Analysis
5.1.7 Statistical Analysis – Linear Regression
▪ A technique to determine the relationship between two random
variables.
• Relationship between discharge and velocity in a stream
▪ Fit the model (find β0 and β1) such at the sum of the squares of the vertical
deviations is minimum
▪ Alternatively, you can use Tools => Data Analysis => Regression
48
5-1. Hydrological Analysis
5.1.7 Statistical Analysis – Coefficient of determination (R2)
It is the proportion of observed y variation that can be explained by the simple linear regression model
and defined as:
16 X Y ෝ𝒊 𝟐
𝒚𝒊 − 𝒚 ഥ 𝟐
𝒚𝒊 − 𝒚
SSE
R2 = 1−
Dependent Variable (y)
2 3 0.36 38.44
SST 12
where, 4 7 0.36 4.84
SSE = ( yi − yˆ i ) 2 8
6 10 0.64 0.64
8 11 1.00 3.24
Error sum of squares
4 y = 1.4x + 0.8 10 15 0.04 33.64
SST = ( yi − y ) 2
R² = 0.9703
0 𝑦ത = 9.2 𝑆𝑆𝐸 = 2.4 𝑆𝑆𝑇 = 80.8
Total sum of squares, 0 4 8 12
Ybar is the mean of yi Independent Variable (x)
50
5-1. Hydrological Analysis
5.1.8 More about Time Series Sampling - Precision and Accuracy
▪ Precision: How close the measurements are to each other
▪ Accuracy: How close the measurements are to the true value
51
5-1. Hydrological Analysis
5.1.9 Data Collection and Analysis of Hydrologic Extremes
▪ Hydrological extremes are defined as unusually high or low magnitude events associated with
the occurrence, movement and distribution of water, such as floods, storms, hurricanes,
erosion/landslides and droughts (i.e. excessive, immoderate, inordinate, extravagant, exorbitant,
or simply extreme means going beyond a normal limit).
▪ Hydrological extremes usually result from a combination of compounding interacting physical
processes that occur across multiple spatial and temporal scales.
▪ Rainfall/streamflow extremes derived from continuous or partial time series are used to
determine design flood magnitudes and frequencies for water resources planning and
development purposes.
▪ Such extremes are
used to develop
Extreme value
functions and
national and regional
level Rainfall
Intensity-Duration-
Frequency curves
which are used for
design flood
estimation.
52
5-1. Hydrological Analysis
5.1.9 Data Collection and Analysis of Hydrologic Extremes
Traditional Methods for Estimating Hydrologic Extremes
Step 1: Select Extreme Event from Each Historical Year
50,000
45,000
Snohomish Basin
40,000
Streamflow (cfs)
35,000
30,000
25,000
20,000
15,000
10,000
5,000
0
10
19
28
37
46
55
64
73
82
91
1
289
307
325
343
361
100
109
118
127
136
145
154
163
172
181
190
199
208
217
226
235
244
253
262
271
280
298
316
334
352
Day of the Water Year (1 = Oct 1 in SL)
53
5-1. Hydrological Analysis
5.1.9 Data Collection and Analysis of Hydrologic Extremes
Step 2: Rank Extreme Events for all Years and Estimate Quantiles
100,000
90,000
80,000
Streamflow (cfs)
70,000
Median
60,000 1999 Annual
50,000
40,000
30,000
20,000
10,000
0
0.01
0.04
0.07
0.11
0.14
0.17
0.20
0.24
0.27
0.30
0.34
0.37
0.40
0.43
0.47
0.50
0.53
0.57
0.60
0.63
0.66
0.70
0.73
0.76
0.80
0.83
0.86
0.89
0.93
0.96
0.99
Probability of Exceedance (Fraction)
54
5-1. Hydrological Analysis
5.1.9 Data Collection and Analysis of Hydrologic Extremes
▪ For climate change experiments, GEV is a good choice since the true nature of
the future probability distributions is essentially unknown. However, it turns out
that the choice of distribution is not very critical in terms of the evaluating the
sensitivity to warming and/or precipitation change.
55
5-1. Hydrological Analysis
5.1.9 Data Collection and Analysis of Hydrologic Extremes
Note that any return interval can be estimated. E.g. One could provide an
estimate of the “5,000 year flood”.
Q100 Q100
Flow (cfs)
Flow (cfs)
Q50 Q50
Q20 Q20
Q100 Q100
Flow (cfs)
Flow (cfs)
Q50 Q50
Q20 Q20
E.g.