0% found this document useful (0 votes)

66 views22 pages

Verification of Ensemble Forecasts - A Survey: Laurence J. Wilson Meteorological Service of Canada Montreal, Quebec

This document surveys methods for verifying ensemble forecasts, including verifying the ensemble distribution, individual members, and probability forecasts derived from the ensemble. It discusses scores like the ranked probability score (RPS) and continuous ranked probability score (CRPS) to verify the ensemble distribution, as well as rank histograms to assess calibration. Methods for verifying individual members and the ensemble mean are also outlined. The document concludes by noting reliability diagrams and receiver operating characteristic (ROC) curves can verify probability forecasts from the ensemble.

Uploaded by

suraj.atmos458

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views22 pages

Verification of Ensemble Forecasts - A Survey: Laurence J. Wilson Meteorological Service of Canada Montreal, Quebec

Uploaded by

suraj.atmos458

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Verification of Ensemble Forecasts - A

Survey

Laurence J. Wilson
Meteorological Service of Canada
Montreal, Quebec
Outline
• The ensemble verification problem
– Attributes applied to the ensemble distribution
• Verification of the ensemble distribution
– Wilson 1999
– RPS and CRPS
– Rank Histogram
• Verification of individual ensemble members
• Verification of probability forecasts from the
ensemble
– Reliability tables
– The ROC
Verification of the ensemble
• Problem:
– how to compare a distribution with an observation
• The concept of “consistency”:
– For each possible probability distribution f, the a posteriori
verifying observations are distributed according to f in
those circumstances when the system predicts the
distribution f. (Talagrand)
– similar to reliability
• The concept of “non-triviality”
– the eps must predict different distributions at different
times
Strategy for ensemble verification
Ensemble verification - distribution
Ensemble verification - 500 mb
Ensemble verification - 500 mb
Comments on “Wilson” score
• Sensitive both to “nearness” of the ensemble mean
and to ensemble spread
• Verifies the distribution only in the vicinity of the
observation; variations outside the window have
no impact
• Believed to be strictly proper - shown empirically
• Related to Brier Score for a single forecast
Sc = 1 − BS
• Can account for forecast “difficulty” by choosing
window based on climatological variance
Verification of approximations to the eps
distribution
• The Rank probability score (RPS)
1  K  i i

2

RPS = ∑ ∑ P − ∑O  
K − 1  i =1  n =1 n n 

  n =1  
– discrete form, choose categories; samples distribution
according to categories
• Continuous RPS

[P ( x ) − Pa ( x ) ] dx
∞ 2
CRPS ( P , x a ) = ∫
−∞
CRPS example

CDF - Forecast-observed

0.9

0.8

0.7

0.6
Probability

0.5

0.4

0.3

0.2

0.1

0
X

Forecast Observed
Rank Histogram (Talagrand Diagram)
• Preparation
– order the members of the ensemble from lowest to
highest - identifies n+1 ranges including the two
extremes
– identify the location of the observation, tally over a
large number of cases
• Interpretation
– Flat indicates ensemble spread about right to represent
uncertainty
– U-shaped - ensemble spread too small
– dome-shaped - ensemble spread too large
– assymetric - over- or under-forecasting bias
– This is NOT a true verification measure
Rank Histogram example
Rank Histogram
Verification of individual members
• Preferred for comparison with operational model
than verification of ensemble mean
• Unperturbed control
– compare with full resolution model
• Best and worst member
– a “posteriori” verification - less use to forecasters
– select over a forecast range or individually at each range
• Methods
– all that apply to continuous fields: RMSE, MAE, bias,
anomaly correlation etc.
– preferable to verify against data than analysis.
The Ensemble mean
• Popular, because scores well with quadratic rules
• Should NOT be compared to individual outcomes:
– different sampling distribution
– not a trajectory of the model
Verification of probability forecasts from the
Ensemble
• Same as verification of any probability forecasts
• Reliability Table (with unconditional
distribution of forecasts) + ROC (with
likelihood diagram) sufficient for complete
diagnostic verification
• Reliability table: Distribution conditioned by fcst
• ROC: Distribution conditioned by obs.
• Attributes:
• reliability
• sharpness
• resolution
• discrimination
ROC - ECMWF Ensemble Forecasts
Temperature 850 mb anomaly <-4C (vs. analysis)

Like lihood Dia gra m - 96 h

Re la tive Ope ra ting Cha ra cte ristic
T850 a nom a ly <-4, Europe a na l 2000
1 1500

Cases
y es
1000
0.9 no
500

0.05

0.25

0.45

0.65

0.85
0.8

For e cas t Pro bability

0.7

Like lihood Dia gra m - 144 h

0.6 3000
96 h 2500
Hit Rate

0 s kill 2000

Cases
0.5 y es
144 h 1500
240 h 1000 no
0.4 500
0

0.05

0.25

0.45

0.65

0.85
0.3
For e cas t Pro bability
AZ DA
0.2 96 h 0.900 1.812 Like lihood Dia gra m - 240 h
144 h 0.831 1.357
3500
240 h 0.725 0.844
0.1 3000
2500

Cases
2000
y es
0 1500
1000 no
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Fals e Alarm Rate 500
0

0.05

0.25

0.45

0.65

0.85
For e cas t Pro bability
ROC Issues
• Empirical vs. fitted
• No. points needed to define the ROC
• ROC and value (“potential value”)
ROC - threshold variation
(Wilson, 2000)

ROC - Summer 97, Europe

0.9

0.8

0.7

d 3 - 1 mm
0.6
d 3 - 2 mm
Hit Rate

0.5
d 3 - 5 mm

0.4 d 3 - 10 mm

0.3

0.2

0.1

0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
False Alarm Rate
ROC - Summer 97 -Europe

0.9

0.8

0.7

0.6
d 3 - 1 mm
Hit Rate

0.5 No skill
d 3 - 10 mm
HR - 1 mm
0.4
HR - 10 mm

0.3

Az s
0.2 d3 1mm - 0.866 d3 1 mm - 1.221
d3 10mm - 0.851 d3 10 mm - 1.096
0.1

0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
False Alarm Rate
Summary
• Verification of the ensemble distribution - depends
on how it is to be used by forecaster
• Two aspects: verification of distribution vs.
verification of probabilities from the distribution
• Several measures shown, characteristics identified
• Sufficiency of Reliability table and ROC graph for
diagnostic verification of probability forecasts

Sadhana
No ratings yet
Sadhana
134 pages
Alexander 20080404 Presentation
No ratings yet
Alexander 20080404 Presentation
150 pages
Weather Guide Canadian Forest Fire Danger Rating System: For The
No ratings yet
Weather Guide Canadian Forest Fire Danger Rating System: For The
87 pages
SRM Notes
No ratings yet
SRM Notes
38 pages
EUMETNET/ECSN Optional Programme: 'European Climate Assessment & Dataset (ECA&D) '
No ratings yet
EUMETNET/ECSN Optional Programme: 'European Climate Assessment & Dataset (ECA&D) '
47 pages
Package Wux': R Topics Documented
No ratings yet
Package Wux': R Topics Documented
43 pages
Extreme Measures
No ratings yet
Extreme Measures
2 pages
jgrd50150 PDF Jsessionid f01t02
No ratings yet
jgrd50150 PDF Jsessionid f01t02
21 pages
Esmvaltool V2.0: Technical Overview
No ratings yet
Esmvaltool V2.0: Technical Overview
16 pages
Ensemble Verification II: Training Course 2014 1 / 31
No ratings yet
Ensemble Verification II: Training Course 2014 1 / 31
54 pages
Correcting Climate Model Simulations in Heihe River Using The Multivariate Bias Correction Package
No ratings yet
Correcting Climate Model Simulations in Heihe River Using The Multivariate Bias Correction Package
17 pages
2016 Infanib
No ratings yet
2016 Infanib
5 pages
Statistical Inference
No ratings yet
Statistical Inference
47 pages
Russo Et Al-2019-Journal of The European Academy of Dermatology and Venereology PDF
No ratings yet
Russo Et Al-2019-Journal of The European Academy of Dermatology and Venereology PDF
20 pages
Structural Safety and Reliability Index
No ratings yet
Structural Safety and Reliability Index
29 pages
Myocardial Risk As Preventive Medicine
No ratings yet
Myocardial Risk As Preventive Medicine
18 pages
Reli Dtic Ada187220
No ratings yet
Reli Dtic Ada187220
25 pages
Pattern Recognition
No ratings yet
Pattern Recognition
76 pages
ESRL: PSD: Clouds and Climate
No ratings yet
ESRL: PSD: Clouds and Climate
6 pages
Introduction To Forecast Verification - Fowler, Jenson and Brown
No ratings yet
Introduction To Forecast Verification - Fowler, Jenson and Brown
81 pages
Predicting Cricket Match 490021 1 en
No ratings yet
Predicting Cricket Match 490021 1 en
13 pages
Simple Learning Algorithms: Jiming Peng, Advol, Cas, Mcmaster 1
No ratings yet
Simple Learning Algorithms: Jiming Peng, Advol, Cas, Mcmaster 1
41 pages
Forecast Verification. Methods and FAQ
No ratings yet
Forecast Verification. Methods and FAQ
39 pages
Peroutka Oct03 2007
No ratings yet
Peroutka Oct03 2007
47 pages
Holotranscobalamin (HoloTC
No ratings yet
Holotranscobalamin (HoloTC
20 pages
TR 1
No ratings yet
TR 1
12 pages
Basic Verification Concepts: Barbara Brown National Center For Atmospheric Research Boulder Colorado USA Bgb@ucar - Edu
No ratings yet
Basic Verification Concepts: Barbara Brown National Center For Atmospheric Research Boulder Colorado USA Bgb@ucar - Edu
45 pages
Get Involved Donate Join
No ratings yet
Get Involved Donate Join
3 pages
Climate As Culprit
No ratings yet
Climate As Culprit
3 pages
Elementary Statistics: Frequency Distribution " "
No ratings yet
Elementary Statistics: Frequency Distribution " "
15 pages
Khoury 2015
No ratings yet
Khoury 2015
12 pages
Bar and Area Graphs - MATLAB & Simulink
No ratings yet
Bar and Area Graphs - MATLAB & Simulink
19 pages
MAE342 Lecture 24
No ratings yet
MAE342 Lecture 24
27 pages
Roe Bber 2009
No ratings yet
Roe Bber 2009
8 pages
Lectura 1
No ratings yet
Lectura 1
13 pages
Mittermaier Etal Highres PPN
No ratings yet
Mittermaier Etal Highres PPN
33 pages
Standardized Verification System (SVS) For Long-Range Forecasts (LRF)
No ratings yet
Standardized Verification System (SVS) For Long-Range Forecasts (LRF)
17 pages
A Review On Evaluation Metrics For Data Classification Evaluations
No ratings yet
A Review On Evaluation Metrics For Data Classification Evaluations
11 pages
Verification Summary Table
No ratings yet
Verification Summary Table
4 pages
Complete
No ratings yet
Complete
34 pages
Parameter Estimation For The Two-Parameter Weibull Distribution
No ratings yet
Parameter Estimation For The Two-Parameter Weibull Distribution
108 pages
Introduction To SPC (Statistical Process Control)
No ratings yet
Introduction To SPC (Statistical Process Control)
80 pages
M2S2 - Statistical Modelling: DR Axel Gandy Imperial College London Spring 2011
No ratings yet
M2S2 - Statistical Modelling: DR Axel Gandy Imperial College London Spring 2011
25 pages
Gneiting 2007 Jasa
No ratings yet
Gneiting 2007 Jasa
20 pages
EESC V2100 - Surface Energy and Water Balance
No ratings yet
EESC V2100 - Surface Energy and Water Balance
5 pages
A Classifiers Voting Model For Exit Prediction of Privately Held Companies
No ratings yet
A Classifiers Voting Model For Exit Prediction of Privately Held Companies
6 pages
Project Status2
No ratings yet
Project Status2
16 pages
Data Science & Machine Learning 2024
No ratings yet
Data Science & Machine Learning 2024
2 pages
A 2x2 Contingency Table For Binary Forecast
No ratings yet
A 2x2 Contingency Table For Binary Forecast
2 pages
GIS-based Landslide Susceptibility Zonation Mapping Using Frequency Ratio and Logistics Regression Models in The Dessie Area, South Wello, Ethiopia
No ratings yet
GIS-based Landslide Susceptibility Zonation Mapping Using Frequency Ratio and Logistics Regression Models in The Dessie Area, South Wello, Ethiopia
25 pages
Che 4C3/6C3: Lecturer: Dr. John Macgregor Ta'S: Arv Jegatheesan, Nrb-B105, Ext. 26876, Jegatha@Mcmaster - Ca
No ratings yet
Che 4C3/6C3: Lecturer: Dr. John Macgregor Ta'S: Arv Jegatheesan, Nrb-B105, Ext. 26876, Jegatha@Mcmaster - Ca
14 pages
Introductory Lecture 2007 PDF
No ratings yet
Introductory Lecture 2007 PDF
14 pages
Spleen Stiffness-Spleen Size-To-platelet Ratio Risk
No ratings yet
Spleen Stiffness-Spleen Size-To-platelet Ratio Risk
6 pages
E4 DS203 2023 Sem2
No ratings yet
E4 DS203 2023 Sem2
2 pages
Research Questions Prior and Posterior Distributions: Bayesian Estimation
No ratings yet
Research Questions Prior and Posterior Distributions: Bayesian Estimation
1 page
MIDAS Stata Module For Meta-Analytical Integration
No ratings yet
MIDAS Stata Module For Meta-Analytical Integration
25 pages
Direct Assessment of Local Accuracy and Precision
No ratings yet
Direct Assessment of Local Accuracy and Precision
11 pages
Prediction of Behavior From The Past Data Well-Understood Processes
No ratings yet
Prediction of Behavior From The Past Data Well-Understood Processes
23 pages
Basic Statistics
No ratings yet
Basic Statistics
20 pages
CLIMDEX: Climate Extremes Indices
No ratings yet
CLIMDEX: Climate Extremes Indices
5 pages
Probability and Statistics For Engineers Applied Statistics: Course 461601 Course 400516
No ratings yet
Probability and Statistics For Engineers Applied Statistics: Course 461601 Course 400516
22 pages
Industrial Organisation Class Notes Uottawa MCG 5171B
No ratings yet
Industrial Organisation Class Notes Uottawa MCG 5171B
66 pages
Using Bayesian Model Averaging To Calibrate Forecast Ensembles
No ratings yet
Using Bayesian Model Averaging To Calibrate Forecast Ensembles
26 pages
Report File (VJ)
No ratings yet
Report File (VJ)
56 pages
Description of Uncertainty 2024
No ratings yet
Description of Uncertainty 2024
16 pages
Weatherforecast Decision
No ratings yet
Weatherforecast Decision
9 pages
SPC Training
No ratings yet
SPC Training
80 pages
Reliability Engineering
No ratings yet
Reliability Engineering
30 pages
Decomposition of The Continuous Ranked Probability Score For Ensemble Prediction Systems
No ratings yet
Decomposition of The Continuous Ranked Probability Score For Ensemble Prediction Systems
12 pages
Probabilistic Method
No ratings yet
Probabilistic Method
120 pages
Preliminaries: Prediction and Confidence Intervals in Regression
No ratings yet
Preliminaries: Prediction and Confidence Intervals in Regression
10 pages
Mwre-Mwr3280 1
No ratings yet
Mwre-Mwr3280 1
7 pages
Afifah Annis - The Role of Family Support in The Self-Rated Health of Older Adults in Eastern Nepal: Findings From A Cross-Sectional Study (Q1)
No ratings yet
Afifah Annis - The Role of Family Support in The Self-Rated Health of Older Adults in Eastern Nepal: Findings From A Cross-Sectional Study (Q1)
11 pages
004 07 Roc Auc Eer W4L2 W5L1 PDF
No ratings yet
004 07 Roc Auc Eer W4L2 W5L1 PDF
12 pages
14JST 4999 2024
No ratings yet
14JST 4999 2024
29 pages
Statistics in ForecastVerification
No ratings yet
Statistics in ForecastVerification
20 pages
Basic Stats
No ratings yet
Basic Stats
49 pages
Anomaly Detection of Deepfake Audio Based On Real Audio Using Generative Adversarial Network Model
No ratings yet
Anomaly Detection of Deepfake Audio Based On Real Audio Using Generative Adversarial Network Model
16 pages
Basic Statistics Made Easy
No ratings yet
Basic Statistics Made Easy
51 pages
Gneiting Katzfuss 2014 Probabilistic Forecasting
No ratings yet
Gneiting Katzfuss 2014 Probabilistic Forecasting
29 pages
Deep Learning-Based Strategies For Integrated Autonomous Navigation A Review
No ratings yet
Deep Learning-Based Strategies For Integrated Autonomous Navigation A Review
6 pages
Performance Evaluation of Medical Image Captioning Using
No ratings yet
Performance Evaluation of Medical Image Captioning Using
10 pages
Mla 4
No ratings yet
Mla 4
2 pages
23 Ejs2180
No ratings yet
23 Ejs2180
61 pages
Lecture Notes
No ratings yet
Lecture Notes
90 pages
ML Test Questions 3 Confusion Matrix
No ratings yet
ML Test Questions 3 Confusion Matrix
5 pages
EPD Record COMP
No ratings yet
EPD Record COMP
30 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
1 s2.0 S1546144019314516 Main
No ratings yet
1 s2.0 S1546144019314516 Main
9 pages
Verification of Eta-RSM Short-Range Ensemble Forec
No ratings yet
Verification of Eta-RSM Short-Range Ensemble Forec
17 pages
Machine Learning For Dementia Prediction: A Systematic Review and Future Research Directions
No ratings yet
Machine Learning For Dementia Prediction: A Systematic Review and Future Research Directions
25 pages
2008TellA 60 663B
No ratings yet
2008TellA 60 663B
16 pages
Statistics For Econometrics
No ratings yet
Statistics For Econometrics
100 pages
Hydr JHM D 15 0205 - 1
No ratings yet
Hydr JHM D 15 0205 - 1
13 pages
Paulino, Octávio Moura Et Al (2025) PAI Profiles (Logistic Regression)
No ratings yet
Paulino, Octávio Moura Et Al (2025) PAI Profiles (Logistic Regression)
14 pages
Review of Machine Learning For Drilling Applications
No ratings yet
Review of Machine Learning For Drilling Applications
44 pages
"/content/android - Permission - CSV" "/content/plots/" "/content/unsampled/" "/content/oversampled"
No ratings yet
"/content/android - Permission - CSV" "/content/plots/" "/content/unsampled/" "/content/oversampled"
58 pages
Are Sex Differences in Human Brain Structure - Associated With Sex Difference in Behaviour - FSL
No ratings yet
Are Sex Differences in Human Brain Structure - Associated With Sex Difference in Behaviour - FSL
71 pages
Assignment
No ratings yet
Assignment
5 pages
Mwre-1520-0493 1950 078 0001 Vofeit 2 0 Co 2
No ratings yet
Mwre-1520-0493 1950 078 0001 Vofeit 2 0 Co 2
3 pages
Ensemble Forecasting
No ratings yet
Ensemble Forecasting
33 pages
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Exercises of Distributions
From Everand
Exercises of Distributions
Simone Malacrida
No ratings yet
C# Mastery: A Comprehensive Guide to Advanced C# Features and Applications
From Everand
C# Mastery: A Comprehensive Guide to Advanced C# Features and Applications
Lena Neill
No ratings yet
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet

Verification of Ensemble Forecasts - A Survey: Laurence J. Wilson Meteorological Service of Canada Montreal, Quebec

Uploaded by

Verification of Ensemble Forecasts - A Survey: Laurence J. Wilson Meteorological Service of Canada Montreal, Quebec

Uploaded by

Verification of Ensemble Forecasts - A

Like lihood Dia gra m - 96 h

For e cas t Pro bability

Like lihood Dia gra m - 144 h

ROC - Summer 97, Europe

You might also like