0% found this document useful (0 votes)

39 views49 pages

Stats and Data Analysis

Statistics and robust methods are useful for proficiency testing in several ways: 1) Finding the consensus value and its uncertainty by using the robust mean, which is less influenced by outliers than the regular mean. 2) Assessing participant performance using a z-score based on the robust mean and a fitness-for-purpose standard deviation, providing information about how well results meet intended uses. 3) Evaluating test materials for sufficient homogeneity and stability by applying robust statistics, which are better suited for datasets that may contain outliers or stragglers.

Uploaded by

Eman Yahia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views49 pages

Stats and Data Analysis

Uploaded by

Eman Yahia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

Statistics and Data Analysis

in
Proficiency Testing
Michael Thompson
School of Biological and Chemical Sciences
Birkbeck College (University of London)
Malet Street
London WC1E 7HX
[email protected]
Organisation of a proficiency test

“Harmonised Protocol”. Pure Appl Chem. 2006, 78, 145-196.

Where do we use statistics in
proficiency testing?

• Finding a consensus and its uncertainty to

use as an assigned value
• Assessing participants’ results
• Assessing the efficacy of the PT scheme
• Testing for sufficient homogeneity and
stability of the distributed test material
• Others
Criteria for an ideal scoring
method

• Adds value to raw results.

• Easily understandable, based on the
properties of the normal distribution.
• Has no arbitrary scaling transformation.
• Is transferable between different
concentrations, analytes, matrices, and
measurement principles.
How can we construct a score?
• An obvious idea is to utilise the properties
of the normal distribution to interpret the
results of a proficiency test.
BUT…
We do not make
any assumptions
about the actual
data.
Example dataset A
• Determination of protein nitrogen in a meat
product.
A weak scoring method

z 
x x s

x 2.126
s 0.077
• On average, slightly more than 95% of laboratories
receive z-score within the range ±2.
Robust mean and standard
deviation

ˆrob , 
ˆrob
• Robust statistics is applicable to datasets that
look like normally distributed samples
contaminated with outliers and stragglers (i.e.,
unimodal and roughly symmetric.
• The method downweights the otherwise large
influence of outliers and stragglers on the
estimates.
• It models the central ‘reliable’ part of the dataset.
Can I use robust estimates?

Skewed

Bimodal

Heavy-tailed

Measurement axis
x T 
x1 x 2  xn 
Huber’s H15
Set 1 k 2, p 0, 
ˆ0 median, 0 
ˆ 1.5 MAD

xi if 
ˆp kˆp xi 
ˆp k
ˆp
~ 
xi 
ˆ p kp
ˆ if xi ˆp kˆp
ˆ
p k ˆp if xi 
ˆp kˆp


ˆ mean ( ~
xi )
p
1

ˆ2p 1 f ( k ) var( ~
 xi )

If not converged, p p 1
References: robust statistics

• Analytical Methods Committee,

Analyst,1989, 114, 1489
• AMC Technical Brief No 6, 2001
(download from www/rsc.org/amc)
• P J Rousseeuw, J. Chemomet, 1991, 5, 1.
Is that enough?

z 
x 
ˆrob 
ˆrob


ˆrob 2.128


ˆrob 0.048

• On average, slightly less than 95% of

laboratories receive a z-score between ±2.
What more do we need?
• We need a method that evaluates the data
in relation to its intended use, rather than
merely describing it.
• This adds value to the data rather than
simply summarising it.
• The method is based on fitness for
purpose.
Fitness for purpose

• Fitness for purpose occurs when the uncertainty

of the result uf gives best value for money.
• If the uncertainty is smaller than uf , the analysis
may be too expensive.
• If the uncertainty is larger than uf , the cost and
the probability of a mistaken decision will rise.
Fitness for purpose
• The value of uf can sometimes be estimated
objectively by decision theoretic methods, but is
most often simply agreed between the
laboratory and the customer by professional
judgement.
• In the proficiency test context, uf should be
determined by the scheme provider.

Reference: T Fearn, S A Fisher, M Thompson,

and S L R Ellison, Analyst, 2002, 127, 818-824.
A score that meets all of the
criteria
• If we now define a z-score thus:
z 
x 
ˆrob p where p u f
we have a z-score that is both robustified against
extreme values and tells us something about fitness
for purpose.
• In an exactly compliant laboratory, scores of 2<|z|<3
will be encountered occasionally, and scores of |z|>3
rarely. Better performers will receive fewer of these
extreme z-scores.
Example data A again
• Suppose that the fitness for purpose criterion set
for the analysis is an RSD of 1%. This gives us:
p 0.01 2.1 0.021
Finding a consensus from
participants’ results

• The consensus is not theoretically the best

option for the assigned value but is usually
the only practicable value.
• The consensus is not necessarily identical
with the true value. PT providers have to
be alert to this possibility.
What is a ‘consensus’?
• Mean? - easy to calculate, but affected by
outliers and asymmetry.
• Robust mean? - fairly easy to calculate, handles
outliers but affected by asymmetry.
• Median? - easy to calculate, more robust for
asymmetric distributions, but larger standard
error than robust mean.
• Mode? - intuitively good, difficult to define,
difficult to calculate.
The robust mean as consensus

• The robust mean provides a useful consensus

in the great majority of instances, where the
underlying distribution is roughly symmetric
and there are 0-10% outliers.
• The uncertainty of this consensus can be
safely taken as

u
xa 
ˆrob n
When can I use robust estimates?

Skewed

Bimodal

Heavy-tailed

Measurement axis
Skewed distributions
• Skews can arise when the participants’
results come from two or more
inconsistent methods.
• They can also arise as an artefact at low
concentrations of analyte as a result of
data recording practice.
• Rarely, skews can arise when the
distribution is truly lognormal.
Possible use of a trimmed data
set?
Can I use the mode?
How many modes? Where are they?
The normal kernel density for
identifying a mode
n
x xi 
y  
1

nh i 1  h 
where Φis the standard normal density,
exp( a / 2)
2
(a) 
2

AMC Technical Brief No. 4

A normal kernel
A kernel density
Another kernel density
Graphical representation of sample data
Kernel density of the aflatoxin data
Uncertainty of the mode
• The uncertainty of the consensus can be
estimated as the standard error of the
mode by applying the bootstrap to the
procedure.
• The bootstrap is a general procedure
based on resampling for estimating
standard errors of complex statistics.
• Reference: Bump-hunting for the proficiency
tester – searching for multimodality. P J
Lowthian and M Thompson, Analyst, 2002,127,
1359-1364.
The normal mixture model
m m
f ( y ) p j f j ( y ), p j 1
j 1 j
1

exp( ( y j ) / 2 2 2
f j ( y) 
2

AMC Technical Brief No 23, and AMC Software.

Thompson, Acc Qual Assur, 2006, 10, 501-505.
Mixture models found by the maximum
likelihood method (the EM algorithm)
• The M-step
n
pˆ j   Pˆ( j y i ) / n
i 1
n n
̂j  y i Pˆ
( j yi ) Pˆ
( j yi )
i 1 i 1

2
n m

j
ˆ
1i 
1

2 ˆ
  ( yi j ) P( j yi )
ˆ Pˆ( j y )i

• The E-step
m
Pˆ
( j yi ) pˆj f j ( yi ) pˆj f j ( yi )
j
1
Kernel density and fit of 2-component
normal mixture model
Kernel density and variance-inflated
mixture model
Useful References
• Mixture models
M Thompson. Accred Qual Assur. 2006, 10, 501-505.
AMC Technical Brief No. 23, 2006. www/rsc.org/amc

• Kernel densities
B W Silverman, Density estimation for statistics and data
analysis. Chapman and Hall, London, 1986.
AMC Technical Brief, no. 4, 2001 www/rsc.org/amc

• The bootstrap
B Efron and R J Tibshirani, An introduction to the
bootstrap. Chapman and Hall, London, 1993
AMC Technical Brief, No. 8, 2001 www/rsc.org/amc
Conclusions—scoring

• Use z-scores based on fitness for

purpose.
• Estimate the consensus as the robust
mean and its uncertainty as  ˆrob n
if the dataset is roughly symmetric.
• If the dataset is skewed and plausibly
composite, use kernel density methods
or mixture models
Homogeneity testing
• Comminute and mix bulk material.
• Split into distribution units.
• Select m>10 distribution units at random.
• Homogenise each one.
• Analyse 2 test portions from each in
random order, with high precision, and
conduct one-way analysis of variance on
results.
Design for homogeneity testing

MSB MSW
san  MSW , ssam 
2
Problems with simple ANOVA
based on testing
H 0 : sam 0
• Analytical precision too low—method
cannot detect consequential degree of
heterogeneity.

• Analytical precision too high—method

finds significant degree of heterogeneity
that may not be consequential.

(Everything is heterogeneous!)
“Sufficient homogeneity”:
original definition

• Material passes homogeneity test if

ssam L 0.3p

• Problems are:
– ssam may not be well estimated;
– too big a probability of rejecting
satisfactory test material.
Fearn test
• Test H 0 : sam
2
L
2
by rejecting when

2
ssam 
L m 1
2 2

2
san Fm1,m 1
m 1 2

Ref: Analyst, 2001, 127, 1359-1364.

Problems with homogeneity
data
• Problems with data are common:
e.g., no proper randomisation, insufficient
precision, biases, trends, steps,
insufficient significant figures recorded,
outliers.
• Laboratories need detailed instructions.
• Data need careful scrutiny before
statistics.
• HP1 is incorrect in saying that all outlying
data should be retained.
General references
• The Harmonised Protocol (revised)
M Thompson, S L R Ellison and R Wood
Pure Appl. Chem., 2006, 78, 145-196.
• R E Lawn, M Thompson and R F Walker,
Proficiency testing in analytical chemistry. The
Royal Society of Chemistry, Cambridge, 1997.
• ISO Guide 43. International Standards
Organisation, Geneva, 1997.

Develop A Quality by Design Approach For Analytical Method Development
No ratings yet
Develop A Quality by Design Approach For Analytical Method Development
104 pages
Analytical Chem
No ratings yet
Analytical Chem
188 pages
Punch Inspection
No ratings yet
Punch Inspection
5 pages
Unit V Statistical Data Analysis
No ratings yet
Unit V Statistical Data Analysis
72 pages
Useful Statistics Pre Course Guide V2.4
No ratings yet
Useful Statistics Pre Course Guide V2.4
40 pages
Iso 13528
No ratings yet
Iso 13528
65 pages
L2 - Statistical Measurement Sig - Figure
No ratings yet
L2 - Statistical Measurement Sig - Figure
22 pages
Leveraging Lookups and Subsearches
100% (2)
Leveraging Lookups and Subsearches
72 pages
Xử Lý Số Liệu Trong Phân Tích Dược - 01
No ratings yet
Xử Lý Số Liệu Trong Phân Tích Dược - 01
109 pages
1 - Introduction To Analytical Chemistry LAB
No ratings yet
1 - Introduction To Analytical Chemistry LAB
9 pages
Review of Basic Concepts/Solutions and Their Concentrations
No ratings yet
Review of Basic Concepts/Solutions and Their Concentrations
109 pages
Professional Education - Drill 6 - Part 1
100% (2)
Professional Education - Drill 6 - Part 1
4 pages
Assumption - 16 - Oct18
No ratings yet
Assumption - 16 - Oct18
48 pages
Starting Points in Data Analysis: January 21, 2020
No ratings yet
Starting Points in Data Analysis: January 21, 2020
32 pages
APP601S Chapter 4 - Data Handling in Anal Chem
No ratings yet
APP601S Chapter 4 - Data Handling in Anal Chem
42 pages
Chap 2
No ratings yet
Chap 2
15 pages
CC Quality Control
No ratings yet
CC Quality Control
8 pages
1039 Chemometrics
No ratings yet
1039 Chemometrics
19 pages
Chapter 2 - Data Analysis II
No ratings yet
Chapter 2 - Data Analysis II
56 pages
Brochure - Titrator - T50
No ratings yet
Brochure - Titrator - T50
2 pages
Quality Control and Reference Values
No ratings yet
Quality Control and Reference Values
38 pages
Notes On Statistics and Data Quality For Analytical Chemists
No ratings yet
Notes On Statistics and Data Quality For Analytical Chemists
6 pages
5.Evaluation of Analytical Data-đã chuyển đổi
No ratings yet
5.Evaluation of Analytical Data-đã chuyển đổi
25 pages
đánh giá dữ liệu phân tích
No ratings yet
đánh giá dữ liệu phân tích
14 pages
Part2 Statistics
No ratings yet
Part2 Statistics
55 pages
Inicial Apu Aps - 231107 - 232152
No ratings yet
Inicial Apu Aps - 231107 - 232152
138 pages
11.a C Nalysis Strategy - 04 - 06 - 2023
No ratings yet
11.a C Nalysis Strategy - 04 - 06 - 2023
20 pages
RMP470S Lecture 7 - One-Dimensionalstatistics
No ratings yet
RMP470S Lecture 7 - One-Dimensionalstatistics
27 pages
Air Master Catalog
100% (2)
Air Master Catalog
191 pages
Basic Statistical Techniques in Analytical Chemistry Presentation.
No ratings yet
Basic Statistical Techniques in Analytical Chemistry Presentation.
31 pages
PSY417 Week02
No ratings yet
PSY417 Week02
38 pages
Clinical Chemistry Quality Control, Quality Assessment and Statistics 2
No ratings yet
Clinical Chemistry Quality Control, Quality Assessment and Statistics 2
50 pages
Unit 4
No ratings yet
Unit 4
14 pages
APP601S Chapter 4 - Data Handling in Analytical Chem
No ratings yet
APP601S Chapter 4 - Data Handling in Analytical Chem
42 pages
Database Design Using Entity-Relationship Diagrams (3rd Edition, CRC Press) Sikha Saha Bagui Download
No ratings yet
Database Design Using Entity-Relationship Diagrams (3rd Edition, CRC Press) Sikha Saha Bagui Download
53 pages
Data Comes in Different Formats Time Histograms Lists But . Can Contain The Same Information About Quality
No ratings yet
Data Comes in Different Formats Time Histograms Lists But . Can Contain The Same Information About Quality
64 pages
Medical Statistics New
No ratings yet
Medical Statistics New
46 pages
Process Structure and SPC
No ratings yet
Process Structure and SPC
50 pages
Lecture 1 - Data Analysis
No ratings yet
Lecture 1 - Data Analysis
28 pages
Chapt 6 - Statistic Data
No ratings yet
Chapt 6 - Statistic Data
82 pages
Bif601 Final Term Handouts
No ratings yet
Bif601 Final Term Handouts
18 pages
6600 Chapter Summaries SKELETON
No ratings yet
6600 Chapter Summaries SKELETON
20 pages
Top 10 Statistical Analysis Topics Based On Your Data and Requirements
No ratings yet
Top 10 Statistical Analysis Topics Based On Your Data and Requirements
7 pages
Grade 5 Math Bow Q1
No ratings yet
Grade 5 Math Bow Q1
4 pages
One Dimensional Statistics
No ratings yet
One Dimensional Statistics
21 pages
Week 11
No ratings yet
Week 11
22 pages
AMC TB206 tcm18-25922
No ratings yet
AMC TB206 tcm18-25922
2 pages
JR Inter Maths 1A AP EM 01022025
No ratings yet
JR Inter Maths 1A AP EM 01022025
11 pages
BSChem-Statistics in Chemical Analysis PDF
No ratings yet
BSChem-Statistics in Chemical Analysis PDF
6 pages
Stats Notes
No ratings yet
Stats Notes
16 pages
Infineon-AN50987 Getting Started With I2C in PSoC 1-ApplicationNotes-V07 00-En
No ratings yet
Infineon-AN50987 Getting Started With I2C in PSoC 1-ApplicationNotes-V07 00-En
28 pages
Minitab 16: ANOVA, Normality, Tukey, Control Charts
No ratings yet
Minitab 16: ANOVA, Normality, Tukey, Control Charts
63 pages
Reliability Distribution 1
No ratings yet
Reliability Distribution 1
41 pages
Guideline For Final Year Project - Research Supervision: Faculty of Business, Accountancy and Management
No ratings yet
Guideline For Final Year Project - Research Supervision: Faculty of Business, Accountancy and Management
71 pages
Statistical Data Treatment: - Part 1 (Manual Calculations)
No ratings yet
Statistical Data Treatment: - Part 1 (Manual Calculations)
51 pages
Estimation: Large Characteristic of A Population Based On Its Sample
No ratings yet
Estimation: Large Characteristic of A Population Based On Its Sample
19 pages
Week 8
No ratings yet
Week 8
13 pages
Prof Test
No ratings yet
Prof Test
29 pages
A Critical View On ISO Standard 13528: Wim Coucke EQALM Symposium, Dublin, 20th of October 2017
No ratings yet
A Critical View On ISO Standard 13528: Wim Coucke EQALM Symposium, Dublin, 20th of October 2017
22 pages
Assembly Procedure 24M
No ratings yet
Assembly Procedure 24M
21 pages
WK 8 - Method Evaluation and Quality Control PDF
No ratings yet
WK 8 - Method Evaluation and Quality Control PDF
8 pages
24 25ost
No ratings yet
24 25ost
4 pages
Combine PT - MU
No ratings yet
Combine PT - MU
2 pages
Lab 4 .
No ratings yet
Lab 4 .
6 pages
Elog Guide: December 2006
No ratings yet
Elog Guide: December 2006
55 pages
DWDM
No ratings yet
DWDM
18 pages
Statistical Analysis
No ratings yet
Statistical Analysis
16 pages
Optimal Capital Allocation
No ratings yet
Optimal Capital Allocation
37 pages
Data Screening and Psychometrics
No ratings yet
Data Screening and Psychometrics
7 pages
Fluid Power - 2
No ratings yet
Fluid Power - 2
11 pages
Moisture in The Atmosphere
No ratings yet
Moisture in The Atmosphere
43 pages
Outline and Equation Sheet For M E 345: Every Additive Term in An Equation Must Have The Same Dimensions
No ratings yet
Outline and Equation Sheet For M E 345: Every Additive Term in An Equation Must Have The Same Dimensions
7 pages
Proficiency Testing Technical Brief 18A - tcm18 214885
No ratings yet
Proficiency Testing Technical Brief 18A - tcm18 214885
2 pages
Stakeholder
No ratings yet
Stakeholder
9 pages
High Temperature Scale
No ratings yet
High Temperature Scale
51 pages
Notes 1
No ratings yet
Notes 1
76 pages
Market Structure
No ratings yet
Market Structure
14 pages
2018 Howland Et Al. Quantifying The Effects of Erosion On Archaeological Sites With Low-Altitude Aerial Photography, Structure From Motion, and GIS
No ratings yet
2018 Howland Et Al. Quantifying The Effects of Erosion On Archaeological Sites With Low-Altitude Aerial Photography, Structure From Motion, and GIS
9 pages
Sewing Symbols in Tailoring
No ratings yet
Sewing Symbols in Tailoring
12 pages
Effective Crisis Management Academyof Management Executive
No ratings yet
Effective Crisis Management Academyof Management Executive
12 pages
(09.10.30) Dsme H-4453-4 Magnetic Compass (Final)
No ratings yet
(09.10.30) Dsme H-4453-4 Magnetic Compass (Final)
18 pages
Third Year Civil Engg. 3rd Year Scheme Syllabus 2018-19 PDF
No ratings yet
Third Year Civil Engg. 3rd Year Scheme Syllabus 2018-19 PDF
24 pages
RES320 - Preisinger, Carrie FINAL EXAM
100% (1)
RES320 - Preisinger, Carrie FINAL EXAM
5 pages
2022 EMDAT Report
No ratings yet
2022 EMDAT Report
8 pages
LGC Proficiency Testing Catalogue 201
No ratings yet
LGC Proficiency Testing Catalogue 201
1 page
Fluid Focus Lens PDF
No ratings yet
Fluid Focus Lens PDF
25 pages
IEEE 2010-Libre
No ratings yet
IEEE 2010-Libre
6 pages
Grinding System and Circuit of VRM Process Data Plant Data
67% (6)
Grinding System and Circuit of VRM Process Data Plant Data
58 pages
Community Resilience - Responding To and Recovering From Disasters Together
No ratings yet
Community Resilience - Responding To and Recovering From Disasters Together
5 pages
الكروت
No ratings yet
الكروت
2 pages
Disaster Recovery Via Social Capital: News & Views
No ratings yet
Disaster Recovery Via Social Capital: News & Views
2 pages
4 Discuss The Concept of Community-Based Disaster Management and Highlight Its Principles and Challen
No ratings yet
4 Discuss The Concept of Community-Based Disaster Management and Highlight Its Principles and Challen
2 pages
Beneficiation of Ajabanoko Iron Ore Deposit, Kogi State, Nigeria Using Magnetic Methods
No ratings yet
Beneficiation of Ajabanoko Iron Ore Deposit, Kogi State, Nigeria Using Magnetic Methods
3 pages
Maintenance Schedules / Maintenance Parts
100% (1)
Maintenance Schedules / Maintenance Parts
29 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet

Stats and Data Analysis

Uploaded by

Stats and Data Analysis

Uploaded by

Statistics and Data Analysis

“Harmonised Protocol”. Pure Appl Chem. 2006, 78, 145-196.

• Finding a consensus and its uncertainty to

• Adds value to raw results.

• Analytical Methods Committee,

• On average, slightly less than 95% of

• Fitness for purpose occurs when the uncertainty

Reference: T Fearn, S A Fisher, M Thompson,

• The consensus is not theoretically the best

• The robust mean provides a useful consensus

AMC Technical Brief No. 4

AMC Technical Brief No 23, and AMC Software.

• Use z-scores based on fitness for

• Analytical precision too high—method

• Material passes homogeneity test if

ssam L 0.3p

Ref: Analyst, 2001, 127, 1359-1364.

You might also like