Statistics - Review 26 Oct 2022
Statistics - Review 26 Oct 2022
Arrive on time
Bring a functional calculator (that is approved by Exam Secretary)
Use 5-10 minutes to check all questions; you are allowed to ask questions
related to the examination paper only in the first 30 minutes
Reserve 10 minutes for checking your answers at the end of the examination
1 minute for 1 mark in answering questions
Uncertainties in
Numerical Computations: Calibration curve:
intensity
Mass Volume d3
d2
𝑑𝑑12 + 𝑑𝑑22 + 𝑑𝑑32 + 𝑑𝑑42
d1 = minimum
uncertainty
concentration
Computed density = ? ± ? g/mL Conc. = ? ± ? M
Topic Chapter
Errors in chemical analysis: systematic and random errors 5, 6
Statistical data treatment and evaluation: confidence intervals, t-
7
test, F-test, Q-test
Quality assurance: sampling, standardization, and calibration 8
Chapter 6D, Skoog and West’s Fundamentals of Analytical Chemistry, 9th edition, Brooks/Cole (2014)
Chem 2241
Take-home message: Tools of Analytical Chemistry
2016 CHEM2241 8
Chemical analysis may involve several steps. The random error of
each step contributes to the overall error of the measurement.
2016 CHEM2241 9
Example of error propagation
(Mixed operations)
2016 CHEM2241 10
Applications of statistics to Analytical Chemistry
1. Establish confidence limits
2. Determine if two means differ (t test)
3. Calibration curves (least-squares methods)
4. Determine if precision of two sets of
measurements differ (F test)
5. Determine if an outlier can be discarded (Q test)
2016 CHEM2241 11
Statistics: some basic…..
Gaussian Distribution
µ = population mean (average)
−(𝑥𝑥−𝜇𝜇)2 /2𝜎𝜎2
𝑒𝑒
𝑦𝑦 =
𝜎𝜎 2𝜋𝜋
2016 CHEM2241 12
Gaussian distribution
𝜇𝜇 = 𝑥𝑥̅ ± 𝑧𝑧𝑧𝑧
Confidence Limits
50% Data (confidence level) that 95% confidence level that the true
the true mean μ lies within ±0.67 σ mean μ lies within ±1.96 σ.
frequency
frequency
Case II: Decide whether two sets of replicate measurements give “the same” or “different” results,
within a stated confidence level.
Case III. Comparing Individual Differences: we use two methods to make single measurements on
several different samples. Do the two methods give the same answer “within experimental error”?
Confidence Limits
50% confidence level that the true 95% confidence level that the true
mean μ lies within ±0.67 σ mean μ lies within ±1.96 σ.
(±0.67 σ = confidence interval)
For a small set of data, confidence limits of the mean are given by
For N measurements, confidence limit CL for µ is
𝑡𝑡𝑡𝑡
𝜇𝜇 = 𝑥𝑥̅ ±
𝑁𝑁
t is used instead of z to account for larger uncertainties in the estimation of σ for a small set of data.
No confidence intervals adjustment when you know that s as a good estimate of σ even for a small sample size;
use z.
2/16/2015 CHEM2241 IKC 19
Confidence Interval
Calculating Confidence Intervals
The carbohydrate content of a glycoprotein (a protein with sugars attached to it) is found to be
12.6, 11.9, 13.0, 12.7, and 12.5 wt% (g carbohydrate/100 g glycoprotein) in replicate analyses.
Find the 50% and 90% confidence intervals for the carbohydrate content.
50% and 90% confidence intervals for the same set of random data. Filled squares are the
data points whose confidence interval does not include the true population mean of 10 000.
2016 CHEM2241 22
Case 1. Comparing a Measured Result with a “Known” Value
In a new method for determining selenourea in water the following values were obtained
Example for tap water samples spiked with 50.0 ng/ml of selenourea (Standard Reference Material
sample certified by the National Institute of Standards and Technology): 50.4, 50.7, 49.1,
49.0, 51.1 ng/ml. Is there any evidence of systematic error (Within 95% CI)?
Mean = 50.06 ;
standard deviation = 0.956
𝑥𝑥̅ − 𝜇𝜇
x = 99.93%, s = 0.022, N = 4
𝑡𝑡 = 𝑁𝑁 µ = 100.00%
𝑠𝑠
t = 6.4
Unknown sample
Concentration
y = 1450.8x + 0.0002
-
2/16/2015 R² = 0.99997 CHEM2241
Page 29IKChu
• the
2
Standardization andabout
standard deviation Calibration (Refer
the regression sr =to
S yy − m
“Fitting SStraight-Line
xx
− ( n∑ x)
b r 2
n∑
• the mean value of x the
y for
2
points.
i i