0% found this document useful (0 votes)
135 views

Harris' Chapter 4: Statistics

This document summarizes key statistical concepts covered in Chapter 4 of Harris' textbook including: the Gaussian distribution, standard deviation, confidence intervals, comparison of means using t-tests, outlier testing using Q-tests, and linear regression analysis. Examples are provided for calculating statistics like mean, standard deviation, and confidence intervals for data on the number of M&Ms in a bag. Procedures are outlined for performing t-tests to compare data sets, Q-tests to identify outliers, and using linear regression to fit a calibration curve.

Uploaded by

karolinebrito
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
135 views

Harris' Chapter 4: Statistics

This document summarizes key statistical concepts covered in Chapter 4 of Harris' textbook including: the Gaussian distribution, standard deviation, confidence intervals, comparison of means using t-tests, outlier testing using Q-tests, and linear regression analysis. Examples are provided for calculating statistics like mean, standard deviation, and confidence intervals for data on the number of M&Ms in a bag. Procedures are outlined for performing t-tests to compare data sets, Q-tests to identify outliers, and using linear regression to fit a calibration curve.

Uploaded by

karolinebrito
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 22

Harris Chapter 4: Statistics

Gaussian Distribution & Random Error s Standard Deviation & Confidence Intervals s Comparison of Means s Q-Test s Calibration Curves and Least Squares Regression
s

Click to edit Master subtitle style

The mean is sometimes called the arithmetic average. For analytical chemistry, we are normally calculating the mean of 3 or more points in a set of data
s

Reduced random or indeterminate error

The closer the mean is to the true (sometimes called accepted or known) value, the more accuracy there is

s s

Standard deviation (s or STDEV) represents how closely data is scattered about the mean. n-1 is called the degrees of freedom A smaller standard deviation means there is less variability in your data, or that your results are more precise.
s

You have better precision

Standard Deviation (s) %RSD = x 100 Mean (x)

What is the mean, standard deviation and percent relative standard deviation (%RSD) for blue M&Ms in little bags of chocolate M&Ms

The Gaussian Curve


s

A distribution of events occurring with variations that are due to random errors (indeterminate error) Used to evaluate the distribution of random variation about a mean value Key to the mathematical definitions of mean and standard deviation

Confidence Intervals
s

Utilize students t tests. These were originally developed by a brewer for Guinness. Confidence intervals allow on to specify how confident or certain they are that the true value for a set of data lies within a specific range.

s s s s

The confidence interval is labeled () s is the standard deviation n is the number of measurements t is a statistical students t value When appropriate, confidence intervals are expressed with units and are sometimes expressed in sentence form.

Confidence Intervals are a statistical tool, and a tool only! They do not supersede an analysts experience or knowledge about the analysis or sample, but can provide complimentary information. What is the 95% confidence interval for the number of M&Ms in a little bag of chocolate M&Ms?

Comparison of Data with ttests.


s

Case 1: we can just see if the known or true value falls within the confidence interval.
s

A little bag of chocolate M&Ms is supposed to have 24 candies in it? Do our results include the known value at the 90% confidence interval? Problem 4-14 as an example

Case 2: Comparison of 2 data sets.


s

s s s s

Compare 2 sets of data Calculate a standard deviation for each Calculate a t value (tcalculated) Compare tcalc to ttable (from standard table)
s

IF tcalc > ttable the values are statistically different at the indicated confidence interval or level IF tcalc < ttable the values are statistically similar at the indicated confidence level

Q-test for Outliers


s

A Q-test is used to deal with outliers (suspect or spurious) in data sets


s

It is only a statistical test, the chemist must use their experience. For example, if you know you made up 5 solutions for analysis, but you diluted #3 over the mark in the volumetric flask, then you shouldnt consider this in the mean, but perhaps discard that data. You would do this, in this case, even if the Q-test indicated otherwise!

gap Q = range

How to perform a Q-test.


s s

Write the data out in order (like on a number line) Identify the outlier in question. You can use a Qtest only to remove 1 data point from a set. Identify the gap and the range. Calculate Q (Qcalc) Qtable (table values) include the suspect point If Qcalc > Qtable then you can discard the data point at the indicated confidence level. Lets do problem 4-22.

s s s s

Linear Regression
s

Used to prepare a best fit straight line to a set of data.


Widely used for calibration curves, where we relate signal (y axis-what you measured) to concentration or amount (x axis) Signal = slope x concentration + intercept R2 is the correlation coefficient squared. The closer it is to 1, the closer the line lies to the points.
s

s s

We will largely use the regression functions in EXCEL (most calculators do linear regression too)

Lets do problem 4-29.

You might also like