Basic Terms in Statistical Calculations
Basic Terms in Statistical Calculations
Statistics
Definition
Example
A chemist analyzes a sample of water for lead content using atomic absorption
spectroscopy. The results are:
Using statistical methods, the chemist calculates the mean (10.5 ppm) and standard
deviation (0.3 ppm) of the results. This information helps the chemist understand the
accuracy and precision of the measurements.
Chemometrics
Definition
Example
Definition
Error in analytical chemistry refers to the difference between the measured value and the
true value of a sample's composition or property.
Types of error
Also called non random error and occur when something is wrong with measurements.
Scheme of experiment
Skillful handling
Random error
Often called accidental error which represents the experimental uncertainty that causes in
any measurement
Mean
- Explanation: The mean is calculated by adding up all the values and dividing by the
number of values.
- Example: A set of exam scores is 80, 70, 90, 85, 75. The mean score is (80 + 70 + 90 + 85 +
75) / 5 = 80.
Significance
It represents the central or typical value of a dataset.
It is used to summarize large amounts of data with a single number.
It’s useful in comparing different datasets or groups.
It helps in making predictions when data is symmetrically distributed.
Mode
- Explanation: The mode is the value that appears most often in the dataset.
- Example: A set of favorite colors is red, blue, red, green, red, blue. The mode is red
because it appears most frequently.
Median
- Explanation: The median is the middle value when the data is arranged in order from
smallest to largest.
- Example:
- A set of exam scores is 70, 75, 80, 85, 90. The median score is 80 because it is the middle
value.
When dealing with an even number of values, the median is calculated as the average of
the two middle values.
Example
Key differences:
Precision
It refers to the degree of agreement among the replicate measurements of the same
quantity.
Analyst 1
2.63
2.62
2.62
More precision
Analyst 2
2.60
2.71
2.65
Less presion
Accuracy
It refers to the difference between the measured value and the ture value( accepted value)
Deviation
- *Definition*: Deviation is the difference between an individual data point and the mean of
the dataset.
- *Mathematical Example*: Consider a dataset with values 2, 4, 6, 8, 10. The mean is 6. The
deviation of the value 4 is 4 – 6 = -2.
Standard Deviation
1. Calculate the deviations: (2-6) = -4, (4-6) = -2, (6-6) = 0, (8-6) = 2, (10-6) = 4
5. Calculate the standard deviation: √8 ≈ 2.83 1. A low SD means most values are close to the
mean (less variation).
The standard deviation is approximately 2.83. 2. A high SD means values are more spread out from
the mean (more variation).
3. It helps to understand the consistency of the data.
Variance 4. It is essential in probability and inferential
statistics, especially in normal distribution.
Definition
Mathematical Example
1. Calculate the deviations: (2-6) = -4, (4-6) = -2, (6-6) = 0, (8-6) = 2, (10-6) = 4
The variance is 8.
Definition
The Relative Standard Deviation (RSD) or Coefficient of Variation (CV) is a measure of the
relative spread of a dataset. It is calculated as the ratio of the standard deviation to the
mean, expressed as a percentage.
Mathematical Example
Consider a dataset with values 2, 4, 6, 8, 10. The mean is 6 and the standard deviation is
2.83.
Interpretation
The RSD or CV is used to compare the variability of different datasets. A lower RSD or CV
indicates less variability, while a higher RSD or CV indicates more variability.