0% found this document useful (0 votes)

0 views5 pages

Week 7

The document discusses scientific measurement techniques, particularly focusing on small data sizes where the t-distribution is applicable instead of the normal distribution. It provides an example of calculating the t-value to determine if a particle's mass is less than a specified value using limited measurements. Additionally, it explains the construction of box-and-whisker plots to visualize data spread and identify outliers.

Uploaded by

rp21ms106

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views5 pages

Week 7

Uploaded by

rp21ms106

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Elements of Scientific Measurement

(continued)

1 When the data size is small

The above procedures apply to situations where the data size is reasonably
large (at least 25), without which the Central Limit Theorem would not be
applicable. But there are situations in which it is difficult (or very expensive)
to obtain many data points. What to do in such cases?
We have seen earlier that if the number of samples is sufficiently large,
the sampling distribution of the mean follows a normal distribution. In that
case, we defined a quantity
x̄ − µ x̄ − µ
z= = σ ,
σx̄ √
n

which followed a normal distribution. Then we used the z-table to obtain

the probability of getting a z value at least that large (or that small).
Where the data size n is small, the sampling distribution of the mean
would not follow a normal distribution. But it follows a different distribution,
which is called the t-distribution, whose characteristics can then be used to
derive meaningful results. In this case, the quantity t is defined the same
way:
x̄ − µ
t= σ
√
n

Then one can use the t-table to derive similar conclusions.

Let us illustrate this with an example.
Example 1: A scientist could take only 9 measurements on the mass of a
particle, and the measured values were 16.2, 19.7, 21.8, 15.6, 19.0, 18.7, 16.9,
21.7 and 20.2 (in suitable units). Do the data provide sufficient evidence to
say that the mass of the particle is less than 21? Here “sufficient evidence”
implies that the probability that the statement is wrong is less than 0.01 or
1%.
Solution: From the data, we find that the sample mean is x̄ = 18.87, and
the sample standard deviation is s = 2.2583. The prediction we have to test
is that the population mean µ < 21. This is the same as checking the odds
of getting x̄ = 18.87 or below if the value of µ were 21. So our approach will
be to assume µ = 21 and to check the probability of getting x̄ = 18.87 or
below. If the probability is less than 0.01, there will be less than 1% chance
of making an error.

1
Using the data, we get
x̄ − µ x̄ − µ 18.87 − 21
t= ≈ = = −2.83
√σ √s 2.2583
√
n n 9

We need to look at the t-table in Table 1 to locate the threshold value

of t that has a significance level of 1%. The columns are arranged according
to the “significance level” (which is the area under the t-distribution curve
beyond that value of t). In this case, we are looking for a 1% significance
level. The rows are arranged according to the degree of freedom (DoF), which
is one less than the number of data points, i.e., n − 1. Here the number of
data points is 9. Therefore the degree of freedom is n − 1 = 8. For the
above degree of freedom and significance level, we find t = 3.355. Therefore
the probability of getting a t value higher than 3.355 is 1%. Since the t-
distribution is symmetrical about zero, the probability of getting a t value
below −3.355 is also 1%.
The value of t we got in our case is −2.83, which is above −3.355. This
implies that if the mean is µ = 21, the probability of getting x̄ = 18.87 or
lower is more than 1%. Thus, from the data, if we state that the population
mean µ < 21, there will be more than 1% chance of committing an error. □

2 Box and whisker plots

You may notice that a plot of the experimental results showing the error bars
does not give the information about the spread of the data obtained. In some
applications where such information is essential, one prefers a different way
of presenting the results. This is called a ‘box-and-whisker’ plot, a typical
representation is shown in Fig. 1.
Variable

Parameter

Figure 1: A typical box and whisker plot

2
In producing such a plot, the data are first arranged in ascending or-
der. The minimum value and the maximum value thus obtained gives the
extremities of the ‘whiskers’ of the plot. Then one has to obtain the median,
which is nothing but the middle value. If the number of data points is odd,
the middle number is easy to identify. If there are an even number of data
points, two numbers will appear in the middle, and one has to take the mean
of these two numbers. This median gives the mid-point of the plot, called
the second quartile, or Q2 (see Fig. 2).
Interquartile
range

Minimum Q1 Q2 Q3 Maximum
Median

Figure 2: The ranges in a box and whisker plot

Then one has to obtain the median of the data points below Q2. That
gives another value, called the first quartile or Q1. Similarly, one obtains
the median of the data points above Q2, which gives the third quartile, or
Q3. The range between Q1 and Q3 is called the interquartile range (IQR),
which is plotted as a box. The range between the minimum and Q1, and
that between Q3 and the maximum is plotted as a ‘whisker’. Therefore, the
representation of a typical data set would look like Fig. 2. One characteristic
feature of such a plot is that 25% of the data lie in each of the four ranges
shown in the plot.
Sometimes one gets some data points that lie way outside the natural
range of the data. These are called the ‘outliers’. The box plot also enables
one to identify and present the outliers. The usual method is that the data
points outside 1.5 times the interquartile range outside the box are called
outliers. Therefore, one can identify the ’reasonable’ range of the data as
that between (Q1 − 1.5 × IQR) and (Q3 + 1.5 × IQR), and any data point
falling outside this range may be suspected to be ‘outlier’1 .
Such outliers may result from experimental or observational errors but
may also result from some phenomenon not yet discovered. That is why one
cannot simply ignore an outlier or delete it from a data set. Outliers have to
be faithfully presented in a research paper, though you may ignore these in
further analysis of the data.
1
Before we conclude that such a point is indeed an outlier, some more tests would be
required.

3
Example 1:
Consider the following data set:
17.2, 15.9, 16.7, 18.3, 15.0, 19.3, 20.2, 16.3, 17.9, 15.3, 10.1, 19.1, 18.2
Obtain the box and whisker plot.
Solution:
Arranging the data in ascending order, we get
10.1, 15.0, 15.3, 15.9, 16.3, 16.7, 17.2, 17.9, 18.2, 18.3, 19.1, 19.3, 20.2
It has 13 data points, which is an odd number. Hence, the 7th data point,
17.2, is the median.
There are 6 data points below and above the median, which is an even
number. So we get Q1 by taking the mean of the 3rd and 4th entries and
get Q1=15.6. Similarly, we get Q3 as the mean of the 10th and 11th entries
and get Q3=18.7.

10.1 15.6 17.2 18.7 20.2

10 11 12 13 14 15 16 17 18 19 20

Figure 3: The box and whisker plot for the whole data set given in the
Example

Therefore, the box and whisker plot becomes as shown in Fig. 3.

Now let us see if any data point can be identified as an outlier. The IQR is
18.7 − 15.6 = 3.1. Going below the lowest point of the box by 1.5×IQR gives
10.95. We see that there is one data point below that value. Therefore we
can suspect that this point is an outlier and set the end of the whisker at the
last data point above 10.95. This value is 15.0. Going above Q3 by 1.5×IQR
gives 23.35. This is above the highest point of the data set. Thus there is no
outlier on the higher side. The resulting plot, excluding the outlier, is shown
in Fig. 4.

15.0 15.6 17.2 18.7 20.2

10.1

10 11 12 13 14 15 16 17 18 19 20

Figure 4: The box and whisker plot excluding the outlier.

4
Table 1: The t-table.

Question On Box Plot 1
No ratings yet
Question On Box Plot 1
7 pages
Box Whisker Plot
100% (1)
Box Whisker Plot
3 pages
04c - Data Management (Relative Position) PDF
No ratings yet
04c - Data Management (Relative Position) PDF
3 pages
R-32 Refrigerant Gas Pressure Temperature Chart
100% (3)
R-32 Refrigerant Gas Pressure Temperature Chart
2 pages
Measures of Relative Motion
0% (1)
Measures of Relative Motion
20 pages
Lesson 8 - Measure of Relative Position
100% (1)
Lesson 8 - Measure of Relative Position
6 pages
Statistics - Lecture Slides 3 - For Lecture
No ratings yet
Statistics - Lecture Slides 3 - For Lecture
37 pages
Statistics Part 1 and 2
No ratings yet
Statistics Part 1 and 2
53 pages
Mmw-Chapter 1docx-Pdf-Free
No ratings yet
Mmw-Chapter 1docx-Pdf-Free
5 pages
1.08 Example: 1 Exploring Data
No ratings yet
1.08 Example: 1 Exploring Data
2 pages
Statistics Measures of Position Unit Plan
No ratings yet
Statistics Measures of Position Unit Plan
3 pages
Lecture-6: Introduction To Data Science
No ratings yet
Lecture-6: Introduction To Data Science
25 pages
Measures of Position PDF
No ratings yet
Measures of Position PDF
5 pages
L3 Numerical Summary Measures
No ratings yet
L3 Numerical Summary Measures
44 pages
Local Media8189417746246610906
No ratings yet
Local Media8189417746246610906
23 pages
STATISTICS (Averages and Variation)
No ratings yet
STATISTICS (Averages and Variation)
8 pages
07 Descriptive Statistics06
No ratings yet
07 Descriptive Statistics06
26 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
50 pages
CHAPTER 1 Descriptive Statistics
No ratings yet
CHAPTER 1 Descriptive Statistics
5 pages
# 4 Pemusatan & Penyebaran Data (TM)
No ratings yet
# 4 Pemusatan & Penyebaran Data (TM)
65 pages
3.3 Assignment: One Variable Statistics: A) Histogram
No ratings yet
3.3 Assignment: One Variable Statistics: A) Histogram
12 pages
Notes 03
No ratings yet
Notes 03
21 pages
3 Stats Box and Whisker
No ratings yet
3 Stats Box and Whisker
35 pages
W4 D3 G9-12 Outliers Student
No ratings yet
W4 D3 G9-12 Outliers Student
4 pages
Chapter No.2 Describing Central Tendency and Variability
No ratings yet
Chapter No.2 Describing Central Tendency and Variability
83 pages
BS Lect 04
No ratings yet
BS Lect 04
20 pages
GCE As Level Representation of Data Box and Whisker Plot
No ratings yet
GCE As Level Representation of Data Box and Whisker Plot
5 pages
Week 4
No ratings yet
Week 4
18 pages
Chapter 2 Handout Jan 30
No ratings yet
Chapter 2 Handout Jan 30
12 pages
Measures of Relative Position
No ratings yet
Measures of Relative Position
29 pages
Lecture Note 2
No ratings yet
Lecture Note 2
7 pages
Intro W03 Rev
No ratings yet
Intro W03 Rev
23 pages
Box and Whisker Primer
No ratings yet
Box and Whisker Primer
4 pages
Data Preprocessing Problems - Quartile, Box Whisker
No ratings yet
Data Preprocessing Problems - Quartile, Box Whisker
4 pages
Stats Exam 1 Cheat Sheet
No ratings yet
Stats Exam 1 Cheat Sheet
3 pages
Practice 3 Measures of Dispersion 2023 09 20 19 02 53
No ratings yet
Practice 3 Measures of Dispersion 2023 09 20 19 02 53
18 pages
Chapter 03 SSM-FINAL
No ratings yet
Chapter 03 SSM-FINAL
23 pages
Measures of Location
No ratings yet
Measures of Location
6 pages
Elements of Statistics MODULE 1.3
No ratings yet
Elements of Statistics MODULE 1.3
27 pages
CH 03
No ratings yet
CH 03
48 pages
Hilton CH 4 Select Solutions
No ratings yet
Hilton CH 4 Select Solutions
20 pages
Fundamentals Stats
No ratings yet
Fundamentals Stats
44 pages
Measures of Central Tendency & Variability: Lina, Karima, Joselyn, Arlene
No ratings yet
Measures of Central Tendency & Variability: Lina, Karima, Joselyn, Arlene
34 pages
4 - Stat - Measures of Variation 2024
No ratings yet
4 - Stat - Measures of Variation 2024
27 pages
CH 03
No ratings yet
CH 03
50 pages
Topic 11 - Measures of Dispersion
No ratings yet
Topic 11 - Measures of Dispersion
109 pages
3 - Measures of Variation
No ratings yet
3 - Measures of Variation
36 pages
Measures of Relative Position
No ratings yet
Measures of Relative Position
28 pages
Measures of Variability PDF
No ratings yet
Measures of Variability PDF
39 pages
05 - Moments-Standized - Variable - Chebychev-1
No ratings yet
05 - Moments-Standized - Variable - Chebychev-1
22 pages
Box and Whisker Plot in Excel - Step by Step Tutorial
No ratings yet
Box and Whisker Plot in Excel - Step by Step Tutorial
10 pages
A Detailed Lesson Plan in Mathematics 10: A. Preliminary/Routinary Activity
No ratings yet
A Detailed Lesson Plan in Mathematics 10: A. Preliminary/Routinary Activity
12 pages
Lecture 4 Measures of Dispersion
No ratings yet
Lecture 4 Measures of Dispersion
34 pages
Chapter 2
No ratings yet
Chapter 2
46 pages
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
No ratings yet
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
33 pages
CH 3 - 250408 - 170537
No ratings yet
CH 3 - 250408 - 170537
33 pages
Module 7 Week 8
No ratings yet
Module 7 Week 8
37 pages
Boxplot Outlier
No ratings yet
Boxplot Outlier
3 pages
Answers IBS
No ratings yet
Answers IBS
13 pages
Box and Whisker Lesson
No ratings yet
Box and Whisker Lesson
4 pages
Box Whisker Plot
No ratings yet
Box Whisker Plot
6 pages
690+ Series AC Drive: Frame G, H & J
No ratings yet
690+ Series AC Drive: Frame G, H & J
148 pages
This Study Resource Was Shared Via: MATH 1201 College Algebra - Term 2, 2019-2020
No ratings yet
This Study Resource Was Shared Via: MATH 1201 College Algebra - Term 2, 2019-2020
2 pages
Basic Concepts of Chemistry Questions
No ratings yet
Basic Concepts of Chemistry Questions
4 pages
How To Properly Size A Steam Trap
100% (2)
How To Properly Size A Steam Trap
4 pages
All Maths Formulas Class 10
No ratings yet
All Maths Formulas Class 10
54 pages
Well Design - PE 413: Chapter 1: Formation Pressure
No ratings yet
Well Design - PE 413: Chapter 1: Formation Pressure
77 pages
Angima2003 PDF
No ratings yet
Angima2003 PDF
14 pages
Hatch 22 Lessons
No ratings yet
Hatch 22 Lessons
31 pages
Quantum Simulation of Schrödingers Equation
No ratings yet
Quantum Simulation of Schrödingers Equation
50 pages
Lab 6: Generating A Square Wave of Desired Frequency: 1. Objectives
No ratings yet
Lab 6: Generating A Square Wave of Desired Frequency: 1. Objectives
4 pages
Countable or Uncountable
No ratings yet
Countable or Uncountable
3 pages
UnaSensors Device Documentation - V1.1
No ratings yet
UnaSensors Device Documentation - V1.1
16 pages
Sentence Types
No ratings yet
Sentence Types
11 pages
DMC - Prasentation1
No ratings yet
DMC - Prasentation1
8 pages
Addressingmodes tms320c5x
No ratings yet
Addressingmodes tms320c5x
16 pages
Practical Analytical 1 ,,chemistry
No ratings yet
Practical Analytical 1 ,,chemistry
45 pages
CT Renal With Contrast Both 19-03-2023
No ratings yet
CT Renal With Contrast Both 19-03-2023
2 pages
Shift Registers-1
No ratings yet
Shift Registers-1
11 pages
Practice Writing A Lab Report
No ratings yet
Practice Writing A Lab Report
4 pages
Determine The Coordinates and Nature of Each of The Two Turning Points On The Curve
No ratings yet
Determine The Coordinates and Nature of Each of The Two Turning Points On The Curve
39 pages
Evaluating The Incompatibility of Inorganic Zinc Silicate
No ratings yet
Evaluating The Incompatibility of Inorganic Zinc Silicate
8 pages
A Review On Sediment Transport Modelling Using HEC-RAS
No ratings yet
A Review On Sediment Transport Modelling Using HEC-RAS
10 pages
Tissue Healing Timeline
No ratings yet
Tissue Healing Timeline
1 page
Formato Articulo Dyna
No ratings yet
Formato Articulo Dyna
2 pages
DX180LC-3: Crawler Excavator
No ratings yet
DX180LC-3: Crawler Excavator
24 pages
Pascal's Principle and Its Applications
No ratings yet
Pascal's Principle and Its Applications
12 pages
Mesri Feng Ali Hayat 1994 Permeability Soft Clays 6 Paginas
No ratings yet
Mesri Feng Ali Hayat 1994 Permeability Soft Clays 6 Paginas
6 pages
EOCQ - Ans - 5 Biology
75% (8)
EOCQ - Ans - 5 Biology
2 pages

Week 7

Uploaded by

Week 7

Uploaded by

Elements of Scientific Measurement

1 When the data size is small

which followed a normal distribution. Then we used the z-table to obtain

Then one can use the t-table to derive similar conclusions.

We need to look at the t-table in Table 1 to locate the threshold value

2 Box and whisker plots

Figure 1: A typical box and whisker plot

Figure 2: The ranges in a box and whisker plot

10.1 15.6 17.2 18.7 20.2

Therefore, the box and whisker plot becomes as shown in Fig. 3.

15.0 15.6 17.2 18.7 20.2

Figure 4: The box and whisker plot excluding the outlier.

You might also like