1) S - Data Collection
1) S - Data Collection
Data Collection
Twitter: @Owen134866
www.mathsfreeresourcelibrary.com
Prior Knowledge Check
1) Find the mean, median, mode 3) Rebecca records the shoe size,
and range of these data sets x, of the female students in her
year. The results are in the table.
a) 1, 3, 4, 4, 6, 7, 8, 9, 11 5.89, 6, 4, 10
Number of
b) 20, 18, 17, 20, 14, 23, 19, 16
students,
18.38, 18.5, 20, 9 35 3
2) Here is a question from a 36 17
questionnaire surveying TV viewing 37 29
habits. 38 34
Overlapping category
How much TV do you watch? Nothing above 4 39 12
0-1 Hours No time period etc
1-2 Hours Find the mean shoe size.
3-4 Hours
37.37
Give 2 criticisms and write an
improved version of the question.
Teachings for Exercise 1A
Data Collection
You need to understand a
number of key statistical terms
1A
Data Collection
You need to understand a
number of key statistical terms
Advantages Disadvantages
Time consuming
Cannot be used if the sampling
Census Completely accurate result process would render the items
unusable
Processing a lot of data takes a
long time
Less time-consuming Data might not be accurate
Sample Fewer responses needed Sample might not be properly
Less data to process representative of the population
1A
Data Collection
You need to understand a number
of key statistical terms
1A
Teachings for Exercise 1B
Data Collection
You need to know about simple,
systematic and stratified
sampling
1B
Data Collection
You need to know about simple,
systematic and stratified
sampling
1B
Data Collection
You need to know about simple,
systematic and stratified
sampling
A yacht club with 100 members are They could assign each member a
listed alphabetically in the club’s number from 1-100. Then they could
membership book. The committee
wants to take a sample of 12 generate 12 numbers and choose the
members to fill in a questionnaire. members that were assigned those
numbers.
a) Explain how they could use a
random number generator to
generate the sample
They could write all the
members’ names on hats, and
b) Explain how they could use a then draw out 12 members to
lottery system to generate the
sample
make up the sample
1B
Data Collection Total workers
= 300
You need to know about simple, Age Quantity
systematic and stratified sampling
18-32 75 20
33-47 140
A factory manager wants to find
out what his workers think of the 48-62 85
canteen facilities. He decides to
give a questionnaire to a sample of
80 workers. It is believed that As a fraction, workers are from 18-32
different age groups will have
different opinions. We need the same fraction, but of 80
workers to be selected…
The table to the right shows the
number of workers in each age 75
× 80
bracket. 300
¿ 20
a) What sampling method should
be used? Stratified Sampling
b) How many workers should be
selected from each age
bracket?
1B
Data Collection Total workers
= 300
You need to know about simple, Age Quantity
systematic and stratified sampling
18-32 75 20
A factory manager wants to find
33-47 140 37
out what his workers think of the 48-62 85 23
canteen facilities. He decides to
give a questionnaire to a sample of 140 85
80 workers. It is believed that × 80 × 80
different age groups will have 300 300
different opinions.
¿ 37.3 ¿ 22.7
The table to the right shows the
¿ 37 ¿ 23
number of workers in each age
bracket.
Advantages Disadvantages
Free of bias
Simple random Easy and cheap to implement
Not suitable for a large population or
sample size
sampling Every unit has an equal chance of
A sampling frame is needed
selection
1B
Teachings for Exercise 1C
Data Collection
You need to know about non-random
sampling
1C
Data Collection
You need to know about non-random
sampling
Advantages Disadvantages
1C
Teachings for Exercise 1D
Data Collection
There are various different types of
data which can be used in statistics
1D
Data Collection
There are various different types of
data which can be used in statistics
Class Boundaries
These are the maximum
Length of wing Number of and minimum values that
(mm) butterflies, f belong in a group
30-31 2
32-33 25
Midpoint
This is the mean of the
34-36 30 class boundaries
37-39 13
Class width
This is the difference
between the upper and
lower class boundaries
1D
Data Collection
There are various different types of
data which can be used in statistics Is the length Qualitative or Quantitative?
Quantitative
Midpoint = 35mm
1E
Data Collection
Daily mean wind
You need to be able to answer exam Daily mean cloud
questions based on large amounts of direction and
windspeed cover
real data that you will be given
This is measured in Measured in ‘oktas’, or
knots according to the eighths of the sky
The different sets of data recorded beaufort scale (more on covered by cloud
are as follows: the next slide)
1E
Data Collection
You need to be able to answer exam
questions based on large amounts of
real data that you will be given Camborne
1E
Data Collection
Hurn
You need to be able to answer exam
questions based on large amounts of
real data that you will be given
1E
Data Collection
Hurn
You need to be able to answer exam
questions based on large amounts of
real data that you will be given
1E
Data Collection
Hurn
You need to be able to answer exam
questions based on large amounts of
real data that you will be given
1E
Data Collection
Hurn
You need to be able to answer exam
questions based on large amounts of
real data that you will be given
c) State, with a reason, whether your Perth is in Australia, which is south of the UK, and
answer to b) supports this statement its median rainfall was higher. However, taking a
small sample from a single location is each country
means there is not enough data to support the
statement.
1E