Module 03 Inferential Statistics
Module 03 Inferential Statistics
Sampling
Gathering information about an entire population often costs too
much or is virtually impossible. Instead, we use a sample of the
population. A sample should have the same characteristics as the
population it is representing. Most statisticians use various methods of
random sampling in an attempt to achieve this goal. This section will
describe a few of the most common methods. There are several
different methods of random sampling. In each form of random
sampling, each member of a population initially has an equal chance of
being selected for the sample. Each method has pros and cons. The
easiest method to describe is called a simple random sample. Any
group of n individuals is equally likely to be chosen as any other group
of n individuals if the simple random sampling technique is used. In
other words, each sample of the same size has an equal chance of
being selected.
Besides simple random sampling, there are other forms of
sampling that involve a chance process for getting the sample. Other
well-known random sampling methods are the stratified sample, the
cluster sample, and the systematic sample.
Problem 1
A study is done to determine the average tuition that San Jose State
undergraduate students pay per semester. Each student in the
following samples is asked how much tuition they paid for the Fall
semester. What is the type of sampling in each case?
Problem 2
First, we use convenience sampling and survey ten students from a first
term organic chemistry class. Many of these students are taking first
term calculus in addition to the organic chemistry class. The amount of
money they spend on books is as follows:
$128; $87; $173; $116; $130; $204; $147; $189; $93; $153
The second sample is taken using a list of senior citizens who take P.E.
classes and taking every fifth senior citizen on the list, for a total of ten
senior citizens. They spend:
$50; $40; $36; $15; $50; $100; $40; $53; $22; $22
$180; $50; $150; $85; $260; $75; $180; $200; $200; $150
Problem
Be aware that as you take data, your data may vary somewhat
from the data someone else is taking for the same purpose. This is
completely natural. However, if two or more of you are taking the same
data and get very different results, it is time for you and the others to
reevaluate your data-taking methods and your accuracy.
Variation in Samples
It was mentioned previously that two or more samples from the
same population, taken randomly, and having close to the same
characteristics of the population will likely be different from each other.
Suppose Doreen and Jung both decide to study the average amount of
time students at their college sleep each night. Doreen and Jung each
take samples of 500 students. Doreen uses systematic sampling and
Jung uses cluster sampling. Doreen's sample will be different from
Jung's sample. Even if Doreen and Jung used the same sampling
method, in all likelihood their samples would be different. Neither
would be wrong, however.
Think about what contributes to making Doreen’s and Jung’s
samples different.
If Doreen and Jung took larger samples (i.e. the number of data
values is increased), their sample results (the average amount of time a
student sleeps) might be closer to the actual population average. But
still, their samples would be, in all likelihood, different from each other.
This variability in samples cannot be stressed enough.
Size of a Sample
Be aware that many large samples are biased. For example, call-in
surveys are invariably biased, because people choose to respond or not.