Sampling and Data Collection
Sampling and Data Collection
UC:
Plano
de
Dissertação/Plano
de
Projecto/Plano
de
Estágio
Tema
2:
Amostragem
e
Recolha
de
Informação
Theme
2:
Sampling
and
Data
Collec=on
2016/2017
Maria
Eduarda
Silva:
[email protected]
Pedro
Campos:
[email protected]
1
Overview
• Data
Collec=on:
Introduc=on
• Sampling
– Probabilis=c
– Non
probabilis=c
• Sample
survey/ques=onnaires
– Sources
of
informa=on
– How
to
make
a
ques=onnaire
2
Research
Process
(Where
are
we
now?)
Mo=va=on
Formula=on
and
problem
clarifica=on
Literature
Review
Approach
and
strategy
Data
Collec@on,
Sampling,
Surveys,
Ques@onnaires
Data
Analysis
(quan=ta=ve
and
Qualita=ve)
Final
Report
3
Research
Process
• PROBLEM
STATEMENT,
PURPOSES,
BENEFITS
• THEORY,
ASSUMPTIONS,
BACKGROUND
LITERATURE
• VARIABLES
AND
HYPOTHESES
•
DATA
COLLECTION
•
DATA
ANALYSIS
• FINAL
REPORT
and
CONCLUSIONS
4
Inference
– How
will
I
get
the
data
I
need
to
test
my
hypothesis?
– What
tools
or
devices
will
I
use
to
make
or
record
observa=ons?
6
Data
Collec@on
(examples)
• Census
• Survey
• Observa=onal
study:
recording
well-‐defined
events
• Retrospec=ve
studies:
obtaining
relevant
data
from
management
informa=on
systems.
• Prospec=ve
studies/Experiments/Clinical
trials
7
Data
Collec@on
Can
you
observe
(sample)
all
the
popula=on?
Usually
not!
– Impossible
– Imprac=cal
8
Data
collec@on
• Observe
the
whole
popula=on:
CENSUS
Too many Resources (time, money) and too
much workload!
Na=onal
Census
every
10
years
(in
many
countries)!
9
Portuguese Census
• 22
000
people
involved
• 1200
training
courses
• 17000
training
hours
• More
than
10
million
People
• 3
500
000
families
• 4
500
000
households
• 2
900
000
buildings
Sampling
• A sample is “a smaller (but hopefully
representative) collection of units from a
population used to determine truths about that
population” (Field, 2005)
• Why to sample?
– Resources (time, money) and workload
– Gives results with known accuracy that can be
calculated mathematically
11
Sampling
Steps
of
sampling
design
p
Define
the
target
popula=on
p
Iden=fy
the
sampling
frame
p
Define
sampling
unit
p
Choose
a
sampling
technique
p
Define
sample
dimension
p
Select
sample
elements
p
Collect
data
p Review
sampling
process
12
Sampling:
Define
the
target
popula=on
13
Sampling:
Iden=fy
the
sampling
frame
14
Sampling
Frame
• Ideally,
the
sampling
frame
perfectly
coincides
with
the
target
popula=on.
•
Some=mes
the
frame
will
be
larger
or
smaller,
depending
on
prac=cal
ways
of
gelng
in
touch
with
each
member
of
the
sample.
• The
sampling
frame
must
be
representa=ve
of
the
popula=on
• Selec=on/coverage
bias
15
Sampling
Overlape
Target
between
the
Popula=on
two
pops
Sampling
Frame
Sample
16
Sampling
• Sampling
unit-‐
the
elements
of
the
sampling
frame
that
are
chosen
for
the
sample
using
a
sampling
procedure.
Examples:
– Persons
– Households
– Companies
• Sample:
set
of
sampling
units
–
you
may
or
not
observe
all
of
them
17
Sampling
techniques
• Probability
Sampling
Scheme
in
which
every
unit
in
the
popula=on
has
a
known
probability
of
being
selected.
When
every
element
of
the
popula=on
does
have
the
same
probability
of
being
selected
is
known
as
a
equal
probability
selec0on,
EPS
design
• Non
Probability
Sampling
Scheme
in
which
some
elements
have
no
chance
of
selec=on
18
Sampling
techniques
19
Simple
Random
Sampling
In
a
simple
random
sample,
every
group
of
individuals
in
the
popula=on
has
an
equal
chance
of
becoming
the
sample.
A
simple
random
sample
is
the
equivalent
of
pulng
everyone's
name
from
the
popula=on
into
a
hat
and
drawing
your
sample
from
that.
20
Simple
Random
Sampling
• Applicable
when
popula=on
is
small,
homogeneous
&
readily
available
• All
subsets
of
the
frame
are
given
an
equal
probability.
Each
element
of
the
frame
thus
has
an
equal
probability
of
selec=on.
• It
provides
for
greatest
number
of
possible
samples.
This
is
done
by
assigning
a
number
to
each
unit
in
the
sampling
frame.
• A
table
of
random
number
or
loaery
system
is
used
to
determine
which
units
are
to
be
selected
21
Simple
Random
Sampling
• Es=mates
are
easy
to
calculate.
Disadvantages
• If
sampling
frame
large,
this
method
imprac=cable.
• Minority
subgroups
of
interest
in
popula=on
may
not
be
present
in
sample
in
sufficient
numbers
for
study.
22
Systema=c
Sampling
• Arrange
the
target
popula=on
according
to
some
ordering
scheme
and
then
selec=ng
elements
at
regular
intervals
through
that
ordered
list.
• Random
start
followed
with
the
selec=on
of
every
kth
element,
k=(popula=on
size/sample
size).
23
Systema=c
Sampling
• Star=ng
randomly
chosen
from
within
the
first
to
the
kth
element
in
the
list.
• Example:
select
every
10th
name
from
the
telephone
directory.
• It
is
not
'simple
random
sampling'
because
different
subsets
of
the
same
size
have
different
selec=on
probabili=es
-‐
e.g.
the
set
{4,6,8,…,998}
has
a
one-‐in-‐
two
probability
of
selec=on,
but
the
set
{4,5,7,…998}
has
zero
probability
of
selec=on.
24
Systema=c
Sampling
• ADVANTAGES:
– Sample
easy
to
select
in
lists
and
datasets
– Suitable
sampling
frame
can
be
iden=fied
easily
– Sample
evenly
spread
over
en=re
reference
popula=on
• DISADVANTAGES:
– Sample
may
be
biased
if
hidden
periodicity
in
popula=on
coincides
with
that
of
selec=on.
– Difficult
to
assess
precision
of
es=mate
from
one
survey.
25
Stra=fied
Sampling
• par==on
the
popula=on
into
groups
(strata)
• obtain
a
simple
random
sample
from
each
group
(stratum)
• collect
data
on
each
sampling
unit
that
was
randomly
sampled
from
each
group
(stratum)
• Stra@fied
sampling
works
best
when
a
heterogeneous
popula=on
is
split
into
fairly
homogeneous
groups.
26
Draw
a
sample
from
each
stratum
27
Cluster
Sampling
• divide
the
popula=on
into
groups
(clusters)
(geographically)
• obtain
a
simple
random
sample
from
all
possible
clusters.
• obtain
data
on
every
sampling
unit
in
each
of
the
randomly
selected
clusters.
NOTE:
• Clusters
are
microcosms,
rather
than
subsec=ons,
of
the
popula=on.
28
Mul=stage
Sampling
The
above
methods
are
combined
in
different
stages.
For
example,
the
Labour
Force
Survey
is
a
mix
of
Simple
and
Stra=fied
Sampling
in
different
stages.
Stage
1:
stra=fied
sample
(Regions
NUTS
2)
Stage
2:
simple
random
sample
(Households
in
small
territorial
units)
29
Sampling
techniques
31
Convenience
Sampling
• May
prove
to
be
effec=ve
during
• During
explora=on
stage
of
the
research
area
• When
conduc=ng
pilot
data
collec=on
in
order
to
iden=fy
and
address
shortcomings
associated
with
ques=onnaire
design.
32
Snowball
Sampling
• Snowball
sampling
is
where
research
par=cipants
recruit
other
par=cipants
for
a
test
or
study.
It
is
used
where
poten=al
par=cipants
are
hard
to
find.
33
Snowball
Sampling
• Advantages:
– It
allows
for
studies
to
take
place
where
otherwise
it
might
be
impossible
to
conduct
because
of
a
lack
of
par=cipants.
– Snowball
sampling
may
help
you
discover
characteris=cs
about
a
popula=on
that
you
weren’t
aware
existed.
For
example,
the
casual
illegal
downloader
vs.
the
for-‐profit
downloader.
• Disadvantages:
– It
us
usually
impossible
to
determine
the
sampling
error
or
make
inferences
about
popula=ons
based
on
the
obtained
sample.
34
Quota
Sampling
• A
sampling
method
of
gathering
representa=ve
data
from
a
group-‐
ensures
that
sample
group
represents
certain
characteris=cs
of
the
popula=on
chosen
by
the
researcher.
• Quota
sampling:
– controlled
-‐
restric=ons
in
order
to
limit
researcher’s
choice
of
samples
– uncontrolled-‐
resembles
convenience
sampling
35
Prac=ce
1
(Sampling)
Find
the
alterna=ve
solu=ons
to
select
data
in
the
following
situa=ons
(for
each
one
define
target
popula0on,
sampling
frame,
sampling
unit
and
sampling
method):
1. The
Sunday
Times
is
trying
to
define
the
archetypal
reader
of
the
Monday
supplement
(about
music
and
culture);
2. The
WRS.com
company
(food
retail
business)
needs
to
perform
a
market
segmenta=on
of
their
clients,
based
on
their
loyalty
card.
36
Determinants
for
sample
size
38
Sources
of
Informa@on
Sources
of
informa=on
Register-‐based Survey-‐based
39
Data
collec@on
Sources
of
Informa@on
Sources
of
informa=on
Register-‐based Survey-‐based
40
What
is
a
ques@onnaire?
41
How
to
make
a
ques@onnaire?
42
How
to
make
a
ques@onnaire?
Rules
43
How
to
make
a
ques@onnaire?
Rules
44
How
to
make
a
ques@onnaire?
Rules
45
How
to
make
a
ques@onnaire?
Rules
46
How
to
make
a
ques@onnaire?
Rules
47
How
to
make
a
ques@onnaire?
Rules
48
Pretes@ng
a
Ques@onnaire
49
Pretes@ng
a
Ques@onnaire
51
Assessing
a
Ques@onnaire
• Step
1-‐Background
Purpose,
objec=ves,
research
ques=ons,
and
hypothesis
of
the
proposed
research
are
examined.
Define
Target
Popula=on
and
sampling
frame
53
Assessing
a
Ques@onnaire
54
Assessing
a
Ques@onnaire
55
Assessing
a
Ques@onnaire
56
Assessing
a
Ques@onnaire
57
Reliability
and
Validity
Source:
hap://www.socialresearchmethods.net/kb/
relandval.php
58
Measurement
Scales
5
0
1
7
(neither
good
nor
(poor
quality)
(bad
quality)
(good
quality)
bad)
62
Measurement
Scales
63
Measurement
Scales
• Likert
Scale
Each statement is assigned a numerical score ranging from 1 to
5. It can also be scaled as -2 to +2.
-‐2 -‐1 0 1 2
1
5
4
3
2
Strongly
Strongly
agree
Agree
Indifferent
Disagree
disagree
64
Measurement
Scales
1
7
Unpleasant
2
3
4
5
6
Pleasant
Submissive
Dominant
65
Prac@ce
2
(ques@onnaires)
66
New
Ways
of
collec@ng
data
The
“tradi@onals”
PAPI
(Paper
Assisted
Personal
Interview)
CAPI
(Computer
Assisted
Personal
Interview)
CATI
(Computer
Assisted
Telephone
Interviewing)
The
new
ones
CAWI
(Computer
Assisted
Web
Interviewing)
Social
Media
Website
Ac@vity
Tracking
customers
in
stores
67
New
Ways
of
collec@ng
data
Retrieved from :
http://www.greenbookblog.org/2014/09/29/the-top-20-
emerging-methods-in-market-research-a-grit-sneak-peek/
68
References
• Andrew
Hill,
Maria
Manuela
Magalhães
Hill
(2008),
Inves=gação
por
Ques=onário
,
2ª
Ed,
Edições
Sílabo.
• Elizabeth
Reis,
Raúl
Moreira
(1993),
Pesquisa
de
Mercados,
Sílabo
Gestão,
1.ª
Edição.
• Louis
Cohen,
Lawrence
Manion
and
Keith
Morrison,
(2005),
Research
Methods
in
Educa=on
,
Fiwh
edi=on
•
Norland-‐Tilburg,
E.
V.
(1990).
Controlling
error
in
evalua=on
instruments.
Journal
of
Extension,
[On-‐line],
28(2).
Available
at
h>p://www.joe.org/joe/1990summer/>2.html
• William
M.K.
Trochim,,
(2006)
.
The
Research
Methods
Knowledge
Base,
2nd
Edi=on.
Internet
WWW
page,
at
URL:
<
hap://www.socialresearchmethods.net/kb/>
(version
current
as
of
October
20,
2006).
69