0% found this document useful (0 votes)

53 views20 pages

Data Collection and Sampling

Uploaded by

imtiazquazi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views20 pages

Data Collection and Sampling

Uploaded by

imtiazquazi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 20

Chapter Five

Data Collection and Sampling

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1

Recall…
Statistics is a tool for converting data into information:
Statistics

Data Information

But where then does data come from? How is it gathered?

How do we ensure its accurate? Is the data reliable? Is it
representative of the population from which it was drawn?
This chapter explores some of these issues.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.2

Methods of Collecting Data…
There are many methods used to collect or obtain data for
statistical analysis. Three of the most popular methods are:
• Direct Observation
• Experiments, and
• Surveys.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.3

Surveys…
A survey solicits information from people; e.g. Gallup polls;
pre-election polls; marketing surveys.

The Response Rate (i.e. the proportion of all people selected

who complete the survey) is a key survey parameter.

Surveys may be administered in a variety of ways, e.g.

•Personal Interview,
•Telephone Interview, and
•Self Administered Questionnaire.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.4

Questionnaire Design…
Over the years, a lot of thought has been put into the science
of the design of survey questions. Key design principles:
1. Keep the questionnaire as short as possible.
2. Ask short, simple, and clearly worded questions.
3. Start with demographic questions to help respondents get
started comfortably.
4. Use dichotomous (yes|no) and multiple choice questions.
5. Use open-ended questions cautiously.
6. Avoid using leading-questions.
7. Pretest a questionnaire on a small number of people.
8. Think about the way you intend to use the collected data
when preparing the questionnaire.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.5
Sampling…
Recall that statistical inference permits us to draw
conclusions about a population based on a sample.

Sampling (i.e. selecting a sub-set of a whole population) is

often done for reasons of cost (it’s less expensive to sample
1,000 television viewers than 100 million TV viewers) and
practicality (e.g. performing a crash test on every
automobile produced is impractical).

In any case, the sampled population and the target

population should be similar to one another.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.6

Sampling Plans…
A sampling plan is just a method or procedure for
specifying how a sample will be taken from a population.

We will focus our attention on these three methods:

•Simple Random Sampling,

•Stratified Random Sampling, and
•Cluster Sampling.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.7

Simple Random Sampling…
A simple random sample is a sample selected in such a way
that every possible sample of the same size is equally likely
to be chosen.

Drawing three names from a hat containing all the names of

the students in the class is an example of a simple random
sample: any group of three names is as equally likely as
picking any other group of three names.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.8

Simple Random Sampling…
Example 5.1: A government income tax auditor must choose
a sample of 40 of 1,000 returns to audit…

Extra #’s may be used if duplicate random numbers are generated

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.9

Stratified Random Sampling…
A stratified random sample is obtained by separating the
population into mutually exclusive sets, or strata, and then
drawing simple random samples from each stratum.

Strata 1 : Gender Strata 2 : Age Strata 3 : Occupation

Male < 20 professional
Female 20-30 clerical
31-40 blue collar
41-50 other
51-60
> 60
We can acquire about the total population,
make inferences within a stratum
or make comparisons across strata

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.10

Stratified Random Sampling…
After the population has been stratified, we can use simple
random sampling to generate the complete sample:

If we only have sufficient resources to sample 400 people total,

we would draw 100 of them from the low income group…

…if we are sampling 1000 people, we’d draw

50 of them from the high income group.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.11

Cluster Sampling…
A cluster sample is a simple random sample of groups or
clusters of elements (vs. a simple random sample of
individual objects).

This method is useful when it is difficult or costly to develop

a complete list of the population members or when the
population elements are widely dispersed geographically.

Cluster sampling may increase sampling error due to

similarities among cluster members.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.12

Sample Size…
Numerical techniques for determining sample sizes will be
described later, but suffice it to say that the larger the sample
size is, the more accurate we can expect the sample estimates
to be.

Sampling and Non-Sampling Errors…
Two major types of error can arise when a sample of
observations is taken from a population:
sampling error and nonsampling error.

Sampling error refers to differences between the sample and

the population that exist only because of the observations
that happened to be selected for the sample.

Nonsampling errors are more serious and are due to

mistakes made in the acquisition of data or due to the sample
observations being selected improperly.

Sampling Error…
Sampling error refers to differences between the sample and
the population that exist only because of the observations
that happened to be selected for the sample.

Another way to look at this is: the differences in results for

different samples (of the same size) is due to sampling error:

E.g. Two samples of size 10 of 1,000 households. If we

happened to get the highest income level data points in our
first sample and all the lowest income levels in the second,
this delta is due to sampling error.

Sampling Error…
Sampling error refers to differences between the sample and
the population that exist only because of the observations
that happened to be selected for the sample.

Increasing the sample size will reduce this type of error.

Nonsampling Error…
Nonsampling errors are more serious and are due to
mistakes made in the acquisition of data or due to the sample
observations being selected improperly. Three types of
nonsampling errors:

Errors in data acquisition,

Nonresponse errors, and
Selection bias.

Note: increasing the sample size will not reduce this type of
error.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.17
Errors in data acquisition…
…arises from the recording of incorrect responses, due to:

— incorrect measurements being taken because of faulty equipment,

— mistakes made during transcription from primary sources,
— inaccurate recording of data due to misinterpretation of terms, or
— inaccurate responses to questions concerning sensitive issues.

Nonresponse Error…
…refers to error (or bias) introduced when responses are not
obtained from some members of the sample, i.e. the sample
observations that are collected may not be representative of
the target population.

As mentioned earlier, the Response Rate (i.e. the proportion

of all people selected who complete the survey) is a key
survey parameter and helps in the understanding in the
validity of the survey and sources of nonresponse error.

Selection Bias…
…occurs when the sampling plan is such that some members
of the target population cannot possibly be selected for
inclusion in the sample.

[Ebooks PDF] download Statistics Using Stata An Integrative Approach Sharon Lawner Weinberg full chapters
100% (1)
[Ebooks PDF] download Statistics Using Stata An Integrative Approach Sharon Lawner Weinberg full chapters
55 pages
Methods of Data Collection
No ratings yet
Methods of Data Collection
19 pages
Chapter Two: Data Collection and Sampling
No ratings yet
Chapter Two: Data Collection and Sampling
21 pages
Chapter 05
No ratings yet
Chapter 05
18 pages
Data Collection and Sampling
No ratings yet
Data Collection and Sampling
19 pages
Chapter 5
No ratings yet
Chapter 5
22 pages
Chapter 05
No ratings yet
Chapter 05
18 pages
Data Collection and Sampling
No ratings yet
Data Collection and Sampling
16 pages
Lecture - 5 - Start
No ratings yet
Lecture - 5 - Start
167 pages
Sampling Theory And: Its Various Types
No ratings yet
Sampling Theory And: Its Various Types
46 pages
Chapter 7 BRM
No ratings yet
Chapter 7 BRM
51 pages
4 - Sampling and Sample Size - SFB
No ratings yet
4 - Sampling and Sample Size - SFB
52 pages
Statistics For Management and Economics 11th Edition Ch.5
No ratings yet
Statistics For Management and Economics 11th Edition Ch.5
19 pages
Chapter 1
No ratings yet
Chapter 1
26 pages
Module 9
No ratings yet
Module 9
13 pages
Sampling Theory Ppt 1 1
No ratings yet
Sampling Theory Ppt 1 1
41 pages
Final Sampling
No ratings yet
Final Sampling
5 pages
SAMPLING
No ratings yet
SAMPLING
5 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
18 pages
Sampling Technique Full-1
No ratings yet
Sampling Technique Full-1
22 pages
C1 STS
No ratings yet
C1 STS
3 pages
Sampling 1
50% (2)
Sampling 1
31 pages
Economics CH 3 Economics Census Methods Notes
No ratings yet
Economics CH 3 Economics Census Methods Notes
40 pages
Chapter 4 Data Collection and Sampling Method
No ratings yet
Chapter 4 Data Collection and Sampling Method
24 pages
Levels of Measurement: Study
No ratings yet
Levels of Measurement: Study
13 pages
Population and Sample
No ratings yet
Population and Sample
2 pages
Lecture 3 - Types of Data, Data Collection and Sampling: Notes
No ratings yet
Lecture 3 - Types of Data, Data Collection and Sampling: Notes
5 pages
Unit IV
No ratings yet
Unit IV
27 pages
Chapter 4 Data Collection and Sampling Method
No ratings yet
Chapter 4 Data Collection and Sampling Method
25 pages
3 Sampling and Data Gathering Techniques
No ratings yet
3 Sampling and Data Gathering Techniques
38 pages
7 - MMS RM
No ratings yet
7 - MMS RM
93 pages
Sample - Is The Subset of The Entire Population
No ratings yet
Sample - Is The Subset of The Entire Population
6 pages
3T2324 Module 2 - 3
No ratings yet
3T2324 Module 2 - 3
43 pages
Lecture PPT Unit 3- Sampling Design Data Collection (1)
No ratings yet
Lecture PPT Unit 3- Sampling Design Data Collection (1)
47 pages
Lecture 5 Statistics
0% (1)
Lecture 5 Statistics
52 pages
MRM Mod 3
No ratings yet
MRM Mod 3
121 pages
Module On Data MGT
No ratings yet
Module On Data MGT
32 pages
Module 1 - Section 3 - Data Collection
No ratings yet
Module 1 - Section 3 - Data Collection
11 pages
Sampling (Method)
No ratings yet
Sampling (Method)
31 pages
Sts 212 Scientific Data 2024
No ratings yet
Sts 212 Scientific Data 2024
66 pages
unit 2 Probability theory
No ratings yet
unit 2 Probability theory
7 pages
Data Science Q&A - Latest Ed (2020) - 2 - 2
No ratings yet
Data Science Q&A - Latest Ed (2020) - 2 - 2
2 pages
Probability and Statistics Lesson 1 2
No ratings yet
Probability and Statistics Lesson 1 2
47 pages
Inferential Statistics
No ratings yet
Inferential Statistics
169 pages
Chapter 2 Sampling Technques
No ratings yet
Chapter 2 Sampling Technques
6 pages
Research Methods and Sampling Design: (Document Subtitle)
No ratings yet
Research Methods and Sampling Design: (Document Subtitle)
11 pages
Sampling and Simulation Modi
No ratings yet
Sampling and Simulation Modi
48 pages
Sampling
100% (2)
Sampling
24 pages
Statistics in Public Administration: Introduction and Data Collection
100% (1)
Statistics in Public Administration: Introduction and Data Collection
44 pages
PME Lec1. Sampling 13dec
No ratings yet
PME Lec1. Sampling 13dec
48 pages
Population Sample: Sampling and Methods Sampling Method Refers To The Way That Observations Are Selected From A
No ratings yet
Population Sample: Sampling and Methods Sampling Method Refers To The Way That Observations Are Selected From A
23 pages
Copy of Data-Management
No ratings yet
Copy of Data-Management
101 pages
Lecture 2
No ratings yet
Lecture 2
65 pages
Day 4 Data Collection Methods-1
No ratings yet
Day 4 Data Collection Methods-1
25 pages
Business Research Method: Unit 4
No ratings yet
Business Research Method: Unit 4
17 pages
Chapter One: What Is Statistics?
No ratings yet
Chapter One: What Is Statistics?
17 pages
Unit 4 BRM
No ratings yet
Unit 4 BRM
17 pages
Lecture 1 (Sampling)
No ratings yet
Lecture 1 (Sampling)
6 pages
Sampling - MBA - B - Section
No ratings yet
Sampling - MBA - B - Section
46 pages
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
5/5 (1)
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
The 3rd WICE Full Paper Template
No ratings yet
The 3rd WICE Full Paper Template
19 pages
Two-Sample Tests of Hypothesis: Mcgraw Hill/Irwin
No ratings yet
Two-Sample Tests of Hypothesis: Mcgraw Hill/Irwin
14 pages
Module 5 Experimental Designs and Significance Testing PDF
No ratings yet
Module 5 Experimental Designs and Significance Testing PDF
28 pages
Solutions Solutions: Simple Comparative Experiments Simple Comparative Experiments
No ratings yet
Solutions Solutions: Simple Comparative Experiments Simple Comparative Experiments
30 pages
STAT 3008 Outline
No ratings yet
STAT 3008 Outline
4 pages
Chi-Squar Test - Shahida Jahfar Rashka
No ratings yet
Chi-Squar Test - Shahida Jahfar Rashka
20 pages
Cross-Validation, Regularization, and Principal Components Analysis (PCA)
No ratings yet
Cross-Validation, Regularization, and Principal Components Analysis (PCA)
47 pages
Experiments With One Factor (Ch.3. Analysis of Variance - Anova)
No ratings yet
Experiments With One Factor (Ch.3. Analysis of Variance - Anova)
34 pages
Estimator Properties
No ratings yet
Estimator Properties
17 pages
Non-Probability Sampling
No ratings yet
Non-Probability Sampling
12 pages
MODULE 4 - Variability
No ratings yet
MODULE 4 - Variability
17 pages
Latih Tubi Mat Tam t5 Set 6
No ratings yet
Latih Tubi Mat Tam t5 Set 6
3 pages
Training - Data Science
No ratings yet
Training - Data Science
38 pages
Les8e PPT Study 07 02
No ratings yet
Les8e PPT Study 07 02
46 pages
Answer: The Mean Daily Wage Is Rs.145.2
No ratings yet
Answer: The Mean Daily Wage Is Rs.145.2
7 pages
Pengantar Statistik Industri: Debrina Puspita Andriani E-Mail
No ratings yet
Pengantar Statistik Industri: Debrina Puspita Andriani E-Mail
40 pages
StatProb Lesson 4
No ratings yet
StatProb Lesson 4
47 pages
Jerome en
No ratings yet
Jerome en
10 pages
Data Preprocessing: L1+ Freq
No ratings yet
Data Preprocessing: L1+ Freq
13 pages
Data Mining: Exploring Data Data Mining: Exploring Data: Lecture Notes For Chapter 3 Lecture Notes For Chapter 3
No ratings yet
Data Mining: Exploring Data Data Mining: Exploring Data: Lecture Notes For Chapter 3 Lecture Notes For Chapter 3
34 pages
DOST AI - Coding Exercises v0.1
No ratings yet
DOST AI - Coding Exercises v0.1
12 pages
Chapter 10 Guided Notebook
No ratings yet
Chapter 10 Guided Notebook
21 pages
LESSON 9 Sampling Distribution Activity 13
No ratings yet
LESSON 9 Sampling Distribution Activity 13
17 pages
Rajni Sh
No ratings yet
Rajni Sh
6 pages
Statistical Methods For The Social And Behavioural Sciences A Modelbased Approach First Edition David B Flora instant download
No ratings yet
Statistical Methods For The Social And Behavioural Sciences A Modelbased Approach First Edition David B Flora instant download
80 pages
Example Problems - Econometrics
No ratings yet
Example Problems - Econometrics
8 pages
Distribution Tables
No ratings yet
Distribution Tables
6 pages
Problems With Econometric Models Heteros
No ratings yet
Problems With Econometric Models Heteros
10 pages
A Hierarchical Bayesian Analysis of Hors
No ratings yet
A Hierarchical Bayesian Analysis of Hors
13 pages

Data Collection and Sampling

Uploaded by

Data Collection and Sampling

Uploaded by

Chapter Five

Data Collection and Sampling

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1

But where then does data come from? How is it gathered?

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.2

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.3

The Response Rate (i.e. the proportion of all people selected

Surveys may be administered in a variety of ways, e.g.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.4

Sampling (i.e. selecting a sub-set of a whole population) is

In any case, the sampled population and the target

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.6

We will focus our attention on these three methods:

•Simple Random Sampling,

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.7

Drawing three names from a hat containing all the names of

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.8

Extra #’s may be used if duplicate random numbers are generated

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.9

Strata 1 : Gender Strata 2 : Age Strata 3 : Occupation

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.10

If we only have sufficient resources to sample 400 people total,

…if we are sampling 1000 people, we’d draw

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.11

This method is useful when it is difficult or costly to develop

Cluster sampling may increase sampling error due to

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.12

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.13

Sampling error refers to differences between the sample and

Nonsampling errors are more serious and are due to

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.14

Another way to look at this is: the differences in results for

E.g. Two samples of size 10 of 1,000 households. If we

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.15

Increasing the sample size will reduce this type of error.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.16

Errors in data acquisition,

— incorrect measurements being taken because of faulty equipment,

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.18

As mentioned earlier, the Response Rate (i.e. the proportion

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.19

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 5.20

You might also like