0% found this document useful (0 votes)

73 views

Creative Commons Attribution-Noncommercial-Sharealike License

Uploaded by

ahmed22gouda22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views

Creative Commons Attribution-Noncommercial-Sharealike License

Uploaded by

ahmed22gouda22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License.

Your use of this material constitutes acceptance of that license and the conditions of use of materials on this site.

Copyright 2009, The Johns Hopkins University and Saifuddin Ahmed. All rights reserved. Use of these materials permitted only in accordance with license rights granted. Materials provided AS IS; no representations or warranties provided. User assumes all responsibility for use, and all liability related thereto, and must independently review all materials for accuracy and efficacy. May contain materials owned by others. User is responsible for obtaining permissions for use from third parties as needed.

Methods in Sample Surveys Simple Random Sampling Systematic Sampling Lecture 2

Saifuddin Ahmed, MBBS, PhD Biostatistics Department School of Hygiene and Public Health Johns Hopkins University

Sampling Procedure
Sampling involves two tasks:
How to select the elements? How to estimate the population characteristics from the sampling units?

We employ some randomization process for sample selection so that there is no preferential treatment in selection which may introduce selectivity bias.

Simple Random Sampling (SRS)

Simplest sample design Each element has an equal probability of being selected from a list of all population units (sample of n from N population). SRS are EPSEM samples
Equal Probability of SElection Method Equal Probability of Selection [of Element] Method

Two type of SRS

SRS with replacement (SRSWR) SRS without replacement (SRSWOR)

SRS with replacement (SRSWR)

One unit of element is randomly selected from population is the first sampled unit Then the sampled unit is replaced in the population The second sample is drawn with equal probability The procedure is repeated until the requisite sample units n are drawn The probability of selection of an element remains unchanged after each draw The same units could be selected more than once

SRS without replacement (SRSWOR)

Unlike SRSWR, once an element is selected as a sample unit, will not be replaced in the population pool The selected sample units are distinct

Obviously, in practice there is no reason of interviewing the same individual twice! [Except in reliability or validation studies] Why study simple random sampling with replacement (SRSWR)?

Mathematical properties
Number of possible samples (of N elements): SRSWR: SRSWOR: Nn
N N Cn = n

SRSWR:

Example: 2 elements from 4 (ABCD) (How many ways we can draw 2 elements from a population of size 4?) With SRSWR: 42 = 16 AA, AB, AC, AD, BA, BB, BC, BD, CA, CB, CC, CD, DA, DB, DC, DD = 16 samples

SRSWOR:
Mathematically, AA, AB, AC, AD, BA, BB, BC, BD, CA, CB, CC, CD, DA, DB, DC, DD

N N Cn = n

N! 4! = =6 ( N n)! n! 2!2!

SRSWOR:
Mathematically, AB, AC, AD, BA, BC, BD, CA, CB, CD, DA, DB, DC,

N N Cn = n

N! 4! = =6 ( N n)! n! 2!2!

SRSWOR:
Mathematically, AB, AC, AD, BA, BC, BD, CA, CB, CD, DA, DB, DC,

N N Cn = n

N! 4! = =6 ( N n)! n! 2!2!

SRSWOR:

N N Cn = n

N! 4! = =6 Mathematically, ( N n)! n! 2!2! AB, AC, AD BC, BD CD = 6 samples

Sample selection probability

all the samples are equally likely to be selected under SRS, then the probability of selecting each sample of size n is p(s):

SRSWR:

1/N

1 1 1 from, . ..... = 1 / N n N N N
n! ( N n)! 1 1 1 . ...... .n! = N N 1 N n +1 N!

SRSWOR:

N 1/ n

from,

What is the probability that the sample AB will be selected?

SRSWR = 1/16 SRSWOR = 1/6

Element selection probability in sample

(inclusion probability):
Let i is the inclusion probability for unit i, Then i = p(s) =
SRSWR: n/N From:
1 1 1 1 n for every n + + .... + = = N N N N N
N n 1 n 1 N 1 1 = + . + ....... + . N N N 1 N N n N

SRSWOR:

n/N

From:

What is the probability that element A will be selected?

AA, AB, AC, AD, BA, BB, BC, BD, CA, CB, CC, CD, DA, DB, DC, DD
SRSWR= 8/16=1/2

What is the probability that element A will be selected?

AA, AB, AC, AD, BA, BB, BC, BD, CA, CB, CC, CD, DA, DB, DC, DD AB, AC, AD BC, BD CD
SRSWR= 8/16=1/2

SRSWOR= 3/6=1/2 Element selection probability: n/N [In this example: two elements sampled from a population size of 4; each element has a probability=n/N=2/4=1/2 for selection]

So, inclusion probability is same under SRSWR and SRSWOR

f = n/N is called sampling fraction

In practice SRSWR is not attractive: we do not want to interview same individuals more than once. But in mathematical term it is simpler to relate the sample to population by SRSWR. SRSWOR provides two additional advantages:
elements are not repeated variance estimation is smaller than SRSWR with same sample size.

Variance of y under SRSWR:

S2 Var ( y ) = n

Variance of y under SRSWOR:

2 S N n 2 Var ( y ) = E ( y ) = n N 1

S2 N n = n N when N is very l arg e subtracting 1 may not have any effect

n N n The quantity =1 is called finite population correction, or fpc. N N

A comparison: Population parameter Var (X) =

Sample estimate by SRSWR

2 x Var ( x ) = n

Sample estimate by SRSWOR

2 x Var ( x ) = n

Nn N 1

SRS though attractive for its simplicity, the design is not usually used in the sample survey in practice for several reasons:
Lack of listing frame: the method requires that a list of population elements be available, which is not the case for many populations. Problem of small area estimation or domain analysis: For a small sample from a large population, all the areas may not have enough sample size for making small area estimation or for domain analysis by variables of interest. Not cost effective: SRS requires covering of whole population which may reside in a large geographic area; interviewing few samples spread sparsely over a large area would be very costly.

Why do we use SRS?

We use SRS as a scale, and use it to compare the efficiency of the other sample designs. We express this comparison by design effect (deff) .

Compare a comparison of SRSWOR to SRSWR

varwor ( x ) N n n deff wor ( x ) = = 1 ( if we assume N is l arg e ) varwr ( x ) N 1 N

The term (1-n/N) is called finite population correction and is the multiplying factor to convert SRSWR variance to SRSWOR variance.

This finite correction term is always less than 1 (except when n~0!), and suggests that deffsrswor is always less than 1. That is why SRSWOR is more efficient than SRSWR. Deff is extensively used in the design based analysis to examine the efficiency of the estimates.

Proportions from a Simple Random sampling

Let us examine a population characteristic of interest which is dichotomous, e.g., children immunized,
1 Yi ={ 0 if immunized if not immunized

The total number with response 1 (yes/positive attribute) in the population:

T = Yi
i =1

The proportion with response 1 (yes/positive attribute) in the population:

1 N T P = Yi = N i =1 N

The proportion of sample having the characteristics is:

1 n p = yi n i =1

The sample variance of proportion is:

s2 = npq where q =1 p n 1

The sample proportion p is an unbiased estimator of the population proportion P.

An unbiased estimator of Var(p) from simple random sampling:

pq with replacement n 1

var(p) = or

pq N n pq n = 1 without replacement n 1 N n 1 N Usually in practice, Var(p) is approximated as: var(p) =

var(p) pq n (1 ) n N

Implementation of SRS sampling:

1. Listing (sampling) Frame 2. Random number table (from published table or computer generated) 3. Selection of sample

Computer generated random numbers: (STATA output)

832645 708009 305761 536405 217862 844905 670523 334920 885588 458268 277649 977683

573158 285644 995036 504168 782003 296231 707073 023934 384435 058670 076177 759956

467460 727733 740619 750032 409660 103727 049209 808901 129958 888935 482951 553916

838921 343305 054728 367682 155199 053603 830572 740693 303040 064613 876389 983998

171721 539264 746425 626278 129514 562252 337034 170372 264636 661404 898190 331578

152885 907568 713746 855480 484511 219726 716264 095017 858065 411861 927367 981306

Automated random sample selection

all standard statistical software have commands for automated random sample selection. Example:
STATA: sample 10 {10% sample (fraction) selection randomly} sample 10, count

{draws a sample of 10 units}

SPSS: sample .10 Sample 10%

sample 150 FROM 1000 {exact no. of cases selected}

Systematic Sampling

systematic sampling, either by itself or in combination with some other method, may be the most widely used method of sampling.

Levy and Lemeshow, 1999

In simple random sampling we want that the samples should be distributed randomly.

In reality the random selection may be like this:

In systematic sampling we force to select samples evenly from the list (sampling frame):

First, let us consider that we are dividing the list evenly into some blocks.

In systematic sampling we may force to select samples evenly from the list (sampling frame):

First, let us consider that we are dividing the list evenly into some blocks.

Then, we select a sample element from each block.

In systematic sampling, only the first unit is selected at random, The rest being selected according to a predetermined pattern. to select a systematic sample of n units, the first unit is selected with a random start r from 1 to k sample, where k=N/n sample intervals, and after the selection of first sample, every kth unit is included where 1 r k.

An example: Let N=100, n=10, then k=100/10. Then the random start r is selected between 1 and 10 (say, r=7). So, the sample will be selected from the population with serial indexes of: 7, 17,27,.........,97 i.e., r, r+k, r+2k,......., r+(n-1)k

[What could be done if k=N/n is not an integer?

Selection of systematic sampling when sampling interval (k) is not an integer

Consider, n=175 and N=1000. So, k=1000/175 = 5.71 One of the solution is to make k rounded to an integer, i.e., k=5 or k=6. Now, if k=5, then n=1000/5=200; or, If k=6, then n=1000/6 = 166.67 ~ 167.

Which n should be chosen?

Solution if k=5 is considered, stop the selection of samples when n=175 achieved. if k=6 is considered, treat the sampling frame as a circular list and continue the selection of samples from the beginning of the list after exhausting the list during the first cycle. An alternative procedure is to keep k non-integer and continue the sample selection as follows:

Let us consider, k=5.71, and r=4. So, the first sample is 4th in the list. The second = (4+5.71) =9.71 ~9th in the list, the third =(4+2*5.71) =15.42 ~ 15th in the list, and so on. (The last sample is: 4+5.71*(175-1) = 997.54 ~ 997th in the list).

Note that, k is switching between 5 and 6.

Advantages: Systematic sampling has many attractiveness: 1. 2. 3. Provides a better random distribution than SRS Simple to implement May be started without a complete listing frame (say, interview of every 9th patient coming to a clinic).

4. With ordered list, the variance may be smaller than SRS (see below for exceptions)

Disadvantages:

Periodicity (cyclic variation)

linear trend

Proposed solution: Circular systematic sampling,

In this case, the end of list is connected to the beginning of the list, making the list circular. This allows the random start r to start between 1 to n (1<r<n), rather than between 1 to k as in the linear systematic sampling.

Example: Say we want to take a sample of size 10 from a population of 100. We will select the first sample randomly, say, 85th element. So, our sample will consist of the following elements: 85, 95, 5, 15, 25, 35, 45, 55, 65, 75 Not without criticism.

When to use systematic sampling?

Even preferred over SRS When no list of population exists When the list is roughly of random order Small area/population

In case of systematic sampling, the selection probability of each element in the population is the same, i.e., epsem

How efficient is this sampling scheme?

This depends on how the elements are listed: randomly arranged or sorted in a particular fashion by a variable. If the listing is randomly ordered, you may view this as an SRS. Unfortunately this statement is oversimplified. Another factor will affect systematic sampling: intraclass correlation.

DK-The British Medical Association-Complete Family Health Guide
90% (10)
DK-The British Medical Association-Complete Family Health Guide
992 pages
Handbook of Market Segmentation
100% (4)
Handbook of Market Segmentation
252 pages
Medical Devices by Facility
100% (1)
Medical Devices by Facility
60 pages
Sampling
No ratings yet
Sampling
7 pages
Sampling Lecture
No ratings yet
Sampling Lecture
10 pages
Lecture 2 SRS
No ratings yet
Lecture 2 SRS
25 pages
Statistics Population Sample Randomly Probability Probability
No ratings yet
Statistics Population Sample Randomly Probability Probability
7 pages
Chapter 2b March2020 Systematic
No ratings yet
Chapter 2b March2020 Systematic
34 pages
Ens185 Part 2
No ratings yet
Ens185 Part 2
91 pages
Week9 BAM
No ratings yet
Week9 BAM
26 pages
Simple Random Sampling Without Replacement (SRSWOR)
No ratings yet
Simple Random Sampling Without Replacement (SRSWOR)
23 pages
Homogeneity of The Population: No Less Than 30 Elements
No ratings yet
Homogeneity of The Population: No Less Than 30 Elements
2 pages
LECTURE 3
No ratings yet
LECTURE 3
8 pages
Types of Random Sampling Techniques
80% (10)
Types of Random Sampling Techniques
3 pages
Types of Random Sampling Techniques
No ratings yet
Types of Random Sampling Techniques
3 pages
Lesson 1. Simple Random Sampling
No ratings yet
Lesson 1. Simple Random Sampling
23 pages
Sampling 1 PDF
No ratings yet
Sampling 1 PDF
8 pages
Sampling Unit 8
No ratings yet
Sampling Unit 8
7 pages
Chapter2 Sampling Simple Random Sampling PDF
No ratings yet
Chapter2 Sampling Simple Random Sampling PDF
23 pages
Introduction To Probabilistic Sampling
No ratings yet
Introduction To Probabilistic Sampling
39 pages
Chapter2 Sampling Simple Random Sampling
No ratings yet
Chapter2 Sampling Simple Random Sampling
24 pages
Simple Random Sampling
No ratings yet
Simple Random Sampling
10 pages
Probability and Non Probability Sampling
100% (1)
Probability and Non Probability Sampling
7 pages
SP Sampling Lect 4
No ratings yet
SP Sampling Lect 4
13 pages
Types of Sampling Design
No ratings yet
Types of Sampling Design
8 pages
B.SC (CS With AI) Unit - 1
No ratings yet
B.SC (CS With AI) Unit - 1
19 pages
slidesc53_1_2 statistics
No ratings yet
slidesc53_1_2 statistics
27 pages
Ss Notes
No ratings yet
Ss Notes
34 pages
notes on sample survey
No ratings yet
notes on sample survey
34 pages
A sample design of simple random sampling is one of the most basic and widely used sampling methods in statistics
No ratings yet
A sample design of simple random sampling is one of the most basic and widely used sampling methods in statistics
8 pages
Session 9
No ratings yet
Session 9
29 pages
Sampling
No ratings yet
Sampling
86 pages
Lect8 Math231
No ratings yet
Lect8 Math231
30 pages
Sampling Techniques
No ratings yet
Sampling Techniques
3 pages
Sampling Distribution: Estimation and Testing of Hypothesis
No ratings yet
Sampling Distribution: Estimation and Testing of Hypothesis
34 pages
Study Population and Sampling
No ratings yet
Study Population and Sampling
22 pages
7.1 Basic Concepts
No ratings yet
7.1 Basic Concepts
28 pages
Chapter 5 Statistics
No ratings yet
Chapter 5 Statistics
11 pages
Systematic Sampling
No ratings yet
Systematic Sampling
18 pages
Sampling Design: Basic Concepts and Procedure: Sampling Frame. Known. Random Samples
No ratings yet
Sampling Design: Basic Concepts and Procedure: Sampling Frame. Known. Random Samples
18 pages
Sampling Methods: Random Selection or Chance. in Order To Have A Random Selection Method, You Must Set Up Some
No ratings yet
Sampling Methods: Random Selection or Chance. in Order To Have A Random Selection Method, You Must Set Up Some
14 pages
SP Sampling Lect 5
No ratings yet
SP Sampling Lect 5
28 pages
Formalizing The Concepts: Simple Random Sampling: Juan Muñoz Kristen Himelein March 2013
No ratings yet
Formalizing The Concepts: Simple Random Sampling: Juan Muñoz Kristen Himelein March 2013
25 pages
Self Assessment Tool-1
No ratings yet
Self Assessment Tool-1
4 pages
Demo Classes
No ratings yet
Demo Classes
23 pages
Research Methods (Sampling)
No ratings yet
Research Methods (Sampling)
21 pages
Stat 115 - Chapter 1
No ratings yet
Stat 115 - Chapter 1
156 pages
Sampling For Research Methods For Midwife
No ratings yet
Sampling For Research Methods For Midwife
123 pages
Unit 2
No ratings yet
Unit 2
15 pages
9 Sample Design
No ratings yet
9 Sample Design
42 pages
2012 Sampling Theory Art
No ratings yet
2012 Sampling Theory Art
87 pages
Sampling: Click at Http://goo - gl/7Dztn
No ratings yet
Sampling: Click at Http://goo - gl/7Dztn
8 pages
Design Summary - Survey Sampling
No ratings yet
Design Summary - Survey Sampling
4 pages
Sampling
No ratings yet
Sampling
5 pages
CH 7
No ratings yet
CH 7
17 pages
Stat II Chapter One
No ratings yet
Stat II Chapter One
5 pages
Lec. Note E4
No ratings yet
Lec. Note E4
5 pages
Sampling PDF
No ratings yet
Sampling PDF
120 pages
9 Sampling
No ratings yet
9 Sampling
5 pages
Statistics II Essentials
From Everand
Statistics II Essentials
Emil Milewski
2.5/5 (1)
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
5/5 (1)
Measurement of Length - Screw Gauge (Physics) Question Bank
From Everand
Measurement of Length - Screw Gauge (Physics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Ecriture Fraction 1
No ratings yet
Ecriture Fraction 1
1 page
American Statistical Association, Taylor & Francis, Ltd. Journal of The American Statistical Association
No ratings yet
American Statistical Association, Taylor & Francis, Ltd. Journal of The American Statistical Association
2 pages
The Econometric Society, Wiley Econometrica: This Content Downloaded From 193.227.1.29 On Sat, 23 Jul 2016 23:17:22 UTC
No ratings yet
The Econometric Society, Wiley Econometrica: This Content Downloaded From 193.227.1.29 On Sat, 23 Jul 2016 23:17:22 UTC
22 pages
OpenAir Calculating Utilization Services Company
No ratings yet
OpenAir Calculating Utilization Services Company
8 pages
Selling Skills Module Final
No ratings yet
Selling Skills Module Final
11 pages
Efficient Optimal Surface Detection: Theory, Implementation and Experimental Validation
No ratings yet
Efficient Optimal Surface Detection: Theory, Implementation and Experimental Validation
8 pages
11 Chapter Xi: Multi-Objective Programming
No ratings yet
11 Chapter Xi: Multi-Objective Programming
23 pages
Pharmaceutical Representatives: An Evidence-Based Review With Suggested Guidelines For Clerkships
No ratings yet
Pharmaceutical Representatives: An Evidence-Based Review With Suggested Guidelines For Clerkships
5 pages
Triumph Hotel & Conference Centre
No ratings yet
Triumph Hotel & Conference Centre
13 pages
Sample Size Workshop 1
No ratings yet
Sample Size Workshop 1
51 pages
The Philosophy Behind Sampling in Educational Research: Review
No ratings yet
The Philosophy Behind Sampling in Educational Research: Review
5 pages
Creative Commons Attribution-Noncommercial-Sharealike License
No ratings yet
Creative Commons Attribution-Noncommercial-Sharealike License
50 pages
Session Five: Random Sampling
No ratings yet
Session Five: Random Sampling
7 pages
3 14 Recommendation For Promotion
No ratings yet
3 14 Recommendation For Promotion
2 pages
Mean and Variance Estimation
No ratings yet
Mean and Variance Estimation
2 pages
Expected Value of The Sample Variance
No ratings yet
Expected Value of The Sample Variance
2 pages

Creative Commons Attribution-Noncommercial-Sharealike License

Uploaded by

Creative Commons Attribution-Noncommercial-Sharealike License

Uploaded by

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License.

Methods in Sample Surveys Simple Random Sampling Systematic Sampling Lecture 2

Simple Random Sampling (SRS)

Two type of SRS

SRS with replacement (SRSWR)

SRS without replacement (SRSWOR)

N! 4! = =6 Mathematically, ( N n)! n! 2!2! AB, AC, AD BC, BD CD = 6 samples

Sample selection probability

What is the probability that the sample AB will be selected?

Element selection probability in sample

What is the probability that element A will be selected?

What is the probability that element A will be selected?

So, inclusion probability is same under SRSWR and SRSWOR

f = n/N is called sampling fraction

Variance of y under SRSWR:

Variance of y under SRSWOR:

S2 N n = n N when N is very l arg e subtracting 1 may not have any effect

n N n The quantity =1 is called finite population correction, or fpc. N N

A comparison: Population parameter Var (X) =

Sample estimate by SRSWR

Sample estimate by SRSWOR

Why do we use SRS?

Compare a comparison of SRSWOR to SRSWR

Proportions from a Simple Random sampling

The total number with response 1 (yes/positive attribute) in the population:

The proportion with response 1 (yes/positive attribute) in the population:

The proportion of sample having the characteristics is:

The sample variance of proportion is:

The sample proportion p is an unbiased estimator of the population proportion P.

An unbiased estimator of Var(p) from simple random sampling:

pq N n pq n = 1 without replacement n 1 N n 1 N Usually in practice, Var(p) is approximated as: var(p) =

Implementation of SRS sampling:

Computer generated random numbers: (STATA output)

Automated random sample selection

{draws a sample of 10 units}

sample 150 FROM 1000 {exact no. of cases selected}

Levy and Lemeshow, 1999

In reality the random selection may be like this:

Then, we select a sample element from each block.

[What could be done if k=N/n is not an integer?

Selection of systematic sampling when sampling interval (k) is not an integer

Which n should be chosen?

Note that, k is switching between 5 and 6.

Periodicity (cyclic variation)

Proposed solution: Circular systematic sampling,

When to use systematic sampling?

How efficient is this sampling scheme?

You might also like