0% found this document useful (0 votes)

30 views62 pages

Chap 1

Here are the steps to solve this problem: 1) Population size (N) = 3000 students 2) Sample size (n) = 200 students 3) Sampling rate (f) = n/N = 200/3000 = 0.0667 = 6.67% 4) Sample mean (Xn) = 110 dinars 5) Sample standard deviation (Sn) = 70 dinars 6) Confidence level = 95% => α = 0.05 7) Z1-α/2 = Z0.025 = 1.96 (from normal distribution table) 8) Confidence interval = Xn ± Z1-α/2 * Sn/√n = 110 ±

Uploaded by

Ghabri Fida

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views62 pages

Chap 1

Uploaded by

Ghabri Fida

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 62

Chapter 1: Sampling

Lassad El Moubarki.
Tunis Business School

October 6, 2023

L. El Moubarki Sampling October 6, 2023 1 / 62

Outline I
1 The questionnaire
Definition
Responses quantification
2 Generalities about Sampling
Sampling vs census
Why sample?
3 Probability sampling techniques
Simple random sampling (SRS)
Systematic sampling
Sampling with probability proportional to the units size
Stratified random sampling
Cluster sampling
Multistage sampling
4 Non-probability methods
L. El Moubarki Sampling October 6, 2023 2 / 62
Outline II
Judgmental sampling
Quota method
Route sampling

5 Sample adjustment
Adjustment according to a quantitative variable
Adjustment by weight
Other method

L. El Moubarki Sampling October 6, 2023 3 / 62

The questionnaire Definition

The questionnaire

A questionnaire is a research instrument consisting of a series of questions for the purpose of

gathering information from respondents.
Advantages
They are cheap, do not require as much effort as for verbal surveys.
Have standardized answers that make it simple to analyse.
Disadvantages
Standardized answers may frustrate users. Questionnaires are also sharply limited by the
fact that respondents must be able to read the questions and respond to them. Thus, for
some demographic groups conducting a survey by questionnaire may not be concrete.

L. El Moubarki Sampling October 6, 2023 4 / 62

The questionnaire Responses quantification

Types of responses

One of the more difficult skills in data analysis is deciding which statistical models and tests to
use in a particular situation. The statistical model is often dependent on the type of the
variable. Statistical variables can be classified as follow.
Categorical (Qualitative)
Categorical nominal.
Example: Are you married?
Categorical ordinal.
Example: What is the mention of your baccalaureate?
Quantitative (Numerical)
Continuous: Responses are numbers that can be broken into finer and finer units. The set of
possible numbers is large.
Example: What is your height?
Discrete: The set of possible numbers is finite.
Example: How many child do you have?

L. El Moubarki Sampling October 6, 2023 5 / 62

The questionnaire Responses quantification

Rensis Likert Scale

A Likert scale is a psychometric scale commonly involved in research that employs

questionnaires. It is widely used in marketing area.
When responding to a Likert item, respondents specify their level of agreement or
disagreement on a symmetric agree-disagree scale for a series of statements. Thus, the
range captures the intensity of their feelings for a given item.
A scale can be created as the simple sum or average of questionnaire responses over the
set of individuals items (questions).
In so doing, Likert scaling assumes distances between each choice (answer option) are
equal.
To give possibility for neutral answer, the number of choices must be odd.

L. El Moubarki Sampling October 6, 2023 6 / 62

The questionnaire Responses quantification

Example: Website user survey (source Wikipedia)

L. El Moubarki Sampling October 6, 2023 7 / 62

Generalities about Sampling Sampling vs census

Sampling vs census

Census and sampling are two different techniques used for collecting survey data about a
reference population.
In the case of census all the population members are enumerated.
On the other hand, the sampling is widely used method, in statistical models, wherein a
data set is selected randomly from the reference population.
Census implies complete enumeration of the study objects, whereas Sampling is the
enumeration of the subgroup of elements chosen for participation.

L. El Moubarki Sampling October 6, 2023 8 / 62

Generalities about Sampling Sampling vs census

Sampling vs census

L. El Moubarki Sampling October 6, 2023 9 / 62

Generalities about Sampling Why sample?

Why sample?

We need to sample when the census is unachievable or difficult to perform. Often, the census
is:
Expensive.
Time consuming.
Technically difficult to perform.

L. El Moubarki Sampling October 6, 2023 10 / 62

Generalities about Sampling Why sample?

Importance of sample size

Simulation of the big numbers law in R: flipping a fair coin n times.

a=c()
b=c()
n=2000
for(i in 1:n)
{
a=c(a,sample(c(0,1),1))
b=c(b,mean(a))
plot(1:i,b,main=i,col=2,type="l",xlim=c(1,n),ylim=c(0,1))
}

L. El Moubarki Sampling October 6, 2023 11 / 62

Generalities about Sampling Why sample?

convergence en fonction de n
0.5
0.4
les estimations de p

0.3
0.2
0.1
0.0

0 500 1000 1500 2000

L. El Moubarki Sampling October 6, 2023 12 / 62

Generalities about Sampling Why sample?

Probability vs non-probability sampling

Probability sampling is a sampling methods, wherein, all the subjects of the population
get an opportunity to be in the sample.
Non-probability sampling is a method of sampling wherein, it is not known if a given
individual will be in the sample or not.
In practice, we use the non-probability sampling methods when we do not have the list of all
contacts or identifiers of population subjects (The frame).