Week 4 Lecture
Week 4 Lecture
In research, collecting data can either be done in the entire population or the subset
of this population called sample. If a researcher opts to use sample rather than a
population, he must take considerations on the number of samples and how these
samples can be chosen out of his target population.
Sampling - Is the act, process, or technique of selecting a representative part of a
population for the purpose of determining parameters or characteristics of whole
population.
Population - includes all its elements from a set of data. The size of the population is the
number of observations in the population. For example, if ABS-CBN network has 11,000
employees having the required blood type in a certain study, then we have a population of
size 11,000.
Sample - consists of one or more data drawn from the population. It is a subset, or an
incomplete set taken from a population of objects or observations. Taking samples instead
of the population is less time-consuming and cost-effective. Although sampling has
advantages, it can also be a source of bias and inaccuracy.
Random Sampling - is a sampling method of choosing representatives from the
population wherein every sample has an equal chance of being selected. Accurate data
can be collected using random sampling techniques.
“Parameter and statistics are usually computed through different measures of central
tendency (Mean, Median, Mode) and variability (Range, Standard deviation, Variance). “
If the mean of a sample drawn from a population is calculated, this value is used as a
statistic, or an estimate of the population mean.
The mean is the sum of the data divided by the number of data. The mean is used to
describe where the set of data tends to concentrate at a certain point. Population mean
is the mean computed based on the elements of the population or data. The symbol µ (read
as “myu”) is used to represent population mean. To compute for the population mean, we
simply add all the data (X) and then, divide it by the number of elements in the population
(N).
𝛴𝑋
We apply the formula: 𝜇= 𝑁
Variance and standard deviation determine how to spread or to scatter each data on the
set from the mean. Standard deviation is simply the square root of the variance.
Population variance is the computed variance of the elements of the population. The
symbol 𝜎2 (read as “sigma squared”) is used to represent population variance. To compute
for the population variance,
2 ∑(𝑋−𝜇 )2
we apply the formula: 𝜎 = 𝑁
Population standard deviation is the computed standard deviation of the elements of the
population. The symbol 𝜎 (read as “sigma”) is used to represent population standard
deviation. To compute for the population standard deviation,
∑(𝑥−𝜇)2
we use the formula: 𝜎= √
𝑁
The sample mean is the average of all the data of the samples. The symbol 𝑥̅ (read as “x
bar”) is used to represent the sample mean. To compute for the sample mean, we simply
add all the data and divide it by the number of elements in the sample (n).
𝛴𝑥
We apply the formula: 𝑥̅ = 𝑛
Sample variance is the computed variance of the elements of the sample. s2 is used to
represent sample variance. To compute for the sample variance,
2 ∑(𝑥−𝑥̅ )2
we apply the formula: 𝑠 = 𝑛−1
Sample standard deviation is the computed standard deviation of the elements of the
sample. s is used to represent sample standard deviation. To compute for the sample
standard deviation,
𝛴(𝑥−𝑥̅ )2
we use the formula: 𝑠 = √
𝑛−1
Sampling error – is the error resulting from using a sample to estimate a population
characteristic. It is the difference between sample measure and the corresponding
population measure due to the fact that the sample is not a perfect representation of the
population.
𝑆𝑎𝑚𝑝𝑙𝑖𝑛𝑔 𝑒𝑟𝑟𝑜𝑟 = 𝑥̅ − 𝜇
Example: The given table shows the monthly income (in thousands of pesos) of the officers
in a government office. Suppose that the sample size is 3, find the mean, variance, and
standard deviation of the sampling distribution of the sample mean.
Mean of the Population = Mean of the Sampling Distribution of the Sample Mean
WEEK 4: DEFINING SAMPLING DISTRIBUTION OF THE SAMPLE MEAN FOR
NORMAL POPULATION WHEN VARIANCE IS KNOWN ABD UNKNOWN
𝑥̅ −𝜇 𝑥̅ −𝜇
z-distribution 𝑧= 𝜎 t-distribution𝑡 = 𝑠
√𝑛 √𝑛
n≥30 n<30
Example 1: Consider a population consisting of 1,2,3,4,5,8,9, and 11. Sample of size are
drawn from this population.
Answer:
Given: N = 9, n = 3
Sampling Distribution when variance is KNOWN.
𝜎
Standard error formula: 𝜎𝑥̅ =
√𝑛
Example 2: Given the population mean of 12, and a sample standard deviation of 3 in
sample size of 125.
Answer:
Given: µ = 12, sample SD = 3, and n = 3
Sampling Distribution when variance is UNKNOWN.
𝑠
Standard error formula: 𝑠𝑥̅ =
√𝑛