0% found this document useful (0 votes)
4 views

Week 4 Lecture

Uploaded by

keratin923
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

Week 4 Lecture

Uploaded by

keratin923
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

STATISTICS AND PROBABILITY

WEEK 4: RANDOM SAMPLING

In research, collecting data can either be done in the entire population or the subset
of this population called sample. If a researcher opts to use sample rather than a
population, he must take considerations on the number of samples and how these
samples can be chosen out of his target population.
Sampling - Is the act, process, or technique of selecting a representative part of a
population for the purpose of determining parameters or characteristics of whole
population.
Population - includes all its elements from a set of data. The size of the population is the
number of observations in the population. For example, if ABS-CBN network has 11,000
employees having the required blood type in a certain study, then we have a population of
size 11,000.
Sample - consists of one or more data drawn from the population. It is a subset, or an
incomplete set taken from a population of objects or observations. Taking samples instead
of the population is less time-consuming and cost-effective. Although sampling has
advantages, it can also be a source of bias and inaccuracy.
Random Sampling - is a sampling method of choosing representatives from the
population wherein every sample has an equal chance of being selected. Accurate data
can be collected using random sampling techniques.

Types pf Random Sampling


a. A Simple random sampling is the most basic random sampling wherein each
element in the population has an equal probability of being selected. They are
usually represented by a unique identification number that is written on equal-sized
and shaped papers and then selection of samples is possible through the lottery
method. Random numbers selected to decide which elements are included as the
sample. The drawing number of paper is based on the desired number of samples.
b. Systematic random sampling is a random sampling that uses a list of all the
elements in the population and then elements are being selected based on the kth
consistent intervals. To get the kth interval, divide the population size by the sample
size.
c. Stratified random sampling is a random sampling wherein the population is
divided into different strata or divisions. The number of samples will be
proportionately picked in each stratum that is why all strata are represented in the
samples.
d. Cluster sampling is a random sampling wherein population is divided into clusters
or groups and then the clusters are randomly selected. All elements of the clusters
randomly selected are considered the samples of the study.
Probability sampling - The sampling techniques that involve random selection. Likewise, simple
random, systematic, and stratified and cluster sampling are all probability sampling techniques.
There are also sampling techniques that do not involve random selection of data. They are called
non-probability sampling.

WEEK 4: PARAMETER AND STATISTICS

Parameter - is a measure that is used to describe the population.


Statistic - is a measure that is used to describe the sample.

“Parameter and statistics are usually computed through different measures of central
tendency (Mean, Median, Mode) and variability (Range, Standard deviation, Variance). “
If the mean of a sample drawn from a population is calculated, this value is used as a
statistic, or an estimate of the population mean.

The mean is the sum of the data divided by the number of data. The mean is used to
describe where the set of data tends to concentrate at a certain point. Population mean
is the mean computed based on the elements of the population or data. The symbol µ (read
as “myu”) is used to represent population mean. To compute for the population mean, we
simply add all the data (X) and then, divide it by the number of elements in the population
(N).
𝛴𝑋
We apply the formula: 𝜇= 𝑁
Variance and standard deviation determine how to spread or to scatter each data on the
set from the mean. Standard deviation is simply the square root of the variance.
Population variance is the computed variance of the elements of the population. The
symbol 𝜎2 (read as “sigma squared”) is used to represent population variance. To compute
for the population variance,
2 ∑(𝑋−𝜇 )2
we apply the formula: 𝜎 = 𝑁
Population standard deviation is the computed standard deviation of the elements of the
population. The symbol 𝜎 (read as “sigma”) is used to represent population standard
deviation. To compute for the population standard deviation,

∑(𝑥−𝜇)2
we use the formula: 𝜎= √
𝑁
The sample mean is the average of all the data of the samples. The symbol 𝑥̅ (read as “x
bar”) is used to represent the sample mean. To compute for the sample mean, we simply
add all the data and divide it by the number of elements in the sample (n).
𝛴𝑥
We apply the formula: 𝑥̅ = 𝑛
Sample variance is the computed variance of the elements of the sample. s2 is used to
represent sample variance. To compute for the sample variance,

2 ∑(𝑥−𝑥̅ )2
we apply the formula: 𝑠 = 𝑛−1
Sample standard deviation is the computed standard deviation of the elements of the
sample. s is used to represent sample standard deviation. To compute for the sample
standard deviation,

𝛴(𝑥−𝑥̅ )2
we use the formula: 𝑠 = √
𝑛−1
Sampling error – is the error resulting from using a sample to estimate a population
characteristic. It is the difference between sample measure and the corresponding
population measure due to the fact that the sample is not a perfect representation of the
population.
𝑆𝑎𝑚𝑝𝑙𝑖𝑛𝑔 𝑒𝑟𝑟𝑜𝑟 = 𝑥̅ − 𝜇

WEEK 4: SAMPLING DISTRIBUTION OF THE SAMPLE MEAN and MEAN, VARIANCE


and STANDARD DEVIATION OF THE SAMPLING DISTRIBUTION OF THE SAMPLE
MEAN

Sampling distribution of the sample mean is a frequency distribution using the


computed sample mean from all the possible random samples of a particular sample size
taken from the given population.

Steps to follow in making a sampling distribution of the sample mean:


1. Find the total number of possible samples by applying combination using N
(population size) and n (sample size).
2. List all the possible random samples and solve for the sample mean of each set of
samples.
3. Write all the means of each sample in ascending order.
4. Construct the probability distribution table.
5. Use the same table and formula in finding mean and variance of discrete random
variable.

Example: The given table shows the monthly income (in thousands of pesos) of the officers
in a government office. Suppose that the sample size is 3, find the mean, variance, and
standard deviation of the sampling distribution of the sample mean.

N = 5 (there are five values)


n = 3 (sample size)
5C3 = 10 (10 possible samples)

Mean of the Population = Mean of the Sampling Distribution of the Sample Mean
WEEK 4: DEFINING SAMPLING DISTRIBUTION OF THE SAMPLE MEAN FOR
NORMAL POPULATION WHEN VARIANCE IS KNOWN ABD UNKNOWN

Population Variance 𝜎 2 is KNOWN Population Variance 𝜎 2 is UNKNOWN


The population has a mean µ and Sample standard deviation (s)
variance 𝜎 2 Sample variance (s2)
𝜎 𝑠
Standard Error: 𝜎𝑥̅ = Standard Error: 𝑠𝑥̅ =
√𝑛 √𝑛

𝑥̅ −𝜇 𝑥̅ −𝜇
z-distribution 𝑧= 𝜎 t-distribution𝑡 = 𝑠
√𝑛 √𝑛
n≥30 n<30

Example 1: Consider a population consisting of 1,2,3,4,5,8,9, and 11. Sample of size are
drawn from this population.

Answer:
Given: N = 9, n = 3
Sampling Distribution when variance is KNOWN.
𝜎
Standard error formula: 𝜎𝑥̅ =
√𝑛

Example 2: Given the population mean of 12, and a sample standard deviation of 3 in
sample size of 125.
Answer:
Given: µ = 12, sample SD = 3, and n = 3
Sampling Distribution when variance is UNKNOWN.
𝑠
Standard error formula: 𝑠𝑥̅ =
√𝑛

You might also like