Lesson-Ii Stats
Lesson-Ii Stats
2
AND
SAMPLING DISTRIBUTION
A good survey research paper relies on the precision of the methods, and
procedures of conducting the study. This includes reliability of the selected subjects or
respondents of the study, the validity of information gathered out of the distributed
questionnaires, and the accuracy of measurements used in answering the research
questions and other observations. A study which were conducted in the entire
population assures us of 100% reliability since the responses are obtained from all
members of the population. This means that the data were collected by a complete
enumeration method or the so-called census taking. However, it is impossible for many
types of research to conduct a survey to all members of the population especially if the
population size is infinite or finite but very large. To minimize the time and cost involved
in conducting the survey to a large population, it has been accepted that the information
about the population will be based only from a small portion of the population.
On the other hand, considering only the responses of a small portion of the
population may result into some possible biases due to improper selection of the
samples and errors due to the manner of measuring the desired observations since the
selected sample may not have equally represented the characteristics of the entire
population. Hence, it is very important to consider the methods used in selecting the
sample.
General Objectives:
The following terms are defined for us to fully understand the concept of
sampling distribution.
Sample is a subset of a population; hence, the sample must possess the same
characteristics of the population. Sample size is denoted by “n”.
Sampling
Sample
Population Inference
FIGURE 2.1 Population and Sample
.
KEY TERMS
The following terms are defined as guide whenever the terms are used in the
discussion of the entire module.
Parameter is a measurable characteristic of a population, such as population
mean denoted by “µ” (mu) and population standard deviation denoted by “σ”
(sigma).
Statistic is a measurable characteristic of a sample such as sample mean
denoted by “ẋ” (x-bar) or sample standard deviation, denoted by “s”.
Sampling distribution is a probability distribution of statistics. When we say
sampling distribution of the mean, we are referring to the mean values of every
possible samples that can be obtained from the population.
Sampling with replacement is used when a population element can be selected
more than one time. After a person or item is selected, it is returned to the frame
where it has the same probability of being selected again. However, it is
generally considered more desirable to have a sample of different people or
items than to permit repetition of measurements on the same person or item.
Sampling without replacement is used when a population element can be
selected only once. A person or item once selected, it is not returned to the frame
and therefore cannot be selected again.
Standards error refers to the standard deviation of the sampling distribution.
Hence, the standard error of the mean is the standard deviation of the sampling
distribution of the mean.
Variable noted by the letter X and Y, is a characteristic of interest for each person
or thing in a population. It may be qualitative or quantitative.
Data are the actual values of the variable. They may be numbers, or they may be
in words. Datum is a single value.
Example:
Determine what the key terms refer to the following study. We want to know the
average amount of money first year college students spend at ABC College on school
supplies that do not include books. We randomly survey 100 first year students at the
college. Three of those students spent Php500, Php1,000, Php1,500.
Solution:
The Population is all first year students attending ABC College this terms.
The sample could be all students enrolled on one section of a beginning
statistics at ABC College (although this sample may not represent the entire
population).
The parameter is the average (mean) amount of money spent (excluding books)
by first year college students at ABC College this term.
The statistic is the average (mean) amount of money spent (excluding books) by
first year college students in the sample.
The variable could be the amount of money spent (excluding books) by one first
year student. Let X = the amount of money spent (excluding books) by one first
year students at ABC College this term.
The data are the peso amounts spent by the first year students in the sample.
Examples of the data are Php500, Php1,000, Php1,500.
SAMPLE DESIGNS
A sample design can be described by two factors.
1. Sampling method. It is the process of selecting a part from a given whole. The
primary purpose of which is to make a generalization about the (unknown)
characteristics of a whole. It is also central to the study of statistical inference.
The rules and procedures by which some of the elements of the population are
included in the sample is referred to sampling method. Random sampling,
stratified sampling, and cluster sampling are some common sampling method
that will be discussed in the next lesson.
2. Estimator. It refers to the process of calculating sampling statistics. Different
estimators can be used in different sampling methods. For instance, the equation
used in the computation of a mean score with a random sample is different from
the equation for computing a mean score with a stratified sample. The formula to
be used in the standard error may vary form one sampling method to another.
Survey objectives and survey resources are two factors where the “best” sampling
design depend. For example, the most economical design that provides a desired level
of precision may be selected by a researcher. In cases wherein budget is limited, the
researcher must then choose the design that provides the greatest precision without
going over budget.