Unit 2
Unit 2
Arithmetic Mean: The mean of a variable represents its average value. It can be
calculated by using the below formula:
- where X represents the sample mean and fi represents the frequency of an ith observation of
the variable. Mean is the hypothetical value of a variable. It may or may not exist in the dataset.
1. Measures of Central Tendency
The practice of collecting samples and analyzing them to derive some useful
information is called sampling theory. Some important concepts related to sampling
theory are as follows:
• Data: Data refers to the entire set of observations that have been collected.
• Population: An entire group of subjects or objects that are to be studied and
analysed is called population.
• Sample: A sample is a portion or sub-collection of elements that are examined in
order to estimate the characteristics of a population.
• Parameter: A parameter refers to a characteristic of a sample that is generalised for
the population.
• Statistics: It is a branch of mathematics that deals with planning and conducting
experiments, obtaining data, and organising, summarising, presenting, analysing,
interpreting and drawing conclusions based on data.
4. Sampling Theory
Sampling Frame
• Sampling frame refers to the complete list of all the items (everyone and everything)
that must be studied. At first, it would appear that a sampling frame is the same as
population. But, population is general, whereas sampling frame is specific.
• For example, we may define a population as all those individuals who can be
sampled (for example, all the Indian Americans living in Texas, USA), whereas an
exhaustive list of all the Indian Americans living in Texas, USA would be considered
as the sampling frame because it is not necessary that all the Indian Americans living
in Texas, USA would be listed under the list so provided.
• In statistical research, the experimenters require a list of items in order to draw a
sample from it. It must be ensured that the sampling frame is adequate for the needs
of the experimenter.
Sampling Theory
Sampling Methods
In statistics, there are various sampling methods. Sampling methods are
divided into two categories, namely probability sampling and non-
probability sampling.
Probability sampling is the one wherein the sample has a known
probability of being selected.
In non-probability sampling, a sample does not have known probability
of being selected. In probability sampling, we can determine the
probability that each sample will be selected. In addition, we can also
determine which sampling units belong to which sample..
4. Sampling
Theory
Sampling Errors
• Errors that are involved in sampling
are shown in following figure:
5. Hypothesis
Testing
A hypothesis is a statement or a proposed explanation about
one or more populations. A hypothesis statement is usually
associated with the population parameters. A hypothesis can
be tested using a research method.
In hypothesis testing, there are two types of hypotheses,
namely null hypothesis and alternate hypothesis. The null
hypothesis (H0) is the hypothesis to be tested. Alternate
hypothesis (HA) is the hypothesis that must be accepted if the
sample data leads to rejection of H0.
Hypothesis testing, also called significance testing, is a
method which is used to test the hypothesis regarding the
population parameters using the data collected from a
sample. Alternatively, we can say that hypothesis testing is a
method of evaluating samples to learn about the
characteristics of a given population.
Hypothesis Testing
1 2 3 4
Step 1: Identify Step 2: Set the Step 3: Select a Step 4: Make a
the hypothesis to criterion upon random sample decision –
be tested. which the from the Compare the
hypothesis would population and observed value of
be tested. measure the the sample to
sample mean what we expect to
(Compute the test observe if the
statistic). claim we are
testing is true.
Hypothesis Testing