BRM U4
BRM U4
Sampling Methods
There are different methods for selecting a sample from a population, depending on the
research question, the population size, the available resources, and other factors. Some
common sampling methods include simple random sampling, stratified sampling, cluster
sampling, and systematic sampling. Each method has its own strengths and weaknesses, and
the choice of sampling method should be made carefully to ensure that the sample is
representative of the population and that any findings from the sample can be generalized to
the population.
In summary, sampling is the process of selecting a subset of individuals or elements from a
larger population for the purpose of collecting data. The population is the larger group of
individuals or elements that the sample is drawn from. The sample should be representative of
the population, and the sampling frame should include all individuals or elements that make up
the population of interest. There are different methods for selecting a sample from a population,
and the choice of sampling method should be made carefully to ensure that the sample is
representative of the population and that any findings from the sample can be generalized to
the population.
Sampling Error
Sampling error refers to the difference between the characteristics of the sample and the
characteristics of the population. Sampling error occurs because samples are only a subset of
the population, so there is always some degree of uncertainty in the findings from the sample.
For example, if a sample of 500 adults is surveyed to estimate the percentage of adults in a city
who support a new public policy, the sample proportion may be slightly different from the true
population proportion due to sampling error. The degree of sampling error depends on several
factors, including the sample size, the sampling method, and the variability of the population.
Sample Size
Sample size refers to the number of individuals or elements in the sample. The sample size
should be large enough to accurately represent the population while also being feasible in terms
of time and resources. A larger sample size generally reduces sampling error and increases the
precision of estimates from the sample. For example, if a researcher wants to estimate the
proportion of all customers who are satisfied with a product, they might need a larger sample
size to get a more precise estimate than if they were estimating the proportion of all customers
who have returned a product. However, a larger sample size also means higher costs and longer
processing times. Therefore, sample size should be chosen based on the research question, the
population size, and the available resources.
In probability sampling, how large a sample should be is a function of the variation in the
population parameters under study and the estimating precision needed by the researcher. Some
principles that influence sample size include:
• The greater the dispersion or variance within the population, the larger the sample must be to
provide estimation precision.
• The greater the desired precision of the estimate, the larger the sample must be.
• The narrower or smaller the error range, the larger the sample must be.
• The higher the confidence level in the estimate, the larger the sample must be.
• The greater the number of subgroups of interest within a sample, the greater the sample size
must be, as each subgroup must meet minimum sample size requirements.
Characteristics of a good sample
In business research, selecting a good sample is crucial to obtain reliable and valid results that
can inform decision-making. A good sample should possess certain characteristics that ensure
both precision and accuracy of the research findings. Two of the most important characteristics
of a good sample are precision and accuracy.
1. Precision refers to the degree to which the sample accurately reflects the population
from which it was drawn. A precise sample will produce research findings that are
representative of the population and can be confidently generalized to the larger
population. In other words, a precise sample minimizes sampling error, which is the
difference between the characteristics of the sample and the characteristics of the
population. For example, let us say a business wants to conduct market research on
consumer attitudes towards a new product. The population of interest is all consumers
in a particular region. A precise sample would include consumers from all
demographic groups, income levels, and geographic locations within that region to
ensure that the research findings accurately represent the attitudes of the entire
population.
2. Accuracy, on the other hand, refers to the degree to which the sample is free from
bias. A sample is unbiased if all members of the population have an equal chance of
being selected. An accurate sample is important because it ensures that the research
findings are not skewed in favor of one group or another, which could lead to
incorrect conclusions and poor decision-making. For example, let us say a business
wants to conduct research on employee satisfaction with their work environment. An
accurate sample would ensure that all employees in the company have an equal
chance of being selected, regardless of their job title, department, or tenure with the
company. This would help ensure that the research findings accurately reflect the
views of all employees, not just those in a particular department or at a certain level
of seniority. A good sample is one that accurately reflects the population of interest
and provides unbiased information. This is essential for making accurate conclusions
and informed decisions in business research. In addition to precision and accuracy,
there are several other characteristics that contribute to a good sample.
3. Representativeness: A good sample is representative of the population from which
it was drawn. This means that the sample should include individuals or elements from
all demographic groups, regions, and other relevant characteristics that are present in
the population. For example, if a business is conducting market research on a new
product, the sample should include individuals from all income levels, education
levels, and geographic locations within the target market.
4. Randomness: A good sample is selected using a random process. This means that all
individuals or elements in the population have an equal chance of being selected for
the sample. Random sampling ensures that the sample is free from bias and produces
unbiased results. For example, if a business is conducting employee satisfaction
research, a random sample of employees should be selected to ensure that all
employees have an equal chance of being included in the sample.
5. Size: The size of the sample is an important characteristic that affects the precision
of the results. A larger sample size generally produces more precise results because
it reduces the sampling error. However, a larger sample size can also be more
expensive and time-consuming to collect. The appropriate sample size depends on
the research question, the population size, and the available resources.
6. Sampling method: There are several different methods for selecting a sample, each
with its own strengths and weaknesses. The choice of sampling method depends on
the research question, the population size, and the available resources. Some common
sampling methods include simple random sampling, stratified sampling, cluster
sampling, and systematic sampling.
7. Data quality: A good sample produces high-quality data that is accurate, reliable,
and valid. This means that the data should be collected using valid and reliable
measures, and that the data collection process should be standardized to ensure
consistency across all individuals or elements in the sample.
Sampling Design within the Research Process
Types of Sampling Designs
Representation
Introduction to Hypothesis
A hypothesis may be defined as a proposition, or a set of propositions set forth as an explanation
for the occurrence of some specified group of phenomena either asserted merely as a
provisional conjecture to guide some investigation or accepted as highly probable in the light
of established facts. Quite often a research hypothesis is a predictive statement, capable of
being tested by scientific methods, that relates an independent variable to some dependent
variable.
For example, consider statements like the following ones:
“Students who receive counselling will show a greater increase in creativity than students not
receiving counselling” Or “the automobile A is performing as well as automobile B.”
These are hypotheses capable of being objectively verified and tested. Thus, we may conclude
that a hypothesis states what we are looking for and it is a proposition which can be put to a
test to determine its validity.
A type I error appears when the null hypothesis (H0) of an experiment is true, but still, it is
rejected. A Type I error means rejecting the null hypothesis when it is true. It means concluding
that results are statistically significant when they came about purely by chance or because of
unrelated factors. A type I error is often called a false positive (an event that shows that a given
condition is present when it is absent). The probability of making a Type I error is the
significance level, or alpha (α). A type II error appears when the null hypothesis is false but
mistakenly fails to be refused. A Type II error means not rejecting the null hypothesis when it
is actually false. A type II error is also known as false negative. The probability of making a
Type II error is beta (β)
Example: Type I vs Type II error
You decide to get tested for COVID-19 based on mild symptoms. There are two errors that
could potentially occur:
Type I error (false positive): the test result says you have coronavirus, but you do not.
Type II error (false negative): the test result says you do not have coronavirus, but you actually
do.
Now we have a “less than” in the alternative hypothesis. This means that instead of performing
a two-tailed test, we will perform a left-sided one-tailed test.
i) Level of Significance: The level of significance is denoted by the Greek symbol α (alpha).
Therefore, the level of significance is defined as follows:
Significance Level = p (type I error) = α. It is the probability of rejecting the null hypothesis
when it is true. For example, a significance level of 0.05 indicates a 5% risk of concluding that
a difference exists when there is no actual difference. Lower significance levels indicate that
you require stronger evidence before you will reject the null hypothesis.
To test a hypothesis means to tell (based on the data the researcher has collected) whether the
hypothesis seems to be valid. In hypothesis testing the main question is: whether to accept the
null hypothesis or not to accept the null hypothesis? Procedure for hypothesis testing refers to
all those steps that we undertake for making a choice between the two actions i.e., rejection
and acceptance of a null hypothesis. The various steps involved in hypothesis testing are stated
below:
P – Value Approach
What is P-Value?
The P-value refers to a continuous measure of the evidence against a model to determine the
tendency of occurrence of an event or not. It is also a method of examining the level of marginal
significance within a statistical hypothesis representing the tendency of the occurrence of a
given event. The p-value is an alternative to rejection points, it provides a level of significance
in which the null hypothesis should be rejected or ignored. A smaller p-value is an indicator of
strong evidence supporting the alternative hypothesis.
P-value is a statistical measure, that helps to determine whether the hypothesis is correct or not.
P-value is a number that lies between 0 and 1. The level of significance(α) is a predefined
threshold that should be set by the researcher. It is generally fixed as 0.05.
Real World Example of P-Value
For example, an investor affirms that their investment performance equals that of the Standard
& Poor's (S & P) 500 Index. To determine this, a two-tailed test is conducted. The null
hypothesis declares that over a period or time frame, the portfolio's returns will be equivalent
to the S&P 500's returns. The alternative hypothesis, on the other hand, states that the portfolio's
returns did not equal the S&P 500's returns. If a one-tailed test had been conducted, the
portfolio's returns in an alternative hypothesis would have been either less or greater than the
S&P 500's returns. The commonly used p-value is 0.05, if the investor drives at a conclusion
that the p-value is less than 0.05, there is cogent evidence against the null hypothesis. With
this, the investor would ignore the null hypothesis and accept the alternative hypothesis. If the
p-value is greater than 0.05, there is an indication of weak evidence against the conjecture or
assumption. Then, the null hypothesis will be accepted by the investor. In the scenario where
the p-value is 0.001, there is strong evidence against the null hypothesis, and the portfolio's
returns did not equal the S&P 500's returns.