Statistical Inferences, Hypothesis Testing-1
Statistical Inferences, Hypothesis Testing-1
Hypothesis Testing
QTDM-II
Descriptive v/s Inferential Statistics
Simple Random Sampling
Each sample size of say ‗n‘ has an equal chance of getting selected.
Statistics
Descriptive Inferential
Statistics Statistics
Estimation Testing of
Hypothesis
Point Estimation
Interval
Parametric Test Non-parametric
Estimation
Test
Hypothesis Testing
Estimation: When we don‘t have any information about parameter and want to know about that using Statistic (sample)
Point Estimation: A single value calculated from sample that estimates the parameter. There are different methods of
point estimation, namely, moment method, maximum likelihood method, least squares method, etc. With the help of
these methods, broadly population characters are estimated by their sample counterparts.
• Interval estimation: A range of values within which the parameter is expected to fall, with a certain degree of
confidence. It is the determination of a range of values within which the numerical character of the population is very
likely to lie. Interval estimation is made after knowing the sampling distribution of the relevant statistic which is
discussed along with the discussion of testing of hypothesis
Hypothesis Testing
STEPS for Hypothesis Testing
1. Formation of Hypotheses
2. Selection of Test Statistic and its Sampling Distribution
3. Choice of Level of Significance (α): The confidence with which an
experimenter rejects or accepts Null Hypothesis depends on the
significance level adopted. Level of significance is the rejection
region ( which is outside the confidence or acceptance region).The
level of significance, usually denoted by the α.
4. Specification of Critical Region
5. Calculation of the value of Test Statistic under H0
6. Decision to be taken
Step-1: Formation of Hypotheses
Step-1: Formation of Hypotheses
Step-2: Selection of Test Statistic & its Sampling Distribution
Step-2: Selection of Test Statistic & its Sampling Distribution
Step-2: Selection of Test Statistic & its Sampling Distribution
Step-2: Selection of Test Statistic & its Sampling Distribution
Step-2: Selection of Test Statistic & its Sampling Distribution
Step-2: Selection of Test Statistic & its Sampling Distribution
Step-2: Selection of Test Statistic & its Sampling Distribution
Step-2: Selection of Test Statistic & its Sampling Distribution
Step-2: Selection of Test Statistic & its Sampling Distribution
Confidence Coefficient: The confidence coefficient is the confidence level stated as a proportion, rather
than as a percentage. For example, if you had a confidence level of 99%, the confidence coefficient
would be .99.
Step-3: Choice of Level of Significance ()
Step-3: Choice of Level of Significance ()
Rejection Region: A critical region, also known as the rejection region, is a set of values for the test statistic
for which the null hypothesis is rejected. i.e. if the observed test statistic is in the critical region then we
reject the null hypothesis and accept the alternative hypothesis.
Step-3: Choice of Level of Significance ()
Level of Significance: The level of significance is the measurement of the statistical significance. It defines
whether the null hypothesis is assumed to be accepted or rejected. It is expected to identify if the result is
statistically significant for the null hypothesis to be false or rejected. The level of significance is denoted by
the Greek symbol α (alpha). Therefore, the level of significance is defined as follows:
Significance Level = p (type I error) = α
The values or the observations are less likely when they are farther than the mean. The results are written
as ―significant at x%‖. The value significant at 5% refers to p-value is less than 0.05 or p < 0.05. Similarly,
significant at the 1% means that the p-value is less than 0.01.
Step-3: Choice of Level of Significance ()
Level of Significance: The level of significance is the measurement of the statistical significance. It defines
whether the null hypothesis is assumed to be accepted or rejected. It is expected to identify if the result is
statistically significant for the null hypothesis to be false or rejected. The level of significance is denoted by
the Greek symbol α (alpha). Therefore, the level of significance is defined as follows:
Significance Level = p (type I error) = α
The values or the observations are less likely when they are farther than the mean. The results are written
as ―significant at x%‖. The value significant at 5% refers to p-value is less than 0.05 or p < 0.05. Similarly,
significant at the 1% means that the p-value is less than 0.01.
Step-3: Choice of Level of Significance ()
Step-4, 5 and 6
• One way to reduce the probability of making a Type II error is by increasing the sample size.
• Large samples generally permit us to detect even very small differences between the hypothesized values
and the actual population parameters. For a given level of increasing the sample size decreases and
therefore increases the power of the statistical test to detect that the null hypothesis, H0 is false.
• However, there is always a limit, and this affects the decision of how large a sample we can select. For
any given sample size, we must consider the trade-offs between the two possible types of errors.
• Because we can directly control the risk of Type I error, we can reduce this risk by selecting a smaller
value for . For example, if the negative consequences associated with making a Type I error are
substantial, we could select = 0.01 instead of 0.05.
• However, when we decrease , we increase , so reducing the risk of a Type I error results in an
increased risk of a Type II error. However, to reduce we could select a larger value for . Therefore, if
it is important to try to avoid a Type II error, you can select of 0.05 or 0.10 instead of 0.01.
Risks in Decision Making
Doubts