0% found this document useful (0 votes)

36 views37 pages

Hypothesis Testing

Uploaded by

Venkata Lokendra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views37 pages

Hypothesis Testing

Uploaded by

Venkata Lokendra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Business Statistics

Agenda - Estimation and Hypothesis Testing - Week 2

1. Sampling and Inference 5. Performing a Hypothesis Test
a. Simple random samples a. Some key ideas
b. Sampling distribution b. Assumptions
c. Central Limit Theorem c. Critical point
2. Estimation d. Rejection region approach
a. Point estimation e. p-value approach
b. Interval estimation 6. One-Tailed and Two-Tailed Tests
3. Hypothesis Testing 7. Confidence Interval and Hypothesis Test
a. Introduction
b. Hypothesis Formulation
4. Basic concepts of Hypothesis Testing
a. Importance of null
b. Importance of test statistic
c. Type I and Type 2 errors
d. Hypothesis testing template
Sampling and Inference
Revisiting the need for sampling..

In many of the situations, what we have available to us is a sample of data.

The data we have is finite.

Till now, the goal was to find ways of describing, summarizing and visualising
the sample data only

Moving ahead, we want to make inferences about the

“entire” population using the sample data.
Sampling : Simple Random Sampling

A sampling technique where every item in the population has an equal chance
of being selected

Allows all the entities in the population to have

Why are simple random an equal chance of being selected and so the
samples important? sample is likely to be representative of the
population
Sampling Distribution
The sampling distribution of a statistic is the probability distribution of that
statistic when we draw many samples
For example sampling distribution of the mean, sampling distribution of variance etc.
To a great extent, statistical inference techniques are based on sampling distribution of a statistic

Samples of
size n

Population Sampling Distribution

Distribution of means
Sampling Distribution
Central Limit Theorem

The sampling distribution of the sample means will approach

normal distribution as the sample size gets bigger, no matter
what the shape of the population distribution is.

Assumptions

Data must be randomly sampled Sample values must be independent of each other

Samples should come from the same distribution Sample size must be sufﬁciently large (≥30)
Central Limit Theorem

Large sample size provides better estimate of

the population mean.

For sample size n = 5, the mean of sample

means pile up around the population mean.

For sample size n = 30, the mean of sample

means are much closer to the population
mean.
Estimation
Estimation

Estimation
Make inference about a population parameter
based on sample statistic

Point Estimation Interval Estimation

Single point estimation of the population A range of values within which the
parameter population parameter lies with some
(x%) confidence
E.g. Population mean as estimated from
the sample mean is $40 E.g. Population mean should lie between
$38-42, with 95% confidence (x = 95)
Point Estimation
A point estimate of a population parameter is a single value of a statistic

Point estimates vary from sample to sample. Often an interval is used to provide a range of values the
parameter can take, instead of a single point estimate.
Interval estimation - Conﬁdence interval

Confidence interval provides an interval, or a range of values, which is expected

to cover the true unknown parameter.
Confidence limits
True Value
Estimation
(unknown)
The upper and lower limits of
the interval are determined
using the distribution of the
sample mean and a multiplier
which specifies the ‘confidence’
Confidence level
Confidence Interval for Mean 𝜇

Interpretation of 95% Conﬁdence Interval

- The interpretation of a 95% confidence interval is that, if the process is repeated a

large number of times, then the intervals so constructed, will contain the true
population parameter 95% of times.

Why not 100% Conﬁdence Interval?

- A 100% confidence interval will include all possible values.

- Hence there will be no insight into the problem.
Hypothesis Testing
Why Hypothesis?

The problem of estimation is considered, when there is no

previous knowledge of the population parameter. The
Estimation problem is simpler in that case. A random sample is taken,
a sample statistic is computed and an appropriate point
and interval estimate is suggested.

Often the interest is not in the numerical value of the point

Hypothesis estimate of the parameter, but in knowing the plausibility
Testing of a hypothesis about the population parameter by using
sample data. Estimation is not enough to arrive at a
conclusion in such cases.
What is Hypothesis?

Often we are interested in population parameter(s)

A hypothesis is a conjecture about the population parameter(s)

For example, a bulb manufacturing company is interested in knowing whether the new
manufacturing process improves reliability of the bulbs.

The objective of the Hypothesis Testing is to SET a value for the parameter(s) and perform
a statistical TEST to see whether that value is tenable in the light of the evidence gathered
from the sample.
Overview of Applications

Applications of Hypothesis Testing

Testing Testing the Testing the

Research validity of a business
Hypotheses claim decisions

e.g. a new automobile e.g. a manufacturer claims e.g. new online ad has
system increases the mean that 1L soft drink bottles are resulted in higher online
mpg performance filled with an average of at conversion rates for an
least 0.99L E-commerce website
Stating the Hypothesis
Null and Alternative Hypotheses - Two
mutually exclusive statements about
the population parameter(s)

Null Hypothesis (H0 ) Alternative Hypothesis (Ha)

The presumed current The rival opinion
state of the matter or research hypothesis
or status quo. or an improvement target.

E.g. The new process for E.g. The new process for
manufacturing bulbs does manufacturing bulbs
not improve reliability. improves reliability.
Null & Alternative Formulation : Example

Mean length of lumber is specified to be 8.5m for a certain building project. A construction
engineer wants to make sure that the shipments she received adhere to that specification.

The population parameter about which the hypothesis will be formed is population mean
𝜇.

The hypotheses are

H 0 : 𝜇= 8.5

H a : 𝜇≠ 8.5
Tips to formulate Null & Alternative

Am I testing an
Am I testing a status quo
assumption or claim that
that already exists?
is beyond what I know?

Null Hypothesis Alternate Hypothesis

Negation of the research Research question to be

question proven

Always contains equality (=, >= Doesn’t contain equality (≠, >,
, <=) <)
Basic Concepts of Hypothesis Testing
Importance of Null

Null hypothesis is assumed to be true unless reasonably strong evidence to the contrary is
found.

Based on a random sample a decision is made whether there exists reasonably strong
evidence against the null hypothesis.

Evidence is strong (satisfies the Reject the null hypothesis

predetermined decision rule) in favour of alternative hypothesis

Evidence is not strong (does not satisfy Fail to reject the null hypothesis
the predetermined decision rule) in favour of alternative hypothesis
Importance of Test Statistic
The test statistic is calculated from the sample data and tested against the predetermined
Decision Rule.

The test statistic is a random variable that follows a standard distribution such as Normal,
T, F, Chi-square etc. Sometimes the tests are named after the test statistic

Since hypothesis testing is done on the basis of sampling distribution, the decisions made
are probabilistic.

Hence, it is very important to understand the errors associated with hypothesis testing.
Type I and Type II Error
Type I and Type II Errors

Level of Power of
signiﬁcance the test
H 0 is True H 0 is False

Type I Error Correct decision

Reject H 0 Prob = α Prob = 1 - β

Fail to reject Correct decision Type II Error

H0 Prob = 1 - α Prob = β
Type I and Type II Errors : Example

Null Hypothesis: The patient Alternate Hypothesis: The patient

doesn’t have cancer has cancer

Type I error (false positive): “The patient doesn’t have cancer but doctors says she does”

Type II error (false negative): “The patient does have cancer but report says she doesn’t”
Template for Hypothesis Testing
Hypothesis Testing Template

1 Identify the key question What is the research question that you are trying to answer?

2 Establish the hypotheses What is the metric of interest? Define the Null and Alternate Hypothesis.

What data do you have? Do you understand what it means? Can it be used
3 Understand and prepare data directly?

4 Identify the right test Choose the method for testing based on the last three points

5 Check the assumptions Ensure that data satisfies the assumption for the test.

6 Perform the test Get to conclusion based on the results (p-value)

Performing a hypothesis test
Some key ideas ﬁrst
● Probability of rejecting the null hypothesis when it is
true
Level of
Signiﬁcance (𝝰) ● Fixed before the hypothesis test.

● Probability of observing test statistic or more extreme

results than the computed test statistic, under the
null hypothesis.
p-value
● Depends on the sample data. Alpha is pre-fixed but
p-value depends on the value of the test statistic

● The total area under the distribution curve of the test

Acceptance or statistic is partitioned into acceptance and rejection
Rejection Region region

● Reject the null hypothesis when the test statistic lies

in the rejection region, Else we fail to reject it
Let’s start simple

Consider the following questions in hypothesis testing

What are the null and alternative hypotheses? What is an appropriate test statistic?

How to check whether the data is giving significant

What is preset level of significance?
evidence against the null hypothesis or not?

Let’s see an example and understand the significance of the above questions

For simplicity, we will assume that the population standard deviation is known and the
sample size is more than 30.
Example
It is known from experience that for a certain E-commerce company the mean delivery time
of the products is 5 days with a standard deviation of 1.3 days.

The new customer service manager of the company is afraid that the company is slipping
and collects a random sample of 45 orders. The mean delivery time of these samples comes
out to be 5.25 days.

Is there enough statistical evidence for the manager’s apprehension that the mean delivery
time of products is greater than 5 days.

This is clearly a one-tailed test, concerning population mean 𝛍,

the mean delivery time of products.
First test - z-test for One Mean

Signiﬁcance of Test
Assumptions
the test Statistic
Distribution
Test for population Standard Normal
mean ● Continuous data distribution
H0 : 𝜇 = 𝜇 0 ● Normally distributed population
or sample size > 30
● Known population standard
deviation 𝜎
● Random sampling from the
population
One-tailed and Two-tailed Tests
One-tailed and Two-tailed Tests
Greater than type
H a : 𝜇> 𝜇0

One-tailed test
Less than type
Alternative H a : 𝜇< 𝜇0
Hypothesis

Two-tailed test

Not equal type

H a : 𝜇≠ 𝜇0

Choice of One tailed vs Two tailed depends on the nature of the problem, not on the sample data!
Difference between One-tailed and Two-tailed Tests

Test statistic value does not change for two-tailed or one-tailed test.

Only the critical value(s) / p-value associated with the test statistic changes

0 1.645 -1.96 0 1.96

The difference is not tested on this

The difference is tested on both the
side and the hypothesis test has
sides.
greater power on the other side

I M Com QT Final On16march2016
0% (1)
I M Com QT Final On16march2016
166 pages
Statistics Lecture Part 4
No ratings yet
Statistics Lecture Part 4
100 pages
Health Econometrics Using Stata 1nbsped 1597182281 9781597182287 Compress
No ratings yet
Health Econometrics Using Stata 1nbsped 1597182281 9781597182287 Compress
374 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
56 pages
Preliminary Concepts On Statistical Inference
100% (1)
Preliminary Concepts On Statistical Inference
39 pages
Math562TB 06F PDF
No ratings yet
Math562TB 06F PDF
701 pages
3 - Test of Hypothesis (Part - 1) PDF
100% (1)
3 - Test of Hypothesis (Part - 1) PDF
45 pages
Testing of Hypothesis
67% (3)
Testing of Hypothesis
37 pages
Stat
67% (3)
Stat
70 pages
Inferential Statistics
No ratings yet
Inferential Statistics
28 pages
Oracle DBA Interview Questions and Answers
No ratings yet
Oracle DBA Interview Questions and Answers
46 pages
T - Test
100% (2)
T - Test
32 pages
Fall 2019 Ltam Syllabus PDF
No ratings yet
Fall 2019 Ltam Syllabus PDF
7 pages
Isi Mtech Qror 08
No ratings yet
Isi Mtech Qror 08
36 pages
Hypothesis Testing - Intro - Summer 2025
No ratings yet
Hypothesis Testing - Intro - Summer 2025
59 pages
Chapter 3-Hypothesis Testing
No ratings yet
Chapter 3-Hypothesis Testing
55 pages
Statistics and Probability Module 4 Moodle
No ratings yet
Statistics and Probability Module 4 Moodle
6 pages
Lecture 2 - Inferential Statistics
No ratings yet
Lecture 2 - Inferential Statistics
75 pages
Hypothesis Testing
0% (1)
Hypothesis Testing
139 pages
Chapter - 3-Hypothesis Testing
No ratings yet
Chapter - 3-Hypothesis Testing
55 pages
Testing of Hypothesis: Business Mathematics and Statistics MBA (FT) I
No ratings yet
Testing of Hypothesis: Business Mathematics and Statistics MBA (FT) I
18 pages
1.1 Hypothesis Testing
No ratings yet
1.1 Hypothesis Testing
93 pages
405 Econometrics: Domodar N. Gujarati
No ratings yet
405 Econometrics: Domodar N. Gujarati
12 pages
Z Test
No ratings yet
Z Test
14 pages
Eco No Metrics
No ratings yet
Eco No Metrics
79 pages
1032 Design and Development of Biological Assays
No ratings yet
1032 Design and Development of Biological Assays
18 pages
BRM-Chapter-10-Hypothesis Testing For Single Populations - Revised
No ratings yet
BRM-Chapter-10-Hypothesis Testing For Single Populations - Revised
28 pages
Econometrics For Management Chapter 1
No ratings yet
Econometrics For Management Chapter 1
19 pages
Slides On Hypotheses Testing
No ratings yet
Slides On Hypotheses Testing
50 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
61 pages
Formulatinghypotheses 110911135920 Phpapp02
No ratings yet
Formulatinghypotheses 110911135920 Phpapp02
53 pages
Inferential Statistics: Sampling, Probability, and Hypothesis Testing
No ratings yet
Inferential Statistics: Sampling, Probability, and Hypothesis Testing
26 pages
Ashrm P9
No ratings yet
Ashrm P9
51 pages
Module 1 - One Sample Test - With MINITAB
No ratings yet
Module 1 - One Sample Test - With MINITAB
60 pages
Estimator & Types of Estimators
No ratings yet
Estimator & Types of Estimators
30 pages
Module 6 Testing of Hypothesis
No ratings yet
Module 6 Testing of Hypothesis
49 pages
BUS51A Lecture12
No ratings yet
BUS51A Lecture12
47 pages
Cbsnews 20240202 Valentine 2
No ratings yet
Cbsnews 20240202 Valentine 2
11 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
45 pages
Ed Inference1
No ratings yet
Ed Inference1
20 pages
Chapter 8
No ratings yet
Chapter 8
45 pages
Chapter 5
No ratings yet
Chapter 5
35 pages
Business Research Methods: Prof - Radhika Kiran Kumar Indira Institute of Business Management
No ratings yet
Business Research Methods: Prof - Radhika Kiran Kumar Indira Institute of Business Management
41 pages
RM 5
No ratings yet
RM 5
46 pages
Hypothesis
No ratings yet
Hypothesis
34 pages
03 Inferential Statistics2025
No ratings yet
03 Inferential Statistics2025
38 pages
Predicting Patient Readmission
No ratings yet
Predicting Patient Readmission
57 pages
Chapt10 Hypothesis Testing One-Sample Tests BBA
No ratings yet
Chapt10 Hypothesis Testing One-Sample Tests BBA
50 pages
Lecture Slides - Hypothesis Testing
No ratings yet
Lecture Slides - Hypothesis Testing
30 pages
Quantitative Portfolio Selection: Using Density Forecasting To Find Consistent Portfolios N. Meade, J.E. Beasley and C.J. Adcock
No ratings yet
Quantitative Portfolio Selection: Using Density Forecasting To Find Consistent Portfolios N. Meade, J.E. Beasley and C.J. Adcock
31 pages
Lesson 5 - Statistics For Data Science - I
No ratings yet
Lesson 5 - Statistics For Data Science - I
27 pages
2 Intro To Inferential Stat
No ratings yet
2 Intro To Inferential Stat
37 pages
Stratified Sampling 2012
No ratings yet
Stratified Sampling 2012
17 pages
L18 Hypothesis Testing1
No ratings yet
L18 Hypothesis Testing1
62 pages
QA Hypothesis
No ratings yet
QA Hypothesis
41 pages
Adaptive Delta Modulation
No ratings yet
Adaptive Delta Modulation
10 pages
Statistics
No ratings yet
Statistics
28 pages
Estimation
No ratings yet
Estimation
29 pages
Test of Hypothesis-9 - 10
No ratings yet
Test of Hypothesis-9 - 10
26 pages
90156hypothesis Testing
No ratings yet
90156hypothesis Testing
34 pages
Statistical Test of Hypotheses
No ratings yet
Statistical Test of Hypotheses
36 pages
Statistics
No ratings yet
Statistics
29 pages
Bayseian Sensor Location2015
No ratings yet
Bayseian Sensor Location2015
18 pages
Chapter IX Hypothesis Testing
No ratings yet
Chapter IX Hypothesis Testing
31 pages
100 Frequently Asked Oracle DBA Interview Questions - RAC - RMAN - Data Guard - Flashback
No ratings yet
100 Frequently Asked Oracle DBA Interview Questions - RAC - RMAN - Data Guard - Flashback
30 pages
Infer Ential
No ratings yet
Infer Ential
25 pages
C 17
No ratings yet
C 17
20 pages
BI Lec 6 - Hypothesis Testing
No ratings yet
BI Lec 6 - Hypothesis Testing
22 pages
Week 12
No ratings yet
Week 12
8 pages
Topic 4B. Inferential Statistics
No ratings yet
Topic 4B. Inferential Statistics
45 pages
STA - 319 - Lecture II
No ratings yet
STA - 319 - Lecture II
9 pages
SBC 3305
No ratings yet
SBC 3305
11 pages
Alcohol 2014
No ratings yet
Alcohol 2014
26 pages
Week 1 To 3 Lectures Q A
No ratings yet
Week 1 To 3 Lectures Q A
16 pages
Chapter-7-Estimation & Hypothesis Testing
No ratings yet
Chapter-7-Estimation & Hypothesis Testing
15 pages
ECO375H Slides 4
No ratings yet
ECO375H Slides 4
45 pages
a78bde04-1efd-4ff1-9e48-b23104cd7c3b (1)
No ratings yet
a78bde04-1efd-4ff1-9e48-b23104cd7c3b (1)
10 pages
ISS Syllabus
No ratings yet
ISS Syllabus
5 pages
Degradation Testing and Analysis
No ratings yet
Degradation Testing and Analysis
47 pages
Relevant Coursework
No ratings yet
Relevant Coursework
12 pages
ISS Syllabus
No ratings yet
ISS Syllabus
7 pages
Theory of Decision
No ratings yet
Theory of Decision
9 pages
Biostats Midterms
No ratings yet
Biostats Midterms
4 pages
Econometrics of Fair Values: Shyam Sunder
No ratings yet
Econometrics of Fair Values: Shyam Sunder
15 pages
Hypothesis Lecture
No ratings yet
Hypothesis Lecture
7 pages
Stats and Prob
No ratings yet
Stats and Prob
4 pages
Take Home Assignment 2
No ratings yet
Take Home Assignment 2
7 pages
Lesson 04 - Possessive Adjectives & Determiners
No ratings yet
Lesson 04 - Possessive Adjectives & Determiners
8 pages
SORS 4102 2019 Tutorial
No ratings yet
SORS 4102 2019 Tutorial
6 pages
Maths Jun10
No ratings yet
Maths Jun10
8 pages
L1Norm Genetic
No ratings yet
L1Norm Genetic
10 pages
Practical List For Stats 1 and Stats 2 For BBE
No ratings yet
Practical List For Stats 1 and Stats 2 For BBE
2 pages
Statistics For Dummies
From Everand
Statistics For Dummies
Deborah J. Rumsey
4/5 (28)
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet

Hypothesis Testing

Uploaded by

Hypothesis Testing

Uploaded by

Business Statistics

Agenda - Estimation and Hypothesis Testing - Week 2

In many of the situations, what we have available to us is a sample of data.

The data we have is finite.

Moving ahead, we want to make inferences about the

Allows all the entities in the population to have

Population Sampling Distribution

The sampling distribution of the sample means will approach

Large sample size provides better estimate of

For sample size n = 5, the mean of sample

For sample size n = 30, the mean of sample

Point Estimation Interval Estimation

Confidence interval provides an interval, or a range of values, which is expected

Interpretation of 95% Conﬁdence Interval

- The interpretation of a 95% confidence interval is that, if the process is repeated a

Why not 100% Conﬁdence Interval?

- A 100% confidence interval will include all possible values.

The problem of estimation is considered, when there is no

Often the interest is not in the numerical value of the point

Often we are interested in population parameter(s)

A hypothesis is a conjecture about the population parameter(s)

Applications of Hypothesis Testing

Testing Testing the Testing the

Null Hypothesis (H0 ) Alternative Hypothesis (Ha)

The hypotheses are

Null Hypothesis Alternate Hypothesis

Negation of the research Research question to be

Evidence is strong (satisfies the Reject the null hypothesis

Type I Error Correct decision

Fail to reject Correct decision Type II Error

Null Hypothesis: The patient Alternate Hypothesis: The patient

6 Perform the test Get to conclusion based on the results (p-value)

● Probability of observing test statistic or more extreme

● The total area under the distribution curve of the test

● Reject the null hypothesis when the test statistic lies

Consider the following questions in hypothesis testing

How to check whether the data is giving significant

This is clearly a one-tailed test, concerning population mean 𝛍,

Not equal type

0 1.645 -1.96 0 1.96

The difference is not tested on this

You might also like