0% found this document useful (0 votes)

33 views59 pages

Hypothesis Testing Statistics

Uploaded by

Mervin Arguelles

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views59 pages

Hypothesis Testing Statistics

Uploaded by

Mervin Arguelles

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 59

EXPERIMENTAL STATISTICS - MATH4B

HYPOTHESIS
TESTING
Intended Learning Outcome
After the completion of the unit, students should be able to:
know the basic concept of statistical hypothesis testing;
apply the steps in hypothesis testing;
determine what kind of statistical test is appropriate for a
specific data; and
draw conclusion and interpretation base on the result of
the test
HYPOTHESIS
TESTING
EXPERIMENTAL STATISTICS - MATH4B
HYPOTHESIS
•A hypothesis is a tentative assertion or statement that
is used to explain a phenomenon.

•A statistical hypothesis is a conjecture about a

population parameter. This conjecture may or may
not be true.
HYPOTHESIS
TESTING
Hypothesis Testing is a statistical procedure which is used to
determine whether the hypothesis is true or not
.
If the hypothesis is found to be true, its accepted
If it is found to be false, it is rejected.
TYPES OF
HYPOTHESES
The null hypothesis, symbolized by H₀, is a statistical
hypothesis that states that there is no difference between
a parameter and a specific value, or that there is no
difference between two parameters.
The alternative hypothesis, symbolized by H₁, is a
statistical hypothesis that states the existence of a
difference between a parameter and a specific value, or
states that there is a difference between two parameters
EXAMPLES

Conjecture: The average system performance benchmark for a specific compute task is 95.
H₀: The average system performance benchmark for a specific compute task is 95. (μ=95).
H₁:The average system performance benchmark for a specific compute task is not 95.
(μ≠95).

Conjecture: The average system performance benchmark for a specific compute task is
lower than 95.
H₀: The average system performance benchmark for a specific compute task is 95. (μ=95).
H₁:The average system performance benchmark for a specific compute task is lower than
95. (μ<95).
EXAMPLES

Conjecture: The average system performance benchmark for a

specific compute task is higher 95.
H₀: The average system performance benchmark for a specific
compute task is 95. (μ=95).
H₁:The average system performance benchmark for a specific
compute task is higher than 95.(μ>95).
EXAMPLE
A researcher hypothesizes that optimizing compiler settings will
improve the execution speed of programs. The average execution
speed of programs in the current setup is 8.6 seconds.
EXAMPLE
An engineer hypothesizes that the mean number of bugs can be
decreased in software development by using automated testing tools
instead of manual testing. The mean number of bugs found per 1000
lines of code is 18.
EXAMPLE
A computer scientist believes that modifying the algorithm for task
scheduling will impact the system's performance. The scientist is
uncertain whether the performance metrics will improve or decline.
Historically, the mean performance score was 73.
HYPOTHESIS
TESTING
EXPERIMENTAL STATISTICS - MATH4B
TYPES OF TESTS
Directional or One-tailed Test is a test of statistical
hypothesis that uses a < or > symbols.

Non-directional or Two-tailed Test is a test of

statistical hypothesis that uses ≠ symbol.

Note: To determine whether the test is directional or

non-directional, look at how the alternative hypothesis
was stated
STATISTICAL TEST
uses the data obtained from a sample to make a
decision about whether the null hypothesis should
be rejected

TEST VALUE
the numerical value obtained from a statistical test
TYPES OF ERRORS
A type I error occurs if you reject the null hypothesis
when it is true.
A type II error occurs if you do not reject the null
hypothesis when it is false.
LEVEL OF
SIGNIFICANCE
the maximum probability of committing a type I error
this probability is symbolized by a (Greek letter alpha - α).
HYPOTHESIS
TESTING
EXPERIMENTAL STATISTICS - MATH4B
CRITICAL VALUE
separates the critical region from the noncritical region. The symbol for critical value is C.V.

CRITICAL OR REJECTION REGION

the range of values of the test value that indicates that there is a significant difference and
that the null hypothesis should be rejected.

NONCRITICAL OR NONREJECTION REGION

qthe range of values of the test value that indicates that the difference was probably due to
chance and that the null hypothesis should not be rejected
EXAMPLE
Find the critical value(s) for each situation and
draw the
appropriate figure, showing the critical region.
a. A left-tailed test with a 0.10.
b. A two-tailed test with a 0.02.
c. A right-tailed test with a 0.005.
HYPOTHESIS
TESTING
EXPERIMENTAL STATISTICS - MATH4B
SOLVING HYPOTHESIS-
TESTING PROBLEMS
(TRADITIONAL METHOD)
Step 1 State the hypotheses and identify the claim.
Step 2 Find the critical value(s)
Step 3 Compute the test value.
Step 4 Make the decision to reject or not reject the null hypothesis.
Step 5 Summarize the result
WHEN TO USE THE Z-TEST?
If the population standard deviation, σ is known
If the sample size is large, n≥30,
Replace σ by s if σ is unknown but n≥30.
ST #1: Z-TEST OF ONE-SAMPLE MEAN
When to use? To compare sample mean and population
mean (x ̅ vs μ)
Test Statistic
CRITICAL VALUES OF Z

One-tailed Two-tailed
test test

α=0.05 ±2.58 ±1.96

α=0.01 ±1.65 ±2.33

EXAMPLE

A researcher wants to investigate whether the mean processing time for a

specific algorithm on a computer cluster is 29 milliseconds. A sample of 30 runs
on different clusters yields a mean processing time of 30.1 milliseconds. Using a
significance level of α = 0.05, test the hypothesis that the mean processing time
is greater than 29 milliseconds. The standard deviation of the processing time
across clusters is 3.8 milliseconds.
EXAMPLE
A researcher asserts that the average response time of a certain web service is
less than 80 milliseconds. They collect a random sample of 36 response times
and record the following values (in milliseconds). (The values have been
rounded to the nearest millisecond.) Is there sufficient evidence to substantiate
the researcher’s assertion at a significance level of α = 0.10? Assume a standard
deviation (σ) of 19.2 milliseconds.
60 70 75 55 80 55
50 40 80 70 50 95
120 90 75 85 80 60
110 65 80 85 85 45
75 60 90 90 60 95
110 85 45 90 70 70
EXAMPLE

The Philippine Computer Society reports that the average cost of developing a
new software system is ₱1,236,400. To investigate whether the average
development cost differs at a specific software development company, a
researcher selects a random sample of 35 software projects and finds that the
average cost of development is ₱1,304,497. The standard deviation of the
population is ₱162,786. At a significance level of α = 0.01, can it be concluded
that the average cost of software development at the particular company differs
from ₱1,236,400?
HYPOTHESIS
TESTING
EXPERIMENTAL STATISTICS - MATH4B
ST # 2: T-TEST OF ONE-SAMPLE MEAN
When to use? To compare sample mean and population
mean (x ̅ vs μ)
σ is unknown
n is small, n<30
THE T DISTRIBUTION IS SIMILAR TO THE STANDARD
NORMAL DISTRIBUTION IN THE FOLLOWING WAYS.
1. It is bell-shaped.
2. It is symmetric about the mean.
3. The mean, median, and mode are equal to 0 and are located at the
center of the distribution.
4. The curve never touches the x axis.
THE T DISTRIBUTION DIFFERS FROM THE STANDARD
NORMAL DISTRIBUTION IN THE FOLLOWING WAYS.
1. The variance is greater than 1.
2. The t distribution is a family of curves based on the degrees of
freedom, which is a number related to sample size.
3. As the sample size increases, the t distribution approaches the
normal distribution. The t test is defined next.
EXAMPLE

Find the critical t value for α = 0.05 with d.f. = 16 for a right-tailed t
test.
Find the critical t value for α = 0.01 with d.f. = 22 for a left-tailed
test.
Find the critical values for α = 0.10 with d.f. = 18 for a two-tailed t
test.
Find the critical value for α = 0.05 with d.f. = 28 for a right-tailed t
test.
EXAMPLE

A software reliability investigation claims that the average number of

software defects reported per week at a technology firm in Metro
Manila is 16.3. A random sample of 10 weeks had a mean number of
17.7 defects reported. The sample standard deviation is 1.8. Is there
enough evidence to reject the investigator’s claim at α = 0.05?
EXAMPLE

A computer science researcher claims that the average hourly rate of

freelance software developers in tech companies in Metro Manila is
less than ₱3,000 per day. A random sample of eight tech companies is
selected, and the daily rates are shown. Is there enough evidence to
support the researcher's claim at α = 0.10?
3000 2600 3000 2500 4000 2500 3000 2500
ST # 3: Z-TEST FOR INDEPENDENT
SAMPLE MEANS
When to use? To compare the means from two
independent samples (x̄ ₁ vs x̄ ₂)
Test Statistic

x̄ ₁= mean of the 1st sample

x̄ ₂= mean of the 2nd sample
n₁= size of the 1st sample
n₂= size of the 2nd sample
σ₁²= population variance of the 1st sample
σ₂²= population variance of the 2nd sample
Difference is not significant Difference is significant
THESE TESTS CAN ALSO BE ONE-TAILED, USING THE
FOLLOWING HYPOTHESES:
THE BASIC FORMAT FOR HYPOTHESIS TESTING
USING THE TRADITIONAL METHOD

Step 1 State the hypotheses and identify the claim.

Step 2 Find the critical value(s).
Step 3 Compute the test value.
Step 4 Make the decision.
Step 5 Summarize the results.
EXAMPLE

In a comparative study between two tech hubs in the Philippines, it

was found that the average hourly rate for software developers in
Manila is ₱1,500 and the average rate in Cebu is ₱1,300. Assume that
the data were obtained from two samples of 50 software developers
each, and that the standard deviations of the populations are ₱200
and ₱180, respectively. At a significance level of α = 0.05, can it be
concluded that there is a significant difference in the rates between
Manila and Cebu?
EXAMPLE

A researcher hypothesizes that the average number of programming

languages taught in computer science departments at colleges for
males is greater than the average number of programming languages
taught for females. A sample of the number of programming
languages offered by colleges is shown. At a significance level of α =
0.10, is there enough evidence to support the claim? Assume
standard deviations σ₁ and σ₂ = 3.3.
ST # 4: T-TEST FOR INDEPENDENT
SAMPLE MEANS
When to use? To compare the means from two
independent samples (x̄ ₁ vs x̄ ₂)
Test Statistic
x̄ ₁= mean of the 1st sample
x̄ ₂= mean of the 2nd sample
n₁= size of the 1st sample
n₂= size of the 2nd sample
s₁²= sample standard deviation of the 1st sample
s₂²= sample standard deviation of the 2nd sample
where the degrees of freedom are equal to the smaller of n₁-1 or n₂-2.
EXAMPLE

The average number of lines of code written per project in tech

startups in Metro Manila is 191. The average number of lines of code
written per project in tech startups in Cebu City is 199. Assume the
data were obtained from two samples with standard deviations of 38
and 12 lines of code, respectively, and sample sizes of 8 and 10,
respectively. Can it be concluded at a significance level of 0.05 that the
average number of lines of code written per project in the two cities is
different? Assume the populations are normally distributed.
ST # 4: T-TEST FOR INDEPENDENT
SAMPLE MEANS
When the variances are assumed to be equal, this
formula is used and
Test Statistic
x̄ ₁= mean of the 1st sample
x̄ ₂= mean of the 2nd sample
n₁= size of the 1st sample
n₂= size of the 2nd sample
s₁²= sample standard deviation of the 1st sample
s₂²= sample standard deviation of the 2nd sample
where the degrees of freedom are equal to the smaller of n₁-1 or n₂-2.
EXPERIMENTAL STATISTICS - MATH4B

STATISTICAL
HYPOTHESIS TESTING
ST # 5: T-TEST FOR DEPENDENT
SAMPLE MEANS
When to use? To compare the means from two
dependent samples (x̄ ₁ vs x̄ ₂)
Test Statistics
D= difference between the
two sets of values
n= sample size
df= n-1
EXAMPLE

A data scientist wishes to investigate if a machine learning

algorithm can affect the accuracy of predicting customer
churn. Six datasets were preprocessed, and then the algorithm
was applied for a 6-week period. The results are shown in the
table. (Accuracy level is measured as a percentage.) Can it be
concluded that the accuracy level has been changed at a
significance level of 0.10? Assume the variable is
approximately normally distributed.
EXAMPLE
Bank Deposits
A sample of nine local banks shows their deposits (in billions of
dollars) 3 years ago and their deposits (in billions of dollars) today. At
a = 0.05, can it be concluded that the average in deposits for the
banks is greater today than it was 3 years ago? Use a = 0.05.
ST # 6: Z-TEST FOR INDEPENDENT
PROPORTIONS
When to use? To compare the proportions from two
independent samples
Test Statistic:
p₁=proportion from the 1st sample
p₂=proportion from the 2nd sample
n₁=size of the 1st sample
n₂=size of the 1st sample
q₁=1-p₁
q₂=1-p₂
EXAMPLE
Vaccination Rates in Nursing Homes
In the nursing home study mentioned in the chapter-opening
Statistics Today, the researchers found that 12 out of 34 small nursing
homes had a resident vaccination rate of less than 80%, while 17 out
of 24 large nursing homes had a vaccination rate of less than 80%. At
a = 0.05, test the claim that there is no difference in the proportions
of the small and large nursing homes with a resident vaccination rate
of less than 80%.
EXAMPLE
Texting While Driving
A survey of 1000 drivers this year showed that 29% of the people
send text messages while driving. Last year a survey of 1000 drivers
showed that 17% of those send text messages while driving. At a
0.01, can it be concluded that there has been an increase in the
number of drivers who text while driving?
EXPERIMENTAL STATISTICS - MATH4B

HYPOTHESIS
TESTING

UNIT 1 Origin of Mathematics
No ratings yet
UNIT 1 Origin of Mathematics
58 pages
Aumann's Agreement Theorem
No ratings yet
Aumann's Agreement Theorem
3 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
45 pages
Lesson - 4.1 - Hypothesis Testing - Analyze - Phase
No ratings yet
Lesson - 4.1 - Hypothesis Testing - Analyze - Phase
77 pages
Hypothesis Testing For The Mean (Small Samples)
No ratings yet
Hypothesis Testing For The Mean (Small Samples)
40 pages
Statppt2 - Test Statistic, Z-Critical & T-Critical
No ratings yet
Statppt2 - Test Statistic, Z-Critical & T-Critical
44 pages
Chapter 4
No ratings yet
Chapter 4
77 pages
G) Compliance and Regulations
No ratings yet
G) Compliance and Regulations
9 pages
Hypothesis Testing : Z-Test, T-Test, F-Test
No ratings yet
Hypothesis Testing : Z-Test, T-Test, F-Test
42 pages
Websockets 3. Output Layer: Cloud / Dashboard
No ratings yet
Websockets 3. Output Layer: Cloud / Dashboard
9 pages
Chapter 7 Hypothesis Testing Part 1 FAL
No ratings yet
Chapter 7 Hypothesis Testing Part 1 FAL
31 pages
An Introduction To T-Tests: Statistical Test Means Hypothesis Testing
100% (1)
An Introduction To T-Tests: Statistical Test Means Hypothesis Testing
8 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
10 pages
Hypothesis Testing Hand Notre
No ratings yet
Hypothesis Testing Hand Notre
6 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
44 pages
Math 110 2 Hypothesis Testing
100% (1)
Math 110 2 Hypothesis Testing
74 pages
PR2 Lesson 7 Hypothesis Testing
No ratings yet
PR2 Lesson 7 Hypothesis Testing
59 pages
Chapter 6 Hypothesis Test
No ratings yet
Chapter 6 Hypothesis Test
22 pages
Statistical Analysis (T-Test)
No ratings yet
Statistical Analysis (T-Test)
61 pages
21st April Lecture-Chi Square and ANNOVA
No ratings yet
21st April Lecture-Chi Square and ANNOVA
184 pages
Stats Unit5
No ratings yet
Stats Unit5
26 pages
Lesson - 4.1 - Hypothesis Testing - Analyze - Phase
No ratings yet
Lesson - 4.1 - Hypothesis Testing - Analyze - Phase
81 pages
Eda Group5 Hypothesis Testing
No ratings yet
Eda Group5 Hypothesis Testing
32 pages
Key Concepts of Tests of Hypotheses On The Population Mean and Population Proportion
No ratings yet
Key Concepts of Tests of Hypotheses On The Population Mean and Population Proportion
33 pages
Z Test and T Test
No ratings yet
Z Test and T Test
15 pages
Statprb Quarter 4 Module 2 Final 1
No ratings yet
Statprb Quarter 4 Module 2 Final 1
24 pages
IE6200-Lecture 7
No ratings yet
IE6200-Lecture 7
46 pages
One Sample Z-Test & T-Test
No ratings yet
One Sample Z-Test & T-Test
12 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
12 pages
Z-TEST and T-Test
57% (7)
Z-TEST and T-Test
45 pages
5 Largesampletest
No ratings yet
5 Largesampletest
41 pages
Lecture 8 Hypothesis Testing
No ratings yet
Lecture 8 Hypothesis Testing
44 pages
Module 7 - MAMW100 Hypothesis Testing New
No ratings yet
Module 7 - MAMW100 Hypothesis Testing New
6 pages
UNIT-4: Reading Material On Hypothesis Testing With Single Sample
No ratings yet
UNIT-4: Reading Material On Hypothesis Testing With Single Sample
7 pages
Analysing and Presenting Data: Practical Hints: Daniele CEI, Giorgio MATTEI
No ratings yet
Analysing and Presenting Data: Practical Hints: Daniele CEI, Giorgio MATTEI
53 pages
Module 6
No ratings yet
Module 6
29 pages
Hypothesis Testting3
No ratings yet
Hypothesis Testting3
7 pages
Hypothesis Testing Updated
No ratings yet
Hypothesis Testing Updated
20 pages
MET 10 - LESSON 1 - Hypothesis Testing Using T - Test
No ratings yet
MET 10 - LESSON 1 - Hypothesis Testing Using T - Test
7 pages
Hypotheses Testing
No ratings yet
Hypotheses Testing
56 pages
G) Compliance and Regulations
No ratings yet
G) Compliance and Regulations
6 pages
Reviewer CHAPTER 12
No ratings yet
Reviewer CHAPTER 12
8 pages
Lecture 09
No ratings yet
Lecture 09
48 pages
Hypothesis Testing Intro and Test For Means
No ratings yet
Hypothesis Testing Intro and Test For Means
10 pages
Lab 5
No ratings yet
Lab 5
7 pages
Stat
67% (3)
Stat
70 pages
Hypothesis Testing Revised
No ratings yet
Hypothesis Testing Revised
22 pages
Stats 2 Module Updated
No ratings yet
Stats 2 Module Updated
30 pages
Stat - Hypothesis Testing
No ratings yet
Stat - Hypothesis Testing
34 pages
Chapter 3 Hypothesis Testing (Students - Notes)
No ratings yet
Chapter 3 Hypothesis Testing (Students - Notes)
17 pages
Statistics
No ratings yet
Statistics
39 pages
Lesson 6 Hypothesis Testing A
No ratings yet
Lesson 6 Hypothesis Testing A
38 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
54 pages
ZTest and T-Test G12 Stats
No ratings yet
ZTest and T-Test G12 Stats
46 pages
Finalptrp
No ratings yet
Finalptrp
16 pages
STATS - WK - April 23 25
No ratings yet
STATS - WK - April 23 25
9 pages
Lecture
No ratings yet
Lecture
77 pages
B. Tech Second Year: Course Name: Engineering Mathematics IV
No ratings yet
B. Tech Second Year: Course Name: Engineering Mathematics IV
17 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
45 pages
Hypothesis Testing G
No ratings yet
Hypothesis Testing G
28 pages
Educ 70 - Finals
No ratings yet
Educ 70 - Finals
9 pages
Performanced - Based Test
No ratings yet
Performanced - Based Test
1 page
Product Based Assessment
No ratings yet
Product Based Assessment
10 pages
Unit 1
No ratings yet
Unit 1
15 pages
Definition of Research
No ratings yet
Definition of Research
4 pages
Assingmnet 1
No ratings yet
Assingmnet 1
4 pages
Question: Exercise #4 Is The Transportation Mode Used To Ship Goods Ind
No ratings yet
Question: Exercise #4 Is The Transportation Mode Used To Ship Goods Ind
3 pages
Inbound 3119491012672076326
No ratings yet
Inbound 3119491012672076326
5 pages
Chapter 5 Goodness of Fit and Contingency Table
No ratings yet
Chapter 5 Goodness of Fit and Contingency Table
21 pages
A Language For Measurements
No ratings yet
A Language For Measurements
3 pages
RM Practical No. 7
No ratings yet
RM Practical No. 7
3 pages
Random Variables and Discrete Probability Distributions
No ratings yet
Random Variables and Discrete Probability Distributions
37 pages
Comte Positivism
No ratings yet
Comte Positivism
1 page
From Quantum Trajectories To Classical Orbits
No ratings yet
From Quantum Trajectories To Classical Orbits
9 pages
Negative Energy'' Solutions - Hole Theory
No ratings yet
Negative Energy'' Solutions - Hole Theory
2 pages
Quantum Field Theory II
No ratings yet
Quantum Field Theory II
27 pages
Chap 015
No ratings yet
Chap 015
21 pages
Measurement System Analysis (MSA) Gauge Repeatability & Reproducibility (GR&R)
No ratings yet
Measurement System Analysis (MSA) Gauge Repeatability & Reproducibility (GR&R)
1 page
Quantum Mechanics II - Homework 3
No ratings yet
Quantum Mechanics II - Homework 3
7 pages
Pertemuan 2 Dimensi Penelitian Sosial
No ratings yet
Pertemuan 2 Dimensi Penelitian Sosial
21 pages
Theory of Many Particle System
100% (1)
Theory of Many Particle System
80 pages
Dirac Delta 1
No ratings yet
Dirac Delta 1
3 pages
Chapter 18
0% (1)
Chapter 18
56 pages
Qualitative Methods by John Gerring
No ratings yet
Qualitative Methods by John Gerring
25 pages
Chapter 1 - SCS2150
No ratings yet
Chapter 1 - SCS2150
3 pages
Uji Hipotesis Goodness of Fit
No ratings yet
Uji Hipotesis Goodness of Fit
7 pages
Additional Hypothesis Testing
0% (1)
Additional Hypothesis Testing
31 pages
AccuracyPrecision
No ratings yet
AccuracyPrecision
40 pages
Business-Research-Methods Questions (Set 1)
No ratings yet
Business-Research-Methods Questions (Set 1)
22 pages
Tomato Production
No ratings yet
Tomato Production
29 pages
Método de Hayward Fredericks: Josué Betancourt
No ratings yet
Método de Hayward Fredericks: Josué Betancourt
11 pages
Pierre Duhem Thesis
100% (2)
Pierre Duhem Thesis
7 pages
Parametric and Non-Parametric Statistical Testing
No ratings yet
Parametric and Non-Parametric Statistical Testing
19 pages

Hypothesis Testing Statistics

Uploaded by

Hypothesis Testing Statistics

Uploaded by

EXPERIMENTAL STATISTICS - MATH4B

•A statistical hypothesis is a conjecture about a

Conjecture: The average system performance benchmark for a

Non-directional or Two-tailed Test is a test of

Note: To determine whether the test is directional or

CRITICAL OR REJECTION REGION

NONCRITICAL OR NONREJECTION REGION

α=0.05 ±2.58 ±1.96

α=0.01 ±1.65 ±2.33

A researcher wants to investigate whether the mean processing time for a

A software reliability investigation claims that the average number of

A computer science researcher claims that the average hourly rate of

x̄ ₁= mean of the 1st sample

Step 1 State the hypotheses and identify the claim.

In a comparative study between two tech hubs in the Philippines, it

A researcher hypothesizes that the average number of programming

The average number of lines of code written per project in tech

A data scientist wishes to investigate if a machine learning

You might also like