0% found this document useful (0 votes)

86 views34 pages

Chapter 1

The document discusses hypothesis testing using Python and an example involving analyzing data from the Stack Overflow Developer Survey 2020. It introduces hypothesis testing concepts like the null and alternative hypotheses, test statistics like the z-score, bootstrap distributions, standard error, and p-values. The example tests whether the proportion of data scientists who started programming as children is greater than the hypothesized value of 35% using these techniques. A z-score of 4.001497129152506 and p-value of 3.1471479512323874e-05 are calculated, providing evidence to reject the null hypothesis at a significance level of 0.05.

Uploaded by

Komi David ABOTSITSE

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views34 pages

Chapter 1

Uploaded by

Komi David ABOTSITSE

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Hypothesis tests and

z-scores
HYPOTHESIS TESTING IN PYTHON

James Chapman
Curriculum Manager, DataCamp
A/B testing
In 2013, Electronic Arts (EA) released
SimCity 5

They wanted to increase pre-orders of the

game

They used A/B testing to test different

advertising scenarios

This involves splitting users into control and

treatment groups

1 Image credit: "Electronic Arts" by majaX1 CC BY-NC-SA 2.0

HYPOTHESIS TESTING IN PYTHON

Retail webpage A/B test
Control: Treatment:

HYPOTHESIS TESTING IN PYTHON

A/B test results
The treatment group (no ad) got 43.4% more purchases than the control group (with ad)
Intuition that "showing an ad would increase sales" was false

Was this result statistically significant or just chance?

Need EA's data to determine this

Techniques from Sampling in Python + this course to do so

HYPOTHESIS TESTING IN PYTHON

Stack Overflow Developer Survey 2020
import pandas as pd
print(stack_overflow)

respondent age_1st_code ... age hobbyist

0 36.0 30.0 ... 34.0 Yes
1 47.0 10.0 ... 53.0 Yes
2 69.0 12.0 ... 25.0 Yes
3 125.0 30.0 ... 41.0 Yes
4 147.0 15.0 ... 28.0 No
... ... ... ... ... ...
2259 62867.0 13.0 ... 33.0 Yes
2260 62882.0 13.0 ... 28.0 Yes

[2261 rows x 8 columns]

HYPOTHESIS TESTING IN PYTHON

Hypothesizing about the mean
A hypothesis:

The mean annual compensation of the population of data scientists is $110,000

The point estimate (sample statistic):

mean_comp_samp = stack_overflow['converted_comp'].mean()

119574.71738168952

HYPOTHESIS TESTING IN PYTHON

Generating a bootstrap distribution
import numpy as np
# Step 3. Repeat steps 1 & 2 many times, appending to a list
so_boot_distn = []
for i in range(5000):
so_boot_distn.append(
# Step 2. Calculate point estimate
np.mean(
# Step 1. Resample
stack_overflow.sample(frac=1, replace=True)['converted_comp']
)
)

1 Bootstrap distributions are taught in Chapter 4 of Sampling in Python

HYPOTHESIS TESTING IN PYTHON

Visualizing the bootstrap distribution
import matplotlib.pyplot as plt
plt.hist(so_boot_distn, bins=50)
plt.show()

HYPOTHESIS TESTING IN PYTHON

Standard error
std_error = np.std(so_boot_distn, ddof=1)

5607.997577378606

HYPOTHESIS TESTING IN PYTHON

z-scores
value − mean
standardized value =
standard deviation
sample stat − hypoth. param. value
z=
standard error

HYPOTHESIS TESTING IN PYTHON

sample stat − hypoth. param. value
z=
standard error
stack_overflow['converted_comp'].mean()

119574.71738168952

mean_comp_hyp = 110000

std_error

5607.997577378606

z_score = (mean_comp_samp - mean_comp_hyp) / std_error

1.7073326529796957

HYPOTHESIS TESTING IN PYTHON

Testing the hypothesis
Is 1.707 a high or low number?
This is the goal of the course!

HYPOTHESIS TESTING IN PYTHON

Testing the hypothesis
Is 1.707 a high or low number?
This is the goal of the course!

Hypothesis testing use case:

Determine whether sample statistics are close to or far away from expected (or
"hypothesized" values)

HYPOTHESIS TESTING IN PYTHON

Standard normal (z) distribution
Standard normal distribution: normal distribution with mean = 0 + standard deviation = 1

HYPOTHESIS TESTING IN PYTHON

Let's practice!
HYPOTHESIS TESTING IN PYTHON
p-values
HYPOTHESIS TESTING IN PYTHON

James Chapman
Curriculum Manager, DataCamp
Criminal trials
Two possible true states:
1. Defendant committed the crime

2. Defendant did not commit the crime

Two possible verdicts:

1. Guilty

2. Not guilty

Initially the defendant is assumed to be not guilty

Prosecution must present evidence "beyond reasonable doubt" for a guilty verdict

HYPOTHESIS TESTING IN PYTHON

Age of first programming experience
age_first_code_cut classifies when Stack Overflow user first started programming
"adult" means they started at 14 or older

"child" means they started before 14

Previous research: 35% of software developers started programming as children

Evidence that a greater proportion of data scientists starting programming as children?

HYPOTHESIS TESTING IN PYTHON

Definitions
A hypothesis is a statement about an unknown population parameter

A hypothesis test is a test of two competing hypotheses

The null hypothesis (H0 ) is the existing idea

The alternative hypothesis (HA ) is the new "challenger" idea of the researcher

For our problem:

H0 : The proportion of data scientists starting programming as children is 35%

HA : The proportion of data scientists starting programming as children is greater than 35%

1"Naught" is British English for "zero". For historical reasons, "H-naught" is the international convention for
pronouncing the null hypothesis.

HYPOTHESIS TESTING IN PYTHON

Criminal trials vs. hypothesis testing
Either HA or H0 is true (not both)
Initially, H0 is assumed to be true

The test ends in either "reject H0 " or "fail to reject H0 "

If the evidence from the sample is "significant" that HA is true, reject H0 , else choose H0

Significance level is "beyond a reasonable doubt" for hypothesis testing

HYPOTHESIS TESTING IN PYTHON

One-tailed and two-tailed tests
Hypothesis tests check if the sample statistics
lie in the tails of the null distribution

Test Tails
alternative different from null two-tailed
alternative greater than null right-tailed
alternative less than null left-tailed

HA : The proportion of data scientists starting

programming as children is greater than 35%

This is a right-tailed test

HYPOTHESIS TESTING IN PYTHON

p-values
p-values: probability of obtaining a result,
assuming the null hypothesis is true

Large p-value, large support for H0

Statistic likely not in the tail of the null
distribution
Small p-value, strong evidence against H0
Statistic likely in the tail of the null
distribution
"p" in p-value → probability

"small" means "close to zero"

HYPOTHESIS TESTING IN PYTHON

Calculating the z-score
prop_child_samp = (stack_overflow['age_first_code_cut'] == "child").mean()

0.39141972578505085

prop_child_hyp = 0.35

std_error = np.std(first_code_boot_distn, ddof=1)

0.010351057228878566

z_score = (prop_child_samp - prop_child_hyp) / std_error

4.001497129152506

HYPOTHESIS TESTING IN PYTHON

Calculating the p-value
norm.cdf() is normal CDF from scipy.stats .

Left-tailed test → use norm.cdf() .

Right-tailed test → use 1 - norm.cdf() .

from scipy.stats import norm

1 - norm.cdf(z_score, loc=0, scale=1)

3.1471479512323874e-05

HYPOTHESIS TESTING IN PYTHON

Let's practice!
HYPOTHESIS TESTING IN PYTHON
Statistical
significance
HYPOTHESIS TESTING IN PYTHON

James Chapman
Curriculum Manager, DataCamp
p-value recap
p-values quantify evidence for the null hypothesis
Large p-value → fail to reject null hypothesis

Small p-value → reject null hypothesis

Where is the cutoff point?

HYPOTHESIS TESTING IN PYTHON

Significance level
The significance level of a hypothesis test (α) is the threshold point for "beyond a
reasonable doubt"

Common values of α are 0.2 , 0.1 , 0.05 , and 0.01

If p ≤ α, reject H0 , else fail to reject H0

α should be set prior to conducting the hypothesis test

HYPOTHESIS TESTING IN PYTHON

Calculating the p-value
alpha = 0.05
prop_child_samp = (stack_overflow['age_first_code_cut'] == "child").mean()
prop_child_hyp = 0.35
std_error = np.std(first_code_boot_distn, ddof=1)

z_score = (prop_child_samp - prop_child_hyp) / std_error

p_value = 1 - norm.cdf(z_score, loc=0, scale=1)

3.1471479512323874e-05

HYPOTHESIS TESTING IN PYTHON

Making a decision
alpha = 0.05
print(p_value)

3.1471479512323874e-05

p_value <= alpha

True

Reject H0 in favor of HA

HYPOTHESIS TESTING IN PYTHON

Confidence intervals
For a significance level of α, it's common to choose a confidence interval level of 1 - α

α = 0.05 → 95% confidence interval

import numpy as np
lower = np.quantile(first_code_boot_distn, 0.025)
upper = np.quantile(first_code_boot_distn, 0.975)
print((lower, upper))

(0.37063246351172047, 0.41132242370632466)

HYPOTHESIS TESTING IN PYTHON

Types of errors
Truly didn't commit crime Truly committed crime
Verdict not guilty correct they got away with it
Verdict guilty wrongful conviction correct

actual H0 actual HA

chosen H0 correct false negative

chosen HA false positive correct

False positives are Type I errors; false negatives are Type II errors.

HYPOTHESIS TESTING IN PYTHON

Possible errors in our example
If p ≤ α, we reject H0 :

A false positive (Type I) error: data scientists didn't start coding as children at a higher rate

If p > α, we fail to reject H0 :

A false negative (Type II) error: data scientists started coding as children at a higher rate

HYPOTHESIS TESTING IN PYTHON

Let's practice!
HYPOTHESIS TESTING IN PYTHON

Analyzing Politics - Rationality, Behavior, and Institutions - Shepsle, Kenneth A - Bonchek, Mark S
No ratings yet
Analyzing Politics - Rationality, Behavior, and Institutions - Shepsle, Kenneth A - Bonchek, Mark S
484 pages
CodeTantra Python Long
No ratings yet
CodeTantra Python Long
50 pages
Sample Statement of Purpose
50% (2)
Sample Statement of Purpose
7 pages
A Pragmatic Perspective of Measurement Readable PDF Download
100% (11)
A Pragmatic Perspective of Measurement Readable PDF Download
16 pages
Teaching in Nursing A Guide For Faculty 6ed EPubazw3 PDF
No ratings yet
Teaching in Nursing A Guide For Faculty 6ed EPubazw3 PDF
324 pages
Management 9th Edition Daft Fast Access
No ratings yet
Management 9th Edition Daft Fast Access
308 pages
Introduction To Psychology 12th Edition Kalat Test Bank Available Instantly
No ratings yet
Introduction To Psychology 12th Edition Kalat Test Bank Available Instantly
315 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
54 pages
15 Statistical Hypothesis Tests in Python (Cheat Sheet)
No ratings yet
15 Statistical Hypothesis Tests in Python (Cheat Sheet)
11 pages
Hypothesis Testing - Intro - Summer 2025
No ratings yet
Hypothesis Testing - Intro - Summer 2025
59 pages
Dominant Paradigm - Everett Rogers
80% (5)
Dominant Paradigm - Everett Rogers
43 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
22 pages
Inferential Statistics
No ratings yet
Inferential Statistics
48 pages
Hypothesis Testing in ML
No ratings yet
Hypothesis Testing in ML
3 pages
AP Shah Ads Notes PT 1
No ratings yet
AP Shah Ads Notes PT 1
177 pages
Z Test and T Test
No ratings yet
Z Test and T Test
15 pages
Angular JS Lab Manual-1
0% (1)
Angular JS Lab Manual-1
38 pages
Unit 1 Lesson 3
No ratings yet
Unit 1 Lesson 3
6 pages
Lesson - 4.1 - Hypothesis Testing - Analyze - Phase
No ratings yet
Lesson - 4.1 - Hypothesis Testing - Analyze - Phase
81 pages
1.hypothesis Testing Fundamentals
No ratings yet
1.hypothesis Testing Fundamentals
34 pages
11 Statistical Tests
No ratings yet
11 Statistical Tests
24 pages
Calibration Certificate Compression Machine C0065-22 (2000 KN)
No ratings yet
Calibration Certificate Compression Machine C0065-22 (2000 KN)
1 page
5.convergence Informatics Week5
No ratings yet
5.convergence Informatics Week5
41 pages
Introduction To Hypothesis Test in R
No ratings yet
Introduction To Hypothesis Test in R
103 pages
Chemistry Online Lab List
100% (1)
Chemistry Online Lab List
2 pages
Lesson 6 Hypothesis Testing A
No ratings yet
Lesson 6 Hypothesis Testing A
38 pages
Lecture 07 (Part I) Hypothesis Testing
No ratings yet
Lecture 07 (Part I) Hypothesis Testing
62 pages
Ba3 2019 PDF
No ratings yet
Ba3 2019 PDF
372 pages
Cse Machine Learning Lab Manual
No ratings yet
Cse Machine Learning Lab Manual
22 pages
A Complete Guide To Hypothesis Testing For Data Scientists Using Python - by Rashida Nasrin Sucky - Oct, 2020 - Towards Data Science
No ratings yet
A Complete Guide To Hypothesis Testing For Data Scientists Using Python - by Rashida Nasrin Sucky - Oct, 2020 - Towards Data Science
14 pages
Chapter 4
No ratings yet
Chapter 4
43 pages
Chapter 3
No ratings yet
Chapter 3
34 pages
Hypothesis Testing in Python
No ratings yet
Hypothesis Testing in Python
149 pages
Hypothesis
No ratings yet
Hypothesis
44 pages
UE23MA242A Unit-2 Class-25 26 Hypothesis and Inference
No ratings yet
UE23MA242A Unit-2 Class-25 26 Hypothesis and Inference
29 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
Hypothesis Testing Statistics
No ratings yet
Hypothesis Testing Statistics
59 pages
Maths 9709 Paper 1 - Trigonometry
No ratings yet
Maths 9709 Paper 1 - Trigonometry
105 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
40 pages
Hypothesis Testing With Z Tests
No ratings yet
Hypothesis Testing With Z Tests
32 pages
CTP MFT
No ratings yet
CTP MFT
30 pages
AP SHAH ADS Notes Mod 2 p2
No ratings yet
AP SHAH ADS Notes Mod 2 p2
50 pages
Evaluation - Statistical Significance Testing
No ratings yet
Evaluation - Statistical Significance Testing
42 pages
SBP RM Coursework 31052025
No ratings yet
SBP RM Coursework 31052025
21 pages
Chapter 2 T Test
No ratings yet
Chapter 2 T Test
42 pages
Chapter 3
No ratings yet
Chapter 3
31 pages
Chapter 3
No ratings yet
Chapter 3
41 pages
Stats Final Review
No ratings yet
Stats Final Review
11 pages
Hypthesis
No ratings yet
Hypthesis
17 pages
Unit 4 Statistical Testing and Modeling in R
No ratings yet
Unit 4 Statistical Testing and Modeling in R
25 pages
Chapter 5
No ratings yet
Chapter 5
35 pages
027 Result Btech 7th Sem Jan 2023
No ratings yet
027 Result Btech 7th Sem Jan 2023
239 pages
Statistical Analysis Presentation
No ratings yet
Statistical Analysis Presentation
11 pages
RM Presentation
No ratings yet
RM Presentation
19 pages
Doing Better Statistics in Humancomputer Interaction Cairns PDF Download
No ratings yet
Doing Better Statistics in Humancomputer Interaction Cairns PDF Download
79 pages
ENAT Class Grade 3 PostTest Grade3A Eleanor Filipino
No ratings yet
ENAT Class Grade 3 PostTest Grade3A Eleanor Filipino
29 pages
Tips For Testing in Python 1646539645
No ratings yet
Tips For Testing in Python 1646539645
23 pages
Hypothesis
No ratings yet
Hypothesis
18 pages
Business Accounting Project
No ratings yet
Business Accounting Project
13 pages
KAN Pd-01.05 (Sep 2019) Guidelines On Forensic Science Laboratories
No ratings yet
KAN Pd-01.05 (Sep 2019) Guidelines On Forensic Science Laboratories
40 pages
Unit Ii DS LM
No ratings yet
Unit Ii DS LM
20 pages
Hands On With Probability and Statistical
No ratings yet
Hands On With Probability and Statistical
9 pages
Summary
No ratings yet
Summary
40 pages
Jacqueline Kiszka: Atmospheric Science Student
No ratings yet
Jacqueline Kiszka: Atmospheric Science Student
1 page
Stats - Hypothesis - Testing - Ipynb at Main Pik1989 - Stats GitHub
No ratings yet
Stats - Hypothesis - Testing - Ipynb at Main Pik1989 - Stats GitHub
10 pages
Machine Learning Techniques Lab: Session: 2023-24, Even Semester
No ratings yet
Machine Learning Techniques Lab: Session: 2023-24, Even Semester
20 pages
Hypothesis Tesing
No ratings yet
Hypothesis Tesing
30 pages
T Test
No ratings yet
T Test
3 pages
Masters Thesis Michel Foucault On Bio Po
No ratings yet
Masters Thesis Michel Foucault On Bio Po
106 pages
1 221 Measurement Lab
No ratings yet
1 221 Measurement Lab
4 pages
Chapter 2
No ratings yet
Chapter 2
67 pages
Lecture8 PDF
No ratings yet
Lecture8 PDF
64 pages
Basic Concepts of Hypothesis Testing Discussion
No ratings yet
Basic Concepts of Hypothesis Testing Discussion
46 pages
Hypotesis Testing Chapter1
No ratings yet
Hypotesis Testing Chapter1
32 pages
Questionnaire Organizing
No ratings yet
Questionnaire Organizing
3 pages
Unit 4 Part 2
No ratings yet
Unit 4 Part 2
24 pages
Chapter 1
No ratings yet
Chapter 1
25 pages
17 Statistical Hypothesis Tests in Python (Cheat Sheet)
No ratings yet
17 Statistical Hypothesis Tests in Python (Cheat Sheet)
44 pages
Lab 5
No ratings yet
Lab 5
7 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
35 pages
Bjsports 2018 March 52 6 387 Inline Supplementary Material 3
No ratings yet
Bjsports 2018 March 52 6 387 Inline Supplementary Material 3
97 pages
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
No ratings yet
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
15 pages
Power BI Case Study Meta Data Sheet-2
No ratings yet
Power BI Case Study Meta Data Sheet-2
1 page
Chapter 1
No ratings yet
Chapter 1
16 pages
2650imguf UPPSC-AE EE Book-List 2025
No ratings yet
2650imguf UPPSC-AE EE Book-List 2025
3 pages
Edu 580
No ratings yet
Edu 580
5 pages
Cybernetics and Second-Order Cubernetics Francis Heylighen
No ratings yet
Cybernetics and Second-Order Cubernetics Francis Heylighen
15 pages
Chapter 3
No ratings yet
Chapter 3
15 pages
Chapter 3
No ratings yet
Chapter 3
7 pages
Unit 3 (Hypothesis Testing)
No ratings yet
Unit 3 (Hypothesis Testing)
40 pages
Russian Index of Science Citation Overvi
No ratings yet
Russian Index of Science Citation Overvi
11 pages
Chapter 3
No ratings yet
Chapter 3
16 pages
Chapter 3
No ratings yet
Chapter 3
12 pages
Chapter 1
No ratings yet
Chapter 1
10 pages
Chapter 1
No ratings yet
Chapter 1
9 pages
Chapter 3
No ratings yet
Chapter 3
7 pages
Universiti Teknologi Mara (Uitm) Shah Alam, Malaysia
No ratings yet
Universiti Teknologi Mara (Uitm) Shah Alam, Malaysia
4 pages
ss2 Frame Work
No ratings yet
ss2 Frame Work
1 page
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet

Chapter 1

Uploaded by

Chapter 1

Uploaded by

Hypothesis tests and

They wanted to increase pre-orders of the

They used A/B testing to test different

This involves splitting users into control and

1 Image credit: "Electronic Arts" by majaX1 CC BY-NC-SA 2.0

HYPOTHESIS TESTING IN PYTHON

HYPOTHESIS TESTING IN PYTHON

Was this result statistically significant or just chance?

Need EA's data to determine this

Techniques from Sampling in Python + this course to do so

HYPOTHESIS TESTING IN PYTHON

respondent age_1st_code ... age hobbyist

[2261 rows x 8 columns]

HYPOTHESIS TESTING IN PYTHON

The mean annual compensation of the population of data scientists is $110,000

The point estimate (sample statistic):

HYPOTHESIS TESTING IN PYTHON

1 Bootstrap distributions are taught in Chapter 4 of Sampling in Python

HYPOTHESIS TESTING IN PYTHON

HYPOTHESIS TESTING IN PYTHON

HYPOTHESIS TESTING IN PYTHON

HYPOTHESIS TESTING IN PYTHON

z_score = (mean_comp_samp - mean_comp_hyp) / std_error

HYPOTHESIS TESTING IN PYTHON

HYPOTHESIS TESTING IN PYTHON

Hypothesis testing use case:

HYPOTHESIS TESTING IN PYTHON

HYPOTHESIS TESTING IN PYTHON

2. Defendant did not commit the crime

Two possible verdicts:

Initially the defendant is assumed to be not guilty

HYPOTHESIS TESTING IN PYTHON

"child" means they started before 14

Previous research: 35% of software developers started programming as children

Evidence that a greater proportion of data scientists starting programming as children?

HYPOTHESIS TESTING IN PYTHON

A hypothesis test is a test of two competing hypotheses

The null hypothesis (H0 ) is the existing idea

For our problem:

H0 : The proportion of data scientists starting programming as children is 35%

HYPOTHESIS TESTING IN PYTHON

The test ends in either "reject H0 " or "fail to reject H0 "

Significance level is "beyond a reasonable doubt" for hypothesis testing

HYPOTHESIS TESTING IN PYTHON

HA : The proportion of data scientists starting

This is a right-tailed test

HYPOTHESIS TESTING IN PYTHON

Large p-value, large support for H0

"small" means "close to zero"

HYPOTHESIS TESTING IN PYTHON

std_error = np.std(first_code_boot_distn, ddof=1)

z_score = (prop_child_samp - prop_child_hyp) / std_error

HYPOTHESIS TESTING IN PYTHON

Left-tailed test → use norm.cdf() .

Right-tailed test → use 1 - norm.cdf() .

from scipy.stats import norm

HYPOTHESIS TESTING IN PYTHON

Small p-value → reject null hypothesis

Where is the cutoff point?

HYPOTHESIS TESTING IN PYTHON

Common values of α are 0.2 , 0.1 , 0.05 , and 0.01

If p ≤ α, reject H0 , else fail to reject H0

HYPOTHESIS TESTING IN PYTHON

z_score = (prop_child_samp - prop_child_hyp) / std_error

p_value = 1 - norm.cdf(z_score, loc=0, scale=1)

HYPOTHESIS TESTING IN PYTHON

p_value <= alpha

HYPOTHESIS TESTING IN PYTHON

α = 0.05 → 95% confidence interval

HYPOTHESIS TESTING IN PYTHON

chosen H0 correct false negative

chosen HA false positive correct

HYPOTHESIS TESTING IN PYTHON

If p > α, we fail to reject H0 :

HYPOTHESIS TESTING IN PYTHON

You might also like