0% found this document useful (0 votes)

25 views31 pages

Lecture 03. Statistical Inference

The document discusses statistical inference concepts including normal distribution, standard normal distribution, central limit theorem, point estimation, interval estimation, confidence intervals, hypothesis testing, type I and type II errors. Examples are provided to illustrate key concepts.

Uploaded by

dantie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views31 pages

Lecture 03. Statistical Inference

Uploaded by

dantie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Fundamentals of Data Analytics

Lecture 03. Statistical Inference

Instructional Team
About this Course
- Probability
- Statistics
- Hands-on programming skills
- Meet your instructors

Vinh Dang Thuy Nguyen Sang Nguyen Huy Pham

PhD., Data Scientist M.Sc., Data Analyst M.Sc., Data Scientist PharmB, R&D Officer
Trusting Social VNG FE Credit OPC
Content
➔ Recall: Normal Distribution & the CLT & Sampling Distribution
➔ Point Estimation & Interval Estimation
➔ Hypothesis Testing
Normal Distribution
Normal Distribution
❖ A continuous probability distribution which is characterized by a symmetric
bell-shaped curve.
X ∼ N(µ, σ^2 )
Standard Normal Distribution
Standard Normal Distribution
❖ Standard Normal Distribution Z (with Z = (X - µ) / σ) is a Normal Distribution
N(µ, σ^2 ) with parameters µ = 0 and σ = 1
Z ∼ N(0, 1)
Normal Probabilities
Normal Probabilities
Probability that X takes on values between a and b
P(a ≤ X ≤ b)
Steps to calculate Normal Probability:
- Calculate standardized Z-score by transforming X using Z = (X - μ) / σ
- Use the standard normal N(0,1) table or Z-table (www.z-table.com)

Example:
Let X equal the weight of a randomly selected infant. Assume X ~ N(3000, 1000).
- What is the probability that a randomly selected infant has weight below 3500?
- What is the probability that a randomly selected infant has weight above 5000?
- What is the probability that a randomly selected infant has weight between
2500 and 4000?
Normal Probabilities
Example:
Let X equal the weight of a randomly selected infant. Assume X ~ N(3000, 1000).
- What is the probability that a randomly selected infant has weight below 3500?
P(X ≤ 3500) = P(Z ≤ (3500-3000)/1000) = P(Z ≤ 0.5) = 0.6915

- What is the probability that a randomly selected infant has weight above 5000?
P(X ≥ 5000) = P(Z ≥ (5000-3000)/1000) = P(Z ≥ 2) = 1 - P(Z ≤ 2) = 0.0228

- What is the probability that a randomly selected infant has weight between
2500 and 4000?
P(2500 ≤ X ≤ 4000) = P(-0.5 ≤ Z ≤ 1) = P(Z ≤ 1) - P(Z ≤ -0.5) = 0.8413 - 0.3085
Population vs Sample
Sampling Distribution
Sampling Distribution
The distribution of the statistic for all possible samples randomly drawn from the
same population of a given sample size.

Example:
- Considering a population follows the normal distribution N(μ, σ^2)
- Repeatedly take samples of a given size from this population
- Calculate the mean for each sample – this statistic is called the sample mean
- The distribution of these means is "sampling distribution of the sample mean"
The Central Limit Theorem
Central Limit Theorem
If a random sample of size n is drawn from a any population with μ and σ, the
distribution of the sample mean X approaches a normal distribution with μ and
σ_x = σ/sqrt(n) (the standard error) as the sample size increases
X ~ N(μ, (σ^2)/n)

As sample size n increases, the

distribution of sample means
converges to the population mean μ
(i.e., the standard error of the mean
σX̄ = σ / √n gets smaller).
The Central Limit Theorem
Range of Sample Means
If we know μ and σ, the CLT allows us to predict the range of sample means for
samples of size n
[ μ - z*σ/√n, μ + z*σ/√n]

Example:
Within what interval would we expect GMAT sample means to fall for samples of n =
5 applicants? The population is approximately normal with parameters μ = 520.78
and σ = 86.80, so the predicted range for 95 percent of the sample means is
[ 520.78 - 1.96*86.80/√10, 520.78 + 1.96*86.80/√10]
Estimation
❖ Point Estimation
❖ Interval Estimation
❖ Mean (μ) vs Proportion (π)
■ With known σ
■ With unknown σ
➢ Difference in Mean (μ1 - μ2)
➢ Difference in Proportion (π1 - π2)
❖ Sample size
Point Estimation
Point Estimation
Point Estimation is a single statistic, determined from a sample, that is used to
estimate the corresponding population parameter.

Example:
A sample mean x̄ calculated from a random sample x1 , x2 , . . . , xn is a point
estimate of the unknown population mean μ.
Interval Estimation
Interval Estimation
An interval estimate is a range of values for a statistic which means a point estimate
plus an interval that expresses the uncertainty or variability associated with the
estimate
estimate ± (critical value of z or t) × (standard error)

Example:
Given a data set with the mean falls somewhere between 10 and 100 (10<μ<100).
Confidence Interval for Mean
Confidence Interval for Mean
A 100(1 − α)% confidence interval for µ, the population mean, is given by the
interval estimate
x̄ - z𝜶/2*σ/√n ≤ μ ≤ x̄ + z𝜶/2*σ/√n
when the population variance is known

Interpretation of CI
❖ In repeated sampling, 100(1 − α)% confidence interval is a range of values that
you can be 100(1 − α)% certain contains the true mean of the population
❖ This is not the same as a range that contains 95% of the values
Derivation of Confidence Interval (CI) for Mean

The confidence level (1 - 𝛂) indicates how confident we are that the population
mean lies within the indicated confidence interval

P(μ - z𝜶/2σ/√n ≤ x̄ ≤ μ + z𝜶/2σ/√n ) = 1 - 𝛂

P(x̄ - z𝜶/2*σ/√n ≤ μ ≤ x̄ + z𝜶/2*σ/√n ) = 1 - 𝛂
P(L ≤ μ ≤ U) = 1 - 𝛂

Example:
If confidence level is 0.95 then z𝜶/2 = 1.96.
We can say that we are 95% confident that
the population mean lies within the interval
x̄ - 1.96*σ/√n ≤ μ ≤ x̄ + 1.96*σ/√n
Summary of Confidence Interval (CI)
Summary of Confidence Interval (CI) for Mean

Summary of Confidence Interval (CI) for Difference of Mean

Estimating Proportion
Estimating Proportion π

Where:
❖ p is the Sample proportion
❖ zα/2 is the Critical value for Confidence level (1 - α) in Standard normal table
❖ n is the Sample size
Sample size determination for a mean
Suppose we wish to estimate a population mean with a maximum allowable margin
of error of ± E.

How to Estimate σ?
❖ Take a Preliminary Sample
→ Take small sample to estimate σ
What if we don’t ❖ Assume Uniform Population
→ Estimate upper and lower limits a and b and set σ = [(b - a)2 / 12 ]1/2
know σ?
❖ Assume Normal Population
→ Estimate upper and lower bounds a and b, and set σ = (b - a) / 6
❖ Poisson Arrivals
→ In the special case when λ is a Poisson arrival rate, then σ = √ λ .
Sample size determination for a proportion
Suppose we wish to estimate a population mean with a maximum allowable margin
of error of ± E.

How to Estimate π ?
What if we don’t ❖ Assume that π = 0.5
know π? ❖ Take a Preliminary Sample
→ Take small sample to estimate σ
❖ Use a Prior Sample or Historical Data
Type I Error & Type II Error
H0 is True H0 is False

Reject H0 Type I error - α

(false positive)

Fail to Reject H0 Type II error - β

(false negative)

❖ If we choose α = .05, we expect to commit a Type I error about 5 times in 100

❖ Depending on the situation, one error is more important than the other
❖ There is trade-off between Type I and Type II error
❖ The larger critical value needed to reduce α makes it harder to reject H0, thereby
increasing β
Example:
• A doctor who is conservative about admitting patients with symptoms of heart attack to the ICU (reduced β) will
admit more patients with no heart attack (increased α).
• More sensitive airport weapons detectors (reduced β) will inconvenience more safe passengers (increased α).
Type I Error & Type II Error
The below examples come from Top 45 Data Scientist Interview Questions:
False Positive is more important than False Negative:
An e-commerce site run a marketing campaign that gives $1000 Gift voucher to the customers who purchase at least $10,000 worth
of items. The marketing team sends free voucher mail directly to 100 customers randomly (without any minimum purchase condition)
with an assumption that to make at least 20% profit on sold items above $10,000. Now the issue is if we send the $1000 gift
vouchers to customers who have not actually purchased anything but are marked as having made $10,000 worth of purchase.

False Positive is less important than False Negative:

Assume there is an airport which has received high-security threats. The security team identifies whether a particular passenger can
be a threat or not based on certain characteristics they. Due to a shortage of staff, they decide to scan passengers being predicted
as risk positives by their predictive model. What will happen if a true threat customer is being flagged as non-threat by airport model?

False Positive is equally important than False Negative:

In the Banking industry giving loans is the primary source of making money but at the same time if your repayment rate is not good
you will not make any profit, rather you will risk huge losses. Banks don’t want to lose good customers and at the same point in time,
they don’t want to acquire bad customers. In this scenario, both the false positives and false negatives become very important to
measure.
One-sample Hypothesis Test Steps for a Single Mean
Basic Steps
❖ Define the null hypothesis, H0
❖ Define the alternative hypothesis, Ha, where Ha is usually of the form “not H0”
❖ Define the Type I error (the probability of falsely rejecting the null)
❖ Calculate the test statistics
❖ Calculate the p-value (probability of getting a result as or more extreme than
observed if the null hypothesis is true)
❖ If p-value ≤ Type I error, reject H0. Otherwise, fail to reject H0

Hint to state the null hypothesis:

- The H0 should always contain a statement of equality. Another way of thinking of it is that the null
hypothesis is a statement of "no difference." while The Ha is the claim we are trying to find evidence in
favor of
- The hypothesis is sometimes a statement of what you expect to happen in the experiment
Approaches to two-sided hypothesis testing
Using Confidence Interval - CI
❖ Create 100(1 − α)% CI for the population parameter.
❖ If the CI does not contain the null hypothesis,fail to reject the null hypothesis
❖ If the CI contains the null hypothesis, you reject the null

Example:
Let X equal the weight of a randomly selected 10 infants with the sample mean is 2500 grams. The population
follow normal distribution with standard deviation is 1000 grams

Question: Is the mean birth weight in this population different from 3000 grams?
Answer: With 95% confidence, we have:
x̄ - 1.96*σ/√n ≤ μ ≤ x̄ + 1.96*σ/√n
Or 2500 - 1.96*1000/√10 ≤ μ ≤ 2500 + 1.96*1000/√10
1880 ≤ μ ≤ 3120
So, we can not say that the true mean is different from 3000
Approaches to two-sided hypothesis testing
Using Critical Value - CV
❖ Calculate a critical value zc(CV) for the specified α
❖ Compute the test statistics zobs(TS)
❖ Reject the null hypothesis if |TS| > |CV| and fail to reject the null if |TS| < |CV|

Example:
Let X equal the weight of a randomly selected 10 infants with the sample mean is 2500 grams. The population
follow normal distribution with standard deviation is 1000 grams

Question: Is the mean birth weight in this population different from 3000 grams?
Answer: With significance level α = 0.05, we have
- zc = 1.96 (recall that 2 × P(Z > |zc|) = 0.05)
- zobs = -1.58 (recall that zobs = (x̄ - μ)/(σ/√n) = -1.58)
Because, |zc| > |zobs| we can not say that the true mean is different from 3000
p-value
p-value
The p-value for a hypothesis test is the probability of obtaining a value of the test
statistics that is as or more extreme than the observed test statistics when the null
hypothesis is true

❖ The rejection region is determined by the desired level of significance, or

probability of committing a type I error or the probability of falsely reject the null
❖ Reporting the p-value associated with a test gives an indication of how common
or rare the computed value of the test hypothesis is, given that the null
- hypothesis is true
One-sample Hypothesis Test for a Single Mean
Example:

Assume a chair manufacturing process of normally distributed chair heights with:

- a known standard deviation of 5cm

- a sample of 10 chairs
- the sample mean is calculated as 37.5cm

Question: Is the mean chair height in this production line different from 40cm?
One-sample Hypothesis Test for a Single Mean
❖ Set up a two-sided test of
- H0: mean = 40cm
- Ha: mean ≠ 40cm
❖ Let type I error be 0.05
- Calculate the test statistics
37.5 40
5
- What does this mean? Our observed mean is 1.58 standard error below the
hypothesized mean
- The test statistics is the standardized value of our data assuming the null
hypothesis is true
- Question: if the true mean is 40cm, is our observed sample mean of 37.5cm
“common” or is this value unlikely to occur?
One-sample Hypothesis Test for a Single Mean
- Calculate the p-value to answer our question

- If the true mean is 40cm, our data, or data more extreme than ours, would
occur in 11 out of 100 studies (of the same size, n=10)
- General guideline, if p-value is less than or equal to type I error, then reject
the null hypothesis
- Conclusion: we fail to reject the null hypothesis with 95% of confidence since
we choose α = 0.05
Summary of One-sample Hypothesis Testing
Summary of One-sample Hypothesis Testing for One Mean

Summary of One-sample Hypothesis Testing for One Proportion

Reference
1. Doane, David P., and Lori E. Seward - Applied statistics in business and economics
2. Wasserman, Larry - All of statistics: a concise course in statistical inference
3. http://www-hsc.usc.edu/~eckel/biostat2/slides/lecture4.pdf
4. http://dsearls.org/courses/M120Concepts/ClassNotes/Statistics/530G_Derivation.htm
5. https://www.graphpad.com/guides/prism/7/statistics/stat_more_about_confidence_interval.htm?toc=0&print
Window

Estimation
No ratings yet
Estimation
29 pages
Estimation 1920
No ratings yet
Estimation 1920
51 pages
Unit-4 - Confidence Interval and CLT
No ratings yet
Unit-4 - Confidence Interval and CLT
29 pages
Review of Statistics
No ratings yet
Review of Statistics
36 pages
Statssss
No ratings yet
Statssss
31 pages
Applied Statistics and Probability For Engineers Chapter - 8
No ratings yet
Applied Statistics and Probability For Engineers Chapter - 8
13 pages
Chapter 2
No ratings yet
Chapter 2
30 pages
Normal Distribution
No ratings yet
Normal Distribution
8 pages
Stat 255 Supplement 2011 Fall
100% (1)
Stat 255 Supplement 2011 Fall
78 pages
Statistical Inference - Part1.4
No ratings yet
Statistical Inference - Part1.4
28 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
12 pages
CH 4 - Estimation & Hypothesis One Sample
No ratings yet
CH 4 - Estimation & Hypothesis One Sample
139 pages
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
No ratings yet
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
37 pages
A Confidence Interval Provides Additional Information About Variability
No ratings yet
A Confidence Interval Provides Additional Information About Variability
14 pages
Lecture 5 - Inferential Statistics
No ratings yet
Lecture 5 - Inferential Statistics
42 pages
ADDB Week 5
No ratings yet
ADDB Week 5
66 pages
Estimation and CI
No ratings yet
Estimation and CI
87 pages
C22 Inferential Statistics DXB
No ratings yet
C22 Inferential Statistics DXB
66 pages
Hypothesis Testing 1,2 PPT 1
No ratings yet
Hypothesis Testing 1,2 PPT 1
30 pages
Chapter 9 (Independent Means Only) UPDATED!!!
No ratings yet
Chapter 9 (Independent Means Only) UPDATED!!!
27 pages
Chapter-7-Estimation & Hypothesis Testing
No ratings yet
Chapter-7-Estimation & Hypothesis Testing
15 pages
000.chapter8 Cumulative PDF
No ratings yet
000.chapter8 Cumulative PDF
19 pages
Unit 6 - Generalizing From A Sample To A Population
No ratings yet
Unit 6 - Generalizing From A Sample To A Population
30 pages
Module3 Part3 Inference About Population Mean
No ratings yet
Module3 Part3 Inference About Population Mean
67 pages
Bus 7
No ratings yet
Bus 7
48 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
71 pages
Topic 5
No ratings yet
Topic 5
11 pages
Week 12-1 Pre
No ratings yet
Week 12-1 Pre
30 pages
Statistical Inference
No ratings yet
Statistical Inference
29 pages
Lecture 6
No ratings yet
Lecture 6
28 pages
Estimation and Confidence Intervals
No ratings yet
Estimation and Confidence Intervals
28 pages
PPT08-Confidence Interval Estimation
No ratings yet
PPT08-Confidence Interval Estimation
22 pages
5 - Stat Lecture..
No ratings yet
5 - Stat Lecture..
44 pages
Estimation of Parameters (Part 2)
No ratings yet
Estimation of Parameters (Part 2)
33 pages
Estimation and Hypothesis Testing
No ratings yet
Estimation and Hypothesis Testing
46 pages
Estimation
No ratings yet
Estimation
44 pages
Probability and Hypothesis Testing
No ratings yet
Probability and Hypothesis Testing
28 pages
4 Confidence Intervals
100% (1)
4 Confidence Intervals
49 pages
Materi 4 Estimasi Titik Dan Interval-Edit
No ratings yet
Materi 4 Estimasi Titik Dan Interval-Edit
73 pages
Chapter Two (Estimation and Hypothesis Testing)
No ratings yet
Chapter Two (Estimation and Hypothesis Testing)
20 pages
10 Inferential Statistics
No ratings yet
10 Inferential Statistics
39 pages
STA 2023 Unit 3 Shell Notes (Chapter 6 - 7)
No ratings yet
STA 2023 Unit 3 Shell Notes (Chapter 6 - 7)
36 pages
Chapter 8
No ratings yet
Chapter 8
42 pages
Lecture 6 - Estimation Part A
No ratings yet
Lecture 6 - Estimation Part A
23 pages
Chapter 4 - BUSINESS STATISTICS
No ratings yet
Chapter 4 - BUSINESS STATISTICS
14 pages
Chapter 8: Estimation: - Estimation Defined - Confidence Levels - Confidence Intervals - Confidence Interval Precision
No ratings yet
Chapter 8: Estimation: - Estimation Defined - Confidence Levels - Confidence Intervals - Confidence Interval Precision
29 pages
Chapter 5 Infernece Concerning Mean - 081fa6ed Cdae 4e5f Afc5 E9ab156488e0
No ratings yet
Chapter 5 Infernece Concerning Mean - 081fa6ed Cdae 4e5f Afc5 E9ab156488e0
47 pages
Chapter 8
No ratings yet
Chapter 8
45 pages
4th Quarter STAT FINAL PPT Revised
No ratings yet
4th Quarter STAT FINAL PPT Revised
54 pages
Statistics ESCP
No ratings yet
Statistics ESCP
383 pages
Lecture 4-Statistical Inferences
No ratings yet
Lecture 4-Statistical Inferences
118 pages
Reviewer StatProb
No ratings yet
Reviewer StatProb
35 pages
Stat-II CH-TWO
No ratings yet
Stat-II CH-TWO
68 pages
Business Modelling Confidence Intervals: Prof Baibing Li BE 1.26 E-Mail: Tel 228841
No ratings yet
Business Modelling Confidence Intervals: Prof Baibing Li BE 1.26 E-Mail: Tel 228841
11 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
98 pages
Statistics For Management Decisions MG913: Sampling Distributions and Confidence Intervals
No ratings yet
Statistics For Management Decisions MG913: Sampling Distributions and Confidence Intervals
31 pages
Bus 173 - 1
No ratings yet
Bus 173 - 1
28 pages
Lecture Note 5
No ratings yet
Lecture Note 5
8 pages
4 Inferentials
No ratings yet
4 Inferentials
53 pages
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Value Proposition Canvas
100% (7)
Value Proposition Canvas
41 pages
An Analysis of Deliveroo's Operation Strategy and Food Takeaway Industry
No ratings yet
An Analysis of Deliveroo's Operation Strategy and Food Takeaway Industry
14 pages
Business Models Testing
No ratings yet
Business Models Testing
36 pages
Scalable Business Model
100% (1)
Scalable Business Model
26 pages
Blue Ocean Strategy - Business Analyst
No ratings yet
Blue Ocean Strategy - Business Analyst
37 pages
Lecture 02. Statistics Draft
No ratings yet
Lecture 02. Statistics Draft
39 pages
Grab Q2 22
No ratings yet
Grab Q2 22
30 pages
Location Analysis Tongand Murray
No ratings yet
Location Analysis Tongand Murray
17 pages
UML Diagrams
100% (2)
UML Diagrams
9 pages
Conceptual and Procedural Knowledge in Mathematics: An Introductory Analysis.
No ratings yet
Conceptual and Procedural Knowledge in Mathematics: An Introductory Analysis.
92 pages
Template REVIEW JURNAL AJMH
No ratings yet
Template REVIEW JURNAL AJMH
2 pages
Integrated Circuits - K. R. Botkar
No ratings yet
Integrated Circuits - K. R. Botkar
67 pages
Bla Bla
No ratings yet
Bla Bla
6 pages
Principles and Practice 2 Edition T.S. Rappaport: Chapter 5: Mobile Radio Propagation: Small-Scale Fading and Multipath
No ratings yet
Principles and Practice 2 Edition T.S. Rappaport: Chapter 5: Mobile Radio Propagation: Small-Scale Fading and Multipath
95 pages
Resistência Ao Rasgo (Elmendorf) - ASTM D1424-96
No ratings yet
Resistência Ao Rasgo (Elmendorf) - ASTM D1424-96
8 pages
DX Diag
No ratings yet
DX Diag
27 pages
Htl05 Sub Pe 001 Mem Imp Civ r00 - Equipment Foundations
No ratings yet
Htl05 Sub Pe 001 Mem Imp Civ r00 - Equipment Foundations
27 pages
Statics - Chapter 5
No ratings yet
Statics - Chapter 5
12 pages
Geography F1T1 2024 QS Teacher - Co - .Ke
No ratings yet
Geography F1T1 2024 QS Teacher - Co - .Ke
4 pages
Emailing Metrology Lab Manual - Consolidated Mar2021
No ratings yet
Emailing Metrology Lab Manual - Consolidated Mar2021
113 pages
M2 Lesson 4 Slides For Students
No ratings yet
M2 Lesson 4 Slides For Students
48 pages
M. Tech. Chemical 2018
No ratings yet
M. Tech. Chemical 2018
37 pages
Chapter 2 - FIR Filters - Digital Filter Design
No ratings yet
Chapter 2 - FIR Filters - Digital Filter Design
100 pages
MS-MO6-L02-Theory of Columns-Rankine Formula
No ratings yet
MS-MO6-L02-Theory of Columns-Rankine Formula
11 pages
Enzyme Practical 1
No ratings yet
Enzyme Practical 1
2 pages
BODMAS 1new
No ratings yet
BODMAS 1new
2 pages
Strength of Materials Math Worksheet: Answers
No ratings yet
Strength of Materials Math Worksheet: Answers
2 pages
1988 - Nesbitt - Gold Deposit Continuum A Genetic Model For Lode Au
No ratings yet
1988 - Nesbitt - Gold Deposit Continuum A Genetic Model For Lode Au
5 pages
Year 1 Math Exam
No ratings yet
Year 1 Math Exam
9 pages
Audio Amplifier Applications Low Noise Audio Amplifier Applications
No ratings yet
Audio Amplifier Applications Low Noise Audio Amplifier Applications
5 pages
Precedence, Dominance and C-Command: Binding Theory
100% (1)
Precedence, Dominance and C-Command: Binding Theory
6 pages
Text Summarization
No ratings yet
Text Summarization
6 pages
ER To Relational Model
No ratings yet
ER To Relational Model
39 pages
Logical Micro Instructions in Computer Organization and Architecture
No ratings yet
Logical Micro Instructions in Computer Organization and Architecture
9 pages
Introduction To Cisco PIX and ASA
No ratings yet
Introduction To Cisco PIX and ASA
35 pages
Cold Storage of Tomato The Good The Bad en The Ug-Wageningen University and Research 444870
No ratings yet
Cold Storage of Tomato The Good The Bad en The Ug-Wageningen University and Research 444870
1 page
E427 PDF
No ratings yet
E427 PDF
7 pages
The Basel II IRB Approach For Credit Portfolios
0% (1)
The Basel II IRB Approach For Credit Portfolios
30 pages

Lecture 03. Statistical Inference

Uploaded by

Lecture 03. Statistical Inference

Uploaded by

Fundamentals of Data Analytics

Lecture 03. Statistical Inference

Vinh Dang Thuy Nguyen Sang Nguyen Huy Pham

As sample size n increases, the

P(μ - z𝜶/2*σ/√n ≤ x̄ ≤ μ + z𝜶/2*σ/√n ) = 1 - 𝛂

Summary of Confidence Interval (CI) for Difference of Mean

Reject H0 Type I error - α

Fail to Reject H0 Type II error - β

❖ If we choose α = .05, we expect to commit a Type I error about 5 times in 100

False Positive is less important than False Negative:

False Positive is equally important than False Negative:

Hint to state the null hypothesis:

❖ The rejection region is determined by the desired level of significance, or

Assume a chair manufacturing process of normally distributed chair heights with:

- a known standard deviation of 5cm

Summary of One-sample Hypothesis Testing for One Proportion

You might also like

P(μ - z𝜶/2σ/√n ≤ x̄ ≤ μ + z𝜶/2σ/√n ) = 1 - 𝛂