0% found this document useful (0 votes)

75 views

Introduction To Statistics Part IV: Statistical Inference: Achim Ahrens Anna Babloyan Erkal Ersoy

This document provides an introduction to statistical inference. It discusses key concepts like population versus sample, how the sample mean is an unbiased estimator of the population mean, and the law of large numbers which states that as the sample size increases, the sample mean gets closer to the population mean. It also introduces the central limit theorem, which establishes that the sampling distribution of the sample mean approaches a normal distribution as the sample size increases, regardless of the shape of the population distribution. This is demonstrated through a simulation where taking many sample means from a uniform distribution results in a distribution of sample means that is approximately normal.

Uploaded by

Jovan Ssenkandwa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views

Introduction To Statistics Part IV: Statistical Inference: Achim Ahrens Anna Babloyan Erkal Ersoy

Uploaded by

Jovan Ssenkandwa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

Introduction to Statistics

Part IV: Statistical inference

Achim Ahrens Anna Babloyan

[email protected] [email protected]
Erkal Ersoy
[email protected]

Heriot-Watt University, Edinburgh

September 2015
Outline
1. Descriptive statistics
I Sample statistics (mean, variance, percentiles)
I Graphs (box plot, histogram)
I Data transformations (log transformation, unit of measure)
I Correlation vs. Causation
2. Probability theory
I Conditional probabilities and independence
I Bayes’ theorem
3. Probability distributions
I Discrete and continuous probability functions
I Probability density function & cumulative distribution function
I Binomial, Poisson and Normal distribution
I E[X] and V[X]
4. Statistical inference
I Population vs. sample
I Law of large numbers
I Central limit theorem
I Confidence intervals
I Hypothesis testing and p-values
1 / 41
Introduction
Recall, in the last lecture we assumed that we know the probability
distribution of the random variable in question as well as the parameters
of the distribution (e.g. µ and σ 2 for the normal distribution). Under
these assumptions we were able to obtain the probability that the random
variable would take values within a particular interval (e.g. P(X ≤8)).
0.5
N(0, 1)
N(0, 2)
0.4 N(0, 3)

0.3
f (x)

0.2

0.1

0
−8 −6 −4 −2 0 2 4 6 8
x
What if we don’t know µ?
2 / 41
Population vs. sample
Suppose we are interested in the distribution of heights in the UK. The
residents of the UK are the population; the parameter µ is the true
average height of UK residents and σ 2 the true variance.
If we were to measure the height of all UK residents, we would conduct a
census. However, measuring the height of every individual is hardly
feasible, or only at an exorbitant cost. Instead, we can randomly select a
sample from the population and make inferences from the sample to the
population.
In particular, we can use the sample statistics (e.g. sample mean and
sample variance) to make inferences about the true, but unknown
population parameters (µ and σ 2 ).

3 / 41
Population vs. sample
We randomly select a sample from the UK population and measure the
heights of the individuals in the sample.
Simple random sample
A random sample is given if each individual in the population has an
equal chance of being chosen.

Since the draws are random, the height of the first, second, third, . . . nth
selected individual is random, too. That is, X1 , X2 , . . . , Xn are random
variables.
I.I.D.
Suppose we draw n items (X1 , X2 , . . . , Xn ) at random from the same
population. Since X1 , X2 , . . . , Xn are drawn from the same population,
they are identically distributed. Furthermore, since the distribution of Xi
does not depend on the distribution of Xj (for i, j = 1, . . . , n; i 6= j), we
can say that they are independently distributed. We say that
X1 , X2 , . . . , Xn are independently and identically distributed (i.i.d.).

4 / 41
Population vs. sample
Now, we draw (n=10, in cm)
182 197 183 171 171 162 152 157 192 174
Given this sample, what is our best guess about µ? It’s just the sample
mean.
n
X 1
x̄ = xi = (182 + · · · + 174) = 174.1
i=1
10
The sample mean is an unbiased and consistent estimator of the
unknown population mean µ.
Unbiasedness vs. consistency
To understand unbiasedness, note that the sampling distribution of x̄ is
centered at µ. When we repeatedly sample (more on this in a bit), x̄ is
sometimes above the true value of the parameter µ and sometimes below
it. However, the key aspect here is that there is no systematic tendency
to overestimate or underestimate the true parameter. This makes x̄ an
unbiased estimator of the parameter µ.

Unbiasedness vs. consistency

5 / 41
An estimator is consistent if, as the sample size increases, the estimator
The Law of Large Numbers
Although x̄ is rarely exactly right and varies from sample to
sample, it is still a reasonable (and in fact, the best) estimate of
the population mean, µ.
This is because it is guaranteed to get closer to the population
parameter µ as the sample size increases. Therefore, we know that
if we could keep taking measurements from more subjects,
eventually we would estimate the true population mean very
accurately.
This fact is usually referred to as the law of large numbers. It is a
remarkable fact because it holds for any population.
Law of Large Numbers
If we randomly draw independent observations from any population
with finite mean µ, the sample mean, x̄, of the observed values
approaches the true mean, µ, of the population as the number of
observations, n, goes to ∞.
6 / 41
LLN in Action
101 100
Mean of first n observations
99 98
97

1 5 10 50 100 500 1000 5000 10000

Number of observations, n

In the diagram on the previous slide (reproduced below), we have

7 / 41
Population vs. sample
The sample mean estimator X̄ is a function of X1 , . . . , Xn ,
n
1 X
X̄ = Xi .
N i=1

Therefore, it is a random variable, whereas the sample mean of our

0.4

0.3
f (x)

0.2

0.1

0
−4 −2 0 2 4
x

16 / 41
Making statistical inferences
Confidence intervals
As discussed earlier, the sample mean x̄ is an appropriate estimator of the
unknown population mean µ because it is an unbiased estimator of µ,
and it approaches the true population parameter as sample size increases.
We have also mentioned, however, that this estimate varies from sample
to sample. So, how reliable is this estimator? To answer this question, we
need to consider the spread as well. From the central limit theorem
(CLT), we know that if the population mean is µ and the standard
deviation is σ, then repeated samples of n observationsshouldyield a
2
sample mean x̄ with the following distribution: X̄ ∼ N µ, σn .

Confidence Interval
A confidence interval with confidence level C consists of two parts:
1. An interval obtained from the data in the form
estimate ± margin of error
2. A chosen confidence level, C , which gives the probability that the
calculated interval will contain the true parameter value.
17 / 41
Confidence Intervals
Calculating the interval

One of the most popular values for C is 95%, which (obviously)

leads to a 95% confidence interval—meaning there’s 95%
probability that the true population parameter lies within that
confidence interval (CI).

95% Confidence Interval

To get a 95% CI for a population mean (µ), we simply need to do:
σ
x̄ ± z × √
n
where z is the critical value with area C between −z and z under
the standard normal curve. The right part of the expression (i.e.
z × √σn ) is the margin of error.

18 / 41
Confidence Intervals
Calculating the interval

Example
Suppose a student measuring the boiling temperature of a certain
liquid observes the readings (in degrees Celsius) 102.5, 101.7,
103.1, 100.9, 100.5, and 102.2 on 6 different samples of the liquid.
He calculates the sample mean to be 101.82. If he knows that the
standard deviation for this procedure is 1.2 degrees, what is the
confidence interval for the population mean at a 95% confidence
level?
Example
Suppose a student measuring the boiling temperature of a certain liquid
observes the readings (in degrees Celsius) 102.5, 101.7, 103.1, 100.9,
100.5, and 102.2 on 6 different samples of the liquid. He calculates the
sample mean to be 101.82. If he knows that the standard deviation for
this procedure is 1.2 degrees, what is the confidence interval for the
population mean at a 95% confidence level?
19 / 41
Confidence Intervals
Behaviour of confidence intervals
Confidence intervals get
smaller as:
1. The number of
observations, n,
increases

2. The level of
confidence
decreases

20 / 41
Tests of Significance
Why we need them (and some terminology)

Significance tests are a formal way for us to draw conclusions

about a statement using observed data. These tests are the tools
we use to investigate whether the data corroborate the hypothesis
that is being put forth. A hypothesis is a statement about the
parameters in a population or model.

Null hypothesis, H0
The statement or hypothesis being tested in a significance test is
called the null hypothesis, and is normally referred to as H0 . The
significance tests assess the evidence for and against this
hypothesis and allow us to either reject or fail to reject H0 .

Consider the following example to see how we can put these to use.

21 / 41
Tests of Significance
Example: Are the bottles being filled as advertised?
Suppose we are appointed as inspectors at an Irn Bru factory here in Scotland.
We have data on past production and observe that the distribution of the
contents is normal with standard deviation of 2ml. To assess the bottling
process, we randomly select 10 bottles, measure their contents and obtain the
following results:

502.9 499.8 503.2 502.8 500.9 503.9 498.2 502.5 503.8 501.4

For this sample of observations, the mean content, x̄, is 501.94 ml. Is this
sample mean far enough from 500 ml to provide convincing evidence that the
mean content of all bottles produced at the factory differ from the advertised
amount of 500 ml?

Example: Are the bottles being filled as advertised?

The randomly selected bottles resulted in the following measurements (and
recall that σ = 2).

502.9 499.8 503.2 502.8 500.9 503.9 498.2 502.5 503.8 501.4
22 / 41
Tests of Significance
P-values

As we mentioned earlier, test statistics provide us with a measure

of evidence against the null hypothesis. The farther away the
observations from what we would expect if the H0 were true, the
more evidence there is against H0 . The z-statistic we calculated
earlier is one way of measuring how far the data are from what we
would expect, which is what allows us to draw conclusions about
the hypotheses.
Another way to quantify how far away from what we expect the
data are is using a p-value.
P-value
The p-value is the probability that the corresponding test statistic
would take a value as or more extreme than that is actually
observed assuming H0 is true. Hence, the smaller the p-value,
the stronger the evidence in the data against H0 being true.
23 / 41
Tests of Significance
Calculating p-values

Consider our earlier example about the Irn Bru factory, where we
calculated the z-statistic to be 3.07 using our sample of size
n = 10, standard deviation of 2 ml and the sample mean
x̄ = 501.94:
x̄ − µ 501.94 − 500
z= = = 3.07
√σ √2
n 10

If H0 is true, we expect z to be close to 0. If z is far from 0,

there’s evidence against H0 .
Then, the p-value is P = Pr(z ≤ −3.07) + Pr(z ≥ 3.07). Since z
has the standard normal distribution (N (0, 1)) under H0 , we can
find the area under the standard normal probability density
function (pdf).
NB: We only need to find one of the areas above because the
standard normal pdf is symmetric around 0.
24 / 41
Tests of Significance
Calculating p-values

We can find this area from a standard normal table:

25 / 41
Tests of Significance
Calculating p-values

Pr(z ≤ −3.07) = 0.0011 (from table A)

Pr(z ≥ 3.07) = 1 − Pr(z ≤ 3.07)
= 1 − 0.9989
= 0.0011 = Pr(z ≤ −3.07)
P = Pr(z ≤ −3.07) + Pr(z ≥ 3.07)
= 2Pr(Z ≤ −3.07)
= 0.0022

Now that we obtained the p-value, we need to decide what level of significance
to use in our test. The significance level determines how much evidence we
require to reject H0 , and is usually denoted by the Greek letter alpha, α.
If we choose α = 0.05, rejecting H0 requires evidence so strong that it would
happen no more than 5% of the time if H0 is true. If we choose α = 0.01, we
require even stronger evidence against H0 to be able to reject it: evidence
against H0 would need to be so strong that it would happen only 1% of the
time if H0 is true.
26 / 41
Tests of Significance
P-values and statistical significance

Statistical significance
If the p-value we calculate is smaller than our chosen α, we reject
H0 at significance level α.

Based on this, in our earlier example, we reject the null hypothesis,

H0 , at the 5% level because our p-value of 0.0022 < 0.05.
In fact, we reject H0 even at the 1% level because 0.0022 < 0.01.

This suggests that there is very strong evidence against the null
hypothesis, because if H0 were true, the observed sample mean
should not have happened any more than 1% of the time.

27 / 41
Tests of Significance
One- and two-sided alternative hypotheses

So far, we have focused on two-sided alternative hypotheses where

Ha is stated as µ 6= c, where c is a constant.
In some cases, however, we might be interested in a one-sided
alternative hypothesis. In such cases, Ha would be expressed as
µ < c or µ > c.
In some cases, however, we might be interested in a one-sided alternative
hypothesis. In such cases, Ha would be expressed as µ < c or µ > c.
The p-values then look like this:

28 / 41
Tests of Significance
Summary 1/2
I Significance tests allow us to formally assess the evidence
against a null hypothesis (H0 ) provided by data. This way, we
can judge whether the deviations from what the null
hypothesis suggests are due to chance.
I When stating hypotheses, H0 is usually a statement that no
effect exists (e.g. all bottles at a factory are filled with a mean
quantity of 500 ml). The alternative hypothesis, Ha , on the
other hand, suggests that a parameter differs from its null
value in either direction (two-sided alternative) or in a specific
direction (one-sided alternative).
I The test itself is conducted using a test statistic. The
corresponding p-value is calculated assuming H0 is true, and it
indicates the probability that the test statistic will take a value
at least as "surprising" as the observed one.

29 / 41
Tests of Significance
Summary 2/2

I Small p-values indicate strong evidence against H0 .

I If the p-value is smaller than a specified significance level, α,
we reject H0 and conclude that the data are statistically
significant at level α.
I The test statistic concerning an unknown mean, µ, of a
population are based on the z-statistic calculated as:
x̄ − µ
z=
√σ
n

where x̄ is the sample mean, n is the sample size, and σ is the

known population standard deviation.

30 / 41
Tests of Significance
Standard error

Consider a sample of size n from a normally distributed population

with mean µ and standard deviation σ. As always, the sample
mean is x̄ and is normally distributed with mean µ and standard
deviation √σn .
When σ is unknown, we estimate it using the sample standard
deviation, s. We can then estimate the standard deviation of x̄ as
√s . This quantity is called the standard error of the sample mean,
n
x̄.
Standard Error
Standard deviation of a statistic estimated from the data is
referred to as the standard error of the statistic. The standard
error of the sample mean is
s
SE = √
n

31 / 41
Tests of Significance
z versus t distribution

When σ is known, we can use the familiar z statistic to make

inferences about µ: z = x̄−µ
√σ . Recall that this standardized
n
statistic has the standard normal distribution, N (0, 1).
When σ is not known, however, we need to estimate it using the
sample standard deviation, s. As a result, we substitute the
standard deviation √σn with the standard error √sn and our test
statistic becomes
x̄ − µ
t=
√s
n

This statistic does not have a standard normal distribution, and

instead follows what’s called a t distribution.
There are lots of t distributions depending on its degrees of
freedom.

32 / 41
Tests of Significance
t distributions

33 / 41
Tests of Significance
z versus t distribution

To use the appropriate t distribution, we need to know the correct

degrees of freedom. In the case of our simple t statistic, this is
simply n − 1, where n is the number of observations in our data
set.
This is because the degrees of freedom come from the sample
standard deviation, s, which has n − 1 degrees of freedom.
Question: Why does s have n − 1 degrees of freedom?
q PN
1
To see why, note that s = N −1 i=1 (xi − x̄)2 and that
PN
i=1 (xi − x̄) = 0 (i.e. the deviations from the mean sum to zero).
This implies that if we know n − 1 of the deviations, we can
determine the last one. So, technically, only n − 1 deviations can
vary freely and this number (i.e. n − 1) is called the degrees of
freedom.

34 / 41
Confidence Intervals
...using t distributions

Using t distributions allows us to analyze samples from normally

distributed populations without the need to know σ.

As we saw earlier, replacing the standard deviation √σn of x̄ by its

standard error √sn readily converts the z statistic into a t statistic.

All hypothesis tests and confidence intervals can be

conducted/obtained the same way as before simply by using the
appropriate t distribution (i.e. using the correct degrees of
freedom).
More specifically, the margin of error that was z × √σ becomes
n
t × √sn .

35 / 41
Confidence Intervals
...using t distributions

And formally,

t Confidence Interval
A level C confidence interval for a population mean (µ) is
s
x̄ ± t ∗ × √
n
where t ∗ is the critical value with area C between −t ∗ and t ∗ under
the t(n − 1) density curve, and n − 1 is the degrees of freedom.

36 / 41
Confidence Intervals
...using t distributions

Example
Here are monthly dollar amounts for phone service for a random
sample of 8 households: 43, 47, 51, 36, 50, 42, 37, 41. We would
like to construct a 95% CI for the average monthly expenditure, µ.

The sample mean is

43 + 47 + · · · + 41
x̄ = = 43.5
8
and the
sstandard deviation is
(43 − 43.5)2 + (47 − 43.5)2 + · · · + (41 − 43.5)2
s= = 5.42
8−1 37 / 41
Confidence Intervals
...using t distributions

From the t distribution table, we find that t ∗ = 2.365, and thus

the margin of error is

m = 2.365 × SE = (2.365)(1.92) = 4.5

And the 95% confidence interval is

x̄ ± m = 43.5 ± 4.5 = (39, 48)

38 / 41
Confidence Intervals
...using t distributions

39 / 41
Hypothesis tests
...using t distributions

Example
Suppose that the overall U.S. average monthly expenditure for
phone service is $49. Is the sample mean, x̄, of 43.5 different from
the national average of $49?

Example
Suppose that the overall U.S. average monthly expenditure for phone service is
$49. Is the sample mean, x̄, of 43.5 different from the national average of $49?

Before we attempt the question, we should state our hypotheses:

H0 : µ = 49
Ha : µ 6= 49
Then, note that x̄ = 43.5, n = 8, and s = 5.42. So, the t test statistic is
x̄ − µ0 43.5 − 49
t= s = 5.42 = −2.87
√ √
n 8

Example
Suppose that the overall U.S. average monthly expenditure for phone service is 40 / 41
References
DeGroot, M. H. & Schervish, M. J. (2002). Probability and Statistics.
Addison-Wesley.
Moore, D. et al. (2011). The Practice of Statistics for Business and
Economics. Third Edition. W. H. Freeman.
Stock, J. H. & Watson, M. W. (2010). Introduction to Econometrics. Third
Edition. Pearson Education.

41 / 41

Have I Done Any Good PDF
100% (1)
Have I Done Any Good PDF
6 pages
St. Augustine Choir Booklet
100% (13)
St. Augustine Choir Booklet
240 pages
Question Bank 2014
100% (2)
Question Bank 2014
63 pages
Revision SB Chap 8 12 Updated 1
No ratings yet
Revision SB Chap 8 12 Updated 1
44 pages
SB K49 Lecture7
No ratings yet
SB K49 Lecture7
57 pages
Point of Estimation of Parameters and Sampling Distri.
No ratings yet
Point of Estimation of Parameters and Sampling Distri.
39 pages
Session_11&12
No ratings yet
Session_11&12
46 pages
Lecture 6 Estimation
No ratings yet
Lecture 6 Estimation
8 pages
Unit 8. Data Analysis
No ratings yet
Unit 8. Data Analysis
69 pages
Chapter 2 Students-Sta408
No ratings yet
Chapter 2 Students-Sta408
59 pages
Ch. 9 Lecture Slides Fall 2018
No ratings yet
Ch. 9 Lecture Slides Fall 2018
77 pages
Statistics
No ratings yet
Statistics
49 pages
CHAPTER TWO Statistics method (2)-1(0)
No ratings yet
CHAPTER TWO Statistics method (2)-1(0)
10 pages
6 Estimation and Hypothesis
No ratings yet
6 Estimation and Hypothesis
95 pages
Point and Interval Estimation-26!08!2011
No ratings yet
Point and Interval Estimation-26!08!2011
28 pages
Statistical Foundations: SOST70151 - LECTURE 5
No ratings yet
Statistical Foundations: SOST70151 - LECTURE 5
49 pages
2.5+Sample+Moments
No ratings yet
2.5+Sample+Moments
30 pages
3 XWEgynrp de 5 DPZ 4
No ratings yet
3 XWEgynrp de 5 DPZ 4
14 pages
Stimation: Statistic
No ratings yet
Stimation: Statistic
46 pages
Formula_List_Statistics_2
No ratings yet
Formula_List_Statistics_2
4 pages
Ma Statsv2 3
No ratings yet
Ma Statsv2 3
3 pages
Lec 5
No ratings yet
Lec 5
64 pages
Inferential Statistic: 1 Estimation of A Population Mean
No ratings yet
Inferential Statistic: 1 Estimation of A Population Mean
8 pages
Session2_QTII_24
No ratings yet
Session2_QTII_24
31 pages
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
No ratings yet
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
37 pages
Inferential PDF
No ratings yet
Inferential PDF
9 pages
Chapter 6. Estiamation
No ratings yet
Chapter 6. Estiamation
65 pages
4 Sampling Distributions
100% (1)
4 Sampling Distributions
30 pages
Week 9+10+11
No ratings yet
Week 9+10+11
82 pages
W5 Lecture5
No ratings yet
W5 Lecture5
15 pages
Lecture note on biostatistics
No ratings yet
Lecture note on biostatistics
74 pages
Unit - III (P&S Notes)
No ratings yet
Unit - III (P&S Notes)
39 pages
Biostat Inferential Statistics
No ratings yet
Biostat Inferential Statistics
62 pages
Lecture 4 - Confidence Intervals & Hypothesis
No ratings yet
Lecture 4 - Confidence Intervals & Hypothesis
25 pages
Confidence Intervals PDF
No ratings yet
Confidence Intervals PDF
5 pages
2 Hypothesis Testing
No ratings yet
2 Hypothesis Testing
22 pages
Estimation & Hypothesis Testing.pptx (Final)
No ratings yet
Estimation & Hypothesis Testing.pptx (Final)
92 pages
10 Inferential Statistics
No ratings yet
10 Inferential Statistics
39 pages
Chapter 6 - Estimation
No ratings yet
Chapter 6 - Estimation
20 pages
Statistics I: Introduction and Distributions of Sampling Statistics
No ratings yet
Statistics I: Introduction and Distributions of Sampling Statistics
22 pages
L8 Statistical Estimation 1
No ratings yet
L8 Statistical Estimation 1
48 pages
Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
No ratings yet
Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
12 pages
Estimation New
No ratings yet
Estimation New
37 pages
Lecture 4 Estimating From A Sample Sample Mean and Confidence Intervals Upload 2
No ratings yet
Lecture 4 Estimating From A Sample Sample Mean and Confidence Intervals Upload 2
22 pages
Estimation
No ratings yet
Estimation
41 pages
Chapter 3 Sampling Distribution and Confidence Interval
100% (2)
Chapter 3 Sampling Distribution and Confidence Interval
57 pages
Lec 10-13
No ratings yet
Lec 10-13
207 pages
Chapter 3 - Sampling Distribution and Confidence Interval1
No ratings yet
Chapter 3 - Sampling Distribution and Confidence Interval1
54 pages
2_analyze - Inferential Statistics
No ratings yet
2_analyze - Inferential Statistics
27 pages
7 Estimation
No ratings yet
7 Estimation
91 pages
Notes On Sampling and Hypothesis Testing
No ratings yet
Notes On Sampling and Hypothesis Testing
10 pages
Subject: Inferential Statistics Module Number: 1.1 Module Name: Parameter Estimation - Preliminaries
No ratings yet
Subject: Inferential Statistics Module Number: 1.1 Module Name: Parameter Estimation - Preliminaries
30 pages
Estimation
No ratings yet
Estimation
14 pages
Chapter 2
No ratings yet
Chapter 2
39 pages
Lesson6 CLT 0
No ratings yet
Lesson6 CLT 0
25 pages
RM Note Unit - 4
No ratings yet
RM Note Unit - 4
21 pages
POINT INTERVAL Estimates
No ratings yet
POINT INTERVAL Estimates
48 pages
Introduction To Estimation: OPRE 6301
100% (1)
Introduction To Estimation: OPRE 6301
18 pages
Stat Notes
No ratings yet
Stat Notes
5 pages
UCLAChapter 9
No ratings yet
UCLAChapter 9
30 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Mathematical Foundations of Information Theory
From Everand
Mathematical Foundations of Information Theory
A. Ya. Khinchin
3.5/5 (9)
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
Forward Forward 16TH November 2020
No ratings yet
Forward Forward 16TH November 2020
23 pages
BID APPLICATION FORM
No ratings yet
BID APPLICATION FORM
1 page
AKALIGA - Joseph M. Ssenyonga
No ratings yet
AKALIGA - Joseph M. Ssenyonga
1 page
Stats
100% (1)
Stats
100 pages
Module 5 - Slide Presentation
No ratings yet
Module 5 - Slide Presentation
12 pages
ABUM ACHI CHA DI NDU - Full Score-1-1-1
No ratings yet
ABUM ACHI CHA DI NDU - Full Score-1-1-1
8 pages
STATS 200: Introduction To Statistical Inference: Lecture 1: Course Introduction and Polling
No ratings yet
STATS 200: Introduction To Statistical Inference: Lecture 1: Course Introduction and Polling
35 pages
Entrance: Arise O God and Shine: Alleluia Alleluia Alleluia
No ratings yet
Entrance: Arise O God and Shine: Alleluia Alleluia Alleluia
3 pages
Forward Forward 13TH November 2020
No ratings yet
Forward Forward 13TH November 2020
12 pages
Katonda Yezu Otwewadde PDF
No ratings yet
Katonda Yezu Otwewadde PDF
2 pages
Aci - The Financial Markets Association: Examination Formulae
No ratings yet
Aci - The Financial Markets Association: Examination Formulae
8 pages
Module 2 - Slide Presentation
No ratings yet
Module 2 - Slide Presentation
10 pages
Module 6 - Slide Presentation
No ratings yet
Module 6 - Slide Presentation
9 pages
Module 5o - Slide Presentation
No ratings yet
Module 5o - Slide Presentation
10 pages
Principles of Asset and Liability Management: Minimum Correct Answers For This Module: 4/8
No ratings yet
Principles of Asset and Liability Management: Minimum Correct Answers For This Module: 4/8
11 pages
Principles of Risk: Minimum Correct Answers For This Module: 4/8
No ratings yet
Principles of Risk: Minimum Correct Answers For This Module: 4/8
12 pages
Cash Money Markets: Minimum Correct Answers For This Module: 6/12
No ratings yet
Cash Money Markets: Minimum Correct Answers For This Module: 6/12
15 pages
Options: Minimum Correct Answers For This Module: 3/6
No ratings yet
Options: Minimum Correct Answers For This Module: 3/6
14 pages
Special Relativity Homework
No ratings yet
Special Relativity Homework
1 page
Two Way Anova
No ratings yet
Two Way Anova
20 pages
Vankampen 2008
No ratings yet
Vankampen 2008
2 pages
Post Hoc Test in ANOVA
No ratings yet
Post Hoc Test in ANOVA
17 pages
Correlation Regression Hypo ANOVA
No ratings yet
Correlation Regression Hypo ANOVA
22 pages
Appendix C - Quantization of The Electromagnetic Field
No ratings yet
Appendix C - Quantization of The Electromagnetic Field
5 pages
Chapter 5 Hypothesis Testing
100% (1)
Chapter 5 Hypothesis Testing
27 pages
Quantum Impurity Problems in Condensed Matter Physics ( )
No ratings yet
Quantum Impurity Problems in Condensed Matter Physics ( )
44 pages
Assign 3
No ratings yet
Assign 3
1 page
Chapter 11 Testing Hypothesis
No ratings yet
Chapter 11 Testing Hypothesis
34 pages
Alsing-2004) Simplified Derivation of The Hawking-Unruh Temperature For An Accelerated Observer in Vacuum
No ratings yet
Alsing-2004) Simplified Derivation of The Hawking-Unruh Temperature For An Accelerated Observer in Vacuum
6 pages
Goldstein
No ratings yet
Goldstein
12 pages
Topical Advanced Level Physics
0% (1)
Topical Advanced Level Physics
13 pages
ISYE 6420 Syllabus
No ratings yet
ISYE 6420 Syllabus
1 page
Lecture 3 - Decision Analysis
100% (1)
Lecture 3 - Decision Analysis
54 pages
Ken Black QA 5th Chapter 12 Solution
No ratings yet
Ken Black QA 5th Chapter 12 Solution
36 pages
Advanced Educational Statistics - Edu 901C
No ratings yet
Advanced Educational Statistics - Edu 901C
12 pages
Tabel Poison
No ratings yet
Tabel Poison
2 pages
Free Particle in 1D Box
No ratings yet
Free Particle in 1D Box
6 pages
tmp1295 TMP
No ratings yet
tmp1295 TMP
9 pages
Theory BARON and Kenny
No ratings yet
Theory BARON and Kenny
3 pages
John Preskill - Quantum Information and Computation
No ratings yet
John Preskill - Quantum Information and Computation
321 pages
Albert Einstein - Principles of Theoretical Physics
100% (1)
Albert Einstein - Principles of Theoretical Physics
3 pages
Theoretical Framework and Findings Worksheet
No ratings yet
Theoretical Framework and Findings Worksheet
2 pages
Michael A. Levin and Xiao-Gang Wen - A Unification of Light and Electrons Through String-Net Condensation in Spin Models
No ratings yet
Michael A. Levin and Xiao-Gang Wen - A Unification of Light and Electrons Through String-Net Condensation in Spin Models
8 pages
Analysis of Covariance
No ratings yet
Analysis of Covariance
4 pages
Analysis of Variance
No ratings yet
Analysis of Variance
7 pages
The Adventure of Physics Vol4
No ratings yet
The Adventure of Physics Vol4
286 pages
Chapter 1. Other-Regarding Preferences.: 1.1 Fehr-Schmidt Model (1999)
No ratings yet
Chapter 1. Other-Regarding Preferences.: 1.1 Fehr-Schmidt Model (1999)
4 pages
Path Goal Theory
No ratings yet
Path Goal Theory
1 page

Introduction To Statistics Part IV: Statistical Inference: Achim Ahrens Anna Babloyan Erkal Ersoy

Uploaded by

Introduction To Statistics Part IV: Statistical Inference: Achim Ahrens Anna Babloyan Erkal Ersoy

Uploaded by

Introduction to Statistics

Part IV: Statistical inference

Achim Ahrens Anna Babloyan

Heriot-Watt University, Edinburgh

Unbiasedness vs. consistency

1 5 10 50 100 500 1000 5000 10000

In the diagram on the previous slide (reproduced below), we have

Therefore, it is a random variable, whereas the sample mean of our

As any other random variable, X̄ follows a distribution. What does the

166 168 170 172 174

166 168 170 172 174

166 168 170 172 174

166 168 170 172 174

Central limit theorem

Recall the following diagram from the first lecture, where we

mean - one SD mean + one SD

mean - two SDs mean + two SDs

We can use this observation to make statistical inferences. 15 / 41

One of the most popular values for C is 95%, which (obviously)

95% Confidence Interval

Significance tests are a formal way for us to draw conclusions

Example: Are the bottles being filled as advertised?

As we mentioned earlier, test statistics provide us with a measure

If H0 is true, we expect z to be close to 0. If z is far from 0,

We can find this area from a standard normal table:

Pr(z ≤ −3.07) = 0.0011 (from table A)

Based on this, in our earlier example, we reject the null hypothesis,

So far, we have focused on two-sided alternative hypotheses where

I Small p-values indicate strong evidence against H0 .

where x̄ is the sample mean, n is the sample size, and σ is the

Consider a sample of size n from a normally distributed population

When σ is known, we can use the familiar z statistic to make

This statistic does not have a standard normal distribution, and

To use the appropriate t distribution, we need to know the correct

Using t distributions allows us to analyze samples from normally

As we saw earlier, replacing the standard deviation √σn of x̄ by its

All hypothesis tests and confidence intervals can be

The sample mean is

From the t distribution table, we find that t ∗ = 2.365, and thus

m = 2.365 × SE = (2.365)(1.92) = 4.5

And the 95% confidence interval is

x̄ ± m = 43.5 ± 4.5 = (39, 48)

Before we attempt the question, we should state our hypotheses:

You might also like