0% found this document useful (0 votes)

9 views

Lecture 7

Uploaded by

fyfd

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Lecture 7

Uploaded by

fyfd

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 82

Statistical inference: CLT,

confidence intervals, p-
values
Statistical Inference
The process of making
guesses about the truth
from a sample. Sample statistics

n
x
̂  X n  i 1
n
n

Truth (not  (x  X 2
i n)
ˆ 2 s 2  i 1
n 1
observable)
Sample *hat notation ^ is often used to indicate

Population (observation)
“estitmate”

parameters
N N

x  (x   )
i
2

  i 1  2  i 1
N N
Make guesses
about the whole
population
Statistics vs. Parameters
 Sample Statistic – any summary measure calculated
from data; e.g., could be a mean, a difference in means
or proportions, an odds ratio, or a correlation coefficient

E.g., the mean vitamin D level in a sample of 100 men is 63
nmol/L

E.g., the correlation coefficient between vitamin D and
cognitive function in the sample of 100 men is 0.15

 Population parameter – the true value/true effect in

the entire population of interest

E.g., the true mean vitamin D in all middle-aged and older
European men is 62 nmol/L

E.g., the true correlation between vitamin D and cognitive
function in all middle-aged and older European men is 0.15
Examples of Sample
Statistics:
Single population mean
Single population proportion
Difference in means (ttest)
Difference in proportions (Z-test)
Odds ratio/risk ratio
Correlation coefficient
Regression coefficient
…
Example 1: cognitive
function and vitamin D
 Hypothetical data loosely based on [1]; cross-
sectional study of 100 middle-aged and older
European men.
 Estimation: What is the average serum vitamin
D in middle-aged and older European men?
 Sample statistic: mean vitamin D levels
 Hypothesis testing: Are vitamin D levels and
cognitive function correlated?
 Sample statistic: correlation coefficient between
vitamin D and cognitive function, measured by the
Digit Symbol Substitution Test (DSST).

1. Lee DM, Tajar A, Ulubaev A, et al. Association between 25-hydroxyvitamin D levels and cognitive performance
in middle-aged and older European men. J Neurol Neurosurg Psychiatry. 2009 Jul;80(7):722-9.
Distribution of a trait:
vitamin D

Right-skewed!
Mean= 63 nmol/L
Standard deviation = 33
nmol/L
Distribution of a trait: DSST

Normally distributed
Mean = 28 points
Standard deviation = 10 points
Distribution of a statistic…
 Statistics follow distributions too…
 But the distribution of a statistic is a
theoretical construct.
 Statisticians ask a thought experiment: how
much would the value of the statistic fluctuate
if one could repeat a particular study over and
over again with different samples of the same
size?
 By answering this question, statisticians are
able to pinpoint exactly how much uncertainty
is associated with a given statistic.
Distribution of a statistic
 Two approaches to determine the
distribution of a statistic:
 1. Computer simulation

Repeat the experiment over and over again
virtually!

More intuitive; can directly observe the behavior
of statistics.
 2. Mathematical theory

Proofs and formulas!

More practical; use formulas to solve problems.
Example of computer
simulation…
 How many heads come up in 100
coin tosses?
 Flip coins virtually
 Flip a coin 100 times; count the number
of heads.
 Repeat this over and over again a large
number of times (we’ll try 30,000
repeats!)
 Plot the 30,000 results.
Coin tosses…

Conclusions:
We usually get
between 40 and 60
heads when we flip
a coin 100 times.
It’s extremely
unlikely that we will
get 30 heads or 70
heads (didn’t
happen in 30,000
experiments!).
Distribution of the sample
mean, computer simulation…
 1. Specify the underlying distribution of
vitamin D in all European men aged 40 to 79.

Right-skewed

Standard deviation = 33 nmol/L

True mean = 62 nmol/L (this is arbitrary; does not
affect the distribution)
 2. Select a random sample of 100 virtual men
from the population.
 3. Calculate the mean vitamin D for the
sample.
 4. Repeat steps (2) and (3) a large number of
times (say 1000 times).
 5. Explore the distribution of the 1000 means.
Distribution of mean
vitamin D (a sample
statistic)
Normally distributed!
Surprise!
Mean= 62 nmol/L (the true
mean)
Standard deviation = 3.3
nmol/L
Distribution of mean
vitamin D (a sample
statistic)
 Normally distributed (even though
the trait is right-skewed!)
 Mean = true mean
 Standard deviation = 3.3 nmol/L
 The standard deviation of a statistic is
called a standard error
s
 The standard error of a mean =
n
If I increase the sample
size to n=400…
Standard error = 1.7 nmol/L
s 33
 1.7
n 400
If I increase the variability of
vitamin D (the trait) to
SD=40…
Standard error = 4.0 nmol/L

s 40
 4.0
n 100
Mathematical Theory…
The Central Limit
Theorem!
If all possible random samples, each of size n, are
taken from any population with a mean  and a
standard deviation , the sampling distribution of
the sample means (averages) will:

x The mean of the sample means.

x The standard deviation of the sample means.

Also called “the standard error of the mean.”
Mathematical Proof
(optional!)
If X is a random variable from any distribution with known
mean, E(x), and variance, Var(x), then the expected
value and variance of the average of n observations of
X is:

n n

x i  E ( x) nE ( x)
E ( X n ) E ( i 1 )  i 1  E ( x )
n n n
n n

x i  Var ( x) nVar ( x) Var ( x)

Var ( X n ) Var ( i 1 )  i 1 2  2

n n n n
Computer simulation of the C
(this is what we will do in lab next Wednesday!)

1. Pick any probability distribution and specify a mean

and standard deviation.
2. Tell the computer to randomly generate 1000
observations from that probability distributions
E.g., the computer is more likely to spit out values with
high probabilities
3. Plot the “observed” values in a histogram.
4. Next, tell the computer to randomly generate 1000
averages-of-2 (randomly pick 2 and take their average)
from that probability distribution. Plot “observed”
averages in histograms.
5. Repeat for averages-of-10, and averages-of-100.
Uniform on [0,1]: average
of 1
(original distribution)
Uniform: 1000 averages
of 2
Uniform: 1000 averages
of 5
Uniform: 1000 averages of
100
~Exp(1): average of 1
(original distribution)
~Exp(1): 1000 averages
of 2
~Exp(1): 1000 averages
of 5
~Exp(1): 1000 averages of
100
~Bin(40, .05): average
of 1
(original distribution)
~Bin(40, .05): 1000
averages of 2
~Bin(40, .05): 1000
averages of 5
~Bin(40, .05): 1000 averages
of 100
The Central Limit
Theorem:
If all possible random samples, each of size n, are
taken from any population with a mean  and a
standard deviation , the sampling distribution of
the sample means (averages) will:

1. have mean:  x 

2. have standard deviation: x 
n
3. be approximately normally distributed regardless of the shape
of the parent population (normality improves with larger n)
Central Limit Theorem
caveats for small samples:
 For small samples:
 The sample standard deviation is an imprecise
estimate of the true standard deviation (σ); this
imprecision changes the distribution to a T-
distribution.

A t-distribution approaches a normal distribution for large n
(100), but has fatter tails for small n (<100)
 If the underlying distribution is non-normal, the
distribution of the means may be non-normal.

More on T-distributions next week!!

Summary: Single
population mean (large n)
 Hypothesis test:
observed mean  null mean
Z
s
n

 Confidence Interval
s
confidence interval observed mean Z/2 * ( )
n
Single population mean
(small n, normally
distributed trait)
 Hypothesis test:
observed mean  null mean
Tn  1 
s
n

 Confidence Interval
s
confidence interval observed mean Tn  1,/2 * ( )
n
Examples of Sample
Statistics:
Single population mean
Single population proportion
Difference in means (ttest)
Difference in proportions (Z-test)
Odds ratio/risk ratio
Correlation coefficient
Regression coefficient
…
Distribution of a correlation
coefficient?? Computer
simulation…
 1. Specify the true correlation coefficient
 Correlation coefficient = 0.15
 2. Select a random sample of 100 virtual
men from the population.
 3. Calculate the correlation coefficient for
the sample.
 4. Repeat steps (2) and (3) 15,000 times
 5. Explore the distribution of the 15,000
correlation coefficients.
Distribution of a
correlation coefficient…

Normally distributed!
Mean = 0.15 (true correlation)
Standard error = 0.10
Distribution of a
correlation coefficient in
general…
 1. Shape of the distribution

Normally distributed for large samples

T-distribution for small samples (n<100)
 2. Mean = true correlation
coefficient (r) 2
1 r
 3. Standard error 
n
Many statistics follow
normal (or t-distributions)
…
 Means/difference in means
 T-distribution for small samples
 Proportions/difference in
proportions
 Regression coefficients
 T-distribution for small samples
 Natural log of the odds ratio
Estimation (confidence
intervals)…
 What is a good estimate for the
true mean vitamin D in the
population (the population
parameter)?
 63 nmol/L +/- margin of error
95% confidence interval
 Goal: capture the true effect (e.g.,
the true mean) most of the time.
 A 95% confidence interval should
include the true effect about 95%
of the time.
 A 99% confidence interval should
include the true effect about 99%
of the time.
Recall: 68-95-99.7 rule for normal distributions! These is a
95% chance that the sample mean will fall within two
standard errors of the true mean= 62 +/- 2*3.3 = 55.4 nmol/L
to 68.6 nmol/L
Mean - 2 Std error=55.4 Mean Mean + 2 Std error =68.6

To be precise,
95% of
observations fall
between Z=-1.96
and Z= +1.96 (so
the “2” is a
rounded number)
…
95% confidence interval
 There is a 95% chance that the sample
mean is between 55.4 nmol/L and 68.6
nmol/L
 For every sample mean in this range,
sample mean +/- 2 standard errors will
include the true mean:

For example, if the sample mean is 68.6
nmol/L:

95% CI = 68.6 +/- 6.6 = 62.0 to 75.2

This interval just hits the true mean, 62.0.
95% confidence interval
 Thus, for normally distributed statistics,
the formula for the 95% confidence
interval is:
 sample statistic  2 x (standard error)
 Examples:
 95% CI for mean vitamin D:

63 nmol/L  2 x (3.3) = 56.4 – 69.6 nmol/L
 95% CI for the correlation coefficient:

0.15  2 x (0.1) = -.05 – .35
Simulation of 20 studies of
100 men…
Vertical line indicates the true mean (62)

95% confidence
intervals for the mean
vitamin D for each of
the simulated studies.

Only 1 confidence
interval missed the true
mean.
Confidence Intervals give:
*A plausible range of values for a
population parameter.
*The precision of an estimate.(When
sampling variability is high, the
confidence interval will be wide to reflect
the uncertainty of the observation.)
*Statistical significance (if the 95% CI
does not cross the null value, it is
significant at .05)
Confidence Intervals
The value of the statistic in my
sample (eg., mean, odds ratio,
etc.)
point estimate  (measure of how
confident we want to be)  (standard
error)
From a Z table or a T table,
depending on the sampling
distribution of the statistic.

Standard error of the statistic.

Common “Z” levels of
confidence
 Commonly used confidence levels
are 90%, 95%, and 99%
Confidence
Z value
Level

80% 1.28
90% 1.645
95% 1.96
98% 2.33
99% 2.58
99.8% 3.08
99.9% 3.27
99% confidence
intervals…
 99% CI for mean vitamin D:

63 nmol/L  2.6 x (3.3) = 54.4 – 71.6
nmol/L
 99% CI for the correlation coefficient:

0.15  2.6 x (0.1) = -.11 – .41
Testing Hypotheses
 1. Is the mean vitamin D in middle-
aged and older European men
lower than 100 nmol/L (the
“desirable” level)?
 2. Is cognitive function correlated
with vitamin D?
Is the mean vitamin D
different than 100?
 Start by assuming that the mean =
100
 This is the “null hypothesis”
 This is usually the “straw man” that
we want to shoot down
 Determine the distribution of
statistics assuming that the null is
true…
Computer simulation
(10,000 repeats)…

This is called the

null distribution!

Normally
distributed
Std error = 3.3
Mean = 100
Compare the null
distribution to the
observed value…
What’s the
probability of
seeing a sample
It didn’t happen
mean of 63
in 10,000
nmol/L if the true
simulated
mean is 100
studies. So the
nmol/L?
probability is less
than 1/10,000
Compare the null
distribution to the
observed value…

This is the p-
value!
P-value <
1/10,000
Calculating the p-value
with a formula…
Because we know how normal curves work, we can exactly calculate the
probability of seeing an average of 63 nmol/L if the true average weight is
100 (i.e., if our null hypothesis is true):

63  100
Z 11 .2
3.3
Z= 11.2, P-value << .0001
The P-value
P-value is the probability that we would have seen our
data (or something more unexpected) just by chance if
the null hypothesis (null value) is true.

Small p-values mean the null value is unlikely given

our data.

Our data are so unlikely given the null hypothesis

(<<1/10,000) that I’m going to reject the null
hypothesis! (Don’t want to reject our data!)
P-value<.0001 means:

The probability of seeing what you saw or something

more extreme if the null hypothesis is true (due to
chance)<.0001

P(empirical data/null hypothesis) <.0001

The P-value
 By convention, p-values of <.05 are
often accepted as “statistically
significant” in the medical literature; but
this is an arbitrary cut-off.

 A cut-off of p<.05 means that in about 5

of 100 experiments, a result would
appear significant just by chance (“Type
I error”).
Summary: Hypothesis
Testing
The Steps:
1. Define your hypotheses (null, alternative)
2. Specify your null distribution
3. Do an experiment
4. Calculate the p-value of what you
observed
5. Reject or fail to reject (~accept) the null
hypothesis
Hypothesis Testing
The Steps:
1. Define your hypotheses (null, alternative)
 The null hypothesis is the “straw man” that we are trying to shoot down.
 Null here: “mean vitamin D level = 100 nmol/L”
 Alternative here: “mean vit D < 100 nmol/L” (one-sided)
2. Specify your sampling distribution (under the null)
 If we repeated this experiment many, many times, the mean vitamin D
would be normally distributed around 100 nmol/L with a standard error
33
of 3.3 100
3.3

3. Do a single experiment (observed sample mean = 63 nmol/L)

4. Calculate the p-value of what you observed (p<.0001)
5. Reject or fail to reject the null hypothesis (reject)
 Confidence intervals give the same
information (and more) than
hypothesis tests…
Duality with hypothesis
tests.
95% confidence interval Null value

50 60 70 80 90 100

Null hypothesis: Average vitamin D is 100 nmol/L

Alternative hypothesis: Average vitamin D is not 100
nmol/L (two-sided)
P-value < .05
Duality with hypothesis
tests.
99% confidence interval Null value

50 60 70 80 90 100

Null hypothesis: Average vitamin D is 100 nmol/L

Alternative hypothesis: Average vitamin D is not 100
nmol/L (two-sided)
P-value < .01
2. Is cognitive function
correlated with vitamin D?
 Null hypothesis: r = 0
 Alternative hypothesis: r  0
 Two-sided hypothesis
 Doesn’t assume that the correlation
will be positive or negative.
Computer simulation
(15,000 repeats)…

Null distribution:
Normally
distributed
Std error = 0.1
Mean = 0
What’s the probability of
our data?

Even when the true

correlation is 0, we get
correlations as big as
0.15 or bigger 7% of the
time.
What’s the probability of
our data?

This is a two-sided
hypothesis test, so “more
extreme” includes as big or
bigger negative correlations
(<-0.15).

P-value = 7% + 7% = 14%
What’s the probability of
our data?

Our results could have

happened purely due to a
fluke of chance!
Formal hypothesis test
 1. Null hypothesis: r=0
 Alternative: r  0 (two-sided)
 2. Determine the null distribution

Normally distributed

Standard error = 0.1
 3. Collect Data, r=0.15
 4. Calculate the p-value for the data:
 Z= 0.15  0 Z of 1.5 corresponds to a
1.5 two-sided p-value of 14%
.1
 5. Reject or fail to reject the null (fail to reject)
Or use confidence interval
to gauge statistical
significance…
 95% CI = -0.05 to 0.35
 Thus, 0 (the null value) is a
plausible value!
 P>.05
Examples of Sample
Statistics:
Single population mean
Single population proportion
Difference in means (ttest)
Difference in proportions (Z-test)
Odds ratio/risk ratio
Correlation coefficient
Regression coefficient
…
Example 2: HIV vaccine
trial
 Thai HIV vaccine trial (2009)
 8197 randomized to vaccine

 8198 randomized to placebo

 Generated a lot of public discussion

about p-values!
51/8197 vs. 75/8198
=23 excess infections in the
placebo group.
=2.8 fewer infections per
1000 people vaccinated

Source: BBC news, http://news.bbc.co.uk/go/pr/fr/-/2/hi/health/8272113.stm

Null hypothesis
 Null hypothesis: infection rate is
the same in the two groups
 Alternative hypothesis: infection
rates differ
Computer simulation
assuming the null (15,000
repeats)…

Normally distributed,
standard error = 11.1
Computer simulation
assuming the null (15,000
repeats)…

If the vaccine is
completely
ineffective, we
could still get
23 excess
infections just
by chance.

Probability of
23 or more
excess
infections =
0.04
How to interpret p=.04…
 P(data/null) = .04
 P(null/data) .04

 P(null/data)  22%
*estimated using Bayes’ Rule
(and prior data on the vaccine)

*Gilbert PB, Berger JO, Stablein D, Becker S, Essex M, Hammer SM, Kim JH, DeGruttola VG.
Statistical interpretation of the RV144 HIV vaccine efficacy trial in Thailand: a case study for
statistical issues in efficacy trials. J Infect Dis 2011; 203: 969-975.
Alternative analysis of the
data (“intention to treat”)
…
 56/8202 (6.8 per 1000) infections
in the vaccine group versus
76/8200 (9.3 per 1000)
Computer simulation
assuming the null (15,000
repeats)…

Probability of
20 or more
excess
infections =
0.08

P=.08 is only
slightly different
than p=.04!
Confidence intervals…
 95% CI (analysis 1): .0014 to .0055

 95% CI (analysis 2): -.0003

to .0051

 The plausible ranges are nearly

identical!

Statistical Inference: CLT, Confidence Intervals, P-Values
No ratings yet
Statistical Inference: CLT, Confidence Intervals, P-Values
82 pages
Statistical Inference: CLT, Confidence Intervals, P-Values
No ratings yet
Statistical Inference: CLT, Confidence Intervals, P-Values
82 pages
Point Estimation
No ratings yet
Point Estimation
7 pages
Foundations of Statistical Inference
No ratings yet
Foundations of Statistical Inference
22 pages
2 Hypothesis Testing
No ratings yet
2 Hypothesis Testing
22 pages
Chapter 8 Sampling and Estimation
No ratings yet
Chapter 8 Sampling and Estimation
14 pages
13 Final Review
No ratings yet
13 Final Review
32 pages
Types of Statistics
No ratings yet
Types of Statistics
7 pages
Chapter 05 W7 L1 Random Sample 2015 UTP C5
No ratings yet
Chapter 05 W7 L1 Random Sample 2015 UTP C5
8 pages
One sample inf
No ratings yet
One sample inf
9 pages
Sampling Distribution and Simulation in R
No ratings yet
Sampling Distribution and Simulation in R
10 pages
Chapter-8-Estimation & Hypothesis Testing
No ratings yet
Chapter-8-Estimation & Hypothesis Testing
12 pages
Chapter 1: Descriptive Statistics: Example 1: Making Steel Rods
No ratings yet
Chapter 1: Descriptive Statistics: Example 1: Making Steel Rods
20 pages
Chapter 1 - Descriptive Statistcs - L1 - Jan 2024
No ratings yet
Chapter 1 - Descriptive Statistcs - L1 - Jan 2024
13 pages
Week 1.: "All Models Are Wrong, But Some Are Useful" - George Box
No ratings yet
Week 1.: "All Models Are Wrong, But Some Are Useful" - George Box
7 pages
Chapter-8-Estimation & Hypothesis Testing
100% (1)
Chapter-8-Estimation & Hypothesis Testing
12 pages
Chapter 4 Data Description
No ratings yet
Chapter 4 Data Description
44 pages
Introduction To Inferential Statistics Sampling Distributions
No ratings yet
Introduction To Inferential Statistics Sampling Distributions
21 pages
Statistical Inference Point Estimators Estimating The Population Mean Using Confidence Intervals
No ratings yet
Statistical Inference Point Estimators Estimating The Population Mean Using Confidence Intervals
40 pages
Business Modelling Confidence Intervals: Prof Baibing Li BE 1.26 E-Mail: Tel 228841
No ratings yet
Business Modelling Confidence Intervals: Prof Baibing Li BE 1.26 E-Mail: Tel 228841
11 pages
Chapter 1 Statistics Review Sept20
No ratings yet
Chapter 1 Statistics Review Sept20
11 pages
POINT INTERVAL Estimates
No ratings yet
POINT INTERVAL Estimates
48 pages
Chemometrics
No ratings yet
Chemometrics
201 pages
Statics Chapter 8 88
No ratings yet
Statics Chapter 8 88
12 pages
Applied Maths-Unit5
No ratings yet
Applied Maths-Unit5
4 pages
Introduction To Statistics Part IV: Statistical Inference: Achim Ahrens Anna Babloyan Erkal Ersoy
No ratings yet
Introduction To Statistics Part IV: Statistical Inference: Achim Ahrens Anna Babloyan Erkal Ersoy
44 pages
Chapter 5 Statistics
No ratings yet
Chapter 5 Statistics
11 pages
Mean, Standard Deviation, and Counting Statistics
No ratings yet
Mean, Standard Deviation, and Counting Statistics
2 pages
02data Part2
No ratings yet
02data Part2
34 pages
sample distribution
No ratings yet
sample distribution
8 pages
Statistical Inference
No ratings yet
Statistical Inference
15 pages
Chap1SamplingDistributions
No ratings yet
Chap1SamplingDistributions
14 pages
Sampling Distribution
No ratings yet
Sampling Distribution
41 pages
Chapter 7_Point Estimation of Parameters and Sampling Distributions
No ratings yet
Chapter 7_Point Estimation of Parameters and Sampling Distributions
39 pages
Hypothesis Testing 23.09.2023
No ratings yet
Hypothesis Testing 23.09.2023
157 pages
Q3 Random Variables and Probability Distribution
No ratings yet
Q3 Random Variables and Probability Distribution
12 pages
Lecture 2.2 - Statistics - Desc Stat and Distrib
No ratings yet
Lecture 2.2 - Statistics - Desc Stat and Distrib
48 pages
Chapter 7 - Sampling Distributions CLT
No ratings yet
Chapter 7 - Sampling Distributions CLT
17 pages
FRM Part 1: Basic Statistics
No ratings yet
FRM Part 1: Basic Statistics
28 pages
ECON 361: Income & Inequality: Lecture 2: Review of Statistics
No ratings yet
ECON 361: Income & Inequality: Lecture 2: Review of Statistics
279 pages
Week+7+and+8+31+Aug+to+18+Sept+Sampling+distributions
No ratings yet
Week+7+and+8+31+Aug+to+18+Sept+Sampling+distributions
6 pages
Basic Concepts of Inference: Corresponds To Chapter 6 of Tamhane and Dunlop
No ratings yet
Basic Concepts of Inference: Corresponds To Chapter 6 of Tamhane and Dunlop
40 pages
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
No ratings yet
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
4 pages
Sampling Distributions: IPS Chapter 5
No ratings yet
Sampling Distributions: IPS Chapter 5
52 pages
Confidence interval and credintial interval
No ratings yet
Confidence interval and credintial interval
15 pages
Emgt 512 SP 2024
No ratings yet
Emgt 512 SP 2024
156 pages
BDU Biometrics
No ratings yet
BDU Biometrics
122 pages
of Bootstrap by Spida - 2010
No ratings yet
of Bootstrap by Spida - 2010
80 pages
SPC Training
No ratings yet
SPC Training
78 pages
Week 5 - Result and Analysis 1 (UP)
No ratings yet
Week 5 - Result and Analysis 1 (UP)
7 pages
Lecture 4 PDF
No ratings yet
Lecture 4 PDF
21 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
24 pages
Chapter 4A: Inferences Based On A Single Sample: Confidence Intervals
No ratings yet
Chapter 4A: Inferences Based On A Single Sample: Confidence Intervals
88 pages
Mathematical Foundations of Computer Science
No ratings yet
Mathematical Foundations of Computer Science
24 pages
Random Errors in Chemical Analysis: CHM028 Analytical Chemistry For Teachers
No ratings yet
Random Errors in Chemical Analysis: CHM028 Analytical Chemistry For Teachers
38 pages
Topic07 Wrriten
No ratings yet
Topic07 Wrriten
23 pages
3-Measures of Dispersion
No ratings yet
3-Measures of Dispersion
33 pages
MATH 403 Engineering Data Analysis 95 132
No ratings yet
MATH 403 Engineering Data Analysis 95 132
38 pages
Chapter 6
No ratings yet
Chapter 6
37 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Chapter 12 Applications: Terrain Mapping and Analysis
No ratings yet
Chapter 12 Applications: Terrain Mapping and Analysis
13 pages
PHP Set - 1: B. Program or Sequence of Instruction That Is Interpreted or Carried Out by Another Program
No ratings yet
PHP Set - 1: B. Program or Sequence of Instruction That Is Interpreted or Carried Out by Another Program
9 pages
Cebu
100% (1)
Cebu
76 pages
Pseudo Code and Flow Chart
0% (1)
Pseudo Code and Flow Chart
25 pages
7 Cladogramsandgenetics
No ratings yet
7 Cladogramsandgenetics
4 pages
Compare ISO 9001 AS9100c
100% (1)
Compare ISO 9001 AS9100c
1 page
4 Evaluation New PrEN 1591
No ratings yet
4 Evaluation New PrEN 1591
4 pages
Early - ID Issue - Paper2
No ratings yet
Early - ID Issue - Paper2
6 pages
ColaMid HPC
No ratings yet
ColaMid HPC
2 pages
Edgar Morin, Restricted Complexity, General Complexity
No ratings yet
Edgar Morin, Restricted Complexity, General Complexity
25 pages
Registration Card Details - ICSE 2026 (For The Children Presently Studying in Grade 9)
No ratings yet
Registration Card Details - ICSE 2026 (For The Children Presently Studying in Grade 9)
1 page
Memory - and - Nostalgia Svetlana Boyhm PDF
100% (1)
Memory - and - Nostalgia Svetlana Boyhm PDF
11 pages
Pengendalian Pencemaran
No ratings yet
Pengendalian Pencemaran
204 pages
Argumentative Essay Assessment Rubric
100% (1)
Argumentative Essay Assessment Rubric
1 page
Final Edited
No ratings yet
Final Edited
40 pages
2-Circle Drawing Algorithms
No ratings yet
2-Circle Drawing Algorithms
4 pages
Data Similarity
0% (1)
Data Similarity
18 pages
Facilitation of Training
No ratings yet
Facilitation of Training
5 pages
5100 HD
No ratings yet
5100 HD
25 pages
1st Quarter Week 8 WLP
No ratings yet
1st Quarter Week 8 WLP
15 pages
Speaking Naturally
100% (18)
Speaking Naturally
128 pages
Demo 1
No ratings yet
Demo 1
9 pages
The Sierra Leone Energy Sector: Prospects & Challenges
No ratings yet
The Sierra Leone Energy Sector: Prospects & Challenges
10 pages
Poly Vinyl Chloride
100% (2)
Poly Vinyl Chloride
16 pages
Activity 5. On Rosss Prima Facie Duty
No ratings yet
Activity 5. On Rosss Prima Facie Duty
3 pages
Ra Cocnats
No ratings yet
Ra Cocnats
13 pages
SANGEETHA
No ratings yet
SANGEETHA
32 pages
Anatomia Colenquima Stress PDF
No ratings yet
Anatomia Colenquima Stress PDF
16 pages
Q3 Sci 6 Quiz 1
No ratings yet
Q3 Sci 6 Quiz 1
2 pages
DLL All Subjects 2 q2 w9 d2
No ratings yet
DLL All Subjects 2 q2 w9 d2
10 pages

Lecture 7

Uploaded by

Lecture 7

Uploaded by

Statistical inference: CLT,

 Population parameter – the true value/true effect in

x The mean of the sample means.

x The standard deviation of the sample means.

x i  Var ( x) nVar ( x) Var ( x)

1. Pick any probability distribution and specify a mean

More on T-distributions next week!!

Standard error of the statistic.

This is called the

Small p-values mean the null value is unlikely given

Our data are so unlikely given the null hypothesis

The probability of seeing what you saw or something

P(empirical data/null hypothesis) <.0001

 A cut-off of p<.05 means that in about 5

3. Do a single experiment (observed sample mean = 63 nmol/L)

Null hypothesis: Average vitamin D is 100 nmol/L

Null hypothesis: Average vitamin D is 100 nmol/L

Even when the true

Our results could have

 8198 randomized to placebo

 Generated a lot of public discussion

Source: BBC news, http://news.bbc.co.uk/go/pr/fr/-/2/hi/health/8272113.stm

 95% CI (analysis 2): -.0003

 The plausible ranges are nearly

You might also like