0% found this document useful (0 votes)

195 views12 pages

Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018

This document provides an introduction to statistical estimation and point estimation. It discusses key concepts like parameters, estimators, estimates, and sampling distributions. Specifically, it examines common estimators like the sample mean, sample variance, and sample proportion. The sample mean is an unbiased estimator of the population mean with variance that decreases with larger sample sizes. The document aims to introduce statistical inference and approaches to estimation, especially maximum likelihood estimation.

Uploaded by

Anonymous na314kKjOA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

195 views12 pages

Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018

Uploaded by

Anonymous na314kKjOA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Point estimation

(Module 2)
Statistics (MAST20005) & Elements of Statistics (MAST90058)
Semester 2, 2018

Contents
1 Estimation & sampling distributions 1

2 Estimators 3

3 Method of moments 6

4 Maximum likelihood estimation 8

Aims of this module

• Introduce the main elements of statistical inference and estimation, especially the idea of a sampling distribution
• Show the simplest type of estimation: that of a single number
• Show some general approaches to estimation, especially the method of maximum likelihood

1 Estimation & sampling distributions

Motivating example

On a particular street, we measure the time interval (in minutes) between each car that passes:
2.55 2.13 3.18 5.94 2.29 2.41 8.72 3.71
We believe these follow an exponential distribution:
Xi ∼ Exp(λ)
What can we say about λ?
Can we approximate it from the data?
Yes! We can do it using a statistic. This is called estimation.

Statistics: the big picture

We want to start learning how to do inference. First, we need a good

understanding of the ‘sampling’ part.

1
Distributions of statistics

Consider sampling from X ∼ Exp(λ = 1/5).

Convenient simplification: set θ = 1/λ. This makes E(X) = θ and var(X) = θ2 .
Note: There are two common parameterisations,

fX (x) = λe−λx , x ∈ [0, ∞)

1 1
fX (x) = e− θ x , x ∈ [0, ∞)
θ
λ is called the rate parameter (relates to a Poisson process)
Be clear about which is being used!
Take a large number of samples, each of size n = 100:
1. 1.84 1.19 11.73 5.64 17.98 0.26 ...
2. 2.67 7.15 5.99 1.03 0.65 3.18 ...
3. 16.99 2.15 2.60 5.40 3.64 2.01 ...
4. 2.21 1.54 4.27 5.29 3.65 0.83 ...
5. 12.24 1.59 2.56 1.38 5.72 0.69 ...
...
Then calculate some statistics (x̄, x(1) , x(n) , etc.) for each one:
Min. Median Mean Max.
1. 0.02 4.10 5.17 23.96
2. 0.16 4.48 5.84 39.90
3. 0.17 3.39 4.38 15.61
4. 0.03 3.73 5.43 34.02
5. 0.01 3.12 4.71 19.94
...
As we continue this process, we get some information on the distributions of these statistics.

Sampling distribution (definition)

Recall that any statistic T = φ(X1 , . . . , Xn ) is a random variable.

The sampling distribution of a statistic is its probability distribution, given an assumed population distribution and a
sampling scheme (e.g. random sampling).
Sometimes we can determine it exactly, but often we might resort to simulation.
In the current example, we know that:

X(1) ∼ Exp(100λ)
X
Xi ∼ Gamma(100, λ)

How to estimate?

Suppose we want to estimate θ from the data. What should we do?

Reminder:
• Population mean, E(X) = θ = 5
• Population variance, var(X) = θ2 = 52
• Population standard deviation, sd(X) = θ = 5

2
Can we use the sample mean, X̄, as an estimate of θ? Yes!
Can we use the sample standard deviation, S, as an estimate of θ? Yes!
Will these statistics be good estimates? Which one is better? Let’s see. . .
We need to know properties of their sampling distributions, such as their mean and variance.
Note: we are referring to the distribution of the statistic, T , rather than the population distribution from which we
draw samples, X.
For example, it is natural to expect that:
• E(X̄) ≈ µ (sample mean ≈ population mean)
• E(S 2 ) ≈ σ 2 (sample variance ≈ population variance)
Let’s see for our example:
0.8

0.06
0.6

0.04
)
f ( x)

0.4

2
f (s

0.02
0.2

0.00
0.0

2 3 4 5 6 7 8 10 20 30 40 50 60
2
x s

Left: distribution of X̄. Right: distribution of S 2 . Vertical dashed lines: true values, E(X) = 5 and var(X) = 52 .
• Should we use X̄ or S to estimate θ? Which one is the better estimator?
• We would like the sample distribution of the estimator to be as close as possible to the true value θ = 5.
• In practice, for any given dataset, we don’t know which estimate is the closest, since we don’t know the true
value.
• We should use the one that is more likely to be the closest.
• Simulation: consider 250 samples of size n = 100 and compute:

x̄1 , . . . , x̄250 ,

s1 , . . . , s250
> summary(x.bar)
Min. 1st Qu. Median Mean 3rd Qu. Max.
3.789 4.663 4.972 5.015 5.365 6.424
> sd(x.bar)
[1] 0.4888185
> summary(s)
Min. 1st Qu. Median Mean 3rd Qu. Max.
3.502 4.473 4.916 5.002 5.512 7.456
> sd(s)
[1] 0.7046119
From our simulation, sd(X̄) ≈ 0.49 and sd(S) ≈ 0.70. So, in this case it looks like X̄ is superior to S.

2 Estimators
Definitions
• A parameter is a quantity that describes the population distribution, e.g. µ and σ 2 for N(µ, σ 2 )

3
• The parameter space is the set of all possible values that a parameter might take, e.g. −∞ < µ < ∞ and
0 ≤ σ < ∞.
• An estimator (or point estimator) is a statistic that is used to estimate a parameter. It refers specifically to the
random variable version of the statistic, e.g. T = u(X1 , . . . , Xn ).
• An estimate (or point estimate) is the observed value of the estimator for a given dataset. In other words, it is
a realisation of the estimator, e.g. t = u(x1 , . . . , xn ), where x1 , . . . , xn is the observed sample (data).
• ‘Hat’ notation: If T is an estimator for θ, then we usually refer to it by θ̂ for convenience.

Examples

We will now go through a few important examples:

• Sample mean
• Sample variance
• Sample proportion
In each case, we assume a sample of iid rvs, X1 , . . . , Xn , with mean µ and variance σ 2 .

Sample mean

n
1 1X
X̄ = (X1 + X2 + . . . Xn ) = Xi
n n i=1
Properties:
• E(X̄) = µ
σ2
• var(X̄) = n

Also, the Central Limit Theorem implies that usually:

σ2

X̄ ≈ N µ,
n

Often used to estimate the population mean, µ̂ = X̄.

Sample variance

n
1 X 2
S2 = Xi − X̄
n − 1 i=1
Properties:
• E(S 2 ) = σ 2
• var(S 2 ) = (a messy formula)
Often used to estimate the population variance, σ̂ 2 = S 2 .

Sample proportion

For a discrete random variable, we might be interested in how often a particular value appears. Counting this gives
the sample frequency:
Xn
freq(a) = I(Xi = a)
i=1

Let the population proportion be p = Pr(X = a). Then we have:

freq(a) ∼ Bi(n, p)

4
Divide by the sample size to get the sample proportion. This is often used as an estimator for the population proportion:
n
freq(a) 1X
p̂ = = I(Xi = a)
n n i=1

For large n, we can approximate this with a normal distribution:

p(1 − p)
p̂ ≈ N p,
n

Note:
• The sample pmf and the sample proportion are the same, both of them estimate the probability of a given event
or set of events.
• The pmf is usually used when the interest is in many different events/values, and is written as a function, e.g.
p̂(a).
• The proportion is usually used when only a single event is of interest (getting heads for a coin flip, a certain
candidate winning an election, etc.).

Examples for a normal distribution

If the sample is drawn from a normal distribution, Xi ∼ N(µ, σ 2 ), we can derive exact distributions for these statistics.
Sample mean:
σ2

X̄ ∼ N µ,
n

Sample variance:
σ2 2
S2 ∼ χ
n − 1 n−1
2σ 4
E(S 2 ) = σ 2 , var(S 2 ) =
n−1

χ2k is the chi-squared distribution with k degrees of freedom. (more details in Module 3)

Bias

Consider an estimator θ̂ of θ.
• If E(θ̂) = θ, the estimator is said to be unbiased
• The bias of the estimator is, E(θ̂) − θ
Examples:
• The sample variance is unbiased for the population variance, E(S 2 ) = σ 2 . (problem 5 in week 3 tutorial)
• What if we divide by n instead of n − 1 in the denominator?

Transformations and biasedness

E( n−1 2
n S )=
n−1 2
n σ < σ2
⇒ biased!
In general, if θ̂ is unbiased for θ, then it will usually be the case that g(θ̂) is biased for g(θ).
Unbiasedness is not preserved under transformations.

Challenge problem
√
Is the sample standard deviation, S = S 2 , biased for the population standard deviation, σ?

5
Choosing between estimators
• Evaluate and compare the sampling distributions of the estimators.
• Generally, prefer estimators that have smaller bias and smaller variance (and it can vary depending on the
aim of your problem).
• Sometimes, we only know asymptotic properties of estimators (will see examples later).
Note: this approach to estimation is referred to as frequentist or classical inference. The same is true for most of the
techniques we will cover. We will also learn about an alternative approach, called Bayesian inference, later in the
semester.

Challenge problem (uniform distribution)

Take a random sample of size n from the uniform distribution with pdf:
1 1
f (x) = 1 (θ − <x<θ+ )
2 2
Can you think of some estimators for θ? What is their bias and variance?

Challenge problem (boundary problem)

Take a random sample of size n from the shifted exponential distribution, with pdf:

f (x) = e−(x−θ) (x > θ)

Equivalently:
Xi ∼ θ + Exp(1)
Can you think of some estimators for θ? What is their bias and variance?

Coming up with (good) estimators?

How can we do this for any given problem?

We will cover two general methods:
• Method of moments
• Maximum likelihood

3 Method of moments
Method of moments (MM)
• Idea:
– Make the population distribution resemble the empirical (data) distribution. . .
– . . . by equating theoretical moments with sample moments
– Do this until you have enough equations, and then solve them
• Example: if E(X̄) = θ, then the method of moments estimator of θ is X̄.
• General procedure (for r parameters):
1. X1 , . . . , Xn i.i.d. f (x | θ1 , . . . , θr ).
2. kth moment is E(X k )
1
Xik
P
3. kth sample moment is Mk = n

4. Set E(X k ) = Mk , for k = 1, . . . r and solve for (θ1 , . . . , θr ).

• Alternative: Can use the variance instead of the second moment (sometimes more convenient).

6
Remarks
• An intuitive approach to estimation
• Can work in situations where other approaches are too difficult
• Usually biased
• Usually not optimal (but may suffice)
• Note: some authors use a ‘bar’ (θ̄) or a ‘tilde’ (θ)
e to denote MM estimators rather than a ‘hat’ (θ̂). This helps
to distinguish different estimators when comparing them to each other.

Example: Geometric distribution

• Sampling from: X ∼ Geom(p)
• The first moment:
∞
X 1
E(X) = xp(1 − p)x−1 =
x=1
p

• The MM estimator is obtained by solving

1
X̄ =
p
which gives
1
pe =
X̄

Example: Normal distribution

• Sampling from: X ∼ N(µ, σ 2 )
• Population moments: E(X) = µ and E(X 2 ) = σ 2 + µ2
• Sample moments: M1 = X̄ and M2 = n1
P 2
Xi
• Equating them:
1X 2
X̄ = µ and Xi = σ 2 + µ2
n
Solving these gives:
n
1X
µ
e = X̄ e2 =
and σ (Xi − X̄)2
n i=1

Note:
• This not the usual sample variance!
n−1 2
e2 =
• σ n S
n−1 2
σ2 ) =
• This one is biased, E(e n σ 6= σ 2 .

Example: Gamma distribution

• Sampling from: X ∼ Gamma(α, θ)
• The pdf is:
1 −x
f (x | α, θ) = xα−1 exp
Γ(α)θα θ

• Population moments: E(X) = αθ and var(X) = αθ2

1
• Sample moments: M = X̄ and S 2 = n−1 (Xi − X̄)2
P

7
Equating them:
X̄ = αθ and S 2 = αθ2
Solving these gives:
S2 X̄ 2
θe = and α
e=
X̄ S2
Note:
• This is an example of using S 2 instead of M2

4 Maximum likelihood estimation

Method of maximum likelihood (ML)
• Idea: find the ‘most likely’ explanation for the data
• More concretely: find parameter values that maximise the probability of the data

Example: Bernoulli distribution

• Sampling from: X ∼ Be(p)
• Data are 0’s and 1’s
• Then pmf is
f (x | p) = px (1 − p)1−x , x = 0, 1, 0≤p≤1

• Observe values x1 , . . . , xn of X1 , . . . , Xn (iid)

• The probability of the data (the random sample) is
n
Y n
Y
Pr(X1 = x1 , . . . , Xn = xn | p) = f (xi | p) = pxi (1 − p)1−xi
i=1 i=1
P P
xi
=p (1 − p)n− xi

• Regard the sample x1 , . . . , xn as known (since we have observed it) and regard the probability of the data as a
function of p.
• When written this way, this is called the likelihood of p:
L(p) = L(p | x1 , . . . , xn )
= Pr(X1 = x1 , . . . , Xn = xn | p)
P P
xi
=p (1 − p)n− xi

• Want to find the value of p that maximizes this likelihood.

• It often helps to find the value of θ that maximizes the log of the likelihood rather than the likelihood
• This is called the log-likelihood P P
xi
ln L(p) = ln p + ln(1 − p)n− xi

• The final answer (the maximising value of p) is the same, since the log of non-negative numbers is a one-to-one
function whose inverse is the exponential, so any value θ that maximises the log-likelihood also maximises the
likelihood.
Pn
• Putting x = i=1 xi so that x is the number of 1’s in the sample,
ln L(p) = x ln p + (n − x) ln(1 − p)

• Find the maximum of this log-likelihood with respect to p by differentiating and equating to zero,
∂ ln L(p) 1 −1
= x + (n − x) =0
∂p p 1−p

• This gives p = x/n

• Therefore, the maximum likelihood estimator is p̂ = X/n = X̄

8
x = 50 x = 40 x = 80

−100

n = 100
log likelihood(p)

−200

−300

0.00 0.25 0.50 0.75 1.00

p
Figure 1: Log-likelihoods for Bernoulli trials with parameter p

Maximum likelihood: general procedure

• Random sample (iid): X1 , . . . , Xn
• Likelihood function with m parameters θ1 , . . . , θm and data x1 , . . . , xn is:
n
Y
L(θ1 , . . . , θm ) = f (xi | θ1 , . . . , θm )
i=1

• If X is discrete, for f use the pmf

• If X is continuous, for f use the pdf
• The maximum likelihood estimates (MLEs) or the maximum likelihood estimators (MLEs) θ̂1 , . . . , θ̂m are values
that maximize L(θ1 , . . . , θm ).
• Note: same abbreviation and notation for both the estimators (random variable) and the estimates (realised
values).
• Often (but not always) useful to take logs and then differentiate and equate derivatives to zero to find MLE’s.
• Sometimes this is too hard, but we can maximise numerically. No closed-form expression in this case.

Example: Exponential distribution

Sampling (iid) from: X ∼ Exp(λ)

1 −x/λ
f (x | λ) = e , x > 0, 0<λ<∞
λ
Pn
1 − i=1 xi
L(λ) = n exp
λ λ
n
1X
ln L(λ) = −n ln(λ) − xi
λ i=1

9
P
∂ ln L(λ) n xi
=− + 2 =0
∂λ λ λ
This gives: λ̂ = X̄

Example: Exponential distribution (simulated)

> x <- rexp(25) # simulate 25 observations from Exp(1)
> x
[1] 0.009669867 3.842141708 0.394267770 0.098725403
[5] 0.386704987 0.024086824 0.274132718 0.872771164
[9] 0.950139285 0.022927997 1.538592014 0.837613769
[13] 0.634363088 0.494441270 1.789416017 0.503498224
[17] 0.000482703 1.617899321 0.336797648 0.312564298
[21] 0.702562098 0.265119483 3.825238461 0.238687987
[25] 1.752657238
> mean(x) # maximum likelihood estimate
[1] 0.8690201

Log−likelihood curve
−10
−20

●
−30
log(L)

−40
−50
−60

0.5 1.0 1.5 2.0 2.5

What if we repeat the sampling process several times?

Log−likelihood curves
−10

●
●
−20

●●
●●●
●
●●
●●
−30

●●
●
log(L)

−40
−50
−60

0.5 1.0 1.5 2.0 2.5

10
Example: Geometric distribution

Sampling (iid) from: X ∼ Geom(p)

n
Y P
L(p) = p(1 − p)xi −1 = pn (1 − p) xi −n
, 0≤p≤1
i=1
Pn
∂ ln L(p) n xi − n
i=1
= − =0
∂p p 1−p
This gives: p̂ = 1/X̄

Example: Normal distribution

Sampling (iid) from: X ∼ N(θ1 , θ2 )

n
(xi − θ1 )2

Y 1
L(θ1 , θ2 ) = √ exp −
i=1
2πθ2 2θ2

n
n 1 X
ln L(θ1 , θ2 ) = − ln(2πθ2 ) − (xi − θ1 )2
2 2θ2 i=1
Take partial derivatives with respect to θ1 and θ2 .

n
∂ ln L(θ1 , θ2 ) 1 X
= (xi − θ1 )
∂θ1 θ2 i=1
n
∂ ln L(θ1 , θ2 ) n 1 X
=− + 2 (xi − θ1 )2
∂θ2 2θ2 2θ2 i=1
Pn
Set both of these to zero and solve. This gives: θ1 = x̄ and θ̂2 = n−1 i=1 (xi − x̄)2 . The maximum likelihood
estimators are therefore:
n
1X n−1 2
θb1 = X̄, θb2 = (Xi − X̄)2 = S
n i=1 n

Note: θb2 is biased.

Stress and cancer: VEGFC

> x <- c(0.97, 0.52, 0.73, 0.96, 1.26)
> n <- length(x)
> mean(x) # MLE for population mean
[1] 0.888

> sd(x) * sqrt((n - 1) / n) # MLE for the pop. st. dev.

[1] 0.2492709

> qqnorm(x) # Draw a QQ plot

> qqline(x) # Fit line to QQ plot

11
Normal Q−Q Plot

0.6
●
0.1
1.2

0.5
0.2

0.3
1.0
Sample Quantiles

0.4
●
● 0.4

0.5

σ
0.6

0.7

0.3
0.8

0.8

0.2
0.6

0.1
−1.0 −0.5 0.0 0.5 1.0 0.6 0.7 0.8 0.9 1.0 1.1 1.2

Theoretical Quantiles µ

Challenge problem (boundary problem)

Take a random sample of size n from the shifted exponential distribution, with pdf:

f (x | θ) = e−(x−θ) (x > θ)

Equivalently:
Xi ∼ θ + Exp(1)
Derive the MLE for θ. Is it biased? Can you create an unbiased estimator from it?

Invariance property

Suppose we know θ̂ but are actually interestd in φ = g(θ) rather than θ itself. Can we estimate φ?
Yes! It is simply φ̂ = g(θ̂).
This is known as the invariance property of the MLE. In other words, transformations don’t affect the value of the
MLE.
Consequence: MLEs are usually biased since expectations are not invariant under transformations.

Is the MLE a good estimator?

Some useful results:

• Asymptotically unbiased
• Asymptotically optimal variance (‘efficient’)
• Asymptotically normally distributed
The proofs of these rely on the CLT. More details of the mathematical theory will be covered towards the end of the
semester.

YSQ-S3 Questionnaire
90% (20)
YSQ-S3 Questionnaire
3 pages
PEGA 02 Material Total
100% (8)
PEGA 02 Material Total
223 pages
1 Dramatic Techniques in Literature +
100% (4)
1 Dramatic Techniques in Literature +
5 pages
Class of Heirs 2
No ratings yet
Class of Heirs 2
7 pages
A Post-Structuralist Feminism Approach To The Nature Poems of Emily Dickinson, Emily Bronte, and Sylvia Plath
100% (1)
A Post-Structuralist Feminism Approach To The Nature Poems of Emily Dickinson, Emily Bronte, and Sylvia Plath
45 pages
In The Supreme Court of Bangladesh (High Court Division) : Md. Imman Ali and Sk. Hassan Arif, JJ
No ratings yet
In The Supreme Court of Bangladesh (High Court Division) : Md. Imman Ali and Sk. Hassan Arif, JJ
14 pages
2012 13 Undergraduate Catalog PDF
No ratings yet
2012 13 Undergraduate Catalog PDF
268 pages
Ucchista Ganapati - Dashakari (10 Lettered) Mantra (Version 2)
No ratings yet
Ucchista Ganapati - Dashakari (10 Lettered) Mantra (Version 2)
19 pages
Worksheet Redox Kohes Year 11
100% (1)
Worksheet Redox Kohes Year 11
2 pages
6 Authentic Leadership
No ratings yet
6 Authentic Leadership
12 pages
The Book of E by Laurel Airica
100% (1)
The Book of E by Laurel Airica
32 pages
Factor Analysis
No ratings yet
Factor Analysis
8 pages
Gmat Focus 6 Week Study Planner 6 June 2023 - 240310 - 163124
No ratings yet
Gmat Focus 6 Week Study Planner 6 June 2023 - 240310 - 163124
3 pages
Agt Activity Proposal
No ratings yet
Agt Activity Proposal
6 pages
Zurich Max Medic
No ratings yet
Zurich Max Medic
5 pages
Worksheet Redox Kohes Year 11-Answers
No ratings yet
Worksheet Redox Kohes Year 11-Answers
6 pages
Year 3 (Entry Into Year 4) 25 Hour Revision Booklet English
No ratings yet
Year 3 (Entry Into Year 4) 25 Hour Revision Booklet English
117 pages
MAST20005 Statistics Assignment 2
No ratings yet
MAST20005 Statistics Assignment 2
9 pages
Year 5 Maths Worksheets
No ratings yet
Year 5 Maths Worksheets
2 pages
Year 2 Independent Writing Activities
100% (1)
Year 2 Independent Writing Activities
42 pages
GeM Bidding Corr 7124359 3
No ratings yet
GeM Bidding Corr 7124359 3
3 pages
Course 4 - Practice Questions For Survey Analytics-Q1
No ratings yet
Course 4 - Practice Questions For Survey Analytics-Q1
2 pages
Grade 6 Find Percentage of A Quantity: Fill in The Blanks
No ratings yet
Grade 6 Find Percentage of A Quantity: Fill in The Blanks
3 pages
Cole Porter: F.Scott Fitgerald
No ratings yet
Cole Porter: F.Scott Fitgerald
8 pages
Chapter 6 The Normal Distribution Other Continuous Distributions
No ratings yet
Chapter 6 The Normal Distribution Other Continuous Distributions
66 pages
Borromeo-Herrera V Borromeo
No ratings yet
Borromeo-Herrera V Borromeo
9 pages
Lab Activity 1
No ratings yet
Lab Activity 1
2 pages
All About That Bass
100% (1)
All About That Bass
12 pages
Fix The Story With Antonyms
100% (2)
Fix The Story With Antonyms
2 pages
Year 11 Coordinate Geometry Worksheet 2
No ratings yet
Year 11 Coordinate Geometry Worksheet 2
1 page
Revolver - April 2017
No ratings yet
Revolver - April 2017
8 pages
Gashadokuro
No ratings yet
Gashadokuro
1 page
Ethics in Science Communication
No ratings yet
Ethics in Science Communication
28 pages
Rust Language Cheat Sheet
No ratings yet
Rust Language Cheat Sheet
19 pages
Class 8 SA II Syllabus & Pattern
No ratings yet
Class 8 SA II Syllabus & Pattern
4 pages
AF707 Checklist-OPR
No ratings yet
AF707 Checklist-OPR
2 pages
Software Development Units (BooksRack - Net)
No ratings yet
Software Development Units (BooksRack - Net)
174 pages
Add Interest With Synonyms 2
No ratings yet
Add Interest With Synonyms 2
2 pages
Past Continuous
No ratings yet
Past Continuous
12 pages
Topics
100% (1)
Topics
21 pages
The Death of God and The Need For Symbolic Literacy
No ratings yet
The Death of God and The Need For Symbolic Literacy
6 pages
Fantasy Companion Character Sheet Fillable
No ratings yet
Fantasy Companion Character Sheet Fillable
1 page
Esters Worksheet PDF
No ratings yet
Esters Worksheet PDF
3 pages
Worksheet 0001 Treble Clef Notes PDF
No ratings yet
Worksheet 0001 Treble Clef Notes PDF
1 page
Student Subscriber: Application For Admission As A
No ratings yet
Student Subscriber: Application For Admission As A
3 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Piazzolla
No ratings yet
Piazzolla
3 pages
Tut 1
No ratings yet
Tut 1
2 pages
Introduction To Mechanisms: 2 Mechanisms and Simple Machines
No ratings yet
Introduction To Mechanisms: 2 Mechanisms and Simple Machines
6 pages
Gates Grade5
No ratings yet
Gates Grade5
2 pages
Chapter One-CSR
No ratings yet
Chapter One-CSR
9 pages
MAST20005 Statistics Assignment 1
No ratings yet
MAST20005 Statistics Assignment 1
10 pages
Unit - III (P&S Notes)
No ratings yet
Unit - III (P&S Notes)
39 pages
Dote 2011 L1
No ratings yet
Dote 2011 L1
35 pages
Multivariate Statistics: Factor Analysis
100% (1)
Multivariate Statistics: Factor Analysis
4 pages
Task 1 - Pre Knowledge Quiz Ingles A2
No ratings yet
Task 1 - Pre Knowledge Quiz Ingles A2
15 pages
VCE Chemistry Unit 1 Revision The Mole Concept
No ratings yet
VCE Chemistry Unit 1 Revision The Mole Concept
4 pages
Listening Practice Test Volume 1 20-1
No ratings yet
Listening Practice Test Volume 1 20-1
5 pages
MATH 136 1015 Final - Exam
No ratings yet
MATH 136 1015 Final - Exam
13 pages
Interval Estimation: Part 1: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
No ratings yet
Interval Estimation: Part 1: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
18 pages
Module05 Notes
No ratings yet
Module05 Notes
19 pages
Statistics Vocabulary List
100% (1)
Statistics Vocabulary List
1 page
Vocabulary For Academic IELTS Writing Task 1
No ratings yet
Vocabulary For Academic IELTS Writing Task 1
28 pages
MAST20005 Module01 Notes
No ratings yet
MAST20005 Module01 Notes
20 pages
EviewsTutorial2011 2012v7
No ratings yet
EviewsTutorial2011 2012v7
97 pages
Correlation and Covariance: James H. Steiger
No ratings yet
Correlation and Covariance: James H. Steiger
20 pages
6 - CFA-SEM Intro - 4-18-11
100% (1)
6 - CFA-SEM Intro - 4-18-11
94 pages
Hypothesis Testing and Confidence Intervals
0% (1)
Hypothesis Testing and Confidence Intervals
3 pages
ΣΤΑΤΙΣΤΙΚΗ ΛΥΜΕΝΕΣ ΑΣΚΗΣΕΙΣ
No ratings yet
ΣΤΑΤΙΣΤΙΚΗ ΛΥΜΕΝΕΣ ΑΣΚΗΣΕΙΣ
18 pages
Interval Estimation: Part 2: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
No ratings yet
Interval Estimation: Part 2: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
9 pages
R Programming
No ratings yet
R Programming
63 pages
V 1.1 AWE AWL LG Day 1
No ratings yet
V 1.1 AWE AWL LG Day 1
32 pages
Simple Linear Regression in R
No ratings yet
Simple Linear Regression in R
17 pages
Always Look For WRONG Answers DON'T Look For The RIGHT Ones: Reading Comprehension
No ratings yet
Always Look For WRONG Answers DON'T Look For The RIGHT Ones: Reading Comprehension
8 pages
Module07 Notes
No ratings yet
Module07 Notes
14 pages
The Vocab Mountain
No ratings yet
The Vocab Mountain
4 pages
Math1041 Study Notes For UNSW
No ratings yet
Math1041 Study Notes For UNSW
16 pages
MAST20005 Statistics Assignment 3
No ratings yet
MAST20005 Statistics Assignment 3
8 pages
R-Tutorial - Introduction
No ratings yet
R-Tutorial - Introduction
30 pages
Wordlist (PDF) Split Merge
100% (1)
Wordlist (PDF) Split Merge
24 pages
BNM854 Descriptive Statistics Intro
No ratings yet
BNM854 Descriptive Statistics Intro
9 pages
Infosys Placement Paper
No ratings yet
Infosys Placement Paper
84 pages
Tutprac 1
No ratings yet
Tutprac 1
8 pages
.ppt
No ratings yet
.ppt
10 pages
Presenter:: Prof. Richard Chinomona
100% (1)
Presenter:: Prof. Richard Chinomona
55 pages
Exercise For Midterm Test
No ratings yet
Exercise For Midterm Test
4 pages
MH3511 Midterm 2017 Q
No ratings yet
MH3511 Midterm 2017 Q
4 pages
MTH 102: Probability and Statistics: Quiz 7 Post (A Light) Lunch Assignment 27/05/2020 Sanjit K. Kaul
No ratings yet
MTH 102: Probability and Statistics: Quiz 7 Post (A Light) Lunch Assignment 27/05/2020 Sanjit K. Kaul
3 pages
MBA Free Ebooks
No ratings yet
MBA Free Ebooks
56 pages
Structural Equation Modeling
No ratings yet
Structural Equation Modeling
23 pages
GRE Geometry
100% (1)
GRE Geometry
26 pages
Graphs: Planned Drawings Showing How Different Values Are Related To Each Other
No ratings yet
Graphs: Planned Drawings Showing How Different Values Are Related To Each Other
18 pages
Chap 5 Exercises For Discrete Distributions
No ratings yet
Chap 5 Exercises For Discrete Distributions
6 pages
CAE Result Wordlist
No ratings yet
CAE Result Wordlist
47 pages
Stats1 Chp3 SupplementaryHistogramExercise
No ratings yet
Stats1 Chp3 SupplementaryHistogramExercise
4 pages
Grammar - Inversion
No ratings yet
Grammar - Inversion
9 pages
Ielts Class One
100% (1)
Ielts Class One
19 pages
Representation of Data
No ratings yet
Representation of Data
27 pages
Writing Task-1 Presentation
No ratings yet
Writing Task-1 Presentation
20 pages
Correlation & Regression Analysis
No ratings yet
Correlation & Regression Analysis
21 pages
Testing Endogeneity
No ratings yet
Testing Endogeneity
3 pages
Midtermtest 158-1
No ratings yet
Midtermtest 158-1
5 pages
GRE Math ERROR Log
No ratings yet
GRE Math ERROR Log
7 pages
Formulas Statistics II: ∫ = E (X) = ∫ = E (X) = ∫ ∫ Γ (p + 1) =
No ratings yet
Formulas Statistics II: ∫ = E (X) = ∫ = E (X) = ∫ ∫ Γ (p + 1) =
1 page
Statistics
No ratings yet
Statistics
10 pages
TOEFL, GRE, SOP, Etc
No ratings yet
TOEFL, GRE, SOP, Etc
10 pages
Introduction To Testing & Assessment - ELEM
No ratings yet
Introduction To Testing & Assessment - ELEM
3 pages
Ap Calculus Ab Syllabus 3
No ratings yet
Ap Calculus Ab Syllabus 3
6 pages
Types of Distributions: Probablity Distribution (Non Specific) Binomial Distribution
No ratings yet
Types of Distributions: Probablity Distribution (Non Specific) Binomial Distribution
1 page
Mixed Graph Form
No ratings yet
Mixed Graph Form
3 pages

Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018

Uploaded by

Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018

Uploaded by

Point estimation

4 Maximum likelihood estimation 8

Aims of this module

1 Estimation & sampling distributions

Statistics: the big picture

We want to start learning how to do inference. First, we need a good

Consider sampling from X ∼ Exp(λ = 1/5).

fX (x) = λe−λx , x ∈ [0, ∞)

Sampling distribution (definition)

Recall that any statistic T = φ(X1 , . . . , Xn ) is a random variable.

Suppose we want to estimate θ from the data. What should we do?

We will now go through a few important examples:

Also, the Central Limit Theorem implies that usually:

Often used to estimate the population mean, µ̂ = X̄.

Let the population proportion be p = Pr(X = a). Then we have:

For large n, we can approximate this with a normal distribution:

Examples for a normal distribution

Transformations and biasedness

Challenge problem (uniform distribution)

Challenge problem (boundary problem)

f (x) = e−(x−θ) (x > θ)

Coming up with (good) estimators?

How can we do this for any given problem?

4. Set E(X k ) = Mk , for k = 1, . . . r and solve for (θ1 , . . . , θr ).

Example: Geometric distribution

• The MM estimator is obtained by solving

Example: Normal distribution

Example: Gamma distribution

• Population moments: E(X) = αθ and var(X) = αθ2

4 Maximum likelihood estimation

Example: Bernoulli distribution

• Observe values x1 , . . . , xn of X1 , . . . , Xn (iid)

• Want to find the value of p that maximizes this likelihood.

• This gives p = x/n

0.00 0.25 0.50 0.75 1.00

Maximum likelihood: general procedure

• If X is discrete, for f use the pmf

Example: Exponential distribution

Sampling (iid) from: X ∼ Exp(λ)

Example: Exponential distribution (simulated)

0.5 1.0 1.5 2.0 2.5

What if we repeat the sampling process several times?

0.5 1.0 1.5 2.0 2.5

Sampling (iid) from: X ∼ Geom(p)

Example: Normal distribution

Sampling (iid) from: X ∼ N(θ1 , θ2 )

Note: θb2 is biased.

Stress and cancer: VEGFC

> sd(x) * sqrt((n - 1) / n) # MLE for the pop. st. dev.

> qqnorm(x) # Draw a QQ plot

Challenge problem (boundary problem)

Is the MLE a good estimator?

Some useful results:

You might also like