0% found this document useful (0 votes)

23 views44 pages

Estimation

Uploaded by

Filimon Cheneke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views44 pages

Estimation

Uploaded by

Filimon Cheneke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 44

Statistical Estimation

Desta M. (MPH)

1
Objective
• To know methods and principles of drawing
conclusions about a larger group (or population)
based on samples taken from that population

2
Introduction
• Descriptive statistics help investigators describe
and summarize data.

• Probability and sampling distribution concepts

needed to evaluate data using statistical
methods.

• Inferential statistics are the statistical methods

used to draw conclusions from a sample and
make inferences to the entire population.

• The two primary methods for making inference

are estimation and hypothesis testing.
3
4
5
Statistical Estimation
• Estimation is the process of determining a likely value
for a variable in the survey population, based on
information collected from the sample.

• Estimation is the use of sample statistics to estimate

population parameters.

• For example, a sample survey could be used to

produce any of the following statistics:

– estimates for the proportion of smokers among

all people aged 15 to 24 in the
population;
– the mean level of a certain enzyme among
healthy men 6
Parameter Estimations
• Population parameter: the underlying (unknown)
distribution of the variable of interest for a
population

• Sample parameter: estimates of the population

parameters obtained from a sample.

7
Example:
• A sample survey revealed:
― Proportion of smokers among a certain group of
population aged 15 to 24.
― Mean of SBP among sampled population
― Prevalence of HIV among people involved in the
study

→The next question is what can we predict

about the characteristics of the
population from which the sample was
drawn.

8
Types of Estimates
 Point Estimation: A single numerical value is used to
estimate the corresponding population parameter.
 x is an estimator of the population mean μ.

 s is an estimator of the population standard deviation

σ
 p is an estimator of the population proportion π.

 Point estimate is always within the interval

estimate

9
Point Estimation …
 From a single sample we can calculate a sample
statistic to estimate a single parameter (a point
estimate).
 Point estimate for population mean µ is
n

i =1
xi
x =
n

 Point estimate for population proportion

 xis given by
p=
n

 Where x is the total number of success (events)

10
Mean … Example
 A SRS of 16 apparently healthy subjects yielded
the following values of urine excreted (mg per
day);
0.007, 0.03, 0.025, 0.008, 0.03, 0.038, 0.007,
0.005, 0.032, 0.04, 0.009, 0.014, 0.011, 0.022,
0.009, 0.008

If xCompute
1 , x 2 , ..., x n estimate
point are n observed valuesmean
of the population , then
n

x i
0.295
x= i =1
 0.01844
n 16 11
Proportion … Example
• In a survey of 300 automobile drivers in one city,
123 reported that they wear seat belts regularly.

• Estimate the seat belt rate of the city.

• Answer : p= 123/300 = 0.41=41%

12
Interval estimation
 Interval estimation: is a statement that a population
parameter has a value lying between two specified limits.

 The value of the sample statistic will vary from sample to

sample therefore to simply obtain an estimate of the
single value of the parameter is not generally acceptable.

 We need to take into account the sample to sample

variation of the statistic.
 A confidence interval defines an interval within which
the true population parameter is like to fall (interval
estimate).
13
Interval estimation …

• Interval estimate (Confidence interval) - consists

of two numbers, a lower limit and an upper limit
which serve as the bounding values within which
the parameter is expected to lie with a certain
degree of confidence.
• Interval estimate:
• Takes into consideration variation in sample
statistics from sample to sample
• Provides Range of Values Based on Observations
from 1 Sample
• Gives Information about Closeness to Unknown
Population Parameter
• Stated in terms of Probability
• Never 100% Sure 14
Interval estimation …
• Two questions to put bounds on our point
estimates to reflect our level of confidence
– How wide does the bracket have to be?
– What is our tolerance of error
(variability, not mistake)?
• Scientists usually accept a 5% chance that the
range will not include the true population value
– The range or interval is called 95%
confidence interval
• however 90% and 99% confidence intervals are
sometimes used.

15
16
17
18
19
Factors Affecting Interval Width
 Level of Confidence

 90% CI is narrower than 95% CI since we are

only 90% certain that the interval includes the
population parameter.

 The 99% CI is wider than 95% CI; the extra width

meaning that we can be more certain that the
interval will contain the population parameter.

20
Factors Affecting Interval Width …

 But to obtain a higher confidence from the same

sample, we must be willing to accept a larger margin of
error (a wider interval).

 For a given confidence level (i.e. 90%, 95%, 99%) the

width of the confidence interval depends on the standard
error of the estimate which in turn depends on the:

A. Sample size:-The larger the sample size, the

narrower the confidence interval and the more
precise our estimate.

21
Factors Affecting Interval Width …
 You can make the precision as high as you
want by taking a large enough sample.
 The margin of error decreases as√n
increases.

B. Standard deviation:-The more the

variation among the individual values, the
wider the confidence interval and the less
precise the estimate.
 As sample size increases SD decreases.
22
• Confidence Intervals for

• A single population mean

• A single population proportion

23
1) C.I. for a population mean
(normally distributed)
A) Known variance (large sample size)

• A 100(1‐α)% C.I. for μ is

• α is to be chosen by the researcher, most common

values of α are 0.05, 0.01, 0.001 and 0.1.

24
• 100(1-α)% CI for μ when σ is known (sampling
from normal population or large sample)

The 95% confidence interval is interpreted in such

a way that, under the conditions assumed for
underlying distribution, you are 95% confident that25
Example
 A physical therapist wished to estimate, with 99%
confidence, the mean maximal strength of a
particular muscle in a certain group of individuals.

 He assume that strength scores are approximately

normally distributed with a variance of 144.
 A sample of 150 subjects who participated in the
experiment yielded a mean of 84.3.

26
• Solution:

⇒ We are 99% confident that the population mean

is between 76.3 and 92.3.

27
Example
• A data on 199 patients on systolic blood pressure
gives a mean value of 125.8 mmHg. Let us
assume that the standard deviation for this
patient population is known to be 20 mmHg.

Construct a 95 percent confidence interval for

the population mean.

28
• Solution
• α = 0.05 Z α/2 ⇒ 1.96

125.8 ±1.96× 20
√199

• The 95% CI is (123.0, 128.6 mmHg )

• We are 95% sure that the average systolic blood

pressure for similar patients is between 123 and
128.6.

29
B) Unknown variance (small sample size
n ≤ 30)
A 100(1‐α)% C.I. for μ is

 The t distribution density curve is bell shaped and

symmetrical about zero.
Different curves for different df (i.e. sample sizes)
and for very large df very close to Z.
30
The Z-test is applied when:
 The distribution is normal

 The population standard deviation σ is known or

 When the sample size n is large ( n ≥ 30) and

 With unknown σ (by taking S as estimator of σ).

31
But, what happens when n< 30 and σ is unknown?

 We will use a t-distribution which depends on the number of

degrees of freedom (df).

 The distribution is symmetrical, bell-shaped and similar to

the normal but more spread out.

 The sample standard deviation is used as an estimate of σ

(the standard deviation of the population which is unknown)
and appears to be a logical substitute.

 For large sample sizes (n ≥ 30), both t and Z curves are so

close together and it does not much matter which you use.

32
33
Degrees of Freedom
 It is defined as the number of values which are free
to vary after imposing a certain restriction on your
data.

Example: If 3 scores have a mean of 10, how many of

the scores can be freely chosen?

Solution: The first and the second scores could be

chosen freely (i.e., 8 and 12, 9 and 5, 7 & 15, etc.)

But the third score is fixed (i.e., 10, 16, 8, etc.)

 Hence, there are two degrees of freedom 34
35
36
• Example
• In a study of preeclampsia, Kaminski and Rechberger
found the mean systolic blood pressure of 10 healthy,
nonpregnant women to be 119 with a standard
deviation of 2.1.
A. What is the estimated standard error of the
mean?

B. Construct the 99% confidence interval for the

mean of the population from which the 10
subjects may be presumed to be a random
sample.

C. What is the precision of the estimate?

D. What assumptions are necessary for the validity

of the confidence interval you constructed? 37
38
C. Precision = 3.250 X 0.66
= 2.16
D. The population is normally distributed. The
10 subjects represent a random sample from this
population

39
2) C.I. for a population proportion
(large sample size)

40
• Example
• A research study obtained data regarding sexual
behavior from a sample of unmarried men and
women between the ages of 20 and 44 residing
in geographic areas characterized by high rates
of sexually transmitted diseases and admission to
drug programs. Forty percent of 1229
respondents reported that they never used a
condom.

• Construct a 95 percent confidence interval for

the population proportion never using a condom.

41
42
Example

• In a survey of 300 automobile drivers in one city,

123 reported that they wear seat belts regularly.
Estimate the seat belt rate of the city and 95%
confidence interval for true population
proportion.

• Answer :p= 123/300 =0.41=41%

n=300,
Estimate of the seat belt of the city at 95%
CI = p ± z ×(√p(1-p) /n) =(0.35,0.47)

43
44

Interval Estimation
100% (1)
Interval Estimation
42 pages
6 Estimation
No ratings yet
6 Estimation
65 pages
Statistical Inference
100% (1)
Statistical Inference
33 pages
Chapter Two-Four
No ratings yet
Chapter Two-Four
118 pages
Lecture 4-Statistical Inferences
No ratings yet
Lecture 4-Statistical Inferences
118 pages
7 Estimation
No ratings yet
7 Estimation
108 pages
Chapter Two
No ratings yet
Chapter Two
154 pages
Lecture 5
No ratings yet
Lecture 5
130 pages
University of Gondar College of Medicine and Health Science Department of Epidemiology and Biostatistics
No ratings yet
University of Gondar College of Medicine and Health Science Department of Epidemiology and Biostatistics
119 pages
Hypothesis Testing Notes 2025
No ratings yet
Hypothesis Testing Notes 2025
116 pages
Chapter Two
No ratings yet
Chapter Two
28 pages
Inferential Statistics
No ratings yet
Inferential Statistics
119 pages
Statistical Inference 417
No ratings yet
Statistical Inference 417
90 pages
Hypothesis Testing Notes 2025
No ratings yet
Hypothesis Testing Notes 2025
93 pages
4estimation and Hypothesis Testing (DB) (Compatibility Mode)
No ratings yet
4estimation and Hypothesis Testing (DB) (Compatibility Mode)
170 pages
VIII - Estimation
No ratings yet
VIII - Estimation
60 pages
Lecture 8
No ratings yet
Lecture 8
85 pages
Chapte 8 Estimation
No ratings yet
Chapte 8 Estimation
60 pages
Chapter 2
No ratings yet
Chapter 2
30 pages
Stat-II CH-TWO
No ratings yet
Stat-II CH-TWO
68 pages
Chapter 7estimation
No ratings yet
Chapter 7estimation
44 pages
Inferential Estimation
100% (1)
Inferential Estimation
74 pages
One Sample Inf
No ratings yet
One Sample Inf
9 pages
Estimation and CI
No ratings yet
Estimation and CI
87 pages
Biostat Inferential Statistics
No ratings yet
Biostat Inferential Statistics
62 pages
Biostat Lecture Seven
No ratings yet
Biostat Lecture Seven
59 pages
Estimation
No ratings yet
Estimation
74 pages
Confidence Intervals
No ratings yet
Confidence Intervals
28 pages
L8 Statistical Estimation 1
No ratings yet
L8 Statistical Estimation 1
48 pages
Chapter 5-6 Estimation Hypothesis
No ratings yet
Chapter 5-6 Estimation Hypothesis
146 pages
Chapter 4 - BUSINESS STATISTICS
No ratings yet
Chapter 4 - BUSINESS STATISTICS
14 pages
8.1 Estimation of Parameters
No ratings yet
8.1 Estimation of Parameters
5 pages
Methods Chapter 2
No ratings yet
Methods Chapter 2
19 pages
Business Statistics CH 2
No ratings yet
Business Statistics CH 2
49 pages
Estimation
No ratings yet
Estimation
106 pages
Confidence Intervals
No ratings yet
Confidence Intervals
30 pages
Chapter 1
No ratings yet
Chapter 1
24 pages
Bio 6
No ratings yet
Bio 6
36 pages
Math 301 CH 9 Estimation (One and Two Samples
No ratings yet
Math 301 CH 9 Estimation (One and Two Samples
42 pages
Statistical Estimation
No ratings yet
Statistical Estimation
28 pages
Chapter 4 - Hypothesis Confidence Interval - 30102016
No ratings yet
Chapter 4 - Hypothesis Confidence Interval - 30102016
103 pages
Chapter 3 (Sampling-New)
0% (1)
Chapter 3 (Sampling-New)
103 pages
Estimation by Confidence Interval
No ratings yet
Estimation by Confidence Interval
13 pages
Estimation
No ratings yet
Estimation
29 pages
Estimation
No ratings yet
Estimation
53 pages
Basic Business Statistics: Concepts & Applications: Confidence Interval Estimation
100% (1)
Basic Business Statistics: Concepts & Applications: Confidence Interval Estimation
27 pages
Estimation
No ratings yet
Estimation
40 pages
Biostat Estimation
100% (1)
Biostat Estimation
48 pages
Week 10
No ratings yet
Week 10
42 pages
Lecture 6 Estimation
No ratings yet
Lecture 6 Estimation
8 pages
Applied Statistics and Probability For Engineers Chapter - 8
No ratings yet
Applied Statistics and Probability For Engineers Chapter - 8
13 pages
Confidence Intervals
No ratings yet
Confidence Intervals
12 pages
Module 5
No ratings yet
Module 5
67 pages
Ch-1.Ppt Business Statx
No ratings yet
Ch-1.Ppt Business Statx
66 pages
Confidence Intervals 1
No ratings yet
Confidence Intervals 1
10 pages
Module 06 - One Population Parameter Estimation - Topic 4A
No ratings yet
Module 06 - One Population Parameter Estimation - Topic 4A
59 pages
Post A Status
No ratings yet
Post A Status
52 pages
Estimation 06
No ratings yet
Estimation 06
29 pages
Confidence Interval
100% (1)
Confidence Interval
19 pages
13 Correlation Analysis 1633738603
No ratings yet
13 Correlation Analysis 1633738603
17 pages
Fybcom Correlation
No ratings yet
Fybcom Correlation
2 pages
Soal Ujian Akhir Semester Statistika Bisnis Semester Genap T.A. 2018/2019 Jurusan Agribisnis Fakultas Pertanian Uho
No ratings yet
Soal Ujian Akhir Semester Statistika Bisnis Semester Genap T.A. 2018/2019 Jurusan Agribisnis Fakultas Pertanian Uho
6 pages
Assignment On Statistics For Banking & Finance Students
No ratings yet
Assignment On Statistics For Banking & Finance Students
13 pages
2nd-Sem Quarter4 Stat Week2
No ratings yet
2nd-Sem Quarter4 Stat Week2
32 pages
(Ebook PDF) The Process of Social Research 2nd Edition by Jeffrey C. Dixon PDF Download
100% (1)
(Ebook PDF) The Process of Social Research 2nd Edition by Jeffrey C. Dixon PDF Download
55 pages
Overview of Bayesian Statistics
No ratings yet
Overview of Bayesian Statistics
13 pages
Assumptions of Regression Including Independent of Errors
No ratings yet
Assumptions of Regression Including Independent of Errors
5 pages
3 Sls
No ratings yet
3 Sls
31 pages
6 Sebaran Penarikan Contoh
No ratings yet
6 Sebaran Penarikan Contoh
15 pages
Correlation Analysis
No ratings yet
Correlation Analysis
47 pages
??module 6 ?
No ratings yet
??module 6 ?
33 pages
Chap 5-1 - Machine Learning Basics - Jinwook Kim
No ratings yet
Chap 5-1 - Machine Learning Basics - Jinwook Kim
39 pages
RegrCorr PDF
No ratings yet
RegrCorr PDF
20 pages
Fernando, Logit Tobit Probit March 2011
No ratings yet
Fernando, Logit Tobit Probit March 2011
19 pages
1 Hsiao
No ratings yet
1 Hsiao
4 pages
Tugas Regresi Linear Sederhana (CPMK12)
No ratings yet
Tugas Regresi Linear Sederhana (CPMK12)
12 pages
Waqar Ansari's RISE QM Ch#10
No ratings yet
Waqar Ansari's RISE QM Ch#10
15 pages
(Ebook) Statistical Methods in Experimental Physics by James, Frederick ISBN 9789812567956, 9789812705273, 981256795X, 9812705279 Download
100% (1)
(Ebook) Statistical Methods in Experimental Physics by James, Frederick ISBN 9789812567956, 9789812705273, 981256795X, 9812705279 Download
56 pages
Anova
No ratings yet
Anova
9 pages
Chandan Mukherjee, Howard White, Marc Wuyts - Econometrics and Data Analysis For Developing Countries-Routledge (1998)
No ratings yet
Chandan Mukherjee, Howard White, Marc Wuyts - Econometrics and Data Analysis For Developing Countries-Routledge (1998)
515 pages
SAS 02 - MAT089 (Biostat) - Branches of Statistics, Biostatistics
No ratings yet
SAS 02 - MAT089 (Biostat) - Branches of Statistics, Biostatistics
6 pages
P&S MCQ U 5
100% (6)
P&S MCQ U 5
8 pages
Basic Statistics
100% (9)
Basic Statistics
73 pages
4b Homework
No ratings yet
4b Homework
2 pages
Linear Regression
No ratings yet
Linear Regression
22 pages
DA MCQs
No ratings yet
DA MCQs
12 pages
Topic 2 Estimation
No ratings yet
Topic 2 Estimation
56 pages
Complete the table showing the rejection regions for common values of α
No ratings yet
Complete the table showing the rejection regions for common values of α
1 page
Gamma Regression Models With The Gammareg R Package
No ratings yet
Gamma Regression Models With The Gammareg R Package
13 pages

Estimation

Uploaded by

Estimation

Uploaded by

Statistical Estimation

• Probability and sampling distribution concepts

• Inferential statistics are the statistical methods

• The two primary methods for making inference

• Estimation is the use of sample statistics to estimate

• For example, a sample survey could be used to

– estimates for the proportion of smokers among

• Sample parameter: estimates of the population

→The next question is what can we predict

 s is an estimator of the population standard deviation

 Point estimate is always within the interval

 Point estimate for population proportion

 Where x is the total number of success (events)

• Estimate the seat belt rate of the city.

• Answer : p= 123/300 = 0.41=41%

 The value of the sample statistic will vary from sample to

 We need to take into account the sample to sample

• Interval estimate (Confidence interval) - consists

 90% CI is narrower than 95% CI since we are

 The 99% CI is wider than 95% CI; the extra width

 But to obtain a higher confidence from the same

 For a given confidence level (i.e. 90%, 95%, 99%) the

A. Sample size:-The larger the sample size, the

B. Standard deviation:-The more the

• A single population mean

• A 100(1‐α)% C.I. for μ is

• α is to be chosen by the researcher, most common

The 95% confidence interval is interpreted in such

 He assume that strength scores are approximately

⇒ We are 99% confident that the population mean

Construct a 95 percent confidence interval for

• The 95% CI is (123.0, 128.6 mmHg )

• We are 95% sure that the average systolic blood

 The t distribution density curve is bell shaped and

 The population standard deviation σ is known or

 When the sample size n is large ( n ≥ 30) and

 With unknown σ (by taking S as estimator of σ).

 We will use a t-distribution which depends on the number of

 The distribution is symmetrical, bell-shaped and similar to

 The sample standard deviation is used as an estimate of σ

 For large sample sizes (n ≥ 30), both t and Z curves are so

Example: If 3 scores have a mean of 10, how many of

Solution: The first and the second scores could be

But the third score is fixed (i.e., 10, 16, 8, etc.)

B. Construct the 99% confidence interval for the

C. What is the precision of the estimate?

D. What assumptions are necessary for the validity

• Construct a 95 percent confidence interval for

• In a survey of 300 automobile drivers in one city,

• Answer :p= 123/300 =0.41=41%

You might also like