0% found this document useful (0 votes)

33 views9 pages

One Sample Inf

Uploaded by

burukg473

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views9 pages

One Sample Inf

Uploaded by

burukg473

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Lecture notes on Biostatistics and ED One Sample Inference

ONE SAMPLE INFERENCE

ESTIMATION AND HYPOTHESIS TESTING
 Inference is the process of making interpretations or conclusions from sample data for the
totality of the population.
 It is only the sample data that is ready for inference.
 In statistics there are two ways though which inference can be made.
 Statistical estimation
 Statistical hypothesis testing.

Inference Analyzed
Population
Data

Numerical
Sample
data

Data analysis is the process of extracting relevant information from the summarized data.
Statistical Estimation
This is one way of making inference about the population parameter where the investigator
does not have any prior notion about values or characteristics of the population parameter.
There are two ways estimation.
1) Point Estimation
It is a procedure that results in a “single value as an estimate for a parameter.
2) Interval estimation
It is the procedure that results in the interval of values as an estimate for a parameter,
which is interval that contains the likely values of a parameter. It deals with identifying
the upper and lower limits of a parameter. The limits by themselves are random variable.
Definitions
Confidence Interval: An interval estimate with a specific level of confidence
Confidence Level: The percent of the time that the true value will lie in the interval
estimate given.
Consistent Estimator: An estimator which gets closer to the value of the parameter as the
sample size increases.
Degrees of Freedom: The number of data values which are allowed to vary once a
statistic has been determined.
Estimator: A sample statistic which is used to estimate a population parameter. It must be
unbiased, consistent, and relatively efficient.
Estimate: Is the different possible values which an estimator can assumes.

Page 1 of 9
Lecture notes on Biostatistics and ED One Sample Inference

Interval Estimate: A range of values used to estimate a parameter.

Point Estimate: A single value used to estimate a parameter.
Point and Interval estimation of the population mean: µ
a. Point Estimation
Another term for statistic is point estimate, since we are estimating the parameter value. A
point estimator is the mathematical way we compute the point estimate. For instance, sum of
xi over n is the point estimator used to compute the estimate of the population means,

 xi
 .That is X  is a point estimator of the population mean.
n
b. Confidence interval estimation of the population mean
Although X possesses nearly all the qualities of a good estimator, because of sampling error,
we know that it's not likely that our sample statistic will be equal to the population parameter,
but instead will fall into an interval of values. We will have to be satisfied knowing that the
statistic is "close to" the parameter. That leads to the obvious question, what is "close"?

We can phrase the latter question differently: How confident can we be that the value of the
statistic falls within a certain "distance" of the parameter? Or, what is the probability that the
parameter's value is within a certain range of the statistic's value? This range is the confidence
interval.
The confidence level is the probability that the value of the parameter falls within the range
specified by the confidence interval surrounding the statistic.
There are different cases to be considered to construct confidence intervals.
Case 1: If sample size is large or if the population is normal with known variance
Recall the Central Limit Theorem, which applies to the sampling distribution of the mean of a
sample. Consider samples of size n drawn from a population, whose mean is  and standard
deviation is  with replacement and order important. The population can have any frequency
distribution. The sampling distribution of X will have a mean  x   and a standard

deviation  x  , and approaches a normal distribution as n gets large. This allows us to
n
use the normal distribution curve for computing confidence intervals.
X 
Z  has a normal distribution with mean  0 and var iance  1
 n
   X  Z n
 X , where  is a measure of error.
  Z n

- For the interval estimator to be good the error should be small. How it be small?

Page 2 of 9
Lecture notes on Biostatistics and ED One Sample Inference

 By making n large
 Small variability
 Taking Z small
- To obtain the value of Z, we have to attach this to a theory of chance. That is, there is an area of
size 1   such that
P (  Z 2  Z  Z 2 )  1  
Where   is the probability that the parameter lies outside the int erval
Z 2  s tan ds for the s tan dard normal var iable to the right of which
 2 probability lies, i.e P ( Z  Z 2 )   2

X 
 P( Z 2   Z 2 )  1  
 n
 P( X  Z 2  n    X  Z 2  n)  1  

 ( X  Z 2  n , X  Z 2  n ) is a1001   % conifidence int erval for 

But usually 
2
is not known, in that case we estimate by its point estimator S 2

 ( X  Z 2 S n , X  Z 2 S n ) is a1001   % conifidence int erval for 

Here are the Z values corresponding to the most commonly used confidence levels.

100(1   ) %   2 Z 2
90 0.10 0.05 1.645
95 0.05 0.025 1.96
99 0.01 0.005 2.58

Case 2: If sample size is small and the population variance,  2 is not known.
X 
t has t distribution with n  1 deg rees of freedom.
S n

Page 3 of 9
Lecture notes on Biostatistics and ED One Sample Inference

 ( X  t 2 S n , X  t 2 S n ) is a1001   % conifidence int erval for 

The unit of measurement of the confidence interval is the standard error. This is just the
standard deviation of the sampling distribution of the statistic.
Examples:
1. From a normal sample of size 25 a mean of 32 was found .Given that the population
standard deviation is 4.2. Find
a) A 95% confidence interval for the population mean.
b) A 99% confidence interval for the population mean.
Solution:
a)
X  32,   4.2, 1    0.95    0.05,  2  0.025
 Z 2  1.96 from table.
 The required int erval will be X  Z 2  n
 32  1.96 * 4.2 25
 32  1.65
 (30.35, 33.65)

X  32,   4.2, 1    0.99    0.01,  2  0.005

 Z 2  2.58 from table.
 The required int erval will be X  Z 2  n
 32  2.58 * 4.2 25
 32  2.17
 (29.83, 34.17)

2. A drug company is testing a new drug which is supposed to reduce blood pressure. From
the six people who are used as subjects, it is found that the average drop in blood pressure
is 2.28 points, with a standard deviation of .95 points. What is the 95% confidence interval
for the mean change in pressure?

Page 4 of 9
Lecture notes on Biostatistics and ED One Sample Inference

Solution:

X  2 .28 , S  0 .95 , 1    0 .95    0 .05 ,  2  0 .025

 t 2  2 .571 with df  5 from table .
 The required int erval will be X  t 2 S n
 2 .28  2 .571 * 0 .95 6
 2 .28  1 .008
 (1 .28 , 3 .28 )
That is, we can be 95% confident that the mean decrease in blood pressure is between 1.28 and 3.28
points.

Hypothesis Testing
This is also one way of making inference about population parameter, where the investigator has
prior notion about the value of the parameter.
Definitions:
 Statistical hypothesis: is an assertion or statement about the population whose plausibility is
to be evaluated on the basis of the sample data.
 Test statistic: is a statistics whose value serves to determine whether to reject or accept the
hypothesis to be tested. It is a random variable.
 Statistic test: is a test or procedure used to evaluate a statistical hypothesis and its value
depends on sample data.
There are two types of hypothesis:
Null hypothesis:
- It is the hypothesis to be tested.
- It is the hypothesis of equality or the hypothesis of no difference.
- Usually denoted by H0.
Alternative hypothesis:
- It is the hypothesis available when the null hypothesis has to be rejected.
- It is the hypothesis of difference.
- Usually denoted by H1 or Ha.
Types and size of errors:
- Testing hypothesis is based on sample data which may involve sampling and non
sampling errors.
- The following table gives a summary of possible results of any hypothesis test:

Page 5 of 9
Lecture notes on Biostatistics and ED One Sample Inference

Decision
Reject H0 Don't reject H0
H0 Type I Error Right Decision
Truth
H1 Right Decision Type II Error

- Type I error: Rejecting the null hypothesis when it is true.

- Type II error: Failing to reject the null hypothesis when it is false.
NOTE:
1. There are errors that are prevalent in any two choice decision making problems.
2. There is always a possibility of committing one or the other errors.
3. Type I error (  ) and type II error (  ) have inverse relationship and therefore, can not
be minimized at the same time.
 In practice we set  at some value and design a test that minimize  . This is because a type I
error is often considered to be more serious, and therefore more important to avoid, than a
type II error.

General steps in hypothesis testing:

1. Specify the null hypothesis (H0) and the alternative hypothesis (H1).
2. Specify the significance level, 
3. Identify the sampling distribution (if it is Z or t) of the estimator.
4. Identify the critical region.
5. Calculate a statistic analogous to the parameter specified by the null hypothesis.
6. Making decision.
7. Summarization of the result.

Hypothesis testing about the population mean,  :

Suppose the assumed or hypothesized value of  is denoted by  0 , then one can formulate two
sided (1) and one sided (2 and 3) hypothesis as follows:

1. H 0 :   0 vs H1 :    0
2. H 0 :   0 vs H1 :    0
3. H 0 :   0 vs H1 :    0

Case 1: When sampling is from a normal distribution with  known

- The relevant test statistic is

X  0
Z cal 
 n

Page 6 of 9
Lecture notes on Biostatistics and ED One Sample Inference

- After specifying  we have the following regions (critical and acceptance) on the standard
normal distribution corresponding to the above three hypothesis.

Summary table for decision rule:

H0 Reject H0 if Accept H0 if Inconclusive if
  0 Z cal  Z 2 Z cal  Z 2 Z cal  Z 2 or Z cal   Z 2

  0 Z cal   Z Z cal   Z Z cal   Z

  0 Z cal  Z Z cal  Z Z cal  Z

Case 2: When sampling is from a normal distribution with  unknown and small sample size
2

- The relevant test statistic is

X  0
t cal  ~ t with n  1 deg rees of freedom.
S n
- After specifying  we have the following regions on the student t-distribution
corresponding to the above three hypothesis.
H0 Reject H0 if Accept H0 if Inconclusive if
  0 tcal  t 2 tcal  t 2 t cal  t 2 or tcal  t 2

  0 t cal  t t cal  t t cal  t

  0 tcal  t tcal  t tcal  t

Case 3: When sampling is from a non- normally distributed population or a population

whose functional form is unknown.
- If a sample size is large one can perform a test hypothesis about the mean by using:

X  0
Z cal  , if  2 is known.
 n
X  0
 , if  2 is unknown.
S n
- The decision rule is the same as case I.
Examples:
1. Test the hypotheses that the average height content of containers of certain lubricant is 10 liters if
the contents of a random sample of 10 containers are 10.2, 9.7, 10.1, 10.3, 10.1, 9.8, 9.9, 10.4,
10.3, and 9.8 liters. Use the 0.01 level of significance and assume that the distribution of contents
is normal.

Solution:

Page 7 of 9
Lecture notes on Biostatistics and ED One Sample Inference

Let   Population mean. ,  0  10

Step 1: Identify the appropriate hypothesis
H 0 :   10 vs H1 :   10
Step 2: select the level of significance,   0.01 ( given)
Step 3: Select an appropriate test statistics
t- Statistic is appropriate because population variance is not known and the sample size is
also small.
Step 4: identify the critical region.
Here we have two critical regions since we have two tailed hypothesis
The critical region is tcal  t0.005 (9)  3.2498
 (3.2498, 3.2498) is accep tan ce region.
Step 5: Computations:
X  10.06, S  0.25
X   0 10.06  10
 t cal    0.76
S n 0.25 10
Step 6: Decision
Accept H0 , since tcal is in the acceptance region.
Step 7: Conclusion
At 1% level of significance, we have no evidence to say that the average height content of
containers of the given lubricant is different from 10 litters, based on the given sample data.

2. The mean life time of a sample of 16 fluorescent light bulbs produced by a company is computed to
be 1570 hours. The population standard deviation is 120 hours. Suppose the hypothesized value for
the population mean is 1600 hours. Can we conclude that the life time of light bulbs is decreasing?
(Use   0.05 and assume the normality of the population)
Solution:
Let   Population mean. ,  0  1600
Step 1: Identify the appropriate hypothesis
H 0 :   1600 H1 :   1600
vs
Step 2: select the level of significance,   0.05 ( given)
Step 3: Select an appropriate test statistics
Z- Statistic is appropriate because population variance is known.
Step 4: identify the critical region.
The critical region is Z cal   Z 0.05  1.645
 (1.645, ) is accep tan ce region.
Step 5: Computations:

Page 8 of 9
Lecture notes on Biostatistics and ED One Sample Inference

X   0 1570  1600
Z cal    1.0
 n 120 16
Step 6: Decision
Accept H0, since Zcal is in the acceptance region.
Step 7: Conclusion
At 5% level of significance, we have no evidence to say that that the life time of light bulbs is
decreasing, based on the given sample data.

Exercise: It is known in a pharmacological experiment that rats fed with a particular diet over a
certain period gain an average of 40 gms in weight. A new diet was tried on a sample of 20 rats
yielding a weight gain of 43 gms with variance 7 gms. Test the hypothesis that the new diet is an
improvement assuming normality.

Page 9 of 9

The Impact of Social Media On Language Use Among Nigeria Youths Edited Copy 111
100% (1)
The Impact of Social Media On Language Use Among Nigeria Youths Edited Copy 111
61 pages
Chapter Two (Estimation and Hypothesis Testing)
No ratings yet
Chapter Two (Estimation and Hypothesis Testing)
20 pages
Estimation
No ratings yet
Estimation
44 pages
Chapter-8-Estimation & Hypothesis Testing
100% (1)
Chapter-8-Estimation & Hypothesis Testing
12 pages
中国股票市场的ESG责任投资研究赵斯彤
No ratings yet
中国股票市场的ESG责任投资研究赵斯彤
177 pages
1 EC108 Estimation and Confidence Interval
No ratings yet
1 EC108 Estimation and Confidence Interval
125 pages
Stat-II CH-TWO
No ratings yet
Stat-II CH-TWO
68 pages
Estimation
No ratings yet
Estimation
53 pages
Lecture 4-Statistical Inferences
No ratings yet
Lecture 4-Statistical Inferences
118 pages
SDO Navotas Sci7 Q1 Lumped - FV
No ratings yet
SDO Navotas Sci7 Q1 Lumped - FV
50 pages
4 Inferentials
No ratings yet
4 Inferentials
53 pages
Chapter 4 - BUSINESS STATISTICS
No ratings yet
Chapter 4 - BUSINESS STATISTICS
14 pages
6 Estimation
No ratings yet
6 Estimation
65 pages
7 Estimation
No ratings yet
7 Estimation
108 pages
Estimation and CI
No ratings yet
Estimation and CI
87 pages
Statistics For Manangement II
No ratings yet
Statistics For Manangement II
28 pages
Chapter Two-Four
No ratings yet
Chapter Two-Four
118 pages
Chapter Two
No ratings yet
Chapter Two
154 pages
Chapter 2
No ratings yet
Chapter 2
30 pages
Ch-1.Ppt Business Statx
No ratings yet
Ch-1.Ppt Business Statx
66 pages
Lecture 8
No ratings yet
Lecture 8
85 pages
University of Gondar College of Medicine and Health Science Department of Epidemiology and Biostatistics
No ratings yet
University of Gondar College of Medicine and Health Science Department of Epidemiology and Biostatistics
119 pages
Chapter 6
No ratings yet
Chapter 6
43 pages
Chapter-8-Estimation & Hyp
No ratings yet
Chapter-8-Estimation & Hyp
42 pages
Chapter 7estimation
No ratings yet
Chapter 7estimation
44 pages
Estimation and Sample Size Determination
No ratings yet
Estimation and Sample Size Determination
37 pages
Basiouni - Abdullah Innovation in E-Business Model
No ratings yet
Basiouni - Abdullah Innovation in E-Business Model
282 pages
Business Statistics CH 2
No ratings yet
Business Statistics CH 2
49 pages
CH II - Statistical Estimations
No ratings yet
CH II - Statistical Estimations
17 pages
Module 5
No ratings yet
Module 5
67 pages
Economics Research Methods I
No ratings yet
Economics Research Methods I
91 pages
Unit-3 (Estimation)
No ratings yet
Unit-3 (Estimation)
16 pages
Chapter 8
No ratings yet
Chapter 8
19 pages
America Again Reprint Stephen Colbert PDF Download
No ratings yet
America Again Reprint Stephen Colbert PDF Download
77 pages
Pre-Activity Metabo-Herb P. Abreu
100% (1)
Pre-Activity Metabo-Herb P. Abreu
2 pages
Chapter 8
No ratings yet
Chapter 8
42 pages
Estimation and Confidence Intervals
No ratings yet
Estimation and Confidence Intervals
28 pages
Inferential Statistics
No ratings yet
Inferential Statistics
119 pages
Session: 27: Topic
No ratings yet
Session: 27: Topic
62 pages
Chapter-7-Estimation & Hypothesis Testing
No ratings yet
Chapter-7-Estimation & Hypothesis Testing
15 pages
Chapter-8-Estimation & Hypothesis Testing
No ratings yet
Chapter-8-Estimation & Hypothesis Testing
14 pages
CH 2
No ratings yet
CH 2
20 pages
PLU Quantitative Techniques 3
No ratings yet
PLU Quantitative Techniques 3
17 pages
Statistical Inference
100% (1)
Statistical Inference
33 pages
Chapter Two
No ratings yet
Chapter Two
28 pages
Ce (PC) 602
No ratings yet
Ce (PC) 602
21 pages
Chapter 7 Estimation
No ratings yet
Chapter 7 Estimation
35 pages
Estimation by Confidence Interval
No ratings yet
Estimation by Confidence Interval
13 pages
Biostat Inferential Statistics
No ratings yet
Biostat Inferential Statistics
62 pages
Rws 6th
No ratings yet
Rws 6th
76 pages
Interval Estimation
100% (1)
Interval Estimation
42 pages
Estimation
No ratings yet
Estimation
11 pages
Chapter 8 Estimation & Hypothesis Testing Copy Copy1
No ratings yet
Chapter 8 Estimation & Hypothesis Testing Copy Copy1
11 pages
Cash Flows and Accrual Accounting in Predicting Future Cash Flows
No ratings yet
Cash Flows and Accrual Accounting in Predicting Future Cash Flows
210 pages
ABU Buad 837 Summary
No ratings yet
ABU Buad 837 Summary
14 pages
STA630 Quiz
No ratings yet
STA630 Quiz
54 pages
Han 1989
No ratings yet
Han 1989
8 pages
Buss. Stat CH-2
100% (2)
Buss. Stat CH-2
13 pages
ch.8 Inference
No ratings yet
ch.8 Inference
10 pages
Minitab
No ratings yet
Minitab
19 pages
A Hermeneutic Critique On George Steiners Hermene
No ratings yet
A Hermeneutic Critique On George Steiners Hermene
17 pages
Chapter 1
No ratings yet
Chapter 1
24 pages
Chapter-8-Estimation & Hypothesis Testing
No ratings yet
Chapter-8-Estimation & Hypothesis Testing
12 pages
Flipped Notes 7 Estimation
No ratings yet
Flipped Notes 7 Estimation
36 pages
Statistical Inferenace 1
No ratings yet
Statistical Inferenace 1
9 pages
Chapter 2 Statistics Estimation Final
No ratings yet
Chapter 2 Statistics Estimation Final
13 pages
Chapter 8
No ratings yet
Chapter 8
21 pages
Maths Statistics Coursework Sample
100% (2)
Maths Statistics Coursework Sample
7 pages
Estimation in Statistics
100% (1)
Estimation in Statistics
4 pages
Business Research Methods Lecutre Notes ALL UNITS
No ratings yet
Business Research Methods Lecutre Notes ALL UNITS
75 pages
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
No ratings yet
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
37 pages
R1Q1 Handbook
No ratings yet
R1Q1 Handbook
31 pages
Assessment of Related Learning Experience (RLE) : Basis For A Proposed Dedicated Education Unit Model (DEU)
No ratings yet
Assessment of Related Learning Experience (RLE) : Basis For A Proposed Dedicated Education Unit Model (DEU)
17 pages
Integrative Taxonomy - A Multisource Approach To Exploring Biodiversity
No ratings yet
Integrative Taxonomy - A Multisource Approach To Exploring Biodiversity
21 pages
Introduction To Estimation: OPRE 6301
100% (1)
Introduction To Estimation: OPRE 6301
18 pages
Midterm Wife Cheat Sheet
No ratings yet
Midterm Wife Cheat Sheet
3 pages
41pmt Da 15243 Bsg09315 Chekwoti
No ratings yet
41pmt Da 15243 Bsg09315 Chekwoti
8 pages
(16133641 - Cognitive Linguistics) Space-To-Time Mappings and Temporal Concepts PDF
No ratings yet
(16133641 - Cognitive Linguistics) Space-To-Time Mappings and Temporal Concepts PDF
46 pages
Activity 3
No ratings yet
Activity 3
4 pages
Statics Chapter 8 88
No ratings yet
Statics Chapter 8 88
12 pages
Introduction To Theory
No ratings yet
Introduction To Theory
35 pages
Exploratory Research
No ratings yet
Exploratory Research
3 pages
Advancing in Debate - Skills and Concepts
100% (4)
Advancing in Debate - Skills and Concepts
26 pages
Scientific Method of Research
No ratings yet
Scientific Method of Research
9 pages
Estimation
No ratings yet
Estimation
10 pages
Changing Trends in FMCG Industry In: India
No ratings yet
Changing Trends in FMCG Industry In: India
10 pages
Midterm Study Notes Psy1101
No ratings yet
Midterm Study Notes Psy1101
16 pages
Chapter 17 Confidence Interval
0% (1)
Chapter 17 Confidence Interval
3 pages
STATS Exam Questions!
No ratings yet
STATS Exam Questions!
3 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet

One Sample Inf

Uploaded by

One Sample Inf

Uploaded by

Lecture notes on Biostatistics and ED One Sample Inference

ONE SAMPLE INFERENCE

Interval Estimate: A range of values used to estimate a parameter.

 ( X  Z 2  n , X  Z 2  n ) is a1001   % conifidence int erval for 

 ( X  Z 2 S n , X  Z 2 S n ) is a1001   % conifidence int erval for 

 ( X  t 2 S n , X  t 2 S n ) is a1001   % conifidence int erval for 

X  32,   4.2, 1    0.99    0.01,  2  0.005

X  2 .28 , S  0 .95 , 1    0 .95    0 .05 ,  2  0 .025

- Type I error: Rejecting the null hypothesis when it is true.

General steps in hypothesis testing:

Hypothesis testing about the population mean,  :

Case 1: When sampling is from a normal distribution with  known

- The relevant test statistic is

Summary table for decision rule:

  0 Z cal   Z Z cal   Z Z cal   Z

- The relevant test statistic is

  0 t cal  t t cal  t t cal  t

Case 3: When sampling is from a non- normally distributed population or a population

Let   Population mean. ,  0  10

You might also like