0% found this document useful (0 votes)

29 views7 pages

Chapter 8 and 9

This document provides an overview of estimation and hypothesis testing. It defines key terms like parameters, statistics, point estimation, and interval estimation. It explains how to construct confidence intervals for the population mean and outlines the steps for hypothesis testing, including defining null and alternative hypotheses, determining significance levels, calculating test statistics, and making conclusions. Examples are provided for hypothesis testing of claims about population means where the test statistics are z-scores or t-statistics depending on sample size. The document then previews the topic of the next chapter which will cover simple linear regression and correlation analysis involving bi-variate data.

Uploaded by

Ellii YouTube channel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views7 pages

Chapter 8 and 9

Uploaded by

Ellii YouTube channel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Chapter 8

Estimation and Hypothesis Testing

The most important objective of statistical analysis is to draw inferences about the population
using sample information. The process of generalizing from sample to the population is known
as Statistical Inference.

A summary measure that describes any given characteristic of the population is known as
Parameter. Eg: population mean (µ), population variance (δ2), population standard deviation (δ),
population proportion (P), population moments (µr) are parameters.
The summary measure that describes the characteristic of the sample is known as Statistic.
Eg: sample mean ( X̄ ), sample variance (S2), sample standard deviation (S), sample proportion
(p), sample moments (
X̄ r ) are Statistics.

Statistical inference generally takes one of the two forms, namely, estimation of the population
parameter and testing of hypothesis.
For the purpose of general discussion, a population parameter is denoted by θ and the
^
corresponding statistic byθ . As already stated the parameter θ is unknown. The value of the
^
statistic θ is computed from the random sample taken from the population.
^
The statistic θ intended for estimating a parameter θ is called an Estimator ofθ . The specific
numerical value of an estimator calculated from the sample is called the Estimate.
The process of obtaining an estimate of the unknown value of a parameter by a statistic is called
Estimation. There are two types of estimations. One is the point estimation and the other is
interval estimation.
Point Estimation
It is the process of obtaining a single sample value (point estimate) that is used to estimate the
desired population parameter. The estimator is known as point estimator.
Eg: X̄ is a point estimate of µ.
S is a point estimate ofδ
The best estimator should be highly reliable and have such desirable properties as unbiasedness,
consistency, efficiency and sufficiency. These criteria are described as follows:
1. Unbiasedness: An estimator is a random variable since it is always a function of the
sample values. The expected value of the sample statistic is considered to be an unbiased
^
estimator if it equals the population parameter which is being estimated. This means E( θ
)=θ .

1
2. Consistency: It refers to the effect of sample size on the accuracy of the estimator. A
statistic is said to be consistent estimator of the population parameter if it approaches the
^
parameter as the sample size increases, i.e. θ →θ as n→N.
3. Efficiency: An estimator is considered to be efficient if its value remains stable from
sample to sample. The best estimator would be the one which would have the least
variance from sample to sample. From the three point estimators of central tendency,
namely the mean, median and mode, the mean is considered the least variant and hence a
better estimator.
4. Sufficiency: An estimator is said to be sufficient if it uses all the information about the
population parameter contained in the sample. For example, the statistic mean uses all the
sample values in its computation while median and mode do not. Hence the mean is a
better estimator in this sense.
Interval Estimation
Point estimator has some drawbacks. First a point estimator from the sample may not exactly
locate the population parameter (i.e. the value of the point estimator is not likely to be exactly
equal to the value of the parameter) resulting in some margin of uncertainty. If the sample value
is different from the population value, the point estimator does not indicate the extent of the
possible error. Secondly a point estimate does not specify as to how confident we can be that the
estimate is close to the parameter it is estimating. That is we cannot attach any degree of
confidence to such an estimate as to what extent it is closer to the value of the parameter.
Because of these limitations of point estimation, interval estimation is considered desirable. The
interval estimation involves the determination of an interval (a range of values) within which the
population parameter must lie with a specified degree of confidence. It is the construction of an
interval on both sides of the point estimate within which we can reasonably confident that the
true parameter will lie.

Interval Estimation for the Population Mean (µ)

If the probability of rejecting true hypothesis is given, then it is denoted by α and it is called level
δ δ
of significance. The (1-α) 100% confidence interval for µ is ( X̄ -Zα/2 √ n , X̄ +Zα/2 √ n )=(L,U)
δ
Zα/2 √ n is the maximum error of the estimate (the maximum difference between the point
estimate of a parameter and the actual value of a parameter).

Ex: - Haramaya University wishes to estimate the average age of students who graduate with
B.Sc. degree. A random sample of 625 graduating students showed that the average age was 24
with a standard deviation of 5 years. Construct the 95% confidence interval for the true average
age of all such graduating students at the university and interpret it.
Hypothesis Testing

2
A statistical hypothesis is a conjecture (an assumption) about a population parameter which may
or may not be true. Hypothesis testing is a statistical procedure which leads to take a decision
about such an assumption for the population parameter being correct or not, by using data
obtained from the sample.
In hypothesis testing, the researcher must define the population under study, state the particular
hypothesis that will be checked, give the significance level, select sample from the population,
perform calculations required for statistical test and reach conclusion.
It is already expressed that a statistical hypothesis may or may not true. For each situation, there
two types of statistical hypotheses.
1. Null Hypothesis (H0):- is a statistical hypothesis that states there is no difference
between a parameter and a specific value or hypothesized value. H 0:µ=µ0 where µ is the
population mean and µ0 is the hypothesized mean
2. Alternative Hypothesis (H1):- is a statistical hypothesis that states there exists a
difference between a parameter and a specific value or hypothesized value.
H1: µ≠µ0 H1: µ<µ0 H1: µ>µ0
Errors in Hypothesis Testing
1. Type I error: is an error occurred if one rejects the null hypothesis which is actually
true.
2. Type II error: is an error occurred if one failed to reject the null hypothesis which is
actually false.

The maximum probability of committing type I error is called the level of significance and
denoted by α (alpha).

Hypothesis testing for the population mean

1. State both hypotheses; the null and the alternative hypotheses. The hypotheses may be
either of the three.
H0:µ=µ0 H0:µ=µ0 H0:µ=µ0
H1: µ≠µ0 H1: µ<µ0 H1: µ>µ0
Two tailed test left tailed test right tailed test
2. Determine the level of significance α and obtain the tabulated (critical) value. For two
tailed test the critical value is Zα/2 (tα/2) and for left tailed -Zα (-tα) and right tailed Zα (tα).
3. Use the appropriate test statistic.
X̄− μ
t=
S
Use t statistic if n is small, √n ~t α (n−1) .
X̄−μ
Z=
δ
If n is large use Z statistic, √ n ~N (0, 1).

3
4. Define the critical (rejection) region.
5. If the value of the test statistic falls in the critical region (rejection region), reject the null
hypothesis; otherwise accept it.
6. Make a decision.
EX:
1. A research repots that the average salary of veterinarians is more than $42000. A sample
of 30 veterinarians has a mean salary of $43260. Test the reports claim. Assume the
population standard deviation is $5230.
2. A national magazine claims that the average college students watches less television than
the general public. The national average is 29.4 hours per week, with a standard deviation
2 hours. A random sample of 25 college students has a mean of 27 hours. Test the claim.
Assume normality.
3. A merchant believes that the average age of customers who purchase a certain brand of
wears is 13 years of age. A random sample of 35 customers had an average age of 15.6
years. At α=0.01, should this conjecture be rejected. The standard deviation of the
population is 1year.

Chapter 9
Simple Linear Regression and Correlation
In the previous chapters we have been dealing with a single variable. In this chapter we will deal
with a bi-variate data i.e. data involving two variables. In this section we will deal with the
problem of predicting the average value of one variable in terms of known values of the other
variable(s).

Regression may be defined as the estimation or prediction of the unknown value of one variable
from the known values of one or more variables. The variable whose values are to be estimated
or predicted is known as dependent or explained variable while the variable which are used in
determining the value of the dependent variable are called independent or predictor variables.

The regression study that involves only two variables is called simple regression and the
regression analysis that studies more than two variables is called multiple regression. If the
relation ship between the two variables can be described by a straight line then the regression is
known as linear regression other wise it is called non-linear.

 The regression analysis involving only two variables and having a linear relationship is
called Simple Linear Regression. This linear relationship between the two variables is
represented by a straight line.

Regression Line (Line of Regression): is the line that gives the best estimate of one variable for
any given value of another variable. The regression line which is used to predict the values of Y

4
for any given value of X is called regression line of Y on X. similarly the regression line which is
used to predict the values of X for any given value of Y is called regression line of X on Y.

Regression Equation: is a mathematical equation that defines the relationship between two
variables.

Regression of Y on X
Model: Y= α + βX + Є
Where Y is the dependent variable
X is the dependent variable
α is constant term(intercept)
β is slope(change in Y for a unit change in X)
Є is the error term
To estimate the regression coefficients (α and β), the procedure is minimizing the sum of the
^
squares of the errors. Let the estimated model be Y = a + bX. Then, from sample data the values
of a (estimate of α) and b (estimate of β) can be obtained as follows:
n ∑ XY −∑ X ∑ Y
b= n ∑ X 2−( ∑ X )2 and a= Ȳ -b X̄ .

Interpretation of the slope (b)

1. If b is positive, there is a direct relationship between the two variables.
2. If b is zero, there is no linear relationship between the two variables.
3. If b is negative, there is an indirect relationship between the two variables.

Correlation
Most of the variables in economics and business area show relationship. For example, price and
supply, income and expenditure, advertizing expenditure and sales. Thus in order to know the
degree or direction of such a relationship between variables, correlation analysis is important.
Correlation is a mathematical tool desired towards measuring the degree of the relationship
(degree of association) between the variables. Correlation that involves only two variables is
called simple correlation and which involves more than two variables is called multiple
correlations.
Covariance is a measure of the joint variation in two variables, i.e. it measures the way in which
the values of the two variables vary together. If the covariance is zero, there is no linear
relationship between the two variables. If it is negative, there is an indirect linear relationship
between them. If the covariance is positive, there is a direct linear relationship between the
variables.

5
Pearson’s coefficient of correlation (r)
Pearson’s coefficient of correlation (r) is used to measure the strength of the linear relationship
between two variables.
The population correlation coefficient is denoted by ρ and the sample correlation coefficient is
denoted by r.
n ∑ XY −∑ X ∑ Y
√ ∑ X 2−( ∑ X )2 √ n ∑ Y 2−( ∑ Y )2
r= n

The value of r is always in between -1 and 1.

Interpretation of r
 If the value of r is -1 or 1, there is perfect negative or perfect positive linear relationship
between the variables.
 If the value of r is approximately -1 or 1, there is a strong negative or strong positive
linear relationship between the variables.
 If r is -0.5 (or approximately -0.5) or 0.5 (or approximately 0.5), there is moderate
negative or moderate positive linear relationship between the variables.
 If r¿ 0, there is no linear relationship.

Coefficient of determination (r2)

It is the proportion of the variation in the dependent variable which is explained by the
independent variable, in the regression model. It is the square of the correlation coefficient.

Ex:
1. Given the following data on supply (X) and sales (Y) of a certain commodity

Supply 60 62 6 70 7 75 71
(X) 5 3
Sales (Y) 10 11 1 15 1 19 14
3 6

a. Estimate the regression equation supply on sales.

b. Interpret the estimated coefficients (the slope and intercept).
c. Calculate the correlation coefficient between supply and sales, and interpret it.
d. Find the coefficient of determination and interpret it.
e. Predict the amount of sales of the commodity if the supply amount is 80.

6
2. The following summary results are obtained from price and demand of a commodity
∑price=30 ∑demand=40 ∑(price)(demand)=214
2 2
∑(price) =220 ∑(demand) =340 n=5
a. Identify the dependent and independent variable.
b. Estimate the regression equation.
c. Interpret the estimated coefficients.
d. Calculate the correlation coefficient between price and demand, and interpret it.
e. Find the coefficient of determination and interpret it.

2
2
S
3. Given n=25, X̄ =3.95, Ȳ =2.03, S x =85.35, S y =98.75, xy =90
a. Fit the regression equation Y on X.
b. Interpret the estimated coefficients.
c. Calculate the correlation coefficient and interpret it.
d. Find the coefficient of determination and interpret it.

Data Visualization Notes Ou
No ratings yet
Data Visualization Notes Ou
125 pages
Chapter Two (Estimation and Hypothesis Testing)
No ratings yet
Chapter Two (Estimation and Hypothesis Testing)
20 pages
7 Estimation
No ratings yet
7 Estimation
108 pages
STS 201 Week 6 Lecture Note
No ratings yet
STS 201 Week 6 Lecture Note
35 pages
BUS51A Lecture12
No ratings yet
BUS51A Lecture12
47 pages
Mba 2
No ratings yet
Mba 2
28 pages
Lecture 4-Statistical Inferences
No ratings yet
Lecture 4-Statistical Inferences
118 pages
Estimation and Hypothesis Testing
No ratings yet
Estimation and Hypothesis Testing
44 pages
Unit-1 Introduction To SI 1
No ratings yet
Unit-1 Introduction To SI 1
52 pages
2.parameter Estimation
No ratings yet
2.parameter Estimation
59 pages
Hypothesis Notes 1
No ratings yet
Hypothesis Notes 1
88 pages
22-Intro To Inference For Decision Making-19-03-2024
No ratings yet
22-Intro To Inference For Decision Making-19-03-2024
15 pages
Statssss
No ratings yet
Statssss
31 pages
Research Methodology and Biostatistics Part II 2
No ratings yet
Research Methodology and Biostatistics Part II 2
45 pages
Chapter 5 Infernece Concerning Mean - 081fa6ed Cdae 4e5f Afc5 E9ab156488e0
No ratings yet
Chapter 5 Infernece Concerning Mean - 081fa6ed Cdae 4e5f Afc5 E9ab156488e0
47 pages
Chapter 8
No ratings yet
Chapter 8
29 pages
Session On Confidence Interval
No ratings yet
Session On Confidence Interval
13 pages
CH 8
No ratings yet
CH 8
20 pages
BBA IV Business Statistics
No ratings yet
BBA IV Business Statistics
270 pages
Review of Statistics
No ratings yet
Review of Statistics
36 pages
Statistical Inference
No ratings yet
Statistical Inference
29 pages
Estimation and Sample Size Determination
No ratings yet
Estimation and Sample Size Determination
37 pages
Unit 6.
No ratings yet
Unit 6.
37 pages
Group 5
No ratings yet
Group 5
20 pages
Chapter 7estimation
No ratings yet
Chapter 7estimation
44 pages
Chapter 8
No ratings yet
Chapter 8
45 pages
Procedure of Testing Hypothesis
100% (1)
Procedure of Testing Hypothesis
5 pages
Unit V Estimation
No ratings yet
Unit V Estimation
33 pages
Statistical Inference - Part1.4
No ratings yet
Statistical Inference - Part1.4
28 pages
Inferential Statistics: Estimation Hypothesis Testing
No ratings yet
Inferential Statistics: Estimation Hypothesis Testing
59 pages
Chapter 6
No ratings yet
Chapter 6
43 pages
Sampling Theory
No ratings yet
Sampling Theory
7 pages
DV Unit 1&2 Notes
No ratings yet
DV Unit 1&2 Notes
50 pages
Statistics 2 Chapter Two
No ratings yet
Statistics 2 Chapter Two
14 pages
University of Gondar College of Medicine and Health Science Department of Epidemiology and Biostatistics
No ratings yet
University of Gondar College of Medicine and Health Science Department of Epidemiology and Biostatistics
119 pages
Business Statistics CH 2
No ratings yet
Business Statistics CH 2
49 pages
4 Inferentials
No ratings yet
4 Inferentials
53 pages
Stat
67% (3)
Stat
70 pages
Chapter 8
No ratings yet
Chapter 8
19 pages
Chapter-7-Estimation & Hypothesis Testing
No ratings yet
Chapter-7-Estimation & Hypothesis Testing
15 pages
Tinu
No ratings yet
Tinu
5 pages
Chapter 8
No ratings yet
Chapter 8
21 pages
Inferential Statistics
No ratings yet
Inferential Statistics
6 pages
Chapter 8
No ratings yet
Chapter 8
42 pages
Elementary-statistics-Group-4 20250402 132652 0000
No ratings yet
Elementary-statistics-Group-4 20250402 132652 0000
31 pages
Chapter Two Stat II
No ratings yet
Chapter Two Stat II
20 pages
BBIO105 Statistics Handouts Module 5A (Introduction To Inferential Statistics & Estimation)
No ratings yet
BBIO105 Statistics Handouts Module 5A (Introduction To Inferential Statistics & Estimation)
8 pages
Ch-1.Ppt Business Statx
No ratings yet
Ch-1.Ppt Business Statx
66 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
6 pages
Normal Distribution
No ratings yet
Normal Distribution
8 pages
Chapter 8 Estimation & Hypothesis Testing Copy Copy1
No ratings yet
Chapter 8 Estimation & Hypothesis Testing Copy Copy1
11 pages
03 Inferential Statistics2025
No ratings yet
03 Inferential Statistics2025
38 pages
Inferential Statistics
No ratings yet
Inferential Statistics
119 pages
Estimation in Statistics
100% (1)
Estimation in Statistics
4 pages
Ed Inference1
No ratings yet
Ed Inference1
20 pages
1 Review of Basic Concepts - Interval Estimation
No ratings yet
1 Review of Basic Concepts - Interval Estimation
4 pages
Estimation 1920
No ratings yet
Estimation 1920
51 pages
Statistics Esrimation and Hypothesis
No ratings yet
Statistics Esrimation and Hypothesis
13 pages
Uppen FP Series FP 2400Q Service Manual
No ratings yet
Uppen FP Series FP 2400Q Service Manual
47 pages
5 6089131777291453670
100% (1)
5 6089131777291453670
70 pages
9a BMGT 220 S.I. Theory of Estimation
No ratings yet
9a BMGT 220 S.I. Theory of Estimation
5 pages
E-Commerce & Business Communication Ebook (SEM 4)
No ratings yet
E-Commerce & Business Communication Ebook (SEM 4)
87 pages
OOPM UNIT 1 (Cse)
No ratings yet
OOPM UNIT 1 (Cse)
64 pages
Module 3 Joint Arrangements
No ratings yet
Module 3 Joint Arrangements
19 pages
TLE7 - 8-ICT-PROGRAMMING FOR ROBOTICS Q1 M1 W1 - noAK
No ratings yet
TLE7 - 8-ICT-PROGRAMMING FOR ROBOTICS Q1 M1 W1 - noAK
16 pages
The Physics of Clinical MR Taught Through Images, 5th Edition Educational Ebook Download
100% (15)
The Physics of Clinical MR Taught Through Images, 5th Edition Educational Ebook Download
15 pages
Tewodros Tesfahun
No ratings yet
Tewodros Tesfahun
155 pages
Intro CH 4 Theory of Production and Cost
0% (1)
Intro CH 4 Theory of Production and Cost
53 pages
Research Proposal
100% (1)
Research Proposal
17 pages
Project: K028-Fayyhealth-Polyclinic Risk Assessment For Slab Coring Work
No ratings yet
Project: K028-Fayyhealth-Polyclinic Risk Assessment For Slab Coring Work
5 pages
Ductility Factor - Article368966 - Structuraldesigncodesofaustraliaandnewzealand - Manuscript
No ratings yet
Ductility Factor - Article368966 - Structuraldesigncodesofaustraliaandnewzealand - Manuscript
16 pages
Chapter-6 and 7
100% (1)
Chapter-6 and 7
16 pages
Chapter 2 Opaud
No ratings yet
Chapter 2 Opaud
5 pages
Modul Session 12 Akuntasi Feb
No ratings yet
Modul Session 12 Akuntasi Feb
26 pages
BARTEC Engineers Manual
No ratings yet
BARTEC Engineers Manual
12 pages
Unit 1 Meaning of OB Meaning & Importance of OB: Understanding Organisational Behaviour
No ratings yet
Unit 1 Meaning of OB Meaning & Importance of OB: Understanding Organisational Behaviour
27 pages
x100 Pad 2 User Manual PDF
No ratings yet
x100 Pad 2 User Manual PDF
29 pages
Complete Mesocolic Excision and Extent of Lymphadenectomy For The Treatment of Colon Cancer
No ratings yet
Complete Mesocolic Excision and Extent of Lymphadenectomy For The Treatment of Colon Cancer
14 pages
Journal of Oral Health and Dentistry Research (ISSN: 2583-522X) Case Report The in Uence of The Pulp On The Periodontium: A Viewpoint
No ratings yet
Journal of Oral Health and Dentistry Research (ISSN: 2583-522X) Case Report The in Uence of The Pulp On The Periodontium: A Viewpoint
11 pages
In-Line Mixing
No ratings yet
In-Line Mixing
9 pages
C++ With Visual Basic
No ratings yet
C++ With Visual Basic
10 pages
Acc100 Lecture Notes Ch9
No ratings yet
Acc100 Lecture Notes Ch9
32 pages
Previews 2034814 Pre
No ratings yet
Previews 2034814 Pre
7 pages
Chapter 5 Elementary Probability2
No ratings yet
Chapter 5 Elementary Probability2
13 pages
Coldrinks Project
No ratings yet
Coldrinks Project
23 pages
Zok The Armenian Dialect of Agulis
No ratings yet
Zok The Armenian Dialect of Agulis
19 pages
612 D Fig702 Flanged y Type Strainer Ul
No ratings yet
612 D Fig702 Flanged y Type Strainer Ul
2 pages
Basic Stat Assignment
No ratings yet
Basic Stat Assignment
2 pages
Aqautec Ocean Parts Manual
No ratings yet
Aqautec Ocean Parts Manual
4 pages
GDC BCP Template
No ratings yet
GDC BCP Template
53 pages
Designation
No ratings yet
Designation
12 pages
Pressure Volume Curve 2005
No ratings yet
Pressure Volume Curve 2005
22 pages
CV Thabet English
No ratings yet
CV Thabet English
2 pages
Psychology Chapter 1
No ratings yet
Psychology Chapter 1
2 pages
CBR Proposal
No ratings yet
CBR Proposal
14 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet

Chapter 8 and 9

Uploaded by

Chapter 8 and 9

Uploaded by

Chapter 8

Estimation and Hypothesis Testing

Interval Estimation for the Population Mean (µ)

Hypothesis testing for the population mean

Interpretation of the slope (b)

The value of r is always in between -1 and 1.

Coefficient of determination (r2)

a. Estimate the regression equation supply on sales.

You might also like