Acc117 Chapter5 Report
Acc117 Chapter5 Report
INFERENTIAL
STATISTICS
CHAPTER 5
POINT ESTIMATE AND
INTERVAL ESTIMATE
CHAPTER 5
POINT ESTIMATE Search
Using the results of the central limit theorem, we can say that for a
reasonably large sample size, we can state the following:
95 percent of the sample means selected from a population will
be within 1.96 standard deviations of the population mean, 𝜇.
99 percent of the sample means selected from a population will
be within 2.58 standard deviations of the population mean, 𝜇.
CONFIDENCE INTERVAL ESTIMATION FOR LARGE SAMPLES
Suppose that we have a confidence level of 95 percent. Because it refers to the middle 95 percent of the
observations, we divide the bell curve into two at its center, with a z score of 0.
Because it is symmetrical, both divisions are mirror images.
Because the sum of probabilities under the entire bell curve is equal to 1, each division has a sum of
probabilities equal to 0.5.
We have 0.025 at both ends since the final 5 percent is equally split between the two tails.
CONFIDENCE INTERVAL ESTIMATION FOR LARGE SAMPLES
Using the standard normal table, locate 0.025 (can be found in the standard normal table for negative z scores).
It corresponds to a z score of -- 1.96.
Therefore, the area under the bell curve from z score of -- 1.96 to 0 is 0.475.
Correspondingly, we do the same thing for the other half to the right of z score 0. (mirror image)
Using the standard normal table, locate 0.975 (can be found in the standard normal table for positive z scores).
It corresponds to a z score of 1.96.
Therefore, the area under the bell curve from the z score of 0 to 1.96 is also 0.475.
Determining z for Different
Confidence Level
The z score for a corresponding
CONFIDENCE
0.9 0.95 0.99 0.999 =1-∝ confidence level is important in
LEVEL
constructing the confidence
SIGNIFICANCE
LEVEL
0.1 0.05 0.01 0.001 =∝ interval.
Z SCORE (LEFT-
-1.28 -1.64 -2.33 -3.09 =normsinv (∝)
It is also important that we are
TAILED)
Z SCORE (TWO-
availability of the population
±1.64 ±1.96 ±2.58 ±3.29 =±normsinv (∝/2)
TAILED)
standard deviation to know which
formula to use in constructing the
confidence interval.
Determining z for Different
Confidence Level
Let us say the sample size is less than 30, and we do not know what the
population standard deviation is. We are unable to apply the z
distribution or the conventional normal distribution in this situation.
We would instead apply the t distribution.
Standard Normal Value (z score) when According to the presumption that the
Equation 4.4 population standard deviation is population of interest has a normal
known, regardless of n distribution, the t distribution is also
described as follows:
Standard Normal Value (z score) when There is also a family of t distributions.
Equation 4.5 z population standard deviation is While all t distributions are centered
unknown and n≥30 at mean 0, their standard deviation
differs according to sample size. A
t distribution (t statistic) when smaller sample size has a higher
Equation 5.3 population standard deviation is standard deviation.
unknown and n<30
Compared to the standard normal
Confidence interval for the population distribution, the t distribution is more
Equation 5.4 mean when n<30 and population spread out and flatter at the center.
standard deviation is not available
t Distribution Table
CONFIDENCE INTERVAL ESTIMATION FOR PROPORTION
Note that σp is the standard error of the proportion that measures the variability in the
sampling distribution of the sample proportion.
Using Equation 5.7, we restate Equation 5.6 into Equation 5.8, which is the formula to
construct a confidence interval for a population proportion.
FINITE POPULATION CORRECTION FACTOR
CHAPTER 5
HYPOTHESIS TESTING
1 2 3 4 5
To begin hypothesis testing, state the hypothesis being tested. This statement is
called null hypothesis, denoted by Ho read as “H not” or “H subzero”.
The null hypothesis is a hypothesis that indicates “no change”, “no difference”,
current or reported condition.
Hence, it is always stated in equality terms.
It is also stated in the negative because you have to be able to prove something is
indeed true.
Thus, the null hypothesis is what we test with statistics.
We either reject or fail to reject the null hypothesis
STEP 1: State the null and alternative hypothesis
When you have empirical evidence to reject the null hypothesis, you need to have an
alternative. This is where the alternative hypothesis comes into play.
The alternative hypothesis states what you will choose if the null hypothesis is
rejected.
Formally, it is a statement that is not rejected if the sample data provides sufficient
empirical evidence to reject the null hypothesis.
It is denoted by H1 read as “H sub one”/
If the null hypothesis is called the test statement, the alternative hypothesis is
called the research statement.
STEP 2: Select a significance level (α)
Once you have stated the null and alternative hypotheses, you need to decide on
your significance level, denoted by α.
In hypothesis testing, the significance level is the probability of rejecting the null
hypothesis when it should not have been rejected.
Also called the significance level the risk level because it is the risk you take of
rejecting the null hypothesis when it should not have been rejected.
STEP 2: Select a significance level (α)
There are various statistics, which include z, t, F, and χ2 (chi squared). Each has its
specific use given the nature of the hypothesis, data distribution, and the testing
procedure itself.
STEP 4: Formulate a decision rule
CHAPTER 5
From our discussion of constructing a
confidence interval, we have to use a
specific formula depending on our
HYPOTHESIS TEST data’s distribution, sample size, level
of measurement, and availability of
ABOUT A population parameters.
POPULATION MEAN The same is true for implementing
statistical hypothesis testing for a
(ONE-SAMPLE, population mean using one sample
only.
z Test) Here we test whether sample data
results are less than or greater than
the hypothesized population
parameter value.
STEP 1: STATE THE NULL AND ALTERNATIVE HYPOTHESES
Alternative
Two-tailed test
hypothesis
STEP 2: SELECT THE LEVEL OF SIGNIFICANCE (α)
90% 10%
95% 5%
99% 1%
STEP 3: SELECT A TEST STATISTIC
Compare the test statistic to a t-distribution to calculate the probability of observing the test
statistic under the null hypothesis .
Compare the P-value to the significance level to decide whether to reject the null hypothesis
A one-sample proportion z-test is a
statistical test that compares the
proportion of a sample to a
HYPOTHESIS TEST ABOUT theoretical proportion.
Make a decision
A two-sample z-test is a statistical
hypothesis test that compares the
HYPOTHESIS TEST means of two independent samples
to determine if they come from the
ABOUT A POPULATION same population.
MEAN (INDEPENDENT
TWO-SAMPLE, Z TEST)
When to use a two-sample z-test
The two samples must be independent
The sample size for each sample must be large enough (𝑛≥30)
The population standard deviations for both samples are unknown, but the
sample containing at least 30 observations was randomly selected.
A two-sample t-test, also called an
independent samples t-test, is a
HYPOTHESIS TEST statistical test that compares the
means of two independent groups.
ABOUT A POPULATION
It is used to determine if there's a
MEAN (INDEPENDENT statistically significant difference
TWO-SAMPLE, T TEST) between the two groups.
When to use an independent two-sample, t test
To use the t statistic to test for the difference between two means, the following
assumptions have to be satisfied:
The two population standard deviations are equal (i.3., equal variances).
METHOD A METHOD B
A researcher wants to compare
the mean test scores of two
85 78
groups of students who were
taught using different teaching 90 82
methods (Method A and Method B).
The researcher collects the 88 80
following data:
92 85
87 83
I L L U S T R A T I O N
We can also conduct statistical
hypothesis testing for a proportion for
two samples, which is similar to a
one-sample hypothesis test.
HYPOTHESIS TEST A hypothesis test for two proportions
ABOUT A PROPORTION compares the proportions of two
independent samples.
(Two-Sample, t test) In implementing a statistical
hypothesis test of proportions for two
samples, we will continue to apply
the five-step hypothesis testing
procedure.
Hypothesis Test about a Proportion (Two-Sample, t test)
Equation 5. 18 represents the pooled proportion who has the trait in the combined samples, which is the
pooled estimate of the population proportion. We then substitute Equation 5.18 into Equation 5.19.