Statistical Analysis in Finance Session 4: Hypothesis Testing
Statistical Analysis in Finance Session 4: Hypothesis Testing
Session 4:
Hypothesis Testing
www.cranfield.ac.uk/som
Content :
Sessions 1 & 2: Probability and Probability Distributions
Session 3: Sampling and Estimations
SESSION 4: HYPOTHESIS TESTING
Session 5: Problem Solving
Sessions 6 & 7: Regression Analysis
Session 8: Regression Models with Dummy Variables
Sessions 9 &10: Problem Solving and Exam Revision
2
Statistical Analysis in Finance
Reading:
Statistical Techniques in Business and Economics
(17/E) by Douglas A. Lind, William G. Marchal and
Samuel A. Wathen 2017. McGraw-Hill. Chapters
Chapters 10, 11 and 12.
What is a Hypothesis?
8
Step1: Stating the Hypothesis (cont’d)
10
Step1: Stating the Hypothesis (cont’d)
11
ü H0: m = 606.4
H1: m ≠ 606.4
13
14
Step1: Stating the Hypothesis (cont’d):
Example 3
• It is generally believed that at least 0.6 of the residents
of Cranfield town are happy with their lives. A sociologist
is concerned about the lingering economic crisis and
wants to determine whether the crisis has adversely
affected the happiness level in Cranfield town. Specify
the null and the alternative hypotheses to determine if
the sociologist ‘s concern is valid.
ü H0: p ≥ 0.6
H1: p < 0.6
15
16
Example 4: Door Delivery
• In this case the builder has accepted more than 6% faulty doors.
He committed type II error not rejecting when it is false.
18
Step 2: Specifying the significance level
(cont’d)
• Type I Error
• Defined as the probability of rejecting the
null hypothesis when it is actually true.
• This is denoted by the Greek letter “a”
• Also known as the significance level of a
test
• Type II Error
• Defined as the probability of failing to reject
the null hypothesis when it is actually false.
• This is denoted by the Greek letter “β”
19
Z-distribution
T-distribution
22
Step 4: Stating the decision rule
23
24
Step 4: Stating the decision rule: Critical
value approach (cont.)
26
Step 4: Stating the decision rule:
p-Value approach
30
Hypothesis Setups for Testing a single
Mean, µ
31
• The value of the test statistics for the hypothesis test of the
population mean, μ, when the population standard deviation
σ is known is computed as:
33
The waiting time for customers at Burger King Restaurants follows a normal
distribution with a population standard deviation of 1 minute. At the Victoria Road
Burger King, the quality-assurance department sampled 50 customers and found
that the mean waiting time was 2.75 minutes. At the .05 significance level, can we
conclude that the mean waiting time is less than 3 minutes?
Step 4: Formulate the decision rule by first determining the critical values of z.
35
Example 5 (Cont.)
Compute z:
x# − µ 2.75 - 3
z= = = -1.768
σ/ n 1/ 50
Decision: Reject Ho, because -1.768 does fall in the rejection region, we
decide to reject H0.
X -µ
t n -1 =
s/ n
where,
t n -1 = t - statistic with n - 1 degrees of freedom
X = sample mean
µ = the hypothesized value of the population mean
s = sample standard deviation
37
Example 6
• You are analysing Etihad Equity fund, a midcap growth fund, that has
been in existence for 24 moths. During this period, it has achieved a
mean monthly return of 1.5% with a sample standard deviation of monthly
returns of 3.6%.
• Given its level of its systematic (market) risk and according to a pricing
model, this equity fund was expected to have earned a 1.10% mean
monthly return during that time period. Assuming that returns are
normally distributed, are the actual results consistent with a population
mean monthly return of 1.10%.
39
40
Comparing two populations
– Some Examples
42
Comparing two population means (Cont.)
X1 - X 2
z =
s 12 s 22
+
n1 n2
164 - 159
z= = 1.85
2 2
12.5 9.25
+
35 30 45
Example 7: Solution
46
Comparing population means:
Equal but unknown population standard
deviations (the Pooled t-test)
48
Example 8
The realized mean monthly return on a sample the FTSE All Share index
in the 1990s appears to have been substantially different from the mean
return on a sample of the FTSE All Share index in the 1980s. Was the
difference statistically significant? Assuming equal population variance,
determine whether the null hypothesis is rejected or not at the 0.05 level
of significance.
1. Formulate the null and alternate hypothesis
2. Assuming equal population variance, determine whether the null
hypothesis is rejected or not at the 0.05 level of significance.
Example 8: Solution
52
Comparing population means: Unequal and
unknown population standard deviations
54
Characteristics of F-Distribution
1. There is a “family” of F
Distributions. A particular member
of the family is determined by two
parameters: the degrees of
freedom in the numerator and the
degrees of freedom in the
denominator.
2. The F distribution is continuous
3. F value cannot be negative.
4 . The F distribution is positively
skewed.
5. It is asymptotic. As F ® ¥ the
curve approaches the X-axis but
never touches it. 55
For example:
• If you wished to test whether the mean returns earned by
two investment strategies were equal over a study period.
The observations here are dependent in the sense there is
one observation for each strategy in each month, and both
strategies are likely to be related to some common market
risk factors.
57
d
t=
sd / n
There are n - 1 degrees of freedom
d is the mean of the difference between the paired observations.
s d is the standard deviation of the differences between the paired observations.
n is the number of paired observations.
• The statistical test requires that both populations are normally distributed
or the sample size of the sample n ≥ 30.
58
Example 9
Example 9: Solution
• We reject the null hypothesis and we can conclude that the difference in
mean returns was clearly statistically significant.
• The FTSE-10 and the FTSE-100 are not independent samples; in general,
the correlation of returns on the FTSE-10 and FTSE-100 should be
positive. Because the samples are dependent, a paired comparison test
was appropriate.
60
Test Statistic for testing a single population
proportion
• A Proportion is the fraction or percentage that indicates the part of the population
or sample having a particular trait of interest.
• The sample proportion is denoted by p and is found by x/n
• Assumptions in testing a population proportion using the z-distribution are met (in
the slides of the last lecture).
• The test statistic is computed as follows:
Hypothesized
Sample proportion population proportion
p -p
z =
p (1 - p )
n
Sample size 61
EXAMPLES
• The vice president of human resources wishes to know whether there
is a difference in the proportion of hourly employees who miss more
than 5 days of work per year at the Atlanta and the Houston plants.
63
64