Chapter 4
Chapter 4
STATISTICS FOR
BUSINESS AND SOCIAL
SCIENCES
CHAPTER 4:
HYPOTHESES
TESTING
HYPOTHESIS TESTING
DEFINITION
TERM DESCRIPTION
Hypothesis Statement about the value of the population parameter.
Null hypothesis, H0 Claim (or statement) about a population parameter that
assumed to be true until it is declared false.
Statistical hypothesis that states that there is no difference
between a parameter and specific value, or that there is no
difference between two parameters.
𝐇𝟎 : 𝛍 = 𝛍𝟎
where μ is the parameter and μ0 is a single number.
Alternative Hypothesis opposite to H0 and this hypothesis will be
hypothesis, H1 accepted if H0 is rejected. It is also known as the research
hypothesis.
Statistical hypothesis that states the existence of a
difference between a parameter and specific value, or that
there is a difference between two parameters.
The alternative hypothesis can take two form:
a) Non-directional
𝐇𝟏 : 𝛍 ≠ 𝛍𝟎 (two tailed-test)
b) Directional
𝐇𝟏 : 𝛍 > 𝛍𝟎 (right tailed-test)
𝐇𝟏 : 𝛍 < 𝛍𝟎 (left tailed-test)
Significance level The probability of rejecting the null hypothesis it is true. This
level is represented by the symbol α (alpha). The level of
confidence that correspond to α is 1 - α. Thus, a confidence
level of 95% corresponds to α of 5%.
Test statistic Single number calculated from the sample data as a basis in
deciding to reject or not to reject the null hypothesis.
The formula for the test statistic:
MEANS VARIANCE
1 sample 2
1 sample 2 samples
samples
Is variance known or
n≥30? Is variance known or
n≥30?
YES NO
YES NO
Critical value Value of the test statistic that separates the non-rejection region
(NRR) from the rejection region (RR)
0.10 – we have
some evidence
that is not true
HYPOTHESIS TESTING
CRITICAL VALUE APPROACH P - VALUE METHOD
- find critical value from statistical table - use p - value from minitab output
State the null and alternative State the null and alternative
hypothesis hypothesis
Assumption:
1. The population is normally distributed.
2. The sample is independent.
with df = n-1
A report claims that the mean starting salary of engineering graduates from bachelor degree is more than RM2250.
Test at 5% level of significance whether the data support the claim
EXAMPLE 3
The voltage readings of two types of altimeters are recorded for a period of 13 days. The means and variances of the
readings are:
Type I Type 2
Sample mean 40.33 42.54
Sample variance 1.54 2.96
Assume that the voltage readings are normally distributed for both types of altimeters. A researcher claims that the
variance of the voltage reading of type 2 altimeter is different from 4. Test whether the claim is correct at 5%
significance level.
EXAMPLE 4
Environmental testing is one of the attempts to test a component under conditions that closely simulates the
environment in which the component will be used. An electrical component is to be used in two different locations in
Country A. Before environmental testing can be conducted, it is necessary to determine the soil composition in these
localities. These data are obtained on the percentages of SiO2 by weight of the soil.
Soil Type I Soil Type 2
Number of sample 13 11
Sample mean 64.94 57.06
Sample variance 9 7.29
Assume the data is normally distributed for both populations. A researcher claims that the percentage of SiO2 in Soil
Type 1 is 64. Do the data contradict the researcher‟s claim? Test at the 1% significance level.
EXAMPLE 5
A set of facilitation tools to help with data analysis for problem solving is being developed a group of statisticians at
UiTM. In order to test the effectiveness of these tools, a group of research officers were asked to analyze and produce a
built – in report for a set of data on the computer. Twelve equally capable research officers were randomly selected and
six were randomly assigned a standard procedure to complete the task. The other six were asked to do the task using
the developed facilitation tools. The response measured was the time to completion (in minutes). The data collected
are shown below.
Group 1 (standard procedure) 61 69 68 74 58 63
Group 2 (facilitation tool) 32 42 40 34 38 33
Assume that the population distributions are normal. The statisticians claim that the average time to complete the task
using the developed facilitation tools is significantly less than 50 minutes. Do they have sufficient evidence to support
this claim at the 5% level of significance?
SPSS OUTPUT
EXAMPLE 6
A manufacturer claims that the average capacity for a certain type of battery is 140 ampere – hours. An agency wishes
to test the credibility of the manufacturer‟s claim and measures the capacity of ten randomly selected batteries from a
current production batch. The results, in ampere - hours are as follows:
One-Sample Statistics
N Mean Std. Deviation Std. Error Mean
Battery_capacity 10 139.290 1.6360 .5174
One-Sample Statistics
N Mean Std. Deviation Std. Error Mean
Thicknesses 10 9.990 .3348 .1059
a) Construct a 95% confidence interval for the mean thickness of steel bolts. Does the interval indicate that the mean
thickness of steel bolts is more than 9 mm?
b) Hence, perform a test of hypotheses at 5% significance level to test whether the industry is able to produce steel
bolts with mean that more than 9mm.
EXAMPLE 8
A food researcher claimed that the percentage starch content of a certain type is normally distributed with mean, μ
equals to 20%. In order to assess the mean value of the starch content a random sample of twelve potatoes is selected
and their starch content measured. The percentages of starch contents obtained were as follows:
One-Sample Statistics
N Mean Std. Deviation Std. Error Mean
Percentage 12 20.675 1.5130 .4368
One-Sample Test
Test Value = 20
98% Confidence Interval of the
Sig. (2- Mean Difference
t df
tailed) Difference
Lower Upper
Percentage 1.545 11 .151 .6750 -.512 1.862
Based on the p – value in the computer output, is there any evidence that the average percentage starch content of
potatoes is significantly different from 20 percent? Use α = 0.01.
EXAMPLE 9
A gas company claimed that the length of time between a gas leak being reported and an engineer arriving to
investigate the report has a mean of 113 minutes. An analysis has been conducted and the results are reposted in the
following tables.
One-Sample Statistics
Std. Std. Error
N Mean Deviation Mean
Time 8 130.25 50.230 17.759
One-Sample Test
Test Value = 113
95% Confidence Interval
Sig. (2- Mean of the Difference
t df tailed) Difference Lower Upper
Time .971 7 .364 17.250 -24.74 59.24
Test using the 5% level significance whether there is any evidence to support Azlan‟s belief.
TUTORIAL 1
1. At canon Food Corporation, it took an average of 50 minutes for new workers to learn a food processing job.
Recently the company installed a new food processing machine. The supervisor at the company wants to find if the
mean time taken by new workers to learn the food processing procedure on this new machine is different from 50
minutes. A sample of 40 workers showed that it took on average, 47 minutes for them to learn the food processing
procedure on the new machine with standard deviation of 7 minutes. At α = 0.01, test that the mean learning time
for the food processing procedure on the new machine is different from 50 minutes.
2. A study claims that all adults spend an average of 14 hours or more on chores during a weekend. A researcher
wanted to check if this claim is true. A random sample of 200 adults taken by this researcher showed that these
adults spend an average of 13.75 hours on chores during a weekend with a standard deviation of 3 hours. Test that
all adults spend less than 14 hours on chores during a weekend at 5% significance level.
3. The mayor of a large city claims that the average net worth of families living in this city is at least RM3,000. A
random sample of 100 families selected from this city produced a mean net worth RM2,880 with a standard
deviation of RM800. Using the 2.5% significance level, can you conclude that the mayor‟s claim is false?
4. According to report, there were 8.1 million unemployed people aged 18 years and over in August 2015. The
average duration of unemployment for these people was 16.3 weeks. Suppose that a recent random sample of 400
unemployed aged 18 years and over gave mean duration of unemployment of 16.9 weeks with a standard deviation
of 4.2 weeks. Test at the 2% significance level whether the current mean duration of unemployment exceeds 16.3
weeks.
5. According to an estimate, the average age of motorcycle owners was 38.1 years in 2014. A recent random sample
of 700 motorcycle owners yielded a mean age of 37 years with a standard deviation of 8 years. Testing at 1%
significance level, can you conclude that the current mean age of motorcycle owners is less than 38.1 years?
HYPOTHESIS TESTING FOR A DIFFERENCE BETWEEN TWO MEANS
(INDEPENDENT SAMPLE)
An Independent Sample t-test compares the means of two independent groups and normally
distributed.
a) The null hypothesis would be that the means are the same. (μ1 = μ2)
b) A low p-value indicating a sufficiently large difference between groups would suggest that
you reject the null hypothesis and conclude that the two groups are significantly different.
Two-tailed H1: μ1 ≠ μ2
Restaurant A 15 17 15 18 16 18
Restaurant B 12 14 13 15 15 19 13
a) Construct a 90% confidence interval for the ratio between the two variances. Explain whether the variances are
equal.
b) Using the result in part (a), determine if there are differences in the mean prices of set lunch in Restaurant A and
Restaurant B. Test at 10% level of significance.
EXAMPLE 11
The weights in kilograms (kg) of a sample of bricks produced by company A and company B are shown below.
a) Construct a 95% confidence interval for the ratio between the two variances. Justify whether the variances are equal.
b) Using the result in (a) test whether there are differences in the mean of the weight of bricks produced by company
A and company B. Use 10% level of significance.
EXAMPLE 12
To evaluate the performance of inspectors in a new company, 13 novice inspectors were chosen to evaluate 200
finished products. The same 200 products were evaluated by 13 experienced inspectors. The table below lists the
number of inspection errors made by each inspector.
Novice inspectors 30 45 35 31 26 33 40 29 36 21 20 48 41
Experienced inspectors 31 19 15 18 25 24 19 10 28 20 17 21 20
a) Construct a 90% confidence interval for the ratio between the two variances. Hence, explain whether the variances
are equal.
b) Based on the result in (a), can we conclude there are differences in the mean of the number of inspection errors for
both groups of inspectors? Use α = 0.10.
EXAMPLE 13
Manufacturing Company X and Y produce standard screws for a furniture company. A random sample of 9 screws is
selected from each company and the length of the screws is measured. The results, in cm, are as follows:
Assume both populations have equal variances, can we conclude that there are differences in the mean length of
screws produced by Company X and Y? Test at 10% level if significance.
EXAMPLE 14
Steel rods from company A and B are randomly selected and the length of the rods is measured. A sample of size nine
from company A gives a sample mean of 145 cm and estimated standard deviation of 9 cm. Meanwhile a sample of
size 13 from company B gives a sample mean of 152 cm and estimated standard deviation of 11 cm. The length of steel
rods is approximately normally distributed.
a) Construct a 95% confidence interval for the ratio between the two variances. Explain why we can assume that the
variances are equal.
b) Using the result on the variances in part (a), test at 5% significance level whether the mean length of steels rods by
Company A is longer than Company B.
SPSS OUTPUT
EXAMPLE 15
Manufacturing Company X and Y produce standard screws for a furniture company. A random sample of 9 screws is
selected from each company and the length of the screws is measured. The results (in cm) are as follows:
a) State the null and alternative hypothesis for the above test.
b) Based on the result, what is the assumption for the variances of the effective life between two temperatures? Use α
= 0.05.
c) Using the p – value, do the data provide sufficient evidence to indicate that there is a significant difference in
effective life between the two temperatures?
d) State the 95% confidence interval of the mean difference in effective life between the two temperatures. Does the
interval further prove the conclusion in (c)?
EXAMPLE 17
A researcher is interested in comparing the sodium content (in grams) in real cheese and substitute cheese. The data
for the two random samples are shown in the following table.
Group Statistics
Std. Std. Error
Cheese N Mean Deviation Mean
Sodium_content Real 8 193.12 132.151 46.722
Substitute 8 231.25 76.614 27.087
a) Based on the p – value in the Levene‟s Test, what can be concluded about the equality of variances?
b) Show that the test statistic value is – 0.032.
c) State the 99% confidence interval of the difference in the mean hardness of metals between Metal X and Metal Y.
d) Using the p – value, do the data provide sufficient evidence whether there is a significant difference in the mean
hardness metal? Does the confidence interval in (c) consistent with your answer in (d)? Give a reason to support
your answer.
EXAMPLE 19
An investigation was conducted into the dust content in the flue gases of two types of solid fuel boilers. Thirteen
boilers of Type X and nine boilers of Type Y were used under identical fuelling and extraction conditions. Over a similar
period, the following quantities (grams) of dust were deposited in similar traps inserted in each of the 22 flues. Assume
that these independent samples came from a normal population. The results are shown in the table.
Group Statistics
Type N Mean Std. Deviation Std. Error Mean
Dust_deposit X 13 63.831 10.6307 2.9484
Y 9 52.889 9.0044 3.0015
a) Using Levene‟s test, determine whether there is any difference between the variances of the population at 5% level
of significance.
b) Find a 95% confidence interval for the difference between the mean dust deposit of Type X and Type Y.
c) Is there any difference in the mean dust deposit between Type X and Type Y? Test at 5% significance level.
EXERCISE 2
1. The monthly income of a sample male and female tourist guides is summarized in the table below.
Is there any evidence at 10% significance level to indicate that female tourist guides earn more than male guides?
Assume that the populations are normally distributed.
2. The following information was obtained from two independent samples selected from two normally distributed
populations with unknown but equal standard deviation. Test at 1% significance level if ≠ .
Sample 1 27 39 25 33 21 35 30 26 25 31 35 30 28
Sample 2 24 28 23 25 24 22 29 26 29 28 19 29
3. A consumer agency wanted to estimate the difference in the mean amounts of caffeine in two brands of coffee. The
agency took a sample of 15 one – pound jars of Brand I coffee that showed the mean amount of caffeine in these
jars to be 80 mg per jar with a standard deviation of 5 mg. Another sample of 12 one – pound jars of Brand II coffee
gave a mean amount of caffeine equal to 77 mg per jar with a standard deviation of 6 mg. Test at the 1%
significance level whether the mean amount of caffeine are different for those two brands. Assume that the two
populations are normally distributed and that the standard deviations of the two populations are equal.
4. A random sample of 20 cleaning persons produced the mean hourly earnings of RM10.60 with a standard deviation
of RM1.02. A random sample of 25 technicians gave the mean hourly earnings of RM11.57 with a standard deviation
of RM1.34. Assume that the hourly earnings of both groups are normally distributed with different population
standard deviations. Using 5% significance level, can you conclude that the mean hourly earnings of all cleaning
persons are lower than those of all technicians in this state?
EXERCISE 2
5. A study is conducted to compare the effect of stress in the form of noise on the ability to perform a simple task by
50 subjects. The first group of 25 subjects acted as a control, while the second group of 25 was the experimental
group. Both group subjects had to perform the task but only the experimental group subjects had to perform the
task with loud music played on. The time to finish the task was recorded for each subject and the following
summary was obtained.
Control Experimental
Number of samples 25 25
Sample mean 15 minutes 23 minutes
Sample standard deviation 4 minutes 10 minutes
Assume that the time to finish the task follows a normal distribution for both populations.
a) Construct a 95% confidence interval for the ratio between the two variances. Explain why we can assume that
the variances are equal.
b) Using the result on the variances in part (a), test at α = 0.05 whether the mean time to finish the task for the
experimental group is higher than the control group.
6. The average salary offered to college students who graduated in 2014 was RM4,373 to Civil Engineering (EC) and
RM4,029 to Computer Science (CS). Assume that these means are based on samples of 900 EC students and 1,200
CS students and that the sample standard deviations for the two samples are RM220 and RM195, respectively. Test
at the 2.5% significance level if the mean salary offered to college students who graduated in 2014 with EC is higher
than that for CS. Assuming that the populations are approximately normally distributed.
7. A company is considering installing new machines to assemble its product. The company is considering two types
of machines but it will buy only one type. The company selected eight assembly workers and asked them to use
these two types of machines to assemble products. The following table gives the time taken (in minutes) to
assemble one unit of the product on each type of machine for each of these eight workers.
Machine I 23 26 19 24 27 22 20 18
Machine II 21 24 23 25 24 28 24 23
Test at 5% significance level whether the mean times taken assemble a unit of the product are different for the
two types of machines.
HYPOTHESIS TESTING FOR A DIFFERENCE BETWEEN TWO MEANS
(DEPENDENT SAMPLE)
The dependent t-test or paired sample t-test is used to evaluate that are related to each other.
For example, data from a group of students who are given the pre-test and post-test would be analyzed using a paired
sample t-test.
Assumption:
1. The paired differences are normally distributed.
2. The paired differences represent a random sample from the population.
Two-tailed H0: μD ≠ D0
Test at 5% level of significance to determine whether the mean weight loss for all persons due to this special exercise
program is greater than zero.
EXAMPLE 21
A private agency claims that the crash course it offers significantly increases the writing speed of secretaries. The
following table gives the scores of eight secretaries before and after they attended this course.
Before 81 75 89 91 65 70 90 64
After 97 72 93 110 78 69 115 72
Using the 5% significance level, can you conclude that attending this course increases the writing speed?
EXAMPLE 22
A company wanted to know if attending a course on “how to be a successful salesperson” can increase the average
sales of its employees. The company sent six of its salesperson to attend this course. The following table gives the one
– week sales of these salespersons before and after they attended this course.
Before 12 18 25 9 14 16
After 18 24 24 14 19 20
Using the 1% significance level, can you conclude that the mean weekly sales for all salespersons increase as a result of
attending this course? Assume that the population of paired differences has a normal distribution.
SPSS OUTPUT
EXAMPLE 23
A company wanted to know if attending a course on “how to be successful salesperson” can increase the average sales
of its employees. The company sent six of its salespersons to attend this course. The following table gives the one-
week sales of these salespersons before and after they attended this course.
Using p – value in the SPSS output, test whether attending this course increases the mean score of employees.
EXAMPLE 25
A researcher wanted to find the effect of a special diet on systolic blood pressure. She selected a sample of seven
adults and put them on this dietary plan for three months. The following table gives the systolic blood pressures of
these seven adults before and after the completion of this plan.
Paired Samples Statistics
Mean N Std. Deviation Std. Error Mean
Pair 1 Before 208.43 7 18.100 6.841
After 203.43 7 21.078 7.967
A researcher wants to determine whether there is a significant difference in the Body Mass Index (BMI) between male
and female. A survey as conducted on 80 patients at Tawakal Health Centre. The collected data analyzed using SPSS.
The partial output indicated in the following table.
Group Statistics
Gender Mean N Std. Deviation Std. Error Mean
Male 27.0375 40 4.41911 0.69872
BMI
Female 24.0175 40 4.00031 0.63251
The quantity of information contained in a sample is affected by various factors that the
experimenter may or may not be able to control.
The analysis of variance is used to determine how different experimental factors affect the
average response.
The total variation in the response measurements is broken into portions that can be attributed to
various factors.
These portions are used to judge the effect of the various factors on the experimental response.
EXPERIMENTAL DESIGN OF ANOVA
Extension of two
Extension of paired-test
independent sample t-test
(two-way ANOVA)
(one-way ANOVA)
Involve only two factors
and their effect on the
response. (two-way ANOVA
with interaction)
BLOCKING EXPERIMENTAL UNIT FACTOR
a technique to include other object on which a
factors in an experiment which an independent variable,X whose
measurement(s) is taken. values are controlled and varied
contribute to undesirable
variation. by the experimenter
REPLICATION LEVEL
a repetition of the basic the intensity setting of a
experiment.
TERM
factor.
RANDOMIZATION
a random process of assigning
treatments to the experimental
units.
TREATMENT/ FACTOR
RESPONSE, Y LEVELS
the variable being specific combination of
measured by the factor levels.
experimenter.
COMPARISON AMONG DESIGNS
Two Factor Factorial
Completely Randomized Completely Randomized Experiment
Design (CRD) Block Design (CRBD)
(a x b Factorial Experiment)
Objective of CRD
To compare three or more treatment means by using analysis of variance (ANOVA) F-test
Limitation
F-test can only show whether or not a difference exists among the three of more means. It cannot reveal where the
difference lies.
ONE-WAY ANOVA (COMPLETELY RANDOMIZED
DESIGN)
Assume comparing k different treatments or k normal populations. Let random samples of size n1, n2, …, nk are drawn
from k populations with mean µ1, µ2, …, µk and common variance σ2.
Group 1
10 students
(TREATMENT 1)
Group 2
Randomly assigned 30 students to 10 students
3 groups (TREATMENT)
(TREATMENT 2)
Group 3
10 students
(TREATMENT 3)
Are the k population means the same or is at least one mean differ from others?
Treatment, TR
(Between treatment)
Error, E
(within treatment)
Total
STEP 1: Hypothesis Statement
A B C D
86 89 100 88
91 98 98 92
88 97 101 94
Based on the above data:
a) Identify the following item:
i. Experimental unit
ii. Treatment
iii. Factor
iv. Response
b) Determine the type of experimental design used in this study.
c) Construct an ANOVA table.
d) Test at 5% level of significance whether the average yields are equal.
The following are the mileages recorded during series of road test on three new models of luxury sedans.
Model
A B C
22 28 29
26 24 32
27 29 28
Method A 25 38 42 65 47 52
Method B 15 21 19 25
Method C 44 39 54 58 73
A team of researchers interest to compare the yield (in kilograms) of four different varieties (A, B, C and D) of a
rambutan tree in Kg Hutan Kampung orchard. The researchers obtain a random sample of four trees of each variety
from the same orchard. The data were analyzed by using IBM SPSS Statistics. The result given as below.
ANOVA
Source Sum of Squares df Mean Square F Sig.
Between Groups R 3 125.729 U 0.293
Within Groups 1083.250 S 90.271
Total 1460.438 T
Complete the ANOVA table by finding the appropriate values (x) and (y). Test at 5% level of significance if
there is any difference among the four types of insulating fluids.
2. An experiment is carried out to examine the safety of compact cars, midsize cars and full – size cars. A
sample of three for each of the types of cars was randomly selected to perform the test. The data on the
mean pressure (Pa) applied to the driver‟s head during a crash test is recorded and it is shown below.
Pressure (Pa)
Compact Car Midsize Car Full – size Car
643 469 484
655 427 456
702 525 402
Qualitative Variable 2
Qualitative Variable 1
1 2 … c
1 O11 O12 … O1C
2 O21 O22 … O2C
R Or1 Or2 … OrC
GENERAL STEPS:
where
FOR 2 x 2 otherwise
contingency table Oij = observed frequencies / counts
Eij = expected frequencies/ counts
At 5% significance level, investigate if there is sufficient evidence to indicate that type of furniture sold and the
location of stores are dependent.
A total of 120 computer defects were recorded and the defects were classified into three types (A, B and C). the
production shift were each computer was manufactured was also identified. The counts are recorded in the
following contingency table. At 2.5% level of significance, is there enough evidence that there is an association
between production shift and type of defects?
Alcohol Consumption
Gender
Low Moderate High
Male 10 9 8
Female 13 16 12
At 5% significance level, can the researcher conclude that alcohol consumption is related to gender?
A researcher wants to know whether hotel guest‟s response is dependent on hotel location. A sample of hotel
guests at three locations (urban, semi – urban, rural) was asked to indicate whether they would return to the
hotel or would not return to the hotel. The following tables provide the result of the analysis done using SPSS
software. Based on the output, answer the following questions:
Response * Location Crosstabulation
Location
urban semiurban rural Total
response yes Count 46 36 12 94
Expected Count 40.2 29.2 24.5 94.0
no Count Y 20 35 86
Expected Count 36.8 26.8 X 86.0
Total Count 77 56 47
Expected Count 77.0 56.0 47.0
Chi-Square Tests
Value df Asymp. Sig. (2-sided)
Pearson Chi-Square Z 2 .000
Likelihood Ratio 18.975 2 .000
Linear-by-Linear Association 11.257 1 .001
N of Valid Cases 180
By pooling a random sample of 316 undergraduate students, a campus press obtains the following frequency
counts regarding students‟s attitude towards a proposed change in dormitory regulations.
Do the data provide sufficient evidence to conclude that gender ad opinions of adults are dependent at 1%
significance level?
2. A sample of 800 workers was asked their marital status (single or married) and their working shift (morning,
afternoon, or night). The information obtained was recorded in the following two-way classification table.
Marital Morning Afternoon Night
status shift shift shift
Single 160 60 80
Married 360 100 40
Test at the 5% significance level whether the marital status and the working shift are related.
3. A well-known hotel is being investigated to determine if its recruitment is gender biased. The following table
shows the classifications of applications for management, secretarial and chef positions according to gender
and the results of the interview.
Job category
Gender
Offered Denied
Male 70 30
Female 50 80
Test the null hypothesis that there is no gender bias in any job category. Use α = 0.01.
4. A sample of employees at a large travel agency was asked to indicate a preference for one of three pension
plans. The results are given in the following table:
Moderate Heavy
Nonsmokers
smokers smokers
Hypertension 21 36 30
No
48 26 19
hypertension
a) Specify the hypotheses.
b) At 5% significance level, can we conclude that there is association between the pension plan selected and
job class?
5. A study is conducted to test for the independence between road accident injuries and type of road. These
data were obtained from records of 500 selected accidents. Based on 5% level of significance, is there any
evidence of an association between the variables?
Type of road
Type of
Federal / Highway
injury State
City
None or minor 60 20 15
Major 100 88 102
Fatal 45 22 48
6. In an experiment to study the dependence of hypertension on smoking habits, the following data were taken
from 180 individuals. Test the hypothesis that the presence or absence of hypertension is independent of
smoking habits. Use 5% level of significance. Pension Plan
Job class Plan Plan Plan
A B C
Supervisor 10 13 29
Clerical 19 80 19
Labor 81 57 22