0% found this document useful (0 votes)
3 views2 pages

Assessment 4

The document outlines an assessment on hypothesis testing covering various statistical tests including Z, t, F, and Chi-square tests. It includes multiple scenarios requiring hypothesis testing to evaluate proportions, means, variances, and independence based on given data. The assessment consists of five main questions, each requiring statistical analysis at a 5% significance level.

Uploaded by

matlabdec12
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views2 pages

Assessment 4

The document outlines an assessment on hypothesis testing covering various statistical tests including Z, t, F, and Chi-square tests. It includes multiple scenarios requiring hypothesis testing to evaluate proportions, means, variances, and independence based on given data. The assessment consists of five main questions, each requiring statistical analysis at a 5% significance level.

Uploaded by

matlabdec12
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Hypothesis Testing (Z, t, F, χ2)

Assessment-4
Total Marks:8

Faculty: Prof.Anitha G
Course code: BMAT202P
Class Ids: CH202425050100, CH2024250500996
Slots: L7+L8 , L37+L38

1. i) Suppose that 12% of apples harvested in an orchard last year was


rotten. 30 out of 214 apples in a harvest sample this year turns out to
be rotten. At .05 significance level, can we reject the null hypothesis
that the proportion of rotten apples in harvest stays below 12% this
year?
ii) Suppose the food label on a cookie bag states that there is at most 2
grams of saturated fat in a single cookie. In a sample of 35 cookies,
it is found that the mean amount of saturated fat per cookie is 2.1
grams. Assume that the population standard deviation is 0.25 grams.
At 0.05 significance level, can we reject the claim on food label?
2. i) A researcher wants to compare the average test scores of students
from two different classes that followed different teaching methods.
Class A had 25 students, while Class B had 30 students. The test
scores for each student were recorded as follows:
Class A (n = 25): 72, 68, 75, 80, 79, 85, 77, 82, 90, 88, 76, 81, 74,
69, 87, 84, 83, 78, 86, 73, 71, 89, 91, 92, 70.
Class B (n = 30): 65, 70, 68, 72, 66, 74, 75, 78, 79, 71, 77, 69, 67,
73, 76, 80, 81, 85, 82, 64, 62, 63, 86, 88, 83, 90, 87, 79, 68, 74.
At a 5% significance level, can we conclude that there is a significant
difference in the mean test scores between the two classes?
ii) The number of company malfunctions per day is recorded for 260
days with the following results.
No. of malfunctions (xi ) 0 1 2 3 4 5
No. of days (fi ) 77 90 55 30 5 3
Test the goodness of fit of an appropriate probability model.

1
3. A fitness coach wants to evaluate the effectiveness of a 6-week high-
intensity workout program in improving endurance. A group of 20 partic-
ipants was tested for the time (in minutes) they could run on a treadmill
before and after completing the program.
• Generate a dataset for 20 participants using random or sample num-
bers, ensuring:
– The “before workout” times are between 8 to 15 minutes.
– The “after workout” times are generally expected to be higher,
between 10 to 20 minutes.
• Conduct a paired t-test at a 5% significance level to determine whether
the training program significantly improved endurance.
• Show the generated dataset (before and after workout times). Com-
pute and report the test statistic and p-value. Provide a conclusion:
Does the data suggest a significant improvement in endurance?
4. A company wants to analyze whether salary variability differs between
the Technology and Healthcare industries. Two independent samples
of salaries (in thousands of dollars) are collected. The goal is to determine
whether the two industries have significantly different salary variances.

• Generate two random datasets, each containing 20 salary values:


– Technology Industry: Salaries should be between 60K to 150K.
– Healthcare Industry: Salaries should be between 50K to 130K.
• Conduct an F-test at a 5% significance level to check if there is a
significant difference in salary variance between the two industries.
• Show the generated salary dataset for both industries.
• Compute and report the F-statistic and p-value.
• Provide a conclusion: Can we reject the null hypothesis and conclude
that salary variability is different between the two industries?

5. A company wants to determine whether Machine Type and Malfunc-


tion Severity are independent attributes. The following contingency
table provides data collected over 200 cases:

Machine Type Minor Moderate Severe Total


Type A 30 50 20 100
Type B 20 40 40 100
Total 50 90 60 200

Perform a Chi-square test for independence at a 5% significance level


to determine whether the Machine Type and Malfunction Severity are
independent.

You might also like