Exam
Exam
Exercise 2:
• Step 1: Organize the dataSort the data in ascending order:
5395,5538,5561,5612,5710,5816,6045,6106,6265,6320,6341,6632,6908,7220,7361,7439
• step2:Calculate the five-number summary:
1. Minimum: The smallest value is 5395.
2. Maximum:The largest value is 7439.
3. Median:The median is the middle value of the sorted data.Since there
are 16 data points , the median is the average of the 8th and 9th values:
Median=6106+6265/2=12371/2 =6185.5
1. First Quartile (Q1):Q1 is the median of the first half of the data
(lower 8 values):
Q1=5561+5612/2=11173/2 =5586.5
• Third Quartile (Q3):Q3 is the median of the second half of the data
(upper 8 values):
Q3=6320+6341/2=12661/2 =6330.5
• Five-number summary:
Minimum: 5395, Q1: 5586.5, Median: 6185.5, Q3: 6330.5, Maximum: 7439.
•Step 3: Calculate other statistical measures:
Mean=Sum of data/Number Of odata points
5395+5538+5561+5612+5710+5816+6045+6106+6265+6320+6341+6632+6908+7220+7361+7439
=100,069
Mean=100,069/16≈6254.31
Mode:The mode is the most frequently occurring value. In this dataset, no value
repeats, so there is no mode.
.Range:
Range=Maximum−Minimum=7439−5395=2044
• Variance and Standard Deviation:Variance is calculated as:
σ2=∑(xi−Mean)2/n
• Steps:
Find the squared deviations for each value:For example, for 5395:
(5395−6254.31)2=(−859.31)2. ≈738417.27 ( all values )
.Sum of Squared Deviations≈14,439,634.87
Variance:σ2=14,439,634.87/16 ≈902,477.18i
Standard Deviation:σ=/902,477.18≈949.99
•Step 4: Shape of the Distribution:
To determine the shape, consider the relationship between the mean, median, and
mode:
Mean = 6254.31, Median = 6185.5, No Mode.
⁃ The mean is slightly greater than the median, suggesting a slight
right skew.
⁃ The distribution is slightly skewed right because the mean is
greater than the median.
Exercise 3:
To construct a 99% confidence interval for the population mean, we use the
following formula:
Where:
• xˉ: is the sample mean
• t: is the t-value for the given confidence level and degrees of freedom
• s:is the sample standard deviation
• n : is the sample size
We have :
• xˉ=9.75
• t=2.724 (for 99% confidence and df = 35)
• s=2.39
• n=36
Margin of Error :
(ME)=t⋅SE=2.720⋅0.3983≈1.083
Exercise 4:
•Step 1: Identify the Null and Alternative Hypotheses The hypotheses are:
• Null Hypothesis (H0): μ=14,050 (The mean cost is $14,050.)
• Alternative Hypothesis (Ha): μ≠14,050 (The mean cost is not $14,050.)
Where:
• xˉ=13,795 (sample mean)
• μ=14,050 (population mean under (H0)
• σ=2875 (population standard deviation)
• n=500 (sample size)
Z=13,795-14,050/128,63
= -255/128,63≈-1,98
Exercise 5:
•Step 1: Identify the null and alternative hypothesis:
Let μd represent the mean difference in vertical jump heights (after - before).
• H0 :μd =0 (The shoes do not improve vertical jump height.)
• Ha :μd >0 (The shoes improve vertical jump height.)
This is a one-tailed test at α=0.10.
t= 𝐃 ̄/𝐬𝐃/√𝒏
The formula for the paired t-test statistic is:
Where:
• dˉ = mean of the differences.
• sd = standard deviation of the differences.
• n=8 = number of athletes.
Dˉ=∑D/n=2+3+0+1-2+2+5+3/8 =1.75
• (Di−Dˉ)^2=[0.0625,1.5625,3.0625,0.5625,14.0625,0.0625,10.5625,1.5625]
-∑(Di − D ̄)^2=31.4375
Exercise 6:
r= n∑xy−∑x∑y / √[ n∑ x2−(∑x)2][n∑y2(∑y)2]
Where:
• n=10 (number of data points)
• x = hours spent studying
• y = test scores
r= 10(4093)−(51)(747) / √[10(361)−51^2][10(57491)−747^2]
r= 40930−38097 / √[3610−2601[ 574910−558009]
r= 2833 /√ [1009][16901]
r= 2833 / √ 17060809= 2833 / 4129,3 ≈0,686
For n−2=8 degrees of freedom and α=0.05, the critical t-value is tcritical =2.306
•Step 3: Find the Regression Line. Equation The regression line equation is:
y=a+bx
Where:
• b=n∑xy−∑x∑y /n∑x^2−(∑x)^2
• a=∑y−b∑/ n
b=10(4093)−(51)(747) / 10(361)−51^2
Final Results
1. Scatter Plot: The scatter plot shows a positive correlation between
hours spent studying and test scores.
2. Correlation Coefficient: r≈0.686, significant α=0.05.
3. Regression Line: y=60.37+2.81x.
Exercise 7:
4. Critical Value at α=0.01: From the chi-square table, the critical value
for df=4 and α=0.01 is approximately 13.28.
After calculating for all cells (details omitted for brevity): χ2≈15.87
1. Decision Rule:
• If χ2>critical value, reject H0 .
• Here : χ2=15.87>13.28.
Step 1: Hypotheses
1. Null Hypothesis (H0 ):The mean costs per ounce for the three stain
removal groups (very good, good, and fair) are the same.
H0 :μVery Good =μGood =μFair
Alternative Hypothesis (Ha ):
At least one mean cost per ounce is different.
Ha:At least one μ differs.
Step 2: Organize Data
Group the data:
• Very Good: 0.47, 0.49, 0.41, 0.37, 0.48, 0.51 (n1 =6)
• Good: 0.60, 0.64, 0.58, 0.75, 0.46 (n2=5)
• Fair: 0.34, 0.46, 0.44, 0.60 (n3 =4)
Step 3: Compute Group Means and Overall Mean
1. Calculate the group means:
•XˉVeryGood=0.47+0.49+0.41+0.37+0.48+
0.51 /6 = 0.455
•XˉGood=0.60+0.64+0.58+0.75+0.46/5
=0.606
•XˉFair=0.34+0.46+0.44+0.60/ 4=0.46
4. Degrees of Freedom:
• dfB =k−1=3−1=2
• dfW =N−k=15−3=12
• dfT =N−1=15−1=14
5. Mean Squares:
• MSB=SSB/ dfB =0.01697/2 ≈0.008485
• MSW=SSW /dfW=0.04489/12≈0.003741
6. F-Statistic:
F= MSB/MSW=0.008485/0.003741≈ 2.27