0% found this document useful (0 votes)
13 views2 pages

Answer All

The document presents a detailed regression analysis to find the relationship between delivery time and distance, resulting in the regression equation ŷ = 0.889 + 0.00275X, with a coefficient of determination (R²) of approximately 0.869, indicating a strong positive correlation. Additionally, it calculates the sample size needed for estimating a population proportion with a specified margin of error and confidence level, concluding that a sample size of 8741 is required. The analysis demonstrates a significant linear relationship between distance and delivery time, as well as the necessary sample size for accurate estimations.

Uploaded by

charlesmalawo32
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views2 pages

Answer All

The document presents a detailed regression analysis to find the relationship between delivery time and distance, resulting in the regression equation ŷ = 0.889 + 0.00275X, with a coefficient of determination (R²) of approximately 0.869, indicating a strong positive correlation. Additionally, it calculates the sample size needed for estimating a population proportion with a specified margin of error and confidence level, concluding that a sample size of 8741 is required. The analysis demonstrates a significant linear relationship between distance and delivery time, as well as the necessary sample size for accurate estimations.

Uploaded by

charlesmalawo32
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

A) Regression Analysis

Here's the breakdown of the calculations for part A:


1. Find the regression equation (3 marks)
To find the regression equation (which takes the form ŷ = a + bX, where ŷ is the predicted
delivery time, a is the y-intercept, b is the slope, and X is the distance in miles), we need to
calculate the following:
●​ ΣX (sum of distances) = 825 + 215 + 1070 + 550 + 480 + 920 = 4060
●​ Σy (sum of delivery times) = 3.5 + 1.5 + 4 + 2 + 2.5 + 3 = 16.5
●​ ΣX² (sum of squared distances) = 825² + 215² + 1070² + 550² + 480² + 920² = 680625 +
46225 + 1144900 + 302500 + 230400 + 846400 = 3250650
●​ Σy² (sum of squared delivery times) = 3.5² + 1.5² + 4² + 2² + 2.5² + 3² = 12.25 + 2.25 + 16
+ 4 + 6.25 + 9 = 49.75
●​ ΣXY (sum of the product of distances and delivery times) = (825 * 3.5) + (215 * 1.5) +
(1070 * 4) + (550 * 2) + (480 * 2.5) + (920 * 3) = 2887.5 + 322.5 + 4280 + 1100 + 1200 +
2760 = 12550
Number of data points (n) = 6
Now we can calculate the slope (b) and y-intercept (a):
●​ Slope (b): b = (nΣXY - ΣXΣy) / (nΣX² - (ΣX)²) b = (6 * 12550 - 4060 * 16.5) / (6 * 3250650
- 4060²) b = (75300 - 67000) / (19503900 - 16483600) b = 8300 / 3020300 b ≈ 0.00275
●​ Y-intercept (a): a = (Σy - bΣX) / n a = (16.5 - 0.00275 * 4060) / 6 a = (16.5 - 11.165) / 6 a
= 5.335 / 6 a ≈ 0.889
Therefore, the regression equation is: ŷ = 0.889 + 0.00275X
2. Compute the co-efficient of determination for the above data and comment on your
results (2 marks)
The coefficient of determination (R²) measures the proportion of the variance in the dependent
variable (delivery time) that is predictable from the independent variable (distance).
R² = (nΣXY - ΣXΣy)² / [(nΣX² - (ΣX)²)(nΣy² - (Σy)²)] R² = (6 * 12550 - 4060 * 16.5)² / [(6 *
3250650 - 4060²)(6 * 49.75 - 16.5²)] R² = (75300 - 67000)² / [(19503900 - 16483600)(298.5 -
272.25)] R² = (8300)² / [(3020300)(26.25)] R² = 68890000 / 79282875 R² ≈ 0.869
Comment: The coefficient of determination (R²) is approximately 0.869 or 86.9%. This indicates
that about 86.9% of the variation in delivery time can be explained by the variation in the
distance travelled. This suggests a strong positive linear relationship between distance and
delivery time.
3. Calculate the Co-efficient of correlation and comment on your results (2 marks)
The coefficient of correlation (r) measures the strength and direction of the linear relationship
between two variables. It is the square root of the coefficient of determination.
r = ±√R²
Since the slope (b) of the regression equation is positive, the correlation coefficient will also be
positive.
r = √0.869 r ≈ 0.932
Comment: The coefficient of correlation (r) is approximately 0.932. This value is close to +1,
indicating a strong positive linear correlation between the distance travelled and the delivery
time. As the distance increases, the delivery time tends to increase significantly.
4. Estimate the delivery time when the distance travelled is 1,500 miles (3 marks)
We can use the regression equation to estimate the delivery time for a distance of 1,500 miles:
ŷ = 0.889 + 0.00275 * 1500 ŷ = 0.889 + 4.125 ŷ ≈ 5.014 days
Therefore, the estimated delivery time when the distance travelled is 1,500 miles is
approximately 5.014 days.

B) Sample Size Determination


Here's the calculation for part B:
Determine the sample size for this estimation (5 marks)
To determine the required sample size for estimating a population proportion with a specified
margin of error and confidence level, we use the following formula:
n = (Zα/2)² * p * (1 - p) / E²
Where:
●​ n = required sample size
●​ Zα/2 = the z-score corresponding to the desired confidence level
●​ p = estimated proportion of crimes where firearms are used (from previous year's data)
●​ E = desired margin of error
Given:
●​ Margin of error (E) = 0.01
●​ Confidence level = 95%
●​ Previous year's proportion (p) = 0.65
First, find the Zα/2 value for a 95% confidence level. For a 95% confidence level, α = 1 - 0.95 =
0.05, and α/2 = 0.025. The z-score that leaves 0.025 in the upper tail of the standard normal
distribution is approximately 1.96.
Now, plug the values into the formula:
n = (1.96)² * 0.65 * (1 - 0.65) / (0.01)² n = (3.8416) * 0.65 * 0.35 / 0.0001 n = 3.8416 * 0.2275 /
0.0001 n = 0.87406 / 0.0001 n = 8740.6
Since the sample size must be a whole number, we round up to the nearest integer.
Therefore, the required sample size for this estimation is 8741.

You might also like