Chapter 4 Exercises
Chapter 4 Exercises
Section 4.1
Problem 21
The correlation coefficient is an appropriate summary for this scatter plot. This is because, the
Problem 23
This scatter plot portrays a non-linear relationship between the variables. As such, the correlation
Problem 31
Part a)
3.45
3.4
3.35
3.3
1.6 1.65 1.7 1.75 1.8 1.85 1.9 1.95 2 2.05
Price of Eggs
2
R =0.7133 … … ¿ the plot above
KELVEN GUERRERO 2
R=√ 0.7133
Part c)
In a month where the price of eggs is above the average, I would expect the price of milk for that
month to be above the average as well. This is because, there is a positive strong relationship
between the price of eggs and the price of milk, which is depicted by the correlation of 0.8446
Part d)
i) When the price of eggs rises, it causes the price of milk to rise.
Problem 33
Part a)
Weight vs Height
250
240
R² = 0.774914503288335
220
210
200
190
180
70 71 72 73 74 75 76 77 78
Height (inches)
KELVEN GUERRERO 3
2
R =0.7749 … … ¿ the plot above
R=√ 0.7749
Part c)
Since the height and weight of quarterbacks have a strong positive linear relationship with a
correlation of 0.8803, then I would expect a quarterback below average height to have a weight
Part d)
Problem 35
Part a)
KELVEN GUERRERO 4
82
80
78
76
74
72
74 76 78 80 82 84 86 88 90
2
R =0.66 … … ¿ the plot above
R=√ 0.66
Part c)
If the left foot patient is cooler than the average temperature, then I would expect the right foot
patient to be cooler than the average temperature. This is because, the right foot variable has a
strong positive linear relationship with the left foot variable with a correlation coefficient of
0.8124.
Part d)
i. Some patients have warmer feet than others. Those who have warmer feet generally
Section 4.2
Problem 21
Part a)
3.45
3.4
3.35
3.3
1.6 1.65 1.7 1.75 1.8 1.85 1.9 1.95 2 2.05
Price of Eggs
From the scatter plot, the least square regression line is;
y=0.3991 x +2.7592
Where x-variable represents the price of eggs, and y-variable represents the price of milk.
Part b)
Let the change in price of eggs be change in x, and change in price of milk per month be
represented by change in y.
Given;
∆ x=$ 0.25
KELVEN GUERRERO 6
Δy
Slope= =0.3991 … . … ¿ theregression line ( part a )
Δx
Then;
∆ y =0.0998 ≈ 0.10
Therefore, the price of milk for that month will differ by $0.10.
Part c)
Given;
y= ( 0.3991∗1.95 ) +2.7592
y=0.7782+ 2.7592
y=3.5374 ≈ 3.54
Problem 23
Part a)
KELVEN GUERRERO 7
Weight vs Height
250
240
R² = 0.774914503288335
220
210
200
190
180
70 71 72 73 74 75 76 77 78
Height (inches)
From the scatter plot, the least square regression line is;
y=4.9439 x−146.77
Where x-variable represent the height of quarterbacks, and y-variable to represent their weights.
Part b)
It is not possible to interpret the y-intercept. From the regression line equation above, the value
of y-intercept is -146.77. In this case, the y-variable represents the weight of the quarterbacks. As
Part c)
Let the change in heights of two quarterbacks be change in x, and change in weights of two
Given;
KELVEN GUERRERO 8
∆ x=2inches
Δy
Slope= =4.9439 … . … ¿ the regression line
Δx
Then;
∆ y =4.9439 × ∆ x=4.9439 × 2
∆ y =9.8878 ≈ 9.90
Therefore, the weights of the two quarterbacks will differ by 9.90 pounds.
Part d)
Let the heights and weights of the quarterbacks be represented by x and y respectively.
Given;
y= ( 4.9439∗74.5 )−146.77
y=368.3206−146.77
y=221.5506 ≈221.55
Problem 25
Part a)
KELVEN GUERRERO 9
Let the right foot temperature be represented by y-variable, and the left foot temperature be
represented by x-variable.
( x 1 , y 1 ) =( 76 , 81 ) , ( x 2 , y 2 ) =( 89 , 90 )
Δ y y 2− y 1
Slope=m= =
Δ x x 2−x 1
Δ y 90−81 9
m= = = =0.6923
Δ x 89−76 13
Then, we use any of the two data points with the slope (m) in the general equation to evaluate the
y=mx+c
y=0.6923 x +c
c=81−52.6154
c=28.3846
y=0.6923 x +28.3846
Part b)
KELVEN GUERRERO 10
82
80
78
76
74
72
74 76 78 80 82 84 86 88 90
Part c)
Let the change in left foot temperatures of two patients be the change in x, and change in right
Given;
∆ x=2℉
Δy
Slope= =0.593 … . … ¿ the regression line
Δx
Then;
∆ y =0.593 × ∆ x =0.593 ×2
∆ y =1.186 ≈ 1.19
Therefore, the temperatures of the two right feet patients would differ by 1.19 degree
Fahrenheit.
KELVEN GUERRERO 11
Problem 27
Part a)
Let the diastolic blood pressure be represented by y-variable, and the systolic blood pressure be
represented by x-variable.
Δ y y 2− y 1
Slope=m= =
Δ x x 2−x 1
Δy 94−71 23
m= = = =0.4894
Δ x 154−107 47
Then, we use any of the two data points with the slope (m) in the general equation to evaluate the
y=mx+c
y=0.4894 x+ c
c=94−75.3676
c=18.6324
Part b)
Yes, it is possible to interpret the y-intercept. From the regression line equation, the value of the
y-intercept is 18.6324, which indicates the systolic blood pressure value, when the diastolic
pressure is 0 mmHg.
Part c)
Let the change in systolic pressure of two patients be the change in x, and change in diastolic
Given;
∆ x=10 mmHg
Δy
Slope= =0.4894 … .… ¿ the regression line
Δx
Then;
∆ y =4.894 ≈ 4.89
Therefore, the diastolic pressure of the two right feet patients would differ by 4.89 mmHg.
Part d)
Given;
y= ( 0.4894∗125 ) +18.6324
y=61.175+18.6324
y=79.8074 ≈ 79.81
Problem 29
Part a)
y=a+bx
a=49.7124 , b=4.2886
y=49.7124+ 4.2886 x
Part b)
r =0.9186
Part c)
Given;
Exam score = y
KELVEN GUERRERO 14
y=49.7124+ ( 4.2886∗10 )
y=49.7124+ 42.886
y=92.5984
Therefore, the statistic exam score for the student would be 92.5984.
Problem 31
Part a)
In this problem, let the concentration of ozone be represented by variable y, and the
y=33.8127 +1.21015 x
Part b)
Given;
Concentration of Ozone = y
y=33.8127 + ( 1.21015∗21.4 )
y=33.8127 +25.8972
KELVEN GUERRERO 15
y=59.71
Section 4.3
Problem 17
Part a)
Scatterplot
70
50
40
30
20
10
0
0 10 20 30 40 50 60 70 80 90 100
y=19.373+ 0.5101 x
Part b)
Part c)
KELVEN GUERRERO 16
Scatterplot
70
50
40
30
20
10
0
0 10 20 30 40 50 60 70 80 90 100
y=20.91+ 0.523 x
Part d)
Yes, the outlier is influential. The least-squares regression line equation is different with the
outlier point from without the outlier point. The slope and the y-intercept increased slightly upon
Problem 25
Part a)
Let the number of mutants be represented by y-variable, and age be represented by x-variable.
( x 1 , y 1 ) =( 46 , 52 ) , ( x2 , y 2 )=( 61 , 148 )
Δ y y 2− y 1
Slope=m= =
Δ x x 2−x 1
Δ y 148−52 96
m= = = =6.4
Δ x 61−46 15
Then, we use any of the two data points with the slope (m) in the general equation to evaluate the
y=mx+c
y=6.4 x+ c
c=148−390.4
c=−242.4
y=6.4 x−242.4
Part b)
KELVEN GUERRERO 18
300
250
200
f(x) = 3.24988097731904 x − 76.2257526994344
150
R² = 0.1590954257454
100
50
0
40 45 50 55 60 65 70 75 80 85 90
Age
Part c)
Part d)
250
Number of Mutants
200
150
f(x) = 1.2282037845706 x + 31.863173216885
100 R² = 0.0370350247487646
50
0
40 45 50 55 60 65 70 75 80 85 90
Age
KELVEN GUERRERO 19
y=1.2282 x +31.863
Part e)
Yes, the outlier is highly influential. Without the outlier, the y-intercept becomes positive. Also,
Problem 29
Part a)
R² = 0.904261401567418
2000
1500
1000
500
0
0 100 200 300 400 500 600 700
Part b)
From the scatterplot (part a), the value coefficient of determination is R-squared.
2
R =0.9043∗1000=904.3
Part c)
KELVEN GUERRERO 20
300
200
Residuals
100
0
0 100 200 300 400 500 600 700
-100
-200
-300
X Variable 1
Part d)
False.