0% found this document useful (0 votes)
74 views20 pages

Chapter 4 Exercises

The document contains problems analyzing scatter plots and calculating linear regressions. Problem 21 examines a plot of milk and egg prices, finding a linear relationship with a regression equation of y=0.3991x+2.7592. Problem 23 looks at a non-linear quarterback height and weight plot, calculating that a 2 inch height difference corresponds to a 9.9 pound weight difference. Problem 25 begins analyzing a linear foot temperature plot to calculate the slope from two given points.

Uploaded by

kjoel.ngugi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
74 views20 pages

Chapter 4 Exercises

The document contains problems analyzing scatter plots and calculating linear regressions. Problem 21 examines a plot of milk and egg prices, finding a linear relationship with a regression equation of y=0.3991x+2.7592. Problem 23 looks at a non-linear quarterback height and weight plot, calculating that a 2 inch height difference corresponds to a 9.9 pound weight difference. Problem 25 begins analyzing a linear foot temperature plot to calculate the slope from two given points.

Uploaded by

kjoel.ngugi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 20

KELVEN GUERRERO 1

Section 4.1

Problem 21

The correlation coefficient is an appropriate summary for this scatter plot. This is because, the

scatter plot portrays a linear relationship between the variables.

Problem 23

This scatter plot portrays a non-linear relationship between the variables. As such, the correlation

coefficient is not an appropriate summary for the plot.

Problem 31

Part a)

Price of Milk vs. Price of Eggs


3.6

3.55 f(x) = 0.399065279662295 x + 2.75921754862053


R² = 0.713251718613164
3.5
Price of Milk

3.45

3.4

3.35

3.3
1.6 1.65 1.7 1.75 1.8 1.85 1.9 1.95 2 2.05

Price of Eggs

Part b): Coefficient Correlation, R

2
R =0.7133 … … ¿ the plot above
KELVEN GUERRERO 2

R=√ 0.7133

Coefficient correlation , R=0.8446

Part c)

In a month where the price of eggs is above the average, I would expect the price of milk for that

month to be above the average as well. This is because, there is a positive strong relationship

between the price of eggs and the price of milk, which is depicted by the correlation of 0.8446

(in part b above).

Part d)

i) When the price of eggs rises, it causes the price of milk to rise.

Problem 33

Part a)

Weight vs Height
250

240

230 f(x) = 4.94392523364486 x − 146.766355140187


Weight (pounds)

R² = 0.774914503288335
220

210

200

190

180
70 71 72 73 74 75 76 77 78

Height (inches)
KELVEN GUERRERO 3

Part b) Coefficient Correlation, R

2
R =0.7749 … … ¿ the plot above

R=√ 0.7749

Coefficient correlation , R=0.8 803

Part c)

Since the height and weight of quarterbacks have a strong positive linear relationship with a

correlation of 0.8803, then I would expect a quarterback below average height to have a weight

that is below the average weight.

Part d)

i. If a quarterback gains weight, he will grow taller.

Problem 35

Part a)
KELVEN GUERRERO 4

Right Foot Temperature vs Left Foot Temperature


92
90
88
Right Foot Temperatue

86 f(x) = 0.592989985693848 x + 33.7753934191702


84 R² = 0.659998252146163

82
80
78
76
74
72
74 76 78 80 82 84 86 88 90

Left Foot Temperature

Part b) Coefficient Correlation, R

2
R =0.66 … … ¿ the plot above

R=√ 0.66

Coefficient correlation , R=0.8 124

Part c)

If the left foot patient is cooler than the average temperature, then I would expect the right foot

patient to be cooler than the average temperature. This is because, the right foot variable has a

strong positive linear relationship with the left foot variable with a correlation coefficient of

0.8124.

Part d)

i. Some patients have warmer feet than others. Those who have warmer feet generally

have left warmer right feet as well.


KELVEN GUERRERO 5

Section 4.2

Problem 21

Part a)

Price of Milk vs. Price of Eggs


3.6

3.55 f(x) = 0.399065279662295 x + 2.75921754862053


R² = 0.713251718613164
3.5
Price of Milk

3.45

3.4

3.35

3.3
1.6 1.65 1.7 1.75 1.8 1.85 1.9 1.95 2 2.05

Price of Eggs

From the scatter plot, the least square regression line is;

y=0.3991 x +2.7592

Where x-variable represents the price of eggs, and y-variable represents the price of milk.

Part b)

Let the change in price of eggs be change in x, and change in price of milk per month be

represented by change in y.

Given;

∆ x=$ 0.25
KELVEN GUERRERO 6
Δy
Slope= =0.3991 … . … ¿ theregression line ( part a )
Δx

Then;

∆ y =0.3991 ×∆ x=0.3991× 0.25

∆ y =0.0998 ≈ 0.10

Therefore, the price of milk for that month will differ by $0.10.

Part c)

Let the prices of eggs and milk be represented by x and y respectively.

Given;

The price of egg x = $1.95

Linear regression equation; y = 0.3991x + 2.7592

y= ( 0.3991∗1.95 ) +2.7592

y=0.7782+ 2.7592

y=3.5374 ≈ 3.54

Therefore, the price of milk that month would be $3.54.

Problem 23

Part a)
KELVEN GUERRERO 7

Weight vs Height
250

240

230 f(x) = 4.94392523364486 x − 146.766355140187


Weight (pounds)

R² = 0.774914503288335
220

210

200

190

180
70 71 72 73 74 75 76 77 78

Height (inches)

From the scatter plot, the least square regression line is;

y=4.9439 x−146.77

Where x-variable represent the height of quarterbacks, and y-variable to represent their weights.

Part b)

It is not possible to interpret the y-intercept. From the regression line equation above, the value

of y-intercept is -146.77. In this case, the y-variable represents the weight of the quarterbacks. As

such, it is mathematically impossible to have a negative weight, which makes is impossible to

interpret the y-intercept.

Part c)

Let the change in heights of two quarterbacks be change in x, and change in weights of two

quarterbacks be represented by change in y.

Given;
KELVEN GUERRERO 8
∆ x=2inches

Δy
Slope= =4.9439 … . … ¿ the regression line
Δx

Then;

∆ y =4.9439 × ∆ x=4.9439 × 2

∆ y =9.8878 ≈ 9.90

Therefore, the weights of the two quarterbacks will differ by 9.90 pounds.

Part d)

Let the heights and weights of the quarterbacks be represented by x and y respectively.

Given;

The height of a quarterback x = 74.5 inches

Linear regression equation; y = 4.9439x – 146.77

y= ( 4.9439∗74.5 )−146.77

y=368.3206−146.77

y=221.5506 ≈221.55

Therefore, the weight of the quarterback would be 221.55 pounds.

Problem 25

Part a)
KELVEN GUERRERO 9

Let the right foot temperature be represented by y-variable, and the left foot temperature be

represented by x-variable.

Two random points on the data;

( x 1 , y 1 ) =( 76 , 81 ) , ( x 2 , y 2 ) =( 89 , 90 )

General least-square regression equation for two variables;

y=mx+c , where m=slope

Δ y y 2− y 1
Slope=m= =
Δ x x 2−x 1

Δ y 90−81 9
m= = = =0.6923
Δ x 89−76 13

Then, we use any of the two data points with the slope (m) in the general equation to evaluate the

value of the y-intercept (c).

y=mx+c

y=0.6923 x +c

c= y−0.6923 x=81− ( 0.6923∗76 )

c=81−52.6154

c=28.3846

Therefore, the least-square regression line is;

y=0.6923 x +28.3846

Part b)
KELVEN GUERRERO 10

Right Foot Temperature vs Left Foot Temperature


92
90
88
Right Foot Temperatue

86 f(x) = 0.592989985693848 x + 33.7753934191702


84 R² = 0.659998252146163

82
80
78
76
74
72
74 76 78 80 82 84 86 88 90

Left Foot Temperature

Part c)

Let the change in left foot temperatures of two patients be the change in x, and change in right

foot temperatures of the two patients be represented by change in y.

Given;

∆ x=2℉

Δy
Slope= =0.593 … . … ¿ the regression line
Δx

Then;

∆ y =0.593 × ∆ x =0.593 ×2

∆ y =1.186 ≈ 1.19

Therefore, the temperatures of the two right feet patients would differ by 1.19 degree

Fahrenheit.
KELVEN GUERRERO 11

Problem 27

Part a)

Let the diastolic blood pressure be represented by y-variable, and the systolic blood pressure be

represented by x-variable.

Two random points on the data;

( x 1 , y 1 ) =( 107 ,71 ) , ( x 2 , y 2 ) =( 154 , 94 )

General least-square regression equation for two variables;

y=mx+c , where m=slope

Δ y y 2− y 1
Slope=m= =
Δ x x 2−x 1

Δy 94−71 23
m= = = =0.4894
Δ x 154−107 47

Then, we use any of the two data points with the slope (m) in the general equation to evaluate the

value of the y-intercept (c).

y=mx+c

y=0.4894 x+ c

c= y−0.6923 x=94−( 0.4894∗154 )

c=94−75.3676

c=18.6324

Therefore, the least-square regression line is;


KELVEN GUERRERO 12
y=0. 4894 x+ 18.6324

Part b)

Yes, it is possible to interpret the y-intercept. From the regression line equation, the value of the

y-intercept is 18.6324, which indicates the systolic blood pressure value, when the diastolic

pressure is 0 mmHg.

Part c)

Let the change in systolic pressure of two patients be the change in x, and change in diastolic

pressure of the two patients be represented by change in y.

Given;

∆ x=10 mmHg

Δy
Slope= =0.4894 … .… ¿ the regression line
Δx

Then;

∆ y =4894 × ∆ x=0.4894 ×10

∆ y =4.894 ≈ 4.89

Therefore, the diastolic pressure of the two right feet patients would differ by 4.89 mmHg.

Part d)

Let the systolic and diastolic pressures be represented by x and y respectively.

Given;

The systolic pressure, x = 125 mmHg


KELVEN GUERRERO 13

Linear regression equation; y = 0.4894x + 18.6324

y= ( 0.4894∗125 ) +18.6324

y=61.175+18.6324

y=79.8074 ≈ 79.81

Therefore, the diastolic pressure of the patient would be 79.81 mmHg.

Problem 29

Part a)

y=a+bx

a=49.7124 , b=4.2886

The least-squares regression line is;

y=49.7124+ 4.2886 x

Part b)

Correlation coefficient between the score and time = r

r =0.9186

Part c)

Given;

Exam score = y
KELVEN GUERRERO 14

Study time, x = 10 hours

Regression line equation; y=49.7124+ 4.2886 x

y=49.7124+ ( 4.2886∗10 )

y=49.7124+ 42.886

y=92.5984

Therefore, the statistic exam score for the student would be 92.5984.

Problem 31

Part a)

In this problem, let the concentration of ozone be represented by variable y, and the

concentration of NOx be represented by x-variable.

The regression line from the output is given as;

y=33.8127 +1.21015 x

Part b)

Given;

Concentration of Ozone = y

Concentration of NOx, x = 21.4

Regression line equation; y=33.8127 +1.21015 x

y=33.8127 + ( 1.21015∗21.4 )

y=33.8127 +25.8972
KELVEN GUERRERO 15
y=59.71

Therefore, the ozone concentration would be 59.71.

Section 4.3

Problem 17

Part a)

Scatterplot
70

f(x) = 0.523033869115959 x + 20.909730195178


60 R² = 0.916000449780447

50

40

30

20

10

0
0 10 20 30 40 50 60 70 80 90 100

From the scatterplot above, the least-squares regression line is;

y=19.373+ 0.5101 x

Part b)

The outlier point is (52, 33).

Part c)
KELVEN GUERRERO 16

Scatterplot
70

f(x) = 0.523033869115959 x + 20.909730195178


60 R² = 0.916000449780447

50

40

30

20

10

0
0 10 20 30 40 50 60 70 80 90 100

The least-squares regression line without the outlier point is;

y=20.91+ 0.523 x

Part d)

Yes, the outlier is influential. The least-squares regression line equation is different with the

outlier point from without the outlier point. The slope and the y-intercept increased slightly upon

the removal of the outlier point.

Problem 25

Part a)

Let the number of mutants be represented by y-variable, and age be represented by x-variable.

Two random points on the data;

( x 1 , y 1 ) =( 46 , 52 ) , ( x2 , y 2 )=( 61 , 148 )

General least-square regression equation for two variables;


KELVEN GUERRERO 17
y=mx+c , where m=slope

Δ y y 2− y 1
Slope=m= =
Δ x x 2−x 1

Δ y 148−52 96
m= = = =6.4
Δ x 61−46 15

Then, we use any of the two data points with the slope (m) in the general equation to evaluate the

value of the y-intercept (c).

y=mx+c

y=6.4 x+ c

c= y−6.4 x=148−( 6.4∗61 )

c=148−390.4

c=−242.4

Therefore, the least-square regression line is;

y=6.4 x−242.4

Part b)
KELVEN GUERRERO 18

Number of Mutants vs. Age


500
450
400
350
Number of Mutants

300
250
200
f(x) = 3.24988097731904 x − 76.2257526994344
150
R² = 0.1590954257454
100
50
0
40 45 50 55 60 65 70 75 80 85 90

Age

Part c)

The outlier point in the scatterplot is (82, 449).

Part d)

Number of Mutants vs. Age


300

250
Number of Mutants

200

150
f(x) = 1.2282037845706 x + 31.863173216885
100 R² = 0.0370350247487646

50

0
40 45 50 55 60 65 70 75 80 85 90

Age
KELVEN GUERRERO 19

Without the outlier, the least-squares regression line is;

y=1.2282 x +31.863

Part e)

Yes, the outlier is highly influential. Without the outlier, the y-intercept becomes positive. Also,

the both the slope, and y-intercept reduces by almost half.

Problem 29

Part a)

Rental Price vs. Sales Price


3000

2500 f(x) = 3.30087924072444 x + 757.291994159635


Average Rental Price ('000s)

R² = 0.904261401567418

2000

1500

1000

500

0
0 100 200 300 400 500 600 700

Average Sales Price ('000s)

Part b)

From the scatterplot (part a), the value coefficient of determination is R-squared.

2
R =0.9043∗1000=904.3

Part c)
KELVEN GUERRERO 20

X Variable 1 Residual Plot


400

300

200
Residuals

100

0
0 100 200 300 400 500 600 700
-100

-200

-300
X Variable 1

The relationship does not appear to be linear.

Part d)

False.

You might also like