0% found this document useful (0 votes)
22 views13 pages

Statistics Classwork #3-2

This document contains examples and exercises on scatterplots, correlation, and linear regression. Scatterplots are used to examine the relationship between two variables, and the correlation coefficient measures the strength and direction of association between them. A correlation of +1 or -1 indicates a perfect positive or negative linear relationship, while a correlation of 0 indicates no linear relationship between the variables. Linear regression finds the line of best fit that minimizes the residuals or errors between predicted values from the line and actual data points. The slope of the regression line indicates how much on average the dependent variable changes per one-unit increase in the independent variable.

Uploaded by

dc.yinhelen
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views13 pages

Statistics Classwork #3-2

This document contains examples and exercises on scatterplots, correlation, and linear regression. Scatterplots are used to examine the relationship between two variables, and the correlation coefficient measures the strength and direction of association between them. A correlation of +1 or -1 indicates a perfect positive or negative linear relationship, while a correlation of 0 indicates no linear relationship between the variables. Linear regression finds the line of best fit that minimizes the residuals or errors between predicted values from the line and actual data points. The slope of the regression line indicates how much on average the dependent variable changes per one-unit increase in the independent variable.

Uploaded by

dc.yinhelen
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 13

Statistics class work # 3

SCATTERPLOT AND CORRELATION

1. Association:
2. Correlation:

no correlation for r = 0
r=1, perfect positive correlation.
r = .95, very strong positive correlation
r=.66, moderately strong positive correlation.
r =-1, perfect negative correlation
r = -.97, very strong negative correlation.
r=-0.49, negative, little bit strong(almost weak)
3. Match the following graphs with their corresponding correlation coefficient
value:
A.

B
C.

D.
Graph ___ has a correlation coefficient of 0.015 between the two variables.

Graph ___ has a correlation coefficient of -0.996 between the two variables.

Graph ___ has a correlation coefficient of 0.778 between the two variables.

Graph ___ has a correlation coefficient of -0.728 between the two variables.

[Notes: For association/correlation, x=independent variable= explanatory variable


y = dependent variable = response variable, y depends on x.]

4. Scores on Algebra and Calculus obtained by 10 students;

Algebra 17 21 11 16 15 11 24 27 19 8 : x= L1

Calculus 73 66 64 61 70 71 90 68 84 52 : y= L2

a. Draw the scatterplot on your calculator.

[2nd statplot,on, select the 1st picture. Graph or zoom # 9 or 6]

|-------------------------------------------------------------------------x

b. Describe the association(strength, form, direction)


c. Calculate the correlation coefficient. Describe the correlation coefficient

[Stat, calc, LinReg(ax+b) . If you cannot get r, 2nd 0(catalog)

Diagnostic on, enter twice ]

r = correlation efficient =

5. Weights of the trucks in motion and when they are not

moving(static):

Weight in motion (x) Static weight (y)

(in thousands of pounds) (in thousands of pounds)

26 27.9

29.9 29.1

39.5 38

31.6 30.3

36.2 34.5

25.1 27.8

31 29.6

35.6 33.1

40.2 35.5

25.1 27
a. Draw the scatterplot.

---------------------------------------------------------------------------

b. Describe the association.

Find the correlation coefficient.

Explain the correlation.

r=

6. The table below presents the percentage of students who

tested proficient in reading and the percentage who tested

proficient in math for each of the 10 most populous states in

the United States. Compute the least-squares regression line

(line of best fit) for predicting math proficiency from reading proficiency.

Percent Proficient in Reading Percent Proficient in Math

(x) (y)
60 59

73 78

75 70

66 68

75 70

79 77

79 76

73 66

67 64

71 73

a. Draw the scatterplot

b. Association: strong, positive,

c. Find the correlation coefficient: r = 0.8086

Very strong, positive correlation

d. Write the equation of the least square regression line(line of best fit)
e. Draw the regression line on the scatterplot.

Explain slope in context. Slope = a=

f. Explain y-intercept(when x = 0). b = y- intercept when x = 0

b=

g. Predict/estimate the math percent given that reading percent is 73

(x= 73)

yhat =

h. Hence calculate the residual for x = 73. Explain the residual.

Residual= y – yhat for x = 73.

[ Note: This result is prediction, it is extrapolation , it is not exact]

7. Below is a table showing the airfares of flights between


Atlanta and 12 US cities:
Atlanta to: Distance (miles) Fare($)

(x) (y)

Baltimore 568 219

Boston 933 222

Dallas 720 249

Denver 1190 308

Detroit 602 249

Kansas City 683 141

Las Vegas 1719 252

Miami 589 229

Memphis 327 183

Minneapolis 894 209

New Orleans 419 199

Seattle 2150 343

a. What is the explanatory variable and what is the


response variable?

x =explanatory variable=

y = response variable=

b. Draw the scatterplot and describe the association


between Distance and Fare.

c. Find the correlation coefficient and explain.


d. Write the equation of the line of regression. Draw the line

e. State the slope. Explain what it means in the context of the problem.

f. Explain what the y-intercept means in the context of the problem.

y-intercept =b for x=0 , distance = 0

g. Predict/estimate the fare for a 200 mile flight.

yhat for x = 200

h. Find the residual for a flight to New Orleans


Explain the residual.

Distance =x= 419, y=fare=199

yhat =

Residual = y-yhat =

8. Suppose we have the following dataset with the weight and height of seven
individuals:
x= weight, y= height
The line of best fit(linear regression line) is given by: yhat = ax+ b= b + ax
^
height = 32.783 + 0.2001*(weight)
a=
b=
a. Find the slope. Explain the slope

b. What is y- intercept? Explain it.

c. Predict the height for 155 lbs weight


x = 155, yhat=

d. Calculate residual for weight 155 lbs. Explain the residual

Residual = y – yhat

9. Third Exam vs Final Exam scores.


The least squares regression line (line of best-fit) for the third-exam/final-exam
has the equation: x= Third exam score, y= final exam score
^
Final= −173.51+ 4.83 * Third
a=¿
b=¿
a. Find the slope. Explain the slope.
Slope =

b. Predict the final score for third exam score 70.


x= 70, yhat =

c.Calculate residual when third exam score is 70. Explain the residual

You might also like