0% found this document useful (0 votes)

25 views

Unit 2 Notes

The document discusses examining relationships between variables. It describes two situations when examining relationships: when simply interested in the nature of the relationship, and when one variable may explain or predict the other known as the explanatory and response variables. Scatterplots are used to display the relationship between two quantitative variables by plotting the values of each individual as a single point. When examining a scatterplot, we look at the direction, form, strength, and outliers of the relationship.

Uploaded by

Thomas Shi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

Unit 2 Notes

Uploaded by

Thomas Shi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

U2-1

Examining Relationships
In statistics, we often want to compare two (or
more) different populations with respect to the
same variable.

We use tools such as side-by-side boxplots to make

comparisons between the samples.

U2-2 U2-3
Examining Relationships Explanatory and Response Variables
Often, however, we wish to examine relationships In this second case, one of the variables is an
between several variables for the same population. explanatory variable (which we denote by X) and
the other is a response variable (denoted by Y).
When we are interested in examining the relationship
between two variables, we may find ourselves in one A response variable takes values representing the
of two situations: outcome of a study, while an explanatory variable
 We may simply be interested in the nature of the helps explain this outcome.
relationship.
 One of the variables may be thought to explain
or predict the other.

U2-4 U2-5
Example Example
Does the caffeine in coffee really help keep you Are students who excel in English also good at
awake? Researchers interviewed 300 adults and math, or are most people strictly left- or right-
brained? A psychology professor locates 450
asked them how many cups of coffee they drink
students at a large university who have taken the
on an average day, as well as how many hours of same introductory English course and the same
sleep they get at night. Math course and compares their percentage
grades in the two courses at the end of the
The response variable Y is the hours of sleep, semester.
while the explanatory variable X is number of
cups of coffee per day. In this case, there is no explanatory or response
variable; we are simply interested in the nature of
the relationship.
U2-6 U2-7
Scatterplots Example
The best way to display the relationship between Consider the relationship between the number of
two quantitative variables is with a scatterplot. classes a student misses during the term and his or
her final exam score. The table on the following
A scatterplot displays the values of two different page gives the values for both variables for a sample
quantitative variables measured on the same
of eight students.
individuals. The data for each individual (for both
variables) appears as a single point on the
scatterplot. If there is an explanatory and a
response variable, they should be plotted on the
x- and y-axes, respectively. Otherwise, the choice
of axes is arbitrary.

U2-8 U2-9
Example Example
Student Classes Missed Exam Score The scatterplot for these data is shown below:
1 5 60
2 2 95
3 6 73
4 10 56
5 1 81
6 8 45
7 4 82
8 2 78

U2-10 U2-11
Scatterplots Scatterplots
We look for four things when examining a scatterplot: 2) Form
 A straight line would do a fairly good job
1) Direction approximating the relationship between the
 In this case, there is a negative association two variables. It is therefore reasonable to
between the two variables. An above-average assume that these two variables share a
number of classes missed tends to be linear relationship.
accompanied by a below-average exam score, and
vice-versa. If the pattern of points slopes upward
from left to right, we say there is a positive
association.
U2-12 U2-13
Scatterplots Scatterplots
3) Strength 3) Strength (cont’d)
 The strength of the relationship is determined  Not all relationships are linear in form. They
by how close the points lie to a simple form can be quadratic, logarithmic or exponential,
such as a straight line. In our example, if we to name a few. Sometimes the points appear
draw a line which roughly approximates the to be “randomly scattered”, in which case
relationship between the two variables, all many of them will fall far from a line used to
points will fall quite close to the line. As such, approximate the relationship. In this case, we
the linear relationship is quite strong. say the linear relationship between the two
variables is weak.

U2-14
Scatterplots R Code
4) Outliers
> missed <- c(5, 2, 6, 10, 1, 8, 4, 2)
 There are several types of outliers for bivariate > score <- c(60, 95, 73, 56, 81, 45, 82, 78)
data. An observation may be outlying in either > plot(missed, score)
the x- or y-directions (or both). Another type of
outlier occurs when an observation simply falls
outside the general pattern of points, even if it is
not extreme in either the x- or y-directions.
Some types of outliers have more of an impact
on our analysis than others, as we will discuss
shortly.

U2-15 U2-16
Strength of Linear Relationship Strength of Linear Relationship
The STAT 1000 and STAT 2000 percentage grades The scatterplot shows a moderately strong positive
for a sample of students who have taken both linear relationship. Does the relationship for the
courses are displayed in the scatterplot below: data in the following scatterplot appear stronger?
100 140
90
120
80
100
70
STAT 2000

STAT 2000

60 80
50 60
40
40
30
20
20
10 0
40 50 60 70 80 90 100 0 20 40 60 80 100 120 140
STAT 1000 STAT 1000
U2-17 U2-18
Strength of Linear Relationship Strength of Linear Relationship
It might, but these are the same data; the scatterplots This example shows that our eyes are not the best
are just constructed with different scales! tools to assess the strength of relationship between
100 140
two quantitative variables.
90 120
80
100
70 Can we find a numerical measure that will give us
STAT 2000

STAT 2000
60 80
50 60 a concrete description of the strength of a linear
40
30
40 relationship between two quantitative variables?
20 20
10 0
40 50 60 70 80 90 100 0 20 40 60 80 100 120 140
STAT 1000 STAT 1000 The measure we use is called correlation.

U2-19 U2-20
Correlation Coefficient Correlation
The correlation coefficient r measures the direction We will use the second version of the formula, as
and strength of a linear relationship between two it is computationally simpler. To calculate the
quantitative variables. correlation r:
(i) Calculate x , y , sx and sy
Suppose the values of two quantitative variables X
(ii) Calculate the deviations xi  x and yi  y
and Y have been measured for n individuals. Then
(iii) Multiply the corresponding deviations for x and y
1 n  xi  x  yi  y  ( xi  x )( yi  y )
r   n
n  1 i 1  sx  s y  (iv) Add the n products  ( xi  x )( yi  y )
i 1

1 n (v) Divide by (n – 1)sxsy

  ( xi  x )( yi  y ) 1 n
(n  1) sx s y i 1 r  ( xi  x ) yi  y 
(n  1) sx s y i1

U2-21 U2-22
Correlation Correlation
For the Classes Missed and Exam Score example, (v)
(i)
xi yi (ii) xi  x yi  y (iii) ( xi  x )( yi  y )
5 60 0.25 – 11.25 – 2.8125
2 95 – 2.75 23.75 – 65.3125
6 73 1.25 1.75 2.1875
10 56 5.25 – 15.25 – 80.0625 Note that some software programs display only the
1 81 – 3.75 9.75 – 36.5625 value of r2. If there is a positive association, r is the
8 45 3.25 – 26.25 – 85.3125 positive square root of r2, and if there is a negative
4 82 – 0.75 10.75 – 8.0625 association, r is the negative square root of r2.
2 78 – 2.75 6.75 – 18.5625
sum = 0 sum = 0 (iv) sum = – 294.5
U2-23
R Code Association vs. Causation
We must be careful when interpreting correlation.
> cor(missed, score)
[1] -0.8165786
Despite the very strong negative correlation, we
cannot conclude that missing more classes causes
Calculations in R will often differ slightly from our a student’s grade to decrease.
calculations, as R carries more decimal places.
There are many other variables that could help
explain the strong relationship between Classes
Missed and Exam Score. One such variable is the
effort of a student.

U2-24 U2-25
Lurking Variable Association vs. Causation
Students who put more effort into the course Regardless of the existence of identifiable lurking
generally miss fewer classes. We also know that variables, we must remember that correlation
exam scores tend to be higher for more dedicated measures only the linear association between two
students. quantitative variables. It gives us no information
about the causal nature of the relationship.
The effort of a student in this example is known
as a lurking variable. A lurking variable is one Association does not imply causation!
that helps explain the relationship between
variables in a study, but which is not itself
included in the study.

U2-26 U2-27
Correlation Correlation
Some properties of correlation: Some properties of correlation (cont’d):
 Positive values of r indicate a positive association  r has no units (i.e., it is just a number).
and negative values indicate a negative association.  The correlation makes no distinction between X
 r falls between –1 and 1, inclusive. Values of r and Y. As such, an explanatory and response
close to –1 or 1 indicate a strong linear association variable are not necessary.
(negative or positive, respectively). A correlation  Changing the units of X and Y has no effect on
of –1 or 1 is obtained only in the case of a perfect the correlation, i.e., it doesn’t matter if we
linear relationship, i.e., when all points fall on a measure a variable in pounds or kilograms, feet
straight line. Values of r close to zero indicate a or metres, dollars or cents, etc.
weak linear relationship.
U2-28 U2-29
Correlation Linear Regression
Some properties of correlation (cont’d): When a relationship appears to be linear in nature,
 r measures only the strength of a linear we often wish to estimate this relationship between
relationship. In other cases, it is a useless variables with a single straight line.
measure.
 Because the correlation is a function of A regression line is a straight line that describes
several measures that are affected by how a response variable Y changes as an explanatory
outliers, r is itself strongly affected by variable X changes. This line is often used to
outliers. predict values of Y for given values of X.

U2-30 U2-31
Linear Regression Regression Line
Note that with correlation, we didn’t require a We will use a sample to estimate the true relationship
response variable and an explanatory variable. between the two variables. Our estimate of the “true
line” is
In regression, we always have an explanatory yˆ  b0  b1 x
variable X and a response variable Y.
ŷ is the predicted value of Y for a given value of X.
Given a value of X, we would like to predict b0 is the intercept of the line and b1 is the slope.
the corresponding value of Y. Unless there is a
perfect relationship, we won’t know the exact We will use this regression line to make our
value of Y, because Y is a variable. predictions.

U2-32 U2-33
Regression Line Regression Line
We would like to find the line that fits our data the The line we will use is the line that minimizes the
best. That is, we need to find the appropriate values sum of squared deviations in the vertical direction:
n
of b0 and b1.
 ( yi  yˆ i )
2

i 1
20
But there are infinitely many possible lines. Which
yi
one is the “best” line? 15

10 ŷi
Y

Since we are using X to predict Y, we would like the

5
line to lie as close to the points as possible in the
vertical direction. 0
0 2 4 6 8 10
X
U2-34 U2-35
Least Squares Regression Slope
The values of b0 and b1 that give us the line that The slope of the regression line, b1, is defined as the
minimizes this sum of squared deviations are: predicted increase in y when x increases by one unit.
20
sy  yˆ
b1  r and b0  y  b1 x 15 b1 
sx x
10  ŷ

Y
The line yˆ  b0  b1 x is called the least squares
regression line, for obvious reasons. 5
x
0
0 2 4 6 8 10
X

U2-36 U2-37
Intercept Coefficient of Determination r2
The intercept of the regression line, b0 , is defined as Some variability in Y is accounted for by the fact
the predicted value of y when x = 0. that, as X changes, it pulls Y along with it. The
20 remaining variation is accounted for by other
factors (which we usually don’t know).
15

The value of r2 has a special meaning in least

10
Y

squares regression. It is the fraction of variation

5 in Y that is accounted for by its regression on X.
b0
0
0 2 4 6 8 10
X

U2-38 U2-39
Coefficient of Determination r2 Example
If r = –1 or 1, then r2 = 1. That is, we can predict Y Can the monthly rent for an apartment be predicted by
exactly for any value of X, as regression on X the size of the apartment? The size X (in square feet)
accounts for all of the variation in Y. and the monthly rent Y (in $) are recorded for a
sample of ten apartments in a large city. The data
If r = 0, then r2 = 0, and so regression on X tells us are shown below:
nothing about the value of Y.
X 770 650 925 850 575 860 800 1000 730 900

Otherwise, r2 is between 0 and 1. Y 1270 990 2230 1295 860 1925 1575 1790 1580 1550
U2-40 U2-41
Example Example
The scatterplot for these data is shown below: We see a strong positive linear relationship
between Length and Concentration. From the
data, we calculate

And so

U2-42
Example R Code
The equation of the least squares regression line is
> Size <- c(770, 650, 925, 850, 575, 860, 800,
therefore . . The line is shown
1000, 730, 900)
on the scatterplot below: > Rent <- c(1270, 990, 2230, 1295, 860, 1925,
1575, 1790, 1580, 1550)
> lm(Rent ~ Size)

U2-43
R Code Example
The slope b1 = 2.60 tells us that, when the size of an
> plot(Size, Rent)
apartment increases by one square foot, we predict the
> abline(lm(Rent ~ Size), col = "red")
monthly rent to increase by $2.60.
The intercept b0 = – 589.10 is statistically meaningless
in this case. An apartment cannot have a size of
0 square feet, and a negative rent is impossible.
We also see that r2 = (0.8031)2 = 0.645, which tells us
that 64.5% of the variation in an apartment’s monthly
rent is accounted for by its regression on size.
U2-44 U2-45
Example Predicted Value of Y
We can now use this line to predict the monthly rent We call this the predicted value of Y when X = 860.
for an apartment of a given size.

To do this, we simply plug the size of an apartment

into the equation of the least squares regression line.
For example, the predicted monthly rent for an 860
square foot apartment is

U2-46
Residuals R Code
Note that there is an 860 square foot apartment in
the sample. How does the actual monthly rent for > lm(Rent ~ Size)$residuals
this apartment compare with the predicted rent?

The monthly rent for this apartment is $278.10

higher than we would have predicted it to be from
our regression line.

The value yi  yˆ i is called the residual for the ith

observation. The residual for any value of X reflects
the error of our prediction.

U2-47 U2-48
Residuals Residuals
A positive residual indicates that an observation falls
residual = actual value of y – predicted value of y above the regression line and a negative residual indicates
that it falls below the line. As an example, check that the
residual for the 770 square foot apartment in the sample is
actual equal to –142.90.
Note that it is in fact the sum of squared residuals that
predicted
is minimized in calculating the least squares regression
line.
What if we want to predict the monthly rent for a 1250
square foot apartment? Our predicted value is
U2-49 U2-50
Extrapolation Transformations
Mathematically, there is no problem with making this The values of some explanatory variable X and some
prediction. However, there is a statistical problem. response variable Y are measured on a sample of
individuals. The data are shown below:
Our range of values for X is from 575 to 1000 square
feet. We have good evidence of a linear relationship
within this range of values. However, we have no X 2 3 5 8 10 14 15 18 21
apartments in our sample as large as 1250 square feet, Y 88 234 67 228 841 1621 904 1017 2809
and so we have no idea whether this relationship
continues to hold outside our range of data. X 23 27 32 36 40 45
Y 2154 5327 4118 6715 9063 8664
The process of predicting a value of Y for a value of X
outside our range of data is known as extrapolation,
and should be avoided if at all possible.

U2-51 U2-52
Transformations Transformations
A scatterplot of the data is shown below: Let us examine instead the relationship between X and the
transformed variable 𝑌 ∗ = 𝑌.

The relationship between X and Y appears to be nonlinear.

It appears that the relationship may be parabolic. The relationship between X and Y* appears to be linear.

U2-53 U2-54
Transformations Transformations
Now suppose we want to predict the value of Y when X = 25.
We fit the least squares regression line to the transformed
We first find the predicted value of 𝑌 ∗ = 𝑌 using the
data: regression line for this transformed relationship:

Now we can back-transform to find the predicted value of Y:

The equation of the regression line for this transformed

relationship is
U2-55 U2-56
Outliers Outliers
We have seen that an outlier can be defined as a point Point # 1 is an outlier in the y-direction. It generally
that is far from the other data points in the x-direction has little effect on the regression line.
14
or the y-direction, or if it falls outside the general #1
12
pattern of points.
10

We now examine the effect of each of these three 8

Y
types of outliers. 6

0
0 5 10 15
X

U2-57 U2-58
Outliers Outliers
Point # 2 is not an outlier in either the x- or A bivariate outlier such as this generally has little
y-directions, but falls outside the pattern of points. effect on the regression line.
14 14

12 12

10 10

8 8
Y

6 6

4 4

2 #2 2 #2
0 0
0 5 10 15 0 5 10 15
X X

U2-59 U2-60
Outliers Influential Observations
Point # 3 is an outlier in the x-direction. It has a An observation is called influential if removing it
strong effect on the regression line. from the data set would dramatically alter the
14
position of the regression line (and the value of r2).
12

10
In the above illustration, Point # 3 is an influential
8 observation, which is often the case for outliers in
Y

6 the x-direction.
4

2
#3

0
0 5 10 15
X
U2-61 U2-62
Influential Observations Influential Observations
In our example, suppose the size of the largest We see that, with the outlier included, the regression
apartment was 1500 square feet instead of 1000 line is a less accurate description of the relationship.
square feet, and the monthly rent was still $1790.
The equation of the regression line changes to
LSR line with
outlier excluded

LSR line with

In addition, the value of r2 reduces to 0.316. The outlier included
outlying value has had a strong effect on the
equation of the line and the value of r2.

U2-63 U2-64
Least Squares Regression Association vs. Causation
One property of the least squares regression line is Recall our discussion of association vs. causation.
that it always passes through the point ( x , y ) . The former does not imply the latter. In the
apartment example, there was a strong positive
Consider our previous example for the regression
relationship between the size of an apartment and
of Rent vs. Size of an apartment. The mean size its monthly rent. However, this doesn’t mean an
of the apartments in the sample was 806 square apartment being larger causes its monthly rent to
feet. The predicted monthly rent for an apartment be higher. This was an observational study, and
of this size is so the observed relationship may be due to one or
more lurking variables. For example, perhaps
apartments in nicer, more expensive parts of the
city are larger. Then the neighbourhood where an
which is exactly equal to the mean monthly rent apartment is located might be a lurking variable.
for the apartments in the sample.

U2-65 U2-66
Experiment vs. Observational Study Experiment vs. Observational Study
The best way to avoid lurking variables is to perform an A national drug study examined a sample of American
cities. In these cities, the percentage X of teenagers who
experiment rather than an observational study. have tried marijuana, and the percentage Y of teenagers
who have tried hard drugs were recorded. The correlation
In an experiment, the value of the explanatory variable between X and Y was calculated to be r = 0.85. But this
is randomly “assigned” to the sample units, rather than doesn’t mean that using marijuana causes teenagers to use
hard drugs. (We don’t even know if the teens using
being simply observed prior to the study. marijuana are the same ones who are using other drugs.)
There are other possible lurking variables that we are not
For example, consider the issue of drug use among
considering. One possible example of a lurking variable
teenagers. Does marijuana use cause teenagers to try is the availability of drugs in different cities. Teenagers
harder illegal drugs? in cities where drugs are more easily available may be
more likely to try them.
U2-67 U2-68
Experiment vs. Observational Study Experiment vs. Observational Study
If we really wanted to know if marijuana use among teens The reason for this is that we have diversified away
causes hard drug use, we would need to perform an the similarities within the two groups (those who
experiment. We would have to get a large number of use marijuana and those who don’t) with respect to
teenagers who have never tried marijuana or other drugs all possible lurking variables.
to volunteer to participate in the study. We would
randomly assign half of the volunteers to start smoking
marijuana, and the other half would continue not to use it. For example, some teenagers who live in cities
After two years, we could determine whether each where drugs are easily available will be assigned to
volunteer subsequently used hard drugs. If we still see a use marijuana, while others won’t. The same will
strong positive association, we can then say that marijuana be true for teenagers who live in cities where drugs
use does in fact cause hard drug use. are not easily available.

U2-69 U2-70
Experiment vs. Observational Study Categorical Variables on a Scatterplot
This example provides a good illustration that it is Sometimes a scatterplot may actually be displaying
not always possible to perform an experiment rather two or more distinct relationships.
than an observational study. It is not realistic to
For example, the Average Driving Distance X and
expect to find a group of teenagers who have never
the Average Score Y are recorded for a sample of
tried marijuana who are willing to start using it.
professional golfers. (A “drive” is a golfer’s first
shot on a golf hole).
Note however that this doesn’t mean observational
studies are “bad”. We must just remember that
association does not imply causation!

U2-71 U2-72
Categorical Variables on a Scatterplot Categorical Variables on a Scatterplot
The data are plotted on the scatterplot below. The This scatterplot is actually displaying two distinct
relationship does not appear to be linear, but….. linear relationships, one for male golfers and one
for female golfers.
U2-73
Categorical Variables on a Scatterplot
This example illustrates that we should be careful
when examining a relationship to make sure that
the data belong to only one population. In this
case, a separate regression line should be fit to the
data for the male and female golfers.

English-Medium-grade 9 Full Revision
100% (3)
English-Medium-grade 9 Full Revision
8 pages
Online King Sejong Institute - Quiz, Test
No ratings yet
Online King Sejong Institute - Quiz, Test
19 pages
Unit 2 Notes 1pp
No ratings yet
Unit 2 Notes 1pp
79 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
36 pages
Module 2 - Section 4 (Linear Regression) - 11
No ratings yet
Module 2 - Section 4 (Linear Regression) - 11
20 pages
MCO552 Unit 11 - Bivariate Stats
No ratings yet
MCO552 Unit 11 - Bivariate Stats
16 pages
Chapter2-ESTA3042 2020S2
No ratings yet
Chapter2-ESTA3042 2020S2
80 pages
6) CorrelationAndRegression - 27
No ratings yet
6) CorrelationAndRegression - 27
5 pages
CORRELATION-AND-REGRESSION-ANALYSIS
No ratings yet
CORRELATION-AND-REGRESSION-ANALYSIS
37 pages
Chapter 3 Slides
No ratings yet
Chapter 3 Slides
40 pages
Notes3.1 TPS6up
No ratings yet
Notes3.1 TPS6up
19 pages
CS3353 FDS UNIT 3 NEW
No ratings yet
CS3353 FDS UNIT 3 NEW
48 pages
Lecture10 Correlation
No ratings yet
Lecture10 Correlation
13 pages
Chapter 2
No ratings yet
Chapter 2
67 pages
Module-4
No ratings yet
Module-4
35 pages
Module 5 Correlation
No ratings yet
Module 5 Correlation
39 pages
Cs3353 Fds Unit 3 Notes Eduengg
No ratings yet
Cs3353 Fds Unit 3 Notes Eduengg
47 pages
stat215 test 2
No ratings yet
stat215 test 2
18 pages
STAT22209 - Chapter 01-Correlation Analyisis - 2022
No ratings yet
STAT22209 - Chapter 01-Correlation Analyisis - 2022
53 pages
May 8 2023
No ratings yet
May 8 2023
39 pages
Correlation
100% (1)
Correlation
29 pages
Regression And-Correlation
No ratings yet
Regression And-Correlation
69 pages
Lectures 5 6 - Correlation Analysis
No ratings yet
Lectures 5 6 - Correlation Analysis
29 pages
Pearson R
No ratings yet
Pearson R
25 pages
Relationship Between Variables: Chandan Sharma
No ratings yet
Relationship Between Variables: Chandan Sharma
20 pages
Dr. Saeed A. Dobbah Alghamdi Department of Statistics Faculty of Sciences King Abdulaziz University
No ratings yet
Dr. Saeed A. Dobbah Alghamdi Department of Statistics Faculty of Sciences King Abdulaziz University
30 pages
Correlation New
No ratings yet
Correlation New
37 pages
Review: I Am Examining Differences in The Mean Between Groups
100% (2)
Review: I Am Examining Differences in The Mean Between Groups
44 pages
Correlation N Regression
No ratings yet
Correlation N Regression
25 pages
Correlation and Regression
No ratings yet
Correlation and Regression
21 pages
Correlation Qmt-Students - 13 May 2022
No ratings yet
Correlation Qmt-Students - 13 May 2022
14 pages
Notes 2 - Scatterplots and Correlation
No ratings yet
Notes 2 - Scatterplots and Correlation
6 pages
Session 4 Correlation and Regression
No ratings yet
Session 4 Correlation and Regression
81 pages
Data Science With Python Relationship
No ratings yet
Data Science With Python Relationship
30 pages
Correlation Analysis
No ratings yet
Correlation Analysis
102 pages
Unit 3 Topic 1 Bivariate Data Analysis: Miss Perry
No ratings yet
Unit 3 Topic 1 Bivariate Data Analysis: Miss Perry
25 pages
Lecture-1 Scatter Diagram and Correlation
No ratings yet
Lecture-1 Scatter Diagram and Correlation
14 pages
Z. Correlation and Regression PDF
No ratings yet
Z. Correlation and Regression PDF
29 pages
Statistics Correlation Analysis
No ratings yet
Statistics Correlation Analysis
10 pages
Part 5. Sample Teaching Guide
No ratings yet
Part 5. Sample Teaching Guide
4 pages
Descriptive Stats (E.g., Mean, Median, Mode, Standard Deviation) Z-Test &/or T-Test For A Single Population Parameter (E.g., Mean)
No ratings yet
Descriptive Stats (E.g., Mean, Median, Mode, Standard Deviation) Z-Test &/or T-Test For A Single Population Parameter (E.g., Mean)
43 pages
Regression
No ratings yet
Regression
8 pages
5_Chapter9-linear regression
No ratings yet
5_Chapter9-linear regression
15 pages
Relationship- Correlation and Regression (1)
No ratings yet
Relationship- Correlation and Regression (1)
42 pages
Coorelation
No ratings yet
Coorelation
8 pages
Chapter 3 - Regression
No ratings yet
Chapter 3 - Regression
8 pages
202003241550009941rajeev Pandey Correlation Research
No ratings yet
202003241550009941rajeev Pandey Correlation Research
87 pages
Examining Relationships in Quantitative Research
No ratings yet
Examining Relationships in Quantitative Research
9 pages
PMC 500 Statistical Reasoning in Education: Correlation
No ratings yet
PMC 500 Statistical Reasoning in Education: Correlation
45 pages
Stat 202 Homework 08
No ratings yet
Stat 202 Homework 08
2 pages
Jan 28 - Correlation II
No ratings yet
Jan 28 - Correlation II
59 pages
Correlation-and-Regression
No ratings yet
Correlation-and-Regression
17 pages
Lesson 6: Correlation and Linear Regression
No ratings yet
Lesson 6: Correlation and Linear Regression
39 pages
Chapter 2_ EDA - Examining Relationships
No ratings yet
Chapter 2_ EDA - Examining Relationships
40 pages
Hypothesis Testing Correlation
No ratings yet
Hypothesis Testing Correlation
15 pages
Lecture 29
No ratings yet
Lecture 29
5 pages
Correg
No ratings yet
Correg
19 pages
Statistics Regression Final Project
100% (2)
Statistics Regression Final Project
12 pages
PRP1001 JXH1003 Week 7 2024 No Notes(1)
No ratings yet
PRP1001 JXH1003 Week 7 2024 No Notes(1)
49 pages
Correlation D 17
No ratings yet
Correlation D 17
8 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet
21 - USB
No ratings yet
21 - USB
26 pages
25 - Logic Level Conversion
No ratings yet
25 - Logic Level Conversion
10 pages
23 - Caching
No ratings yet
23 - Caching
20 pages
22 - USB
No ratings yet
22 - USB
7 pages
24 - Caching
No ratings yet
24 - Caching
22 pages
Unit 1 Notes 1pp
No ratings yet
Unit 1 Notes 1pp
128 pages
Unit 3 Notes 1pp
No ratings yet
Unit 3 Notes 1pp
86 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
15 pages
Training and Placement Cell, IIT (BHU), Varanasi: Internship Rules and Regulations For Students
No ratings yet
Training and Placement Cell, IIT (BHU), Varanasi: Internship Rules and Regulations For Students
4 pages
Touchpoint Plus Wireless: Gas Detection
No ratings yet
Touchpoint Plus Wireless: Gas Detection
4 pages
WristSense framework
No ratings yet
WristSense framework
19 pages
Syllabus of Mis
No ratings yet
Syllabus of Mis
1 page
Job Description - Marketing Specialist
No ratings yet
Job Description - Marketing Specialist
3 pages
Instant download Contemporary Sociological Theory and Its Classical Roots Ritzer George Wei Zhi pdf all chapter
No ratings yet
Instant download Contemporary Sociological Theory and Its Classical Roots Ritzer George Wei Zhi pdf all chapter
15 pages
CE2407B Lecture 2 PDF
No ratings yet
CE2407B Lecture 2 PDF
26 pages
Service Manual: DBP-2012UDCI DBP-2012UD
No ratings yet
Service Manual: DBP-2012UDCI DBP-2012UD
92 pages
Minazaini - Resume 2022
No ratings yet
Minazaini - Resume 2022
1 page
Iso 6803 - 2017
No ratings yet
Iso 6803 - 2017
18 pages
HP Dealer Price List - June Month PDF
No ratings yet
HP Dealer Price List - June Month PDF
4 pages
Positive Systems Theory and Applications POSTA 2018 James Lam all chapter instant download
100% (5)
Positive Systems Theory and Applications POSTA 2018 James Lam all chapter instant download
65 pages
002_Lecture_2_What-if, Checklist, HAZOP
No ratings yet
002_Lecture_2_What-if, Checklist, HAZOP
22 pages
01 Handout 1
No ratings yet
01 Handout 1
3 pages
Simulators For Occlusion
No ratings yet
Simulators For Occlusion
16 pages
CCHU9056 Virtual Worlds, Real Bodies Course Preparation: Education
No ratings yet
CCHU9056 Virtual Worlds, Real Bodies Course Preparation: Education
4 pages
Tank Design Form
No ratings yet
Tank Design Form
26 pages
H3000 brochure
No ratings yet
H3000 brochure
8 pages
Data Analytics FULL Course for Begi
No ratings yet
Data Analytics FULL Course for Begi
2 pages
PCBM Manual v002
No ratings yet
PCBM Manual v002
14 pages
Parts Manual: First Edition Rev A Part No. 134883 August 2008
No ratings yet
Parts Manual: First Edition Rev A Part No. 134883 August 2008
64 pages
CCNA Sem 1 Module 5 v3.0
No ratings yet
CCNA Sem 1 Module 5 v3.0
42 pages
Unit221B Deep Instinct Product Report
No ratings yet
Unit221B Deep Instinct Product Report
28 pages
SCA - Module 7
No ratings yet
SCA - Module 7
47 pages
Blogging As A Disruptive Technology
No ratings yet
Blogging As A Disruptive Technology
9 pages
3117 301 LCF Furnace Controller
No ratings yet
3117 301 LCF Furnace Controller
4 pages
Crafting and Designing Effective Learning Programs Projects and Activities
No ratings yet
Crafting and Designing Effective Learning Programs Projects and Activities
33 pages
EMS Lab Manual-2
No ratings yet
EMS Lab Manual-2
68 pages

Unit 2 Notes

Uploaded by

Unit 2 Notes

Uploaded by

U2-1

We use tools such as side-by-side boxplots to make

1 n (v) Divide by (n – 1)sxsy

Since we are using X to predict Y, we would like the

The value of r2 has a special meaning in least

squares regression. It is the fraction of variation

To do this, we simply plug the size of an apartment

The monthly rent for this apartment is $278.10

The value yi  yˆ i is called the residual for the ith

The relationship between X and Y appears to be nonlinear.

Now we can back-transform to find the predicted value of Y:

The equation of the regression line for this transformed

We now examine the effect of each of these three 8

LSR line with

You might also like