Graph Analysis 1 - Linear Regression
Graph Analysis 1 - Linear Regression
Q.1.
Ten Corvettes between 1 and 6 years old were randomly selected from the classified ads of
The Arizona Republic. The following data were obtained, where x denotes age, in years,
and y denotes price, in hundreds of dollars.
Solution:
A)
For x = 1, y = 291.602 – 27.9029(1) = 263.6991
For x = 3, y = 291.602 – 27.9029(3) = 207.8933
Plot the graph using above calculated points.
C) Describe the apparent relationship between age and price for Corvettes.
The price for Corvettes tends to decrease as they get older (as age increases).
E) Identify outliers.
There do not appear to be any outliers.
F) Use the regression equation to predict the price of a 2-year-old Corvette & a 3-year-old
Corvette.
For a 2-year-old Corvette, y = 291.602 – 27.9029(2) = 235.7962 or $23,579.62.
For a 3-year-old Corvette, y = 291.602 – 27.9029(3) = 207.8933 or $20,789.33.
G) What does the slope of the regression line represent in the context of the data?
The slope of the regression line represents the rate of change of the dependent variable
with respect to the independent variable. It indicates how much the dependent variable is
expected to change for a one-unit increase in the independent variable.
H) What does the y-intercept of the regression line represent in the context of the data?
The y-intercept of the regression line represents the value of the dependent variable when
the independent variable is zero. It is the predicted value of the dependent variable when
the independent variable has no effect.
Solution:
We can see that there is a positive relationship between the monthly e-commerce sales (Y)
and online advertising costs (X).
The positive correlation means that the values of the dependent variable (y) increase when
the values of the independent variable (x) rise.
So, if we want to predict the monthly e-commerce sales from the online advertising costs,
the higher the value of advertising costs, the higher our prediction of sales.
We will use the above data to build our Scatter diagram.
The Scatter plot shows how much one variable affects another. In our example, above
Scatter plot shows how much online advertising costs affect the monthly e-commerce sales.
It shows their correlation.
Linear regression aims to find the best-fitting straight line through the points. The best-fitting
line is known as the regression line.
If data points are closer when plotted to making a straight line, it means the correlation
between the two variables is higher. In our example, the relationship is strong.
The orange diagonal line in diagram 2 is the regression line and shows the predicted score
on e-commerce sales for each possible value of the online advertising costs.
The formula estimates that for each increase of 1 dollar in online advertising costs, the
expected monthly e-commerce sales are predicted to increase by $171.5.