0% found this document useful (0 votes)

47 views6 pages

Tute - 04

A scatter diagram displays the relationship between two variables and is often combined with linear regression analysis. Regression analysis finds the line of best fit that approximates the relationship between the variables in a scatter plot. Simple regression uses one independent variable to explain a dependent variable, while multiple regression uses more than one independent variable. Examples of simple regression include analyzing the relationship between advertising spending and revenue or between drug dosage and blood pressure. Multiple regression can examine relationships like crop yield in relation to fertilizer and water amounts or a basketball player's points scored based on yoga and weightlifting sessions.

Uploaded by

Kamsha Nathan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views6 pages

Tute - 04

Uploaded by

Kamsha Nathan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Name: Kamshagini Nallainathan

CPM: 19601
MC: 94269

1. Describe how a scatter diagram and regression analysis is interrelated?

An incredibly basic statistical tool for displaying the relationship between two variables is a scatter
diagram. To fit a model between the two variables, it is frequently combined with a straightforward
linear regression line. Even though scatterplots can appear to be a mess, occasionally we can spot trends
in the data. The two graphs on the left, for instance, appear to be roughly following a line: the top graph
appears to follow a line with a positive slope, while the bottom graph appears to follow a line with a
negative slope. When we say that the data in a scatterplot appears to follow a trend, what we’re really
saying is that it appears to follow some line, or maybe some other kind of curve, like for example an
exponential curve or sinusoidal curve.

No matter the shape of the curve that the data follows, we call it the approximating curve, and the
process of finding the equation of the approximating curve is called curve fitting. Regression line: As
soon as we saw the plotted points, it was natural for us to begin searching for trends in the scatterplots.
In fact, when using scatterplots, we probably spend most of our time identifying trends. The plot by
itself isn't all that useful, but if we can use the plot to spot a pattern in the data, we might be able to use
that pattern to infer meaning from the data or make predictions about it.

A regression line will be used most of the time to accomplish this. It is the line in a scatterplot that most
effectively depicts the trend in the data. The terms best-fit line, line of best fit, and least-squares all refer
to regression lines. T The regression line is a trend line that we use to simulate a linear trend that we see
in a scatterplot, but we must be aware that not all data will exhibit a linear relationship. The regression
curve would be parabolic, for instance, if the relationship resembled the parabola's curve. We'll
primarily concentrate on linear regression for the remainder of this lesson.

2. Discuss 4 practical situations where regression analysis may be used in business decision-making.

One of the most popular statistical methods is linear regression. The relationship between one or more
predictor variables and a response variable is quantified using this method. Simple linear regression, the
most fundamental type of linear regression, is employed to quantify the relationship between a single
predictor variable and a single response variable. The relationship between multiple predictor variables
and a response variable can be quantified using multiple linear regression if we have more than one
predictor variable.

four different examples of when linear regression is used in real life.

 To comprehend the relationship between advertising expenditure and revenue, businesses

frequently use linear regression.
 revenue = β0 + β1(ad spending)

For instance, they might fit a simple linear regression model with revenue as the response
variable and advertising spending as the predictor variable. The regression model would look
like this: When there are no advertisements, the coefficient β0 would represent the total
expected revenue. When ad spending is increased by one unit, the coefficient β1 would
represent the typical change in total revenue (e.g., one dollar). If β1 is negative, then more
advertising expenditures will result in lower revenue. If β1 is nearly zero, advertising
expenditures have little impact on revenue. And if β1 is positive, it would imply that higher
advertising expenditures are linked to higher revenue. Depending on the value of β1, a company
may decide to either decrease or increase their ad spending.

 Medical researchers often use linear regression to understand the relationship between drug dosage
and the blood pressure of patients.
 blood pressure = β0 + β1(dosage)

For instance, researchers may give patients different dosages of a specific medication and track
how their blood pressure changes. They might use dosage as the predictor variable and blood
pressure as the response variable in a simple linear regression model. The regression model
would look like this: When the dosage is zero, the coefficient β0 would represent the anticipated
blood pressure. The average change in blood pressure when the dosage is increased by one unit
would be represented by the coefficient β1. If β1 is negative, it indicates that a dosage increase
is linked to a drop in blood pressure. A dosage increase is not associated with a change in blood
pressure if β1 is close to zero. If β1 is positive, it would mean that an increase in dosage is
associated with an increase in blood pressure. Depending on the value of β1, researchers may
decide to change the dosage given to a patient.

 Agricultural scientists often use linear regression to measure the effect of fertilizer and water on
crop yields.
 crop yield = β0 + β1(amount of fertilizer) + β2(amount of water)

For instance, researchers may vary the water and fertilizer applications in various fields to
observe the effects on crop yield. They might fit a multiple linear regression model with crop
yield as the response variable and fertilizer and water as the predictor variables. The expected
crop yield in the absence of fertilizer or water would be represented by the coefficient β0 in the
regression model. If the amount of water doesn't change, the coefficient β1 would represent the
typical change in crop yield when fertilizer is increased by one unit. The coefficient β2 would
represent the average change in crop yield when water is increased by one unit, assuming the
amount of fertilizer remains unchanged. Depending on the values of β1 and β2, the scientists
may change the amount of fertilizer and water used to maximize the crop yield.

 Data scientists for professional sports teams often use linear regression to measure the effect that
different training regimens have on player performance.
 points scored = β0 + β1(yoga sessions) + β2(weightlifting sessions)

For instance, data scientists in the NBA may examine how various frequencies of yoga and
weightlifting sessions each week affect a player's point total. With yoga and weightlifting
sessions as the predictor variables and total points earned as the response variable, they could
fit a multiple linear regression model. The regression model would look like this: The coefficient
0 would represent the predicted number of points for a player who does neither yoga nor
weightlifting. If the number of weekly weightlifting sessions stays the same, the coefficient of 1
would represent the average change in points earned when weekly yoga sessions are increased
by one. The coefficient β2 represents the average change in points scored when weekly
weightlifting sessions are increased by one while weekly yoga sessions remain constant.
Depending on the values of β1 and β2, data scientists may advise a player to engage in weekly
yoga and weightlifting sessions to maximize the points scored.

3. What is the difference between a simple regression analysis and a multiple regression analysis,
give examples for the two in a functional form (Describe your independent and dependent
variables clearly?

simple regression analysis Multiple regression analysis

attempts to explain a dependent variable using attempts to use more than one independent
only one independent variable. variable to explain a dependent variable.
Use to identify simple connections between data Use to identify complex connections between
data
Simple regression assumes there is a strong Multiple regression assumes that there is no
relationship between an independent variable strong relationship between each independent
and a dependent variable. variable. It also assumes that there is a
correlation between each independent variable
and the single dependent variable.
It is not a more specific calculation and not better It is a more specific calculation than simple linear
than multiple regression analysis. regression. So, it is often better.
Multiple regression is a narrow class of Multiple regression is a type of broader class
regressions encompassing only linear regressions of regression that includes both linear and
with one variable. nonlinear regressions with multiple explanatory
variables.

E: g

Simple regression analysis

y = bx+ a

Where,

y is a dependent variable we need and to find, x is an independent variable. The constants “a” and
“b” drives the equation. But according to our definition, as the multiple regression takes several
independent variables (x), so for the equation we will have multiple x values too.

If we consider the performance of students decreases only by not having additional technology
support to attend online classes. Then we need to find the relationship between only 2 variables.

P = decreases in student’s performance (dependent variable)

T = not having additional technology support (independent variable)

Interpretation of Function = bx+ ay

b*t +a =bt+a
Multiple regression analysis

A researcher decides to study students’ performance in a university over a period. He observed

that as the lectures proceed to operate online, the performance of students started to decline
as well. The parameters for the dependent variable "decrease in performance" are various
independent variables such as "lack of attention, increased internet addiction, neglecting
studies," and many more.

p= decrease in performance

x1= lack of attention

x2= more internet addiction

x3= neglecting studies

Interpretation of function

y = bx+ a

p= (b* x1) + (b* x2) + (b* x3) + a

4. Regression analysis is the most useful and common method of demand estimation. Do you
agree? Explain.

Yes, I do agree.

In general, there are many ways to estimate the demand of a firm such as consumer surveys,

consumer clinics, market experiments, virtual marketing, and so on.

However, researchers have preferred the regression technique and even complimented the

other techniques mentioned above with regression, in many instances due to their advantages

such as:

 Objective nature: the results from a regression will not differ (drastically) from one
researcher to another and will provide consistent analysis of data.
 Provides more complete information; based on economic theories, regression
analysis can help determine cause-and-effect relations and even prove or disprove
certain prevailing theoretical notions.
 Less costly; conducting regression is less costly and can be done relatively quickly
using basic statistical software.
5. Explain the rationale behind deriving the regression line using the ordinary least squares (OLS)
method?

While the scatter plot provides coordinates for dependent-independent variable combinations,
there may be observations of multiple coordinates that deviate from the average (mean) value of
coordinates. This difference is referred to as the vertical deviation or the error of the observed data
from the calculated regression line.

The basic goal of the ordinary least squares approach is to keep this deviation or error (denoted by e
and in some sources as u) to a minimum.
Since the sum of deviations equals zero (i.e.:∑𝑛 𝑒𝑡 = 0), w e o v e r c o m e t h i s b y t a k i n g
𝑡=1

S q u a r e d d e v i a ti o n s a n d a i m e d t o m i n i m i z e t h e m u s i n g d i ff e r e n ti a ti o n .

6. Fill in the blanks of the following equation and briefly explain what each term represents

Yt = The dependent/endogenous variable

a = Intercept of the function

b = Slope of the function

Xt = The independent variable/exogenous

et = The random error term

7. What are the assumptions of regression analysis?

The assumptions are critical in regression analysis to get unbiased estimates of the slope
coefficient and to be able to verify the significance of the estimates using probability theory.
The assumptions are:

1. The dependent (response) variable and the independent (predictor) variable should have
a linear and additive relationship (s). A linear connection implies that a change in reaction Y
caused by a one-unit change in X1 is constant, independent of X1. An additive connection
implies that X1's influence on Y is unaffected by other factors.

2. No correlation should exist between the residual (error) terms. Autocorrelation is the
absence of this phenomenon.

3. There should be no correlation between the independent variables. The absence of this
phenomenon is referred to as multicollinearity.

4. The variance of the error terms must be constant. This is referred to as homoscedasticity.
Heteroscedasticity is the existence of non-constant variation.

5. The error terms must have a normal distribution. Independence of the observations

8. Develop a hypothetical regression function to determine the sales revenue of a business and
interpret the equation developed (You may also incorporate the error term in your answer).

𝑌̂𝑡=10.4+6.25𝑋𝑡

Where,
Y = Sales Revenue

X = Advertising Expenditure

Interpret the equation.

Considering the given equation (assuming values in Rs. Million):

 When the spending on advertising is dropped to zero, the sales revenue is expected to be
Rs. 10.4 million.
 For each increase in advertising by Rs. 1 million, the sales revenue is expected to rise
by 6.25 million rupees.
 If the advertising expense values observed to obtain this equation are very distant from
Rs. 0, then the value of autonomous sales (Rs. 10.4 million), has no meaning as the
observed advertising expenses are not within, or is even far away from Rs. 0.

Assignment On Regression
100% (1)
Assignment On Regression
11 pages
ML Module3 Regression
No ratings yet
ML Module3 Regression
51 pages
BA3 4 5modules
No ratings yet
BA3 4 5modules
258 pages
Linear Regression
No ratings yet
Linear Regression
16 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
32 pages
Basics of Structural Equation Modeling
100% (2)
Basics of Structural Equation Modeling
328 pages
Business Decision Making II Simple Linear Regression: Dr. Nguyen Ngoc Phan
No ratings yet
Business Decision Making II Simple Linear Regression: Dr. Nguyen Ngoc Phan
69 pages
Module 2 Transcripts - v3
No ratings yet
Module 2 Transcripts - v3
103 pages
Linear Regression
No ratings yet
Linear Regression
3 pages
Lecture 13 BA
No ratings yet
Lecture 13 BA
36 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
Statistics
No ratings yet
Statistics
1,130 pages
Chapter 3 - Multiple Linear Regressions
No ratings yet
Chapter 3 - Multiple Linear Regressions
30 pages
3-4 CLRM
No ratings yet
3-4 CLRM
87 pages
Module - 05 Statistical Computing and R Programming
No ratings yet
Module - 05 Statistical Computing and R Programming
53 pages
EVSC 445 Week 11
No ratings yet
EVSC 445 Week 11
40 pages
Chapter 14 Simple Linear Regression .
No ratings yet
Chapter 14 Simple Linear Regression .
39 pages
Lecture 8 Linear and Multiple Regression
No ratings yet
Lecture 8 Linear and Multiple Regression
55 pages
Simple Linear Regression Homework Solutions
100% (1)
Simple Linear Regression Homework Solutions
6 pages
(Mathe) Simple Linear Regression and Correlation
No ratings yet
(Mathe) Simple Linear Regression and Correlation
61 pages
Simple Regression and Simple Correlation: MA261 Statistical and Numerical Techniques March 24, 2022
No ratings yet
Simple Regression and Simple Correlation: MA261 Statistical and Numerical Techniques March 24, 2022
52 pages
Linear Regression. Com
No ratings yet
Linear Regression. Com
13 pages
UNIT II Regression
No ratings yet
UNIT II Regression
59 pages
PA
No ratings yet
PA
28 pages
Master of Business Administration (Mba)
100% (1)
Master of Business Administration (Mba)
56 pages
AI Lec23
No ratings yet
AI Lec23
36 pages
Chapter 1
No ratings yet
Chapter 1
24 pages
Module 3
No ratings yet
Module 3
34 pages
Regression
No ratings yet
Regression
14 pages
CH 4 - Correlation and Regression YARA&LAMA
No ratings yet
CH 4 - Correlation and Regression YARA&LAMA
27 pages
Presentation4 - Bivariate Analysis and Simple Linear Regression
No ratings yet
Presentation4 - Bivariate Analysis and Simple Linear Regression
31 pages
Demand Forecasting (For Students) - V6
No ratings yet
Demand Forecasting (For Students) - V6
75 pages
Linear Regression
No ratings yet
Linear Regression
22 pages
Course 10-Part 1
No ratings yet
Course 10-Part 1
32 pages
Data Analytics Regression Unit III
No ratings yet
Data Analytics Regression Unit III
27 pages
Mod 3C
No ratings yet
Mod 3C
36 pages
Corr - Regression Analysis
No ratings yet
Corr - Regression Analysis
19 pages
Day 2-Data Science
No ratings yet
Day 2-Data Science
16 pages
Regression Model and Its Applications
100% (1)
Regression Model and Its Applications
30 pages
Da Unit 3 R22
No ratings yet
Da Unit 3 R22
15 pages
DA-3rd Unit
No ratings yet
DA-3rd Unit
16 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Noakhali Science and Technology University
No ratings yet
Noakhali Science and Technology University
28 pages
DS Unit-Iv
No ratings yet
DS Unit-Iv
34 pages
Internship Report AIML
No ratings yet
Internship Report AIML
40 pages
REGRESSION and CORRELATION ANALYSIS STA 106 - DR. BASHIRU
No ratings yet
REGRESSION and CORRELATION ANALYSIS STA 106 - DR. BASHIRU
10 pages
Econometrics
No ratings yet
Econometrics
18 pages
Bsacore1 M5 Wed
No ratings yet
Bsacore1 M5 Wed
4 pages
Hanan
No ratings yet
Hanan
9 pages
5 - Part II - Regression Analysis W-Notes
No ratings yet
5 - Part II - Regression Analysis W-Notes
10 pages
Session 15 Regression and Correlation
No ratings yet
Session 15 Regression and Correlation
66 pages
Regression and Introduction To Bayesian Network
No ratings yet
Regression and Introduction To Bayesian Network
12 pages
Regression
No ratings yet
Regression
25 pages
Simple and Multiple Linear Regression
No ratings yet
Simple and Multiple Linear Regression
6 pages
Unit5 R
No ratings yet
Unit5 R
5 pages
ML Exp1 C36
No ratings yet
ML Exp1 C36
13 pages
Regression Analysis 1 2020
No ratings yet
Regression Analysis 1 2020
40 pages
SimpleMultipleLinearRegression FoundationalMathofAI S24
No ratings yet
SimpleMultipleLinearRegression FoundationalMathofAI S24
6 pages
2024 Chapter 1
No ratings yet
2024 Chapter 1
8 pages
Correlation and Regression Analyses
No ratings yet
Correlation and Regression Analyses
8 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
11 pages
Module 11 Unit 2 Simple Linear Regression
No ratings yet
Module 11 Unit 2 Simple Linear Regression
12 pages
Journal
No ratings yet
Journal
12 pages
Solution Manual Elements of Chemical Reaction Engineering 4th Edition WWW - Elsolucionario.org 573 680
No ratings yet
Solution Manual Elements of Chemical Reaction Engineering 4th Edition WWW - Elsolucionario.org 573 680
108 pages
Regression Analysis Linear and Multiple Regression
No ratings yet
Regression Analysis Linear and Multiple Regression
6 pages
Linear Regression
No ratings yet
Linear Regression
10 pages
Tay Thesis 2015 PDF
No ratings yet
Tay Thesis 2015 PDF
253 pages
Factors Affecting The Saving Behaviour of Taj International College Students
No ratings yet
Factors Affecting The Saving Behaviour of Taj International College Students
16 pages
Parametric Models For Regression (Graded)
100% (2)
Parametric Models For Regression (Graded)
6 pages
Mini Proj
No ratings yet
Mini Proj
58 pages
Pareto Analysis Technique
No ratings yet
Pareto Analysis Technique
15 pages
Syba (Applied Statistics General) - 06.072020
No ratings yet
Syba (Applied Statistics General) - 06.072020
5 pages
Solutions To Ch12 Blanchard
No ratings yet
Solutions To Ch12 Blanchard
11 pages
Apps Rating Prediction
No ratings yet
Apps Rating Prediction
51 pages
ML Usar Manual-2
No ratings yet
ML Usar Manual-2
21 pages
Vanshdeep Singh Madan Resume - v3
No ratings yet
Vanshdeep Singh Madan Resume - v3
2 pages
Binary Logistic Regression and Its Application
No ratings yet
Binary Logistic Regression and Its Application
8 pages
Revision Guideline and Solved Problems JAN2018
No ratings yet
Revision Guideline and Solved Problems JAN2018
24 pages
Ambient Turbulence Intensity Calculation For Al-Nasiriyah Province in Iraq
No ratings yet
Ambient Turbulence Intensity Calculation For Al-Nasiriyah Province in Iraq
11 pages
A Primer of Ecological Statistics 2nd Edition Full Text PDF
No ratings yet
A Primer of Ecological Statistics 2nd Edition Full Text PDF
16 pages
Khoshnood Et Al 2025 Immigrant Background and Rape Conviction A 21 Year Follow Up Study in Sweden
No ratings yet
Khoshnood Et Al 2025 Immigrant Background and Rape Conviction A 21 Year Follow Up Study in Sweden
20 pages
Production Planning and Control
No ratings yet
Production Planning and Control
44 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
X y S R: Name: - Period: - Row: - Date: - Homework 9
No ratings yet
X y S R: Name: - Period: - Row: - Date: - Homework 9
3 pages
Bootstrap Aggregating Multivariate Adaptive Regression Spline For Observational Studies in Diabetes Cases
No ratings yet
Bootstrap Aggregating Multivariate Adaptive Regression Spline For Observational Studies in Diabetes Cases
8 pages
Laporan Hasil Analisis Regresi Dan Uji Pendukungnya: A. Lampiran I. Output Eviews
No ratings yet
Laporan Hasil Analisis Regresi Dan Uji Pendukungnya: A. Lampiran I. Output Eviews
9 pages
College of Computer Studies: Vision: Mission: Program Objectives
No ratings yet
College of Computer Studies: Vision: Mission: Program Objectives
6 pages
1.14 Function Model Construction: Graph The Data and Choose The Regression That Best Fits The Data
No ratings yet
1.14 Function Model Construction: Graph The Data and Choose The Regression That Best Fits The Data
2 pages
Pyoderma Gangrenosum
No ratings yet
Pyoderma Gangrenosum
18 pages
Cars Sales (In 1,000 Units) Price (In Lakh Rupees) Mileage (KM/LTR) Top Speed (KM/HR)
No ratings yet
Cars Sales (In 1,000 Units) Price (In Lakh Rupees) Mileage (KM/LTR) Top Speed (KM/HR)
10 pages

Tute - 04

Uploaded by

Tute - 04

Uploaded by

Name: Kamshagini Nallainathan

1. Describe how a scatter diagram and regression analysis is interrelated?

four different examples of when linear regression is used in real life.

 To comprehend the relationship between advertising expenditure and revenue, businesses

simple regression analysis Multiple regression analysis

Simple regression analysis

P = decreases in student’s performance (dependent variable)

T = not having additional technology support (independent variable)

Interpretation of Function = bx+ ay

A researcher decides to study students’ performance in a university over a period. He observed

x1= lack of attention

x2= more internet addiction

x3= neglecting studies

p= (b* x1) + (b* x2) + (b* x3) + a

consumer clinics, market experiments, virtual marketing, and so on.

Yt = The dependent/endogenous variable

a = Intercept of the function

b = Slope of the function

Xt = The independent variable/exogenous

et = The random error term

7. What are the assumptions of regression analysis?

5. The error terms must have a normal distribution. Independence of the observations

Interpret the equation.

Considering the given equation (assuming values in Rs. Million):

You might also like