0% found this document useful (0 votes)

17 views6 pages

Regression Analysis

Uploaded by

aditidocmoc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views6 pages

Regression Analysis

Uploaded by

aditidocmoc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Study Notes

Regression Analysis
Regression Analysis

Regression

 Regression is considered as analysis of dependence of dependent variable on the

independent variables with an objective to predict the average value of the
dependent variable given a specific value of the independent variable.
 It shows statistical relationship among variables, i.e.; it deals with variables that have
probability distributions (random or stochastic variables).
 It does not imply necessarily imply causation. The researcher specifies Y as
dependent variable based on his knowledge/existing theory. The analysis does not tell
you anything about the causation. And after estimation, by looking at the significance of
᷇ β you are saying that causal relationship is established.

Regression
Analysis of dependence of dependent variable on the independent variables with
an objective to predict the average value of the dependent variable

Types of Regression

Simple Regression Multiple Regression

Analysis is confined to 2 Analysis is confined to more
variables at a time than 2 variables at a time

Linear Regression Non-linear or Curvilinear Regression

Linear in parameters, the β’s; i.e.; the The regression equation will have the parameters with
parameters are raised to the first degree higher than 1, involving terms of the type β 2 , β 3,
power only. It may or may not be β 1 β 2 , β 1∨β 2 , etc.
linear in the explanatory variables,
the X’s.

Dependent Variable Independent Variable

 Effect  Cause
 Explained Variable  Explanatory Variable
 Predictand  Predictor
 Regressand  Regressor
 Response  Stimulus
 Endogenous  Exogenous
 Outcome  Covariate
 Controlled Variable  Control Variable

2
Regression Analysis

Historical Background

 Traditionally, regression meant tending towards average.

 The term was first introduced by Sir Francis Galton in the study of hereditary.
 In a population, if you take any child’s height; it will tend towards the population average.
In other words, taller parents had taller child and shorter parents had shorter child. It was
“regression to mediocrity”.
 Karl Pearson confirmed this in his study where he found, “average height of sons of tall
fathers was less than their father’s height and average height of sons of short fathers
was greater than their father’s height, thus “regressing” tall and short sons toward
average height of all men”.

How do you proceed?

 Consider two statements:

o S1: model generates data or
o S2 : data generates the model.
 Obviously, S1 is correct.
 It can be broadly thought that the model exists in nature but is unknown to the
experimenter.
 When some values to the explanatory variables are provided, then the values for the
output or study variable are generated accordingly, depending on the form of the
function f and the nature of the phenomenon.
 So ideally, the pre-existing model gives rise to the data. Our objective is to determine the
functional form of this model.
 Now we move in the backward direction. We propose to first collect the data on study
and explanatory variables. Then we employ some statistical techniques and use this
data to know the form of function f.
 Equivalently, the data from the model is recorded first and then used to determine the
parameters of the model.
 Thus, the literal meaning of regression analysis is to move in backward direction (used
to determine unknown parameters)

Example

 Suppose the yield of the crop (Y) depends linearly on two explanatory variables, viz., the
quantity of fertilizer ( X 1 ) and level of irrigation ( X 2 ) as

Y=bX +bX +
1 1 2 2

 There exist the true values of β 1 and β 2 in nature but are unknown to the experimenter.
 Some values on Y are recorded by providing different values to X 1 and X 2 . There exists
some relationship between Y and X 1 , X 2 which gives rise to a systematically behaved
data on Y, X 1 and X 2 . Such a relationship is unknown to the experimenter.

3
Regression Analysis

 To determine the model, we move in the backward direction in the sense that the
collected data is used to determine the unknown parameters β 1 and β 2of the model.
 In this sense, such an approach is termed as regression analysis.

Steps in Regression Analysis

 Statement of the problem under consideration

For example, the height and weight of children are related. Now there can be two issues
to be addressed.
(i) Determination of height for a given weight, or
(ii) Determination of weight for a given height.
 Choice of relevant variables
For example, in any agricultural experiment, the yield depends on explanatory variables
like quantity of fertilizer, rainfall, irrigation, temperature etc. These variables are denoted
by X 1 , X 2, X 3 , X 4, ….., X kas a set of k explanatory variables.
 Collection of data on relevant variables
For example, suppose we want to collect the data on age. For this, it is important to
know how to record the data on age. Then either the date of birth can be recorded which
will provide the exact age on any specific date or the age in terms of completed years as
on specific date can be recorded. Moreover, it is also important to decide whether the
data has to be collected on variables as quantitative variables or qualitative variables.
For example, if the ages (in years) are 15,17,19,21,23, then these are quantitative
values. If the ages are defined by a variable that takes value 1 if ages are less than 18
years and 0 if the ages are more than 18 years, then the earlier recorded data is
converted to 1,1,0,0,0.
 If the study variable is binary, then logistic and probit regressions etc. are used.
 If all explanatory variables are quantitative, then analysis of variance technique
is used.
 If some explanatory variables are qualitative and others are quantitative, then
analysis of covariance technique is used.
 Specification of model
Only the form of the tentative model can be ascertained, and it will depend on some
unknown parameters. For example, a general form will be like Y = f ( X , X ,..., X ; b , b
1 2 k 1 2

,..., b ) +  where  is the random error reflecting mainly the difference in the observed
k

value of Y and the value of Y obtained through the model. The form of f ( X , X ,..., X ; b ,
1 2 k 1

b ,..., b ) can be linear as well as non-linear depending on the form of parameters b , b

2 k 1 2

,..., b A model is said to be linear if it is linear in parameters.

 Choice of method for fitting the data

After the model has been defined, and the data have been collected, the next task is to
estimate the parameters of the model based on the collected data. This is also referred
to as parameter estimation or model fitting. The most commonly used method of
estimation is the least-squares method. Under certain assumptions, the least-squares
method produces estimators with desirable properties. The other estimation methods are

4
Regression Analysis

the maximum likelihood method (needs knowledge of distribution of Y), principle of least
squares, method of moments, ridge method, principal components method etc.
 Fitting of model
The estimation of unknown parameters using appropriate method provides the values of
the parameter. Substituting these values in the equation gives us a usable model. This is
termed as model fitting. Estimates of parameters b , b ,..., b in the model Y = f ( X , X
1 2 k. 1 2

,..., X ; b , b ,..., b ) +  are denoted by bˆ 1, bˆ2 ,..., bˆk which gives the fitted model as Y
k 1 2 k 1

= f ( X , X ,..., X ; bˆ 1, bˆ2 ,..., bˆk) . When the value of Y is obtained for the given values
1 2 k 1

X1 X 2,..., X k , it is denoted as Yˆ and called as fitted value.

 Model validation and criticism

The validation of the assumptions must be made before drawing any statistical
conclusion. Regression analysis is an iterative process where the outputs are used to
diagnose, validate, criticize and modify the inputs.

 Using the chosen model(s) for the solution of the posed problem.
 The determination of the explicit form of the regression equation is the ultimate
objective of regression analysis.
 To determine the role of any explanatory variable in the joint relationship in any
policy formulation,
 To forecast the values of the response variable for a given set of values of
explanatory variables.

Regression v/s Correlation

Regression Correlation
Purpose Predicts the average value of one Measures the direction and strength or
variable on the basis of fixed values of degree of linear association between the
other variables. two variables.
Usage There is an asymmetry in the way the Variables are treated symmetrically, i.e.;
dependent and explanatory variables are there is no difference between the
treated. dependent and explanatory variables.
The dependent variable is assumed to Both variables are assumed to be random.
be statistical, random or stochastic (i.e.,
to have a probability distribution). The
explanatory variables, are assumed to
have fixed values (in repeated sampling).

5
Regression Analysis

Coefficient Represented by b Represented by r

value Only one of the regression coefficients Can be between -1 to 1
can be greater than one.
Origin and Regression coefficients are independent Correlation coefficient is independent of
scale of change of origin but not of scale both change of origin and scale
Cause Can be used to establish cause effect Does not establish
and effect relationship

15 Types of Regression You Should Know
No ratings yet
15 Types of Regression You Should Know
30 pages
Regression and Analysis
No ratings yet
Regression and Analysis
132 pages
CHAPTER 2 Tesfaye Final - New Slide
No ratings yet
CHAPTER 2 Tesfaye Final - New Slide
159 pages
Advancedeconometricsl3!4!240128102442 58a0f1f1
No ratings yet
Advancedeconometricsl3!4!240128102442 58a0f1f1
58 pages
Lec1 ppt2019
No ratings yet
Lec1 ppt2019
23 pages
Mda-Session-7 Simple Linear Regression
No ratings yet
Mda-Session-7 Simple Linear Regression
75 pages
Mathematical Modeling Using Linear Regresion
No ratings yet
Mathematical Modeling Using Linear Regresion
52 pages
Bi - Variate Data Analysis - II Regression Analysis
No ratings yet
Bi - Variate Data Analysis - II Regression Analysis
37 pages
STAT22209 - Chapter 02-Regression Analyisis - 2022
No ratings yet
STAT22209 - Chapter 02-Regression Analyisis - 2022
41 pages
Regression Analysis
No ratings yet
Regression Analysis
10 pages
Introduction To Econometrics Chapt 1,2,3
No ratings yet
Introduction To Econometrics Chapt 1,2,3
41 pages
Reg 01
No ratings yet
Reg 01
17 pages
Econometrics I Handout
No ratings yet
Econometrics I Handout
41 pages
Regression Analysis-1 JJ
No ratings yet
Regression Analysis-1 JJ
17 pages
1 - Stat-701 Regression
No ratings yet
1 - Stat-701 Regression
18 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
Unit 2
No ratings yet
Unit 2
76 pages
Regression
No ratings yet
Regression
11 pages
Regression Analysis
No ratings yet
Regression Analysis
11 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
70 pages
Unit 2 Regression
No ratings yet
Unit 2 Regression
31 pages
Regression Course For Second Year (Chap 1-3)
No ratings yet
Regression Course For Second Year (Chap 1-3)
59 pages
Econometrics Session
No ratings yet
Econometrics Session
43 pages
Aalysis
No ratings yet
Aalysis
16 pages
DISCRETE MATH Chapter-8
No ratings yet
DISCRETE MATH Chapter-8
34 pages
Regression PDF
No ratings yet
Regression PDF
16 pages
Intro Regression Modeling
No ratings yet
Intro Regression Modeling
11 pages
Chapter1 Regression Introduction
No ratings yet
Chapter1 Regression Introduction
8 pages
Chapter1 Regression Introduction
No ratings yet
Chapter1 Regression Introduction
8 pages
Chapter 5
No ratings yet
Chapter 5
14 pages
Regression Analysis
No ratings yet
Regression Analysis
4 pages
2020 Physics Is FUNdamental
No ratings yet
2020 Physics Is FUNdamental
119 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Chapter Two: Simple Linear Regression Model: 2.1 Introduction To Regression Analysis
No ratings yet
Chapter Two: Simple Linear Regression Model: 2.1 Introduction To Regression Analysis
7 pages
Regression Analysis: From Wikipedia, The Free Encyclopedia
No ratings yet
Regression Analysis: From Wikipedia, The Free Encyclopedia
10 pages
Regression Analysis
No ratings yet
Regression Analysis
9 pages
W6 - L4 - Simple Linear Regression
No ratings yet
W6 - L4 - Simple Linear Regression
4 pages
Multiple Regression Analysis
No ratings yet
Multiple Regression Analysis
14 pages
Linear Regression Analysis: Module - I
No ratings yet
Linear Regression Analysis: Module - I
13 pages
Regression Analysis
No ratings yet
Regression Analysis
10 pages
Regression Analysis Is
No ratings yet
Regression Analysis Is
16 pages
Econometrics 2
No ratings yet
Econometrics 2
27 pages
Untitled 472
No ratings yet
Untitled 472
13 pages
Regression Analysis
No ratings yet
Regression Analysis
65 pages
Regression Analysis - Wikipedia
No ratings yet
Regression Analysis - Wikipedia
10 pages
Econometrics Chapter Two
No ratings yet
Econometrics Chapter Two
92 pages
Regression Analysis
No ratings yet
Regression Analysis
41 pages
Engineering Research Models Evaluation Methods
No ratings yet
Engineering Research Models Evaluation Methods
4 pages
Lecture 6 Simple Linear Regression
No ratings yet
Lecture 6 Simple Linear Regression
36 pages
Regression Analysis: Mathematical Methods of Cognitive Science
100% (1)
Regression Analysis: Mathematical Methods of Cognitive Science
12 pages
Regression Analysis
No ratings yet
Regression Analysis
12 pages
Chapter1 Regression Introduction PDF
No ratings yet
Chapter1 Regression Introduction PDF
8 pages
325unit 1 Simple Regression Analysis
No ratings yet
325unit 1 Simple Regression Analysis
10 pages
Correlation and Linear Regression
No ratings yet
Correlation and Linear Regression
25 pages
Topic0 Introduction
No ratings yet
Topic0 Introduction
9 pages
Lecture #1
No ratings yet
Lecture #1
22 pages
Regression: 9.1.1 Definition
No ratings yet
Regression: 9.1.1 Definition
20 pages
BDA (18CS72) Module-5
No ratings yet
BDA (18CS72) Module-5
52 pages
Regression: by Vijeta Gupta Amity University
No ratings yet
Regression: by Vijeta Gupta Amity University
15 pages
Final Lesson Q1 W4 SLM
No ratings yet
Final Lesson Q1 W4 SLM
9 pages
QAM Chapter 4
No ratings yet
QAM Chapter 4
71 pages
Multiple Regression
No ratings yet
Multiple Regression
67 pages
Clustering and Risk Factor Analysis of Pulmonary Tuberculosis in A District in Ethiopia A Population-Based Cohort Study
No ratings yet
Clustering and Risk Factor Analysis of Pulmonary Tuberculosis in A District in Ethiopia A Population-Based Cohort Study
9 pages
Why Nurses Need Statistics
No ratings yet
Why Nurses Need Statistics
24 pages
Regression With A Binary Dependent Variable
No ratings yet
Regression With A Binary Dependent Variable
55 pages
Econometrics Chapter One
No ratings yet
Econometrics Chapter One
11 pages
2008 - Flow, Performance and Moderators of Challenge-Skill Balance - Stefan Engeser Falko Rheinberg
No ratings yet
2008 - Flow, Performance and Moderators of Challenge-Skill Balance - Stefan Engeser Falko Rheinberg
15 pages
Phy ATP (5054) Class 10
No ratings yet
Phy ATP (5054) Class 10
57 pages
1 s2.0 S2214509523007374 Main
No ratings yet
1 s2.0 S2214509523007374 Main
26 pages
Simple Linear Regression Model Ordinary Least Square (OLS) Method
No ratings yet
Simple Linear Regression Model Ordinary Least Square (OLS) Method
18 pages
Discriminant Research Paper
No ratings yet
Discriminant Research Paper
12 pages
Psychological Needs Predict Fanship and Fandom in Anime Fans
No ratings yet
Psychological Needs Predict Fanship and Fandom in Anime Fans
14 pages
Two Variables Regression Equation
No ratings yet
Two Variables Regression Equation
44 pages
Financial Machine Learning 1690482448
No ratings yet
Financial Machine Learning 1690482448
23 pages
The Urban Informal Sector As A Means of Livelihood Improvement Among Youth Evidence From Hawassa City Ethiopia
No ratings yet
The Urban Informal Sector As A Means of Livelihood Improvement Among Youth Evidence From Hawassa City Ethiopia
20 pages
Concept Bottleneck Models
No ratings yet
Concept Bottleneck Models
19 pages
FR15
No ratings yet
FR15
21 pages
The University of Auckland
No ratings yet
The University of Auckland
24 pages
CHAPTER 6lesson 4
No ratings yet
CHAPTER 6lesson 4
11 pages
Coyne 2017 Pow Boom Kablam Effects of Viewing
No ratings yet
Coyne 2017 Pow Boom Kablam Effects of Viewing
13 pages
NBE Key - Unit 2 - Data Literacy
No ratings yet
NBE Key - Unit 2 - Data Literacy
5 pages
Effect of Greenhouse Height
No ratings yet
Effect of Greenhouse Height
9 pages
Regression Analysis LAB - Session 17 - 1
No ratings yet
Regression Analysis LAB - Session 17 - 1
2 pages
Blood Transfusions in Severe Burn Patients
No ratings yet
Blood Transfusions in Severe Burn Patients
7 pages
1 Model Building and Application in Logistic Regression
No ratings yet
1 Model Building and Application in Logistic Regression
7 pages
Applied Probability Models with Optimization Applications
From Everand
Applied Probability Models with Optimization Applications
Sheldon M. Ross
2.5/5 (3)
Co-Clustering: Models, Algorithms and Applications
From Everand
Co-Clustering: Models, Algorithms and Applications
Gérard Govaert
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet

Regression Analysis

Uploaded by

Regression Analysis

Uploaded by

Study Notes

 Regression is considered as analysis of dependence of dependent variable on the

Simple Regression Multiple Regression

Linear Regression Non-linear or Curvilinear Regression

Dependent Variable Independent Variable

 Traditionally, regression meant tending towards average.

How do you proceed?

 Consider two statements:

Steps in Regression Analysis

 Statement of the problem under consideration

b ,..., b ) can be linear as well as non-linear depending on the form of parameters b , b

,..., b A model is said to be linear if it is linear in parameters.

 Choice of method for fitting the data

X1 X 2,..., X k , it is denoted as Yˆ and called as fitted value.

 Model validation and criticism

Regression v/s Correlation

Coefficient Represented by b Represented by r

You might also like