0% found this document useful (0 votes)

23 views17 pages

Reg 01

Regression

Uploaded by

biniase669

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views17 pages

Reg 01

Regression

Uploaded by

biniase669

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Regression Analysis(Math4319)

Introduction

Instructor: Tatek Getachew(PhD)

Tatek () Math4319 1 / 17
Outline

1 Regression and Model Building

2 Data Collection

3 Uses of Regression

4 Role of the Computer

Tatek () Math4319 2 / 17
Regression and Model Building

Regression and Model Building

Regression analysis is a statistical technique for investigating and
modeling the relationship between variables.
Applications of regression are numerous and occur in almost every
field, including engineering, the physical and chemical sciences,
economics, management, life and biological sciences, and the social
sciences.
Regression analysis is used extensively in data mining and is a basic
tool of data science and analytics.
regression analysis may be the most widely used statistical technique.
it is used to answer questions such as
Does yield in quintal depend on amount of rainfall, temperature,
fertilizer use and number of times the cultivation is made?
Does change in cholesterol level depend on diet change, age, sex and
amount of exercise?
Does changing class size affect success of students?
Does students GPA affected by amount of time for studied, age, sex,
economic status of the parent, field of study, . . .
Tatek () Math4319 3 / 17
Eg. Suppose that an industrial engineer employed by a soft drink
beverage, The engineer visits 25 randomly chosen retail outlets having
vending machines, and the in - outlet delivery time (in minutes) and the
volume of product delivered (in cases)
If we let y represent delivery time and x represent delivery volume,
then the equation of a straight line relating these two variables is
y = β0 + β1 x
where β0 is the intercept and β1 is the slope. Now the data points do
not fall exactly on a straight line, so
We use Scatter Plot to display the relationship between two variables

Tatek () Math4319 4 / 17
the difference between the observed value of y and the straight line
(β0 + β1 x) be an error .
It is convenient to think of as a statistical error; that is, it is a
random variable that accounts for the failure of the model to fit the
data exactly.
The error may be made up of the effects of other variables on delivery
time, measurement errors, and so forth.
Thus, a more plausible model for the delivery time data is
y = β0 + β1 x +
The above Equation is called a linear regression model.
x is called the independent variable and y is called the dependent
variable.
we refer to x as the predictor or regressor variable and y as the
response variable.
The Equation involves only one regressor variable, it is called a
simple linear regression model.
Tatek () Math4319 5 / 17
We assume x is fixed, the random component on the right-hand side
of Eq. determines the properties of y.
Suppose that the mean and variance of are 0 and σ 2 , respectively.
Then the mean response at any value of the regressor variable is

E (y /x) = µy /x = E [β0 + β1 x + ] = β0 + β1 x

The variance of y given any value of x is

Var (y /x) = σy2/x = Var [β0 + β1 x + ] = σ 2

Thus, the true regression model µy /x = β0 + β1 x is a line of mean

values, that is, the height of the regression line at any value of x is
just the expected value of y for that x.
The slope, β1 can be interpreted as the change in the mean of y for a
unit change in x.
Furthermore, the variability of y at a particular value of x is
determined by the variance of the error component of the model, σ 2 .
Tatek () Math4319 6 / 17
This implies that there is a distribution of y values at each x and that
the variance of this distribution is the same at each x.
The variance σ 2 determines the amount of variability or noise in the
observations y on delivery time.
When σ 2 is small, the observed values of delivery time will fall close
to the line, and when σ 2 is large, the observed values of delivery time
may deviate considerably from the line.

Tatek () Math4319 7 / 17
These functional relationships are often based on physical, chemical,
or other engineering or scientific theory, that is, knowledge of the
underlying mechanism.
these types of models are often called mechanistic models.
Regression models, are thought of as empirical models.
Figure 1.3 illustrates a situation where the true relationship between y
and x is relatively complex, yet it may be approximated quite well by
a linear regression equation.
Sometimes the underlying mechanism is more complex, resulting in
the need for a more complex approximating function,
in Figure 1.4, where a ”piecewise linear” regression function is used to
approximate the true relationship between y and x.
Generally regression equations are valid only over the region of the
regressor variables contained in the observed data.

Tatek () Math4319 8 / 17
For example, consider Figure 1.5. Suppose that data on y and x were
collected in the interval x1 ≤ x ≤ x2 .
Over this interval the linear regression equation shown in Figure 1.5 is
a good approximation of the true relationship.
However, suppose this equation were used to predict values of y for
values of the regressor variable in the region x2 ≤ x ≤ x3 .
Clearly the linear regression model is not going to perform well over
this range of x because of model error or equation error.

Tatek () Math4319 9 / 17
In general, the response variable y may be related to k regressors,
x1 , x2 , . . . , xk , so that

y = β0 + β1 x1 + β2 x2 + · + βk xk +

This is called a multiple linear regression model because more than

one regressor is involved.

Tatek () Math4319 10 / 17
The adjective linear is employed to indicate that the model is linear in
the parameters β0 , β1 , . . . , βk , not because y is a linear function of
the x’s.
An important objective of regression analysis is to estimate the
unknown parameters in the regression model.
This process is also called fitting the model to the data.
We study several parameter estimation techniques in this book. One
of these techniques is the method of least squares (introduced in
Chapter 2 ).
The next phase of a regression analysis is called model adequacy
checking, in which the appropriateness of the model is studied and
the quality of the fit ascertained.
Through such analyses the usefulness of the regression model may be
determined.
The outcome of adequacy checking may indicate either that the
model is reasonable or that the original fit must be modified.
Thus, regression analysis is an iterative procedure, in which data lead
to a model and a fit of the model to the data is produced.
Tatek () Math4319 11 / 17
The quality of the fit is then investigated, leading either to
modification of the model or the fit or to adoption of the model.
A regression model does not imply a cause - and - effect relationship
between the variables.
Finally it is important to remember that regression analysis is part of
a broader data - analytic approach to problem solving.
That is, the regression equation itself may not be the primary
objective of the study.
It is usually more important to gain insight and understanding
concerning the system generating the data.

Tatek () Math4319 12 / 17
Data Collection

Data Collection

An essential aspect of regression analysis is data collection. Any

regression analysis is only as good as the data on which it is based.
Three basic methods for collecting data are as follows:
A retrospective study based on historical data
An observational study
A designed experiment
A good data collection scheme can ensure a simplified and a generally
more applicable model.
A poor data collection scheme can result in serious problems for the
analysis and its interpretation.

Tatek () Math4319 13 / 17
Data Collection

Retrospective Study:- use either all or a sample of the historical process

data over some period of time to determine the relationships among the
two variables
In general, their primary disadvantages are as follows:
Some of the relevant data often are missing.
The reliability and quality of the data are often highly questionable.
The nature of the data often may not allow us to address the problem
at hand.
The analyst often tries to use the data in ways they were never
intended to be used.
Logs, notebooks, and memories may not explain interesting phenomena
identified by the data analysis
Using historical data always involves the risk that, for whatever
reason, some of the data were not recorded or were lost.
These errors make historical data prone to outliers, or observations
that are very different from the bulk of the data.

Tatek () Math4319 14 / 17
Data Collection

Observational Study:- an observational study simply observes the

process or population. We interact or disturb the process only as
much as is required to obtain relevant data.
With proper planning, these studies can ensure accurate, complete,
and reliable data.
On the other hand, these studies often provide very limited
information about specific relationships among the data.
Designed Experiment:- The best data collection strategy for this
problem uses a designed experiment where we would manipulate the
response and which we would call the factors, according to a well -
defined strategy, called the experimental design.
The experimental design or plan consists of a series of runs.

Tatek () Math4319 15 / 17
Uses of Regression

Uses of Regression
Regression models are used for several purposes, including the
following:
1. Data description
2. Parameter estimation
3. Prediction and estimation
4. Control
Regression analysis is helpful in developing use equations to
summarize or describe a set of data.
regression model would probably be a much more convenient and
useful summary of those data than a table or even a graph.
Sometimes parameter estimation problems can be solved by regression
methods
Many applications of regression involve prediction of the response
variable.
For example, we may wish to predict delivery time for a specified
number of cases of soft drinks to be delivered.
Regression models may be used for control purposes.
Tatek () Math4319 16 / 17
Role of the Computer

Role of the Computer

Building a regression model is an iterative process.
The model - building process is illustrated in Figure below. It begins
by using any theoretical knowledge of the process that is being
studied and available data to specify an initial regression model.
A good regression computer program is a necessary tool in the model
- building process.
We must learn how to interpret what the computer is telling us and
how to incorporate that information in subsequent models.
Generally, regression computer programs are part of more general
statistics software packages, such as Minitab, SAS, JMP, and R.

Tatek () Math4319 17 / 17

SOURCE CODE Telecom
No ratings yet
SOURCE CODE Telecom
30 pages
(Riccardo Scarpa, Anna A. Alberini) Applications o
No ratings yet
(Riccardo Scarpa, Anna A. Alberini) Applications o
431 pages
Econometrics ch4
No ratings yet
Econometrics ch4
66 pages
Hasil Uji Validitas Ria 10 Responden
No ratings yet
Hasil Uji Validitas Ria 10 Responden
4 pages
Fundamentals of Epidemiology (EPID 610) Exercise 12 Screening Learning Objectives
100% (1)
Fundamentals of Epidemiology (EPID 610) Exercise 12 Screening Learning Objectives
4 pages
Time Value of Money
No ratings yet
Time Value of Money
8 pages
Unit 5 Assignment Problems: Structure
No ratings yet
Unit 5 Assignment Problems: Structure
34 pages
KTN Omitted Variables
No ratings yet
KTN Omitted Variables
6 pages
Economics 675 Syllabus Fall 09 J Smith
No ratings yet
Economics 675 Syllabus Fall 09 J Smith
9 pages
Rancangan Nested
No ratings yet
Rancangan Nested
23 pages
Econ 582 Forecasting: Eric Zivot
No ratings yet
Econ 582 Forecasting: Eric Zivot
20 pages
Mathematical Modeling Using Linear Regresion
No ratings yet
Mathematical Modeling Using Linear Regresion
52 pages
Lecture Notes in Empirical Macroeconomics (Miqef, MSC Course at Unisg)
No ratings yet
Lecture Notes in Empirical Macroeconomics (Miqef, MSC Course at Unisg)
58 pages
15 Types of Regression You Should Know
No ratings yet
15 Types of Regression You Should Know
30 pages
Introduction To Statistical Learning
No ratings yet
Introduction To Statistical Learning
16 pages
EffectSize - CBU Statistics Wiki
No ratings yet
EffectSize - CBU Statistics Wiki
3 pages
Levi's Lexicographical Presentation (IMS555)
No ratings yet
Levi's Lexicographical Presentation (IMS555)
6 pages
Reading 07-Correlation and Regression
No ratings yet
Reading 07-Correlation and Regression
18 pages
Module 2 Part 1 - Types of Forecasting Models and Simple Linear Regression
No ratings yet
Module 2 Part 1 - Types of Forecasting Models and Simple Linear Regression
71 pages
Regression PDF
No ratings yet
Regression PDF
16 pages
Tutorial2 Solution Jan21
No ratings yet
Tutorial2 Solution Jan21
5 pages
14 Statistics and Probability
No ratings yet
14 Statistics and Probability
37 pages
Chapter1 Regression Introduction
No ratings yet
Chapter1 Regression Introduction
8 pages
Randomized Block Design and Latin Square
No ratings yet
Randomized Block Design and Latin Square
2 pages
Analisis Pengaruh Marketing Mix Terhadap Keputusan Konsumen Dalam Membeli Produk Susu Milo Di Hypermarket
No ratings yet
Analisis Pengaruh Marketing Mix Terhadap Keputusan Konsumen Dalam Membeli Produk Susu Milo Di Hypermarket
10 pages
What Is Regression Analysis
No ratings yet
What Is Regression Analysis
4 pages
Regression Analysis
No ratings yet
Regression Analysis
65 pages
Forecasting Methods
No ratings yet
Forecasting Methods
20 pages
Lecture 4 MLR - 1
No ratings yet
Lecture 4 MLR - 1
30 pages
Statistics in Project Management
No ratings yet
Statistics in Project Management
1 page
Preview-9780429554490 A36645845
No ratings yet
Preview-9780429554490 A36645845
32 pages
Lecture9 Regression1 PDF
No ratings yet
Lecture9 Regression1 PDF
22 pages
Multiple Linear Regression 1
No ratings yet
Multiple Linear Regression 1
8 pages
003-Forecasting Techniques Detailed
No ratings yet
003-Forecasting Techniques Detailed
20 pages
Chapter1 Regression Introduction PDF
No ratings yet
Chapter1 Regression Introduction PDF
8 pages
Student Notes Madule 2
No ratings yet
Student Notes Madule 2
12 pages
Statistical Modeling
No ratings yet
Statistical Modeling
22 pages
Cs3491 - Aiml - Unit III - Probabilistic Discriminative Model
No ratings yet
Cs3491 - Aiml - Unit III - Probabilistic Discriminative Model
9 pages
CHAPTER 1 - INTRODUCTION - Introduction To Linear Regression Analysis, 5th Edition
No ratings yet
CHAPTER 1 - INTRODUCTION - Introduction To Linear Regression Analysis, 5th Edition
11 pages
CH 4 Decision Theory
No ratings yet
CH 4 Decision Theory
37 pages
Regression Analysis Is
No ratings yet
Regression Analysis Is
16 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
White Paper On Regression
No ratings yet
White Paper On Regression
14 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
Regression Analysis: From Wikipedia, The Free Encyclopedia
No ratings yet
Regression Analysis: From Wikipedia, The Free Encyclopedia
10 pages
2023 Statistics Fin 10
No ratings yet
2023 Statistics Fin 10
14 pages
Chapter 6
No ratings yet
Chapter 6
58 pages
Ees 400 - Topic Three - Simple Regression
No ratings yet
Ees 400 - Topic Three - Simple Regression
36 pages
OLS Assumptions and Diagnostics
No ratings yet
OLS Assumptions and Diagnostics
18 pages
Econometrics Session
No ratings yet
Econometrics Session
43 pages
FM Textbook Solutions Chapter 1 Second Edition
No ratings yet
FM Textbook Solutions Chapter 1 Second Edition
6 pages
3 Regression Analysis
No ratings yet
3 Regression Analysis
6 pages
Unit 2
No ratings yet
Unit 2
76 pages
Regression Analysis
100% (2)
Regression Analysis
11 pages
Topic0 Introduction
No ratings yet
Topic0 Introduction
9 pages
Chapter 4 Demand Estimation
No ratings yet
Chapter 4 Demand Estimation
9 pages
Solution Manual For Elementary Statistics, 13th Edition Mario F. Triola
100% (1)
Solution Manual For Elementary Statistics, 13th Edition Mario F. Triola
36 pages
An Introduction To Regression Analysis
No ratings yet
An Introduction To Regression Analysis
7 pages
Linear Regression Analysis: Module - I
No ratings yet
Linear Regression Analysis: Module - I
13 pages
Unit-III (Data Analytics)
50% (2)
Unit-III (Data Analytics)
15 pages
Introduction To Econometrics Chapt 1,2,3
No ratings yet
Introduction To Econometrics Chapt 1,2,3
41 pages
Regression Course For Second Year (Chap 1-3)
No ratings yet
Regression Course For Second Year (Chap 1-3)
59 pages
Regression Techniques
No ratings yet
Regression Techniques
14 pages
Chapter 0
No ratings yet
Chapter 0
10 pages
Chapter 3
No ratings yet
Chapter 3
67 pages
Reg 02
No ratings yet
Reg 02
46 pages
JOY Das
No ratings yet
JOY Das
10 pages
Intro To Reg Models
No ratings yet
Intro To Reg Models
27 pages
Reg 04
No ratings yet
Reg 04
24 pages
Chapter 4
No ratings yet
Chapter 4
23 pages
Chapter 2
No ratings yet
Chapter 2
20 pages
Bini Proposal 4
No ratings yet
Bini Proposal 4
18 pages
Ida Unit-3
No ratings yet
Ida Unit-3
34 pages
1 - Introduction To Statistical Modelling
No ratings yet
1 - Introduction To Statistical Modelling
12 pages
Lse Ppa M4u3 Notes
No ratings yet
Lse Ppa M4u3 Notes
15 pages
Digital Competency Levels - Online Student Checklist
No ratings yet
Digital Competency Levels - Online Student Checklist
7 pages
Regression Analysis
No ratings yet
Regression Analysis
6 pages
Finals-Predictive-Time-Series-Analysis - Module
No ratings yet
Finals-Predictive-Time-Series-Analysis - Module
14 pages
Management Science Notes
No ratings yet
Management Science Notes
13 pages
Internship Supervision Report Format
No ratings yet
Internship Supervision Report Format
1 page
Quantitative Techniques AMC 301
No ratings yet
Quantitative Techniques AMC 301
20 pages
Unit III
No ratings yet
Unit III
18 pages
Bayesian Inference With INLA, 1st Edition Exclusive Download
100% (9)
Bayesian Inference With INLA, 1st Edition Exclusive Download
14 pages
Slides
No ratings yet
Slides
39 pages
Regression
No ratings yet
Regression
11 pages
Semester 2 - Actuarial Science
No ratings yet
Semester 2 - Actuarial Science
1 page
Data Analysis Coca Cola
No ratings yet
Data Analysis Coca Cola
7 pages
DA-3rd Unit
No ratings yet
DA-3rd Unit
16 pages
Regression Analysis
No ratings yet
Regression Analysis
10 pages
Chapter1 Regression Introduction
No ratings yet
Chapter1 Regression Introduction
8 pages
Demand Estimation
No ratings yet
Demand Estimation
4 pages
WEEK 4 St.
No ratings yet
WEEK 4 St.
7 pages
Ssdma Unit 2 Part1
No ratings yet
Ssdma Unit 2 Part1
20 pages
Unit III
No ratings yet
Unit III
13 pages
Topic 3 - Simple Regression Analysis
No ratings yet
Topic 3 - Simple Regression Analysis
37 pages
Assignment Group C
No ratings yet
Assignment Group C
8 pages
Da Unit 3 R22
No ratings yet
Da Unit 3 R22
15 pages
00 MMS Regression For Economics
No ratings yet
00 MMS Regression For Economics
24 pages
Unit 3 Da
No ratings yet
Unit 3 Da
20 pages
Mathematical Methods for Physicists and Engineers: Second Corrected Edition
From Everand
Mathematical Methods for Physicists and Engineers: Second Corrected Edition
Royal Eugene Collins
No ratings yet
Co-Clustering: Models, Algorithms and Applications
From Everand
Co-Clustering: Models, Algorithms and Applications
Gérard Govaert
No ratings yet

Reg 01

Uploaded by

Reg 01

Uploaded by

Regression Analysis(Math4319)

Instructor: Tatek Getachew(PhD)

1 Regression and Model Building

4 Role of the Computer

Regression and Model Building

The variance of y given any value of x is

Var (y /x) = σy2/x = Var [β0 + β1 x + ] = σ 2

Thus, the true regression model µy /x = β0 + β1 x is a line of mean

This is called a multiple linear regression model because more than

An essential aspect of regression analysis is data collection. Any

Retrospective Study:- use either all or a sample of the historical process

Observational Study:- an observational study simply observes the

Role of the Computer

You might also like

Var (y /x) = σy2/x = Var [β0 + β1 x + ] = σ 2