0% found this document useful (0 votes)

235 views36 pages

Running A Proper Regression Analysis: V G R Chandran Govindaraju Uitm Email: Website

This document provides an overview of running a proper regression analysis. It discusses exploring the data, developing regression models, checking assumptions like linearity, normality, autocorrelation and heteroskedasticity. It also covers using dummy variables, interactions, and time series techniques like unit root tests and cointegration. Key steps outlined are exploring the data, developing one or more regression models, identifying the most suitable model, and making inferences. Recommended books on regression analysis and econometrics are also listed.

Uploaded by

Arafatul Alam Patwary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

235 views36 pages

Running A Proper Regression Analysis: V G R Chandran Govindaraju Uitm Email: Website

Uploaded by

Arafatul Alam Patwary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

RUNNING A PROPER REGRESSION ANALYSIS

V G R CHANDRAN GOVINDARAJU UITM Email: [email protected] Website: www.vgrchandran.com/default.html

Topics
Running a proper regression analysis First half of the day:
1. What is regression? 2. How to estimate? (Simple and Multiple Regression) 3. Checking the assumptions of regression

Second half of the day

1. Regression with dummy variables 2. Recap: Time Series Econometrics

Types of data
Cross sectional Time series Panel data Where to get the data, DOS and BNM Lets download some data Data transformation level data, growth rate, index numbers, nominal to real values, exponential to linear models, etc

What to do after obtaining your data?

START

EXPLORE DATA

DEVELOP ONE OR MORE REGRESSION MODELS

IS ONE OR MORE REG. MODELS SUITABLE FOR DATA

REVISE THE MODEL /NEW MODEL

IDENTIFY MOST SUITABLE MODEL

MAKE INFERENCES & REPORT

Explore the data

Data cleaning Feel your data
Descriptive Statistics Correlations and Plots

What is regression?
Relationship between two variables (simple) or more than two variables (multiple) Models: Y = + X + is the intercept is the coefficient is the error term

Regression (Simple Example)

n 1 2 3 4 5 6 7 8 9 10 11 12 13 14 y 23 29 49 64 74 87 96 97 109 119 149 145 154 166 x 1 2 3 4 4 5 6 6 7 8 9 9 10 10

What is regression? (continue) SIMPLE EXAMPLE

Lets plot the data scatter plots Fitting a regression line. Findings the error term (residuals) Residuals are the most important part of regression (will let you known why later)

Estimating the alpha and beta.

Using Excel, SPSS, Eviews, Microfit, STATA, SPLUS Will teach how to use Excel, EVIEWS and SPSS (just an overview) Interpreting the outputs

Things to evaluate (output)

Economic Criteria signs and size of the effects (coefficient) follows economic theory demand for food (price variable) Coefficient of determinants Significance test on parameters (also joint test) Model selection criteria Functional form Econometric criteria - assumptions (do not violate)

Assumptions of linear regression

Linearity Normality Autocorrelation/Serial Correlation Heterogeneity Multicollinearity

Linearity
Straight enough condition (scatter plots) SPSS: Graphs: Scatter: Matrix: enter the dependent (outcome) variable first and then each of the independent variables (categorical/nominal variables dont need to be entered, but do it anyway to see what it looks like). SPSS: Analyze: Regression: Linear: Ramsey RESET test.

Normality
We do not need to test each series Just test the residuals Jarque-Berra statistics or the QQ and PP plots We use JB stat. Null Hypo: Normal

What to do if data is not normal?

Increase sample size Transform data e.g. log values Data may not be normal bcause of specification problems or functional form. Remember linearity

Serial Correlation/AutoCorrelation
Likely a problem in time series data especially data with short frequency What cause autocorrelation?
Omitted variables Misspecification

Consequences of autocorrelation
OLS estimators will be inefficient Variance of the coefficient will be biased and inconsistent

How to detect autocorrelation

Graphical methods: plot the residual and also draw a scatter plot of residual against residual (1) Durbin-Watson test Eviews (Null Hypo: no autocorrelation) Application when model includes constant, only first order and no lagged dependent variables We have a table to compare (DW stat) to the critical values (but rule of thumb if the value nears 2 than it is ok

How to detect autocorrelation

Breusch-Godfrey test for serial correlation It can test higher orders Eviews View/Residual tests/serial correlation LM test

How to solve autocorrelation?

Cochrane-Orcutt iterative procedure (beyond our scope remember I said regression in plain English AR (1)

Test for specification

Ramseys RESET test We include the predicted dependent variable as one of the regressors Lets do it in eviews

Heteroskedasticity
The opposite of homoskedasticity Hetero means unequal; Homo means equal Second part of the word skedasticity means spread (variance) Example: Consumption rich and poor rich have better spread (save and consumption) poor have lower spread There are many ways to test hetero : Graphically plot residual squared against dependent or independent variable there must not be a systematic pattern However graphical methods can be used for multiple regression

Heteroskedasticity
The following test can be used:
Breusch-Pagan LM test, Glesjer LM Test, HarveyGodfrey LM test, Park test, Goldfeld-Quabdt test, and White test Lets use the White test Null Hypo: No hetero or homo

Consequence of hetero and ways to correct it

No change in estimated parameter but standard error is effected (so does the significant) Generalized (or weighted ) least squares (beyond or discussion) Run a heterogeneity corrected regression (lets do a simple (White corrected standard error estimates) Alternatively, we can also use dummy variables to account for hetero

Multicollinearity
Whether there is any relationship between the regressors Consequences parameter is indetermine if perfect multicollinearity (However, real data do not have perfect multicollinearity) Imperfect multicollinearity when regressors are correlated but less than perfect How to detect?
Correlation matrics Check the significance of individual coefficient (t-test) and the joint significance (F-test) Run the regression by separating the regressors VIF Eviews or in SPSS (VIF value of less than 10 is ok)

Structural break and parameter stability test

Aim is to see whether parameters of the models have been constant over the periods Chow test we have to know the point of the break CUSUM and CUSUM Q Test parameter stability

Regression Analysis with Dummy Variables y = b0 + b1x1 + b2D2 + . . . bkxk + u

Dummy Variables
A dummy variable is a variable that takes on the value 1 or 0 Examples: male (= 1 if are male, 0 otherwise), south (= 1 if in the south, 0 otherwise), etc. Dummy variables are also called binary variables, for obvious reasons

A Dummy Independent Variable

Consider a simple model with one continuous variable (x) and one dummy (d) y = b0 + d0d + b1x + u This can be interpreted as an intercept shift If d = 0, then y = b0 + b1x + u If d = 1, then y = (b0 + d0) + b1x + u The case of d = 0 is the base group

27 Example of d0 > 0
Economics 20 - Prof. Anderson

d=1
d0

y = (b0 + d0) + b1x

slope = b1 d=0

y = b0 + b1x
x

Dummies for Multiple Categories

We can use dummy variables to control for something with multiple categories Suppose everyone in your data is either a HS dropout, HS grad only, or college grad To compare HS and college grads to HS dropouts, include 2 dummy variables hsgrad = 1 if HS grad only, 0 otherwise; and colgrad = 1 if college grad, 0 otherwise

Multiple Categories (cont)

Any categorical variable can be turned into a set of dummy variables Because the base group is represented by the intercept, if there are n categories there should be n 1 dummy variables If there are a lot of categories, it may make sense to group some together Example: top 10 ranking, 11 25, etc.

Interactions Among Dummies

Interacting dummy variables is like subdividing the group Example: have dummies for male, as well as hsgrad and colgrad Add male*hsgrad and male*colgrad, for a total of 5 dummy variables > 6 categories Base group is female HS dropouts hsgrad is for female HS grads, colgrad is for female college grads The interactions reflect male HS grads and male college grads

More on Dummy Interactions

Formally, the model is y = b0 + d1male + d2hsgrad + d3colgrad + d4male*hsgrad + d5male*colgrad + b1x + u, then, for example: If male = 0 and hsgrad = 0 and colgrad = 0 y = b0 + b1x + u If male = 0 and hsgrad = 1 and colgrad = 0 y = b0 + d2hsgrad + b1x + u If male = 1 and hsgrad = 0 and colgrad = 1 y = b0 + d1male + d3colgrad + d5male*colgrad + b1x + u

Other Interactions with Dummies

Can also consider interacting a dummy variable, d, with a continuous variable, x y = b0 + d1d + b1x + d2d*x + u If d = 0, then y = b0 + b1x + u If d = 1, then y = (b0 + d1) + (b1+ d2) x + u This is interpreted as a change in the slope

Other use of dummy variables

Seasonal dummy Structural breaks Shocks etc

Lets Recap Our Time Series Analysis

Unit Root Test Cointegration Vector Error Correction Model Granger Causality

Must Have Books (for new researchers)

Pratical Data Analysis Gary Koop (2004) Analysis of Economic Data, John Wiley. Basic Econometrics Gary Koop (2008) Introduction to Econometrics, John Wiley Samprit Chatterjee, Ali S. Hadi, Bertam Price (2000) Regression Analysis by Example, John Wiley. Dimitrios Asteriou and Stephen G. Hall (2007) Applied Econometrics: A Modern Approach Using Eviews and Microfit, Palgrave Basic Statistics and Regression Models De Veaux, Paul Velleman and David Bock, Stats: Data and Models, Pearson. (for basic statistics)

Thank you
QUESTIONS PLEASE
More materials will soon be available (by end of the month) through my website:

www.vgrchandran.com/default.html

Econometric S Cheat Sheet
No ratings yet
Econometric S Cheat Sheet
3 pages
Applied Econometrics Module
100% (2)
Applied Econometrics Module
141 pages
Amos Annotated Output Sem Cfa PDF
No ratings yet
Amos Annotated Output Sem Cfa PDF
31 pages
MACT - 2222 - Sample Exam - Final
No ratings yet
MACT - 2222 - Sample Exam - Final
8 pages
Intro To Econometrics Latter Half Chanon-1016098-17101310898743
No ratings yet
Intro To Econometrics Latter Half Chanon-1016098-17101310898743
15 pages
Multiple Regression
No ratings yet
Multiple Regression
49 pages
Economics 308: Econometrics Professor Moody: Describing The Relationship Between Two Variables
No ratings yet
Economics 308: Econometrics Professor Moody: Describing The Relationship Between Two Variables
8 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Ôn Final KTL
No ratings yet
Ôn Final KTL
5 pages
Ch08 Part 2 - Multtiple Regression
No ratings yet
Ch08 Part 2 - Multtiple Regression
45 pages
Econometrics I - Lecture 7 (Wooldridge)
No ratings yet
Econometrics I - Lecture 7 (Wooldridge)
34 pages
New Section 1
No ratings yet
New Section 1
39 pages
Lecture06 MultReg
No ratings yet
Lecture06 MultReg
38 pages
Econometrics Cheat Sheet
No ratings yet
Econometrics Cheat Sheet
4 pages
MGT555 CH 6 Regression Analysis
No ratings yet
MGT555 CH 6 Regression Analysis
19 pages
Topic 7 Regression (Cont.)
No ratings yet
Topic 7 Regression (Cont.)
47 pages
Econ 3049: Econometrics: Department of Economics The University of The West Indies, Mona
No ratings yet
Econ 3049: Econometrics: Department of Economics The University of The West Indies, Mona
16 pages
Unit 1 - Part 1
No ratings yet
Unit 1 - Part 1
105 pages
Lec 5 V 11
No ratings yet
Lec 5 V 11
44 pages
Regression Analysis
No ratings yet
Regression Analysis
65 pages
Data Analysis
No ratings yet
Data Analysis
263 pages
Estimating Demand: Learn How To Interpret The Results of Regression Analysis Based On Demand Data
No ratings yet
Estimating Demand: Learn How To Interpret The Results of Regression Analysis Based On Demand Data
18 pages
Chapter 4: Economic Analysis
No ratings yet
Chapter 4: Economic Analysis
18 pages
Ch08 Part 2 - Multiple Regression
No ratings yet
Ch08 Part 2 - Multiple Regression
45 pages
Estimating Demand: Regression Analysis
No ratings yet
Estimating Demand: Regression Analysis
29 pages
ECO 4203 - Outline 2023-2024 Session
No ratings yet
ECO 4203 - Outline 2023-2024 Session
4 pages
Multiple Regression Analysis Further Issues
No ratings yet
Multiple Regression Analysis Further Issues
27 pages
Session 11 - Correlation and Regression
No ratings yet
Session 11 - Correlation and Regression
28 pages
Ra Web
No ratings yet
Ra Web
70 pages
Chap - 2 - Econometrics I Jonse
No ratings yet
Chap - 2 - Econometrics I Jonse
41 pages
Introduction To Financial Econometrics
No ratings yet
Introduction To Financial Econometrics
38 pages
Session 1.3 Notes
No ratings yet
Session 1.3 Notes
39 pages
Ardl 1
No ratings yet
Ardl 1
166 pages
Maddala G.S. Introduction To Econometrics
100% (1)
Maddala G.S. Introduction To Econometrics
637 pages
Chapter Five Demand Estimation: Page 1 of 22
No ratings yet
Chapter Five Demand Estimation: Page 1 of 22
22 pages
Ontents: Foreword Preface To The Fourth Edition
No ratings yet
Ontents: Foreword Preface To The Fourth Edition
12 pages
Slides
No ratings yet
Slides
39 pages
ECON3049 Lecture Notes 1
No ratings yet
ECON3049 Lecture Notes 1
32 pages
Group 5 - Paz, Chavez, Raña, Corporal
No ratings yet
Group 5 - Paz, Chavez, Raña, Corporal
46 pages
Chapter 6
No ratings yet
Chapter 6
58 pages
BEC 340 Econometrics I Course Outline
No ratings yet
BEC 340 Econometrics I Course Outline
6 pages
Statistical Interference Lecture-8
No ratings yet
Statistical Interference Lecture-8
12 pages
UNIT II Regression
No ratings yet
UNIT II Regression
59 pages
Mad Dala
No ratings yet
Mad Dala
637 pages
Regression Analysis Estimation and Interpretation of Regression Equation Dummy Independent Variable
No ratings yet
Regression Analysis Estimation and Interpretation of Regression Equation Dummy Independent Variable
39 pages
BOOK MADDLA Econometric - Introduction To Econometrics
0% (1)
BOOK MADDLA Econometric - Introduction To Econometrics
637 pages
1 Merged Merged
No ratings yet
1 Merged Merged
245 pages
Two Variable Regression Analysis PDF
No ratings yet
Two Variable Regression Analysis PDF
13 pages
Econometrics 2
No ratings yet
Econometrics 2
84 pages
Introduction To Simple Linear Regression
No ratings yet
Introduction To Simple Linear Regression
34 pages
IB - 306: Econometrics Department of International Business
No ratings yet
IB - 306: Econometrics Department of International Business
6 pages
Session 19&20
No ratings yet
Session 19&20
54 pages
Data Science Q&A - Latest Ed (2020) - 3 - 1
No ratings yet
Data Science Q&A - Latest Ed (2020) - 3 - 1
2 pages
Basic Econometrics Revision - Econometric Modelling
No ratings yet
Basic Econometrics Revision - Econometric Modelling
65 pages
Lecture #1
No ratings yet
Lecture #1
22 pages
Chapter 3 - Linear Regression
No ratings yet
Chapter 3 - Linear Regression
43 pages
Lecture5 Mar22 2024
No ratings yet
Lecture5 Mar22 2024
44 pages
Mod 3C
No ratings yet
Mod 3C
36 pages
Cross Sectional
No ratings yet
Cross Sectional
40 pages
M2C GettingStartedGuide 2020 PDF
No ratings yet
M2C GettingStartedGuide 2020 PDF
31 pages
Ptak Prize - CSCA Enrollment Form
No ratings yet
Ptak Prize - CSCA Enrollment Form
2 pages
Registration For Competition: Bangladesh Brunei Hong Kong Malaysia Philippines Shanghai Thailand
No ratings yet
Registration For Competition: Bangladesh Brunei Hong Kong Malaysia Philippines Shanghai Thailand
5 pages
International Tourism 2010: Multi-Speed Recovery
No ratings yet
International Tourism 2010: Multi-Speed Recovery
3 pages
M - K - Mujeri & S Younus PDF
No ratings yet
M - K - Mujeri & S Younus PDF
34 pages
Criteria Yea
No ratings yet
Criteria Yea
2 pages
FAQs (How To Prepare For IELTS FAQs)
No ratings yet
FAQs (How To Prepare For IELTS FAQs)
5 pages
Codes of Corporate Governance - Yale - 053112
No ratings yet
Codes of Corporate Governance - Yale - 053112
34 pages
Accounting Scandal & Sarbanes Oxley
No ratings yet
Accounting Scandal & Sarbanes Oxley
41 pages
To Whom It May Concern
No ratings yet
To Whom It May Concern
1 page
Tax Problem Solution
No ratings yet
Tax Problem Solution
5 pages
Lithunia
No ratings yet
Lithunia
1 page
Application Form
No ratings yet
Application Form
5 pages
Ekram Assignment ECONO
100% (1)
Ekram Assignment ECONO
16 pages
Konstruk Alat Ukur Adaptasi Lingkungan: Muhliansyah, Anindya Pinasthi Putri, Miranti Rasyid, M. Ali Adriansyah, Diana
No ratings yet
Konstruk Alat Ukur Adaptasi Lingkungan: Muhliansyah, Anindya Pinasthi Putri, Miranti Rasyid, M. Ali Adriansyah, Diana
8 pages
Prelim Exam Question Paper - BI
No ratings yet
Prelim Exam Question Paper - BI
2 pages
Ai ML
No ratings yet
Ai ML
2 pages
Semester-Long Internship Report: Tanmay Srinath (BMSCE, Bangalore)
No ratings yet
Semester-Long Internship Report: Tanmay Srinath (BMSCE, Bangalore)
31 pages
PS Unit - Iv
No ratings yet
PS Unit - Iv
19 pages
Random Forest Algorithms - Comprehensive Guide With Examples
No ratings yet
Random Forest Algorithms - Comprehensive Guide With Examples
13 pages
Y .C, YA,: Yt Yy y Ys
No ratings yet
Y .C, YA,: Yt Yy y Ys
24 pages
Statistical Anaylsis For Industrial Engineering 2
No ratings yet
Statistical Anaylsis For Industrial Engineering 2
2 pages
ML Lab Manual Devansh
No ratings yet
ML Lab Manual Devansh
57 pages
Ira Analisis Data
No ratings yet
Ira Analisis Data
8 pages
(FREE PDF Sample) Multidimensional Item Response Theory 1st Edition Wes Bonifay Ebooks
No ratings yet
(FREE PDF Sample) Multidimensional Item Response Theory 1st Edition Wes Bonifay Ebooks
49 pages
Assignment - Econometrics (Instrumental Variable Stock Watson)
No ratings yet
Assignment - Econometrics (Instrumental Variable Stock Watson)
10 pages
Case Project Econometrics
No ratings yet
Case Project Econometrics
4 pages
AIO2023
No ratings yet
AIO2023
11 pages
Pca Vs Pls
No ratings yet
Pca Vs Pls
20 pages
Introduction To Multiple Regression: Chapter 14 - 1
No ratings yet
Introduction To Multiple Regression: Chapter 14 - 1
62 pages
Seatwork Q
No ratings yet
Seatwork Q
2 pages
Relationships Between Litter Size Sex Ratio and Wi
No ratings yet
Relationships Between Litter Size Sex Ratio and Wi
28 pages
Model Analisis Perkiraan Produksi Garam Di Kabupaten Kupang Menggunakan Metode Regresi Linear
No ratings yet
Model Analisis Perkiraan Produksi Garam Di Kabupaten Kupang Menggunakan Metode Regresi Linear
9 pages
Questions For Chapter 2
No ratings yet
Questions For Chapter 2
6 pages
Analisis Pengaruh Kebijakan Moneter, Kebijakan Fiskal, Dan Penyaluran Kredit Terhadap Pertumbuhan Ekonomi Di Provinsi Jawa Timur Tahun 2006-2016
No ratings yet
Analisis Pengaruh Kebijakan Moneter, Kebijakan Fiskal, Dan Penyaluran Kredit Terhadap Pertumbuhan Ekonomi Di Provinsi Jawa Timur Tahun 2006-2016
18 pages
ML 06 Multiclass
No ratings yet
ML 06 Multiclass
11 pages
Applied Predictive Modeling - Max Kuhn
80% (5)
Applied Predictive Modeling - Max Kuhn
57 pages
Cronbach's α (Reliability of data) and Factor Analysis (Construct Validity)
No ratings yet
Cronbach's α (Reliability of data) and Factor Analysis (Construct Validity)
55 pages
BA Module 5 Summary
No ratings yet
BA Module 5 Summary
3 pages
Optimization Problems For Machine Learning: A Survey
No ratings yet
Optimization Problems For Machine Learning: A Survey
41 pages
Unit 3
No ratings yet
Unit 3
15 pages

Running A Proper Regression Analysis: V G R Chandran Govindaraju Uitm Email: Website

Uploaded by

Running A Proper Regression Analysis: V G R Chandran Govindaraju Uitm Email: Website

Uploaded by

RUNNING A PROPER REGRESSION ANALYSIS

V G R CHANDRAN GOVINDARAJU UITM Email: [email protected] Website: www.vgrchandran.com/default.html

Second half of the day

What to do after obtaining your data?

DEVELOP ONE OR MORE REGRESSION MODELS

IS ONE OR MORE REG. MODELS SUITABLE FOR DATA

REVISE THE MODEL /NEW MODEL

IDENTIFY MOST SUITABLE MODEL

MAKE INFERENCES & REPORT

Explore the data

Regression (Simple Example)

What is regression? (continue) SIMPLE EXAMPLE

Estimating the alpha and beta.

Things to evaluate (output)

Assumptions of linear regression

What to do if data is not normal?

How to detect autocorrelation

How to detect autocorrelation

How to solve autocorrelation?

Test for specification

Consequence of hetero and ways to correct it

Structural break and parameter stability test

Regression Analysis with Dummy Variables y = b0 + b1x1 + b2D2 + . . . bkxk + u

A Dummy Independent Variable

y = (b0 + d0) + b1x

Dummies for Multiple Categories

Multiple Categories (cont)

Interactions Among Dummies

More on Dummy Interactions

Other Interactions with Dummies

Other use of dummy variables

Lets Recap Our Time Series Analysis

Must Have Books (for new researchers)

You might also like