0% found this document useful (0 votes)

52 views32 pages

CE687A Lecture23

The document provides an overview of count models and model selection for modeling average annual daily traffic (AADT). It discusses Poisson and negative binomial regression models as well as the use of log transformations and different model specifications. Key aspects covered include heteroskedasticity, empirical Bayes estimation to update the mean based on observed counts, and comparing models on training and test data to avoid overfitting.

Uploaded by

varun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views32 pages

CE687A Lecture23

Uploaded by

varun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Statistical and Econometric

Methods for Transportation

Engineering (CE687A)
Count models: application and
model selection
Aditya Medury
Lecture 23

2022-23, Semester I
IIT Kanpur
1
Disclaimer

This course material is being distributed as part of CE687A, titled “Statistical and
Econometric Methods for Transportation Engineering ", at IIT Kanpur during
semester I of the academic year 2022-23. Its contents are being shared in
confidence, for the sole purpose of instruction, and are only meant for the
students registered in this course. Any form of distribution, reproduction or
uploading of these materials anywhere, or with anyone, outside this course is
strictly prohibited.

2
Poisson-Gamma mixing

• Let 𝜇’s in the population be distributed by Gamma distribution

• Let crash counts 𝑍|𝜇 be distributed by Poisson distribution

𝜇𝑘 −𝜇
𝑃𝑟𝑜𝑏 𝑍 = 𝑘|𝜇 = 𝑒
𝑘!

𝐸 𝑍|𝜇 = 𝜇, 𝑉𝑎𝑟 𝑍|𝜇 = 𝜇

𝛽 𝛼 −𝛽𝜇 𝛼−1
𝑓 𝜇 = 𝑒 𝜇
Γ(𝛼)

𝛼 𝛼
𝐸 𝜇 = , 𝑉𝑎𝑟 𝜇 = 2
𝛽 𝛽
3
Poisson-Gamma mixing

∞
𝑃𝑟𝑜𝑏 𝑍 = 𝑘 = න 𝑃𝑟𝑜𝑏 𝑍 = 𝑘|𝜇 𝑓 𝜇 𝑑𝜇
0

∞
𝜇𝑘 −𝜇 𝛽 𝛼 −𝛽𝜇 𝛼−1
𝑃𝑟𝑜𝑏 𝑍 = 𝑘 = න 𝑒 𝑒 𝜇 𝑑𝜇
0 𝑘! Γ(𝛼)

∞
𝛽𝛼
= න 𝑒 −𝜇(1+𝛽) 𝜇𝑘+𝛼−1 𝑑𝜇
Γ(𝛼)𝑘! 0

Resembles a gamma distribution

4
Negative binomial distribution

Γ(𝑘 + 𝛼) 𝛽𝛼
𝑃𝑟𝑜𝑏 𝑍 = 𝑘 =
Γ(𝛼)𝑘! 𝛽 + 1 𝑘−𝛼

𝛼 𝛼 𝛼
𝐸𝑍 = , 𝑉𝑎𝑟 𝑍 = + 2 = 𝐸 𝜇 + 𝑉𝑎𝑟[𝜇]
𝛽 𝛽 𝛽

5
Re-parameterized NB distribution
(as used in R)

𝜃 𝑘
Γ 𝑘+𝜃 𝜃 𝜇
𝑃𝑟𝑜𝑏 𝑍 = 𝑘 =
Γ 𝜃 𝑘! 𝜃+𝜇 𝜃+𝜇

Where,

𝛼 1 2 𝛼 𝛼
𝐸 𝑍 =𝜇 = , 𝑉𝑎𝑟 𝑍 = 𝜇 + 𝜇 = + 2
𝛽 𝜃 𝛽 𝛽

6
𝑉𝑎𝑟 𝑍 > 𝑉𝑎𝑟 𝜇

7
Image source: Hauer, E. (2015). The art of regression modeling in road safety (Vol. 38). New York: Springer.
Recap: Law of iterated expectations

𝐸 𝑌 = 𝐸𝑋 𝐸 𝑌|𝑋

• 𝐸𝑋 . is the expectation over the values of 𝑋.

8
Law of total variance

𝑉𝑎𝑟 𝑌 = 𝐸 𝑉𝑎𝑟 𝑌|𝑋 + 𝑉𝑎𝑟 𝐸 𝑌 𝑋

9
Deriving the law of total variance (I)

𝑉𝑎𝑟 𝑌|𝑋 = 𝐸 𝑌 2 𝑋 − 𝐸 𝑌|𝑋 2 (𝑐𝑜𝑛𝑑𝑖𝑡𝑖𝑜𝑛𝑎𝑙 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒)

⇒ 𝐸 𝑉𝑎𝑟 𝑌|𝑋 = 𝐸 𝐸 𝑌 2 |𝑋 − 𝐸[ 𝐸 𝑌|𝑋 2 ]

= 𝐸 𝑌 2 − 𝐸 𝐸 𝑌|𝑋 2 (𝑖𝑡𝑒𝑟𝑎𝑡𝑒𝑑 𝑒𝑥𝑝𝑒𝑐𝑡𝑎𝑡𝑖𝑜𝑛𝑠)

= 𝐸 𝑌2 − 𝐸 𝑌 2 − 𝐸 𝐸 𝑌|𝑋 2 − 𝐸𝑌 2

⇒ 𝑉𝑎𝑟 𝑌 = 𝐸 𝑉𝑎𝑟 𝑌|𝑋 + 𝑉𝑎𝑟 𝐸 𝑌|𝑋

11
What is the distribution of mean conditional on observing
crashes?

• 𝑘|𝜇~𝑃𝑜𝑖𝑠𝑠𝑜𝑛[𝜇] 𝜇|𝑘 ∼ ?
• 𝜇~𝐺𝑎𝑚𝑚𝑎[𝛼, 𝛽]
𝑃 𝑘 𝜇 𝑓(𝜇)
𝑓 𝜇|𝑘 = ∝ 𝑃 𝑘 𝜇 𝑓(𝜇)
𝑃(𝑘)

𝜇𝑘 −𝜇 𝛽 𝛼−1 −𝛽𝜇 𝛼−1

∝ 𝑒 𝑒 𝜇
𝑘! Γ 𝛼

∝ 𝑒 −(1+𝛽)𝜇 𝜇 (𝛼+𝑘)−1

~Γ[𝛼 + 𝑘, 𝛽 + 1]
12
Combining information from 𝝁 and 𝒌

𝑓 𝜇|𝑘 ~Γ[𝛼 + 𝑘, 𝛽 + 1]

𝛼+𝑘 𝛼+𝑘
𝐸 𝜇|𝑘 = , 𝑉𝑎𝑟 𝜇|𝑘 =
𝛽+1 (𝛽 + 1)2

13
𝑬 𝝁|𝒌 is a weighted average of 𝑬 𝝁 and 𝒌

𝛼 𝑘
𝐸 𝜇|𝑘 = +
𝛽+1 𝛽+1

𝛼 𝛼
Since,𝐸 𝜇 = 𝛽, substituting 𝛽=𝐸 𝜇 ,

𝛼 𝑘 𝛼𝐸 𝜇 𝑘𝐸 𝜇
𝐸 𝜇|𝑘 = 𝛼 + 𝛼 = +
+1 +1 𝛼+𝐸 𝜇 𝛼+𝐸 𝜇
𝐸𝜇 𝐸𝜇

𝛼 1 1
𝑤= = =
𝛼+𝐸 𝜇 1 + 𝐸 𝜇 /𝛼 1 + 𝐸 𝜇 𝜃

14
𝑬 𝝁|𝒌 is a weighted average of 𝑬 𝝁 and 𝒌

𝐸 𝜇|𝑘 = 𝒘𝐸 𝜇 + 1 − 𝒘 𝑘

where,
𝛼 1 1
𝒘= = =
𝛼+𝐸 𝜇 1 + 𝐸 𝜇 /𝛼 1 + 𝐸 𝜇 𝜃

• 𝐸 𝜇 ↑ 𝒘 ↓: 𝐸 𝜇|𝑘 → 𝛼 + 𝑘
• 𝜃 ↑ 𝒘 ↓: 𝐸 𝜇|𝑘 → 𝑘

15
𝑬 𝝁|𝒌 is an empirical Bayes estimate of 𝝁

• 𝐸 𝜇 : prior, obtained from empirical data/modeling

• 𝐸 𝜇|𝑘 : posterior estimate

16
Let us revisit the AADT model

Let’s compare three models

• Model 1: CNTYPOP, NUMLANES2, NUMCLASS3, FUNCLASS3

• Model 2: CNTYPOP, CNTYPOP_Urb, Urban, NUMLANES2, NUMCLASS3,

FUNCLASS3

• Model 3: log(CNTYPOP), NUMLANES2, NUMCLASS3, FUNCLASS3

17
What is the variation across urban/rural segments?

• Other variables are also likely to be correlated with Urban (e.g., FUNCLASS3)

18
Comparison of linear models

• Which model would you prefer? Why?

• Are the relevant variables significant

with appropriate signs?

19
Checking for heteroskedasticity

Null hypothesis assumes

homoskedasticity, which can be rejected
for all three models at low significance
levels

20
Recap: Heteroskedasticity Consistent Estimator

−1 𝑛 −1
1 1 ′ 1 1 ′
෡
Estimate. Asy. Var 𝛃|𝐗 = 𝐗𝐗 ෍ 𝑒𝑖2 𝐱 𝐢 𝐱 𝐢′ 𝐗𝐗
𝑛 𝑛 𝑛 𝑛
𝑖=1

• The above estimator is also known as Eicker-Huber-White heteroscedasticity consistent

estimator
• This estimator is robust to unknown heteroskedasticity, and provides “robust” standard
errors for confidence intervals.
• Other techniques include weighted least squares (WLS), if the nature of
heteroskedasticity is known (see 4.4.2 of Washington et al.)

Additional reference for more mathematical background: http://people.stern.nyu.edu/wgreene/MathStat/GreeneChapter9.pdf 21

How different are robust standard errors?

Before After

22
Considering Poisson and NB alternatives

Let’s compare three models

• Model 1 (linear): CNTYPOP, NUMLANES2, NUMCLASS3, FUNCLASS3

• Model 4 (Poisson): CNTYPOP, NUMLANES2, NUMCLASS3, FUNCLASS3

• Model 5 (Poisson): log(CNTYPOP), NUMLANES2, NUMCLASS3, FUNCLASS3

• Model 6 (NB): log(CNTYPOP), NUMLANES2, NUMCLASS3, FUNCLASS3

23
Comparison of generalized linear
models

• Linear model is a special case of

generalized linear models

• Would you prefer log(CNTYPOP) over

CNTYPOP as an explanatory for Poisson
and NB?
𝐾−1
❑ 𝐸[𝜇𝑖 ] = 𝑒 σ𝑗=0 β𝑗 𝑥𝑖𝑗

• Would you prefer NB over Poisson?

24
So which model would you prefer for modelling AADT?
(Model selection)

25
Training vs test data

• Out-of-sample predictions are expected to be worse than in-sample predictions due to

the possibility of overfitting when seeking models with favourable goodness-of-fit
criteria
❑ Penalizing for increase in variables helps mitigate this issue.

Image source: https://medium.com/greyatom/what-is-underfitting-and-overfitting-in-machine-learning-and-how-to-deal-with-it-6803a989c76 26

Estimating training vs test data differences for AADT models

• Given the small sample size, we can split the data as 90% training and 10% test data
• We run 100 iterations of train-test splits, and estimate the training and test RMSE and
MAD for:
• Model 1 (preferred linear model out of three)
• Model 2 (preferred Poisson model out of two)
• Model 3 (preferred NB model)

27
Mean Absolute Deviation

Train Test

28
Root Mean Squared Error

Train Test

29
Overdispersion tests

• When data are overdispersed, the estimated variance is larger than expected from a true
Poisson process → standard errors get inflated.

Given a Poisson regression

𝐻0 : 𝑣𝑎𝑟 𝑦𝑖 = 𝜇𝑖
𝐻𝐴 : 𝑣𝑎𝑟 𝑦𝑖 = 𝜇𝑖 + 𝛼𝜇𝑖2
Regression-based tests can be undertaken to test 𝛼 = 0 assumption. (see section 11.5 of
Washington et al. or Cameron and Trivedi (1990))
• AER package in R has a function dispersiontest

30
A new goodness-of-fit statistic: deviance

ෝ = 2 𝐿𝐿𝑚𝑎𝑥 𝐲 − 𝐿𝐿𝑓𝑖𝑡𝑡𝑒𝑑 (ෝ
𝐷 𝛍 𝛍)
𝐿𝐿𝑚𝑎𝑥 : Maximum possible value of likelihood (𝜇ෝ𝑖 = 𝑦𝑖 )
• For Poisson distribution:𝐿𝐿𝑚𝑎𝑥 = σ𝑁
𝑖=1 𝑦𝑖 log 𝑦𝑖 − 𝑦𝑖 − log 𝑦𝑖 !

• For normal distribution: 𝐷𝑁 = σ𝑁

𝑖 𝑦𝑖 − 𝜇𝑖
2

𝑦𝑖
• For Poisson distribution: 𝐷𝑁 = σ𝑁
𝑖 𝑦𝑖 log − (𝑦𝑖 − 𝜇𝑖 )
𝜇𝑖

• Null Deviance: 𝜇ෝ𝑖 = 𝑦ത (also the outcome of an intercept-only model)

31
Comments
Discussion
Questions

E-mail: [email protected]
32

Formula Sheet - Quantitative Analysis
100% (1)
Formula Sheet - Quantitative Analysis
11 pages
(Ebook PDF) Essentials of Modern Business Statistics With Microsoft Office Excel 7th Editioninstant Download
100% (5)
(Ebook PDF) Essentials of Modern Business Statistics With Microsoft Office Excel 7th Editioninstant Download
51 pages
LectureNotes22 WI4455
No ratings yet
LectureNotes22 WI4455
154 pages
Statistical Methods in Data Analysis - W. J. Metzger
No ratings yet
Statistical Methods in Data Analysis - W. J. Metzger
278 pages
Statistical Methods in Experimental Chemistry
100% (1)
Statistical Methods in Experimental Chemistry
103 pages
Fundamentals of Statistics (18.6501x)
No ratings yet
Fundamentals of Statistics (18.6501x)
20 pages
Cimentaciones Maquinas
100% (1)
Cimentaciones Maquinas
235 pages
Sasin DECS 434 Session 1 and 2 - Probability and Excel
No ratings yet
Sasin DECS 434 Session 1 and 2 - Probability and Excel
104 pages
Statistics, Probability, Distributions, & Error Propagation: James R. Graham 9/2/09
No ratings yet
Statistics, Probability, Distributions, & Error Propagation: James R. Graham 9/2/09
39 pages
Fall 2018 Statistics 201A Aditya Guntuboyina
No ratings yet
Fall 2018 Statistics 201A Aditya Guntuboyina
101 pages
Basic Stats Session
No ratings yet
Basic Stats Session
16 pages
Formuleblad Statistiek
No ratings yet
Formuleblad Statistiek
10 pages
Mathematical Statistics Intro Course 1713243381
No ratings yet
Mathematical Statistics Intro Course 1713243381
142 pages
Problem Set 2
No ratings yet
Problem Set 2
18 pages
L19 CountDataModels v2
No ratings yet
L19 CountDataModels v2
36 pages
Stats Formula Sheet
No ratings yet
Stats Formula Sheet
28 pages
Ikaj Stochmod Lectnotes
No ratings yet
Ikaj Stochmod Lectnotes
114 pages
Project Report
No ratings yet
Project Report
56 pages
Stat 2013
No ratings yet
Stat 2013
132 pages
Lecture Notes
No ratings yet
Lecture Notes
138 pages
Statlearn PDF
No ratings yet
Statlearn PDF
123 pages
Solution
No ratings yet
Solution
148 pages
DS ML Probability Statistics Interview
No ratings yet
DS ML Probability Statistics Interview
6 pages
All Lectures 2018 Fall 201 A
No ratings yet
All Lectures 2018 Fall 201 A
100 pages
Math and Statistics PDF
No ratings yet
Math and Statistics PDF
192 pages
Handbook Statistical Foundations of Machine Learning
No ratings yet
Handbook Statistical Foundations of Machine Learning
267 pages
Statistics A. Introduction
50% (2)
Statistics A. Introduction
24 pages
Statistics
No ratings yet
Statistics
53 pages
Introduction To Probability Theory and S
No ratings yet
Introduction To Probability Theory and S
127 pages
Chapter 3 Randomized Experiments Advanced Inference
No ratings yet
Chapter 3 Randomized Experiments Advanced Inference
13 pages
Error Propagation
No ratings yet
Error Propagation
22 pages
Probability and Statistics Ii: George Deligiannidis Module Lecturer 2020/21: Kalliopi Mylona
No ratings yet
Probability and Statistics Ii: George Deligiannidis Module Lecturer 2020/21: Kalliopi Mylona
107 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
STAT359 Study Guide
No ratings yet
STAT359 Study Guide
7 pages
Lecture Notes For STAT2602
No ratings yet
Lecture Notes For STAT2602
104 pages
Lecture 11: Standard Error, Propagation of Error, Central Limit Theorem in The Real World
No ratings yet
Lecture 11: Standard Error, Propagation of Error, Central Limit Theorem in The Real World
13 pages
Lecture 4
No ratings yet
Lecture 4
8 pages
Block 05d ControChartAdvanced
No ratings yet
Block 05d ControChartAdvanced
98 pages
Words of Wisdom
No ratings yet
Words of Wisdom
17 pages
Statistics and Econometrics
No ratings yet
Statistics and Econometrics
12 pages
(University of Wisconsin-Madison, Shalizi) CSSS 2000-2001 Math Review Lectures - Probability, Statistics, and Stochastic Processes
No ratings yet
(University of Wisconsin-Madison, Shalizi) CSSS 2000-2001 Math Review Lectures - Probability, Statistics, and Stochastic Processes
71 pages
Introduction
No ratings yet
Introduction
11 pages
Machine Learning Lecture Notes Undergrad
No ratings yet
Machine Learning Lecture Notes Undergrad
19 pages
Statistic and Probability
No ratings yet
Statistic and Probability
83 pages
Introduction To Probability Theory and Statistics
No ratings yet
Introduction To Probability Theory and Statistics
127 pages
Probability and Statistics: Cookbook
No ratings yet
Probability and Statistics: Cookbook
28 pages
MS Theory Exam Study Guide
No ratings yet
MS Theory Exam Study Guide
50 pages
Solutions - Week 6
No ratings yet
Solutions - Week 6
5 pages
Problem Set 1 - Answers
No ratings yet
Problem Set 1 - Answers
7 pages
Machine Learning and Pattern Recognition Week 2 Error Bars
No ratings yet
Machine Learning and Pattern Recognition Week 2 Error Bars
3 pages
Lecture 1
No ratings yet
Lecture 1
8 pages
SST 306 LECTURE NOTES TWO (Power of A Test)
No ratings yet
SST 306 LECTURE NOTES TWO (Power of A Test)
22 pages
SDM 1 Formula
No ratings yet
SDM 1 Formula
9 pages
Reliability & Maintainability Engineering Ebeling Chapter 12 Book Solutions - Data Collection ..
No ratings yet
Reliability & Maintainability Engineering Ebeling Chapter 12 Book Solutions - Data Collection ..
15 pages
ACTL30004 Assignment
No ratings yet
ACTL30004 Assignment
15 pages
STAT270 Formula Booklet Vretta Updated
No ratings yet
STAT270 Formula Booklet Vretta Updated
10 pages
A Handbook To Conquer Casella and Berger Book in Ten Days: Oliver Y. Chén Last Update: June 25, 2016
No ratings yet
A Handbook To Conquer Casella and Berger Book in Ten Days: Oliver Y. Chén Last Update: June 25, 2016
15 pages
Stat 1116-BHS20100 - M Assignment
No ratings yet
Stat 1116-BHS20100 - M Assignment
7 pages
MECH 262 - Notes (Statistics)
No ratings yet
MECH 262 - Notes (Statistics)
7 pages
Lin Mod Book
No ratings yet
Lin Mod Book
567 pages
Wilkins, A Zurn Company: Demand Forecasting: Submitted By: Group-8 Section-C
No ratings yet
Wilkins, A Zurn Company: Demand Forecasting: Submitted By: Group-8 Section-C
6 pages
Sampling Design and Analysis MTH 494: Ossam Chohan Assistant Professor CIIT Abbottabad
No ratings yet
Sampling Design and Analysis MTH 494: Ossam Chohan Assistant Professor CIIT Abbottabad
34 pages
CH 00
No ratings yet
CH 00
4 pages
Statistical Questions For Practice Exercises
No ratings yet
Statistical Questions For Practice Exercises
7 pages
Audit Sampling 50 Points
100% (1)
Audit Sampling 50 Points
18 pages
Statistics For Management and Economics, Sixth Edition: Formulas
No ratings yet
Statistics For Management and Economics, Sixth Edition: Formulas
15 pages
Statistics Summative 3
No ratings yet
Statistics Summative 3
5 pages
Chapter 7 Exercises
No ratings yet
Chapter 7 Exercises
4 pages
Measure of Central Tendency
No ratings yet
Measure of Central Tendency
8 pages
wst03 01 Que 2023.01
No ratings yet
wst03 01 Que 2023.01
28 pages
Cook 2008
No ratings yet
Cook 2008
27 pages
Example PPT Case Study 2
No ratings yet
Example PPT Case Study 2
10 pages
Using The Leapfrog Design As A Simple Form of Ad
No ratings yet
Using The Leapfrog Design As A Simple Form of Ad
17 pages
Stats Crib Sheet Exam
No ratings yet
Stats Crib Sheet Exam
2 pages
Answer Key Testname: UNTITLED1.TST: ESSAY. Write Your Answer in The Space Provided
No ratings yet
Answer Key Testname: UNTITLED1.TST: ESSAY. Write Your Answer in The Space Provided
6 pages
Clustering-Based Undersampling With Random Over Sampling Examples and Support Vector Machine For Imbalanced Classification of Breast Cancer Diagnosis
No ratings yet
Clustering-Based Undersampling With Random Over Sampling Examples and Support Vector Machine For Imbalanced Classification of Breast Cancer Diagnosis
12 pages
Tripod Cluster Checklist
No ratings yet
Tripod Cluster Checklist
2 pages
Five Number Summary
No ratings yet
Five Number Summary
8 pages
Statistics Chapter1
No ratings yet
Statistics Chapter1
3 pages
Smbi TD1
No ratings yet
Smbi TD1
4 pages
Assignment 4 Chapter 4
No ratings yet
Assignment 4 Chapter 4
4 pages
Simple Linear Regression Interpretation PDF
No ratings yet
Simple Linear Regression Interpretation PDF
2 pages
Examination 2 STAT 285: Business Statistics Spring 2020: Raehslerr@duq - Edu
No ratings yet
Examination 2 STAT 285: Business Statistics Spring 2020: Raehslerr@duq - Edu
3 pages
Villanueva BSA22 LaboratoryExercise3
No ratings yet
Villanueva BSA22 LaboratoryExercise3
2 pages
Assignment 5 - 2020
No ratings yet
Assignment 5 - 2020
2 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Calculus Volume1
From Everand
Calculus Volume1
Ming Yao Tsai
No ratings yet
Speed Mathamatics
From Everand
Speed Mathamatics
Naila Hina
1/5 (1)
Exact Trigonometry Table for All Angles
From Everand
Exact Trigonometry Table for All Angles
Bhava Nath Dahal
No ratings yet
Exact Trigonometric Table for all Angles
From Everand
Exact Trigonometric Table for all Angles
Bhava Nath Dahal
No ratings yet

CE687A Lecture23

Uploaded by

CE687A Lecture23

Uploaded by

Statistical and Econometric

Methods for Transportation

• Let 𝜇’s in the population be distributed by Gamma distribution

𝐸 𝑍|𝜇 = 𝜇, 𝑉𝑎𝑟 𝑍|𝜇 = 𝜇

Resembles a gamma distribution

• 𝐸𝑋 . is the expectation over the values of 𝑋.

𝑉𝑎𝑟 𝑌 = 𝐸 𝑉𝑎𝑟 𝑌|𝑋 + 𝑉𝑎𝑟 𝐸 𝑌 𝑋

𝑉𝑎𝑟 𝑌|𝑋 = 𝐸 𝑌 2 𝑋 − 𝐸 𝑌|𝑋 2 (𝑐𝑜𝑛𝑑𝑖𝑡𝑖𝑜𝑛𝑎𝑙 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒)

= 𝐸 𝑌 2 − 𝐸 𝐸 𝑌|𝑋 2 (𝑖𝑡𝑒𝑟𝑎𝑡𝑒𝑑 𝑒𝑥𝑝𝑒𝑐𝑡𝑎𝑡𝑖𝑜𝑛𝑠)

⇒ 𝑉𝑎𝑟 𝑌 = 𝐸 𝑉𝑎𝑟 𝑌|𝑋 + 𝑉𝑎𝑟 𝐸 𝑌|𝑋

𝜇𝑘 −𝜇 𝛽 𝛼−1 −𝛽𝜇 𝛼−1

• 𝐸 𝜇 : prior, obtained from empirical data/modeling

Let’s compare three models

• Model 2: CNTYPOP, CNTYPOP_Urb, Urban, NUMLANES2, NUMCLASS3,

• Model 3: log(CNTYPOP), NUMLANES2, NUMCLASS3, FUNCLASS3

• Which model would you prefer? Why?

• Are the relevant variables significant

Null hypothesis assumes

• The above estimator is also known as Eicker-Huber-White heteroscedasticity consistent

Additional reference for more mathematical background: http://people.stern.nyu.edu/wgreene/MathStat/GreeneChapter9.pdf 21

Let’s compare three models

• Model 4 (Poisson): CNTYPOP, NUMLANES2, NUMCLASS3, FUNCLASS3

• Model 5 (Poisson): log(CNTYPOP), NUMLANES2, NUMCLASS3, FUNCLASS3

• Model 6 (NB): log(CNTYPOP), NUMLANES2, NUMCLASS3, FUNCLASS3

• Linear model is a special case of

• Would you prefer log(CNTYPOP) over

• Would you prefer NB over Poisson?

• Out-of-sample predictions are expected to be worse than in-sample predictions due to

Image source: https://medium.com/greyatom/what-is-underfitting-and-overfitting-in-machine-learning-and-how-to-deal-with-it-6803a989c76 26

Given a Poisson regression

• For normal distribution: 𝐷𝑁 = σ𝑁

• Null Deviance: 𝜇ෝ𝑖 = 𝑦ത (also the outcome of an intercept-only model)

You might also like