0% found this document useful (0 votes)

48 views32 pages

18.650 - Fundamentals of Statistics

This document discusses generalized linear models which generalize normal linear regression models. It defines components of linear models and generalized linear models. Examples are provided of linear models for a kyphosis data set and a predator-prey model. Exponential family distributions are discussed including the normal, Poisson, Bernoulli, and gamma distributions. Canonical forms of exponential families are defined.

Uploaded by

phantom29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views32 pages

18.650 - Fundamentals of Statistics

Uploaded by

phantom29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

18.

650 – Fundamentals of Statistics

7. Generalized linear models

1/32
Linear model

A linear model assumes

2
Y |X = x ⇠ N (µ(x), I),

And 1
>
IE(Y |X = x) = µ(x) = x ,

1
Throughout we drop the boldface notation for vectors 2/32
Components of a linear model

The two model components (that we are going to relax) are

1. Random component: the response variable Y is continuous

and Y |X = x is with mean µ(x).

2. Regression function: µ(x) = x > .

3/32
Kyphosis

The Kyphosis data consist of measurements on 81 children

following corrective spinal surgery. The binary response variable,
Y , indicates the presence or absence of a postoperative deforming.
The three covariates are:
I X (1) : Age of the child in month,

I X (2) : Number of the vertebrae involved in the operation, and

I X (3) : Start of the range of the vertebrae involved.

Write X = ( (1) (2)

,X ,X ,X )(3) > 2 IR 4

4/32
Kyphosis

I The response variable is binary so there is no choice:

Y |X = x is with expected value
µ(x) = IE[Y |X = x] 2
I We cannot write
>
µ(x) = x
because the right-hand side ranges through
I We need an invertible function f such that f (x> ) 2

5/32
Generalization

A generalized linear model (GLM) generalizes normal linear

regression models in the following directions.

1. Random component:

Y |X = x ⇠ some distribution

(e.g. Bernoulli, exponential, Poisson)

2. Regression function:
>
µ(x) = x

where g called link function and µ(x) = IE(Y |X = x) is the

6/32
Predator/Prey
Consider the following model for the number of preys Y that a
predator (Hawk) catches per day a predator given a number X of
preys (mice) in its hunting territory.
Random component: Y > 0 and the variance of capture rate is
known to be approximately equal to its expectation so we propose
the following model:

Y |X = x ⇠

Where µ(x) = IE[Y |X = x].

Regression function: We assume
mx
µ(x) = , for some unknown m, h > 0.
h+x
where:
I m is the max expected daily preys the predator can cope with
I h is the number of preys such that µ(h) =
7/32
The regression function m(x) for m = h = 10

8/32
Example 2: Prey Capture Rate

Obviously µ(x) is not linear but using reciprocal link: g(x) = ,

the right-hand side can be made linear in the parameters:

1 1
g(µ(x)) = = = 0+ 1 .
µ(x) x

9/32
Exponential Family

A family of distribution {IP✓ : ✓ 2 ⇥}, ⇥ ⇢ k

is said to be a
IR
q
k-parameter exponential family on IR , if there exist real valued
functions:
I ⌘1 , ⌘2 , · · · , ⌘k and B of ✓,

I T1 , T2 , · · · , Tk , and h of y 2 IRq such that the density

function (pmf or pdf) of IP✓ can be written as

hX
k i
f✓ (y) = exp ⌘i (✓)Ti (y) B(✓) h(y)
i=1

10/32
Normal distribution example
I Consider Y ⇠ N (µ, 2 ), ✓ = (µ, 2 ). The density is
⇣µ 1 µ 2 ⌘ 1
2
f✓ (y) = exp 2
y 2
y 2
p ,
2 2 2⇡
which forms a two-parameter exponential family with
µ 1 2
⌘1 = 2
, ⌘ 2 = 2
, T 1 (y) = y, T 2 (y) = y ,
2
µ 2 p
B(✓) = 2
+ log( 2⇡), h(y) = 1.
2
I When 2 is known, it becomes a one-parameter exponential
family on IR:
y2
µ µ 2 e 2 2
⌘= 2
, T (y) = y, B(✓) = 2
, h(y) = p .
2 2⇡
11/32
Examples of discrete distributions

The following distributions form discrete exponential families of

distributions with pmf

I Bernoulli(p): p (1y
p) 1 y
, y 2 {0, 1}
y
I Poisson( ): e , y = 0, 1, . . . .
y!

12/32
Examples of Continuous distributions
The following distributions form continuous exponential families
of distributions with pdf:
1 y
I Gamma(a, b): y a 1
e b;
(a)ba
I above: a: shape parameter, b: scale parameter
I reparametrize: µ = ab: mean parameter
✓ ◆a
1 a a 1
ay
y e µ.
(a) µ
↵
I Inverse Gamma(↵, ): y ↵ 1
e /y
.
(↵)
s
2 2 (y µ)2
I Inverse Gaussian(µ, 2 ): e 2µ2 y .
2⇡y 3

Others: Chi-square, Beta, Binomial, Negative binomial

distributions.
13/32
One-parameter canonical exponential family

I Canonical exponential family for k = 1, y 2 IR

⇣ y✓ b(✓) ⌘
f✓ (y) = exp + c(y, )

for some known functions b(·) and c(·, ·) .

I If is known, this is a one-parameter exponential family with

✓ being the canonical parameter .
I If is unknown, this may/may not be a two-parameter
exponential family.
I is called dispersion parameter.
I In this class, we always assume that is known.

14/32
Normal distribution example

I Consider the following Normal density function with known

variance 2 ,
1 (y µ)2
f✓ (y) = p e 2 2
2⇡
⇢ 1 2 ✓ ◆
yµ 2µ 1 y2 2
= exp 2 2
+ log(2⇡ ) ,
2

I Therefore ✓ = µ, = 2, b(✓) = ✓2
2 , and

1 y 2
c(y, ) = ( + log(2⇡ )).
2

15/32
Other distributions

Table 1: Exponential Family

Normal Poisson Bernoulli
Notation 2
N (µ, ) P(µ) B(p)
Range of y ( 1, 1) [0, 1) {0, 1}
2 1 1
✓2
b(✓) 2 e✓ log(1 + e✓ )
1 y2
c(y, ) 2 ( + log(2⇡ )) log y! 0

16/32
Likelihood

Let `(✓) = log f✓ (Y ) denote the log-likelihood function.

The mean IE(Y ) and the variance var(Y ) can be derived from the
following identities
I First identity
@`
IE( ) =
@✓
I Second identity

@2` @` 2
IE( 2 ) + IE( ) = 0.
@✓ @✓

17/32
Expected value

Note that
Y✓ b(✓)
`(✓) = + c(Y ; ),

Therefore
@`
=
@✓
It yields
@` IE(Y ) 0
b (✓)
0 = IE( ) = ,
@✓
which leads to
IE(Y ) =

18/32
Variance
On the other hand we have we have
@2` @` 2
+ ( ) =
@✓2 @✓
and from the previous result,

Y 0
b (✓) Y IE(Y )
=

Together, with the second identity, this yields

00
b (✓) var(Y )
0= + 2
,

which leads to
var(Y ) =

19/32
Example: Poisson distribution

Example: Consider a Poisson likelihood,

µy µ
f (y) = e = exp y log µ µ log(y!)
y!
Thus,

✓= b(✓) = = c(y, ) = log(y!),

So
✓ 00
µ=e , b(✓) = b (✓) =

20/32
Link function

I is the parameter of interest, and needs to appear somehow

in the likelihood function to use maximum likelihood.
I A link function g relates the linear predictor X > to the mean
parameter µ,
>
X = g(µ).
I g is required to be monotone increasing and di↵erentiable
1 >
µ=g (X ).

21/32
Examples of link functions

I For LM, g(·) = identity.

I Poisson data. Suppose Y |X ⇠ Poisson(µ(X)).
I µ(X) > 0;
I log(µ(X)) = X > ;
I In general, a link function for the count data should map
(0, +1) to IR.
I The log link is a natural one.
I Bernoulli/Binomial data.
I 0 < µ < 1;
I g should map (0, 1) to IR:
I 3 choices: ⇣ ⌘
µ(X)
1. logit: log 1 µ(X)
= X> ;
2. probit: 1
(µ(X)) = X > where (·) is the normal cdf;
I The logit link is the natural choice.

22/32
Examples of link functions for Bernoulli response
5

2
I in blue:
1
1 g1 (x) = f1 (x) =
x
log (logit link)
0
1 x
-1
I in red:
1 1
g2 (x) = f2 (x) = (x)
-2
(probit link)
-3

-4

-5
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

23/32
Examples of link functions for Bernoulli response
1

0.9

0.8

0.7

0.6

0.5

0.4

0.3

0.2

0.1

0
-5 -4 -3 -2 -1 0 1 2 3 4 5

ex
I in blue: f1 (x) =
1 + ex
I in red: f2 (x) = (x) (Gaussian CDF)
24/32
Canonical Link

I The function g that links the mean µ to the canonical

parameter ✓ is called Canonical Link:

g(µ) = ✓

I Since µ = b0 (✓), the canonical link is given by

0 1
g(µ) = (b ) (µ) .

I If > 0, the canonical link function is strictly increasing.

Why?

25/32
Example: the Bernoulli distribution

I We can check that

✓
b(✓) = log(1 + e )

I Hence we solve

0 exp(✓)
b (✓) = =µ , ✓=
1 + exp(✓)
I The canonical link for the Bernoulli distribution is the

26/32
Other examples

b(✓) g(µ)
Normal 2
✓ /2 µ
Poisson exp(✓) log µ
✓ µ
Bernoulli log(1 + e ) log 1 µ
1
Gamma log( ✓) µ

27/32
Model and notation

I Let (Xi , Yi ) 2 IRp ⇥ IR, i = 1, . . . , n be independent random

pairs such that the conditional distribution of Yi given
Xi = xi has density in the canonical exponential family:

yi ✓ i b(✓i ) o
f✓i (yi ) = exp + c(yi , ) .

I Y = (Y1 , . . . , Yn )> , X = (X1 , . . . , Xn )>

I Here the mean µi = IE[Yi |Xi ] is related to the canonical
parameter ✓i via
µi =
I and µi depends linearly on the covariates through a link
function g:
g(µi ) = .

28/32
Back to

I Given a link function g, note the following relationship

between and ✓:
0 1
✓i = (b ) (µi )
0 1 1 > >
= (b ) (g (Xi )) ⌘ h(Xi ),

where h is defined as
0 1 1 0 1
h = (b ) g = (g b ) .

I Remark: if g is the canonical link function, h is

29/32
Log-likelihood

I The log-likelihood is given by

X Yi ✓i b(✓i )
`n (Y, X, ) =
i
X Yi h(X > ) >
b(h(Xi ))
i
=
i

up to a constant term.
I Note that when we use the canonical link function, we obtain
the simpler expression
X Yi X > >
b(Xi )
i
`n (Y, X, ) =
i

30/32
Strict concavity

I The log-likelihood `(✓) is strictly concave using the

canonical function when > 0. Why?
I As a consequence the maximum likelihood estimator is

I On the other hand, if another parameterization is used, the

likelihood function may not be strictly concave leading to
several local maxima.

31/32
Concluding remarks

I Maximum likelihood for Bernoulli Y and the logit link is called

I In general, there is no closed form for the MLE and we have

to use
I The asymptotic normality of the MLE also applies to GLMs.

32/32

Formulae and Tables For Actuarial Exams
No ratings yet
Formulae and Tables For Actuarial Exams
45 pages
Cheatsheet PDF
100% (1)
Cheatsheet PDF
4 pages
CSE 423:cloud Computing and Virtualisation MCQ
50% (2)
CSE 423:cloud Computing and Virtualisation MCQ
47 pages
cs109 Final Cheat 3 PDF
No ratings yet
cs109 Final Cheat 3 PDF
13 pages
Resa
No ratings yet
Resa
168 pages
Hydrodynamics of Offshore Structures - S K Chakrabarti PDF
No ratings yet
Hydrodynamics of Offshore Structures - S K Chakrabarti PDF
225 pages
14S Operator Manual
100% (1)
14S Operator Manual
106 pages
Eugen Fink Oasis of Happiness
No ratings yet
Eugen Fink Oasis of Happiness
29 pages
w6 - Statistical Modelling
No ratings yet
w6 - Statistical Modelling
24 pages
Exponential Family
No ratings yet
Exponential Family
13 pages
GLM Slides 2 Exp Family
No ratings yet
GLM Slides 2 Exp Family
35 pages
GLM Slides 6 Binary Response Print
No ratings yet
GLM Slides 6 Binary Response Print
55 pages
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
No ratings yet
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
6 pages
Lecture Notes 1: Brief Review of Basic Probability (Casella and Berger Chapters 1-4)
100% (1)
Lecture Notes 1: Brief Review of Basic Probability (Casella and Berger Chapters 1-4)
14 pages
Generalized Linear Models: FX Axb C DX Axb C DX
No ratings yet
Generalized Linear Models: FX Axb C DX Axb C DX
11 pages
Appendix: 12.1 Inventory of Distributions
No ratings yet
Appendix: 12.1 Inventory of Distributions
6 pages
Lecture BDS 2 23 24 Print
No ratings yet
Lecture BDS 2 23 24 Print
10 pages
(A) Model Assumptions: 1.2 Outline of Generalized Linear Models
No ratings yet
(A) Model Assumptions: 1.2 Outline of Generalized Linear Models
8 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
9 pages
Modelling Lecture 5
No ratings yet
Modelling Lecture 5
10 pages
7 Generalized Linear Models Padua
No ratings yet
7 Generalized Linear Models Padua
29 pages
(TRANSLATED) Generalized Linear Model
No ratings yet
(TRANSLATED) Generalized Linear Model
11 pages
Note on Generalized Linear Models: y y Xβ w X β w I y Xβ I y Xβ X w X
No ratings yet
Note on Generalized Linear Models: y y Xβ w X β w I y Xβ I y Xβ X w X
4 pages
Important PMFs and PDFs
No ratings yet
Important PMFs and PDFs
7 pages
Cheat Sheet
No ratings yet
Cheat Sheet
5 pages
Lecture 11
No ratings yet
Lecture 11
6 pages
Lecture Notes 5
100% (1)
Lecture Notes 5
53 pages
Ps Formuale
No ratings yet
Ps Formuale
7 pages
Stochastic Processes SM
No ratings yet
Stochastic Processes SM
82 pages
Review MidtermII Summer09
No ratings yet
Review MidtermII Summer09
51 pages
MIT14 381F13 Lec1 PDF
No ratings yet
MIT14 381F13 Lec1 PDF
8 pages
Stochastic Dynamics
No ratings yet
Stochastic Dynamics
72 pages
Week 5-8 Short Notes
No ratings yet
Week 5-8 Short Notes
10 pages
Generalized Linear Models
No ratings yet
Generalized Linear Models
109 pages
Common Probability Distributionsi Math 217/218 Probability and Statistics
No ratings yet
Common Probability Distributionsi Math 217/218 Probability and Statistics
10 pages
Categorical Notes Ch4
No ratings yet
Categorical Notes Ch4
40 pages
4.2 Slides - Generalized Linear Mixed Models Part 1
No ratings yet
4.2 Slides - Generalized Linear Mixed Models Part 1
9 pages
Lec12 GLM ExponentialFamilies
No ratings yet
Lec12 GLM ExponentialFamilies
24 pages
Presentation Generalized Linear Model Theory
No ratings yet
Presentation Generalized Linear Model Theory
77 pages
01 Lectureslides ProbTheory
No ratings yet
01 Lectureslides ProbTheory
42 pages
Unit 2
No ratings yet
Unit 2
11 pages
Compendium of Distributions
No ratings yet
Compendium of Distributions
120 pages
College Statistics
No ratings yet
College Statistics
244 pages
North South University Mat361 Total Marks-30 (Time - 70 Min + 10min)
No ratings yet
North South University Mat361 Total Marks-30 (Time - 70 Min + 10min)
8 pages
Statistics - Lecture 7
No ratings yet
Statistics - Lecture 7
47 pages
Stats Cheat Sheet
No ratings yet
Stats Cheat Sheet
28 pages
Chap2 PDF
No ratings yet
Chap2 PDF
20 pages
Probability and Statistics: Cookbook
No ratings yet
Probability and Statistics: Cookbook
28 pages
Section10-Beta Gamma Conditional Order
No ratings yet
Section10-Beta Gamma Conditional Order
8 pages
Formulario Ep Probability and Statistics
No ratings yet
Formulario Ep Probability and Statistics
28 pages
CSD502 Standard Probability Dist
No ratings yet
CSD502 Standard Probability Dist
15 pages
STAT2011 2017 Exam Formulae PDF
No ratings yet
STAT2011 2017 Exam Formulae PDF
3 pages
Probabilistic Learning and Generalized Linear Models (GLMS)
No ratings yet
Probabilistic Learning and Generalized Linear Models (GLMS)
38 pages
Ho GLM
No ratings yet
Ho GLM
5 pages
Stat5900 f24 Lec9
No ratings yet
Stat5900 f24 Lec9
12 pages
Output
No ratings yet
Output
6 pages
Probability and Statistics - Cookbook
No ratings yet
Probability and Statistics - Cookbook
28 pages
3logistic Regression
No ratings yet
3logistic Regression
61 pages
Chapter 3 Common Univariate Random Variables
No ratings yet
Chapter 3 Common Univariate Random Variables
7 pages
Nonlife Actuarial Models: Claim-Severity Distribution
No ratings yet
Nonlife Actuarial Models: Claim-Severity Distribution
62 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Pozzobon OpenFOAM Training Basic Booklet v2406 Corrected
No ratings yet
Pozzobon OpenFOAM Training Basic Booklet v2406 Corrected
75 pages
Jmse 08 00582
No ratings yet
Jmse 08 00582
22 pages
III IV: Unit Unit
No ratings yet
III IV: Unit Unit
19 pages
Prediction of A Ship Roll Added Mass Mom
No ratings yet
Prediction of A Ship Roll Added Mass Mom
13 pages
Predicting Roll Added Mass and Damping o
No ratings yet
Predicting Roll Added Mass and Damping o
12 pages
1 PMRes Phuong 4 12
No ratings yet
1 PMRes Phuong 4 12
9 pages
Keec1gl PDF
No ratings yet
Keec1gl PDF
9 pages
Frederico Monteiro - Dissertacao
No ratings yet
Frederico Monteiro - Dissertacao
98 pages
Ural Evelopment: 9 9 Rural Development
No ratings yet
Ural Evelopment: 9 9 Rural Development
17 pages
E: G, I I: Mployment Rowth Nformalisation AND Other Ssues
No ratings yet
E: G, I I: Mployment Rowth Nformalisation AND Other Ssues
23 pages
Nvironment AND Ustainable Evelopment: 162 Indian Economic Development
No ratings yet
Nvironment AND Ustainable Evelopment: 162 Indian Economic Development
17 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
61 pages
Keec 108
No ratings yet
Keec 108
23 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
20 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
45 pages
MA 2018 Assbrock
No ratings yet
MA 2018 Assbrock
80 pages
Ch3 PDF
No ratings yet
Ch3 PDF
55 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
45 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
61 pages
1964 - Ogilve - Recent Progress Toward The Understanding and Prediction of Ship Motions
No ratings yet
1964 - Ogilve - Recent Progress Toward The Understanding and Prediction of Ship Motions
40 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
62 pages
A Study On The Effectiveness of Active F PDF
No ratings yet
A Study On The Effectiveness of Active F PDF
6 pages
Barrons 80 (High Frequency Words) : By: Amresh Negi
No ratings yet
Barrons 80 (High Frequency Words) : By: Amresh Negi
46 pages
1964 - Ogilve - Recent Progress Toward The Understanding and Prediction of Ship Motions PDF
No ratings yet
1964 - Ogilve - Recent Progress Toward The Understanding and Prediction of Ship Motions PDF
40 pages
Cummins 1962 PDF
No ratings yet
Cummins 1962 PDF
12 pages
5450 1 Omae2012-83405
No ratings yet
5450 1 Omae2012-83405
9 pages
Camry - EF932 - Instructions - For - Use - Manual 21
No ratings yet
Camry - EF932 - Instructions - For - Use - Manual 21
8 pages
BROSURABFPLOFT20112
No ratings yet
BROSURABFPLOFT20112
6 pages
Experimental Investigation of Circular Concrete Filled Steel Tube Geometry On Seismic Performance
No ratings yet
Experimental Investigation of Circular Concrete Filled Steel Tube Geometry On Seismic Performance
54 pages
Ephesians: What To Do
No ratings yet
Ephesians: What To Do
8 pages
Semitic Alphabets
No ratings yet
Semitic Alphabets
16 pages
Classic Porsche 05 06 2024
No ratings yet
Classic Porsche 05 06 2024
116 pages
Financial Kake Da Hotel (N)
No ratings yet
Financial Kake Da Hotel (N)
10 pages
SSV 2018 DPS (MAVERICK TRAIL) Shop 219100905-050
No ratings yet
SSV 2018 DPS (MAVERICK TRAIL) Shop 219100905-050
11 pages
Nature 14432
No ratings yet
Nature 14432
17 pages
BBMF2083 - Chap 2 - 6.22
No ratings yet
BBMF2083 - Chap 2 - 6.22
40 pages
2018 Oakland Linuxmalware
No ratings yet
2018 Oakland Linuxmalware
15 pages
Introduction To Modern Industrial Engineering
100% (2)
Introduction To Modern Industrial Engineering
221 pages
Century Iib: Autopilot Flight System
No ratings yet
Century Iib: Autopilot Flight System
24 pages
Guidanc CTspection
No ratings yet
Guidanc CTspection
17 pages
Photoluminescence FBG
No ratings yet
Photoluminescence FBG
13 pages
Case
No ratings yet
Case
4 pages
Regent College London New
No ratings yet
Regent College London New
2 pages
Semi-Detailed Lesson Plan in English 8
100% (1)
Semi-Detailed Lesson Plan in English 8
2 pages
People v. Pagal
No ratings yet
People v. Pagal
3 pages
Carolina Reaper
No ratings yet
Carolina Reaper
19 pages
Briere ITCT-A Final PDF
No ratings yet
Briere ITCT-A Final PDF
119 pages
Features
No ratings yet
Features
7 pages
PLC Interview Questions
No ratings yet
PLC Interview Questions
3 pages
Futong Ism Tds SCG Hdpe h2001wc 20jul20
No ratings yet
Futong Ism Tds SCG Hdpe h2001wc 20jul20
3 pages
Metalsa Supplier Manual Rev 4 1
No ratings yet
Metalsa Supplier Manual Rev 4 1
58 pages
Thesis Port Service
100% (3)
Thesis Port Service
7 pages

18.650 - Fundamentals of Statistics

Uploaded by

18.650 - Fundamentals of Statistics

Uploaded by

18.

650 – Fundamentals of Statistics

7. Generalized linear models

A linear model assumes

The two model components (that we are going to relax) are

1. Random component: the response variable Y is continuous

2. Regression function: µ(x) = x > .

The Kyphosis data consist of measurements on 81 children

I X (2) : Number of the vertebrae involved in the operation, and

I X (3) : Start of the range of the vertebrae involved.

Write X = ( (1) (2)

I The response variable is binary so there is no choice:

A generalized linear model (GLM) generalizes normal linear

(e.g. Bernoulli, exponential, Poisson)

where g called link function and µ(x) = IE(Y |X = x) is the

Where µ(x) = IE[Y |X = x].

Obviously µ(x) is not linear but using reciprocal link: g(x) = ,

A family of distribution {IP✓ : ✓ 2 ⇥}, ⇥ ⇢ k

I T1 , T2 , · · · , Tk , and h of y 2 IRq such that the density

The following distributions form discrete exponential families of

Others: Chi-square, Beta, Binomial, Negative binomial

I Canonical exponential family for k = 1, y 2 IR

for some known functions b(·) and c(·, ·) .

I If is known, this is a one-parameter exponential family with

I Consider the following Normal density function with known

Table 1: Exponential Family

Let `(✓) = log f✓ (Y ) denote the log-likelihood function.

Together, with the second identity, this yields

Example: Consider a Poisson likelihood,

✓= b(✓) = = c(y, ) = log(y!),

I is the parameter of interest, and needs to appear somehow

I For LM, g(·) = identity.

I The function g that links the mean µ to the canonical

I Since µ = b0 (✓), the canonical link is given by

I If > 0, the canonical link function is strictly increasing.

I We can check that

I Let (Xi , Yi ) 2 IRp ⇥ IR, i = 1, . . . , n be independent random

I Y = (Y1 , . . . , Yn )> , X = (X1 , . . . , Xn )>

I Given a link function g, note the following relationship

I Remark: if g is the canonical link function, h is

I The log-likelihood is given by

I The log-likelihood `(✓) is strictly concave using the

I On the other hand, if another parameterization is used, the

I Maximum likelihood for Bernoulli Y and the logit link is called

I In general, there is no closed form for the MLE and we have

You might also like