0% found this document useful (0 votes)

47 views11 pages

Generalized Linear Models: FX Axb C DX Axb C DX

The document discusses generalized linear models, which unify linear and nonlinear regression models. It describes how distributions from the exponential family, such as the normal, binomial, and Poisson distributions, can be represented in a canonical form with natural parameters. This allows the mean and variance of these distributions to be expressed consistently for use in generalized linear models. Examples are provided to demonstrate how the binomial and Poisson distributions fit into this framework.

Uploaded by

Aishat Omotola

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views11 pages

Generalized Linear Models: FX Axb C DX Axb C DX

Uploaded by

Aishat Omotola

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Chapter 16

Generalized Linear Models

The usual linear regression model assumes normal distribution of study variables whereas nonlinear
logistic and Poison regressions are based on Bernoulli and Poisson distributions respectively of study
variables. Similar to as in logistic and Poisson regressions, the study variable can follow different
probability distributions like exponential, gamma, inverse normal etc. One such family of distribution is
described by exponential family of distributions. The generalized linear model is based on this
distribution and unifies linear and nonlinear regression models. It assumes that the distribution of study
variable is a member of exponential family of distribution.

Exponential family of distribution

A random variable X belongs to exponential family with single parameter θ has a probability density
function
f ( X , θ ) exp [ a ( X )b(θ ) + c(θ ) + d ( X ) ]
=

where a ( X ), b(θ ), c(θ ) and d ( X ) are all known function.

If a ( X ) = X , the distribution in said to be in canonical form. The function b(θ ) is called the natural
parameter of the distribution. The parameter θ is of interest and all other parameters which are not of
interest are called nuisance parameters.

Example:
Normal distribution
1  1 
f x ( x, µ , σ )
= exp  − 2 ( x − µ ) 2  ; −∞ < x < ∞; −∞ < µ < ∞; σ 2 > 0
σ 2π  2σ 
  µ   µ 2
1  x2 
= exp  x  2  +  − 2 − ln 2πσ 2  − 2  .
  σ   2σ 2  2σ 
µ
Here= , b(θ )
a ( x) x= .
σ2

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

1
Binomial distribution
n
x, p )   p x (1 − p ) n − x , 0 < p <=
f (= 1, x 0,1,..., n
 x
  p   n 
= exp  x ln   + n ln(1 − p ) + ln    .
  1− p   x 
Here
 p 
= , b(θ ) ln 
a ( x) x= .
 1− p 

Expected values and variance of a(X):

The exponential family of distribution for a random variables X and parameter of interest θ is
f ( X , θ ) exp [ a( X )b(θ ) + c(θ ) + d ( X ) ]
=
= , θ ) a ( X )b(θ ) + c(θ ) + d ( X ).
L ln f ( X=

dL
Let U =
dθ
then for any distribution
E (U ) = 0

Var (U=
) E (U 2=
) E (−U ')
dU
where U ' = . The function U is called score and Var (U ) is called information.
dθ
The log-likelihood function is
=L ln [ f ( X ,=
θ ) ] a( X )b(θ ) + c(θ ) + d ( y )
and then
dL
=
U = a ( X )b '(θ ) + c '(θ )
dθ
d 2L
=U' = a( X )b "(θ ) + c "(θ )
dθ 2
db(θ ) d 2b(θ ) dc(θ ) d 2 c(θ )
where b '(θ ) =
= , b "(θ ) = , c '(θ ) = and c "(θ ) .
dθ dθ 2 dθ dθ 2
Since E(U) = 0, so

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

2
=E (U ) b '(θ ) E [ a ( X ) ] + c '(θ )
=0 b '(θ ) E [ a ( X ) ] + c '(θ )
c '(θ )
⇒ E [ a( X )] =
− .
b '(θ )
Since

Var (U ) = [b '(θ ) ] Var [ a ( X ) ] ,

−b "(θ ) E [ a ( X )] − c "(θ )
E (−U ') =
Var (U=
) E (−U ')

−b "(θ ) E [ a ( X )] − c "(θ )
⇒ Var [ a ( X )] = 2
[b '(θ )]
b "(θ )c '(θ ) − c "(θ )b '(θ )
= .
[b '(θ )]
3

Now we consider two examples which illustrate how other distribution and their properties can be
obtained as particular cases:

Example: Binomial distribution

Consider X follows a Binomial distribution with parameters n and π , i.e. X ~ Bin(n, π ) . Then in the
framework of exponential class of family
n
f ( x, π )   π x (1 − π ) n − x
=
 x
  n 
= exp  x ln π − x ln(1 − π ) + n ln(1 − π ) + ln    .
  x 

π n
Here a= , θ π , b=
( x) x= (θ ) ln (θ ) n ln(1 − π ), d ( x) = ln   ,
, c=
1− π  x
n
) x ln π − x ln(1 − π ) + n ln(1 − π ) + ln   .
L = ln f ( x, π =
 x
It is the canonical form of f ( x, π ) with natural parameter ln π .

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

3
dL x x n
U = =+ −
dπ π 1 − π 1 − π
x n
= −
π (1 − π ) 1 − π
x − nπ
=
π (1 − π )

E ( x) − nπ
E (U ) =
π (1 − π )

nπ − nπ
=
π (1 − π )
=0

Var ( x)
Var (U ) =
π 2 (1 − π ) 2
nπ (1 − π )
= 2
π (1 − π ) 2
n
=
π (1 − π ) 2
 ( − n) 
E (−U ') = E  − 2
 π (1 − π ) 
n
=
π (1 − π )
1
b '(θ ) = b '(π )
=
π (1 − π )
2π − 1
b "(θ ) =
= b "(π )
[π (1 − π )]
2

n
c '(θ ) =
− = c '(π ).
1− π
n
c "(θ ) =
− = c "(π ).
(1 − π ) 2
Thus
c '(π )
E [ a( X )] =
E( X ) =
− π
=
b '(π )
Var [ a ( X ) ] = Var ( X )
b "(π )c '(π ) − c "(π )b '(π )
= = nπ (1 − π ).
[b '(π )]
3

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

4
Example: Poisson distribution
Consider that the random variable X follows a poisson distribution with parameter λ , i.e., X ~ P(λ ).
Then
exp(−λ )λ x
f ( x, λ ) =
x!
= exp [ x ln λ − λ − ln x !]
L ln f ( x, λ=
= ) x ln λ − λ − ln x !
It is the canonical form of f ( x, λ ) and ln λ is the natural parameter. Here
a( X ) =X , b(θ ) =
ln λ , c(θ ) =
−λ , d ( X ) =
− ln X !
dL x
U= = −1
dλ λ
E ( x)
E=(U ) −1
λ
λ
= −1
λ
=0

Var ( x)
Var (U ) =
λ2
λ
=
λ2
1
=
λ

  d  x  
E (−U ') = E  −   − 1 
  d λ  λ  
 x 
= E 2 
λ 
λ
=
λ2
1
=
λ
1
b '(θ=
) = b '(λ )
λ
1
b "(θ ) = b "(λ )
− 2 =
λ
c '(θ ) =−1 =c '(λ )
c "(θ )= 0= c "(λ )

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

5
c '(λ )
E [ a( X )] = E( X ) = − = λ
b '(λ )
b "(λ )c '(λ ) − c "(λ )b '(λ )
Var [ a ( X ) ] =
[b '(λ )]
3

1
−0
=λ
2

1
λ3
= λ.

Linear predictors and link functions

The role of generalized model is basically to unify various distributions of study variable. This is
accomplished by developing a linear model having an appropriate function of expected value of study
variable.

Denoting ηi to be the linear predictor which relates to expected value of study variable, it is expressed
as
ηi = g [ E ( yi ) ]
= g ( µi )
= xi' β
n
where xi' β= β 0 + ∑ β j xij .
j =1

Thus
E ( yi ) = g −1 (ηi )
= g −1 ( xi' β )
where g is the function called as link function.

Several choices of link functions are available. If

ηi = θ i
then ηi is the canonical link. The choice of θi and canonical link is related to the distribution of
study variable which in turn governs the appropriate, usually nonlinear regression models. The

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

6
canonical link provides mathematical convenience in deriving the statistical properties of the model and
compatibility with sensible conclusions on scientific grounds.

For example, is case y has a

• normal distribution, the canonical link function is identity link defined as ηi = µi .

• Binomial distribution, then logistic regression is used and logistic link is used as canonical link
 π 
which is defined as ηi = ln  i  .
 1− πi 
• Poisson distribution, then log link is used as canonical link which is given as ηi = ln λ .

• Exponential and gamma distribution, then the canonical link function used is reciprocal link
1
given by ηi = .
λ1
Other types of link functions are
- probit link given as ηi = Φ −1 [ E ( yi ) ] where Φ is the cumulative distribution function of N (0,1)

distribution.
- Complementary log-log link given by
=ηi ln ln {1 − E ( yi )}

- power family link

[ E ( yi ) ]λ , λ ≠ 0
ηi = 
ln [ E ( yi ) ] , λ = 0
which is based on power transformation similar to Box-Cox transformation.

A link is preferable if it maps the range of µi onto the whole real line and provides good empirical
approximation. It should also carry a meaningful interpretation is case of real applications.

There are two components in any generalized linear model:

i) distribution of study variable and
ii) link function.

The choice of link function is like choosing an appropriate transformation on study variable. The link
function takes the advantage of natural distribution of study variable. The incorrect choice of link
function can give arise to incorrect statistical inferences.
Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur
7
Maximum likelihood estimation of GLM:
The least squares method can not be directly applied when the study variable is not continuous. So we
use the maximum likelihood estimation method in GLM which has a close connection with iteratively
weighted least squares method.

Given the data ( xi , yi ), i = 1, 2,..., n and y following exponential family of distribution, the joint p.d.f. is

 n n n

θ , φ ) exp  ∑ yi b(θi ) + ∑ c(θi ) + ∑ d ( yi ) 
f ( yi ;=
=  i 1 =i 1 =i 1 
where θ is the parameter of interest and φ is nuisance parameters. The θ and/or φ can be a vector also

like (θ1 , θ 2 ,..., θ n ) and/or (φ1 , φ2 ,..., φn ) respectively.

Consider a smaller set of parameter β = ( β1 , β 2 ,..., β k ) ' which relates some function g ( µi ) to µi . In case

µi is E ( yi ) then g ( µi ) relates µi to a linear combinations of β ' s via g ( µi ) = xi' β .

ri
For example, if data on yi and ni such that yi ~ Bin(ni , π i ), then yi = is the number of successes is
ni

ni trials where π i is the probability of success. Then joint p.d.f. of all n data set is

 n π n n
 ni  
exp  ∑ yi ln i + ∑ ni ln(1 − π i ) + ∑ ln    .
=  i 1= 1 − π i i 1 =i 1  yi  

Assuming that the variation in π i is explained by xi values, choose suitable link function g (π i ) = xi' β .
A sensible link function is log-odds as
πi
g (π i ) = ln .
1− πi
Now the objective is to fit a model
πi
ln = xi' β = β 0 + β1 xi1 + ... + β k xik
1− πi
or equivalently
exp( xi' β )
πi = .
1 + exp( xi' β )
The general log-likelihood function is
n n n n
L( β ) =ln f ( y;θ , φ ) =∑ Li =
∑ yib(θi ) +∑ c(θi ) +∑ d ( yi ).
=i 1 =i 1 =i 1 =i 1

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

8
The log-likelihood function is numerically maximized for a given data set. Generally, iteratively
reweighted least squares method is used.

Suppose β̂ is the final value obtained after optimization and is the maximum likelihood estimator of β ,
then asymptotically

E ( βˆ ) = β
V ( βˆ ) = a (φ )( X 'V −1 X ) −1
where V is a diagonal matrix formed by the variances of estimated parameters in the linear predictor,
apart from a (φ ) . The covariance matrix can be estimated by replacing V by its estimate Vˆ .

In GLM, the variance of yi is not constant and so generalized least squares estimation is used to get
more efficient estimators.

To conduct the test of hypothesis in GLM, the model deviance is used for testing the goodness of model
fit. The difference in deviance of full and reduced models is used to decide for subset model.

The Wald inference can be applied for testing hypothesis and confidence interval estimation about
individual model parameters. The Wald statistic for testing the null hypothesis
H 0 : R β r where R is q × (k + 1) with rank ( R) = q is
=
−1
( R βˆ − r ) '  R ( X 'VX
W= ˆ ) −1 R ' ( R βˆ − r ).

The distribution of W under H 0 is χ 2 distribution with q degrees of freedom.

In particular, for H 0 : β j = β 0 , the test statistic is

βˆ j − β 0
=
Z =
W
se( βˆ j )

which has N (0,1) distribution under H 0 and se( βˆ j ) is the standard error of βˆ j . The confidence

intervals can be constructed using Wald test. For example, 100 (1 − α )% confidence interval for β j is

βˆ j ± Z α se( βˆ j ) .
2

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

9
The likelihood ratio test comprise the maximized log-likelihood function between the full and reduced
models. The reduced model is the full model under null hypothesis.

The likelihood ratio test statistic is

−2( Lˆreduced − Lˆ full )

where Lˆ full and Lˆreduced are the maximized likelihood functions under full and reduced models. The

likelihood ratio test statistic has a χ 2 -distribution with degrees of freedom equal to the difference in the
degrees of freedom of full and reduced model.

Prediction and confidence interval with GLM

Suppose we want to estimate the mean response function at x = x0 . The estimate is given by

yˆ=
0 µˆ=
0 g −1 ( xo' β )

where g is the associated link function.

It is understood that x0 is expandable to model form if more terms, e.g., interaction forms, are to be
accommodated in the linear predictor.

To find the confidence interval, the asymptotic covariance matrix of β̂ given by Ω =a (φ )( X 'V ' X ) −1 ,

is estimated as Ω̂ . The asymptotic variance of linear predictor estimated at x = x0 is

Var (ηˆ0 )= Var ( x0' βˆ )= x0' V ( βˆ ) x0= x0' Ωx0

ˆ x . Then 100(1 − α )% confidence interval on true mean response at x = x is
and its estimate is x0' Ω 0 0

 ˆ x  ≤ µ ( x ) ≤ g −1  x ' βˆ + Z x ' Ω
ˆ 
g −1  x0' βˆ − Z α x0' Ω 0 0 0 α 0 x0  .
 2   2 
This approach usually works in practice because β̂ is the maximum likelihood estimate of β . So any

function of β̂ is also a maximum likelihood estimate. This method constructs the confidence interval in
the space of linear predictor and transform back the interval into the original metric. The Wald method
can also be used to derive the approximate confidence interval for mean response.

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

10
Residual analysis is GLM
The usual approach of finding the residuals is adopted in case of GLM.
The i th ordinary residual from GLM is
e=
i yi − yˆi
= yi − µˆ i .
The residual analysis is generally performed in GLM using deviance residuals defined as
=ri di sign( yi − yˆi )

where di is the contribution of i th observation to the deviance.

We explain it in the context of logistic and poisson regression. In case of logistic regression
 yi 
 yi  1 − n 
=di yi ln   + (ni − yi )  =  , i 1, 2,..., n
i

 niπ i 
ˆ 1− πi 
ˆ
 

1
where πˆi = .
1 + exp( xi' β )
yi
As fitting of data to the model becomes better, then πˆi ≡ and deviance residuals become smaller and
ni
close to zero.
In case of Poisson regression,
 yi 
=di yi ln   −  yi − exp(
= xi' β )  , i 1, 2,..., n.
 exp( xi
'
β ) 

Here yi and yˆi = exp( xi' βˆ ) become close to each other as deviance residuals approach zero.

The behaviour of deviance residuals is like the behaviour of ordinary residuals as in standard normal
linear regression model. The normal probability plot is obtained by plotting the deviance residuals on a
normal probability scale versus fitted values. Usually, the fitted values are transformed to constant
information scale before plotting, so
• yˆi is used in case of usual regression with normal distribution,

• 2sin −1 πˆi is used in case of logistic regression.

• 2 yˆi is used in Poisson regression.

• 2 ln yˆi is used when study variable has gamma distribution.

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

(Springer Texts in Statistics) Peter K. Dunn, Gordon K. Smyth - Generalized Linear Models With Examples in R-Springer (2018) - 228-258
No ratings yet
(Springer Texts in Statistics) Peter K. Dunn, Gordon K. Smyth - Generalized Linear Models With Examples in R-Springer (2018) - 228-258
31 pages
Lecture 4 Day 3 Stochastic Frontier Analysis
100% (7)
Lecture 4 Day 3 Stochastic Frontier Analysis
45 pages
Unit - II Regression-LogisticRegressionModels
No ratings yet
Unit - II Regression-LogisticRegressionModels
7 pages
Homework 8 - Solution
No ratings yet
Homework 8 - Solution
8 pages
Chapter 12 - 250310 - 092440
No ratings yet
Chapter 12 - 250310 - 092440
4 pages
Ch13slides Generalized Linear Models
No ratings yet
Ch13slides Generalized Linear Models
24 pages
Lecture Notes 5
100% (1)
Lecture Notes 5
53 pages
GLM Slides 6 Binary Response Print
No ratings yet
GLM Slides 6 Binary Response Print
55 pages
9 Mle
No ratings yet
9 Mle
39 pages
Generalised Linear Models
No ratings yet
Generalised Linear Models
74 pages
Nelder 1972
No ratings yet
Nelder 1972
16 pages
Chapter 2
No ratings yet
Chapter 2
5 pages
HEC-SSP Examples-V9-20250411 - 055422
No ratings yet
HEC-SSP Examples-V9-20250411 - 055422
5 pages
Chapman-Kolmogorov Equations 37 This Produces The 48511
No ratings yet
Chapman-Kolmogorov Equations 37 This Produces The 48511
9 pages
Statistical Modeling Notes
No ratings yet
Statistical Modeling Notes
25 pages
Unit 2
No ratings yet
Unit 2
11 pages
Stat5900 f24 Lec9
No ratings yet
Stat5900 f24 Lec9
12 pages
Probability Integral Transformation 1
No ratings yet
Probability Integral Transformation 1
5 pages
4.2 Slides - Generalized Linear Mixed Models Part 1
No ratings yet
4.2 Slides - Generalized Linear Mixed Models Part 1
9 pages
Week 9 - Activity 6
No ratings yet
Week 9 - Activity 6
5 pages
Analysis of Wind Speed Characteristics Using Different Distribution Models in Medan City, Indonesia
No ratings yet
Analysis of Wind Speed Characteristics Using Different Distribution Models in Medan City, Indonesia
12 pages
Modelling Lecture 5
No ratings yet
Modelling Lecture 5
10 pages
w6 - Statistical Modelling
No ratings yet
w6 - Statistical Modelling
24 pages
RM - Elements of Generalised Linear Models (GLM) and Inference For GLM
No ratings yet
RM - Elements of Generalised Linear Models (GLM) and Inference For GLM
11 pages
Applied Multilevel Analysis-Section B 1
No ratings yet
Applied Multilevel Analysis-Section B 1
12 pages
Diffusion Model For Generative Image Denoising
No ratings yet
Diffusion Model For Generative Image Denoising
15 pages
The Gamma Family of Distribution
No ratings yet
The Gamma Family of Distribution
30 pages
Dampack Analysis
No ratings yet
Dampack Analysis
45 pages
Some Introductory Remarks On Bayesian Inference
100% (1)
Some Introductory Remarks On Bayesian Inference
35 pages
Mod I-II - III - Study Material BL 4 - 5 - 6
No ratings yet
Mod I-II - III - Study Material BL 4 - 5 - 6
7 pages
Generalised Linear Model
No ratings yet
Generalised Linear Model
4 pages
Faqih 2017 IOP Conf. Ser. Earth Environ. Sci. 58 012051
No ratings yet
Faqih 2017 IOP Conf. Ser. Earth Environ. Sci. 58 012051
12 pages
Generalized Weighted Additive Models Based On Distribution Functions
No ratings yet
Generalized Weighted Additive Models Based On Distribution Functions
9 pages
15 GLM
No ratings yet
15 GLM
32 pages
OpenEBGM: An R Implementation of The Gamma-Poisson Shrinker Data Mining Model
No ratings yet
OpenEBGM: An R Implementation of The Gamma-Poisson Shrinker Data Mining Model
21 pages
Deep Learning With Actuarial Applications
No ratings yet
Deep Learning With Actuarial Applications
248 pages
Exponential Family
No ratings yet
Exponential Family
13 pages
An Empirical Study of Generalized Linear Model For
No ratings yet
An Empirical Study of Generalized Linear Model For
4 pages
Counting Your Customers The Easy Way An Alternative To The ParetoNBD Model
No ratings yet
Counting Your Customers The Easy Way An Alternative To The ParetoNBD Model
11 pages
(TRANSLATED) Generalized Linear Model
No ratings yet
(TRANSLATED) Generalized Linear Model
11 pages
Homework Problems Stat 490C
No ratings yet
Homework Problems Stat 490C
44 pages
Ho GLM
No ratings yet
Ho GLM
5 pages
GLM - NelderWedderburn1972
No ratings yet
GLM - NelderWedderburn1972
16 pages
Presentation Generalized Linear Model Theory
No ratings yet
Presentation Generalized Linear Model Theory
77 pages
MAS-I Formula Sheet
No ratings yet
MAS-I Formula Sheet
22 pages
Note on Generalized Linear Models: y y Xβ w X β w I y Xβ I y Xβ X w X
No ratings yet
Note on Generalized Linear Models: y y Xβ w X β w I y Xβ I y Xβ X w X
4 pages
Chapter4 Lecture4
No ratings yet
Chapter4 Lecture4
12 pages
Httpsemas2.Ui - Ac.idpluginfile - Php2375826mod Resourcecontent1kuliah1 2 PDF
No ratings yet
Httpsemas2.Ui - Ac.idpluginfile - Php2375826mod Resourcecontent1kuliah1 2 PDF
31 pages
MAT235: Discussion 3: 1 Convolution
No ratings yet
MAT235: Discussion 3: 1 Convolution
2 pages
Lecture 13: Introduction To Generalized Linear Models: 21 November 2007
No ratings yet
Lecture 13: Introduction To Generalized Linear Models: 21 November 2007
12 pages
(A) Model Assumptions: 1.2 Outline of Generalized Linear Models
No ratings yet
(A) Model Assumptions: 1.2 Outline of Generalized Linear Models
8 pages
Categorical Notes Ch4
No ratings yet
Categorical Notes Ch4
40 pages
Stochastic System Notes 2011
No ratings yet
Stochastic System Notes 2011
88 pages
Exponential Family GLM
No ratings yet
Exponential Family GLM
5 pages
Computing Laplace Transforms For Numerical Inversion Via Continued Fractions
No ratings yet
Computing Laplace Transforms For Numerical Inversion Via Continued Fractions
29 pages
GLM Theory
No ratings yet
GLM Theory
46 pages
Devroye Non Uniform Random Variate Generation PDF
No ratings yet
Devroye Non Uniform Random Variate Generation PDF
857 pages
Lecture BDS 2 23 24 Print
No ratings yet
Lecture BDS 2 23 24 Print
10 pages
GLM Slides 2 Exp Family
No ratings yet
GLM Slides 2 Exp Family
35 pages
Chapter 4: Multiple Random Variables
No ratings yet
Chapter 4: Multiple Random Variables
34 pages
CQF January 2016 M5S8 Workings Annotated
No ratings yet
CQF January 2016 M5S8 Workings Annotated
7 pages
CS1B Y1 Assignment Questions 2022 V01 2 PDF
No ratings yet
CS1B Y1 Assignment Questions 2022 V01 2 PDF
4 pages
Theory Generalized Linear Model
No ratings yet
Theory Generalized Linear Model
16 pages
Exo Probastats EPFL
No ratings yet
Exo Probastats EPFL
13 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
32 pages
7 Generalized Linear Models Padua
No ratings yet
7 Generalized Linear Models Padua
29 pages
M5A42 Applied Stochastic Processes Problem Sheet 1 Solutions Term 1 2010-2011
No ratings yet
M5A42 Applied Stochastic Processes Problem Sheet 1 Solutions Term 1 2010-2011
8 pages
Unit 6 Input Modeling: Collect Data From The Real System of Interest
No ratings yet
Unit 6 Input Modeling: Collect Data From The Real System of Interest
7 pages
MESIO-SIM - (US) Generation of Random Variables
No ratings yet
MESIO-SIM - (US) Generation of Random Variables
42 pages
Bayesian Statistical Methods
100% (10)
Bayesian Statistical Methods
288 pages
Math 361, Problem Set 11
No ratings yet
Math 361, Problem Set 11
4 pages
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
No ratings yet
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
6 pages
Note - 1 PDF
No ratings yet
Note - 1 PDF
75 pages
Applied Statistics and Probability For Engineers: Sixth Edition
100% (1)
Applied Statistics and Probability For Engineers: Sixth Edition
56 pages
Lecture 11
No ratings yet
Lecture 11
6 pages
Random Processes: Version 2, ECE IIT, Kharagpur
No ratings yet
Random Processes: Version 2, ECE IIT, Kharagpur
8 pages
Generalized Linear Models-1
No ratings yet
Generalized Linear Models-1
29 pages
Econometria Avanzada: Generalized Linear Models
No ratings yet
Econometria Avanzada: Generalized Linear Models
30 pages
General Additive Model - Michael Clark
No ratings yet
General Additive Model - Michael Clark
31 pages
Lecture Notes 1: Brief Review of Basic Probability (Casella and Berger Chapters 1-4)
100% (1)
Lecture Notes 1: Brief Review of Basic Probability (Casella and Berger Chapters 1-4)
14 pages
Generalized Linear Models: Simon Jackman Stanford University
No ratings yet
Generalized Linear Models: Simon Jackman Stanford University
7 pages
Unit - 2
No ratings yet
Unit - 2
3 pages
2.1972 Generalized Linear Models Nelder Wedderburn
No ratings yet
2.1972 Generalized Linear Models Nelder Wedderburn
16 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
9 pages
Gamma Distribution
No ratings yet
Gamma Distribution
30 pages
(GAM) Application PDF
No ratings yet
(GAM) Application PDF
30 pages
Generalised Linear Models and Bayesian Statistics
No ratings yet
Generalised Linear Models and Bayesian Statistics
35 pages
Generalized Linear Models
No ratings yet
Generalized Linear Models
109 pages
GAMS Getting Started
No ratings yet
GAMS Getting Started
31 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)

Generalized Linear Models: FX Axb C DX Axb C DX

Uploaded by

Generalized Linear Models: FX Axb C DX Axb C DX

Uploaded by

Chapter 16

Generalized Linear Models

Exponential family of distribution

where a ( X ), b(θ ), c(θ ) and d ( X ) are all known function.

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

Expected values and variance of a(X):

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

Var (U ) = [b '(θ ) ] Var [ a ( X ) ] ,

Example: Binomial distribution

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

Linear predictors and link functions

Several choices of link functions are available. If

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

For example, is case y has a

• normal distribution, the canonical link function is identity link defined as ηi = µi .

- power family link

There are two components in any generalized linear model:

like (θ1 , θ 2 ,..., θ n ) and/or (φ1 , φ2 ,..., φn ) respectively.

µi is E ( yi ) then g ( µi ) relates µi to a linear combinations of β ' s via g ( µi ) = xi' β .

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

In particular, for H 0 : β j = β 0 , the test statistic is

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

The likelihood ratio test statistic is

Prediction and confidence interval with GLM

where g is the associated link function.

is estimated as Ω̂ . The asymptotic variance of linear predictor estimated at x = x0 is

Var (ηˆ0 )= Var ( x0' βˆ )= x0' V ( βˆ ) x0= x0' Ωx0

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

where di is the contribution of i th observation to the deviance.

• 2sin −1 πˆi is used in case of logistic regression.

• 2 yˆi is used in Poisson regression.

• 2 ln yˆi is used when study variable has gamma distribution.

Regression Analysis | Chapter 16 | Generalized Linear Models | Shalabh, IIT Kanpur

You might also like