0% found this document useful (0 votes)

38 views5 pages

Ho GLM

Uploaded by

Deepmala Bharti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views5 pages

Ho GLM

Uploaded by

Deepmala Bharti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Newsom

Psy 525/625 Categorical Data Analysis, Spring 2021 1

Generalized Linear Models

Link Function
The logistic equation is stated in terms of the probability that Y = 1, which is π, and the probability that Y =
0, which is 1 - π.

 π 
ln  = α + β X
1− π 

The left-hand side of the equation represents the logit transformation, which takes the natural log of the
ratio of the probability that Y is equal to 1 compared to the probability that it is not equal to one. As we
know, the probability, π, is just the mean of the Y values, assuming 0,1 coding, which is often expressed
as µ. The logit transformation could then be written in terms of the mean rather than the probability,

 µ 
ln  = α + β X
1− µ 

The transformation of the mean represents a link to the central tendency of the distribution, sometimes
called the location, one of the important defining aspects of any given probability distribution. The log
transformation represents a kind of link function (often canonical link function) 1 that is sometimes given
more generally as g(.), with the letter g used as an arbitrary name for a mathematical function and the
use of the “.” within the parentheses to suggest that any variable, value, or function (the argument) could
be placed within. For logistic regression, this is known as the logit link function. The right hand side of the
equation, α + βX, is the familiar equation for the regression line and represents a linear combination of
the parameters for the regression.

The concept of this logistic link function can generalized to any other distribution, with the simplest, most
familiar case being the ordinary least squares or linear regression model. For the linear regression
model, the link function is called the identity link function, because no transformation is needed to get
from the linear regression parameters on the right-hand side of the equation to the normal distribution.

Yˆ= E (Y )= g ( E (Y ) )= g ( µ )= α + β X

I give four equivalent terms here, Yˆ , E (Y ) , g ( E (Y ) ) , and g ( µ ) , just to illustrate all of the different
notations that might be used to express the linear regression model—we don’t need them all. The
expected value E(Y) or mean, µ, of the response is plugged into g(.) function because the predicted value
is the expected value for Y when X is equal to some particular value.

The general concept that we can use a variety of link functions on the left-hand side of the equation and
still keep the linear parameters on the right is referred to as the generalized linear model (Nelder &
Wedderburn, 1972). The term should not be confused with the term “general linear model” used to refer
generally to regression and ANOVA and their equivalence, which are special cases of the generalized
linear model. 2

1
Technically, there is a distinction between a link function generally speaking and a canonical link function (see Agresti, 2015, pp. 3,123,142). A
canonical link function is one in which transforms the mean, µ = E(yi), to the natural exponential (location) parameter for the exponential family of
distributions (e.g., normal, binomial, Poisson, gamma). The canonical link function is the most commonly used link form in generalized linear
models.
2
GLM is sometimes used for either generalized linear model or general linear model. GLIM is another abbreviation that is used only for the
generalized linear model.
Newsom
Psy 525/625 Categorical Data Analysis, Spring 2021 2

Some Generalized Linear Modeling Link Functions

Link type Natural/Canonical Parameter Example Application

Transformation
Normal/Identity (OLS) µ
Log ln µ Poisson loglinear model for counts
Inverse 1/ µ Regression with gamma distributed response
Square root µ Gamma distributed response increasing
variance
Logit ln (π / (1 − π ) ) Binary and ordinal logistic regression
Probit 1
π Binary and ordinal probit regressions
∫e
− z2 / 2
dz (normal or Gaussian)
2π *
−∞

Log-log ln  − ln (1 − π )  Survival analysis

(also known as
complementary log-log,
Weibull)
Poisson µy Regression of count response (equidispersion)
e− µ
Y!
Negative binomial Γ ( yi + ω ) µiy ω ω
i Regression of count response

y !Γ ( ω ) ( µi + ω )
µi + ω

Random Component
The second component of the generalized linear model is the probability distribution associated with the
with a particular type of variable—the distribution that the errors from the model are expected to follow.
There are a number of distributions that fall under the exponential family of distributions, whose densities
are all described with specific mathematical equations for the shape(s) of the distribution (see the
overhead “Some Members of the Exponential Distribution Family”). All of the exponential family of
distributions can be expressed in a very general form that has two parameters, the natural or canonical
parameter (the location which is some function of the mean) and the variance parameter. 3 For ordinary
least squares, it is the normal distribution. For logistic regression, it is the logistic distribution. Several
other distributions are commonly used, including the Poisson for count variables, the inverse normal for
the probit model, or the log-normal and log-logistic distributions used in survival analysis.

Generalized linear models are specified by indicating both the link function and the residual distribution.
Sometimes a particular link is always used with a particular distribution, but sometimes there may be
several possible distributions for a certain link. Maximum likelihood estimation is used for generalized
linear models, with the usual significance test for overall model fit and coefficients—Wald, likelihood ratio,
score tests (see Agresti, 2015, Chapter 4 for details on estimation and standard errors). Software
packages, such as SPSS (Genlin), SAS (PROC GENMOD), and glm in R, allow users to specify link
functions and distributions for a particular analytic circumstance.

Probit Regression Model

The logit link function is a fairly simple transformation of the prediction curve and also provides odds
ratios, both features that make it popular among researchers. Another possibility when the dependent
variable is dichotomous is probit regression. For some dichotomous variables, one can argue that the
dependent variable is a proxy for a variable that is really continuous. Take for example our hypothetical
child age and divorce study. Divorce might be the dichotomy that is ultimately observed, but there may
be an underlying propensity toward divorce falling along some continuum related to marital satisfaction.
Only when the propensity exceeds some threshold value on the continuum do we observe 1 (divorce) on

3
The two parameters for a distribution are usually given using a function notation, such as f(y; µ,σ), where µ is the natural parameter and σ is
the variance parameter.
Newsom
Psy 525/625 Categorical Data Analysis, Spring 2021 3

the binary variable instead 0 (married). This underlying continuous variable is often called a latent or
unobserved variable, 4 and the probit link function can be conceptualized as the link between the linear
combination of parameters on the right-hand side to some unobserved continuum on the left-hand side of
the generalized linear model. Below, I use Y* (the Greek letter eta, η, is often used instead) to refer to
the latent predicted score.

Y *= α + β X

The figure below illustrates the concept, using Y as the observed score, Y*, and τ (tau) as the threshold.

fY*

Y*
Y* < τ τ < Y* Unobserved

Y =0 τ Y =1 Observed

Because the Y* distribution is assumed to be normal, the unstandardized probit coefficients represent a
change in the z-score for Y* for each unit change in X. You can think about this as a partially
standardized solution, with the dependent but not the independent variable standardized. The next step
is the standardize X, to obtain a fully standardized solution, which provides a familiar metric and a
convenient magnitude of effect for the association between each predictor and the response.

If the true underlying variable we are predicting is continuous, we can assume the errors are normally
distributed. The probit regression model uses a (inverse) normal distribution link for a binary variable
instead of the logit link, where Y * = Φ −1[π ] . The -1 superscript refers to the inverse of the cdf to correspond
with the cumulative probability that Y is equal to 1.
π
1
Φ −1 (π ) = ∫e
− z2 / 2
dz
2π * −∞

π* is used in the above for mathematical constant to distinguish it from the probability. Φ-1 is the probit,
and, like the simpler logit, it connects the linear model with the expected probability.

Using the inverse normal function (in a statistical package or spreadsheet) for an observed probability
returns a z-score on the normal distribution. The complementary function to the inverse normal cdf is the
normal cdf, Φ, which can be used to transform a z-score back to a probability (i.e., the underlying
mathematical transformation behind conversions obtained from a z statistical table).

Φ −1 (π i ) =
α + β Xi Φ (α + β X i )
πi =

The transformation, thus, represents the translation of Y to Y* and back in the figure above. Similarly,
values from the logistic model can be used to return an expected probability for a given value of X from

4
Not really referring to the same concept as the term “latent variable” used in structural equation modeling, where a latent variable is estimated
by a set of observed indicators assessing the same construct.
Newsom
Psy 525/625 Categorical Data Analysis, Spring 2021 4

the model, except is simpler mathematical transformation to obtain the predicted probability from the cdf,
eα + β X / (1 + eα + β X ) .

The probit regression is related to polychoric correlations, which does not require designation of an
explanatory and response variable. Polychoric correlations were originally developed by Karl Pearson
(1901) to correct for the loss of information in the usual Pearson correlations due to categorization of a
continuous variable (see Olsson, 1979; MacCallum, Zhang, Preacher, & Rucker, 2002). The term
polychoric is used more generally, but Tetrachoric correlations are a special case of polychoric
correlations involving only binary variables, and polyserial correlations are those involving the correlation
between a binary and a continuous variable (see also the “Analysis of Ordinal Contingency Tables”
handout for more information). The concept of Y* is the same as that invoked to conceptualize probit
analysis, where the polychoric correlation represents the correlation between two Y* variable. The
variable Y* is a true value that is not observed but leads to the observed response of Y, which is binary or
ordinal.

Probit Regression vs. Logistic Regression

Probit regression and logistic regression can both model a binary dependent response. The difference
between the two is just the link and error distributions assumed. As we know from the binomial test, with
reasonably large n the normal and binomial distributions are very similar. Here is a picture of the cdf for
the normal, standard logistic (usual, raw logistic), and the standardized logistic (assuming a standard
deviation equal to π * / 3 ).

From J. S. Long, 1997, p. 43

As this figure suggests, probit and logistic regression models nearly always produce the same statistical
result. The unstandardized coefficient estimates from the two modeling approaches are on a different
scale, given the different link functions (logit vs. probit), although the logistic coefficients tend to be
approximately 1.7 larger than probit coefficients. 5 Different disciplines tend to use one more frequently
than the other, although logistic regression is by far the most common. Logistic regression provides odds
ratios, and probit models produce easily defined standardized coefficients.

5
The difference tends to vary between about 1.6 and 1.8 and depends on the overall proportion of the outcome. This difference in units is
connected to the variances of the logistic and normal probability distributions, where the proportion and the variance for binary variables are
interdependent. The standardized logistic variance, which is (π * ) / 3 ≈ 1.81 , leads to a cdf that is very close to the normal cdf, but this is based
2

on the average across all values of X.

Newsom
Psy 525/625 Categorical Data Analysis, Spring 2021 5

References and Further Reading

Agresti, A. (2015). Foundations of linear and generalized linear models. New York: John Wiley & Sons.
Dunteman, G. H., & Ho, M. H. R. (2005). An introduction to generalized linear models (Vol. 145). Thousand Oaks, CA: Sage Publications.
Fox, J. (2008). Applied regression analysis and generalized linear models, second edition. Sage
Long, J.S. (1997). Regression models for categorical and limited dependent variables. Thousand Oaks, CA: Sage.
Nelder, J. A. and Wedderburn, R. W. M. (1972). Generalized linear models. Journal of the Royal Statistical. Society Series A, 135, 370-384.
Pearson, K. (1901). Mathematical contributions to the theory of evolution. X. Supplement to a memoir on skew variation. Philosophical
Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character, 197, 443-459.
Olsson, U. (1979). Maximum likelihood estimation of the polychoric correlation coefficient. Psychometrika, 44, 443-460

ES714glm Generalized Linear Models
No ratings yet
ES714glm Generalized Linear Models
26 pages
Statistics 244 - Binary Response Regression, and Related Issues
100% (1)
Statistics 244 - Binary Response Regression, and Related Issues
30 pages
Lecture Notes 5
100% (1)
Lecture Notes 5
53 pages
Generalised Linear Models
No ratings yet
Generalised Linear Models
74 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
9 pages
Difference Between Logit and Probit Models
100% (1)
Difference Between Logit and Probit Models
7 pages
Statistical Modeling Notes
No ratings yet
Statistical Modeling Notes
25 pages
Unit - II Regression-LogisticRegressionModels
No ratings yet
Unit - II Regression-LogisticRegressionModels
7 pages
(GAM) Application PDF
No ratings yet
(GAM) Application PDF
30 pages
Choosing Link Functions Probit Regression Model Selection: Statistics 149 Spring 2006
No ratings yet
Choosing Link Functions Probit Regression Model Selection: Statistics 149 Spring 2006
42 pages
Differences Between Statistical Software Packages
No ratings yet
Differences Between Statistical Software Packages
24 pages
Lecture 8
No ratings yet
Lecture 8
22 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
32 pages
Lecture 13: Introduction To Generalized Linear Models: 21 November 2007
No ratings yet
Lecture 13: Introduction To Generalized Linear Models: 21 November 2007
12 pages
Exponential Family
No ratings yet
Exponential Family
13 pages
Generalized Weighted Additive Models Based On Distribution Functions
No ratings yet
Generalized Weighted Additive Models Based On Distribution Functions
9 pages
Httpsemas2.Ui - Ac.idpluginfile - Php2375826mod Resourcecontent1kuliah1 2 PDF
No ratings yet
Httpsemas2.Ui - Ac.idpluginfile - Php2375826mod Resourcecontent1kuliah1 2 PDF
31 pages
GLM Theory
No ratings yet
GLM Theory
46 pages
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
No ratings yet
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
6 pages
Mulia Edit
No ratings yet
Mulia Edit
11 pages
GLM Slides 2 Exp Family
No ratings yet
GLM Slides 2 Exp Family
35 pages
Self-Study - The Difference Between Link Functions and Data Transformations
No ratings yet
Self-Study - The Difference Between Link Functions and Data Transformations
3 pages
RM - Elements of Generalised Linear Models (GLM) and Inference For GLM
No ratings yet
RM - Elements of Generalised Linear Models (GLM) and Inference For GLM
11 pages
Generalized Linear Models: FX Axb C DX Axb C DX
No ratings yet
Generalized Linear Models: FX Axb C DX Axb C DX
11 pages
Lecture BDS 2 23 24 Print
No ratings yet
Lecture BDS 2 23 24 Print
10 pages
Categorical Notes Ch4
No ratings yet
Categorical Notes Ch4
40 pages
Stat5900 f24 Lec9
No ratings yet
Stat5900 f24 Lec9
12 pages
Class - Lectur 5&6
No ratings yet
Class - Lectur 5&6
12 pages
26GeneralizedLinearModelBernoulliAnnotated PDF
No ratings yet
26GeneralizedLinearModelBernoulliAnnotated PDF
46 pages
Modelling Lecture 5
No ratings yet
Modelling Lecture 5
10 pages
Econometria Avanzada: Generalized Linear Models
No ratings yet
Econometria Avanzada: Generalized Linear Models
30 pages
4.2 Slides - Generalized Linear Mixed Models Part 1
No ratings yet
4.2 Slides - Generalized Linear Mixed Models Part 1
9 pages
Unit 2
No ratings yet
Unit 2
11 pages
Theory Generalized Linear Model
No ratings yet
Theory Generalized Linear Model
16 pages
07 GLM
No ratings yet
07 GLM
49 pages
10 Dichotomous or Binary Responses
No ratings yet
10 Dichotomous or Binary Responses
74 pages
GAMS Getting Started
No ratings yet
GAMS Getting Started
31 pages
Cda Chapter Three
No ratings yet
Cda Chapter Three
18 pages
Capstone - Https:Users - Ox.ac - Uk: Jesu0073:Lecture 3:LogisticRegression
No ratings yet
Capstone - Https:Users - Ox.ac - Uk: Jesu0073:Lecture 3:LogisticRegression
17 pages
w6 - Statistical Modelling
No ratings yet
w6 - Statistical Modelling
24 pages
An Empirical Study of Generalized Linear Model For
No ratings yet
An Empirical Study of Generalized Linear Model For
4 pages
General Additive Model - Michael Clark
No ratings yet
General Additive Model - Michael Clark
31 pages
Chapter 2
No ratings yet
Chapter 2
11 pages
RJ 2018 004
No ratings yet
RJ 2018 004
14 pages
(TRANSLATED) Generalized Linear Model
No ratings yet
(TRANSLATED) Generalized Linear Model
11 pages
Chapman-Kolmogorov Equations 37 This Produces The 48511
No ratings yet
Chapman-Kolmogorov Equations 37 This Produces The 48511
9 pages
7 Generalized Linear Models Padua
No ratings yet
7 Generalized Linear Models Padua
29 pages
Note on Generalized Linear Models: y y Xβ w X β w I y Xβ I y Xβ X w X
No ratings yet
Note on Generalized Linear Models: y y Xβ w X β w I y Xβ I y Xβ X w X
4 pages
Unit - 2
No ratings yet
Unit - 2
3 pages
2.1972 Generalized Linear Models Nelder Wedderburn
No ratings yet
2.1972 Generalized Linear Models Nelder Wedderburn
16 pages
Applied Multilevel Analysis-Section B 1
No ratings yet
Applied Multilevel Analysis-Section B 1
12 pages
T3 Logistic Regression
No ratings yet
T3 Logistic Regression
53 pages
Generalized Linear Models: Simon Jackman Stanford University
No ratings yet
Generalized Linear Models: Simon Jackman Stanford University
7 pages
Assignment On Probit Model
No ratings yet
Assignment On Probit Model
17 pages
15 GLM
No ratings yet
15 GLM
32 pages
Random Notes
No ratings yet
Random Notes
11 pages
Week6 1 GLM
No ratings yet
Week6 1 GLM
28 pages
Chapter 2
No ratings yet
Chapter 2
5 pages

Ho GLM

Uploaded by

Ho GLM

Uploaded by

Newsom

Psy 525/625 Categorical Data Analysis, Spring 2021 1

Generalized Linear Models

Some Generalized Linear Modeling Link Functions

Link type Natural/Canonical Parameter Example Application

Log-log ln  − ln (1 − π )  Survival analysis

Probit Regression Model

Probit Regression vs. Logistic Regression

From J. S. Long, 1997, p. 43

on the average across all values of X.

References and Further Reading

You might also like