0% found this document useful (0 votes)

39 views26 pages

Monday 3 June, 2024 Esa L A Ar A

Uploaded by

drwinkhaing

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views26 pages

Monday 3 June, 2024 Esa L A Ar A

Uploaded by

drwinkhaing

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Linear and generalized linear models

Monday 3 June, 2024

Esa Läärä

Statistical Practice in Epidemiology using R

3 to 7 June, 2024
International Agency for Research on Cancer, Lyon, France
Outline
▶ Simple linear regression.
▶ Fitting a regression model and extracting results.
▶ Predictions and diagnostics.
▶ Categorical factors and contrast matrices.
▶ Main effects and interactions.
▶ Modelling curved effects.
▶ Generalized linear models.
▶ Binary regression and Poisson regression.

Linear and generalized linear models 1/ 25

Variables in generalized linear models
▶ The outcome or response variable must be numeric.
▶ Main types of response variables are
– Metric or continuous (a measurement with units).
– Binary (“yes” vs. ”no”, coded 1/0), or proportion.
– Failure in person-time, or incidence rate.
▶ Explanatory variables or regressors can be
– Numeric or quantitative variables
– Categorical factors, represented by class indicators or contrast matrices.

Linear and generalized linear models 2/ 25

The births data in Epi
id: Identity number for mother and baby.
bweight: Birth weight of baby.
lowbw: Indicator for birth weight less than 2500 g.
gestwks: Gestation period in weeks.
preterm: Indicator for gestation period less than 37 weeks.
matage: Maternal age.
hyp: Indicator for maternal hypertension (0 = no, 1 = yes).
sex: Sex of baby (1 = male, 2 = female).

Declaring and transforming some variables as factors:

> library(Epi) ; data(births)
> births <- transform(births,
+ hyp = factor(hyp, labels=c("N", "H")),
+ sex = factor(sex, labels=c("M", "F")),
+ gest4 = cut(gestwks,breaks=c(20, 35, 37, 39, 45), right=FALSE) )
> births <- subset(births, !is.na(gestwks))

Linear and generalized linear models 3/ 25

Birth weight and gestational age
4000
Birth weight (g)
3000
2000
1000

25 30 35 40 45
Gestational age (wk)

> with(births, plot(bweight ~ gestwks, xlim = c(24,45), pch = 16, cex.axis=1.5, cex.lab = 1.5,
+ xlab= "Gestational age (wk)", ylab= "Birth weight (g)" ) )

Linear and generalized linear models 4/ 25

Metric response, numeric explanatory variable
Roughly linear relationship btw bweight and gestwks
→ Simple linear regression model fitted.
> m <- lm(bweight ~ gestwks, data=births)
▶ lm() is the function that fits linear regression models, assuming
Gaussian distribution or family for error terms.
▶ bweight ~ gestwks is the model formula
▶ m is a model object belonging to class “lm”.
> coef(m) – Printing the estimated regression coefficients
(Intercept) gestwks
-4489.1 197.0

Interpretation of intercept and slope?

Linear and generalized linear models 5/ 25
Model object and extractor functions
Model object = list of different elements, each being separately accessible.
– See str(m) for the full list.

Functions that extract results from the fitted model object

▶ summary(m) – lots of output
▶ coef(m) – beta-hats only (see above)

▶ ci.lin(m)[,c(1,5,6)] – βbj s plus confidence limits

Estimate 2.5% 97.5%
(Intercept) -4489.1 -5157.3 -3821.0
gestwks 197.0 179.7 214.2
Function ci.lin() is found in Epi package.
▶ anova(m) – Analysis of Variance Table
Linear and generalized linear models 6/ 25
Other extractor functions, for example
▶ fitted(m), resid(m), vcov(m), . . .
▶ predict(m, ...) – or ci.pred(m, ...) in Epi
– Predicted responses for desired combinations of new values of the
regressors – argument newdata
– Argument interval specifies whether confidence intervals for the
mean response or prediction intervals for individual responses are
returned.
▶ plot(m) – produces various diagnostic plots based on residuals
(raw, standardized or studentized residuals).
Many of these are special methods for certain generic functions, aimed at
acting on objects of class “lm”.

Linear and generalized linear models 7/ 25

Fitted values, confidence & prediction intervals
4000
Birth weight (g)
3000
2000
1000

25 30 35 40 45
Gestational age (wk)

> nd <- data.frame( gestwks = seq(24, 45, by = 0.25 ) )

> pr.c1 <- predict( m, newdata=nd, interval="conf" )
> pr.p1 <- predict( m, newdata=nd, interval="pred" )
> with(births, plot(bweight ~ gestwks, xlim = c(24,45), cex.axis=1.5, cex.lab = 1.5, xlab = ’Gestation
> matlines( nd$gestwks, pr.c1, lty=1, lwd=c(3,2,2), col=c(’red’,’blue’,’blue’))
> matlines( nd$gestwks, pr.p1, lty=1, lwd=c(3,2,2), col=c(’red’,’green’,’green’))
Linear and generalized linear models 8/ 25
A couple of diagnostic plots
Residuals vs Fitted Q−Q Residuals
1500

78 78

Standardized residuals
2
500
Residuals
0

0
−1000

−2
124
124
−2000

30
30

−4
1000 2000 3000 4000 −3 −2 −1 0 1 2 3
Fitted values Theoretical Quantiles

> par(mfrow=c(1,2))
> plot(m, 1:2, cex.lab = 1.5, cex.axis=1.5, cex.caption=1.5, lwd=2)

▶ Some deviation from linearity?

▶ Reasonable agreement with Gaussian error assumption?
Linear and generalized linear models 9/ 25
Factor as an explanatory variable
▶ How bweight depends on maternal hypertension?
> mh <- lm( bweight ~ hyp, data=births)

Estimate 2.5% 97.5%

(Intercept) 3198.9 3140.2 3257.6
hypH -430.7 -585.4 -275.9
▶ Removal of intercept → mean bweights by hyp:
> mh2 <- lm( bweight ~ -1 + hyp, data = births)
> coef(mh2)
hypN hypH
3198.9 2768.2
▶ Interpretation: -430.7 = 2768.2 - 3198.9
= difference between level 2 (“H”) vs. reference level 1 (“N”) of factor hyp.
Linear and generalized linear models 10/ 25
Additive model with both gestwks and hyp
▶ Joint effect of hyp and gestwks is modelled e.g. by updating:
> mhg <- update(mh, . ~ . + gestwks)
Estimate 2.5% 97.5%
(Intercept) -4285.0 -4969.7 -3600.3
hypH -143.7 -259.0 -28.4
gestwks 192.2 174.7 209.8
▶ The coefficient for hyp: H vs. N is attenuated (from −430.7 to −143.7).
▶ Does −143.7 estimate the causal effect of hyp adjusted for gestwks?
▶ No, as gestwks is most likely a mediator. – Much of the effect of hyp on
bweight is mediated via shorter gestwks in hypertensive mothers.
▶ Instead, for total causal effect of hyp, adjustment for at least age is
needed, but adjusting for gestwks is overadjustment.
▶ Yet, for predictive modelling it is OK to keep gestwks.
Linear and generalized linear models 11/ 25
Model with interaction of hyp and gestwks
▶ mhgi <- lm(bweight ~ hyp + gestwks + hyp:gestwks, ...)
▶ Or with shorter formula: bweight ~ hyp * gestwks
Estimate 2.5% 97.5%
(Intercept) -3960.8 -4758.0 -3163.6
hypH -1332.7 -2841.0 175.7
gestwks 183.9 163.5 204.4
hypH:gestwks 31.4 -8.3 71.1

▶ Estimated slope: 183.9 g/wk in reference group N of normotensive mothers and

183.9 + 31.4 = 215.3 g/wk in hypertensive mothers.

⇔ For each additional week the difference in mean bweight between H and N group
increases by 31.4 g.
▶ Interpretation of Intercept and “main effect” hypH?
Linear and generalized linear models 12/ 25
Model with interaction (cont’d)
More interpretable parametrization obtained if gestwks is centered at some
reference value, using e.g. the insulate operator I() for explicit transformation
of an original term.
▶ mi2 <- lm(bweight ~ hyp*I(gestwks-40), ...)
Estimate 2.5% 97.5%
(Intercept) 3395.6 3347.5 3443.7
hypH -77.3 -219.8 65.3
I(gestwks - 40) 183.9 163.5 204.4
hypH:I(gestwks - 40) 31.4 -8.3 71.1
▶ The “main effect” of hyp = −77.3 is the difference between H and N
at the reference value gestwks = 40.
▶ Intercept = 3395.6 is the estimated mean bweight
at the reference value 40 of gestwks in group N.
Linear and generalized linear models 13/ 25
Factors and contrasts in R
▶ A categorical explanatory variable or factor with L levels will be
represented by L − 1 linearly independent columns in the
model matrix of a linear model.
▶ These columns can be defined in various ways implying alternative
parametrizations for the effect of the factor.
▶ Parametrization is defined by given type of contrasts.
▶ Default: treatment contrasts, in which 1st class is the reference, and
regression coefficient βk for class k is interpreted as βk = µk − µ1
▶ Own parametrization may be tailored by ci.lin(mod, ctr.mat=CM) after
fitting, the pertinent contrast matrix given as the 2nd argument.

Linear and generalized linear models 14/ 25

Two factors: additive effects
▶ Factor X has 3 levels, Z has 2 levels – Model:
µ = α + β1 X1 + β2 X2 + β3 X3 + γ1 Z1 + γ2 Z2
▶ X1 (reference), X2 , X3 are the indicators for X ,
▶ Z1 (reference), Z2 are the indicators for Z .
▶ Omitting X1 and Z1 the model for mean is:
µ = α + β2 X2 + β3 X3 + γ2 Z2
with predicted means µjk (j = 1, 2, 3; k = 1, 2):
Z =1 Z =2
1 µ11 = α µ11 = α + γ2
X 2 µ21 = α + β2 µ22 = α + β2 + γ2
3 µ31 = α + β3 µ32 = α + β3 + γ2
Linear and generalized linear models 15/ 25
Two factors with interaction
▶ Effect of Z differs at different levels of X :
Z =1 Z =2
1 µ11 = α µ12 = α + γ2
X 2 µ21 = α + β2 µ22 = α + β2 + γ2 + δ22
3 µ31 = α + β3 µ32 = α + β3 + γ2 + δ32
▶ How much the effect of Z (level 2 vs. 1)
changes when the level of X is changed from 1 to 3:
δ32 = (µ32 − µ31 ) − (µ12 − µ11 )
= (µ32 − µ12 ) − (µ31 − µ11 ),
= how much the effect of X (level 3 vs. 1)
changes when the level of Z is changed from 1 to 2.
▶ See the exercise: interaction of hyp and gest4.
Linear and generalized linear models 16/ 25
Contrasts in R
▶ All contrasts can be implemented by supplying a suitable
contrast function giving the contrast matrix e.g:
> contr.cum(3) > contr.sum(3)
1 0 0 1 1 0
2 1 0 2 0 1
3 1 1 3 -1 -1

▶ In model formula factor name faktori can be replaced by expression like

C(faktori, contr.cum).
▶ Function ci.lin() can calculate CI’s for linear functions of the parameters
of a fitted model mall when supplied by a relevant contrast matrix
> ci.lin(mall, ctr.mat = CM)[ , c(1,5,6)]
→ No need to specify contrasts in model formula!
Linear and generalized linear models 17/ 25
More about numeric regressors
What if dependence of Y on X is non-linear?
▶ Categorize the values of X into a factor.
– Continuous effects violently discretized by often
arbitrary cutpoints. This is inefficient.
▶ Fit a low-degree (e.g. 2 to 4) polynomial of X .
– Tail behaviour may be problematic.
▶ Use fractional polynomials.
– Invariance problems. Only useful if X = 0 is well-defined.
▶ Use a spline model: smooth function s(X ; β). – See Martyn’s lecture
– More flexible models that act locally.
– Effect of X reported by graphing bs (X ; β) & its CI
Linear and generalized linear models 18/ 25
Mean bweigth as 3rd order polynomial of gestwks
4000
3000
bweight
2000
1000

25 30 35 40 45
gestwks

> mp3 <- update( m, . ~ . - gestwks + poly(gestwks, 3) )

▶ The model is linear in parameters with 4 terms & 4 df.

▶ Otherwise good, but the tails do not behave well.
Linear and generalized linear models 19/ 25
Penalized spline model with cross-validation
4000
3000
bweight
2000
1000

25 30 35 40 45
gestwks

> library(mgcv)
> mpen <- gam( bweight ~ s(gestwks), data = births)

▶ Looks quite nice.

▶ Model df ≈ 4.2; close to 4, as in the 3rd degree polynomial model.
Linear and generalized linear models 20/ 25
From linear to generalized linear models
▶ An alternative way of fitting our 1st Gaussian model:
> m <- glm(bweight ~ gestwks, family=gaussian, data=births)

▶ Function glm() fits generalized linear models (GLM).

▶ Requires specification of the
– family – i.e. the assumed “error” distribution for Yi s,
– link function – a transformation of the expected Yi .
▶ Covers common models for other types of response variables and
distributions, too, e.g. logistic regression for binary responses and
Poisson regression for counts.
▶ Fitting: method of maximum likelihood.
▶ Many extractor functions for a glm object similar to those for an lm object.
Linear and generalized linear models 21/ 25
Generalized linear models
Modelling how expected values, risks, rates, etc. depend on explanatory variables
or regressors X = (X1 , . . . , Xp ). – Common elements:
▶ Each subject i (i = 1, . . . , N ) has an own regressor profile, i.e. vector
xiT = (xi1 , . . . , xip ) of values of X .
▶ Let vector β T = (β0 , β1 , . . . , βp ) contain regression coefficients.
The linear predictor is a linear combination of βj s and xij s:
ηi = β0 + β1 xi1 + · · · + βp xip
▶ Some Xj s can be product terms for interactions and modifications if
needed, and splines may be used for continuous covariates.
▶ Further model specification depends on the type of outcome variable,
assumed error distribution or family, desired interpretation of coefficients,
and importance and choice of time scale(s).
Linear and generalized linear models 22/ 25
Binary regression and interpretations of coefficients
▶ Basic model for risks π(xi ) = P {Yi = 1|X = xi } = E (Yi |X = xi ) with
fixed risk period, complete follow-up (no censoring, nor competing events):
g{π(xi )} = β0 + β1 xi1 + . . . βp xip , i = 1, . . . , N .
▶ Link g(·) and interpretation of βj s, assuming the validity of model
(including homogeneity or non-modification of the coefficent in question):
– id ⇒ βj = adjusted risk difference (RD) for Xj = 1 vs. Xj = 0,
– log ⇒ βj = adjusted log of risk ratio (RR) – ”–
– logit ⇒ βj = adjusted log of odds ratio (OR), – ” –
▶ Fitting: glm(..., family=binomial(link=...), ...)
▶ Issues with id & log links in keeping predicted π
b(·) between 0 and 1.
– A solution for RR: Doubling the cases & logit-link! (Ning et al. 2022).
– A solution for RD exists, too (Battey et al. 2019).
Linear and generalized linear models 23/ 25
Poisson regression – model for rates
▶ A common outcome variable is a pair (D, Y ) = (no. of cases, person-time),
from which the incidence rate = D/Y (see Janne’s lecture on Monday).
▶ Poisson regression model specifies, how theoretical hazard rates or
hazards λ(xi ) are assumed to depend on values of X .
▶ Some components of X represent the relevant time scales
(as in the exercise of today; more details in Bendix’s lecture on Wednesday).
▶ Linear predictor as above – Link g(·) and interpretation of βj s:
– id ⇒ βj = adjusted rate difference (RD) for Xj = 1 vs. Xj = 0,
– log ⇒ βj = adjusted log of rate ratio (RR) – ”–

▶ Fitting – our recommended approach using Epi:

glm(cbind(d,y) ~ ..., family=poisreg(link=...),...)
Linear and generalized linear models 24/ 25
What was covered
▶ A wide range of models from simple linear regression to splines.
▶ Gaussian family for continuous outcomes, binomial for binary outcomes, and
Poisson family for rates.
▶ Various link functions for different parametrizations.
▶ R functions fitting linear and generalized models: lm() and glm().
▶ Parametrization of categorical explanatory factors; contrast matrices.
▶ Extracting results and predictions: ci.lin(), fitted(), predict().
▶ Model diagnostics: resid(), plot.lm(), ... .

Linear and generalized linear models 25/ 25

James W. Hardin, Joseph M. Hilbe - Generalized Linear Models and Extensions-Stata Press (2018)
100% (1)
James W. Hardin, Joseph M. Hilbe - Generalized Linear Models and Extensions-Stata Press (2018)
789 pages
Machine Learning 2: Exercise Sheet 2
0% (1)
Machine Learning 2: Exercise Sheet 2
1 page
3 Linear Regression
No ratings yet
3 Linear Regression
57 pages
330 Lecture18 2014
No ratings yet
330 Lecture18 2014
24 pages
(GAM) Application PDF
No ratings yet
(GAM) Application PDF
30 pages
Introduction To Mixed-Effects Models For Hierarchical and Longitudinal Data (Part II)
No ratings yet
Introduction To Mixed-Effects Models For Hierarchical and Longitudinal Data (Part II)
4 pages
Lecture 19: Interactions
No ratings yet
Lecture 19: Interactions
4 pages
w6 - Statistical Modelling
No ratings yet
w6 - Statistical Modelling
24 pages
GAMS Getting Started
No ratings yet
GAMS Getting Started
31 pages
D Linear Regression With R
No ratings yet
D Linear Regression With R
9 pages
Unit 540 Differences Between Two Groups With Answers
No ratings yet
Unit 540 Differences Between Two Groups With Answers
8 pages
Lecture Oct 2 2024 Ab
No ratings yet
Lecture Oct 2 2024 Ab
15 pages
Linear Regression
No ratings yet
Linear Regression
13 pages
General Additive Model - Michael Clark
No ratings yet
General Additive Model - Michael Clark
31 pages
ESB2021 Resit With Solution
No ratings yet
ESB2021 Resit With Solution
9 pages
Linear and Generalized Linear Models: Nicholas Christian BIOST 2094 Spring 2011
No ratings yet
Linear and Generalized Linear Models: Nicholas Christian BIOST 2094 Spring 2011
22 pages
Lecture 4 Linear Regression
No ratings yet
Lecture 4 Linear Regression
75 pages
4.2 Slides - Generalized Linear Mixed Models Part 1
No ratings yet
4.2 Slides - Generalized Linear Mixed Models Part 1
9 pages
Econometrics 2 Notes
No ratings yet
Econometrics 2 Notes
14 pages
Stats Notes
No ratings yet
Stats Notes
4 pages
Introduction To Multi-Level Models: Statistical Background On Mlms
No ratings yet
Introduction To Multi-Level Models: Statistical Background On Mlms
20 pages
Unit 540 Differences Between Two Groups Without Answers
No ratings yet
Unit 540 Differences Between Two Groups Without Answers
5 pages
RegrCorr PDF
No ratings yet
RegrCorr PDF
20 pages
Hypothesis Testing in R
No ratings yet
Hypothesis Testing in R
13 pages
Lecture 4
No ratings yet
Lecture 4
45 pages
Regression Models Course Notes
No ratings yet
Regression Models Course Notes
102 pages
Prediction of Random Effects and Effects of Misspecification of Their Distribution
No ratings yet
Prediction of Random Effects and Effects of Misspecification of Their Distribution
49 pages
Fu Ch11 Linear Regression
No ratings yet
Fu Ch11 Linear Regression
70 pages
00 MMS Regression For Economics
No ratings yet
00 MMS Regression For Economics
24 pages
Computer Lab 3 MM
No ratings yet
Computer Lab 3 MM
38 pages
Regression With Linear Predictors Complete DOCX Download
100% (20)
Regression With Linear Predictors Complete DOCX Download
16 pages
00 MMS Regression For Economics
No ratings yet
00 MMS Regression For Economics
24 pages
07 GLM
No ratings yet
07 GLM
49 pages
Computer Lab 1 MM
No ratings yet
Computer Lab 1 MM
26 pages
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
No ratings yet
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
6 pages
Class 10 Multilevel Models
No ratings yet
Class 10 Multilevel Models
42 pages
PairedTests MixedModels TheoryAll
No ratings yet
PairedTests MixedModels TheoryAll
51 pages
15 GLM
No ratings yet
15 GLM
32 pages
R Workshop PART 2
No ratings yet
R Workshop PART 2
36 pages
Limited Dependent Variables
No ratings yet
Limited Dependent Variables
17 pages
Regression & Linear Modeling Best Practices and Modern Methods, 1st Edition Complete DOCX Download
100% (14)
Regression & Linear Modeling Best Practices and Modern Methods, 1st Edition Complete DOCX Download
15 pages
Regression Analysis Assignment1111
No ratings yet
Regression Analysis Assignment1111
13 pages
Binary
No ratings yet
Binary
135 pages
06 MultilevelModels
No ratings yet
06 MultilevelModels
35 pages
Stata Logistic
No ratings yet
Stata Logistic
4 pages
Econometria Avanzada: Generalized Linear Models
No ratings yet
Econometria Avanzada: Generalized Linear Models
30 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Logistic Regression Notes
No ratings yet
Logistic Regression Notes
79 pages
Chapter 2
No ratings yet
Chapter 2
5 pages
Introduction To Logistic Regression: Rachid Salmi, Jean-Claude Desenclos, Alain Moren, Thomas Grein
No ratings yet
Introduction To Logistic Regression: Rachid Salmi, Jean-Claude Desenclos, Alain Moren, Thomas Grein
36 pages
James Steiger R For MultipleRegressionIntro
No ratings yet
James Steiger R For MultipleRegressionIntro
54 pages
Lecture Notes 5
100% (1)
Lecture Notes 5
53 pages
Session 6-15 - Unit II & III: Probability and Distribution, Classical Tests
No ratings yet
Session 6-15 - Unit II & III: Probability and Distribution, Classical Tests
34 pages
Using Estimated Linear Mixed: Generalized Linear Models Trajectories From Model
No ratings yet
Using Estimated Linear Mixed: Generalized Linear Models Trajectories From Model
9 pages
Regression and Correlation Methods
No ratings yet
Regression and Correlation Methods
70 pages
A1
No ratings yet
A1
8 pages
Topic 3a
No ratings yet
Topic 3a
64 pages
Limited Dependent Variables - Binary Dependent Variables
No ratings yet
Limited Dependent Variables - Binary Dependent Variables
24 pages
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
No ratings yet
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
6 pages
Fu Ch11 Linear Regression
No ratings yet
Fu Ch11 Linear Regression
70 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Statistics and Prob 11 Summative Test Q4
No ratings yet
Statistics and Prob 11 Summative Test Q4
5 pages
5 MANOVA Presentation Stats
No ratings yet
5 MANOVA Presentation Stats
32 pages
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
No ratings yet
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
33 pages
A Robust Test For Weak Instruments
No ratings yet
A Robust Test For Weak Instruments
13 pages
Unit 3
No ratings yet
Unit 3
20 pages
Predicting House Prices Using Regression Techniques: Problem Statement: Problems Faced During Buying A House
No ratings yet
Predicting House Prices Using Regression Techniques: Problem Statement: Problems Faced During Buying A House
20 pages
Regression
No ratings yet
Regression
3 pages
DSUR I Chapter 06 (Correlation)
No ratings yet
DSUR I Chapter 06 (Correlation)
42 pages
Pengaruh Motivasi Dan Kemampuan Serta Komitmen Terhadap Kinerja Pengurus Upk PNPM Mandiri Perdesaan Di Kabupaten Lumajang
No ratings yet
Pengaruh Motivasi Dan Kemampuan Serta Komitmen Terhadap Kinerja Pengurus Upk PNPM Mandiri Perdesaan Di Kabupaten Lumajang
17 pages
Government Engineering College, Modasa: B.E. - Computer Engineering (Semester - VII) 3170724 - Machine Learning
No ratings yet
Government Engineering College, Modasa: B.E. - Computer Engineering (Semester - VII) 3170724 - Machine Learning
3 pages
Big Data Projecct
No ratings yet
Big Data Projecct
12 pages
MA 641 - Final
No ratings yet
MA 641 - Final
15 pages
Correlation Regression
No ratings yet
Correlation Regression
42 pages
UNIT 3: Association Rules and Regression: I) Apriori Algorithm
No ratings yet
UNIT 3: Association Rules and Regression: I) Apriori Algorithm
18 pages
Theory Question For 504 A
No ratings yet
Theory Question For 504 A
2 pages
Ekram Assignment ECONO
100% (1)
Ekram Assignment ECONO
16 pages
Eda 2
No ratings yet
Eda 2
21 pages
Anova and Pca
No ratings yet
Anova and Pca
10 pages
Exploratory Factor Analysis With SPSS Oct 2019
No ratings yet
Exploratory Factor Analysis With SPSS Oct 2019
26 pages
Chapter 10 Simple Linear Regression and Correlation
No ratings yet
Chapter 10 Simple Linear Regression and Correlation
28 pages
Multinomial Logistic Regression-1
No ratings yet
Multinomial Logistic Regression-1
17 pages
Foundation of Machine Learning F-PMLFML02-WS
No ratings yet
Foundation of Machine Learning F-PMLFML02-WS
352 pages
Lesson 8 - Linear Regression and Correlation PDF
No ratings yet
Lesson 8 - Linear Regression and Correlation PDF
3 pages
Sta404 - Chapter 5 - Bivariate Analysis (Student)
No ratings yet
Sta404 - Chapter 5 - Bivariate Analysis (Student)
27 pages
DADM-Correlation and Regression
No ratings yet
DADM-Correlation and Regression
138 pages
Assignment - Econometrics (Instrumental Variable Stock Watson)
No ratings yet
Assignment - Econometrics (Instrumental Variable Stock Watson)
10 pages
Time Series hw5
100% (2)
Time Series hw5
4 pages
Correlation Matrix PDF
No ratings yet
Correlation Matrix PDF
15 pages
Simple Linear Regression Notes
No ratings yet
Simple Linear Regression Notes
4 pages

Monday 3 June, 2024 Esa L A Ar A

Uploaded by

Monday 3 June, 2024 Esa L A Ar A

Uploaded by

Linear and generalized linear models

Monday 3 June, 2024

Statistical Practice in Epidemiology using R

Linear and generalized linear models 1/ 25

Linear and generalized linear models 2/ 25

Declaring and transforming some variables as factors:

Linear and generalized linear models 3/ 25

Linear and generalized linear models 4/ 25

Interpretation of intercept and slope?

Functions that extract results from the fitted model object

▶ ci.lin(m)[,c(1,5,6)] – βbj s plus confidence limits

Linear and generalized linear models 7/ 25

> nd <- data.frame( gestwks = seq(24, 45, by = 0.25 ) )

▶ Some deviation from linearity?

Estimate 2.5% 97.5%

▶ Estimated slope: 183.9 g/wk in reference group N of normotensive mothers and

Linear and generalized linear models 14/ 25

▶ In model formula factor name faktori can be replaced by expression like

> mp3 <- update( m, . ~ . - gestwks + poly(gestwks, 3) )

▶ The model is linear in parameters with 4 terms & 4 df.

▶ Looks quite nice.

▶ Function glm() fits generalized linear models (GLM).

▶ Fitting – our recommended approach using Epi:

Linear and generalized linear models 25/ 25

You might also like